WO1998043478A1 - Identification of polynucleotides encoding novel helicobacter polypeptides in the helicobacter genome - Google Patents

Identification of polynucleotides encoding novel helicobacter polypeptides in the helicobacter genome Download PDF

Info

Publication number
WO1998043478A1
WO1998043478A1 PCT/US1998/006371 US9806371W WO9843478A1 WO 1998043478 A1 WO1998043478 A1 WO 1998043478A1 US 9806371 W US9806371 W US 9806371W WO 9843478 A1 WO9843478 A1 WO 9843478A1
Authority
WO
WIPO (PCT)
Prior art keywords
seq
ghpo
gly
leu
asn
Prior art date
Application number
PCT/US1998/006371
Other languages
French (fr)
Inventor
Harold Kleanthous
Amal Al-Garawi
Charles Miller
Jean-François TOMB
Raymond Peter Oomen
Original Assignee
Merieux Oravax
Human Genome Science, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Merieux Oravax, Human Genome Science, Inc. filed Critical Merieux Oravax
Priority to EP98917972A priority Critical patent/EP0977482A4/en
Priority to CA002286306A priority patent/CA2286306A1/en
Priority to KR1019997008969A priority patent/KR20010005893A/en
Priority to JP54194798A priority patent/JP2001527393A/en
Priority to AU70995/98A priority patent/AU756010B2/en
Priority to NZ338039A priority patent/NZ338039A/en
Publication of WO1998043478A1 publication Critical patent/WO1998043478A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K16/00Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
    • C07K16/12Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria
    • C07K16/1203Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria from Gram-negative bacteria
    • C07K16/121Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria from Gram-negative bacteria from Helicobacter (Campylobacter) (G)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P1/00Drugs for disorders of the alimentary tract or the digestive system
    • A61P1/04Drugs for disorders of the alimentary tract or the digestive system for ulcers, gastritis or reflux esophagitis, e.g. antacids, inhibitors of acid secretion, mucosal protectants
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P31/00Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
    • A61P31/04Antibacterial agents
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/195Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
    • C07K14/205Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Campylobacter (G)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A50/00TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
    • Y02A50/30Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change

Definitions

  • the invention relates to Helicobacter antigens and corresponding polynucleotide molecules that can be used in methods to prevent or treat Helicobacter infection in mammals, such as humans.
  • Helicobacter is a genus of spiral, gram-negative bacteria that colonize the gastrointestinal tracts of mammals. Several species colonize the stomach, most notably H. pylori, H. heilmanii, H. felis, and H. mustelae. Although H. pylori is the species most commonly associated with human infection, H. heilmanii and H. felis have also been isolated from humans, but at lower frequencies than H. pylori. Helicobacter infects over 50% of adult populations in developed countries and nearly 100% in developing countries and some Pacific rim countries, making it one ofthe most prevalent infections worldwide. Helicobacter is routinely recovered from gastric biopsies of humans with histological evidence of gastritis and peptic ulceration. Indeed, H.
  • pylori is now recognized as an important pathogen of humans, in that the chronic gastritis it causes is a risk factor for the development of peptic ulcer diseases and gastric carcinoma. It is thus highly desirable to develop safe and effective vaccines for preventing and treating Helicobacter infection.
  • Helicobacter antigens have been characterized or isolated. These include urease, which is composed of two structural subunits of approximately 30 and 67 kDa (Hu et al, Infect. Immun. 58:992, 1990; Dunn et al, J. Biol. Chem. 265:9464, 1990; Evans et al, Microbial Pathogenesis 10:15, 1991; Labigne et al, J. Bact, 173: 1920, 1991); the 87 kDa vacuolar cytotoxin (VacA) (Cover et al, J. Biol. Chem. 267:10570, 1992; Phadnis et al, Infect. Immun.
  • urease which is composed of two structural subunits of approximately 30 and 67 kDa (Hu et al, Infect. Immun. 58:992, 1990; Dunn et al, J. Biol. Chem. 265:9464, 1990; Evans et al, Microbial Pathogenesis 10:15,
  • the invention provides polynucleotide molecules that encode Helicobacter polypeptides, designated GHPO 35 (SEQ ID NO:2), GHPO 55 (SEQ ID NO:4), GHPO 78 (SEQ ID NO:6), GHPO 89 (SEQ ID NO: 8), GHPO 129 (SEQ ID NO: 10), GHPO 541 (SEQ ID NO: 12), GHPO 607 (SEQ ID NO:14), GHPO 635 (SEQ ID NO:16), GHPO 701 (SEQ ID NO:18), GHPO
  • GHPO 237 (SEQ ID NO: 130), GHPO 290 (SEQ ID NO: 132), GHPO 293 (SEQ ID NO:134), GHPO 335 (SEQ ID NO:136), GHPO 374 (SEQ ID NO: 138), GHPO 442 (SEQ ID NO:140), GHPO 480 (SEQ ID NO: 142), GHPO 523 (SEQ ID NO: 144), GHPO 610 (SEQ ID NO:146), GHPO 675 (SEQ ID NO: 148), GHPO 690 (SEQ ID NO: 150), GHPO 829 (SEQ ID NO: 152), GHPO
  • GHPO 1084 (SEQ ID NO:236), GHPO 1329 (SEQ ID NO:238), GHPO 1330 (SEQ ID NO:240), GHPO 1346 (SEQ ID NO:242), GHPO 1360 (SEQ ID NO:244), GHPO 1388 (SEQ ID NO:246), GHPO 1411 (SEQ ID NO:248), GHPO 1419 (SEQ ID NO:250), GHPO 1446 (SEQ ID NO:252), GHPO 1469 (SEQ ID NO:254), GHPO 1501 (SEQ ID NO:256), GHPO 1505 (SEQ ID NO:
  • GHPO 1522 (SEQ ID NO:260), GHPO 1525 (SEQ ID NO:262), GHPO 1615 (SEQ ID NO:264), GHPO 1689 (SEQ ID NO:266), GHPO 1733 (SEQ ID NO:268), GHPO 18 (SEQ ID NO:270), GHPO 139 (SEQ ID NO:272), GHPO 142 (SEQ ID NO:274), GHPO 250 (SEQ ID NO:276), GHPO 257 (SEQ ID NO:278), GHPO 325 (SEQ ID NO:280), GHPO 355 (SEQ ID NO:260), GHPO 1525 (SEQ ID NO:262), GHPO 1615 (SEQ ID NO:264), GHPO 1689 (SEQ ID NO:266), GHPO 1733 (SEQ ID NO:268), GHPO 18 (SEQ ID NO:270), GHPO 139 (SEQ ID NO:272), GHPO 142 (SEQ ID NO:27
  • GHPO 1272 (SEQ ID NO:318), GHPO 1345 (SEQ ID NO:320), GHPO 1377 (SEQ ID NO:322), GHPO 1424 (SEQ ID NO:324), GHPO 1430 (SEQ ID NO:326), GHPO 1502 (SEQ ID NO:328), GHPO 1600 (SEQ ID NO:330), GHPO 1714 (SEQ ID NO:332), GHPO 359 (SEQ ID NO:334), GHPO 678 (SEQ ID NO:336), GHPO 708 (SEQ ID NO:338), GHPO 759 (SEQ ID NO:
  • GHPO 847 (SEQ ID NO:342), GHPO 1050 (SEQ ID NO:344), GHPO 1101 (SEQ ID NO:346), GHPO 1120 (SEQ ID NO:348), GHPO 1138 (SEQ ID NO:350), GHPO 1310 (SEQ ID NO:352), GHPO 1320 (SEQ ID NO:354), GHPO 1375 (SEQ ID NO:356), GHPO 1432 (SEQ ID NO:358), GHPO 21 (SEQ ID NO:360), GHPO 282 (SEQ ID NO:362), GHPO 1089
  • GHPO 86 (SEQ ID NO:390), GHPO 99 (SEQ ID NO:392), GHPO 106 (SEQ ID NO:394), GHPO 118 (SEQ ID NO:396), GHPO 122 (SEQ ID NO:398), GHPO 128 (SEQ ID NO:400), GHPO 138 (SEQ ID NO:402), GHPO 153 (SEQ ID NO:404), GHPO 160 (SEQ ID NO:406), GHPO 168 (SEQ ID NO:408), GHPO 179 (SEQ ID NO:410), GHPO 189 (SEQ ID NO:412), GHPO
  • GHPO 382 (SEQ ID NO:450), GHPO 384 (SEQ ID NO:452), GHPO 398 (SEQ ID NO:454), GHPO 409 (SEQ ID NO:456), GHPO 422 (SEQ ID NO:458), GHPO 430 (SEQ ID NO:460), GHPO 446 (SEQ ID NO:462), GHPO 447 (SEQ ID NO:464), GHPO 450 (SEQ ID NO:466), GHPO 451 (SEQ ID NO:468), GHPO 452 (SEQ ID NO:470), GHPO 456 (SEQ ID NO:472), GHPO
  • GHPO 580 (SEQ ID NO:500), GHPO 585 (SEQ ID NO:502), GHPO 599 (SEQ ID NO:504), GHPO 639 (SEQ ID NO:506), GHPO 642 (SEQ ID NO:508), GHPO 647 (SEQ ID NO:510), GHPO 654 (SEQ ID NO:512), GHPO 669 (SEQ ID NO:514), GHPO 710 (SEQ ID NO:516), GHPO 713 (SEQ ID NO:518), GHPO 716 (SEQ ID NO:520), GHPO 718 (SEQ ID NO:522), GHPO
  • GHPO 1301 (SEQ ID NO:630), GHPO 1304 (SEQ ID NO:632), GHPO 1315 (SEQ ID NO:634), GHPO 1319 (SEQ ID NO:636), GHPO 1323 (SEQ ID NO:638), GHPO 1331 (SEQ ID NO:640), GHPO 1332 (SEQ ID NO:642), GHPO 1347 (SEQ ID NO:644), GHPO 1373 (SEQ ID NO:646), GHPO 1376 (SEQ ID NO:648), GHPO 1380 (SEQ ID NO:650), GHPO 1394 (SEQ ID NO:630), GHPO 1304 (SEQ ID NO:632), GHPO 1315 (SEQ ID NO:634), GHPO 1319 (SEQ ID NO:636), GHPO 1323 (SEQ ID NO:638), GHPO 1331 (SEQ ID NO:640), GHPO 1332 (SEQ ID NO:642), GHPO 1347 (S
  • GHPO 846 (SEQ ID NO: 1050), GHPO 875 (SEQ ID NO: 1052), GHPO 892 (SEQ ID NO: 1054), GHPO 902 (SEQ ID NO: 1056), GHPO 904 (SEQ ID NO: 1058), GHPO 906 (SEQ ID NO: 1060), GHPO 908 (SEQ ID NO: 1062), GHPO 921 (SEQ ID NO: 1064), GHPO 923 (SEQ ID NO: 1066), GHPO 926 (SEQ ID NO: 1068), GHPO 933 (SEQ ID NO: 1070), GHPO 939 (SEQ ID NO: 1050), GHPO 875 (SEQ ID NO: 1052), GHPO 892 (SEQ ID NO: 1054), GHPO 902 (SEQ ID NO: 1056), GHPO 904 (SEQ ID NO: 1058), GHPO 906 (SEQ ID NO: 1060), GHPO 908 (SEQ ID NO: 1062), GH
  • GHPO 1103 (SEQ ID NO: 1128), GHPO 1113 (SEQ ID NO:l 130), GHPO 1116 (SEQ ID NO: 1132), GHPO 1123 (SEQ ID NO: 1134), GHPO 1125 (SEQ ID NO:1136), GHPO 1129 (SEQ ID NO:1138), GHPO 1130 (SEQ ID NO:l 140), GHPO 1134 (SEQ ID NO: 1142), GHPO 1161 (SEQ ID NO: 1144), GHPO 1166 (SEQ ID NO: 1146), GHPO 1170 (SEQ ID NO: 1148), GHPO
  • GHPO 1460 (SEQ ID NO:1226), GHPO 1381 (SEQ ID NO:1228), GHPO 1401 (SEQ ID NO: 1230), GHPO 1402 (SEQ ID NO: 1232), GHPO 1403 (SEQ ID NO: 1234), GHPO 1408 (SEQ ID NO: 1236), GHPO 1416 (SEQ ID NO:1238), GHPO 1420 (SEQ ID NO: 1240), GHPO 1428 (SEQ ID NO: 1242), GHPO 1437 (SEQ ID NO: 1244), GHPO 1439 (SEQ ID NO: 1246), GHPO 1460 (SEQ ID NO:1226), GHPO 1381 (SEQ ID NO:1228), GHPO 1401 (SEQ ID NO: 1230), GHPO 1402 (SEQ ID NO: 1232), GHPO 1403 (SEQ ID NO: 1234), GHPO 1408 (SEQ ID NO: 1236), GHPO 1416 (SEQ ID NO:1238), GH
  • GHPO 1504 (SEQ ID NO:1272), GHPO 1510 (SEQ ID NO:1274), GHPO 1518 (SEQ ID NO: 1276), GHPO 1533 (SEQ ID NO: 1278), GHPO 1541 (SEQ ID NO: 1280), GHPO 1544 (SEQ ID NO: 1282), GHPO 1548 (SEQ ID NO:1284), GHPO 1565 (SEQ ID NO:1286), GHPO 1575 (SEQ ID NO:1288), GHPO 1582 (SEQ ID NO:1290), GHPO 1595 (SEQ ID NO:1292), GHPO
  • the invention includes the corresponding polypeptides (i.e., polypeptides encoded by the polynucleotide molecules ofthe invention, or fragments thereof), and monospecific antibodies that specifically bind to these polypeptides.
  • the polypeptides ofthe invention include those having the amino acid sequences shown in the sequence listing (even numbers, up to SEQ ID NO: 1363), as well as mature forms of proteins having sequences shown in the sequence listing in their unprocessed forms, and fragments thereof.
  • the present invention has many applications and includes expression cassettes, vectors, and cells transformed or transfected with the polynucleotides of the invention.
  • the present invention provides (i) methods for producing polypeptides ofthe invention in recombinant host systems and related expression cassettes, vectors, and transformed or transfected cells; (ii) live vaccine vectors, such as pox virus, Salmonella typhimurium, and Vibrio cholerae vectors, that contain polynucleotides ofthe invention (such vaccine vectors being useful in, e.g., methods for preventing or treating Helicobacter infection) in combination with a diluent or carrier, and related pharmaceutical compositions and associated therapeutic and/or prophylactic methods; (iii) therapeutic and/or prophylactic methods involving administration of polynucleotide molecules, either in a naked form or formulated with a delivery vehicle, polypeptides or mixtures of polypeptides, or monospecific antibodies ofthe invention, and related pharmaceutical compositions; (iv) methods for detecting the presence of Helicobacter in biological samples, which can involve the use of polynucleotide molecules, monospecific antibodies, or polypeptide
  • Open reading frames encoding new polypeptides, designated GHPO 35 (SEQ ID NO:2), GHPO 55 (SEQ ID NO:4), GHPO 78 (SEQ ID NO:6), GHPO 89 (SEQ ID NO:8), GHPO 129 (SEQ ID NO: 10), GHPO 541
  • GHPO 28 (SEQ ID NO: 122), GHPO 86 (SEQ ID NO: 124), GHPO 155 (SEQ ID NO: 126), GHPO 157 (SEQ ID NO: 128), GHPO 237 (SEQ ID NO: 130), GHPO 290 (SEQ ID NO: 132), GHPO 293 (SEQ ID NO: 134), GHPO 335 (SEQ ID NO: 136), GHPO 374 (SEQ ID NO: 138), GHPO 442 (SEQ ID NO: 140), GHPO 480 (SEQ ID NO: 142), GHPO 523 (SEQ ID NO: 144), GHPO
  • GHPO 208 (SEQ ID NO:204), GHPO 219 (SEQ ID NO:206), GHPO 445 (SEQ ID NO:208), GHPO 479 (SEQ ID NO:210), GHPO 525 (SEQ ID NO:212), GHPO 535 (SEQ ID NO:214), GHPO 731 (SEQ ID NO:216), GHPO 836 (SEQ ID NO:218), GHPO 879 (SEQ ID NO:220), GHPO 881 (SEQ ID NO:222), GHPO 886 (SEQ ID NO:224), GHPO 893 (SEQ ID NO:226), GHPO
  • GHPO 1446 (SEQ ID NO:252), GHPO 1469 (SEQ ID NO:254), GHPO 1501 (SEQ ID NO:256), GHPO 1505 (SEQ ID NO:258), GHPO 1522 (SEQ ID NO:260), GHPO 1525 (SEQ ID NO:262), GHPO 1615 (SEQ ID NO:264), GHPO 1689 (SEQ ID NO:266), GHPO 1733 (SEQ ID NO:268), GHPO 18 (SEQ ID NO:270), GHPO 139 (SEQ ID NO:272), GHPO 142 (SEQ ID NO:252), GHPO 142 (SEQ ID NO:
  • GHPO 250 (SEQ ID NO:276), GHPO 257 (SEQ ID NO:278), GHPO 325 (SEQ ID NO:280), GHPO 355 (SEQ ID NO:282), GHPO 357 (SEQ ID NO:284), GHPO 454 (SEQ ID NO:286), GHPO 475 (SEQ ID NO:288), GHPO 515 (SEQ ID NO:290), GHPO 527 (SEQ ID NO:292), GHPO 551 (SEQ ID NO:294), GHPO 602 (SEQ ID NO:296), GHPO 626 (SEQ ID NO:298), GHPO
  • GHPO 359 (SEQ ID NO:334), GHPO 678 (SEQ ID NO:336), GHPO 708 (SEQ ID NO:338), GHPO 759 (SEQ ID NO:340), GHPO 847 (SEQ ID NO:342), GHPO 1050 (SEQ ID NO:344), GHPO 1101 (SEQ ID NO:346), GHPO 1120 (SEQ ID NO:348), GHPO 1138 (SEQ ID NO:350), GHPO 1310 (SEQ ID NO:352), GHPO 1320 (SEQ ID NO:354), GHPO 1375 (SEQ ID NO:
  • GHPO 996 (SEQ ID NO:568), GHPO 997 (SEQ ID NO:570), GHPO 1002 (SEQ ID NO:572), GHPO 1026 (SEQ ID NO:574), GHPO 1028 (SEQ ID NO:576), GHPO 1034 (SEQ ID NO:578), GHPO 1038 (SEQ ID NO:580), GHPO 1059 (SEQ ID NO:582), GHPO 1065 (SEQ ID NO:584), GHPO 1072 (SEQ ID NO:586), GHPO 1073 (SEQ ID NO:588), GHPO 1088 (SEQ ID NO:590), GHPO 1091 (SEQ ID NO:592), GHPO 1105 (SEQ ID NO:594), GHPO 1115 (SEQ ID NO: 596), GHPO 1159 (SEQ ID NO:598), GHPO 1177
  • GHPO 1570 (SEQ ID NO:694), GHPO 1588 (SEQ ID NO:696), GHPO 1604 (SEQ ID NO:698), GHPO 1605 (SEQ ID NO:700), GHPO 1619 (SEQ ID NO:702), GHPO 1629 (SEQ ID NO:704), GHPO 1642 (SEQ ID NO:706), GHPO 1654 (SEQ ID NO:708), GHPO 1661 (SEQ ID NO:710), GHPO 1673 (SEQ ID NO:712), GHPO 1687 (SEQ ID NO:714), GHPO 1692 (SEQ ID NO:716), GHPO 1693 (SEQ ID NO:718), GHPO 1699 (SEQ ID NO:720), GHPO 1738 (SEQ ID NO:722), GHPO 1745 (SEQ ID NO:724), GHPO 1746
  • GHPO 30 (SEQ ID NO:752), GHPO 37 (SEQ ID NO:754), GHPO 49 (SEQ ID NO:756), GHPO 51 (SEQ ID NO:758), GHPO 54 (SEQ ID NO:760), GHPO 65 (SEQ ID NO:762), GHPO 66 (SEQ ID NO:764), GHPO 68 (SEQ ID NO:766), GHPO 70 (SEQ ID NO:768), GHPO 77 (SEQ ID NO:770), GHPO 83 (SEQ ID NO:772), GHPO 85 (SEQ ID NO:774), GHPO
  • GHPO 131 (SEQ ID NO:802), GHPO 133 (SEQ ID NO:804), GHPO 140 (SEQ ID NO:806), GHPO 141 (SEQ ID NO:808), GHPO 145 (SEQ ID NO:810), GHPO 147 (SEQ ID NO:812), GHPO 166 (SEQ ID NO:814), GHPO 181 (SEQ ID NO:816), GHPO 187 (SEQ ID NO:818), GHPO 188 (SEQ ID NO: 820), GHPO 192 (SEQ ID NO: 822), GHPO 202 (SEQ ID NO: 824), GHPO
  • GHPO 453 (SEQ ID NO:912), GHPO 455 (SEQ ID NO:914), GHPO 464 (SEQ ID NO:916), GHPO 467 (SEQ ID NO:918), GHPO 468 (SEQ ID NO:920), GHPO 470 (SEQ ID NO:922), GHPO 486 (SEQ ID NO:924), GHPO 487 (SEQ ID NO:926), GHPO 488 (SEQ ID NO:928), GHPO 489 (SEQ ID NO:930), GHPO 498 (SEQ ID NO:932), GHPO 501 (SEQ ID NO:934), GHPO
  • GHPO 763 (SEQ ID NO: 1020), GHPO 771 (SEQ ID NO: 1022), GHPO 774 (SEQ ID NO: 1024), GHPO 776 (SEQ ID NO: 1026), GHPO 783 (SEQ ID NO:1028), GHPO 800 (SEQ ID NO:1030), GHPO 806 (SEQ ID NO:1032), GHPO 807 (SEQ ID NO:1034), GHPO 808 (SEQ ID NO:1036), GHPO 809 (SEQ ID NO: 1038), GHPO 811 (SEQ ID NO: 1040), GHPO 815 (SEQ ID NO:
  • GHPO 1001 (SEQ ID NO: 1090), GHPO 1005 (SEQ ID NO: 1092), GHPO 1033 (SEQ ID NO: 1094), GHPO 1039 (SEQ ID NO: 1096), GHPO 1041 (SEQ ID NO: 1098), GHPO 1043 (SEQ ID NO: 1100), GHPO 1044 (SEQ ID NO: 1102), GHPO 1051 (SEQ ID NO: 1104), GHPO 1058 (SEQ ID NO: 1106), GHPO 1060 (SEQ ID NO:l 108), GHPO 1075 (SEQ ID NO: 1110), GHPO 1077 (SEQ ID NO:1112), GHPO 1082 (SEQ ID NO:1114), GHPO 1083 (SEQ ID NO:1116), GHPO 1086 (SEQ ID NO: 1118), GHPO 1087 (SEQ ID NO:
  • GHPO 1161 (SEQ ID NO: 1144), GHPO 1166 (SEQ ID NO: 1146), GHPO 1170 (SEQ ID NO: 1148), GHPO 1175 (SEQ ID NO: 1150), GHPO 1181 (SEQ ID NO: 1152), GHPO 1186 (SEQ ID NO: 1154), GHPO 1188 (SEQ ID NO: 1156), GHPO 1191 (SEQ ID NO:1158), GHPO 1193 (SEQ ID NO: 1160), GHPO 1196 (SEQ ID NO: 1162), GHPO 1204 (SEQ ID NO: 1164), GHPO
  • GHPO 1575 (SEQ ID NO:1288), GHPO 1582 (SEQ ID NO:1290), GHPO 1595 (SEQ ID NO:1292), GHPO 1597 (SEQ ID NO:1294), GHPO 1599 (SEQ ID NO:1296), GHPO 1601 (SEQ ID NO: 1298), GHPO 1609 (SEQ ID NO:1300), GHPO 1613 (SEQ ID NO: 1302), GHPO 1614 (SEQ ID NO: 1304), GHPO 1626 (SEQ ID NO: 1306), GHPO 1628 (SEQ ID NO: 1308), GHPO
  • GHPO 1695 (SEQ ID NO:1334), GHPO 1697 (SEQ ID NO:1336), GHPO 1701 (SEQ ID NO:1338), GHPO 1719 (SEQ ID NO:1340), GHPO 1723 (SEQ ID NO:1342), GHPO 1732 (SEQ ID NO:1344), GHPO 1739 (SEQ ID NO: 1346), GHPO 1741 (SEQ ID NO:1348), GHPO 1747 (SEQ ID NO:1350), GHPO 1749 (SEQ ID NO:1352), GHPO 1750 (SEQ ID NO: 1354), GHPO 1751 (SEQ ID NO:1356), GHPO 1755 (SEQ ID NO:1358), GHPO 1771 (SEQ ID NO:1360), GHPO 1786 (SEQ ID NO: 1362), and GHPO
  • GHPO 1275, GHPO 1308, GHPO 1600, GHPO 1615, GHPO 536, GHPO 66, GHPO 1363, GHPO 1595, and GHPO 1166 have been shown to be protective antigens that can be used in methods for preventing Helicobacter infection.
  • protective antigen is meant an antigen that is capable of reducing the infection level after challenge, relative to a positive control. Absolute protection from infection, although included in the invention, is not required.
  • Some ofthe new polypeptides are secreted polypeptides that can be produced in their mature forms (i.e., as polypeptides that have been exported through class II or class III secretion pathways) or as precursors that include signal peptides, which can be removed in the course of excretion/secretion by cleavage at the N-terminal end ofthe mature form. (The cleavage site is located at the C-terminal end ofthe signal peptide, adjacent to the mature form.)
  • GHPO proteins listed above. Examples of such polynucleotides are those encoding GHPO 35 (SEQ ID NO:l), GHPO 55 (SEQ ID NO:3), GHPO 78 (SEQ ID NO:5), GHPO 89 (SEQ ID NO:7), GHPO 129 (SEQ ID NO:9), GHPO 541 (SEQ ID NO: 11), GHPO 607 (SEQ ID NO: 13), GHPO 635 (SEQ ID NO: 15), GHPO 701 (SEQ ID NO: 17), GHPO 712 (SEQ ID NO:19), GHPO 761 (SEQ ID NO:21), GHPO 838 (SEQ ID NO:23), GHPO 1034 (SEQ ID NO:25), GHPO 1085 (SEQ ID NO:27), GHPO 1213 (SEQ ID NO:29), GHPO
  • GHPO 1490 (SEQ ID NO: 107), GHPO 1559 (SEQ ID NO: 109), GHPO 1651 (SEQ ID NO: 111), GHPO 1726 (SEQ ID NO: 113), GHPO 1780 (SEQ ID NO:l 15), GHPO 895 (SEQ ID NO:l 17), GHPO 1447 (SEQ ID NO:l 19), GHPO 28 (SEQ ID NO:121), GHPO 86 (SEQ ID NO:123), GHPO 155 (SEQ ID NO: 125), GHPO 157 (SEQ ID NO: 127), GHPO 237 (SEQ ID NO: 129),
  • GHPO 290 (SEQ ID NO: 131), GHPO 293 (SEQ ID NO: 133), GHPO 335 (SEQ ID NO: 135), GHPO 374 (SEQ ID NO: 137), GHPO 442 (SEQ ID NO: 139), GHPO 480 (SEQ ID NO: 141), GHPO 523 (SEQ ID NO: 143), GHPO 610 (SEQ ID NO: 145), GHPO 675 (SEQ ID NO: 147), GHPO 690 (SEQ ID NO: 149), GHPO 829 (SEQ ID NO: 151), GHPO 850 (SEQ ID NO: 153), GHPO 876 (SEQ ID NO: 155), GHPO 984 (SEQ ID NO: 157), GHPO 989 (SEQ ID NO: 159), GHPO 1111 (SEQ ID NO:161), GHPO 1145 (SEQ ID NO:163),
  • GHPO 1256 (SEQ ID NO: 165), GHPO 1264 (SEQ ID NO: 167), GHPO 1316 (SEQ ID NO: 169), GHPO 1368 (SEQ ID NO: 171), GHPO 1442 (SEQ ID NO: 173), GHPO 1506 (SEQ ID NO: 175), GHPO 1543 (SEQ ID NO: 177), GHPO 1574 (SEQ ID NO: 179), GHPO 1627 (SEQ ID NO: 181), GHPO 1657 (SEQ ID NO: 183), GHPO 1664 (SEQ ID NO: 185), GHPO 1694 (SEQ ID NO: 165), GHPO 1264 (SEQ ID NO: 167), GHPO 1316 (SEQ ID NO: 169), GHPO 1368 (SEQ ID NO: 171), GHPO 1442 (SEQ ID NO: 173), GHPO 1506 (SEQ ID NO: 175), GHPO 1543 (SEQ ID NO: 177), GH
  • GHPO 1345 (SEQ ID NO:319), GHPO 1377 (SEQ ID NO:321), GHPO 1424 (SEQ ID NO:323), GHPO 1430 (SEQ ID NO:325), GHPO 1502 (SEQ ID NO:327), GHPO 1600 (SEQ ID NO:329), GHPO 1714 (SEQ ID NO:331), GHPO 359 (SEQ ID NO:333), GHPO 678 (SEQ ID NO:335), GHPO 708 (SEQ ID NO:337), GHPO 759 (SEQ ID NO:339), GHPO 847 (SEQ ID NO:319), GHPO 1377 (SEQ ID NO:321), GHPO 1424 (SEQ ID NO:323), GHPO 1430 (SEQ ID NO:325), GHPO 1502 (SEQ ID NO:327), GHPO 1600 (SEQ ID NO:329), GHPO 1714 (SEQ ID NO:331), GHPO 359
  • GHPO 1050 (SEQ ID NO:343), GHPO 1101 (SEQ ID NO:345), GHPO 1120 (SEQ ID NO:347), GHPO 1138 (SEQ ID NO:349), GHPO 1310 (SEQ ID NO:351), GHPO 1320 (SEQ ID NO:353), GHPO 1375 (SEQ ID NO:355), GHPO 1432 (SEQ ID NO:357), GHPO 21 (SEQ ID NO:359), GHPO 282 (SEQ ID NO:361), GHPO 1089 (SEQ ID NO:363), GHPO 1141 (SEQ ID NO:341), GHPO 1050 (SEQ ID NO:343), GHPO 1101 (SEQ ID NO:345), GHPO 1120 (SEQ ID NO:347), GHPO 1138 (SEQ ID NO:349), GHPO 1310 (SEQ ID NO:351), GHPO 1320 (SEQ ID NO:353), GHPO
  • GHPO 284 (SEQ ID NO:427), GHPO 296 (SEQ ID NO:429), GHPO 300 (SEQ ID NO:431), GHPO 305 (SEQ ID NO:433), GHPO 319 (SEQ ID NO:435), GHPO 330 (SEQ ID NO:437), GHPO 340 (SEQ ID NO:439), GHPO 342 (SEQ ID NO:441), GHPO 344 (SEQ ID NO:443), GHPO 358 (SEQ ID NO:445), GHPO 373 (SEQ ID NO:447), GHPO 382 (SEQ ID NO:449), GHPO
  • GHPO 478 SEQ ID NO:477), GHPO 491 (SEQ ID NO:479), GHPO 511 (SEQ ID NO:481), GHPO 519 (SEQ ID NO:483), GHPO 526 (SEQ ID NO:485), GHPO 534 (SEQ ID NO:487), GHPO 536 (SEQ ID NO:489), GHPO 542 (SEQ ID NO:491), GHPO 544 (SEQ ID NO:493), GHPO 576 (SEQ ID NO:495), GHPO 578 (SEQ ID NO:497), GHPO 580 (SEQ ID NO:499), GHPO
  • GHPO 1225 (SEQ ID NO:609), GHPO 1228 (SEQ ID NO:611), GHPO 1229 (SEQ ID NO:613), GHPO 1231 (SEQ ID NO:615), GHPO 1236 (SEQ ID NO:617), GHPO 1242 (SEQ ID NO:619), GHPO 1248 (SEQ ID NO:621), GHPO 1270 (SEQ ID NO:623), GHPO 1271 (SEQ ID NO:625), GHPO 1298 (SEQ ID NO:627), GHPO 1301 (SEQ ID NO:629), GHPO 1304 (SEQ ID NO:609), GHPO 1228 (SEQ ID NO:611), GHPO 1229 (SEQ ID NO:613), GHPO 1231 (SEQ ID NO:615), GHPO 1236 (SEQ ID NO:617), GHPO 1242 (SEQ ID NO:619), GHPO 1248 (SEQ ID NO:621), GH
  • GHPO 1315 (SEQ ID NO:633), GHPO 1319 (SEQ ID NO:635), GHPO 1323 (SEQ ID NO:637), GHPO 1331 (SEQ ID NO:639), GHPO 1332 (SEQ ID NO:641), GHPO 1347 (SEQ ID NO:643), GHPO 1373 (SEQ ID NO:645), GHPO 1376 (SEQ ID NO:647), GHPO 1380 (SEQ ID NO:649), GHPO 1394 (SEQ ID NO:651), GHPO 1407 (SEQ ID NO:653), GHPO 1415
  • GHPO 1560 SEQ ID NO:689
  • GHPO 1564 SEQ ID NO:691
  • GHPO 1570 SEQ ID NO:693
  • GHPO 1588 SEQ ID NO:695
  • GHPO 1604 SEQ ID NO:697
  • GHPO 1605 SEQ ID NO:699
  • GHPO 1619 SEQ ID NO:701
  • GHPO 1629 SEQ ID NO:703
  • GHPO 1642 SEQ ID NO:705
  • GHPO 1654 SEQ ID NO:707
  • GHPO 1661 SEQ ID NO:709
  • GHPO 7 (SEQ ID NO:735), GHPO 8 (SEQ ID NO:737), GHPO 9 (SEQ ID NO:739), GHPO 10 (SEQ ID NO:741), GHPO 12 (SEQ ID NO:743), GHPO 25 (SEQ ID NO:745), GHPO 27 (SEQ ID NO:747), GHPO 29 (SEQ ID NO:749), GHPO 30 (SEQ ID NO:751), GHPO 37 (SEQ ID NO:753), GHPO 49 (SEQ ID NO:755), GHPO 51 (SEQ ID NO:757), GHPO 54 (SEQ ID NO:
  • GHPO 65 (SEQ ID NO:761), GHPO 66 (SEQ ID NO:763), GHPO 68 (SEQ ID NO:765), GHPO 70 (SEQ ID NO:767), GHPO 77 (SEQ ID NO:769), GHPO 83 (SEQ ID NO:771), GHPO 85 (SEQ ID NO:773), GHPO 87 (SEQ ID NO:775), GHPO 91 (SEQ ID NO:777), GHPO 92 (SEQ ID NO:779), GHPO 96 (SEQ ID NO:781), GHPO 97 (SEQ ID NO:783), GHPO
  • GHPO 192 (SEQ ID NO:821), GHPO 202 (SEQ ID NO:823), GHPO 204 (SEQ ID NO:825), GHPO 205 (SEQ ID NO:827), GHPO 212 (SEQ ID NO:829), GHPO 218 (SEQ ID NO:831), GHPO 226 (SEQ ID NO:833), GHPO 231 (SEQ ID NO:835), GHPO 236 (SEQ ID NO:837), GHPO 239 (SEQ ID NO:839), GHPO 245 (SEQ ID NO:841), GHPO 246 (SEQ ID NO:843), GHPO
  • GHPO 326 SEQ ID NO:871
  • GHPO 331 SEQ ID NO:873
  • GHPO 343 SEQ ID NO: 875
  • GHPO 345 SEQ ID NO: 877
  • GHPO 346 SEQ ID NO:879
  • GHPO 352 SEQ ID NO:881
  • GHPO 355 SEQ ID NO:883
  • GHPO 363 SEQ ID NO:885
  • GHPO 369 SEQ ID NO:887
  • GHPO 376 SEQ ID NO:889
  • GHPO 378 SEQ ID NO:891
  • GHPO 388 SEQ ID NO:893
  • GHPO 470 (SEQ ID NO:921), GHPO 486 (SEQ ID NO:923), GHPO 487 (SEQ ID NO:925), GHPO 488 (SEQ ID NO:927), GHPO 489 (SEQ ID NO:929), GHPO 498 (SEQ ID NO:931), GHPO 501 (SEQ ID NO:933), GHPO 504 (SEQ ID NO:935), GHPO 512 (SEQ ID NO:937), GHPO 517 (SEQ ID NO:939), GHPO 520 (SEQ ID NO:941), GHPO 528 (SEQ ID NO:943), GHPO 530 (SEQ ID NO:945), GHPO 532 (SEQ ID NO:947), GHPO 548 (SEQ ID NO:949), GHPO 561 (SEQ ID NO:951), GHPO 564 (SEQ ID NO:953), GHPO
  • GHPO 612 (SEQ ID NO:981), GHPO 615 (SEQ ID NO:983), GHPO 632 (SEQ ID NO:985), GHPO 633 (SEQ ID NO:987), GHPO 637 (SEQ ID NO:989), GHPO 651 (SEQ ID NO:991), GHPO 663 (SEQ ID NO:993), GHPO 686 (SEQ ID NO:995), GHPO 693 (SEQ ID NO:997), GHPO 698 (SEQ ID NO:999), GHPO 703 (SEQ ID NO: 1001), GHPO 704 (SEQ ID NO: 1003),
  • GHPO 705 (SEQ ID NO: 1005), GHPO 707 (SEQ ID NO: 1007), GHPO 721 (SEQ ID NO: 1009), GHPO 727 (SEQ ID NO: 1011), GHPO 728 (SEQ ID NO: 1013), GHPO 733 (SEQ ID NO: 1015), GHPO 758 (SEQ ID NO: 1017), GHPO 763 (SEQ ID NO: 1019), GHPO 771 (SEQ ID NO: 1021), GHPO 774 (SEQ ID NO: 1023), GHPO 776 (SEQ ID NO: 1025), GHPO 783 (SEQ ID NO: 1005), GHPO 707 (SEQ ID NO: 1007), GHPO 721 (SEQ ID NO: 1009), GHPO 727 (SEQ ID NO: 1011), GHPO 728 (SEQ ID NO: 1013), GHPO 733 (SEQ ID NO: 1015), GHPO 758 (SEQ ID NO: 1017), GH
  • GHPO 991 SEQ ID NO:1085)
  • GHPO 998 SEQ ID NO:1087
  • GHPO 1001 SEQ ID NO: 1089
  • GHPO 1005 SEQ ID NO: 1091
  • GHPO 1033 SEQ ID NO: 1093
  • GHPO 1039 SEQ ID NO: 1095
  • GHPO 1041 SEQ ID NO: 1097
  • GHPO 1043 SEQ ID NO: 1099
  • GHPO 1044 SEQ ID NO: 1101
  • GHPO 1051 SEQ ID NO: 1103
  • GHPO 1058 SEQ ID NO: 1105)
  • GHPO 1060 (SEQ ID NO:1107), GHPO 1075 (SEQ ID NO:1109), GHPO 1077 (SEQ ID NO:l l l l), GHPO 1082 (SEQ ID NO:1113), GHPO 1083 (SEQ ID NO:1115), GHPO 1086 (SEQ ID NO:1117), GHPO 1087 (SEQ ID NO: 1119), GHPO 1090 (SEQ ID NO: 1121), GHPO 1097 (SEQ ID NO: 1123), GHPO 1098 (SEQ ID NO: 1125), GHPO 1103 (SEQ ID NO: 1127), GHPO
  • GHPO 1401 (SEQ ID NO:1229), GHPO 1402 (SEQ ID NO: 1231), GHPO 1403 (SEQ ID NO: 1233), GHPO 1408 (SEQ ID NO: 1235), GHPO 1416 (SEQ ID NO: 1237), GHPO 1420 (SEQ ID NO: 1239), GHPO 1428 (SEQ ID NO: 1241), GHPO 1437 (SEQ ID NO: 1243), GHPO 1439 (SEQ ID NO: 1245), GHPO 1460 (SEQ ID NO: 1247), GHPO 1463 (SEQ ID NO: 1249),
  • GHPO 1472 (SEQ ID NO: 1251), GHPO 1474 (SEQ ID NO: 1253), GHPO 1484 (SEQ ID NO: 1255), GHPO 1489 (SEQ ID NO: 1257), GHPO 1494 (SEQ ID NO: 1259), GHPO 1495 (SEQ ID NO: 1261), GHPO 1498 (SEQ ID NO: 1263), GHPO 1499 (SEQ ID NO: 1265), GHPO 1500 (SEQ ID NO: 1267), GHPO 1503 (SEQ ID NO:1269), GHPO 1504 (SEQ ID NO:1271), GHPO
  • GHPO 1749 (SEQ ID NO:1351), GHPO 1750 (SEQ ID NO: 1353), GHPO 1751 (SEQ ID NO:1355), GHPO 1755 (SEQ ID NO:1357), GHPO 1771 (SEQ ID NO:1359), GHPO 1786 (SEQ ID NO:1361), and GHPO 1789 (SEQ ID NO:1363).
  • An isolated polynucleotide ofthe invention encodes (i) a polypeptide having an amino acid sequence that is homologous to a Helicobacter amino acid sequence of a polypeptide, the Helicobacter amino acid sequence being selected from the group consisting ofthe amino acid sequences shown in the sequence listing (even numbers, up to SEQ ID NO: 1364), or (ii) a derivative of the polypeptide.
  • polynucleotides included in the invention can also encode polypeptides that lack signal sequences, as well as other polypeptide or peptide fragments ofthe full-length polypeptides.
  • isolated polynucleotide is defined as a polynucleotide that is removed from the environment in which it naturally occurs.
  • a naturally-occurring DNA molecule present in the genome of a living bacteria or as part of a gene bank is not isolated, but the same molecule, separated from the remaining part ofthe bacterial genome, as a result of, e.g., a cloning event (amplification), is "isolated.”
  • an isolated DNA molecule is free from DNA regions (e.g., coding regions) with which it is immediately contiguous, at the 5 ' or 3' ends, in the naturally occurring genome.
  • isolated polynucleotides can be part of a vector or a composition and still be isolated, as such a vector or composition is not part of its natural environment.
  • a polynucleotide ofthe invention can consist of RNA or DNA (e.g., cDNA, genomic DNA, or synthetic DNA), or modifications or combinations of RNA or DNA.
  • the polynucleotide can be double-stranded or single-stranded and, if single-stranded, can be the coding (sense) strand or the non-coding (anti- sense) strand.
  • sequences that encode polypeptides ofthe invention can be (a) the coding sequence as shown in any ofthe nucleotide sequences ofthe sequence listing (odd numbers, up to SEQ ID NO: 1363); (b) a ribonucleotide sequence derived by transcription of (a); or (c) a different coding sequence that, as a result ofthe redundancy or degeneracy ofthe genetic code, encodes the same polypeptides as the polynucleotide molecules having the sequences illustrated in any ofthe nucleotide sequences ofthe sequence listing (odd numbers, up to SEQ ID NO: 1363).
  • the polypeptide can be one that is naturally secreted or excreted by, e.g., H. felis, H. mustelae, H. heilmanii, or H. pylori.
  • polypeptide or “protein” is meant any chain of amino acids, regardless of length or post-translational modification (e.g., glycosylation or phosphorylation). Both terms are used interchangeably in the present application.
  • homologous amino acid sequence is meant an amino acid sequence that differs from an amino acid sequence shown in the sequence listing (even numbers, up to SEQ ID NO: 1364), or an amino acid sequence encoded by a nucleotide sequence shown in the sequence listing (odd numbers, up to SEQ ID NO: 1363), by one or more non-conservative amino acid substitutions, deletions, or additions located at positions at which they do not destroy the specific antigenicity ofthe polypeptide.
  • such a sequence is at least 75%o, more preferably at least 80%, and most preferably at least 90% identical to an amino acid sequence shown in the sequence listing (even numbers, up to SEQ ID NO: 1364).
  • Homologous amino acid sequences include sequences that are identical or substantially identical to an amino acid sequence as shown in the sequence listing (even numbers, up to SEQ ID NO: 1364).
  • amino acid sequence that is substantially identical is meant a sequence that is at least 90%), preferably at least 95%, more preferably at least 97%, and most preferably at least 99% identical to an amino acid sequence of reference and that differs from the sequence of reference, if at all, by a majority of conservative amino acid substitutions.
  • Conservative amino acid substitutions typically include substitutions among amino acids ofthe same class. These classes include, for example, amino acids having uncharged polar side chains, such as asparagine, glutamine, serine, threonine, and tyrosine; amino acids having basic side chains, such as lysine, arginine, and histidine; amino acids having acidic side chains, such as aspartic acid and glutamic acid; and amino acids having nonpolar side chains, such as glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, and cysteine. Homology can be measured using sequence analysis software (e.g.,
  • homologous polynucleotide sequences are defined in a similar way.
  • a homologous sequence is one that is at least 45%, more preferably at least 60%>, and most preferably at least 85% identical to a coding sequence of any ofthe nucleotide sequences set forth in the sequence listing (odd numbers, up to SEQ ID NO: 1363).
  • Polypeptides having a sequence homologous to any one ofthe sequences shown in the sequence listing include naturally-occurring allelic variants, as well as mutants or any other non- naturally occurring variants that are analogous in terms of antigenicity, to a polypeptide having a sequence as shown in the sequence listing (even numbers, up to SEQ ID NO: 1364).
  • an allelic variant is an alternate form of a polypeptide that is characterized as having a substitution, deletion, or addition of one or more amino acids that does not alter the biological function ofthe polypeptide.
  • biological function is meant a function ofthe polypeptide in the cells in which it naturally occurs, even if the function is not necessary for the growth or survival ofthe cells.
  • the biological function of a porin is to allow the entry into cells of compounds present in the extracellular medium.
  • the biological function is distinct from the antigenic function.
  • a polypeptide can have more than one biological function.
  • Allelic variants are very common in nature. For example, a bacterial species, e.g., H.
  • pylori is usually represented by a variety of strains that differ from each other by minor allelic variations. Indeed, a polypeptide that fulfills the same biological function in different strains can have an amino acid sequence that is not identical in each ofthe strains. Such an allelic variation can be equally reflected at the polynucleotide level.
  • allelic variants of polypeptide antigens comes from, e.g., studies ofthe Helicobacter urease antigen.
  • the amino acid sequence of Helicobacter urease varies widely from species to species, yet cross-species protection occurs, indicating that the urease molecule, when used as an immunogen, is highly tolerant of amino acid variations. Even among different strains ofthe single species H. pylori, there are amino acid sequence variations.
  • H. pylori urease protects mice from H. felis infection (Michetti et al, Gasfroenterology 107:1002, 1994).
  • UreA and UreB which contain distinct amino acid sequences, are both protective antigens against Helicobacter infection (Michetti et al, supra).
  • H. pylori strain CPM630 H. pylori strain CPM630; Lee et al, J. Infect. Dis.l72:161, 1995); recombinant UreA + UreB apoenzyme expressed from pORN214 (UreA and UreB sequences differ from H. pylori strain CPM630 by one and two amino acid changes, respectively; Lee et al, supra, 1995); a UreA-glutathione-S- transferase fusion protein (UreA sequence from H. pylori strain ATCC 43504; Thomas et al, Acta Gastro-Enterologica Belgica 56:54, 1993); UreA + UreB holoenzyme purified from H. pylori strain ⁇ CTC11637 (Marchetti et al,
  • UreA-MBP fusion protein (UreA from H. pylori strain 85P; Ferrero et al, Infection and Immunity 62:4981, 1994); a UreB-MBP fusion protein (UreB from H. pylori strain 85P; Ferrero et al, supra); a UreA- MBP fusion protein (UreA from H felis strain ATCC 49179; Ferrero et al, supra); a UreB-MBP fusion protein (UreB from H. felis strain ATCC 49179;
  • Polynucleotides, e.g., D ⁇ A molecules, encoding allelic variants can easily be obtained by polymerase chain reaction (PCR) amplification of genomic bacterial D ⁇ A extracted by conventional methods.
  • PCR polymerase chain reaction
  • Suitable primers can be designed based on the nucleotide sequence information provided in the sequence listing (odd numbers, up to SEQ ID NO: 1363).
  • a primer consists of 10 to 40, preferably 15 to 25 nucleotides.
  • primers containing C and G nucleotides in proportions sufficient to ensure efficient hybridization, e.g., an amount of C and G nucleotides of at least 40%, preferably 50%, ofthe total nucleotide amount.
  • primers that can be used to isolate the polynucleotides ofthe invention from different Helicobacter strains can readily design primers that can be used to isolate the polynucleotides ofthe invention from different Helicobacter strains.
  • Experimental conditions for carrying out PCR can readily be determined by one skilled in the art and an illustration of carrying out PCR is provided in Example 2.
  • restriction endonuclease recognition sites that contain, typically, 4 to 6 nucleotides (for example, the sequences 5'-
  • GGATCC-3' (BamHI) or 5'-CTCGAG-3' (Xhol)
  • Restriction sites can be selected by those skilled in the art so that the amplified DNA can be conveniently cloned into an appropriately digested vector, such as a plasmid.
  • Useful homologs that do not occur naturally can be designed using known methods for identifying regions of an antigen that are likely to be tolerant of amino acid sequence changes and/or deletions. For example, sequences ofthe antigen from different species can be compared to identify conserved sequences.
  • Polypeptide derivatives that are encoded by polynucleotides of the invention include, e.g., fragments, polypeptides having large internal deletions derived from full-length polypeptides, and fusion proteins.
  • Polypeptide fragments ofthe invention can be derived from a polypeptide having a sequence homologous to any ofthe sequences ofthe sequence listing (even numbers, up to SEQ ID NO: 1364), to the extent that the fragments retain the substantial antigenicity ofthe parent polypeptide (specific antigenicity).
  • Polypeptide derivatives can also be constructed by large internal deletions that remove a substantial part ofthe parent polypeptide, while retaining specific antigenicity.
  • polypeptide derivatives should be about at least 12 amino acids in length to maintain antigenicity.
  • they can be at least 20 amino acids, preferably at least 50 amino acids, more preferably at least 75 amino acids, and most preferably at least 100 amino acids in length.
  • polypeptide derivatives e.g., polypeptide fragments
  • polypeptide fragments can be designed using computer-assisted analysis of amino acid sequences in order to identify sites in protein antigens having potential as surface-exposed, antigenic regions (Hughes et al, Infect. Immun. 60(9):3497, 1992). For example, the
  • Laser Gene Program from DNA Star can be used to obtain hydrophilicity, antigenic index, and intensity index plots for the polypeptides ofthe invention.
  • This program can also be used to obtain information about homologies ofthe polypeptides with known protein motifs.
  • One skilled in the art can readily use the information provided in such plots to select peptide fragments for use as vaccine antigens. For example, fragments spanning regions ofthe plots in which the antigenic index is relatively high can be selected. One can also select fragments spanning regions in which both the antigenic index and the intensity plots are relatively high. Fragments containing conserved sequences, particularly hydrophilic conserved sequences, can also be selected.
  • Polypeptide fragments and polypeptides having large internal deletions can be used for revealing epitopes that are otherwise masked in the parent polypeptide and that may be of importance for inducing a protective T cell- dependent immune response. Deletions can also remove immunodominant regions of high variability among strains.
  • Polynucleotides encoding polypeptide fragments and polypeptides having large internal deletions can be constructed using standard methods (see, e.g., Ausubel et al, Current Protocols in Molecular Biology, John Wiley & Sons Inc., 1994), for example, by PCR, including inverse PCR, by restriction enzyme treatment ofthe cloned DNA molecules, or by the method of Kunkel et al. (Proc. Natl. Acad. Sci. USA 82:448, 1985; biological material available at Stratagene).
  • a polypeptide derivative can also be produced as a fusion polypeptide that contains a polypeptide or a polypeptide derivative ofthe invention fused, e.g., at the N- or C-terminal end, to any other polypeptide (hereinafter referred to as a peptide tail).
  • a product can be easily obtained by translation of a genetic fusion, i.e., a hybrid gene.
  • Vectors for expressing fusion polypeptides are commercially available, and include the pMal-c2 or pMal-p2 systems of New England Biolabs, in which the peptide tail is a maltose binding protein, the glutathione-S-transferase system of Pharmacia, or the His-Tag system available from Novagen. These and other expression systems provide convenient means for further purification of polypeptides and derivatives ofthe invention.
  • fusion polypeptides included in invention includes a polypeptide or polypeptide derivative ofthe invention fused to a polypeptide having adjuvant activity, such as, e.g., subunit B of either cholera toxin or E. coli heat-labile toxin.
  • a polypeptide having adjuvant activity such as, e.g., subunit B of either cholera toxin or E. coli heat-labile toxin.
  • the polypeptide ofthe invention can be fused to the N-terminal end or, preferably, to the C-terminal end ofthe polypeptide having adjuvant activity.
  • a polypeptide fragment ofthe invention can be fused within the amino acid sequence ofthe polypeptide having adjuvant activity. Spacer sequences can also be included, if desired.
  • the polynucleotides ofthe invention encode Helicobacter polypeptides in precursor or mature form. They can also encode hybrid precursors containing heterologous signal peptides, which can mature into polypeptides ofthe invention.
  • heterologous signal peptide is meant a signal peptide that is not found in the naturally-occurring precursor of a polypeptide ofthe invention.
  • a polynucleotide ofthe invention hybridizes, preferably under stringent conditions, to a polynucleotide having a sequence as shown in the sequence listing (odd numbers, up to SEQ ID NO: 1363).
  • Hybridization procedures are, e.g., described by Ausubel et al. (supra); Silhavy et al. (Experiments with Gene
  • Tm melting temperature
  • hybridization temperature (Th) is approximately 20 to 40°C, 20 to 25 °C, or, preferably, 30 to 40°C below the calculated Tm.
  • optimal temperature and salt conditions can be readily determined empirically in preliminary experiments using conventional procedures. For example, stringent conditions can be achieved, both for pre-hybridizing and hybridizing incubations, (i) within 4-16 hours at 42 °C, in 6 x SSC containing 50% formamide or (ii) within 4-16 hours at 65 °C in an aqueous 6 x SSC solution (1 M NaCl, 0.1 M sodium citrate (pH 7.0)).
  • Tm 4 x (G+C) + 2 (A+T).
  • a polynucleotide molecule ofthe invention containing RNA, DNA, or modifications or combinations thereof, can have various applications.
  • a polynucleotide molecule can be used (i) in a process for producing the encoded polypeptide in a recombinant host system, (ii) in the construction of vaccine vectors such as poxviruses, which are further used in methods and compositions for preventing and/or treating Helicobacter infection, (iii) as a vaccine agent, in a naked form or formulated with a delivery vehicle and, (iv) in the construction of attenuated Helicobacter strains that can over-express a polynucleotide ofthe invention or express it in a non-toxic, mutated form.
  • vaccine vectors such as poxviruses
  • elements e.g., a promoter
  • a recombinant expression system can be selected from procaryotic and eucaryotic hosts.
  • Eucaryotic hosts include, for example, yeast cells (e.g., Saccharomyces cerevisiae or Pichia Pastoris), mammalian cells (e.g., COS1,
  • NIH3T3, or JEG3 cells NIH3T3, or JEG3 cells
  • arthropods cells e.g., Spodoptera frugiperda (SF9) cells
  • plant cells e.g., a procaryotic host such as E. coli is used.
  • Bacterial and eucaryotic cells are available from a number of different sources that are known to those skilled in the art, e.g., the American Type Culture Collection (ATCC; Rockville, Maryland).
  • an expression cassette includes a constitutive or inducible promoter that is functional in the selected host system; a ribosome binding site; a start codon (ATG); if necessary, a region encoding a signal peptide, e.g., a lipidation signal peptide; a polynucleotide molecule ofthe invention; a stop codon; and, optionally, a 3' terminal region (translation and/or transcription terminator).
  • the signal peptide-encoding region is adjacent to the polynucleotide ofthe invention and is placed in the proper reading frame.
  • the signal peptide-encoding region can be homologous or heterologous to the polynucleotide molecule encoding the mature polypeptide and it can be specific to the secretion apparatus ofthe host used for expression.
  • the open reading frame constituted by the polynucleotide molecule ofthe invention, alone or together with the signal peptide, is placed under the control ofthe promoter so that transcription and translation occur in the host system.
  • Promoters and signal peptide-encoding regions are widely known and available to those skilled in the art and include, for example, the promoter of Salmonella typhimurium (and derivatives) that is inducible by arabinose (promoter araB) and is functional in Gram-negative bacteria such as E. coli (U.S. Patent No. 5,028,530; Cagnon et al, Protein Engineering 4(7) : 843 , 1991 ); the promoter of the bacteriophage T7 RNA polymerase gene, which is functional in a number of E. coli strains expressing T7 polymerase (U.S. Patent No. 4,952,496); the OspA lipidation signal peptide; and RlpB lipidation signal peptide (Takase et al, J. Bact. 169:5692, 1987).
  • the expression cassette is typically part of an expression vector, which is selected for its ability to replicate in the chosen expression system.
  • Expression vectors e.g., plasmids or viral vectors
  • plasmids or viral vectors can be chosen from, for example, those described in Pouwels et al. (Cloning Vectors: A Laboratory Manual, 1985, Supp. 1987) and can purchased from various commercial sources. Methods for transforming or transfecting host cells with expression vectors are well known in the art and will depend on the host system selected, as described in Ausubel et al (supra).
  • a recombinant polypeptide ofthe invention (or a polypeptide derivative) is produced and remains in the intracellular compartment, is secreted/excreted in the extracellular medium or in the periplasmic space, or is embedded in the cellular membrane.
  • the polypeptide can then be recovered in a substantially purified form from the cell extract or from the supernatant after centrifugation ofthe cell culture.
  • the recombinant polypeptide can be purified by antibody-based affinity purification or by any other method known to a person skilled in the art, such as by genetic fusion to a small affinity-binding domain.
  • Antibody-based affinity purification methods are also available for purifying a polypeptide ofthe invention extracted from a Helicobacter strain. Antibodies useful for immunoaffinity purification ofthe polypeptides ofthe invention can be obtained using methods described below.
  • Polynucleotides ofthe invention can also be used in DNA vaccination methods, using either a viral or bacterial host as gene delivery vehicle (live vaccine vector) or administering the gene in a free form, e.g., inserted into a plasmid.
  • Therapeutic or prophylactic efficacy of a polynucleotide ofthe invention can be evaluated as is described below.
  • a vaccine vector such as a poxvirus, containing a polynucleotide molecule ofthe invention placed under the control of elements required for expression;
  • a pharmaceutical composition containing a therapeutically or prophylactically effective amount of a vaccine vector ofthe invention (iv) a method for inducing an immune response against Helicobacter in a mammal (e.g., a human; alternatively, the method can be used in veterinary applications for treating or preventing Helicobacter infection of animals, e.g., cats or birds), which involves administering to the mammal an immunogenically effective amount of a vaccine vector ofthe invention to elicit an immune response, e.g., a protective or therapeutic immune response to Helicobacter; and (v) a method for preventing and/or treating a Helicobacter
  • the third aspect ofthe invention encompasses the use of a vaccine vector ofthe invention in the preparation of a medicament for preventing and/or treating Helicobacter infection.
  • a vaccine vector ofthe invention can express one or several polypeptides or derivatives ofthe invention, as well as at least one additional Helicobacter antigen such as a urease apoenzyme or a subunit, fragment, homolog, mutant, or derivative thereof.
  • a vaccine vector can include an additional polynucleotide molecules encoding, e.g., urease subunit A, B, or both, or a cytokine, placed under the control of elements required for expression in a mammalian cell.
  • composition ofthe invention can include several vaccine vectors, each of which being capable of expressing a polypeptide or derivative ofthe invention.
  • a composition can also contain a vaccine vector capable of expressing an additional Helicobacter antigen such as urease apoenzyme, a subunit, fragment, homolog, mutant, or derivative thereof, or a cytokine such as IL-2 or IL-12.
  • a vaccine vector ofthe invention can be administered by any conventional route in use in the vaccine field, for example, to a mucosal (e.g., ocular, intranasal, oral, gastric, pulmonary, intestinal, rectal, vaginal, or urinary tract) surface or via a parenteral (e.g., subcutaneous, intradermal, intramuscular, intravenous, or intraperitoneal) route.
  • a mucosal e.g., ocular, intranasal, oral, gastric, pulmonary, intestinal, rectal, vaginal, or urinary tract
  • parenteral e.g., subcutaneous, intradermal, intramuscular, intravenous, or intraperitoneal
  • the administration can be achieved in a single dose or repeated at intervals.
  • the appropriate dosage depends on various parameters that are understood by those skilled in the art, such as the nature ofthe vaccine vector itself, the route of administration, and the condition ofthe mammal to be vaccinated (e.g., the weight, age, and general health ofthe mammal).
  • Live vaccine vectors that can be used in the invention include viral vectors, such as adenoviruses and poxviruses, as well as bacterial vectors, e.g.,
  • adenovirus vector as well as a method for constructing an adenovirus vector capable of expressing a polynucleotide molecule ofthe invention, is described in U.S. Patent No. 4,920,209.
  • Poxvirus vectors that can be used in the invention include, e.g., vaccinia and canary pox viruses, which are described in U.S. Patent No. 4,722,848 and U.S. Patent No.
  • Poxvirus vectors capable of expressing a polynucleotide ofthe invention can be obtained by homologous recombination, as described in Kieny et al. (Nature 312:163, 1984) so that the polynucleotide ofthe invention is inserted in the viral genome under appropriate conditions for expression in mammalian cells.
  • the dose of viral vector vaccine for therapeutic or prophylactic use, can be from about lxlO 4 to about lxlO 11 , advantageously from about lxl 0 7 to about lxl 0 10 , or, preferably, from about lxl 0 7 to about lxl 0 9 plaque-forming units per kilogram.
  • viral vectors are administered parenterally, for example, in 3 doses that are 4 weeks apart. Those skilled in the art will recognize that it is preferable to avoid adding a chemical adjuvant to a composition containing a viral vector ofthe invention and thereby minimizing the immune response to the viral vector itself.
  • Non-toxicogenic Vibrio cholerae mutant strains that can be used in live oral vaccines are described by Mekalanos et al. (Nature 306:551, 1983) and in U.S. Patent No. 4,882,278 (strain in which a substantial amount ofthe coding sequence of each ofthe two ctxA alleles has been deleted so that no functional cholerae toxin is produced); WO 92/11354 (strain in which the irgA locus is inactivated by mutation; this mutation can be combined in a single strain with ctxA mutations); and WO 94/1533 (deletion mutant lacking functional ctxA and attRSl DNA sequences).
  • An effective vaccine dose of a V. cholerae strain capable of expressing a polypeptide or polypeptide derivative encoded by a polynucleotide molecule ofthe invention can contain, e.g., about lxlO 5 to about lxlO 9 , preferably about lxlO 6 to about lxl 0 8 viable bacteria in an appropriate volume for the selected route of administration.
  • Preferred routes of administration include all mucosal routes, but, most preferably, these vectors are administered intranasally or orally.
  • Attenuated Salmonella typhimurium strains genetically engineered for recombinant expression of heterologous antigens, and their use as oral vaccines, are described by Nakayama et al. (Bio/Technology 6:693, 1988) and in WO 92/11361.
  • Preferred routes of administration for these vectors include all mucosal routes. Most preferably, the vectors are administered intranasally or orally.
  • a polynucleotide ofthe invention can be inserted into the bacterial genome or it can remain in a free state, for example, carried on a plasmid.
  • An adjuvant can also be added to a composition containing a bacterial vector vaccine.
  • a number of adjuvants that can be used are known to those skilled in the art.
  • preferred adjuvants can be selected from the list provided below.
  • a composition of matter containing a polynucleotide ofthe invention, together with a diluent or carrier containing a therapeutically or prophylactically effective amount of a polynucleotide ofthe invention
  • a method for inducing an immune response against a polynucleotide ofthe invention containing a therapeutically or prophylactically effective amount of a polynucleotide ofthe invention
  • Helicobacter in a mammal, by administering to the mammal an immunogenically effective amount of a polynucleotide ofthe invention to elicit an immune response, e.g., a protective immune response to Helicobacter; and (iv) a method for preventing and or treating a Helicobacter (e.g., H. pylori, H. felis, H. mustelae, or H. heilmanii) infection, by administering a prophylactic or therapeutic amount of a polynucleotide ofthe invention to an individual in need of such treatment.
  • a Helicobacter e.g., H. pylori, H. felis, H. mustelae, or H. heilmanii
  • the fourth aspect ofthe invention encompasses the use of a polynucleotide ofthe invention in the preparation of a medicament for preventing and/or treating Helicobacter infection.
  • the fourth aspect ofthe invention preferably includes the use of a polynucleotide molecule placed under conditions for expression in a mammalian cell, e.g., in a plasmid that is unable to replicate in mammalian cells and to substantially integrate into a mammalian genome.
  • Polynucleotides (for example, DNA or RNA molecules) ofthe invention can also be administered as such to a mammal as a vaccine.
  • a DNA molecule ofthe invention When a DNA molecule ofthe invention is used, it can be in the form of a plasmid that is unable to replicate in a mammalian cell and unable to integrate into the mammalian genome.
  • a DNA molecule is placed under the control of a promoter suitable for expression in a mammalian cell.
  • the promoter can function ubiquitously or tissue-specifically. Examples of non-tissue specific promoters include the early Cytomegalovirus (CMV) promoter (U.S. Patent No. 4,168,062) and the Rous Sarcoma Virus promoter (Norton et al, Molec.
  • the desmin promoter (Li et al, Gene 78:243, 1989; Li et al, J. Biol. Chem. 266:6562, 1991; Li et al, J. Biol. Chem. 268: 10403, 1993) is tissue-specific and drives expression in muscle cells. More generally, useful promoters and vectors are described, e.g., in WO 94/21797 and by Hartikka et al (Human Gene Therapy 7:1205, 1996).
  • the polynucleotide ofthe invention can encode a precursor or a mature form of a polypeptide ofthe invention.
  • the precursor sequence can be homologous or heterologous.
  • a eucaryotic leader sequence can be used, such as the leader sequence ofthe tissue-type plasminogen factor (tPA).
  • a composition ofthe invention can contain one or several polynucleotides ofthe invention. It can also contain at least one additional polynucleotide encoding another Helicobacter antigen, such as urease subunit A, B, or both, or a fragment, derivative, mutant, or analog thereof.
  • DNA molecules ofthe invention and/or additional DNA molecules to be included in the same composition are carried in the same plasmid.
  • Standard methods can be used in the preparation of therapeutic polynucleotides ofthe invention.
  • a polynucleotide can be used in a naked form, free of any delivery vehicles, such as anionic liposomes, cationic lipids, microparticles, e.g., gold microparticles, precipitating agents, e.g., calcium phosphate, or any other fransfection-facilitating agent.
  • the polynucleotide can be simply diluted in a physiologically acceptable solution, such as sterile saline or sterile buffered saline, with or without a carrier.
  • a physiologically acceptable solution such as sterile saline or sterile buffered saline
  • the carrier preferably is isotonic, hypotonic, or weakly hypertonic, and has a relatively low ionic strength, such as provided by a sucrose solution, e.g., a solution containing 20% sucrose.
  • a polynucleotide can be associated with agents that assist in cellular uptake.
  • It can be, e.g., (i) complemented with a chemical agent that modifies cellular permeability, such as bupivacaine (see, e.g., WO 94/16737), (ii) encapsulated into liposomes, or (iii) associated with cationic lipids or silica, gold, or tungsten microparticles.
  • a chemical agent that modifies cellular permeability such as bupivacaine (see, e.g., WO 94/16737), (ii) encapsulated into liposomes, or (iii) associated with cationic lipids or silica, gold, or tungsten microparticles.
  • Anionic and neutral liposomes are well-known in the art (see, e.g.,
  • Liposomes A Practical Approach, RPC New Ed, IRL Press, 1990, for a detailed description of methods for making liposomes) and are useful for delivering a large range of products, including polynucleotides.
  • Cationic lipids can also be used for gene delivery.
  • Such lipids include, for example, LipofectinTM, which is also known as DOTMA (N-[l-(2,3- dioleyloxy)propyl]-N,N,N-trimethylammonium chloride), DOTAP (1,2- bis(oleyloxy)-3-(trimethylammonio)propane), DDAB (dimethyldioctadecylammonium bromide), DOGS (dioctadecylamidologlycyl spermine), and cholesterol derivatives.
  • DOTMA N-[l-(2,3- dioleyloxy)propyl]-N,N,N-trimethylammonium chloride
  • DOTAP 1,2- bis(oleyloxy)-3-(trimethylammonio)propane
  • DDAB dimethyldioctadecylammonium bromide
  • DOGS dioctadecylamid
  • Cationic lipids for gene delivery are preferably used in association with a neutral lipid such as DOPE (dioleyl phosphatidylethanolamine; WO 90/11092).
  • DOPE dioleyl phosphatidylethanolamine
  • Other transfection- facilitating compounds can be added to a formulation containing cationic liposomes. A number of them are described in, e.g., WO 93/18759, WO 93/19768, WO 94/25608, and WO 95/2397.
  • spermine derivatives useful for facilitating the transport of DNA through the nuclear membrane see, for example, WO 93/18759
  • membrane-permeabilizing compounds such as GALA, Gramicidine S, and cationic bile salts (see, for example, WO 93/19768).
  • Gold or tungsten microparticles can also be used for gene delivery, as described in WO 91/359, WO 93/17706, and by Tang et al (Nature 356:152,
  • the microparticle-coated polynucleotides can be injected via intradermal or intraepidermal routes using a needleless injection device ("gene gun"), such as those described in U.S. Patent No. 4,945,050, U.S. Patent No. 5,015,580, and WO 94/24263.
  • the amount of DNA to be used in a vaccine recipient depends, e.g., on the strength ofthe promoter used in the DNA construct, the immunogenicity of the expressed gene product, the condition ofthe mammal intended for administration (e.g., the weight, age, and general health ofthe mammal), the mode of administration, and the type of formulation. In general, a therapeutically or prophylactically effective dose from about 1 ⁇ g to about
  • 1 mg preferably, from about 10 ⁇ g to about 800 ⁇ g, and, more preferably, from about 25 ⁇ g to about 250 ⁇ g, can be administered to human adults.
  • the administration can be achieved in a single dose or repeated at intervals.
  • the route of administration can be any conventional route used in the vaccine field.
  • a polynucleotide ofthe invention can be administered via a mucosal surface, e.g., an ocular, intranasal, pulmonary, oral, intestinal, rectal, vaginal, or urinary tract surface, or via a parenteral route, e.g., by an intravenous, subcutaneous, intraperitoneal, intradermal, intraepidermal, or intramuscular route.
  • the choice of administration route will depend on, e.g., the formulation that is selected.
  • a polynucleotide formulated in association with bupivacaine is advantageously administered into muscle.
  • the formulation can be advantageously injected via intravenous, intranasal (for example, by aerosolization), intramuscular, intradermal, and subcutaneous routes.
  • a polynucleotide in a naked form can advantageously be administered via the intramuscular, intradermal, or subcutaneous routes.
  • such a composition can also contain an adjuvant.
  • a systemic adjuvant that does not require concomitant administration in order to exhibit an adjuvant effect is preferable.
  • nucleotide probe or primer having a sequence found in, or derived by degeneracy ofthe genetic code from, a sequence shown in the sequence listing (odd numbers, up to SEQ ID NO: 1363).
  • probe refers to DNA (preferably single stranded) or RNA molecules (or modifications or combinations thereof) that hybridize under the stringent conditions, as defined above, to polynucleotide molecules having sequences homologous to any of those shown in the sequence listing (odd numbers, up to SEQ ID NO: 1363), or to a complementary or anti-sense sequence of any of those shown in the sequence listing (odd numbers, up to SEQ ID NO: 1363).
  • probes are significantly shorter than the full-length sequences shown in the sequence listing. For example, they can contain from about 5 to about 100, preferably from about 10 to about 80 nucleotides.
  • probes have sequences that are at least 75%, preferably at least 85%, more preferably 95% homologous to a portion of a sequence as shown in the sequence listing (odd numbers, up to SEQ ID NO: 1363), or a sequence complementary to any of such sequences.
  • Probes can contain modified bases, such as inosine, methyl-5- deoxycytidine, deoxyuridine, dimethylamino-5-deoxyuridine, or diamino-2, 6- purine.
  • Sugar or phosphate residues can also be modified or substituted.
  • a deoxyribose residue can be replaced by a polyamide (Nielsen et al, Science 254: 1497, 1991) and phosphate residues can be replaced by ester groups such as diphosphate, alkyl, arylphosphonate, and phosphorothioate esters.
  • the 2'-hydroxyl group on ribonucleotides can be modified by addition of, e.g., alkyl groups.
  • Probes ofthe invention can be used in diagnostic tests, or as capture or detection probes. Such capture probes can be immobilized on solid supports, directly or indirectly, by covalent means or by passive adsorption.
  • a detection probe can be labeled by a detectable label, for example a label selected from radioactive isotopes; enzymes, such as peroxidase and alkaline phosphatase; enzymes that are able to hydrolyze a chromogenic, fluorogenic, or luminescent substrate; compounds that are chromogenic, fluorogenic, or luminescent; nucleotide base analogs; and biotin.
  • Probes ofthe invention can be used in any conventional hybridization method, such as in dot blot methods (Maniatis et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1982), Southern blot methods (Southern, J. Mol. Biol.
  • Primers used in the invention usually contain about 10 to 40 nucleotides and are used to initiate enzymatic polymerization of DNA in an amplification process (e.g., PCR), an elongation process, or a reverse transcription method. In a diagnostic method involving PCR, the primers can be labeled.
  • the invention also encompasses (i) a reagent containing a probe of the invention for detecting and/or identifying the presence of Helicobacter in a biological material; (ii) a method for detecting and/or identifying the presence of Helicobacter in a biological material, in which (a) a sample is recovered or derived from the biological material, (b) DNA or RNA is extracted from the material and denatured, and (c) the sample is exposed to a probe ofthe invention, for example, a capture probe, a detection probe, or both, under stringent hybridization conditions, so that hybridization is detected; and (iii) a method for detecting and/or identifying the presence of Helicobacter in a biological material, in which (a) a sample is recovered or derived from the biological material, (b) DNA is extracted therefrom, (c) the extracted DNA is contacted with at least one, or, preferably two, primers ofthe invention, and amplified by the polymerase chain reaction, and (d) an amplified DNA
  • a sixth aspect ofthe invention features a substantially purified polypeptide or polypeptide derivative having an amino acid sequence encoded by a polynucleotide ofthe invention.
  • a "substantially purified polypeptide” is defined as a polypeptide that is separated from the environment in which it naturally occurs and or a polypeptide that is free of most ofthe other polypeptides that are present in the environment in which it was synthesized.
  • the polypeptides ofthe invention can be purified from a natural source, such as a Helicobacter strain, or can be produced using recombinant methods.
  • Homologous polypeptides or polypeptide derivatives encoded by polynucleotides ofthe invention can be screened for specific antigenicity by testing cross-reactivity with an antiserum raised against a polypeptide having an amino acid sequence as shown in the sequence listing (even numbers, up to SEQ ID NO: 1364). Briefly, a monospecific hyperimmune antiserum can be raised against a purified reference polypeptide as such or as a fusion polypeptide, for example, an expression product of MBP, GST, or His-tag systems, or a synthetic peptide predicted to be antigenic. The homologous polypeptide or derivative that is screened for specific antigenicity can be produced as such or as a fusion polypeptide.
  • the material After being transferred to a filter, such as a nitrocellulose membrane, the material is incubated with the monospecific hyperimmune antiserum, which is diluted in a range of dilutions from about 1 : 50 to about 1 :5000, preferably from about
  • the product to be screened can be used as the coating antigen.
  • a purified preparation is preferred, but a whole cell extract can also be used. Briefly, about 100 ⁇ l of a preparation of about 10 ⁇ g protein/ml is distributed into wells of a 96-well ELISA plate. The plate is incubated for about 2 hours at 37°C, then overnight at 4°C. The plate is washed with phosphate buffer saline (PBS) contaimng 0.05% Tween 20 (PBS/Tween buffer) and the wells are saturated with 250 ⁇ l PBS containing 1% bovine serum albumin (BSA), to prevent non-specific antibody binding.
  • PBS phosphate buffer saline
  • BSA bovine serum albumin
  • the plate After 1 hour of incubation at 37 °C, the plate is washed with PBS/Tween buffer. The antiserum is serially diluted in PBS/Tween buffer containing 0.5% BSA, and 100 ⁇ l dilutions are added to each well. The plate is incubated for 90 minutes at 37°C, washed, and evaluated using standard methods. For example, a goat anti-rabbit peroxidase conjugate can be added to the wells when the specific antibodies used were raised in rabbits. Incubation is carried out for about 90 minutes at
  • a purified product is preferred, although a whole cell extract can be used.
  • a solution ofthe product at a concentration of about 100 ⁇ g/ml is serially diluted two-fold with 50 mM Tris-HCl (pH 7.5).
  • a filter such as a 0.45 ⁇ m nitrocellulose membrane, set in a 96-well dot blot apparatus (Biorad).
  • the buffer is removed by applying vacuum to the system.
  • Wells are washed by addition of 50 mM Tris-HCl (pH 7.5) and the membrane is air-dried.
  • the membrane is saturated in blocking buffer (50 mM Tris-HCl (pH 7.5), 0.15 M NaCl, 10 g/L skim milk) and incubated with an antiserum diluted from about 1:50 to about 1:5000, preferably about 1:500.
  • the reaction is detected using standard methods. For example, a goat anti-rabbit peroxidase conjugate can be added to the wells when rabbit antibodies are used. Incubation is carried out for about 90 minutes at 37 °C and the blot is washed. The reaction is developed with the appropriate substrate and stopped. The reaction is then measured visually by the appearance of a colored spot, e.g., by colorimetry.
  • a positive reaction is associated with detection of a colored spot for reactions carried out with a dilution of at least about 1 :50, preferably, of at least about 1 :500.
  • Therapeutic or prophylactic efficacy of a polypeptide or polypeptide derivative ofthe invention can be evaluated as described below.
  • a composition of matter containing a polypeptide ofthe invention together with a diluent or carrier containing a therapeutically or prophylactically effective amount of a polypeptide ofthe invention
  • a pharmaceutical composition containing a therapeutically or prophylactically effective amount of a polypeptide ofthe invention containing a therapeutically or prophylactically effective amount of a polypeptide ofthe invention
  • a method for inducing an immune response against Helicobacter in a mammal by administering to the mammal an immunogenically effective amount of a polypeptide ofthe invention to elicit an immune response, e.g., a protective immune response to Helicobacter
  • a method for preventing and/or treating a Helicobacter e.g., H. pylori, H. felis, H.
  • this aspect ofthe invention includes the use of a polypeptide of the invention in the preparation of a medicament for preventing and/or treating Helicobacter infection.
  • the immunogenic compositions ofthe invention can be administered by any conventional route in use in the vaccine field, for example, to a mucosal (e.g., ocular, intranasal, pulmonary, oral, gastric, intestinal, rectal, vaginal, or urinary tract) surface or via a parenteral (e.g., subcutaneous, intradermal, intramuscular, intravenous, or intraperitoneal) route.
  • a mucosal e.g., ocular, intranasal, pulmonary, oral, gastric, intestinal, rectal, vaginal, or urinary tract
  • a parenteral e.g., subcutaneous, intradermal, intramuscular, intravenous, or intraperitoneal
  • the choice ofthe administration route depends upon a number of parameters, such as the adjuvant used. For example, if a mucosal adjuvant is used, the intranasal or oral route will be preferred, and if a lipid formulation or an aluminum compound is used,
  • the subcutaneous or intramuscular route is most preferred.
  • the choice of administration route can also depend upon the nature ofthe vaccine agent.
  • a polypeptide ofthe invention fused to CTB or to LTB will be best administered to a mucosal surface.
  • a composition ofthe invention can contain one or several polypeptides or derivatives ofthe invention. It can also contain at least one additional Helicobacter antigen, such as the urease apoenzyme, or a subunit, fragment, homolog, mutant, or derivative thereof.
  • a polypeptide or polypeptide derivative can be formulated into or with liposomes, such as neutral or anionic liposomes, microspheres, ISCOMS, or virus-like particles (VLPs), to facilitate delivery and/or enhance the immune response.
  • liposomes such as neutral or anionic liposomes, microspheres, ISCOMS, or virus-like particles (VLPs)
  • VLPs virus-like particles
  • Adjuvants other than liposomes can also be used in the invention and are well known in the art (see, for example, the list provided below).
  • Administration can be achieved in a single dose or repeated as necessary at intervals that can be determined by one skilled in the art. For example, a priming dose can be followed by three booster doses at weekly or monthly intervals.
  • a vaccine antigen ofthe invention can be administered mucosally in an amount ranging from about 10 ⁇ g to about 500 mg, preferably from about 1 mg to about 200 mg.
  • the dose usually should not exceed about 1 mg, and is, preferably, about 100 ⁇ g.
  • the polynucleotides and polypeptides ofthe invention can be used sequentially as part of a multi-step immunization process.
  • a mammal can be initially primed with a vaccine vector ofthe invention, such as a pox virus, e.g., via a parenteral route, and then boosted twice with a polypeptide encoded by the vaccine vector, e.g., via the mucosal route.
  • liposomes associated with a polypeptide or polypeptide derivative ofthe invention can be used for priming, with boosting being carried out mucosally using a soluble polypeptide or polypeptide derivative ofthe invention, in combination with a mucosal adjuvant (e.g., LT).
  • a mucosal adjuvant e.g., LT
  • Polypeptides and polypeptide derivatives ofthe invention can also be used as diagnostic reagents for detecting the presence of an ⁇ -Helicobacter antibodies, e.g., in blood samples.
  • Such polypeptides can be about 5 to about 80, preferably, about 10 to about 50 amino acids in length and can be labeled or unlabeled, depending upon the diagnostic method. Diagnostic methods involving such a reagent are described below.
  • a polypeptide or polypeptide derivative is produced and can be purified using known methods.
  • the polypeptide or polypeptide derivative can be produced as a fusion protein containing a fused tail that facilitates purification.
  • the fusion product can be used to immunize a small mammal, e.g., a mouse or a rabbit, in order to raise monospecific antibodies against the polypeptide or polypeptide derivative.
  • the eighth aspect ofthe invention thus provides a monospecific antibody that binds to a polypeptide or polypeptide derivative of the invention.
  • monospecific antibody an antibody that is capable of reacting with a unique, naturally-occurring Helicobacter polypeptide.
  • An antibody ofthe invention can be polyclonal or monoclonal.
  • Monospecific antibodies can be recombinant, e.g., chimeric (e.g., consisting of a variable region of murine origin and a human constant region), humanized (e.g., a human immunoglobulin constant region and a variable region of animal, e.g., murine, origin), and/or single chain.
  • Both polyclonal and monospecific antibodies can also be in the form of immunoglobulin fragments, e.g., F(ab)'2 or Fab fragments.
  • the antibodies ofthe invention can be of any isotype, e.g., IgG or IgA, and polyclonal antibodies can be of a single isotype or can contain a mixture of isotypes.
  • the antibodies ofthe invention which can be raised to a polypeptide or polypeptide derivative ofthe invention, can be produced and identified using standard immunological assays, e.g., Western blot assays, dot blot assays, or ELISA (see, e.g., Coligan et al., Current Protocols in Immunology, John Wiley
  • the antibodies can be used in diagnostic methods to detect the presence of Helicobacter antigens in a sample, such as a biological sample.
  • the antibodies can also be used in affinity chromatography methods for purifying a polypeptide or polypeptide derivative ofthe invention. As is discussed further below, the antibodies can also be used in prophylactic and therapeutic passive immunization methods.
  • a ninth aspect ofthe invention provides (i) a reagent for detecting the presence of Helicobacter in a biological sample that contains an antibody, polypeptide, or polypeptide derivative ofthe invention; and (ii) a diagnostic method for detecting the presence of Helicobacter in a biological sample, by contacting the biological sample with an antibody, a polypeptide, or a polypeptide derivative ofthe invention, so that an immune complex is formed, and detecting the complex as an indication ofthe presence of Helicobacter in the sample or the organism from which the sample was derived.
  • the immune complex is formed between a component ofthe sample and the antibody, polypeptide, or polypeptide derivative, and that any unbound material can be removed prior to detecting the complex.
  • a polypeptide reagent can be used for detecting the presence of anti-Helicobacter antibodies in a sample, e.g., a blood sample, while an antibody ofthe invention can be used for screening a sample, such as a gastric extract or biopsy sample, for the presence of Helicobacter polypeptides.
  • the reagent e.g., the antibody, polypeptide, or polypeptide derivative ofthe invention
  • the reagent can be in a free state or can be immobilized on a solid support, such as, for example, on the interior surface of a tube or on the surface, or within pores, of a bead. Immobilization can be achieved using direct or indirect means.
  • Direct means include passive adsorption (i.e., non-covalent binding) or covalent binding between the support and the reagent.
  • indirect means is meant that an anti-reagent compound that interacts with the reagent is first attached to the solid support.
  • an anti-reagent compound that interacts with the reagent can serve as an anti-reagent, provided that it binds to an epitope that is not involved in recognition of antibodies in biological samples.
  • Indirect means can also employ a ligand-receptor system, for example, a molecule, such as a vitamin, can be grafted onto the polypeptide reagent and the corresponding receptor can be immobilized on the solid phase.
  • a process for purifying, from a biological sample, a polypeptide or polypeptide derivative ofthe invention which involves carrying out antibody-based affinity chromatography with the biological sample, wherein the antibody is a monospecific antibody ofthe invention.
  • the antibody can be polyclonal or monospecific, and preferably is ofthe IgG type.
  • Purified IgGs can be prepared from an antiserum using standard methods (see, e.g., Coligan et al, supra). Conventional chromatography supports, as well as standard methods for grafting antibodies, are described, for example, by Harlow et al.
  • a biological sample such as an H. pylori extract, preferably in a buffer solution
  • a chromatography material which is, preferably, equilibrated with the buffer used to dilute the biological sample, so that the polypeptide or polypeptide derivative ofthe invention (i.e., the antigen) is allowed to adsorb onto the material.
  • the chromatography material such as a gel or a resin coupled to an antibody ofthe invention, can be in batch form or in a column.
  • the unbound components are washed off and the antigen is eluted with an appropriate elution buffer, such as a glycine buffer, a buffer containing a chaotropic agent, e.g., guanidine HC1, or a buffer having high salt concentration (e.g., 3 M MgCl 2 ).
  • an appropriate elution buffer such as a glycine buffer, a buffer containing a chaotropic agent, e.g., guanidine HC1, or a buffer having high salt concentration (e.g., 3 M MgCl 2 ).
  • Eluted fractions are recovered and the presence ofthe antigen is detected, e.g., by measuring the absorbance at 280 nm.
  • An antibody ofthe invention can be screened for therapeutic efficacy as follows.
  • a composition of matter containing a monospecific antibody ofthe invention together with a diluent or carrier;
  • a pharmaceutical composition containing a therapeutically or prophylactically effective amount of a monospecific antibody ofthe invention and
  • a method for treating or preventing Helicobacter e.g., H. pylori, H felis, H. mustelae, or H. heilmanii
  • the eleventh aspect ofthe invention includes the use of a monospecific antibody ofthe invention in the preparation of a medicament for treating or preventing Helicobacter infection.
  • the monospecific antibody can be polyclonal or monoclonal, and is, preferably, predominantly ofthe IgA isotype.
  • the antibody is administered to a mucosal surface of a mammal, e.g., the gastric mucosa, e.g., orally or intragastrically, optionally, in the presence of a bicarbonate buffer.
  • systemic administration not requiring a bicarbonate buffer, can be carried out.
  • a monospecific antibody ofthe invention can be administered as a single active agent or as a mixture with at least one additional monospecific antibody specific for a different Helicobacter polypeptide.
  • the amount of antibody and the particular regimen used can be readily determined by one skilled in the art. For example, daily administration of about 100 to 1,000 mg of antibody over one week, or three doses per day of about 100 to 1,000 mg of antibody over two or three days, can be effective regimens for most purposes.
  • Therapeutic or prophylactic efficacy can be evaluated using standard methods in the art, e.g., by measuring induction of a mucosal immune response or induction of protective and/or therapeutic immunity, using, e.g., the H. felis mouse model and the procedures described by Lee et al. (Eur. J. Gasfroenterology & Hepatology 7:303, 1995) or Lee et al (J. Infect. Dis.
  • H felis strain of the model can be replaced with another Helicobacter strain.
  • the efficacy of polynucleotide molecules and polypeptides from H pylori is, preferably, evaluated in a mouse model using an H. pylori strain. Protection can be determined by comparing the degree of Helicobacter infection in the gastric tissue assessed by, for example, urease activity, bacterial counts, or gastritis, to that of a control group. Protection is shown when infection is reduced by comparison to the control group.
  • Such an evaluation can be made for polynucleotides, vaccine vectors, polypeptides, and polypeptide derivatives, as well as for antibodies ofthe invention.
  • an antibody ofthe invention can be administered to the gastric mucosa of mice previously challenged with an H pylori strain, as described, e.g., by Lee et al (supra). Then, after an appropriate period of time, the bacterial load ofthe mucosa can be estimated by assessing urease activity, as compared to a control. Reduced urease activity indicates that the antibody is therapeutically effective.
  • Adjuvants that can be used in any ofthe vaccine compositions described above are described as follows.
  • Adjuvants for parenteral administration include, for example, aluminum compounds, such as aluminum hydroxide, aluminum phosphate, and aluminum hydroxy phosphate. The antigen can be precipitated with, or adsorbed onto, the aluminum compound using standard methods. Other adjuvants, such as RIBI (ImmunoChem, Hamilton, MT), can also be used in parenteral administration.
  • Adjuvants that can be used for mucosal administration include, for example, bacterial toxins, e.g., the cholera toxin (CT), the E. coli heat-labile toxin (LT), the Clostridium difficile toxin A, the pertussis toxin (PT), and combinations, subunits, toxoids, or mutants thereof.
  • CT cholera toxin
  • LT E. coli heat-labile toxin
  • PT pertussis toxin
  • a purified preparation of native cholera toxin subunit B (CTB) can be used. Fragments, homologs, derivatives, and fusions to any of these toxins can also be used, provided that they retain adjuvant activity.
  • CTB native cholera toxin subunit B
  • Fragments, homologs, derivatives, and fusions to any of these toxins can also be used, provided that they retain adjuvant activity.
  • a mutant having reduced toxicity is used. Suitable
  • Additional LT mutants that can be used in the methods and compositions ofthe invention include, e.g., Ser-63-Lys, Ala-69-Gly, Glu-110- Asp, and Glu-112-Asp mutants.
  • Other adjuvants such as the bacterial monophosphoryl lipid A (MPLA) of, e.g., E. coli, Salmonella minnesota, Salmonella typhimurium, or Shigella flexneri; saponins, and polylactide glycolide (PLGA) microspheres, can also be used in mucosal administration.
  • MPLA bacterial monophosphoryl lipid A
  • PLGA polylactide glycolide
  • Adjuvants useful for both mucosal and parenteral administrations can also be used.
  • Any pharmaceutical composition ofthe invention, containing a polynucleotide, polypeptide, polypeptide derivative, or antibody ofthe invention can be manufactured using standard methods. It can be formulated with a pharmaceutically acceptable diluent or carrier, e.g., water or a saline solution, such as phosphate buffer saline, optionally, including a bicarbonate salt, such as sodium bicarbonate, e.g., 0.1 to 0.5 M. Bicarbonate can advantageously be added to compositions intended for oral or intragastric administration.
  • a diluent or carrier can be selected on the basis of the mode and route of administration, and standard pharmaceutical practice.
  • Suitable pharmaceutical carriers and diluents, as well as pharmaceutical necessities for their use in pharmaceutical formulations, are described in Remington's Pharmaceutical Sciences, a standard reference text in this field and in the USP/NF.
  • the invention also includes methods in which gastroduodenal infections, such as Helicobacter infection, are treated by oral administration of a Helicobacter polypeptide ofthe invention and a mucosal adjuvant, in combination with an antibiotic, an antisecretory agent, a bismuth salt, an antacid, sucralfate, or a combination thereof.
  • antibiotics including, e.g., macrolides, tetracyclines, ⁇ -lactams, aminoglycosides, quinolones, penicillins, and derivatives thereof
  • antibiotics include, e.g., amoxicillin, clarithromycin, tetracycline, metronidizole, erythromycin, cefuroxime, and erythromycin
  • antisecretory agents including, e.g., H 2 - receptor antagonists (e.g., cimetidine, ranitidine, famotidine, nizatidine, and roxatidine), proton pump inhibitors (e.g., omeprazole, lansoprazole, and pantoprazole), prostaglandin analogs (e.g., misoprostil and enprostil), and anticholinergic agents (e.g., pirenzepin
  • compositions for carrying out these methods i.e., compositions containing a Helicobacter antigen (or antigens) ofthe invention, an adjuvant, and one or more ofthe above-listed compounds, in a pharmaceutically acceptable carrier or diluent.
  • Amounts ofthe above-listed compounds used in the methods and compositions ofthe invention can readily be determined by one skilled in the art. In addition, one skilled in the art can readily design treatment/immunization schedules.
  • the non- vaccine components can be administered on days 1-14, and the vaccine antigen + adjuvant can be administered on days 7, 14, 21, and 28.
  • Methods and pharmaceutical compositions ofthe invention can be used to treat or to prevent Helicobacter infections and, accordingly, gastroduodenal diseases associated with these infections, including acute, chronic, and afrophic gastritis, and peptic ulcer diseases, e.g., gastric and duodenal ulcers.
  • Example 1 describes identification of genes, such as genes that encode the polypeptides ofthe invention, in the Helicobacter genome, as well as identification of signal sequences, and primer design for amplification of genes lacking signal sequences.
  • Example 2 describes cloning of DNA molecules encoding polypeptides ofthe invention into a vector that provides a histidine tag, and production and purification ofthe resulting his-tagged fusion proteins.
  • Example 3 describes methods for cloning DNA encoding the polypeptides of the invention so that they can be produced without his-tags
  • Example 4 describes methods for purifying recombinantly produced polypeptides ofthe invention.
  • EXAMPLE 1 Identification of genes in the H. pylori genome, identification of signal sequences, and primer design for amplification of genes lacking signal sequences l.A. Creating H. pylori genomic databases
  • the H. pylori genome was provided as a text file containing a single contiguous string of nucleotides that had been determined to be 1.76 Megabases in length.
  • the complete genome was split into 17 separate files using the program SPLIT (Creativity in Action), giving rise to 16 contigs, each containing 100,000 nucleotides, and a 17 th contig containing the remaining 76,000 nucleotides.
  • a header was added to each ofthe 17 files using the format: >hpg0.txt (representing contig 1), .hpgl.txt (representing contig 2), etc.
  • the resulting 17 files, named hpgO through hpg 16 were then copied together to form one file that represented the plus strand ofthe complete H.
  • a negative strand database ofthe H. pylori genome was created similarly by first creating a reverse complement ofthe positive strand using the program SeqPup (D.G. Gilbert, Indiana University Biology Department) and then performing the same procedure as described above for the plus strand. This database was given the designation "N.”
  • ORFs open reading frames
  • FASTA Pearson et al, Proc. Natl. Acad. Sci. USA 85:2444-2448, 1988.
  • FASTA was used for searching either a DNA sequence against either of the gene databases (" ⁇ " and/or "N"), or a peptide sequence against the ORF library ("O").
  • TFASTX was used to search a peptide sequence against all possible reading frames of a DNA database (" ⁇ " and/or "N” libraries). Potential frameshifts also being resolved, FASTX was used for searching the translated reading frames of a DNA sequence against either a DNA database, or a peptide sequence against the protein database.
  • the FASTA searches against the constructed DNA databases identified exact nucleotide coordinates on one or more ofthe isolated contigs, and therefore the location ofthe target DNA. Once the exact location ofthe target sequence was known, the contig identified to carry the gene was exported into the software package MapDraw (DNAStar, Inc.) and the gene was isolated. Gene sequences with flanking DNA was then excised and copied into the EditSeq. Software package (DNAStar, Inc.) for further analysis. l.D. Identification of signal sequences
  • the deduced protein encoded by a target gene sequence is analyzed using the PROTEAN software package (DNAStar, Inc.). This analysis predicts those areas ofthe protein that are hydrophobic by using the Kyte-Doolittle algorithm, and identifies any potential polar residues preceding the hydrophobic core region, which is typical for many signal sequences. For confirmation, the target protein is then searched against a PROSITE database (DNAStar, Inc.) consisting of motifs and signatures. Characteristic of many signal sequences and hydrophobic regions in general, is the identification of predicted prokaryotic lipid attachment sites. Where confirmation between the two approaches is apparent at the N-terminus of any protein, putative cleavage sites are sought.
  • this includes the presence of either an Alanine (A), Serine (S), or Glycine (G) residue immediately after the core hydrophobic region.
  • A Alanine
  • S Serine
  • G Glycine
  • C Cysteine
  • the gene sequence that specifies the signal sequence is omitted.
  • the 5'-end ofthe gene- specific portion ofthe N-terminal primer is designed to start at the first codon beyond the cleavage site. In the case of lipoproteins, the 5'-end ofthe N- terminal primer begins at the second codon, immediately after the modifiable residue at position +1 post-cleavage.
  • EXAMPLE 2 Preparation of isolated DNA encoding the polypeptides of the invention, and production of these polypeptides as histidine-tagged fusion proteins
  • H. pylori strain ORV2001 stored in LB medium containing 50% glycerol at -70 °C, is grown on Colombia agar containing 7% sheep blood for 48 hours under microaerophilic conditions (8-10% C0 2 , 5-7% 0 2 , 85-87% N 2 ). Cells are harvested, washed with phosphate buffer saline (PBS) (pH 7.2), and
  • DNA is then extracted from the cells using the Rapid Prep Genomic DNA Isolation kit (Pharmacia Biotech).
  • PCR amplification DNA molecules encoding the polypeptides ofthe invention are amplified from genomic DNA, as can be prepared as is described above, by the Polymerase Chain Reaction (PCR) using primers that can readily be designed by one skilled in the art. Specific examples of primers that can be used in the invention are shown in Table 1. As specific examples, to amplify genes encoding GHPO 147, GHPO 615, GHPO 961, GHPO 1282, GHPO 296, and
  • GHPO 840 the following primers can be used:
  • GHPO 147 5'-CTGAATTCGAATGAAAAGAATTTTAGTCTCT-3' (SEQ ID NO: 1365), and 5'-CCGCTCGAGTTAAAACTCATAATTCAAAT-3' (SEQ ID NO: 1366).
  • GHPO 615 5 * -CGCGGATCCGAAGACATGTGCAACCGATG-3' (SEQ ID NO: 1367), and
  • GHPO 1282 5'-GCGGATCCTTTTCTTCAATGTTTG-3" (SEQ ID NO:1371), and
  • GHPO 296 5'-CCGAATTCGGTTATAAAGCCCCT-3' (SEQ ID NO: 1373), and
  • GHPO 840 5'-CGCGGATCCGAGGAAATAGCATGTTAATAACC-3' (SEQ ID NO: 1375), and
  • the N-terminal and C-terminal primers for each clone can each include a 5' clamp and a restriction enzyme recognition sequence for cloning purposes (for example, Bam ⁇ l (GGATCC) and Xhol (CTCGAG) recognition sequences).
  • GGATCC Bam ⁇ l
  • CTCGAG Xhol
  • Amplification of gene-specific DNA is carried out using Vent DNA Polymerase (New England Biolabs) or Taq DNA polymerase (Appligene), according to the manufacturer's instructions.
  • the reaction mixture which is brought to a final volume of 100 ⁇ l with distilled water, is as follows: dNTPs mix 200 ⁇ M lOx ThermoPol buffer 10 ⁇ l primers 300 nM each
  • Appropriate amplification reaction conditions can readily be determined by one skilled in the art. For example, the following conditions can be used for amplification of DNA encoding GHPO 615 using the primers set forth above: initial denaturation at 94°C for 5 minutes, 25 cycles of denaturation at 97°C for 30 seconds, hybridization at 55°C for 1 minute, and elongation at 72°C for 2 minutes, using Vent DNA polymerase.
  • the following conditions can be used: initial denaturation at 94°C for 5 minutes, 25 cycles of denaturation at 94°C for 30 seconds, hybridization at 45°C for 30 seconds, and elongation at 72°C for 30 seconds, followed by a final elongation at 72°C for 7 minutes, using Vent DNA polymerase.
  • the following conditions can be used for amplification of DNA encoding GHPO 840 using the primers set forth above: 25 cycles of denaturation at 97°C for 30 seconds, hybridization at 55°C for 1 minute, and elongation at 72°C for 2 minutes using Vent DNA polymerase.
  • Table 1 sets forth conditions for using the primers listed therein.
  • a single PCR product is thus amplified and then is digested at 37 °C for 2 hours with BamHI and Xhol together in a 20 ⁇ l reaction volume.
  • the digested product is ligated to similarly cleaved pET28a (Novagen) that is dephosphorylated prior to the ligation by treatment with Calf Intestinal Alkaline Phosphatase (CIP).
  • CIP Calf Intestinal Alkaline Phosphatase
  • the ligation reaction (20 ⁇ l) is carried out at 14 °C overnight and then is used to transform 100 ⁇ l fresh E. coli XL 1 -blue competent cells (Novagen). The cells are incubated on ice for 2 hours, heat-shocked at 42 °C for 30 seconds, and returned to ice for 90 seconds. The samples are then added to
  • PCR is performed with the gene-specific primers under the conditions set forth above and transformant DNA is confirmed to contain the desired insert. If PCR-positive, one ofthe five plasmid DNA samples (500 ng) extracted from the E. coli XL 1 -blue cells is used to transform competent BL21 ( ⁇ DE3) E. coli competent cells (Novagen; as described previously). Transformants (10) are picked, plated onto selective kanamycin (50 ⁇ g/ml)- containing LB agar plates, and stored as a research stock in LB containing
  • One ml of frozen glycerol stock prepared as described in 2.C. is used to inoculate 50 ml of LB medium containing 25 ⁇ g/ml kanamycin in a 250 ml
  • the flask is incubated at 37°C for 2 hours or until the absorbance at 600 nm (OD 600 ) reaches 0.4-1.0.
  • the culture is stopped from growing by placing the flask at 4°C overnight. The following day, 10 ml ofthe overnight culture is used to inoculate 240 ml LB medium containing kanamycin (25 ⁇ g/ml), with the initial OD 600 being about 0.02-0.04.
  • Four flasks are inoculated for each ORF.
  • the cells are grown to an OD 600 of 1.0 (about 2 hours at 37°C), a 1 ml sample is harvested by centrifugation, and the sample is analyzed by SDS-PAGE to detect any leaky expression.
  • the remaining culture is induced with 1 mM IPTG and the induced cultures are grown for an additional 2 hours at 37°C.
  • the final OD 600 reading is taken and the cells are harvested by centrifugation at 5,000 x g for 15 minutes at 4°C. The supernatant is discarded and the pellets are resuspended in 50 mM Tris-HCl (pH 8.0), 2 mM EDTA. Two hundred and fifty ml of buffer are used for each 1 L of culture and the cells are recovered by centrifugation at 12,000 x g for 20 minutes. The supernatant is discarded and the pellets are stored at -45°C.
  • Pellets obtained using the methods described in 2.D. are thawed and resuspended in 95 ml of 50 mM Tris-HCl (pH 8.0). Pefabloc and lysozyme are added to final concentrations of 100 ⁇ M and 100 ⁇ g/ml, respectively. The mixture is homogenized with magnetic stirring at 5°C for 30 minutes.
  • Benzonase (Merck) is added to a final concentration of 1 U/ml, in the presence of 10 mM MgCl2, to ensure total digestion ofthe DNA.
  • the suspension is sonicated (Branson Sonifier 450) for 3 cycles of 2 minutes each at maximum output.
  • the homogenate is centrifuged at 19,000 x g for 15 minutes and both the supernatant and the pellet are analyzed by SDS-PAGE to detect the cellular location ofthe target protein in the soluble or insoluble fractions, as is described further below.
  • the elution profile is monitored by measuring the absorbance ofthe fractions at 280 nm. Fractions corresponding to the protein peak are pooled, dialyzed against PBS containing 0.5 M arginine, filtered through a 0.22 ⁇ m membrane, and stored at -45°C.
  • the target protein is expressed in the insoluble fraction (pellets obtained using the methods described in 2.E.)
  • purification is conducted under denaturing conditions. NaCl, imidazole, and urea are added to the resuspended pellet to final concentrations of 50 mM Tris-HCl (pH 8.0), 0.5 M NaCl, 10 mM imidazole, and 6 M urea (buffer C). After complete solubilization, the mixture is filtered through a 0.45 ⁇ m membrane and loaded onto an IMAC column.
  • the purification procedures on the IMAC column are the same as are described in 2.E.I., except that 6 M urea is included in all ofthe buffers used and 10 column volumes of buffer C are used to wash the column after protein loading, instead of 50 column volumes.
  • the protein fractions eluted from the IMAC column with buffer D (buffer C containing 500 mM imidazole) are pooled.
  • Arginine is added to the solution to a final concentration of 0.5 M, and the mixture is dialyzed against PBS containing 0.5 M arginine and various concentrations of urea (4 M, 3 M, 2 M, 1 M, and 0.5 M) to progressively decrease the concentration of urea.
  • the final dialysate is filtered through a 0.22 ⁇ m membrane and stored at -45 °C.
  • a first alternative involves the use of a mild denaturant, N-octyl glucoside (NOG). Briefly, a pellet obtained as is described in 2.E. is homogenized in a solution of 5 mM imidazole, 500 mM sodium chloride, and
  • the pellet is dissolved in 8 M urea, 50 mM Tris (pH 8.0).
  • the urea-solubilized protein is diluted with an equal volume of 2 M arginine, 50 mM Tris (pH 8.0), and is dialyzed against 1 M arginine for 24-48 hours to remove the urea.
  • the final dialysate is filtered through a 0.22 ⁇ m membrane and stored at -45°C.
  • a second alternative involves the use of a strong denaturant, such as guanidine hydrochloride. Briefly, a pellet obtained as is described in 2.E.
  • ⁇ -mercaptoethanol is added to the eluted protein to a final concentration of 1 mM, and then the eluted protein is passed through a Sephadex G-25 column equilibrated in 0.1 M acetic acid. Protein eluted from the column is slowly added to 4 volumes of 50 mM phosphate buffer (pH 7.0), and the protein remains in solution.
  • mice Groups of 10 OF 1 mice (IFFA Credo) are immunized rectally with 25 ⁇ g ofthe purified recombinant protein, admixed with 1 ⁇ g of cholera toxin (Berna) in physiological buffer. Mice are immunized on days 0, 7, 14, and 21. Fourteen days after the last immunization, the mice are challenged with H. pylori strain ORV2001, grown in liquid media (the cells are grown on agar plates, as described in 2.A., and, after harvest, are resuspended in Brucella broth; the flasks are then incubated overnight at 37 °C). Fourteen days after challenge, the mice are sacrificed and their stomachs are removed. The amount of H. pylori is determined by measuring the urease activity in the stomach and by culture.
  • 2.G. Production of monospecific polyclonal antibodies 2.G.I. Hyperimmune rabbit antiserum New Zealand rabbits are injected both subcutaneously and intramuscularly with 100 ⁇ g of a purified fusion polypeptide, as obtained using the methods described in 2.E.I. or 2.E.2., in the presence of Freund's complete adjuvant and in a total volume of approximately 2 ml. Twenty one and 42 days after the initial injection, booster doses, which are identical to the priming doses, except that Freund's incomplete adjuvant is used, are administered in the same way. Fifteen days after the last injection, animal serum is recovered, decomplemented, and filtered through a 0.45 ⁇ m membrane.
  • mice are injected subcutaneously with 10-50 ⁇ g of a purified fusion polypeptide as obtained using the methods described in 2.E.1. or 2.E.2., in the presence of Freund's complete adjuvant and in a volume of approximately 200 ⁇ l. Seven and 14 days after the initial injection, booster doses, which are identical to the priming doses, except that Freund's incomplete adjuvant is used, are administered in the same way. Twenty one and 28 days after the initial infection, mice receive 50 ⁇ g ofthe antigen alone infraperitoneally.
  • mice are also injected infraperitoneally with sarcoma 180/TG cells CM26684 (Lennette et al, Diagnostic Procedures for Viral, Rickettsial, and Chlamydial Infections, 5th Ed. Washington DC, American Public Health Association, 1979). Ascites fluid is collected 10-13 days after the last injection.
  • the N-terminal primers are designed to include the ribosome binding site ofthe target gene, the ATG start site, and any signal sequence and cleavage site.
  • the N-terminal primers can include a 5' clamp and a restriction endonuclease recognition site, such as that for BamHI (GGATCC), which facilitates subsequent cloning.
  • the C-terminal primers can include a restriction endonuclease recognition site, such as that for Xhol (CTCGAG), which can be used in subsequent cloning, and a TAA stop codon.
  • Amplification of genes encoding the polypeptides ofthe invention can be carried out using Thermalase DNA Polymerase under the conditions described above in Example 2.
  • Vent DNA polymerase New England Biolabs
  • Pwo DNA polymerase Boehringer Mannheim
  • Taq Taq
  • DNA polymerase (Appligene) can be used, according to instructions provided by the manufacturers.
  • a single PCR product for each clone is amplified and cloned into appropriately cleaved pET 24 (e.g., BamHl-Xh ⁇ cleaved pET 24), resulting in the construction of a franscriptional fusion that permits expression of the proteins without His-tags.
  • the expressed products can be purified as denatured proteins that are refolded by dialysis into 1 M arginine.
  • Cloning into pET 24 allows transcription ofthe genes from the T7 promoter, which is supplied by the vector, but relies upon binding ofthe RNA- specific DNA polymerase to the intrinsic ribosome binding sites ofthe genes, and thereby expression ofthe complete ORF.
  • the amplification, digestion, and cloning protocols that can be used in this method are as described above for constructing translational fusions.
  • EXAMPLE 4 Purification of the polypeptides of the invention by immunoaffinity
  • An immune serum as prepared as is described in section 2.G., is applied to a protein A Sepharose Fast Flow column (Pharmacia) equilibrated in 100 mM Tris-HCl (pH 8.0). The resin is washed by applying 10 column volumes of 100 mM Tris-HCl and 10 volumes of 10 mM Tris-HCl (pH 8.0) to the column.
  • IgG antibodies are eluted with 0.1 M glycine buffer (pH 3.0) and are collected as 5 ml fractions to each of which is added 0.25 ml 1 M Tris-HCl (pH 8.0). The optical density ofthe eluate is measured at 280 nm and fractions containing the IgG antibodies are pooled, dialyzed against 50 mM Tris-HCl
  • CNBr-activated Sepharose 4B gel (1 g of dried gel provides for approximately 3.5 ml of hydrated gel; gel capacity is from 5 to 10 mg coupled IgG/ml of gel) manufactured by Pharmacia (17-0430- 01) is suspended in 1 mM HCl buffer and washed with a buchner by adding small quantities of 1 mM HCl buffer. The total volume of buffer is 200 ml per gram of gel. Purified IgG antibodies are dialyzed for 4 hours at 20 ⁇ 5°C against
  • IgG antibodies are mixed with the gel overnight at 5 ⁇ 3°C.
  • the gel is packed into a chromatography column and is washed with 2 column volumes of
  • the gel is then transferred to a tube, mixed with 100 mM ethanolamine (pH 7.5) for 4 hours at room temperature, and washed twice with 2 column volumes of PBS. The gel is then stored in 1/10,000 PBS/merthiolate. The amount of IgG antibodies coupled to the gel is determined by measuring the optical density (OD) at 280 nm ofthe IgG solution and the direct eluate, plus washings.
  • OD optical density
  • the adsorbed gel is washed with 2 to 6 volumes of 10 mM sodium phosphate buffer (pH 6.8) and the antigen is eluted with 100 mM glycine buffer (pH 2.5).
  • the eluate is recovered in 3 ml fractions, to each of which is added 150 ⁇ l of 1 M sodium phosphate buffer (pH 8.0). Absorption is measured at 280 nm for each fraction; those fractions containing the antigen are pooled and stored at -20 °C.
  • MOLECULE TYPE Genomic DNA
  • FEATURE
  • MOLECULE TYPE protein
  • FRAGMENT TYPE internal
  • MOLECULE TYPE Genomic DNA
  • FEATURE
  • GAG CCT AAA AAA AGT CAT ATT TAT TTT GGG GCT ATG GTG GGT TTA GCT 152 Glu Pro Lys Lys Ser His lie Tyr Phe Gly Ala Met Val Gly Leu Ala 20 25 30
  • AGC ACG ATC GAT CGC CAC CAC CGC ATA GAG CTT GGG GCT AAA ATC CCT 536 Ser Thr lie Asp Arg His His Arg lie Glu Leu Gly Ala Lys lie Pro 150 155 160
  • MOLECULE TYPE protein
  • FRAGMENT TYPE internal
  • MOLECULE TYPE Genomic DNA
  • FEATURE
  • MOLECULE TYPE protein
  • FRAGMENT TYPE internal
  • MOLECULE TYPE Genomic DNA
  • FEATURE
  • MOLECULE TYPE protein
  • FRAGMENT TYPE internal
  • MOLECULE TYPE Genomic DNA
  • FEATURE
  • MOLECULE TYPE protein
  • FRAGMENT TYPE internal
  • MOLECULE TYPE Genomic RNA
  • FEATURE
  • MOLECULE TYPE protein
  • FRAGMENT TYPE internal
  • MOLECULE TYPE Genomic DNA
  • FEATURE
  • AAA AAG ATT GAT ATA GCT AGG GGG ATT TAT CCT ACA GAG ACT TTT GTA 255 Lys Lys He Asp He Ala Arg Gly He Tyr Pro Thr Glu Thr Phe Val 50 55 60
  • GGC AAG GTG ATT GAT TCT ATA GCG TGC GGG AAC GCT AGG GCC AAT AAA 543 Gly Lys Val He Asp Ser He Ala Cys Gly Asn Ala Arg Ala Asn Lys 145 150 155
  • MOLECULE TYPE protein
  • FRAGMENT TYPE internal
  • MOLECULE TYPE Genomic DNA
  • FEATURE
  • MOLECULE TYPE protein
  • FRAGMENT TYPE internal
  • MOLECULE TYPE Genomic DNA
  • FEATURE (A) NAME/KEY: Coding Sequence
  • GGC ATG GTG GGA TCT ATT TTC TAT GAT GGC ACG AAG AAG TTT GAA GAC 344 Gly Met Val Gly Ser He Phe Tyr Asp Gly Thr Lys Lys Phe Glu Asp 85 90 95
  • AAA CCT CAT CGT TTC CTC ATA GAA GGC TTT TAT TAC CTT TCG
  • MOLECULE TYPE protein
  • FRAGMENT TYPE internal

Landscapes

  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Molecular Biology (AREA)
  • Genetics & Genomics (AREA)
  • Biophysics (AREA)
  • Biochemistry (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • General Chemical & Material Sciences (AREA)
  • Public Health (AREA)
  • Veterinary Medicine (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Animal Behavior & Ethology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Engineering & Computer Science (AREA)
  • Oncology (AREA)
  • Communicable Diseases (AREA)
  • Immunology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Peptides Or Proteins (AREA)
  • Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Abstract

The invention provides Helicobacter polypeptides that can be used in vaccination methods for preventing or treating Helicobacter infection, and polynucleotides that encode these polypeptides. The invention also provides diagnostic methods employing these polypeptides.

Description

IDENTIFICATION OF POLYNUCLEOTIDES ENCODING NOVEL HELICOBACTER POLYPEPTIDES IN THE HELICOBACTER GENOME
The invention relates to Helicobacter antigens and corresponding polynucleotide molecules that can be used in methods to prevent or treat Helicobacter infection in mammals, such as humans.
Background ofthe Invention
Helicobacter is a genus of spiral, gram-negative bacteria that colonize the gastrointestinal tracts of mammals. Several species colonize the stomach, most notably H. pylori, H. heilmanii, H. felis, and H. mustelae. Although H. pylori is the species most commonly associated with human infection, H. heilmanii and H. felis have also been isolated from humans, but at lower frequencies than H. pylori. Helicobacter infects over 50% of adult populations in developed countries and nearly 100% in developing countries and some Pacific rim countries, making it one ofthe most prevalent infections worldwide. Helicobacter is routinely recovered from gastric biopsies of humans with histological evidence of gastritis and peptic ulceration. Indeed, H. pylori is now recognized as an important pathogen of humans, in that the chronic gastritis it causes is a risk factor for the development of peptic ulcer diseases and gastric carcinoma. It is thus highly desirable to develop safe and effective vaccines for preventing and treating Helicobacter infection.
A number of Helicobacter antigens have been characterized or isolated. These include urease, which is composed of two structural subunits of approximately 30 and 67 kDa (Hu et al, Infect. Immun. 58:992, 1990; Dunn et al, J. Biol. Chem. 265:9464, 1990; Evans et al, Microbial Pathogenesis 10:15, 1991; Labigne et al, J. Bact, 173: 1920, 1991); the 87 kDa vacuolar cytotoxin (VacA) (Cover et al, J. Biol. Chem. 267:10570, 1992; Phadnis et al, Infect. Immun. 62: 1557, 1994; WO 93/18150); a 128 kDa immunodominant antigen associated with the cytotoxin (CagA, also called TagA; WO 93/18150; U.S. Patent No. 5,403,924); 13 and 58 kDa heat shock proteins HspA and HspB
(Suerbaum et al, Mol. Microbiol. 14:959, 1994; WO 93/18150); a 54 kDa catalase (Hazell et al, J. Gen. Microbiol.137: 57, 1991); a 15 kDa histidine-rich protein (Hpn) (Gilbert et al, Infect. Immun. 63:2682, 1995); a 20 kDa membrane-associated lipoprotein (Kostrcynska et al, J. Bact. 176:5938, 1994); a 30 kDa outer membrane protein (Bόlin et al, J. Clin. Microbiol. 33:381,
1995); a lactoferrin receptor (FR 2,724,936); and several porins, designated HopA, HopB, HopC, HopD, and HopE, which have molecular weights of 48-67 kDa (Exner et al, Infect. Immun. 63:1567, 1995; Doig et al, J. Bact. 177:5447, 1995). Some of these proteins have been proposed as potential vaccine antigens. In particular, urease is believed to be a vaccine candidate
(WO 94/9823; WO 95/22987; WO 95/3824; Michetti et al, Gastroenterology 107: 1002, 1994). Nevertheless, it is thought that several antigens may ultimately be necessary in a vaccine.
Summary ofthe Invention
The invention provides polynucleotide molecules that encode Helicobacter polypeptides, designated GHPO 35 (SEQ ID NO:2), GHPO 55 (SEQ ID NO:4), GHPO 78 (SEQ ID NO:6), GHPO 89 (SEQ ID NO: 8), GHPO 129 (SEQ ID NO: 10), GHPO 541 (SEQ ID NO: 12), GHPO 607 (SEQ ID NO:14), GHPO 635 (SEQ ID NO:16), GHPO 701 (SEQ ID NO:18), GHPO
712 (SEQ ID NO:20), GHPO 761 (SEQ ID NO:22), GHPO 838 (SEQ ID NO:24), GHPO 1034 (SEQ ID NO:26), GHPO 1085 (SEQ ID NO:28), GHPO 1213 (SEQ ID NO:30), GHPO 1255 (SEQ ID NO:32), GHPO 1308 (SEQ ID NO:34), GHPO 1389 (SEQ ID NO:36), GHPO 1706 (SEQ ID NO:38), GHPO 234 (SEQ ID NO:40), GHPO 314 (SEQ ID NO:42), GHPO 510 (SEQ ID NO:44), GHPO 603 (SEQ ID NO:46), GHPO 937 (SEQ ID NO:48), GHPO 1027 (SEQ ID NO:50), GHPO 1099 (SEQ ID NO:52), GHPO 1151 (SEQ ID
NO:54), GHPO 1275 (SEQ ID NO:56), GHPO 1365 (SEQ ID NO:58), GHPO 1578 (SEQ ID NO:60), GHPO 22 (SEQ ID NO:62), GHPO 58 (SEQ ID NO:64), GHPO 200 (SEQ ID NO:66), GHPO 558 (SEQ ID NO:68), GHPO 563 (SEQ ID NO:70), GHPO 695 (SEQ ID NO:72), GHPO 699 (SEQ ID NO:74), GHPO 702 (SEQ ID NO:76), GHPO 709 (SEQ ID NO:78), GHPO
741 (SEQ ID NO:80), GHPO 762 (SEQ ID NO:82), GHPO 827 (SEQ ID NO:84), GHPO 852 (SEQ ID NO:86), GHPO 1013 (SEQ ID NO:88), GHPO 1020 (SEQ ID NO:90), GHPO 1031 (SEQ ID NO:92), GHPO 1052 (SEQ ID NO:94), GHPO 1127 (SEQ ID NO:96), GHPO 1149 (SEQ ID NO:98), GHPO 1176 (SEQ ID NO: 100), GHPO 1250 (SEQ ID NO: 102), GHPO 1312 (SEQ ID
NO: 104), GHPO 1358 (SEQ ID NO: 106), GHPO 1490 (SEQ ID NO: 108), GHPO 1559 (SEQ ID NO: 110), GHPO 1651 (SEQ ID NO: 112), GHPO 1726 (SEQ ID NO:114), GHPO 1780 (SEQ ID NO: 116), GHPO 895 (SEQ ID NO: 118), GHPO 1447 (SEQ ID NO: 120), GHPO 28 (SEQ ID NO: 122), GHPO 86 (SEQ ID NO: 124), GHPO 155 (SEQ ID NO: 126), GHPO 157 (SEQ ID
NO: 128), GHPO 237 (SEQ ID NO: 130), GHPO 290 (SEQ ID NO: 132), GHPO 293 (SEQ ID NO:134), GHPO 335 (SEQ ID NO:136), GHPO 374 (SEQ ID NO: 138), GHPO 442 (SEQ ID NO:140), GHPO 480 (SEQ ID NO: 142), GHPO 523 (SEQ ID NO: 144), GHPO 610 (SEQ ID NO:146), GHPO 675 (SEQ ID NO: 148), GHPO 690 (SEQ ID NO: 150), GHPO 829 (SEQ ID NO: 152), GHPO
850 (SEQ ID NO: 154), GHPO 876 (SEQ ID NO: 156), GHPO 984 (SEQ ID NO:158), GHPO 989 (SEQ ID NO: 160), GHPO 1111 (SEQ ID NO: 162), GHPO 1145 (SEQ ID NO: 164), GHPO 1256 (SEQ ID NO: 166), GHPO 1264 (SEQ ID NO: 168), GHPO 1316 (SEQ ID NO: 170), GHPO 1368 (SEQ ID NO: 172), GHPO 1442 (SEQ ID NO: 174), GHPO 1506 (SEQ ID NO: 176), GHPO 1543 (SEQ ID NO: 178), GHPO 1574 (SEQ ID NO: 180), GHPO 1627 (SEQ ID NO: 182), GHPO 1657 (SEQ ID NO: 184), GHPO 1664 (SEQ ID
NO: 186), GHPO 1694 (SEQ ID NO: 188), GHPO 1704 (SEQ ID NO: 190), GHPO 1763 (SEQ ID NO: 192), GHPO 616 (SEQ ID NO: 194), GHPO 76 (SEQ ID NO: 196), GHPO 109 (SEQ ID NO: 198), GHPO 163 (SEQ ID NO:200), GHPO 169 (SEQ ID NO:202), GHPO 208 (SEQ ID NO:204), GHPO 219 (SEQ ID NO:206), GHPO 445 (SEQ ID NO:208), GHPO 479 (SEQ ID
NO:210), GHPO 525 (SEQ ID NO:212), GHPO 535 (SEQ ID NO:214), GHPO 731 (SEQ ID NO:216), GHPO 836 (SEQ ID NO:218), GHPO 879 (SEQ ID NO:220), GHPO 881 (SEQ ID NO:222), GHPO 886 (SEQ ID NO:224), GHPO 893 (SEQ ID NO:226), GHPO 894 (SEQ ID NO:228), GHPO 976 (SEQ ID NO:230), GHPO 1011 (SEQ ID NO:232), GHPO 1024 (SEQ ID NO:234),
GHPO 1084 (SEQ ID NO:236), GHPO 1329 (SEQ ID NO:238), GHPO 1330 (SEQ ID NO:240), GHPO 1346 (SEQ ID NO:242), GHPO 1360 (SEQ ID NO:244), GHPO 1388 (SEQ ID NO:246), GHPO 1411 (SEQ ID NO:248), GHPO 1419 (SEQ ID NO:250), GHPO 1446 (SEQ ID NO:252), GHPO 1469 (SEQ ID NO:254), GHPO 1501 (SEQ ID NO:256), GHPO 1505 (SEQ ID
NO:258), GHPO 1522 (SEQ ID NO:260), GHPO 1525 (SEQ ID NO:262), GHPO 1615 (SEQ ID NO:264), GHPO 1689 (SEQ ID NO:266), GHPO 1733 (SEQ ID NO:268), GHPO 18 (SEQ ID NO:270), GHPO 139 (SEQ ID NO:272), GHPO 142 (SEQ ID NO:274), GHPO 250 (SEQ ID NO:276), GHPO 257 (SEQ ID NO:278), GHPO 325 (SEQ ID NO:280), GHPO 355 (SEQ ID
NO:282), GHPO 357 (SEQ ID NO:284), GHPO 454 (SEQ ID NO:286), GHPO 475 (SEQ ID NO:288), GHPO 515 (SEQ ID NO:290), GHPO 527 (SEQ ID NO:292), GHPO 551 (SEQ ID NO:294), GHPO 602 (SEQ ID NO:296), GHPO 626 (SEQ ID NO:298), GHPO 646 (SEQ ID NO:300), GHPO 653 (SEQ ID NO:302), GHPO 655 (SEQ ID NO:304), GHPO 670 (SEQ ID NO:306), GHPO 739 (SEQ ID NO:308), GHPO 798 (SEQ ID NO:310), GHPO 1102 (SEQ ID NO:312), GHPO 1114 (SEQ ID NO:314), GHPO 1152 (SEQ ID NO:316),
GHPO 1272 (SEQ ID NO:318), GHPO 1345 (SEQ ID NO:320), GHPO 1377 (SEQ ID NO:322), GHPO 1424 (SEQ ID NO:324), GHPO 1430 (SEQ ID NO:326), GHPO 1502 (SEQ ID NO:328), GHPO 1600 (SEQ ID NO:330), GHPO 1714 (SEQ ID NO:332), GHPO 359 (SEQ ID NO:334), GHPO 678 (SEQ ID NO:336), GHPO 708 (SEQ ID NO:338), GHPO 759 (SEQ ID
NO:340), GHPO 847 (SEQ ID NO:342), GHPO 1050 (SEQ ID NO:344), GHPO 1101 (SEQ ID NO:346), GHPO 1120 (SEQ ID NO:348), GHPO 1138 (SEQ ID NO:350), GHPO 1310 (SEQ ID NO:352), GHPO 1320 (SEQ ID NO:354), GHPO 1375 (SEQ ID NO:356), GHPO 1432 (SEQ ID NO:358), GHPO 21 (SEQ ID NO:360), GHPO 282 (SEQ ID NO:362), GHPO 1089
(SEQ ID NO:364), GHPO 1141 (SEQ ID NO:366), GHPO 1280 (SEQ ID NO:368), GHPO 1608 (SEQ ID NO:370), GHPO 15 (SEQ ID NO:372), GHPO 16 (SEQ ID NO:374), GHPO 36 (SEQ ID NO:376), GHPO 38 (SEQ ID NO:378), GHPO 52 (SEQ ID NO:380), GHPO 57 (SEQ ID NO:382), GHPO 64 (SEQ ID NO:384), GHPO 79 (SEQ ID NO:386), GHPO 84 (SEQ ID
NO:388), GHPO 86 (SEQ ID NO:390), GHPO 99 (SEQ ID NO:392), GHPO 106 (SEQ ID NO:394), GHPO 118 (SEQ ID NO:396), GHPO 122 (SEQ ID NO:398), GHPO 128 (SEQ ID NO:400), GHPO 138 (SEQ ID NO:402), GHPO 153 (SEQ ID NO:404), GHPO 160 (SEQ ID NO:406), GHPO 168 (SEQ ID NO:408), GHPO 179 (SEQ ID NO:410), GHPO 189 (SEQ ID NO:412), GHPO
229 (SEQ ID NO:414), GHPO 243 (SEQ ID NO:416), GHPO 244 (SEQ ID NO:418), GHPO 251 (SEQ ID NO:420), GHPO 267 (SEQ ID NO:422), GHPO 269 (SEQ ID NO:424), GHPO 279 (SEQ ID NO:426), GHPO 284 (SEQ ID NO:428), GHPO 296 (SEQ ID NO:430), GHPO 300 (SEQ ID NO:432), GHPO 305 (SEQ ID NO:434), GHPO 319 (SEQ ID NO:436), GHPO 330 (SEQ ID NO:438), GHPO 340 (SEQ ID NO:440), GHPO 342 (SEQ ID NO:442), GHPO 344 (SEQ ID NO:444), GHPO 358 (SEQ ID NO:446), GHPO 373 (SEQ ID
NO:448), GHPO 382 (SEQ ID NO:450), GHPO 384 (SEQ ID NO:452), GHPO 398 (SEQ ID NO:454), GHPO 409 (SEQ ID NO:456), GHPO 422 (SEQ ID NO:458), GHPO 430 (SEQ ID NO:460), GHPO 446 (SEQ ID NO:462), GHPO 447 (SEQ ID NO:464), GHPO 450 (SEQ ID NO:466), GHPO 451 (SEQ ID NO:468), GHPO 452 (SEQ ID NO:470), GHPO 456 (SEQ ID NO:472), GHPO
461 (SEQ ID NO:474), GHPO 476 (SEQ ID NO:476), GHPO 478 (SEQ ID NO:478), GHPO 491 (SEQ ID NO:480), GHPO 511 (SEQ ID NO:482), GHPO 519 (SEQ ID NO:484), GHPO 526 (SEQ ID NO:486), GHPO 534 (SEQ ID NO:488), GHPO 536 (SEQ ID NO:490), GHPO 542 (SEQ ID NO:492), GHPO 544 (SEQ ID NO:494), GHPO 576 (SEQ ID NO:496), GHPO 578 (SEQ ID
NO:498), GHPO 580 (SEQ ID NO:500), GHPO 585 (SEQ ID NO:502), GHPO 599 (SEQ ID NO:504), GHPO 639 (SEQ ID NO:506), GHPO 642 (SEQ ID NO:508), GHPO 647 (SEQ ID NO:510), GHPO 654 (SEQ ID NO:512), GHPO 669 (SEQ ID NO:514), GHPO 710 (SEQ ID NO:516), GHPO 713 (SEQ ID NO:518), GHPO 716 (SEQ ID NO:520), GHPO 718 (SEQ ID NO:522), GHPO
726 (SEQ ID NO:524), GHPO 734 (SEQ ID NO:526), GHPO 740 (SEQ ID NO:528), GHPO 770 (SEQ ID NO:530), GHPO 782 (SEQ ID NO:532), GHPO 786 (SEQ ID NO:534), GHPO 792 (SEQ ID NO:536), GHPO 797 (SEQ ID NO:538), GHPO 816 (SEQ ID NO:540), GHPO 828 (SEQ ID NO:542), GHPO 839 (SEQ ID NO:544), GHPO 840 (SEQ ID NO:546), GHPO 842 (SEQ ID
NO:548), GHPO 885 (SEQ ID NO:550), GHPO 889 (SEQ ID NO:552), GHPO 903 (SEQ ID NO:554), GHPO 912 (SEQ ID NO:556), GHPO 946 (SEQ ID NO:558), GHPO 958 (SEQ ID NO:560), GHPO 968 (SEQ ID NO:562), GHPO 987 (SEQ ID NO:564), GHPO 992 (SEQ ID NO:566), GHPO 996 (SEQ ID NO:568), GHPO 997 (SEQ ID NO:570), GHPO 1002 (SEQ ID NO:572), GHPO 1026 (SEQ ID NO:574), GHPO 1028 (SEQ ID NO:576), GHPO 1034 (SEQ ID NO:578), GHPO 1038 (SEQ ID NO:580), GHPO 1059 (SEQ ID
NO:582), GHPO 1065 (SEQ ID NO:584), GHPO 1072 (SEQ ID NO:586), GHPO 1073 (SEQ ID NO:588), GHPO 1088 (SEQ ID NO:590), GHPO 1091 (SEQ ID NO:592), GHPO 1105 (SEQ ID NO:594), GHPO 1115 (SEQ ID NO:596), GHPO 1159 (SEQ ID NO:598), GHPO 1177 (SEQ ID NO:600), GHPO 1187 (SEQ ID NO:602), GHPO 1192 (SEQ ID NO:604), GHPO 1195
(SEQ ID NO:606), GHPO 1224 (SEQ ID NO:608), GHPO 1225 (SEQ ID NO:610), GHPO 1228 (SEQ ID NO:612), GHPO 1229 (SEQ ID NO:614), GHPO 1231 (SEQ ID NO:616), GHPO 1236 (SEQ ID NO:618), GHPO 1242 (SEQ ID NO:620), GHPO 1248 (SEQ ID NO:622), GHPO 1270 (SEQ ID NO:624), GHPO 1271 (SEQ ID NO:626), GHPO 1298 (SEQ ID NO:628),
GHPO 1301 (SEQ ID NO:630), GHPO 1304 (SEQ ID NO:632), GHPO 1315 (SEQ ID NO:634), GHPO 1319 (SEQ ID NO:636), GHPO 1323 (SEQ ID NO:638), GHPO 1331 (SEQ ID NO:640), GHPO 1332 (SEQ ID NO:642), GHPO 1347 (SEQ ID NO:644), GHPO 1373 (SEQ ID NO:646), GHPO 1376 (SEQ ID NO:648), GHPO 1380 (SEQ ID NO:650), GHPO 1394 (SEQ ID
NO:652), GHPO 1407 (SEQ ID NO:654), GHPO 1415 (SEQ ID NO:656), GHPO 1425 (SEQ ID NO:658), GHPO 1427 (SEQ ID NO:660), GHPO 1444 (SEQ ID NO:662), GHPO 1449 (SEQ ID NO:664), GHPO 1465 (SEQ ID NO:666), GHPO 1475 (SEQ ID NO:668), GHPO 1479 (SEQ ID NO:670), GHPO 1483 (SEQ ID NO:672), GHPO 1488 (SEQ ID NO:674), GHPO 1496
(SEQ ID NO:676), GHPO 1524 (SEQ ID NO:678), GHPO 1536 (SEQ ID NO:680), GHPO 1539 (SEQ ID NO:682), GHPO 1540 (SEQ ID NO:684), GHPO 1542 (SEQ ID NO:686), GHPO 1555 (SEQ ID NO:688), GHPO 1560 (SEQ ID NO:690), GHPO 1564 (SEQ ID NO:692), GHPO 1570 (SEQ ID NO:694), GHPO 1588 (SEQ ID NO:696), GHPO 1604 (SEQ ID NO:698), GHPO 1605 (SEQ ID NO:700), GHPO 1619 (SEQ ID NO:702), GHPO 1629 (SEQ ID NO:704), GHPO 1642 (SEQ ID NO:706), GHPO 1654 (SEQ ID
NO:708), GHPO 1661 (SEQ ID NO:710), GHPO 1673 (SEQ ID NO:712), GHPO 1687 (SEQ ID NO:714), GHPO 1692 (SEQ ID NO:716), GHPO 1693 (SEQ ID NO:718), GHPO 1699 (SEQ ID NO:720), GHPO 1738 (SEQ ID NO:722), GHPO 1745 (SEQ ID NO:724), GHPO 1746 (SEQ ID NO:726), GHPO 1754 (SEQ ID NO:728), GHPO 1792 (SEQ ID NO:730), GHPO 1795
(SEQ ID NO:732), GHPO 1796 (SEQ ID NO:734), GHPO 7 (SEQ ID NO:736), GHPO 8 (SEQ ID NO:738), GHPO 9 (SEQ ID NO:740), GHPO 10 (SEQ ID NO:742), GHPO 12 (SEQ ID NO:744), GHPO 25 (SEQ ID NO:746), GHPO 27 (SEQ ID NO:748), GHPO 29 (SEQ ID NO:750), GHPO 30 (SEQ ID NO:752), GHPO 37 (SEQ ID NO:754), GHPO 49 (SEQ ID NO:756), GHPO
51 (SEQ ID NO:758), GHPO 54 (SEQ ID NO:760), GHPO 65 (SEQ ID NO:762), GHPO 66 (SEQ ID NO:764), GHPO 68 (SEQ ID NO:766), GHPO 70 (SEQ ID NO:768), GHPO 77 (SEQ ID NO:770), GHPO 83 (SEQ ID NO:772), GHPO 85 (SEQ ID NO:774), GHPO 87 (SEQ ID NO:776), GHPO 91 (SEQ ID NO:778), GHPO 92 (SEQ ID NO:780), GHPO 96 (SEQ ID
NO:782), GHPO 97 (SEQ ID NO:784), GHPO 111 (SEQ ID NO:786), GHPO 115 (SEQ ID NO:788), GHPO 117 (SEQ ID NO:790), GHPO 123 (SEQ ID NO:792), GHPO 124 (SEQ ID NO:794), GHPO 126 (SEQ ID NO:796), GHPO 127 (SEQ ID NO:798), GHPO 128 (SEQ ID NO:800), GHPO 131 (SEQ ID NO:802), GHPO 133 (SEQ ID NO:804), GHPO 140 (SEQ ID NO:806), GHPO
141 (SEQ ID NO:808), GHPO 145 (SEQ ID NO:810), GHPO 147 (SEQ ID NO:812), GHPO 166 (SEQ ID NO:814), GHPO 181 (SEQ ID NO:816), GHPO 187 (SEQ ID NO:818), GHPO 188 (SEQ ID NO:820), GHPO 192 (SEQ ID NO:822), GHPO 202 (SEQ ID NO:824), GHPO 204 (SEQ ID NO:826), GHPO 205 (SEQ ID NO:828), GHPO 212 (SEQ ID NO:830), GHPO 218 (SEQ ID NO:832), GHPO 226 (SEQ ID NO:834), GHPO 231 (SEQ ID NO:836), GHPO 236 (SEQ ID NO:838), GHPO 239 (SEQ ID NO:840), GHPO 245 (SEQ ID
NO:842), GHPO 246 (SEQ ID NO:844), GHPO 248 (SEQ ID NO:846), GHPO 253 (SEQ ID NO:848), GHPO 265 (SEQ ID NO:850), GHPO 266 (SEQ ID NO:852), GHPO 271 (SEQ ID NO:854), GHPO 272 (SEQ ID NO:856), GHPO 286 (SEQ ID NO:858), GHPO 291 (SEQ ID NO:860), GHPO 292 (SEQ ID NO:862), GHPO 297 (SEQ ID NO:864), GHPO 304 (SEQ ID NO:866), GHPO
307 (SEQ ID NO:868), GHPO 324 (SEQ ID NO:870), GHPO 326 (SEQ ID NO:872), GHPO 331 (SEQ ID NO:874), GHPO 343 (SEQ ID NO:876), GHPO 345 (SEQ ID NO:878), GHPO 346 (SEQ ID NO:880), GHPO 352 (SEQ ID NO:882), GHPO 355 (SEQ ID NO:884), GHPO 363 (SEQ ID NO:886), GHPO 369 (SEQ ID NO:888), GHPO 376 (SEQ ID NO:890), GHPO 378 (SEQ ID
NO:892), GHPO 388 (SEQ ID NO:894), GHPO 396 (SEQ ID NO:896), GHPO 403 (SEQ ID NO:898), GHPO 410 (SEQ ID NO:900), GHPO 415 (SEQ ID NO:902), GHPO 421 (SEQ ID NO:904), GHPO 439 (SEQ ID NO:906), GHPO 441 (SEQ ID NO:908), GHPO 443 (SEQ ID NO:910), GHPO 453 (SEQ ID NO:912), GHPO 455 (SEQ ID NO:914), GHPO 464 (SEQ ID NO:916), GHPO
467 (SEQ ID NO:918), GHPO 468 (SEQ ID NO:920), GHPO 470 (SEQ ID NO:922), GHPO 486 (SEQ ID NO:924), GHPO 487 (SEQ ID NO:926), GHPO 488 (SEQ ID NO:928), GHPO 489 (SEQ ID NO:930), GHPO 498 (SEQ ID NO:932), GHPO 501 (SEQ ID NO:934), GHPO 504 (SEQ ID NO:936), GHPO 512 (SEQ ID NO:938), GHPO 517 (SEQ ID NO:940), GHPO 520 (SEQ ID
NO:942), GHPO 528 (SEQ ID NO:944), GHPO 530 (SEQ ID NO:946), GHPO 532 (SEQ ID NO:948), GHPO 548 (SEQ ID NO:950), GHPO 561 (SEQ ID NO:952), GHPO 564 (SEQ ID NO:954), GHPO 572 (SEQ ID NO:956), GHPO 573 (SEQ ID NO:958), GHPO 574 (SEQ ID NO:960), GHPO 577 (SEQ ID NO:962), GHPO 579 (SEQ ID NO:964), GHPO 583 (SEQ ID NO:966), GHPO 588 (SEQ ID NO:968), GHPO 593 (SEQ ID NO:970), GHPO 597 (SEQ ID NO:972), GHPO 598 (SEQ ID NO:974), GHPO 604 (SEQ ID NO:976), GHPO
606 (SEQ ID NO:978), GHPO 611 (SEQ ID NO:980), GHPO 612 (SEQ ID NO:982), GHPO 615 (SEQ ID NO:984), GHPO 632 (SEQ ID NO:986), GHPO 633 (SEQ ID NO:988), GHPO 637 (SEQ ID NO:990), GHPO 651 (SEQ ID NO:992), GHPO 663 (SEQ ID NO:994), GHPO 686 (SEQ ID NO:996), GHPO 693 (SEQ ID NO:998), GHPO 698 (SEQ ID NO:1000), GHPO 703 (SEQ ID
NO: 1002), GHPO 704 (SEQ ID NO: 1004), GHPO 705 (SEQ ID NO: 1006), GHPO 707 (SEQ ID NO:1008), GHPO 721 (SEQ ID NO:1010), GHPO 727 (SEQ ID NO: 1012), GHPO 728 (SEQ ID NO:1014), GHPO 733 (SEQ ID NO: 1016), GHPO 758 (SEQ ID NO: 1018), GHPO 763 (SEQ ID NO: 1020), GHPO 771 (SEQ ID NO: 1022), GHPO 774 (SEQ ID NO: 1024), GHPO 776
(SEQ ID NO: 1026), GHPO 783 (SEQ ID NO: 1028), GHPO 800 (SEQ ID NO: 1030), GHPO 806 (SEQ ID NO: 1032), GHPO 807 (SEQ ID NO: 1034), GHPO 808 (SEQ ID NO: 1036), GHPO 809 (SEQ ID NO:1038), GHPO 811 (SEQ ID NO:1040), GHPO 815 (SEQ ID NO:1042), GHPO 819 (SEQ ID NO:1044), GHPO 841 (SEQ ID NO:1046), GHPO 843 (SEQ ID NO:1048),
GHPO 846 (SEQ ID NO: 1050), GHPO 875 (SEQ ID NO: 1052), GHPO 892 (SEQ ID NO: 1054), GHPO 902 (SEQ ID NO: 1056), GHPO 904 (SEQ ID NO: 1058), GHPO 906 (SEQ ID NO: 1060), GHPO 908 (SEQ ID NO: 1062), GHPO 921 (SEQ ID NO: 1064), GHPO 923 (SEQ ID NO: 1066), GHPO 926 (SEQ ID NO: 1068), GHPO 933 (SEQ ID NO: 1070), GHPO 939 (SEQ ID
NO: 1072), GHPO 940 (SEQ ID NO: 1074), GHPO 943 (SEQ ID NO: 1076), GHPO 951 (SEQ ID NO: 1078), GHPO 961 (SEQ ID NO: 1080), GHPO 965 (SEQ ID NO: 1082), GHPO 990 (SEQ ID NO: 1084), GHPO 991 (SEQ ID NO: 1086), GHPO 998 (SEQ ID NO: 1088), GHPO 1001 (SEQ ID NO: 1090), GHPO 1005 (SEQ ID NO: 1092), GHPO 1033 (SEQ ID NO: 1094), GHPO 1039 (SEQ ID NO:1096), GHPO 1041 (SEQ ID NO:1098), GHPO 1043 (SEQ ID NO: 1100), GHPO 1044 (SEQ ID NO: 1102), GHPO 1051 (SEQ ID
NO:l 104), GHPO 1058 (SEQ ID NO:l 106), GHPO 1060 (SEQ ID NO: 1108), GHPO 1075 (SEQ ID NO:l 110), GHPO 1077 (SEQ ID NO:l 112), GHPO 1082 (SEQ ID NO: 1114), GHPO 1083 (SEQ ID NO:1116), GHPO 1086 (SEQ ID NO: 1118), GHPO 1087 (SEQ ID NO: 1120), GHPO 1090 (SEQ ID NO:1122), GHPO 1097 (SEQ ID NO: 1124), GHPO 1098 (SEQ ID NO: 1126),
GHPO 1103 (SEQ ID NO: 1128), GHPO 1113 (SEQ ID NO:l 130), GHPO 1116 (SEQ ID NO: 1132), GHPO 1123 (SEQ ID NO: 1134), GHPO 1125 (SEQ ID NO:1136), GHPO 1129 (SEQ ID NO:1138), GHPO 1130 (SEQ ID NO:l 140), GHPO 1134 (SEQ ID NO: 1142), GHPO 1161 (SEQ ID NO: 1144), GHPO 1166 (SEQ ID NO: 1146), GHPO 1170 (SEQ ID NO: 1148), GHPO
1175 (SEQ ID NO: 1150), GHPO 1181 (SEQ ID NO: 1152), GHPO 1186 (SEQ ID NO: 1154), GHPO 1188 (SEQ ID NO: 1156), GHPO 1191 (SEQ ID NO:1158), GHPO 1193 (SEQ ID NO: 1160), GHPO 1196 (SEQ ID NO: 1162), GHPO 1204 (SEQ ID NO:1164), GHPO 1210 (SEQ ID NO:1166), GHPO 1211 (SEQ ID NO:1168), GHPO 1216 (SEQ ID NO:1170), GHPO 1218 (SEQ
ID NO: 1172), GHPO 1220 (SEQ ID NO: 1174), GHPO 1223 (SEQ ID NO: 1176), GHPO 1226 (SEQ ID NO: 1178), GHPO 1240 (SEQ ID NO: 1180), GHPO 1246 (SEQ ID NO: 1182), GHPO 1251 (SEQ ID NO: 1184), GHPO 1252 (SEQ ID NO: 1186), GHPO 1261 (SEQ ID NO: 1188), GHPO 1265 (SEQ ID NO: 1190), GHPO 1267 (SEQ ID NO: 1192), GHPO 1278 (SEQ ID
NO: 1194), GHPO 1282 (SEQ ID NO: 1196), GHPO 1283 (SEQ ID NO: 1198), GHPO 1287 (SEQ ID NO: 1200), GHPO 1292 (SEQ ID NO: 1202), GHPO 1293 (SEQ ID NO:1204), GHPO 1302 (SEQ ID NO: 1206), GHPO 1309 (SEQ ID NO: 1208), GHPO 1317 (SEQ ID NO: 1210), GHPO 1318 (SEQ ID NO:1212), GHPO 1321 (SEQ ID NO:1214), GHPO 1325 (SEQ ID NO: 1216), GHPO 1341 (SEQ ID NO:1218), GHPO 1351 (SEQ ID NO:1220), GHPO 1354 (SEQ ID NO: 1222), GHPO 1363 (SEQ ID NO: 1224), GHPO 1371 (SEQ
ID NO:1226), GHPO 1381 (SEQ ID NO:1228), GHPO 1401 (SEQ ID NO: 1230), GHPO 1402 (SEQ ID NO: 1232), GHPO 1403 (SEQ ID NO: 1234), GHPO 1408 (SEQ ID NO: 1236), GHPO 1416 (SEQ ID NO:1238), GHPO 1420 (SEQ ID NO: 1240), GHPO 1428 (SEQ ID NO: 1242), GHPO 1437 (SEQ ID NO: 1244), GHPO 1439 (SEQ ID NO: 1246), GHPO 1460 (SEQ ID
NO: 1248), GHPO 1463 (SEQ ID NO: 1250), GHPO 1472 (SEQ ID NO: 1252), GHPO 1474 (SEQ ID NO: 1254), GHPO 1484 (SEQ ID NO: 1256), GHPO 1489 (SEQ ID NO: 1258), GHPO 1494 (SEQ ID NO: 1260), GHPO 1495 (SEQ ID NO: 1262), GHPO 1498 (SEQ ID NO: 1264), GHPO 1499 (SEQ ID NO:1266), GHPO 1500 (SEQ ID NO:1268), GHPO 1503 (SEQ ID NO:1270),
GHPO 1504 (SEQ ID NO:1272), GHPO 1510 (SEQ ID NO:1274), GHPO 1518 (SEQ ID NO: 1276), GHPO 1533 (SEQ ID NO: 1278), GHPO 1541 (SEQ ID NO: 1280), GHPO 1544 (SEQ ID NO: 1282), GHPO 1548 (SEQ ID NO:1284), GHPO 1565 (SEQ ID NO:1286), GHPO 1575 (SEQ ID NO:1288), GHPO 1582 (SEQ ID NO:1290), GHPO 1595 (SEQ ID NO:1292), GHPO
1597 (SEQ ID NO:1294), GHPO 1599 (SEQ ID NO: 1296), GHPO 1601 (SEQ ID NO:1298), GHPO 1609 (SEQ ID NO:1300), GHPO 1613 (SEQ ID NO:1302), GHPO 1614 (SEQ ID NO:1304), GHPO 1626 (SEQ ID NO:1306), GHPO 1628 (SEQ ID NO: 1308), GHPO 1639 (SEQ ID NO: 1310), GHPO 1640 (SEQ ID NO:1312), GHPO 1641 (SEQ ID NO: 1314), GHPO 1646 (SEQ
ID NO: 1316), GHPO 1662 (SEQ ID NO: 1318), GHPO 1667 (SEQ ID NO:1320), GHPO 1668 (SEQ ID NO: 1322), GHPO 1670 (SEQ ID NO: 1324), GHPO 1671 (SEQ ID NO: 1326), GHPO 1672 (SEQ ID NO: 1328), GHPO 1678 (SEQ ID NO: 1330), GHPO 1684 (SEQ ID NO: 1332), GHPO 1695 (SEQ ID NO: 1334), GHPO 1697 (SEQ ID NO: 1336), GHPO 1701 (SEQ ID NO: 1338), GHPO 1719 (SEQ ID NO: 1340), GHPO 1723 (SEQ ID NO: 1342), GHPO 1732 (SEQ ID NO:1344), GHPO 1739 (SEQ ID NO:1346), GHPO
1741 (SEQ ID NO: 1348), GHPO 1747 (SEQ ID NO: 1350), GHPO 1749 (SEQ ID NO: 1352), GHPO 1750 (SEQ ID NO: 1354), GHPO 1751 (SEQ ID NO: 1356), GHPO 1755 (SEQ ID NO:1358), GHPO 1771 (SEQ ID NO:1360), GHPO 1786 (SEQ ID NO:1362), and GHPO 1789 (SEQ ID NO:1364), which can be used, e.g., in methods to prevent, treat, or diagnose Helicobacter infection. The sequences of polynucleotides that encode these polypeptides are shown in the sequence listing (odd numbers, up to SEQ ID NO: 1363). Those skilled in the art will understand that the invention also includes polynucleotide molecules that encode mutants and derivatives of these polypeptides, which can result from the addition, deletion, or substitution of non-essential amino acids, as is described further below.
In addition to the polynucleotide molecules described above, the invention includes the corresponding polypeptides (i.e., polypeptides encoded by the polynucleotide molecules ofthe invention, or fragments thereof), and monospecific antibodies that specifically bind to these polypeptides. The polypeptides ofthe invention include those having the amino acid sequences shown in the sequence listing (even numbers, up to SEQ ID NO: 1363), as well as mature forms of proteins having sequences shown in the sequence listing in their unprocessed forms, and fragments thereof. The present invention has many applications and includes expression cassettes, vectors, and cells transformed or transfected with the polynucleotides of the invention. Accordingly, the present invention provides (i) methods for producing polypeptides ofthe invention in recombinant host systems and related expression cassettes, vectors, and transformed or transfected cells; (ii) live vaccine vectors, such as pox virus, Salmonella typhimurium, and Vibrio cholerae vectors, that contain polynucleotides ofthe invention (such vaccine vectors being useful in, e.g., methods for preventing or treating Helicobacter infection) in combination with a diluent or carrier, and related pharmaceutical compositions and associated therapeutic and/or prophylactic methods; (iii) therapeutic and/or prophylactic methods involving administration of polynucleotide molecules, either in a naked form or formulated with a delivery vehicle, polypeptides or mixtures of polypeptides, or monospecific antibodies ofthe invention, and related pharmaceutical compositions; (iv) methods for detecting the presence of Helicobacter in biological samples, which can involve the use of polynucleotide molecules, monospecific antibodies, or polypeptides ofthe invention; and (v) methods for purifying polypeptides of the invention by antibody-based affinity chromatography.
Detailed Description Open reading frames (ORFs) encoding new polypeptides, designated GHPO 35 (SEQ ID NO:2), GHPO 55 (SEQ ID NO:4), GHPO 78 (SEQ ID NO:6), GHPO 89 (SEQ ID NO:8), GHPO 129 (SEQ ID NO: 10), GHPO 541
(SEQ ID NO:12), GHPO 607 (SEQ ID NO:14), GHPO 635 (SEQ ID NO:16), GHPO 701 (SEQ ID NO:18), GHPO 712 (SEQ ID NO:20), GHPO 761 (SEQ ID NO:22), GHPO 838 (SEQ ID NO:24), GHPO 1034 (SEQ ID NO:26), GHPO 1085 (SEQ ID NO:28), GHPO 1213 (SEQ ID NO:30), GHPO 1255 (SEQ ID NO:32), GHPO 1308 (SEQ ID NO:34), GHPO 1389 (SEQ ID
NO:36), GHPO 1706 (SEQ ID NO:38), GHPO 234 (SEQ ID NO:40), GHPO 314 (SEQ ID NO:42), GHPO 510 (SEQ ID NO:44), GHPO 603 (SEQ ID NO:46), GHPO 937 (SEQ ID NO:48), GHPO 1027 (SEQ ID NO:50), GHPO 1099 (SEQ ID NO:52), GHPO 1151 (SEQ ID NO:54), GHPO 1275 (SEQ ID NO:56), GHPO 1365 (SEQ ID NO:58), GHPO 1578 (SEQ ID NO:60), GHPO 22 (SEQ ID NO:62), GHPO 58 (SEQ ID NO:64), GHPO 200 (SEQ ID NO:66), GHPO 558 (SEQ ID NO:68), GHPO 563 (SEQ ID NO:70), GHPO 695 (SEQ
ID NO:72), GHPO 699 (SEQ ID NO:74), GHPO 702 (SEQ ID NO:76), GHPO 709 (SEQ ID NO:78), GHPO 741 (SEQ ID NO:80), GHPO 762 (SEQ ID NO:82), GHPO 827 (SEQ ID NO:84), GHPO 852 (SEQ ID NO:86), GHPO 1013 (SEQ ID NO: 88), GHPO 1020 (SEQ ID NO:90), GHPO 1031 (SEQ ID NO:92), GHPO 1052 (SEQ ID NO:94), GHPO 1127 (SEQ ID NO:96), GHPO
1149 (SEQ ID NO:98), GHPO 1176 (SEQ ID NO: 100), GHPO 1250 (SEQ ID NO: 102), GHPO 1312 (SEQ ID NO:104), GHPO 1358 (SEQ ID NO: 106), GHPO 1490 (SEQ ID NO: 108), GHPO 1559 (SEQ ID NO: 110), GHPO 1651 (SEQ ID NO: 112), GHPO 1726 (SEQ ID NO:114), GHPO 1780 (SEQ ID NO: 116), GHPO 895 (SEQ ID NO: 118), GHPO 1447 (SEQ ID NO: 120),
GHPO 28 (SEQ ID NO: 122), GHPO 86 (SEQ ID NO: 124), GHPO 155 (SEQ ID NO: 126), GHPO 157 (SEQ ID NO: 128), GHPO 237 (SEQ ID NO: 130), GHPO 290 (SEQ ID NO: 132), GHPO 293 (SEQ ID NO: 134), GHPO 335 (SEQ ID NO: 136), GHPO 374 (SEQ ID NO: 138), GHPO 442 (SEQ ID NO: 140), GHPO 480 (SEQ ID NO: 142), GHPO 523 (SEQ ID NO: 144), GHPO
610 (SEQ ID NO:146), GHPO 675 (SEQ ID NO:148), GHPO 690 (SEQ ID NO: 150), GHPO 829 (SEQ ID NO: 152), GHPO 850 (SEQ ID NO: 154), GHPO 876 (SEQ ID NO: 156), GHPO 984 (SEQ ID NO: 158), GHPO 989 (SEQ ID NO: 160), GHPO 1111 (SEQ ID NO: 162), GHPO 1145 (SEQ ID NO: 164), GHPO 1256 (SEQ ID NO: 166), GHPO 1264 (SEQ ID NO: 168), GHPO 1316
(SEQ ID NO: 170), GHPO 1368 (SEQ ID NO: 172), GHPO 1442 (SEQ ID NO: 174), GHPO 1506 (SEQ ID NO: 176), GHPO 1543 (SEQ ID NO: 178), GHPO 1574 (SEQ ID NO:180), GHPO 1627 (SEQ ID NO:182), GHPO 1657 (SEQ ID NO: 184), GHPO 1664 (SEQ ID NO: 186), GHPO 1694 (SEQ ID NO: 188), GHPO 1704 (SEQ ID NO: 190), GHPO 1763 (SEQ ID NO: 192), GHPO 616 (SEQ ID NO: 194), GHPO 76 (SEQ ID NO: 196), GHPO 109 (SEQ ID NO: 198), GHPO 163 (SEQ ID NO:200), GHPO 169 (SEQ ID NO:202),
GHPO 208 (SEQ ID NO:204), GHPO 219 (SEQ ID NO:206), GHPO 445 (SEQ ID NO:208), GHPO 479 (SEQ ID NO:210), GHPO 525 (SEQ ID NO:212), GHPO 535 (SEQ ID NO:214), GHPO 731 (SEQ ID NO:216), GHPO 836 (SEQ ID NO:218), GHPO 879 (SEQ ID NO:220), GHPO 881 (SEQ ID NO:222), GHPO 886 (SEQ ID NO:224), GHPO 893 (SEQ ID NO:226), GHPO
894 (SEQ ID NO:228), GHPO 976 (SEQ ID NO:230), GHPO 1011 (SEQ ID NO:232), GHPO 1024 (SEQ ID NO:234), GHPO 1084 (SEQ ID NO:236), GHPO 1329 (SEQ ID NO:238), GHPO 1330 (SEQ ID NO:240), GHPO 1346 (SEQ ID NO:242), GHPO 1360 (SEQ ID NO:244), GHPO 1388 (SEQ ID NO:246), GHPO 1411 (SEQ ID NO:248), GHPO 1419 (SEQ ID NO:250),
GHPO 1446 (SEQ ID NO:252), GHPO 1469 (SEQ ID NO:254), GHPO 1501 (SEQ ID NO:256), GHPO 1505 (SEQ ID NO:258), GHPO 1522 (SEQ ID NO:260), GHPO 1525 (SEQ ID NO:262), GHPO 1615 (SEQ ID NO:264), GHPO 1689 (SEQ ID NO:266), GHPO 1733 (SEQ ID NO:268), GHPO 18 (SEQ ID NO:270), GHPO 139 (SEQ ID NO:272), GHPO 142 (SEQ ID
NO:274), GHPO 250 (SEQ ID NO:276), GHPO 257 (SEQ ID NO:278), GHPO 325 (SEQ ID NO:280), GHPO 355 (SEQ ID NO:282), GHPO 357 (SEQ ID NO:284), GHPO 454 (SEQ ID NO:286), GHPO 475 (SEQ ID NO:288), GHPO 515 (SEQ ID NO:290), GHPO 527 (SEQ ID NO:292), GHPO 551 (SEQ ID NO:294), GHPO 602 (SEQ ID NO:296), GHPO 626 (SEQ ID NO:298), GHPO
646 (SEQ ID NO:300), GHPO 653 (SEQ ID NO:302), GHPO 655 (SEQ ID NO:304), GHPO 670 (SEQ ID NO:306), GHPO 739 (SEQ ID NO:308), GHPO 798 (SEQ ID NO:310), GHPO 1102 (SEQ ID NO:312), GHPO 1114 (SEQ ID NO:314), GHPO 1152 (SEQ ID NO:316), GHPO 1272 (SEQ ID NO:318), GHPO 1345 (SEQ ID NO:320), GHPO 1377 (SEQ ID NO:322), GHPO 1424 (SEQ ID NO:324), GHPO 1430 (SEQ ID NO:326), GHPO 1502 (SEQ ID NO:328), GHPO 1600 (SEQ ID NO:330), GHPO 1714 (SEQ ID NO:332),
GHPO 359 (SEQ ID NO:334), GHPO 678 (SEQ ID NO:336), GHPO 708 (SEQ ID NO:338), GHPO 759 (SEQ ID NO:340), GHPO 847 (SEQ ID NO:342), GHPO 1050 (SEQ ID NO:344), GHPO 1101 (SEQ ID NO:346), GHPO 1120 (SEQ ID NO:348), GHPO 1138 (SEQ ID NO:350), GHPO 1310 (SEQ ID NO:352), GHPO 1320 (SEQ ID NO:354), GHPO 1375 (SEQ ID
NO:356), GHPO 1432 (SEQ ID NO:358), GHPO 21 (SEQ ID NO:360), GHPO 282 (SEQ ID NO:362), GHPO 1089 (SEQ ID NO:364), GHPO 1141 (SEQ ID NO:366), GHPO 1280 (SEQ ID NO:368), GHPO 1608 (SEQ ID NO:370), GHPO 15 (SEQ ID NO:372), GHPO 16 (SEQ ID NO:374), GHPO 36 (SEQ ID NO:376), GHPO 38 (SEQ ID NO:378), GHPO 52 (SEQ ID NO:380), GHPO
57 (SEQ ID NO:382), GHPO 64 (SEQ ID NO:384), GHPO 79 (SEQ ID NO:386), GHPO 84 (SEQ ID NO:388), GHPO 86 (SEQ ID NO:390), GHPO 99 (SEQ ID NO:392), GHPO 106 (SEQ ID NO:394), GHPO 118 (SEQ ID NO:396), GHPO 122 (SEQ ID NO:398), GHPO 128 (SEQ ID NO:400), GHPO 138 (SEQ ID NO:402), GHPO 153 (SEQ ID NO:404), GHPO 160 (SEQ ID
NO:406), GHPO 168 (SEQ ID NO:408), GHPO 179 (SEQ ID NO:410), GHPO 189 (SEQ ID NO:412), GHPO 229 (SEQ ID NO:414), GHPO 243 (SEQ ID NO:416), GHPO 244 (SEQ ID NO:418), GHPO 251 (SEQ ID NO:420), GHPO 267 (SEQ ID NO:422), GHPO 269 (SEQ ID NO:424), GHPO 279 (SEQ ID NO:426), GHPO 284 (SEQ ID NO:428), GHPO 296 (SEQ ID NO:430), GHPO
300 (SEQ ID NO:432), GHPO 305 (SEQ ID NO:434), GHPO 319 (SEQ ID NO:436), GHPO 330 (SEQ ID NO:438), GHPO 340 (SEQ ID NO:440), GHPO 342 (SEQ ID NO:442), GHPO 344 (SEQ ID NO:444), GHPO 358 (SEQ ID NO:446), GHPO 373 (SEQ ID NO:448), GHPO 382 (SEQ ID NO:450), GHPO 384 (SEQ ID NO:452), GHPO 398 (SEQ ID NO:454), GHPO 409 (SEQ ID NO:456), GHPO 422 (SEQ ID NO:458), GHPO 430 (SEQ ID NO:460), GHPO 446 (SEQ ID NO:462), GHPO 447 (SEQ ID NO:464), GHPO 450 (SEQ ID
NO:466), GHPO 451 (SEQ ID NO:468), GHPO 452 (SEQ ID NO:470), GHPO 456 (SEQ ID NO:472), GHPO 461 (SEQ ID NO:474), GHPO 476 (SEQ ID NO:476), GHPO 478 (SEQ ID NO:478), GHPO 491 (SEQ ID NO:480), GHPO 511 (SEQ ID NO:482), GHPO 519 (SEQ ID NO:484), GHPO 526 (SEQ ID NO:486), GHPO 534 (SEQ ID NO:488), GHPO 536 (SEQ ID NO:490), GHPO
542 (SEQ ID NO:492), GHPO 544 (SEQ ID NO:494), GHPO 576 (SEQ ID NO:496), GHPO 578 (SEQ ID NO:498), GHPO 580 (SEQ ID NO:500), GHPO 585 (SEQ ID NO:502), GHPO 599 (SEQ ID NO:504), GHPO 639 (SEQ ID NO:506), GHPO 642 (SEQ ID NO:508), GHPO 647 (SEQ ID NO:510), GHPO 654 (SEQ ID NO:512), GHPO 669 (SEQ ID NO:514), GHPO 710 (SEQ ID
NO:516), GHPO 713 (SEQ ID NO:518), GHPO 716 (SEQ ID NO:520), GHPO 718 (SEQ ID NO:522), GHPO 726 (SEQ ID NO:524), GHPO 734 (SEQ ID NO:526), GHPO 740 (SEQ ID NO:528), GHPO 770 (SEQ ID NO:530), GHPO 782 (SEQ ID NO:532), GHPO 786 (SEQ ID NO:534), GHPO 792 (SEQ ID NO:536), GHPO 797 (SEQ ID NO:538), GHPO 816 (SEQ ID NO:540), GHPO
828 (SEQ ID NO:542), GHPO 839 (SEQ ID NO:544), GHPO 840 (SEQ ID NO:546), GHPO 842 (SEQ ID NO:548), GHPO 885 (SEQ ID NO:550), GHPO 889 (SEQ ID NO:552), GHPO 903 (SEQ ID NO:554), GHPO 912 (SEQ ID NO:556), GHPO 946 (SEQ ID NO:558), GHPO 958 (SEQ ID NO:560), GHPO 968 (SEQ ID NO:562), GHPO 987 (SEQ ID NO:564), GHPO 992 (SEQ ID
NO:566), GHPO 996 (SEQ ID NO:568), GHPO 997 (SEQ ID NO:570), GHPO 1002 (SEQ ID NO:572), GHPO 1026 (SEQ ID NO:574), GHPO 1028 (SEQ ID NO:576), GHPO 1034 (SEQ ID NO:578), GHPO 1038 (SEQ ID NO:580), GHPO 1059 (SEQ ID NO:582), GHPO 1065 (SEQ ID NO:584), GHPO 1072 (SEQ ID NO:586), GHPO 1073 (SEQ ID NO:588), GHPO 1088 (SEQ ID NO:590), GHPO 1091 (SEQ ID NO:592), GHPO 1105 (SEQ ID NO:594), GHPO 1115 (SEQ ID NO: 596), GHPO 1159 (SEQ ID NO:598), GHPO 1177
(SEQ ID NO:600), GHPO 1187 (SEQ ID NO:602), GHPO 1192 (SEQ ID NO:604), GHPO 1195 (SEQ ID NO:606), GHPO 1224 (SEQ ID NO:608), GHPO 1225 (SEQ ID NO:610), GHPO 1228 (SEQ ID NO:612), GHPO 1229 (SEQ ID NO:614), GHPO 1231 (SEQ ID NO:616), GHPO 1236 (SEQ ID NO:618), GHPO 1242 (SEQ ID NO:620), GHPO 1248 (SEQ ID NO:622),
GHPO 1270 (SEQ ID NO:624), GHPO 1271 (SEQ ID NO:626), GHPO 1298 (SEQ ID NO:628), GHPO 1301 (SEQ ID NO:630), GHPO 1304 (SEQ ID NO:632), GHPO 1315 (SEQ ID NO:634), GHPO 1319 (SEQ ID NO:636), GHPO 1323 (SEQ ID NO:638), GHPO 1331 (SEQ ID NO:640), GHPO 1332 (SEQ ID NO:642), GHPO 1347 (SEQ ID NO:644), GHPO 1373 (SEQ ID
NO:646), GHPO 1376 (SEQ ID NO:648), GHPO 1380 (SEQ ID NO:650), GHPO 1394 (SEQ ID NO:652), GHPO 1407 (SEQ ID NO:654), GHPO 1415 (SEQ ID NO:656), GHPO 1425 (SEQ ID NO:658), GHPO 1427 (SEQ ID NO:660), GHPO 1444 (SEQ ID NO:662), GHPO 1449 (SEQ ID NO:664), GHPO 1465 (SEQ ID NO:666), GHPO 1475 (SEQ ID NO:668), GHPO 1479
(SEQ ID NO:670), GHPO 1483 (SEQ ID NO:672), GHPO 1488 (SEQ ID NO:674), GHPO 1496 (SEQ ID NO:676), GHPO 1524 (SEQ ID NO:678), GHPO 1536 (SEQ ID NO:680), GHPO 1539 (SEQ ID NO:682), GHPO 1540 (SEQ ID NO:684), GHPO 1542 (SEQ ID NO:686), GHPO 1555 (SEQ ID NO:688), GHPO 1560 (SEQ ID NO:690), GHPO 1564 (SEQ ID NO:692),
GHPO 1570 (SEQ ID NO:694), GHPO 1588 (SEQ ID NO:696), GHPO 1604 (SEQ ID NO:698), GHPO 1605 (SEQ ID NO:700), GHPO 1619 (SEQ ID NO:702), GHPO 1629 (SEQ ID NO:704), GHPO 1642 (SEQ ID NO:706), GHPO 1654 (SEQ ID NO:708), GHPO 1661 (SEQ ID NO:710), GHPO 1673 (SEQ ID NO:712), GHPO 1687 (SEQ ID NO:714), GHPO 1692 (SEQ ID NO:716), GHPO 1693 (SEQ ID NO:718), GHPO 1699 (SEQ ID NO:720), GHPO 1738 (SEQ ID NO:722), GHPO 1745 (SEQ ID NO:724), GHPO 1746
(SEQ ID NO:726), GHPO 1754 (SEQ ID NO:728), GHPO 1792 (SEQ ID NO:730), GHPO 1795 (SEQ ID NO:732), GHPO 1796 (SEQ ID NO:734), GHPO 7 (SEQ ID NO:736), GHPO 8 (SEQ ID NO:738), GHPO 9 (SEQ ID NO:740), GHPO 10 (SEQ ID NO:742), GHPO 12 (SEQ ID NO:744), GHPO 25 (SEQ ID NO:746), GHPO 27 (SEQ ID NO:748), GHPO 29 (SEQ ID
NO:750), GHPO 30 (SEQ ID NO:752), GHPO 37 (SEQ ID NO:754), GHPO 49 (SEQ ID NO:756), GHPO 51 (SEQ ID NO:758), GHPO 54 (SEQ ID NO:760), GHPO 65 (SEQ ID NO:762), GHPO 66 (SEQ ID NO:764), GHPO 68 (SEQ ID NO:766), GHPO 70 (SEQ ID NO:768), GHPO 77 (SEQ ID NO:770), GHPO 83 (SEQ ID NO:772), GHPO 85 (SEQ ID NO:774), GHPO
87 (SEQ ID NO:776), GHPO 91 (SEQ ID NO:778), GHPO 92 (SEQ ID NO:780), GHPO 96 (SEQ ID NO:782), GHPO 97 (SEQ ID NO:784), GHPO 111 (SEQ ID NO:786), GHPO 115 (SEQ ID NO:788), GHPO 117 (SEQ ID NO:790), GHPO 123 (SEQ ID NO:792), GHPO 124 (SEQ ID NO:794), GHPO 126 (SEQ ID NO:796), GHPO 127 (SEQ ID NO:798), GHPO 128 (SEQ ID
NO:800), GHPO 131 (SEQ ID NO:802), GHPO 133 (SEQ ID NO:804), GHPO 140 (SEQ ID NO:806), GHPO 141 (SEQ ID NO:808), GHPO 145 (SEQ ID NO:810), GHPO 147 (SEQ ID NO:812), GHPO 166 (SEQ ID NO:814), GHPO 181 (SEQ ID NO:816), GHPO 187 (SEQ ID NO:818), GHPO 188 (SEQ ID NO: 820), GHPO 192 (SEQ ID NO: 822), GHPO 202 (SEQ ID NO: 824), GHPO
204 (SEQ ID NO:826), GHPO 205 (SEQ ID NO:828), GHPO 212 (SEQ ID NO:830), GHPO 218 (SEQ ID NO:832), GHPO 226 (SEQ ID NO:834), GHPO 231 (SEQ ID NO:836), GHPO 236 (SEQ ID NO:838), GHPO 239 (SEQ ID NO: 840), GHPO 245 (SEQ ID NO: 842), GHPO 246 (SEQ ID NO: 844), GHPO 248 (SEQ ID NO:846), GHPO 253 (SEQ ID NO:848), GHPO 265 (SEQ ID NO:850), GHPO 266 (SEQ ID NO:852), GHPO 271 (SEQ ID NO:854), GHPO 272 (SEQ ID NO:856), GHPO 286 (SEQ ID NO:858), GHPO 291 (SEQ ID
NO:860), GHPO 292 (SEQ ID NO:862), GHPO 297 (SEQ ID NO:864), GHPO 304 (SEQ ID NO: 866), GHPO 307 (SEQ ID NO: 868), GHPO 324 (SEQ ID NO:870), GHPO 326 (SEQ ID NO:872), GHPO 331 (SEQ ID NO:874), GHPO 343 (SEQ ID NO:876), GHPO 345 (SEQ ID NO:878), GHPO 346 (SEQ ID NO:880), GHPO 352 (SEQ ID NO:882), GHPO 355 (SEQ ID NO:884), GHPO
363 (SEQ ID NO:886), GHPO 369 (SEQ ID NO:888), GHPO 376 (SEQ ID NO:890), GHPO 378 (SEQ ID NO:892), GHPO 388 (SEQ ID NO:894), GHPO 396 (SEQ ID NO:896), GHPO 403 (SEQ ID NO:898), GHPO 410 (SEQ ID NO:900), GHPO 415 (SEQ ID NO:902), GHPO 421 (SEQ ID NO:904), GHPO 439 (SEQ ID NO:906), GHPO 441 (SEQ ID NO:908), GHPO 443 (SEQ ID
NO:910), GHPO 453 (SEQ ID NO:912), GHPO 455 (SEQ ID NO:914), GHPO 464 (SEQ ID NO:916), GHPO 467 (SEQ ID NO:918), GHPO 468 (SEQ ID NO:920), GHPO 470 (SEQ ID NO:922), GHPO 486 (SEQ ID NO:924), GHPO 487 (SEQ ID NO:926), GHPO 488 (SEQ ID NO:928), GHPO 489 (SEQ ID NO:930), GHPO 498 (SEQ ID NO:932), GHPO 501 (SEQ ID NO:934), GHPO
504 (SEQ ID NO:936), GHPO 512 (SEQ ID NO:938), GHPO 517 (SEQ ID NO:940), GHPO 520 (SEQ ID NO:942), GHPO 528 (SEQ ID NO:944), GHPO 530 (SEQ ID NO:946), GHPO 532 (SEQ ID NO:948), GHPO 548 (SEQ ID NO:950), GHPO 561 (SEQ ID NO:952), GHPO 564 (SEQ ID NO:954), GHPO 572 (SEQ ID NO:956), GHPO 573 (SEQ ID NO:958), GHPO 574 (SEQ ID
NO:960), GHPO 577 (SEQ ID NO:962), GHPO 579 (SEQ ID NO:964), GHPO 583 (SEQ ID NO:966), GHPO 588 (SEQ ID NO:968), GHPO 593 (SEQ ID NO:970), GHPO 597 (SEQ ID NO:972), GHPO 598 (SEQ ID NO:974), GHPO 604 (SEQ ID NO:976), GHPO 606 (SEQ ID NO:978), GHPO 611 (SEQ ID NO:980), GHPO 612 (SEQ ID NO:982), GHPO 615 (SEQ ID NO:984), GHPO 632 (SEQ ID NO:986), GHPO 633 (SEQ ID NO:988), GHPO 637 (SEQ ID NO:990), GHPO 651 (SEQ ID NO:992), GHPO 663 (SEQ ID NO:994), GHPO
686 (SEQ ID NO:996), GHPO 693 (SEQ ID NO:998), GHPO 698 (SEQ ID NO:1000), GHPO 703 (SEQ ID NO:1002), GHPO 704 (SEQ ID NO:1004), GHPO 705 (SEQ ID NO:1006), GHPO 707 (SEQ ID NO:1008), GHPO 721 (SEQ ID NO: 1010), GHPO 727 (SEQ ID NO: 1012), GHPO 728 (SEQ ID NO:1014), GHPO 733 (SEQ ID NO:1016), GHPO 758 (SEQ ID NO:1018),
GHPO 763 (SEQ ID NO: 1020), GHPO 771 (SEQ ID NO: 1022), GHPO 774 (SEQ ID NO: 1024), GHPO 776 (SEQ ID NO: 1026), GHPO 783 (SEQ ID NO:1028), GHPO 800 (SEQ ID NO:1030), GHPO 806 (SEQ ID NO:1032), GHPO 807 (SEQ ID NO:1034), GHPO 808 (SEQ ID NO:1036), GHPO 809 (SEQ ID NO: 1038), GHPO 811 (SEQ ID NO: 1040), GHPO 815 (SEQ ID
NO:1042), GHPO 819 (SEQ ID NO:1044), GHPO 841 (SEQ ID NO:1046), GHPO 843 (SEQ ID NO: 1048), GHPO 846 (SEQ ID NO: 1050), GHPO 875 (SEQ ID NO: 1052), GHPO 892 (SEQ ID NO: 1054), GHPO 902 (SEQ ID NO:1056), GHPO 904 (SEQ ID NO:1058), GHPO 906 (SEQ ID NO:1060), GHPO 908 (SEQ ID NO: 1062), GHPO 921 (SEQ ID NO: 1064), GHPO 923
(SEQ ID NO: 1066), GHPO 926 (SEQ ID NO: 1068), GHPO 933 (SEQ ID NO: 1070), GHPO 939 (SEQ ID NO: 1072), GHPO 940 (SEQ ID NO: 1074), GHPO 943 (SEQ ID NO: 1076), GHPO 951 (SEQ ID NO: 1078), GHPO 961 (SEQ ID NO: 1080), GHPO 965 (SEQ ID NO: 1082), GHPO 990 (SEQ ID NO: 1084), GHPO 991 (SEQ ID NO: 1086), GHPO 998 (SEQ ID NO: 1088),
GHPO 1001 (SEQ ID NO: 1090), GHPO 1005 (SEQ ID NO: 1092), GHPO 1033 (SEQ ID NO: 1094), GHPO 1039 (SEQ ID NO: 1096), GHPO 1041 (SEQ ID NO: 1098), GHPO 1043 (SEQ ID NO: 1100), GHPO 1044 (SEQ ID NO: 1102), GHPO 1051 (SEQ ID NO: 1104), GHPO 1058 (SEQ ID NO: 1106), GHPO 1060 (SEQ ID NO:l 108), GHPO 1075 (SEQ ID NO: 1110), GHPO 1077 (SEQ ID NO:1112), GHPO 1082 (SEQ ID NO:1114), GHPO 1083 (SEQ ID NO:1116), GHPO 1086 (SEQ ID NO: 1118), GHPO 1087 (SEQ ID
NO:1120), GHPO 1090 (SEQ ID NO: 1122), GHPO 1097 (SEQ ID NO: 1124), GHPO 1098 (SEQ ID NO: l 126), GHPO 1103 (SEQ ID NO: 1128), GHPO 1113 (SEQ ID NO:1130), GHPO 1116 (SEQ ID NO:1132), GHPO 1123 (SEQ ID NO: 1134), GHPO 1125 (SEQ ID NO:1136), GHPO 1129 (SEQ ID NO: 1138), GHPO 1130 (SEQ ID NO: 1140), GHPO 1134 (SEQ ID NO: 1142),
GHPO 1161 (SEQ ID NO: 1144), GHPO 1166 (SEQ ID NO: 1146), GHPO 1170 (SEQ ID NO: 1148), GHPO 1175 (SEQ ID NO: 1150), GHPO 1181 (SEQ ID NO: 1152), GHPO 1186 (SEQ ID NO: 1154), GHPO 1188 (SEQ ID NO: 1156), GHPO 1191 (SEQ ID NO:1158), GHPO 1193 (SEQ ID NO: 1160), GHPO 1196 (SEQ ID NO: 1162), GHPO 1204 (SEQ ID NO: 1164), GHPO
1210 (SEQ ID NO: 1166), GHPO 1211 (SEQ ID NO: 1168), GHPO 1216 (SEQ ID NO:1170), GHPO 1218 (SEQ ID NO:1172), GHPO 1220 (SEQ ID NO:l 174), GHPO 1223 (SEQ ID NO: 1176), GHPO 1226 (SEQ ID NO: 1178), GHPO 1240 (SEQ ID NO:1180), GHPO 1246 (SEQ ID NO: 1182), GHPO 1251 (SEQ ID NO:1184), GHPO 1252 (SEQ ID NO:1186), GHPO 1261 (SEQ
ID NO: 1188), GHPO 1265 (SEQ ID NO: 1190), GHPO 1267 (SEQ ID NO: 1192), GHPO 1278 (SEQ ID NO: 1194), GHPO 1282 (SEQ ID NO: 1196), GHPO 1283 (SEQ ID NO:l 198), GHPO 1287 (SEQ ID NO: 1200), GHPO 1292 (SEQ ID NO: 1202), GHPO 1293 (SEQ ID NO: 1204), GHPO 1302 (SEQ ID NO: 1206), GHPO 1309 (SEQ ID NO:1208), GHPO 1317 (SEQ ID
NO:1210), GHPO 1318 (SEQ ID NO:1212), GHPO 1321 (SEQ ID NO:1214), GHPO 1325 (SEQ ID NO:1216), GHPO 1341 (SEQ ID NO:1218), GHPO 1351 (SEQ ID NO: 1220), GHPO 1354 (SEQ ID NO: 1222), GHPO 1363 (SEQ ID NO:1224), GHPO 1371 (SEQ ID NO:1226), GHPO 1381 (SEQ ID NO: 1228), GHPO 1401 (SEQ ID NO: 1230), GHPO 1402 (SEQ ID NO: 1232), GHPO 1403 (SEQ ID NO: 1234), GHPO 1408 (SEQ ID NO: 1236), GHPO 1416 (SEQ ID NO:1238), GHPO 1420 (SEQ ID NO:1240), GHPO 1428 (SEQ
ID NO: 1242), GHPO 1437 (SEQ ID NO: 1244), GHPO 1439 (SEQ ID NO: 1246), GHPO 1460 (SEQ ID NO: 1248), GHPO 1463 (SEQ ID NO: 1250), GHPO 1472 (SEQ ID NO: 1252), GHPO 1474 (SEQ ID NO: 1254), GHPO 1484 (SEQ ID NO: 1256), GHPO 1489 (SEQ ID NO: 1258), GHPO 1494 (SEQ ID NO: 1260), GHPO 1495 (SEQ ID NO: 1262), GHPO 1498 (SEQ ID
NO: 1264), GHPO 1499 (SEQ ID NO: 1266), GHPO 1500 (SEQ ID NO: 1268), GHPO 1503 (SEQ ID NO:1270), GHPO 1504 (SEQ ID NO: 1272), GHPO 1510 (SEQ ID NO: 1274), GHPO 1518 (SEQ ID NO: 1276), GHPO 1533 (SEQ ID NO: 1278), GHPO 1541 (SEQ ID NO: 1280), GHPO 1544 (SEQ ID NO: 1282), GHPO 1548 (SEQ ID NO: 1284), GHPO 1565 (SEQ ID NO:1286),
GHPO 1575 (SEQ ID NO:1288), GHPO 1582 (SEQ ID NO:1290), GHPO 1595 (SEQ ID NO:1292), GHPO 1597 (SEQ ID NO:1294), GHPO 1599 (SEQ ID NO:1296), GHPO 1601 (SEQ ID NO: 1298), GHPO 1609 (SEQ ID NO:1300), GHPO 1613 (SEQ ID NO: 1302), GHPO 1614 (SEQ ID NO: 1304), GHPO 1626 (SEQ ID NO: 1306), GHPO 1628 (SEQ ID NO: 1308), GHPO
1639 (SEQ ID NO:1310), GHPO 1640 (SEQ ID NO: 1312), GHPO 1641 (SEQ ID NO: 1314), GHPO 1646 (SEQ ID NO: 1316), GHPO 1662 (SEQ ID NO: 1318), GHPO 1667 (SEQ ID NO: 1320), GHPO 1668 (SEQ ID NO: 1322), GHPO 1670 (SEQ ID NO: 1324), GHPO 1671 (SEQ ID NO: 1326), GHPO 1672 (SEQ ID NO: 1328), GHPO 1678 (SEQ ID NO: 1330), GHPO 1684 (SEQ
ID NO: 1332), GHPO 1695 (SEQ ID NO:1334), GHPO 1697 (SEQ ID NO:1336), GHPO 1701 (SEQ ID NO:1338), GHPO 1719 (SEQ ID NO:1340), GHPO 1723 (SEQ ID NO:1342), GHPO 1732 (SEQ ID NO:1344), GHPO 1739 (SEQ ID NO: 1346), GHPO 1741 (SEQ ID NO:1348), GHPO 1747 (SEQ ID NO:1350), GHPO 1749 (SEQ ID NO:1352), GHPO 1750 (SEQ ID NO: 1354), GHPO 1751 (SEQ ID NO:1356), GHPO 1755 (SEQ ID NO:1358), GHPO 1771 (SEQ ID NO:1360), GHPO 1786 (SEQ ID NO: 1362), and GHPO
1789 (SEQ ID NO: 1364), have been identified in the H. pylori genome. These polypeptides can be used, for example, in vaccination methods for preventing or treating Helicobacter infection. For example, GHPO 1320, GHPO 523, GHPO 792, GHPO 639, GHPO 669, GHPO 992, GHPO 576, GHPO 109, GHPO 129, GHPO 234, GHPO 257, GHPO 525, GHPO 626, GHPO 1034,
GHPO 1275, GHPO 1308, GHPO 1600, GHPO 1615, GHPO 536, GHPO 66, GHPO 1363, GHPO 1595, and GHPO 1166 have been shown to be protective antigens that can be used in methods for preventing Helicobacter infection. By "protective antigen" is meant an antigen that is capable of reducing the infection level after challenge, relative to a positive control. Absolute protection from infection, although included in the invention, is not required.
Some ofthe new polypeptides are secreted polypeptides that can be produced in their mature forms (i.e., as polypeptides that have been exported through class II or class III secretion pathways) or as precursors that include signal peptides, which can be removed in the course of excretion/secretion by cleavage at the N-terminal end ofthe mature form. (The cleavage site is located at the C-terminal end ofthe signal peptide, adjacent to the mature form.)
According to a first aspect ofthe invention, there are provided isolated polynucleotides that encode the precursor and mature forms ofthe Helicobacter
GHPO proteins listed above. Examples of such polynucleotides are those encoding GHPO 35 (SEQ ID NO:l), GHPO 55 (SEQ ID NO:3), GHPO 78 (SEQ ID NO:5), GHPO 89 (SEQ ID NO:7), GHPO 129 (SEQ ID NO:9), GHPO 541 (SEQ ID NO: 11), GHPO 607 (SEQ ID NO: 13), GHPO 635 (SEQ ID NO: 15), GHPO 701 (SEQ ID NO: 17), GHPO 712 (SEQ ID NO:19), GHPO 761 (SEQ ID NO:21), GHPO 838 (SEQ ID NO:23), GHPO 1034 (SEQ ID NO:25), GHPO 1085 (SEQ ID NO:27), GHPO 1213 (SEQ ID NO:29), GHPO
1255 (SEQ ID NO:31), GHPO 1308 (SEQ ID NO:33), GHPO 1389 (SEQ ID NO:35), GHPO 1706 (SEQ ID NO:37), GHPO 234 (SEQ ID NO:39), GHPO 314 (SEQ ID NO:41), GHPO 510 (SEQ ID NO:43), GHPO 603 (SEQ ID NO:45), GHPO 937 (SEQ ID NO:47), GHPO 1027 (SEQ ID NO:49), GHPO 1099 (SEQ ID NO:51), GHPO 1151 (SEQ ID NO:53), GHPO 1275 (SEQ ID
NO:55), GHPO 1365 (SEQ ID NO:57), GHPO 1578 (SEQ ID NO:59), GHPO 22 (SEQ ID NO:61), GHPO 58 (SEQ ID NO:63), GHPO 200 (SEQ ID NO:65), GHPO 558 (SEQ ID NO:67), GHPO 563 (SEQ ID NO:69), GHPO 695 (SEQ ID NO:71), GHPO 699 (SEQ ID NO:73), GHPO 702 (SEQ ID NO:75), GHPO 709 (SEQ ID NO:77), GHPO 741 (SEQ ID NO:79), GHPO 762 (SEQ ID
NO:81), GHPO 827 (SEQ ID NO:83), GHPO 852 (SEQ ID NO:85), GHPO 1013 (SEQ ID NO: 87), GHPO 1020 (SEQ ID NO: 89), GHPO 1031 (SEQ ID NO:91), GHPO 1052 (SEQ ID NO:93), GHPO 1127 (SEQ ID NO:95), GHPO 1149 (SEQ ID NO:97), GHPO 1176 (SEQ ID NO:99), GHPO 1250 (SEQ ID NO: 101), GHPO 1312 (SEQ ID NO:103), GHPO 1358 (SEQ ID NO: 105),
GHPO 1490 (SEQ ID NO: 107), GHPO 1559 (SEQ ID NO: 109), GHPO 1651 (SEQ ID NO: 111), GHPO 1726 (SEQ ID NO: 113), GHPO 1780 (SEQ ID NO:l 15), GHPO 895 (SEQ ID NO:l 17), GHPO 1447 (SEQ ID NO:l 19), GHPO 28 (SEQ ID NO:121), GHPO 86 (SEQ ID NO:123), GHPO 155 (SEQ ID NO: 125), GHPO 157 (SEQ ID NO: 127), GHPO 237 (SEQ ID NO: 129),
GHPO 290 (SEQ ID NO: 131), GHPO 293 (SEQ ID NO: 133), GHPO 335 (SEQ ID NO: 135), GHPO 374 (SEQ ID NO: 137), GHPO 442 (SEQ ID NO: 139), GHPO 480 (SEQ ID NO: 141), GHPO 523 (SEQ ID NO: 143), GHPO 610 (SEQ ID NO: 145), GHPO 675 (SEQ ID NO: 147), GHPO 690 (SEQ ID NO: 149), GHPO 829 (SEQ ID NO: 151), GHPO 850 (SEQ ID NO: 153), GHPO 876 (SEQ ID NO: 155), GHPO 984 (SEQ ID NO: 157), GHPO 989 (SEQ ID NO: 159), GHPO 1111 (SEQ ID NO:161), GHPO 1145 (SEQ ID NO:163),
GHPO 1256 (SEQ ID NO: 165), GHPO 1264 (SEQ ID NO: 167), GHPO 1316 (SEQ ID NO: 169), GHPO 1368 (SEQ ID NO: 171), GHPO 1442 (SEQ ID NO: 173), GHPO 1506 (SEQ ID NO: 175), GHPO 1543 (SEQ ID NO: 177), GHPO 1574 (SEQ ID NO: 179), GHPO 1627 (SEQ ID NO: 181), GHPO 1657 (SEQ ID NO: 183), GHPO 1664 (SEQ ID NO: 185), GHPO 1694 (SEQ ID
NO:187), GHPO 1704 (SEQ ID NO:189), GHPO 1763 (SEQ ID NO:191), GHPO 616 (SEQ ID NO:193), GHPO 76 (SEQ ID NO:195), GHPO 109 (SEQ ID NO:197), GHPO 163 (SEQ ID NO: 199), GHPO 169 (SEQ ID NO:201), GHPO 208 (SEQ ID NO:203), GHPO 219 (SEQ ID NO:205), GHPO 445 (SEQ ID NO:207), GHPO 479 (SEQ ID NO:209), GHPO 525 (SEQ ID
NO:211), GHPO 535 (SEQ ID NO:213), GHPO 731 (SEQ ID NO:215), GHPO 836 (SEQ ID NO:217), GHPO 879 (SEQ ID NO:219), GHPO 881 (SEQ ID NO:221), GHPO 886 (SEQ ID NO:223), GHPO 893 (SEQ ID NO:225), GHPO 894 (SEQ ID NO:227), GHPO 976 (SEQ ID NO:229), GHPO 1011 (SEQ ID NO:231), GHPO 1024 (SEQ ID NO:233), GHPO 1084 (SEQ ID NO:235),
GHPO 1329 (SEQ ID NO:237), GHPO 1330 (SEQ ID NO:239), GHPO 1346 (SEQ ID NO:241), GHPO 1360 (SEQ ID NO:243), GHPO 1388 (SEQ ID NO:245), GHPO 1411 (SEQ ID NO:247), GHPO 1419 (SEQ ID NO:249), GHPO 1446 (SEQ ID NO:251), GHPO 1469 (SEQ ID NO:253), GHPO 1501 (SEQ ID NO:255), GHPO 1505 (SEQ ID NO:257), GHPO 1522 (SEQ ID
NO:259), GHPO 1525 (SEQ ID NO:261), GHPO 1615 (SEQ ID NO:263), GHPO 1689 (SEQ ID NO:265), GHPO 1733 (SEQ ID NO:267), GHPO 18 (SEQ ID NO:269), GHPO 139 (SEQ ID NO:271), GHPO 142 (SEQ ID NO:273), GHPO 250 (SEQ ID NO:275), GHPO 257 (SEQ ID NO:277), GHPO 325 (SEQ ID NO:279), GHPO 355 (SEQ ID NO:281), GHPO 357 (SEQ ID NO:283), GHPO 454 (SEQ ID NO:285), GHPO 475 (SEQ ID NO:287), GHPO 515 (SEQ ID NO:289), GHPO 527 (SEQ ID NO:291), GHPO 551 (SEQ ID
NO:293), GHPO 602 (SEQ ID NO:295), GHPO 626 (SEQ ID NO:297), GHPO 646 (SEQ ID NO:299), GHPO 653 (SEQ ID NO:301), GHPO 655 (SEQ ID NO:303), GHPO 670 (SEQ ID NO:305), GHPO 739 (SEQ ID NO:307), GHPO 798 (SEQ ID NO:309), GHPO 1102 (SEQ ID NO:311), GHPO 1114 (SEQ ID NO:313), GHPO 1152 (SEQ ID NO:315), GHPO 1272 (SEQ ID NO:317),
GHPO 1345 (SEQ ID NO:319), GHPO 1377 (SEQ ID NO:321), GHPO 1424 (SEQ ID NO:323), GHPO 1430 (SEQ ID NO:325), GHPO 1502 (SEQ ID NO:327), GHPO 1600 (SEQ ID NO:329), GHPO 1714 (SEQ ID NO:331), GHPO 359 (SEQ ID NO:333), GHPO 678 (SEQ ID NO:335), GHPO 708 (SEQ ID NO:337), GHPO 759 (SEQ ID NO:339), GHPO 847 (SEQ ID
NO:341), GHPO 1050 (SEQ ID NO:343), GHPO 1101 (SEQ ID NO:345), GHPO 1120 (SEQ ID NO:347), GHPO 1138 (SEQ ID NO:349), GHPO 1310 (SEQ ID NO:351), GHPO 1320 (SEQ ID NO:353), GHPO 1375 (SEQ ID NO:355), GHPO 1432 (SEQ ID NO:357), GHPO 21 (SEQ ID NO:359), GHPO 282 (SEQ ID NO:361), GHPO 1089 (SEQ ID NO:363), GHPO 1141 (SEQ ID
NO:365), GHPO 1280 (SEQ ID NO:367), GHPO 1608 (SEQ ID NO:369), GHPO 15 (SEQ ID NO:371), GHPO 16 (SEQ ID NO:373), GHPO 36 (SEQ ID NO:375), GHPO 38 (SEQ ID NO:377), GHPO 52 (SEQ ID NO:379), GHPO 57 (SEQ ID NO:381), GHPO 64 (SEQ ID NO:383), GHPO 79 (SEQ ID NO:385), GHPO 84 (SEQ ID NO:387), GHPO 86 (SEQ ID NO:389), GHPO
99 (SEQ ID NO:391), GHPO 106 (SEQ ID NO:393), GHPO 118 (SEQ ID NO:395), GHPO 122 (SEQ ID NO:397), GHPO 128 (SEQ ID NO:399), GHPO 138 (SEQ ID NO:401), GHPO 153 (SEQ ID NO:403), GHPO 160 (SEQ ID NO:405), GHPO 168 (SEQ ID NO:407), GHPO 179 (SEQ ID NO:409), GHPO 189 (SEQ ID NO:411), GHPO 229 (SEQ ID NO:413), GHPO 243 (SEQ ID NO:415), GHPO 244 (SEQ ID NO:417), GHPO 251 (SEQ ID NO:419), GHPO 267 (SEQ ID NO:421), GHPO 269 (SEQ ID NO:423), GHPO 279 (SEQ ID
NO:425), GHPO 284 (SEQ ID NO:427), GHPO 296 (SEQ ID NO:429), GHPO 300 (SEQ ID NO:431), GHPO 305 (SEQ ID NO:433), GHPO 319 (SEQ ID NO:435), GHPO 330 (SEQ ID NO:437), GHPO 340 (SEQ ID NO:439), GHPO 342 (SEQ ID NO:441), GHPO 344 (SEQ ID NO:443), GHPO 358 (SEQ ID NO:445), GHPO 373 (SEQ ID NO:447), GHPO 382 (SEQ ID NO:449), GHPO
384 (SEQ ID NO:451), GHPO 398 (SEQ ID NO:453), GHPO 409 (SEQ ID NO:455), GHPO 422 (SEQ ID NO:457), GHPO 430 (SEQ ID NO:459), GHPO 446 (SEQ ID NO:461), GHPO 447 (SEQ ID NO:463), GHPO 450 (SEQ ID NO:465), GHPO 451 (SEQ ID NO:467), GHPO 452 (SEQ ID NO:469), GHPO 456 (SEQ ID NO:471), GHPO 461 (SEQ ID NO:473), GHPO 476 (SEQ ID
NO:475), GHPO 478 (SEQ ID NO:477), GHPO 491 (SEQ ID NO:479), GHPO 511 (SEQ ID NO:481), GHPO 519 (SEQ ID NO:483), GHPO 526 (SEQ ID NO:485), GHPO 534 (SEQ ID NO:487), GHPO 536 (SEQ ID NO:489), GHPO 542 (SEQ ID NO:491), GHPO 544 (SEQ ID NO:493), GHPO 576 (SEQ ID NO:495), GHPO 578 (SEQ ID NO:497), GHPO 580 (SEQ ID NO:499), GHPO
585 (SEQ ID NO:501), GHPO 599 (SEQ ID NO:503), GHPO 639 (SEQ ID NO:505), GHPO 642 (SEQ ID NO:507), GHPO 647 (SEQ ID NO:509), GHPO 654 (SEQ ID NO:511), GHPO 669 (SEQ ID NO:513), GHPO 710 (SEQ ID NO:515), GHPO 713 (SEQ ID NO:517), GHPO 716 (SEQ ID NO:519), GHPO 718 (SEQ ID NO: 521), GHPO 726 (SEQ ID NO: 523), GHPO 734 (SEQ ID
NO:525), GHPO 740 (SEQ ID NO:527), GHPO 770 (SEQ ID NO:529), GHPO 782 (SEQ ID NO:531), GHPO 786 (SEQ ID NO:533), GHPO 792 (SEQ ID NO:535), GHPO 797 (SEQ ID NO:537), GHPO 816 (SEQ ID NO:539), GHPO 828 (SEQ ID NO:541), GHPO 839 (SEQ ID NO:543), GHPO 840 (SEQ ID NO:545), GHPO 842 (SEQ ID NO:547), GHPO 885 (SEQ ID NO:549), GHPO 889 (SEQ ID NO:551), GHPO 903 (SEQ ID NO:553), GHPO 912 (SEQ ID NO:555), GHPO 946 (SEQ ID NO:557), GHPO 958 (SEQ ID NO:559), GHPO
968 (SEQ ID NO:561), GHPO 987 (SEQ ID NO:563), GHPO 992 (SEQ ID NO:565), GHPO 996 (SEQ ID NO:567), GHPO 997 (SEQ ID NO:569), GHPO 1002 (SEQ ID NO:571), GHPO 1026 (SEQ ID NO:573), GHPO 1028 (SEQ ID NO:575), GHPO 1034 (SEQ ID NO:577), GHPO 1038 (SEQ ID NO:579), GHPO 1059 (SEQ ID NO:581), GHPO 1065 (SEQ ID NO:583), GHPO 1072
(SEQ ID NO:585), GHPO 1073 (SEQ ID NO:587), GHPO 1088 (SEQ ID NO:589), GHPO 1091 (SEQ ID NO:591), GHPO 1105 (SEQ ID NO:593), GHPO 1115 (SEQ ID NO:595), GHPO 1159 (SEQ ID NO:597), GHPO 1177 (SEQ ID NO:599), GHPO 1187 (SEQ ID NO:601), GHPO 1192 (SEQ ID NO:603), GHPO 1195 (SEQ ID NO:605), GHPO 1224 (SEQ ID NO:607),
GHPO 1225 (SEQ ID NO:609), GHPO 1228 (SEQ ID NO:611), GHPO 1229 (SEQ ID NO:613), GHPO 1231 (SEQ ID NO:615), GHPO 1236 (SEQ ID NO:617), GHPO 1242 (SEQ ID NO:619), GHPO 1248 (SEQ ID NO:621), GHPO 1270 (SEQ ID NO:623), GHPO 1271 (SEQ ID NO:625), GHPO 1298 (SEQ ID NO:627), GHPO 1301 (SEQ ID NO:629), GHPO 1304 (SEQ ID
NO:631), GHPO 1315 (SEQ ID NO:633), GHPO 1319 (SEQ ID NO:635), GHPO 1323 (SEQ ID NO:637), GHPO 1331 (SEQ ID NO:639), GHPO 1332 (SEQ ID NO:641), GHPO 1347 (SEQ ID NO:643), GHPO 1373 (SEQ ID NO:645), GHPO 1376 (SEQ ID NO:647), GHPO 1380 (SEQ ID NO:649), GHPO 1394 (SEQ ID NO:651), GHPO 1407 (SEQ ID NO:653), GHPO 1415
(SEQ ID NO:655), GHPO 1425 (SEQ ID NO:657), GHPO 1427 (SEQ ID NO:659), GHPO 1444 (SEQ ID NO:661), GHPO 1449 (SEQ ID NO:663), GHPO 1465 (SEQ ID NO:665), GHPO 1475 (SEQ ID NO:667), GHPO 1479 (SEQ ID NO:669), GHPO 1483 (SEQ ID NO:671), GHPO 1488 (SEQ ID NO:673), GHPO 1496 (SEQ ID NO:675), GHPO 1524 (SEQ ID NO:677), GHPO 1536 (SEQ ID NO:679), GHPO 1539 (SEQ ID NO:681), GHPO 1540 (SEQ ID NO:683), GHPO 1542 (SEQ ID NO:685), GHPO 1555 (SEQ ID
NO:687), GHPO 1560 (SEQ ID NO:689), GHPO 1564 (SEQ ID NO:691), GHPO 1570 (SEQ ID NO:693), GHPO 1588 (SEQ ID NO:695), GHPO 1604 (SEQ ID NO:697), GHPO 1605 (SEQ ID NO:699), GHPO 1619 (SEQ ID NO:701), GHPO 1629 (SEQ ID NO:703), GHPO 1642 (SEQ ID NO:705), GHPO 1654 (SEQ ID NO:707), GHPO 1661 (SEQ ID NO:709), GHPO 1673
(SEQ ID NO:711), GHPO 1687 (SEQ ID NO:713), GHPO 1692 (SEQ ID NO:715), GHPO 1693 (SEQ ID NO:717), GHPO 1699 (SEQ ID NO:719), GHPO 1738 (SEQ ID NO:721), GHPO 1745 (SEQ ID NO:723), GHPO 1746 (SEQ ID NO:725), GHPO 1754 (SEQ ID NO:727), GHPO 1792 (SEQ ID NO:729), GHPO 1795 (SEQ ID NO:731), GHPO 1796 (SEQ ID NO:733),
GHPO 7 (SEQ ID NO:735), GHPO 8 (SEQ ID NO:737), GHPO 9 (SEQ ID NO:739), GHPO 10 (SEQ ID NO:741), GHPO 12 (SEQ ID NO:743), GHPO 25 (SEQ ID NO:745), GHPO 27 (SEQ ID NO:747), GHPO 29 (SEQ ID NO:749), GHPO 30 (SEQ ID NO:751), GHPO 37 (SEQ ID NO:753), GHPO 49 (SEQ ID NO:755), GHPO 51 (SEQ ID NO:757), GHPO 54 (SEQ ID
NO:759), GHPO 65 (SEQ ID NO:761), GHPO 66 (SEQ ID NO:763), GHPO 68 (SEQ ID NO:765), GHPO 70 (SEQ ID NO:767), GHPO 77 (SEQ ID NO:769), GHPO 83 (SEQ ID NO:771), GHPO 85 (SEQ ID NO:773), GHPO 87 (SEQ ID NO:775), GHPO 91 (SEQ ID NO:777), GHPO 92 (SEQ ID NO:779), GHPO 96 (SEQ ID NO:781), GHPO 97 (SEQ ID NO:783), GHPO
111 (SEQ ID NO:785), GHPO 115 (SEQ ID NO:787), GHPO 117 (SEQ ID NO:789), GHPO 123 (SEQ ID NO:791), GHPO 124 (SEQ ID NO:793), GHPO 126 (SEQ ID NO:795), GHPO 127 (SEQ ID NO:797), GHPO 128 (SEQ ID NO:799), GHPO 131 (SEQ ID NO:801), GHPO 133 (SEQ ID NO:803), GHPO 140 (SEQ ID NO:805), GHPO 141 (SEQ ID NO:807), GHPO 145 (SEQ ID NO:809), GHPO 147 (SEQ ID NO:811), GHPO 166 (SEQ ID NO:813), GHPO 181 (SEQ ID NO:815), GHPO 187 (SEQ ID NO:817), GHPO 188 (SEQ ID
NO:819), GHPO 192 (SEQ ID NO:821), GHPO 202 (SEQ ID NO:823), GHPO 204 (SEQ ID NO:825), GHPO 205 (SEQ ID NO:827), GHPO 212 (SEQ ID NO:829), GHPO 218 (SEQ ID NO:831), GHPO 226 (SEQ ID NO:833), GHPO 231 (SEQ ID NO:835), GHPO 236 (SEQ ID NO:837), GHPO 239 (SEQ ID NO:839), GHPO 245 (SEQ ID NO:841), GHPO 246 (SEQ ID NO:843), GHPO
248 (SEQ ID NO:845), GHPO 253 (SEQ ID NO:847), GHPO 265 (SEQ ID NO:849), GHPO 266 (SEQ ID NO:851), GHPO 271 (SEQ ID NO:853), GHPO 272 (SEQ ID NO:855), GHPO 286 (SEQ ID NO:857), GHPO 291 (SEQ ID NO:859), GHPO 292 (SEQ ID NO:861), GHPO 297 (SEQ ID NO:863), GHPO 304 (SEQ ID NO:865), GHPO 307 (SEQ ID NO:867), GHPO 324 (SEQ ID
NO:869), GHPO 326 (SEQ ID NO:871), GHPO 331 (SEQ ID NO:873), GHPO 343 (SEQ ID NO: 875), GHPO 345 (SEQ ID NO: 877), GHPO 346 (SEQ ID NO:879), GHPO 352 (SEQ ID NO:881), GHPO 355 (SEQ ID NO:883), GHPO 363 (SEQ ID NO:885), GHPO 369 (SEQ ID NO:887), GHPO 376 (SEQ ID NO:889), GHPO 378 (SEQ ID NO:891), GHPO 388 (SEQ ID NO:893), GHPO
396 (SEQ ID NO:895), GHPO 403 (SEQ ID NO:897), GHPO 410 (SEQ ID NO:899), GHPO 415 (SEQ ID NO:901), GHPO 421 (SEQ ID NO:903), GHPO 439 (SEQ ID NO:905), GHPO 441 (SEQ ID NO:907), GHPO 443 (SEQ ID NO:909), GHPO 453 (SEQ ID NO:911), GHPO 455 (SEQ ID NO:913), GHPO 464 (SEQ ID NO:915), GHPO 467 (SEQ ID NO:917), GHPO 468 (SEQ ID
NO:919), GHPO 470 (SEQ ID NO:921), GHPO 486 (SEQ ID NO:923), GHPO 487 (SEQ ID NO:925), GHPO 488 (SEQ ID NO:927), GHPO 489 (SEQ ID NO:929), GHPO 498 (SEQ ID NO:931), GHPO 501 (SEQ ID NO:933), GHPO 504 (SEQ ID NO:935), GHPO 512 (SEQ ID NO:937), GHPO 517 (SEQ ID NO:939), GHPO 520 (SEQ ID NO:941), GHPO 528 (SEQ ID NO:943), GHPO 530 (SEQ ID NO:945), GHPO 532 (SEQ ID NO:947), GHPO 548 (SEQ ID NO:949), GHPO 561 (SEQ ID NO:951), GHPO 564 (SEQ ID NO:953), GHPO
572 (SEQ ID NO:955), GHPO 573 (SEQ ID NO:957), GHPO 574 (SEQ ID NO:959), GHPO 577 (SEQ ID NO:961), GHPO 579 (SEQ ID NO:963), GHPO 583 (SEQ ID NO:965), GHPO 588 (SEQ ID NO:967), GHPO 593 (SEQ ID NO:969), GHPO 597 (SEQ ID NO:971), GHPO 598 (SEQ ID NO:973), GHPO 604 (SEQ ID NO:975), GHPO 606 (SEQ ID NO:977), GHPO 611 (SEQ ID
NO:979), GHPO 612 (SEQ ID NO:981), GHPO 615 (SEQ ID NO:983), GHPO 632 (SEQ ID NO:985), GHPO 633 (SEQ ID NO:987), GHPO 637 (SEQ ID NO:989), GHPO 651 (SEQ ID NO:991), GHPO 663 (SEQ ID NO:993), GHPO 686 (SEQ ID NO:995), GHPO 693 (SEQ ID NO:997), GHPO 698 (SEQ ID NO:999), GHPO 703 (SEQ ID NO: 1001), GHPO 704 (SEQ ID NO: 1003),
GHPO 705 (SEQ ID NO: 1005), GHPO 707 (SEQ ID NO: 1007), GHPO 721 (SEQ ID NO: 1009), GHPO 727 (SEQ ID NO: 1011), GHPO 728 (SEQ ID NO: 1013), GHPO 733 (SEQ ID NO: 1015), GHPO 758 (SEQ ID NO: 1017), GHPO 763 (SEQ ID NO: 1019), GHPO 771 (SEQ ID NO: 1021), GHPO 774 (SEQ ID NO: 1023), GHPO 776 (SEQ ID NO: 1025), GHPO 783 (SEQ ID
NO: 1027), GHPO 800 (SEQ ID NO: 1029), GHPO 806 (SEQ ID NO: 1031), GHPO 807 (SEQ ID NO: 1033), GHPO 808 (SEQ ID NO: 1035), GHPO 809 (SEQ ID NO: 1037), GHPO 811 (SEQ ID NO: 1039), GHPO 815 (SEQ ID NO:1041), GHPO 819 (SEQ ID NO:1043), GHPO 841 (SEQ ID NO: 1045), GHPO 843 (SEQ ID NO: 1047), GHPO 846 (SEQ ID NO: 1049), GHPO 875
(SEQ ID NO:1051), GHPO 892 (SEQ ID NO:1053), GHPO 902 (SEQ ID NO:1055), GHPO 904 (SEQ ID NO:1057), GHPO 906 (SEQ ID NO:1059), GHPO 908 (SEQ ID NO: 1061), GHPO 921 (SEQ ID NO: 1063), GHPO 923 (SEQ ID NO: 1065), GHPO 926 (SEQ ID NO: 1067), GHPO 933 (SEQ ID NO: 1069), GHPO 939 (SEQ ID NO: 1071), GHPO 940 (SEQ ID NO: 1073), GHPO 943 (SEQ ID NO:1075), GHPO 951 (SEQ ID NO:1077), GHPO 961 (SEQ ID NO: 1079), GHPO 965 (SEQ ID NO:1081), GHPO 990 (SEQ ID
NO:1083), GHPO 991 (SEQ ID NO:1085), GHPO 998 (SEQ ID NO:1087), GHPO 1001 (SEQ ID NO: 1089), GHPO 1005 (SEQ ID NO: 1091), GHPO 1033 (SEQ ID NO: 1093), GHPO 1039 (SEQ ID NO: 1095), GHPO 1041 (SEQ ID NO: 1097), GHPO 1043 (SEQ ID NO: 1099), GHPO 1044 (SEQ ID NO: 1101), GHPO 1051 (SEQ ID NO: 1103), GHPO 1058 (SEQ ID NO: 1105),
GHPO 1060 (SEQ ID NO:1107), GHPO 1075 (SEQ ID NO:1109), GHPO 1077 (SEQ ID NO:l l l l), GHPO 1082 (SEQ ID NO:1113), GHPO 1083 (SEQ ID NO:1115), GHPO 1086 (SEQ ID NO:1117), GHPO 1087 (SEQ ID NO: 1119), GHPO 1090 (SEQ ID NO: 1121), GHPO 1097 (SEQ ID NO: 1123), GHPO 1098 (SEQ ID NO: 1125), GHPO 1103 (SEQ ID NO: 1127), GHPO
1113 (SEQ ID NO: 1129), GHPO 1116 (SEQ ID NO:1131), GHPO 1123 (SEQ ID NO:1133), GHPO 1125 (SEQ ID NO:1135), GHPO 1129 (SEQ ID NO:1137), GHPO 1130 (SEQ ID NO:1139), GHPO 1134 (SEQ ID NO:1141), GHPO 1161 (SEQ ID NO: 1143), GHPO 1166 (SEQ ID NO: 1145), GHPO 1170 (SEQ ID NO: 1147), GHPO 1175 (SEQ ID NO: 1149), GHPO 1181 (SEQ
ID NO: 1151), GHPO 1186 (SEQ ID NO: 1153), GHPO 1188 (SEQ ID NO:l 155), GHPO 1191 (SEQ ID NO: 1157), GHPO 1193 (SEQ ID NO: 1159), GHPO 1196 (SEQ ID NO: 1161), GHPO 1204 (SEQ ID NO: 1163), GHPO 1210 (SEQ ID NO:1165), GHPO 1211 (SEQ ID NO:1167), GHPO 1216 (SEQ ID NO:1169), GHPO 1218 (SEQ ID NO:l 171), GHPO 1220 (SEQ ID
NO:l 173), GHPO 1223 (SEQ ID NO: 1175), GHPO 1226 (SEQ ID NO: 1177), GHPO 1240 (SEQ ID NO: 1179), GHPO 1246 (SEQ ID NO: 1181), GHPO 1251 (SEQ ID NO:l 183), GHPO 1252 (SEQ ID NO: 1185), GHPO 1261 (SEQ ID NO: 1187), GHPO 1265 (SEQ ID NO: 1189), GHPO 1267 (SEQ ID NO:l 191), GHPO 1278 (SEQ ID NO: 1193), GHPO 1282 (SEQ ID NO: 1195), GHPO 1283 (SEQ ID NO: 1197), GHPO 1287 (SEQ ID NO: 1199), GHPO 1292 (SEQ ID NO: 1201), GHPO 1293 (SEQ ID NO: 1203), GHPO 1302 (SEQ
ID NO:1205), GHPO 1309 (SEQ ID NO:1207), GHPO 1317 (SEQ ID NO:1209), GHPO 1318 (SEQ ID NO: 1211), GHPO 1321 (SEQ ID NO: 1213), GHPO 1325 (SEQ ID NO:1215), GHPO 1341 (SEQ ID NO:1217), GHPO 1351 (SEQ ID NO:1219), GHPO 1354 (SEQ ID NO:1221), GHPO 1363 (SEQ ID NO: 1223), GHPO 1371 (SEQ ID NO: 1225), GHPO 1381 (SEQ ID
NO:1227), GHPO 1401 (SEQ ID NO:1229), GHPO 1402 (SEQ ID NO: 1231), GHPO 1403 (SEQ ID NO: 1233), GHPO 1408 (SEQ ID NO: 1235), GHPO 1416 (SEQ ID NO: 1237), GHPO 1420 (SEQ ID NO: 1239), GHPO 1428 (SEQ ID NO: 1241), GHPO 1437 (SEQ ID NO: 1243), GHPO 1439 (SEQ ID NO: 1245), GHPO 1460 (SEQ ID NO: 1247), GHPO 1463 (SEQ ID NO: 1249),
GHPO 1472 (SEQ ID NO: 1251), GHPO 1474 (SEQ ID NO: 1253), GHPO 1484 (SEQ ID NO: 1255), GHPO 1489 (SEQ ID NO: 1257), GHPO 1494 (SEQ ID NO: 1259), GHPO 1495 (SEQ ID NO: 1261), GHPO 1498 (SEQ ID NO: 1263), GHPO 1499 (SEQ ID NO: 1265), GHPO 1500 (SEQ ID NO: 1267), GHPO 1503 (SEQ ID NO:1269), GHPO 1504 (SEQ ID NO:1271), GHPO
1510 (SEQ ID NO:1273), GHPO 1518 (SEQ ID NO: 1275), GHPO 1533 (SEQ ID NO:1277), GHPO 1541 (SEQ ID NO:1279), GHPO 1544 (SEQ ID NO:1281), GHPO 1548 (SEQ ID NO:1283), GHPO 1565 (SEQ ID NO:1285), GHPO 1575 (SEQ ID NO: 1287), GHPO 1582 (SEQ ID NO: 1289), GHPO 1595 (SEQ ID NO:1291), GHPO 1597 (SEQ ID NO:1293), GHPO 1599 (SEQ
ID NO: 1295), GHPO 1601 (SEQ ID NO: 1297), GHPO 1609 (SEQ ID NO:1299), GHPO 1613 (SEQ ID NO:1301), GHPO 1614 (SEQ ID NO:1303), GHPO 1626 (SEQ ID NO: 1305), GHPO 1628 (SEQ ID NO: 1307), GHPO 1639 (SEQ ID NO:1309), GHPO 1640 (SEQ ID NO:1311), GHPO 1641 (SEQ ID NO: 1313), GHPO 1646 (SEQ ID NO: 1315), GHPO 1662 (SEQ ID NO:1317), GHPO 1667 (SEQ ID NO: 1319), GHPO 1668 (SEQ ID NO: 1321), GHPO 1670 (SEQ ID NO: 1323), GHPO 1671 (SEQ ID NO: 1325), GHPO
1672 (SEQ ID NO: 1327), GHPO 1678 (SEQ ID NO: 1329), GHPO 1684 (SEQ ID NO:1331), GHPO 1695 (SEQ ID NO:1333), GHPO 1697 (SEQ ID NO:1335), GHPO 1701 (SEQ ID NO:1337), GHPO 1719 (SEQ ID NO: 1339), GHPO 1723 (SEQ ID NO:1341), GHPO 1732 (SEQ ID NO:1343), GHPO 1739 (SEQ ID NO:1345), GHPO 1741 (SEQ ID NO:1347), GHPO 1747 (SEQ
ID NO: 1349), GHPO 1749 (SEQ ID NO:1351), GHPO 1750 (SEQ ID NO: 1353), GHPO 1751 (SEQ ID NO:1355), GHPO 1755 (SEQ ID NO:1357), GHPO 1771 (SEQ ID NO:1359), GHPO 1786 (SEQ ID NO:1361), and GHPO 1789 (SEQ ID NO:1363). An isolated polynucleotide ofthe invention encodes (i) a polypeptide having an amino acid sequence that is homologous to a Helicobacter amino acid sequence of a polypeptide, the Helicobacter amino acid sequence being selected from the group consisting ofthe amino acid sequences shown in the sequence listing (even numbers, up to SEQ ID NO: 1364), or (ii) a derivative of the polypeptide.
In addition to the full-length polypeptides encoded by the polynucleotides ofthe invention, as set forth above, polynucleotides included in the invention can also encode polypeptides that lack signal sequences, as well as other polypeptide or peptide fragments ofthe full-length polypeptides. The term "isolated polynucleotide" is defined as a polynucleotide that is removed from the environment in which it naturally occurs. For example, a naturally-occurring DNA molecule present in the genome of a living bacteria or as part of a gene bank is not isolated, but the same molecule, separated from the remaining part ofthe bacterial genome, as a result of, e.g., a cloning event (amplification), is "isolated." Typically, an isolated DNA molecule is free from DNA regions (e.g., coding regions) with which it is immediately contiguous, at the 5 ' or 3' ends, in the naturally occurring genome. Such isolated polynucleotides can be part of a vector or a composition and still be isolated, as such a vector or composition is not part of its natural environment.
A polynucleotide ofthe invention can consist of RNA or DNA (e.g., cDNA, genomic DNA, or synthetic DNA), or modifications or combinations of RNA or DNA. The polynucleotide can be double-stranded or single-stranded and, if single-stranded, can be the coding (sense) strand or the non-coding (anti- sense) strand. The sequences that encode polypeptides ofthe invention, as shown in the sequence listing (even numbers, up to SEQ ID NO: 1364), can be (a) the coding sequence as shown in any ofthe nucleotide sequences ofthe sequence listing (odd numbers, up to SEQ ID NO: 1363); (b) a ribonucleotide sequence derived by transcription of (a); or (c) a different coding sequence that, as a result ofthe redundancy or degeneracy ofthe genetic code, encodes the same polypeptides as the polynucleotide molecules having the sequences illustrated in any ofthe nucleotide sequences ofthe sequence listing (odd numbers, up to SEQ ID NO: 1363). The polypeptide can be one that is naturally secreted or excreted by, e.g., H. felis, H. mustelae, H. heilmanii, or H. pylori.
By "polypeptide" or "protein" is meant any chain of amino acids, regardless of length or post-translational modification (e.g., glycosylation or phosphorylation). Both terms are used interchangeably in the present application. By "homologous amino acid sequence" is meant an amino acid sequence that differs from an amino acid sequence shown in the sequence listing (even numbers, up to SEQ ID NO: 1364), or an amino acid sequence encoded by a nucleotide sequence shown in the sequence listing (odd numbers, up to SEQ ID NO: 1363), by one or more non-conservative amino acid substitutions, deletions, or additions located at positions at which they do not destroy the specific antigenicity ofthe polypeptide. Preferably, such a sequence is at least 75%o, more preferably at least 80%, and most preferably at least 90% identical to an amino acid sequence shown in the sequence listing (even numbers, up to SEQ ID NO: 1364). Homologous amino acid sequences include sequences that are identical or substantially identical to an amino acid sequence as shown in the sequence listing (even numbers, up to SEQ ID NO: 1364). By "amino acid sequence that is substantially identical" is meant a sequence that is at least 90%), preferably at least 95%, more preferably at least 97%, and most preferably at least 99% identical to an amino acid sequence of reference and that differs from the sequence of reference, if at all, by a majority of conservative amino acid substitutions.
Conservative amino acid substitutions typically include substitutions among amino acids ofthe same class. These classes include, for example, amino acids having uncharged polar side chains, such as asparagine, glutamine, serine, threonine, and tyrosine; amino acids having basic side chains, such as lysine, arginine, and histidine; amino acids having acidic side chains, such as aspartic acid and glutamic acid; and amino acids having nonpolar side chains, such as glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, and cysteine. Homology can be measured using sequence analysis software (e.g.,
Sequence Analysis Software Package ofthe Genetics Computer Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, WI 53705). Similar amino acid sequences are aligned to obtain the maximum degree of homology (i.e., identity). To this end, it may be necessary to artificially introduce gaps into the sequence. Once the optimal alignment has been set up, the degree of homology (i.e., identity) is established by recording all ofthe positions in which the amino acids of both sequences are identical, relative to the total number of positions.
Homologous polynucleotide sequences are defined in a similar way. Preferably, a homologous sequence is one that is at least 45%, more preferably at least 60%>, and most preferably at least 85% identical to a coding sequence of any ofthe nucleotide sequences set forth in the sequence listing (odd numbers, up to SEQ ID NO: 1363).
Polypeptides having a sequence homologous to any one ofthe sequences shown in the sequence listing (even numbers, up to SEQ ID NO: 1364), include naturally-occurring allelic variants, as well as mutants or any other non- naturally occurring variants that are analogous in terms of antigenicity, to a polypeptide having a sequence as shown in the sequence listing (even numbers, up to SEQ ID NO: 1364).
As is known in the art, an allelic variant is an alternate form of a polypeptide that is characterized as having a substitution, deletion, or addition of one or more amino acids that does not alter the biological function ofthe polypeptide. By "biological function" is meant a function ofthe polypeptide in the cells in which it naturally occurs, even if the function is not necessary for the growth or survival ofthe cells. For example, the biological function of a porin is to allow the entry into cells of compounds present in the extracellular medium. The biological function is distinct from the antigenic function. A polypeptide can have more than one biological function. Allelic variants are very common in nature. For example, a bacterial species, e.g., H. pylori, is usually represented by a variety of strains that differ from each other by minor allelic variations. Indeed, a polypeptide that fulfills the same biological function in different strains can have an amino acid sequence that is not identical in each ofthe strains. Such an allelic variation can be equally reflected at the polynucleotide level.
Support for the use of allelic variants of polypeptide antigens comes from, e.g., studies ofthe Helicobacter urease antigen. The amino acid sequence of Helicobacter urease varies widely from species to species, yet cross-species protection occurs, indicating that the urease molecule, when used as an immunogen, is highly tolerant of amino acid variations. Even among different strains ofthe single species H. pylori, there are amino acid sequence variations.
For example, although the amino acid sequences ofthe UreA and UreB subunits of H. pylori and H. felis ureases differ from one another by 26.5% and 11.8%), respectively (Ferrero et al., Molecular Microbiology 9(2):323-333,
1993), it has been shown that H. pylori urease protects mice from H. felis infection (Michetti et al, Gasfroenterology 107:1002, 1994). In addition, it has been shown that the individual structural subunits of urease, UreA and UreB, which contain distinct amino acid sequences, are both protective antigens against Helicobacter infection (Michetti et al, supra). Similarly, Cuenca et al.
(Gasfroenterology 110:1770, 1996) showed that therapeutic immunization of H. mustelae-infecteά ferrets with H. pylori urease was effective at eradicating H. mustelae infection. Further, several urease variants have been reported to be effective vaccine antigens, including, e.g., recombinant UreA + UreB apoenzyme expressed from pORN142 (UreA and UreB sequences derived from
H. pylori strain CPM630; Lee et al, J. Infect. Dis.l72:161, 1995); recombinant UreA + UreB apoenzyme expressed from pORN214 (UreA and UreB sequences differ from H. pylori strain CPM630 by one and two amino acid changes, respectively; Lee et al, supra, 1995); a UreA-glutathione-S- transferase fusion protein (UreA sequence from H. pylori strain ATCC 43504; Thomas et al, Acta Gastro-Enterologica Belgica 56:54, 1993); UreA + UreB holoenzyme purified from H. pylori strain ΝCTC11637 (Marchetti et al,
Science 267:1655, 1995); a UreA-MBP fusion protein (UreA from H. pylori strain 85P; Ferrero et al, Infection and Immunity 62:4981, 1994); a UreB-MBP fusion protein (UreB from H. pylori strain 85P; Ferrero et al, supra); a UreA- MBP fusion protein (UreA from H felis strain ATCC 49179; Ferrero et al, supra); a UreB-MBP fusion protein (UreB from H. felis strain ATCC 49179;
Ferrero et al, supra); and a 37 kDa fragment of UreB containing amino acids 220-569 (Dore-Davin et al, "A 37 kD fragment of UreB is sufficient to confer protection against Helicobacter felis infection in mice"). Finally, Thomas et al. (supra) showed that oral immunization of mice with crude sonicates of H. pylori protected mice from subsequent challenge with H. felis.
Polynucleotides, e.g., DΝA molecules, encoding allelic variants can easily be obtained by polymerase chain reaction (PCR) amplification of genomic bacterial DΝA extracted by conventional methods. This involves the use of synthetic oligonucleotide primers matching sequences that are upstream and downstream ofthe 5' and 3' ends ofthe coding region. Suitable primers can be designed based on the nucleotide sequence information provided in the sequence listing (odd numbers, up to SEQ ID NO: 1363). Typically, a primer consists of 10 to 40, preferably 15 to 25 nucleotides. It can also be advantageous to select primers containing C and G nucleotides in proportions sufficient to ensure efficient hybridization, e.g., an amount of C and G nucleotides of at least 40%, preferably 50%, ofthe total nucleotide amount. Those skilled in the art can readily design primers that can be used to isolate the polynucleotides ofthe invention from different Helicobacter strains. Experimental conditions for carrying out PCR can readily be determined by one skilled in the art and an illustration of carrying out PCR is provided in Example 2. As is well known in the art, restriction endonuclease recognition sites that contain, typically, 4 to 6 nucleotides (for example, the sequences 5'-
GGATCC-3' (BamHI) or 5'-CTCGAG-3' (Xhol)), can be included on the 5' ends ofthe primers. Restriction sites can be selected by those skilled in the art so that the amplified DNA can be conveniently cloned into an appropriately digested vector, such as a plasmid. Useful homologs that do not occur naturally can be designed using known methods for identifying regions of an antigen that are likely to be tolerant of amino acid sequence changes and/or deletions. For example, sequences ofthe antigen from different species can be compared to identify conserved sequences. Polypeptide derivatives that are encoded by polynucleotides of the invention include, e.g., fragments, polypeptides having large internal deletions derived from full-length polypeptides, and fusion proteins. Polypeptide fragments ofthe invention can be derived from a polypeptide having a sequence homologous to any ofthe sequences ofthe sequence listing (even numbers, up to SEQ ID NO: 1364), to the extent that the fragments retain the substantial antigenicity ofthe parent polypeptide (specific antigenicity). Polypeptide derivatives can also be constructed by large internal deletions that remove a substantial part ofthe parent polypeptide, while retaining specific antigenicity. Generally, polypeptide derivatives should be about at least 12 amino acids in length to maintain antigenicity. Advantageously, they can be at least 20 amino acids, preferably at least 50 amino acids, more preferably at least 75 amino acids, and most preferably at least 100 amino acids in length.
Useful polypeptide derivatives, e.g., polypeptide fragments, can be designed using computer-assisted analysis of amino acid sequences in order to identify sites in protein antigens having potential as surface-exposed, antigenic regions (Hughes et al, Infect. Immun. 60(9):3497, 1992). For example, the
Laser Gene Program from DNA Star can be used to obtain hydrophilicity, antigenic index, and intensity index plots for the polypeptides ofthe invention. This program can also be used to obtain information about homologies ofthe polypeptides with known protein motifs. One skilled in the art can readily use the information provided in such plots to select peptide fragments for use as vaccine antigens. For example, fragments spanning regions ofthe plots in which the antigenic index is relatively high can be selected. One can also select fragments spanning regions in which both the antigenic index and the intensity plots are relatively high. Fragments containing conserved sequences, particularly hydrophilic conserved sequences, can also be selected.
Polypeptide fragments and polypeptides having large internal deletions can be used for revealing epitopes that are otherwise masked in the parent polypeptide and that may be of importance for inducing a protective T cell- dependent immune response. Deletions can also remove immunodominant regions of high variability among strains.
It is an accepted practice in the field of immunology to use fragments and variants of protein immunogens as vaccines, as all that is required to induce an immune response to a protein is a small (e.g., 8 to 10 amino acids) immunogenic region ofthe protein. This has been done for a number of vaccines against pathogens other than Helicobacter. For example, short synthetic peptides corresponding to surface-exposed antigens of pathogens such as murine mammary tumor virus (peptide containing 11 amino acids; Dion et al, Virology 179:474-477, 1990), Semliki Forest virus (peptide containing 16 amino acids; Snijders et al, J. Gen. Virol. 72:557-565, 1991), and canine parvovirus (2 overlapping peptides, each containing 15 amino acids; Langeveld et al, Vaccine 12(15): 1473-1480, 1994) have been shown to be effective vaccine antigens against their respective pathogens.
Polynucleotides encoding polypeptide fragments and polypeptides having large internal deletions can be constructed using standard methods (see, e.g., Ausubel et al, Current Protocols in Molecular Biology, John Wiley & Sons Inc., 1994), for example, by PCR, including inverse PCR, by restriction enzyme treatment ofthe cloned DNA molecules, or by the method of Kunkel et al. (Proc. Natl. Acad. Sci. USA 82:448, 1985; biological material available at Stratagene).
A polypeptide derivative can also be produced as a fusion polypeptide that contains a polypeptide or a polypeptide derivative ofthe invention fused, e.g., at the N- or C-terminal end, to any other polypeptide (hereinafter referred to as a peptide tail). Such a product can be easily obtained by translation of a genetic fusion, i.e., a hybrid gene. Vectors for expressing fusion polypeptides are commercially available, and include the pMal-c2 or pMal-p2 systems of New England Biolabs, in which the peptide tail is a maltose binding protein, the glutathione-S-transferase system of Pharmacia, or the His-Tag system available from Novagen. These and other expression systems provide convenient means for further purification of polypeptides and derivatives ofthe invention.
Another particular example of fusion polypeptides included in invention includes a polypeptide or polypeptide derivative ofthe invention fused to a polypeptide having adjuvant activity, such as, e.g., subunit B of either cholera toxin or E. coli heat-labile toxin. Several possibilities can be used for producing such fusion proteins. First, the polypeptide ofthe invention can be fused to the N-terminal end or, preferably, to the C-terminal end ofthe polypeptide having adjuvant activity. Second, a polypeptide fragment ofthe invention can be fused within the amino acid sequence ofthe polypeptide having adjuvant activity. Spacer sequences can also be included, if desired.
As stated above, the polynucleotides ofthe invention encode Helicobacter polypeptides in precursor or mature form. They can also encode hybrid precursors containing heterologous signal peptides, which can mature into polypeptides ofthe invention. By "heterologous signal peptide" is meant a signal peptide that is not found in the naturally-occurring precursor of a polypeptide ofthe invention.
A polynucleotide ofthe invention hybridizes, preferably under stringent conditions, to a polynucleotide having a sequence as shown in the sequence listing (odd numbers, up to SEQ ID NO: 1363). Hybridization procedures are, e.g., described by Ausubel et al. (supra); Silhavy et al. (Experiments with Gene
Fusions, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1984); and Davis et al. (A Manual for Genetic Engineering: Advanced Bacterial Genetics, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1980). Important parameters that can be considered for optimizing hybridization conditions are reflected in the following formula, which facilitates calculation ofthe melting temperature (Tm), which is the temperature above which two complementary DNA strands separate from one another (Casey et al, Nucl. Acid Res. 4:1539, 1977): Tm = 81.5 + 0.5 x (% G+C) + 1.6 log (positive ion concentration) - 0.6 x (% formamide). Under appropriate stringency conditions, hybridization temperature (Th) is approximately 20 to 40°C, 20 to 25 °C, or, preferably, 30 to 40°C below the calculated Tm. Those skilled in the art will understand that optimal temperature and salt conditions can be readily determined empirically in preliminary experiments using conventional procedures. For example, stringent conditions can be achieved, both for pre-hybridizing and hybridizing incubations, (i) within 4-16 hours at 42 °C, in 6 x SSC containing 50% formamide or (ii) within 4-16 hours at 65 °C in an aqueous 6 x SSC solution (1 M NaCl, 0.1 M sodium citrate (pH 7.0)). For polynucleotides containing 30 to 600 nucleotides, the above formula is used and then is corrected by subtracting (600/polynucleotide size in base pairs). Stringency conditions are defined by a Th that is 5 to 10 °C below Tm. Hybridization conditions with oligonucleotides shorter than 20-30 bases do not precisely follow the rules set forth above. In such cases, the formula for calculating the Tm is as follows: Tm = 4 x (G+C) + 2 (A+T). For example, an 18 nucleotide fragment of 50% G+C would have an approximate Tm of 54°C. A polynucleotide molecule ofthe invention, containing RNA, DNA, or modifications or combinations thereof, can have various applications. For example, a polynucleotide molecule can be used (i) in a process for producing the encoded polypeptide in a recombinant host system, (ii) in the construction of vaccine vectors such as poxviruses, which are further used in methods and compositions for preventing and/or treating Helicobacter infection, (iii) as a vaccine agent, in a naked form or formulated with a delivery vehicle and, (iv) in the construction of attenuated Helicobacter strains that can over-express a polynucleotide ofthe invention or express it in a non-toxic, mutated form.
According to a second aspect ofthe invention, there is therefore provided (i) an expression cassette containing a polynucleotide molecule ofthe invention placed under the control of elements (e.g., a promoter) required for expression; (ii) an expression vector containing an expression cassette ofthe invention; (iii) a procaryotic or eucaryotic cell transformed or fransfected with an expression cassette and/or vector ofthe invention, as well as (iv) a process for producing a polypeptide or polypeptide derivative encoded by a polynucleotide ofthe invention, which involves culturing a procaryotic or eucaryotic cell transformed or fransfected with an expression cassette and/or vector ofthe invention, under conditions that allow expression ofthe polynucleotide molecule ofthe invention and, recovering the encoded polypeptide or polypeptide derivative from the cell culture.
A recombinant expression system can be selected from procaryotic and eucaryotic hosts. Eucaryotic hosts include, for example, yeast cells (e.g., Saccharomyces cerevisiae or Pichia Pastoris), mammalian cells (e.g., COS1,
NIH3T3, or JEG3 cells), arthropods cells (e.g., Spodoptera frugiperda (SF9) cells), and plant cells. Preferably, a procaryotic host such as E. coli is used. Bacterial and eucaryotic cells are available from a number of different sources that are known to those skilled in the art, e.g., the American Type Culture Collection (ATCC; Rockville, Maryland).
The choice ofthe expression cassette will depend on the host system selected, as well as the features desired for the expressed polypeptide. For example, it may be useful to produce a polypeptide ofthe invention in a particular lipidated form or any other form. Typically, an expression cassette includes a constitutive or inducible promoter that is functional in the selected host system; a ribosome binding site; a start codon (ATG); if necessary, a region encoding a signal peptide, e.g., a lipidation signal peptide; a polynucleotide molecule ofthe invention; a stop codon; and, optionally, a 3' terminal region (translation and/or transcription terminator). The signal peptide-encoding region is adjacent to the polynucleotide ofthe invention and is placed in the proper reading frame. The signal peptide-encoding region can be homologous or heterologous to the polynucleotide molecule encoding the mature polypeptide and it can be specific to the secretion apparatus ofthe host used for expression. The open reading frame constituted by the polynucleotide molecule ofthe invention, alone or together with the signal peptide, is placed under the control ofthe promoter so that transcription and translation occur in the host system. Promoters and signal peptide-encoding regions are widely known and available to those skilled in the art and include, for example, the promoter of Salmonella typhimurium (and derivatives) that is inducible by arabinose (promoter araB) and is functional in Gram-negative bacteria such as E. coli (U.S. Patent No. 5,028,530; Cagnon et al, Protein Engineering 4(7) : 843 , 1991 ); the promoter of the bacteriophage T7 RNA polymerase gene, which is functional in a number of E. coli strains expressing T7 polymerase (U.S. Patent No. 4,952,496); the OspA lipidation signal peptide; and RlpB lipidation signal peptide (Takase et al, J. Bact. 169:5692, 1987).
The expression cassette is typically part of an expression vector, which is selected for its ability to replicate in the chosen expression system.
Expression vectors (e.g., plasmids or viral vectors) can be chosen from, for example, those described in Pouwels et al. (Cloning Vectors: A Laboratory Manual, 1985, Supp. 1987) and can purchased from various commercial sources. Methods for transforming or transfecting host cells with expression vectors are well known in the art and will depend on the host system selected, as described in Ausubel et al (supra).
Upon expression, a recombinant polypeptide ofthe invention (or a polypeptide derivative) is produced and remains in the intracellular compartment, is secreted/excreted in the extracellular medium or in the periplasmic space, or is embedded in the cellular membrane. The polypeptide can then be recovered in a substantially purified form from the cell extract or from the supernatant after centrifugation ofthe cell culture. Typically, the recombinant polypeptide can be purified by antibody-based affinity purification or by any other method known to a person skilled in the art, such as by genetic fusion to a small affinity-binding domain. Antibody-based affinity purification methods are also available for purifying a polypeptide ofthe invention extracted from a Helicobacter strain. Antibodies useful for immunoaffinity purification ofthe polypeptides ofthe invention can be obtained using methods described below.
Polynucleotides ofthe invention can also be used in DNA vaccination methods, using either a viral or bacterial host as gene delivery vehicle (live vaccine vector) or administering the gene in a free form, e.g., inserted into a plasmid. Therapeutic or prophylactic efficacy of a polynucleotide ofthe invention can be evaluated as is described below.
Accordingly, in a third aspect ofthe invention, there is provided (i) a vaccine vector such as a poxvirus, containing a polynucleotide molecule ofthe invention placed under the control of elements required for expression; (ii) a composition of matter containing a vaccine vector ofthe invention, together with a diluent or carrier; (iii) a pharmaceutical composition containing a therapeutically or prophylactically effective amount of a vaccine vector ofthe invention; (iv) a method for inducing an immune response against Helicobacter in a mammal (e.g., a human; alternatively, the method can be used in veterinary applications for treating or preventing Helicobacter infection of animals, e.g., cats or birds), which involves administering to the mammal an immunogenically effective amount of a vaccine vector ofthe invention to elicit an immune response, e.g., a protective or therapeutic immune response to Helicobacter; and (v) a method for preventing and/or treating a Helicobacter
(e.g., H. pylori, H. felis, H. mustelae, or H. heilmanii) infection, which involves administering a prophylactic or therapeutic amount of a vaccine vector ofthe invention to an individual in need. Additionally, the third aspect ofthe invention encompasses the use of a vaccine vector ofthe invention in the preparation of a medicament for preventing and/or treating Helicobacter infection. A vaccine vector ofthe invention can express one or several polypeptides or derivatives ofthe invention, as well as at least one additional Helicobacter antigen such as a urease apoenzyme or a subunit, fragment, homolog, mutant, or derivative thereof. In addition, it can express a cytokine, such as interleukin-2 (IL-2) or interleukin-12 (IL-12), that enhances the immune response. Thus, a vaccine vector can include an additional polynucleotide molecules encoding, e.g., urease subunit A, B, or both, or a cytokine, placed under the control of elements required for expression in a mammalian cell.
Alternatively, a composition ofthe invention can include several vaccine vectors, each of which being capable of expressing a polypeptide or derivative ofthe invention. A composition can also contain a vaccine vector capable of expressing an additional Helicobacter antigen such as urease apoenzyme, a subunit, fragment, homolog, mutant, or derivative thereof, or a cytokine such as IL-2 or IL-12. In vaccination methods for treating or preventing infection in a mammal, a vaccine vector ofthe invention can be administered by any conventional route in use in the vaccine field, for example, to a mucosal (e.g., ocular, intranasal, oral, gastric, pulmonary, intestinal, rectal, vaginal, or urinary tract) surface or via a parenteral (e.g., subcutaneous, intradermal, intramuscular, intravenous, or intraperitoneal) route. Preferred routes depend upon the choice ofthe vaccine vector. The administration can be achieved in a single dose or repeated at intervals. The appropriate dosage depends on various parameters that are understood by those skilled in the art, such as the nature ofthe vaccine vector itself, the route of administration, and the condition ofthe mammal to be vaccinated (e.g., the weight, age, and general health ofthe mammal).
Live vaccine vectors that can be used in the invention include viral vectors, such as adenoviruses and poxviruses, as well as bacterial vectors, e.g.,
Shigella, Salmonella, Vibrio cholerae, Lactobacillus, Bacille bilie de Calmette- Guerin (BCG), and Streptococcus. An example of an adenovirus vector, as well as a method for constructing an adenovirus vector capable of expressing a polynucleotide molecule ofthe invention, is described in U.S. Patent No. 4,920,209. Poxvirus vectors that can be used in the invention include, e.g., vaccinia and canary pox viruses, which are described in U.S. Patent No. 4,722,848 and U.S. Patent No. 5,364,773, respectively (also see, e.g., Tartaglia et al, Virology 188:217, 1992, for a description of a vaccinia virus vector, and Taylor et al, Vaccine 13:539, 1995, for a description of a canary poxvirus vector). Poxvirus vectors capable of expressing a polynucleotide ofthe invention can be obtained by homologous recombination, as described in Kieny et al. (Nature 312:163, 1984) so that the polynucleotide ofthe invention is inserted in the viral genome under appropriate conditions for expression in mammalian cells. Generally, the dose of viral vector vaccine, for therapeutic or prophylactic use, can be from about lxlO4 to about lxlO11, advantageously from about lxl 07 to about lxl 010, or, preferably, from about lxl 07 to about lxl 09 plaque-forming units per kilogram. Preferably, viral vectors are administered parenterally, for example, in 3 doses that are 4 weeks apart. Those skilled in the art will recognize that it is preferable to avoid adding a chemical adjuvant to a composition containing a viral vector ofthe invention and thereby minimizing the immune response to the viral vector itself. Non-toxicogenic Vibrio cholerae mutant strains that can be used in live oral vaccines are described by Mekalanos et al. (Nature 306:551, 1983) and in U.S. Patent No. 4,882,278 (strain in which a substantial amount ofthe coding sequence of each ofthe two ctxA alleles has been deleted so that no functional cholerae toxin is produced); WO 92/11354 (strain in which the irgA locus is inactivated by mutation; this mutation can be combined in a single strain with ctxA mutations); and WO 94/1533 (deletion mutant lacking functional ctxA and attRSl DNA sequences). These strains can be genetically engineered to express heterologous antigens, as described in WO 94/19482. An effective vaccine dose of a V. cholerae strain capable of expressing a polypeptide or polypeptide derivative encoded by a polynucleotide molecule ofthe invention can contain, e.g., about lxlO5 to about lxlO9, preferably about lxlO6 to about lxl 08 viable bacteria in an appropriate volume for the selected route of administration. Preferred routes of administration include all mucosal routes, but, most preferably, these vectors are administered intranasally or orally. Attenuated Salmonella typhimurium strains, genetically engineered for recombinant expression of heterologous antigens, and their use as oral vaccines, are described by Nakayama et al. (Bio/Technology 6:693, 1988) and in WO 92/11361. Preferred routes of administration for these vectors include all mucosal routes. Most preferably, the vectors are administered intranasally or orally.
Others bacterial strains useful as vaccine vectors are described by High et al. (EMBO 11:1991, 1992) and Sizemore et al (Science 270:299, 1995; Shigellaflexneri); Medaglini et al. (Proc. Natl. Acad. Sci. USA 92:6868, 1995; (Streptococcus gordonii); Flynn (Cell. Mol. Biol. 40 (suppl. I):31, 1194), and in WO 88/6626, WO 90/0594, WO 91/13157, WO 92/1796, and WO 92/21376
(Bacille Calmette Guerin). In bacterial vectors, a polynucleotide ofthe invention can be inserted into the bacterial genome or it can remain in a free state, for example, carried on a plasmid.
An adjuvant can also be added to a composition containing a bacterial vector vaccine. A number of adjuvants that can be used are known to those skilled in the art. For example, preferred adjuvants can be selected from the list provided below.
According to a fourth aspect ofthe invention, there is also provided (i) a composition of matter containing a polynucleotide ofthe invention, together with a diluent or carrier; (ii) a pharmaceutical composition containing a therapeutically or prophylactically effective amount of a polynucleotide ofthe invention; (iii) a method for inducing an immune response against
Helicobacter, in a mammal, by administering to the mammal an immunogenically effective amount of a polynucleotide ofthe invention to elicit an immune response, e.g., a protective immune response to Helicobacter; and (iv) a method for preventing and or treating a Helicobacter (e.g., H. pylori, H. felis, H. mustelae, or H. heilmanii) infection, by administering a prophylactic or therapeutic amount of a polynucleotide ofthe invention to an individual in need of such treatment. Additionally, the fourth aspect ofthe invention encompasses the use of a polynucleotide ofthe invention in the preparation of a medicament for preventing and/or treating Helicobacter infection. The fourth aspect ofthe invention preferably includes the use of a polynucleotide molecule placed under conditions for expression in a mammalian cell, e.g., in a plasmid that is unable to replicate in mammalian cells and to substantially integrate into a mammalian genome.
Polynucleotides (for example, DNA or RNA molecules) ofthe invention can also be administered as such to a mammal as a vaccine. When a DNA molecule ofthe invention is used, it can be in the form of a plasmid that is unable to replicate in a mammalian cell and unable to integrate into the mammalian genome. Typically, a DNA molecule is placed under the control of a promoter suitable for expression in a mammalian cell. The promoter can function ubiquitously or tissue-specifically. Examples of non-tissue specific promoters include the early Cytomegalovirus (CMV) promoter (U.S. Patent No. 4,168,062) and the Rous Sarcoma Virus promoter (Norton et al, Molec.
Cell Biol. 5:281, 1985). The desmin promoter (Li et al, Gene 78:243, 1989; Li et al, J. Biol. Chem. 266:6562, 1991; Li et al, J. Biol. Chem. 268: 10403, 1993) is tissue-specific and drives expression in muscle cells. More generally, useful promoters and vectors are described, e.g., in WO 94/21797 and by Hartikka et al (Human Gene Therapy 7:1205, 1996).
For DNA/RNA vaccination, the polynucleotide ofthe invention can encode a precursor or a mature form of a polypeptide ofthe invention. When it encodes a precursor form, the precursor sequence can be homologous or heterologous. In the latter case, a eucaryotic leader sequence can be used, such as the leader sequence ofthe tissue-type plasminogen factor (tPA).
A composition ofthe invention can contain one or several polynucleotides ofthe invention. It can also contain at least one additional polynucleotide encoding another Helicobacter antigen, such as urease subunit A, B, or both, or a fragment, derivative, mutant, or analog thereof. A polynucleotide encoding a cytokine, such as interleukin-2 (IL-2) or interleukin-
12 (IL-12), can also be added to the composition so that the immune response is enhanced. These additional polynucleotides are placed under appropriate control for expression. Advantageously, DNA molecules ofthe invention and/or additional DNA molecules to be included in the same composition are carried in the same plasmid. Standard methods can be used in the preparation of therapeutic polynucleotides ofthe invention. For example, a polynucleotide can be used in a naked form, free of any delivery vehicles, such as anionic liposomes, cationic lipids, microparticles, e.g., gold microparticles, precipitating agents, e.g., calcium phosphate, or any other fransfection-facilitating agent. In this case, the polynucleotide can be simply diluted in a physiologically acceptable solution, such as sterile saline or sterile buffered saline, with or without a carrier. When present, the carrier preferably is isotonic, hypotonic, or weakly hypertonic, and has a relatively low ionic strength, such as provided by a sucrose solution, e.g., a solution containing 20% sucrose. Alternatively, a polynucleotide can be associated with agents that assist in cellular uptake. It can be, e.g., (i) complemented with a chemical agent that modifies cellular permeability, such as bupivacaine (see, e.g., WO 94/16737), (ii) encapsulated into liposomes, or (iii) associated with cationic lipids or silica, gold, or tungsten microparticles. Anionic and neutral liposomes are well-known in the art (see, e.g.,
Liposomes: A Practical Approach, RPC New Ed, IRL Press, 1990, for a detailed description of methods for making liposomes) and are useful for delivering a large range of products, including polynucleotides.
Cationic lipids can also be used for gene delivery. Such lipids include, for example, Lipofectin™, which is also known as DOTMA (N-[l-(2,3- dioleyloxy)propyl]-N,N,N-trimethylammonium chloride), DOTAP (1,2- bis(oleyloxy)-3-(trimethylammonio)propane), DDAB (dimethyldioctadecylammonium bromide), DOGS (dioctadecylamidologlycyl spermine), and cholesterol derivatives. A description of these cationic lipids can be found in EP 187,702, WO 90/11092, U.S. Patent No. 5,283,185,
WO 91/15501, WO 95/26356, and U.S. Patent No. 5,527,928. Cationic lipids for gene delivery are preferably used in association with a neutral lipid such as DOPE (dioleyl phosphatidylethanolamine; WO 90/11092). Other transfection- facilitating compounds can be added to a formulation containing cationic liposomes. A number of them are described in, e.g., WO 93/18759, WO 93/19768, WO 94/25608, and WO 95/2397. They include, e.g., spermine derivatives useful for facilitating the transport of DNA through the nuclear membrane (see, for example, WO 93/18759) and membrane-permeabilizing compounds such as GALA, Gramicidine S, and cationic bile salts (see, for example, WO 93/19768).
Gold or tungsten microparticles can also be used for gene delivery, as described in WO 91/359, WO 93/17706, and by Tang et al (Nature 356:152,
1992). In this case, the microparticle-coated polynucleotides can be injected via intradermal or intraepidermal routes using a needleless injection device ("gene gun"), such as those described in U.S. Patent No. 4,945,050, U.S. Patent No. 5,015,580, and WO 94/24263. The amount of DNA to be used in a vaccine recipient depends, e.g., on the strength ofthe promoter used in the DNA construct, the immunogenicity of the expressed gene product, the condition ofthe mammal intended for administration (e.g., the weight, age, and general health ofthe mammal), the mode of administration, and the type of formulation. In general, a therapeutically or prophylactically effective dose from about 1 μg to about
1 mg, preferably, from about 10 μg to about 800 μg, and, more preferably, from about 25 μg to about 250 μg, can be administered to human adults. The administration can be achieved in a single dose or repeated at intervals.
The route of administration can be any conventional route used in the vaccine field. As general guidance, a polynucleotide ofthe invention can be administered via a mucosal surface, e.g., an ocular, intranasal, pulmonary, oral, intestinal, rectal, vaginal, or urinary tract surface, or via a parenteral route, e.g., by an intravenous, subcutaneous, intraperitoneal, intradermal, intraepidermal, or intramuscular route. The choice of administration route will depend on, e.g., the formulation that is selected. A polynucleotide formulated in association with bupivacaine is advantageously administered into muscle. When a neutral or anionic liposome or a cationic lipid, such as DOTMA, is used, the formulation can be advantageously injected via intravenous, intranasal (for example, by aerosolization), intramuscular, intradermal, and subcutaneous routes. A polynucleotide in a naked form can advantageously be administered via the intramuscular, intradermal, or subcutaneous routes. Although not absolutely required, such a composition can also contain an adjuvant. A systemic adjuvant that does not require concomitant administration in order to exhibit an adjuvant effect is preferable.
The sequence information provided in the present application enables the design of specific nucleotide probes and primers that can be used in diagnostic methods. Accordingly, in a fifth aspect ofthe invention, there is provided a nucleotide probe or primer having a sequence found in, or derived by degeneracy ofthe genetic code from, a sequence shown in the sequence listing (odd numbers, up to SEQ ID NO: 1363).
The term "probe" as used in the present application refers to DNA (preferably single stranded) or RNA molecules (or modifications or combinations thereof) that hybridize under the stringent conditions, as defined above, to polynucleotide molecules having sequences homologous to any of those shown in the sequence listing (odd numbers, up to SEQ ID NO: 1363), or to a complementary or anti-sense sequence of any of those shown in the sequence listing (odd numbers, up to SEQ ID NO: 1363). Generally, probes are significantly shorter than the full-length sequences shown in the sequence listing. For example, they can contain from about 5 to about 100, preferably from about 10 to about 80 nucleotides. In particular, probes have sequences that are at least 75%, preferably at least 85%, more preferably 95% homologous to a portion of a sequence as shown in the sequence listing (odd numbers, up to SEQ ID NO: 1363), or a sequence complementary to any of such sequences.
Probes can contain modified bases, such as inosine, methyl-5- deoxycytidine, deoxyuridine, dimethylamino-5-deoxyuridine, or diamino-2, 6- purine. Sugar or phosphate residues can also be modified or substituted. For example, a deoxyribose residue can be replaced by a polyamide (Nielsen et al, Science 254: 1497, 1991) and phosphate residues can be replaced by ester groups such as diphosphate, alkyl, arylphosphonate, and phosphorothioate esters. In addition, the 2'-hydroxyl group on ribonucleotides can be modified by addition of, e.g., alkyl groups.
Probes ofthe invention can be used in diagnostic tests, or as capture or detection probes. Such capture probes can be immobilized on solid supports, directly or indirectly, by covalent means or by passive adsorption. A detection probe can be labeled by a detectable label, for example a label selected from radioactive isotopes; enzymes, such as peroxidase and alkaline phosphatase; enzymes that are able to hydrolyze a chromogenic, fluorogenic, or luminescent substrate; compounds that are chromogenic, fluorogenic, or luminescent; nucleotide base analogs; and biotin.
Probes ofthe invention can be used in any conventional hybridization method, such as in dot blot methods (Maniatis et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1982), Southern blot methods (Southern, J. Mol. Biol.
98:503, 1975), northern blot methods (identical to Southern blot to the exception that RNA is used as a target), or a sandwich method (Dunn et al, Cell 12:23, 1977). As is known in the art, the latter technique involves the use of a specific capture probe and a specific detection probe that have nucleotide sequences that are at least partially different from each other.
Primers used in the invention usually contain about 10 to 40 nucleotides and are used to initiate enzymatic polymerization of DNA in an amplification process (e.g., PCR), an elongation process, or a reverse transcription method. In a diagnostic method involving PCR, the primers can be labeled.
Thus, the invention also encompasses (i) a reagent containing a probe of the invention for detecting and/or identifying the presence of Helicobacter in a biological material; (ii) a method for detecting and/or identifying the presence of Helicobacter in a biological material, in which (a) a sample is recovered or derived from the biological material, (b) DNA or RNA is extracted from the material and denatured, and (c) the sample is exposed to a probe ofthe invention, for example, a capture probe, a detection probe, or both, under stringent hybridization conditions, so that hybridization is detected; and (iii) a method for detecting and/or identifying the presence of Helicobacter in a biological material, in which (a) a sample is recovered or derived from the biological material, (b) DNA is extracted therefrom, (c) the extracted DNA is contacted with at least one, or, preferably two, primers ofthe invention, and amplified by the polymerase chain reaction, and (d) an amplified DNA molecule is produced.
As mentioned above, polypeptides that can be produced by expression ofthe polynucleotides ofthe invention can be used as vaccine antigens. Accordingly, a sixth aspect ofthe invention features a substantially purified polypeptide or polypeptide derivative having an amino acid sequence encoded by a polynucleotide ofthe invention. A "substantially purified polypeptide" is defined as a polypeptide that is separated from the environment in which it naturally occurs and or a polypeptide that is free of most ofthe other polypeptides that are present in the environment in which it was synthesized. The polypeptides ofthe invention can be purified from a natural source, such as a Helicobacter strain, or can be produced using recombinant methods.
Homologous polypeptides or polypeptide derivatives encoded by polynucleotides ofthe invention can be screened for specific antigenicity by testing cross-reactivity with an antiserum raised against a polypeptide having an amino acid sequence as shown in the sequence listing (even numbers, up to SEQ ID NO: 1364). Briefly, a monospecific hyperimmune antiserum can be raised against a purified reference polypeptide as such or as a fusion polypeptide, for example, an expression product of MBP, GST, or His-tag systems, or a synthetic peptide predicted to be antigenic. The homologous polypeptide or derivative that is screened for specific antigenicity can be produced as such or as a fusion polypeptide. In the latter case, and if the antiserum is also raised against a fusion polypeptide, two different fusion systems are employed. Specific antigenicity can be determined using a number of methods, including Western blot (Towbin et al, Proc. Natl. Acad. Sci. USA 76:4350, 1979), dot blot, and ELISA methods, as described below. In a Western blot assay, the product to be screened, either as a purified preparation or a total E. coli extract, is fractionated by SDS-PAGE, as described, for example, by Laemmli (Nature 227:680, 1970). After being transferred to a filter, such as a nitrocellulose membrane, the material is incubated with the monospecific hyperimmune antiserum, which is diluted in a range of dilutions from about 1 : 50 to about 1 :5000, preferably from about
1 : 100 to about 1 :500. Specific antigenicity is shown once a band corresponding to the product exhibits reactivity at any ofthe dilutions in the range.
In an ELISA assay, the product to be screened can be used as the coating antigen. A purified preparation is preferred, but a whole cell extract can also be used. Briefly, about 100 μl of a preparation of about 10 μg protein/ml is distributed into wells of a 96-well ELISA plate. The plate is incubated for about 2 hours at 37°C, then overnight at 4°C. The plate is washed with phosphate buffer saline (PBS) contaimng 0.05% Tween 20 (PBS/Tween buffer) and the wells are saturated with 250 μl PBS containing 1% bovine serum albumin (BSA), to prevent non-specific antibody binding. After 1 hour of incubation at 37 °C, the plate is washed with PBS/Tween buffer. The antiserum is serially diluted in PBS/Tween buffer containing 0.5% BSA, and 100 μl dilutions are added to each well. The plate is incubated for 90 minutes at 37°C, washed, and evaluated using standard methods. For example, a goat anti-rabbit peroxidase conjugate can be added to the wells when the specific antibodies used were raised in rabbits. Incubation is carried out for about 90 minutes at
37 °C and the plate is washed. The reaction is developed with the appropriate substrate and the reaction is measured by colorimetry (absorbance measured spectrophotometrically). Under these experimental conditions, a positive reaction is shown once an O.D. value of 1.0 is detected with a dilution of at least about 1:50, preferably of at least about 1 :500.
In a dot blot assay, a purified product is preferred, although a whole cell extract can be used. Briefly, a solution ofthe product at a concentration of about 100 μg/ml is serially diluted two-fold with 50 mM Tris-HCl (pH 7.5). One hundred μl of each dilution is applied to a filter, such as a 0.45 μm nitrocellulose membrane, set in a 96-well dot blot apparatus (Biorad). The buffer is removed by applying vacuum to the system. Wells are washed by addition of 50 mM Tris-HCl (pH 7.5) and the membrane is air-dried. The membrane is saturated in blocking buffer (50 mM Tris-HCl (pH 7.5), 0.15 M NaCl, 10 g/L skim milk) and incubated with an antiserum diluted from about 1:50 to about 1:5000, preferably about 1:500. The reaction is detected using standard methods. For example, a goat anti-rabbit peroxidase conjugate can be added to the wells when rabbit antibodies are used. Incubation is carried out for about 90 minutes at 37 °C and the blot is washed. The reaction is developed with the appropriate substrate and stopped. The reaction is then measured visually by the appearance of a colored spot, e.g., by colorimetry. Under these experimental conditions, a positive reaction is associated with detection of a colored spot for reactions carried out with a dilution of at least about 1 :50, preferably, of at least about 1 :500. Therapeutic or prophylactic efficacy of a polypeptide or polypeptide derivative ofthe invention can be evaluated as described below.
According to a seventh aspect ofthe invention, there is provided (i) a composition of matter containing a polypeptide ofthe invention together with a diluent or carrier; (ii) a pharmaceutical composition containing a therapeutically or prophylactically effective amount of a polypeptide ofthe invention; (iii) a method for inducing an immune response against Helicobacter in a mammal by administering to the mammal an immunogenically effective amount of a polypeptide ofthe invention to elicit an immune response, e.g., a protective immune response to Helicobacter; and (iv) a method for preventing and/or treating a Helicobacter (e.g., H. pylori, H. felis, H. mustelae, or H. heilmanii) infection, by administering a prophylactic or therapeutic amount of a polypeptide ofthe invention to an individual in need of such treatment. Additionally, this aspect ofthe invention includes the use of a polypeptide of the invention in the preparation of a medicament for preventing and/or treating Helicobacter infection.
The immunogenic compositions ofthe invention can be administered by any conventional route in use in the vaccine field, for example, to a mucosal (e.g., ocular, intranasal, pulmonary, oral, gastric, intestinal, rectal, vaginal, or urinary tract) surface or via a parenteral (e.g., subcutaneous, intradermal, intramuscular, intravenous, or intraperitoneal) route. The choice ofthe administration route depends upon a number of parameters, such as the adjuvant used. For example, if a mucosal adjuvant is used, the intranasal or oral route will be preferred, and if a lipid formulation or an aluminum compound is used, a parenteral route will be preferred. In the latter case, the subcutaneous or intramuscular route is most preferred. The choice of administration route can also depend upon the nature ofthe vaccine agent. For example, a polypeptide ofthe invention fused to CTB or to LTB will be best administered to a mucosal surface. A composition ofthe invention can contain one or several polypeptides or derivatives ofthe invention. It can also contain at least one additional Helicobacter antigen, such as the urease apoenzyme, or a subunit, fragment, homolog, mutant, or derivative thereof.
For use in a composition ofthe invention, a polypeptide or polypeptide derivative can be formulated into or with liposomes, such as neutral or anionic liposomes, microspheres, ISCOMS, or virus-like particles (VLPs), to facilitate delivery and/or enhance the immune response. These compounds are readily available to those skilled in the art; for example, see Liposomes: A Practical Approach (supra). Adjuvants other than liposomes can also be used in the invention and are well known in the art (see, for example, the list provided below). Administration can be achieved in a single dose or repeated as necessary at intervals that can be determined by one skilled in the art. For example, a priming dose can be followed by three booster doses at weekly or monthly intervals. An appropriate dose depends on various parameters, including the nature ofthe recipient (e.g., whether the recipient is an adult or an infant), the particular vaccine antigen, the route and frequency of administration, the presence/absence or type of adjuvant, and the desired effect (e.g., protection and/or treatment), and can be readily determined by one skilled in the art. In general, a vaccine antigen ofthe invention can be administered mucosally in an amount ranging from about 10 μg to about 500 mg, preferably from about 1 mg to about 200 mg. For a parenteral route of administration, the dose usually should not exceed about 1 mg, and is, preferably, about 100 μg.
When used as components of a vaccine, the polynucleotides and polypeptides ofthe invention can be used sequentially as part of a multi-step immunization process. For example, a mammal can be initially primed with a vaccine vector ofthe invention, such as a pox virus, e.g., via a parenteral route, and then boosted twice with a polypeptide encoded by the vaccine vector, e.g., via the mucosal route. In another example, liposomes associated with a polypeptide or polypeptide derivative ofthe invention can be used for priming, with boosting being carried out mucosally using a soluble polypeptide or polypeptide derivative ofthe invention, in combination with a mucosal adjuvant (e.g., LT).
Polypeptides and polypeptide derivatives ofthe invention can also be used as diagnostic reagents for detecting the presence of anύ-Helicobacter antibodies, e.g., in blood samples. Such polypeptides can be about 5 to about 80, preferably, about 10 to about 50 amino acids in length and can be labeled or unlabeled, depending upon the diagnostic method. Diagnostic methods involving such a reagent are described below.
Upon expression of a polynucleotide molecule ofthe invention, a polypeptide or polypeptide derivative is produced and can be purified using known methods. For example, the polypeptide or polypeptide derivative can be produced as a fusion protein containing a fused tail that facilitates purification.
The fusion product can be used to immunize a small mammal, e.g., a mouse or a rabbit, in order to raise monospecific antibodies against the polypeptide or polypeptide derivative. The eighth aspect ofthe invention thus provides a monospecific antibody that binds to a polypeptide or polypeptide derivative of the invention.
By "monospecific antibody" is meant an antibody that is capable of reacting with a unique, naturally-occurring Helicobacter polypeptide. An antibody ofthe invention can be polyclonal or monoclonal. Monospecific antibodies can be recombinant, e.g., chimeric (e.g., consisting of a variable region of murine origin and a human constant region), humanized (e.g., a human immunoglobulin constant region and a variable region of animal, e.g., murine, origin), and/or single chain. Both polyclonal and monospecific antibodies can also be in the form of immunoglobulin fragments, e.g., F(ab)'2 or Fab fragments. The antibodies ofthe invention can be of any isotype, e.g., IgG or IgA, and polyclonal antibodies can be of a single isotype or can contain a mixture of isotypes.
The antibodies ofthe invention, which can be raised to a polypeptide or polypeptide derivative ofthe invention, can be produced and identified using standard immunological assays, e.g., Western blot assays, dot blot assays, or ELISA (see, e.g., Coligan et al., Current Protocols in Immunology, John Wiley
& Sons, Inc., New York, NY, 1994). The antibodies can be used in diagnostic methods to detect the presence of Helicobacter antigens in a sample, such as a biological sample. The antibodies can also be used in affinity chromatography methods for purifying a polypeptide or polypeptide derivative ofthe invention. As is discussed further below, the antibodies can also be used in prophylactic and therapeutic passive immunization methods. Accordingly, a ninth aspect ofthe invention provides (i) a reagent for detecting the presence of Helicobacter in a biological sample that contains an antibody, polypeptide, or polypeptide derivative ofthe invention; and (ii) a diagnostic method for detecting the presence of Helicobacter in a biological sample, by contacting the biological sample with an antibody, a polypeptide, or a polypeptide derivative ofthe invention, so that an immune complex is formed, and detecting the complex as an indication ofthe presence of Helicobacter in the sample or the organism from which the sample was derived. The immune complex is formed between a component ofthe sample and the antibody, polypeptide, or polypeptide derivative, and that any unbound material can be removed prior to detecting the complex. A polypeptide reagent can be used for detecting the presence of anti-Helicobacter antibodies in a sample, e.g., a blood sample, while an antibody ofthe invention can be used for screening a sample, such as a gastric extract or biopsy sample, for the presence of Helicobacter polypeptides. For use in diagnostic methods, the reagent (e.g., the antibody, polypeptide, or polypeptide derivative ofthe invention) can be in a free state or can be immobilized on a solid support, such as, for example, on the interior surface of a tube or on the surface, or within pores, of a bead. Immobilization can be achieved using direct or indirect means. Direct means include passive adsorption (i.e., non-covalent binding) or covalent binding between the support and the reagent. By "indirect means" is meant that an anti-reagent compound that interacts with the reagent is first attached to the solid support. For example, if a polypeptide reagent is used, an antibody that binds to it can serve as an anti-reagent, provided that it binds to an epitope that is not involved in recognition of antibodies in biological samples. Indirect means can also employ a ligand-receptor system, for example, a molecule, such as a vitamin, can be grafted onto the polypeptide reagent and the corresponding receptor can be immobilized on the solid phase. This concept is illustrated by the well known biotin-streptavidin system. Alternatively, indirect means can be used, e.g., by adding to the reagent a peptide tail, chemically or by genetic engineering, and immobilizing the grafted or fused product by passive adsorption or covalent linkage ofthe peptide tail.
According to a tenth aspect ofthe invention, there is provided a process for purifying, from a biological sample, a polypeptide or polypeptide derivative ofthe invention, which involves carrying out antibody-based affinity chromatography with the biological sample, wherein the antibody is a monospecific antibody ofthe invention.
For use in a purification process ofthe invention, the antibody can be polyclonal or monospecific, and preferably is ofthe IgG type. Purified IgGs can be prepared from an antiserum using standard methods (see, e.g., Coligan et al, supra). Conventional chromatography supports, as well as standard methods for grafting antibodies, are described, for example, by Harlow et al.
(Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1988).
Briefly, a biological sample, such as an H. pylori extract, preferably in a buffer solution, is applied to a chromatography material, which is, preferably, equilibrated with the buffer used to dilute the biological sample, so that the polypeptide or polypeptide derivative ofthe invention (i.e., the antigen) is allowed to adsorb onto the material. The chromatography material, such as a gel or a resin coupled to an antibody ofthe invention, can be in batch form or in a column. The unbound components are washed off and the antigen is eluted with an appropriate elution buffer, such as a glycine buffer, a buffer containing a chaotropic agent, e.g., guanidine HC1, or a buffer having high salt concentration (e.g., 3 M MgCl2). Eluted fractions are recovered and the presence ofthe antigen is detected, e.g., by measuring the absorbance at 280 nm.
An antibody ofthe invention can be screened for therapeutic efficacy as follows. According to an eleventh aspect ofthe invention, there is provided (i) a composition of matter containing a monospecific antibody ofthe invention, together with a diluent or carrier; (ii) a pharmaceutical composition containing a therapeutically or prophylactically effective amount of a monospecific antibody ofthe invention, and (iii) a method for treating or preventing Helicobacter (e.g., H. pylori, H felis, H. mustelae, or H. heilmanii) infection, by administering a therapeutic or prophylactic amount of a monospecific antibody ofthe invention to an individual in need of such treatment. In addition, the eleventh aspect ofthe invention includes the use of a monospecific antibody ofthe invention in the preparation of a medicament for treating or preventing Helicobacter infection. The monospecific antibody can be polyclonal or monoclonal, and is, preferably, predominantly ofthe IgA isotype. In passive immunization methods, the antibody is administered to a mucosal surface of a mammal, e.g., the gastric mucosa, e.g., orally or intragastrically, optionally, in the presence of a bicarbonate buffer. Alternatively, systemic administration, not requiring a bicarbonate buffer, can be carried out. A monospecific antibody ofthe invention can be administered as a single active agent or as a mixture with at least one additional monospecific antibody specific for a different Helicobacter polypeptide. The amount of antibody and the particular regimen used can be readily determined by one skilled in the art. For example, daily administration of about 100 to 1,000 mg of antibody over one week, or three doses per day of about 100 to 1,000 mg of antibody over two or three days, can be effective regimens for most purposes.
Therapeutic or prophylactic efficacy can be evaluated using standard methods in the art, e.g., by measuring induction of a mucosal immune response or induction of protective and/or therapeutic immunity, using, e.g., the H. felis mouse model and the procedures described by Lee et al. (Eur. J. Gasfroenterology & Hepatology 7:303, 1995) or Lee et al (J. Infect. Dis.
172:161, 1995). Those skilled in the art will recognize that the H felis strain of the model can be replaced with another Helicobacter strain. For example, the efficacy of polynucleotide molecules and polypeptides from H pylori is, preferably, evaluated in a mouse model using an H. pylori strain. Protection can be determined by comparing the degree of Helicobacter infection in the gastric tissue assessed by, for example, urease activity, bacterial counts, or gastritis, to that of a control group. Protection is shown when infection is reduced by comparison to the control group. Such an evaluation can be made for polynucleotides, vaccine vectors, polypeptides, and polypeptide derivatives, as well as for antibodies ofthe invention.
For example, various doses of an antibody ofthe invention can be administered to the gastric mucosa of mice previously challenged with an H pylori strain, as described, e.g., by Lee et al (supra). Then, after an appropriate period of time, the bacterial load ofthe mucosa can be estimated by assessing urease activity, as compared to a control. Reduced urease activity indicates that the antibody is therapeutically effective. Adjuvants that can be used in any ofthe vaccine compositions described above are described as follows. Adjuvants for parenteral administration include, for example, aluminum compounds, such as aluminum hydroxide, aluminum phosphate, and aluminum hydroxy phosphate. The antigen can be precipitated with, or adsorbed onto, the aluminum compound using standard methods. Other adjuvants, such as RIBI (ImmunoChem, Hamilton, MT), can also be used in parenteral administration.
Adjuvants that can be used for mucosal administration include, for example, bacterial toxins, e.g., the cholera toxin (CT), the E. coli heat-labile toxin (LT), the Clostridium difficile toxin A, the pertussis toxin (PT), and combinations, subunits, toxoids, or mutants thereof. For example, a purified preparation of native cholera toxin subunit B (CTB) can be used. Fragments, homologs, derivatives, and fusions to any of these toxins can also be used, provided that they retain adjuvant activity. Preferably, a mutant having reduced toxicity is used. Suitable mutants are described, e.g., in WO 95/17211 (Arg-7-Lys CT mutant), WO 96/6627 (Arg- 192-Gly LT mutant), and WO
95/34323 (Arg-9-Lys and Glu-129-Gly PT mutant). Additional LT mutants that can be used in the methods and compositions ofthe invention include, e.g., Ser-63-Lys, Ala-69-Gly, Glu-110- Asp, and Glu-112-Asp mutants. Other adjuvants, such as the bacterial monophosphoryl lipid A (MPLA) of, e.g., E. coli, Salmonella minnesota, Salmonella typhimurium, or Shigella flexneri; saponins, and polylactide glycolide (PLGA) microspheres, can also be used in mucosal administration. Adjuvants useful for both mucosal and parenteral administrations, such as polyphosphazene (WO 95/2415), can also be used. Any pharmaceutical composition ofthe invention, containing a polynucleotide, polypeptide, polypeptide derivative, or antibody ofthe invention, can be manufactured using standard methods. It can be formulated with a pharmaceutically acceptable diluent or carrier, e.g., water or a saline solution, such as phosphate buffer saline, optionally, including a bicarbonate salt, such as sodium bicarbonate, e.g., 0.1 to 0.5 M. Bicarbonate can advantageously be added to compositions intended for oral or intragastric administration. In general, a diluent or carrier can be selected on the basis of the mode and route of administration, and standard pharmaceutical practice.
Suitable pharmaceutical carriers and diluents, as well as pharmaceutical necessities for their use in pharmaceutical formulations, are described in Remington's Pharmaceutical Sciences, a standard reference text in this field and in the USP/NF. The invention also includes methods in which gastroduodenal infections, such as Helicobacter infection, are treated by oral administration of a Helicobacter polypeptide ofthe invention and a mucosal adjuvant, in combination with an antibiotic, an antisecretory agent, a bismuth salt, an antacid, sucralfate, or a combination thereof. Examples of such compounds that can be administered with the vaccine antigen and an adjuvant are antibiotics, including, e.g., macrolides, tetracyclines, β-lactams, aminoglycosides, quinolones, penicillins, and derivatives thereof (specific examples of antibiotics that can be used in the invention include, e.g., amoxicillin, clarithromycin, tetracycline, metronidizole, erythromycin, cefuroxime, and erythromycin); antisecretory agents, including, e.g., H2- receptor antagonists (e.g., cimetidine, ranitidine, famotidine, nizatidine, and roxatidine), proton pump inhibitors (e.g., omeprazole, lansoprazole, and pantoprazole), prostaglandin analogs (e.g., misoprostil and enprostil), and anticholinergic agents (e.g., pirenzepine, telenzepine, carbenoxolone, and proglumide); and bismuth salts, including colloidal bismuth subcitrate, tripotassium dicitrate bismuthate, bismuth subsalicylate, bicitropeptide, and pepto-bismol (see, e.g., Goodwin et al, Helicobacter pylori, Biology and Clinical Practice, CRC Press, Boca Raton, FL, pp 366-395, 1993; Physicians' Desk Reference, 49th edn., Medical Economics Data Production Company, Monrvale, New Jersey, 1995). In addition, compounds containing more than one ofthe above-listed components coupled together, e.g., ranitidine coupled to bismuth subcitrate, can be used. The invention also includes compositions for carrying out these methods, i.e., compositions containing a Helicobacter antigen (or antigens) ofthe invention, an adjuvant, and one or more ofthe above-listed compounds, in a pharmaceutically acceptable carrier or diluent. Amounts ofthe above-listed compounds used in the methods and compositions ofthe invention can readily be determined by one skilled in the art. In addition, one skilled in the art can readily design treatment/immunization schedules. For example, the non- vaccine components can be administered on days 1-14, and the vaccine antigen + adjuvant can be administered on days 7, 14, 21, and 28. Methods and pharmaceutical compositions ofthe invention can be used to treat or to prevent Helicobacter infections and, accordingly, gastroduodenal diseases associated with these infections, including acute, chronic, and afrophic gastritis, and peptic ulcer diseases, e.g., gastric and duodenal ulcers.
The invention is further illustrated by the following examples. Example 1 describes identification of genes, such as genes that encode the polypeptides ofthe invention, in the Helicobacter genome, as well as identification of signal sequences, and primer design for amplification of genes lacking signal sequences. Example 2 describes cloning of DNA molecules encoding polypeptides ofthe invention into a vector that provides a histidine tag, and production and purification ofthe resulting his-tagged fusion proteins.
Example 3 describes methods for cloning DNA encoding the polypeptides of the invention so that they can be produced without his-tags, and Example 4 describes methods for purifying recombinantly produced polypeptides ofthe invention.
EXAMPLE 1: Identification of genes in the H. pylori genome, identification of signal sequences, and primer design for amplification of genes lacking signal sequences l.A. Creating H. pylori genomic databases
The H. pylori genome was provided as a text file containing a single contiguous string of nucleotides that had been determined to be 1.76 Megabases in length. The complete genome was split into 17 separate files using the program SPLIT (Creativity in Action), giving rise to 16 contigs, each containing 100,000 nucleotides, and a 17th contig containing the remaining 76,000 nucleotides. A header was added to each ofthe 17 files using the format: >hpg0.txt (representing contig 1), .hpgl.txt (representing contig 2), etc. The resulting 17 files, named hpgO through hpg 16, were then copied together to form one file that represented the plus strand ofthe complete H. pylori genome. The constructed database was given the designation "Η." A negative strand database ofthe H. pylori genome was created similarly by first creating a reverse complement ofthe positive strand using the program SeqPup (D.G. Gilbert, Indiana University Biology Department) and then performing the same procedure as described above for the plus strand. This database was given the designation "N."
The regions predicted to encode open reading frames (ORFs) were defined for the complete H. pylori genome using the program GENEMARK™ (Borodovsky et al., Comp. Chem. 17:123, 1993). A database was created from a text file containing an annotated version of all ORFs predicted to be encoded by the H. pylori genome for both the plus and minus strands, and was given the designation "O." Each ORF was assigned a number indicating its location on the genome and its position relative to other genes. No manipulation ofthe text file was required.
l.B. Searching the H. pylori databases
The databases constructed as is described above were searched using the program FASTA (Pearson et al, Proc. Natl. Acad. Sci. USA 85:2444-2448, 1988). FASTA was used for searching either a DNA sequence against either of the gene databases ("Η" and/or "N"), or a peptide sequence against the ORF library ("O"). TFASTX was used to search a peptide sequence against all possible reading frames of a DNA database ("Η" and/or "N" libraries). Potential frameshifts also being resolved, FASTX was used for searching the translated reading frames of a DNA sequence against either a DNA database, or a peptide sequence against the protein database.
l.C. Isolation of DNA sequences from the H. pylori genome
The FASTA searches against the constructed DNA databases identified exact nucleotide coordinates on one or more ofthe isolated contigs, and therefore the location ofthe target DNA. Once the exact location ofthe target sequence was known, the contig identified to carry the gene was exported into the software package MapDraw (DNAStar, Inc.) and the gene was isolated. Gene sequences with flanking DNA was then excised and copied into the EditSeq. Software package (DNAStar, Inc.) for further analysis. l.D. Identification of signal sequences
The deduced protein encoded by a target gene sequence is analyzed using the PROTEAN software package (DNAStar, Inc.). This analysis predicts those areas ofthe protein that are hydrophobic by using the Kyte-Doolittle algorithm, and identifies any potential polar residues preceding the hydrophobic core region, which is typical for many signal sequences. For confirmation, the target protein is then searched against a PROSITE database (DNAStar, Inc.) consisting of motifs and signatures. Characteristic of many signal sequences and hydrophobic regions in general, is the identification of predicted prokaryotic lipid attachment sites. Where confirmation between the two approaches is apparent at the N-terminus of any protein, putative cleavage sites are sought. Specifically, this includes the presence of either an Alanine (A), Serine (S), or Glycine (G) residue immediately after the core hydrophobic region. In the case of lipoproteins, a Cysteine (C) residue would be identified as the +1 residue, post-cleavage.
I.E. Rational design of PCR primers based on the identification of signal sequences
In order to clone gene sequences as N-terminus translational fusions for the generation of recombinant proteins with N-terminal Histidine tags, the gene sequence that specifies the signal sequence is omitted. The 5'-end ofthe gene- specific portion ofthe N-terminal primer is designed to start at the first codon beyond the cleavage site. In the case of lipoproteins, the 5'-end ofthe N- terminal primer begins at the second codon, immediately after the modifiable residue at position +1 post-cleavage. The omission ofthe signal sequence from the recombinant allows for one-step purification, and potential problems associated with insertion of signal sequences in the membrane ofthe host strain carrying the hybrid construct are avoided.
EXAMPLE 2: Preparation of isolated DNA encoding the polypeptides of the invention, and production of these polypeptides as histidine-tagged fusion proteins
2.A. Preparation of genomic DNA from Helicobacter pylori
H. pylori strain ORV2001, stored in LB medium containing 50% glycerol at -70 °C, is grown on Colombia agar containing 7% sheep blood for 48 hours under microaerophilic conditions (8-10% C02, 5-7% 02, 85-87% N2). Cells are harvested, washed with phosphate buffer saline (PBS) (pH 7.2), and
DNA is then extracted from the cells using the Rapid Prep Genomic DNA Isolation kit (Pharmacia Biotech).
2.B. PCR amplification DNA molecules encoding the polypeptides ofthe invention are amplified from genomic DNA, as can be prepared as is described above, by the Polymerase Chain Reaction (PCR) using primers that can readily be designed by one skilled in the art. Specific examples of primers that can be used in the invention are shown in Table 1. As specific examples, to amplify genes encoding GHPO 147, GHPO 615, GHPO 961, GHPO 1282, GHPO 296, and
GHPO 840 the following primers can be used:
GHPO 147: 5'-CTGAATTCGAATGAAAAGAATTTTAGTCTCT-3' (SEQ ID NO: 1365), and 5'-CCGCTCGAGTTAAAACTCATAATTCAAAT-3' (SEQ ID NO: 1366).
GHPO 615: 5*-CGCGGATCCGAAGACATGTGCAACCGATG-3' (SEQ ID NO: 1367), and
5'-CCGCTCGAGCTAAAAGTTTTGCAAAATCAC-3' (SEQ ID NO:1368). GHPO 961: 5'-CGCGGATCCGATTTTACTTGAAAAATTTAAAC-3* (SEQ ID NO: 1369), and 5'-CCGCTCGAGTTAGAAAGTGTAGTTCAAATAC-3' (SEQ ID
NO: 1370). GHPO 1282: 5'-GCGGATCCTTTTCTTCAATGTTTG-3" (SEQ ID NO:1371), and
5'-CCGCTCGAGTCAAAGTTTTAAACAAATTC-3' (SEQ ID NO: 1372).
GHPO 296: 5'-CCGAATTCGGTTATAAAGCCCCT-3' (SEQ ID NO: 1373), and
5'-CCGCTCGAGTTAAGGCTGATTTAA-3' (SEQ ID NO: 1374). GHPO 840: 5'-CGCGGATCCGAGGAAATAGCATGTTAATAACC-3' (SEQ ID NO: 1375), and
5'-CCGCTCGAGTCACTGCTTGCATGACTTATTCCA-3' (SEQ ID NO:1376).
The N-terminal and C-terminal primers for each clone can each include a 5' clamp and a restriction enzyme recognition sequence for cloning purposes (for example, BamΗl (GGATCC) and Xhol (CTCGAG) recognition sequences).
Amplification of gene-specific DNA is carried out using Vent DNA Polymerase (New England Biolabs) or Taq DNA polymerase (Appligene), according to the manufacturer's instructions. The reaction mixture, which is brought to a final volume of 100 μl with distilled water, is as follows: dNTPs mix 200 μM lOx ThermoPol buffer 10 μl primers 300 nM each
DNA template 50 ng
Heat-stable DNA polymerase 2 units
Appropriate amplification reaction conditions can readily be determined by one skilled in the art. For example, the following conditions can be used for amplification of DNA encoding GHPO 615 using the primers set forth above: initial denaturation at 94°C for 5 minutes, 25 cycles of denaturation at 97°C for 30 seconds, hybridization at 55°C for 1 minute, and elongation at 72°C for 2 minutes, using Vent DNA polymerase. In the case of amplifying DNA encoding GHPO 1282 with the primers set forth above, the following conditions can be used: initial denaturation at 94°C for 5 minutes, 25 cycles of denaturation at 94°C for 30 seconds, hybridization at 45°C for 30 seconds, and elongation at 72°C for 30 seconds, followed by a final elongation at 72°C for 7 minutes, using Vent DNA polymerase. The following conditions can be used for amplification of DNA encoding GHPO 840 using the primers set forth above: 25 cycles of denaturation at 97°C for 30 seconds, hybridization at 55°C for 1 minute, and elongation at 72°C for 2 minutes using Vent DNA polymerase. Table 1 sets forth conditions for using the primers listed therein.
2.C. Transformation and selection of transformants
A single PCR product is thus amplified and then is digested at 37 °C for 2 hours with BamHI and Xhol together in a 20 μl reaction volume. The digested product is ligated to similarly cleaved pET28a (Novagen) that is dephosphorylated prior to the ligation by treatment with Calf Intestinal Alkaline Phosphatase (CIP). The gene fusion constructed in this manner allows one-step affinity purification ofthe resulting fusion protein because ofthe presence of histidine residues at the N-terminus ofthe fusion protein, which are encoded by the vector.
The ligation reaction (20 μl) is carried out at 14 °C overnight and then is used to transform 100 μl fresh E. coli XL 1 -blue competent cells (Novagen). The cells are incubated on ice for 2 hours, heat-shocked at 42 °C for 30 seconds, and returned to ice for 90 seconds. The samples are then added to
1 ml LB broth in the absence of selection and grown at 37° C for 2 hours. The cells are plated out on LB agar containing kanamycin (50 μg/ml) at a 1 Ox and neat dilution and incubated overnight at 37 °C. The following day, 50 colonies are picked, plated onto secondary plates, and incubated at 37 °C overnight. Five colonies are picked, grown in 3 ml LB broth supplemented with kanamycin (100 μg/ml), and grown overnight at 37°C. Plasmid DNA is extracted using the Quiagen mini-prep method and is quantitated by agarose gel electrophoresis.
PCR is performed with the gene-specific primers under the conditions set forth above and transformant DNA is confirmed to contain the desired insert. If PCR-positive, one ofthe five plasmid DNA samples (500 ng) extracted from the E. coli XL 1 -blue cells is used to transform competent BL21 (λDE3) E. coli competent cells (Novagen; as described previously). Transformants (10) are picked, plated onto selective kanamycin (50 μg/ml)- containing LB agar plates, and stored as a research stock in LB containing
50% glycerol.
2.D. Purification of recombinant proteins
One ml of frozen glycerol stock prepared as described in 2.C. is used to inoculate 50 ml of LB medium containing 25 μg/ml kanamycin in a 250 ml
Erlenmeyer flask. The flask is incubated at 37°C for 2 hours or until the absorbance at 600 nm (OD600) reaches 0.4-1.0. The culture is stopped from growing by placing the flask at 4°C overnight. The following day, 10 ml ofthe overnight culture is used to inoculate 240 ml LB medium containing kanamycin (25 μg/ml), with the initial OD600 being about 0.02-0.04. Four flasks are inoculated for each ORF. The cells are grown to an OD600 of 1.0 (about 2 hours at 37°C), a 1 ml sample is harvested by centrifugation, and the sample is analyzed by SDS-PAGE to detect any leaky expression. The remaining culture is induced with 1 mM IPTG and the induced cultures are grown for an additional 2 hours at 37°C.
The final OD600 reading is taken and the cells are harvested by centrifugation at 5,000 x g for 15 minutes at 4°C. The supernatant is discarded and the pellets are resuspended in 50 mM Tris-HCl (pH 8.0), 2 mM EDTA. Two hundred and fifty ml of buffer are used for each 1 L of culture and the cells are recovered by centrifugation at 12,000 x g for 20 minutes. The supernatant is discarded and the pellets are stored at -45°C.
2. E. Protein purification
Pellets obtained using the methods described in 2.D. are thawed and resuspended in 95 ml of 50 mM Tris-HCl (pH 8.0). Pefabloc and lysozyme are added to final concentrations of 100 μM and 100 μg/ml, respectively. The mixture is homogenized with magnetic stirring at 5°C for 30 minutes.
Benzonase (Merck) is added to a final concentration of 1 U/ml, in the presence of 10 mM MgCl2, to ensure total digestion ofthe DNA. The suspension is sonicated (Branson Sonifier 450) for 3 cycles of 2 minutes each at maximum output. The homogenate is centrifuged at 19,000 x g for 15 minutes and both the supernatant and the pellet are analyzed by SDS-PAGE to detect the cellular location ofthe target protein in the soluble or insoluble fractions, as is described further below.
2.E.I. Soluble fraction If the target protein is produced in a soluble form (i.e., in the supernatant obtained using the methods described in 2.E.) NaCl and imidazole are added to the supernatant to final concentrations of 50 mM Tris-HCl (pH 8.0), 0.5 M
NaCl, and 10 mM imidazole (buffer A). The mixture is filtered through a 0.45 μm membrane and loaded onto an IMAC column (Pharmacia HiTrap chelating Sepharose; 1 ml), which has been charged with nickel ions according to the manufacturer's recommendations. After loading, the column is washed with 50 column volumes of buffer A and the recombinant protein is eluted with 5 ml of buffer B (50 mM Tris-HCl (pH 8.0), 0.5 M NaCl, 500 mM imidazole).
The elution profile is monitored by measuring the absorbance ofthe fractions at 280 nm. Fractions corresponding to the protein peak are pooled, dialyzed against PBS containing 0.5 M arginine, filtered through a 0.22 μm membrane, and stored at -45°C.
2.E.2. Insoluble fraction
If the target protein is expressed in the insoluble fraction (pellets obtained using the methods described in 2.E.), purification is conducted under denaturing conditions. NaCl, imidazole, and urea are added to the resuspended pellet to final concentrations of 50 mM Tris-HCl (pH 8.0), 0.5 M NaCl, 10 mM imidazole, and 6 M urea (buffer C). After complete solubilization, the mixture is filtered through a 0.45 μm membrane and loaded onto an IMAC column. The purification procedures on the IMAC column are the same as are described in 2.E.I., except that 6 M urea is included in all ofthe buffers used and 10 column volumes of buffer C are used to wash the column after protein loading, instead of 50 column volumes. The protein fractions eluted from the IMAC column with buffer D (buffer C containing 500 mM imidazole) are pooled. Arginine is added to the solution to a final concentration of 0.5 M, and the mixture is dialyzed against PBS containing 0.5 M arginine and various concentrations of urea (4 M, 3 M, 2 M, 1 M, and 0.5 M) to progressively decrease the concentration of urea. The final dialysate is filtered through a 0.22 μm membrane and stored at -45 °C.
Alternatively, when the above-described purification process is not as efficient as it should be, two other processes can be used and are described as follows. A first alternative involves the use of a mild denaturant, N-octyl glucoside (NOG). Briefly, a pellet obtained as is described in 2.E. is homogenized in a solution of 5 mM imidazole, 500 mM sodium chloride, and
20 mM Tris-HCl (pH 7.9) by microfluidization at a pressure of 15,000 psi, and is clarified by centrifugation at 4,000-5,000 x g. The pellet is recovered, resuspended in 50 mM NaP04 (pH 7.5) containing 1-2% weight /volume NOG, and homogenized. The NOG-soluble impurities are removed by centrifugation. The pellet is extracted once more by repeating the preceding extraction step.
The pellet is dissolved in 8 M urea, 50 mM Tris (pH 8.0). The urea-solubilized protein is diluted with an equal volume of 2 M arginine, 50 mM Tris (pH 8.0), and is dialyzed against 1 M arginine for 24-48 hours to remove the urea. The final dialysate is filtered through a 0.22 μm membrane and stored at -45°C. A second alternative involves the use of a strong denaturant, such as guanidine hydrochloride. Briefly, a pellet obtained as is described in 2.E. is homogenized in a solution of 5 mM imidazole, 500 mM sodium chloride, and 20 mM Tris-HCl (pH 7.9) by microfluidization at a pressure of 15,000 psi, and is clarified by centrifugation at 4,000-5,000 x g. The pellet is recovered, resuspended in 6 M guanidine hydrochloride, and passed through an IMAC column charged with Ni"". The bound antigen is eluted with 8 M urea (pH 8.5). β-mercaptoethanol is added to the eluted protein to a final concentration of 1 mM, and then the eluted protein is passed through a Sephadex G-25 column equilibrated in 0.1 M acetic acid. Protein eluted from the column is slowly added to 4 volumes of 50 mM phosphate buffer (pH 7.0), and the protein remains in solution.
2.F. Evaluation of the protective activity of the purified protein
Groups of 10 OF 1 mice (IFFA Credo) are immunized rectally with 25 μg ofthe purified recombinant protein, admixed with 1 μg of cholera toxin (Berna) in physiological buffer. Mice are immunized on days 0, 7, 14, and 21. Fourteen days after the last immunization, the mice are challenged with H. pylori strain ORV2001, grown in liquid media (the cells are grown on agar plates, as described in 2.A., and, after harvest, are resuspended in Brucella broth; the flasks are then incubated overnight at 37 °C). Fourteen days after challenge, the mice are sacrificed and their stomachs are removed. The amount of H. pylori is determined by measuring the urease activity in the stomach and by culture.
2.G. Production of monospecific polyclonal antibodies 2.G.I. Hyperimmune rabbit antiserum New Zealand rabbits are injected both subcutaneously and intramuscularly with 100 μg of a purified fusion polypeptide, as obtained using the methods described in 2.E.I. or 2.E.2., in the presence of Freund's complete adjuvant and in a total volume of approximately 2 ml. Twenty one and 42 days after the initial injection, booster doses, which are identical to the priming doses, except that Freund's incomplete adjuvant is used, are administered in the same way. Fifteen days after the last injection, animal serum is recovered, decomplemented, and filtered through a 0.45 μm membrane.
2.G.2. Mouse hyperimmune ascites fluid
Ten mice are injected subcutaneously with 10-50 μg of a purified fusion polypeptide as obtained using the methods described in 2.E.1. or 2.E.2., in the presence of Freund's complete adjuvant and in a volume of approximately 200 μl. Seven and 14 days after the initial injection, booster doses, which are identical to the priming doses, except that Freund's incomplete adjuvant is used, are administered in the same way. Twenty one and 28 days after the initial infection, mice receive 50 μg ofthe antigen alone infraperitoneally. On day 21, mice are also injected infraperitoneally with sarcoma 180/TG cells CM26684 (Lennette et al, Diagnostic Procedures for Viral, Rickettsial, and Chlamydial Infections, 5th Ed. Washington DC, American Public Health Association, 1979). Ascites fluid is collected 10-13 days after the last injection.
EXAMPLE 3: Methods for producing transcriptional fusions lacking His- tags
Methods for amplification and cloning of DNA encoding the polypeptides ofthe invention as transcriptional fusions lacking His-tags are described as follows. Two PCR primers for each clone are designed based upon the sequences ofthe polynucleotides that encode them (see the attached sequence listing, odd numbers, up to SEQ ID NO: 1363). These primers can be used to amplify DNA encoding the polypeptides ofthe invention from any H. pylori strain, including, for example, ORV2001 and the strain deposited as ATCC deposit number 43579, as well as from other Helicobacter species. The N-terminal primers are designed to include the ribosome binding site ofthe target gene, the ATG start site, and any signal sequence and cleavage site. The N-terminal primers can include a 5' clamp and a restriction endonuclease recognition site, such as that for BamHI (GGATCC), which facilitates subsequent cloning. Similarly, the C-terminal primers can include a restriction endonuclease recognition site, such as that for Xhol (CTCGAG), which can be used in subsequent cloning, and a TAA stop codon.
Amplification of genes encoding the polypeptides ofthe invention can be carried out using Thermalase DNA Polymerase under the conditions described above in Example 2. Alternatively, Vent DNA polymerase (New England Biolabs), Pwo DNA polymerase (Boehringer Mannheim), or Taq
DNA polymerase (Appligene) can be used, according to instructions provided by the manufacturers.
A single PCR product for each clone is amplified and cloned into appropriately cleaved pET 24 (e.g., BamHl-Xhό cleaved pET 24), resulting in the construction of a franscriptional fusion that permits expression of the proteins without His-tags. The expressed products can be purified as denatured proteins that are refolded by dialysis into 1 M arginine.
Cloning into pET 24 allows transcription ofthe genes from the T7 promoter, which is supplied by the vector, but relies upon binding ofthe RNA- specific DNA polymerase to the intrinsic ribosome binding sites ofthe genes, and thereby expression ofthe complete ORF. The amplification, digestion, and cloning protocols that can be used in this method are as described above for constructing translational fusions. EXAMPLE 4: Purification of the polypeptides of the invention by immunoaffinity
4.A. Purification of specific IgGs
An immune serum, as prepared as is described in section 2.G., is applied to a protein A Sepharose Fast Flow column (Pharmacia) equilibrated in 100 mM Tris-HCl (pH 8.0). The resin is washed by applying 10 column volumes of 100 mM Tris-HCl and 10 volumes of 10 mM Tris-HCl (pH 8.0) to the column. IgG antibodies are eluted with 0.1 M glycine buffer (pH 3.0) and are collected as 5 ml fractions to each of which is added 0.25 ml 1 M Tris-HCl (pH 8.0). The optical density ofthe eluate is measured at 280 nm and fractions containing the IgG antibodies are pooled, dialyzed against 50 mM Tris-HCl
(pH 8.0), and, if necessary, stored frozen at -70°C.
4.B. Preparation of the column
An appropriate amount of CNBr-activated Sepharose 4B gel (1 g of dried gel provides for approximately 3.5 ml of hydrated gel; gel capacity is from 5 to 10 mg coupled IgG/ml of gel) manufactured by Pharmacia (17-0430- 01) is suspended in 1 mM HCl buffer and washed with a buchner by adding small quantities of 1 mM HCl buffer. The total volume of buffer is 200 ml per gram of gel. Purified IgG antibodies are dialyzed for 4 hours at 20±5°C against
50 volumes of 500 mM sodium phosphate buffer (pH 7.5). The antibodies are then diluted in 500 mM phosphate buffer (pH 7.5) to a final concentration of 3 mg/ml.
IgG antibodies are mixed with the gel overnight at 5±3°C. The gel is packed into a chromatography column and is washed with 2 column volumes of
500 mM phosphate buffer (pH 7.5), and 1 column volume of 50 mM sodium phosphate buffer, containing 500 mM NaCl (pH 7.5). The gel is then transferred to a tube, mixed with 100 mM ethanolamine (pH 7.5) for 4 hours at room temperature, and washed twice with 2 column volumes of PBS. The gel is then stored in 1/10,000 PBS/merthiolate. The amount of IgG antibodies coupled to the gel is determined by measuring the optical density (OD) at 280 nm ofthe IgG solution and the direct eluate, plus washings.
4.C. Adsorption and elution of the antigen
An antigen solution in 50 mM Tris-HCl (pH 8.0), 2 mM EDTA, for example, the supernatant or the solubilized pellet obtained using the methods described in 3.E., after centrifugation and filtration through a 0.45 μm membrane, is applied to a column equilibrated with 50 mM Tris-HCl (pH 8.0), 2 mM EDTA, at a flow rate of about 10 ml/hour. The column is then washed with 20 volumes of 50 mM Tris-HCl (pH 8.0), 2 mM EDTA. Alternatively, adsorption can be achieved by mixing overnight at 5±3°C. The adsorbed gel is washed with 2 to 6 volumes of 10 mM sodium phosphate buffer (pH 6.8) and the antigen is eluted with 100 mM glycine buffer (pH 2.5). The eluate is recovered in 3 ml fractions, to each of which is added 150 μl of 1 M sodium phosphate buffer (pH 8.0). Absorption is measured at 280 nm for each fraction; those fractions containing the antigen are pooled and stored at -20 °C.
Figure imgf000090_0001
Figure imgf000091_0001
Figure imgf000091_0002
- 90 -
Figure imgf000092_0001
Figure imgf000093_0001
Figure imgf000093_0002
V r
Figure imgf000094_0001
Figure imgf000094_0002
MISSING UPON TIME OF PUBLICATION
SEQUENCE LISTING (1) GENERAL INFORMATION
(i) APPLICANT: MERIEUX ORAVAX SOCIETE EN NOM COLLECTIF PASTEUR MERIEUX SERUMS ET VACCINS S.A. HUMAN GENOME SCIENCES, INC.
(ii) TITLE OF THE INVENTION: Identification of Polynucleotides
Encoding Novel Helicobacter Polypeptides in the Helicobacter Genome
(iii) NUMBER OF SEQUENCES: 1376
(iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Clark & Elbing LLP
(B) STREET: 176 Federal Street
(C) CITY: Boston
(D) STATE: MA
(E) COUNTRY: USA
(F) ZIP: 02110
(v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Diskette
(B) COMPUTER: IBM Compatible
(C) OPERATING SYSTEM: DOS
(D) SOFTWARE: FastSEQ for Windows Version 2.0
(vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: PCT/US98/
(B) FILING DATE: 01-APR-98
(C) CLASSIFICATION:
(vii) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: 08/833,457
(B) FILING DATE: 01-APR-1997
(vii) PRIOR APPLICATION DATA: 08/881,227
(A) APPLICATION NUMBER: 24-JUN-1997
(B) FILING DATE:
(vii) PRIOR APPLICATION DATA: 08/902,615
(A) APPLICATION NUMBER: 29-JUL-1997
(B) FILING DATE:
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Clark, Paul T.
(B) REGISTRATION NUMBER: 30,162
(C) REFERENCE/DOCKET NUMBER: 06132/041WO1
(ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: 617-428-0200
(B) TELEFAX: 617-428-7045
(C) TELEX: (2) INFORMATION FOR SEQ ID NO : 1 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 265 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...212 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 :
TTTTTTAGTT TGTTTTTGAG TATAATCCTA CGAAAATTTT AAGGAACGGC ATG GAG 56
Met Glu 1
TTT TTG GGA CTG ATT TTA AGT CTG GCC GCT ATT TTG ATA GCG TTT AAA 104 Phe Leu Gly Leu lie Leu Ser Leu Ala Ala lie Leu lie Ala Phe Lys 5 10 15
AAG CCT GAA AAA GAA AAT TGG GCG TTT GGG ATT TTG ATG GTG GTG TGG 152 Lys Pro Glu Lys Glu Asn Trp Ala Phe Gly lie Leu Met Val Val Trp 20 25 30
TTA GTG GAG CTT ATT ATT TTT ATA GCC CAC AGC TCT AGC GTT TTG CCT 200 Leu Val Glu Leu lie lie Phe lie Ala His Ser Ser Ser Val Leu Pro 35 40 45 50
AAC ATG AAT CTA TAAGGGGGAT GCATGGATAA AGAAACCCGA TTTTACAACC TTTTT 257 Asn Met Asn Leu
TCTTTGGC 265
(2) INFORMATION FOR SEQ ID NO : 2 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 54 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : Met Glu Phe Leu Gly Leu lie Leu Ser Leu Ala Ala lie Leu lie Ala
1 5 10 15
Phe Lys Lys Pro Glu Lys Glu Asn Trp Ala Phe Gly lie Leu Met Val
20 25 30
Val Trp Leu Val Glu Leu lie lie Phe lie Ala His Ser Ser Ser Val
35 40 45
Leu Pro Asn Met Asn Leu 50
(2) INFORMATION FOR SEQ ID NO : 3 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 670 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...617 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 :
CCCATAGACG ACAAAATCAA GCGGTTTTAT CAAAACCAAA AAACTTTAGA ATG AAA 56
Met Lys 1
AAA ATT GCT TTC ATT TTG GCT TTA TGG GTG GGC TTG TTA GGG GCG TTT 104 Lys lie Ala Phe lie Leu Ala Leu Trp Val Gly Leu Leu Gly Ala Phe 5 10 15
GAG CCT AAA AAA AGT CAT ATT TAT TTT GGG GCT ATG GTG GGT TTA GCT 152 Glu Pro Lys Lys Ser His lie Tyr Phe Gly Ala Met Val Gly Leu Ala 20 25 30
CCT ATT AAA ATA ACC CCA AAA CCG GCT AGT GAT TCT TCT TAT ACG GCT 200 Pro lie Lys lie Thr Pro Lys Pro Ala Ser Asp Ser Ser Tyr Thr Ala 35 40 45 50
TTT TTA TGG GGG GCT AAA GGA GGG TAT CAA TTC GCT TTT TTT AAA GCT 248 Phe Leu Trp Gly Ala Lys Gly Gly Tyr Gin Phe Ala Phe Phe Lys Ala 55 60 65
CTA GCG TTA AGG GGT GAA TTT TCC TAC CTT ATG GCA ATC AAA CCC ACC 296 Leu Ala Leu Arg Gly Glu Phe Ser Tyr Leu Met Ala lie Lys Pro Thr 70 75 80
GCA CTG CAC ACG ATT AAC ACT TCT TTA TTG AGC TTA AAT ATT GAT GTG 344 Ala Leu His Thr lie Asn Thr Ser Leu Leu Ser Leu Asn lie Asp Val 85 90 95 TTA AGC GAT TTT TAC ACT TAC AAA AAA TAC AGC TTT GGG GTG TAT GGG 392 Leu Ser Asp Phe Tyr Thr Tyr Lys Lys Tyr Ser Phe Gly Val Tyr Gly 100 105 110
GGG CTT GGG ATA GGG TAT TTT TAT CAA AGC AAC CAT TTA GGC ATG AAA 440 Gly Leu Gly lie Gly Tyr Phe Tyr Gin Ser Asn His Leu Gly Met Lys 115 120 125 130
AAT AGT TCG TTT ATG GGT TAT AAC GGC TTG TTT AAT GTG GGG CTT GGC 488 Asn Ser Ser Phe Met Gly Tyr Asn Gly Leu Phe Asn Val Gly Leu Gly 135 140 145
AGC ACG ATC GAT CGC CAC CAC CGC ATA GAG CTT GGG GCT AAA ATC CCT 536 Ser Thr lie Asp Arg His His Arg lie Glu Leu Gly Ala Lys lie Pro 150 155 160
TTT TCA AAG ACT AGA AAT TCT TTT AAA AAT CCT TAT TTT TTA GAG AGC 584 Phe Ser Lys Thr Arg Asn Ser Phe Lys Asn Pro Tyr Phe Leu Glu Ser 165 170 175
GTT TTT ATC CAT GCG ACT TAT AGC TAT ATG TTT TAAGAGAGAA TAGCCTATTA 637 Val Phe lie His Ala Thr Tyr Ser Tyr Met Phe 180 185
GTGGTCGTTA TCAATAAGAT AAGATCCTTA ATG 670
(2) INFORMATION FOR SEQ ID NO : 4 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 189 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 :
Met Lys Lys lie Ala Phe lie Leu Ala Leu Trp Val Gly Leu Leu Gly
1 5 10 15
Ala Phe Glu Pro Lys Lys Ser His lie Tyr Phe Gly Ala Met Val Gly
20 25 30
Leu Ala Pro lie Lys lie Thr Pro Lys Pro Ala Ser Asp Ser Ser Tyr
35 40 45
Thr Ala Phe Leu Trp Gly Ala Lys Gly Gly Tyr Gin Phe Ala Phe Phe
50 55 60
Lys Ala Leu Ala Leu Arg Gly Glu Phe Ser Tyr Leu Met Ala lie Lys 65 70 75 80
Pro Thr Ala Leu His Thr lie Asn Thr Ser Leu Leu Ser Leu Asn lie
85 90 95
Asp Val Leu Ser Asp Phe Tyr Thr Tyr Lys Lys Tyr Ser Phe Gly Val
100 105 110
Tyr Gly Gly Leu Gly He Gly Tyr Phe Tyr Gin Ser Asn His Leu Gly 115 120 125 Met Lys Asn Ser Ser Phe Met Gly Tyr Asn Gly Leu Phe Asn Val Gly
130 135 140
Leu Gly Ser Thr He Asp Arg His His Arg He Glu Leu Gly Ala Lys 145 150 155 160
He Pro Phe Ser Lys Thr Arg Asn Ser Phe Lys Asn Pro Tyr Phe Leu
165 170 175
Glu Ser Val Phe He His Ala Thr Tyr Ser Tyr Met Phe 180 185
(2) INFORMATION FOR SEQ ID NO : 5 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 434 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...380 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 :
AGCGTGAAAA AAATTGAGTT GAATCAAAAC CTGCATTAAG GATTAAAAGA ATG CTC 56
Met Leu 1
AAA AAA AGT TTG TTA TTG CTT GTT TTT TTA GTC TTA CAG CTT AGC GGC 104 Lys Lys Ser Leu Leu Leu Leu Val Phe Leu Val Leu Gin Leu Ser Gly 5 10 15
GCT GAA GAA AAC AAT CAA GCC CCA AAA AAC ACG CCC CCT GAA TTA AAC 152 Ala Glu Glu Asn Asn Gin Ala Pro Lys Asn Thr Pro Pro Glu Leu Asn 20 25 30
CCC GCT AAC GCT AAG GGC GCG CCA AAC TCT AAC ACC CAG ATC ACC CCT 200 Pro Ala Asn Ala Lys Gly Ala Pro Asn Ser Asn Thr Gin He Thr Pro 35 40 45 50
AAA AAC GAT AAC TCT AAC CTG TTA GAC AAA TTA GGT TCG CCT GAA AAC 248 Lys Asn Asp Asn Ser Asn Leu Leu Asp Lys Leu Gly Ser Pro Glu Asn 55 60 65
GCT CAA ACC GAG CTT TCT GCC GGT ATT GAT TTG GCT AAA AAG GGC GAT 296 Ala Gin Thr Glu Leu Ser Ala Gly He Asp Leu Ala Lys Lys Gly Asp 70 75 80
TAT CAA GGG GCT TTC AAG CTT TTT TCC CAA TCG TGC GAT AAT GGT AAT 344 Tyr Gin Gly Ala Phe Lys Leu Phe Ser Gin Ser Cys Asp Asn Gly Asn 85 90 95
GCG GCC GGG TGT TTT GCA AGT GGG GGC GAT GTA TGC TAATGGGGTA GGGATC 396 Ala Ala Gly Cys Phe Ala Ser Gly Gly Asp Val Cys 100 105 110
CAAACCAACA GATTAAAAGC CGCTCGCTAT TATGAATG 434
(2) INFORMATION FOR SEQ ID NO : 6 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 110 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 :
Met Leu Lys Lys Ser Leu Leu Leu Leu Val Phe Leu Val Leu Gin Leu
1 5 10 15
Ser Gly Ala Glu Glu Asn Asn Gin Ala Pro Lys Asn Thr Pro Pro Glu
20 25 30
Leu Asn Pro Ala Asn Ala Lys Gly Ala Pro Asn Ser Asn Thr Gin He
35 40 45
Thr Pro Lys Asn Asp Asn Ser Asn Leu Leu Asp Lys Leu Gly Ser Pro
50 55 60
Glu Asn Ala Gin Thr Glu Leu Ser Ala Gly He Asp Leu Ala Lys Lys 65 70 75 80
Gly Asp Tyr Gin Gly Ala Phe Lys Leu Phe Ser Gin Ser Cys Asp Asn
85 90 95
Gly Asn Ala Ala Gly Cys Phe Ala Ser Gly Gly Asp Val Cys 100 105 110
(2) INFORMATION FOR SEQ ID NO : 7 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 575 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 73...522 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : CCACAAAAGC TTAATGAATA TTAAATTAAA AACATGTTAA TCTTTAGTTA TTTTTAAAAT 60 TTAGGAAATC CC ATG CAT CAA AAC AAT AAA ACT TTT TTA CCC AGC CAA TCC 111 Met His Gin Asn Asn Lys Thr Phe Leu Pro Ser Gin Ser 1 5 10
GCT CAC CTC TCT AAA ATC ATT CTT TTT TTA AAC ACC GGC TTT TTA GCC 159 Ala His Leu Ser Lys He He Leu Phe Leu Asn Thr Gly Phe Leu Ala 15 20 25
TAT CTG TTA AGC GCT TGT GGG GCG AAT GTG CCT ATA GAA GAA GTG TTG 207 Tyr Leu Leu Ser Ala Cys Gly Ala Asn Val Pro He Glu Glu Val Leu 30 35 40 45
GTT AAA GAT CCT AAA GAG ACC AAA GCC CAA GAA GTC GCC AGA GAA GAA 255 Val Lys Asp Pro Lys Glu Thr Lys Ala Gin Glu Val Ala Arg Glu Glu 50 55 60
AAG GCT ATC CAG CAA GAA AAC GCC ACT ATT GAT GCG CGC ACC ACG CCT 303 Lys Ala He Gin Gin Glu Asn Ala Thr He Asp Ala Arg Thr Thr Pro 65 70 75
TTA ATC AAT CGT TTC ACT AAT TAT AGC GCT TAT GGC TCT TTA AAC GGC 351 Leu He Asn Arg Phe Thr Asn Tyr Ser Ala Tyr Gly Ser Leu Asn Gly 80 85 90
TTT TAC AAT TCA GTG GAT AAT CTC AAT TCG CCC ATG CAA AAC GGG ATG 399 Phe Tyr Asn Ser Val Asp Asn Leu Asn Ser Pro Met Gin Asn Gly Met 95 100 105
TAT GGA GGC TAT TAC ATG CCT TAT TAT TAC ATG CCC TAT GGT TTC ATG 447 Tyr Gly Gly Tyr Tyr Met Pro Tyr Tyr Tyr Met Pro Tyr Gly Phe Met 110 115 120 125
CCT TAT GGG TCA GGT CTT ATG CCT TAT GGG CCT TAT GGG TAT GGA GCG 495 Pro Tyr Gly Ser Gly Leu Met Pro Tyr Gly Pro Tyr Gly Tyr Gly Ala 130 135 140
CCT GGA TAC TTC CCT TAC GCT TTT TAT TGATTGAGTG GCTTTAGAAA GCGTGGT 549 Pro Gly Tyr Phe Pro Tyr Ala Phe Tyr 145 150
GGTGTTGGTG TTTTTACTCA AACACG 575
(2) INFORMATION FOR SEQ ID NO : 8 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 150 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : Met His Gin Asn Asn Lys Thr Phe Leu Pro Ser Gin Ser Ala His Leu
1 5 10 15
Ser Lys He He Leu Phe Leu Asn Thr Gly Phe Leu Ala Tyr Leu Leu
20 25 30
Ser Ala Cys Gly Ala Asn Val Pro He Glu Glu Val Leu Val Lys Asp
35 40 45
Pro Lys Glu Thr Lys Ala Gin Glu Val Ala Arg Glu Glu Lys Ala He
50 55 60
Gin Gin Glu Asn Ala Thr He Asp Ala Arg Thr Thr Pro Leu He Asn 65 70 75 80
Arg Phe Thr Asn Tyr Ser Ala Tyr Gly Ser Leu Asn Gly Phe Tyr Asn
85 90 95
Ser Val Asp Asn Leu Asn Ser Pro Met Gin Asn Gly Met Tyr Gly Gly
100 105 110
Tyr Tyr Met Pro Tyr Tyr Tyr Met Pro Tyr Gly Phe Met Pro Tyr Gly
115 120 125
Ser Gly Leu Met Pro Tyr Gly Pro Tyr Gly Tyr Gly Ala Pro Gly Tyr
130 135 140
Phe Pro Tyr Ala Phe Tyr 145 150
(2) INFORMATION FOR SEQ ID NO : 9 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 910 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...860 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 :
ATTATGTTAT TATTATACGA AATGTAGACT TTTAGAAGGA AAAATGTTGT ATG AAA 56
Met Lys 1
AAG TTT GTA GTG TTT AAA ACG CTC TGT TTA TCG GTA GTG TTA GGT AAT 104 Lys Phe Val Val Phe Lys Thr Leu Cys Leu Ser Val Val Leu Gly Asn 5 10 15
AGT CTT GTG GCA GCA GAA GGC AGC ACA GAA GTG CAA AAG CAA TTG GAA 152 Ser Leu Val Ala Ala Glu Gly Ser Thr Glu Val Gin Lys Gin Leu Glu 20 25 30
AAG CCA AAA GAG TAT AAA GCA GTG AAA GGC GAG AAA AAC GCT TGG TAT 200 Lys Pro Lys Glu Tyr Lys Ala Val Lys Gly Glu Lys Asn Ala Trp Tyr 35 40 45 50
TTG GGG ATT AGC TAT CAA GTC GGT CAG GCT TCG CAA AGC GTT AAA AAC 248 Leu Gly He Ser Tyr Gin Val Gly Gin Ala Ser Gin Ser Val Lys Asn 55 60 65
CCC CCC AAA AGC AGT GAA TTT AAC TAC CCT AAG TTC CCT GTG GGT AAA 296 Pro Pro Lys Ser Ser Glu Phe Asn Tyr Pro Lys Phe Pro Val Gly Lys 70 75 80
ACC GAC TAT CTG GCC GTT ATG CAA GGC TTA GGG CTT ACT GTG GGT TAT 344 Thr Asp Tyr Leu Ala Val Met Gin Gly Leu Gly Leu Thr Val Gly Tyr 85 90 95
AAG CAG TTT TTC GGG GAA AAG AGA TGG TTT GGT GCA CGC TAT TAC GGC 392 Lys Gin Phe Phe Gly Glu Lys Arg Trp Phe Gly Ala Arg Tyr Tyr Gly 100 105 110
TTC ATG GAT TAT GGG CAT GCC GTA TTT GGA GCG AAC GCT TTA ACA TCG 440 Phe Met Asp Tyr Gly His Ala Val Phe Gly Ala Asn Ala Leu Thr Ser 115 120 125 130
GAT AAT GGT GGG GTG TGT GAG CTT CAC CAA CCA TGT GCG ACC AAA GTA 488 Asp Asn Gly Gly Val Cys Glu Leu His Gin Pro Cys Ala Thr Lys Val 135 140 145
GGG ACA ATG GGC AAT CTG TCT GAC ATG TTC ACT TAT GGT GTG GGT ATT 536 Gly Thr Met Gly Asn Leu Ser Asp Met Phe Thr Tyr Gly Val Gly He 150 155 160
GAC ACT TTA TAC AAT GTC ATC AAT AAA GAA GAT GCG AGT TTT GGT TTC 584 Asp Thr Leu Tyr Asn Val He Asn Lys Glu Asp Ala Ser Phe Gly Phe 165 170 175
TTT TTT GGG GCT CAA ATC GCG GGT AAC TCT TGG GGT AAT ACG ACA GGG 632 Phe Phe Gly Ala Gin He Ala Gly Asn Ser Trp Gly Asn Thr Thr Gly 180 185 190
GCC TTT TTG GAA ACT AAA AGC CCT TAT AAG CAC ACT TCC TAT AGC CTT 680 Ala Phe Leu Glu Thr Lys Ser Pro Tyr Lys His Thr Ser Tyr Ser Leu 195 200 205 210
GAT CCG GCG ATT TTC CAG TTC CTT TTT AAT TTA GGG ATC CGC ACC CAT 728 Asp Pro Ala He Phe Gin Phe Leu Phe Asn Leu Gly He Arg Thr His 215 220 225
ATT GGC CGG CAT CAA GAA TTT GAC TTT GGC GTG AAG ATT CCC ACT ATC 776 He Gly Arg His Gin Glu Phe Asp Phe Gly Val Lys He Pro Thr He 230 235 240
AAT GTT TAT TAT TTT AAC CAT GGG AAT TTG AGC TTC ACT TAC CGC CGT 824 Asn Val Tyr Tyr Phe Asn His Gly Asn Leu Ser Phe Thr Tyr Arg Arg 245 250 255
CAA TAC AGC CTT TAT GTG GGG TAT CGT TAC AAT TTC TGATTTAAAA CGCTTG 876 Gin Tyr Ser Leu Tyr Val Gly Tyr Arg Tyr Asn Phe 260 265 270
TTTTTCTCTA ATTGAATTTT CAATTAGAGT TTTC 910
(2) INFORMATION FOR SEQ ID NO: 10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 270 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10:
Met Lys Lys Phe Val Val Phe Lys Thr Leu Cys Leu Ser Val Val Leu
1 5 10 15
Gly Asn Ser Leu Val Ala Ala Glu Gly Ser Thr Glu Val Gin Lys Gin
20 25 30
Leu Glu Lys Pro Lys Glu Tyr Lys Ala Val Lys Gly Glu Lys Asn Ala
35 40 45
Trp Tyr Leu Gly He Ser Tyr Gin Val Gly Gin Ala Ser Gin Ser Val
50 55 60
Lys Asn Pro Pro Lys Ser Ser Glu Phe Asn Tyr Pro Lys Phe Pro Val 65 70 75 80
Gly Lys Thr Asp Tyr Leu Ala Val Met Gin Gly Leu Gly Leu Thr Val
85 90 95
Gly Tyr Lys Gin Phe Phe Gly Glu Lys Arg Trp Phe Gly Ala Arg Tyr
100 105 110
Tyr Gly Phe Met Asp Tyr Gly His Ala Val Phe Gly Ala Asn Ala Leu
115 120 125
Thr Ser Asp Asn Gly Gly Val Cys Glu Leu His Gin Pro Cys Ala Thr
130 135 140
Lys Val Gly Thr Met Gly Asn Leu Ser Asp Met Phe Thr Tyr Gly Val 145 150 155 160
Gly He Asp Thr Leu Tyr Asn Val He Asn Lys Glu Asp Ala Ser Phe
165 170 175
Gly Phe Phe Phe Gly Ala Gin He Ala Gly Asn Ser Trp Gly Asn Thr
180 185 190
Thr Gly Ala Phe Leu Glu Thr Lys Ser Pro Tyr Lys His Thr Ser Tyr
195 200 205
Ser Leu Asp Pro Ala He Phe Gin Phe Leu Phe Asn Leu Gly He Arg
210 215 220
Thr His He Gly Arg His Gin Glu Phe Asp Phe Gly Val Lys He Pro 225 230 235 240
Thr He Asn Val Tyr Tyr Phe Asn His Gly Asn Leu Ser Phe Thr Tyr
245 250 255
Arg Arg Gin Tyr Ser Leu Tyr Val Gly Tyr Arg Tyr Asn Phe 260 265 270
(2) INFORMATION FOR SEQ ID NO: 11: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1357 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic RNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 58...1305 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11:
CGCAATTAAA AGGAATTTTA ACTAAAATAT TGAGTTTAAA TCCACGATGA GTTTTTA ATG 60
Met
1
CAA TAT AAG AAA AAT AAG AAA AGA TAT TAT TAT TTA GCG TTA GGG ATC 108 Gin Tyr Lys Lys Asn Lys Lys Arg Tyr Tyr Tyr Leu Ala Leu Gly He 5 10 15
TTT TTT TTA AAT GGT CTG TCT TTG AAA GCT TTA GAA ATC GCC GTC AAA 156 Phe Phe Leu Asn Gly Leu Ser Leu Lys Ala Leu Glu He Ala Val Lys 20 25 30
CCT TTT GGC TAT CTG GGG CTA TTA TAT AAT CAA GGG GCG CAA AAA AAC 204 Pro Phe Gly Tyr Leu Gly Leu Leu Tyr Asn Gin Gly Ala Gin Lys Asn 35 40 45
CCT CAC AGC TAT GTG GGG GCT TTA GCG CGT CTT GGG GTG GAT TTT TCT 252 Pro His Ser Tyr Val Gly Ala Leu Ala Arg Leu Gly Val Asp Phe Ser 50 55 60 65
TAT AGC AAC GGG TGG TCC TTT GGT ATT GGA GCG ATT GGG GCT TGG AAT 300 Tyr Ser Asn Gly Trp Ser Phe Gly He Gly Ala He Gly Ala Trp Asn 70 75 80
ATT TAT AAC AAA CAG CGT TTG GCT AAC CTT TAT ATC AGT CTA GGG AAT 348 He Tyr Asn Lys Gin Arg Leu Ala Asn Leu Tyr He Ser Leu Gly Asn 85 90 95
TTT TTT GGT AGT TCT AAA AAT GTT AAA CCT TAT TTG AGC GCT GGC GAT 396 Phe Phe Gly Ser Ser Lys Asn Val Lys Pro Tyr Leu Ser Ala Gly Asp 100 105 110
GTT TCT GAT GCG TAT GTT CAA TAC ACT AAC CAG CGT TTT AAA ATC GCT 444 Val Ser Asp Ala Tyr Val Gin Tyr Thr Asn Gin Arg Phe Lys He Ala 115 120 125
TTA GGG CGT TTC AAT ACC GAT TTT GTG GAT TTT GAT TGG ATA GGG GGC 492 Leu Gly Arg Phe Asn Thr Asp Phe Val Asp Phe Asp Trp He Gly Gly 130 135 140 145
AAT ATT CAA GGG GTT TCT GTA GCT TTT AAG CAA AAT TCC ATG CGT TAT 540 Asn He Gin Gly Val Ser Val Ala Phe Lys Gin Asn Ser Met Arg Tyr 150 155 160
TTT GGG ATT TTT ATG GAT AGC ATG CTT TAT AAT GGG CAT CAA ATC AAC 588 Phe Gly He Phe Met Asp Ser Met Leu Tyr Asn Gly His Gin He Asn 165 170 175
AAA GAG CAA GGG AAT CGG ATC GCT ACT TCC CTA AAC GCT CTA GCG TCT 636 Lys Glu Gin Gly Asn Arg He Ala Thr Ser Leu Asn Ala Leu Ala Ser 180 185 190
TAT GAC CCT GTG TCT AAA CGC TTG TAT GTG GGG GGG GAA GTG TTT GTT 684 Tyr Asp Pro Val Ser Lys Arg Leu Tyr Val Gly Gly Glu Val Phe Val 195 200 205
TTA GGT GCA GAA TAC AGG CAT GAA AAT CTT AAA GTG GTG CCT TTT ATT 732 Leu Gly Ala Glu Tyr Arg His Glu Asn Leu Lys Val Val Pro Phe He 210 215 220 225
TTA ACG GAC ACC CGC TTG CCT TTA TCC ACC CAA AAT GTT TTA GTG CAA 780 Leu Thr Asp Thr Arg Leu Pro Leu Ser Thr Gin Asn Val Leu Val Gin 230 235 240
GTG GGG GGT AAG TTG GAG TAT GAC GCT TCT TTA GCT AAG GGT TTC ACT 828 Val Gly Gly Lys Leu Glu Tyr Asp Ala Ser Leu Ala Lys Gly Phe Thr 245 250 255
TCG CAC ACT CTA GTG CAT GGC ATG TAT CAA TAC GGC AAC ACT GAT GCG 876 Ser His Thr Leu Val His Gly Met Tyr Gin Tyr Gly Asn Thr Asp Ala 260 265 270
GCT ACA AGC GTT AAA AAT GCC GGC TTG TTT TTG ATC GAT CAA ACT TTT 924 Ala Thr Ser Val Lys Asn Ala Gly Leu Phe Leu He Asp Gin Thr Phe 275 280 285
AAA TAC AAA ATT TTT AAT TTT GGA ACG GGT TTT TAT ATC GTT CCG GCA 972 Lys Tyr Lys He Phe Asn Phe Gly Thr Gly Phe Tyr He Val Pro Ala 290 295 300 305
AGA AAC AAT AAG GGC TAT CTA TGG ACT TTT AAT GAC AGG ACT AAA TTC 1020 Arg Asn Asn Lys Gly Tyr Leu Trp Thr Phe Asn Asp Arg Thr Lys Phe 310 315 320
TAT GGC CGT GGG ATC AAT GCG CCC GGC GTG CCA GCG ATT TAT TTT GCA 1068 Tyr Gly Arg Gly He Asn Ala Pro Gly Val Pro Ala He Tyr Phe Ala 325 330 335
AAC TCT AGC ATT TCA GGC TAT GTT TTT TTA GGG CTT AAG ACT AAA AGG 1116 Asn Ser Ser He Ser Gly Tyr Val Phe Leu Gly Leu Lys Thr Lys Arg 340 345 350 GTG CGT TTA GAC GCG ATG GTG GCT TTT GGG GAT TAC CAA GAA TAT TCT 1164 Val Arg Leu Asp Ala Met Val Ala Phe Gly Asp Tyr Gin Glu Tyr Ser 355 360 365
TTA ATG AGC AGT TTT AGG GTT TGG ACT TAT AGG AGT TTG TCT TTT GAT 1212 Leu Met Ser Ser Phe Arg Val Trp Thr Tyr Arg Ser Leu Ser Phe Asp 370 375 380 385
ATG GGT GGG GGG TAT GTG TAT GCT TAC AAT TCT AAA GCC ACG AGA AAA 1260 Met Gly Gly Gly Tyr Val Tyr Ala Tyr Asn Ser Lys Ala Thr Arg Lys 390 395 400
AGT CTT GGA AAT AGT TCT TTT GTC TTT TTT GGG AAG TTT TTG TTT TAAAA 1310 Ser Leu Gly Asn Ser Ser Phe Val Phe Phe Gly Lys Phe Leu Phe 405 410 415
AATACCATTT CTACAATCAA TAGTGAAGAG TTTGCAATAA AGTAAGC 1357
(2) INFORMATION FOR SEQ ID NO: 12:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 416 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS : single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 :
Met Gin Tyr Lys Lys Asn Lys Lys Arg Tyr Tyr Tyr Leu Ala Leu Gly
1 5 10 15
He Phe Phe Leu Asn Gly Leu Ser Leu Lys Ala Leu Glu He Ala Val
20 25 30
Lys Pro Phe Gly Tyr Leu Gly Leu Leu Tyr Asn Gin Gly Ala Gin Lys
35 40 45
Asn Pro His Ser Tyr Val Gly Ala Leu Ala Arg Leu Gly Val Asp Phe
50 55 60
Ser Tyr Ser Asn Gly Trp Ser Phe Gly He Gly Ala He Gly Ala Trp 65 70 75 80
Asn He Tyr Asn Lys Gin Arg Leu Ala Asn Leu Tyr He Ser Leu Gly
85 90 95
Asn Phe Phe Gly Ser Ser Lys Asn Val Lys Pro Tyr Leu Ser Ala Gly
100 105 110
Asp Val Ser Asp Ala Tyr Val Gin Tyr Thr Asn Gin Arg Phe Lys He
115 120 125
Ala Leu Gly Arg Phe Asn Thr Asp Phe Val Asp Phe Asp Trp He Gly
130 135 140
Gly Asn He Gin Gly Val Ser Val Ala Phe Lys Gin Asn Ser Met Arg 145 150 155 160
Tyr Phe Gly He Phe Met Asp Ser Met Leu Tyr Asn Gly His Gin He
165 170 175
Asn Lys Glu Gin Gly Asn Arg He Ala Thr Ser Leu Asn Ala Leu Ala 180 185 190 Ser Tyr Asp Pro Val Ser Lys Arg Leu Tyr Val Gly Gly Glu Val Phe
195 200 205
Val Leu Gly Ala Glu Tyr Arg His Glu Asn Leu Lys Val Val Pro Phe
210 215 220
He Leu Thr Asp Thr Arg Leu Pro Leu Ser Thr Gin Asn Val Leu Val 225 230 235 240
Gin Val Gly Gly Lys Leu Glu Tyr Asp Ala Ser Leu Ala Lys Gly Phe
245 250 255
Thr Ser His Thr Leu Val His Gly Met Tyr Gin Tyr Gly Asn Thr Asp
260 265 270
Ala Ala Thr Ser Val Lys Asn Ala Gly Leu Phe Leu He Asp Gin Thr
275 280 285
Phe Lys Tyr Lys He Phe Asn Phe Gly Thr Gly Phe Tyr He Val Pro
290 295 300
Ala Arg Asn Asn Lys Gly Tyr Leu Trp Thr Phe Asn Asp Arg Thr Lys 305 310 315 320
Phe Tyr Gly Arg Gly He Asn Ala Pro Gly Val Pro Ala He Tyr Phe
325 330 335
Ala Asn Ser Ser He Ser Gly Tyr Val Phe Leu Gly Leu Lys Thr Lys
340 345 350
Arg Val Arg Leu Asp Ala Met Val Ala Phe Gly Asp Tyr Gin Glu Tyr
355 360 365
Ser Leu Met Ser Ser Phe Arg Val Trp Thr Tyr Arg Ser Leu Ser Phe
370 375 380
Asp Met Gly Gly Gly Tyr Val Tyr Ala Tyr Asn Ser Lys Ala Thr Arg 385 390 395 400
Lys Ser Leu Gly Asn Ser Ser Phe Val Phe Phe Gly Lys Phe Leu Phe 405 410 415
(2) INFORMATION FOR SEQ ID NO: 13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1562 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 73...1509 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13:
TTATGCTTCT TTGTTTTTAG ATCAGTTAAG AATTGTAGTC TTTAAGATGT ATTGGCTATT 60 AAAAGGAAAA AA ATG AAA AAT AGC ACG CCT TTA AAG AAT CAA GTT TTT TGT 111 Met Lys Asn Ser Thr Pro Leu Lys Asn Gin Val Phe Cys 1 5 10
GGG TTA TAT GTT TTA AGT TTG AGC GCT TCT TTG CAA GCG TTT GAT TAT 159 Gly Leu Tyr Val Leu Ser Leu Ser Ala Ser Leu Gin Ala Phe Asp Tyr 15 20 25
AAA ATT GAA GTT TCA GCG GAG TCC TTT TCT AAA GTT GGC TTT AAT AAA 207 Lys He Glu Val Ser Ala Glu Ser Phe Ser Lys Val Gly Phe Asn Lys 30 35 40 45
AAA AAG ATT GAT ATA GCT AGG GGG ATT TAT CCT ACA GAG ACT TTT GTA 255 Lys Lys He Asp He Ala Arg Gly He Tyr Pro Thr Glu Thr Phe Val 50 55 60
ACC GCT GTA GGG CAG GGC AAT ATC TAT GCG GAT TTT TTA CCC AAA GGC 303 Thr Ala Val Gly Gin Gly Asn He Tyr Ala Asp Phe Leu Pro Lys Gly 65 70 75
CTT AAA GAT CAA GGG CAT GTT TTA GAG GGA AAA ATC GGT GGC ACG CTA 351 Leu Lys Asp Gin Gly His Val Leu Glu Gly Lys He Gly Gly Thr Leu 80 85 90
GGA GGG GTC GCT TAT GAT AGC ACG AAA TTC AAT CAA GGC GGA TCG GTT 399 Gly Gly Val Ala Tyr Asp Ser Thr Lys Phe Asn Gin Gly Gly Ser Val 95 100 105
ATT TAT AAC TAC ATC GGT TAT TGG GAT GGC TAT TTA GGG GGT AAA AGA 447 He Tyr Asn Tyr He Gly Tyr Trp Asp Gly Tyr Leu Gly Gly Lys Arg 110 115 120 125
GCC TTG CTT GAT GGC ACG AGT ATC CAT GAG TGC GCG CTT GGA TCT GAT 495 Ala Leu Leu Asp Gly Thr Ser He His Glu Cys Ala Leu Gly Ser Asp 130 135 140
GGC AAG GTG ATT GAT TCT ATA GCG TGC GGG AAC GCT AGG GCC AAT AAA 543 Gly Lys Val He Asp Ser He Ala Cys Gly Asn Ala Arg Ala Asn Lys 145 150 155
ATC CGC CGT AAT TAC TTG ATG AAT AAC GCT TTT TTA GAA TAC CGC TAT 591 He Arg Arg Asn Tyr Leu Met Asn Asn Ala Phe Leu Glu Tyr Arg Tyr 160 165 170
AAA GAT ATT TTT TTA GCT AAG GGA GGG CGT TAT CAA TCC AAT GCT CCT 639 Lys Asp He Phe Leu Ala Lys Gly Gly Arg Tyr Gin Ser Asn Ala Pro 175 180 185
TAT ATG AGC GGT TAC ACG CAA GGC TTT GAA ATC AGC GCT AAA GTC AAG 687 Tyr Met Ser Gly Tyr Thr Gin Gly Phe Glu He Ser Ala Lys Val Lys 190 195 200 205
GAT AAA AAT GAA GGA ATC CAC AAA TTA TGG TGG TTT AGC TCA TGG GGT 735 Asp Lys Asn Glu Gly He His Lys Leu Trp Trp Phe Ser Ser Trp Gly 210 215 220
AGG GCG TTC GCT TAT GGG GAG TGG ATT TAT GAT TTT TAT TCT CCA AGA 783 Arg Ala Phe Ala Tyr Gly Glu Trp He Tyr Asp Phe Tyr Ser Pro Arg 225 230 235 ACC GTG GTT AAA AAC GGG CGC ACT TTG AAT TAT GGT ATC CAT TTA GTG 831 Thr Val Val Lys Asn Gly Arg Thr Leu Asn Tyr Gly He His Leu Val 240 245 250
AAT TAT ACT TAT GAA AGA AAA GGG GTT AGC GTT AGC CCT TTT TTC CAA 879 Asn Tyr Thr Tyr Glu Arg Lys Gly Val Ser Val Ser Pro Phe Phe Gin 255 260 265
TTT TCG CCT GGG ACT TAT TAT AGC CCT GGG GTG GTT GTA GGC TAT GAT 927 Phe Ser Pro Gly Thr Tyr Tyr Ser Pro Gly Val Val Val Gly Tyr Asp 270 275 280 285
AGT AAC CCT AAT TTT AAC GGC GTT GGC TTT AGA TCC GAA ACA AAA GCT 975 Ser Asn Pro Asn Phe Asn Gly Val Gly Phe Arg Ser Glu Thr Lys Ala 290 295 300
TAT ATT TTG CTC CCT GTC CAT GAC CCC TTA AGA AGG GAT ACT TAT CGT 1023 Tyr He Leu Leu Pro Val His Asp Pro Leu Arg Arg Asp Thr Tyr Arg 305 310 315
TAC GCT ATA AAG GCT GGC ACT GCC GGG CAA AGC TTG CTC ATT AGG CAA 1071 Tyr Ala He Lys Ala Gly Thr Ala Gly Gin Ser Leu Leu He Arg Gin 320 325 330
CGA TTT GAT TAC AAT GAA TTT AAT TTT GGG GGA GCG TTT TAT AAA GTA 1119 Arg Phe Asp Tyr Asn Glu Phe Asn Phe Gly Gly Ala Phe Tyr Lys Val 335 340 345
TGG AAA AAC GCA AAC GCT TAC ATC GGC ACG ACA GGA AAC CCT TTA GGC 1167 Trp Lys Asn Ala Asn Ala Tyr He Gly Thr Thr Gly Asn Pro Leu Gly 350 355 360 365
ATT GAT TTT TGG ACC AAT AGC GTT TAT GAT ATA GGG CAA GCT TTA AGC 1215 He Asp Phe Trp Thr Asn Ser Val Tyr Asp He Gly Gin Ala Leu Ser 370 375 380
CAT GTG GTA ACC GCT GAT GCC GTC TCT GGT TGG GTT TTT GGT GGG GGC 1263 His Val Val Thr Ala Asp Ala Val Ser Gly Trp Val Phe Gly Gly Gly 385 390 395
GTG CAT AAA AAG TGG CTG TGG GGG ACT TTA TGG CGT TGG ACT AGC GGC 1311 Val His Lys Lys Trp Leu Trp Gly Thr Leu Trp Arg Trp Thr Ser Gly 400 405 410
ACT TTA GCC AAT GAA GCG AGT GCG GCT GTT AAT GTG GGC TAT AAG ATC 1359 Thr Leu Ala Asn Glu Ala Ser Ala Ala Val Asn Val Gly Tyr Lys He 415 420 425
AGT AAG AGT TTG ACA GCG AGC GTG AAA TTA GAA TAT TTG GGC GTG ATG 1407 Ser Lys Ser Leu Thr Ala Ser Val Lys Leu Glu Tyr Leu Gly Val Met 430 435 440 445
ACG CAT GCA GGC TTT ACG GTA GGG AGT TAC AGG CCC ACG CCC GGC TCT 1455 Thr His Ala Gly Phe Thr Val Gly Ser Tyr Arg Pro Thr Pro Gly Ser 450 455 460 AAA GCG CTT TAT TCA GAC AGG AGT CAT TTG ATG ACA ACT CTT AGC GCT 1503 Lys Ala Leu Tyr Ser Asp Arg Ser His Leu Met Thr Thr Leu Ser Ala 465 470 475
AAA TTC TAACCAATCG CTTTAAGCTG TTTATTAAAG CGTTAAAAAT CCCTTAATAA AA 1561 Lys Phe
1562
(2) INFORMATION FOR SEQ ID NO : 14 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 479 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 :
Met Lys Asn Ser Thr Pro Leu Lys Asn Gin Val Phe Cys Gly Leu Tyr
1 5 10 15
Val Leu Ser Leu Ser Ala Ser Leu Gin Ala Phe Asp Tyr Lys He Glu
20 25 30
Val Ser Ala Glu Ser Phe Ser Lys Val Gly Phe Asn Lys Lys Lys He
35 40 45
Asp He Ala Arg Gly He Tyr Pro Thr Glu Thr Phe Val Thr Ala Val
50 55 60
Gly Gin Gly Asn He Tyr Ala Asp Phe Leu Pro Lys Gly Leu Lys Asp 65 70 75 80
Gin Gly His Val Leu Glu Gly Lys He Gly Gly Thr Leu Gly Gly Val
85 90 95
Ala Tyr Asp Ser Thr Lys Phe Asn Gin Gly Gly Ser Val He Tyr Asn
100 105 110
Tyr He Gly Tyr Trp Asp Gly Tyr Leu Gly Gly Lys Arg Ala Leu Leu
115 120 125
Asp Gly Thr Ser He His Glu Cys Ala Leu Gly Ser Asp Gly Lys Val
130 135 140
He Asp Ser He Ala Cys Gly Asn Ala Arg Ala Asn Lys He Arg Arg 145 150 155 160
Asn Tyr Leu Met Asn Asn Ala Phe Leu Glu Tyr Arg Tyr Lys Asp He
165 170 175
Phe Leu Ala Lys Gly Gly Arg Tyr Gin Ser Asn Ala Pro Tyr Met Ser
180 185 190
Gly Tyr Thr Gin Gly Phe Glu He Ser Ala Lys Val Lys Asp Lys Asn
195 200 205
Glu Gly He His Lys Leu Trp Trp Phe Ser Ser Trp Gly Arg Ala Phe
210 215 220
Ala Tyr Gly Glu Trp He Tyr Asp Phe Tyr Ser Pro Arg Thr Val Val 225 230 235 240
Lys Asn Gly Arg Thr Leu Asn Tyr Gly He His Leu Val Asn Tyr Thr 245 250 255 Tyr Glu Arg Lys Gly Val Ser Val Ser Pro Phe Phe Gin Phe Ser Pro
260 265 270
Gly Thr Tyr Tyr Ser Pro Gly Val Val Val Gly Tyr Asp Ser Asn Pro
275 280 285
Asn Phe Asn Gly Val Gly Phe Arg Ser Glu Thr Lys Ala Tyr He Leu
290 295 300
Leu Pro Val His Asp Pro Leu Arg Arg Asp Thr Tyr Arg Tyr Ala He 305 310 315 320
Lys Ala Gly Thr Ala Gly Gin Ser Leu Leu He Arg Gin Arg Phe Asp
325 330 335
Tyr Asn Glu Phe Asn Phe Gly Gly Ala Phe Tyr Lys Val Trp Lys Asn
340 345 350
Ala Asn Ala Tyr He Gly Thr Thr Gly Asn Pro Leu Gly He Asp Phe
355 360 365
Trp Thr Asn Ser Val Tyr Asp He Gly Gin Ala Leu Ser His Val Val
370 375 380
Thr Ala Asp Ala Val Ser Gly Trp Val Phe Gly Gly Gly Val His Lys 385 390 395 400
Lys Trp Leu Trp Gly Thr Leu Trp Arg Trp Thr Ser Gly Thr Leu Ala
405 410 415
Asn Glu Ala Ser Ala Ala Val Asn Val Gly Tyr Lys He Ser Lys Ser
420 425 430
Leu Thr Ala Ser Val Lys Leu Glu Tyr Leu Gly Val Met Thr His Ala
435 440 445
Gly Phe Thr Val Gly Ser Tyr Arg Pro Thr Pro Gly Ser Lys Ala Leu
450 455 460
Tyr Ser Asp Arg Ser His Leu Met Thr Thr Leu Ser Ala Lys Phe 465 470 475
(2) INFORMATION FOR SEQ ID NO: 15:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 810 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 98...757 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 15 :
CTAATATTTA TTTTAAACTT TGTTATTATT TAAGGTGTGA TTTGATTTTA GTCTGTATGG 60 GGCAAGTGTG GGGCAGGATA ACATAAGGAA TTGGGTT ATG AAT AAA ACA ACG GTT 115
Met Asn Lys Thr Thr Val 1 5
AAA ATA TTA ATG GGC ATG GCG TTA TTA TCA TCG CTT CAA GCC GCA GAG 163 Lys He Leu Met Gly Met Ala Leu Leu Ser Ser Leu Gin Ala Ala Glu 10 15 20
GCA GAG CTT GAT GAA AAA TCA AAA AAA CCT AAA TTT GCG GAC AGG AAT 211 Ala Glu Leu Asp Glu Lys Ser Lys Lys Pro Lys Phe Ala Asp Arg Asn 25 30 35
ACA TTT TAT TTA GGG GTT GGG TAT CAA CTT AGT GCG ATC AAC ACA TCT 259 Thr Phe Tyr Leu Gly Val Gly Tyr Gin Leu Ser Ala He Asn Thr Ser 40 45 50
TTT AGC ACC GAG TCT GTA GAT AAA TCG TAT TTT ATG ACC GGC AAT GGC 307 Phe Ser Thr Glu Ser Val Asp Lys Ser Tyr Phe Met Thr Gly Asn Gly 55 60 65 70
TTT GGT GTG GTG TTA GGG GGG AAA TTT GTG GCT AAA ACG CAA GCT GTA 355 Phe Gly Val Val Leu Gly Gly Lys Phe Val Ala Lys Thr Gin Ala Val 75 80 85
GAG CAT GTG GGT TTC CGT TAC GGG TTG TTT TAT GAT CAG ACC TTT TCT 403 Glu His Val Gly Phe Arg Tyr Gly Leu Phe Tyr Asp Gin Thr Phe Ser 90 95 100
TCT CAC AAA TCC TAT ATT TCT ACC TAT GGT TTA GAA TTT AGC GGT TTG 451 Ser His Lys Ser Tyr He Ser Thr Tyr Gly Leu Glu Phe Ser Gly Leu 105 110 115
TGG GAC GCT TTC AAT TCG CCA AAG ATG TTT TTA GGG TTA GAG TTT GGC 499 Trp Asp Ala Phe Asn Ser Pro Lys Met Phe Leu Gly Leu Glu Phe Gly 120 125 130
TTA GGC ATC GCT GGG GCG ACT TAT ATG CCA GGA GGG GCT ATG CAT GGG 547 Leu Gly He Ala Gly Ala Thr Tyr Met Pro Gly Gly Ala Met His Gly 135 140 145 150
ATT ATC GCT CAA AAT TTA GGC AAA GAA AAT TCG CTT TTC CAA TTG CTT 595 He He Ala Gin Asn Leu Gly Lys Glu Asn Ser Leu Phe Gin Leu Leu 155 160 165
GTG AAA GTG GGT TTT CGT TTT GGC TTT TTG CAC AAT GAA ATC ACT TTC 643 Val Lys Val Gly Phe Arg Phe Gly Phe Leu His Asn Glu He Thr Phe 170 175 180
GGG TTG AAA TTC CCT GTC ATT CCT AAC AAA AGA ACG GAA ATC ATT GAT 691 Gly Leu Lys Phe Pro Val He Pro Asn Lys Arg Thr Glu He He Asp 185 190 195
GGC TTG AGC ACG ACT ACT TTA TGG CAC CGC TTA CCG GTA GCT TAT TTC 739 Gly Leu Ser Thr Thr Thr Leu Trp His Arg Leu Pro Val Ala Tyr Phe 200 205 210
AAT TAT ATC TAT AAT TTT TAGATATGGT TATTTAGAGG TTTTAGATTT GACAAAAT 795 Asn Tyr He Tyr Asn Phe 215 220 CAATCAACTC TCGTG 810
(2) INFORMATION FOR SEQ ID NO : 16 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 220 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16:
Met Asn Lys Thr Thr Val Lys He Leu Met Gly Met Ala Leu Leu Ser
1 5 10 15
Ser Leu Gin Ala Ala Glu Ala Glu Leu Asp Glu Lys Ser Lys Lys Pro
20 25 30
Lys Phe Ala Asp Arg Asn Thr Phe Tyr Leu Gly Val Gly Tyr Gin Leu
35 40 45
Ser Ala He Asn Thr Ser Phe Ser Thr Glu Ser Val Asp Lys Ser Tyr
50 55 60
Phe Met Thr Gly Asn Gly Phe Gly Val Val Leu Gly Gly Lys Phe Val 65 70 75 80
Ala Lys Thr Gin Ala Val Glu His Val Gly Phe Arg Tyr Gly Leu Phe
85 90 r 95
Tyr Asp Gin Thr Phe Ser Ser His Lys Ser Tyr He Ser Thr Tyr Gly
100 105 110
Leu Glu Phe Ser Gly Leu Trp Asp Ala Phe Asn Ser Pro Lys Met Phe
115 120 125
Leu Gly Leu Glu Phe Gly Leu Gly He Ala Gly Ala Thr Tyr Met Pro
130 135 140
Gly Gly Ala Met His Gly He He Ala Gin Asn Leu Gly Lys Glu Asn 145 150 155 160
Ser Leu Phe Gin Leu Leu Val Lys Val Gly Phe Arg Phe Gly Phe Leu
165 170 175
His Asn Glu He Thr Phe Gly Leu Lys Phe Pro Val He Pro Asn Lys
180 185 190
Arg Thr Glu He He Asp Gly Leu Ser Thr Thr Thr Leu Trp His Arg
195 200 205
Leu Pro Val Ala Tyr Phe Asn Tyr He Tyr Asn Phe 210 215 220
(2) INFORMATION FOR SEQ ID NO : 17 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1516 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1463 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17:
TTTATCTTTA AAAGTATTTG CATTTATCAA TCTCATTTTA GGAGGCATGC ATG AAA 56
Met Lys 1
AAG GCA AGT CAG GTT TTA TTC TTT GGG GCA TTT TTA AGC TCT TCT TTG 104 Lys Ala Ser Gin Val Leu Phe Phe Gly Ala Phe Leu Ser Ser Ser Leu 5 10 15
CAA GGT TTT GAA GCT AAG CTC AAC GGC TTT GTG GAT CAA TCC AGC ACT 152 Gin Gly Phe Glu Ala Lys Leu Asn Gly Phe Val Asp Gin Ser Ser Thr 20 25 30
ATC GGT TTT AAC CAG CAT AAA ATC AAT AAA GAA AGA GGT ATC TAC CCT 200 He Gly Phe Asn Gin His Lys He Asn Lys Glu Arg Gly He Tyr Pro 35 40 45 50
ATG CAG CAA TTC GCA ACG ATT GCG GGC TAT TTA GGG CTT GGT TTT AGC 248 Met Gin Gin Phe Ala Thr He Ala Gly Tyr Leu Gly Leu Gly Phe Ser 55 60 65
CTG TTA CCC AAA AAG GTT TCA GAC CAT GTT CTA AAA GGC AAA ATA GGA 296 Leu Leu Pro Lys Lys Val Ser Asp His Val Leu Lys Gly Lys He Gly 70 75 80
GGC ATG GTG GGA TCT ATT TTC TAT GAT GGC ACG AAG AAG TTT GAA GAC 344 Gly Met Val Gly Ser He Phe Tyr Asp Gly Thr Lys Lys Phe Glu Asp 85 90 95
AGC TCT GTA GCT TAC AAC CTC TTT GGT TAT TAT GAT GGG TTC ATG GGG 392 Ser Ser Val Ala Tyr Asn Leu Phe Gly Tyr Tyr Asp Gly Phe Met Gly 100 105 110
GGT TAT ACA AAC ATC TTA CAA AGC GAT GAT TTA GCG ACA CAA AAC ATG 440 Gly Tyr Thr Asn He Leu Gin Ser Asp Asp Leu Ala Thr Gin Asn Met 115 120 125 130
AAA TAC AAT AAA AAT GTC CGC AAC TAT GTC TTT AGC GAC GCG TAT TTA 488 Lys Tyr Asn Lys Asn Val Arg Asn Tyr Val Phe Ser Asp Ala Tyr Leu 135 140 145
GAA TAC GCT TAT AAG AAT TAT TTT GAA ATA AAA GCC GGG CGC TAT TTA 536 Glu Tyr Ala Tyr Lys Asn Tyr Phe Glu He Lys Ala Gly Arg Tyr Leu 150 155 160
TCC ACT ATG CCT TAT AAA AGC GGT CAA ACG CAA GGC TTT CAA ATT TCT 584 Ser Thr Met Pro Tyr Lys Ser Gly Gin Thr Gin Gly Phe Gin He Ser 165 170 175
GGG CAA TAC AAG AAA GCG CGC TTG ACT TGG TTT AGC TCT TTT GGG AGG 632 Gly Gin Tyr Lys Lys Ala Arg Leu Thr Trp Phe Ser Ser Phe Gly Arg 180 185 190
GCG TTC GCT TAC GGC TCG TTT TTG ATG GAT TGG TTT GCC GCT AGG ACC 680 Ala Phe Ala Tyr Gly Ser Phe Leu Met Asp Trp Phe Ala Ala Arg Thr 195 200 205 210
ACT TAT AGC GGA GGT TTT ACC AAA AAC GAT AAG GGA GGT TAT GAT AGC 728 Thr Tyr Ser Gly Gly Phe Thr Lys Asn Asp Lys Gly Gly Tyr Asp Ser 215 220 225
CAT GGG CGA AAG GTG CTT TAT GGC ACG CAT GCG GTG CAA CTC ACC TAT 776 His Gly Arg Lys Val Leu Tyr Gly Thr His Ala Val Gin Leu Thr Tyr 230 235 240
AAA CCT CAT CGT TTC CTC ATA GAA GGC TTT TAT TAC CTT TCG CCT CAA 824 Lys Pro His Arg Phe Leu He Glu Gly Phe Tyr Tyr Leu Ser Pro Gin 245 250 255
ATC TTT AAC GCT CCG GGC GTT AAG ATT GGT TGG GAT TCT AAC CCT AAT 872 He Phe Asn Ala Pro Gly Val Lys He Gly Trp Asp Ser Asn Pro Asn 260 265 270
TTT AGC GGC ACA GGC TTT CGC TCT GAT ACA GCT ATC ATA GGG TTT TTC 920 Phe Ser Gly Thr Gly Phe Arg Ser Asp Thr Ala He He Gly Phe Phe 275 280 285 290
CCC ATT TAC TAC CCT TGG ATG ATC GTT AAA TCC AAT GGA AGC CCG GTC 968 Pro He Tyr Tyr Pro Trp Met He Val Lys Ser Asn Gly Ser Pro Val 295 300 305
TAT AAA TAC GAC ACG CCT GCC ACT CAA AAT GGG CAA AAC CTC ATT ATC 1016 Tyr Lys Tyr Asp Thr Pro Ala Thr Gin Asn Gly Gin Asn Leu He He 310 315 320
CTC CAA CGC TTT GAC ATC AAC AAT TAC AAT GTT TCC ATC GCT TTT TAT 1064 Leu Gin Arg Phe Asp He Asn Asn Tyr Asn Val Ser He Ala Phe Tyr 325 330 335
AAA GTC TTT CAA AAC GCT AAT GGT TGG ATA GGC AAC ATG GGG AAT CCA 1112 Lys Val Phe Gin Asn Ala Asn Gly Trp He Gly Asn Met Gly Asn Pro 340 345 350
AGC GGT GTG ATC ATG GGG AGT AAC AGC GTC TAT GCG GGT TTT ACA GGC 1160 Ser Gly Val He Met Gly Ser Asn Ser Val Tyr Ala Gly Phe Thr Gly 355 360 365 370
ACA GCC CTT AAA AGA GAT GCC GCT ACC ATT TTC CTT TCT TGT GGC GGC 1208 Thr Ala Leu Lys Arg Asp Ala Ala Thr He Phe Leu Ser Cys Gly Gly 375 380 385
ACT CAT TTT GCC AAA AAA TTC ACA TGG AAA TTC GCC ACG CAA TAC TCC 1256 Thr His Phe Ala Lys Lys Phe Thr Trp Lys Phe Ala Thr Gin Tyr Ser 390 395 400
AAT TCA GTG GTT TCT TGG GAA GCG AGA GCG ATG ATC TCT TTA GGT TAT 1304 Asn Ser Val Val Ser Trp Glu Ala Arg Ala Met He Ser Leu Gly Tyr 405 410 415
AAA TTC ACT GAA TAC TTG AGC GGT AGC GTG GAT CTT GCA TAT TAT GGC 1352 Lys Phe Thr Glu Tyr Leu Ser Gly Ser Val Asp Leu Ala Tyr Tyr Gly 420 425 430
GTG TAT ACT AAC AAA GGA TTT AAA CCG GGT GAA AAC GGG CCT GTG CCT 1400 Val Tyr Thr Asn Lys Gly Phe Lys Pro Gly Glu Asn Gly Pro Val Pro 435 440 445 450
AAA GAC TTC CCC GCC CTT TAT TCT GAC AGG AGC GCG TTA TAC ACG GCT 1448 Lys Asp Phe Pro Ala Leu Tyr Ser Asp Arg Ser Ala Leu Tyr Thr Ala 455 460 465
CTA GTA GCA TCT TTT TGATGCTACC CTATGATTAT GGTGGGCGTC TTTTGATGCT G 1504 Leu Val Ala Ser Phe 470
TTTCTCTAGT CT 1516
(2) INFORMATION FOR SEQ ID NO: 18:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 471 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18:
Met Lys Lys Ala Ser Gin Val Leu Phe Phe Gly Ala Phe Leu Ser Ser
1 5 10 15
Ser Leu Gin Gly Phe Glu Ala Lys Leu Asn Gly Phe Val Asp Gin Ser
20 25 30
Ser Thr He Gly Phe Asn Gin His Lys He Asn Lys Glu Arg Gly He
35 40 45
Tyr Pro Met Gin Gin Phe Ala Thr He Ala Gly Tyr Leu Gly Leu Gly
50 55 60
Phe Ser Leu Leu Pro Lys Lys Val Ser Asp His Val Leu Lys Gly Lys 65 70 75 80
He Gly Gly Met Val Gly Ser He Phe Tyr Asp Gly Thr Lys Lys Phe
85 90 95
Glu Asp Ser Ser Val Ala Tyr Asn Leu Phe Gly Tyr Tyr Asp Gly Phe
100 105 110
Met Gly Gly Tyr Thr Asn He Leu Gin Ser Asp Asp Leu Ala Thr Gin
115 120 125
Asn Met Lys Tyr Asn Lys Asn Val Arg Asn Tyr Val Phe Ser Asp Ala 130 135 140
Tyr Leu Glu Tyr Ala Tyr Lys Asn Tyr Phe Glu He Lys Ala Gly Arg 145 150 155 160
Tyr Leu Ser Thr Met Pro Tyr Lys Ser Gly Gin Thr Gin Gly Phe Gin
165 170 175
He Ser Gly Gin Tyr Lys Lys Ala Arg Leu Thr Trp Phe Ser Ser Phe
180 185 190
Gly Arg Ala Phe Ala Tyr Gly Ser Phe Leu Met Asp Trp Phe Ala Ala
195 200 205
Arg Thr Thr Tyr Ser Gly Gly Phe Thr Lys Asn Asp Lys Gly Gly Tyr
210 215 220
Asp Ser His Gly Arg Lys Val Leu Tyr Gly Thr His Ala Val Gin Leu 225 230 235 240
Thr Tyr Lys Pro His Arg Phe Leu He Glu Gly Phe Tyr Tyr Leu Ser
245 250 255
Pro Gin He Phe Asn Ala Pro Gly Val Lys He Gly Trp Asp Ser Asn
260 265 270
Pro Asn Phe Ser Gly Thr Gly Phe Arg Ser Asp Thr Ala He He Gly
275 280 285
Phe Phe Pro He Tyr Tyr Pro Trp Met He Val Lys Ser Asn Gly Ser
290 295 300
Pro Val Tyr Lys Tyr Asp Thr Pro Ala Thr Gin Asn Gly Gin Asn Leu 305 310 315 320
He He Leu Gin Arg Phe Asp He Asn Asn Tyr Asn Val Ser He Ala
325 330 335
Phe Tyr Lys Val Phe Gin Asn Ala Asn Gly Trp He Gly Asn Met Gly
340 345 350
Asn Pro Ser Gly Val He Met Gly Ser Asn Ser Val Tyr Ala Gly Phe
355 360 365
Thr Gly Thr Ala Leu Lys Arg Asp Ala Ala Thr He Phe Leu Ser Cys
370 375 380
Gly Gly Thr His Phe Ala Lys Lys Phe Thr Trp Lys Phe Ala Thr Gin 385 390 395 400
Tyr Ser Asn Ser Val Val Ser Trp Glu Ala Arg Ala Met He Ser Leu
405 410 415
Gly Tyr Lys Phe Thr Glu Tyr Leu Ser Gly Ser Val Asp Leu Ala Tyr
420 425 430
Tyr Gly Val Tyr Thr Asn Lys Gly Phe Lys Pro Gly Glu Asn Gly Pro
435 440 445
Val Pro Lys Asp Phe Pro Ala Leu Tyr Ser Asp Arg Ser Ala Leu Tyr
450 455 460
Thr Ala Leu Val Ala Ser Phe 465 470
(2) INFORMATION FOR SEQ ID NO: 19:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 377 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 87...323 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19:
AAATTTATGT TATAATTAAA CGCATTGTAA ATAAATTCTC ATTTTGATAC ATTTTTACAA 60 TAAAACATTA CTTTAAGGAA CATCTT ATG AAA AAA ACG AAA AAA ACG ATT CTG 113
Met Lys Lys Thr Lys Lys Thr He Leu 1 5
CTT TCT CTA ACT CTC GCG GCG TCA TTG CTC CAT GCT GAA GAC AAC GGC 161 Leu Ser Leu Thr Leu Ala Ala Ser Leu Leu His Ala Glu Asp Asn Gly 10 15 20 25
GTT TTT TTA AGC GTG GGT TAT CAA ATC GGT GAA GCG GTT CAA AAA GTG 209 Val Phe Leu Ser Val Gly Tyr Gin He Gly Glu Ala Val Gin Lys Val 30 35 40
AAA AAC GCC GAC AAG GTG CAA AAA CTT TCA GAC ACT TAT GAA CAA TTA 257 Lys Asn Ala Asp Lys Val Gin Lys Leu Ser Asp Thr Tyr Glu Gin Leu 45 50 55
AGC CGG CTT TTA ACC AAC GAT AAT GGC ACA AAC TCA AAG ACA AGC GCG 305 Ser Arg Leu Leu Thr Asn Asp Asn Gly Thr Asn Ser Lys Thr Ser Ala 60 65 70
CAA NAT CAA CCA AGC GGT TAATAATTTG AACGAACGCG CAAAAACTTT AGCCGGTG 361 Gin Xaa Gin Pro Ser Gly 75
GGACAACCAA TTCCCC 377
(2) INFORMATION FOR SEQ ID NO: 20:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 79 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:
Met Lys Lys Thr Lys Lys Thr He Leu Leu Ser Leu Thr Leu Ala Ala
1 5 10 15
Ser Leu Leu His Ala Glu Asp Asn Gly Val Phe Leu Ser Val Gly Tyr
20 25 30
Gin He Gly Glu Ala Val Gin Lys Val Lys Asn Ala Asp Lys Val Gin 35 40 45 Lys Leu Ser Asp Thr Tyr Glu Gin Leu Ser Arg Leu Leu Thr Asn Asp
50 55 60
Asn Gly Thr Asn Ser Lys Thr Ser Ala Gin Xaa Gin Pro Ser Gly 65 70 75
(2) INFORMATION FOR SEQ ID NO: 21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2169 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 60...2039 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: CCCTATCATA GGGCGTGGCA TGAAGAAAAA AGCAAAAGTC TTTTGGTATT GTTTTAATC 59
ATG ATT TAT TGG TTG TAT TTG GCG GTC TTT TTT TTG TTG AGC GCA TTA 107 Met He Tyr Trp Leu Tyr Leu Ala Val Phe Phe Leu Leu Ser Ala Leu 1 5 10 15
GAC GCT AAA GAA ATC GCT ATG CAA CGA TTT GAC AAA CAA AAC CAT AAG 155 Asp Ala Lys Glu He Ala Met Gin Arg Phe Asp Lys Gin Asn His Lys 20 25 30
ATT TTT GAA ATC CTT GCG GAT AAA GTG AGC GCT AAA GAC AAT GTG ATA 203 He Phe Glu He Leu Ala Asp Lys Val Ser Ala Lys Asp Asn Val He 35 40 45
ACC GCA TCA GGG AAT GCG ATC TTA TTG AAT TAT GAT GTG TAT ATT CTA 251 Thr Ala Ser Gly Asn Ala He Leu Leu Asn Tyr Asp Val Tyr He Leu 50 55 60
GCG GAC AAG GTG CGT TAT GAC ACT AAA ACC AAA GAA GCG TTA TTA GAG 299 Ala Asp Lys Val Arg Tyr Asp Thr Lys Thr Lys Glu Ala Leu Leu Glu 65 70 75 80
GGG AAT ATC AAG GTT TAT AGG GGC GAG GGT TTG CTC GTT AAA ACC GAT 347 Gly Asn He Lys Val Tyr Arg Gly Glu Gly Leu Leu Val Lys Thr Asp 85 90 95
TAC GTG AAA TTG AGT TTG AAT GAA AAA TAT GAA ATC ATT TTC CCC TTT 395 Tyr Val Lys Leu Ser Leu Asn Glu Lys Tyr Glu He He Phe Pro Phe 100 105 110
TAT GTC CAA GAC AGC GTG AGC GGG ATT TGG GTG AGC GCG GAT ATT GCC 443 Tyr Val Gin Asp Ser Val Ser Gly He Trp Val Ser Ala Asp He Ala 115 120 125
AGC GGA AAG GAT CAA AAA TAT AAG GTT AAA AAC ATG AGC ACT TCA GGG 491 Ser Gly Lys Asp Gin Lys Tyr Lys Val Lys Asn Met Ser Thr Ser Gly 130 135 140
TGC AGC ATT GAT AAC CCC ATT TGG CAT GTC AAT GCG ACT TCA GGC TCA 539 Cys Ser He Asp Asn Pro He Trp His Val Asn Ala Thr Ser Gly Ser 145 150 155 160
TTC AAC ATG CAA AAA TCG CAT TTG TCT ATG TGG AAT CCT AAG ATC TAT 587 Phe Asn Met Gin Lys Ser His Leu Ser Met Trp Asn Pro Lys He Tyr 165 170 175
GTC GGT GAT ATT CCT GTA TTG TAT TTG CCC TAT ATT TTC ATG TCC ACG 635 Val Gly Asp He Pro Val Leu Tyr Leu Pro Tyr He Phe Met Ser Thr 180 185 190
AGC AAT AAA AGA ACT ACT GGG TTT TTA TAC CCT GAG TTT GGC ACT TCC 683 Ser Asn Lys Arg Thr Thr Gly Phe Leu Tyr Pro Glu Phe Gly Thr Ser 195 200 205
AAC TTA GAC GGC TTT ATT TAT TTG CAA CCC TTT TAT TTA GCC CCC AAA 731 Asn Leu Asp Gly Phe He Tyr Leu Gin Pro Phe Tyr Leu Ala Pro Lys 210 215 220
AAC TCA TGG GAT ATG ACC TTT ACC CCA CAA ATC CGC TAT AAA AGG GGT 779 Asn Ser Trp Asp Met Thr Phe Thr Pro Gin He Arg Tyr Lys Arg Gly 225 230 235 240
TTT GGC TTG AAT TTT GAA GCG CGC TAC ATT AAC TCT AAA AAC GAC AGG 827 Phe Gly Leu Asn Phe Glu Ala Arg Tyr He Asn Ser Lys Asn Asp Arg 245 250 255
TTT TTA TTC AAC GCG CGC TAT TTT AGG AAT TAC ACC CAA TAT GTC AAA 875 Phe Leu Phe Asn Ala Arg Tyr Phe Arg Asn Tyr Thr Gin Tyr Val Lys 260 265 270
CGC TAC GAT TTG AGG AAT CAA AAT ATC TAC GGG TTT GAA TTT TTA AGC 923 Arg Tyr Asp Leu Arg Asn Gin Asn He Tyr Gly Phe Glu Phe Leu Ser 275 280 285
TCT AGC AGG GAC ACT TTA CAA AAA TAC TTC CAC CTT AAG TCT AAT ATT 971 Ser Ser Arg Asp Thr Leu Gin Lys Tyr Phe His Leu Lys Ser Asn He 290 295 300
GAC AAC GGG CAT TAC ATT GAC TTT TTA TAC ATG AAC GAT TTG GAC TAT 1019 Asp Asn Gly His Tyr He Asp Phe Leu Tyr Met Asn Asp Leu Asp Tyr 305 310 315 320
GTG CGT TTT GAA AAG GTT AAT AAG CGT ATC ACA GAC GCC ACG CAC ATG 1067 Val Arg Phe Glu Lys Val Asn Lys Arg He Thr Asp Ala Thr His Met 325 330 335 TCT AGG GCG AAT TAC TAT TTG CAA ACA GAA AAC AAT TAT TAC GGC TTG 1115 Ser Arg Ala Asn Tyr Tyr Leu Gin Thr Glu Asn Asn Tyr Tyr Gly Leu 340 345 350
AAT ATC AAG TAT TTT TTA AAC CTG AAT AAA ATC AAC AAT AAC CGC ACT 1163 Asn He Lys Tyr Phe Leu Asn Leu Asn Lys He Asn Asn Asn Arg Thr 355 360 365
TTC CAA TCT GTC CCT AAT TTG CAA TAC CAT AAA TAT TTA AAT TCT TTG 1211 Phe Gin Ser Val Pro Asn Leu Gin Tyr His Lys Tyr Leu Asn Ser Leu 370 375 380
TAT TTT AGA AAT TTG TTG TAT TCG GTG GAT TAT CAG TTT AGA AAC ACC 1259 Tyr Phe Arg Asn Leu Leu Tyr Ser Val Asp Tyr Gin Phe Arg Asn Thr 385 390 395 400
GCA AGA GAG ATT GGT TAT GGC TAT GTG CAA AAC GCT TTG AAT GTG CCG 1307 Ala Arg Glu He Gly Tyr Gly Tyr Val Gin Asn Ala Leu Asn Val Pro 405 410 415
GTG GGC TTG CAA TTT TCT TTG TTT AAA AAG TAT TTG TCT TTA GGG CTT 1355 Val Gly Leu Gin Phe Ser Leu Phe Lys Lys Tyr Leu Ser Leu Gly Leu 420 425 430
TGG AAT GAT CTC CAA CTA TCT AAT GTG GCT TTA ATG CAA TCT AAA AAT 1403 Trp Asn Asp Leu Gin Leu Ser Asn Val Ala Leu Met Gin Ser Lys Asn 435 440 445
TCC TTC GTG CCT ACG ATC CCT AAT GAA TCA AGG GAA TTT GGG AAT TTT 1451 Ser Phe Val Pro Thr He Pro Asn Glu Ser Arg Glu Phe Gly Asn Phe 450 455 460
GTG TCT TCA AAT TTT TCC ATG TAT GTC AAT ACG GAT TTG GCT AGA GAA 1499 Val Ser Ser Asn Phe Ser Met Tyr Val Asn Thr Asp Leu Ala Arg Glu 465 470 475 480
TAC AAC AAG CTT TTC CAC ACG ATC CAA CTA GAA GCG ATT TTC AAC ATC 1547 Tyr Asn Lys Leu Phe His Thr He Gin Leu Glu Ala He Phe Asn He 485 490 495
CCT TAT TAC ACC TTT AAA AAC GGC TTA TTT TCT CAA AAC ATG TAT GCT 1595 Pro Tyr Tyr Thr Phe Lys Asn Gly Leu Phe Ser Gin Asn Met Tyr Ala 500 505 510
TTA AGC GCG CAA GCC TTA AAC AGC TAC ACT TCG CCT TTA TTG AGA GAT 1643 Leu Ser Ala Gin Ala Leu Asn Ser Tyr Thr Ser Pro Leu Leu Arg Asp 515 520 525
TAT GAT TAT CAA GGG CGT TTG TAT GAC TCG GTG TGG AAT CCT AGC AGT 1691 Tyr Asp Tyr Gin Gly Arg Leu Tyr Asp Ser Val Trp Asn Pro Ser Ser 530 535 540
ATT TTA CCT AGC AAT GCG AGC AAC AAG ACG GTG GAT TTA ACC CTA ACG 1739 He Leu Pro Ser Asn Ala Ser Asn Lys Thr Val Asp Leu Thr Leu Thr 545 550 555 560 CAA TAC CTT TAT GGC TTA GGG GGG CAA GAG TTA TTG TAT TTT AAA ATA 1787 Gin Tyr Leu Tyr Gly Leu Gly Gly Gin Glu Leu Leu Tyr Phe Lys He
565 570
575
TCG CAA CTC ATC AAT CTT GAC GAT AAA GTT TCG CCC TTT AGA ATG CCA 1835 Ser Gin Leu He Asn Leu Asp Asp Lys Val Ser Pro Phe Arg Met Pro 580 585 590
CTA GAG AGC AAG ATC GGG TTT TCG CCC TTA ACG GGA TTG AAC ATC TTT 1883 Leu Glu Ser Lys He Gly Phe Ser Pro Leu Thr Gly Leu Asn He Phe 595 600 605
GGG AAT GTC TTT TAT TCG TTT TAT CAA AAC CGC TTA GAA GAA ATC TCT 1931 Gly Asn Val Phe Tyr Ser Phe Tyr Gin Asn Arg Leu Glu Glu He Ser 610 615 620
GTG AAC GCC AAT TAC CAA CGC AAG TTT TTA AGC TTT AAC CTC TCT TAT 1979 Val Asn Ala Asn Tyr Gin Arg Lys Phe Leu Ser Phe Asn Leu Ser Tyr 625 630 635 640
TTT TTA AAA AAC AAT TTT AGC AGT GGG ATT AAT AGC ATT GTA GAA AAT 2027 Phe Leu Lys Asn Asn Phe Ser Ser Gly He Asn Ser He Val Glu Asn 645 650 655
CTG CGG ATT ATT TAAAGGCGGG TTTTAGCAAC GACTTTGGCT ATTTTTCCAT GAGCGC 2085 Leu Arg He He 660
GGATGTGGGT TATGATATTA GAAACAATGT GGTTTTAAAT TGGAATGTGG GGATTTATAA 2145 AAAAATCCGT TGTTTTGGGA TTGG 2169
(2) INFORMATION FOR SEQ ID NO: 22:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 660 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:
Met He Tyr Trp Leu Tyr Leu Ala Val Phe Phe Leu Leu Ser Ala Leu
1 5 10 15
Asp Ala Lys Glu He Ala Met Gin Arg Phe Asp Lys Gin Asn His Lys
20 25 30
He Phe Glu He Leu Ala Asp Lys Val Ser Ala Lys Asp Asn Val He
35 40 45
Thr Ala Ser Gly Asn Ala He Leu Leu Asn Tyr Asp Val Tyr He Leu
50 55 60
Ala Asp Lys Val Arg Tyr Asp Thr Lys Thr Lys Glu Ala Leu Leu Glu 65 70 75 80 Gly Asn He Lys Val Tyr Arg Gly Glu Gly Leu Leu Val Lys Thr Asp
85 90 95
Tyr Val Lys Leu Ser Leu Asn Glu Lys Tyr Glu He He Phe Pro Phe
100 105 110
Tyr Val Gin Asp Ser Val Ser Gly He Trp Val Ser Ala Asp He Ala
115 120 125
Ser Gly Lys Asp Gin Lys Tyr Lys Val Lys Asn Met Ser Thr Ser Gly
130 135 140
Cys Ser He Asp Asn Pro He Trp His Val Asn Ala Thr Ser Gly Ser 145 150 155 160
Phe Asn Met Gin Lys Ser His Leu Ser Met Trp Asn Pro Lys He Tyr
165 170 175
Val Gly Asp He Pro Val Leu Tyr Leu Pro Tyr He Phe Met Ser Thr
180 185 190
Ser Asn Lys Arg Thr Thr Gly Phe Leu Tyr Pro Glu Phe Gly Thr Ser
195 200 205
Asn Leu Asp Gly Phe He Tyr Leu Gin Pro Phe Tyr Leu Ala Pro Lys
210 215 220
Asn Ser Trp Asp Met Thr Phe Thr Pro Gin He Arg Tyr Lys Arg Gly 225 230 235 240
Phe Gly Leu Asn Phe Glu Ala Arg Tyr He Asn Ser Lys Asn Asp Arg
245 250 255
Phe Leu Phe Asn Ala Arg Tyr Phe Arg Asn Tyr Thr Gin Tyr Val Lys
260 265 270
Arg Tyr Asp Leu Arg Asn Gin Asn He Tyr Gly Phe Glu Phe Leu Ser
275 280 285
Ser Ser Arg Asp Thr Leu Gin Lys Tyr Phe His Leu Lys Ser Asn He
290 295 300
Asp Asn Gly His Tyr He Asp Phe Leu Tyr Met Asn Asp Leu Asp Tyr 305 310 315 320
Val Arg Phe Glu Lys Val Asn Lys Arg He Thr Asp Ala Thr His Met
325 330 335
Ser Arg Ala Asn Tyr Tyr Leu Gin Thr Glu Asn Asn Tyr Tyr Gly Leu
340 345 350
Asn He Lys Tyr Phe Leu Asn Leu Asn Lys He Asn Asn Asn Arg Thr
355 360 365
Phe Gin Ser Val Pro Asn Leu Gin Tyr His Lys Tyr Leu Asn Ser Leu
370 375 380
Tyr Phe Arg Asn Leu Leu Tyr Ser Val Asp Tyr Gin Phe Arg Asn Thr 385 390 395 400
Ala Arg Glu He Gly Tyr Gly Tyr Val Gin Asn Ala Leu Asn Val Pro
405 410 415
Val Gly Leu Gin Phe Ser Leu Phe Lys Lys Tyr Leu Ser Leu Gly Leu
420 425 430
Trp Asn Asp Leu Gin Leu Ser Asn Val Ala Leu Met Gin Ser Lys Asn
435 440 445
Ser Phe Val Pro Thr He Pro Asn Glu Ser Arg Glu Phe Gly Asn Phe
450 455 460
Val Ser Ser Asn Phe Ser Met Tyr Val Asn Thr Asp Leu Ala Arg Glu 465 470 475 480
Tyr Asn Lys Leu Phe His Thr He Gin Leu Glu Ala He Phe Asn He
485 490 495
Pro Tyr Tyr Thr Phe Lys Asn Gly Leu Phe Ser Gin Asn Met Tyr Ala
500 505 510
Leu Ser Ala Gin Ala Leu Asn Ser Tyr Thr Ser Pro Leu Leu Arg Asp 515 520 525
Tyr Asp Tyr Gin Gly Arg Leu Tyr Asp Ser Val Trp Asn Pro Ser Ser
530 535 540
He Leu Pro Ser Asn Ala Ser Asn Lys Thr Val Asp Leu Thr Leu Thr 545 550 555 560
Gin Tyr Leu Tyr Gly Leu Gly Gly Gin Glu Leu Leu Tyr Phe Lys He
565 570 575
Ser Gin Leu He Asn Leu Asp Asp Lys Val Ser Pro Phe Arg Met Pro
580 585 590
Leu Glu Ser Lys He Gly Phe Ser Pro Leu Thr Gly Leu Asn He Phe
595 600 605
Gly Asn Val Phe Tyr Ser Phe Tyr Gin Asn Arg Leu Glu Glu He Ser
610 615 620
Val Asn Ala Asn Tyr Gin Arg Lys Phe Leu Ser Phe Asn Leu Ser Tyr 625 630 635 640
Phe Leu Lys Asn Asn Phe Ser Ser Gly He Asn Ser He Val Glu Asn
645 650 655
Leu Arg He He 660
(2) INFORMATION FOR SEQ ID NO: 23:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 454 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...401 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:
TCAAGGTGTG CCAAACATGC CTTGAAACTC AATTTTTGAA TCTCAATTTT ATG AAA 56
Met Lys
1
GGA TTT GTT ATG AGT GGA TTA AAA GCA TTT AGT TGT GTA GTG GTT TTA 104 Gly Phe Val Met Ser Gly Leu Lys Ala Phe Ser Cys Val Val Val Leu 5 10 15
TGC GGT GCA ATG GCT AAT ACG GCT ATA GCT GGT CCT AAA ATA GAA GCA 152 Cys Gly Ala Met Ala Asn Thr Ala He Ala Gly Pro Lys He Glu Ala 20 25 30
AGG GGT GAG TTT GGC AGA TTT TGG GGG GGA GCT GTT GGT GGT GCA ATT 200 Arg Gly Glu Phe Gly Arg Phe Trp Gly Gly Ala Val Gly Gly Ala He 35 40 45 50 GGG GGT GGT GTT GGT GGT GCA GTG GGG GGA GCT GTT GGT GGT CCT GCG 248 Gly Gly Gly Val Gly Gly Ala Val Gly Gly Ala Val Gly Gly Pro Ala 55 60 65
GGT GGT TGG GCT GGC AGA TTA GTT GGT GGT TCT GTG GGG AGA GAG TTT 296 Gly Gly Trp Ala Gly Arg Leu Val Gly Gly Ser Val Gly Arg Glu Phe 70 75 80
GGT CGG GAA ATA GGC GAT AGG GTA GAA GAT TAC ATC CGT GGC GTT GAT 344 Gly Arg Glu He Gly Asp Arg Val Glu Asp Tyr He Arg Gly Val Asp 85 90 95
AGA GAG CCA CAA GCC CCA AGA GAA CCC ACC TAT GAT CGT CAT TTC GTG 392 Arg Glu Pro Gin Ala Pro Arg Glu Pro Thr Tyr Asp Arg His Phe Val 100 105 110
TAT GAC AGG TAGCTTTGGG CGAGAAAGGA GAGAGCATGA ATGTCAAAAA TCGTTTGAG 450
Tyr Asp Arg
115
CGAT 454
(2) INFORMATION FOR SEQ ID NO: 24:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 117 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24:
Met Lys Gly Phe Val Met Ser Gly Leu Lys Ala Phe Ser Cys Val Val
1 5 10 15
Val Leu Cys Gly Ala Met Ala Asn Thr Ala He Ala Gly Pro Lys He
20 25 30
Glu Ala Arg Gly Glu Phe Gly Arg Phe Trp Gly Gly Ala Val Gly Gly
35 40 45
Ala He Gly Gly Gly Val Gly Gly Ala Val Gly Gly Ala Val Gly Gly
50 55 60
Pro Ala Gly Gly Trp Ala Gly Arg Leu Val Gly Gly Ser Val Gly Arg 65 70 75 80
Glu Phe Gly Arg Glu He Gly Asp Arg Val Glu Asp Tyr He Arg Gly
85 90 95
Val Asp Arg Glu Pro Gin Ala Pro Arg Glu Pro Thr Tyr Asp Arg His
100 105 110
Phe Val Tyr Asp Arg 115
(2) INFORMATION FOR SEQ ID NO: 25:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 856 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 59...802 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: GCCATTTTAA GCTAATATAA TATAGAGCGA TTATCAAAAA ATAAAGGGAA AAGACTGA 58
ATG TTG AAA AGA ATG ATA TTA TTA GGG GCT TTG GGT GTT TTA GCG AGC 106 Met Leu Lys Arg Met He Leu Leu Gly Ala Leu Gly Val Leu Ala Ser 1 5 10 15
GCT GAA GAG AGT GCG GCT TTT GTG GGA GTC AAT TAC CAG GTG AGC ATG 154 Ala Glu Glu Ser Ala Ala Phe Val Gly Val Asn Tyr Gin Val Ser Met 20 25 30
ATA CAA AAT CAG ACT AAA ATG GTG AAT GAC AAC GGC TTG CAA AAG CCT 202 He Gin Asn Gin Thr Lys Met Val Asn Asp Asn Gly Leu Gin Lys Pro 35 40 45
TTG ATA AAG TTT CCG CCT TAC GCA GGA GCG GGT TTT GAA GTG GGC TAT 250 Leu He Lys Phe Pro Pro Tyr Ala Gly Ala Gly Phe Glu Val Gly Tyr 50 55 60
AAG CAA TTT TTT GGT AAG AAA AAA TGG TTT GGC ATG CGT TAT TAT GGG 298 Lys Gin Phe Phe Gly Lys Lys Lys Trp Phe Gly Met Arg Tyr Tyr Gly 65 70 75 80
TTT TTT GAC TAC GCG CAC AAC CGC TTT GGC GTG ATG AAA AAG GGC ATT 346 Phe Phe Asp Tyr Ala His Asn Arg Phe Gly Val Met Lys Lys Gly He 85 90 95
CCG GTG GGC GAT AGT GGG TTT ATT TAC AAT AGT TTT AGT TTT GGA GGG 394 Pro Val Gly Asp Ser Gly Phe He Tyr Asn Ser Phe Ser Phe Gly Gly 100 105 110
AAC ACT TTA ACG GAA AGG GAT TCC TAT CAG GGG CAA TAC TAT GTC AAT 442 Asn Thr Leu Thr Glu Arg Asp Ser Tyr Gin Gly Gin Tyr Tyr Val Asn 115 120 125
TTA TTC ACT TAT GGC GTG GGG TTA GAT ACG CTG TGG AAT TTT GTG AAT 490 Leu Phe Thr Tyr Gly Val Gly Leu Asp Thr Leu Trp Asn Phe Val Asn 130 135 140 AAA GAA AAC ATG GTT TTT GGT TTT GTG GTG GGG ATC CAA TTA GCG GGG 538 Lys Glu Asn Met Val Phe Gly Phe Val Val Gly He Gin Leu Ala Gly 145 150 155 160
GAT AGT TGG GCA ACG AGC ATC AGT AAA GAA ATC GCT CAT TAT GCA AAA 586 Asp Ser Trp Ala Thr Ser He Ser Lys Glu He Ala His Tyr Ala Lys 165 170 175
CAC CAC AGC AAT TCC AGT TAT AGC CCG GCC AAT TTC CAG TTT TTA TGG 634 His His Ser Asn Ser Ser Tyr Ser Pro Ala Asn Phe Gin Phe Leu Trp 180 185 190
AAG TTT GGG GTC CGC ACC CAT ATC GCT AAA CAC AAT AGC CTA GAA TTA 682 Lys Phe Gly Val Arg Thr His He Ala Lys His Asn Ser Leu Glu Leu 195 200 205
GGG ATT AAA GTG CCT ACG ATC ACA CAC CAG CTT TTC TCT CTT ACC AAC 730 Gly He Lys Val Pro Thr He Thr His Gin Leu Phe Ser Leu Thr Asn 210 215 220
GAA AAG GGA TAC ACC TTA CAG GCT GAT GTG CGT AGA GTT TAT GCG TTT 778 Glu Lys Gly Tyr Thr Leu Gin Ala Asp Val Arg Arg Val Tyr Ala Phe 225 230 235 240
CAA ATC AGT TAC TTG AGG GAT TTT TAACCCCTTT TTAGATACAA TCACGCCTGA AA 834 Gin He Ser Tyr Leu Arg Asp Phe 245
CTATCCATTT AAAGGTGTGA AA 856
(2) INFORMATION FOR SEQ ID NO: 26:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 248 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26:
Met Leu Lys Arg Met He Leu Leu Gly Ala Leu Gly Val Leu Ala Ser
1 5 10 15
Ala Glu Glu Ser Ala Ala Phe Val Gly Val Asn Tyr Gin Val Ser Met
20 25 30
He Gin Asn Gin Thr Lys Met Val Asn Asp Asn Gly Leu Gin Lys Pro
35 40 45
Leu He Lys Phe Pro Pro Tyr Ala Gly Ala Gly Phe Glu Val Gly Tyr
50 55 60
Lys Gin Phe Phe Gly Lys Lys Lys Trp Phe Gly Met Arg Tyr Tyr Gly 65 70 75 80
Phe Phe Asp Tyr Ala His Asn Arg Phe Gly Val Met Lys Lys Gly He 85 90 95
Pro Val Gly Asp Ser Gly Phe He Tyr Asn Ser Phe Ser Phe Gly Gly
100 105 110
Asn Thr Leu Thr Glu Arg Asp Ser Tyr Gin Gly Gin Tyr Tyr Val Asn
115 120 125
Leu Phe Thr Tyr Gly Val Gly Leu Asp Thr Leu Trp Asn Phe Val Asn
130 135 140
Lys Glu Asn Met Val Phe Gly Phe Val Val Gly He Gin Leu Ala Gly 145 150 155 160
Asp Ser Trp Ala Thr Ser He Ser Lys Glu He Ala His Tyr Ala Lys
165 170 175
His His Ser Asn Ser Ser Tyr Ser Pro Ala Asn Phe Gin Phe Leu Trp
180 185 190
Lys Phe Gly Val Arg Thr His He Ala Lys His Asn Ser Leu Glu Leu
195 200 205
Gly He Lys Val Pro Thr He Thr His Gin Leu Phe Ser Leu Thr Asn
210 215 220
Glu Lys Gly Tyr Thr Leu Gin Ala Asp Val Arg Arg Val Tyr Ala Phe 225 230 235 240
Gin He Ser Tyr Leu Arg Asp Phe 245
(2) INFORMATION FOR SEQ ID NO: 27:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2750 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 69...2699 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:
TAATAATAAC CTTAGGTTTA AAACTTGACT AAATTTTTAG AAAAAAGTAA ATAAAAAGGC 60 TAAAAGAA ATG CTT AGA AAT CAA TTT CGT ATC GTG TTT GTC TCT TGT ATT 110 Met Leu Arg Asn Gin Phe Arg He Val Phe Val Ser Cys He 1 5 10
GTC GCT AGC AAT TTG CAA GCT CAA GAA ACA ACC CAC ACT TTG GGT AAG 158 Val Ala Ser Asn Leu Gin Ala Gin Glu Thr Thr His Thr Leu Gly Lys 15 20 25 30
GTA ACC ACT AAG GGT GAA AGG ACT TTT GAA TAC AAC AAT AAA ATG TAT 206 Val Thr Thr Lys Gly Glu Arg Thr Phe Glu Tyr Asn Asn Lys Met Tyr 35 40 45 ATT GAC AGA AAA GAG CTC CAA CAA CGC CAA AGT AAC CAA ATC CGT GAT 254 He Asp Arg Lys Glu Leu Gin Gin Arg Gin Ser Asn Gin He Arg Asp 50 55 60
ATT TTT AGG ACT AGA GCG GAT GTG AAT GTG GCC AGT GGG GGC TTG ATG 302 He Phe Arg Thr Arg Ala Asp Val Asn Val Ala Ser Gly Gly Leu Met 65 70 75
GCG CAA AAG ATC TAT GTT AGG GGG ATT GAG AGC CGT CTC TTA AGG GTA 350 Ala Gin Lys He Tyr Val Arg Gly He Glu Ser Arg Leu Leu Arg Val 80 85 90
ACA ATA GAT GGC GTC GCC CAA AAT GGT AAC ATT TTC CAC CAT GAC GCT 398 Thr He Asp Gly Val Ala Gin Asn Gly Asn He Phe His His Asp Ala 95 100 105 110
AAC ACC GTG ATC GAT CCT AAC ATG ATT AAA GAA GTG GAA GTG ATC AAG 446 Asn Thr Val He Asp Pro Asn Met He Lys Glu Val Glu Val He Lys 115 120 125
GGG GCG GCG AAC GCT TCA GCA GGC CCA GGT GCG GTG GCG GGT AAA TTG 494 Gly Ala Ala Asn Ala Ser Ala Gly Pro Gly Ala Val Ala Gly Lys Leu 130 135 140
TCT TTC ACC ACG ATT GAC GCT AAC GAC TTC TTA AGA AAG AAT CAA ACT 542 Ser Phe Thr Thr He Asp Ala Asn Asp Phe Leu Arg Lys Asn Gin Thr 145 150 155
TAT GGC GCT AAA GCG GAA GCG GCC TTT TAT ACC AAC TTC GGG TAT CGC 590 Tyr Gly Ala Lys Ala Glu Ala Ala Phe Tyr Thr Asn Phe Gly Tyr Arg 160 165 170
ATG AAC GCC ACT GCG GCT TAC CGG GGG AAA AAC TGG GAC ATC CTA GCC 638 Met Asn Ala Thr Ala Ala Tyr Arg Gly Lys Asn Trp Asp He Leu Ala 175 180 185 190
TAT TAC AAC CAT CAA AAT ATT TTT TAC TAC AGA GAC GGG AAC AAC GCT 686 Tyr Tyr Asn His Gin Asn He Phe Tyr Tyr Arg Asp Gly Asn Asn Ala 195 200 205
TTT AGG AAT GTC TTC CAC CCT AAC TAC GAT TTA CAA GAT CCA AGC AAT 734 Phe Arg Asn Val Phe His Pro Asn Tyr Asp Leu Gin Asp Pro Ser Asn 210 215 220
AGC GAT ATG AGC GTA GGG ACT CCC AGT GAA GTC AAT AGC GTT TTA GCT 782 Ser Asp Met Ser Val Gly Thr Pro Ser Glu Val Asn Ser Val Leu Ala 225 230 235
AAA ATT AAT GGC TAT ATC AAC GAA ACA GAC AGC ATT AGC GTG AGC TAC 830 Lys He Asn Gly Tyr He Asn Glu Thr Asp Ser He Ser Val Ser Tyr 240 245 250
AAC CTC ACA CGA GAC AAT TCT ACA AGG CTT TTA CGC CCT AAC ACC ACT 878 Asn Leu Thr Arg Asp Asn Ser Thr Arg Leu Leu Arg Pro Asn Thr Thr 255 260 265 270 TCA GCC CTC TCT AAA GCC AAT GAC CCA GGA AGC CAG CCA GCC CCC TTT 926 Ser Ala Leu Ser Lys Ala Asn Asp Pro Gly Ser Gin Pro Ala Pro Phe 275 280 285
GTG ATT GAC TTT GGG AAA GAA TTA GCC CAT ACG ATC AAC TTC AAC CAC 974 Val He Asp Phe Gly Lys Glu Leu Ala His Thr He Asn Phe Asn His 290 295 300
AAT TTG AGC TTG AAA TAC AAG CAT GAA GGC GGC CCT AAT TTT AAC CAG 1022 Asn Leu Ser Leu Lys Tyr Lys His Glu Gly Gly Pro Asn Phe Asn Gin 305 310 315
CCG CGC GTT GAA TCC ACC GCC TTT TTA GGG GTA AGG GGG GGC AAT TAT 1070 Pro Arg Val Glu Ser Thr Ala Phe Leu Gly Val Arg Gly Gly Asn Tyr 320 325 330
AAC CCT GTG GTG AAT CCT TTC GCT TAC AAT TCT AAC GAG CCG GCT AAC 1118 Asn Pro Val Val Asn Pro Phe Ala Tyr Asn Ser Asn Glu Pro Ala Asn 335 340 345 350
CCA GAT TAT ATC CCT GAA GTG AAA GAG TGG TGT AAT AAC CCA GAT AAT 1166 Pro Asp Tyr He Pro Glu Val Lys Glu Trp Cys Asn Asn Pro Asp Asn 355 360 365
ATC AGC CAG TGC ACG CAA GGG GCT ATC AGG CCT TCT AAT GGA GGC TAT 1214 He Ser Gin Cys Thr Gin Gly Ala He Arg Pro Ser Asn Gly Gly Tyr 370 375 380
CAA ATA GGC TAT GGC ACG CCT AAT TCT ATT AAT TGG CAA GGG ACT AGC 1262 Gin He Gly Tyr Gly Thr Pro Asn Ser He Asn Trp Gin Gly Thr Ser 385 390 395
GAT TCT AGT GGA GGG GCG CAA GCA GGG TAT GGG CAG CTT AAC GCT ATT 1310 Asp Ser Ser Gly Gly Ala Gin Ala Gly Tyr Gly Gin Leu Asn Ala He 400 405 410
TCT ACA AGC GCG AAC GTT TAT CAT GGG CTT GTC CCT AAA AAT CCT GAT 1358 Ser Thr Ser Ala Asn Val Tyr His Gly Leu Val Pro Lys Asn Pro Asp 415 420 425 430
TAT GAC ATG ACC CCC CCT AAC GCT CAA AAC CCT AGC GCA AAC GAT TGG 1406 Tyr Asp Met Thr Pro Pro Asn Ala Gin Asn Pro Ser Ala Asn Asp Trp 435 440 445
ACT TTA GGG AAT GCG GAC GCT GAG GGG ACT TTA GCC AGA AGG ATT TTT 1454 Thr Leu Gly Asn Ala Asp Ala Glu Gly Thr Leu Ala Arg Arg He Phe 450 455 460
TTA ATC AAC TCG GGC GTT AAT TTT AAA GTA ACC CAC CCC ATT AGC GAA 1502 Leu He Asn Ser Gly Val Asn Phe Lys Val Thr His Pro He Ser Glu 465 470 475
GAT TAT GGG AAT GTG TTT GAA TAC GGC ATG ATT TAT CAA AAC CTG AGC 1550 Asp Tyr Gly Asn Val Phe Glu Tyr Gly Met He Tyr Gin Asn Leu Ser 480 485 490 GTT TTC TCT GGA TTG GAT AAA GGC AAA AAC GGC TAT TAT AAA AAC AAC 1598 Val Phe Ser Gly Leu Asp Lys Gly Lys Asn Gly Tyr Tyr Lys Asn Asn 495 500 505 510
ATT GAT CCT AAC GAC CCT AAC GGG CCG GGC TTG CCT TAC CGC CAT TAC 1646 He Asp Pro Asn Asp Pro Asn Gly Pro Gly Leu Pro Tyr Arg His Tyr 515 520 525
TAC ACC GAT CAA AGC TCC CAA TAC CCC CAA AAT CTC AAC ACC CCT AAC 1694 Tyr Thr Asp Gin Ser Ser Gin Tyr Pro Gin Asn Leu Asn Thr Pro Asn 530 535 540
CCG CTC TAT CGT AAC ATG CCC CAA AAT TCG CAT GCG ATC GGC AAT ATC 1742 Pro Leu Tyr Arg Asn Met Pro Gin Asn Ser His Ala He Gly Asn He 545 550 555
ATC GGA GGG TTT ATG CAA GCA AAC TAC AAC ATT TTA AGC AAT GTG ATC 1790 He Gly Gly Phe Met Gin Ala Asn Tyr Asn He Leu Ser Asn Val He 560 565 570
GTG GGT GCG GGA ACT CGT TAT GAT ATT TAC ACC TTG CTA GAC AAA AAC 1838 Val Gly Ala Gly Thr Arg Tyr Asp He Tyr Thr Leu Leu Asp Lys Asn 575 580 585 590
GGC CGC ACG CAT GTA ACT TCT GGT TTC TCG CCT TCT GCA ACC GTG CTT 1886 Gly Arg Thr His Val Thr Ser Gly Phe Ser Pro Ser Ala Thr Val Leu 595 600 605
TAT AAC CCC ATT GAA AGC ATT GGC TTG AAA GTG AGT TAT GCG TAT GTA 1934 Tyr Asn Pro He Glu Ser He Gly Leu Lys Val Ser Tyr Ala Tyr Val 610 615 620
ACT AAG GGG GCT TTG CCT GGC GAT GGC GTT TTG ATG CGC GAT CCT ACG 1982 Thr Lys Gly Ala Leu Pro Gly Asp Gly Val Leu Met Arg Asp Pro Thr 625 630 635
GTG ATT TAT CAA AGG AAT TTG CGC CCT GCG ATC GGT CAA AAT GTG GAA 2030 Val He Tyr Gin Arg Asn Leu Arg Pro Ala He Gly Gin Asn Val Glu 640 645 650
TTT AAT GTG GAT TTC AAC AGC AAG TAT TTC AAT GTG CGC GGG GCA GCG 2078 Phe Asn Val Asp Phe Asn Ser Lys Tyr Phe Asn Val Arg Gly Ala Ala 655 660 665 670
TTC TAT CAA GTC ATC AAT AAT TTC ATC AAC AGC TAC GGG CAA GAC ACT 2126 Phe Tyr Gin Val He Asn Asn Phe He Asn Ser Tyr Gly Gin Asp Thr 675 680 685
TCT AAA AAT GGA GGG GGT AAC GCA ACC GCA AAA AAC ATG TCA GGG AAT 2174 Ser Lys Asn Gly Gly Gly Asn Ala Thr Ala Lys Asn Met Ser Gly Asn 690 695 700
TTA CCC GAA ACC ATT AAC ATT TAT GGT TAT GAA GTT TCA GGG AAT GTG 2222 Leu Pro Glu Thr He Asn He Tyr Gly Tyr Glu Val Ser Gly Asn Val 705 710 715 AGG TAT AAG AAT TTC TTA GGG ACT TTC TCA GTG GCT CGC TCT TGG CCA 2270 Arg Tyr Lys Asn Phe Leu Gly Thr Phe Ser Val Ala Arg Ser Trp Pro 720 725 730
ACG GCT AGG GGG CAT TTA TTA GCG GAC ACT TAC GCT CTA GCT GCA ACG 2318 Thr Ala Arg Gly His Leu Leu Ala Asp Thr Tyr Ala Leu Ala Ala Thr 735 740 745 750
ACT GGG AAT GTG TTT ATT TTA AAA GCC GAT TAT GAT GTT CGC AGG TGG 2366 Thr Gly Asn Val Phe He Leu Lys Ala Asp Tyr Asp Val Arg Arg Trp 755 760 765
GGG CTT ACT TTA ACC TGG CTC TCG CGC TTT GTA ACT AAC ATG TAT TAT 2414 Gly Leu Thr Leu Thr Trp Leu Ser Arg Phe Val Thr Asn Met Tyr Tyr 770 775 780
GAG GGC TAT TCT ATC TAT TAC CCG CAA TAC GGC TTG ATC AAA ATC CAT 2462 Glu Gly Tyr Ser He Tyr Tyr Pro Gin Tyr Gly Leu He Lys He His 785 790 795
AAA CCC GGG TAT GGC GTG CAT AAT GTC TTT ATC AAC TGG ACT CCG CCT 2510 Lys Pro Gly Tyr Gly Val His Asn Val Phe He Asn Trp Thr Pro Pro 800 805 810
TCT AAA AAA TGG CAG GGT TTA AGG ATT TCA GCC GTG TTT AAT AAT ATC 2558 Ser Lys Lys Trp Gin Gly Leu Arg He Ser Ala Val Phe Asn Asn He 815 820 825 830
TTA AAC AAG CAA TAT GTG GAT CAA ACT TCT GTG TTT CAA GCG AGC GCG 2606 Leu Asn Lys Gin Tyr Val Asp Gin Thr Ser Val Phe Gin Ala Ser Ala 835 840 845
GAC GCT CCA GCG AGC GAT ATG ATC CCT AAA GGT AAG CGC ATG GCG CTC 2654 Asp Ala Pro Ala Ser Asp Met He Pro Lys Gly Lys Arg Met Ala Leu 850 855 860
CCG GCT CCT GGA TTT AAC GCG CGT TTT GAG GTA TCC TAT CAG TTC TAAAA 2704 Pro Ala Pro Gly Phe Asn Ala Arg Phe Glu Val Ser Tyr Gin Phe 865 870 875
TGAAAGGAAT CTTAGGATTT CTTTTTGAAT TTTGAACATG GAAACA 2750
(2) INFORMATION FOR SEQ ID NO: 28:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 877 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: Met Leu Arg Asn Gin Phe Arg He Val Phe Val Ser Cys He Val Ala
1 5 10 15
Ser Asn Leu Gin Ala Gin Glu Thr Thr His Thr Leu Gly Lys Val Thr
20 25 30
Thr Lys Gly Glu Arg Thr Phe Glu Tyr Asn Asn Lys Met Tyr He Asp
35 40 45
Arg Lys Glu Leu Gin Gin Arg Gin Ser Asn Gin He Arg Asp He Phe
50 55 60
Arg Thr Arg Ala Asp Val Asn Val Ala Ser Gly Gly Leu Met Ala Gin 65 70 75 80
Lys He Tyr Val Arg Gly He Glu Ser Arg Leu Leu Arg Val Thr He
85 90 95
Asp Gly Val Ala Gin Asn Gly Asn He Phe His His Asp Ala Asn Thr
100 105 110
Val He Asp Pro Asn Met He Lys Glu Val Glu Val He Lys Gly Ala
115 120 125
Ala Asn Ala Ser Ala Gly Pro Gly Ala Val Ala Gly Lys Leu Ser Phe
130 135 140
Thr Thr He Asp Ala Asn Asp Phe Leu Arg Lys Asn Gin Thr Tyr Gly 145 150 155 160
Ala Lys Ala Glu Ala Ala Phe Tyr Thr Asn Phe Gly Tyr Arg Met Asn
165 170 175
Ala Thr Ala Ala Tyr Arg Gly Lys Asn Trp Asp He Leu Ala Tyr Tyr
180 185 190
Asn His Gin Asn He Phe Tyr Tyr Arg Asp Gly Asn Asn Ala Phe Arg
195 200 205
Asn Val Phe His Pro Asn Tyr Asp Leu Gin Asp Pro Ser Asn Ser Asp
210 215 220
Met Ser Val Gly Thr Pro Ser Glu Val Asn Ser Val Leu Ala Lys He 225 230 235 240
Asn Gly Tyr He Asn Glu Thr Asp Ser He Ser Val Ser Tyr Asn Leu
245 250 255
Thr Arg Asp Asn Ser Thr Arg Leu Leu Arg Pro Asn Thr Thr Ser Ala
260 265 270
Leu Ser Lys Ala Asn Asp Pro Gly Ser Gin Pro Ala Pro Phe Val He
275 280 285
Asp Phe Gly Lys Glu Leu Ala His Thr He Asn Phe Asn His Asn Leu
290 295 300
Ser Leu Lys Tyr Lys His Glu Gly Gly Pro Asn Phe Asn Gin Pro Arg 305 310 315 320
Val Glu Ser Thr Ala Phe Leu Gly Val Arg Gly Gly Asn Tyr Asn Pro
325 330 335
Val Val Asn Pro Phe Ala Tyr Asn Ser Asn Glu Pro Ala Asn Pro Asp
340 345 350
Tyr He Pro Glu Val Lys Glu Trp Cys Asn Asn Pro Asp Asn He Ser
355 360 365
Gin Cys Thr Gin Gly Ala He Arg Pro Ser Asn Gly Gly Tyr Gin He
370 375 380
Gly Tyr Gly Thr Pro Asn Ser He Asn Trp Gin Gly Thr Ser Asp Ser 385 390 395 400
Ser Gly Gly Ala Gin Ala Gly Tyr Gly Gin Leu Asn Ala He Ser Thr
405 410 415
Ser Ala Asn Val Tyr His Gly Leu Val Pro Lys Asn Pro Asp Tyr Asp
420 425 430
Met Thr Pro Pro Asn Ala Gin Asn Pro Ser Ala Asn Asp Trp Thr Leu 435 440 445
Gly Asn Ala Asp Ala Glu Gly Thr Leu Ala Arg Arg He Phe Leu He
450 455 460
Asn Ser Gly Val Asn Phe Lys Val Thr His Pro He Ser Glu Asp Tyr 465 470 475 480
Gly Asn Val Phe Glu Tyr Gly Met He Tyr Gin Asn Leu Ser Val Phe
485 490 495
Ser Gly Leu Asp Lys Gly Lys Asn Gly Tyr Tyr Lys Asn Asn He Asp
500 505 510
Pro Asn Asp Pro Asn Gly Pro Gly Leu Pro Tyr Arg His Tyr Tyr Thr
515 520 525
Asp Gin Ser Ser Gin Tyr Pro Gin Asn Leu Asn Thr Pro Asn Pro Leu
530 535 540
Tyr Arg Asn Met Pro Gin Asn Ser His Ala He Gly Asn He He Gly 545 550 555 560
Gly Phe Met Gin Ala Asn Tyr Asn He Leu Ser Asn Val He Val Gly
565 570 575
Ala Gly Thr Arg Tyr Asp He Tyr Thr Leu Leu Asp Lys Asn Gly Arg
580 585 590
Thr His Val Thr Ser Gly Phe Ser Pro Ser Ala Thr Val Leu Tyr Asn
595 600 605
Pro He Glu Ser He Gly Leu Lys Val Ser Tyr Ala Tyr Val Thr Lys
610 615 620
Gly Ala Leu Pro Gly Asp Gly Val Leu Met Arg Asp Pro Thr Val He 625 630 635 640
Tyr Gin Arg Asn Leu Arg Pro Ala He Gly Gin Asn Val Glu Phe Asn
645 650 655
Val Asp Phe Asn Ser Lys Tyr Phe Asn Val Arg Gly Ala Ala Phe Tyr
660 665 670
Gin Val He Asn Asn Phe He Asn Ser Tyr Gly Gin Asp Thr Ser Lys
675 680 685
Asn Gly Gly Gly Asn Ala Thr Ala Lys Asn Met Ser Gly Asn Leu Pro
690 695 700
Glu Thr He Asn He Tyr Gly Tyr Glu Val Ser Gly Asn Val Arg Tyr 705 710 715 720
Lys Asn Phe Leu Gly Thr Phe Ser Val Ala Arg Ser Trp Pro Thr Ala
725 730 735
Arg Gly His Leu Leu Ala Asp Thr Tyr Ala Leu Ala Ala Thr Thr Gly
740 745 750
Asn Val Phe He Leu Lys Ala Asp Tyr Asp Val Arg Arg Trp Gly Leu
755 760 765
Thr Leu Thr Trp Leu Ser Arg Phe Val Thr Asn Met Tyr Tyr Glu Gly
770 775 780
Tyr Ser He Tyr Tyr Pro Gin Tyr Gly Leu He Lys He His Lys Pro 785 790 795 800
Gly Tyr Gly Val His Asn Val Phe He Asn Trp Thr Pro Pro Ser Lys
805 810 815
Lys Trp Gin Gly Leu Arg He Ser Ala Val Phe Asn Asn He Leu Asn
820 825 830
Lys Gin Tyr Val Asp Gin Thr Ser Val Phe Gin Ala Ser Ala Asp Ala
835 840 845
Pro Ala Ser Asp Met He Pro Lys Gly Lys Arg Met Ala Leu Pro Ala
850 855 860
Pro Gly Phe Asn Ala Arg Phe Glu Val Ser Tyr Gin Phe 865 870 875 (2) INFORMATION FOR SEQ ID NO: 29:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 370 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...317 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29:
TTTAAAATCA CCCGTTACAG CATCACTGAA ATCACTAATA GGGGTGATTG ATG CGT 56
Met Arg 1
AAG GTT TTA TAC GCT CTT GTG GGC TTT TTG TTG GCT TTT AGC GCT TTA 104 Lys Val Leu Tyr Ala Leu Val Gly Phe Leu Leu Ala Phe Ser Ala Leu 5 10 15
AAA GCC GAT GAT TTT TTA GAA GAA GCG AAC GAA ACA GCC CCG GCG CAT 152 Lys Ala Asp Asp Phe Leu Glu Glu Ala Asn Glu Thr Ala Pro Ala His 20 25 30
TTA AAC CAC CCT ATG CAG GAT TTA AAC GCC ATT CAA GGG AGC TTT TTT 200 Leu Asn His Pro Met Gin Asp Leu Asn Ala He Gin Gly Ser Phe Phe 35 40 45 50
GAC AAA AAC CGC TCA AAA ATG TCC AAC ACT TTG AAC ATT GAT TAC TTT 248 Asp Lys Asn Arg Ser Lys Met Ser Asn Thr Leu Asn He Asp Tyr Phe 55 60 65
CAA GGG CAA ACT TAT AAA ATC CCG CTT GCG TTA TGC GAT GGC GMC CTT 296 Gin Gly Gin Thr Tyr Lys He Pro Leu Ala Leu Cys Asp Gly Xaa Leu 70 75 80
ATT GTT TTT TTC AAA ACC CAT TAGCGATTTT GTTTTAGGGG ATAAGGTGGG TTTT 351 He Val Phe Phe Lys Thr His 85
GATGCGAAAA TTTTAGAAA 370
(2) INFORMATION FOR SEQ ID NO: 30:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 89 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30:
Met Arg Lys Val Leu Tyr Ala Leu Val Gly Phe Leu Leu Ala Phe Ser
1 5 10 15
Ala Leu Lys Ala Asp Asp Phe Leu Glu Glu Ala Asn Glu Thr Ala Pro
20 25 30
Ala His Leu Asn His Pro Met Gin Asp Leu Asn Ala He Gin Gly Ser
35 40 45
Phe Phe Asp Lys Asn Arg Ser Lys Met Ser Asn Thr Leu Asn He Asp
50 55 60
Tyr Phe Gin Gly Gin Thr Tyr Lys He Pro Leu Ala Leu Cys Asp Gly 65 70 75 80
Xaa Leu He Val Phe Phe Lys Thr His 85
(2) INFORMATION FOR SEQ ID NO: 31:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 357 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...305 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31:
ATGCAATAAA AAAAGAAATT CTTAGGATTT CTCACATTAA GGAGTTTTAA ATG AAA 56
Met Lys 1
AAG GTT TTT TTA GGT ATG GCA TTA GCC TTT AGT GTG TCC ATG GCA GAA 104 Lys Val Phe Leu Gly Met Ala Leu Ala Phe Ser Val Ser Met Ala Glu 5 10 15
AAA AGT GGC GCG TTT TTA GGA GGG GGG TTT CAA TAT TCT AAT TTA GAA 152 Lys Ser Gly Ala Phe Leu Gly Gly Gly Phe Gin Tyr Ser Asn Leu Glu 20 25 30
AAC CAA AAC ACC ACC CGC ACC CCA GGC GCT AAC AAT AAC ACC CCG ATA 200 Asn Gin Asn Thr Thr Arg Thr Pro Gly Ala Asn Asn Asn Thr Pro He 35 40 45 50
GAC ACT TCA ATG TTT GGC AGC AAC AAA ACA GCT CCA GCC CAA GAA ACG 248 Asp Thr Ser Met Phe Gly Ser Asn Lys Thr Ala Pro Ala Gin Glu Thr 55 60 65
CAA AGC GCT TCC AAA CCG GAC ACT AAA GTC AAT CCA AGC GCA AGT TGG 296 Gin Ser Ala Ser Lys Pro Asp Thr Lys Val Asn Pro Ser Ala Ser Trp 70 75 80
ATG AAA AAA TAAGAAGGAA GTTATGAAAA AGTCATTCAA AAAATTAGGC TTTGTCTCT 354 Met Lys Lys 85
TTA 357
(2) INFORMATION FOR SEQ ID NO: 32:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 85 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:
Met Lys Lys Val Phe Leu Gly Met Ala Leu Ala Phe Ser Val Ser Met
1 5 10 15
Ala Glu Lys Ser Gly Ala Phe Leu Gly Gly Gly Phe Gin Tyr Ser Asn
20 25 30
Leu Glu Asn Gin Asn Thr Thr Arg Thr Pro Gly Ala Asn Asn Asn Thr
35 40 45
Pro He Asp Thr Ser Met Phe Gly Ser Asn Lys Thr Ala Pro Ala Gin
50 55 60
Glu Thr Gin Ser Ala Ser Lys Pro Asp Thr Lys Val Asn Pro Ser Ala 65 70 75 80
Ser Trp Met Lys Lys 85
(2) INFORMATION FOR SEQ ID NO: 33:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 961 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...908 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:
GAATGTAGCA TTTAGAACTC AAGTAGAGAA AATGTAGAAG GAAGGAATAC ATG AAG 56
Met Lys 1
AAA TCT GTT ATA GTA GGT GCT ATC TCT CTA GCA ATG ACA AGC TTA TTG 104 Lys Ser Val He Val Gly Ala He Ser Leu Ala Met Thr Ser Leu Leu 5 10 15
TCA GCA GAG ACC CCT AAG CAA GAA AAA GCT ATT AAG ACT AGC CCT ACC 152 Ser Ala Glu Thr Pro Lys Gin Glu Lys Ala He Lys Thr Ser Pro Thr 20 25 30
AAA AAA GGT GAA AGA AAT GCT GCT TTT ATA GGG ATT GAT TAC CAG TTG 200 Lys Lys Gly Glu Arg Asn Ala Ala Phe He Gly He Asp Tyr Gin Leu 35 40 45 50
GGT ATG CTC AGC ACT ACC GCT CAA AAT TGT TCC CAT GGG AAT TGC AAT 248 Gly Met Leu Ser Thr Thr Ala Gin Asn Cys Ser His Gly Asn Cys Asn 55 60 65
GGT AAT CAA AGT GGG GCT TAC GGC TCT AAT ACG CCT AAC ATG CCT ACA 296 Gly Asn Gin Ser Gly Ala Tyr Gly Ser Asn Thr Pro Asn Met Pro Thr 70 75 80
GCG TCA AAC CCA ACA GGA GGG TTT ACT CAT GGC GCT CTA GGG ACT CGT 344 Ala Ser Asn Pro Thr Gly Gly Phe Thr His Gly Ala Leu Gly Thr Arg 85 90 95
GGG TAT AAA GGC TTA AGC AAC CAA CAA TAC GCT ATC AAT GGT TTT GGT 392 Gly Tyr Lys Gly Leu Ser Asn Gin Gin Tyr Ala He Asn Gly Phe Gly 100 105 110
TTT GTT GTA GGG TAT AAG CAT TTT TTC AAG AAA TCC CCG CAA TTT GGA 440 Phe Val Val Gly Tyr Lys His Phe Phe Lys Lys Ser Pro Gin Phe Gly 115 120 125 130
ATG CGT TAT TAC GGA TTC TTT GAT TTT GCA AGC TCT TAT TAT AAG TAT 488 Met Arg Tyr Tyr Gly Phe Phe Asp Phe Ala Ser Ser Tyr Tyr Lys Tyr 135 140 145
TAC ACT TAT AAT GAT TAT GGC ATG AGA GAC GCT CGC AAG GGT TCT CAA 536 Tyr Thr Tyr Asn Asp Tyr Gly Met Arg Asp Ala Arg Lys Gly Ser Gin 150 155 160
AGT TTC ATG TTT GGC TAT GGG GCT GGC ACA GAT GTG TTG TTT AAC CCG 584 Ser Phe Met Phe Gly Tyr Gly Ala Gly Thr Asp Val Leu Phe Asn Pro 165 170 175 GCT ATT TTC AAT CGT GAG AAC TTG CAT TTT GGG TTT TTC TTG GGC GTT 632 Ala He Phe Asn Arg Glu Asn Leu His Phe Gly Phe Phe Leu Gly Val 180 185 190
GCG ATC GGT GGC ACC TCT TGG GGT CCA ACA AAC TAT TAT TTT AAG GAC 680 Ala He Gly Gly Thr Ser Trp Gly Pro Thr Asn Tyr Tyr Phe Lys Asp 195 200 205 210
TTG GCT GAT GAG TAT AGA GGG AGT TTC CAC CCA TCA AAT TTC CAG GTC 728 Leu Ala Asp Glu Tyr Arg Gly Ser Phe His Pro Ser Asn Phe Gin Val 215 220 225
TTA GTT AAT GGT GGG ATT CGC TTA GGC ACT AAA CAC CAA GGT TTT GAA 776 Leu Val Asn Gly Gly He Arg Leu Gly Thr Lys His Gin Gly Phe Glu 230 235 240
ATT GGC TTG AAA ATC CAA ACC ATC CGC AAC AAT TAC TAC ACC GCT AGT 824 He Gly Leu Lys He Gin Thr He Arg Asn Asn Tyr Tyr Thr Ala Ser 245 250 255
GCG GAT AAT GTG CCT GAA GGG ACT ACT TAT AGA TTC ACT TTC CAC CGC 872 Ala Asp Asn Val Pro Glu Gly Thr Thr Tyr Arg Phe Thr Phe His Arg 260 265 270
CCC TAT GCC TTT TAT TGG CGT TAC ATT GTA AGC TTT TAAGGTGTTT TAGGGC 924 Pro Tyr Ala Phe Tyr Trp Arg Tyr He Val Ser Phe 275 280 285
TAATCTTATG GGGGCATAGA AAAGGGCTTT TGCTCTT 961
(2) INFORMATION FOR SEQ ID NO: 34:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 286 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:
Met Lys Lys Ser Val He Val Gly Ala He Ser Leu Ala Met Thr Ser
1 5 10 15
Leu Leu Ser Ala Glu Thr Pro Lys Gin Glu Lys Ala He Lys Thr Ser
20 25 30
Pro Thr Lys Lys Gly Glu Arg Asn Ala Ala Phe He Gly He Asp Tyr
35 40 45
Gin Leu Gly Met Leu Ser Thr Thr Ala Gin Asn Cys Ser His Gly Asn
50 55 60
Cys Asn Gly Asn Gin Ser Gly Ala Tyr Gly Ser Asn Thr Pro Asn Met 65 70 75 80
Pro Thr Ala Ser Asn Pro Thr Gly Gly Phe Thr His Gly Ala Leu Gly 85 90 95 Thr Arg Gly Tyr Lys Gly Leu Ser Asn Gin Gin Tyr Ala He Asn Gly
100 105 110
Phe Gly Phe Val Val Gly Tyr Lys His Phe Phe Lys Lys Ser Pro Gin
115 120 125
Phe Gly Met Arg Tyr Tyr Gly Phe Phe Asp Phe Ala Ser Ser Tyr Tyr
130 135 140
Lys Tyr Tyr Thr Tyr Asn Asp Tyr Gly Met Arg Asp Ala Arg Lys Gly 145 150 155 160
Ser Gin Ser Phe Met Phe Gly Tyr Gly Ala Gly Thr Asp Val Leu Phe
165 170 175
Asn Pro Ala He Phe Asn Arg Glu Asn Leu His Phe Gly Phe Phe Leu
180 185 190
Gly Val Ala He Gly Gly Thr Ser Trp Gly Pro Thr Asn Tyr Tyr Phe
195 200 205
Lys Asp Leu Ala Asp Glu Tyr Arg Gly Ser Phe His Pro Ser Asn Phe
210 215 220
Gin Val Leu Val Asn Gly Gly He Arg Leu Gly Thr Lys His Gin Gly 225 230 235 240
Phe Glu He Gly Leu Lys He Gin Thr He Arg Asn Asn Tyr Tyr Thr
245 250 255
Ala Ser Ala Asp Asn Val Pro Glu Gly Thr Thr Tyr Arg Phe Thr Phe
260 265 270
His Arg Pro Tyr Ala Phe Tyr Trp Arg Tyr He Val Ser Phe 275 280 285
(2) INFORMATION FOR SEQ ID NO: 35:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 289 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...236 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35:
GGGATTTTAT TTCTTATAGC AGAAATTATT TTTAAAGTAA AAGACAAATC ATG TTT 56
Met Phe 1
AGA GAT ATA GTA GAT ATT TTA ATA TCT GTT GTT ATT ATT GGA TTA GTA 104 Arg Asp He Val Asp He Leu He Ser Val Val He He Gly Leu Val 5 10 15
TTA ACA GCT ATT AGA GCT ACT ATA ATG GCG TTT AAA GGC GAT ACT GAT 152 Leu Thr Ala He Arg Ala Thr He Met Ala Phe Lys Gly Asp Thr Asp 20 25 30
GAT GAT GAA GTT GAG AGT GAT GGG TTT TTT AGT AGA ATA TGG GAT AAA 200 Asp Asp Glu Val Glu Ser Asp Gly Phe Phe Ser Arg He Trp Asp Lys 35 40 45 50
TTC GTT GAA TAT TTC GGC TAT ACT CTA GTT ACT ATA TAATGTTTTT TCCTTA 252 Phe Val Glu Tyr Phe Gly Tyr Thr Leu Val Thr He 55 60
TATAATTGGA CCAGTTATCG CTTTAATTTT TATATTT 289
(2) INFORMATION FOR SEQ ID NO: 36:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 62 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36:
Met Phe Arg Asp He Val Asp He Leu He Ser Val Val He He Gly
1 5 10 15
Leu Val Leu Thr Ala He Arg Ala Thr He Met Ala Phe Lys Gly Asp
20 25 30
Thr Asp Asp Asp Glu Val Glu Ser Asp Gly Phe Phe Ser Arg He Trp
35 40 45
Asp Lys Phe Val Glu Tyr Phe Gly Tyr Thr Leu Val Thr He 50 55 60
(2) INFORMATION FOR SEQ ID NO: 37:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1544 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...1491 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: GACACACATT AGTTATAGTT TCTAAGAGAG TTCTCCCCCT ATCTCTTAGA T ATG CCT 57 Met Pro 1
TTT TGT ATT TTT ATT TTA ATA TCT TTG GGA GTT AGG GTT TTG GAA ATT 105 Phe Cys He Phe He Leu He Ser Leu Gly Val Arg Val Leu Glu He 5 10 15
AAG AAA TAT TTT TCT TAC TCT CTA TTT TTT TTG CTT TTT TCT AGT CTC 153 Lys Lys Tyr Phe Ser Tyr Ser Leu Phe Phe Leu Leu Phe Ser Ser Leu 20 25 30
TTT TTA TCC AAA CTT CAA GCT TAT AAA TTC AAC ATG AGC ATT GTT GGA 201 Phe Leu Ser Lys Leu Gin Ala Tyr Lys Phe Asn Met Ser He Val Gly 35 40 45 50
AAG GTG AGC AGC TAT ACC AAG TTT GGC TTT AAC AAC CAA AGA TAC CAG 249 Lys Val Ser Ser Tyr Thr Lys Phe Gly Phe Asn Asn Gin Arg Tyr Gin 55 60 65
CCT TCT AAA GAC ATT TAT CCT ACA GGT AGC TAC ACT TCT TTA CTC GGC 297 Pro Ser Lys Asp He Tyr Pro Thr Gly Ser Tyr Thr Ser Leu Leu Gly 70 75 80
GAA TTG AAT TTG AGC ATG GGT TTA TAC AAG GGT TTG AGA GCG GAA GTG 345 Glu Leu Asn Leu Ser Met Gly Leu Tyr Lys Gly Leu Arg Ala Glu Val 85 90 95
GGG GCT ATG ATG GCA GCG CTC CCC TAT GAC TCT ACC GCC TAT CAA GGC 393 Gly Ala Met Met Ala Ala Leu Pro Tyr Asp Ser Thr Ala Tyr Gin Gly 100 105 110
AAC AAT ATC CCT AAC GGC CAG CCC GGC TCT AGG ACC GAT CCT TTT GGG 441 Asn Asn He Pro Asn Gly Gin Pro Gly Ser Arg Thr Asp Pro Phe Gly 115 120 125 130
GCG GGT ATC TTT TGG CAA TAT ATT GGT TGG TAT GCG GGG CAT AGT GGT 489 Ala Gly He Phe Trp Gin Tyr He Gly Trp Tyr Ala Gly His Ser Gly 135 140 145
TTG CAA GTG CAA AAA CCT CGT TTA GCC ATG GTG CAT AAC GCT TTT TTG 537 Leu Gin Val Gin Lys Pro Arg Leu Ala Met Val His Asn Ala Phe Leu 150 155 160
AGC TAC AAC TAC AAA AAA GAC AAA TTC AGT TTT GGC GTG AAA GGG GGG 585 Ser Tyr Asn Tyr Lys Lys Asp Lys Phe Ser Phe Gly Val Lys Gly Gly 165 170 175
CGC TAT GAC GCT GAA GAG TAT GAT TGG TTC ACT TCT TAC ACT CAA GGG 633 Arg Tyr Asp Ala Glu Glu Tyr Asp Trp Phe Thr Ser Tyr Thr Gin Gly 180 185 190
GTT GAA GGC TTT GTC AAA TAT AAA GAC ACC AGA TTC AGG GTG ATG TAT 681 Val Glu Gly Phe Val Lys Tyr Lys Asp Thr Arg Phe Arg Val Met Tyr 195 200 205 210 TCA GAC GCT AGG GCT TCA GCG TCA AGC GAC TGG TTT TGG TAT TTT GGG 729 Ser Asp Ala Arg Ala Ser Ala Ser Ser Asp Trp Phe Trp Tyr Phe Gly 215 220 225
CGT TAC TAT ACA AGC GGT AAG GCT CTA ATG GTA GCT GAT TTG AAA TAT 777 Arg Tyr Tyr Thr Ser Gly Lys Ala Leu Met Val Ala Asp Leu Lys Tyr 230 235 240
GAA AAA GAC AAC CTA AAA ATC AAC CCT TAT TTT TAT GCG ATC TTT CAA 825 Glu Lys Asp Asn Leu Lys He Asn Pro Tyr Phe Tyr Ala He Phe Gin 245 250 255
AGA ATG TAT GCG CCA GGC ATT AAT ATC ACT TAT GAC ACC AAC CCT AAT 873 Arg Met Tyr Ala Pro Gly He Asn He Thr Tyr Asp Thr Asn Pro Asn 260 265 270
TTC AAC AAT AAG GGT TTT CGT TTT GTA GGC ACT TTC GTA GGG TTT TTC 921 Phe Asn Asn Lys Gly Phe Arg Phe Val Gly Thr Phe Val Gly Phe Phe 275 280 285 290
CCC ATT TTT GCC ACT CCG GCT AAT CAA AAT GAT ATT ATC CTC TTC CAA 969 Pro He Phe Ala Thr Pro Ala Asn Gin Asn Asp He He Leu Phe Gin 295 300 305
CAA GTG CCA TTA GGC AAG AGT GGG CAA ACT TAT TTC TTC CGC ACT CGT 1017 Gin Val Pro Leu Gly Lys Ser Gly Gin Thr Tyr Phe Phe Arg Thr Arg 310 315 320
TTT TAC TAT AAT AAG TGG CAA TTT GGG GGC AGC GTC TAT AAA AAC ATC 1065 Phe Tyr Tyr Asn Lys Trp Gin Phe Gly Gly Ser Val Tyr Lys Asn He 325 330 335
GGT AAC GCT AAT GGT GAT ATA GGT ATT TAT GGC GAC CCT TTG GGG TAT 1113 Gly Asn Ala Asn Gly Asp He Gly He Tyr Gly Asp Pro Leu Gly Tyr 340 345 350
AAC ATT TGG ACG AAT AGT ATT TAT GAC GCA GAA ATT AAC AAT ATT GTT 1161 Asn He Trp Thr Asn Ser He Tyr Asp Ala Glu He Asn Asn He Val 355 360 365 370
GGC GCT GAT GTT ATT AAC GGG TTT TTG TAT GTA GGC TCA CAA TAT AGA 1209 Gly Ala Asp Val He Asn Gly Phe Leu Tyr Val Gly Ser Gin Tyr Arg 375 380 385
GGG TTT AGT TGG AAA ATT TTA GGC CGT TGG ACG GAT AGC CCA AGG GCT 1257 Gly Phe Ser Trp Lys He Leu Gly Arg Trp Thr Asp Ser Pro Arg Ala 390 395 400
GAT GAA AGG AGT CTC GCG CTC TTT TTG AGT TAT TTT TCT AAT AAG TAT 1305 Asp Glu Arg Ser Leu Ala Leu Phe Leu Ser Tyr Phe Ser Asn Lys Tyr 405 410 415
AAT ATT AGA ATG GAT TTA AAA CTA GAA TAT TAT GGC AAT ATC ACC AAA 1353 Asn He Arg Met Asp Leu Lys Leu Glu Tyr Tyr Gly Asn He Thr Lys 420 425 430 AAA GGC TAT TGT ATT GGG TAT TGT GGC ATG TAT GTT CCA GTC GAT CCT 1401 Lys Gly Tyr Cys He Gly Tyr Cys Gly Met Tyr Val Pro Val Asp Pro 435 440 445 450
AAC GGG CCT GGG ACA CAG CCT TTA ACG CAC AAT GTG TAT TCT GAC AGG 1449 Asn Gly Pro Gly Thr Gin Pro Leu Thr His Asn Val Tyr Ser Asp Arg 455 460 465
AGC CAT ATA ATG TTT AAC ATT GCT TAC GGT TTT AGG ATT TAC TAGCATTTT 1500 Ser His He Met Phe Asn He Ala Tyr Gly Phe Arg He Tyr 470 475 480
ATCCTTAATG GATATTTTTG ATTAGCCTTT TTAAAATATT GAAA 1544
(2) INFORMATION FOR SEQ ID NO: 38:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 480 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38:
Met Pro Phe Cys He Phe He Leu He Ser Leu Gly Val Arg Val Leu
1 5 10 15
Glu He Lys Lys Tyr Phe Ser Tyr Ser Leu Phe Phe Leu Leu Phe Ser
20 25 30
Ser Leu Phe Leu Ser Lys Leu Gin Ala Tyr Lys Phe Asn Met Ser He
35 40 45
Val Gly Lys Val Ser Ser Tyr Thr Lys Phe Gly Phe Asn Asn Gin Arg
50 55 60
Tyr Gin Pro Ser Lys Asp He Tyr Pro Thr Gly Ser Tyr Thr Ser Leu 65 70 75 80
Leu Gly Glu Leu Asn Leu Ser Met Gly Leu Tyr Lys Gly Leu Arg Ala
85 90 95
Glu Val Gly Ala Met Met Ala Ala Leu Pro Tyr Asp Ser Thr Ala Tyr
100 105 110
Gin Gly Asn Asn He Pro Asn Gly Gin Pro Gly Ser Arg Thr Asp Pro
115 120 125
Phe Gly Ala Gly He Phe Trp Gin Tyr He Gly Trp Tyr Ala Gly His
130 135 140
Ser Gly Leu Gin Val Gin Lys Pro Arg Leu Ala Met Val His Asn Ala 145 150 155 160
Phe Leu Ser Tyr Asn Tyr Lys Lys Asp Lys Phe Ser Phe Gly Val Lys
165 170 175
Gly Gly Arg Tyr Asp Ala Glu Glu Tyr Asp Trp Phe Thr Ser Tyr Thr
180 185 190
Gin Gly Val Glu Gly Phe Val Lys Tyr Lys Asp Thr Arg Phe Arg Val
195 200 205
Met Tyr Ser Asp Ala Arg Ala Ser Ala Ser Ser Asp Trp Phe Trp Tyr 210 215 220 Phe Gly Arg Tyr Tyr Thr Ser Gly Lys Ala Leu Met Val Ala Asp Leu 225 230 235 240
Lys Tyr Glu Lys Asp Asn Leu Lys He Asn Pro Tyr Phe Tyr Ala He
245 250 255
Phe Gin Arg Met Tyr Ala Pro Gly He Asn He Thr Tyr Asp Thr Asn
260 265 270
Pro Asn Phe Asn Asn Lys Gly Phe Arg Phe Val Gly Thr Phe Val Gly
275 280 285
Phe Phe Pro He Phe Ala Thr Pro Ala Asn Gin Asn Asp He He Leu
290 295 300
Phe Gin Gin Val Pro Leu Gly Lys Ser Gly Gin Thr Tyr Phe Phe Arg 305 310 315 320
Thr Arg Phe Tyr Tyr Asn Lys Trp Gin Phe Gly Gly Ser Val Tyr Lys
325 330 335
Asn He Gly Asn Ala Asn Gly Asp He Gly He Tyr Gly Asp Pro Leu
340 345 350
Gly Tyr Asn He Trp Thr Asn Ser He Tyr Asp Ala Glu He Asn Asn
355 360 365
He Val Gly Ala Asp Val He Asn Gly Phe Leu Tyr Val Gly Ser Gin
370 375 380
Tyr Arg Gly Phe Ser Trp Lys He Leu Gly Arg Trp Thr Asp Ser Pro 385 390 395 400
Arg Ala Asp Glu Arg Ser Leu Ala Leu Phe Leu Ser Tyr Phe Ser Asn
405 410 415
Lys Tyr Asn He Arg Met Asp Leu Lys Leu Glu Tyr Tyr Gly Asn He
420 425 430
Thr Lys Lys Gly Tyr Cys He Gly Tyr Cys Gly Met Tyr Val Pro Val
435 440 445
Asp Pro Asn Gly Pro Gly Thr Gin Pro Leu Thr His Asn Val Tyr Ser
450 455 460
Asp Arg Ser His He Met Phe Asn He Ala Tyr Gly Phe Arg He Tyr 465 470 475 480
(2) INFORMATION FOR SEQ ID NO: 39:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 658 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...605 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39:
AATTTTAGGT TATTAGTTAC CATTTTATTA TTCTTAAGGA TGTGTTTATA ATG AGA 56
Met Arg ATT AAG GCT TAT TTT TTG CGT TTT ATC GCG CTG GTT TTG ATT GTT TTG 104 He Lys Ala Tyr Phe Leu Arg Phe He Ala Leu Val Leu He Val Leu 5 10 15
TTA GGT TTT AGT GCT TGT AAA AAT TCT CAA AAA TCT CAA GAT TCT CAA 152 Leu Gly Phe Ser Ala Cys Lys Asn Ser Gin Lys Ser Gin Asp Ser Gin 20 25 30
AAC AAT ACC CCC CAA CAA GAT AGC CCT AAA ACC TAC ACC GCT ATG GAT 200 Asn Asn Thr Pro Gin Gin Asp Ser Pro Lys Thr Tyr Thr Ala Met Asp 35 40 45 50
TTG AAT AAC CAA GAA TAC ACC ATC ACA GGC GAT TTA GAT TCT CTC AAT 248 Leu Asn Asn Gin Glu Tyr Thr He Thr Gly Asp Leu Asp Ser Leu Asn 55 60 65
ATC AGC CCG GAT TCC AAC ACC CCT ACC CTA TTA GTT TTA AGC GCT TTA 296 He Ser Pro Asp Ser Asn Thr Pro Thr Leu Leu Val Leu Ser Ala Leu 70 75 80
GAT AAT TCT TTA AAA GAT TAC GCC CCC AGC TTT AAC ATC TTA AAA AAA 344 Asp Asn Ser Leu Lys Asp Tyr Ala Pro Ser Phe Asn He Leu Lys Lys 85 90 95
ACT TTT AAA GAT CGT TTG AGG GTG CTT ATT TTA CTC AAT AAA CCC TAT 392 Thr Phe Lys Asp Arg Leu Arg Val Leu He Leu Leu Asn Lys Pro Tyr 100 105 110
TCA AGC GAT GCA ATC AAA GAC TTT AGC GCG CAT TTT CAA GCT GAT TTG 440 Ser Ser Asp Ala He Lys Asp Phe Ser Ala His Phe Gin Ala Asp Leu 115 120 125 130
ATG ATT TTA AAC CCT AAA GAT ACC GCT CTT TTT GAT CAT TTA AAG TAT 488 Met He Leu Asn Pro Lys Asp Thr Ala Leu Phe Asp His Leu Lys Tyr 135 140 145
GAC GCT TTA AAC CAT TCT TTT AAC ATG CTC TTA TAC CAC AAA CAC CAA 536 Asp Ala Leu Asn His Ser Phe Asn Met Leu Leu Tyr His Lys His Gin 150 155 160
TTG ATC AAA ATG TAT CAA GGG ATC GTG CCA ATA GAA ATG CTC CAA TTT 584 Leu He Lys Met Tyr Gin Gly He Val Pro He Glu Met Leu Gin Phe 165 170 175
GAT ATT TCC AAT TTA AAG GAT TAAAAAAAAC CATGTTTAAT TTTTTCAAAA AAAT 639 Asp He Ser Asn Leu Lys Asp 180 185
TGTCAATAAA ATTAAGGGT 658
(2) INFORMATION FOR SEQ ID NO: 40:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 185 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40:
Met Arg He Lys Ala Tyr Phe Leu Arg Phe He Ala Leu Val Leu He
1 5 10 15
Val Leu Leu Gly Phe Ser Ala Cys Lys Asn Ser Gin Lys Ser Gin Asp
20 25 30
Ser Gin Asn Asn Thr Pro Gin Gin Asp Ser Pro Lys Thr Tyr Thr Ala
35 40 45
Met Asp Leu Asn Asn Gin Glu Tyr Thr He Thr Gly Asp Leu Asp Ser
50 55 60
Leu Asn He Ser Pro Asp Ser Asn Thr Pro Thr Leu Leu Val Leu Ser 65 70 75 80
Ala Leu Asp Asn Ser Leu Lys Asp Tyr Ala Pro Ser Phe Asn He Leu
85 90 95
Lys Lys Thr Phe Lys Asp Arg Leu Arg Val Leu He Leu Leu Asn Lys
100 105 110
Pro Tyr Ser Ser Asp Ala He Lys Asp Phe Ser Ala His Phe Gin Ala
115 120 125
Asp Leu Met He Leu Asn Pro Lys Asp Thr Ala Leu Phe Asp His Leu
130 135 140
Lys Tyr Asp Ala Leu Asn His Ser Phe Asn Met Leu Leu Tyr His Lys 145 150 155 160
His Gin Leu He Lys Met Tyr Gin Gly He Val Pro He Glu Met Leu
165 170 175
Gin Phe Asp He Ser Asn Leu Lys Asp 180 185
(2) INFORMATION FOR SEQ ID NO: 41:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 460 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...407 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: AATCCCTTCA AAAATGATAT AATAGACTTG ATGAACTCAT TTTAAGGAAA ATG CCC 56
Met Pro 1
ATG CGT TTG CAC ACT GCC TTT TTT GGT ATT AAT TCA TTG CTT GTT GCC 104 Met Arg Leu His Thr Ala Phe Phe Gly He Asn Ser Leu Leu Val Ala 5 10 15
TCT CTT TTG ATA AGC GGT TGC AGT CTC TTT AAA AAG CGT AAC ACT AAC 152 Ser Leu Leu He Ser Gly Cys Ser Leu Phe Lys Lys Arg Asn Thr Asn 20 25 30
GCC CAG CTA ATC CCC CCT TCA GCT AAT GGC TTG CAA GCC CCC ATT TAT 200 Ala Gin Leu He Pro Pro Ser Ala Asn Gly Leu Gin Ala Pro He Tyr 35 40 45 50
CCC CCA ACC AAT TTC ACC CCT AGA AAG AGC ATT CAG CCT CTC CCA AGC 248 Pro Pro Thr Asn Phe Thr Pro Arg Lys Ser He Gin Pro Leu Pro Ser 55 60 65
CCT CGC CTT GAG AAT AAC GAT CAG CCC GTC ATT AGT TCT AAC CCC ACT 296 Pro Arg Leu Glu Asn Asn Asp Gin Pro Val He Ser Ser Asn Pro Thr 70 75 80
AAC GCT ATC CCT AAC ACC CCC ATT CTC ACG CCT AAT AAT GTC ATT GAA 344 Asn Ala He Pro Asn Thr Pro He Leu Thr Pro Asn Asn Val He Glu 85 90 95
TTG AAC GCA TGG GCA TGG GCG TGG CTC CAG AAT CCA CCA TTT CAC CCT 392 Leu Asn Ala Trp Ala Trp Ala Trp Leu Gin Asn Pro Pro Phe His Pro 100 105 110
CTC AAG CCC TGG CTC TAGCCAAGCG GGCGGCTATC GTTGATGGCT ACCGCCAGTT G 448
Leu Lys Pro Trp Leu
115
GGTGAAAAAA TG 460
(2) INFORMATION FOR SEQ ID NO: 42:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 119 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42:
Met Pro Met Arg Leu His Thr Ala Phe Phe Gly He Asn Ser Leu Leu
1 5 10 15
Val Ala Ser Leu Leu He Ser Gly Cys Ser Leu Phe Lys Lys Arg Asn
20 25 30 Thr Asn Ala Gin Leu He Pro Pro Ser Ala Asn Gly Leu Gin Ala Pro
35 40 45
He Tyr Pro Pro Thr Asn Phe Thr Pro Arg Lys Ser He Gin Pro Leu
50 55 60
Pro Ser Pro Arg Leu Glu Asn Asn Asp Gin Pro Val He Ser Ser Asn 65 70 75 80
Pro Thr Asn Ala He Pro Asn Thr Pro He Leu Thr Pro Asn Asn Val
85 90 95
He Glu Leu Asn Ala Trp Ala Trp Ala Trp Leu Gin Asn Pro Pro Phe
100 105 110
His Pro Leu Lys Pro Trp Leu 115
(2) INFORMATION FOR SEQ ID NO:43:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1285 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1232 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43:
GTTGTGATTT TATTGTGTTT TCATATCAAT TTTCATATCA AGGAGTTTAA ATG AAA 56
Met Lys
1
GAA ACA AGA CTT TTA AAA TTG AGA GCG TTG AGC TTA GCA TGT TTA ATG 104 Glu Thr Arg Leu Leu Lys Leu Arg Ala Leu Ser Leu Ala Cys Leu Met 5 10 15
GGA TTA GGC GTG AGT GGG TGC GCG TTT TTA GAT AAG CAA ATC TTA AAC 152 Gly Leu Gly Val Ser Gly Cys Ala Phe Leu Asp Lys Gin He Leu Asn 20 25 30
GAC CAT TTG ACT AAA GCT AAA AAT AAC CCA AAA TAC GAT TGC CAA AAA 200 Asp His Leu Thr Lys Ala Lys Asn Asn Pro Lys Tyr Asp Cys Gin Lys 35 40 45 50
GAA ATG TGG TCT TTC CCT AAA AAA TAC GAT GGG ATA AAT CAG TGT TTA 248 Glu Met Trp Ser Phe Pro Lys Lys Tyr Asp Gly He Asn Gin Cys Leu 55 60 65
AAG GCT CAA GAA GAG CTT ATT GAA CCA ATC ATC ACT AAA AAG ATC GAT 296 Lys Ala Gin Glu Glu Leu He Glu Pro He He Thr Lys Lys He Asp 70 75 80
CAG TAT CAA TGC GAT GAT TTC ACT AAT GAA GGC TTA AAA GAT AAG TGT 344 Gin Tyr Gin Cys Asp Asp Phe Thr Asn Glu Gly Leu Lys Asp Lys Cys 85 90 95
TTC AAA AGA AAC GAT GCC TAC TTA AAC ACC CTT TTA ACG CCC ATC ATT 392 Phe Lys Arg Asn Asp Ala Tyr Leu Asn Thr Leu Leu Thr Pro He He 100 105 110
CAA AAA CAA GAG CGT CGT TTT AGC TGC TCT GAT TTC CAT AAC CCA GAG 440 Gin Lys Gin Glu Arg Arg Phe Ser Cys Ser Asp Phe His Asn Pro Glu 115 120 125 130
CTA AAA GAA CAA TGC ATG GAT AAA ACT AAC GCT TAT GAA AAG CAA AAA 488 Leu Lys Glu Gin Cys Met Asp Lys Thr Asn Ala Tyr Glu Lys Gin Lys 135 140 145
GAC CGA CAA AAA AGA CTA ATT AAT CTC GTG CAA TTA GAA GCG TTT GAA 536 Asp Arg Gin Lys Arg Leu He Asn Leu Val Gin Leu Glu Ala Phe Glu 150 155 160
AAA GAA TAC GCG CAA TAT AAA CCA TAC ATT ATC CCT TAC TTC ACC AAA 584 Lys Glu Tyr Ala Gin Tyr Lys Pro Tyr He He Pro Tyr Phe Thr Lys 165 170 175
GAA TGC GTT AAA AAT GCG CCC CAT TTA GCC AAC AAG GAA AGA CTA TGC 632 Glu Cys Val Lys Asn Ala Pro His Leu Ala Asn Lys Glu Arg Leu Cys 180 185 190
CAA AAA GAA GTG CAT GAA AAA TTT GAC GAC CCT TAT TCT AGC TCT AAA 680 Gin Lys Glu Val His Glu Lys Phe Asp Asp Pro Tyr Ser Ser Ser Lys 195 200 205 210
GAA TTG AGC GTT CAA TCG GCT ATT TCT TTT TGC ATT AAA AAA GTT GAT 728 Glu Leu Ser Val Gin Ser Ala He Ser Phe Cys He Lys Lys Val Asp 215 220 225
GCT AAA TTA GAA AAA GCC GCT CTT ATG AAT GGC GTT TAT ATA AGC CCT 776 Ala Lys Leu Glu Lys Ala Ala Leu Met Asn Gly Val Tyr He Ser Pro 230 235 240
TAT AAA AAA TCC ACC CAT TGC CAA AGA ACG CAT TTG GAA AAT AAG AGC 824 Tyr Lys Lys Ser Thr His Cys Gin Arg Thr His Leu Glu Asn Lys Ser 245 250 255
TTG AAA GAA ATC GCT TTA AAT ATG AAC CCT AAA TTA GAA AAG CAA AGC 872 Leu Lys Glu He Ala Leu Asn Met Asn Pro Lys Leu Glu Lys Gin Ser 260 265 270
CCT TTT ATT GAT GCG GAT AAA ATG GCT ATG CAA TCT GCG GGG TTA TTG 920 Pro Phe He Asp Ala Asp Lys Met Ala Met Gin Ser Ala Gly Leu Leu 275 280 285 290
AGA AAG AAT AAA GGT GTC TTG ATT GCT TTT GCT ACA GAT ATT TGC ATG 968 Arg Lys Asn Lys Gly Val Leu He Ala Phe Ala Thr Asp He Cys Met 295 300 305
GAG CGT AAC GAA CAT AAA AAA GAA GAG TTT ATC AGC CTT AAA GAT AGT 1016 Glu Arg Asn Glu His Lys Lys Glu Glu Phe He Ser Leu Lys Asp Ser 310 315 320
TGC ACC CAA TCG CAA GCC AAA ATC TAT AAC AAC AAG GAG CGC TTT GAC 1064 Cys Thr Gin Ser Gin Ala Lys He Tyr Asn Asn Lys Glu Arg Phe Asp 325 330 335
AAA TTC ATA CAA GAT TAC CAA AAA GAC TTA AAA ACT TGT CTT TTA GAC 1112 Lys Phe He Gin Asp Tyr Gin Lys Asp Leu Lys Thr Cys Leu Leu Asp 340 345 350
ACT TCT AAC ACT AAA GAA GAA GTG GAG CAA AAT TTT TCA CAA TGC CAA 1160 Thr Ser Asn Thr Lys Glu Glu Val Glu Gin Asn Phe Ser Gin Cys Gin 355 360 365 370
AAA GAG CAA TTG AGA GAT GAT AAC AAA GGC TTG GGT TTC ACT TTA GAA 1208 Lys Glu Gin Leu Arg Asp Asp Asn Lys Gly Leu Gly Phe Thr Leu Glu 375 380 385
GAA TTG GTT AAA AAA TAC GCT AAG TAAAGTTATT TAATTTTATG GATGGTTTTA 1262 Glu Leu Val Lys Lys Tyr Ala Lys 390
AAAATCCATT CCATAGTTAT TGT 1285
(2) INFORMATION FOR SEQ ID NO: 44:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 394 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44:
Met Lys Glu Thr Arg Leu Leu Lys Leu Arg Ala Leu Ser Leu Ala Cys
1 5 10 15
Leu Met Gly Leu Gly Val Ser Gly Cys Ala Phe Leu Asp Lys Gin He
20 25 30
Leu Asn Asp His Leu Thr Lys Ala Lys Asn Asn Pro Lys Tyr Asp Cys
35 40 45
Gin Lys Glu Met Trp Ser Phe Pro Lys Lys Tyr Asp Gly He Asn Gin
50 55 60
Cys Leu Lys Ala Gin Glu Glu Leu He Glu Pro He He Thr Lys Lys 65 70 75 80
He Asp Gin Tyr Gin Cys Asp Asp Phe Thr Asn Glu Gly Leu Lys Asp
85 90 95
Lys Cys Phe Lys Arg Asn Asp Ala Tyr Leu Asn Thr Leu Leu Thr Pro 100 105 110
He He Gin Lys Gin Glu Arg Arg Phe Ser Cys Ser Asp Phe His Asn
115 120 125
Pro Glu Leu Lys Glu Gin Cys Met Asp Lys Thr Asn Ala Tyr Glu Lys
130 135 140
Gin Lys Asp Arg Gin Lys Arg Leu He Asn Leu Val Gin Leu Glu Ala 145 150 155 160
Phe Glu Lys Glu Tyr Ala Gin Tyr Lys Pro Tyr He He Pro Tyr Phe
165 170 175
Thr Lys Glu Cys Val Lys Asn Ala Pro His Leu Ala Asn Lys Glu Arg
180 185 190
Leu Cys Gin Lys Glu Val His Glu Lys Phe Asp Asp Pro Tyr Ser Ser
195 200 205
Ser Lys Glu Leu Ser Val Gin Ser Ala He Ser Phe Cys He Lys Lys
210 215 220
Val Asp Ala Lys Leu Glu Lys Ala Ala Leu Met Asn Gly Val Tyr He 225 230 235 240
Ser Pro Tyr Lys Lys Ser Thr His Cys Gin Arg Thr His Leu Glu Asn
245 250 255
Lys Ser Leu Lys Glu He Ala Leu Asn Met Asn Pro Lys Leu Glu Lys
260 265 270
Gin Ser Pro Phe He Asp Ala Asp Lys Met Ala Met Gin Ser Ala Gly
275 280 285
Leu Leu Arg Lys Asn Lys Gly Val Leu He Ala Phe Ala Thr Asp He
290 295 300
Cys Met Glu Arg Asn Glu His Lys Lys Glu Glu Phe He Ser Leu Lys 305 310 315 320
Asp Ser Cys Thr Gin Ser Gin Ala Lys He Tyr Asn Asn Lys Glu Arg
325 330 335
Phe Asp Lys Phe He Gin Asp Tyr Gin Lys Asp Leu Lys Thr Cys Leu
340 345 350
Leu Asp Thr Ser Asn Thr Lys Glu Glu Val Glu Gin Asn Phe Ser Gin
355 360 365
Cys Gin Lys Glu Gin Leu Arg Asp Asp Asn Lys Gly Leu Gly Phe Thr
370 375 380
Leu Glu Glu Leu Val Lys Lys Tyr Ala Lys 385 390
(2) INFORMATION FOR SEQ ID NO: 45:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 835 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic RNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 84...704 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45:
TTGATCATTC TTATTTCGCA CAACCCAAGC ACGCTAAAAT TAGCCACTAA GCATGTGAAA 60 TTAGAGCATG GGCGTTTGAC AGA ATG CTA AGG GTT TTA AGC GTT GGT GTT GCT 113
Met Leu Arg Val Leu Ser Val Gly Val Ala 1 5 10
TTT ATT TTA CTA GGG TGT CAG TTT TTC AAC AAA ACG ACG CTG CAT TTA 161 Phe He Leu Leu Gly Cys Gin Phe Phe Asn Lys Thr Thr Leu His Leu 15 20 25
AAA TAT AAA GAT TAC CCC AAA AAT AGC GCT TTA AAA ACC GCT TTC ACT 209 Lys Tyr Lys Asp Tyr Pro Lys Asn Ser Ala Leu Lys Thr Ala Phe Thr 30 35 40
TTA ACC CCC CCT AAA ATC TTT TTT AAC GCC CGT TTT GTG CCG CCC TTT 257 Leu Thr Pro Pro Lys He Phe Phe Asn Ala Arg Phe Val Pro Pro Phe 45 50 55
TAC CAA AAA GAA TTT AAA AAA GCG ATC ACC CAA CAA ATC GCT TAT TTT 305 Tyr Gin Lys Glu Phe Lys Lys Ala He Thr Gin Gin He Ala Tyr Phe 60 65 70
TTA AAA GAT AAA AGT GCT TTT ATT CTC AAT GTT TCA GGC AAT GTT TTT 353 Leu Lys Asp Lys Ser Ala Phe He Leu Asn Val Ser Gly Asn Val Phe 75 80 85 90
TTT TCT TTT GAA GAG AAT CCT AAA GAT TTA AAA GCC ATT AAA GAA AGG 401 Phe Ser Phe Glu Glu Asn Pro Lys Asp Leu Lys Ala He Lys Glu Arg 95 100 105
CTT AAA AAG ACG ATT GAG CCT AAC GCT GAC CCA AAA GCC GTC ATG CGT 449 Leu Lys Lys Thr He Glu Pro Asn Ala Asp Pro Lys Ala Val Met Arg 110 115 120
TTT TTA AAC CTT CAA GCG AGC TTG ATT TTA GAA TGC GTC CCG CAA ACC 497 Phe Leu Asn Leu Gin Ala Ser Leu He Leu Glu Cys Val Pro Gin Thr 125 130 135
ACT TGC CCG TTT GAC ACC CTT TTA ATC CCC ACC GCT TTC AGC GTG CCT 545 Thr Cys Pro Phe Asp Thr Leu Leu He Pro Thr Ala Phe Ser Val Pro 140 145 150
GTT TAT TAC GCT AAT CGT TTG GGC GAT AAC CCC TCT CTT TTT TCC CAA 593 Val Tyr Tyr Ala Asn Arg Leu Gly Asp Asn Pro Ser Leu Phe Ser Gin 155 160 165 170
GAG GAT AAA ACC TAT CAT AAC GCT TTG ATC AAA GCC CTT AAT AAG GCT 641 Glu Asp Lys Thr Tyr His Asn Ala Leu He Lys Ala Leu Asn Lys Ala 175 180 185
TAC TAT TCT CTT ATG GAG GGT TTA GAA AAG CGT TTG AAC GCT ATA AAA 689 Tyr Tyr Ser Leu Met Glu Gly Leu Glu Lys Arg Leu Asn Ala He Lys 190 195 200 AAT GCA GAG TGG CTT TAAGGCATGA AAAAGATTGC ATTTTTTATT TTTGTCATTT T 745 Asn Ala Glu Trp Leu 205
GTTTTCGGTA GGGATTTATT TAATTTGGCA TGTTTTATTG GAAAAAGCCC TAGAATTGAA 805 ATTAGCAACC TCAGCTAATG ATTTGCTTTT 835
(2) INFORMATION FOR SEQ ID NO: 46:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 207 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS : single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46:
Met Leu Arg Val Leu Ser Val Gly Val Ala Phe He Leu Leu Gly Cys
1 5 10 15
Gin Phe Phe Asn Lys Thr Thr Leu His Leu Lys Tyr Lys Asp Tyr Pro
20 25 30
Lys Asn Ser Ala Leu Lys Thr Ala Phe Thr Leu Thr Pro Pro Lys He
35 40 45
Phe Phe Asn Ala Arg Phe Val Pro Pro Phe Tyr Gin Lys Glu Phe Lys
50 55 60
Lys Ala He Thr Gin Gin He Ala Tyr Phe Leu Lys Asp Lys Ser Ala 65 70 75 80
Phe He Leu Asn Val Ser Gly Asn Val Phe Phe Ser Phe Glu Glu Asn
85 90 95
Pro Lys Asp Leu Lys Ala He Lys Glu Arg Leu Lys Lys Thr He Glu
100 105 110
Pro Asn Ala Asp Pro Lys Ala Val Met Arg Phe Leu Asn Leu Gin Ala
115 120 125
Ser Leu He Leu Glu Cys Val Pro Gin Thr Thr Cys Pro Phe Asp Thr
130 135 140
Leu Leu He Pro Thr Ala Phe Ser Val Pro Val Tyr Tyr Ala Asn Arg 145 150 155 160
Leu Gly Asp Asn Pro Ser Leu Phe Ser Gin Glu Asp Lys Thr Tyr His
165 170 175
Asn Ala Leu He Lys Ala Leu Asn Lys Ala Tyr Tyr Ser Leu Met Glu
180 185 190
Gly Leu Glu Lys Arg Leu Asn Ala He Lys Asn Ala Glu Trp Leu 195 200 205
(2) INFORMATION FOR SEQ ID NO: 47:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 763 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...710 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47:
AAAAAATAAC CATGAGTTAT TCAAAAATTT AACTTTATAA GACAGGTGGC ATG CGT 56
Met Arg 1
TTA AAA CAT TTT AAA ACT TTC CTT TTT ATC ACA ATG GCA ATC ATT GTA 104 Leu Lys His Phe Lys Thr Phe Leu Phe He Thr Met Ala He He Val 5 10 15
ATA GGT ACC GGT TGC GCG AAC AAA AAG AAA AAA AAA GAC GAA TAC AAC 152 He Gly Thr Gly Cys Ala Asn Lys Lys Lys Lys Lys Asp Glu Tyr Asn 20 25 30
AAA CCG GCG ATC TTT TGG TAT CAA GGG ATT TTG AGA GAA ATC CTT TTT 200 Lys Pro Ala He Phe Trp Tyr Gin Gly He Leu Arg Glu He Leu Phe 35 40 45 50
GCT AAT TTA GAA ACA GCG GAC AAT TAC TAT TCT TCT TTA CAA AGC GAA 248 Ala Asn Leu Glu Thr Ala Asp Asn Tyr Tyr Ser Ser Leu Gin Ser Glu 55 60 65
CAC ATC AAT TCC CCC CTT GTC CCA GAA GCG ATG CTA GCT TTA GGG CAA 296 His He Asn Ser Pro Leu Val Pro Glu Ala Met Leu Ala Leu Gly Gin 70 75 80
GCG CAC ATG AAA AAG AAA GAG TAT GTT TTA GCG TCT TTT TAC TTT GAT 344 Ala His Met Lys Lys Lys Glu Tyr Val Leu Ala Ser Phe Tyr Phe Asp 85 90 95
GAA TAC ATC AAG CGC TTT GGG ACT AAG GAC AAT GTG GAT TAT TTG ACT 392 Glu Tyr He Lys Arg Phe Gly Thr Lys Asp Asn Val Asp Tyr Leu Thr 100 105 110
TTT TTA AAA TTG CAA TCG CAT TAT TAC GCT TTC AAA AAC CAT TCT AAA 440 Phe Leu Lys Leu Gin Ser His Tyr Tyr Ala Phe Lys Asn His Ser Lys 115 120 125 130
GAC CAG GAA TTT ATC TCT AAT TCT ATT GTG AGT TTA GGC GAA TTT ATA 488 Asp Gin Glu Phe He Ser Asn Ser He Val Ser Leu Gly Glu Phe He 135 140 145
GAA AAA TAC CCT AAC AGC CGT TAC CGC CCC TAT GTA GAA TAC ATG CAA 536 Glu Lys Tyr Pro Asn Ser Arg Tyr Arg Pro Tyr Val Glu Tyr Met Gin 150 155 160 ATC AAA TTC ATT TTA GGG CAA AAT GAG CTC AAT CGC GCG ATC GCG AAT 584 He Lys Phe He Leu Gly Gin Asn Glu Leu Asn Arg Ala He Ala Asn 165 170 175
GTC TAT AAA AAA CGC CAC AAG CCT GAG GGC GTG AAA CGC TAT TTA GAA 632 Val Tyr Lys Lys Arg His Lys Pro Glu Gly Val Lys Arg Tyr Leu Glu 180 185 190
AGG ATA GAT GAG ACT TTA GAA AAA GAG ACT AAA CCC AAA CCA TCG CAC 680 Arg He Asp Glu Thr Leu Glu Lys Glu Thr Lys Pro Lys Pro Ser His 195 200 205 210
ATG CCT TGG TAT GTG TTA ATT TTT GAT TGG TAGGATATTT CAAAACCATA CAC 733 Met Pro Trp Tyr Val Leu He Phe Asp Trp 215 220
ATTATAACAG AGAGATGAAA AATGACTGAA 763
(2) INFORMATION FOR SEQ ID NO: 48:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 220 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48:
Met Arg Leu Lys His Phe Lys Thr Phe Leu Phe He Thr Met Ala He
1 5 10 15
He Val He Gly Thr Gly Cys Ala Asn Lys Lys Lys Lys Lys Asp Glu
20 25 30
Tyr Asn Lys Pro Ala He Phe Trp Tyr Gin Gly He Leu Arg Glu He
35 40 45
Leu Phe Ala Asn Leu Glu Thr Ala Asp Asn Tyr Tyr Ser Ser Leu Gin
50 55 60
Ser Glu His He Asn Ser Pro Leu Val Pro Glu Ala Met Leu Ala Leu 65 70 75 80
Gly Gin Ala His Met Lys Lys Lys Glu Tyr Val Leu Ala Ser Phe Tyr
85 90 95
Phe Asp Glu Tyr He Lys Arg Phe Gly Thr Lys Asp Asn Val Asp Tyr
100 105 110
Leu Thr Phe Leu Lys Leu Gin Ser His Tyr Tyr Ala Phe Lys Asn His
115 120 125
Ser Lys Asp Gin Glu Phe He Ser Asn Ser He Val Ser Leu Gly Glu
130 135 140
Phe He Glu Lys Tyr Pro Asn Ser Arg Tyr Arg Pro Tyr Val Glu Tyr 145 150 155 160
Met Gin He Lys Phe He Leu Gly Gin Asn Glu Leu Asn Arg Ala He
165 170 175
Ala Asn Val Tyr Lys Lys Arg His Lys Pro Glu Gly Val Lys Arg Tyr 180 185 190 Leu Glu Arg He Asp Glu Thr Leu Glu Lys Glu Thr Lys Pro Lys Pro
195 200 205
Ser His Met Pro Trp Tyr Val Leu He Phe Asp Trp 210 215 220
(2) INFORMATION FOR SEQ ID NO: 49:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 801 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS : single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 75...749 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49:
GAAAAAGGCT CTGCTTTGAT AGATAAATTT GACGCTAACC CCTATAAAAC GATTTTTGGA 60 GAAAGGAAAT AATC ATG AGA GCT ACG GCG ATA AAA ATC TTT TCA CTC TCA 110 Met Arg Ala Thr Ala He Lys He Phe Ser Leu Ser 1 5 10
TCA GCA TTA GCC CTA TTG CTT CAT GGT TGC TTG AGC ATC AAT TTA AAA 158 Ser Ala Leu Ala Leu Leu Leu His Gly Cys Leu Ser He Asn Leu Lys 15 20 25
CAA ATG CTA CCA GAG ATC AGA ACT TAC GAT TTG AAT GCG AGT TCT TTT 206 Gin Met Leu Pro Glu He Arg Thr Tyr Asp Leu Asn Ala Ser Ser Phe 30 35 40
GAA ATC ACG CAA TGC GCT AAA CCT TTG ACT GAA GTG AGG CTC ATT AGT 254 Glu He Thr Gin Cys Ala Lys Pro Leu Thr Glu Val Arg Leu He Ser 45 50 55 60
ATT TTG AGC GCG GAT TTA TTC AAC ACT AAA GAG ATC GTT TTT AAA GCC 302 He Leu Ser Ala Asp Leu Phe Asn Thr Lys Glu He Val Phe Lys Ala 65 70 75
AAA GAC GGG CAG ATC ACG CAT GGG AAG CAC CAA AAA TGG ATA GAC TTG 350 Lys Asp Gly Gin He Thr His Gly Lys His Gin Lys Trp He Asp Leu 80 85 90
CCT CGC AAC ATG CTA AAA ACC ATG TTC ATG CAA GAA GCG CAA AAA GCA 398 Pro Arg Asn Met Leu Lys Thr Met Phe Met Gin Glu Ala Gin Lys Ala 95 100 105
TGC TTA GGC GTG GCT TTG CCT CCT TAT GGC GCG GGT GCA CCC ACT TAT 446 Cys Leu Gly Val Ala Leu Pro Pro Tyr Gly Ala Gly Ala Pro Thr Tyr 110 115 120
GCG GTT CGT TTT ACG ATT TTA TCG TTT TCT CTT TTA GAA AAA GAA AAT 494 Ala Val Arg Phe Thr He Leu Ser Phe Ser Leu Leu Glu Lys Glu Asn 125 130 135 140
TCT ACC TAT AGG GCG GAA TTT GCA CTA GGC TAT GAC ATT AGC GTG AAA 542 Ser Thr Tyr Arg Ala Glu Phe Ala Leu Gly Tyr Asp He Ser Val Lys 145 150 155
GGC GAT TCG CAT TCT GGG GTG ATC ATT AAG CAT GAA AAT ATT TCT AGC 590 Gly Asp Ser His Ser Gly Val He He Lys His Glu Asn He Ser Ser 160 165 170
TTG GAA AAT AAA ACG ACC AAA ACG AGT AAA AAT GGC AAT CAA GAT TTT 638 Leu Glu Asn Lys Thr Thr Lys Thr Ser Lys Asn Gly Asn Gin Asp Phe 175 180 185
CAA GAA AGC GCG ATA CAA TCT CTC CAA CAT GTA AGC GTG CAA GCG ATT 686 Gin Glu Ser Ala He Gin Ser Leu Gin His Val Ser Val Gin Ala He 190 195 200
CAA GAA GCG GTT TCT TTG ATT AAA AAA GCC ATT GAA GCG CAA AGC GTA 734 Gin Glu Ala Val Ser Leu He Lys Lys Ala He Glu Ala Gin Ser Val 205 210 215 220
AGC CCG TTA AAA AAA TAAAAAATAA GGAGGAATTG TTTGATTTTA CGATTGGCTG G 790 Ser Pro Leu Lys Lys 225
AGCAAGCGTT T 801
(2) INFORMATION FOR SEQ ID NO: 50:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 225 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50:
Met Arg Ala Thr Ala He Lys He Phe Ser Leu Ser Ser Ala Leu Ala
1 5 10 15
Leu Leu Leu His Gly Cys Leu Ser He Asn Leu Lys Gin Met Leu Pro
20 25 30
Glu He Arg Thr Tyr Asp Leu Asn Ala Ser Ser Phe Glu He Thr Gin
35 40 45
Cys Ala Lys Pro Leu Thr Glu Val Arg Leu He Ser He Leu Ser Ala
50 55 60
Asp Leu Phe Asn Thr Lys Glu He Val Phe Lys Ala Lys Asp Gly Gin 65 70 75 80
He Thr His Gly Lys His Gin Lys Trp He Asp Leu Pro Arg Asn Met
85 90 95
Leu Lys Thr Met Phe Met Gin Glu Ala Gin Lys Ala Cys Leu Gly Val
100 105 110
Ala Leu Pro Pro Tyr Gly Ala Gly Ala Pro Thr Tyr Ala Val Arg Phe
115 120 125
Thr He Leu Ser Phe Ser Leu Leu Glu Lys Glu Asn Ser Thr Tyr Arg
130 135 140
Ala Glu Phe Ala Leu Gly Tyr Asp He Ser Val Lys Gly Asp Ser His 145 150 155 160
Ser Gly Val He He Lys His Glu Asn He Ser Ser Leu Glu Asn Lys
165 170 175
Thr Thr Lys Thr Ser Lys Asn Gly Asn Gin Asp Phe Gin Glu Ser Ala
180 185 190
He Gin Ser Leu Gin His Val Ser Val Gin Ala He Gin Glu Ala Val
195 200 205
Ser Leu He Lys Lys Ala He Glu Ala Gin Ser Val Ser Pro Leu Lys
210 215 220
Lys 225
(2) INFORMATION FOR SEQ ID NO: 51:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 448 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...395 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51:
TTTTATGATA AGATAGTCAA ATTATACATT GACTTAAGGA AATTTAATTG ATG AAA 56
Met Lys 1
TCT AAA ATC ACT CAT TTT ATC GCT ATC TCT TTT GTT TTA AGC CTG TTT 104 Ser Lys He Thr His Phe He Ala He Ser Phe Val Leu Ser Leu Phe 5 10 15
AGC GCA TGC AAA GAC GAG CCT AAA AAA TCG TCT CAA TCG CAC CAA AAC 152 Ser Ala Cys Lys Asp Glu Pro Lys Lys Ser Ser Gin Ser His Gin Asn 20 25 30
AAC ACT AAA ATC ACT AAA AAC AAT CCA ATC AAT CAA GCG AAT AAT GAT 200 Asn Thr Lys He Thr Lys Asn Asn Pro He Asn Gin Ala Asn Asn Asp 35 40 45 50
ATA AGA AAA ATT GAG CAT GAA GAA GAA GAT GAA AAA GCC ACC AAA GAA 248 He Arg Lys He Glu His Glu Glu Glu Asp Glu Lys Ala Thr Lys Glu 55 60 65
GTG AAC GAT TTG ATC AAT AAC GAA AAT AAA ATT GAT GAA ATC AAT AAT 296 Val Asn Asp Leu He Asn Asn Glu Asn Lys He Asp Glu He Asn Asn 70 75 80
GAA GAA AAC GCT GAT CCT TCG CAA AAA AGA ACG AAC AAC GTT TTG CAA 344 Glu Glu Asn Ala Asp Pro Ser Gin Lys Arg Thr Asn Asn Val Leu Gin 85 90 95
CGA GCC ACT AAC CAC CAA GAC AAT CTC AAT TCC CCA CTC AAC AGG AAG 392 Arg Ala Thr Asn His Gin Asp Asn Leu Asn Ser Pro Leu Asn Arg Lys 100 105 110
TAT TAAAGTGTGA AACTTTTTTC AAAGGATTTA TTTAAAAAAG TAACCCCTTT ATT 448
Tyr
115
(2) INFORMATION FOR SEQ ID NO: 52:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 115 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52:
Met Lys Ser Lys He Thr His Phe He Ala He Ser Phe Val Leu Ser
1 5 10 15
Leu Phe Ser Ala Cys Lys Asp Glu Pro Lys Lys Ser Ser Gin Ser His
20 25 30
Gin Asn Asn Thr Lys He Thr Lys Asn Asn Pro He Asn Gin Ala Asn
35 40 45
Asn Asp He Arg Lys He Glu His Glu Glu Glu Asp Glu Lys Ala Thr
50 55 60
Lys Glu Val Asn Asp Leu He Asn Asn Glu Asn Lys He Asp Glu He 65 70 75 80
Asn Asn Glu Glu Asn Ala Asp Pro Ser Gin Lys Arg Thr Asn Asn Val
85 90 95
Leu Gin Arg Ala Thr Asn His Gin Asp Asn Leu Asn Ser Pro Leu Asn
100 105 110
Arg Lys Tyr 115
(2) INFORMATION FOR SEQ ID NO: 53: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1121 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 121...1065 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53:
CTATCAAGTC AGTATTTCCA ATATCCAATT AGCCAATGAT CTCAAAGATT CTAATATTTT 60
TATCCACCAG CGTTTAATCA TCCCCACCAA CAAAAAATTA CTCGCTACAA GGGAATTTTA 120
ATG GGT TTG GCG TTG GAA AAA GTT TGT TTT TTA GGC GTT ATT TTT TTG 168 Met Gly Leu Ala Leu Glu Lys Val Cys Phe Leu Gly Val He Phe Leu 1 5 10 15
ATT AGC GCT TGC ACG GTT AAA AAA GAG GGG GTA AAG AAT TTG TCT TAC 216 He Ser Ala Cys Thr Val Lys Lys Glu Gly Val Lys Asn Leu Ser Tyr 20 25 30
AAG CAT GAA AGC TTG CGC GCT TAT GAA AAC GCT AAA GAT TAT GAT CCG 264 Lys His Glu Ser Leu Arg Ala Tyr Glu Asn Ala Lys Asp Tyr Asp Pro 35 40 45
ACA ACC AAA AAA GCC GCC TAT AAA CGC AAT TTT TTT GAA CGC CAT TTC 312 Thr Thr Lys Lys Ala Ala Tyr Lys Arg Asn Phe Phe Glu Arg His Phe 50 55 60
AAA CGC TAC TCC GAT TCG CAA GAT AGC AAC ACA AAA GAT CAG CςA CTA 360 Lys Arg Tyr Ser Asp Ser Gin Asp Ser Asn Thr Lys Asp Gin Pro Leu 65 70 75 80
GAT AAC GGC ATG CGC GAT TCT AGC TCG ATC CAA AGA GCC ACC ATG CGC 408 Asp Asn Gly Met Arg Asp Ser Ser Ser He Gin Arg Ala Thr Met Arg 85 90 95
CCT TAT CAA GTG GGG GGC AAG TGG TAT TAC CCC ACT AAA GTG GAT TTA 456 Pro Tyr Gin Val Gly Gly Lys Trp Tyr Tyr Pro Thr Lys Val Asp Leu 100 105 110
GGC GAA AAA TTT GAT GGC GTT GCG AGT TGG TAT GGC CCT AAC TTC CAT 504 Gly Glu Lys Phe Asp Gly Val Ala Ser Trp Tyr Gly Pro Asn Phe His 115 120 125
GCC AAA AAA ACC AGT AAT GGG GAA ATT TAT AAC ATG TAT GCC CAC ACC 552 Ala Lys Lys Thr Ser Asn Gly Glu He Tyr Asn Met Tyr Ala His Thr 130 135 140 GCC GCG CAC AAA ACT TTA CCC ATG AAC ACC GTG GTG AAA GTC ATC AAT 600 Ala Ala His Lys Thr Leu Pro Met Asn Thr Val Val Lys Val He Asn 145 150 155 160
GTT GAT AAT AAC TTA AGC ACC ATT GTG CGC ATC AAC GAT AGA GGG CCT 648 Val Asp Asn Asn Leu Ser Thr He Val Arg He Asn Asp Arg Gly Pro 165 170 175
TTT GTG AGC GAT CGC ATC ATT GAT TTG TCT AAT GCG GCC GCT AGG GAT 696 Phe Val Ser Asp Arg He He Asp Leu Ser Asn Ala Ala Ala Arg Asp 180 185 190
ATT GAC ATG GTT AAA AAA GGC ACA GCC AGC GTG CGT CTC ATT GTT TTG 744 He Asp Met Val Lys Lys Gly Thr Ala Ser Val Arg Leu He Val Leu 195 200 205
GGC TTT GGT GGG GTT ATC TCC ACG CAA TAC GAA CAA TCC TTT AAC GCC 792 Gly Phe Gly Gly Val He Ser Thr Gin Tyr Glu Gin Ser Phe Asn Ala 210 215 220
AGC TCT TCA AAG ATC TTG CAC AAG GAA TTT AAA GTC GGC GAG AGC GAA 840 Ser Ser Ser Lys He Leu His Lys Glu Phe Lys Val Gly Glu Ser Glu 225 230 235 240
AAA AGC GTG AGC GGA GGG AAA TTT TCT TTG CAA ATG GGG GCT TTT AGA 888 Lys Ser Val Ser Gly Gly Lys Phe Ser Leu Gin Met Gly Ala Phe Arg 245 250 255
AAC CAA ATA GGT GCT CAA ACT TTA GCG GAT AAA TTG CAA GCA GAA AAT 936 Asn Gin He Gly Ala Gin Thr Leu Ala Asp Lys Leu Gin Ala Glu Asn 260 265 270
CCA AAT TAC AGC GTC AAG GTT GCT TTT AAA GAC GAT TTG TAT AAA GTT 984 Pro Asn Tyr Ser Val Lys Val Ala Phe Lys Asp Asp Leu Tyr Lys Val 275 280 285
TTA GTT CAA GGG TTT CAA AGC GAA GAA GAG GCT AGG GAT TTT ATG AAA 1032 Leu Val Gin Gly Phe Gin Ser Glu Glu Glu Ala Arg Asp Phe Met Lys 290 295 300
AAA TAC AAC CAG AAT GCG GTT TTA ACG AGA GAA TGATTAAGTT ATTGCTTTTA 1085 Lys Tyr Asn Gin Asn Ala Val Leu Thr Arg Glu 305 310 315
GATGTGGATG GCACGCTCAC AGACGGATCG TTGTAT 1121
(2) INFORMATION FOR SEQ ID NO: 54:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 315 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54:
Met Gly Leu Ala Leu Glu Lys Val Cys Phe Leu Gly Val He Phe Leu
1 5 10 15
He Ser Ala Cys Thr Val Lys Lys Glu Gly Val Lys Asn Leu Ser Tyr
20 25 30
Lys His Glu Ser Leu Arg Ala Tyr Glu Asn Ala Lys Asp Tyr Asp Pro
35 40 45
Thr Thr Lys Lys Ala Ala Tyr Lys Arg Asn Phe Phe Glu Arg His Phe
50 55 60
Lys Arg Tyr Ser Asp Ser Gin Asp Ser Asn Thr Lys Asp Gin Pro Leu 65 70 75 80
Asp Asn Gly Met Arg Asp Ser Ser Ser He Gin Arg Ala Thr Met Arg
85 90 95
Pro Tyr Gin Val Gly Gly Lys Trp Tyr Tyr Pro Thr Lys Val Asp Leu
100 105 110
Gly Glu Lys Phe Asp Gly Val Ala Ser Trp Tyr Gly Pro Asn Phe His
115 120 125
Ala Lys Lys Thr Ser Asn Gly Glu He Tyr Asn Met Tyr Ala His Thr
130 135 140
Ala Ala His Lys Thr Leu Pro Met Asn Thr Val Val Lys Val He Asn 145 150 155 160
Val Asp Asn Asn Leu Ser Thr He Val Arg He Asn Asp Arg Gly Pro
165 170 175
Phe Val Ser Asp Arg He He Asp Leu Ser Asn Ala Ala Ala Arg Asp
180 185 190
He Asp Met Val Lys Lys Gly Thr Ala Ser Val Arg Leu He Val Leu
195 200 205
Gly Phe Gly Gly Val He Ser Thr Gin Tyr Glu Gin Ser Phe Asn Ala
210 215 220
Ser Ser Ser Lys He Leu His Lys Glu Phe Lys Val Gly Glu Ser Glu 225 230 235 240
Lys Ser Val Ser Gly Gly Lys Phe Ser Leu Gin Met Gly Ala Phe Arg
245 250 255
Asn Gin He Gly Ala Gin Thr Leu Ala Asp Lys Leu Gin Ala Glu Asn
260 265 270
Pro Asn Tyr Ser Val Lys Val Ala Phe Lys Asp Asp Leu Tyr Lys Val
275 280 285
Leu Val Gin Gly Phe Gin Ser Glu Glu Glu Ala Arg Asp Phe Met Lys
290 295 300
Lys Tyr Asn Gin Asn Ala Val Leu Thr Arg Glu 305 310 315
(2) INFORMATION FOR SEQ ID NO: 55:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 811 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...761 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55:
TATAATAAGG AAATTCTAAA CGAAAATTAA ACTGAATGAA AGGAGTTTGA ATG AAA 56
Met Lys 1
AAA ATC GTT TTA GTA GCG ATA GCC TTA TTG ATG AGC GCT TGC GCG AGC 104 Lys He Val Leu Val Ala He Ala Leu Leu Met Ser Ala Cys Ala Ser 5 10 15
TAT AAG ATC ACG CCT GAA CAT GTT ACT TCC TAT AAT AAT GGG ATT CAA 152 Tyr Lys He Thr Pro Glu His Val Thr Ser Tyr Asn Asn Gly He Gin 20 25 30
GTG ATG ACT TCC ACG CAA GCC AAA TCT AAA GTC CAG CTA GAA ATC GCT 200 Val Met Thr Ser Thr Gin Ala Lys Ser Lys Val Gin Leu Glu He Ala 35 40 45 50
CAA AGC AAG TTG AAA GGC TTG AAC GAG TCC CCC TTA GTG CTG TAT GTA 248 Gin Ser Lys Leu Lys Gly Leu Asn Glu Ser Pro Leu Val Leu Tyr Val 55 60 65
GCG GCG CAA GTT ATA GAG GGA AGT CCT GTG GTG TTT AGC CGT AAA GCC 296 Ala Ala Gin Val He Glu Gly Ser Pro Val Val Phe Ser Arg Lys Ala 70 75 80
ATT TCA GTG TCT ATC AAC CAA ACG AAT TTA CCG GTC TTA AGC CTG AGA 344 He Ser Val Ser He Asn Gin Thr Asn Leu Pro Val Leu Ser Leu Arg 85 90 95
CAG GTG ATG AAA TCC AGT TTT GAT TTT GAG GGT ATT TTA CAA AGT TTT 392 Gin Val Met Lys Ser Ser Phe Asp Phe Glu Gly He Leu Gin Ser Phe 100 105 110
AAT ATC GCC GTG CCG ACC ACC CCT ATT GAT AAT GTC AAT ATG ATC ACC 440 Asn He Ala Val Pro Thr Thr Pro He Asp Asn Val Asn Met He Thr 115 120 125 130
CCG CCT ATG TTT TAT TAC GGG CAA GGG GGA TTT TTA GCT TAT AAC GGC 488 Pro Pro Met Phe Tyr Tyr Gly Gin Gly Gly Phe Leu Ala Tyr Asn Gly 135 140 145
ATG ATG TAT GGG GGA ATG GGC ATG TAT GGG CCA GGC TTT GGC ATG ATG 536 Met Met Tyr Gly Gly Met Gly Met Tyr Gly Pro Gly Phe Gly Met Met 150 155 160
ATG ATG GAT GAT GTA GAA GAG CAA GAA GTC ATG CAA GAA AGC CGC CAA 584 Met Met Asp Asp Val Glu Glu Gin Glu Val Met Gin Glu Ser Arg Gin 165 170 175
GCT TTA AAA ATC CTA GCG ATC AAT TAC CTT AAA AAC AAC ACC CTT AAT 632 Ala Leu Lys He Leu Ala He Asn Tyr Leu Lys Asn Asn Thr Leu Asn 180 185 190
GTT GAG AGT AAG GCT AAG GGA GGG TTT GTG GTG GTG GAT ACC AAA AAC 680 Val Glu Ser Lys Ala Lys Gly Gly Phe Val Val Val Asp Thr Lys Asn 195 200 205 210
CTT AAA ACC CCG GGT GTG GTG GTG GTT AAA GTC TTT TTA GAA GAT GAA 728 Leu Lys Thr Pro Gly Val Val Val Val Lys Val Phe Leu Glu Asp Glu 215 220 225
ATC CAC ACC TTT AAA ATT GAT ATT TCT AAG ATG TAATCGCCCC CTTTAATAAA 781 He His Thr Phe Lys He Asp He Ser Lys Met 230 235
AGCCTTTGGG CCATCCACCT AAAGGTTTTT 811
(2) INFORMATION FOR SEQ ID NO: 56:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 237 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56:
Met Lys Lys He Val Leu Val Ala He Ala Leu Leu Met Ser Ala Cys
1 5 10 15
Ala Ser Tyr Lys He Thr Pro Glu His Val Thr Ser Tyr Asn Asn Gly
20 25 30
He Gin Val Met Thr Ser Thr Gin Ala Lys Ser Lys Val Gin Leu Glu
35 40 45
He Ala Gin Ser Lys Leu Lys Gly Leu Asn Glu Ser Pro Leu Val Leu
50 55 60
Tyr Val Ala Ala Gin Val He Glu Gly Ser Pro Val Val Phe Ser Arg 65 70 75 80
Lys Ala He Ser Val Ser He Asn Gin Thr Asn Leu Pro Val Leu Ser
85 90 95
Leu Arg Gin Val Met Lys Ser Ser Phe Asp Phe Glu Gly He Leu Gin
100 105 110
Ser Phe Asn He Ala Val Pro Thr Thr Pro He Asp Asn Val Asn Met
115 120 125
He Thr Pro Pro Met Phe Tyr Tyr Gly Gin Gly Gly Phe Leu Ala Tyr
130 135 140
Asn Gly Met Met Tyr Gly Gly Met Gly Met Tyr Gly Pro Gly Phe Gly 145 150 155 160
Met Met Met Met Asp Asp Val Glu Glu Gin Glu Val Met Gin Glu Ser 165 170 175 Arg Gin Ala Leu Lys He Leu Ala He Asn Tyr Leu Lys Asn Asn Thr
180 185 190
Leu Asn Val Glu Ser Lys Ala Lys Gly Gly Phe Val Val Val Asp Thr
195 200 205
Lys Asn Leu Lys Thr Pro Gly Val Val Val Val Lys Val Phe Leu Glu
210 215 220
Asp Glu He His Thr Phe Lys He Asp He Ser Lys Met 225 230 235
(2) INFORMATION FOR SEQ ID NO: 57:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1425 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 97...1371 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57:
TAAAAAAAAC TACCCTTAAA AAAATCAATC TAAAATTCTT AAATTAAAAT ATAGCTATAA 60 TACACTAAAA CAATCTCAAG GTTTCAAAAT TTAGCC ATG CGT CTT CTT CTG TTC 114
Met Arg Leu Leu Leu Phe 1 5
AAT CAA AAC GCT TTT TTA TTA GCG TGC ATG TTT GTT TCA AGC GTG TAT 162 Asn Gin Asn Ala Phe Leu Leu Ala Cys Met Phe Val Ser Ser Val Tyr 10 15 20
GTG AAC GCT GTC TTA GAC GCT TAT GCA ATT GAA AAC CCC TAT ATT TCT 210 Val Asn Ala Val Leu Asp Ala Tyr Ala He Glu Asn Pro Tyr He Ser 25 30 35
ATC ACA CTC ACA AGC CTA TTA GCC CCT TTA AGC ATG CTA GCG TTT TTA 258 He Thr Leu Thr Ser Leu Leu Ala Pro Leu Ser Met Leu Ala Phe Leu 40 45 50
AAA ACC CCT AGA AAT AGT GCT TTT GCT TTG GGG TTT TTT GTG GGG GCG 306 Lys Thr Pro Arg Asn Ser Ala Phe Ala Leu Gly Phe Phe Val Gly Ala 55 60 65 70
TTA TTG TTT TAT TGG TGC GCT TTA AGC TTT CGC TAC TCG GAT TTC ACT 354 Leu Leu Phe Tyr Trp Cys Ala Leu Ser Phe Arg Tyr Ser Asp Phe Thr 75 80 85
TAT TTA TTG CCC TTA ATC ATT GTT TTA ATA GCG TTA GTT TAT GGG GTT 402 Tyr Leu Leu Pro Leu He He Val Leu He Ala Leu Val Tyr Gly Val 90 95 100
TTA TTT TAT TTG TTG CTC TAT TTT GAA AAC CCC TAT TTC AGG CTT TTG 450 Leu Phe Tyr Leu Leu Leu Tyr Phe Glu Asn Pro Tyr Phe Arg Leu Leu 105 110 115
AGT TTT TTA GGC TCT AGT TTT ATC CAC CCC TTT GGA TTT GAT TGG TTA 498 Ser Phe Leu Gly Ser Ser Phe He His Pro Phe Gly Phe Asp Trp Leu 120 125 130
GTC CCA GAT AGC TTT TTT TCT TAT AGC GTG TTT AGA GTG GAT AAA TTA 546 Val Pro Asp Ser Phe Phe Ser Tyr Ser Val Phe Arg Val Asp Lys Leu 135 140 145 150
TCG CTA GGG CTT GTT TTT TTG GCT TGC ATT TTT TTG AGC ACT AAA CCA 594 Ser Leu Gly Leu Val Phe Leu Ala Cys He Phe Leu Ser Thr Lys Pro 155 160 165
TTG AAA AAA TAT AGG ATC ATA GGG GTT TTA TTG TTA CTT GGC GCG TTG 642 Leu Lys Lys Tyr Arg He He Gly Val Leu Leu Leu Leu Gly Ala Leu 170 175 180
GAT TTT AAT GGT TTC AAA ACA AGC GAT TTA AAA AAG GTT GGA AAT ATT 690 Asp Phe Asn Gly Phe Lys Thr Ser Asp Leu Lys Lys Val Gly Asn He 185 190 195
GAA TTA GTC TCT ACA AAA ACG CCC CAA GAT TTG AAA TTT GAC TCA AGT 738 Glu Leu Val Ser Thr Lys Thr Pro Gin Asp Leu Lys Phe Asp Ser Ser 200 205 210
TAC CTT AAT GAT ATT GAA AAC AAC ATT CTT AAA GAA ATC AAG CTC GCT 786 Tyr Leu Asn Asp He Glu Asn Asn He Leu Lys Glu He Lys Leu Ala 215 220 225 230
CAA AGC AAG CAA AAA ACC TTG ATT GTT TTT CCA GAA ACC GCC TAC CCC 834 Gin Ser Lys Gin Lys Thr Leu He Val Phe Pro Glu Thr Ala Tyr Pro 235 240 245
ATC GCT TTA GAA AAC TCC CCC TTT AAA GCG AAG CTA GAA GAT TTA AGC 882 He Ala Leu Glu Asn Ser Pro Phe Lys Ala Lys Leu Glu Asp Leu Ser 250 255 260
GAT AAT ATT GCT ATT TTA ATA GGG ACA TTA CGG ACT CAA GGC TAT AAT 930 Asp Asn He Ala He Leu He Gly Thr Leu Arg Thr Gin Gly Tyr Asn 265 270 275
CTT TAT AAC AGC TCG TTT TTA TTT TCT AAA GAA AGC GTT CAG ATC GCT 978 Leu Tyr Asn Ser Ser Phe Leu Phe Ser Lys Glu Ser Val Gin He Ala 280 285 290
GAT AAA GTA ATT TTA GCC CCC TTT GGC GAG ACC ATG CCT TTA CCG GAA 1026 Asp Lys Val He Leu Ala Pro Phe Gly Glu Thr Met Pro Leu Pro Glu 295 300 305 310 TTT CTT CAA AAA CCC CTT GAA AAG CTC TTT TTT GGC GAG AGC ACT TAT 1074 Phe Leu Gin Lys Pro Leu Glu Lys Leu Phe Phe Gly Glu Ser Thr Tyr 315 320 325
TTA TAC CGC AAT GCT CCT CAT TTC AGC GAT TTT ACA TTA GAC GAT TTT 1122 Leu Tyr Arg Asn Ala Pro His Phe Ser Asp Phe Thr Leu Asp Asp Phe 330 335 340
ACT TTT CGC CCC CTG ATT TGC TAT GAA GGC ACT TCC AAA CCC GCT TAT 1170 Thr Phe Arg Pro Leu He Cys Tyr Glu Gly Thr Ser Lys Pro Ala Tyr 345 350 355
TCA AAC AGC CCT TCA AAA ATT TTT ATC GTG ATG AGC AAT AAC GCA TGG 1218 Ser Asn Ser Pro Ser Lys He Phe He Val Met Ser Asn Asn Ala Trp 360 365 370
TTT AGC CCA AGC ATT GAA CCC ACC TTA CAA AGA ACG CTT TTA AAA TAC 1266 Phe Ser Pro Ser He Glu Pro Thr Leu Gin Arg Thr Leu Leu Lys Tyr 375 380 385 390
TAC GCA AGG CGT TAT GAT AAG ATC ATC TTG CAC AGC GCG AAC TTT TCA 1314 Tyr Ala Arg Arg Tyr Asp Lys He He Leu His Ser Ala Asn Phe Ser 395 400 405
ACT TCT TAC ATC TTA AGC CCT AGT TTA TTA GGC GAT ATT CTT TTT AGG 1362 Thr Ser Tyr He Leu Ser Pro Ser Leu Leu Gly Asp He Leu Phe Arg 410 415 420
AAA CGA TCA TGATTAAAGC GATTAATATT TCTCATGCTT TTGAAAAGCC TCTTTATAA 1420 Lys Arg Ser 425
TGGCG 1425
(2) INFORMATION FOR SEQ ID NO: 58:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 425 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58:
Met Arg Leu Leu Leu Phe Asn Gin Asn Ala Phe Leu Leu Ala Cys Met
1 5 10 15
Phe Val Ser Ser Val Tyr Val Asn Ala Val Leu Asp Ala Tyr Ala He
20 25 30
Glu Asn Pro Tyr He Ser He Thr Leu Thr Ser Leu Leu Ala Pro Leu
35 40 45
Ser Met Leu Ala Phe Leu Lys Thr Pro Arg Asn Ser Ala Phe Ala Leu 50 55 60 Gly Phe Phe Val Gly Ala Leu Leu Phe Tyr Trp Cys Ala Leu Ser Phe 65 70 75 80
Arg Tyr Ser Asp Phe Thr Tyr Leu Leu Pro Leu He He Val Leu He
85 90 95
Ala Leu Val Tyr Gly Val Leu Phe Tyr Leu Leu Leu Tyr Phe Glu Asn
100 105 110
Pro Tyr Phe Arg Leu Leu Ser Phe Leu Gly Ser Ser Phe He His Pro
115 120 125
Phe Gly Phe Asp Trp Leu Val Pro Asp Ser Phe Phe Ser Tyr Ser Val
130 135 140
Phe Arg Val Asp Lys Leu Ser Leu Gly Leu Val Phe Leu Ala Cys He 145 150 155 160
Phe Leu Ser Thr Lys Pro Leu Lys Lys Tyr Arg He He Gly Val Leu
165 170 175
Leu Leu Leu Gly Ala Leu Asp Phe Asn Gly Phe Lys Thr Ser Asp Leu
180 185 190
Lys Lys Val Gly Asn He Glu Leu Val Ser Thr Lys Thr Pro Gin Asp
195 200 205
Leu Lys Phe Asp Ser Ser Tyr Leu Asn Asp He Glu Asn Asn He Leu
210 215 220
Lys Glu He Lys Leu Ala Gin Ser Lys Gin Lys Thr Leu He Val Phe 225 230 235 240
Pro Glu Thr Ala Tyr Pro He Ala Leu Glu Asn Ser Pro Phe Lys Ala
245 250 255
Lys Leu Glu Asp Leu Ser Asp Asn He Ala He Leu He Gly Thr Leu
260 265 270
Arg Thr Gin Gly Tyr Asn Leu Tyr Asn Ser Ser Phe Leu Phe Ser Lys
275 280 285
Glu Ser Val Gin He Ala Asp Lys Val He Leu Ala Pro Phe Gly Glu
290 295 300
Thr Met Pro Leu Pro Glu Phe Leu Gin Lys Pro Leu Glu Lys Leu Phe 305 310 315 320
Phe Gly Glu Ser Thr Tyr Leu Tyr Arg Asn Ala Pro His Phe Ser Asp
325 330 335
Phe Thr Leu Asp Asp Phe Thr Phe Arg Pro Leu He Cys Tyr Glu Gly
340 345 350
Thr Ser Lys Pro Ala Tyr Ser Asn Ser Pro Ser Lys He Phe He Val
355 360 365
Met Ser Asn Asn Ala Trp Phe Ser Pro Ser He Glu Pro Thr Leu Gin
370 375 380
Arg Thr Leu Leu Lys Tyr Tyr Ala Arg Arg Tyr Asp Lys He He Leu 385 390 395 400
His Ser Ala Asn Phe Ser Thr Ser Tyr He Leu Ser Pro Ser Leu Leu
405 410 415
Gly Asp He Leu Phe Arg Lys Arg Ser 420 425
(2) INFORMATION FOR SEQ ID NO: 59:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 766 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...713 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59:
TGTAGTAAGA TACCTAGTTT TCAAATCTAT TAAAAGATAA AGGTTATTAC ATG TTT 56
Met Phe
1
TCA CTT TCT TAT GTT TCC AAG AAA TTT TTA AGC GTT TTA TTA TTG ATT 104 Ser Leu Ser Tyr Val Ser Lys Lys Phe Leu Ser Val Leu Leu Leu He 5 10 15
TCG CTG TTT TTA AGC GCT TGC AAA TCC AAC AAT AAA GAC AAG TTA GAC 152 Ser Leu Phe Leu Ser Ala Cys Lys Ser Asn Asn Lys Asp Lys Leu Asp 20 25 30
GAA AAT CTT TTA AGC TCT GGC TCT CAA AGC TCC AAA GAA TTA AAC GAT 200 Glu Asn Leu Leu Ser Ser Gly Ser Gin Ser Ser Lys Glu Leu Asn Asp 35 40 45 50
GAG CGA GAC AAT ATA GAC AAA AAG AGT TAC GCT GGT TTA GAA GAT GTT 248 Glu Arg Asp Asn He Asp Lys Lys Ser Tyr Ala Gly Leu Glu Asp Val 55 60 65
TTT TCA GAC AAT AAG TCC ATT AGT CCT AAC GAT AAA TAC ATG CTT TTA 296 Phe Ser Asp Asn Lys Ser He Ser Pro Asn Asp Lys Tyr Met Leu Leu 70 75 80
GTT TTT GGC CGT AAT GGT TGC TCC TAT TGC GAA AGG TTT AAA AAA GAT 344 Val Phe Gly Arg Asn Gly Cys Ser Tyr Cys Glu Arg Phe Lys Lys Asp 85 90 95
CTC AAA AAT GTC AAA GAA TTG CGC GAC TAC ATT AAA GAG CAT TTT AGC 392 Leu Lys Asn Val Lys Glu Leu Arg Asp Tyr He Lys Glu His Phe Ser 100 105 110
GCT TAC TAT GTC AAT ATC AGC TAC TCC AAA GAG CAT GAT TTT AAA GTC 440 Ala Tyr Tyr Val Asn He Ser Tyr Ser Lys Glu His Asp Phe Lys Val 115 120 125 130
GGC GAT AAA AAT AAT GAA AAA GAA ATC AAA ATG TCC ACA GAA GAA TTA 488 Gly Asp Lys Asn Asn Glu Lys Glu He Lys Met Ser Thr Glu Glu Leu 135 140 145
GCG CAA ATT TAT GCC GTC CAA TCC ACC CCT ACG ATT GTT TTA TCC GAT 536 Ala Gin He Tyr Ala Val Gin Ser Thr Pro Thr He Val Leu Ser Asp 150 155 160 AAA ACC GGC AAA ACC ATC TAT GAA TTG CCC GGC TAT ATG CCC TCT ACG 584 Lys Thr Gly Lys Thr He Tyr Glu Leu Pro Gly Tyr Met Pro Ser Thr 165 170 175
CAA TTT TTA GCC GTG TTA GAA TTT ATC GGC GAT GGG AAG TAT CAA GAC 632 Gin Phe Leu Ala Val Leu Glu Phe He Gly Asp Gly Lys Tyr Gin Asp 180 185 190
ACA AAA GAC GAT GAG GAT CTC ACT AAA AAA TTA AAG GCT TAC ATC AAG 680 Thr Lys Asp Asp Glu Asp Leu Thr Lys Lys Leu Lys Ala Tyr He Lys 195 200 205 210
TAT AAA ACC AAC CTT TCT AAA AGC AAG TCT AAC TAGGAAAGCC TAATGAAGAA 733 Tyr Lys Thr Asn Leu Ser Lys Ser Lys Ser Asn 215 220
TCTCAAAAGC CTGCTTTCTT TTTTGCTGGC TTC 766
(2) INFORMATION FOR SEQ ID NO: 60:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 221 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60:
Met Phe Ser Leu Ser Tyr Val Ser Lys Lys Phe Leu Ser Val Leu Leu
1 5 10 15
Leu He Ser Leu Phe Leu Ser Ala Cys Lys Ser Asn Asn Lys Asp Lys
20 25 30
Leu Asp Glu Asn Leu Leu Ser Ser Gly Ser Gin Ser Ser Lys Glu Leu
35 40 45
Asn Asp Glu Arg Asp Asn He Asp Lys Lys Ser Tyr Ala Gly Leu Glu
50 55 60
Asp Val Phe Ser Asp Asn Lys Ser He Ser Pro Asn Asp Lys Tyr Met 65 70 75 80
Leu Leu Val Phe Gly Arg Asn Gly Cys Ser Tyr Cys Glu Arg Phe Lys
85 90 95
Lys Asp Leu Lys Asn Val Lys Glu Leu Arg Asp Tyr He Lys Glu His
100 105 110
Phe Ser Ala Tyr Tyr Val Asn He Ser Tyr Ser Lys Glu His Asp Phe
115 120 125
Lys Val Gly Asp Lys Asn Asn Glu Lys Glu He Lys Met Ser Thr Glu
130 135 140
Glu Leu Ala Gin He Tyr Ala Val Gin Ser Thr Pro Thr He Val Leu 145 150 155 160
Ser Asp Lys Thr Gly Lys Thr He Tyr Glu Leu Pro Gly Tyr Met Pro
165 170 175
Ser Thr Gin Phe Leu Ala Val Leu Glu Phe He Gly Asp Gly Lys Tyr 180 185 190 Gin Asp Thr Lys Asp Asp Glu Asp Leu Thr Lys Lys Leu Lys Ala Tyr
195 200 205
He Lys Tyr Lys Thr Asn Leu Ser Lys Ser Lys Ser Asn 210 215 220
(2) INFORMATION FOR SEQ ID NO: 61:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 980 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 53...931 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61:
TGAATGAAAT CCTAGATTCT AACGCCATTG TATATTATCT CGCTAAAAAT TC ATG AGA 58
Met Arg 1
TTG TTA TTC TTG TTA TTG AGT GCT GCT TTT ATG TTA CTG GCT GAA GAA 106 Leu Leu Phe Leu Leu Leu Ser Ala Ala Phe Met Leu Leu Ala Glu Glu 5 10 15
AAA ATA TCT TTA AAC GAT GAC GCC CCC ATT AAA CTA GTG CAT TGG CAA 154 Lys He Ser Leu Asn Asp Asp Ala Pro He Lys Leu Val His Trp Gin 20 25 30
AAT GCA TTA AAA GAA GTC CAA CCT GAT TCA AAC GCT CCA GCA ACA CCA 202 Asn Ala Leu Lys Glu Val Gin Pro Asp Ser Asn Ala Pro Ala Thr Pro 35 40 45 50
CCT ATA AAA GCC GTG CAA ACC ACG CTC ACT TTT GAA ACG CCT TTT AAC 250 Pro He Lys Ala Val Gin Thr Thr Leu Thr Phe Glu Thr Pro Phe Asn 55 60 65
AAA ACG CCT AAA ATC ATG GAA GTT GAA GGG CAA AAG GTG ATC GTC TTA 298 Lys Thr Pro Lys He Met Glu Val Glu Gly Gin Lys Val He Val Leu 70 75 80
AAA AAC GCT AAA CTG GAT TCT AAA AAA ACC ATG GAT TTT AAA GAA GCC 346 Lys Asn Ala Lys Leu Asp Ser Lys Lys Thr Met Asp Phe Lys Glu Ala 85 90 95
TCT TTG AAT GCT TTA GAA ATG TTT TCC TAC CAA AAT GAC ATC TAC CTC 394 Ser Leu Asn Ala Leu Glu Met Phe Ser Tyr Gin Asn Asp He Tyr Leu 100 105 no TTG TCT AAA AAA GCT AAA GTG GAA TTA GAA ATC CAA GCT TCA AAC AGC 442 Leu Ser Lys Lys Ala Lys Val Glu Leu Glu He Gin Ala Ser Asn Ser 115 120 125 130
AAG GAT AAA AAA CGG CTC CGC TTT CTC TTT TTA CCC AAA GGT TTT CAT 490 Lys Asp Lys Lys Arg Leu Arg Phe Leu Phe Leu Pro Lys Gly Phe His 135 140 145
TTA GCC CCA CCG CCT AAC CTG AAA GAA AAA TCT CAG CAA ACT AAC CTT 538 Leu Ala Pro Pro Pro Asn Leu Lys Glu Lys Ser Gin Gin Thr Asn Leu 150 155 160
GCA CAA AAA GAC ACC AAC GAG CAA CCC CAA AGC CCT TTA AAC ACT CTA 586 Ala Gin Lys Asp Thr Asn Glu Gin Pro Gin Ser Pro Leu Asn Thr Leu 165 170 175
GAG TTA AAA CCC CCA CTA AAT TTA AGC CAT GCT TAT AAG GCG CTA GCG 634 Glu Leu Lys Pro Pro Leu Asn Leu Ser His Ala Tyr Lys Ala Leu Ala 180 185 190
GTT ATT GCT GCC TTA CTC TTA ATA TTG TAT GTC ATC AAA AAA AAA ATT 682 Val He Ala Ala Leu Leu Leu He Leu Tyr Val He Lys Lys Lys He 195 200 205 210
GTT CCC ACA CAA GGG TCT TTT TCT GCA AAA GAT TTT AAG TTA GAA ATT 730 Val Pro Thr Gin Gly Ser Phe Ser Ala Lys Asp Phe Lys Leu Glu He 215 220 225
AGC GTT TTG GGT CGT GTT GAT GCG AAC CAT AAA ATC ATT TCA ATA GAA 778 Ser Val Leu Gly Arg Val Asp Ala Asn His Lys He He Ser He Glu 230 235 240
ACC AAT AAG GAG CGT TAC TTG GTC TTA CTA AGC GAT AAA TAC GGC CTG 826 Thr Asn Lys Glu Arg Tyr Leu Val Leu Leu Ser Asp Lys Tyr Gly Leu 245 250 255
CTT TTA GAC AAA ATA AGC CCA AAA ACA TCT AAA GAA GAA CTG ATT AAA 874 Leu Leu Asp Lys He Ser Pro Lys Thr Ser Lys Glu Glu Leu He Lys 260 265 270
GAA GCT GAA AAT AAT ATA AAG AAT TCA AAA TTA GGA AAT TTA TAT GCC 922 Glu Ala Glu Asn Asn He Lys Asn Ser Lys Leu Gly Asn Leu Tyr Ala 275 280 285 290
GGA AAA TTC TAAACTACAA CCTGCTAAGT TAGGGAAAAA TTTTGACCCT GTGGATCAT 980 Gly Lys Phe
(2) INFORMATION FOR SEQ ID NO : 62
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 293 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62:
Met Arg Leu Leu Phe Leu Leu Leu Ser Ala Ala Phe Met Leu Leu Ala
1 5 10 15
Glu Glu Lys He Ser Leu Asn Asp Asp Ala Pro He Lys Leu Val His
20 25 30
Trp Gin Asn Ala Leu Lys Glu Val Gin Pro Asp Ser Asn Ala Pro Ala
35 40 45
Thr Pro Pro He Lys Ala Val Gin Thr Thr Leu Thr Phe Glu Thr Pro
50 55 60
Phe Asn Lys Thr Pro Lys He Met Glu Val Glu Gly Gin Lys Val He 65 70 75 80
Val Leu Lys Asn Ala Lys Leu Asp Ser Lys Lys Thr Met Asp Phe Lys
85 90 95
Glu Ala Ser Leu Asn Ala Leu Glu Met Phe Ser Tyr Gin Asn Asp He
100 105 110
Tyr Leu Leu Ser Lys Lys Ala Lys Val Glu Leu Glu He Gin Ala Ser
115 120 125
Asn Ser Lys Asp Lys Lys Arg Leu Arg Phe Leu Phe Leu Pro Lys Gly
130 135 140
Phe His Leu Ala Pro Pro Pro Asn Leu Lys Glu Lys Ser Gin Gin Thr 145 150 155 160
Asn Leu Ala Gin Lys Asp Thr Asn Glu Gin Pro Gin Ser Pro Leu Asn
165 170 175
Thr Leu Glu Leu Lys Pro Pro Leu Asn Leu Ser His Ala Tyr Lys Ala
180 185 190
Leu Ala Val He Ala Ala Leu Leu Leu He Leu Tyr Val He Lys Lys
195 200 205
Lys He Val Pro Thr Gin Gly Ser Phe Ser Ala Lys Asp Phe Lys Leu
210 215 220
Glu He Ser Val Leu Gly Arg Val Asp Ala Asn His Lys He He Ser 225 230 235 240
He Glu Thr Asn Lys Glu Arg Tyr Leu Val Leu Leu Ser Asp Lys Tyr
245 250 255
Gly Leu Leu Leu Asp Lys He Ser Pro Lys Thr Ser Lys Glu Glu Leu
260 265 270
He Lys Glu Ala Glu Asn Asn He Lys Asn Ser Lys Leu Gly Asn Leu
275 280 285
Tyr Ala Gly Lys Phe 290
(2) INFORMATION FOR SEQ ID NO: 63:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 620 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 70...567 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63:
CTAGCGCGAT CTTTGGTCTC ACACAAGCTA TAACAAGCTT GAGAATCGCA TAATATATTC 60 TTGTTCTAC ATG CTA TCA CCA GCA ACT TTC AAA CAA ATA ACT CTA GCA TTA 111 Met Leu Ser Pro Ala Thr Phe Lys Gin He Thr Leu Ala Leu 1 5 10
ATC GCT TCA AGA CTA ATC GTT GTA ATC CTA TAT GCT TTT ATC TTT ATT 159 He Ala Ser Arg Leu He Val Val He Leu Tyr Ala Phe He Phe He 15 20 25 30
GTT CTC TCT TTT TAT ATG CTC AAT ATC ATC ACT ATT CTT AAT TTT AAA 207 Val Leu Ser Phe Tyr Met Leu Asn He He Thr He Leu Asn Phe Lys 35 40 45
GCG CTT ATT TTG GGG TTT GTT AGT GTT TTT TCA AGC GCA TTG TTT TGT 255 Ala Leu He Leu Gly Phe Val Ser Val Phe Ser Ser Ala Leu Phe Cys 50 55 60
TTT TGC TTG GCA ATT TTT GTA GCT AGA ATT TTT CAA AAC GAA CAA AGC 303 Phe Cys Leu Ala He Phe Val Ala Arg He Phe Gin Asn Glu Gin Ser 65 70 75
ATC TTA GGA TTT TGT AAT ATC ATC AAT CTC TAT GCG CTA ATG TCT TGT 351 He Leu Gly Phe Cys Asn He He Asn Leu Tyr Ala Leu Met Ser Cys 80 85 90
AAT GTT TTT GTT CCT TTA GAA TAC CTA CCT AGT ATT GGT CAA TTA TTT 399 Asn Val Phe Val Pro Leu Glu Tyr Leu Pro Ser He Gly Gin Leu Phe 95 100 105 110
ATC AAA ACA TCT ATT TTT TAC TAC CTT AAT CAA CTT CTA ATC AAA GCT 447 He Lys Thr Ser He Phe Tyr Tyr Leu Asn Gin Leu Leu He Lys Ala 115 120 125
TTT CAA GGG ATT GAT ACT ATA CTG GTT TTA GCA ACT TCA ACA TTT TTC 495 Phe Gin Gly He Asp Thr He Leu Val Leu Ala Thr Ser Thr Phe Phe 130 135 140
ATT ATT GGT GGC ATT ATT TTA TTT TTA CTA AGC GCT AAT CGC ATG TTA 543 He He Gly Gly He He Leu Phe Leu Leu Ser Ala Asn Arg Met Leu 145 150 155
CTA ACA CCA AAA GAA CGC ATG CGT TAAAGGCTTA GTCCCACCAT TGATTTATTT 597 Leu Thr Pro Lys Glu Arg Met Arg 160 165
AATGGCTCAA AAAAGGGGTA AGC 620 (2) INFORMATION FOR SEQ ID NO: 64:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 166 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64:
Met Leu Ser Pro Ala Thr Phe Lys Gin He Thr Leu Ala Leu He Ala
1 5 10 15
Ser Arg Leu He Val Val He Leu Tyr Ala Phe He Phe He Val Leu
20 25 30
Ser Phe Tyr Met Leu Asn He He Thr He Leu Asn Phe Lys Ala Leu
35 40 45
He Leu Gly Phe Val Ser Val Phe Ser Ser Ala Leu Phe Cys Phe Cys
50 55 60
Leu Ala He Phe Val Ala Arg He Phe Gin Asn Glu Gin Ser He Leu 65 70 75 80
Gly Phe Cys Asn He He Asn Leu Tyr Ala Leu Met Ser Cys Asn Val
85 90 95
Phe Val Pro Leu Glu Tyr Leu Pro Ser He Gly Gin Leu Phe He Lys
100 105 110
Thr Ser He Phe Tyr Tyr Leu Asn Gin Leu Leu He Lys Ala Phe Gin
115 120 125
Gly He Asp Thr He Leu Val Leu Ala Thr Ser Thr Phe Phe He He
130 135 140
Gly Gly He He Leu Phe Leu Leu Ser Ala Asn Arg Met Leu Leu Thr 145 150 155 160
Pro Lys Glu Arg Met Arg 165
(2) INFORMATION FOR SEQ ID NO: 65:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1405 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 50...1366 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: ATTGATTTCT TTTGGGAAGT GGATTTGAAA AACTTAAAAA GCCATTAAC ATG CAA GTT 58
Met Gin Val 1
AAA GAA AAC AAA CAA CTC TGC TTA ATT TCA TTA GGT TGC TCT AAA AAT 106 Lys Glu Asn Lys Gin Leu Cys Leu He Ser Leu Gly Cys Ser Lys Asn 5 10 15
TTG GTG GAT TCA GAG GTG ATG TTA GGC AAG CTT TAT AAT TAC ACG CTC 154 Leu Val Asp Ser Glu Val Met Leu Gly Lys Leu Tyr Asn Tyr Thr Leu 20 25 30 35
ACT AAT GAC GCT AAG AGC GCT GAT GTG ATT TTG ATC AAC ACT TGC GGG 202 Thr Asn Asp Ala Lys Ser Ala Asp Val He Leu He Asn Thr Cys Gly 40 45 50
TTT ATT GAA AGC GCT AAA CAA GAG AGT ATC CAA ACC ATT CTC AAC GCC 250 Phe He Glu Ser Ala Lys Gin Glu Ser He Gin Thr He Leu Asn Ala 55 60 65
GCC AAA GAC AAA AAA GAG GGA GCG ATT TTG ATT GCG AGC GGG TGC TTG 298 Ala Lys Asp Lys Lys Glu Gly Ala He Leu He Ala Ser Gly Cys Leu 70 75 80
AGC GAA CGC TAT AAA GAT GAA ATC AAA GAA TTG ATC CCT GAA GTG GAT 346 Ser Glu Arg Tyr Lys Asp Glu He Lys Glu Leu He Pro Glu Val Asp 85 90 95
ATT TTT ACC GGC GTG GGG GAT TAT GAC AAG ATC GAT ATA ATG ATT GCT 394 He Phe Thr Gly Val Gly Asp Tyr Asp Lys He Asp He Met He Ala 100 105 110 115
AAA AAA CAA AAC CAG TTC AGC GAG CAA GTG TTT TTA AGC GAG CAT TAC 442 Lys Lys Gin Asn Gin Phe Ser Glu Gin Val Phe Leu Ser Glu His Tyr 120 125 130
AAC GCA CGC ATC ATC ACG GGA TCG AGC GTG CAT GCG TAT GTG AAA ATT 490 Asn Ala Arg He He Thr Gly Ser Ser Val His Ala Tyr Val Lys He 135 140 145
TCT GAG GGT TGC AAT CAA AAA TGT TCT TTT TGC GCT ATC CCT AGC TTT 538 Ser Glu Gly Cys Asn Gin Lys Cys Ser Phe Cys Ala He Pro Ser Phe 150 155 160
AAG GGG AAA TTG CAA AGC AGG GAA TTG GAC TCC ATT TTA AAA GAA GTG 586 Lys Gly Lys Leu Gin Ser Arg Glu Leu Asp Ser He Leu Lys Glu Val 165 170 175
GAA AAT CTC GCG CTT AAA GGC TAT ACG GAT ATG ACT TTT ATC GCT CAA 634 Glu Asn Leu Ala Leu Lys Gly Tyr Thr Asp Met Thr Phe He Ala Gin 180 185 190 195
GAC TCT AGC TCC TTT TTA TAC GAT AAG GGG CAA AAA GAC GGC TTG ATC 682 Asp Ser Ser Ser Phe Leu Tyr Asp Lys Gly Gin Lys Asp Gly Leu He 200 205 210 CAG CTC ATT AGA GCG ATT GAT AAA CAG CAA GCC TTA AAG AGC GCG CGT 730 Gin Leu He Arg Ala He Asp Lys Gin Gin Ala Leu Lys Ser Ala Arg 215 220 225
ATT TTA TAT CTC TAC CCC TCT AGC ACC ACG CTA GAG CTT ATT GGC GCG 778 He Leu Tyr Leu Tyr Pro Ser Ser Thr Thr Leu Glu Leu He Gly Ala 230 235 240
ATT GAA AGT TCG CCC ATT TTT CAA AAT TAT TTT GAC ATG CCC ATC CAG 826 He Glu Ser Ser Pro He Phe Gin Asn Tyr Phe Asp Met Pro He Gin 245 250 255
CAC ATC AGC GAC TCC ATG CTC AAA AAG ATG CGG CGC AAC TCT AGC CAA 874 His He Ser Asp Ser Met Leu Lys Lys Met Arg Arg Asn Ser Ser Gin 260 265 270 275
GCG CAC CAT TTA AAG CTT TTA GAT GCC ATG AAG CAG GTT AAA GAA AGC 922 Ala His His Leu Lys Leu Leu Asp Ala Met Lys Gin Val Lys Glu Ser 280 285 290
TTT ATC AGA AGC ACG ATC ATT GTA GGG CAT CCG GAA GAA AAT GAG AGC 970 Phe He Arg Ser Thr He He Val Gly His Pro Glu Glu Asn Glu Ser 295 300 305
GAA TTT GAA GAA TTG AGC GCG TTT TTA GAC GAG TTC CAG TTT GAT AGA 1018 Glu Phe Glu Glu Leu Ser Ala Phe Leu Asp Glu Phe Gin Phe Asp Arg 310 315 320
TTG AAT ATT TTT GCT TTC AGC GCT GAA GAA AAC ACG CAT GCC TAT TCT 1066 Leu Asn He Phe Ala Phe Ser Ala Glu Glu Asn Thr His Ala Tyr Ser 325 330 335
TTA GAA AAA GTG CCT AAA AAA ACC ATC AAC GCT CGC ATC AAA GCC TTG 1114 Leu Glu Lys Val Pro Lys Lys Thr He Asn Ala Arg He Lys Ala Leu 340 345 350 355
AAT AAA ATC GCT TTA AAG CAC CAA AAC CAT TCC TTT AAG GCT TTG TTG 1162 Asn Lys He Ala Leu Lys His Gin Asn His Ser Phe Lys Ala Leu Leu 360 365 370
AAT AAG CCC ATT AAG GCG TTA GTG GAA AAT AAA GAG GGC GAG TAT TTT 1210 Asn Lys Pro He Lys Ala Leu Val Glu Asn Lys Glu Gly Glu Tyr Phe 375 380 385
TAC AAA GCA AGG GAT CTC AGA TGG GCG CCT GAA GTG GAT GGG GAA ATC 1258 Tyr Lys Ala Arg Asp Leu Arg Trp Ala Pro Glu Val Asp Gly Glu He 390 395 400
TTG ATC AAT GAT AGC GAA CTA ACC ACC CCC TTA AAA CCC GGG CAT TAT 1306 Leu He Asn Asp Ser Glu Leu Thr Thr Pro Leu Lys Pro Gly His Tyr 405 410 415
ACG ATT GCA CCT AGC GAA TTT AAA GAT AAT ATC CTA CTC GCT AAG GTT 1354 Thr He Ala Pro Ser Glu Phe Lys Asp Asn He Leu Leu Ala Lys Val 420 425 430 435 TTA AGC CCT TTT TAAAAGTTAG CCATAAGGCT AAAAGCACGG CTAAAGCGT 1405
Leu Ser Pro Phe
(2) INFORMATION FOR SEQ ID NO: 66:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 439 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66:
Met Gin Val Lys Glu Asn Lys Gin Leu Cys Leu He Ser Leu Gly Cys
1 5 10 15
Ser Lys Asn Leu Val Asp Ser Glu Val Met Leu Gly Lys Leu Tyr Asn
20 25 30
Tyr Thr Leu Thr Asn Asp Ala Lys Ser Ala Asp Val He Leu He Asn
35 40 45
Thr Cys Gly Phe He Glu Ser Ala Lys Gin Glu Ser He Gin Thr He
50 55 60
Leu Asn Ala Ala Lys Asp Lys Lys Glu Gly Ala He Leu He Ala Ser 65 70 75 80
Gly Cys Leu Ser Glu Arg Tyr Lys Asp Glu He Lys Glu Leu He Pro
85 90 95
Glu Val Asp He Phe Thr Gly Val Gly Asp Tyr Asp Lys He Asp He
100 105 110
Met He Ala Lys Lys Gin Asn Gin Phe Ser Glu Gin Val Phe Leu Ser
115 120 125
Glu His Tyr Asn Ala Arg He He Thr Gly Ser Ser Val His Ala Tyr
130 135 140
Val Lys He Ser Glu Gly Cys Asn Gin Lys Cys Ser Phe Cys Ala He 145 150 155 160
Pro Ser Phe Lys Gly Lys Leu Gin Ser Arg Glu Leu Asp Ser He Leu
165 170 175
Lys Glu Val Glu Asn Leu Ala Leu Lys Gly Tyr Thr Asp Met Thr Phe
180 185 190
He Ala Gin Asp Ser Ser Ser Phe Leu Tyr Asp Lys Gly Gin Lys Asp
195 200 205
Gly Leu He Gin Leu He Arg Ala He Asp Lys Gin Gin Ala Leu Lys
210 215 220
Ser Ala Arg He Leu Tyr Leu Tyr Pro Ser Ser Thr Thr Leu Glu Leu 225 230 235 240
He Gly Ala He Glu Ser Ser Pro He Phe Gin Asn Tyr Phe Asp Met
245 250 255
Pro He Gin His He Ser Asp Ser Met Leu Lys Lys Met Arg Arg Asn
260 265 270
Ser Ser Gin Ala His His Leu Lys Leu Leu Asp Ala Met Lys Gin Val
275 280 285
Lys Glu Ser Phe He Arg Ser Thr He He Val Gly His Pro Glu Glu 290 295 300
Asn Glu Ser Glu Phe Glu Glu Leu Ser Ala Phe Leu Asp Glu Phe Gin 305 310 315 320
Phe Asp Arg Leu Asn He Phe Ala Phe Ser Ala Glu Glu Asn Thr His
325 330 335
Ala Tyr Ser Leu Glu Lys Val Pro Lys Lys Thr He Asn Ala Arg He
340 345 350
Lys Ala Leu Asn Lys He Ala Leu Lys His Gin Asn His Ser Phe Lys
355 360 365
Ala Leu Leu Asn Lys Pro He Lys Ala Leu Val Glu Asn Lys Glu Gly
370 375 380
Glu Tyr Phe Tyr Lys Ala Arg Asp Leu Arg Trp Ala Pro Glu Val Asp 385 390 395 400
Gly Glu He Leu He Asn Asp Ser Glu Leu Thr Thr Pro Leu Lys Pro
405 410 415
Gly His Tyr Thr He Ala Pro Ser Glu Phe Lys Asp Asn He Leu Leu
420 425 430
Ala Lys Val Leu Ser Pro Phe 435
(2) INFORMATION FOR SEQ ID NO: 67:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1420 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 200...1366 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67:
TGCGCGTTAT RTGGTTTTGA TTGAAAAGTT AGGCATTAAA GACAGATAAT TTTTATCGGT 60
TTTAGGGTGT GGCGAGTTTG TGTTGAAAGA GCGTTTGAAG GCCTTTTTTA GTGCGGACTC 120
TGTTTTCACT TTAATTTTTG CCCTTTTCTT TCTCACTTCG TTTAAAAAAC CTTTAACTCA 180
AGTCTTGTTG ATTGTTTTA ATG GTT TTT TTG TTT TTT AGG TGT TAT TTC CAA 232
Met Val Phe Leu Phe Phe Arg Cys Tyr Phe Gin 1 5 10
GCG TCT TTG AAA GAA ACT TTC GCA ATT AAT CAT TTA AAA ACA ATG TCT 280 Ala Ser Leu Lys Glu Thr Phe Ala He Asn His Leu Lys Thr Met Ser 15 20 25
TTT AAA TGG CTC ACT CTG GCT TTT TTG GGC GTG TTT TTA AGC ATC TTC 328 Phe Lys Trp Leu Thr Leu Ala Phe Leu Gly Val Phe Leu Ser He Phe 30 35 40 CCT AAC ATG TTT AAC ATG CAT GAT AGC CAA ACT TTC CGC TAC AAT TTA 376 Pro Asn Met Phe Asn Met His Asp Ser Gin Thr Phe Arg Tyr Asn Leu 45 50 55
TTC GCT CTA AAC ATG TCC TTA ACT TAT GCT TGC GGG GCG TTA TGC TTG 424 Phe Ala Leu Asn Met Ser Leu Thr Tyr Ala Cys Gly Ala Leu Cys Leu 60 65 70 75
CTT TTT GCC AGT TGC TTA AGA ATC AAA TTG AAT CAA AAA ATC CTT TTT 472 Leu Phe Ala Ser Cys Leu Arg He Lys Leu Asn Gin Lys He Leu Phe 80 85 90
TAC AGC ATG GCT GTT GCA AAT TTC ATC AAC GGC TTG CTC TCA TTG GTG 520 Tyr Ser Met Ala Val Ala Asn Phe He Asn Gly Leu Leu Ser Leu Val 95 100 105
CAA AAA ATT TAT TTT AAC ATG CCC AGA GCG CAA GGG TTT AGC ACG GTT 568 Gin Lys He Tyr Phe Asn Met Pro Arg Ala Gin Gly Phe Ser Thr Val 110 115 120
AAG GAG TAT GTG GTT TTA GTG AGC GTG TCC ATT TTA GGC TGT TAT ATT 616 Lys Glu Tyr Val Val Leu Val Ser Val Ser He Leu Gly Cys Tyr He 125 130 135
TAT GCG CTT TAT TCG CAC AAT CAA AAA GAA AAA CTT TTT TTC ACG CTT 664 Tyr Ala Leu Tyr Ser His Asn Gin Lys Glu Lys Leu Phe Phe Thr Leu 140 145 150 155
TCT GTT TTT GTG GGG TTT TTA GTC GTT ATT TTA AGC GCC ACA AGG AGC 712 Ser Val Phe Val Gly Phe Leu Val Val He Leu Ser Ala Thr Arg Ser 160 165 170
GCG ACA ATC GCT TTT GTT ATT ACT TTT TTA ATC CTT TCT TGC TTT ATT 760 Ala Thr He Ala Phe Val He Thr Phe Leu He Leu Ser Cys Phe He 175 180 185
TTA TAC GCC AAA AAA TCG CTC AAA CCA TTG GGT TAT ATG GTG GTC GTG 808 Leu Tyr Ala Lys Lys Ser Leu Lys Pro Leu Gly Tyr Met Val Val Val 190 195 200
AGT CTT ATT TTG AGC GCT TTG TAT GTG GGG AGT AAC GCT TTA GAA AAA 856 Ser Leu He Leu Ser Ala Leu Tyr Val Gly Ser Asn Ala Leu Glu Lys 205 210 215
AAG GGG GCA ATA GAG CAA TCT AGA GTT CAA AAT CAA AGC TTT GAA GAA 904 Lys Gly Ala He Glu Gin Ser Arg Val Gin Asn Gin Ser Phe Glu Glu 220 225 230 235
GAT CTG AAA CGC TAC GCT AAA AAG GAC GCT GAT AGC AGT ATC GGA TGG 952 Asp Leu Lys Arg Tyr Ala Lys Lys Asp Ala Asp Ser Ser He Gly Trp 240 245 250
CGT TTG GAG CGT TGG AAA GAA GCC CTA ACG GTT TTG CGT TTA AGG CCC 1000 Arg Leu Glu Arg Trp Lys Glu Ala Leu Thr Val Leu Arg Leu Arg Pro 255 260 265 TTT TTT GGT ATG GCC GCT AGC GAG AAA TGC CAG AGG TTA GAA GAG ATT 1048 Phe Phe Gly Met Ala Ala Ser Glu Lys Cys Gin Arg Leu Glu Glu He 270 275 280
TTA TCC TTA TCA AAG TCT TAT AGG GCC AAA GAT TTG ATT CTC TGT TAT 1096 Leu Ser Leu Ser Lys Ser Tyr Arg Ala Lys Asp Leu He Leu Cys Tyr 285 290 295
GAA AGA TAC GAC AAT CAA ATC ATT CAC ATT TTA GCC ACT AGG GGG ATC 1144 Glu Arg Tyr Asp Asn Gin He He His He Leu Ala Thr Arg Gly He 300 305 310 315
ATA GGC TTT TTG ATC TGG CTC TTT TTT TTA TTA GTT ATT GTA AAG ATT 1192 He Gly Phe Leu He Trp Leu Phe Phe Leu Leu Val He Val Lys He 320 325 330
TTT TGG AGC GGG ATA AAG CAA AAC TCT TTA ATA TCG TTT TTT ATA CTA 1240 Phe Trp Ser Gly He Lys Gin Asn Ser Leu He Ser Phe Phe He Leu 335 340 345
ATG ACA CTC GCC TTT TAC CTC ATT TTT GGC ATT GGG TTT GAC CCC TTT 1288 Met Thr Leu Ala Phe Tyr Leu He Phe Gly He Gly Phe Asp Pro Phe 350 355 360
GAT TTC TTC ATT ACG GGA AGT TTT TTT GTA GGA ATG ATC ATG ATG GCT 1336 Asp Phe Phe He Thr Gly Ser Phe Phe Val Gly Met He Met Met Ala 365 370 375
GTT TTT TTA AAA AAA GAT AAA AGT GCT TTT TAGCATCAAG GGGTTTGACA TTA 1389 Val Phe Leu Lys Lys Asp Lys Ser Ala Phe 380 385
GTCAAGCGGT AGTTTCTTGT GATTCGTTCT T 1420
(2) INFORMATION FOR SEQ ID NO: 68:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 389 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68:
Met Val Phe Leu Phe Phe Arg Cys Tyr Phe Gin Ala Ser Leu Lys Glu
1 5 10 15
Thr Phe Ala He Asn His Leu Lys Thr Met Ser Phe Lys Trp Leu Thr
20 25 30
Leu Ala Phe Leu Gly Val Phe Leu Ser He Phe Pro Asn Met Phe Asn
35 40 45
Met His Asp Ser Gin Thr Phe Arg Tyr Asn Leu Phe Ala Leu Asn Met 50 55 60 Ser Leu Thr Tyr Ala Cys Gly Ala Leu Cys Leu Leu Phe Ala Ser Cys 65 70 75 80
Leu Arg He Lys Leu Asn Gin Lys He Leu Phe Tyr Ser Met Ala Val
85 90 95
Ala Asn Phe He Asn Gly Leu Leu Ser Leu Val Gin Lys He Tyr Phe
100 105 110
Asn Met Pro Arg Ala Gin Gly Phe Ser Thr Val Lys Glu Tyr Val Val
115 120 125
Leu Val Ser Val Ser He Leu Gly Cys Tyr He Tyr Ala Leu Tyr Ser
130 135 140
His Asn Gin Lys Glu Lys Leu Phe Phe Thr Leu Ser Val Phe Val Gly 145 150 155 160
Phe Leu Val Val He Leu Ser Ala Thr Arg Ser Ala Thr He Ala Phe
165 170 175
Val He Thr Phe Leu He Leu Ser Cys Phe He Leu Tyr Ala Lys Lys
180 185 190
Ser Leu Lys Pro Leu Gly Tyr Met Val Val Val Ser Leu He Leu Ser
195 200 205
Ala Leu Tyr Val Gly Ser Asn Ala Leu Glu Lys Lys Gly Ala He Glu
210 215 220
Gin Ser Arg Val Gin Asn Gin Ser Phe Glu Glu Asp Leu Lys Arg Tyr 225 230 235 240
Ala Lys Lys Asp Ala Asp Ser Ser He Gly Trp Arg Leu Glu Arg Trp
245 250 255
Lys Glu Ala Leu Thr Val Leu Arg Leu Arg Pro Phe Phe Gly Met Ala
260 265 270
Ala Ser Glu Lys Cys Gin Arg Leu Glu Glu He Leu Ser Leu Ser Lys
275 280 285
Ser Tyr Arg Ala Lys Asp Leu He Leu Cys Tyr Glu Arg Tyr Asp Asn
290 295 300
Gin He He His He Leu Ala Thr Arg Gly He He Gly Phe Leu He 305 310 315 320
Trp Leu Phe Phe Leu Leu Val He Val Lys He Phe Trp Ser Gly He
325 330 335
Lys Gin Asn Ser Leu He Ser Phe Phe He Leu Met Thr Leu Ala Phe
340 345 350
Tyr Leu He Phe Gly He Gly Phe Asp Pro Phe Asp Phe Phe He Thr
355 360 365
Gly Ser Phe Phe Val Gly Met He Met Met Ala Val Phe Leu Lys Lys
370 375 380
Asp Lys Ser Ala Phe 385
(2) INFORMATION FOR SEQ ID NO: 69:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1252 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence (B) LOCATION: 89...1198 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69:
AAAAAGGAAA TAGCACGATG AAACCTAAAG GGGATATTAG CGTTAATATG CTAATATAGT 60 AGAACATTAT GACTACAAAA AGGGTGAT ATG CTG ATC TCC ATA GCG TTT TTA 112
Met Leu He Ser He Ala Phe Leu 1 5
TTG GTT TTA TAT CTT TTG AAT TAT AGT TCT TTC AGG ATG TTG AAA TCG 160 Leu Val Leu Tyr Leu Leu Asn Tyr Ser Ser Phe Arg Met Leu Lys Ser 10 15 20
TTT TTA ACC TTA AAG AAA ATC TCT CAA TAC GCT TAT TTA TGG TTT TTT 208 Phe Leu Thr Leu Lys Lys He Ser Gin Tyr Ala Tyr Leu Trp Phe Phe 25 30 35 40
ATC CTT TTG AGC ATA GGC GAG GCG GCT TTT GTT TTT TAT AGA AAT ATT 256 He Leu Leu Ser He Gly Glu Ala Ala Phe Val Phe Tyr Arg Asn He 45 50 55
ATG CCT AGC CAT TTG TTT GTT TTG ACT TCA GCG TGT TCG TTT GTG TCT 304 Met Pro Ser His Leu Phe Val Leu Thr Ser Ala Cys Ser Phe Val Ser 60 65 70
TTT ATT ATT TTT ATC CTT TCT TTA AGT TTT TAC GGG TTT TCC TAT TCC 352 Phe He He Phe He Leu Ser Leu Ser Phe Tyr Gly Phe Ser Tyr Ser 75 80 85
ATA GAA AAA ATA GAT TTT TTG CAT TCA AGG CGT AAA AGT TTA AAA AAC 400 He Glu Lys He Asp Phe Leu His Ser Arg Arg Lys Ser Leu Lys Asn 90 95 100
TTT TTA AAA TTG GGG TTT TAT CTG GCG TTA TTA GGG TAT TTT TGG CGT 448 Phe Leu Lys Leu Gly Phe Tyr Leu Ala Leu Leu Gly Tyr Phe Trp Arg 105 110 115 120
GGG TTT TAT GAA GGG TTG GCC CGC CCT AAA ATC AAA GAA ACC CCT ATT 496 Gly Phe Tyr Glu Gly Leu Ala Arg Pro Lys He Lys Glu Thr Pro He 125 130 135
TAT TTG GAT AAG CTG GAT AAA GAA TTA AAG ATT ATT TTA CTC ACA GAC 544 Tyr Leu Asp Lys Leu Asp Lys Glu Leu Lys He He Leu Leu Thr Asp 140 145 150
ATG CAT GTG GGG AGT TTG TTG CAA AAA GAT TTT GTT GAT TAC ATT GTA 592 Met His Val Gly Ser Leu Leu Gin Lys Asp Phe Val Asp Tyr He Val 155 160 165
GAA GAA GTC AAT CAA AAA GAA GTG GAT ATG GTG CTG ATT GGG GGG GAT 640 Glu Glu Val Asn Gin Lys Glu Val Asp Met Val Leu He Gly Gly Asp 170 175 180 TTA GTG GAT GAA AGC ATT GAA AAA GTC AAA TCT TTT TTA CTG CCT TTA 688 Leu Val Asp Glu Ser He Glu Lys Val Lys Ser Phe Leu Leu Pro Leu 185 190 195 200
AAC AAC CTT AAA AGC ACG CAT GGC ACT TTT TAT GTG CCA GGA AAT CAT 736 Asn Asn Leu Lys Ser Thr His Gly Thr Phe Tyr Val Pro Gly Asn His 205 210 215
GAG TAT TAT CAT GGC ATA GAG CCG ATT TTA TCG TTT CTT GAC ACG CTT 784 Glu Tyr Tyr His Gly He Glu Pro He Leu Ser Phe Leu Asp Thr Leu 220 225 230
AAT TTG ACG ATT TTA GGG AAT GAG TGC GTG CAT TTA GGG GGG ATC AAT 832 Asn Leu Thr He Leu Gly Asn Glu Cys Val His Leu Gly Gly He Asn 235 240 245
TTG TGC GGC GTG TAT GAT TAT TTC GCA AGG AAG CGT CAA AAT TTT GCC 880 Leu Cys Gly Val Tyr Asp Tyr Phe Ala Arg Lys Arg Gin Asn Phe Ala 250 255 260
CCT GAT ATT GAC AAA GCT TTA AAA AAG CGC AAT GAG AGT AAG CCC ACG 928 Pro Asp He Asp Lys Ala Leu Lys Lys Arg Asn Glu Ser Lys Pro Thr 265 270 275 280
ATC CTT TTG GCC CAC CAA CCT AAA CAA ATT AGA AGC CTC AAA GAA AGC 976 He Leu Leu Ala His Gin Pro Lys Gin He Arg Ser Leu Lys Glu Ser 285 290 295
CAC TCT GTA GAT TTA GTC CTT TCA GGG CAT ACC CAT GCA GGG CAA ATC 1024 His Ser Val Asp Leu Val Leu Ser Gly His Thr His Ala Gly Gin He 300 305 310
TTT CCC TTT AGC CTT TTA GTC AAG TTG GCG CAA ACC TAT TTA CAT GGT 1072 Phe Pro Phe Ser Leu Leu Val Lys Leu Ala Gin Thr Tyr Leu His Gly 315 320 325
TTA TAC AAG CAC AGC CCC ACC ACT CAA ATT TAT GTG AGC AGT GGG GCA 1120 Leu Tyr Lys His Ser Pro Thr Thr Gin He Tyr Val Ser Ser Gly Ala 330 335 340
GGG TAT TGG GGG ATT CCT TTA AGG TTT TTA GCC CCT AGC GAG ATC GCA 1168 Gly Tyr Trp Gly He Pro Leu Arg Phe Leu Ala Pro Ser Glu He Ala 345 350 355 360
TAC CTT AGG CTT TTA CCT AAA AAT CAA GCT TAGTTAAACA AAATCTTAAA ATC 1221 Tyr Leu Arg Leu Leu Pro Lys Asn Gin Ala 365 370
TTAATCGTAA TCAAGCGGTT AAAAATAAGA A 1252
(2) INFORMATION FOR SEQ ID NO: 70:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 370 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70:
Met Leu He Ser He Ala Phe Leu Leu Val Leu Tyr Leu Leu Asn Tyr
1 5 10 15
Ser Ser Phe Arg Met Leu Lys Ser Phe Leu Thr Leu Lys Lys He Ser
20 25 30
Gin Tyr Ala Tyr Leu Trp Phe Phe He Leu Leu Ser He Gly Glu Ala
35 40 45
Ala Phe Val Phe Tyr Arg Asn He Met Pro Ser His Leu Phe Val Leu
50 55 60
Thr Ser Ala Cys Ser Phe Val Ser Phe He He Phe He Leu Ser Leu 65 70 75 80
Ser Phe Tyr Gly Phe Ser Tyr Ser He Glu Lys He Asp Phe Leu His
85 90 95
Ser Arg Arg Lys Ser Leu Lys Asn Phe Leu Lys Leu Gly Phe Tyr Leu
100 105 110
Ala Leu Leu Gly Tyr Phe Trp Arg Gly Phe Tyr Glu Gly Leu Ala Arg
115 120 125
Pro Lys He Lys Glu Thr Pro He Tyr Leu Asp Lys Leu Asp Lys Glu
130 135 140
Leu Lys He He Leu Leu Thr Asp Met His Val Gly Ser Leu Leu Gin 145 150 155 160
Lys Asp Phe Val Asp Tyr He Val Glu Glu Val Asn Gin Lys Glu Val
165 170 175
Asp Met Val Leu He Gly Gly Asp Leu Val Asp Glu Ser He Glu Lys
180 185 190
Val Lys Ser Phe Leu Leu Pro Leu Asn Asn Leu Lys Ser Thr His Gly
195 200 205
Thr Phe Tyr Val Pro Gly Asn His Glu Tyr Tyr His Gly He Glu Pro
210 215 220
He Leu Ser Phe Leu Asp Thr Leu Asn Leu Thr He Leu Gly Asn Glu 225 230 235 240
Cys Val His Leu Gly Gly He Asn Leu Cys Gly Val Tyr Asp Tyr Phe
245 250 255
Ala Arg Lys Arg Gin Asn Phe Ala Pro Asp He Asp Lys Ala Leu Lys
260 265 270
Lys Arg Asn Glu Ser Lys Pro Thr He Leu Leu Ala His Gin Pro Lys
275 280 285
Gin He Arg Ser Leu Lys Glu Ser His Ser Val Asp Leu Val Leu Ser
290 295 300
Gly His Thr His Ala Gly Gin He Phe Pro Phe Ser Leu Leu Val Lys 305 310 315 320
Leu Ala Gin Thr Tyr Leu His Gly Leu Tyr Lys His Ser Pro Thr Thr
325 330 335
Gin He Tyr Val Ser Ser Gly Ala Gly Tyr Trp Gly He Pro Leu Arg
340 345 350
Phe Leu Ala Pro Ser Glu He Ala Tyr Leu Arg Leu Leu Pro Lys Asn
355 360 365
Gin Ala 370
(2) INFORMATION FOR SEQ ID NO: 71:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 431 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 103...381 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71:
AGTCTTTGCC CGCTATCAAA AAAGAGATTT TTTGCAATAT TTTCATAAGC ACCGAAGAAA 60 GCTTGCTTTA GCGAGTTTGT GGGTGAAACG TACGCCTTGC TC ATG ATT TTT GTC 114
Met He Phe Val 1
AAT AAA TAT CTC TAT GGG ATT AAA AGC GTT GTG CCT TTG GCG GTT GGT 162 Asn Lys Tyr Leu Tyr Gly He Lys Ser Val Val Pro Leu Ala Val Gly 5 10 15 20
TTT AGC AAA TAC CCT TTA AAA AAG TTT TTA TGG CTT AAT GTT TTT TCC 210 Phe Ser Lys Tyr Pro Leu Lys Lys Phe Leu Trp Leu Asn Val Phe Ser 25 30 35
AGT TTT TTG TGG GCG CTC ATC GTG GGG AGC GTT TCT TTT CAA GCG AGC 258 Ser Phe Leu Trp Ala Leu He Val Gly Ser Val Ser Phe Gin Ala Ser 40 45 50
GAT TGG GTG AAA ACG CTG TAT GAA AGG CTT TCT CAT TAC ACT TCG TTT 306 Asp Trp Val Lys Thr Leu Tyr Glu Arg Leu Ser His Tyr Thr Ser Phe 55 60 65
TTT ATC ATA AGT TTT GTT CTT ATA GCG CTT TTA ATA TGG TTT TTA TTG 354 Phe He He Ser Phe Val Leu He Ala Leu Leu He Trp Phe Leu Leu 70 75 80
AAA CGA TAT TCG CGC AAA ATG GGT TTT TAAGCAAGAT GTTTAATTAA ATGCGCT 408 Lys Arg Tyr Ser Arg Lys Met Gly Phe 85 90
AGACTACGCC CACAAGCATT CGC 431
(2) INFORMATION FOR SEQ ID NO: 72: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 93 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72:
Met He Phe Val Asn Lys Tyr Leu Tyr Gly He Lys Ser Val Val Pro
1 5 10 15
Leu Ala Val Gly Phe Ser Lys Tyr Pro Leu Lys Lys Phe Leu Trp Leu
20 25 30
Asn Val Phe Ser Ser Phe Leu Trp Ala Leu He Val Gly Ser Val Ser
35 40 45
Phe Gin Ala Ser Asp Trp Val Lys Thr Leu Tyr Glu Arg Leu Ser His
50 55 60
Tyr Thr Ser Phe Phe He He Ser Phe Val Leu He Ala Leu Leu He 65 70 75 80
Trp Phe Leu Leu Lys Arg Tyr Ser Arg Lys Met Gly Phe 85 90
(2) INFORMATION FOR SEQ ID NO: 73:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1281 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 70...1227 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73:
TAGCATCAAT ACCCCTTAAA TAAAAGATAT AATGCTGTAT TATAAGCTAG TTTTAATTAC 60 AATTTTCAA ATG TTA AGG AAA AAC ATT TTA GCT TAC TAT GGG GCG AAT TTT 111 Met Leu Arg Lys Asn He Leu Ala Tyr Tyr Gly Ala Asn Phe 1 5 10
CTC TTA ATC ATC GCT CAA AGC TTA CCC CAT GCG ATT TTA ACC CCC TTG 159 Leu Leu He He Ala Gin Ser Leu Pro His Ala He Leu Thr Pro Leu 15 20 25 30
TTG CTT TCT AAA GGG CTT AGT TTG AGT GAA ATC TTG CTC GTG CAA ACC 207 Leu Leu Ser Lys Gly Leu Ser Leu Ser Glu He Leu Leu Val Gin Thr 35 40 45
TTT TTT AGC TTT TGC GTG CTA GTG GCT GAA TAC CCA AGC GGC GTT TTA 255 Phe Phe Ser Phe Cys Val Leu Val Ala Glu Tyr Pro Ser Gly Val Leu 50 55 60
GCG GAT TTG ATG AGC CGA AAA AAT TTA TTC CTG GTT TCT AAT GCC TTT 303 Ala Asp Leu Met Ser Arg Lys Asn Leu Phe Leu Val Ser Asn Ala Phe 65 70 75
TTA ATC GCT AGT TTT TCG TTT GTG CTG TTT TTT GAT AGC TTT ATT TTC 351 Leu He Ala Ser Phe Ser Phe Val Leu Phe Phe Asp Ser Phe He Phe 80 85 90
ATG CTT TTA GCG TGG GGG TTG TAT GGT TTG TAT AGC GCA TGC TCT AGC 399 Met Leu Leu Ala Trp Gly Leu Tyr Gly Leu Tyr Ser Ala Cys Ser Ser 95 100 105 110
GGC ACG ATT GAA GCT TCA CTC ATC ACA GAC ATT AAG GAA AAC AAA AAA 447 Gly Thr He Glu Ala Ser Leu He Thr Asp He Lys Glu Asn Lys Lys 115 120 125
GAT TTA TCC AAG TTT TTA GCC AAA AAC AAT CAA ATT ACT TAT TTA GGC 495 Asp Leu Ser Lys Phe Leu Ala Lys Asn Asn Gin He Thr Tyr Leu Gly 130 135 140
ATG ATT ATA GGG AGT TCT TTG GGA TCG TTT TTG TAT CTC AAA GTC CAT 543 Met He He Gly Ser Ser Leu Gly Ser Phe Leu Tyr Leu Lys Val His 145 150 155
GCG ATG CTG TAT ATT GTG GGG ATT TTT TTA ATC ATG CTC TGT GTG CTA 591 Ala Met Leu Tyr He Val Gly He Phe Leu He Met Leu Cys Val Leu 160 165 170
ACG ATC ATT TTT TAT TTT AAA GAG AAA GAA GGG GAT TTT AAA AGC CAA 639 Thr He He Phe Tyr Phe Lys Glu Lys Glu Gly Asp Phe Lys Ser Gin 175 180 185 190
AAA AGC CTG AAA CTC CTT AAA GAG CAA GTC AAA GGC AGT CTT AAA GAG 687 Lys Ser Leu Lys Leu Leu Lys Glu Gin Val Lys Gly Ser Leu Lys Glu 195 200 205
CTT AAA GAT AAC CCC AAA CTT AAA ATT CTG TTA GTG GGG CAT TTG ATT 735 Leu Lys Asp Asn Pro Lys Leu Lys He Leu Leu Val Gly His Leu He 210 215 220
ACG CCC GTC TTT TTT ATG AGC CAT TTT CAA ATG TGG CAA GCG TAT TTT 783 Thr Pro Val Phe Phe Met Ser His Phe Gin Met Trp Gin Ala Tyr Phe 225 230 235
TTA AAA CAA GGC GTT AAA GAG CAA TAC CTT TTT GTG TTT TAT ATC GCT 831 Leu Lys Gin Gly Val Lys Glu Gin Tyr Leu Phe Val Phe Tyr He Ala 240 245 250
TTT CAA GTG ATT TCT ATT CTC ATT CAT TTT TTA AAA GCC TCT AGT TAT 879 Phe Gin Val He Ser He Leu He His Phe Leu Lys Ala Ser Ser Tyr 255 260 265 270
AGC CAA AAA ATC GCC TTG AGT TCG CTT GTG GTG TTG TTA GGC GTT AGC 927 Ser Gin Lys He Ala Leu Ser Ser Leu Val Val Leu Leu Gly Val Ser 275 280 285
CCC TTA TTG CTT AGC AAT ATC CCT TAT TGT TTC ATA GGG GTG TAT GCG 975 Pro Leu Leu Leu Ser Asn He Pro Tyr Cys Phe He Gly Val Tyr Ala 290 295 300
CTC ATG GTG GCG TTT TTC ACT TAC ATG AGC TAT TGC TTA AAC TAT CAA 1023 Leu Met Val Ala Phe Phe Thr Tyr Met Ser Tyr Cys Leu Asn Tyr Gin 305 310 315
TTC TCC AAA TTC GTT TCT AAA AAC AAC ATT TCC TCG CTC TCA TCG CTT 1071 Phe Ser Lys Phe Val Ser Lys Asn Asn He Ser Ser Leu Ser Ser Leu 320 325 330
TTA TCA AGC TGT GTG CGC GTG GTC TCT GTG CTA ATC TTA TCG CTC AGC 1119 Leu Ser Ser Cys Val Arg Val Val Ser Val Leu He Leu Ser Leu Ser 335 340 345 350
AGT CTG GAA CTG CGT TAC TTC TCA CCC CTA ACT ATC ATA ACC ATG CAT 1167 Ser Leu Glu Leu Arg Tyr Phe Ser Pro Leu Thr He He Thr Met His 355 360 365
TTT GCC TTG ACG CTT ATC ATC CTC TTT TTC TTT TTG TAT AAG GCT AAG 1215 Phe Ala Leu Thr Leu He He Leu Phe Phe Phe Leu Tyr Lys Ala Lys 370 375 380
CCG TTT GAT GAG TGAGCGGCTT TAAGAGTGCA ACCTTTTAGC GATTTCTATA GCAAC 1272 Pro Phe Asp Glu 385
ATCATAGCC 1281
(2) INFORMATION FOR SEQ ID NO : 7 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 386 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 74 :
Met Leu Arg Lys Asn He Leu Ala Tyr Tyr Gly Ala Asn Phe Leu Leu
1 5 10 15
He He Ala Gin Ser Leu Pro His Ala He Leu Thr Pro Leu Leu Leu
20 25 30
Ser Lys Gly Leu Ser Leu Ser Glu He Leu Leu Val Gin Thr Phe Phe 35 40 45
Ser Phe Cys Val Leu Val Ala Glu Tyr Pro Ser Gly Val Leu Ala Asp
50 55 60
Leu Met Ser Arg Lys Asn Leu Phe Leu Val Ser Asn Ala Phe Leu He 65 70 75 80
Ala Ser Phe Ser Phe Val Leu Phe Phe Asp Ser Phe He Phe Met Leu
85 90 95
Leu Ala Trp Gly Leu Tyr Gly Leu Tyr Ser Ala Cys Ser Ser Gly Thr
100 105 110
He Glu Ala Ser Leu He Thr Asp He Lys Glu Asn Lys Lys Asp Leu
115 120 125
Ser Lys Phe Leu Ala Lys Asn Asn Gin He Thr Tyr Leu Gly Met He
130 135 140
He Gly Ser Ser Leu Gly Ser Phe Leu Tyr Leu Lys Val His Ala Met 145 150 155 160
Leu Tyr He Val Gly He Phe Leu He Met Leu Cys Val Leu Thr He
165 170 175
He Phe Tyr Phe Lys Glu Lys Glu Gly Asp Phe Lys Ser Gin Lys Ser
180 185 190
Leu Lys Leu Leu Lys Glu Gin Val Lys Gly Ser Leu Lys Glu Leu Lys
195 200 205
Asp Asn Pro Lys Leu Lys He Leu Leu Val Gly His Leu He Thr Pro
210 215 220
Val Phe Phe Met Ser His Phe Gin Met Trp Gin Ala Tyr Phe Leu Lys 225 230 235 240
Gin Gly Val Lys Glu Gin Tyr Leu Phe Val Phe Tyr He Ala Phe Gin
245 250 255
Val He Ser He Leu He His Phe Leu Lys Ala Ser Ser Tyr Ser Gin
260 265 270
Lys He Ala Leu Ser Ser Leu Val Val Leu Leu Gly Val Ser Pro Leu
275 280 285
Leu Leu Ser Asn He Pro Tyr Cys Phe He Gly Val Tyr Ala Leu Met
290 295 300
Val Ala Phe Phe Thr Tyr Met Ser Tyr Cys Leu Asn Tyr Gin Phe Ser 305 310 315 320
Lys Phe Val Ser Lys Asn Asn He Ser Ser Leu Ser Ser Leu Leu Ser
325 330 335
Ser Cys Val Arg Val Val Ser Val Leu He Leu Ser Leu Ser Ser Leu
340 345 350
Glu Leu Arg Tyr Phe Ser Pro Leu Thr He He Thr Met His Phe Ala
355 360 365
Leu Thr Leu He He Leu Phe Phe Phe Leu Tyr Lys Ala Lys Pro Phe
370 375 380
Asp Glu 385
(2) INFORMATION FOR SEQ ID NO: 75:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2218 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA ( ix) FEATURE :
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 77...2167 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75:
GAAGTTTATG AGCCGTTTTG CCACTATTCA AAGAAATTTT GGATTATAAT AAAAAAACTG 60 GCTGAAATTA ACAACA ATG ATT AAA CAA TCA TTA AAT GGA GAG GAC ATG CAA 112
Met He Lys Gin Ser Leu Asn Gly Glu Asp Met Gin 1 5 10
AAA AGT TTA GTT TCT TTG GCT TGG GTT TTT GTA GCT ATT TTA GGG GCG 160 Lys Ser Leu Val Ser Leu Ala Trp Val Phe Val Ala He Leu Gly Ala 15 20 25
ATC TGT TTA GGG GTG TTA GCC TTA CAC AAG GGT GAG AGC ATT AAC ACG 208 He Cys Leu Gly Val Leu Ala Leu His Lys Gly Glu Ser He Asn Thr 30 35 40
CTA TGG CTT GTA GTA GCG AGC GCT TGT ATT TAT AGC ATA GGC TAT CGT 256 Leu Trp Leu Val Val Ala Ser Ala Cys He Tyr Ser He Gly Tyr Arg 45 50 55 60
TTT TAT AGC CAT TTT ATC GCT TAT AAG GTG TTA AAG CTA GAT GAT AGC 304 Phe Tyr Ser His Phe He Ala Tyr Lys Val Leu Lys Leu Asp Asp Ser 65 70 75
AGA GCC ACG CCC GCA TGC GTA AGG AAT GAT GGC AAG GAT TTT GTG CCA 352 Arg Ala Thr Pro Ala Cys Val Arg Asn Asp Gly Lys Asp Phe Val Pro 80 85 90
ACC GAT AAA GCG ATC ACT TTT GGG CAC CAT TTC GCC GCT ATT GCT GGG 400 Thr Asp Lys Ala He Thr Phe Gly His His Phe Ala Ala He Ala Gly 95 100 105
GCT GGC CCT TTA GTA GGC CCG ATA CTA GCC GCT CAA ATG GGT TAC TTG 448 Ala Gly Pro Leu Val Gly Pro He Leu Ala Ala Gin Met Gly Tyr Leu 110 115 120
CCC TCT ATC TTA TGG ATT TTG ATA GGC TCG GTT TTA GGG GGT TGC GTG 496 Pro Ser He Leu Trp He Leu He Gly Ser Val Leu Gly Gly Cys Val 125 130 135 140
CAT GAT TTT GTG GTG CTT TTT GCT TCT ATT AGG CGC GAT GGC AAG TCT 544 His Asp Phe Val Val Leu Phe Ala Ser He Arg Arg Asp Gly Lys Ser 145 150 155
TTA GGC GAA ATG ATC AAA CTT GAA ATG GGC CAA TTT GTA GGC ATG ATC 592 Leu Gly Glu Met He Lys Leu Glu Met Gly Gin Phe Val Gly Met He 160 165 170 GCA AGT CTG GGC ATT TTA GGG ATC ATG CTC ATT ATC ATT GCG ATT TTA 640 Ala Ser Leu Gly He Leu Gly He Met Leu He He He Ala He Leu 175 180 185
GCG ATG GTG GTG GTG AAG GCT TTA GCG CAT TCG CCT TGG GGC TTT TTT 688 Ala Met Val Val Val Lys Ala Leu Ala His Ser Pro Trp Gly Phe Phe 190 195 200
ACG ATC GCA ATG ACT ATT CCC ATT GCG ATT CTT ATG GGG CTT TAC ATG 736 Thr He Ala Met Thr He Pro He Ala He Leu Met Gly Leu Tyr Met 205 210 215 220
CGG TTT TTC AGG CCA CAC AAG ATT TTA GAG GTT TCT GTT ATT GGC TTT 784 Arg Phe Phe Arg Pro His Lys He Leu Glu Val Ser Val He Gly Phe 225 230 235
ATC CTA TTG ATT ATA GCG ATT TAT GCG GGT AAA TAC GTT TCT TTA GAT 832 He Leu Leu He He Ala He Tyr Ala Gly Lys Tyr Val Ser Leu Asp 240 245 250
CCT AAA TTA GCG TCA ATA TTC ACT TTT GAG GCC AGT TCT TTA GCG TGG 880 Pro Lys Leu Ala Ser He Phe Thr Phe Glu Ala Ser Ser Leu Ala Trp 255 260 265
ATG ATC ATG GGC TAT GGG TTT GTG GCT TCT ATT TTA CCG GTA TGG TTT 928 Met He Met Gly Tyr Gly Phe Val Ala Ser He Leu Pro Val Trp Phe 270 275 280
TTA CTC GCT CCA CGA GAT TAT CTA AGC ACT TTT TTA AAA ATT GGC GTT 976 Leu Leu Ala Pro Arg Asp Tyr Leu Ser Thr Phe Leu Lys He Gly Val 285 290 295 300
ATA GGG GTG TTG GTT GTG GCC ATT ATT TTT GTC GCT CCG CCT TTA CAA 1024 He Gly Val Leu Val Val Ala He He Phe Val Ala Pro Pro Leu Gin 305 310 315
ATC CCT AAA ATC ACG CCC TTT GTA GAT GGC AGT GGG CCT GTG TTT GCA 1072 He Pro Lys He Thr Pro Phe Val Asp Gly Ser Gly Pro Val Phe Ala 320 325 330
GGA AGC GTG TTC CCT TTC TTG TTT ATC ACG GTG GCT TGC GGG ACG ATT 1120 Gly Ser Val Phe Pro Phe Leu Phe He Thr Val Ala Cys Gly Thr He 335 340 345
AGC GGA TTC CAT GCT TTA ATT TCT TCA GGC ACG ACC CCT AAA ATG CTC 1168 Ser Gly Phe His Ala Leu He Ser Ser Gly Thr Thr Pro Lys Met Leu 350 355 360
GCT AAA GAA AGC GAC GCA AGG CTA GTG GGC TAT GGC TCT ATG GTG ATG 1216 Ala Lys Glu Ser Asp Ala Arg Leu Val Gly Tyr Gly Ser Met Val Met 365 370 375 380
GAG AGC GTT GTG GCT CTT ATG GCG TTG GTG TGC GCA GGG ATC TTG CAC 1264 Glu Ser Val Val Ala Leu Met Ala Leu Val Cys Ala Gly He Leu His 385 390 395 CCA GGG CTT TAT TTC GCT ATC AAT TCG CCA GAA GTG AGC ATC GGT AAA 1312 Pro Gly Leu Tyr Phe Ala He Asn Ser Pro Glu Val Ser He Gly Lys 400 405 410
GAT ATA GCT GAT GCG GCT TCA GTG ATT AGC TCA TGG GGG TTT AAT ATC 1360 Asp He Ala Asp Ala Ala Ser Val He Ser Ser Trp Gly Phe Asn He 415 420 425
AGC GCT GAA GAA ATT CGT GAG ATG ACT AAA AAC ATC GGC GAA AGC TCC 1408 Ser Ala Glu Glu He Arg Glu Met Thr Lys Asn He Gly Glu Ser Ser 430 435 440
ATT TTG AGC CGC ACC GGT GGG GCG CCC ACT TTT GCG ATC GGT TTA GCG 1456 He Leu Ser Arg Thr Gly Gly Ala Pro Thr Phe Ala He Gly Leu Ala 445 450 455 460
ATG ATT GTG TAT CAC ATT TTA GGG GAT CCA AGC GTG ATG GCG TTT TGG 1504 Met He Val Tyr His He Leu Gly Asp Pro Ser Val Met Ala Phe Trp 465 470 475
TAT CAT TTT GCG ATT TTG TTT GAA GCT TTG TTC ATT TTA ACC GCT GTG 1552 Tyr His Phe Ala He Leu Phe Glu Ala Leu Phe He Leu Thr Ala Val 480 485 490
GAT GCT GGC ACA CGA ACC GCT CGT TTC ATG ATT CAA GAT TTG CTC GGT 1600 Asp Ala Gly Thr Arg Thr Ala Arg Phe Met He Gin Asp Leu Leu Gly 495 500 505
AAT GTT TAT AAG CCT TTG GGC GAT CTT AGC TCT TAT AAG GCT GGG ATT 1648 Asn Val Tyr Lys Pro Leu Gly Asp Leu Ser Ser Tyr Lys Ala Gly He 510 515 520
TTT GCC ACT CTT TTG TGC GTG GCA GGG TGG GGG TAT TTC TTG TAT CAA 1696 Phe Ala Thr Leu Leu Cys Val Ala Gly Trp Gly Tyr Phe Leu Tyr Gin 525 530 535 540
GGC ACG ATT GAT CCT AAA GGG GGG ATT TAT ACG CTA TGG CCT TTA TTT 1744 Gly Thr He Asp Pro Lys Gly Gly He Tyr Thr Leu Trp Pro Leu Phe 545 550 555
GGC GTG AGC AAT CAG ATG TTA GCG GGC ATG GCG TTG TTG TTG GTC ACG 1792 Gly Val Ser Asn Gin Met Leu Ala Gly Met Ala Leu Leu Leu Val Thr 560 565 570
GTG GTG TTG TTT AAA ATG GGG CGT TTT AAG GGG GCG ATG ATA AGC GCC 1840 Val Val Leu Phe Lys Met Gly Arg Phe Lys Gly Ala Met He Ser Ala 575 580 585
TTA CCG GCA GTT TTG ATT TTA TCC ATC ACT TTT TAT AGC GGT ATT TTA 1888 Leu Pro Ala Val Leu He Leu Ser He Thr Phe Tyr Ser Gly He Leu 590 595 600
AAG GTG GTG CCA AAG AGC GAT AAC AGC GTG CTG AAT AAT GTT TCC CAT 1936 Lys Val Val Pro Lys Ser Asp Asn Ser Val Leu Asn Asn Val Ser His 605 610 615 620 GTG GCG CAA ATG CAA ATC ATC AAA GAA AAA ATG GCT ACC ACT ACC GAT 1984 Val Ala Gin Met Gin He He Lys Glu Lys Met Ala Thr Thr Thr Asp 625 630 635
GAA AAA GCG CTC AAA ACG CTC CAA AAA TCC TTT TTT AAC CAC GCT ATT 2032 Glu Lys Ala Leu Lys Thr Leu Gin Lys Ser Phe Phe Asn His Ala He 640 645 650
GAT GCG ATT TTG TGC GTG TTT TTC ATG CTT GTG GCG CTA TTG GTT TTA 2080 Asp Ala He Leu Cys Val Phe Phe Met Leu Val Ala Leu Leu Val Leu 655 660 665
ATC GTG AGC GTT AGG ATT TGC TCA AAC GCT TAT TTT AAA AAC AAA ATT 2128 He Val Ser Val Arg He Cys Ser Asn Ala Tyr Phe Lys Asn Lys He 670 675 680
TAC CCA CCG CTG GCT GAA ACG CCC TAC ATC AAA GCC TCT TGAATAAAAA AG 2179 Tyr Pro Pro Leu Ala Glu Thr Pro Tyr He Lys Ala Ser 685 690 695
GGGTTTTAAC CCCCTTTAAA TCCATAGAAA AAAGTTTGA 2218
(2) INFORMATION FOR SEQ ID NO: 76:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 697 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76:
Met He Lys Gin Ser Leu Asn Gly Glu Asp Met Gin Lys Ser Leu Val
1 5 10 15
Ser Leu Ala Trp Val Phe Val Ala He Leu Gly Ala He Cys Leu Gly
20 25 30
Val Leu Ala Leu His Lys Gly Glu Ser He Asn Thr Leu Trp Leu Val
35 40 45
Val Ala Ser Ala Cys He Tyr Ser He Gly Tyr Arg Phe Tyr Ser His
50 55 60
Phe He Ala Tyr Lys Val Leu Lys Leu Asp Asp Ser Arg Ala Thr Pro 65 70 75 80
Ala Cys Val Arg Asn Asp Gly Lys Asp Phe Val Pro Thr Asp Lys Ala
85 90 95
He Thr Phe Gly His His Phe Ala Ala He Ala Gly Ala Gly Pro Leu
100 105 110
Val Gly Pro He Leu Ala Ala Gin Met Gly Tyr Leu Pro Ser He Leu
115 120 125
Trp He Leu He Gly Ser Val Leu Gly Gly Cys Val His Asp Phe Val
130 135 140
Val Leu Phe Ala Ser He Arg Arg Asp Gly Lys Ser Leu Gly Glu Met 145 150 155 160 He Lys Leu Glu Met Gly Gin Phe Val Gly Met He Ala Ser Leu Gly
165 170 175
He Leu Gly He Met Leu He He He Ala He Leu Ala Met Val Val
180 185 190
Val Lys Ala Leu Ala His Ser Pro Trp Gly Phe Phe Thr He Ala Met
195 200 205
Thr He Pro He Ala He Leu Met Gly Leu Tyr Met Arg Phe Phe Arg
210 215 220
Pro His Lys He Leu Glu Val Ser Val He Gly Phe He Leu Leu He 225 230 235 240
He Ala He Tyr Ala Gly Lys Tyr Val Ser Leu Asp Pro Lys Leu Ala
245 250 255
Ser He Phe Thr Phe Glu Ala Ser Ser Leu Ala Trp Met He Met Gly
260 265 270
Tyr Gly Phe Val Ala Ser He Leu Pro Val Trp Phe Leu Leu Ala Pro
275 280 285
Arg Asp Tyr Leu Ser Thr Phe Leu Lys He Gly Val He Gly Val Leu
290 295 300
Val Val Ala He He Phe Val Ala Pro Pro Leu Gin He Pro Lys He 305 310 315 320
Thr Pro Phe Val Asp Gly Ser Gly Pro Val Phe Ala Gly Ser Val Phe
325 330 335
Pro Phe Leu Phe He Thr Val Ala Cys Gly Thr He Ser Gly Phe His
340 345 350
Ala Leu He Ser Ser Gly Thr Thr Pro Lys Met Leu Ala Lys Glu Ser
355 360 365
Asp Ala Arg Leu Val Gly Tyr Gly Ser Met Val Met Glu Ser Val Val
370 375 380
Ala Leu Met Ala Leu Val Cys Ala Gly He Leu His Pro Gly Leu Tyr 385 390 395 400
Phe Ala He Asn Ser Pro Glu Val Ser He Gly Lys Asp He Ala Asp
405 410 415
Ala Ala Ser Val He Ser Ser Trp Gly Phe Asn He Ser Ala Glu Glu
420 425 430
He Arg Glu Met Thr Lys Asn He Gly Glu Ser Ser He Leu Ser Arg
435 440 445
Thr Gly Gly Ala Pro Thr Phe Ala He Gly Leu Ala Met He Val Tyr
450 455 460
His He Leu Gly Asp Pro Ser Val Met Ala Phe Trp Tyr His Phe Ala 465 470 475 480
He Leu Phe Glu Ala Leu Phe He Leu Thr Ala Val Asp Ala Gly Thr
485 490 495
Arg Thr Ala Arg Phe Met He Gin Asp Leu Leu Gly Asn Val Tyr Lys
500 505 510
Pro Leu Gly Asp Leu Ser Ser Tyr Lys Ala Gly He Phe Ala Thr Leu
515 520 525
Leu Cys Val Ala Gly Trp Gly Tyr Phe Leu Tyr Gin Gly Thr He Asp
530 535 540
Pro Lys Gly Gly He Tyr Thr Leu Trp Pro Leu Phe Gly Val Ser Asn 545 550 555 560
Gin Met Leu Ala Gly Met Ala Leu Leu Leu Val Thr Val Val Leu Phe
565 570 575
Lys Met Gly Arg Phe Lys Gly Ala Met He Ser Ala Leu Pro Ala Val
580 585 590
Leu He Leu Ser He Thr Phe Tyr Ser Gly He Leu Lys Val Val Pro 595 600 605
Lys Ser Asp Asn Ser Val Leu Asn Asn Val Ser His Val Ala Gin Met
610 615 620
Gin He He Lys Glu Lys Met Ala Thr Thr Thr Asp Glu Lys Ala Leu 625 630 635 640
Lys Thr Leu Gin Lys Ser Phe Phe Asn His Ala He Asp Ala He Leu
645 650 655
Cys Val Phe Phe Met Leu Val Ala Leu Leu Val Leu He Val Ser Val
660 665 670
Arg He Cys Ser Asn Ala Tyr Phe Lys Asn Lys He Tyr Pro Pro Leu
675 680 685
Ala Glu Thr Pro Tyr He Lys Ala Ser 690 695
(2) INFORMATION FOR SEQ ID NO: 77:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 911 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 121...861 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77:
TAAGGGGCTT TGCATTTTTT ACTCATTTCA TGCCTCTTTT TCTTTATTTA GACAGATTAT 60 TATCTTAAAA TAATTGTAAT ATCATTATTA TTATATCAAC TCAATAAAAA AGGAGAAGGT 120 ATG CAA AAA ACT TCT AAC ACT CTG GCG CTG GGG AGT TTG ACA GCG CTA 168 Met Gin Lys Thr Ser Asn Thr Leu Ala Leu Gly Ser Leu Thr Ala Leu 1 5 10 15
TTC TTT CTA ATG GGT TTT ATC ACG GTT TTA AAC GAT ATT TTA ATC CCA 216 Phe Phe Leu Met Gly Phe He Thr Val Leu Asn Asp He Leu He Pro 20 25 30
CAC TTA AAG CCC ATT TTT GAC TTG ACC TAT TTT GAA GCT TCA CTC ATT 264 His Leu Lys Pro He Phe Asp Leu Thr Tyr Phe Glu Ala Ser Leu He 35 40 45
CAA TTT TGC TTT TTT GGG GCG TAT TTC ATC ATG GGA GGA GTT TTT GGG 312 Gin Phe Cys Phe Phe Gly Ala Tyr Phe He Met Gly Gly Val Phe Gly 50 55 60
AAT GTG ATC AGT AAA ATC GGC TAC CCT TTT GGC GTG GTG CTT GGT TTT 360 Asn Val He Ser Lys He Gly Tyr Pro Phe Gly Val Val Leu Gly Phe 65 70 75 80 GTG ATC ACA GCG ACG GGG TGC GCG TTG TTT TAT CCG GCG GCG CAT TTT 408 Val He Thr Ala Thr Gly Cys Ala Leu Phe Tyr Pro Ala Ala His Phe 85 90 95
GGA TCC TAT GGG TTT TTT TTA GGA GCG TTG TTT ATT TTA GCG AGC GGG 456 Gly Ser Tyr Gly Phe Phe Leu Gly Ala Leu Phe He Leu Ala Ser Gly 100 105 110
ATT GTG TGC TTG CAA ACC GCT GGT AAT CCC TTT GTA ACC TTG CTT TCT 504 He Val Cys Leu Gin Thr Ala Gly Asn Pro Phe Val Thr Leu Leu Ser 115 120 125
AAA GGT AAA GAA GCC AGA AAT TTG GTT TTA GTC CAG GCG TTC AAT TCG 552 Lys Gly Lys Glu Ala Arg Asn Leu Val Leu Val Gin Ala Phe Asn Ser 130 135 140
CTT GGC ACA ACT TTA GGG CCT ATT TTT GGG AGC TTG TTG ATT TTT AGC 600 Leu Gly Thr Thr Leu Gly Pro He Phe Gly Ser Leu Leu He Phe Ser 145 150 155 160
ACG ACT AAA ATG GGC GAT AAT GCA AGT TTG ATA GAT AAA TTA GCG GAC 648 Thr Thr Lys Met Gly Asp Asn Ala Ser Leu He Asp Lys Leu Ala Asp 165 170 175
GCT AAA AGC GTT CAA ATG CCT TAT TTG GGC TTG GCG GTG TTT TCG CTT 696 Ala Lys Ser Val Gin Met Pro Tyr Leu Gly Leu Ala Val Phe Ser Leu 180 185 190
CTT TTA GCG CTC ATC ATG TAT CTT TTG AAA TTG CCT GAT GTG GAA AAA 744 Leu Leu Ala Leu He Met Tyr Leu Leu Lys Leu Pro Asp Val Glu Lys 195 200 205
GAA ATG CCC AAA GAG ACG ACT CAA AAA AGC TTG TTT TCG CAC AAA CAC 792 Glu Met Pro Lys Glu Thr Thr Gin Lys Ser Leu Phe Ser His Lys His 210 215 220
TTT GTT TTT GGG GCT TGG GGA TCT TTT TTT ATG TGG GGG GAG AAN TGG 840 Phe Val Phe Gly Ala Trp Gly Ser Phe Phe Met Trp Gly Glu Xaa Trp 225 230 235 240
CGA TTG GCT CAT TCT TGG TGC TAAGCTTTGA AAAGCTTTTG AATTTAGACT CTCA 895 Arg Leu Ala His Ser Trp Cys 245
ATCAAGCGCG CATTAC 911
(2) INFORMATION FOR SEQ ID NO: 78:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 247 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78:
Met Gin Lys Thr Ser Asn Thr Leu Ala Leu Gly Ser Leu Thr Ala Leu
1 5 10 15
Phe Phe Leu Met Gly Phe He Thr Val Leu Asn Asp He Leu He Pro
20 25 30
His Leu Lys Pro He Phe Asp Leu Thr Tyr Phe Glu Ala Ser Leu He
35 40 45
Gin Phe Cys Phe Phe Gly Ala Tyr Phe He Met Gly Gly Val Phe Gly
50 55 60
Asn Val He Ser Lys He Gly Tyr Pro Phe Gly Val Val Leu Gly Phe 65 70 75 80
Val He Thr Ala Thr Gly Cys Ala Leu Phe Tyr Pro Ala Ala His Phe
85 90 95
Gly Ser Tyr Gly Phe Phe Leu Gly Ala Leu Phe He Leu Ala Ser Gly
100 105 110
He Val Cys Leu Gin Thr Ala Gly Asn Pro Phe Val Thr Leu Leu Ser
115 120 125
Lys Gly Lys Glu Ala Arg Asn Leu Val Leu Val Gin Ala Phe Asn Ser
130 135 140
Leu Gly Thr Thr Leu Gly Pro He Phe Gly Ser Leu Leu He Phe Ser 145 150 155 160
Thr Thr Lys Met Gly Asp Asn Ala Ser Leu He Asp Lys Leu Ala Asp
165 170 175
Ala Lys Ser Val Gin Met Pro Tyr Leu Gly Leu Ala Val Phe Ser Leu
180 185 190
Leu Leu Ala Leu He Met Tyr Leu Leu Lys Leu Pro Asp Val Glu Lys
195 200 205
Glu Met Pro Lys Glu Thr Thr Gin Lys Ser Leu Phe Ser His Lys His
210 215 220
Phe Val Phe Gly Ala Trp Gly Ser Phe Phe Met Trp Gly Glu Xaa Trp 225 230 235 240
Arg Leu Ala His Ser Trp Cys 245
(2) INFORMATION FOR SEQ ID NO: 79:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3084 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...3027 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79:
GAATTAATGC ATTAAATAAC TCAAAATTTT TGATCAAAGG CTTGAAAT ATG TCA AAA 57
Met Ser Lys 1
AAA ATT CCC CTA AAA AAC CGC TTG AGA GCT GAT TTT ACA AAA ACC CCA 105 Lys He Pro Leu Lys Asn Arg Leu Arg Ala Asp Phe Thr Lys Thr Pro 5 10 15
ACA GAT TTA GAA GTC CCT AAT TTA TTA TTA TTA CAA CGA GAC AGC TAT 153 Thr Asp Leu Glu Val Pro Asn Leu Leu Leu Leu Gin Arg Asp Ser Tyr 20 25 30 35
GAT TCT TTC TTG TAT TCT AAA GAG GGT AAA GAG AGC GGG ATT GAA AAG 201 Asp Ser Phe Leu Tyr Ser Lys Glu Gly Lys Glu Ser Gly He Glu Lys 40 45 50
GTT TTT AAA TCC ATT TTC CCT ATC CAA GAT GAG CAT AAC CGC ATC ACT 249 Val Phe Lys Ser He Phe Pro He Gin Asp Glu His Asn Arg He Thr 55 60 65
TTA GAA TAC GCG GGT TGC GAA TTT GGC AAG TCT AAA TAC ACC GTT AGA 297 Leu Glu Tyr Ala Gly Cys Glu Phe Gly Lys Ser Lys Tyr Thr Val Arg 70 75 80
GAA GCG ATG GAG AGG GGC ATT ACC TAC TCT ATC CCT CTC AAA ATT AAG 345 Glu Ala Met Glu Arg Gly He Thr Tyr Ser He Pro Leu Lys He Lys 85 90 95
GTG CGC TTG ATC TTG TGG GAA AAA GAT ACC AAG AGT GGC GAA AAG AAC 393 Val Arg Leu He Leu Trp Glu Lys Asp Thr Lys Ser Gly Glu Lys Asn 100 105 110 115
GGC ATT AAG GAT ATT AAA GAA CAA AGC ATT TTC ATT CGT GAG ATC CCT 441 Gly He Lys Asp He Lys Glu Gin Ser He Phe He Arg Glu He Pro 120 125 130
TTG ATG ACA GAA CGC ACT TCA TTT ATT ATT AAT GGG GTG GAG CGC GTG 489 Leu Met Thr Glu Arg Thr Ser Phe He He Asn Gly Val Glu Arg Val 135 140 145
GTG GTC AAT CAA CTC CAC AGA AGC CCC GGT GTG ATT TTC AAA GAA GAA 537 Val Val Asn Gin Leu His Arg Ser Pro Gly Val He Phe Lys Glu Glu 150 155 160
GAG TCT AGC ACT TCT TTA AAC AAG CTC ATT TAC ACA GGG CAA ATC ATT 585 Glu Ser Ser Thr Ser Leu Asn Lys Leu He Tyr Thr Gly Gin He He 165 170 175
CCT GAT AGG GGT TCG TGG TTG TAT TTT GAA TAC GAT TCT AAA GAT GTT 633 Pro Asp Arg Gly Ser Trp Leu Tyr Phe Glu Tyr Asp Ser Lys Asp Val 180 185 190 195
TTA TAC GCT CGT ATC AAT AAA CGC CGT AAA GTG CCT GTT ACC ATT TTA 681 Leu Tyr Ala Arg He Asn Lys Arg Arg Lys Val Pro Val Thr He Leu 200 205 210
TTC AGG GCG ATG GAT TAT CAA AAA CAA GAC ATC ATC AAA ATG TTC TAC 729 Phe Arg Ala Met Asp Tyr Gin Lys Gin Asp He He Lys Met Phe Tyr 215 220 225
CCG CTT GTT AAA GTG CGT TAT GAA AAC GAT AAA TAT TTG ATC CCG TTT 777 Pro Leu Val Lys Val Arg Tyr Glu Asn Asp Lys Tyr Leu He Pro Phe 230 235 240
GCT TCA TTA GAC GCC AAT CAA AGA ATG GAA TTT GAC TTG AAA GAT CCT 825 Ala Ser Leu Asp Ala Asn Gin Arg Met Glu Phe Asp Leu Lys Asp Pro 245 250 255
CAA GGC AAG GTT ATT CTT TTA GCG GGT AAA AAG CTC ACT TCA AGA AAG 873 Gin Gly Lys Val He Leu Leu Ala Gly Lys Lys Leu Thr Ser Arg Lys 260 265 270 275
ATT AAA GAG CTT AAA GAA AAC CAT TTA GAA TGG GTG GAA TAC CCT ATG 921 He Lys Glu Leu Lys Glu Asn His Leu Glu Trp Val Glu Tyr Pro Met 280 285 290
GAT ATT TTA CTC AAT CGC CAT TTA GCT GAG CCT GTT ATG GTA GGG AAA 969 Asp He Leu Leu Asn Arg His Leu Ala Glu Pro Val Met Val Gly Lys 295 300 305
GAA GTC TTA TTG GAC ATG CTC ACT CAG CTA GAT AAA AAC AAA TTA GAA 1017 Glu Val Leu Leu Asp Met Leu Thr Gin Leu Asp Lys Asn Lys Leu Glu 310 315 320
AAA ATC CAC GAT TTA GGC GTG CAA GAA TTT GTG ATC ATC AAC GAT CTG 1065 Lys He His Asp Leu Gly Val Gin Glu Phe Val He He Asn Asp Leu 325 330 335
GCG TTA GGG CAT GAC GCT TCC ATT ATC CAA TCT TTT TCA GCC GAT TCT 1113 Ala Leu Gly His Asp Ala Ser He He Gin Ser Phe Ser Ala Asp Ser 340 345 350 355
GAG TCT TTG AAA TTA CTC AAG CAA ACC GAA AAA ATT GAT GAT GAA AAC 1161 Glu Ser Leu Lys Leu Leu Lys Gin Thr Glu Lys He Asp Asp Glu Asn 360 365 370
GCT CTA GCG GCG ATT CGT ATC CAT AAG GTT ATG AAA CCA GGC GAT CCC 1209 Ala Leu Ala Ala He Arg He His Lys Val Met Lys Pro Gly Asp Pro 375 380 385
GTT ACG ACT GAA GTG GCT AAG CAG TTT GTC AAA AAA CTT TTC TTT GAT 1257 Val Thr Thr Glu Val Ala Lys Gin Phe Val Lys Lys Leu Phe Phe Asp 390 395 400
CCA GAA CGC TAT GAT TTG ACC ATG GTG GGC CGC ATG AAA ATG AAT CAC 1305 Pro Glu Arg Tyr Asp Leu Thr Met Val Gly Arg Met Lys Met Asn His 405 410 415 AAG TTA GGC TTG CAT GTG CCT GAT TAC ATT ACG ACT TTA ACG CAT GAA 1353 Lys Leu Gly Leu His Val Pro Asp Tyr He Thr Thr Leu Thr His Glu 420 425 430 435
GAT ATT ATC ACC ACC GTT AAA TAC CTC ATG AAG ATC AAA AAC AAT CAA 1401 Asp He He Thr Thr Val Lys Tyr Leu Met Lys He Lys Asn Asn Gin 440 445 450
GGC AAG ATT GAT GAC AGG GAC CAC TTG GGC AAT CGT AGG ATT AGG GCG 1449 Gly Lys He Asp Asp Arg Asp His Leu Gly Asn Arg Arg He Arg Ala 455 460 465
GTA GGG GAA TTG TTG GCC AAT GAA TTG CAT TCA GGT TTA GTG AAA ATG 1497 Val Gly Glu Leu Leu Ala Asn Glu Leu His Ser Gly Leu Val Lys Met 470 475 480
CAA AAG ACC ATT AAA GAC AAG CTC ACT ACC ATG AGC GGG GCT TTT GAT 1545 Gin Lys Thr He Lys Asp Lys Leu Thr Thr Met Ser Gly Ala Phe Asp 485 490 495
TCG CTC ATG CCC CAT GAC TTG GTC AAT TCT AAA ATG ATC ACA AGC ACC 1593 Ser Leu Met Pro His Asp Leu Val Asn Ser Lys Met He Thr Ser Thr 500 505 510 515
ATC ATG GAA TTT TTC ATG GGC GGT CAG CTC TCG CAA TTT ATG GAT CAA 1641 He Met Glu Phe Phe Met Gly Gly Gin Leu Ser Gin Phe Met Asp Gin 520 525 530
ACG AAT CCC TTG AGT GAG GTT ACG CAC AAG CGC CGC CTT TCA GCG CTC 1689 Thr Asn Pro Leu Ser Glu Val Thr His Lys Arg Arg Leu Ser Ala Leu 535 540 545
GGC GAA GGG GGG TTG GTG AAA GAC AGA GTG GGG TTT GAA GCC AGG GAT 1737 Gly Glu Gly Gly Leu Val Lys Asp Arg Val Gly Phe Glu Ala Arg Asp 550 555 560
GTG CAC CCC ACG CAT TAT GGC CGA ATT TGT CCC ATT GAG ACC CCA GAA 1785 Val His Pro Thr His Tyr Gly Arg He Cys Pro He Glu Thr Pro Glu 565 570 575
GGT CAA AAT ATC GGT CTG ATC AAC ACC CTT TCC ACT TTC ACA AGA GTG 1833 Gly Gin Asn He Gly Leu He Asn Thr Leu Ser Thr Phe Thr Arg Val 580 585 590 595
AAT GAT TTA GGC TTT ATT GAA GCC CCT TAT AAA AAG GTT GTG GAT GGC 1881 Asn Asp Leu Gly Phe He Glu Ala Pro Tyr Lys Lys Val Val Asp Gly 600 605 610
AAG GTC GTG GGT GAG ACG ATT TAT TTG ACC GCT ATT CAA GAA GAC AGC 1929 Lys Val Val Gly Glu Thr He Tyr Leu Thr Ala He Gin Glu Asp Ser 615 620 625
CAC ATC ATC GCT CCC GCA AGC ACC CCC ATT GAT GAA GAG GGG AAT ATT 1977 His He He Ala Pro Ala Ser Thr Pro He Asp Glu Glu Gly Asn He 630 635 640 TTG GGC GAT TTG ATT GAA ACG CGC GTG GAA GGC GAG ATC GTT TTA AAC 2025 Leu Gly Asp Leu He Glu Thr Arg Val Glu Gly Glu He Val Leu Asn 645 650 655
GAA AAA AGC AAA GTA ACC TTA ATG GAT TTA AGC TCT AGC ATG CTA GTG 2073 Glu Lys Ser Lys Val Thr Leu Met Asp Leu Ser Ser Ser Met Leu Val 660 665 670 675
GGG GTA GCC GCA TCG CTC ATT CCT TTC TTA GAG CAT GAT GAC GCC AAC 2121 Gly Val Ala Ala Ser Leu He Pro Phe Leu Glu His Asp Asp Ala Asn 680 685 690
CGT GCC TTA ATG GGG ACT AAC ATG CAG CGC CAA GCG GTG CCC TTA TTA 2169 Arg Ala Leu Met Gly Thr Asn Met Gin Arg Gin Ala Val Pro Leu Leu 695 700 705
AGA AGC GAC GCT CCC ATT GTA GGC ACG GGG ATT GAA AAA ATT ATT GCT 2217 Arg Ser Asp Ala Pro He Val Gly Thr Gly He Glu Lys He He Ala 710 715 720
AGG GAT TCT TGG GGA GCG ATC AAA GCC AAT CGC GCA GGC GTT GTA GAA 2265 Arg Asp Ser Trp Gly Ala He Lys Ala Asn Arg Ala Gly Val Val Glu 725 730 735
AAA ATT GAT TCT AAA AAT ATT TAT ATT TTA GGC GAA AGC AAA GAA GAA 2313 Lys He Asp Ser Lys Asn He Tyr He Leu Gly Glu Ser Lys Glu Glu 740 745 750 755
GCC TAT ATT GAT GCG TAT TCT TTG CAA AAA AAC TTG CGC ACC AAC CAA 2361 Ala Tyr He Asp Ala Tyr Ser Leu Gin Lys Asn Leu Arg Thr Asn Gin 760 765 770
AAC ACC AGT TTC AAT CAA GTC CCT ATC GTT AAA GTG GGC GAT AAA GTG 2409 Asn Thr Ser Phe Asn Gin Val Pro He Val Lys Val Gly Asp Lys Val 775 780 785
GGA GCC GGG CAA ATC ATC GCT GAT GGC CCT AGC ATG GAT AGA GGC GAG 2457 Gly Ala Gly Gin He He Ala Asp Gly Pro Ser Met Asp Arg Gly Glu 790 795 800
TTG GCG TTA GGG AAA AAT GTG CGC GTG GCG TTC ATG CCT TGG AAT GGC 2505 Leu Ala Leu Gly Lys Asn Val Arg Val Ala Phe Met Pro Trp Asn Gly 805 810 815
TAT AAC TTT GAA GAC GCG ATC GTG GTG AGT GAG TGC ATC ACT AAA GAT 2553 Tyr Asn Phe Glu Asp Ala He Val Val Ser Glu Cys He Thr Lys Asp 820 825 830 835
GAT ATT TTC ACT TCC ACC CAC ATT TAT GAA AAA GAA GTG GAT GCT AGG 2601 Asp He Phe Thr Ser Thr His He Tyr Glu Lys Glu Val Asp Ala Arg 840 845 850
GAG CTT AAG CAT GGT GTG GAA GAA TTT ACC GCT GAT ATT CCT GAT GTG 2649 Glu Leu Lys His Gly Val Glu Glu Phe Thr Ala Asp He Pro Asp Val 855 860 865 AAA GAA GAA GCG CTC GCT CAT CTT GAT GAA AGC GGG ATC GTT AAA GTC 2697 Lys Glu Glu Ala Leu Ala His Leu Asp Glu Ser Gly He Val Lys Val 870 875 880
GGT ACT TAT GTG AGC GCT GGC ATG ATT TTG GTG GGC AAA ACT TCT CCT 2745 Gly Thr Tyr Val Ser Ala Gly Met He Leu Val Gly Lys Thr Ser Pro 885 890 895
AAA GGC GAG ATT AAA AGC ACG CCT GAA GAG CGG CTT TTA AGG GCT ATT 2793 Lys Gly Glu He Lys Ser Thr Pro Glu Glu Arg Leu Leu Arg Ala He 900 905 910 915
TTT GGG GAT AAA GCC GGG CAT GTG GTC AAT AAG AGT TTG TAT TGC CCT 2841 Phe Gly Asp Lys Ala Gly His Val Val Asn Lys Ser Leu Tyr Cys Pro 920 925 930
CCC AGT TTG GAA GGC ACG GTG ATT GAT GTG AAA GTC TTC ACT AAA AAA 2889 Pro Ser Leu Glu Gly Thr Val He Asp Val Lys Val Phe Thr Lys Lys 935 940 945
GGC TAT GAG AAA GAC GCG CGA GTT TTG AGC GCG TAT GAA GAA GAA AAA 2937 Gly Tyr Glu Lys Asp Ala Arg Val Leu Ser Ala Tyr Glu Glu Glu Lys 950 955 960
GCC AAG CTT GAT ATG GAG CAT TTT GAT CGC TTG ACC ATG CTC AAT AGA 2985 Ala Lys Leu Asp Met Glu His Phe Asp Arg Leu Thr Met Leu Asn Arg 965 970 975
GAA GAA TTG TTG CGC GTT ACT CGC TCC TTT CTC AAG CGA TTT TAGAAGAGC 3036 Glu Glu Leu Leu Arg Val Thr Arg Ser Phe Leu Lys Arg Phe 980 985 990
CTTTCAGCCA TAACGGCAAG GATTATAAAG AAGGCGATCA AATCCCTA 3084
(2) INFORMATION FOR SEQ ID NO: 80:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 993 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80:
Met Ser Lys Lys He Pro Leu Lys Asn Arg Leu Arg Ala Asp Phe Thr
1 5 10 15
Lys Thr Pro Thr Asp Leu Glu Val Pro Asn Leu Leu Leu Leu Gin Arg
20 25 30
Asp Ser Tyr Asp Ser Phe Leu Tyr Ser Lys Glu Gly Lys Glu Ser Gly
35 40 45
He Glu Lys Val Phe Lys Ser He Phe Pro He Gin Asp Glu His Asn 50 55 60 Arg He Thr Leu Glu Tyr Ala Gly Cys Glu Phe Gly Lys Ser Lys Tyr 65 70 75 80
Thr Val Arg Glu Ala Met Glu Arg Gly He Thr Tyr Ser He Pro Leu
85 90 95
Lys He Lys Val Arg Leu He Leu Trp Glu Lys Asp Thr Lys Ser Gly
100 105 110
Glu Lys Asn Gly He Lys Asp He Lys Glu Gin Ser He Phe He Arg
115 120 125
Glu He Pro Leu Met Thr Glu Arg Thr Ser Phe He He Asn Gly Val
130 135 140
Glu Arg Val Val Val Asn Gin Leu His Arg Ser Pro Gly Val He Phe 145 150 155 160
Lys Glu Glu Glu Ser Ser Thr Ser Leu Asn Lys Leu He Tyr Thr Gly
165 170 175
Gin He He Pro Asp Arg Gly Ser Trp Leu Tyr Phe Glu Tyr Asp Ser
180 185 190
Lys Asp Val Leu Tyr Ala Arg He Asn Lys Arg Arg Lys Val Pro Val
195 200 205
Thr He Leu Phe Arg Ala Met Asp Tyr Gin Lys Gin Asp He He Lys
210 215 220
Met Phe Tyr Pro Leu Val Lys Val Arg Tyr Glu Asn Asp Lys Tyr Leu 225 230 235 240
He Pro Phe Ala Ser Leu Asp Ala Asn Gin Arg Met Glu Phe Asp Leu
245 250 255
Lys Asp Pro Gin Gly Lys Val He Leu Leu Ala Gly Lys Lys Leu Thr
260 265 270
Ser Arg Lys He Lys Glu Leu Lys Glu Asn His Leu Glu Trp Val Glu
275 280 285
Tyr Pro Met Asp He Leu Leu Asn Arg His Leu Ala Glu Pro Val Met
290 295 300
Val Gly Lys Glu Val Leu Leu Asp Met Leu Thr Gin Leu Asp Lys Asn 305 310 315 320
Lys Leu Glu Lys He His Asp Leu Gly Val Gin Glu Phe Val He He
325 330 335
Asn Asp Leu Ala Leu Gly His Asp Ala Ser He He Gin Ser Phe Ser
340 345 350
Ala Asp Ser Glu Ser Leu Lys Leu Leu Lys Gin Thr Glu Lys He Asp
355 360 365
Asp Glu Asn Ala Leu Ala Ala He Arg He His Lys Val Met Lys Pro
370 375 380
Gly Asp Pro Val Thr Thr Glu Val Ala Lys Gin Phe Val Lys Lys Leu 385 390 395 400
Phe Phe Asp Pro Glu Arg Tyr Asp Leu Thr Met Val Gly Arg Met Lys
405 410 415
Met Asn His Lys Leu Gly Leu His Val Pro Asp Tyr He Thr Thr Leu
420 425 430
Thr His Glu Asp He He Thr Thr Val Lys Tyr Leu Met Lys He Lys
435 440 445
Asn Asn Gin Gly Lys He Asp Asp Arg Asp His Leu Gly Asn Arg Arg
450 455 460
He Arg Ala Val Gly Glu Leu Leu Ala Asn Glu Leu His Ser Gly Leu 465 470 475 480
Val Lys Met Gin Lys Thr He Lys Asp Lys Leu Thr Thr Met Ser Gly
485 490 495
Ala Phe Asp Ser Leu Met Pro His Asp Leu Val Asn Ser Lys Met He 500 505 510
Thr Ser Thr He Met Glu Phe Phe Met Gly Gly Gin Leu Ser Gin Phe
515 520 525
Met Asp Gin Thr Asn Pro Leu Ser Glu Val Thr His Lys Arg Arg Leu
530 535 540
Ser Ala Leu Gly Glu Gly Gly Leu Val Lys Asp Arg Val Gly Phe Glu 545 550 555 560
Ala Arg Asp Val His Pro Thr His Tyr Gly Arg He Cys Pro He Glu
565 570 575
Thr Pro Glu Gly Gin Asn He Gly Leu He Asn Thr Leu Ser Thr Phe
580 585 590
Thr Arg Val Asn Asp Leu Gly Phe He Glu Ala Pro Tyr Lys Lys Val
595 600 605
Val Asp Gly Lys Val Val Gly Glu Thr He Tyr Leu Thr Ala He Gin
610 615 620
Glu Asp Ser His He He Ala Pro Ala Ser Thr Pro He Asp Glu Glu 625 630 635 640
Gly Asn He Leu Gly Asp Leu He Glu Thr Arg Val Glu Gly Glu He
645 650 655
Val Leu Asn Glu Lys Ser Lys Val Thr Leu Met Asp Leu Ser Ser Ser
660 665 670
Met Leu Val Gly Val Ala Ala Ser Leu He Pro Phe Leu Glu His Asp
675 680 685
Asp Ala Asn Arg Ala Leu Met Gly Thr Asn Met Gin Arg Gin Ala Val
690 695 700
Pro Leu Leu Arg Ser Asp Ala Pro He Val Gly Thr Gly He Glu Lys 705 710 715 720
He He Ala Arg Asp Ser Trp Gly Ala He Lys Ala Asn Arg Ala Gly
725 730 735
Val Val Glu Lys He Asp Ser Lys Asn He Tyr He Leu Gly Glu Ser
740 745 750
Lys Glu Glu Ala Tyr He Asp Ala Tyr Ser Leu Gin Lys Asn Leu Arg
755 760 765
Thr Asn Gin Asn Thr Ser Phe Asn Gin Val Pro He Val Lys Val Gly
770 775 780
Asp Lys Val Gly Ala Gly Gin He He Ala Asp Gly Pro Ser Met Asp 785 790 795 800
Arg Gly Glu Leu Ala Leu Gly Lys Asn Val Arg Val Ala Phe Met Pro
805 810 815
Trp Asn Gly Tyr Asn Phe Glu Asp Ala He Val Val Ser Glu Cys He
820 825 830
Thr Lys Asp Asp He Phe Thr Ser Thr His He Tyr Glu Lys Glu Val
835 840 845
Asp Ala Arg Glu Leu Lys His Gly Val Glu Glu Phe Thr Ala Asp He
850 855 860
Pro Asp Val Lys Glu Glu Ala Leu Ala His Leu Asp Glu Ser Gly He 865 870 875 880
Val Lys Val Gly Thr Tyr Val Ser Ala Gly Met He Leu Val Gly Lys
885 890 895
Thr Ser Pro Lys Gly Glu He Lys Ser Thr Pro Glu Glu Arg Leu Leu
900 905 910
Arg Ala He Phe Gly Asp Lys Ala Gly His Val Val Asn Lys Ser Leu
915 920 925
Tyr Cys Pro Pro Ser Leu Glu Gly Thr Val He Asp Val Lys Val Phe 930 935 940 Thr Lys Lys Gly Tyr Glu Lys Asp Ala Arg Val Leu Ser Ala Tyr Glu 945 950 955 960
Glu Glu Lys Ala Lys Leu Asp Met Glu His Phe Asp Arg Leu Thr Met
965 970 975
Leu Asn Arg Glu Glu Leu Leu Arg Val Thr Arg Ser Phe Leu Lys Arg
980 985 990
Phe
(2) INFORMATION FOR SEQ ID NO: 81:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 581 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...525 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81:
AGGTTAAAGT TTAAGACAAA CCAAAGAGTT TGTCTTGTTT GTTTTTGA ATG CAC TCT 57
Met His Ser
1
CCA AAT TTA GAA AAA GAA GAA ACC GAA ATC ATA GAA ACA CTC CTT ATG 105 Pro Asn Leu Glu Lys Glu Glu Thr Glu He He Glu Thr Leu Leu Met 5 10 15
CGT GAA AAA ATG CGT TTA TGC CCC TTG TAT TGG CGC ATC TTA GCG TTT 153 Arg Glu Lys Met Arg Leu Cys Pro Leu Tyr Trp Arg He Leu Ala Phe 20 25 30 35
TTA ACC GAT GGT TTG TTA GTG GCG TTT TTA TTG AGC GAT CTT TTA GAC 201 Leu Thr Asp Gly Leu Leu Val Ala Phe Leu Leu Ser Asp Leu Leu Asp 40 45 50
GCA TGC GAT TTC TTG CAT TCT TTA TAT TGG CTA GCT AAC CCT ATT TAT 249 Ala Cys Asp Phe Leu His Ser Leu Tyr Trp Leu Ala Asn Pro He Tyr 55 60 65
CAC AGC GCA TTT GTT GCG ATG GGT TTT ATC ATC TTG TAT GGC GTT TAT 297 His Ser Ala Phe Val Ala Met Gly Phe He He Leu Tyr Gly Val Tyr 70 75 80
GAA ATC TTT TTT GTG TGT TTG TGC AAG ATG AGC TTG GCT AAA CTG GTT 345 Glu He Phe Phe Val Cys Leu Cys Lys Met Ser Leu Ala Lys Leu Val 85 90 95
TTT AGG ATT AAG ATT ATT GAT ATT TAT TTG GCA GAT TGC CCC AGT AGG 393 Phe Arg He Lys He He Asp He Tyr Leu Ala Asp Cys Pro Ser Arg 100 105 110 115
GCT ATT TTA TTG AAG CGT TTA GGG TTA AAG ATC GTG GTT TTT CTA TGC 441 Ala He Leu Leu Lys Arg Leu Gly Leu Lys He Val Val Phe Leu Cys 120 125 130
CCC TTT TTA TGG TTT GTT GCG TTT AAA AAC CCC TAT CAT AGG GCG TGG 489 Pro Phe Leu Trp Phe Val Ala Phe Lys Asn Pro Tyr His Arg Ala Trp 135 140 145
CAT GAA GAA AAA AGC AAA AGT CTT TTG GTA TTG TTT TAATCATGAT TTATTG 541 His Glu Glu Lys Ser Lys Ser Leu Leu Val Leu Phe 150 155
GTTGTATTTG GCGGTCTTTT TTTTGTTGAG CGCATTAGAC 581
(2) INFORMATION FOR SEQ ID NO: 82:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 159 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82:
Met His Ser Pro Asn Leu Glu Lys Glu Glu Thr Glu He He Glu Thr
1 5 10 15
Leu Leu Met Arg Glu Lys Met Arg Leu Cys Pro Leu Tyr Trp Arg He
20 25 30
Leu Ala Phe Leu Thr Asp Gly Leu Leu Val Ala Phe Leu Leu Ser Asp
35 40 45
Leu Leu Asp Ala Cys Asp Phe Leu His Ser Leu Tyr Trp Leu Ala Asn
50 55 60
Pro He Tyr His Ser Ala Phe Val Ala Met Gly Phe He He Leu Tyr 65 70 75 80
Gly Val Tyr Glu He Phe Phe Val Cys Leu Cys Lys Met Ser Leu Ala
85 90 95
Lys Leu Val Phe Arg He Lys He He Asp He Tyr Leu Ala Asp Cys
100 105 110
Pro Ser Arg Ala He Leu Leu Lys Arg Leu Gly Leu Lys He Val Val
115 120 125
Phe Leu Cys Pro Phe Leu Trp Phe Val Ala Phe Lys Asn Pro Tyr His
130 135 140
Arg Ala Trp His Glu Glu Lys Ser Lys Ser Leu Leu Val Leu Phe 145 150 155
(2) INFORMATION FOR SEQ ID NO: 83: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 901 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 67...852 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83:
GAATTTAAGC GGCAGAGGGG ATAAGGATTT AAGCACCGTT TATAACGCTT TAAAAGGAGG 60 TTTAAA ATG AGG TAT CAA AAC ATG TTT GAA ACC TTA AAA AAA CAC GAA 108 Met Arg Tyr Gin Asn Met Phe Glu Thr Leu Lys Lys His Glu 1 5 10
AAA ATG GCG TTT ATC CCG TTT GTA ACC TTG GGC GAT CCT AAT TAT GAA 156 Lys Met Ala Phe He Pro Phe Val Thr Leu Gly Asp Pro Asn Tyr Glu 15 20 25 30
TTG AGT TTT GAA ATC ATT AAA ACC CTA ATT ATT AGC GGG GTG AGC GCT 204 Leu Ser Phe Glu He He Lys Thr Leu He He Ser Gly Val Ser Ala 35 40 45
TTA GAA TTG GGT CTT GCT TTT TCT GAT CCT GTG GCG GAT GGC ATT ACC 252 Leu Glu Leu Gly Leu Ala Phe Ser Asp Pro Val Ala Asp Gly He Thr 50 55 60
ATA CAA GCG AGC CAT TTA AGG GCG TTA AAA CAC GCT AGC ATG GCT AAA 300 He Gin Ala Ser His Leu Arg Ala Leu Lys His Ala Ser Met Ala Lys 65 70 75
AAT TTC CAG CTT TTA AAA AAG ATT AGA GAT TAC AAC CAC AAT ATT CCC 348 Asn Phe Gin Leu Leu Lys Lys He Arg Asp Tyr Asn His Asn He Pro 80 85 90
ATA GGG CTT TTA GCG TAT GCG AAT TTA ATT TTT TCT TAT GGC GTT GAT 396 He Gly Leu Leu Ala Tyr Ala Asn Leu He Phe Ser Tyr Gly Val Asp 95 100 105 110
GGC TTT TAC GCT CAA GCT AAA GAA TGC GGT ATA GAT AGC GTT TTA ATA 444 Gly Phe Tyr Ala Gin Ala Lys Glu Cys Gly He Asp Ser Val Leu He 115 120 125
GCG GAC ATG CCC CTA ATA GAA AAA GAA TTA GTC ATC AAA TCC GCT CAA 492 Ala Asp Met Pro Leu He Glu Lys Glu Leu Val He Lys Ser Ala Gin 130 135 140 AAA CAC CAA ATC AAG CAA ATC TTT ATC GCC AGC CCC AAT GCG AGC AGT 540 Lys His Gin He Lys Gin He Phe He Ala Ser Pro Asn Ala Ser Ser 145 150 155
AAA GAT TTA GAA CAA GTC GCT ACG CAT TCG CAA GGC TAT ATC TAC GCT 588 Lys Asp Leu Glu Gin Val Ala Thr His Ser Gin Gly Tyr He Tyr Ala 160 165 170
TTA GCC AGG AGT GGG GTT ACA GGG GCG AGC CGT ATT TTA GAG AAT GAT 636 Leu Ala Arg Ser Gly Val Thr Gly Ala Ser Arg He Leu Glu Asn Asp 175 180 185 190
TCG AGT GCT ATT ATT AAA ACC TTA AAA GCT TTT AGC CCT ACC CCA GCC 684 Ser Ser Ala He He Lys Thr Leu Lys Ala Phe Ser Pro Thr Pro Ala 195 200 205
TTA CTG GGC TTT GGC ATT TCC AAA AAA GAA CAC ATC ACA AAC GCT AAA 732 Leu Leu Gly Phe Gly He Ser Lys Lys Glu His He Thr Asn Ala Lys 210 215 220
GGC ATG GGT GCT GAT GGC GTG ATT TGC GGA TCA GCG TTA GTC AAA ATC 780 Gly Met Gly Ala Asp Gly Val He Cys Gly Ser Ala Leu Val Lys He 225 230 235
ATA GAA GAA AAT TTA AAC AAT GAA AAC GCC ATG CTG GAA AAA ATT AAA 828 He Glu Glu Asn Leu Asn Asn Glu Asn Ala Met Leu Glu Lys He Lys 240 245 250
GGG TTT ATA GGA GGA ATG ATT TTT TAAGGCTTTT AGGCTTTGTT GCGTTAAAAA 882 Gly Phe He Gly Gly Met He Phe 255 260
TTAAAGATCA CAGATTAAC 901
(2) INFORMATION FOR SEQ ID NO: 84:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 262 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84:
Met Arg Tyr Gin Asn Met Phe Glu Thr Leu Lys Lys His Glu Lys Met
1 5 10 15
Ala Phe He Pro Phe Val Thr Leu Gly Asp Pro Asn Tyr Glu Leu Ser
20 25 30
Phe Glu He He Lys Thr Leu He He Ser Gly Val Ser Ala Leu Glu
35 40 45
Leu Gly Leu Ala Phe Ser Asp Pro Val Ala Asp Gly He Thr He Gin 50 55 60 Ala Ser His Leu Arg Ala Leu Lys His Ala Ser Met Ala Lys Asn Phe 65 70 75 80
Gin Leu Leu Lys Lys He Arg Asp Tyr Asn His Asn He Pro He Gly
85 90 95
Leu Leu Ala Tyr Ala Asn Leu He Phe Ser Tyr Gly Val Asp Gly Phe
100 105 110
Tyr Ala Gin Ala Lys Glu Cys Gly He Asp Ser Val Leu He Ala Asp
115 120 125
Met Pro Leu He Glu Lys Glu Leu Val He Lys Ser Ala Gin Lys His
130 135 140
Gin He Lys Gin He Phe He Ala Ser Pro Asn Ala Ser Ser Lys Asp 145 150 155 160
Leu Glu Gin Val Ala Thr His Ser Gin Gly Tyr He Tyr Ala Leu Ala
165 170 175
Arg Ser Gly Val Thr Gly Ala Ser Arg He Leu Glu Asn Asp Ser Ser
180 185 190
Ala He He Lys Thr Leu Lys Ala Phe Ser Pro Thr Pro Ala Leu Leu
195 200 205
Gly Phe Gly He Ser Lys Lys Glu His He Thr Asn Ala Lys Gly Met
210 215 220
Gly Ala Asp Gly Val He Cys Gly Ser Ala Leu Val Lys He He Glu 225 230 235 240
Glu Asn Leu Asn Asn Glu Asn Ala Met Leu Glu Lys He Lys Gly Phe
245 250 255
He Gly Gly Met He Phe 260
(2) INFORMATION FOR SEQ ID NO: 85:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1081 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...954 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85:
AAGTAATGCC CCTGTTGTAT CAGCTTGATT TAAGAGGAAT AAGTTATT ATG AAT AAA 57
Met Asn Lys 1
GCT ATT GCT AGT AAG ATA CTC ATC ACT TTG GGT TTT TTA TTT CTC TAC 105 Ala He Ala Ser Lys He Leu He Thr Leu Gly Phe Leu Phe Leu Tyr 5 10 15
AGA GTC TTA GCT TAT ATC CCC ATT CCT GGC GTA GAT TTA GCA GCG ATC 153 Arg Val Leu Ala Tyr He Pro He Pro Gly Val Asp Leu Ala Ala He 20 25 30 35
AAG GCT TTT TTT GAC AGC AAT TCC AAC AAC GCT TTG GGG TTG TTT AAT 201 Lys Ala Phe Phe Asp Ser Asn Ser Asn Asn Ala Leu Gly Leu Phe Asn 40 45 50
ATG TTT AGC GGG AAT GCG GTT TCT CGC TTG AGC ATC ATC TCG TTG GGT 249 Met Phe Ser Gly Asn Ala Val Ser Arg Leu Ser He He Ser Leu Gly 55 60 65
ATC ATG CCC TAT ATC ACT TCT TCA ATT ATC ATG GAG CTT TTG AGC GCG 297 He Met Pro Tyr He Thr Ser Ser He He Met Glu Leu Leu Ser Ala 70 75 80
ACT TTC CCT AAC CTG GCT AAA ATG AAA AAA GAG CGG GAT GGC ATG CAA 345 Thr Phe Pro Asn Leu Ala Lys Met Lys Lys Glu Arg Asp Gly Met Gin 85 90 95
AAA TAC ATG CAA ATC GTG CGT TAT TTG ACC ATT TTA ATC ACC CTA ATC 393 Lys Tyr Met Gin He Val Arg Tyr Leu Thr He Leu He Thr Leu He 100 105 110 115
CAA GCG GTG AGC GTT TCA GTA GGC TTA AGG AGC ATT AGT GGA GGA GCC 441 Gin Ala Val Ser Val Ser Val Gly Leu Arg Ser He Ser Gly Gly Ala 120 125 130
AAT GGG GCG ATC ATG ATT GAT ATG CAA GTT TTT ATG ATC GTT TCA GCG 489 Asn Gly Ala He Met He Asp Met Gin Val Phe Met He Val Ser Ala 135 140 145
TTT TCT ATG CTT ACA GGA ACG ATG CTA CTC ATG TGG ATA GGG GAG CAA 537 Phe Ser Met Leu Thr Gly Thr Met Leu Leu Met Trp He Gly Glu Gin 150 155 160
ATC ACG CAA AGG GGC GTG GGG AAT GGG ATC AGT CTC ATT ATT TTT GCC 585 He Thr Gin Arg Gly Val Gly Asn Gly He Ser Leu He He Phe Ala 165 170 175
GGG ATT GTT TCA GGG ATC CCA TCA GCT ATT TCA GGC ACA TTC AAT TTG 633 Gly He Val Ser Gly He Pro Ser Ala He Ser Gly Thr Phe Asn Leu 180 185 190 195
GTC AAT ACG GGC GTT ATT AAT ATC TTA ATG CTC ATT GGT ATT GTG CTG 681 Val Asn Thr Gly Val He Asn He Leu Met Leu He Gly He Val Leu 200 205 210
ATT GTT TTA GCG ACT ATT TTT GCG ATT ATC TAT GTG GAA TTA GCT GAG 729 He Val Leu Ala Thr He Phe Ala He He Tyr Val Glu Leu Ala Glu 215 220 225
CGC AGG ATC CCT ATT TCT TAT GCG CGT AAA GTG GTG ATG CAA AAC CAA 777 Arg Arg He Pro He Ser Tyr Ala Arg Lys Val Val Met Gin Asn Gin 230 235 240 AAC AAG CGC ATC ATG AAT TAC ATT CCT ATT AAG TTG AAT TTA AGT GGG 825 Asn Lys Arg He Met Asn Tyr He Pro He Lys Leu Asn Leu Ser Gly 245 250 255
GTG ATC CCC CCT ATT TTC GCT TCA GCT TTG CTC GTG TTC CCT TCT ACG 873 Val He Pro Pro He Phe Ala Ser Ala Leu Leu Val Phe Pro Ser Thr 260 265 270 275
ATT TTG CAG CAA GCC ACA AGC AAC AAA ACC TTG CAA GCG GTT GCG NAT 921 He Leu Gin Gin Ala Thr Ser Asn Lys Thr Leu Gin Ala Val Ala Xaa 280 285 290
TTT TTA AGC CCG CAA GGT ATG CGT ATA ATA TTT TGATGTTCTT GCTCATCATC 974 Phe Leu Ser Pro Gin Gly Met Arg He He Phe 295 300
TTTTTTGCTT ACTTTTATTC TTCTATTGTG TTCAATTCTA AGGATATTGC GGATAATTTG 1034 AGGCGTAATG GCGGGTATAT TCCAGGGCTT AGGCCTGGAG AGGGGAC 1081
(2) INFORMATION FOR SEQ ID NO: 86:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 302 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86:
Met Asn Lys Ala He Ala Ser Lys He Leu He Thr Leu Gly Phe Leu
1 5 10 15
Phe Leu Tyr Arg Val Leu Ala Tyr He Pro He Pro Gly Val Asp Leu
20 25 30
Ala Ala He Lys Ala Phe Phe Asp Ser Asn Ser Asn Asn Ala Leu Gly
35 40 45
Leu Phe Asn Met Phe Ser Gly Asn Ala Val Ser Arg Leu Ser He He
50 55 60
Ser Leu Gly He Met Pro Tyr He Thr Ser Ser He He Met Glu Leu 65 70 75 80
Leu Ser Ala Thr Phe Pro Asn Leu Ala Lys Met Lys Lys Glu Arg Asp
85 90 95
Gly Met Gin Lys Tyr Met Gin He Val Arg Tyr Leu Thr He Leu He
100 105 110
Thr Leu He Gin Ala Val Ser Val Ser Val Gly Leu Arg Ser He Ser
115 120 125
Gly Gly Ala Asn Gly Ala He Met He Asp Met Gin Val Phe Met He
130 135 140
Val Ser Ala Phe Ser Met Leu Thr Gly Thr Met Leu Leu Met Trp He 145 150 155 160
Gly Glu Gin He Thr Gin Arg Gly Val Gly Asn Gly He Ser Leu He
165 170 175
He Phe Ala Gly He Val Ser Gly He Pro Ser Ala He Ser Gly Thr 180 185 190
Phe Asn Leu Val Asn Thr Gly Val He Asn He Leu Met Leu He Gly
195 200 205
He Val Leu He Val Leu Ala Thr He Phe Ala He He Tyr Val Glu
210 215 220
Leu Ala Glu Arg Arg He Pro He Ser Tyr Ala Arg Lys Val Val Met 225 230 235 240
Gin Asn Gin Asn Lys Arg He Met Asn Tyr He Pro He Lys Leu Asn
245 250 255
Leu Ser Gly Val He Pro Pro He Phe Ala Ser Ala Leu Leu Val Phe
260 265 270
Pro Ser Thr He Leu Gin Gin Ala Thr Ser Asn Lys Thr Leu Gin Ala
275 280 285
Val Ala Xaa Phe Leu Ser Pro Gin Gly Met Arg He He Phe 290 295 300
(2) INFORMATION FOR SEQ ID NO: 87:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 423 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 109...363 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87:
AATGGGGCAA TTCAAGGCGA TAGAAGCTTG AATGAGGGCT TCTTCTAAGG TTTTGGCTTT 60 GATTTCAATA AAATTTTGCA TCAATGTTCC TTTTTGTTTT GCGCATGC ATG CGT TTT 117
Met Arg Phe
1
TTA TTC TCT AAG ACT TTA TTG ATG ATG AGT TGT TGC AAC ACC GAA AGG 165 Leu Phe Ser Lys Thr Leu Leu Met Met Ser Cys Cys Asn Thr Glu Arg 5 10 15
ATG TTG TTC GTG GTC CAA TAC AAG ACT AAC CCT GCC GGG AAA GTG ATT 213 Met Leu Phe Val Val Gin Tyr Lys Thr Asn Pro Ala Gly Lys Val He 20 25 30 35
AAA AAG ATT GTG AAT AAT AGG GGT AAG AGT TTA AAA ATC TTT GCT TGC 261 Lys Lys He Val Asn Asn Arg Gly Lys Ser Leu Lys He Phe Ala Cys 40 45 50
ATG GGA TCG GTC ATG GTG TTT GGC GTA ACG CTT TGG TGC CAA TAC ATA 309 Met Gly Ser Val Met Val Phe Gly Val Thr Leu Trp Cys Gin Tyr He 55 60 65 GAC GCT CCC ATA AGA AGC GGT AAA ATA AAA TAC GGA TCC ATG ATG GAT 357 Asp Ala Pro He Arg Ser Gly Lys He Lys Tyr Gly Ser Met Met Asp 70 75 80
AAA TCA TGAATCCATA AGATCCACTC TGAGCTTTTC AATTCCACAG CGTTATAAAG CA 415 Lys Ser 85
CTCTATAA 423
(2) INFORMATION FOR SEQ ID NO: 88:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 85 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88:
Met Arg Phe Leu Phe Ser Lys Thr Leu Leu Met Met Ser Cys Cys Asn
1 5 10 15
Thr Glu Arg Met Leu Phe Val Val Gin Tyr Lys Thr Asn Pro Ala Gly
20 25 30
Lys Val He Lys Lys He Val Asn Asn Arg Gly Lys Ser Leu Lys He
35 40 45
Phe Ala Cys Met Gly Ser Val Met Val Phe Gly Val Thr Leu Trp Cys
50 55 60
Gin Tyr He Asp Ala Pro He Arg Ser Gly Lys He Lys Tyr Gly Ser 65 70 75 80
Met Met Asp Lys Ser 85
(2) INFORMATION FOR SEQ ID NO: 89:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 740 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 59...688 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: TTTAAAATTA GAAACAGATG TATCTGTTTT AAATTTTGAA TAGGGAGTTT CTATCATT 58
ATG TTA TTG AAA ACA AAA TTA AAA ATT ATA AGC TCG GTG ATT TTG AGC 106 Met Leu Leu Lys Thr Lys Leu Lys He He Ser Ser Val He Leu Ser 1 5 10 15
GCT TTA TTG TGG GTG GGT TGC TCA AGC GAA ATG GCA ACT TAT CAA AAC 154 Ala Leu Leu Trp Val Gly Cys Ser Ser Glu Met Ala Thr Tyr Gin Asn 20 25 30
GTG AAT GAT GCC ACT AAA AAT ACG ACT GCA AGC ATT AAT AGC ACG GAT 202 Val Asn Asp Ala Thr Lys Asn Thr Thr Ala Ser He Asn Ser Thr Asp 35 40 45
TTA TTG CTA ACC GCT AAC GCG ATG TTA GAT TCC ATG TTT AGC GAC CCT 250 Leu Leu Leu Thr Ala Asn Ala Met Leu Asp Ser Met Phe Ser Asp Pro 50 55 60
AAT TTT GAG CAA CTC AAG GGC AAG CAT TTG ATT GAA GTT TCA GAT GTG 298 Asn Phe Glu Gin Leu Lys Gly Lys His Leu He Glu Val Ser Asp Val 65 70 75 80
ATT AAC GAC ACC ACG CAG CCC AAT TTG GAC ATG AAT CTT TTG ACG ACT 346 He Asn Asp Thr Thr Gin Pro Asn Leu Asp Met Asn Leu Leu Thr Thr 85 90 95
GAA ATC GCG CGG CAG TTG CGG TTG CGA TCT AAT GGG AGG TTC AAT ATC 394 Glu He Ala Arg Gin Leu Arg Leu Arg Ser Asn Gly Arg Phe Asn He 100 105 110
ACA AGG GCG AGC GGA GGG AGT GGC ATT GCA GCC GAT AGC AGA ATG GTG 442 Thr Arg Ala Ser Gly Gly Ser Gly He Ala Ala Asp Ser Arg Met Val 115 120 125
AAA CAG CGC GAA AAA GAA CGA GAG AGC GAA GAG TAT AAT CAA GAC ACC 490 Lys Gin Arg Glu Lys Glu Arg Glu Ser Glu Glu Tyr Asn Gin Asp Thr 130 135 140
ACT GTA GAA AAA GGC ACT TTA AAA GCC GCT GAT TTA TCT TTA AGT GGT 538 Thr Val Glu Lys Gly Thr Leu Lys Ala Ala Asp Leu Ser Leu Ser Gly 145 150 155 160
AAA GTA TCT AGT ATC GCA GCC TCT ATT AGT AGT TCT AGG CAG CGC TTG 586 Lys Val Ser Ser He Ala Ala Ser He Ser Ser Ser Arg Gin Arg Leu 165 170 175
GAC TAT GAC TTC ACC CTA AGC CTT ACC AAC AGG AAA ACG GGT GAA GAG 634 Asp Tyr Asp Phe Thr Leu Ser Leu Thr Asn Arg Lys Thr Gly Glu Glu 180 185 190
GTA TGG AGC GAT GTT AAG CCT ATT GTG AAG AAC GCT AGC AAT AAG CGT 682 Val Trp Ser Asp Val Lys Pro He Val Lys Asn Ala Ser Asn Lys Arg 195 200 205 ATG TTT TAAATTTATA TTTGAAAGGA TGAACAATGA AAAATCAAGT TAAAAAAATT TT 740 Met Phe 210
(2) INFORMATION FOR SEQ ID NO:90:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 210 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90:
Met Leu Leu Lys Thr Lys Leu Lys He He Ser Ser Val He Leu Ser
1 5 10 15
Ala Leu Leu Trp Val Gly Cys Ser Ser Glu Met Ala Thr Tyr Gin Asn
20 25 30
Val Asn Asp Ala Thr Lys Asn Thr Thr Ala Ser He Asn Ser Thr Asp
35 40 45
Leu Leu Leu Thr Ala Asn Ala Met Leu Asp Ser Met Phe Ser Asp Pro
50 55 60
Asn Phe Glu Gin Leu Lys Gly Lys His Leu He Glu Val Ser Asp Val 65 70 75 80
He Asn Asp Thr Thr Gin Pro Asn Leu Asp Met Asn Leu Leu Thr Thr
85 90 95
Glu He Ala Arg Gin Leu Arg Leu Arg Ser Asn Gly Arg Phe Asn He
100 105 110
Thr Arg Ala Ser Gly Gly Ser Gly He Ala Ala Asp Ser Arg Met Val
115 120 125
Lys Gin Arg Glu Lys Glu Arg Glu Ser Glu Glu Tyr Asn Gin Asp Thr
130 135 140
Thr Val Glu Lys Gly Thr Leu Lys Ala Ala Asp Leu Ser Leu Ser Gly 145 150 155 160
Lys Val Ser Ser He Ala Ala Ser He Ser Ser Ser Arg Gin Arg Leu
165 170 175
Asp Tyr Asp Phe Thr Leu Ser Leu Thr Asn Arg Lys Thr Gly Glu Glu
180 185 190
Val Trp Ser Asp Val Lys Pro He Val Lys Asn Ala Ser Asn Lys Arg
195 200 205
Met Phe 210
(2) INFORMATION FOR SEQ ID NO: 91:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1269 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 84...1214 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91:
TTATCATTGT GTTAAAATAG TCGTTTTAAC AAACAAAATT TTGTTAATAG ATTTTACCTA 60 ATCTGAGAGA GAATTATATT TTA ATG AAG ACA GAG AAA CAA AAA TTT TTA GAG 113
Met Lys Thr Glu Lys Gin Lys Phe Leu Glu 1 5 10
ATG CGT AAA GAT GGG GCG AAC TCT GTG CTG ATT TTA AGA GGG GAT TGG 161 Met Arg Lys Asp Gly Ala Asn Ser Val Leu He Leu Arg Gly Asp Trp 15 20 25
GAT TTT AAA ACG AGC GTG TTT CGT TTA GAT GAG TTG AAA AAA AAT TTA 209 Asp Phe Lys Thr Ser Val Phe Arg Leu Asp Glu Leu Lys Lys Asn Leu 30 35 40
TTA GAT CAT CAA GGG CCT TTA AAA ATG GAT TTT TCA GGG TGC CAA AAA 257 Leu Asp His Gin Gly Pro Leu Lys Met Asp Phe Ser Gly Cys Gin Lys 45 50 55
GTG GAT TTT GTT TTT GGC ATG TTT TTA TTT GAT TTA GTT AAG GAG CGT 305 Val Asp Phe Val Phe Gly Met Phe Leu Phe Asp Leu Val Lys Glu Arg 60 65 70
TCT TTA AAC ATT GAA TTG TGT AAC GTG AGT GAG AAT AAC GCA TGC GCT 353 Ser Leu Asn He Glu Leu Cys Asn Val Ser Glu Asn Asn Ala Cyε, Ala 75 80 85 90
TTG AAA GTG GTT AAA GAC TGG CTT GAA AAA GAA GAG GAT TTA GAG TCT 401 Leu Lys Val Val Lys Asp Trp Leu Glu Lys Glu Glu Asp Leu Glu Ser 95 100 105
AAA AAA GCG GGC AAA CAC TAC GAA CTT TTG ATC ACT AAA TTG GGT AAG 449 Lys Lys Ala Gly Lys His Tyr Glu Leu Leu He Thr Lys Leu Gly Lys 110 115 120
AGT ATC GTA GAG ACT TAT AAT ACC TTT TTA AAC GCA TTC AAT TTT TGC 497 Ser He Val Glu Thr Tyr Asn Thr Phe Leu Asn Ala Phe Asn Phe Cys 125 130 135
GGC ATG ATT TTA TTC TAC TTC ATT AAA AGC GTT TTC AAC CCC AAA CGC 545 Gly Met He Leu Phe Tyr Phe He Lys Ser Val Phe Asn Pro Lys Arg 140 145 150
TTT TGT ATC ACT CCT TTG CTC TAT CAT ATC AAT GAA TCC GGG TTT AAG 593 Phe Cys He Thr Pro Leu Leu Tyr His He Asn Glu Ser Gly Phe Lys 155 160 165 170
GTT TTG CCA GTG AGT ATT TTA ACG GTG TTT ATC GTG GGG TTT GCC GTT 641 Val Leu Pro Val Ser He Leu Thr Val Phe He Val Gly Phe Ala Val 175 180 185
GCT TTA CAA GGG GCT TTA CAA TTA CAA GAC ATG GGC GCG CCT TTA ATG 689 Ala Leu Gin Gly Ala Leu Gin Leu Gin Asp Met Gly Ala Pro Leu Met 190 195 200
TCG GTG GAA ATG ACG GCT AAA CTC GCT TTA AGA GAA ATC GGC CCT TTT 737 Ser Val Glu Met Thr Ala Lys Leu Ala Leu Arg Glu He Gly Pro Phe 205 210 215
ATT TTA ACC CTT GTG GTG GCC GGG AGG AGC GCG AGC AGT TTT ACC GCG 785 He Leu Thr Leu Val Val Ala Gly Arg Ser Ala Ser Ser Phe Thr Ala 220 225 230
CAA ATT GGG GTG ATG AAG ATC ACT GAG GAA TTA GAC GCG ATG AAA ACC 833 Gin He Gly Val Met Lys He Thr Glu Glu Leu Asp Ala Met Lys Thr 235 240 245 250
ATG GGC TTT AAC CCT TTT GAA TTT TTA GTG TTG CCT AGG GTG TTA GCC 881 Met Gly Phe Asn Pro Phe Glu Phe Leu Val Leu Pro Arg Val Leu Ala 255 260 265
TTA GTG ATT GTT TTG CCT TTA TTG GTG TTT ATT GCC GAT GCG TTC GCC 929 Leu Val He Val Leu Pro Leu Leu Val Phe He Ala Asp Ala Phe Ala 270 275 280
ATT CTT GGG GGC ATG TTT GCG ATT AAA TAC CAA TTG GAT TTA GGC TTC 977 He Leu Gly Gly Met Phe Ala He Lys Tyr Gin Leu Asp Leu Gly Phe 285 290 295
CCG AGC TAT ATT GAC AGA TTC CAT GAC ACA GTG GGT TGG AAC CAT TTT 1025 Pro Ser Tyr He Asp Arg Phe His Asp Thr Val Gly Trp Asn His Phe 300 305 310
TTG GTA GGG ATT GTC AAA GCC CCT TTT TGG GGG TTT GCG ATT GCG ATG 1073 Leu Val Gly He Val Lys Ala Pro Phe Trp Gly Phe Ala He Ala Met 315 320 325 330
GTA GGG TGC ATG CGC GGG TTT GAA GTC AAG GGG GAT ACT GAG AGC ATT 1121 Val Gly Cys Met Arg Gly Phe Glu Val Lys Gly Asp Thr Glu Ser He 335 340 345
GGG CGC TTG ACC ACT ATT AGC GTC GTG AAC GCT TTG TTT TGG ATC ATT 1169 Gly Arg Leu Thr Thr He Ser Val Val Asn Ala Leu Phe Trp He He 350 355 360
TTC TTA GAC GCT ATT TTT TCT ATC ATC TTT TCT AAG TTG AAC ATA TAATG 1219 Phe Leu Asp Ala He Phe Ser He He Phe Ser Lys Leu Asn He 365 370 375
AACGCTACTA ACAATCAAGT CTTAATTGAA GTGAAGGATC TCCATAGCGC 1269 (2) INFORMATION FOR SEQ ID NO : 92 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 377 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92:
Met Lys Thr Glu Lys Gin Lys Phe Leu Glu Met Arg Lys Asp Gly Ala
1 5 10 15
Asn Ser Val Leu He Leu Arg Gly Asp Trp Asp Phe Lys Thr Ser Val
20 25 30
Phe Arg Leu Asp Glu Leu Lys Lys Asn Leu Leu Asp His Gin Gly Pro
35 40 45
Leu Lys Met Asp Phe Ser Gly Cys Gin Lys Val Asp Phe Val Phe Gly
50 55 60
Met Phe Leu Phe Asp Leu Val Lys Glu Arg Ser Leu Asn He Glu Leu 65 70 75 80
Cys Asn Val Ser Glu Asn Asn Ala Cys Ala Leu Lys Val Val Lys Asp
85 90 95
Trp Leu Glu Lys Glu Glu Asp Leu Glu Ser Lys Lys Ala Gly Lys His
100 105 110
Tyr Glu Leu Leu He Thr Lys Leu Gly Lys Ser He Val Glu Thr Tyr
115 120 125
Asn Thr Phe Leu Asn Ala Phe Asn Phe Cys Gly Met He Leu Phe Tyr
130 135 140
Phe He Lys Ser Val Phe Asn Pro Lys Arg Phe Cys He Thr Pro Leu 145 150 155 160
Leu Tyr His He Asn Glu Ser Gly Phe Lys Val Leu Pro Val Ser He
165 170 175
Leu Thr Val Phe He Val Gly Phe Ala Val Ala Leu Gin Gly Ala Leu
180 185 190
Gin Leu Gin Asp Met Gly Ala Pro Leu Met Ser Val Glu Met Thr Ala
195 200 205
Lys Leu Ala Leu Arg Glu He Gly Pro Phe He Leu Thr Leu Val Val
210 215 220
Ala Gly Arg Ser Ala Ser Ser Phe Thr Ala Gin He Gly Val Met Lys 225 230 235 240
He Thr Glu Glu Leu Asp Ala Met Lys Thr Met Gly Phe Asn Pro Phe
245 250 255
Glu Phe Leu Val Leu Pro Arg Val Leu Ala Leu Val He Val Leu Pro
260 265 270
Leu Leu Val Phe He Ala Asp Ala Phe Ala He Leu Gly Gly Met Phe
275 280 285
Ala He Lys Tyr Gin Leu Asp Leu Gly Phe Pro Ser Tyr He Asp Arg
290 295 300
Phe His Asp Thr Val Gly Trp Asn His Phe Leu Val Gly He Val Lys 305 310 315 320
Ala Pro Phe Trp Gly Phe Ala He Ala Met Val Gly Cys Met Arg Gly 325 330 335 Phe Glu Val Lys Gly Asp Thr Glu Ser He Gly Arg Leu Thr Thr He
340 345 350
Ser Val Val Asn Ala Leu Phe Trp He He Phe Leu Asp Ala He Phe
355 360 365
Ser He He Phe Ser Lys Leu Asn He
370 375
(2) INFORMATION FOR SEQ ID NO: 93:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 557 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 60...503 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: CACAAGAGAA AATTTGCAAG CGTTTTTACA ACAAAATAAG ATAAATTAGG GAGAGTGGT 59
ATG GGA TTT TTG AAT GGG TAT TTT TTA TGG GTT AAG GCT TTC CAT GTG 107 Met Gly Phe Leu Asn Gly Tyr Phe Leu Trp Val Lys Ala Phe His Val 1 5 10 15
ATA GCG GTC ATT TCG TGG ATG GCA GCG TTG TTT TAT TTG CCG CGC CTT 155 He Ala Val He Ser Trp Met Ala Ala Leu Phe Tyr Leu Pro Arg Leu 20 25 30
TTT GTC TAT CAT GCA GAA AAC GCG CAT AAA AAA GAG TTT GTA GGA GTG 203 Phe Val Tyr His Ala Glu Asn Ala His Lys Lys Glu Phe Val Gly Val 35 40 45
GTT CAA ATC CAA GAA AAA AAG CTT TAT TCC TTT ATC GCT TCA CCG GCT 251 Val Gin He Gin Glu Lys Lys Leu Tyr Ser Phe He Ala Ser Pro Ala 50 55 60
ATG GGT TTT ACG CTT ATT ACA GGG ATT TTA ATG CTG TTG ATA GAG CCT 299 Met Gly Phe Thr Leu He Thr Gly He Leu Met Leu Leu He Glu Pro 65 70 75 80
ACG CTC TTT AAA AGT GGG GGT TGG TTG CAT GCT AAA TTG GCT TTA GTG 347 Thr Leu Phe Lys Ser Gly Gly Trp Leu His Ala Lys Leu Ala Leu Val 85 90 95
GTT TTA CTT TTA GCC TAT CAT TTT TAT TGC AAA AAA TGC ATG CGC GAG 395 Val Leu Leu Leu Ala Tyr His Phe Tyr Cys Lys Lys Cys Met Arg Glu 100 105 110
CTG GAA AAA GAC CCC ACA AGG AGA AAC GCA AGG TTT TAT CGC GTG TTT 443 Leu Glu Lys Asp Pro Thr Arg Arg Asn Ala Arg Phe Tyr Arg Val Phe 115 120 125
AAT GAG GCG CCA ACG ATT TTA ATG ATC CTC ATT GTG ATT TTA GTG GTT 491 Asn Glu Ala Pro Thr He Leu Met He Leu He Val He Leu Val Val 130 135 140
GTC AAG CCT TTT TAAAGACAAG CCATGAAAAA AGAAAAGTCA TGAAAAAAGA AAAGCA 549
Val Lys Pro Phe
145
TCTCAAGC 557
(2) INFORMATION FOR SEQ ID NO : 94 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 148 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 94 :
Met Gly Phe Leu Asn Gly Tyr Phe Leu Trp Val Lys Ala Phe His Val
1 5 10 15
He Ala Val He Ser Trp Met Ala Ala Leu Phe Tyr Leu Pro Arg Leu
20 25 30
Phe Val Tyr His Ala Glu Asn Ala His Lys Lys Glu Phe Val Gly Val
35 40 45
Val Gin He Gin Glu Lys Lys Leu Tyr Ser Phe He Ala Ser Pro Ala
50 55 60
Met Gly Phe Thr Leu He Thr Gly He Leu Met Leu Leu He Glu Pro 65 70 75 80
Thr Leu Phe Lys Ser Gly Gly Trp Leu His Ala Lys Leu Ala Leu Val
85 90 95
Val Leu Leu Leu Ala Tyr His Phe Tyr Cys Lys Lys Cys Met Arg Glu
100 105 110
Leu Glu Lys Asp Pro Thr Arg Arg Asn Ala Arg Phe Tyr Arg Val Phe
115 120 125
Asn Glu Ala Pro Thr He Leu Met He Leu He Val He Leu Val Val
130 135 140
Val Lys Pro Phe 145
(2) INFORMATION FOR SEQ ID NO: 95:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1671 base pairs (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 50...1624 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95:
CAAAATTATC TGGTGCTAAG ACTTTGAAAC AACGCCAAAT AACAACTGA ATG AAA CTT 58
Met Lys Leu 1
TTT AAC GCT CGT TTA ATC GTT TTT ATT GGC GCG CTT CTT TTA GGG GTA 106 Phe Asn Ala Arg Leu He Val Phe He Gly Ala Leu Leu Leu Gly Val 5 10 15
GGG TTT TCT GTG CCT TCT TTA CTA GAA ACT AAA GGC CCT AAA ATC ACT 154 Gly Phe Ser Val Pro Ser Leu Leu Glu Thr Lys Gly Pro Lys He Thr 20 25 30 35
TTA GGT TTG GAT TTA AGG GGG GGG TTG AAC ATG CTT TTA GGG GTA CAA 202 Leu Gly Leu Asp Leu Arg Gly Gly Leu Asn Met Leu Leu Gly Val Gin 40 45 50
ACC GAT GAG GCT TTA AAA AAC AAG TAT TTA AGC TTG GCG TCC GCT TTA 250 Thr Asp Glu Ala Leu Lys Asn Lys Tyr Leu Ser Leu Ala Ser Ala Leu 55 60 65
GAA TAC AAC GCT AAA AAG CAA AAT ATC TTG CTT AAA GAT ATT AAA TCC 298 Glu Tyr Asn Ala Lys Lys Gin Asn He Leu Leu Lys Asp He Lys Ser 70 75 80
AAT TTA GAA GGG ATC AGT TTT GAG CTT TTA GAT GAA GAT GAA GCG AAA 346 Asn Leu Glu Gly He Ser Phe Glu Leu Leu Asp Glu Asp Glu Ala Lys 85 90 95
AAA TTA GAC GCG CTT TTA TTG GAA TTG CAA GGC CAT AGC CAG TTT GAA 394 Lys Leu Asp Ala Leu Leu Leu Glu Leu Gin Gly His Ser Gin Phe Glu 100 105 110 115
ATC AAA AAG GAA GCG GGG TTT TAT AGC GTG AAT CTC ACC CCT TTA GAG 442 He Lys Lys Glu Ala Gly Phe Tyr Ser Val Asn Leu Thr Pro Leu Glu 120 125 130
CAA GAA GAA TTG CGT AAA AAC ACG ATT TTG CAA GTG ATA GGG ATC ATT 490 Gin Glu Glu Leu Arg Lys Asn Thr He Leu Gin Val He Gly He He 135 140 145 CGT AAC CGC TTG GAT CAA TTT GGT TTG GCA GAG CCT GTA GTC ATT CAG 538 Arg Asn Arg Leu Asp Gin Phe Gly Leu Ala Glu Pro Val Val He Gin 150 155 160
CAA GGT AAA GAA GAA ATT TCG GTG CAA TTG CCT GGC ATT AAG ACT TTA 586 Gin Gly Lys Glu Glu He Ser Val Gin Leu Pro Gly He Lys Thr Leu 165 170 175
GAA GAA GAA CGG CGC GCT AAA GAC TTG ATT TCA AGA TCC GCT CAT TTG 634 Glu Glu Glu Arg Arg Ala Lys Asp Leu He Ser Arg Ser Ala His Leu 180 185 190 195
CAG ATG ATG GCG GTG GAT GAA GAA CAC AAT AAA GAT GCG ATG AAA ATG 682 Gin Met Met Ala Val Asp Glu Glu His Asn Lys Asp Ala Met Lys Met 200 205 210
ACG GAT TTA GAG GCT CAA AAA TTA GGC AGC GTG TTG TTG TCT GAT GTG 730 Thr Asp Leu Glu Ala Gin Lys Leu Gly Ser Val Leu Leu Ser Asp Val 215 220 225
GAA ATG GGG GGT AAA ATC TTG CTC AAA GCG ATC CCC ATT TTA GAT GGC 778 Glu Met Gly Gly Lys He Leu Leu Lys Ala He Pro He Leu Asp Gly 230 235 240
GAA ATG CTT ACA GAT GCG AAA GTG GTG TAT GAC CAA AAC AAC CAG CCG 826 Glu Met Leu Thr Asp Ala Lys Val Val Tyr Asp Gin Asn Asn Gin Pro 245 250 255
GTG GTG AGC TTC ACG CTG GAT GCG CAA GGG GCT AAG ATT TTT GGG GAT 874 Val Val Ser Phe Thr Leu Asp Ala Gin Gly Ala Lys He Phe Gly Asp 260 265 270 275
TTC TCA GGT GCG AAT GTG GGC AAA CGC ATG GCG ATT GTT TTA GAC AAT 922 Phe Ser Gly Ala Asn Val Gly Lys Arg Met Ala He Val Leu Asp Asn 280 285 290
AAG GTC TAT TCA GCC CCG GTG ATT AGG GAG CGT ATC GGT GGG GGG AGC 970 Lys Val Tyr Ser Ala Pro Val He Arg Glu Arg He Gly Gly Gly Ser 295 300 305
GGG CAG ATT AGC GGG AAT TTT AGC GTG GCT CAA GCG AGC GAT TTA GCG 1018 Gly Gin He Ser Gly Asn Phe Ser Val Ala Gin Ala Ser Asp Leu Ala 310 315 320
ATC GCT TTA AGG AGT GGG GCG ATG AGC GCT CCC ATT CAG GTT TTA GAA 1066 He Ala Leu Arg Ser Gly Ala Met Ser Ala Pro He Gin Val Leu Glu 325 330 335
AAA AGA ATT ATA GGC CCA AGT TTA GGG AAA GAC AGC GTT AAA ACT TCC 1114 Lys Arg He He Gly Pro Ser Leu Gly Lys Asp Ser Val Lys Thr Ser 340 345 350 355
ATT ATC GCT CTA GTT GGG GGC TTT ATT TTA GTG ATG GGC TTT ATG GTG 1162 He He Ala Leu Val Gly Gly Phe He Leu Val Met Gly Phe Met Val 360 365 370 CTT TAT TAC TCT ATG GCG GGG GTG ATC GCT TGT TTG GCG TTA GTG GTC 1210 Leu Tyr Tyr Ser Met Ala Gly Val He Ala Cys Leu Ala Leu Val Val 375 380 385
AAT CTT TTT TTG ATT GTG GCG GTC ATG GCG ATT TTT GGA GCG ACG CTG 1258 Asn Leu Phe Leu He Val Ala Val Met Ala He Phe Gly Ala Thr Leu 390 395 400
ACT TTA CCG GGA ATG GCG GGG ATT GTT TTA ACC GTG GGG ATT GCC GTG 1306 Thr Leu Pro Gly Met Ala Gly He Val Leu Thr Val Gly He Ala Val 405 410 415
GAT GCT AAT ATC ATC ATC AAC GAG CGC ATT AGA GAA GTC TTA AGA GAG 1354 Asp Ala Asn He He He Asn Glu Arg He Arg Glu Val Leu Arg Glu 420 425 430 435
AAT GAG GGC ATC GCT AAA GCG ATC CAT TTA GGC TAT ATC AAT GCG AGC 1402 Asn Glu Gly He Ala Lys Ala He His Leu Gly Tyr He Asn Ala Ser 440 445 450
CGG GCG ATT TTT GAT TCT AAT ATC ACT TCT TTG ATC GCT TCA GTG TTA 1450 Arg Ala He Phe Asp Ser Asn He Thr Ser Leu He Ala Ser Val Leu 455 460 465
TTA TAC GCT TAT GGC ACA GGA GCG ATT AAA GGC TTT GCC CTA ACT ACA 1498 Leu Tyr Ala Tyr Gly Thr Gly Ala He Lys Gly Phe Ala Leu Thr Thr 470 475 480
GGC ATT GGG ATT TTA GCC TCT ATT ATC ACC GCT ATT GTT GGC ACG CAA 1546 Gly He Gly He Leu Ala Ser He He Thr Ala He Val Gly Thr Gin 485 490 495
GGG ATT TAT CAA GCC CTT TTA CCT AAA CTC ACT CAA ACA AAA AGC CTT 1594 Gly He Tyr Gin Ala Leu Leu Pro Lys Leu Thr Gin Thr Lys Ser Leu 500 505 510 515
TAC TTT TGG TTT GGC GTG AAT AAA AGA GCT TAGGAGGTTT TATGGAATTA TTC 1647 Tyr Phe Trp Phe Gly Val Asn Lys Arg Ala 520 525
AAACGAACTA GAATCTTAAG CTTC 1671
(2) INFORMATION FOR SEQ ID NO: 96:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 525 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: Met Lys Leu Phe Asn Ala Arg Leu He Val Phe He Gly Ala Leu Leu
1 5 10 15
Leu Gly Val Gly Phe Ser Val Pro Ser Leu Leu Glu Thr Lys Gly Pro
20 25 30
Lys He Thr Leu Gly Leu Asp Leu Arg Gly Gly Leu Asn Met Leu Leu
35 40 45
Gly Val Gin Thr Asp Glu Ala Leu Lys Asn Lys Tyr Leu Ser Leu Ala
50 55 60
Ser Ala Leu Glu Tyr Asn Ala Lys Lys Gin Asn He Leu Leu Lys Asp 65 70 75 80
He Lys Ser Asn Leu Glu Gly He Ser Phe Glu Leu Leu Asp Glu Asp
85 90 95
Glu Ala Lys Lys Leu Asp Ala Leu Leu Leu Glu Leu Gin Gly His Ser
100 105 110
Gin Phe Glu He Lys Lys Glu Ala Gly Phe Tyr Ser Val Asn Leu Thr
115 120 125
Pro Leu Glu Gin Glu Glu Leu Arg Lys Asn Thr He Leu Gin Val He
130 135 140
Gly He He Arg Asn Arg Leu Asp Gin Phe Gly Leu Ala Glu Pro Val 145 150 155 160
Val He Gin Gin Gly Lys Glu Glu He Ser Val Gin Leu Pro Gly He
165 170 175
Lys Thr Leu Glu Glu Glu Arg Arg Ala Lys Asp Leu He Ser Arg Ser
180 185 190
Ala His Leu Gin Met Met Ala Val Asp Glu Glu His Asn Lys Asp Ala
195 200 205
Met Lys Met Thr Asp Leu Glu Ala Gin Lys Leu Gly Ser Val Leu Leu
210 215 220
Ser Asp Val Glu Met Gly Gly Lys He Leu Leu Lys Ala He Pro He 225 230 235 240
Leu Asp Gly Glu Met Leu Thr Asp Ala Lys Val Val Tyr Asp Gin Asn
245 250 255
Asn Gin Pro Val Val Ser Phe Thr Leu Asp Ala Gin Gly Ala Lys He
260 265 270
Phe Gly Asp Phe Ser Gly Ala Asn Val Gly Lys Arg Met Ala He Val
275 280 285
Leu Asp Asn Lys Val Tyr Ser Ala Pro Val He Arg Glu Arg He Gly
290 295 300
Gly Gly Ser Gly Gin He Ser Gly Asn Phe Ser Val Ala Gin Ala Ser 305 310 315 320
Asp Leu Ala He Ala Leu Arg Ser Gly Ala Met Ser Ala Pro He Gin
325 330 335
Val Leu Glu Lys Arg He He Gly Pro Ser Leu Gly Lys Asp Ser Val
340 345 350
Lys Thr Ser He He Ala Leu Val Gly Gly Phe He Leu Val Met Gly
355 360 365
Phe Met Val Leu Tyr Tyr Ser Met Ala Gly Val He Ala Cys Leu Ala
370 375 380
Leu Val Val Asn Leu Phe Leu He Val Ala Val Met Ala He Phe Gly 385 390 395 400
Ala Thr Leu Thr Leu Pro Gly Met Ala Gly He Val Leu Thr Val Gly
405 410 415
He Ala Val Asp Ala Asn He He He Asn Glu Arg He Arg Glu Val
420 425 430
Leu Arg Glu Asn Glu Gly He Ala Lys Ala He His Leu Gly Tyr He 435 440 445
Asn Ala Ser Arg Ala He Phe Asp Ser Asn He Thr Ser Leu He Ala
450 455 460
Ser Val Leu Leu Tyr Ala Tyr Gly Thr Gly Ala He Lys Gly Phe Ala 465 470 475 480
Leu Thr Thr Gly He Gly He Leu Ala Ser He He Thr Ala He Val
485 490 495
Gly Thr Gin Gly He Tyr Gin Ala Leu Leu Pro Lys Leu Thr Gin Thr
500 505 510
Lys Ser Leu Tyr Phe Trp Phe Gly Val Asn Lys Arg Ala 515 520 525
(2) INFORMATION FOR SEQ ID NO: 97:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 706 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 64...654 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97:
CAGGGGGCAA GGGGGCTGTT AGGGAAGCGA TTGATTATCT TTTAACATTA GAAGGCTTGC 60
AAG ATG AAG CGC TCA AGC TTT ACC TCT AAT AGC GTT TTA AAC TTT TTT 108
Met Lys Arg Ser Ser Phe Thr Ser Asn Ser Val Leu Asn Phe Phe 1 5 10 15
GTA GTT TTG TCT TTC ATT ACG ATA GGA TTA GTG TTT TTC TTT TTG CGT 156 Val Val Leu Ser Phe He Thr He Gly Leu Val Phe Phe Phe Leu Arg 20 25 30
TCC CAA CCC ACT AGC GTA GTT TCT AAA GAA AAT ATC CCT AAA ATT GAA 204 Ser Gin Pro Thr Ser Val Val Ser Lys Glu Asn He Pro Lys He Glu 35 40 45
TTA GAA AAT TTT AAA GCG TTT CAA ATC AAC GAT AAA ATC CTT GAT CTG 252 Leu Glu Asn Phe Lys Ala Phe Gin He Asn Asp Lys He Leu Asp Leu 50 55 60
TCC ATA GAG GGC AAA AAA GCC CTA CAA TAC GAT GAT CAT GAA ATC TTT 300 Ser He Glu Gly Lys Lys Ala Leu Gin Tyr Asp Asp His Glu He Phe 65 70 75
TTT GAT TCC AAA ATC AAG CGC TAT GAT GAA GAC ACC ATT GAA AGC GTT 348 Phe Asp Ser Lys He Lys Arg Tyr Asp Glu Asp Thr He Glu Ser Val 80 85 90 95
GAG TCT CCT AAG GCC AAA CGG CAG CAG GAT TTG TAT TTC TTC CCT AAT 396 Glu Ser Pro Lys Ala Lys Arg Gin Gin Asp Leu Tyr Phe Phe Pro Asn 100 105 110
GGG GTT ACT TAT AAA AGA AGC GAT GAT TCC AGT TTT TGG AGT GAA ACA 444 Gly Val Thr Tyr Lys Arg Ser Asp Asp Ser Ser Phe Trp Ser Glu Thr 115 120 125
GGG ATT TAT AAC CAT AAG GAG CAA AAT TTT AAA GGC AAG GGC CGT TTC 492 Gly He Tyr Asn His Lys Glu Gin Asn Phe Lys Gly Lys Gly Arg Phe 130 135 " 140
ATT CTC ACT TCA AAG GAC AGC AAG ATT GAA GGG CTT GAC ATT TCT TAT 540 He Leu Thr Ser Lys Asp Ser Lys He Glu Gly Leu Asp He Ser Tyr 145 150 155
TCG CAT GCA TTA GCT ATT ATT GAA GCT CAA AGC ATT CAA GCG CAT TTA 588 Ser His Ala Leu Ala He He Glu Ala Gin Ser He Gin Ala His Leu 160 165 170 175
TTC TTA GAT GAA ATC AAA CAA AGC CAA AAA GAA AAG AAA AAA TTC CCC 636 Phe Leu Asp Glu He Lys Gin Ser Gin Lys Glu Lys Lys Lys Phe Pro 180 185 190
ACT TTC AAA GGA GGT TTT TAATGCGTTG GTGGTGTTTT TTGGTGTGTT GTTTTGGT 692 Thr Phe Lys Gly Gly Phe 195
ATTTTAAGCG TGAT 706
(2) INFORMATION FOR SEQ ID NO: 98:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 197 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98:
Met Lys Arg Ser Ser Phe Thr Ser Asn Ser Val Leu Asn Phe Phe Val
1 5 10 15
Val Leu Ser Phe He Thr He Gly Leu Val Phe Phe Phe Leu Arg Ser
20 25 30
Gin Pro Thr Ser Val Val Ser Lys Glu Asn He Pro Lys He Glu Leu
35 40 45
Glu Asn Phe Lys Ala Phe Gin He Asn Asp Lys He Leu Asp Leu Ser
50 55 60
He Glu Gly Lys Lys Ala Leu Gin Tyr Asp Asp His Glu He Phe Phe 65 70 75 80 Asp Ser Lys He Lys Arg Tyr Asp Glu Asp Thr He Glu Ser Val Glu
85 90 95
Ser Pro Lys Ala Lys Arg Gin Gin Asp Leu Tyr Phe Phe Pro Asn Gly
100 105 110
Val Thr Tyr Lys Arg Ser Asp Asp Ser Ser Phe Trp Ser Glu Thr Gly
115 120 125
He Tyr Asn His Lys Glu Gin Asn Phe Lys Gly Lys Gly Arg Phe He
130 135 140
Leu Thr Ser Lys Asp Ser Lys He Glu Gly Leu Asp He Ser Tyr Ser 145 150 155 160
His Ala Leu Ala He He Glu Ala Gin Ser He Gin Ala His Leu Phe
165 170 175
Leu Asp Glu He Lys Gin Ser Gin Lys Glu Lys Lys Lys Phe Pro Thr
180 185 190
Phe Lys Gly Gly Phe 195
(2) INFORMATION FOR SEQ ID NO: 99:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1010 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 130...957 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99:
AAGCGAGCAA GAATGAATTA AAAATTTTTG GTTGGCACTA CATCATAGAA ACAGGCAGGA 60 TTTATAATTA TAATTTTGAA AGCCATTTTT TTGAGCCGAT TGGAGAAACC ATTAAACAAA 120 GGAAAAGTC ATG AAA ACT TCT AAA ACA AAA ACC CCT AAA TCC GTT TTA ATC 171 Met Lys Thr Ser Lys Thr Lys Thr Pro Lys Ser Val Leu He 1 5 10
GCT GGG CCA TGC GTC ATT GAG AGC TTA GAA AAT CTA AGA AGT ATC GCC 219 Ala Gly Pro Cys Val He Glu Ser Leu Glu Asn Leu Arg Ser He Ala 15 20 25 30
ACT AAA TTG CAA CCC CTA GCC AAC AAC GAG CGG TTG GAT TTT TAT TTT 267 Thr Lys Leu Gin Pro Leu Ala Asn Asn Glu Arg Leu Asp Phe Tyr Phe 35 40 45
AAA GCG AGT TTT GAT AAG GCG AAC CGC ACG AGT TTA GAG AGT TAC AGA 315 Lys Ala Ser Phe Asp Lys Ala Asn Arg Thr Ser Leu Glu Ser Tyr Arg 50 55 60 GGG CCT GGT TTA GAA AAA GGC CTA GAA ATG TTA CAA ACG ATC AAA GAG 363 Gly Pro Gly Leu Glu Lys Gly Leu Glu Met Leu Gin Thr He Lys Glu 65 70 75
GAA TTT GGT TAT AAA ATC TTA ACC GAT GTG CAT GAG AGT TAT CAA GCA 411 Glu Phe Gly Tyr Lys He Leu Thr Asp Val His Glu Ser Tyr Gin Ala 80 85 90
AGC GTG GCA GCC AAA GTG GCG GAT ATT TTA CAA ATC CCG GCG TTT TTG 459 Ser Val Ala Ala Lys Val Ala Asp He Leu Gin He Pro Ala Phe Leu 95 100 105 110
TGC CGC CAA ACG GAT CTG ATT GTA GAA GTG AGC CAG ACT AAC GCT ATT 507 Cys Arg Gin Thr Asp Leu He Val Glu Val Ser Gin Thr Asn Ala He 115 120 125
GTC AAT ATC AAA AAA GGG CAA TTC ATG AAC CCA AAA GAC ATG CAA TAT 555 Val Asn He Lys Lys Gly Gin Phe Met Asn Pro Lys Asp Met Gin Tyr 130 135 140
TCT GTT CTA AAG GCC CTT AAA ACG AGA GAT AAA AGC ATT CAA AGC CCC 603 Ser Val Leu Lys Ala Leu Lys Thr Arg Asp Lys Ser He Gin Ser Pro 145 150 155
ACT TAT GAA ACA GCG TTA AAA AAT GGC GTG TGG CTG TGT GAA AGG GGG 651 Thr Tyr Glu Thr Ala Leu Lys Asn Gly Val Trp Leu Cys Glu Arg Gly 160 165 170
AGC AGC TTT GGG TAT GGG AAT TTA GTG GTG GAT ATG CGC TCT TTA AAA 699 Ser Ser Phe Gly Tyr Gly Asn Leu Val Val Asp Met Arg Ser Leu Lys 175 180 185 190
ATC ATG CGA GAA TTT GCC CCT GTG ATT TTT GAC GCT ACC CAT AGC GTG 747 He Met Arg Glu Phe Ala Pro Val He Phe Asp Ala Thr His Ser Val 195 200 205
CAA ATG CCA GGG GGA GCG AAC GGG AAA AGT TCA GGA GAC AGC TCT TTT 795 Gin Met Pro Gly Gly Ala Asn Gly Lys Ser Ser Gly Asp Ser Ser Phe 210 215 220
GCC CCT ATT TTA GCG AGA GCT GCG GCG GCG GTG GGG ATT GAT GGG TTG 843 Ala Pro He Leu Ala Arg Ala Ala Ala Ala Val Gly He Asp Gly Leu 225 230 235
TTT GCT GAA ACG CAT GTT GAT CCT AAA AAC GCC CTA AGC GAT GGA GCA 891 Phe Ala Glu Thr His Val Asp Pro Lys Asn Ala Leu Ser Asp Gly Ala 240 245 250
AAC ATG CTA AAA CCT GAC GAG CTA GAA CAA TTA GTA ACC GAC ATG TTA 939 Asn Met Leu Lys Pro Asp Glu Leu Glu Gin Leu Val Thr Asp Met Leu 255 260 265 270
AAA ATC CAA AAT TTA TTT TAAAGGAATT TCATGCAAAT CATAGAAGGG AAATTGCA 995 Lys He Gin Asn Leu Phe 275 ATTACAAGGG AATGA 1010
(2) INFORMATION FOR SEQ ID NO: 100:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 276 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:100:
Met Lys Thr Ser Lys Thr Lys Thr Pro Lys Ser Val Leu He Ala Gly
1 5 10 15
Pro Cys Val He Glu Ser Leu Glu Asn Leu Arg Ser He Ala Thr Lys
20 25 30
Leu Gin Pro Leu Ala Asn Asn Glu Arg Leu Asp Phe Tyr Phe Lys Ala
35 40 45
Ser Phe Asp Lys Ala Asn Arg Thr Ser Leu Glu Ser Tyr Arg Gly Pro
50 55 60
Gly Leu Glu Lys Gly Leu Glu Met Leu Gin Thr He Lys Glu Glu Phe 65 70 75 80
Gly Tyr Lys He Leu Thr Asp Val His Glu Ser Tyr Gin Ala Ser Val
85 90 95
Ala Ala Lys Val Ala Asp He Leu Gin He Pro Ala Phe Leu Cys Arg
100 105 110
Gin Thr Asp Leu He Val Glu Val Ser Gin Thr Asn Ala He Val Asn
115 120 125
He Lys Lys Gly Gin Phe Met Asn Pro Lys Asp Met Gin Tyr Ser Val
130 135 140
Leu Lys Ala Leu Lys Thr Arg Asp Lys Ser He Gin Ser Pro Thr Tyr 145 150 155 160
Glu Thr Ala Leu Lys Asn Gly Val Trp Leu Cys Glu Arg Gly Ser Ser
165 170 175
Phe Gly Tyr Gly Asn Leu Val Val Asp Met Arg Ser Leu Lys He Met
180 185 190
Arg Glu Phe Ala Pro Val He Phe Asp Ala Thr His Ser Val Gin Met
195 200 205
Pro Gly Gly Ala Asn Gly Lys Ser Ser Gly Asp Ser Ser Phe Ala Pro
210 215 220
He Leu Ala Arg Ala Ala Ala Ala Val Gly He Asp Gly Leu Phe Ala 225 230 235 240
Glu Thr His Val Asp Pro Lys Asn Ala Leu Ser Asp Gly Ala Asn Met
245 250 255
Leu Lys Pro Asp Glu Leu Glu Gin Leu Val Thr Asp Met Leu Lys He
260 265 270
Gin Asn Leu Phe 275
(2) INFORMATION FOR SEQ ID NO: 101:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 240 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 59...196 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: AACAATTCTT TTTTAAGCAA AAACAAAACA AAATTAAGGC ATAATCACTC TTTTTAAA 58
ATG AAA GGT CGC GTA GCT CAG TTG GTA GAG CAC TAC CTT GAC. ATG GTA 106 Met Lys Gly Arg Val Ala Gin Leu Val Glu His Tyr Leu Asp Met Val 1 5 10 15
GTG GCC GCT GGT TCA AGT CCA GTC GTG GCC ACC ATT ATC ACT CCA ATT 154 Val Ala Ala Gly Ser Ser Pro Val Val Ala Thr He He Thr Pro He 20 25 30
TTA ATT CTC ATT TTT TTG CGA GTT TTT GAT CTT TAT AAA TTC TAAAGGGGTA 206 Leu He Leu He Phe Leu Arg Val Phe Asp Leu Tyr Lys Phe 35 40 45
TTAAACGCAC TTCTAATAAC GATTTTATAG CGCT 240
(2) INFORMATION FOR SEQ ID NO: 102:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 46 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:102:
Met Lys Gly Arg Val Ala Gin Leu Val Glu His Tyr Leu Asp Met Val
1 5 10 15
Val Ala Ala Gly Ser Ser Pro Val Val Ala Thr He He Thr Pro He
20 25 30
Leu He Leu He Phe Leu Arg Val Phe Asp Leu Tyr Lys Phe 35 40 45
(2 ) INFORMATION FOR SEQ ID NO : 103 : (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1382 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 91...1329 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:103:
ACCACCCCTT AATCTCAAAA AACCCCAATC ATAAAAAGCT TTATGCTACA ATGAAAGCTC 60 TTTAACACGA TAAAAGGGCG GTTTAATAGC ATG GCA CAA GAA AAA GCA GTT CCA 114
Met Ala Gin Glu Lys Ala Val Pro 1 5
AGA GAT CCT AAA AAA CTC AAT GCG TTT GAT TTG CGT TGG ATG GTG TCC 162 Arg Asp Pro Lys Lys Leu Asn Ala Phe Asp Leu Arg Trp Met Val Ser 10 15 20
TTA TTT GGC ACG GCG GTG GGG GCT GGG ATT TTA TTT TTG CCT ATT AGA 210 Leu Phe Gly Thr Ala Val Gly Ala Gly He Leu Phe Leu Pro He Arg 25 30 35 40
GCC GGT GGG CAT GGG GTA TGG GCT ATT GTG GTA ATG AGC GCG ATC ATT 258 Ala Gly Gly His Gly Val Trp Ala He Val Val Met Ser Ala He He 45 50 55
TTC CCT TTA ACT TAT CTA GGG CAT AGA GCT TTA GCT TAT TTC ATA GGA 306 Phe Pro Leu Thr Tyr Leu Gly His Arg Ala Leu Ala Tyr Phe He Gly 60 65 70
TCT AAA GAC AAA GAA GAC ATT ACC ATG GTC GTT CGC TCT CAT TTT GGC 354 Ser Lys Asp Lys Glu Asp He Thr Met Val Val Arg Ser His Phe Gly 75 80 85
GCT CAA TGG GGT TTT CTT ATC ACT TTG CTT TAT TTC TTA GCG ATT TAT 402 Ala Gin Trp Gly Phe Leu He Thr Leu Leu Tyr Phe Leu Ala He Tyr 90 95 100
CCT ATT TGC TTG GTT TAT GGG GTG GGT ATC ACT AAC GTG TTT GAT CAT 450 Pro He Cys Leu Val Tyr Gly Val Gly He Thr Asn Val Phe Asp His 105 110 115 120
TTT TTC ACT AAC CAG TTG CAT TTA GCG CCT TTT CAT CGG GGA TTA TTG 498 Phe Phe Thr Asn Gin Leu His Leu Ala Pro Phe His Arg Gly Leu Leu 125 130 135
GCT GTA GCG TTA GTT TCT TTA ATG ATG TTG GTG ATG GTT TTT AAC GCT 546 Ala Val Ala Leu Val Ser Leu Met Met Leu Val Met Val Phe Asn Ala 140 145 150
ACG ATT GTT ACG CGC ATT TGT AAC GCT TTA GTG TAT CCT TTA TGC TTG 594 Thr He Val Thr Arg He Cys Asn Ala Leu Val Tyr Pro Leu Cys Leu 155 160 165
ATT TTA TTG CTT TTT TCT TTG TAT CTT ATC CCT TAT TGG CAA GGC GCT 642 He Leu Leu Leu Phe Ser Leu Tyr Leu He Pro Tyr Trp Gin Gly Ala 170 175 180
AAT CTT TTT GTG GTG CCG AGT TTT AAA GAA TTT GTG TTA GCG ATT TGG 690 Asn Leu Phe Val Val Pro Ser Phe Lys Glu Phe Val Leu Ala He Trp 185 190 195 200
CTA ACC TTA CCG GTG CTT GTG TTT GCA TTC GAC CAT AGC CCC ATC ATT 738 Leu Thr Leu Pro Val Leu Val Phe Ala Phe Asp His Ser Pro He He 205 210 215
TCA ACC TTC ACT CAA AAT GTG GGA AAA GAA TAC GGC GTT TTC AAA GAA 786 Ser Thr Phe Thr Gin Asn Val Gly Lys Glu Tyr Gly Val Phe Lys Glu 220 225 230
TAC AAA CTC AAT CAA ATT GAA TTA GGG ACA TCG CTG ATG CTT TTA GGG 834 Tyr Lys Leu Asn Gin He Glu Leu Gly Thr Ser Leu Met Leu Leu Gly 235 240 245
TTT GTG ATG TTT TTT GTG TTT TCG TGC GTC ATG TGC TTG AAT GCT GAT 882 Phe Val Met Phe Phe Val Phe Ser Cys Val Met Cys Leu Asn Ala Asp 250 255 260
GAT TTT GTG AAA GCA AGG GAA CAA AAT ATC CCC ATT TTA AGC TAT TTG 930 Asp Phe Val Lys Ala Arg Glu Gin Asn He Pro He Leu Ser Tyr Leu 265 270 275 280
GCT AAC ACT TTA AAC AAC CCT TTA ATC AAT TAT GCG GGG CCT GTG GTG 978 Ala Asn Thr Leu Asn Asn Pro Leu He Asn Tyr Ala Gly Pro Val Val 285 290 295
GCT TTT TTA GCG ATT TTT TCA TCT TTT TTT GGG CAT TAT TAT GGG GCT 1026 Ala Phe Leu Ala He Phe Ser Ser Phe Phe Gly His Tyr Tyr Gly Ala 300 305 310
AAG GAG GGT TTA GAA GGC ATT ATT ATT CAA AGC TTA AAA TTG AAA AAA 1074 Lys Glu Gly Leu Glu Gly He He He Gin Ser Leu Lys Leu Lys Lys 315 320 325
GCT TCT AAA CCC TTG AGC GTT AGC GTA ACG ATT TTT TTA TGG CTG ACT 1122 Ala Ser Lys Pro Leu Ser Val Ser Val Thr He Phe Leu Trp Leu Thr 330 335 340
ATC ACG CTT GTG GCT TAT ATT AAC CCC AAT ATC TTG GAT TTT ATT GAA 1170 He Thr Leu Val Ala Tyr He Asn Pro Asn He Leu Asp Phe He Glu 345 350 355 360 AAT TTA GGC GGC CCC ATT ATC GCG CTC ATT CTG TTT GTG ATG CCC ATG 1218 Asn Leu Gly Gly Pro He He Ala Leu He Leu Phe Val Met Pro Met 365 370 375
ATA GCT TTT TAT AGT GTT TCT AGT TTG AAG CGT TTT AGA AAT TTC AAA 1266 He Ala Phe Tyr Ser Val Ser Ser Leu Lys Arg Phe Arg Asn Phe Lys 380 385 390
GTG GAT ATT TTT GTG TTT GTC TTT GGG AGC TTG ACG GCT TTG AGC GTG 1314 Val Asp He Phe Val Phe Val Phe Gly Ser Leu Thr Ala Leu Ser Val 395 400 405
TTT TTA GGA CTA TTT TAATGGCTAG TTTTTCTATT TTATCTATTT TTAAAATCGG C 1370 Phe Leu Gly Leu Phe 410
GTGGGGCCTA GC 1382
(2) INFORMATION FOR SEQ ID NO: 104:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 413 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104:
Met Ala Gin Glu Lys Ala Val Pro Arg Asp Pro Lys Lys Leu Asn Ala
1 5 10 15
Phe Asp Leu Arg Trp Met Val Ser Leu Phe Gly Thr Ala Val Gly Ala
20 25 30
Gly He Leu Phe Leu Pro He Arg Ala Gly Gly His Gly Val Trp Ala
35 40 45
He Val Val Met Ser Ala He He Phe Pro Leu Thr Tyr Leu Gly His
50 55 60
Arg Ala Leu Ala Tyr Phe He Gly Ser Lys Asp Lys Glu Asp He Thr 65 70 75 80
Met Val Val Arg Ser His Phe Gly Ala Gin Trp Gly Phe Leu He Thr
85 90 95
Leu Leu Tyr Phe Leu Ala He Tyr Pro He Cys Leu Val Tyr Gly Val
100 105 110
Gly He Thr Asn Val Phe Asp His Phe Phe Thr Asn Gin Leu His Leu
115 120 125
Ala Pro Phe His Arg Gly Leu Leu Ala Val Ala Leu Val Ser Leu Met
130 135 140
Met Leu Val Met Val Phe Asn Ala Thr He Val Thr Arg He Cys Asn 145 150 155 160
Ala Leu Val Tyr Pro Leu Cys Leu He Leu Leu Leu Phe Ser Leu Tyr
165 170 175
Leu He Pro Tyr Trp Gin Gly Ala Asn Leu Phe Val Val Pro Ser Phe 180 185 190 Lys Glu Phe Val Leu Ala He Trp Leu Thr Leu Pro Val Leu Val Phe
195 200 205
Ala Phe Asp His Ser Pro He He Ser Thr Phe Thr Gin Asn Val Gly
210 215 220
Lys Glu Tyr Gly Val Phe Lys Glu Tyr Lys Leu Asn Gin He Glu Leu 225 230 235 240
Gly Thr Ser Leu Met Leu Leu Gly Phe Val Met Phe Phe Val Phe Ser
245 250 255
Cys Val Met Cys Leu Asn Ala Asp Asp Phe Val Lys Ala Arg Glu Gin
260 265 270
Asn He Pro He Leu Ser Tyr Leu Ala Asn Thr Leu Asn Asn Pro Leu
275 280 285
He Asn Tyr Ala Gly Pro Val Val Ala Phe Leu Ala He Phe Ser Ser
290 295 300
Phe Phe Gly His Tyr Tyr Gly Ala Lys Glu Gly Leu Glu Gly He He 305 310 315 320
He Gin Ser Leu Lys Leu Lys Lys Ala Ser Lys Pro Leu Ser Val Ser
325 330 335
Val Thr He Phe Leu Trp Leu Thr He Thr Leu Val Ala Tyr He Asn
340 345 350
Pro Asn He Leu Asp Phe He Glu Asn Leu Gly Gly Pro He He Ala
355 360 365
Leu He Leu Phe Val Met Pro Met He Ala Phe Tyr Ser Val Ser Ser
370 375 380
Leu Lys Arg Phe Arg Asn Phe Lys Val Asp He Phe Val Phe Val Phe 385 390 395 400
Gly Ser Leu Thr Ala Leu Ser Val Phe Leu Gly Leu Phe 405 410
(2) INFORMATION FOR SEQ ID NO: 105;
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 875 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 63...827 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105:
TTTGCCACCT TATTGCAAAC CCTAATGCTA ACGCACTACT TTTTTATCTT TAAAGAGAAA 60 GA ATG CTA GAT TTT ATT CAA GAG CTT AGC ACC CCC CAT GTT AGG GAT 107 Met Leu Asp Phe He Gin Glu Leu Ser Thr Pro His Val Arg Asp 1 5 10 15
TTT TTC TTG TTG TTT TTA AGG GTT AGC GGC GTG CTG TCT TTC TTC CCT 155 Phe Phe Leu Leu Phe Leu Arg Val Ser Gly Val Leu Ser Phe Phe Pro 20 25 30
TTT TTT GAA AAC CAT TTA GTG CCT TTG TCG GTG CGT GGG GCT TTG AGT 203 Phe Phe Glu Asn His Leu Val Pro Leu Ser Val Arg Gly Ala Leu Ser 35 40 45
TTG TAT GTG AGC GCG ATT TTT TAC CCC ACT TTA GAA TTT TCA AAC GCC 251 Leu Tyr Val Ser Ala He Phe Tyr Pro Thr Leu Glu Phe Ser Asn Ala 50 55 60
GCT TAC ACG CCA GAG GGT TTT ATC ATT GCT TGC TTG TGC GAA TTG TTT 299 Ala Tyr Thr Pro Glu Gly Phe He He Ala Cys Leu Cys Glu Leu Phe 65 70 75
TTA GGG GTG TGC GCG TCT GTC TTT TTA CAA ATC GTC TTT GCA AGC TTA 347 Leu Gly Val Cys Ala Ser Val Phe Leu Gin He Val Phe Ala Ser Leu 80 85 90 95
GTG TTT GCA ACC GAT AGC ATC AGC TTT TCT ATG GGG CTT ACG ATG GCG 395 Val Phe Ala Thr Asp Ser He Ser Phe Ser Met Gly Leu Thr Met Ala 100 105 110
AGC GCG TAT GAT CCT ATT TCA GGA TCG CAA AAA CCC ATT GTG GGG CAA 443 Ser Ala Tyr Asp Pro He Ser Gly Ser Gin Lys Pro He Val Gly Gin 115 120 125
GCC CTT TTA TTG TTA GCG ATT TTA ATT TTA TTG GAT TTA TCG TTC CAC 491 Ala Leu Leu Leu Leu Ala He Leu He Leu Leu Asp Leu Ser Phe His 130 135 140
CAT CAA ATC ATT TTG TTT GTG GAT CAC AGC TTA AAA GCC GTC CCT TTA 539 His Gin He He Leu Phe Val Asp His Ser Leu Lys Ala Val Pro Leu 145 150 155
GGG CAA TTT GTC TTT GAG CCA GCG TTG GCT AAA AAC ATC GTT AAA GCC 587 Gly Gin Phe Val Phe Glu Pro Ala Leu Ala Lys Asn He Val Lys Ala 160 165 170 175
TTT TCG CAC CTC TTT GTC ATA GGG TTT TCT ATG GCG TTC CCT ATT TTA 635 Phe Ser His Leu Phe Val He Gly Phe Ser Met Ala Phe Pro He Leu 180 185 190
TGC TTG GTG TTA TTG AGC GAT ATT ATT TTT GGC ATG ATC ATG AAA ACC 683 Cys Leu Val Leu Leu Ser Asp He He Phe Gly Met He Met Lys Thr 195 200 205
CAC CCT CAG TTC AAC CTG CTC GCT ATT GGG TTT CCG GTT AAA ATT GCG 731 His Pro Gin Phe Asn Leu Leu Ala He Gly Phe Pro Val Lys He Ala 210 215 220
ATC GGG TTT GTG GGC ATT ATC TTA ATC GCT TCG GCT ATC ATG GGG CGT 779 He Gly Phe Val Gly He He Leu He Ala Ser Ala He Met Gly Arg 225 230 235 TTT AAA GAA GAA ATC AGC CTG GCC TTT AGC GCC ATT AGC AAA ATC TTT T 828 Phe Lys Glu Glu He Ser Leu Ala Phe Ser Ala He Ser Lys He Phe 240 245 250 255
AAAGGATAAA CATGATTAGT TTTAAAGAAG CTCTAAAAAT CCATTCT 875
(2) INFORMATION FOR SEQ ID NO: 106:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 255 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106:
Met Leu Asp Phe He Gin Glu Leu Ser Thr Pro His Val Arg Asp Phe
1 5 10 15
Phe Leu Leu Phe Leu Arg Val Ser Gly Val Leu Ser Phe Phe Pro Phe
20 25 30
Phe Glu Asn His Leu Val Pro Leu Ser Val Arg Gly Ala Leu Ser Leu
35 40 45
Tyr Val Ser Ala He Phe Tyr Pro Thr Leu Glu Phe Ser Asn Ala Ala
50 55 60
Tyr Thr Pro Glu Gly Phe He He Ala Cys Leu Cys Glu Leu Phe Leu 65 70 75 80
Gly Val Cys Ala Ser Val Phe Leu Gin He Val Phe Ala Ser Leu Val
85 90 95
Phe Ala Thr Asp Ser He Ser Phe Ser Met Gly Leu Thr Met Ala Ser
100 105 110
Ala Tyr Asp Pro He Ser Gly Ser Gin Lys Pro He Val Gly Gin Ala
115 120 125
Leu Leu Leu Leu Ala He Leu He Leu Leu Asp Leu Ser Phe His His
130 135 140
Gin He He Leu Phe Val Asp His Ser Leu Lys Ala Val Pro Leu Gly 145 150 155 160
Gin Phe Val Phe Glu Pro Ala Leu Ala Lys Asn He Val Lys Ala Phe
165 170 175
Ser His Leu Phe Val He Gly Phe Ser Met Ala Phe Pro He Leu Cys
180 185 190
Leu Val Leu Leu Ser Asp He He Phe Gly Met He Met Lys Thr His
195 200 205
Pro Gin Phe Asn Leu Leu Ala He Gly Phe Pro Val Lys He Ala He
210 215 220
Gly Phe Val Gly He He Leu He Ala Ser Ala He Met Gly Arg Phe 225 230 235 240
Lys Glu Glu He Ser Leu Ala Phe Ser Ala He Ser Lys He Phe 245 250 255
(2) INFORMATION FOR SEQ ID NO: 107: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1160 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 373...1110 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107:
GAGATATTAA AAAGAGATAT TAAAATGGCT TTTAAGCTTC TATGAAGCCC CCCCCCCTCC 60
TCCTTTTGCC CTTTATTTCG TGGGCAAATC GCCCACAGGA CAAGCGGCCA CACCCACGCT 120
TGGGCGGGTG ATTTTTTCTA CATTTTCTTG AGCTTTTTTG GGGTCATTCA CCCAAGTTTT 180
ATAGAACTCA AAAGGGCATT CCGCCACACC CTTATCCCCA TCGCCACTAG CAAAAACCCC 240
ACTATAACCT CTGTGCAAGA GTTTGAAGAG TTGGTTTTGG CTTTGGTCGT ATTTTCTCGC 300
ATCACGGATT TGTTGCACAG AAAGTTGAGC GTATTGAAGA CCATTTTCTT CTTCCCCGCA 360
TTCGCCCAAA GT ATG CCC GTC AAA ACC AAT AAT GCT AGA ATG CCC AAA ATA 411 Met Pro Val Lys Thr Asn Asn Ala Arg Met Pro Lys He 1 5 10
GGA ATA CAC CCC ATC AAA ACC GGT CGC ATT CGC TAC CGC TAC ATA ACA 459 Gly He His Pro He Lys Thr Gly Arg He Arg Tyr Arg Tyr He Thr 15 20 25
TTG ATT GGC CCA CGC CAT AGC TTT TAC TAT TGC AAT TTG TTG CTC CTT 507 Leu He Gly Pro Arg His Ser Phe Tyr Tyr Cys Asn Leu Leu Leu Leu 30 35 40 45
AGC CGG ATA CAT GTA ACC TTG ACA GCG CAC AAT GAG TTC TGC CCC ACG 555 Ser Arg He His Val Thr Leu Thr Ala His Asn Glu Phe Cys Pro Thr 50 55 60
CAT CGC GCA ATC GCG CCA AAT TTC AGG GTA GTT TCC ATC ATC GCA AAT 603 His Arg Ala He Ala Pro Asn Phe Arg Val Val Ser He He Ala Asn 65 70 75
AAT CAA AGA AAC TTT CAA GCC CTT AGG CCC ATC AAC CAC ATA AGT TTT 651 Asn Gin Arg Asn Phe Gin Ala Leu Arg Pro He Asn His He Ser Phe 80 85 90
ATC CCC AGG ATA CCA ACA TTC AAT AGG GCA CCA AGG CAA GAT TTT GCG 699 He Pro Arg He Pro Thr Phe Asn Arg Ala Pro Arg Gin Asp Phe Ala 95 100 105
GTA TTT TTG CAC GAT CTC ACC CTT ATC ATT GAC AAG AAT CAA AGT GTT 747 Val Phe Leu His Asp Leu Thr Leu He He Asp Lys Asn Gin Ser Val 110 115 120 125 ATA GGG ATT CTT TTT GGC TTG CTC GTG TTT TTC CCC TGT CAA AGA GAA 795 He Gly He Leu Phe Gly Leu Leu Val Phe Phe Pro Cys Gin Arg Glu 130 135 140
CAC TCC CCA AAC CTT GTT TTT CTT ACA AGC TTC AGC AAA GAT CGC GGT 843 His Ser Pro Asn Leu Val Phe Leu Thr Ser Phe Ser Lys Asp Arg Gly 145 150 155
TTC TTC TCC AGG AAC GCT TGC GGC TGT ATC AAA CAT TTC TTG TCT GTC 891 Phe Phe Ser Arg Asn Ala Cys Gly Cys He Lys His Phe Leu Ser Val 160 165 170
ATA CAT AAT CCC ATG CGT GCT GTA TTC AGG GAA AAT AAT CAG ATC CAA 939 He His Asn Pro Met Arg Ala Val Phe Arg Glu Asn Asn Gin He Gin 175 180 185
CCC AGG CAA ACC CTG TTT GAC CCC ACC AAT CAC CTT AGC GAT ATT GCG 987 Pro Arg Gin Thr Leu Phe Asp Pro Thr Asn His Leu Ser Asp He Ala 190 195 200 205
ACA ATT TTC CAA CAC CTC ATT CTT AGT GTG GAG TCT AGG CAT CTT ATA 1035 Thr He Phe Gin His Leu He Leu Ser Val Glu Ser Arg His Leu He 210 215 220
ATT AAC TAC CGC TAC ACC CAC AGT ATC TGG GCT GCT ACT AAT ATC TCC 1083 He Asn Tyr Arg Tyr Thr His Ser He Trp Ala Ala Thr Asn He Ser 225 230 235
ATG TCT CAT ATT ATG TTC CTT GTT TTT TGATGAGAGT TCCTACAAAC CCTCTAC 1137 Met Ser His He Met Phe Leu Val Phe 240 245
TTGAATTTAT AAAATAATTG TGT 1160
(2) INFORMATION FOR SEQ ID NO: 108:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 246 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108:
Met Pro Val Lys Thr Asn Asn Ala Arg Met Pro Lys He Gly He His
1 5 10 15
Pro He Lys Thr Gly Arg He Arg Tyr Arg Tyr He Thr Leu He Gly
20 25 30
Pro Arg His Ser Phe Tyr Tyr Cys Asn Leu Leu Leu Leu Ser Arg He
35 40 45
His Val Thr Leu Thr Ala His Asn Glu Phe Cys Pro Thr His Arg Ala 50 55 60
He Ala Pro Asn Phe Arg Val Val Ser He He Ala Asn Asn Gin Arg 65 70 75 80
Asn Phe Gin Ala Leu Arg Pro He Asn His He Ser Phe He Pro Arg
85 90 95
He Pro Thr Phe Asn Arg Ala Pro Arg Gin Asp Phe Ala Val Phe Leu
100 105 110
His Asp Leu Thr Leu He He Asp Lys Asn Gin Ser Val He Gly He
115 120 125
Leu Phe Gly Leu Leu Val Phe Phe Pro Cys Gin Arg Glu His Ser Pro
130 135 140
Asn Leu Val Phe Leu Thr Ser Phe Ser Lys Asp Arg Gly Phe Phe Ser 145 150 155 160
Arg Asn Ala Cys Gly Cys He Lys His Phe Leu Ser Val He His Asn
165 170 175
Pro Met Arg Ala Val Phe Arg Glu Asn Asn Gin He Gin Pro Arg Gin
180 185 190
Thr Leu Phe Asp Pro Thr Asn His Leu Ser Asp He Ala Thr He Phe
195 200 205
Gin His Leu He Leu Ser Val Glu Ser Arg His Leu He He Asn Tyr
210 215 220
Arg Tyr Thr His Ser He Trp Ala Ala Thr Asn He Ser Met Ser His 225 230 235 240
He Met Phe Leu Val Phe 245
(2) INFORMATION FOR SEQ ID NO: 109:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1661 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 79...1611 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109:
GGCTTTATAA AAAATGTTAG AAACCCTTAC AAAACAAGCT AATATATTCT ATTCAATTTG 60
CCTCAAGGAC AAACAAAC ATG AAA AAA CTT CTT TAT ACC ATA CTC GCG CTT 111
Met Lys Lys Leu Leu Tyr Thr He Leu Ala Leu 1 5 10
CTT TTA ATC GGC CTT TTA ACA ATC TAT CTC ATC CTT TTT ACA GAA TGG 159 Leu Leu He Gly Leu Leu Thr He Tyr Leu He Leu Phe Thr Glu Trp 15 20 25 GGG AAT AAG ATC ATC GCT TCG TAT ATA GAG AAA AAA ATC AAC CCG AAC 207 Gly Asn Lys He He Ala Ser Tyr He Glu Lys Lys He Asn Pro Asn 30 35 40
GAG CAC TAC TTG AGC GTT AAA ACC TTT AAA TTG AGA TTC AAC TCT TTG 255 Glu His Tyr Leu Ser Val Lys Thr Phe Lys Leu Arg Phe Asn Ser Leu 45 50 55
GAT TTT AAA GCT CAA GCC AAC GAT GAT TCC ACG CTC ATT CTT AAG GGG 303 Asp Phe Lys Ala Gin Ala Asn Asp Asp Ser Thr Leu He Leu Lys Gly 60 65 70 75
GAT TTT TCA CTT TTA AAG CAA AGC GTA AAT TTG AAT TAC CAT ATA GAT 351 Asp Phe Ser Leu Leu Lys Gin Ser Val Asn Leu Asn Tyr His He Asp 80 85 90
ATT AAA GAT TTA CGC TCT TTC AAA GAA TGG ATA CCC TAC CCT TTA AGG 399 He Lys Asp Leu Arg Ser Phe Lys Glu Trp He Pro Tyr Pro Leu Arg 95 100 105
GGG GCT GTT ATC ACT TCT GGG AAT ATT AAA GGG CAT AGA AAA GCC CTT 447 Gly Ala Val He Thr Ser Gly Asn He Lys Gly His Arg Lys Ala Leu 110 115 120
ATG ATT CAA GGC GTC TCT AAT GTG GCT CAA TCC CAC ACT GCC TAC AAT 495 Met He Gin Gly Val Ser Asn Val Ala Gin Ser His Thr Ala Tyr Asn 125 130 135
GCC CTT TTA GAT GAT TTC AAG CTT TCT CGC TTA AAT TTG AAC GCA CAA 543 Ala Leu Leu Asp Asp Phe Lys Leu Ser Arg Leu Asn Leu Asn Ala Gin 140 145 150 155
GAC GCC AAT TTA GAA GAT TTG CTT TAT TTA ATC AAT CGC CCC GCT TAT 591 Asp Ala Asn Leu Glu Asp Leu Leu Tyr Leu He Asn Arg Pro Ala Tyr 160 165 170
GCG AAC GCA AAA GTG TCC TTA CAG GCG GAT TTT AAC TCT CTA AAG CCT 639 Ala Asn Ala Lys Val Ser Leu Gin Ala Asp Phe Asn Ser Leu Lys Pro 175 180 185
TTA GAG GGG CAT TTG ATC CTA ACA GCT AAT AAC GCT TTA ATC AAT AAC 687 Leu Glu Gly His Leu He Leu Thr Ala Asn Asn Ala Leu He Asn Asn 190 195 200
GCC CTA ATC AAT CAA ATT TTT CAT TTA AAC CTT AAA GAC ACG CTT GTT 735 Ala Leu He Asn Gin He Phe His Leu Asn Leu Lys Asp Thr Leu Val 205 210 215
TTC AGC CTC TCG CAT TCA AGC GAC TTT AAA GGA AAC AAA GCC ATC AGC 783 Phe Ser Leu Ser His Ser Ser Asp Phe Lys Gly Asn Lys Ala He Ser 220 225 230 235
GAT ACC ACC CTG ACT AGC CCT TTA GCC AAT TTC AAA GCC CTA AAA AGC 831 Asp Thr Thr Leu Thr Ser Pro Leu Ala Asn Phe Lys Ala Leu Lys Ser 240 245 250 GAA TAC CTT TTC TCT ATT TTA AAA CTC AAC GCC CCC TAC ACT TTA GAA 879 Glu Tyr Leu Phe Ser He Leu Lys Leu Asn Ala Pro Tyr Thr Leu Glu 255 260 265
ATC CCC AAT CTA GCC AAA CTC TAT AAC ATT ACC AAC CAC CCC TTA AAA 927 He Pro Asn Leu Ala Lys Leu Tyr Asn He Thr Asn His Pro Leu Lys 270 275 280
GGG AGC TTG ACT TTA AAA GGC GCT ATA GAA CAA AGC CCC AAA CTT TTA 975 Gly Ser Leu Thr Leu Lys Gly Ala He Glu Gin Ser Pro Lys Leu Leu 285 290 295
AAA GTC AGC GGC CAT TCA AAT TTA CTA GAC GGC GCG CTG GAT TTC ACG 1023 Lys Val Ser Gly His Ser Asn Leu Leu Asp Gly Ala Leu Asp Phe Thr 300 305 310 315
CTT TTA AAT AAA GAT TTG AAA GGG CGT TTT TCC AAT ATT TCC ACT TTA 1071 Leu Leu Asn Lys Asp Leu Lys Gly Arg Phe Ser Asn He Ser Thr Leu 320 325 330
AAA GCT TTA GAT TTA TTC CAT TAC CCT AAG TTT TTC CAA TCC GTT GCA 1119 Lys Ala Leu Asp Leu Phe His Tyr Pro Lys Phe Phe Gin Ser Val Ala 335 340 345
GAC GCT AAT TTG GAT TAT GAT CTT ATC GCT AAG CAA GGC GTA TTG AAA 1167 Asp Ala Asn Leu Asp Tyr Asp Leu He Ala Lys Gin Gly Val Leu Lys 350 355 360
GCC CGC CTA AAA AAC GCA AGA TTC CTC AAA AAT GCA TTC AGC GAT TTT 1215 Ala Arg Leu Lys Asn Ala Arg Phe Leu Lys Asn Ala Phe Ser Asp Phe 365 370 375
CTC TAC TCC ATT TCT AAA TTT GAT ATT ACA AAA GAA ATT TAT AAC GAT 1263 Leu Tyr Ser He Ser Lys Phe Asp He Thr Lys Glu He Tyr Asn Asp 380 385 390 395
GCC AAT CTG GTA AGC CAA ATC AAC CAG CAA CGC CTG CTC TCT GAT CTG 1311 Ala Asn Leu Val Ser Gin He Asn Gin Gin Arg Leu Leu Ser Asp Leu 400 405 410
AGT TTA AAA AGC CCC AAA ACC CAA TTG AAA ATC CAT AAC GGT TTG TTG 1359 Ser Leu Lys Ser Pro Lys Thr Gin Leu Lys He His Asn Gly Leu Leu 415 420 425
GAT TTA AAC ACC AAA CAA ATG AAC ATG CTC ATG GAT GCG GAA ATT TTA 1407 Asp Leu Asn Thr Lys Gin Met Asn Met Leu Met Asp Ala Glu He Leu 430 435 440
AAA TTC ATT TTT AAA ATG AAA CTT CAA GGC AAC ATG CAC CAG CCA AAA 1455 Lys Phe He Phe Lys Met Lys Leu Gin Gly Asn Met His Gin Pro Lys 445 450 455
TTT TCT CTC ATT TTA AAC GAA AAA GCC ATT CAG CAA AAC TTG CAA CAA 1503 Phe Ser Leu He Leu Asn Glu Lys Ala He Gin Gin Asn Leu Gin Gin 460 465 470 475 GGC TTG AAA GAA ATC TTA AAA AAC GAC ACC CTT AAA AAA GGT TTA GAT 1551 Gly Leu Lys Glu He Leu Lys Asn Asp Thr Leu Lys Lys Gly Leu Asp 480 485 490
CAT TTG CTT AAA GAT GAT AAG CTC AAA GAA AAG CTT GAA AAA GGG CTT 1599 His Leu Leu Lys Asp Asp Lys Leu Lys Glu Lys Leu Glu Lys Gly Leu 495 500 505
AAG GGG CTT TTT TAAAATTTTA AAGGATAGAA ATGGCGCACA TTTTAGTTAG CGGGG 1656 Lys Gly Leu Phe 510
CGACT 1661
(2) INFORMATION FOR SEQ ID NO: 110:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 511 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110:
Met Lys Lys Leu Leu Tyr Thr He Leu Ala Leu Leu Leu He Gly Leu
1 5 10 15
Leu Thr He Tyr Leu He Leu Phe Thr Glu Trp Gly Asn Lys He He
20 25 30
Ala Ser Tyr He Glu Lys Lys He Asn Pro Asn Glu His Tyr Leu Ser
35 40 45
Val Lys Thr Phe Lys Leu Arg Phe Asn Ser Leu Asp Phe Lys Ala Gin
50 55 60
Ala Asn Asp Asp Ser Thr Leu He Leu Lys Gly Asp Phe Ser Leu Leu 65 70 75 80
Lys Gin Ser Val Asn Leu Asn Tyr His He Asp He Lys Asp Leu Arg
85 90 95
Ser Phe Lys Glu Trp He Pro Tyr Pro Leu Arg Gly Ala Val He Thr
100 105 110
Ser Gly Asn He Lys Gly His Arg Lys Ala Leu Met He Gin Gly Val
115 120 125
Ser Asn Val Ala Gin Ser His Thr Ala Tyr Asn Ala Leu Leu Asp Asp
130 135 140
Phe Lys Leu Ser Arg Leu Asn Leu Asn Ala Gin Asp Ala Asn Leu Glu 145 150 155 160
Asp Leu Leu Tyr Leu He Asn Arg Pro Ala Tyr Ala Asn Ala Lys Val
165 170 175
Ser Leu Gin Ala Asp Phe Asn Ser Leu Lys Pro Leu Glu Gly His Leu
180 185 190
He Leu Thr Ala Asn Asn Ala Leu He Asn Asn Ala Leu He Asn Gin
195 200 205
He Phe His Leu Asn Leu Lys Asp Thr Leu Val Phe Ser Leu Ser His 210 215 220 Ser Ser Asp Phe Lys Gly Asn Lys Ala He Ser Asp Thr Thr Leu Thr 225 230 235 240
Ser Pro Leu Ala Asn Phe Lys Ala Leu Lys Ser Glu Tyr Leu Phe Ser
245 250 255
He Leu Lys Leu Asn Ala Pro Tyr Thr Leu Glu He Pro Asn Leu Ala
260 265 270
Lys Leu Tyr Asn He Thr Asn His Pro Leu Lys Gly Ser Leu Thr Leu
275 280 285
Lys Gly Ala He Glu Gin Ser Pro Lys Leu Leu Lys Val Ser Gly His
290 295 300
Ser Asn Leu Leu Asp Gly Ala Leu Asp Phe Thr Leu Leu Asn Lys Asp 305 310 315 320
Leu Lys Gly Arg Phe Ser Asn He Ser Thr Leu Lys Ala Leu Asp Leu
325 330 335
Phe His Tyr Pro Lys Phe Phe Gin Ser Val Ala Asp Ala Asn Leu Asp
340 345 350
Tyr Asp Leu He Ala Lys Gin Gly Val Leu Lys Ala Arg Leu Lys Asn
355 360 365
Ala Arg Phe Leu Lys Asn Ala Phe Ser Asp Phe Leu Tyr Ser He Ser
370 375 380
Lys Phe Asp He Thr Lys Glu He Tyr Asn Asp Ala Asn Leu Val Ser 385 390 395 400
Gin He Asn Gin Gin Arg Leu Leu Ser Asp Leu Ser Leu Lys Ser Pro
405 410 415
Lys Thr Gin Leu Lys He His Asn Gly Leu Leu Asp Leu Asn Thr Lys
420 425 430
Gin Met Asn Met Leu Met Asp Ala Glu He Leu Lys Phe He Phe Lys
435 440 445
Met Lys Leu Gin Gly Asn Met His Gin Pro Lys Phe Ser Leu He Leu
450 455 460
Asn Glu Lys Ala He Gin Gin Asn Leu Gin Gin Gly Leu Lys Glu He 465 470 475 480
Leu Lys Asn Asp Thr Leu Lys Lys Gly Leu Asp His Leu Leu Lys Asp
485 490 495
Asp Lys Leu Lys Glu Lys Leu Glu Lys Gly Leu Lys Gly Leu Phe 500 505 510
(2) INFORMATION FOR SEQ ID NO: 111:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 397 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 53...352 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111:
CTAATTCTGT CTATTACACC AACAATCAAT CTCAAAACAA AGGACATGAAAG ATG AAA 58
Met Lys 1
ACA AAA CAT AAA GGA ATA AGA ATG TTT AAG CAA ATT CGT AGA ATG ATG 106 Thr Lys His Lys Gly He Arg Met Phe Lys Gin He Arg Arg Met Met 5 10 15
AGT TTG GCA ATA TTA ATG CCT AGT TTT TTA TTG GCG GCA CCA GAT TAC 154 Ser Leu Ala He Leu Met Pro Ser Phe Leu Leu Ala Ala Pro Asp Tyr 20 25 30
AAA CAA AAA TTC ACT CAA ATA TTG GAT TTC ATA AGC AAT GAC TTT ATC 202 Lys Gin Lys Phe Thr Gin He Leu Asp Phe He Ser Asn Asp Phe He 35 40 45 50
AAG GCT ATT GGT GGT CTA ATC ATT GTT GGG ACT TGC ATT TAC GCC TAT 250 Lys Ala He Gly Gly Leu He He Val Gly Thr Cys He Tyr Ala Tyr 55 60 65
AAA AAT TGG GAC AGG CTT GGA GAA ATT GGT TGG AAA TGC GTT GGG ATT 298 Lys Asn Trp Asp Arg Leu Gly Glu He Gly Trp Lys Cys Val Gly He 70 75 80
ATC ATT ATA ACC GCT GCT ATT TCT AAT GCT AAA ACT TTA AGT CAA TGG 346 He He He Thr Ala Ala He Ser Asn Ala Lys Thr Leu Ser Gin Trp 85 90 95
TTA TTT TAGATGGCAT TGCATATTGT TTGTGTTGAA AGTATCAACA TTAGA 397
Leu Phe 100
(2) INFORMATION FOR SEQ ID NO : 112 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 100 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112:
Met Lys Thr Lys His Lys Gly He Arg Met Phe Lys Gin He Arg Arg
1 5 10 15
Met Met Ser Leu Ala He Leu Met Pro Ser Phe Leu Leu Ala Ala Pro
20 25 30
Asp Tyr Lys Gin Lys Phe Thr Gin He Leu Asp Phe He Ser Asn Asp
35 40 45
Phe He Lys Ala He Gly Gly Leu He He Val Gly Thr Cys He Tyr 50 55 60
Ala Tyr Lys Asn Trp Asp Arg Leu Gly Glu He Gly Trp Lys Cys Val 65 70 75 80
Gly He He He He Thr Ala Ala He Ser Asn Ala Lys Thr Leu Ser
85 90 95
Gin Trp Leu Phe 100
(2) INFORMATION FOR SEQ ID NO: 113:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 367 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...318 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113:
CTTGCCAATC CCTTCAATAT CCACCGAATT GATGCCATGC TCAATTAAAA A ATG ATC 57
Met He 1
CAA AGC GAC GCT GTC TTT AAG ATA AAT TTC TGT CTT GCC CTT CTT GTA 105 Gin Ser Asp Ala Val Phe Lys He Asn Phe Cys Leu Ala Leu Leu Val 5 10 15
TTT GTA AAG AGG GGC TTG AGC GAT ATA AAC ATG CCC TTG TTC AAT CAG 153 Phe Val Lys Arg Gly Leu Ser Asp He Asn Met Pro Leu Phe Asn Gin 20 25 30
CGG GCG CAA ATA ACG ATA GAA AAA AGT CAT CAG CAA GGT TTG GAT ATG 201 Arg Ala Gin He Thr He Glu Lys Ser His Gin Gin Gly Leu Asp Met 35 40 45 50
GCT CCC ATC CAC ATC AGC ATC GGT CAT GAT AAT GAT TTT ATG ATA GCG 249 Ala Pro He His He Ser He Gly His Asp Asn Asp Phe Met He Ala 55 60 65
CAA TCT TTC TAT ATC AAA ACT CTC TTG AAT GCC GCA CCC AAA AGC CGT 297 Gin Ser Phe Tyr He Lys Thr Leu Leu Asn Ala Ala Pro Lys Ser Arg 70 75 80
GAT CAT GTT TTT AAT TTC TTC TGATTTTAGG ATTTTTGATA AATGGCTTTT TTCC 352 Asp His Val Phe Asn Phe Phe 85 ACATTTAAAA TCTTA 367
(2) INFORMATION FOR SEQ ID NO: 114:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 89 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114:
Met He Gin Ser Asp Ala Val Phe Lys He Asn Phe Cys Leu Ala Leu
1 5 10 15
Leu Val Phe Val Lys Arg Gly Leu Ser Asp He Asn Met Pro Leu Phe
20 25 30
Asn Gin Arg Ala Gin He Thr He Glu Lys Ser His Gin Gin Gly Leu
35 40 45
Asp Met Ala Pro He His He Ser He Gly His Asp Asn Asp Phe Met
50 55 60
He Ala Gin Ser Phe Tyr He Lys Thr Leu Leu Asn Ala Ala Pro Lys 65 70 75 80
Ser Arg Asp His Val Phe Asn Phe Phe 85
(2) INFORMATION FOR SEQ ID NO: 115:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 386 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 54...344 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 115 :
TAAAAAGCAC TTTAGAGAGA TTTACGAAAG TGTTTTGAAG CGAAGAATGT CTG ATG 56
Met 1
ATT ATC TTT GGA AAA GAT TAC CTA TCT ACA GAC TTG CAA AAT AGC GCT 104 He He Phe Gly Lys Asp Tyr Leu Ser Thr Asp Leu Gin Asn Ser Ala 5 10 15 AAA GAT ATT CTT CTC ATC GCT TCG CAA ATT CTC AAA GAA AGA CTT TTT 152 Lys Asp He Leu Leu He Ala Ser Gin He Leu Lys Glu Arg Leu Phe 20 25 30
GCC CAC AAA AAT GAG ATT TTC TTT TGC CCT AGA AAT AGC TAC ATT CAA 200 Ala His Lys Asn Glu He Phe Phe Cys Pro Arg Asn Ser Tyr He Gin 35 40 45
GCG TTT AGA ATC TAT CAA GAA AGA AAG ATT ACC ATA AGT TTT CAC GGT 248 Ala Phe Arg He Tyr Gin Glu Arg Lys He Thr He Ser Phe His Gly 50 55 60 65
GGA ATA AAT AAT AAT ATC TGC CTT CTC GCC TTG AAA GGC ATC CAC AGT 296 Gly He Asn Asn Asn He Cys Leu Leu Ala Leu Lys Gly He His Ser 70 75 80
GTC TAT TTT GAG CTC ATC AAA ATT CTT GAA GCC GTA TTT TTC CAC TTC T 345 Val Tyr Phe Glu Leu He Lys He Leu Glu Ala Val Phe Phe His Phe 85 90 95
GATCGCAAGC ATCTTTTTTG GGCATTATAA GGTGTGATAA T 386
(2) INFORMATION FOR SEQ ID NO: 116:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 97 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116:
Met He He Phe Gly Lys Asp Tyr Leu Ser Thr Asp Leu Gin Asn Ser
1 5 10 15
Ala Lys Asp He Leu Leu He Ala Ser Gin He Leu Lys Glu Arg Leu
20 25 30
Phe Ala His Lys Asn Glu He Phe Phe Cys Pro Arg Asn Ser Tyr He
35 40 45
Gin Ala Phe Arg He Tyr Gin Glu Arg Lys He Thr He Ser Phe His
50 55 60
Gly Gly He Asn Asn Asn He Cys Leu Leu Ala Leu Lys Gly He His 65 70 75 80
Ser Val Tyr Phe Glu Leu He Lys He Leu Glu Ala Val Phe Phe His
85 90 95
Phe
(2) INFORMATION FOR SEQ ID NO: 117:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 569 base pairs
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 55...516 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117:
GCGGTTTGGA TCTTCATTAA AAATTCGTTG CTCACCCCTG GGTTATAAGC TTGA GCT 57
Ala 1
TCA GAA GTG GCC CCC TCA GAG GTT TTG TTG GAT TCT TCT TGC TTG TCT 105 Ser Glu Val Ala Pro Ser Glu Val Leu Leu Asp Ser Ser Cys Leu Ser 5 10 15
TTT TCT TTG ACT ATA TCC TTA GTT GTT ACT TGT TTA GGA GCG CTT TTT 153 Phe Ser Leu Thr He Ser Leu Val Val Thr Cys Leu Gly Ala Leu Phe 20 25 30
TCT TTA GCT TCC TCT TTA GCT TCT TCT TTT TTG GGC TCT TCT TTA GGC 201 Ser Leu Ala Ser Ser Leu Ala Ser Ser Phe Leu Gly Ser Ser Leu Gly 35 40 45
TCT TCT TTT TTA ACC TCT TCA ACT TTA GGC TCA GGC TTA GGC TCG GGT 249 Ser Ser Phe Leu Thr Ser Ser Thr Leu Gly Ser Gly Leu Gly Ser Gly 50 55 60 65
TTT GGT TCA GGT TTG GGT TCA GGC TTA GGT TTT GGT TTT GGC TTT GGC 297 Phe Gly Ser Gly Leu Gly Ser Gly Leu Gly Phe Gly Phe Gly Phe Gly 70 75 80
TTG GGT TTA GGC TTA GGT TTA GGC TTT GTA ACC TCC TTT TTG GGT TCT 345 Leu Gly Leu Gly Leu Gly Leu Gly Phe Val Thr Ser Phe Leu Gly Ser 85 90 95
TCT TTT TTT GGC TCT TCT TTC TTG GGT TTT TCT TTA GGC TCT TCT TTG 393 Ser Phe Phe Gly Ser Ser Phe Leu Gly Phe Ser Leu Gly Ser Ser Leu 100 105 110
GGT TTA GCC GAC TCA GCA TTA GTC TTT GTA TTG GAA TTA GTG TTG ATG 441 Gly Leu Ala Asp Ser Ala Leu Val Phe Val Leu Glu Leu Val Leu Met 115 120 125
CTG GCT AAA CTC ATG GTA ACC TTA GTG GTC CCG GCT TGC GCT AAA GGC 489 Leu Ala Lys Leu Met Val Thr Leu Val Val Pro Ala Cys Ala Lys Gly 130 135 140 145 TCT GGG GCG TCT TCG CGC AGT AAA AAA TAGCCAAACC CTATAGCGTA TAGGGCA 543 Ser Gly Ala Ser Ser Arg Ser Lys Lys 150
AAAGAGATTA AAAAGCTAAC ACTCGT 569
(2) INFORMATION FOR SEQ ID NO: 118:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 154 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118:
Ala Ser Glu Val Ala Pro Ser Glu Val Leu Leu Asp Ser Ser Cys Leu
1 5 10 15
Ser Phe Ser Leu Thr He Ser Leu Val Val Thr Cys Leu Gly Ala Leu
20 25 30
Phe Ser Leu Ala Ser Ser Leu Ala Ser Ser Phe Leu Gly Ser Ser Leu
35 40 45
Gly Ser Ser Phe Leu Thr Ser Ser Thr Leu Gly Ser Gly Leu Gly Ser
50 55 60
Gly Phe Gly Ser Gly Leu Gly Ser Gly Leu Gly Phe Gly Phe Gly Phe 65 70 75 80
Gly Leu Gly Leu Gly Leu Gly Leu Gly Phe Val Thr Ser Phe Leu Gly
85 90 95
Ser Ser Phe Phe Gly Ser Ser Phe Leu Gly Phe Ser Leu Gly Ser Ser
100 105 110
Leu Gly Leu Ala Asp Ser Ala Leu Val Phe Val Leu Glu Leu Val Leu
115 120 125
Met Leu Ala Lys Leu Met Val Thr Leu Val Val Pro Ala Cys Ala Lys
130 135 140
Gly Ser Gly Ala Ser Ser Arg Ser Lys Lys 145 150
(2) INFORMATION FOR SEQ ID NO: 119:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 359 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 77...310 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119:
CCCCACAAAT CCTAGCGATA GTGAAATGCC CTAATTCATG GACAAAGATT AAAAACGCCA 60 GCATCAAAAC CGCTAC AAT GAA CAT CAT ACC CCT GCA GGC TCT TTG GTG TTA 112
Asn Glu His His Thr Pro Ala Gly Ser Leu Val Leu 1 5 10
GGA TCT TTT ATC ATC GGC TCT TTT AAA GGC GTG GGT GCT ATA GGG GGC 160 Gly Ser Phe He He Gly Ser Phe Lys Gly Val Gly Ala He Gly Gly 15 20 25
GTG GGT GCT GTG GTT TTT GGG ATT TCT TTA TTT TCT TTT GGG GGT TTT 208 Val Gly Ala Val Val Phe Gly He Ser Leu Phe Ser Phe Gly Gly Phe 30 35 40
TGC CAC AAC TCT GTC AAA GCC GCC GCT TTT TTA GGA TCC ATT TTG GCT 256 Cys His Asn Ser Val Lys Ala Ala Ala Phe Leu Gly Ser He Leu Ala 45 50 55 60
AAA ATT TTA CCG AGT TCT TGG GGT TTT AGC GCC ATT AAA ATT TCT AAT 304 Lys He Leu Pro Ser Ser Trp Gly Phe Ser Ala He Lys He Ser Asn 65 70 75
GCG TTT TGAGTGGGTA AATTTTCTAA AATCAGAGCC GATTTAGAAT CTTTCATTT 359 Ala Phe
(2) INFORMATION FOR SEQ ID NO:120:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 78 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:120:
Asn Glu His His Thr Pro Ala Gly Ser Leu Val Leu Gly Ser Phe He
1 5 10 15
He Gly Ser Phe Lys Gly Val Gly Ala He Gly Gly Val Gly Ala Val
20 25 30
Val Phe Gly He Ser Leu Phe Ser Phe Gly Gly Phe Cys His Asn Ser
35 40 45
Val Lys Ala Ala Ala Phe Leu Gly Ser He Leu Ala Lys He Leu Pro
50 55 60
Ser Ser Trp Gly Phe Ser Ala He Lys He Ser Asn Ala Phe 65 70 75
(2) INFORMATION FOR SEQ ID NO: 121: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1051 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...998 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121:
GGGTTTTGTG AATGACGACT AAAAGAGTGA ATACTGCCAC AAACAAGATA ATG ACA 56
Met Thr 1
TTA AAT ACT TTC TTG GAT ACA TGT TTT CTT TTA TTC ATC AGT ATT CTT 104 Leu Asn Thr Phe Leu Asp Thr Cys Phe Leu Leu Phe He Ser He Leu 5 10 15
TTT TAT TTA AGT ATA CCA ATT TAT CCT AAC AAA GTG GTG GTT GTC CCG 152 Phe Tyr Leu Ser He Pro He Tyr Pro Asn Lys Val Val Val Val Pro 20 25 30
CAA GGT TCG CTC AAA AAA GTG TTT TTT TCT TTA AAA GAG CAA GGC GTG 200 Gin Gly Ser Leu Lys Lys Val Phe Phe Ser Leu Lys Glu Gin Gly Val 35 40 45 50
GAT ATG AAC GCT TTG GAT TTG CTT TTT TTA CGC CTG ATG GGC ATG CCT 248 Asp Met Asn Ala Leu Asp Leu Leu Phe Leu Arg Leu Met Gly Met Pro 55 60 65
AAA AAA GGT TAT ATT GAT ATG GGC GAT GGG GCT TTA AGG AAG GGG GAT 296 Lys Lys Gly Tyr He Asp Met Gly Asp Gly Ala Leu Arg Lys Gly Asp 70 75 80
TTT TTA GTC CGT TTG ATT AAG GCA AAA GCG GCA CAA AAA AGT GCG ACT 344 Phe Leu Val Arg Leu He Lys Ala Lys Ala Ala Gin Lys Ser Ala Thr 85 90 95
CTA ATC CCT GGG GAA AGC CGC TAT TTT TTC ACG CAA ATT TTG AGC GAG 392 Leu He Pro Gly Glu Ser Arg Tyr Phe Phe Thr Gin He Leu Ser Glu 100 105 110
ACT TAC CAA CTA GAA ACA AGC GAT CTC AAT CAG GCT TAT GAA AGC ATC 440 Thr Tyr Gin Leu Glu Thr Ser Asp Leu Asn Gin Ala Tyr Glu Ser He 115 120 125 130
GCT CCA CGA TTG AAT GGC GAA GTG ATA GAA GAT GGG GTG ATA TGG CCA 488 Ala Pro Arg Leu Asn Gly Glu Val He Glu Asp Gly Val He Trp Pro 135 140 145
GAC ACT TAT CAT TTG CCT TTA GGG GAG GAC GCT TTT AAA ATC ATG CAA 536 Asp Thr Tyr His Leu Pro Leu Gly Glu Asp Ala Phe Lys He Met Gin 150 155 160
ACT TTG ATT GGT CAA TCC ATG AAA AAA CAC GAA GCC TTA AGC AAA CAA 584 Thr Leu He Gly Gin Ser Met Lys Lys His Glu Ala Leu Ser Lys Gin 165 170 175
TGG CTT GGA TAC TAC CAT AAA GAA GAG TGG TTT GAA AAA ATC ATT CTC 632 Trp Leu Gly Tyr Tyr His Lys Glu Glu Trp Phe Glu Lys He He Leu 180 185 190
GCT TCT ATT GTG CAA AAA GAA GCC GCT AAT GTT GAA GAA ATG CCC TTG 680 Ala Ser He Val Gin Lys Glu Ala Ala Asn Val Glu Glu Met Pro Leu 195 200 205 210
ATT GCG AGC GTG ATT TTT AAC CGC TTG AAA AAA GGC ATG CCT TTA CAA 728 He Ala Ser Val He Phe Asn Arg Leu Lys Lys Gly Met Pro Leu Gin 215 220 225
ATG GAT GGG GCT TTG AAT TAT CAG GAA TTT TCA CAC GCT AAA GTA ACC 776 Met Asp Gly Ala Leu Asn Tyr Gin Glu Phe Ser His Ala Lys Val Thr 230 235 240
AAA GAG CGC ATT AAA ACC GAT AAC ACC CCC TAC AAT ACC TAT AAA TTT 824 Lys Glu Arg He Lys Thr Asp Asn Thr Pro Tyr Asn Thr Tyr Lys Phe 245 250 255
AAG GGT TTG CCT AAA AAT CCT GTA GGG AGC GTG AGC CTA GAA GCG ATT 872 Lys Gly Leu Pro Lys Asn Pro Val Gly Ser Val Ser Leu Glu Ala He 260 265 270
AGA GCC GTG ATC TTC CCT AAA AAA ACG GAT TTC TTG TAT TTT GTG AAA 920 Arg Ala Val He Phe Pro Lys Lys Thr Asp Phe Leu Tyr Phe Val Lys 275 280 285 290
ATG CCG GAT AAA AAA CAT GCT TTC AGC GCG ACT TAT AAA GAG CAT TTA 968 Met Pro Asp Lys Lys His Ala Phe Ser Ala Thr Tyr Lys Glu His Leu 295 300 305
AAA AAC ATT AAT CTT TCT AAT AAT CAT TTT TAAGATTAAG GTAAATGGGG CGT 1021 Lys Asn He Asn Leu Ser Asn Asn His Phe 310 315
TTTTTCTTTT GAATTGAGTA AAAAGTGTTT 1051
(2) INFORMATION FOR SEQ ID NO: 122:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 316 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122:
Met Thr Leu Asn Thr Phe Leu Asp Thr Cys Phe Leu Leu Phe He Ser
1 5 10 15
He Leu Phe Tyr Leu Ser He Pro He Tyr Pro Asn Lys Val Val Val
20 25 30
Val Pro Gin Gly Ser Leu Lys Lys Val Phe Phe Ser Leu Lys Glu Gin
35 40 45
Gly Val Asp Met Asn Ala Leu Asp Leu Leu Phe Leu Arg Leu Met Gly
50 55 60
Met Pro Lys Lys Gly Tyr He Asp Met Gly Asp Gly Ala Leu Arg Lys 65 70 75 80
Gly Asp Phe Leu Val Arg Leu He Lys Ala Lys Ala Ala Gin Lys Ser
85 90 95
Ala Thr Leu He Pro Gly Glu Ser Arg Tyr Phe Phe Thr Gin He Leu
100 105 110
Ser Glu Thr Tyr Gin Leu Glu Thr Ser Asp Leu Asn Gin Ala Tyr Glu
115 120 125
Ser He Ala Pro Arg Leu Asn Gly Glu Val He Glu Asp Gly Val He
130 135 140
Trp Pro Asp Thr Tyr His Leu Pro Leu Gly Glu Asp Ala Phe Lys He 145 150 155 160
Met Gin Thr Leu He Gly Gin Ser Met Lys Lys His Glu Ala Leu Ser
165 170 175
Lys Gin Trp Leu Gly Tyr Tyr His Lys Glu Glu Trp Phe Glu Lys He
180 185 190
He Leu Ala Ser He Val Gin Lys Glu Ala Ala Asn Val Glu Glu Met
195 200 205
Pro Leu He Ala Ser Val He Phe Asn Arg Leu Lys Lys Gly Met Pro
210 215 220
Leu Gin Met Asp Gly Ala Leu Asn Tyr Gin Glu Phe Ser His Ala Lys 225 230 235 240
Val Thr Lys Glu Arg He Lys Thr Asp Asn Thr Pro Tyr Asn Thr Tyr
245 250 255
Lys Phe Lys Gly Leu Pro Lys Asn Pro Val Gly Ser Val Ser Leu Glu
260 265 270
Ala He Arg Ala Val He Phe Pro Lys Lys Thr Asp Phe Leu Tyr Phe
275 280 285
Val Lys Met Pro Asp Lys Lys His Ala Phe Ser Ala Thr Tyr Lys Glu
290 295 300
His Leu Lys Asn He Asn Leu Ser Asn Asn His Phe 305 310 315
(2) INFORMATION FOR SEQ ID NO: 123:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 637 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...584 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:123:
GAAAGTTCGG GGGCGGATTC TATGATTAAT GGCTATGGTT ATACCAAAGA ATG AGT 56
Met Ser
1
CAA AAA ATC CTA ATT CTA GGT ATT GGC AAT ATC CTT TTT GGC GAT GAA 104 Gin Lys He Leu He Leu Gly He Gly Asn He Leu Phe Gly Asp Glu 5 10 15
GGG ATT GGG GTG CAT TTA GCC CAC TAC CTC AAA AAA AAT TTT TCT TTT 152 Gly He Gly Val His Leu Ala His Tyr Leu Lys Lys Asn Phe Ser Phe 20 25 30
TTC CCT AGC GTG GAT ATT ATA GAT GGG GGG ACA ATG GCC CAG CAG CTC 200 Phe Pro Ser Val Asp He He Asp Gly Gly Thr Met Ala Gin Gin Leu 35 40 45 50
ATT CCT TTA ATC ACT TCG TAT GAA AAG GTT TTG ATT TTG GAT TGC GTG 248 He Pro Leu He Thr Ser Tyr Glu Lys Val Leu He Leu Asp Cys Val 55 60 65
AGC GCT GAA GGC GTT GAG ATA GGA TCA GTC TAT GCT TTT GAT TTT AAG 296 Ser Ala Glu Gly Val Glu He Gly Ser Val Tyr Ala Phe Asp Phe Lys 70 75 80
GAC GCT CCT AAA GAA ATC ACA TGG GCT GGG AGC GCT CAT GAA GTG GAA 344 Asp Ala Pro Lys Glu He Thr Trp Ala Gly Ser Ala His Glu Val Glu 85 90 95
ATG CTA CAC ACT TTA AGG CTC ACG GAG TTT TTA GGG GAT TTG CCT AAA 392 Met Leu His Thr Leu Arg Leu Thr Glu Phe Leu Gly Asp Leu Pro Lys 100 105 110
ACT TTT ATC GTG GGG CTT GTG CCT TTT GTG ATA GGG AGC GAG ACC ACT 440 Thr Phe He Val Gly Leu Val Pro Phe Val He Gly Ser Glu Thr Thr 115 120 125 130
TTC AAG CTT TCA AGC AAA ATT TTA AAC GCT TTA GAA ACC GCC TTA AAA 488 Phe Lys Leu Ser Ser Lys He Leu Asn Ala Leu Glu Thr Ala Leu Lys 135 140 145
GCC ATA GAA ACC CAA CTC AAC GCA TGG GGG GTT AAA ATG CAA CGC ACC 536 Ala He Glu Thr Gin Leu Asn Ala Trp Gly Val Lys Met Gin Arg Thr 150 155 160 GAT CAT ATC GCT TTA GAA TGT ATC GCT GAA CTT TCT TAT AAG GGT TTT T 585 Asp His He Ala Leu Glu Cys He Ala Glu Leu Ser Tyr Lys Gly Phe 165 170 175
GAATTGGTTT TTGTTTTTCT TTTTAAATGC GTTAATGAAG AAACAAGCCT GA 637
(2) INFORMATION FOR SEQ ID NO: 124:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 178 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124:
Met Ser Gin Lys He Leu He Leu Gly He Gly Asn He Leu Phe Gly
1 5 10 15
Asp Glu Gly He Gly Val His Leu Ala His Tyr Leu Lys Lys Asn Phe
20 25 30
Ser Phe Phe Pro Ser Val Asp He He Asp Gly Gly Thr Met Ala Gin
35 40 45
Gin Leu He Pro Leu He Thr Ser Tyr Glu Lys Val Leu He Leu Asp
50 55 60
Cys Val Ser Ala Glu Gly Val Glu He Gly Ser Val Tyr Ala Phe Asp 65 70 75 80
Phe Lys Asp Ala Pro Lys Glu He Thr Trp Ala Gly Ser Ala His Glu
85 90 95
Val Glu Met Leu His Thr Leu Arg Leu Thr Glu Phe Leu Gly Asp Leu
100 105 110
Pro Lys Thr Phe He Val Gly Leu Val Pro Phe Val He Gly Ser Glu
115 120 125
Thr Thr Phe Lys Leu Ser Ser Lys He Leu Asn Ala Leu Glu Thr Ala
130 135 140
Leu Lys Ala He Glu Thr Gin Leu Asn Ala Trp Gly Val Lys Met Gin 145 150 155 160
Arg Thr Asp His He Ala Leu Glu Cys He Ala Glu Leu Ser Tyr Lys
165 170 175
Gly Phe
(2) INFORMATION FOR SEQ ID NO: 125:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 214 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...161 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125:
GCATTGCTAA TTTGGGAATA CTTGTTTATG CCAGTGAAAT AGGAGCGGCT ATG ATG 56
Met Met 1
TGG CGT AGT CTC ARG GTG GCT TTT ACG ATC ACT GAT ATT AGT AAA ACC 104 Trp Arg Ser Leu Xaa Val Ala Phe Thr He Thr Asp He Ser Lys Thr 5 10 15
TTT CAA TCC CAG CCT AAG CAC CAT CAA ATC GGC ACT TTA GAA TTG AAT 152 Phe Gin Ser Gin Pro Lys His His Gin He Gly Thr Leu Glu Leu Asn 20 25 30
TTC GCC TTT TGATTTAATA TCAGTTTAAT ATTTTTCTTC CTATATGATA TTTATATGA 210
Phe Ala Phe
35
TATT 214
(2) INFORMATION FOR SEQ ID NO: 126:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 37 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126:
Met Met Trp Arg Ser Leu Xaa Val Ala Phe Thr He Thr Asp He Ser
1 5 10 15
Lys Thr Phe Gin Ser Gin Pro Lys His His Gin He Gly Thr Leu Glu
20 25 30
Leu Asn Phe Ala Phe 35
(2) INFORMATION FOR SEQ ID NO: 127:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1576 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1523 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127:
CTGTATTCGC TTCTGTGGAT TACTACCCTC AAAGAAAAGA AAGCCACAGA ATG AAC 56
Met Asn
1
ACC ACC ATC TTA GAA GCT TAT GCG GCT GAG CCA AGC AGG CAA ACC CTC 104 Thr Thr He Leu Glu Ala Tyr Ala Ala Glu Pro Ser Arg Gin Thr Leu 5 10 15
TCT AAA GTC AGC AAC CGA TTC AAA GAG CAT GGC GCT AAA TTT GAT CTT 152 Ser Lys Val Ser Asn Arg Phe Lys Glu His Gly Ala Lys Phe Asp Leu 20 25 30
CGT GTG ATG GCA ACG CAT GGA GGC ACC ATT AGT TGG AAA GCT AAA GAA 200 Arg Val Met Ala Thr His Gly Gly Thr He Ser Trp Lys Ala Lys Glu 35 40 45 50
CTC GCT AGG ACT ATT GTG AGC GGC CCT ATT GGA GGC GTG ATT GGA TCT 248 Leu Ala Arg Thr He Val Ser Gly Pro He Gly Gly Val He Gly Ser 55 60 65
AAA TTG CTA GGC GAA ACG CTT GGT TAT GAC AAT ATT GCA TGC AGT GAT 296 Lys Leu Leu Gly Glu Thr Leu Gly Tyr Asp Asn He Ala Cys Ser Asp 70 75 80
ATT GGK GGC ACG AGC TTT GAT ATG GCG CTT ATC GTT AAG AGC AAT TTT 344 He Xaa Gly Thr Ser Phe Asp Met Ala Leu He Val Lys Ser Asn Phe 85 90 95
AAC ATC GCT TCT GAC CCT GAT ATG GCA CGC CTT GTT TTA TCT CTA CCG 392 Asn He Ala Ser Asp Pro Asp Met Ala Arg Leu Val Leu Ser Leu Pro 100 105 110
CTT GTG GCT ATG GAT TCT GTT GGC GCA GGT GCT GGG AGT TTT GTG CGC 440 Leu Val Ala Met Asp Ser Val Gly Ala Gly Ala Gly Ser Phe Val Arg 115 120 125 130
ATT GAT CCA CAC AGC CGA TCT GTC AAA CTA GGG CCT GAC AGC GCG GGG 488 He Asp Pro His Ser Arg Ser Val Lys Leu Gly Pro Asp Ser Ala Gly 135 140 145
TAT AGA GTT GGC ACT TGT TGG AAA GAC AGC GGG TTA GAC ACG GTT TCA 536 Tyr Arg Val Gly Thr Cys Trp Lys Asp Ser Gly Leu Asp Thr Val Ser 150 155 160 GTA ACC GAT TGC CAT ATT GTT TTA GGC TAT TTG AAC CCG GAT AAT TTC 584 Val Thr Asp Cys His He Val Leu Gly Tyr Leu Asn Pro Asp Asn Phe 165 170 175
TTA GGC GGT TTG ATC AAA TTA GAT GTG GAT AGG GCT AAA AAA CAC ATT 632 Leu Gly Gly Leu He Lys Leu Asp Val Asp Arg Ala Lys Lys His He 180 185 190
AAA GAA CAA ATC GCT GAT CCG CTA GGC ATT AGC GTA GAA GAT GCG GCT 680 Lys Glu Gin He Ala Asp Pro Leu Gly He Ser Val Glu Asp Ala Ala 195 200 205 210
GCT GGT GTG ATT GAA TTG CTT GAT TTG GAG CTT AAA GAA TAC TTG CGA 728 Ala Gly Val He Glu Leu Leu Asp Leu Glu Leu Lys Glu Tyr Leu Arg 215 220 225
TCC AAC ATT AGC GCT AAA GGG TAT AGC CCA TCT GAT TTT GTG TGC TTT 776 Ser Asn He Ser Ala Lys Gly Tyr Ser Pro Ser Asp Phe Val Cys Phe 230 235 240
TCA TAT GGT GGC GCA GGA CCT GTG CAT ACC TAT GGC TAT ACA GAA GGA 824 Ser Tyr Gly Gly Ala Gly Pro Val His Thr Tyr Gly Tyr Thr Glu Gly 245 250 255
TTA GGG TTT AAG GAT GTG GTA GTG CCT GCG TGG GCG GCT GGA TTT AGC 872 Leu Gly Phe Lys Asp Val Val Val Pro Ala Trp Ala Ala Gly Phe Ser 260 265 270
GCT TTT GGT TGT GCT TGC GCT GAT TTT GAA TAC AGA TAC GAC AAG AGC 920 Ala Phe Gly Cys Ala Cys Ala Asp Phe Glu Tyr Arg Tyr Asp Lys Ser 275 280 285 290
GTG GAT ATT GCC ATT CCG CAG TAT TCT TCA GAC AAG TCA AAA ATA GAC 968 Val Asp He Ala He Pro Gin Tyr Ser Ser Asp Lys Ser Lys He Asp 295 300 305
GCA TGC AAA ATC ATT CAA GAC GCA TGG GAT GAA TTG ACT TTG AAA GTG 1016 Ala Cys Lys He He Gin Asp Ala Trp Asp Glu Leu Thr Leu Lys Val 310 315 320
ATT GAA GAG TTC AAG ATC AAT GGA TTT TCT CAA AAA GAT GTG ATC TTA 1064 He Glu Glu Phe Lys He Asn Gly Phe Ser Gin Lys Asp Val He Leu 325 330 335
AGA CCT GGA TAC AGG ATG CAG TAT ATG GGG CAA TTG AAT GAT TTA GAG 1112 Arg Pro Gly Tyr Arg Met Gin Tyr Met Gly Gin Leu Asn Asp Leu Glu 340 345 350
ATC ACT TCT CCT GTG TCA AAA GCT GCA AGC GTG GCT GAT TGG GAA GAG 1160 He Thr Ser Pro Val Ser Lys Ala Ala Ser Val Ala Asp Trp Glu Glu 355 360 365 370
ATT GTC AAA GAA TAT GAA AAA ACC TAC GCT CGC GTT TAT TCT GAA TCA 1208 He Val Lys Glu Tyr Glu Lys Thr Tyr Ala Arg Val Tyr Ser Glu Ser 375 380 385 GCG TGT TCT CCA GAG CTT GGT TTT AGC GTG ACT GGC GTG ATC ATG CGT 1256 Ala Cys Ser Pro Glu Leu Gly Phe Ser Val Thr Gly Val He Met Arg 390 395 400
GGT GTT GTG GCT ACG CAA AAA CCT GTG ATT CCG GTT GAA AAA GAG CAT 1304 Gly Val Val Ala Thr Gin Lys Pro Val He Pro Val Glu Lys Glu His 405 410 415
GGT GCT ACG CCC CCA AAA GAA GCC AAA ATA GGC GTT AGA AAA TTC TAT 1352 Gly Ala Thr Pro Pro Lys Glu Ala Lys He Gly Val Arg Lys Phe Tyr 420 425 430
CGG CAT AAA AAA TGG GTG GAT GCA GAT GTG TGG CAA ATG GAA AAA TTA 1400 Arg His Lys Lys Trp Val Asp Ala Asp Val Trp Gin Met Glu Lys Leu 435 440 445 450
CTG CCT GGA AAT GAA GTC ATA GGA CCT GCG ATC GTG GAA TCA GAT GCG 1448 Leu Pro Gly Asn Glu Val He Gly Pro Ala He Val Glu Ser Asp Ala 455 460 465
ACC ACT TTC GTG ATA CCC AAA GGC TTT GCG ACA AGA CTA GAC AAA CAC 1496 Thr Thr Phe Val He Pro Lys Gly Phe Ala Thr Arg Leu Asp Lys His 470 475 480
CGA TTG TTC CAC TTG AAA GAA ATT AAA TAAAGGAGTT CAAAATGGCA AATTTAT 1550 Arg Leu Phe His Leu Lys Glu He Lys 485 490
TGAAAAACGG CAAAACTTTA AAACAA 1576
(2) INFORMATION FOR SEQ ID NO: 128:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 491 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128:
Met Asn Thr Thr He Leu Glu Ala Tyr Ala Ala Glu Pro Ser Arg Gin
1 5 10 15
Thr Leu Ser Lys Val Ser Asn Arg Phe Lys Glu His Gly Ala Lys Phe
20 25 30
Asp Leu Arg Val Met Ala Thr His Gly Gly Thr He Ser Trp Lys Ala
35 40 45
Lys Glu Leu Ala Arg Thr He Val Ser Gly Pro He Gly Gly Val He
50 55 60
Gly Ser Lys Leu Leu Gly Glu Thr Leu Gly Tyr Asp Asn He Ala Cys 65 70 75 80
Ser Asp He Xaa Gly Thr Ser Phe Asp Met Ala Leu He Val Lys Ser 85 90 95 Asn Phe Asn He Ala Ser Asp Pro Asp Met Ala Arg Leu Val Leu Ser
100 105 110
Leu Pro Leu Val Ala Met Asp Ser Val Gly Ala Gly Ala Gly Ser Phe
115 120 125
Val Arg He Asp Pro His Ser Arg Ser Val Lys Leu Gly Pro Asp Ser
130 135 140
Ala Gly Tyr Arg Val Gly Thr Cys Trp Lys Asp Ser Gly Leu Asp Thr 145 150 155 160
Val Ser Val Thr Asp Cys His He Val Leu Gly Tyr Leu Asn Pro Asp
165 170 175
Asn Phe Leu Gly Gly Leu He Lys Leu Asp Val Asp Arg Ala Lys Lys
180 185 190
His He Lys Glu Gin He Ala Asp Pro Leu Gly He Ser Val Glu Asp
195 200 205
Ala Ala Ala Gly Val He Glu Leu Leu Asp Leu Glu Leu Lys Glu Tyr
210 215 220
Leu Arg Ser Asn He Ser Ala Lys Gly Tyr Ser Pro Ser Asp Phe Val 225 230 235 240
Cys Phe Ser Tyr Gly Gly Ala Gly Pro Val His Thr Tyr Gly Tyr Thr
245 250 255
Glu Gly Leu Gly Phe Lys Asp Val Val Val Pro Ala Trp Ala Ala Gly
260 265 270
Phe Ser Ala Phe Gly Cys Ala Cys Ala Asp Phe Glu Tyr Arg Tyr Asp
275 280 285
Lys Ser Val Asp He Ala He Pro Gin Tyr Ser Ser Asp Lys Ser Lys
290 295 300
He Asp Ala Cys Lys He He Gin Asp Ala Trp Asp Glu Leu Thr Leu 305 310 315 320
Lys Val He Glu Glu Phe Lys He Asn Gly Phe Ser Gin Lys Asp Val
325 330 335
He Leu Arg Pro Gly Tyr Arg Met Gin Tyr Met Gly Gin Leu Asn Asp
340 345 350
Leu Glu He Thr Ser Pro Val Ser Lys Ala Ala Ser Val Ala Asp Trp
355 360 365
Glu Glu He Val Lys Glu Tyr Glu Lys Thr Tyr Ala Arg Val Tyr Ser
370 375 380
Glu Ser Ala Cys Ser Pro Glu Leu Gly Phe Ser Val Thr Gly Val He 385 390 395 400
Met Arg Gly Val Val Ala Thr Gin Lys Pro Val He Pro Val Glu Lys
405 410 415
Glu His Gly Ala Thr Pro Pro Lys Glu Ala Lys He Gly Val Arg Lys
420 425 430
Phe Tyr Arg His Lys Lys Trp Val Asp Ala Asp Val Trp Gin Met Glu
435 440 445
Lys Leu Leu Pro Gly Asn Glu Val He Gly Pro Ala He Val Glu Ser
450 455 460
Asp Ala Thr Thr Phe Val He Pro Lys Gly Phe Ala Thr Arg Leu Asp 465 470 475 480
Lys His Arg Leu Phe His Leu Lys Glu He Lys 485 490
(2) INFORMATION FOR SEQ ID NO: 129:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 303 base pairs (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...261 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129:
GTAAGCTAGA GTTTATGCAA AGGGAGATGA GTAGCCTAGA AGCTAAGCAT T ATG ATT 57
Met He
1
CAG TTA AAA TCA AAT TTG GAT TGG TAC GCA GAT TAT TTG AAT TTT TTA 105 Gin Leu Lys Ser Asn Leu Asp Trp Tyr Ala Asp Tyr Leu Asn Phe Leu 5 10 15
GAT CGC TTT GGG GAA AAA ATG GAA GAA TCC AAA GAG CGA AAA CAA CTC 153 Asp Arg Phe Gly Glu Lys Met Glu Glu Ser Lys Glu Arg Lys Gin Leu 20 25 30
CTG ATC GCT TCC CTT GCA CCT CTT GCG GGC TTT GCT GCA AGA ATA TCG 201 Leu He Ala Ser Leu Ala Pro Leu Ala Gly Phe Ala Ala Arg He Ser 35 40 45 50
CCG GGA TTA TTG AGC TTA TTG GGT TTG ATG CTG GCA ATG GGG TGT GCA 249 Pro Gly Leu Leu Ser Leu Leu Gly Leu Met Leu Ala Met Gly Cys Ala 55 60 65
AAT TTT TGG ATT TAGAAACCAA TCTGTGCAAG ATTTATGAAT CGCGCCCGTT AA 303 Asn Phe Trp He 70
(2) INFORMATION FOR SEQ ID NO: 130:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 70 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130:
Met He Gin Leu Lys Ser Asn Leu Asp Trp Tyr Ala Asp Tyr Leu Asn 1 5 10 15
Phe Leu Asp Arg Phe Gly Glu Lys Met Glu Glu Ser Lys Glu Arg Lys
20 25 30
Gin Leu Leu He Ala Ser Leu Ala Pro Leu Ala Gly Phe Ala Ala Arg
35 40 45
He Ser Pro Gly Leu Leu Ser Leu Leu Gly Leu Met Leu Ala Met Gly
50 55 60
Cys Ala Asn Phe Trp He 65 70
(2) INFORMATION FOR SEQ ID NO: 131:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 826 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...773 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131:
TGGCTTAATT GTTAAGCCGG CTAGAAAAAG AGCGTTATTT GCGCCATATC ATG CTA 56
Met Leu 1
GAA GAT GTG GGC GAA GAG GGT CAA TTG AAG CTT TTA AAA TCT AGC GTT 104 Glu Asp Val Gly Glu Glu Gly Gin Leu Lys Leu Leu Lys Ser Ser Val 5 10 15
TTA GTC ATT GGG GCT GGG GGT CTT GGA TCG GCG GTT TTG ATG TAT TTG 152 Leu Val He Gly Ala Gly Gly Leu Gly Ser Ala Val Leu Met Tyr Leu 20 25 30
TGT GCC GCT GGG ATA GGA AAA ATC GGT ATT GTA GAT TTT GAT GTA GTA 200 Cys Ala Ala Gly He Gly Lys He Gly He Val Asp Phe Asp Val Val 35 40 45 50
GAT ATG AGT AAT TTG CAA CGC CAA ATC ATC CAT TCA CAG GAT TTT TTA 248 Asp Met Ser Asn Leu Gin Arg Gin He He His Ser Gin Asp Phe Leu 55 60 65
AAC CAA TCT AAA GCC TCT AGC GCG AAA GCG CGC TTA AAA CAA CTC AAT 296 Asn Gin Ser Lys Ala Ser Ser Ala Lys Ala Arg Leu Lys Gin Leu Asn 70 75 80
GCG GGT ATT GAA ATA GAG GCT TTT GAA GAA CGC TTT AAG GCT CAT AAC 344 Ala Gly He Glu He Glu Ala Phe Glu Glu Arg Phe Lys Ala His Asn 85 90 95
GCT CTT TCT CTC ATA GAG CCT TAT GAT TTT ATC ATA GAC GCC ACG GAC 392 Ala Leu Ser Leu He Glu Pro Tyr Asp Phe He He Asp Ala Thr Asp 100 105 110
AAT TTT AAC GCT AAA TTT TTG ATC AAT GAC GCT TGC GTG TTA GCC CAA 440 Asn Phe Asn Ala Lys Phe Leu He Asn Asp Ala Cys Val Leu Ala Gin 115 120 125 130
AAA CCC TAT TCG CAT GCC GGG GTT TTA GAA TAC AGG GGG CAA AGC ATG 488 Lys Pro Tyr Ser His Ala Gly Val Leu Glu Tyr Arg Gly Gin Ser Met 135 140 145
AGC GTT TTA CCC CAT AGC GCA TGC TTA GCG TGC GTT TTT GAT AAG CCC 536 Ser Val Leu Pro His Ser Ala Cys Leu Ala Cys Val Phe Asp Lys Pro 150 155 160
CCT AAA AAG GGA TTA AAT CCC ATT TCA GGG CTT TTT GGG GTC TTA CCC 584 Pro Lys Lys Gly Leu Asn Pro He Ser Gly Leu Phe Gly Val Leu Pro 165 170 175
GGA GTT TTA GGG TGT ATC CAA GCG AGC GAA TGC CTT AAA TAT TTT TTA 632 Gly Val Leu Gly Cys He Gin Ala Ser Glu Cys Leu Lys Tyr Phe Leu 180 185 190
GGG TTT GAA ACT TTA CTT ATA AAT ACT TTA CTT ATA GCC GAT ATT AAA 680 Gly Phe Glu Thr Leu Leu He Asn Thr Leu Leu He Ala Asp He Lys 195 200 205 210
ACG ATG GAT TTT AAA AAA ATT CAA GCA CCC AAA AAC CCT GAA TGT AGG 728 Thr Met Asp Phe Lys Lys He Gin Ala Pro Lys Asn Pro Glu Cys Arg 215 220 225
GTT TGT GGC ACG CAT AAA ATC ACG CAT TTA CAG GAT TAT GAA ATT TAGAT 778 Val Cys Gly Thr His Lys He Thr His Leu Gin Asp Tyr Glu He 230 235 240
TAAGGGGTAA GTTTTGGATT TATCAACCAT ATTAGGCTTG GTATTGGC 826
(2) INFORMATION FOR SEQ ID NO: 132:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 241 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132:
Met Leu Glu Asp Val Gly Glu Glu Gly Gin Leu Lys Leu Leu Lys Ser 1 5 10 15 Ser Val Leu Val He Gly Ala Gly Gly Leu Gly Ser Ala Val Leu Met
20 25 30
Tyr Leu Cys Ala Ala Gly He Gly Lys He Gly He Val Asp Phe Asp
35 40 45
Val Val Asp Met Ser Asn Leu Gin Arg Gin He He His Ser Gin Asp
50 55 60
Phe Leu Asn Gin Ser Lys Ala Ser Ser Ala Lys Ala Arg Leu Lys Gin 65 70 75 80
Leu Asn Ala Gly He Glu He Glu Ala Phe Glu Glu Arg Phe Lys Ala
85 90 95
His Asn Ala Leu Ser Leu He Glu Pro Tyr Asp Phe He He Asp Ala
100 105 110
Thr Asp Asn Phe Asn Ala Lys Phe Leu He Asn Asp Ala Cys Val Leu
115 120 125
Ala Gin Lys Pro Tyr Ser His Ala Gly Val Leu Glu Tyr Arg Gly Gin
130 135 140
Ser Met Ser Val Leu Pro His Ser Ala Cys Leu Ala Cys Val Phe Asp 145 150 155 160
Lys Pro Pro Lys Lys Gly Leu Asn Pro He Ser Gly Leu Phe Gly Val
165 170 175
Leu Pro Gly Val Leu Gly Cys He Gin Ala Ser Glu Cys Leu Lys Tyr
180 185 190
Phe Leu Gly Phe Glu Thr Leu Leu He Asn Thr Leu Leu He Ala Asp
195 200 205
He Lys Thr Met Asp Phe Lys Lys He Gin Ala Pro Lys Asn Pro Glu
210 215 220
Cys Arg Val Cys Gly Thr His Lys He Thr His Leu Gin Asp Tyr Glu 225 230 235 240
He
(2) INFORMATION FOR SEQ ID NO: 133:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 547 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...494 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:133:
AAAATTCATT CTATTTTAGA TAATGAGTTC AATCCCCACA AACAGCAAGA ATG AAT 56
Met Asn
1 CGC ATG AAT AAA AAT TAT CTT TTA ATC TTT TTG TTG TTA GCG AGT CTT 104 Arg Met Asn Lys Asn Tyr Leu Leu He Phe Leu Leu Leu Ala Ser Leu 5 10 15
GTT GCT AGA GAG AAG GAC GCT TCT TCA AAC CTT TTT GAT TTG ATT GAT 152 Val Ala Arg Glu Lys Asp Ala Ser Ser Asn Leu Phe Asp Leu He Asp 20 25 30
AAG GGG ATC AAC AGA GAA CAA GAA TTA AAA GAG CAG GAG CAA AAA ACG 200 Lys Gly He Asn Arg Glu Gin Glu Leu Lys Glu Gin Glu Gin Lys Thr 35 40 45 50
CGC TTA AAA CTG GCT CAA AGC CCT TTA GTA GCG TTA GAG ATT GTC CCC 248 Arg Leu Lys Leu Ala Gin Ser Pro Leu Val Ala Leu Glu He Val Pro 55 60 65
CAA GAA ACG CCC TAT TTA GAA TGG CAA GGG GCT AGG GAG TCG TAT TAT 296 Gin Glu Thr Pro Tyr Leu Glu Trp Gin Gly Ala Arg Glu Ser Tyr Tyr 70 75 80
TTA AAG GTG AGC GCT GTA GTG GAG AGC GTG GTT ATC TTA AAA ATT GAC 344 Leu Lys Val Ser Ala Val Val Glu Ser Val Val He Leu Lys He Asp 85 90 95
ATC AAT CAA GGG CGT TCT TGC TCG CTC TAC CCC ACG CCT AAA AGC GTT 392 He Asn Gin Gly Arg Ser Cys Ser Leu Tyr Pro Thr Pro Lys Ser Val 100 105 ,110
TCT TTA GTG AGG AAT CAA AGC GTA GCC TAT GAA ATT TTA TGC GAA AAC 440 Ser Leu Val Arg Asn Gin Ser Val Ala Tyr Glu He Leu Cys Glu Asn 115 120 125 130
CAA CCC CTA TGG ATA GAA GTA AGC ACC AAT TTA GGC AAA CGC ACC TTT 488 Gin Pro Leu Trp He Glu Val Ser Thr Asn Leu Gly Lys Arg Thr Phe 135 140 145
CAG TTT TAACCTGCAA CCAACATTAA AGAATGCCTT TAGCATTTTA AAACCCCTTT AT 546 Gin Phe
C 547
(2) INFORMATION FOR SEQ ID NO: 134:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 148 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134:
Met Asn Arg Met Asn Lys Asn Tyr Leu Leu He Phe Leu Leu Leu Ala 1 5 10 15
Ser Leu Val Ala Arg Glu Lys Asp Ala Ser Ser Asn Leu Phe Asp Leu
20 25 30
He Asp Lys Gly He Asn Arg Glu Gin Glu Leu Lys Glu Gin Glu Gin
35 40 45
Lys Thr Arg Leu Lys Leu Ala Gin Ser Pro Leu Val Ala Leu Glu He
50 55 60
Val Pro Gin Glu Thr Pro Tyr Leu Glu Trp Gin Gly Ala Arg Glu Ser 65 70 75 80
Tyr Tyr Leu Lys Val Ser Ala Val Val Glu Ser Val Val He Leu Lys
85 90 95
He Asp He Asn Gin Gly Arg Ser Cys Ser Leu Tyr Pro Thr Pro Lys
100 105 110
Ser Val Ser Leu Val Arg Asn Gin Ser Val Ala Tyr Glu He Leu Cys
115 120 125
Glu Asn Gin Pro Leu Trp He Glu Val Ser Thr Asn Leu Gly Lys Arg
130 135 140
Thr Phe Gin Phe 145
(2) INFORMATION FOR SEQ ID NO: 135:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1684 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA ( ix) FEATURE :
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1631 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135:
CCTTTCTTTA TTCTCATTGC ATCTTGTGGG ATCATTTTTT TTTTTTATTA ATG CTA 56
Met Leu 1
GCT TCC ATC ATC TCA ATT TTA AGG GTT TTT GTT TTG TTA TTC AAC ACG 104 Ala Ser He He Ser He Leu Arg Val Phe Val Leu Leu Phe Asn Thr 5 10 15
CCG TTA TTC ATC TTT GCT TTT TTG CCT GTT GGT TTT TTA GGG TAT TTT 152 Pro Leu Phe He Phe Ala Phe Leu Pro Val Gly Phe Leu Gly Tyr Phe 20 25 30
ATC TTG CAA GCT TAT GCT AAA AAT CCC CTG TTC CCT AAA CTA TGG CTA 200 He Leu Gin Ala Tyr Ala Lys Asn Pro Leu Phe Pro Lys Leu Trp Leu 35 40 45 50 GTA TTG GCT AGT TTG TTT TTT TAT GCT TTT TGG AAT GTG AAG TAT TTG 248 Val Leu Ala Ser Leu Phe Phe Tyr Ala Phe Trp Asn Val Lys Tyr Leu 55 60 65
CCC TTA TTG GTT GGC TCT ATT GTT TTT AAT TAT TTT GTG GCT TTG AAA 296 Pro Leu Leu Val Gly Ser He Val Phe Asn Tyr Phe Val Ala Leu Lys 70 75 80
ATC CAT CAA ACC CAG CCA AAT GCA TAT AAA AGA TTA TGG CTT ATT TTG 344 He His Gin Thr Gin Pro Asn Ala Tyr Lys Arg Leu Trp Leu He Leu 85 90 95
GGC TTG ATC GCT AAT GTT TCA CTT TTA GGA TTT TTC AAA TAC ACT GAT 392 Gly Leu He Ala Asn Val Ser Leu Leu Gly Phe Phe Lys Tyr Thr Asp 100 105 110
TTT TTC TTA ACC AAT TTC AAT CTA ATA TGG AAG AGC CAT TTT GAA ACC 440 Phe Phe Leu Thr Asn Phe Asn Leu He Trp Lys Ser His Phe Glu Thr 115 120 125 130
TTG CAT TTA ATC TTG CCT TTA GCG ATC AGC TTT TTC ACT TTG CAA CAA 488 Leu His Leu He Leu Pro Leu Ala He Ser Phe Phe Thr Leu Gin Gin 135 140 145
ATC GCT TAC TTG ATG GAC ACT TAT AAG CAA AAT CAA ATC ATG CAG CCC 536 He Ala Tyr Leu Met Asp Thr Tyr Lys Gin Asn Gin He Met Gin Pro 150 155 160
AAA ATG AGA GAG AGA GTG AGT GAA AAC GCT CCT ATT TTA TTA AAT CCT 584 Lys Met Arg Glu Arg Val Ser Glu Asn Ala Pro He Leu Leu Asn Pro 165 170 175
CCC ACT TCA TTT TTT TCA CTT TCG CAT TTT TTA GAT TAC GCT TTA TTT 632 Pro Thr Ser Phe Phe Ser Leu Ser His Phe Leu Asp Tyr Ala Leu Phe 180 185 190
GTG AGT TTC TTC CCT CAA CTC ATT GCA GGG CCT ATT GTG CAT CAT AGC 680 Val Ser Phe Phe Pro Gin Leu He Ala Gly Pro He Val His His Ser 195 200 205 210
GAG ATG ATG CCT CAA TTT AAA GAT AAA AAC AAT CAA TAT TTG AAT TAC 728 Glu Met Met Pro Gin Phe Lys Asp Lys Asn Asn Gin Tyr Leu Asn Tyr 215 220 225
AGA AAT ATC GCT TTA GGC TTG TTT ATC TTT TCT ATC GGT TTG TTT AAA 776 Arg Asn He Ala Leu Gly Leu Phe He Phe Ser He Gly Leu Phe Lys 230 235 240
AAG GTC GTG ATT GCA GAT AAT ACC GCT CAT TTT GCT GAT TTT GGA TTT 824 Lys Val Val He Ala Asp Asn Thr Ala His Phe Ala Asp Phe Gly Phe 245 250 255
GAT AAG GCG ACT AGC TTA AGT TTT ATT CAA GCA TGG ATG ACT TCT TTA 872 Asp Lys Ala Thr Ser Leu Ser Phe He Gin Ala Trp Met Thr Ser Leu 260 265 270 TCT TAT TCG TTC CAG CTG TAT TTT GAT TTT AGC GGT TAT TGC GAT ATG 920 Ser Tyr Ser Phe Gin Leu Tyr Phe Asp Phe Ser Gly Tyr Cys Asp Met 275 280 285 290
GCT ATA GGC ATT GGC CTC TTT TTT AAC ATC AAA CTC CCT ATC AAT TTT 968 Ala He Gly He Gly Leu Phe Phe Asn He Lys Leu Pro He Asn Phe 295 300 305
AAT AGC CCC TAT AAG GCT TTG AAT ATC CAA GAT TTT TGG AGG AGG TGG 1016 Asn Ser Pro Tyr Lys Ala Leu Asn He Gin Asp Phe Trp Arg Arg Trp 310 315 320
CAT ATC ACT TTG AGC CGC TTC TTA AAA GAG TAT TTG TAT ATC CCT TTA 1064 His He Thr Leu Ser Arg Phe Leu Lys Glu Tyr Leu Tyr He Pro Leu 325 330 335
GGG GGT AAT AGG GTG AAA GAA TTA ATC GTG TAT AGG AAT TTA ATT TTA 1112 Gly Gly Asn Arg Val Lys Glu Leu He Val Tyr Arg Asn Leu He Leu 340 345 350
GTG TTT TTG ATT GGG GGG TTT TGG CAT GGG GCT GGT TGG ACT TTT ATC 1160 Val Phe Leu He Gly Gly Phe Trp His Gly Ala Gly Trp Thr Phe He 355 360 365 370
ATT TGG GGG CTA TTG CAT GGG ATT GCT TTG AGC GTT CAT AGA GCG TAT 1208 He Trp Gly Leu Leu His Gly He Ala Leu Ser Val His Arg Ala Tyr 375 380 385
TCT CAT GCC ACT AGA AAA TTC CAT TTC ACT ATG CCA AAG ATT TTA GCA 1256 Ser His Ala Thr Arg Lys Phe His Phe Thr Met Pro Lys He Leu Ala 390 395 400
TGG CTC ATC ACT TTT AAT TTT ATC AAT CTC GCA TGG GTG TTT TTT AGA 1304 Trp Leu He Thr Phe Asn Phe He Asn Leu Ala Trp Val Phe Phe Arg 405 410 415
GCC AAA AAT TTA GAA AGC GCT TTG AAG GTT TTA AAG GGG ATG GTT GGT 1352 Ala Lys Asn Leu Glu Ser Ala Leu Lys Val Leu Lys Gly Met Val Gly 420 425 430
TTG AAT GGT GTT TCG CTT TGT CAT CTT TCA AAA GAG GCA TCA GAG TTT 1400 Leu Asn Gly Val Ser Leu Cys His Leu Ser Lys Glu Ala Ser Glu Phe 435 440 445 450
TTA AAT CGT GTC AAT GAT AAC ATG ATC ATG CAC ACC ATA ATG TAT GCA 1448 Leu Asn Arg Val Asn Asp Asn Met He Met His Thr He Met Tyr Ala 455 460 465
TCC CCC ACA TTT AAA ATG TGT GTT TTG ATG ATA ATC ATC TCT TTT TGT 1496 Ser Pro Thr Phe Lys Met Cys Val Leu Met He He He Ser Phe Cys 470 475 480
TTA AAA AAT AGT TCC CAT TTA TAC CAA TCC AAT CAA ATG GAT TGG ATT 1544 Leu Lys Asn Ser Ser His Leu Tyr Gin Ser Asn Gin Met Asp Trp He 485 490 495 AAA ACA ACA AGC GCT TGT TTG TTG CTC TCT ATA GGT TTT TTA TTT ATT 1592 Lys Thr Thr Ser Ala Cys Leu Leu Leu Ser He Gly Phe Leu Phe He 500 505 510
TTT GCC AGT TCT CAA TCG GTA TTT TTG TAT TTT AAT TTT TAGGACACTG CT 1643 Phe Ala Ser Ser Gin Ser Val Phe Leu Tyr Phe Asn Phe 515 520 525
ATGGAATTTT ATAAAAAACA AACTTTAATC ATTGTTTCTT T 1684
(2) INFORMATION FOR SEQ ID NO-.136:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 527 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136:
Met Leu Ala Ser He He Ser He Leu Arg Val Phe Val Leu Leu Phe
1 5 10 15
Asn Thr Pro Leu Phe He Phe Ala Phe Leu Pro Val Gly Phe Leu Gly
20 25 30
Tyr Phe He Leu Gin Ala Tyr Ala Lys Asn Pro Leu Phe Pro Lys Leu
35 40 45
Trp Leu Val Leu Ala Ser Leu Phe Phe Tyr Ala Phe Trp Asn Val Lys
50 55 60
Tyr Leu Pro Leu Leu Val Gly Ser He Val Phe Asn Tyr Phe Val Ala 65 70 75 80
Leu Lys He His Gin Thr Gin Pro Asn Ala Tyr Lys Arg Leu Trp Leu
85 90 95
He Leu Gly Leu He Ala Asn Val Ser Leu Leu Gly Phe Phe Lys Tyr
100 105 110
Thr Asp Phe Phe Leu Thr Asn Phe Asn Leu He Trp Lys Ser His Phe
115 120 125
Glu Thr Leu His Leu He Leu Pro Leu Ala He Ser Phe Phe Thr Leu
130 135 140
Gin Gin He Ala Tyr Leu Met Asp Thr Tyr Lys Gin Asn Gin He Met 145 150 155 160
Gin Pro Lys Met Arg Glu Arg Val Ser Glu Asn Ala Pro He Leu Leu
165 • 170 175
Asn Pro Pro Thr Ser Phe Phe Ser Leu Ser His Phe Leu Asp Tyr Ala
180 185 190
Leu Phe Val Ser Phe Phe Pro Gin Leu He Ala Gly Pro He Val His
195 200 205
His Ser Glu Met Met Pro Gin Phe Lys Asp Lys Asn Asn Gin Tyr Leu
210 215 220
Asn Tyr Arg Asn He Ala Leu Gly Leu Phe He Phe Ser He Gly Leu 225 230 235 240
Phe Lys Lys Val Val He Ala Asp Asn Thr Ala His Phe Ala Asp Phe 245 250 255 Gly Phe Asp Lys Ala Thr Ser Leu Ser Phe He Gin Ala Trp Met Thr
260 265 270
Ser Leu Ser Tyr Ser Phe Gin Leu Tyr Phe Asp Phe Ser Gly Tyr Cys
275 280 285
Asp Met Ala He Gly He Gly Leu Phe Phe Asn He Lys Leu Pro He
290 295 300
Asn Phe Asn Ser Pro Tyr Lys Ala Leu Asn He Gin Asp Phe Trp Arg 305 310 315 320
Arg Trp His He Thr Leu Ser Arg Phe Leu Lys Glu Tyr Leu Tyr He
325 330 335
Pro Leu Gly Gly Asn Arg Val Lys Glu Leu He Val Tyr Arg Asn Leu
340 345 350
He Leu Val Phe Leu He Gly Gly Phe Trp His Gly Ala Gly Trp Thr
355 360 365
Phe He He Trp Gly Leu Leu His Gly He Ala Leu Ser Val His Arg
370 375 380
Ala Tyr Ser His Ala Thr Arg Lys Phe His Phe Thr Met Pro Lys He 385 390 395 400
Leu Ala Trp Leu He Thr Phe Asn Phe He Asn Leu Ala Trp Val Phe
405 410 415
Phe Arg Ala Lys Asn Leu Glu Ser Ala Leu Lys Val Leu Lys Gly Met
420 425 430
Val Gly Leu Asn Gly Val Ser Leu Cys His Leu Ser Lys Glu Ala Ser
435 440 445
Glu Phe Leu Asn Arg Val Asn Asp Asn Met He Met His Thr He Met
450 455 460
Tyr Ala Ser Pro Thr Phe Lys Met Cys Val Leu Met He He He Ser 465 470 475 480
Phe Cys Leu Lys Asn Ser Ser His Leu Tyr Gin Ser Asn Gin Met Asp
485 490 495
Trp He Lys Thr Thr Ser Ala Cys Leu Leu Leu Ser He Gly Phe Leu
500 505 510
Phe He Phe Ala Ser Ser Gin Ser Val Phe Leu Tyr Phe Asn Phe 515 520 525
(2) INFORMATION FOR SEQ ID NO: 137:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3973 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic RNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...3920 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: AAAGTCGCAC CCTTTGTGCA AAAATCGTTT TACAAGAAGA AAGGAAAAAA ATG GAA 56
Met Glu 1
ATA CAA CAA ACA CAC CGC AAA ATC AAT CGC CCT TTG GTT TCT CTC GCT 104 He Gin Gin Thr His Arg Lys He Asn Arg Pro Leu Val Ser Leu Ala 5 10 15
TTA GTA GGA GCG TTA GTC AGC ATC ACA CCG CAA CAA AGT CAT GCC GCC 152 Leu Val Gly Ala Leu Val Ser He Thr Pro Gin Gin Ser His Ala Ala 20 25 30
TTT TTC ACA ACC GTG ATC ATT CCA GCC ATT GTT GGG GGG ATT GCT ACA 200 Phe Phe Thr Thr Val He He Pro Ala He Val Gly Gly He Ala Thr 35 40 45 50
GGC GCT GCT GTA GGA ACG GTC TCA GGG CTT CTT GGC TGG GGG CTA AAA 248 Gly Ala Ala Val Gly Thr Val Ser Gly Leu Leu Gly Trp Gly Leu Lys 55 60 65
CAA GCC GAA GAA GCC AAT AAA ACC CCA GAT AAA CCC GAT AAA GTT TGG 296 Gin Ala Glu Glu Ala Asn Lys Thr Pro Asp Lys Pro Asp Lys Val Trp 70 75 80
CGC ATT CAA GCA GGA AAA GGC TTT AAT GAA TTC CCT AAC AAG GAA TAC 344 Arg He Gin Ala Gly Lys Gly Phe Asn Glu Phe Pro Asn Lys Glu Tyr 85 90 95
GAC TTA TAC AGA TCC CTA CTA TCT AGT AAG ATT GAT GGA GGC TGG GAT 392 Asp Leu Tyr Arg Ser Leu Leu Ser Ser Lys He Asp Gly Gly Trp Asp 100 105 110
TGG GGG AAT GCC GCT ACG CAT TAT TGG GTC AAA GGC GGG CAA TGG AAC 440 Trp Gly Asn Ala Ala Thr His Tyr Trp Val Lys Gly Gly Gin Trp Asn 115 120 125 130
AAG CTT GAA GTG GAT ATG AAA GAC GCT GTA GGG ACT TAT AAT CTC TCA 488 Lys Leu Glu Val Asp Met Lys Asp Ala Val Gly Thr Tyr Asn Leu Ser 135 140 145
GGG CTA AGA AAC TTT ACT GGT GGG GAT TTA GAT GTC AAT ATG CAA AAA 536 Gly Leu Arg Asn Phe Thr Gly Gly Asp Leu Asp Val Asn Met Gin Lys 150 155 160
GCC ACT TTG CGC TTG GGC CAA TTC AAT GGC AAT TCT TTC ACA AGC TAT 584 Ala Thr Leu Arg Leu Gly Gin Phe Asn Gly Asn Ser Phe Thr Ser Tyr 165 170 175
AAG GAT AGC GCT GAT CGC ACC ACG AGA GTG GAT TTC AAC GCT AAA AAT 632 Lys Asp Ser Ala Asp Arg Thr Thr Arg Val Asp Phe Asn Ala Lys Asn 180 185 190
ATC TTA ATT GAT AAT TTT TTA GAA ATC AAT AAT CGT GTG GGT TCT GGA 680 He Leu He Asp Asn Phe Leu Glu He Asn Asn Arg Val Gly Ser Gly 195 200 205 210 GCC GGG AGG AAA GCC AGC TCT ACG GTT TTA ACT TTG CAA GCT TCA GAA 728 Ala Gly Arg Lys Ala Ser Ser Thr Val Leu Thr Leu Gin Ala Ser Glu 215 220 225
GGG ATT ACT AGC AGT AAA AAT GCG GAA ATT TCT CTT TAT GAT GGC GCC 776 Gly He Thr Ser Ser Lys Asn Ala Glu He Ser Leu Tyr Asp Gly Ala 230 235 240
ACG CTC AAT TTG GCT TCA AAC AGC GTT AAA TTA ATG GGT AAT GTG TGG 824 Thr Leu Asn Leu Ala Ser Asn Ser Val Lys Leu Met Gly Asn Val Trp 245 250 255
ATG GGC CGT TTG CAA TAT GTG GGA GCG TAT TTG GCC CCT TCA TAC AGC 872 Met Gly Arg Leu Gin Tyr Val Gly Ala Tyr Leu Ala Pro Ser Tyr Ser 260 265 270
ACG ATA AAC ACT TCA AAA GTG ACA GGG GAA GTG AAT TTT AAC CAT CTC 920 Thr He Asn Thr Ser Lys Val Thr Gly Glu Val Asn Phe Asn His Leu 275 280 285 290
ACT GTG GGC GAT CAC AAC GCC GCT CAA GCA GGC ATT ATC GCT AGT AAC 968 Thr Val Gly Asp His Asn Ala Ala Gin Ala Gly He He Ala Ser Asn 295 300 305
AAG ACT CAT ATT GGC ACA CTG GAT TTG TGG CAA AGC GCG GGA CTA AAC 1016 Lys Thr His He Gly Thr Leu Asp Leu Trp Gin Ser Ala Gly Leu Asn 310 315 320
ATT ATC GCC CCT CCA GAA GGC GGT TAT AAG GAT AAA CCT AAG GAT AAA 1064 He He Ala Pro Pro Glu Gly Gly Tyr Lys Asp Lys Pro Lys Asp Lys 325 330 335
CCT AGT AAC ACC ACG CAA AAT AAT GCT AAC AAC AAC CAA CAA AAC AGC 1112 Pro Ser Asn Thr Thr Gin Asn Asn Ala Asn Asn Asn Gin Gin Asn Ser 340 345 350
GCT CAA AAC AAT AGT AAC ACT CAG GTT ATT AAC CCA CCC AAT AGC GCG 1160 Ala Gin Asn Asn Ser Asn Thr Gin Val He Asn Pro Pro Asn Ser Ala 355 360 365 370
CAA AAA ACA GAA ATT CAA CCC ACG CAA GTC ATT GAT GGG CCT TTT GCT 1208 Gin Lys Thr Glu He Gin Pro Thr Gin Val He Asp Gly Pro Phe Ala 375 380 385
GGT GGC AAA GAC ACG GTT GTC AAT ATT GAT CGC ATC AAC ACT AAC GCT 1256 Gly Gly Lys Asp Thr Val Val Asn He Asp Arg He Asn Thr Asn Ala 390 395 400
GAT GGC ACG ATT AAA GTG GGA GGG TAT AAA GCT TCT CTT ACC ACC AAT 1304 Asp Gly Thr He Lys Val Gly Gly Tyr Lys Ala Ser Leu Thr Thr Asn 405 410 415
GCG GCT CAT TTG CAT ATC GGC AAA GGC GGT ATC AAT CTG TCC AAT CAA 1352 Ala Ala His Leu His He Gly Lys Gly Gly He Asn Leu Ser Asn Gin 420 425 430 GCG AGC GGG CGC ACC CTT TTA GTG GAA AAT CTA ACC GGG AAT ATC ACC 1400 Ala Ser Gly Arg Thr Leu Leu Val Glu Asn Leu Thr Gly Asn He Thr 435 440 445 450
GTT GAT GGG CCT TTA AGA GTG AAT AAT CAA GTG GGT GGT TAT GCT TTG 1448 Val Asp Gly Pro Leu Arg Val Asn Asn Gin Val Gly Gly Tyr Ala Leu 455 460 465
GCA GGA TCA AGC GCG AAT TTT GAG TTT AAG GCT GGT ACG GAT ACC AAA 1496 Ala Gly Ser Ser Ala Asn Phe Glu Phe Lys Ala Gly Thr Asp Thr Lys 470 475 480
AAC GGC ACA GCC ACT TTT AAT AAC GAT ATT AGT TTG GGA AGA TTT GTG 1544 Asn Gly Thr Ala Thr Phe Asn Asn Asp He Ser Leu Gly Arg Phe Val 485 490 495
AAT TTA AAA GTG GAT GCT CAT ACA GCT AAT TTT AAA GGT ATT GAT ACT 1592 Asn Leu Lys Val Asp Ala His Thr Ala Asn Phe Lys Gly He Asp Thr 500 505 510
GGT AAT GGT GGT TTC AAC ACC TTA GAT TTT AGT GGC GTT ACA GGT AAG 1640 Gly Asn Gly Gly Phe Asn Thr Leu Asp Phe Ser Gly Val Thr Gly Lys 515 520 525 530
GTC AAT ATC AAC AAG CTC ATT ACG GCT TCC ACT AAT GTG GCC GTT AAA 1688 Val Asn He Asn Lys Leu He Thr Ala Ser Thr Asn Val Ala Val Lys 535 540 545
AAC TTC AAC ATT AAT GAA TTG GTT GTT AAG ACC AAT GGG GTG AGT GTG 1736 Asn Phe Asn He Asn Glu Leu Val Val Lys Thr Asn Gly Val Ser Val 550 555 560
GGG GAA TAC ACT CAT TTT AGC GAA GAT ATA GGC AGT CAA TCG CGC ATC 1784 Gly Glu Tyr Thr His Phe Ser Glu Asp He Gly Ser Gin Ser Arg He 565 570 575
AAT ACC GTG CGT TTG GAA ACT GGC ACT AGG TCA ATC TTT TCT GGG GGT 1832 Asn Thr Val Arg Leu Glu Thr Gly Thr Arg Ser He Phe Ser Gly Gly 580 585 590
GTC AAA TTT AAA AGC GGT GAA AAA CTG GTT ATA GAT GAG TTT TAC TAT 1880 Val Lys Phe Lys Ser Gly Glu Lys Leu Val He Asp Glu Phe Tyr Tyr 595 600 605 610
AGC CCT TGG AAT TAT TTT GAC GCT AGG AAT ATT AAA AAT GTT GAA ATC 1928 Ser Pro Trp Asn Tyr Phe Asp Ala Arg Asn He Lys Asn Val Glu He 615 620 625
ACC AGA AAA TTC GCT TCT TCA ACC CCA GAA AAC CCT TGG GGC ACA TCA 1976 Thr Arg Lys Phe Ala Ser Ser Thr Pro Glu Asn Pro Trp Gly Thr Ser 630 635 640
AAG CTT ATG TTT AAT AAT CTA ACC CTG GGT CAA AAT GCG GTC ATG GAC 2024 Lys Leu Met Phe Asn Asn Leu Thr Leu Gly Gin Asn Ala Val Met Asp 645 650 655 TAT AGT CAA TTT TCA AAT TTA ACC ATT CAG GGG GAT TTC ATC AAC AAT 2072 Tyr Ser Gin Phe Ser Asn Leu Thr He Gin Gly Asp Phe He Asn Asn 660 665 670
CAA GGC ACT ATC AAT TAT TTG GTC CGA GGC GGG CAA GTA GCC ACC TTG 2120 Gin Gly Thr He Asn Tyr Leu Val Arg Gly Gly Gin Val Ala Thr Leu 675 680 685 690
AAT GTA GGC AAT GCG GCA GCT ATG TTC TTT AGT AAT AAT GTG GAT AGC 2168 Asn Val Gly Asn Ala Ala Ala Met Phe Phe Ser Asn Asn Val Asp Ser 695 700 705
GCG ACT GGG TTT TAC CAA CCG CTC ATG AAG ATT AAC AGC GCT CAA GAT 2216 Ala Thr Gly Phe Tyr Gin Pro Leu Met Lys He Asn Ser Ala Gin Asp 710 715 720
CTC ATT AAA AAT AAA GAA CAT GTC TTA TTG AAA GCG AAA ATC ATC GGT 2264 Leu He Lys Asn Lys Glu His Val Leu Leu Lys Ala Lys He He Gly 725 730 735
TAT GGC AAT GTT TCT TTA GGC ACT AAC AGC ATT AGT AAT GTT AAT CTA 2312 Tyr Gly Asn Val Ser Leu Gly Thr Asn Ser He Ser Asn Val Asn Leu 740 745 750
ATA GAG CAA TTC AAA GAG CGC CTA GCC CTT TAC AAC AAC AAT AAC CGC 2360 He Glu Gin Phe Lys Glu Arg Leu Ala Leu Tyr Asn Asn Asn Asn Arg 755 760 765 770
ATG GAT ATT TGT GTG GTG CGA AAT ACT GAT GAC ATT AAA GCA TGC GGG 2408 Met Asp He Cys Val Val Arg Asn Thr Asp Asp He Lys Ala Cys Gly 775 780 785
ACG GCT ATC GGC AAT CAA AGC ATG GTG AAT AAC CCC GAC AAT TAC AAG 2456 Thr Ala He Gly Asn Gin Ser Met Val Asn Asn Pro Asp Asn Tyr Lys 790 795 800
TAT CTT ATC GGT AAA GCA TGG AAG AAC ATA GGG ATC AGC AAA ACA GCT 2504 Tyr Leu He Gly Lys Ala Trp Lys Asn He Gly He Ser Lys Thr Ala 805 810 815
AAT GGC TCT AAA ATT TCG GTG TAT TAT TTA GGC AAT TCT ACG CCT ACT 2552 Asn Gly Ser Lys He Ser Val Tyr Tyr Leu Gly Asn Ser Thr Pro Thr 820 825 830
GAG AAA GGT GGC AAT ACC ACA AAT TTA CCT ACA AAC ACC ACT AGC AAT 2600 Glu Lys Gly Gly Asn Thr Thr Asn Leu Pro Thr Asn Thr Thr Ser Asn 835 840 845 850
GTG CGT TCT GCC AAC AAC GCC CTT GCG CAA AAC GCT CCT TTC GCT CAA 2648 Val Arg Ser Ala Asn Asn Ala Leu Ala Gin Asn Ala Pro Phe Ala Gin 855 860 865
CCT AGC GCC ACT CCT AAT TTA GTC GCT ATC AAT CAG CAT GAT TTT GGC 2696 Pro Ser Ala Thr Pro Asn Leu Val Ala He Asn Gin His Asp Phe Gly 870 875 880
ACC ATT GAA AGC GTG TTT GAA TTG GCT AAC CGC TCT AAA GAT ATT GAC 2744 Thr He Glu Ser Val Phe Glu Leu Ala Asn Arg Ser Lys Asp He Asp 885 890 895
ACG CTT TAT GCT AAC TCA GGC GCG CAA GGC AGG GAT CTC TTA CAA ACC 2792 Thr Leu Tyr Ala Asn Ser Gly Ala Gin Gly Arg Asp Leu Leu Gin Thr 900 905 910
TTA TTG ATT GAT AGC CAT GAT GCG GGT TAT GCC AGA CAA ATG ATT GAT 2840 Leu Leu He Asp Ser His Asp Ala Gly Tyr Ala Arg Gin Met He Asp 915 920 925 930
AAC ACA AGC ACC GGT GAA ATC ACC AAG CAA TTG AAT GCG GCC ACT ACC 2888 Asn Thr Ser Thr Gly Glu He Thr Lys Gin Leu Asn Ala Ala Thr Thr 935 940 945
ACT TTA AAC AAC ATA GCC AGT TTA GAG CAT AAG ACA AGC AGC TTA CAA 2936 Thr Leu Asn Asn He Ala Ser Leu Glu His Lys Thr Ser Ser Leu Gin 950 955 960
ACT TTG AGC TTG AGT AAT GCG ATG ATC TTA AAT TCT CGT TTA GTC AAT 2984 Thr Leu Ser Leu Ser Asn Ala Met He Leu Asn Ser Arg Leu Val Asn 965 970 975
CTC TCC AGA AGG CAC ACC AAT AAT ATT GAC TCA TTC GCC CAA CGC TTA 3032 Leu Ser Arg Arg His Thr Asn Asn He Asp Ser Phe Ala Gin Arg Leu 980 985 990
CAA GCT TTA AAA GAC CAA AAA TTC GCT TCT TTA GAA AGC GCG GCG GAA 3080 Gin Ala Leu Lys Asp Gin Lys Phe Ala Ser Leu Glu Ser Ala Ala Glu 995 1000 1005 1010
GTG TTG TAT CAA TTT GCC CCT AAA TAT GAA AAA CCT ACC AAT GTT TGG 3128 Val Leu Tyr Gin Phe Ala Pro Lys Tyr Glu Lys Pro Thr Asn Val Trp 1015 1020 1025
GCT AAC GCT ATT GGG GGA ACG AGC TTG AAT AAT GGC GGC AAC GCT TCA 3176 Ala Asn Ala He Gly Gly Thr Ser Leu Asn Asn Gly Gly Asn Ala Ser 1030 1035 1040
TTG TAT GGC ACA AGT GCG GGC GTA GAT GCC TAC CTT AAT GGG GAA GTG 3224 Leu Tyr Gly Thr Ser Ala Gly Val Asp Ala Tyr Leu Asn Gly Glu Val 1045 1050 1055
GAA GCC ATT GTG GGC GGT TTT GGA AGC TAT GGT TAT AGC TCT TTT AAT 3272 Glu Ala He Val Gly Gly Phe Gly Ser Tyr Gly Tyr Ser Ser Phe Asn 1060 1065 1070
AAT CAA GCG AAC TCT CTT AAC TCT GGA GCC AAT AAC ACT AAT TTT GGC 3320 Asn Gin Ala Asn Ser Leu Asn Ser Gly Ala Asn Asn Thr Asn Phe Gly 1075 1080 1085 1090
GTG TAT AGC CGT ATC TTT GCT AAC CAG CAT GAA TTT GAT TTT GAA GCT 3368 Val Tyr Ser Arg He Phe Ala Asn Gin His Glu Phe Asp Phe Glu Ala 1095 1100 1105
CAA GGG GCG CTA GGG AGT GAT CAA TCA AGC TTG AAT TTC AAA AGC GCT 3416 Gin Gly Ala Leu Gly Ser Asp Gin Ser Ser Leu Asn Phe Lys Ser Ala 1110 1115 1120
CTA TTG CGA GAT TTG AAT CAA AGC TAT AAT TAC TTA GCC TAT AGC GCT 3464 Leu Leu Arg Asp Leu Asn Gin Ser Tyr Asn Tyr Leu Ala Tyr Ser Ala 1125 1130 1135
GCA ACA AGA GCG AGC TAT GGT TAT GAC TTT GCA TTT TTT AGG AAC GCT 3512 Ala Thr Arg Ala Ser Tyr Gly Tyr Asp Phe Ala Phe Phe Arg Asn Ala 1140 1145 1150
TTG GTG TTA AAA CCA AGC GTG GGC GTG AGC TAT AAC CAT TTA GGT TCA 3560 Leu Val Leu Lys Pro Ser Val Gly Val Ser Tyr Asn His Leu Gly Ser 1155 1160 1165 1170
ACC AAC TTT AAA AGC AAT AGC AAT CAA GTG GCT TTG AAA AAT GGC TCT 3608 Thr Asn Phe Lys Ser Asn Ser Asn Gin Val Ala Leu Lys Asn Gly Ser 1175 1180 1185
AGC AGT CAG CAT TTA TTC AAC GCT AGC GCT AAT GTG GAA GCG CGC TAT 3656 Ser Ser Gin His Leu Phe Asn Ala Ser Ala Asn Val Glu Ala Arg Tyr 1190 1195 1200
TAT TAT GGG GAC ACT TCA TAC TTC TAT ATG AAC GCT GGA GTT TTA CAA 3704 Tyr Tyr Gly Asp Thr Ser Tyr Phe Tyr Met Asn Ala Gly Val Leu Gin 1205 1210 1215
GAA TTT GCT AAC TTT GGT TCT AGC AAT GCG GTA TCT TTA AAC ACC TTT 3752 Glu Phe Ala Asn Phe Gly Ser Ser Asn Ala Val Ser Leu Asn Thr Phe 1220 1225 1230
AAA GTG AAT GCC GCA CAC AAT CCT TTA AGT ACC CAT GCC AGA GTG ATG 3800 Lys Val Asn Ala Ala His Asn Pro Leu Ser Thr His Ala Arg Val Met 1235 1240 1245 1250
ATG GGT GGG GAA TTA AAA TTA GCT AAA GAA GTG TTT TTG AAT TTG GGC 3848 Met Gly Gly Glu Leu Lys Leu Ala Lys Glu Val Phe Leu Asn Leu Gly 1255 1260 1265
TTT GTT TAT TTG CAC AAT TTG ATT TCC AAT ATA GGC CAT TTC GCT TCC 3896 Phe Val Tyr Leu His Asn Leu He Ser Asn He Gly His Phe Ala Ser 1270 1275 1280
AAT TTA GGA ATG AGG TAT AGT TTC TAAATACCGC TCTTAAACCC ATGCTCAAAG 3950 Asn Leu Gly Met Arg Tyr Ser Phe 1285 1290
CATGGGTTTG AAATCTTACA AAA 3973
(2) INFORMATION FOR SEQ ID NO: 138: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1290 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138:
Met Glu He Gin Gin Thr His Arg Lys He Asn Arg Pro Leu Val Ser
1 5 10 15
Leu Ala Leu Val Gly Ala Leu Val Ser He Thr Pro Gin Gin Ser His
20 25 30
Ala Ala Phe Phe Thr Thr Val He He Pro Ala He Val Gly Gly He
35 40 45
Ala Thr Gly Ala Ala Val Gly Thr Val Ser Gly Leu Leu Gly Trp Gly
50 55 60
Leu Lys Gin Ala Glu Glu Ala Asn Lys Thr Pro Asp Lys Pro Asp Lys 65 70 75 80
Val Trp Arg He Gin Ala Gly Lys Gly Phe Asn Glu Phe Pro Asn Lys
85 90 95
Glu Tyr Asp Leu Tyr Arg Ser Leu Leu Ser Ser Lys He Asp Gly Gly
100 105 110
Trp Asp Trp Gly Asn Ala Ala Thr His Tyr Trp Val Lys Gly Gly Gin
115 120 125
Trp Asn Lys Leu Glu Val Asp Met Lys Asp Ala Val Gly Thr Tyr Asn
130 135 140
Leu Ser Gly Leu Arg Asn Phe Thr Gly Gly Asp Leu Asp Val Asn Met 145 150 155 160
Gin Lys Ala Thr Leu Arg Leu Gly Gin Phe Asn Gly Asn Ser Phe Thr
165 170 175
Ser Tyr Lys Asp Ser Ala Asp Arg Thr Thr Arg Val Asp Phe Asn Ala
180 185 190
Lys Asn He Leu He Asp Asn Phe Leu Glu He Asn Asn Arg Val Gly
195 200 205
Ser Gly Ala Gly Arg Lys Ala Ser Ser Thr Val Leu Thr Leu Gin Ala
210 215 220
Ser Glu Gly He Thr Ser Ser Lys Asn Ala Glu He Ser Leu Tyr Asp 225 230 235 240
Gly Ala Thr Leu Asn Leu Ala Ser Asn Ser Val Lys Leu Met Gly Asn
245 250 255
Val Trp Met Gly Arg Leu Gin Tyr Val Gly Ala Tyr Leu Ala Pro Ser
260 265 270
Tyr Ser Thr He Asn Thr Ser Lys Val Thr Gly Glu Val Asn Phe Asn
275 280 285
His Leu Thr Val Gly Asp His Asn Ala Ala Gin Ala Gly He He Ala
290 295 300
Ser Asn Lys Thr His He Gly Thr Leu Asp Leu Trp Gin Ser Ala Gly 305 310 315 320
Leu Asn He He Ala Pro Pro Glu Gly Gly Tyr Lys Asp Lys Pro Lys
325 330 335
Asp Lys Pro Ser Asn Thr Thr Gin Asn Asn Ala Asn Asn Asn Gin Gin 340 345 350 Asn Ser Ala Gin Asn Asn Ser Asn Thr Gin Val He Asn Pro Pro Asn
355 360 365
Ser Ala Gin Lys Thr Glu He Gin Pro Thr Gin Val He Asp Gly Pro
370 375 380
Phe Ala Gly Gly Lys Asp Thr Val Val Asn He Asp Arg He Asn Thr 385 390 395 400
Asn Ala Asp Gly Thr He Lys Val Gly Gly Tyr Lys Ala Ser Leu Thr
405 410 415
Thr Asn Ala Ala His Leu His He Gly Lys Gly Gly He Asn Leu Ser
420 425 430
Asn Gin Ala Ser Gly Arg Thr Leu Leu Val Glu Asn Leu Thr Gly Asn
435 440 445
He Thr Val Asp Gly Pro Leu Arg Val Asn Asn Gin Val Gly Gly Tyr
450 455 460
Ala Leu Ala Gly Ser Ser Ala Asn Phe Glu Phe Lys Ala Gly Thr Asp 465 470 475 480
Thr Lys Asn Gly Thr Ala Thr Phe Asn Asn Asp He Ser Leu Gly Arg
485 490 495
Phe Val Asn Leu Lys Val Asp Ala His Thr Ala Asn Phe Lys Gly He
500 505 510
Asp Thr Gly Asn Gly Gly Phe Asn Thr Leu Asp Phe Ser Gly Val Thr
515 520 525
Gly Lys Val Asn He Asn Lys Leu He Thr Ala Ser Thr Asn Val Ala
530 535 540
Val Lys Asn Phe Asn He Asn Glu Leu Val Val Lys Thr Asn Gly Val 545 550 555 560
Ser Val Gly Glu Tyr Thr His Phe Ser Glu Asp He Gly Ser Gin Ser
565 570 575
Arg He Asn Thr Val Arg Leu Glu Thr Gly Thr Arg Ser He Phe Ser
580 585 590
Gly Gly Val Lys Phe Lys Ser Gly Glu Lys Leu Val He Asp Glu Phe
595 600 605
Tyr Tyr Ser Pro Trp Asn Tyr Phe Asp Ala Arg Asn He Lys Asn Val
610 615 620
Glu He Thr Arg Lys Phe Ala Ser Ser Thr Pro Glu Asn Pro Trp Gly 625 630 635 640
Thr Ser Lys Leu Met Phe Asn Asn Leu Thr Leu Gly Gin Asn Ala Val
645 650 655
Met Asp Tyr Ser Gin Phe Ser Asn Leu Thr He Gin Gly Asp Phe He
660 665 670
Asn Asn Gin Gly Thr He Asn Tyr Leu Val Arg Gly Gly Gin Val Ala
675 680 685
Thr Leu Asn Val Gly Asn Ala Ala Ala Met Phe Phe Ser Asn Asn Val
690 695 700
Asp Ser Ala Thr Gly Phe Tyr Gin Pro Leu Met Lys He Asn Ser Ala 705 710 715 720
Gin Asp Leu He Lys Asn Lys Glu His Val Leu Leu Lys Ala Lys He
725 730 735
He Gly Tyr Gly Asn Val Ser Leu Gly Thr Asn Ser He Ser Asn Val
740 745 750
Asn Leu He Glu Gin Phe Lys Glu Arg Leu Ala Leu Tyr Asn Asn Asn
755 760 765
Asn Arg Met Asp He Cys Val Val Arg Asn Thr Asp Asp He Lys Ala
770 775 780
Cys Gly Thr Ala He Gly Asn Gin Ser Met Val Asn Asn Pro Asp Asn 785 790 795 800
Tyr Lys Tyr Leu He Gly Lys Ala Trp Lys Asn He Gly He Ser Lys
805 810 815
Thr Ala Asn Gly Ser Lys He Ser Val Tyr Tyr Leu Gly Asn Ser Thr
820 825 830
Pro Thr Glu Lys Gly Gly Asn Thr Thr Asn Leu Pro Thr Asn Thr Thr
835 840 845
Ser Asn Val Arg Ser Ala Asn Asn Ala Leu Ala Gin Asn Ala Pro Phe
850 855 860
Ala Gin Pro Ser Ala Thr Pro Asn Leu Val Ala He Asn Gin His Asp 865 870 875 880
Phe Gly Thr He Glu Ser Val Phe Glu Leu Ala Asn Arg Ser Lys Asp
885 890 895
He Asp Thr Leu Tyr Ala Asn Ser Gly Ala Gin Gly Arg Asp Leu Leu
900 905 910
Gin Thr Leu Leu He Asp Ser His Asp Ala Gly Tyr Ala Arg Gin Met
915 920 925
He Asp Asn Thr Ser Thr Gly Glu He Thr Lys Gin Leu Asn Ala Ala
930 935 940
Thr Thr Thr Leu Asn Asn He Ala Ser Leu Glu His Lys Thr Ser Ser 945 950 955 960
Leu Gin Thr Leu Ser Leu Ser Asn Ala Met He Leu Asn Ser Arg Leu
965 970 975
Val Asn Leu Ser Arg Arg His Thr Asn Asn He Asp Ser Phe Ala Gin
980 985 990
Arg Leu Gin Ala Leu Lys Asp Gin Lys Phe Ala Ser Leu Glu Ser Ala
995 1000 1005
Ala Glu Val Leu Tyr Gin Phe Ala Pro Lys Tyr Glu Lys Pro Thr Asn
1010 1015 1020
Val Trp Ala Asn Ala He Gly Gly Thr Ser Leu Asn Asn Gly Gly Asn 025 1030 1035 1040
Ala Ser Leu Tyr Gly Thr Ser Ala Gly Val Asp Ala Tyr Leu Asn Gly
1045 1050 1055
Glu Val Glu Ala He Val Gly Gly Phe Gly Ser Tyr Gly Tyr Ser Ser
1060 1065 1070
Phe Asn Asn Gin Ala Asn Ser Leu Asn Ser Gly Ala Asn Asn Thr Asn
1075 1080 1085
Phe Gly Val Tyr Ser Arg He Phe Ala Asn Gin His Glu Phe Asp Phe
1090 1095 1100
Glu Ala Gin Gly Ala Leu Gly Ser Asp Gin Ser Ser Leu Asn Phe Lys 105 1110 1115 1120
Ser Ala Leu Leu Arg Asp Leu Asn Gin Ser Tyr Asn Tyr Leu Ala Tyr
1125 1130 1135
Ser Ala Ala Thr Arg Ala Ser Tyr Gly Tyr Asp Phe Ala Phe Phe Arg
1140 1145 1150
Asn Ala Leu Val Leu Lys Pro Ser Val Gly Val Ser Tyr Asn His Leu
1155 1160 1165
Gly Ser Thr Asn Phe Lys Ser Asn Ser Asn Gin Val Ala Leu Lys Asn
1170 1175 1180
Gly Ser Ser Ser Gin His Leu Phe Asn Ala Ser Ala Asn Val Glu Ala 185 1190 1195 1200
Arg Tyr Tyr Tyr Gly Asp Thr Ser Tyr Phe Tyr Met Asn Ala Gly Val
1205 1210 1215
Leu Gin Glu Phe Ala Asn Phe Gly Ser Ser Asn Ala Val Ser Leu Asn 1220 1225 1230 Thr Phe Lys Val Asn Ala Ala His Asn Pro Leu Ser Thr His Ala Arg
1235 1240 1245
Val Met Met Gly Gly Glu Leu Lys Leu Ala Lys Glu Val Phe Leu Asn
1250 1255 1260
Leu Gly Phe Val Tyr Leu His Asn Leu He Ser Asn He Gly His Phe 265 1270 1275 1280
Ala Ser Asn Leu Gly Met Arg Tyr Ser Phe 1285 1290
(2) INFORMATION FOR SEQ ID NO: 139:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1335 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 55...1284 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139:
TTAGTAGAAA TTGAAGCGAT AGCCATTAAG TAATTTATTA AAGGGACTAT CAGC ATG 57
Met 1
AAA AAA GAG GTC GTG GTC ATA GGC GGT GGG ATT GTA GGG CTT TCT TGT 105 Lys Lys Glu Val Val Val He Gly Gly Gly He Val Gly Leu Ser Cys 5 10 15
GCG TAT TCT ATG CAC AAG TTA GGG CAT AAG GTC TGC GTG ATA GAA AAA 153 Ala Tyr Ser Met His Lys Leu Gly His Lys Val Cys Val He Glu Lys 20 25 30
AAC GAT GGC GCA AAC GGC ACT TCT TTT GGG AAT GCT GGG CTT ATT TCT 201 Asn Asp Gly Ala Asn Gly Thr Ser Phe Gly Asn Ala Gly Leu He Ser 35 40 45
GCG TTT AAA AAA GCC CCA CTC TCA TGC CCT GGT GTG GTG TTA GAC ACC 249 Ala Phe Lys Lys Ala Pro Leu Ser Cys Pro Gly Val Val Leu Asp Thr 50 55 60 65
CTG AAG CTC ATG CTC AAA AAC CAA GCC CCT TTA AAA TTC CAT TTC GGG 297 Leu Lys Leu Met Leu Lys Asn Gin Ala Pro Leu Lys Phe His Phe Gly 70 75 80
CTT AAT TTA AAG CTC TAT CAA TGG ATT TTA AAA TTT GTA AAA AGC GCG 345 Leu Asn Leu Lys Leu Tyr Gin Trp He Leu Lys Phe Val Lys Ser Ala 85 90 95
AAC GCC AAA TCC ACG CAC CGC ACC ATG GCG TTG TTT GAA CGC TAC GGG 393 Asn Ala Lys Ser Thr His Arg Thr Met Ala Leu Phe Glu Arg Tyr Gly 100 105 110
TGG CTG AGT ATT GAT ATG TAT CAT CAA ATG CTA AAA GAC GGC ATG GAC 441 Trp Leu Ser He Asp Met Tyr His Gin Met Leu Lys Asp Gly Met Asp 115 120 125
TTT TGG TAT AAA GAA GAT GGG CTT TTA ATG ATC TAC ACT CTA GAA GAA 489 Phe Trp Tyr Lys Glu Asp Gly Leu Leu Met He Tyr Thr Leu Glu Glu 130 135 140 145
AGT TTT GAA AAA AAG CTT AAA ACT TGC GAT AAC AGC GGC GCT TAT AAA 537 Ser Phe Glu Lys Lys Leu Lys Thr Cys Asp Asn Ser Gly Ala Tyr Lys 150 155 160
ATC CTT AGC GCT AAA GAG ACC AAA GAA TAC ATG CCC GTT GTT AAT GAC 585 He Leu Ser Ala Lys Glu Thr Lys Glu Tyr Met Pro Val Val Asn Asp 165 170 175
AAT ATC TGC GGG AGC GTG CTT TTA ACC GAA AAC GCG CAT GTG GAT CCG 633 Asn He Cys Gly Ser Val Leu Leu Thr Glu Asn Ala His Val Asp Pro 180 185 190
GGC GAA GTG ATG CAC TCT TTG CAA GAA TAT TTA CAA AAT GTT GGC GTG 681 Gly Glu Val Met His Ser Leu Gin Glu Tyr Leu Gin Asn Val Gly Val 195 200 205
GAG TTC CTT TAT AAT GAA GAA GTG ATC GAT TTT GAG TTT AAA AAT AAC 729 Glu Phe Leu Tyr Asn Glu Glu Val He Asp Phe Glu Phe Lys Asn Asn 210 215 220 225
CTC ATT GAG GGC GTT ATC ACG CAC AAG GAA AAA ATC CAA GCA GAA ACA 777 Leu He Glu Gly Val He Thr His Lys Glu Lys He Gin Ala Glu Thr 230 235 240
ATC ATT CTA GCC ACT GGG GCT AAC CCC ACT CTC ATT AAA AAA ACC AAG 825 He He Leu Ala Thr Gly Ala Asn Pro Thr Leu He Lys Lys Thr Lys 245 250 255
AAC GAT TTT TTA ATG ATG GGG GCT AAA GGA TAT AGC ATC ACC TTT AAA 873 Asn Asp Phe Leu Met Met Gly Ala Lys Gly Tyr Ser He Thr Phe Lys 260 265 270
ATG CCT GAA GAA TTA AAA CCC AAA ACC TCT TCT TTA TTT GCG GAT ATT 921 Met Pro Glu Glu Leu Lys Pro Lys Thr Ser Ser Leu Phe Ala Asp He 275 280 285
TTC ATG GCG ATG ACC CCA CGA AGA GAC ACT GTA AGG ATC ACT TCT AAA 969 Phe Met Ala Met Thr Pro Arg Arg Asp Thr Val Arg He Thr Ser Lys 290 295 300 305
TTA GAA TTA AAC ACC AAC AAC GCT CTC ATT GAT AAA GAG CAA ATC GCT 1017 Leu Glu Leu Asn Thr Asn Asn Ala Leu He Asp Lys Glu Gin He Ala 310 315 320
AAC ATG AAA AAG AAT TTA GCC GCT TTC ACG CAG CCT TTT GAA ATG AAA 1065 Asn Met Lys Lys Asn Leu Ala Ala Phe Thr Gin Pro Phe Glu Met Lys 325 330 335
GAC GCC ATA GAG TGG TGC GGT TTC AGA CCC TTA ACC CCT AAT GAT ATT 1113 Asp Ala He Glu Trp Cys Gly Phe Arg Pro Leu Thr Pro Asn Asp He 340 345 350
CCT TAT TTG GGC TAT GAC AAA CGC TAT AAA AAC TTA ATC CAT GCG ACA 1161 Pro Tyr Leu Gly Tyr Asp Lys Arg Tyr Lys Asn Leu He His Ala Thr 355 360 365
GGG CTA GGG TGG CTT GGC ATC ACT TTT GGC CCA GCC ATT GGT AAA ATC 1209 Gly Leu Gly Trp Leu Gly He Thr Phe Gly Pro Ala He Gly Lys He 370 375 380 385
ATC' GCC AAT TTG AGC CAA GAC GGA GCG AAT GAA AAA AAT GCC GAT ATT 1257 He Ala Asn Leu Ser Gin Asp Gly Ala Asn Glu Lys Asn Ala Asp He 390 395 400
ATG CTT TTT TCT GCA TTT TTT AGG GAT TAAGGAATTT CTTTTTTAAA CCCTAGT 1311 Met Leu Phe Ser Ala Phe Phe Arg Asp 405 410
TTATTAAGGA GTTTTTATGG AAAC 1335
(2) INFORMATION FOR SEQ ID NO: 140:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 410 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140:
Met Lys Lys Glu Val Val Val He Gly Gly Gly He Val Gly Leu Ser
1 5 10 15
Cys Ala Tyr Ser Met His Lys Leu Gly His Lys Val Cys Val He Glu
20 25 30
Lys Asn Asp Gly Ala Asn Gly Thr Ser Phe Gly Asn Ala Gly Leu He
35 40 45
Ser Ala Phe Lys Lys Ala Pro Leu Ser Cys Pro Gly Val Val Leu Asp
50 55 60
Thr Leu Lys Leu Met Leu Lys Asn Gin Ala Pro Leu Lys Phe His Phe 65 70 75 80
Gly Leu Asn Leu Lys Leu Tyr Gin Trp He Leu Lys Phe Val Lys Ser
85 90 95
Ala Asn Ala Lys Ser Thr His Arg Thr Met Ala Leu Phe Glu Arg Tyr 100 105 110
Gly Trp Leu Ser He Asp Met Tyr His Gin Met Leu Lys Asp Gly Met
115 120 125
Asp Phe Trp Tyr Lys Glu Asp Gly Leu Leu Met He Tyr Thr Leu Glu
130 135 140
Glu Ser Phe Glu Lys Lys Leu Lys Thr Cys Asp Asn Ser Gly Ala Tyr 145 150 155 160
Lys He Leu Ser Ala Lys Glu Thr Lys Glu Tyr Met Pro Val Val Asn
165 170 175
Asp Asn He Cys Gly Ser Val Leu Leu Thr Glu Asn Ala His Val Asp
180 185 190
Pro Gly Glu Val Met His Ser Leu Gin Glu Tyr Leu Gin Asn Val Gly
195 200 205
Val Glu Phe Leu Tyr Asn Glu Glu Val He Asp Phe Glu Phe Lys Asn
210 215 220
Asn Leu He Glu Gly Val He Thr His Lys Glu Lys He Gin Ala Glu 225 230 235 240
Thr He He Leu Ala Thr Gly Ala Asn Pro Thr Leu He Lys Lys Thr
245 250 255
Lys Asn Asp Phe Leu Met Met Gly Ala Lys Gly Tyr Ser He Thr Phe
260 265 270
Lys Met Pro Glu Glu Leu Lys Pro Lys Thr Ser Ser Leu Phe Ala Asp
275 280 285
He Phe Met Ala Met Thr Pro Arg Arg Asp Thr Val Arg He Thr Ser
290 295 300
Lys Leu Glu Leu Asn Thr Asn Asn Ala Leu He Asp Lys Glu Gin He 305 310 315 320
Ala Asn Met Lys Lys Asn Leu Ala Ala Phe Thr Gin Pro Phe Glu Met
325 330 335
Lys Asp Ala He Glu Trp Cys Gly Phe Arg Pro Leu Thr Pro Asn Asp
340 345 350
He Pro Tyr Leu Gly Tyr Asp Lys Arg Tyr Lys Asn Leu He His Ala
355 360 365
Thr Gly Leu Gly Trp Leu Gly He Thr Phe Gly Pro Ala He Gly Lys
370 375 380
He He Ala Asn Leu Ser Gin Asp Gly Ala Asn Glu Lys Asn Ala Asp 385 390 395 400
He Met Leu Phe Ser Ala Phe Phe Arg Asp 405 410
(2) INFORMATION FOR SEQ ID NO: 141:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1579 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1526 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141:
AAAAACGCTA TAAGATAGTC AAATACATTC AATAAATGCA AGGGGAAATC ATG GAA 56
Met Glu
1
CAT AAA GAA ATC GTT ATA GGG GTT GAT CTA GGC TCT AGA AAG ATT TGC 104 His Lys Glu He Val He Gly Val Asp Leu Gly Ser Arg Lys He Cys 5 10 15
GCG ATA GTG GCT GAA TTT AAA GAA GGG ATT TTG CGC ATC ATT GGC ACG 152 Ala He Val Ala Glu Phe Lys Glu Gly He Leu Arg He He Gly Thr 20 25 30
GCC CAT CAA GAC TCC AAA GAA ATC AAT TCA AAA GCC ATT AAA AGA GGG 200 Ala His Gin Asp Ser Lys Glu He Asn Ser Lys Ala He Lys Arg Gly 35 40 45 50
CGT ATC AAT AGC CTT GCT CAC GCT TCC AAC GCC ATT AAA GAA GTG ATT 248 Arg He Asn Ser Leu Ala His Ala Ser Asn Ala He Lys Glu Val He 55 60 65
AAT AGC GCT AAA AAA ATG GCA GGT TTG AAC GCT GAT GAA GAC AGA AAT 296 Asn Ser Ala Lys Lys Met Ala Gly Leu Asn Ala Asp Glu Asp Arg Asn 70 75 80
AAC CCC ATG CCC CAT TTT GGG GAA TAC CAC CCT AAA ACT AAG GCG ATT 344 Asn Pro Met Pro His Phe Gly Glu Tyr His Pro Lys Thr Lys Ala He 85 90 95
GTT TCT TTT TCT GGG GCT TAT ACT GAA AGC ATT AGA GAT GTT ACC GGT 392 Val Ser Phe Ser Gly Ala Tyr Thr Glu Ser He Arg Asp Val Thr Gly 100 105 110
GTA GCG AGC ACC AAA GAT AAT GTG GTA ACC ATT GAT GAA ATC AAT CGC 440 Val Ala Ser Thr Lys Asp Asn Val Val Thr He Asp Glu He Asn Arg 115 120 125 130
GCT ATC AAT AGT GCA TGC GCT AAA GCA GGC TTA GAT AAC GAC AAA CAT 488 Ala He Asn Ser Ala Cys Ala Lys Ala Gly Leu Asp Asn Asp Lys His 135 140 145
ATT TTG CAT GCT CTC CCC TAT CGC TTC ACT TTA GAC AAA CAA GAA GTG 536 He Leu His Ala Leu Pro Tyr Arg Phe Thr Leu Asp Lys Gin Glu Val 150 155 160
AAT GAC CCT TTA GGG ATG AGC GGG ACT CGC TTG GAA GTC TTT ATC CAC 584 Asn Asp Pro Leu Gly Met Ser Gly Thr Arg Leu Glu Val Phe He His 165 170 175
ATT GTC TAT ACA GAA AAA AAC AAC ATT GAA AAT TTA GAA AAA ATC ATG 632 He Val Tyr Thr Glu Lys Asn Asn He Glu Asn Leu Glu Lys He Met 180 185 190 ATC CAA TCT GGG GTA GAG ATT GAA AAC ATC GTG ATC AAT TCT TAT GCA 680 He Gin Ser Gly Val Glu He Glu Asn He Val He Asn Ser Tyr Ala 195 200 205 210
GCC TCG ATT GCC ACC TTA TCT AAT GAT GAA AGG GAA TTG GGC GTG GCT 728 Ala Ser He Ala Thr Leu Ser Asn Asp Glu Arg Glu Leu Gly Val Ala 215 220 225
TGC GTG GAT ATG GGC GGA GAG ACA TGC AAC CTT ACG ATT TAT AGC GGC 776 Cys Val Asp Met Gly Gly Glu Thr Cys Asn Leu Thr He Tyr Ser Gly 230 235 240
AAT TCC ATA CGC TAT AAC AAA TAT TTG CCC GTA GGC TCT CAC CAT TTA 824 Asn Ser He Arg Tyr Asn Lys Tyr Leu Pro Val Gly Ser His His Leu 245 250 255
ACC ACG GAT TTA TCG CAC ATG CTC AAC ACC CCA TTC CCT TAC GCT GAA 872 Thr Thr Asp Leu Ser His Met Leu Asn Thr Pro Phe Pro Tyr Ala Glu 260 265 270
GAA GTT AAG ATC AAA TAC GGG GAT CTT TCT TTT GAA GGC GGC GAA GAA 920 Glu Val Lys He Lys Tyr Gly Asp Leu Ser Phe Glu Gly Gly Glu Glu 275 280 285 290
ACG CCC TCT CAA AAT GTC CAA ATC CCT ACC ACC GGC TCG GAT GGC CAT 968 Thr Pro Ser Gin Asn Val Gin He Pro Thr Thr Gly Ser Asp Gly His 295 300 305
GAA AGC CAT ATT GTG CCG CTT AGT GAA ATC CAA ACT ATC ATG AGA GAA 1016 Glu Ser His He Val Pro Leu Ser Glu He Gin Thr He Met Arg Glu 310 315 320
AGG GCT TTA GAA ACT TTT AAA ATC ATC CAC AGG AGC ATT CAA GAT AGC 1064 Arg Ala Leu Glu Thr Phe Lys He He His Arg Ser He Gin Asp Ser 325 330 335
GGC TTA GAA GAG CAT TTG GGC GGA GGC GTT GTG TTA ACC GGT GGG ATG 1112 Gly Leu Glu Glu His Leu Gly Gly Gly Val Val Leu Thr Gly Gly Met 340 345 350
GCT TTA ATG AAA GGG ATC AAA GAA TTA GCC AGA ACC CAT TTC ACT AAT 1160 Ala Leu Met Lys Gly He Lys Glu Leu Ala Arg Thr His Phe Thr Asn 355 360 365 370
TAC CCG GTG CGT TTG GCA GCC CCT GTG GAA AAA TAC AAT ATC ATG GGC 1208 Tyr Pro Val Arg Leu Ala Ala Pro Val Glu Lys Tyr Asn He Met Gly 375 380 385
ATG TTT GAA GAT TTG AAA GAC CCT CGC TTT TCA GTC GTA GTT GGC TTG 1256 Met Phe Glu Asp Leu Lys Asp Pro Arg Phe Ser Val Val Val Gly Leu 390 395 400
ATT TTA TAC AAA GCA GGG GGG CAT ACC AAT TAT GAA AGA GAC TCT AAA 1304 He Leu Tyr Lys Ala Gly Gly His Thr Asn Tyr Glu Arg Asp Ser Lys 405 410 415 GGG GTT ATC CGC TAC CAT GAA AGC GAT GAT TAC ACA AGA ACA GCC CAT 1352 Gly Val He Arg Tyr His Glu Ser Asp Asp Tyr Thr Arg Thr Ala His 420 425 430
CAA TCA AGC CCT ACC CCC CAT ATC CAT TCA TCG CCC ACA GAA AGG AAT 1400 Gin Ser Ser Pro Thr Pro His He His Ser Ser Pro Thr Glu Arg Asn 435 440 445 450
TTG AGC GAT TTA AAA GCC CCT AGT GCT CCT TTA AAC ACC GCT AAA AAC 1448 Leu Ser Asp Leu Lys Ala Pro Ser Ala Pro Leu Asn Thr Ala Lys Asn 455 460 465
GAT GAC TTT TTA CCT ATA AAA CCC ACC GAA CAA AAA GGT TTT TTT AAA 1496 Asp Asp Phe Leu Pro He Lys Pro Thr Glu Gin Lys Gly Phe Phe Lys 470 475 480
AGT TTC CTT GAT AAG ATT TCT AAA TTC TTT TAAGATACAG CCATTTCTTT ATG 1549 Ser Phe Leu Asp Lys He Ser Lys Phe Phe 485 490
CGATAAAAAC GCCTTGATGG TTATCAAAAG 1579
(2) INFORMATION FOR SEQ ID NO: 142:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 492 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:142:
Met Glu His Lys Glu He Val He Gly Val Asp Leu Gly Ser Arg Lys
1 5 10 15
He Cys Ala He Val Ala Glu Phe Lys Glu Gly He Leu Arg He He
20 25 30
Gly Thr Ala His Gin Asp Ser Lys Glu He Asn Ser Lys Ala He Lys
35 40 45
Arg Gly Arg He Asn Ser Leu Ala His Ala Ser Asn Ala He Lys Glu
50 55 60
Val He Asn Ser Ala Lys Lys Met Ala Gly Leu Asn Ala Asp Glu Asp 65 70 75 80
Arg Asn Asn Pro Met Pro His Phe Gly Glu Tyr His Pro Lys Thr Lys
85 90 95
Ala He Val Ser Phe Ser Gly Ala Tyr Thr Glu Ser He Arg Asp Val
100 105 HO
Thr Gly Val Ala Ser Thr Lys Asp Asn Val Val Thr He Asp Glu He
115 120 125
Asn Arg Ala He Asn Ser Ala Cys Ala Lys Ala Gly Leu Asp Asn Asp
130 135 140
Lys His He Leu His Ala Leu Pro Tyr Arg Phe Thr Leu Asp Lys Gin 145 150 155 160 Glu Val Asn Asp Pro Leu Gly Met Ser Gly Thr Arg Leu Glu Val Phe
165 170 175
He His He Val Tyr Thr Glu Lys Asn Asn He Glu Asn Leu Glu Lys
180 185 190
He Met He Gin Ser Gly Val Glu He Glu Asn He Val He Asn Ser
195 200 205
Tyr Ala Ala Ser He Ala Thr Leu Ser Asn Asp Glu Arg Glu Leu Gly
210 215 220
Val Ala Cys Val Asp Met Gly Gly Glu Thr Cys Asn Leu Thr He Tyr 225 230 235 240
Ser Gly Asn Ser He Arg Tyr Asn Lys Tyr Leu Pro Val Gly Ser His
245 250 255
His Leu Thr Thr Asp Leu Ser His Met Leu Asn Thr Pro Phe Pro Tyr
260 265 270
Ala Glu Glu Val Lys He Lys Tyr Gly Asp Leu Ser Phe Glu Gly Gly
275 280 285
Glu Glu Thr Pro Ser Gin Asn Val Gin He Pro Thr Thr Gly Ser Asp
290 295 300
Gly His Glu Ser His He Val Pro Leu Ser Glu He Gin Thr He Met 305 310 315 320
Arg Glu Arg Ala Leu Glu Thr Phe Lys He He His Arg Ser He Gin
325 330 335
Asp Ser Gly Leu Glu Glu His Leu Gly Gly Gly Val Val Leu Thr Gly
340 345 350
Gly Met Ala Leu Met Lys Gly He Lys Glu Leu Ala Arg Thr His Phe
355 360 365
Thr Asn Tyr Pro Val Arg Leu Ala Ala Pro Val Glu Lys Tyr Asn He
370 375 380
Met Gly Met Phe Glu Asp Leu Lys Asp Pro Arg Phe Ser Val Val Val 385 390 395 400
Gly Leu He Leu Tyr Lys Ala Gly Gly His Thr Asn Tyr Glu Arg Asp
405 410 415
Ser Lys Gly Val He Arg Tyr His Glu Ser Asp Asp Tyr Thr Arg Thr
420 425 430
Ala His Gin Ser Ser Pro Thr Pro His He His Ser Ser Pro Thr Glu
435 440 445
Arg Asn Leu Ser Asp Leu Lys Ala Pro Ser Ala Pro Leu Asn Thr Ala
450 455 460
Lys Asn Asp Asp Phe Leu Pro He Lys Pro Thr Glu Gin Lys Gly Phe 465 470 475 480
Phe Lys Ser Phe Leu Asp Lys He Ser Lys Phe Phe 485 490
(2) INFORMATION FOR SEQ ID NO: 143:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1987 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence (B) LOCATION: 51...1934 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143:
AGCGCTTGAA ATTTTTAGCC ATTTATGACA CGAATTTAGA CGAATTTTAC ATG ATA 56
Met He 1
AGA GTG GCA GGG CTT AAA CAA CTC TAT GAG CAT AAA ATC GCC TCT AAA 104 Arg Val Ala Gly Leu Lys Gin Leu Tyr Glu His Lys He Ala Ser Lys 5 10 15
GGC ATT GAT GGC GCA AGC CCT GAA GAA CAA TTA GAA AAA ATC AAG CAT 152 Gly He Asp Gly Ala Ser Pro Glu Glu Gin Leu Glu Lys He Lys His 20 25 30
TAT TTA GCG CAT GAA ATT GAA GAA AGG GAG TTA GAA TTC CAA AAA ATC 200 Tyr Leu Ala His Glu He Glu Glu Arg Glu Leu Glu Phe Gin Lys He 35 40 45 50
CAA GCC CTA CTC TTT AAA AAA GGG CTT TGT ATC ACC CCC TAT AAT GAA 248 Gin Ala Leu Leu Phe Lys Lys Gly Leu Cys He Thr Pro Tyr Asn Glu 55 60 65
TTG AAT TTA GAG CAA AAA GCG AAG GCT AAA ACC TAT TTT AAA GAG CAG 296 Leu Asn Leu Glu Gin Lys Ala Lys Ala Lys Thr Tyr Phe Lys Glu Gin 70 75 80
CTT TAC GCG TTA GTT TTG CCT TTT AAA TTG GAT TCT TCA CAC ACT TTC 344 Leu Tyr Ala Leu Val Leu Pro Phe Lys Leu Asp Ser Ser His Thr Phe 85 90 95
CCG CCT TTA GCG AAT TTG ACT TTC GCG CTT TTT GCC CGC ATC AAA GAC 392 Pro Pro Leu Ala Asn Leu Thr Phe Ala Leu Phe Ala Arg He Lys Asp 100 105 110
AAA GAA ACC CAA ATT ATC TCC TAT GCG CTC ATC AAA CTC CCC TCT TTT 440 Lys Glu Thr Gin He He Ser Tyr Ala Leu He Lys Leu Pro Ser Phe 115 120 125 130
ATC TTC CGT TTT GTA GAG CTA GAA AAA GGC TTG TTT GTG TTA GCT GAA 488 He Phe Arg Phe Val Glu Leu Glu Lys Gly Leu Phe Val Leu Ala Glu 135 140 145
GAA ATC GTG GAA GCG CAT TTA GAA GAA TTG TTT TTA GAG CAT GAG ATT 536 Glu He Val Glu Ala His Leu Glu Glu Leu Phe Leu Glu His Glu He 150 155 160
TTA GAT TGC ATG GCG TTT AGG GTA ACT TGC GAT GCG GAT ATT GCT ATC 584 Leu Asp Cys Met Ala Phe Arg Val Thr Cys Asp Ala Asp He Ala He 165 170 175 ACT GAA GAT GAA GCG CAT GAT TAT GCA GAT TTG ATG AGT AAG AGT TTG 632 Thr Glu Asp Glu Ala His Asp Tyr Ala Asp Leu Met Ser Lys Ser Leu 180 185 190
AGG AAA CGC AAT CAA GGC GAA ATC GTG CGC TTG CAA ACC CAA AAA GGG 680 Arg Lys Arg Asn Gin Gly Glu He Val Arg Leu Gin Thr Gin Lys Gly 195 200 205 210
AGT CAA GAG CTT TTA AAA ACC CTC TTA GCG TCT TTA AGG AGT TTT CAA 728 Ser Gin Glu Leu Leu Lys Thr Leu Leu Ala Ser Leu Arg Ser Phe Gin 215 220 225
ACC CAC TCT TAC AAA AAG CAC AAA CTC ACC GGC ATG CAT ATC TAT AAA 776 Thr His Ser Tyr Lys Lys His Lys Leu Thr Gly Met His He Tyr Lys 230 235 240
AGC GCG ATC ATG CTC AAT TTA GGG GAT TTG TGG GAA TTA GTC AAT CAT 824 Ser Ala He Met Leu Asn Leu Gly Asp Leu Trp Glu Leu Val Asn His 245 250 255
AGC GAT TTT AAA GCG CTC AAA TCG CCC AAT TTC ACA CCC AAA ATC CAC 872 Ser Asp Phe Lys Ala Leu Lys Ser Pro Asn Phe Thr Pro Lys He His 260 265 270
CCT CAT TTC AAT GAA AAC GAT CTT TTC AAA TCT ATA GAA AAA CAG GAT 920 Pro His Phe Asn Glu Asn Asp Leu Phe Lys Ser He Glu Lys Gin Asp 275 280 285 290
CTG TTG CTG TTT CAT CCT TAT GAA AGT TTT GAG CCT GTG ATT GAT TTA 968 Leu Leu Leu Phe His Pro Tyr Glu Ser Phe Glu Pro Val He Asp Leu 295 300 305
ATA GAG CAA GCC GCT AGC GAT CCA GCC ACC CTT TCT ATC AAA ATG ACG 1016 He Glu Gin Ala Ala Ser Asp Pro Ala Thr Leu Ser He Lys Met Thr 310 315 320
CTT TAT CGT GTG GGC AAG CAT TCC CCC ATT GTC AAA GCT TTG ATT GAA 1064 Leu Tyr Arg Val Gly Lys His Ser Pro He Val Lys Ala Leu He Glu 325 330 335
GCG GCG AGC AAG ATT CAA GTG AGC GTT TTA GTG GAA TTA AAA GCG CGC 1112 Ala Ala Ser Lys He Gin Val Ser Val Leu Val Glu Leu Lys Ala Arg 340 345 350
TTT GAT GAA GAG AGC AAT CTG CAC TGG GCA AAA GCT TTA GAA AGG GCG 1160 Phe Asp Glu Glu Ser Asn Leu His Trp Ala Lys Ala Leu Glu Arg Ala 355 360 365 370
GGC GCG TTA GTC GTT TAT GGC GTT TTC AAA CTC AAA GTG CAT GCT AAA 1208 Gly Ala Leu Val Val Tyr Gly Val Phe Lys Leu Lys Val His Ala Lys 375 380 385
ATG CTA TTG ATC ACT AAA AAA ACA GAC AAC CAA TTA CGC CAT TTC ACC 1256 Met Leu Leu He Thr Lys Lys Thr Asp Asn Gin Leu Arg His Phe Thr 390 395 400 CAT TTA AGC ACG GGC AAT TAC AAC CCT TTG AGC GCT AAA GTC TAT ACC 1304 His Leu Ser Thr Gly Asn Tyr Asn Pro Leu Ser Ala Lys Val Tyr Thr 405 410 415
GAT GTG AGT TTT TTT AGC GCT AAA AAT GAA ATC GCT AAC GAC ATT ATC 1352 Asp Val Ser Phe Phe Ser Ala Lys Asn Glu He Ala Asn Asp He He 420 425 430
AAG CTT TTC CAT TCC TTG CTC ACT AGC AGC GCG ACT AAT AGC GCA TTA 1400 Lys Leu Phe His Ser Leu Leu Thr Ser Ser Ala Thr Asn Ser Ala Leu 435 440 445 450
GAA ACG CTT TTT ATG GCA CCC AAA CAA ATC AAG CCT AAA ATC ATT GAA 1448 Glu Thr Leu Phe Met Ala Pro Lys Gin He Lys Pro Lys He He Glu 455 460 465
CTC ATT CAA AAT GAA ATG AAT CAC CAA CAA GAA GGC TAT ATC ATT TTA 1496 Leu He Gin Asn Glu Met Asn His Gin Gin Glu Gly Tyr He He Leu 470 475 480
AAA GCC AAC GCC CTA GTG GAT AGC GAA ATC ATT GAA TGG CTC TAT CAA 1544 Lys Ala Asn Ala Leu Val Asp Ser Glu He He Glu Trp Leu Tyr Gin 485 490 495
GCC TCT CAA AAA GGG GTT AAA ATT GAT CTC ATT ATT AGA GGG ATT TGC 1592 Ala Ser Gin Lys Gly Val Lys He Asp Leu He He Arg Gly He Cys 500 505 510
TGT TTA AAG CCC CAA GTC AAG GGC TTG AGC GAA AAT ATC AGG GTG TAT 1640 Cys Leu Lys Pro Gin Val Lys Gly Leu Ser Glu Asn He Arg Val Tyr 515 520 525 530
TCT ATC GTG GGG AAA TAT TTA GAA CAT GCA CGC ATT TAT TAT TTT AAA 1688 Ser He Val Gly Lys Tyr Leu Glu His Ala Arg He Tyr Tyr Phe Lys 535 540 545
CAT GAA AAT ATT TAT TTT TCT AGC GCG GAT TTA ATG CCC AGG AAT TTA 1736 His Glu Asn He Tyr Phe Ser Ser Ala Asp Leu Met Pro Arg Asn Leu 550 555 560
GAA AGG CGC GTG GAA TTG CTC ATT CCA GCC ACA AAC CCA AAG ATC GCT 1784 Glu Arg Arg Val Glu Leu Leu He Pro Ala Thr Asn Pro Lys He Ala 565 570 575
CAT AAA TTG TTG CAT ATT TTA GAA ATC CAA CTC AAA GAC ACC TTA AAA 1832 His Lys Leu Leu His He Leu Glu He Gin Leu Lys Asp Thr Leu Lys 580 585 590
CGC TAC GAG TTA AAT TCT AAA GGC CGT TAC ATT AAA GTT TCA AAC CCT 1880 Arg Tyr Glu Leu Asn Ser Lys Gly Arg Tyr He Lys Val Ser Asn Pro 595 600 605 610
AAC GAT CCT TTA AAT TCG CAG GAT TAT TTT GAA AAA CAA GCC CTT AAA 1928 Asn Asp Pro Leu Asn Ser Gin Asp Tyr Phe Glu Lys Gin Ala Leu Lys 615 620 625 ACC TTT TAAGGGTTAT CGTTCAAATC ATAAAAGATA AGGATTTAAA TGCTTTATTC AT 1986 Thr Phe
1987
(2) INFORMATION FOR SEQ ID NO:144:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 628 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144:
Met He Arg Val Ala Gly Leu Lys Gin Leu Tyr Glu His Lys He Ala
1 5 10 15
Ser Lys Gly He Asp Gly Ala Ser Pro Glu Glu Gin Leu Glu Lys He
20 25 30
Lys His Tyr Leu Ala His Glu He Glu Glu Arg Glu Leu Glu Phe Gin
35 40 45
Lys He Gin Ala Leu Leu Phe Lys Lys Gly Leu Cys He Thr Pro Tyr
50 55 60
Asn Glu Leu Asn Leu Glu Gin Lys Ala Lys Ala Lys Thr Tyr Phe Lys 65 70 75 80
Glu Gin Leu Tyr Ala Leu Val Leu Pro Phe Lys Leu Asp Ser Ser His
85 90 95
Thr Phe Pro Pro Leu Ala Asn Leu Thr Phe Ala Leu Phe Ala Arg He
100 105 110
Lys Asp Lys Glu Thr Gin He He Ser Tyr Ala Leu He Lys Leu Pro
115 120 125
Ser Phe He Phe Arg Phe Val Glu Leu Glu Lys Gly Leu Phe Val Leu
130 135 140
Ala Glu Glu He Val Glu Ala His Leu Glu Glu Leu Phe Leu Glu His 145 150 155 160
Glu He Leu Asp Cys Met Ala Phe Arg Val Thr Cys Asp Ala Asp He
165 170 175
Ala He Thr Glu Asp Glu Ala His Asp Tyr Ala Asp Leu Met Ser Lys
180 185 190
Ser Leu Arg Lys Arg Asn Gin Gly Glu He Val Arg Leu Gin Thr Gin
195 200 205
Lys Gly Ser Gin Glu Leu Leu Lys Thr Leu Leu Ala Ser Leu Arg Ser
210 215 220
Phe Gin Thr His Ser Tyr Lys Lys His Lys Leu Thr Gly Met His He 225 230 235 240
Tyr Lys Ser Ala He Met Leu Asn Leu Gly Asp Leu Trp Glu Leu Val
245 250 255
Asn His Ser Asp Phe Lys Ala Leu Lys Ser Pro Asn Phe Thr Pro Lys
260 265 270
He His Pro His Phe Asn Glu Asn Asp Leu Phe Lys Ser He Glu Lys 275 280 285 Gin Asp Leu Leu Leu Phe His Pro Tyr Glu Ser Phe Glu Pro Val He
290 295 300
Asp Leu He Glu Gin Ala Ala Ser Asp Pro Ala Thr Leu Ser He Lys 305 310 315 320
Met Thr Leu Tyr Arg Val Gly Lys His Ser Pro He Val Lys Ala Leu
325 330 335
He Glu Ala Ala Ser Lys He Gin Val Ser Val Leu Val Glu Leu Lys
340 345 350
Ala Arg Phe Asp Glu Glu Ser Asn Leu His Trp Ala Lys Ala Leu Glu
355 360 365
Arg Ala Gly Ala Leu Val Val Tyr Gly Val Phe Lys Leu Lys Val His
370 375 380
Ala Lys Met Leu Leu He Thr Lys Lys Thr Asp Asn Gin Leu Arg His 385 390 395 400
Phe Thr His Leu Ser Thr Gly Asn Tyr Asn Pro Leu Ser Ala Lys Val
405 410 415
Tyr Thr Asp Val Ser Phe Phe Ser Ala Lys Asn Glu He Ala Asn Asp
420 425 430
He He Lys Leu Phe His Ser Leu Leu Thr Ser Ser Ala Thr Asn Ser
435 440 445
Ala Leu Glu Thr Leu Phe Met Ala Pro Lys Gin He Lys Pro Lys He
450 455 460
He Glu Leu He Gin Asn Glu Met Asn His Gin Gin Glu Gly Tyr He 465 470 475 480
He Leu Lys Ala Asn Ala Leu Val Asp Ser Glu He He Glu Trp Leu
485 490 495
Tyr Gin Ala Ser Gin Lys Gly Val Lys He Asp Leu He He Arg Gly
500 505 510
He Cys Cys Leu Lys Pro Gin Val Lys Gly Leu Ser Glu Asn He Arg
515 520 525
Val Tyr Ser He Val Gly Lys Tyr Leu Glu His Ala Arg He Tyr Tyr
530 535 540
Phe Lys His Glu Asn He Tyr Phe Ser Ser Ala Asp Leu Met Pro Arg 545 550 555 560
Asn Leu Glu Arg Arg Val Glu Leu Leu He Pro Ala Thr Asn Pro Lys
565 570 575
He Ala His Lys Leu Leu His He Leu Glu He Gin Leu Lys Asp Thr
580 585 590
Leu Lys Arg Tyr Glu Leu Asn Ser Lys Gly Arg Tyr He Lys Val Ser
595 600 605
Asn Pro Asn Asp Pro Leu Asn Ser Gin Asp Tyr Phe Glu Lys Gin Ala
610 615 620
Leu Lys Thr Phe 625
(2) INFORMATION FOR SEQ ID NO: 145:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 616 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...563 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:145:
TATAATATAG ATTTTATTTT AGCTAAAAAT GGCATGGGTT TTAGCAAGGA ATG GGC 56
Met Gly
1
TTG AAA AAT CTC TCA ACA CTT CTG GTG TTT TTA TTC TTT TGT TTA GGG 104 Leu Lys Asn Leu Ser Thr Leu Leu Val Phe Leu Phe Phe Cys Leu Gly 5 10 15
TGT GTG AGC AAT TTT AAT GAA GAC ACT TAC ACG CTA GAC TTA GTT TTA 152 Cys Val Ser Asn Phe Asn Glu Asp Thr Tyr Thr Leu Asp Leu Val Leu 20 25 30
GAA AAA AAG ATC CAA GCC AGC AGG AAA GGT GAA ATC ACC CAA GAT AAT 200 Glu Lys Lys He Gin Ala Ser Arg Lys Gly Glu He Thr Gin Asp Asn 35 40 45 50
GTG CCT ATC ATC ACG GCT ATC GCT ACG CAT TTA AAC GAT GTG GAT AGC 248 Val Pro He He Thr Ala He Ala Thr His Leu Asn Asp Val Asp Ser 55 60 65
GGC ACT TAC TAT GAC CAT GAG TAT TTT TTA GTG GAG ATT TTC ACG CAA 296 Gly Thr Tyr Tyr Asp His Glu Tyr Phe Leu Val Glu He Phe Thr Gin 70 75 80
AAT AAC GAC TGG ATA GAT GAT GGC TAT ATT TCT TAT GAA CTT TTT GGC 344 Asn Asn Asp Trp He Asp Asp Gly Tyr He Ser Tyr Glu Leu Phe Gly 85 90 95
ACA AAA CCT ATA GGC TCA GAG CCT TTA TGG GTG CGA GAA ATC ACA AAA 392 Thr Lys Pro He Gly Ser Glu Pro Leu Trp Val Arg Glu He Thr Lys 100 105 110
GAT GAA TTT GAT GGC ATT TTA GAA ACC ACG AAC AGG TGG AGC AGA GCT 440 Asp Glu Phe Asp Gly He Leu Glu Thr Thr Asn Arg Trp Ser Arg Ala 115 120 125 130
TTT TTG CTC GCT TTT AAC AAA TTG GAT TAT TTA GCG GTT CAA GAA GCC 488 Phe Leu Leu Ala Phe Asn Lys Leu Asp Tyr Leu Ala Val Gin Glu Ala 135 140 145
AAA CTA GAG CTT GAT GCC TAT AGT TTG GGC AAG ATT GTT TTT AAT TTC 536 Lys Leu Glu Leu Asp Ala Tyr Ser Leu Gly Lys He Val Phe Asn Phe 150 155 160
GCT TAT CAA GTC CCC CTA CCT CAA TTT TAATGCGCTT AGATTACGCC TTATTCA 590 Ala Tyr Gin Val Pro Leu Pro Gin Phe 165 170
GTCAGCATTT AGTAAATAGC AGAGAA 616
(2) INFORMATION FOR SEQ ID NO: 146:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 171 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:146:
Met Gly Leu Lys Asn Leu Ser Thr Leu Leu Val Phe Leu Phe Phe Cys
1 5 10 15
Leu Gly Cys Val Ser Asn Phe Asn Glu Asp Thr Tyr Thr Leu Asp Leu
20 25 30
Val Leu Glu Lys Lys He Gin Ala Ser Arg Lys Gly Glu He Thr Gin
35 40 45
Asp Asn Val Pro He He Thr Ala He Ala Thr His Leu Asn Asp Val
50 55 60
Asp Ser Gly Thr Tyr Tyr Asp His Glu Tyr Phe Leu Val Glu He Phe 65 70 75 80
Thr Gin Asn Asn Asp Trp He Asp Asp Gly Tyr He Ser Tyr Glu Leu
85 90 95
Phe Gly Thr Lys Pro He Gly Ser Glu Pro Leu Trp Val Arg Glu He
100 105 110
Thr Lys Asp Glu Phe Asp Gly He Leu Glu Thr Thr Asn Arg Trp Ser
115 120 125
Arg Ala Phe Leu Leu Ala Phe Asn Lys Leu Asp Tyr Leu Ala Val Gin
130 135 140
Glu Ala Lys Leu Glu Leu Asp Ala Tyr Ser Leu Gly Lys He Val Phe 145 150 155 160
Asn Phe Ala Tyr Gin Val Pro Leu Pro Gin Phe 165 170
(2) INFORMATION FOR SEQ ID NO: 147:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2341 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 966...2291 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147:
ATTGTGAATT AGGAGTGAGC GTGAATAGTA ATGGCAATAA AGACAAACAA CAGCAGAATG 60
TAAGCAGTGG GATTTCTCAA ATCTCATTAA AAAAGGTGGC AACTTTTGAT GAAAATGGGG 120
CGAGTTTTGA GAATTTAAAT TCTATCAACT TTATTTATGG GGCTAATGGG AGCGGTAAGA 180
CAACCACTTC TAGTTTTTTA AAAAATCTAG CTGAAAATGG GATTGAAGAC AAGTTTGCTA 240
ATAGTAAAAT AGCATGGTAT AACAATGAGA GTTTAAAGAT TGAAGTTTAT AACAAGCAAT 300
TTAAAGAAGA GCAATTGAGA AACTCTCAAG TTAAAGGCAT TTTTACGCTC GGTAAAAAAA 360
CGAACGAGAA TTTAGAAAAA ATTGAAAGCA AGAAAGAATC AATAAACAAA GAGAATGAAA 420
AGAAAATAAA AAATGAAGCA AGCTTGCAAG TTTTAACACA AAAAAAGGAA AAGGAAGAAA 480
AGGATTTTGC TGATAGGTGT TGGGAAAAAC TTTATAAGAA AAATGAAGAG GATTTTAAAG 540
AAACGCTAGA AGGCTTTAAG CGTAAAGAGA AGTTTAAAGA AAAAATCCTT AAGGAATTTG 600
AAAACGATAA ATACAATCAA AGCGAAATAG TAGGGTTAGA AAAATTAAAG GAAAAAATTG 660
AGATTGTTTT TGGTGAAAAC CAAACAGAAT TGGCACTATT GGAATGCAAT TTAACAGATT 720
TTGATTTTAT TGAAAATCAT TCTATTTGGG AACAAAAAAT TGTAGGGAGT GGTGATGCAG 780
CCATTGCAGA TTTAATAAAA AGATTAAGCA ATGAAGATTG GGTAGCTCAA GGTAGAGAAT 840
ATATAAAAGA TAATAGTATA TGCCCTTTCT GTCAAAAAGA AACCATTACC GAAGAATTTA 900
AAAAACAACT AGAATCTTAT TTTGATACAA GTTATCAAGA ATCTATTGAA ACGATCAAGG 960
AAAAG ATG GAA GAC TAC GCA AGC AGA ACC GCT GGA GCA CTG GAG CGA CTT 1010 Met Glu Asp Tyr Ala Ser Arg Thr Ala Gly Ala Leu Glu Arg Leu 1 5 10 15
GAT AAG ATT GTT GAA ACA GAA CAG AAG AAT CAA CAA ACT AAA TTG GAC 1058 Asp Lys He Val Glu Thr Glu Gin Lys Asn Gin Gin Thr Lys Leu Asp 20 25 30
ACA GAA AAT TTG AAA ATA ATT ATT GAA ACT TTG AGA AGT AAA ATC AAT 1106 Thr Glu Asn Leu Lys He He He Glu Thr Leu Arg Ser Lys He Asn 35 40 45
GGG AAT CAG CAA AAG ATG CTT GAT AAA AGT AAA GAA ATG AGC AGA AAT 1154 Gly Asn Gin Gin Lys Met Leu Asp Lys Ser Lys Glu Met Ser Arg Asn 50 55 60
TTT AAG CTT GAT AGC ACT AAA AAC GAG ATA GAC GCA ATT AAA GAT TTG 1202 Phe Lys Leu Asp Ser Thr Lys Asn Glu He Asp Ala He Lys Asp Leu 65 70 75
ATT AAA AAG GCT AAT GAG CAA ATA GCC AAT TAT AAT GAG ATG ATA AAG 1250 He Lys Lys Ala Asn Glu Gin He Ala Asn Tyr Asn Glu Met He Lys 80 85 90 95
GAT ATT GAA AAA CAG AAA AAG AGT TGT AAG GAA CAA ACT TGG AAA TTT 1298 Asp He Glu Lys Gin Lys Lys Ser Cys Lys Glu Gin Thr Trp Lys Phe 100 105 110
CTA GTC AAT GAA TTT AAA AGT GAT ATA CAA GAA TAT AAT AAA AAG TAT 1346 Leu Val Asn Glu Phe Lys Ser Asp He Gin Glu Tyr Asn Lys Lys Tyr 115 120 125
TGC GGT TTG GAG AAA GGA ATA AAC AAT TTA GAG AAA GCA ATT AGT GAA 1394 Cys Gly Leu Glu Lys Gly He Asn Asn Leu Glu Lys Ala He Ser Glu 130 135 140 AAT CAA GAA GAG GTA AAG AAA TTA GAA AAT GAA ATT AAG GAA TTA GAA 1442 Asn Gin Glu Glu Val Lys Lys Leu Glu Asn Glu He Lys Glu Leu Glu 145 150 155
AAA ACT ATG GTA AGC ATA AAG CCC ATT GTC AAT GAA ATC AAT ACG CTT 1490 Lys Thr Met Val Ser He Lys Pro He Val Asn Glu He Asn Thr Leu 160 165 170 175
TTA AAA GGG TAT GGA TTC GCG AAT TTT AGT TTG GCA TGC ACT GAA GAT 1538 Leu Lys Gly Tyr Gly Phe Ala Asn Phe Ser Leu Ala Cys Thr Glu Asp 180 185 190
GAA AAA TTT TAT CGT ATT CAA AGA GAA GAT GGT CAA TTA GTA GGA GAA 1586 Glu Lys Phe Tyr Arg He Gin Arg Glu Asp Gly Gin Leu Val Gly Glu 195 200 205
ACA CTG AGC GAG GGT GAA GTT ACT TTC ATC ACT TTC TTA TAT TAT TAT 1634 Thr Leu Ser Glu Gly Glu Val Thr Phe He Thr Phe Leu Tyr Tyr Tyr 210 215 220
CAT TTA GCA AAA GGC TCT TTA GAA GAG AAC GAT ATA TCA AAA AAT AAG 1682 His Leu Ala Lys Gly Ser Leu Glu Glu Asn Asp He Ser Lys Asn Lys 225 230 235
GTT TTA GTG ATT GAT GAC CCC ATT TCA AGT TTG GAT AGC AAT ATA TTG 1730 Val Leu Val He Asp Asp Pro He Ser Ser Leu Asp Ser Asn He Leu 240 245 250 255
TTT ATA GTG AGT GTT TTA GTT AAA GAT CTT ATG AAA GAA GCC ATG GAA 1778 Phe He Val Ser Val Leu Val Lys Asp Leu Met Lys Glu Ala Met Glu 260 265 270
GAA AAA ACA AAC ATC AAG CAA GTT ATT ATA CTA ACC CAC AAC ACA TAT 1826 Glu Lys Thr Asn He Lys Gin Val He He Leu Thr His Asn Thr Tyr 275 280 285
TTT TAC AAG GAA ATT ACA TTA GAA TGT GAT TTA AAA CGC TAT CAA GGG 1874 Phe Tyr Lys Glu He Thr Leu Glu Cys Asp Leu Lys Arg Tyr Gin Gly 290 295 300
AAA TAT TCT TTT TGG ATA ATT AAA AAG GAT AAT AAT GTT TCA AAA ATT 1922 Lys Tyr Ser Phe Trp He He Lys Lys Asp Asn Asn Val Ser Lys He 305 310 315
AAA GAT TAT AAA GAA AAT CCC ATT AAA AAT TCC TAT GAA TTG CTA TGG 1970 Lys Asp Tyr Lys Glu Asn Pro He Lys Asn Ser Tyr Glu Leu Leu Trp 320 325 330 335
CAA GAA GTA AAA CAA GCA AAA GAA AAT AAT GCT TCT TGG GTA TCT TTA 2018 Gin Glu Val Lys Gin Ala Lys Glu Asn Asn Ala Ser Trp Val Ser Leu 340 345 350
CAA AAT GTT ATG CGA AGA ATT ATT GAG TAT TAC TTT AGG ATT TTA GGC 2066 Gin Asn Val Met Arg Arg He He Glu Tyr Tyr Phe Arg He Leu Gly 355 360 365 GGT TTT AAA CAT AAT GAT AGC TTG AGT GAA TGT TTT GAA AAT ATT GAA 2114 Gly Phe Lys His Asn Asp Ser Leu Ser Glu Cys Phe Glu Asn He Glu 370 375 380
GAA AAA CGA GTG TGT AAT TCT TTC ATT TCA TGG TTT AAT GAT GGC TCT 2162 Glu Lys Arg Val Cys Asn Ser Phe He Ser Trp Phe Asn Asp Gly Ser 385 390 395
CAT GGG ATT TCA GAT GAT TTG TTT ATG CAA AGT CAA GAT ACA AGT ATT 2210 His Gly He Ser Asp Asp Leu Phe Met Gin Ser Gin Asp Thr Ser He 400 405 410 415
GAG ACA TAT TTA AAA GTT TTT GAA AAA ATA TTT AAA GAA ACC GGT CAT 2258 Glu Thr Tyr Leu Lys Val Phe Glu Lys He Phe Lys Glu Thr Gly His 420 425 430
GAA GCT CAT TAT AAA ATG ATG ATG AGA ATG AAG TAATTGAATT AAAAACAAGG 2311 Glu Ala His Tyr Lys Met Met Met Arg Met Lys 435 440
AATAACATGC GAATCGTATT TATGGGAACG 2341
(2) INFORMATION FOR SEQ ID NO: 148:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 442 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148:
Met Glu Asp Tyr Ala Ser Arg Thr Ala Gly Ala Leu Glu Arg Leu Asp
1 5 10 15
Lys He Val Glu Thr Glu Gin Lys Asn Gin Gin Thr Lys Leu Asp Thr
20 25 30
Glu Asn Leu Lys He He He Glu Thr Leu Arg Ser Lys He Asn Gly
35 40 45
Asn Gin Gin Lys Met Leu Asp Lys Ser Lys Glu Met Ser Arg Asn Phe
50 55 60
Lys Leu Asp Ser Thr Lys Asn Glu He Asp Ala He Lys Asp Leu He 65 70 75 80
Lys Lys Ala Asn Glu Gin He Ala Asn Tyr Asn Glu Met He Lys Asp
85 90 95
He Glu Lys Gin Lys Lys Ser Cys Lys Glu Gin Thr Trp Lys Phe Leu
100 105 110
Val Asn Glu Phe Lys Ser Asp He Gin Glu Tyr Asn Lys Lys Tyr Cys
115 120 125
Gly Leu Glu Lys Gly He Asn Asn Leu Glu Lys Ala He Ser Glu Asn
130 135 140
Gin Glu Glu Val Lys Lys Leu Glu Asn Glu He Lys Glu Leu Glu Lys 145 150 155 160 Thr Met Val Ser He Lys Pro He Val Asn Glu He Asn Thr Leu Leu
165 170 175
Lys Gly Tyr Gly Phe Ala Asn Phe Ser Leu Ala Cys Thr Glu Asp Glu
180 185 190
Lys Phe Tyr Arg He Gin Arg Glu Asp Gly Gin Leu Val Gly Glu Thr
195 200 205
Leu Ser Glu Gly Glu Val Thr Phe He Thr Phe Leu Tyr Tyr Tyr His
210 215 220
Leu Ala Lys Gly Ser Leu Glu Glu Asn Asp He Ser Lys Asn Lys Val 225 230 235 240
Leu Val He Asp Asp Pro He Ser Ser Leu Asp Ser Asn He Leu Phe
245 250 255
He Val Ser Val Leu Val Lys Asp Leu Met Lys Glu Ala Met Glu Glu
260 265 270
Lys Thr Asn He Lys Gin Val He He Leu Thr His Asn Thr Tyr Phe
275 280 285
Tyr Lys Glu He Thr Leu Glu Cys Asp Leu Lys Arg Tyr Gin Gly Lys
290 295 300
Tyr Ser Phe Trp He He Lys Lys Asp Asn Asn Val Ser Lys He Lys 305 310 315 320
Asp Tyr Lys Glu Asn Pro He Lys Asn Ser Tyr Glu Leu Leu Trp Gin
325 330 335
Glu Val Lys Gin Ala Lys Glu Asn Asn Ala Ser Trp Val Ser Leu Gin
340 345 350
Asn Val Met Arg Arg He He Glu Tyr Tyr Phe Arg He Leu Gly Gly
355 360 365
Phe Lys His Asn Asp Ser Leu Ser Glu Cys Phe Glu Asn He Glu Glu
370 375 380
Lys Arg Val Cys Asn Ser Phe He Ser Trp Phe Asn Asp Gly Ser His 385 390 395 400
Gly He Ser Asp Asp Leu Phe Met Gin Ser Gin Asp Thr Ser He Glu
405 410 415
Thr Tyr Leu Lys Val Phe Glu Lys He Phe Lys Glu Thr Gly His Glu
420 425 430
Ala His Tyr Lys Met Met Met Arg Met Lys 435 440
(2) INFORMATION FOR SEQ ID NO: 149:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3793 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...3740 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149:
TTATGTGGGC TACAACATAG GCTTTTGATT AAACAAAATA AGGGAAAAAT ATG ATA 56
Met He 1
AAA AAA GCT AGA AAA TTC ATA CCA TTC TTT TTA ATT GGC TCC CTC TTA 104 Lys Lys Ala Arg Lys Phe He Pro Phe Phe Leu He Gly Ser Leu Leu 5 10 15
GCT GAA GAC AAT GGC TGG TAT ATG TCT GTA GGC TAT CAA ATC GGT GGC 152 Ala Glu Asp Asn Gly Trp Tyr Met Ser Val Gly Tyr Gin He Gly Gly 20 25 30
ACG CAA CAA TTC ATC AAT AAC AAA CAA CTT TTA GAA AAT CAA AAT ATC 200 Thr Gin Gin Phe He Asn Asn Lys Gin Leu Leu Glu Asn Gin Asn He 35 40 45 50
ATC AAC AGC GTA ACC CAA AGC GCG ATC AAC ATT GCA GGG CCT ACT ACC 248 He Asn Ser Val Thr Gin Ser Ala He Asn He Ala Gly Pro Thr Thr 55 60 65
GGC CTT ATC ACT TTA AGC TCT CAA ACC GTC ATT GAC GCT TTA GGC TAT 296 Gly Leu He Thr Leu Ser Ser Gin Thr Val He Asp Ala Leu Gly Tyr 70 75 80
GGC GTG AGT AAC ACT GTT GGC AAC CAA TTA GAG GGC ATT TCT AAT ATC 344 Gly Val Ser Asn Thr Val Gly Asn Gin Leu Glu Gly He Ser Asn He 85 90 95
TTG AAT CAA ATT GGC AAA AGA AAA GAC TTT TAT TCT AGC CGT CAA ATC 392 Leu Asn Gin He Gly Lys Arg Lys Asp Phe Tyr Ser Ser Arg Gin He 100 105 110
TCT AGC ATT TCC CAA CAA ATC ATA GGG CTT AAA GGA AGC TCT GAT CCC 440 Ser Ser He Ser Gin Gin He He Gly Leu Lys Gly Ser Ser Asp Pro 115 120 125 130
TTA AAA GCC CAT TCT TCA CAG ATC ACA GCC AAA CTC CTT TCC AAC ACC 488 Leu Lys Ala His Ser Ser Gin He Thr Ala Lys Leu Leu Ser Asn Thr 135 140 145
CAA AGC GCG TTT GAT CAG GGC ATC GCG CTA AGC ACT AAC ATC ATT AGC 536 Gin Ser Ala Phe Asp Gin Gly He Ala Leu Ser Thr Asn He He Ser 150 155 160
TCT ATC AAT AGC CTA AAC CCT AGC AAC AAC ACC CAA GAG GTT AAA AAA 584 Ser He Asn Ser Leu Asn Pro Ser Asn Asn Thr Gin Glu Val Lys Lys 165 170 175
CAG CTC CAA AAC ACC GCG CAA TCC ATG ACA GAA TTG TTG CAA CAA ATT 632 Gin Leu Gin Asn Thr Ala Gin Ser Met Thr Glu Leu Leu Gin Gin He 180 185 190
GAA CAC AGC ATC ACT AAA ACC ACT AGC ACC ACT TAC GCG CAA TCC TTA 680 Glu His Ser He Thr Lys Thr Thr Ser Thr Thr Tyr Ala Gin Ser Leu 195 200 205 210
CTC TCC AAT CTA ACC GAT GCG GTG AAT GCC TCT AGC AAT AAT ACC GCT 728 Leu Ser Asn Leu Thr Asp Ala Val Asn Ala Ser Ser Asn Asn Thr Ala 215 220 225
TAT GTG AGC GCT CTT GTT AAC GCT TTA AAC ACT TTA GGG GTA GGG GTT 776 Tyr Val Ser Ala Leu Val Asn Ala Leu Asn Thr Leu Gly Val Gly Val 230 235 240
TTC CCC ACC ACA ACC ACA ACG CAT GTG GTG TTA AAC CCA CCG GGA CAA 824 Phe Pro Thr Thr Thr Thr Thr His Val Val Leu Asn Pro Pro Gly Gin 245 250 255
GTC GTA TTC TAT CCA ACC AAT TCC ATT TTA GGC TCT ACT TCT TCA AAC 872 Val Val Phe Tyr Pro Thr Asn Ser He Leu Gly Ser Thr Ser Ser Asn 260 265 270
AGC AAT AAC CAA CAA CAA TAC AAC AAC ACC CTT TTA ATG AAC ACC TTA 920 Ser Asn Asn Gin Gin Gin Tyr Asn Asn Thr Leu Leu Met Asn Thr Leu 275 280 285 290
CAA GGG ACA TTA AGC GCT AAT ACT CAA AAT AAC CCC AAT GGT TGC GCC 968 Gin Gly Thr Leu Ser Ala Asn Thr Gin Asn Asn Pro Asn Gly Cys Ala 295 300 305
AAT CAA GTC CAG TGT TTG GAG CAA TTC ATC CAA AAT TTA GCC CCT TTA 1016 Asn Gin Val Gin Cys Leu Glu Gin Phe He Gin Asn Leu Ala Pro Leu 310 315 320
GCC GCA ACC CCC ACT TCA AAC AAC CAG GCC AAC CAG CAA GTC CAA GCC 1064 Ala Ala Thr Pro Thr Ser Asn Asn Gin Ala Asn Gin Gin Val Gin Ala 325 330 335
ATC GCT CAA AAG CTT CAA AGC GTT GCT ATC AAC ACT TTA GAC AAC AAT 1112 He Ala Gin Lys Leu Gin Ser Val Ala He Asn Thr Leu Asp Asn Asn 340 345 350
GCG ATC AAC AAC ACC ACC TAT AAT TTA AAC AAT TTG CAC AAC GCT TTG 1160 Ala He Asn Asn Thr Thr Tyr Asn Leu Asn Asn Leu His Asn Ala Leu 355 360 365 370
AAT TTC CAA GCC TAT GAA AGC ACG ATA GAA CAA TAC AAT AAC GCT TTA 1208 Asn Phe Gin Ala Tyr Glu Ser Thr He Glu Gin Tyr Asn Asn Ala Leu 375 380 385
AAA CAA ATT TCT TGG ATC AGT TTT ACT GAG CCT AAA AAC TTA CTC AAA 1256 Lys Gin He Ser Trp He Ser Phe Thr Glu Pro Lys Asn Leu Leu Lys 390 395 400
AAC ACT TCC AAT AAC TAC CAA ATC GGC ACC GTT ACC AAC GCT CAA GGG 1304 Asn Thr Ser Asn Asn Tyr Gin He Gly Thr Val Thr Asn Ala Gin Gly 405 410 415 CAA AAT ATC AGC GCC TAT GAT TGC ATG ACT GCT ACC GGA AGC CTT TCT 1352 Gin Asn He Ser Ala Tyr Asp Cys Met Thr Ala Thr Gly Ser Leu Ser 420 425 430
AGC AAT GCT TCT AGC GGG ATT TCA TGC TCA GCC ACA AGC TCC ACA AGT 1400 Ser Asn Ala Ser Ser Gly He Ser Cys Ser Ala Thr Ser Ser Thr Ser 435 440 445 450
TCC ACA AAT AGC TTT GAC AAT TCT TTA GTC GCT ACC TCC AAA GTC CAA 1448 Ser Thr Asn Ser Phe Asp Asn Ser Leu Val Ala Thr Ser Lys Val Gin 455 460 465
ACC ATC AAC GGC AAA GAG CAG ATC GGC GTG AAT TCT TTT AAC CTT GTC 1496 Thr He Asn Gly Lys Glu Gin He Gly Val Asn Ser Phe Asn Leu Val 470 475 480
TCT CAA GTG TGG AGC GTT TAT AAT TCT TTA AAA ACT TCA GAA GAA AAT 1544 Ser Gin Val Trp Ser Val Tyr Asn Ser Leu Lys Thr Ser Glu Glu Asn 485 490 495
TTG CAA AAA AAC GCC AAT ATT TTA TGC GCT AAT GGG ACG CAA TCT GGG 1592 Leu Gin Lys Asn Ala Asn He Leu Cys Ala Asn Gly Thr Gin Ser Gly 500 505 510
ACA AGC TCA TGC AAT AGC TCT TCA GGG GGT TTG AGC ATC AGC GGG AAC 1640 Thr Ser Ser Cys Asn Ser Ser Ser Gly Gly Leu Ser He Ser Gly Asn 515 520 525 530
GCC CAA TTG CAA AAT ATT TTA AGC CCT ACT AGT GGG ACT ACC ACT AAT 1688 Ala Gin Leu Gin Asn He Leu Ser Pro Thr Ser Gly Thr Thr Thr Asn 535 540 545
ACT CAA GCT AAA AGC AAC GCT CCC AAA CTA AAA GCG ATG GTG GTG GTG 1736 Thr Gin Ala Lys Ser Asn Ala Pro Lys Leu Lys Ala Met Val Val Val 550 555 560
AAT AAT GAA GAA GAA GCT AAA ACG GCC AAT TTA GCC CAA AGC AGC GGG 1784 Asn Asn Glu Glu Glu Ala Lys Thr Ala Asn Leu Ala Gin Ser Ser Gly 565 570 575
ACA ACC ACA CAA TCT CCT AAC AGC ACG GTG ATG GGA GCT TTA AAC ACC 1832 Thr Thr Thr Gin Ser Pro Asn Ser Thr Val Met Gly Ala Leu Asn Thr 580 585 590
GTG TTG CAA AAT GTC AGC AAT TTC CAA CAA AGC ATT CAA AAC GCT TTT 1880 Val Leu Gin Asn Val Ser Asn Phe Gin Gin Ser He Gin Asn Ala Phe 595 600 605 610
CAA AAC CAA GAA AGT AAT ATC CAA GCT TGG GCG AAT GCG ATT TAT AAC 1928 Gin Asn Gin Glu Ser Asn He Gin Ala Trp Ala Asn Ala He Tyr Asn 615 620 625
ACT AAT GGG AGT CAG TCG CAA GAG ATG ACA CCT AAC AAT AAC CAA GAT 1976 Thr Asn Gly Ser Gin Ser Gin Glu Met Thr Pro Asn Asn Asn Gin Asp 630 635 640 TTA CGC ATC CAA TTG AGG GCG AAT TTT TAC CAG CTC ATC AAT ACC ATT 2024 Leu Arg He Gin Leu Arg Ala Asn Phe Tyr Gin Leu He Asn Thr He 645 650 655
AAC CAG CAA GTG CCT ACA GAC ATG AAT GCT TTA ATT AAT CAA AGC CAA 2072 Asn Gin Gin Val Pro Thr Asp Met Asn Ala Leu He Asn Gin Ser Gin 660 665 670
CAA ACC CAA CAA ACA AGC GGA TCA GCA AGC AAT AAT AAC GCA TGC GCG 2120 Gin Thr Gin Gin Thr Ser Gly Ser Ala Ser Asn Asn Asn Ala Cys Ala 675 680 685 690
AGT GGA ATG AGT GGG AGT AAT GGT AAC TGG TGC TAT CAG CAA TGG TCC 2168 Ser Gly Met Ser Gly Ser Asn Gly Asn Trp Cys Tyr Gin Gin Trp Ser 695 700 705
GAT TCT AAG GCT TAT TAC AGC GGG TTG CAA AGC GCT TTA GGG TAT CAA 2216 Asp Ser Lys Ala Tyr Tyr Ser Gly Leu Gin Ser Ala Leu Gly Tyr Gin 710 715 720
ACG CAA GCG ACA ACT CAA AGC GGG AGC AAT GGT GGG AAC AGC ATC ACC 2264 Thr Gin Ala Thr Thr Gin Ser Gly Ser Asn Gly Gly Asn Ser He Thr 725 730 735
TAC AAT GTC CAA CAA ATC ACG CTC ACT AGT AAT GGT TTG CTC AAC CAA 2312 Tyr Asn Val Gin Gin He Thr Leu Thr Ser Asn Gly Leu Leu Asn Gin 740 745 750
ATC ATC ACA AAT CTT AAG AGC GTT AAT GGA GGC AAT GGC GCG AGT GGT 2360 He He Thr Asn Leu Lys Ser Val Asn Gly Gly Asn Gly Ala Ser Gly 755 760 765 770
ACA GGC AGT GGG AAT GGC ACC AGT CAA ATC AAC ACA GCC TAC CAG ATG 2408 Thr Gly Ser Gly Asn Gly Thr Ser Gin He Asn Thr Ala Tyr Gin Met 775 780 785
CTC ACA GAC GCC AGC GAT GGG AAA TTA GGG ACT TAT AGT AGT AGT AGT 2456 Leu Thr Asp Ala Ser Asp Gly Lys Leu Gly Thr Tyr Ser Ser Ser Ser 790 795 800
GGC AGT AAT AAC GGC TAT ACG CCA TGC AAT AGC ACC AAT GGG AGC AAT 2504 Gly Ser Asn Asn Gly Tyr Thr Pro Cys Asn Ser Thr Asn Gly Ser Asn 805 810 815
AAA ACG AGT GGG AAC AAT TGT TAT GAA CCC AAC AAA CAA CAA AAC GCC 2552 Lys Thr Ser Gly Asn Asn Cys Tyr Glu Pro Asn Lys Gin Gin Asn Ala 820 825 830
ACC ACC GCA ACC GCC ACA ACC GAC AGC AAT TTA CAA AAA GTC TAT AAT 2600 Thr Thr Ala Thr Ala Thr Thr Asp Ser Asn Leu Gin Lys Val Tyr Asn 835 840 845 850
GAC GCC CAA AAA ATA GCC AAC ATT ATC GCC AGC TCT GGG AAC AAT AAA 2648 Asp Ala Gin Lys He Ala Asn He He Ala Ser Ser Gly Asn Asn Lys 855 860 865 GGC GTT GAA AAC GGC TTA AAA CAA TTC TTT GAA GCG TTA AAA AAT AAT 2696 Gly Val Glu Asn Gly Leu Lys Gin Phe Phe Glu Ala Leu Lys Asn Asn 870 875 880
AGC AGC AGT CTC AGT AAT TTA TGT GGT AAT GGT AGT AGC GGT AGT AGT 2744 Ser Ser Ser Leu Ser Asn Leu Cys Gly Asn Gly Ser Ser Gly Ser Ser 885 890 895
GGC ACT ACT TGC TCC GGT TGG CTT ATC AAC CTT TTA GGG GCA ATC CCC 2792 Gly Thr Thr Cys Ser Gly Trp Leu He Asn Leu Leu Gly Ala He Pro 900 905 910
ACC AAT GGA GTG AGC GAT ACG AAT AAT TTA ATT AAT CTG CTC ACT GAA 2840 Thr Asn Gly Val Ser Asp Thr Asn Asn Leu He Asn Leu Leu Thr Glu 915 920 925 930
TTC ATT AAA ACC GCC GGG TTT ATC CAA AAT AAT GAT AGT AGT GTA TCT 2888 Phe He Lys Thr Ala Gly Phe He Gin Asn Asn Asp Ser Ser Val Ser 935 940 945
ACT AGT CTT ACA AGC GCT TTT CAA GCC ATT ACG AGC GCT ATT TCT CAA 2936 Thr Ser Leu Thr Ser Ala Phe Gin Ala He Thr Ser Ala He Ser Gin 950 955 960
GGG TTT CAA GCC TTA CAA AAC GAT ATT AGC CCT AAT GCG ATT TTA ACC 2984 Gly Phe Gin Ala Leu Gin Asn Asp He Ser Pro Asn Ala He Leu Thr 965 970 975
TTG CTC CAA GAG ATT ACT TCT AAC ACC ACC ACC ATT CAG TCA TTC TCG 3032 Leu Leu Gin Glu He Thr Ser Asn Thr Thr Thr He Gin Ser Phe Ser 980 985 990
CAA ACC TTA CGG CAG CTT TTA GGG GAT AAA ACA TTC TTT ATG GCG CAA 3080 Gin Thr Leu Arg Gin Leu Leu Gly Asp Lys Thr Phe Phe Met Ala Gin 995 1000 1005 1010
CAA AAG CTC ATT GAT GCG ATG ATT AAC GCC AGA AAT CAG GTT CAA AAC 3128 Gin Lys Leu He Asp Ala Met He Asn Ala Arg Asn Gin Val Gin Asn 1015 1020 1025
GCG CAA AAT CAA GCC AAT AAC TAC GGC TCT CAA CCC GTT TTA AGC CAG 3176 Ala Gin Asn Gin Ala Asn Asn Tyr Gly Ser Gin Pro Val Leu Ser Gin 1030 1035 1040
TAT GCG GCC GCT AAA AGC ACC CAA CAT GGC ATG AGC AAT GGT TTA GGG 3224 Tyr Ala Ala Ala Lys Ser Thr Gin His Gly Met Ser Asn Gly Leu Gly 1045 1050 1055
GTT GGT TTG GGC TAT AAA TAC TTC TTT GGT AAA GCG AGA AAA TTA GGC 3272 Val Gly Leu Gly Tyr Lys Tyr Phe Phe Gly Lys Ala Arg Lys Leu Gly 1060 1065 1070
CTT AGG CAT TAT TTT TTC TTT GAT TAC GGC TTT AGT GAA ATA GGC CTA 3320 Leu Arg His Tyr Phe Phe Phe Asp Tyr Gly Phe Ser Glu He Gly Leu 1075 1080 1085 1090 GCC AAT CAA AGC GTG AAA GCG AAT ATC TTT GCT TAT GGG GTA GGC ACG 3368 Ala Asn Gin Ser Val Lys Ala Asn He Phe Ala Tyr Gly Val Gly Thr 1095 1100 1105
GAT TTT TTA TGG AAC TTA TTC AGG AGG ACT TAC AAC ACT AAA GCG TTG 3416 Asp Phe Leu Trp Asn Leu Phe Arg Arg Thr Tyr Asn Thr Lys Ala Leu 1110 1115 1120
AAT TTT GGG CTA TTT GCT GGG GTC CAA CTG GGC GGC GCA ACC TGG CTT 3464 Asn Phe Gly Leu Phe Ala Gly Val Gin Leu Gly Gly Ala Thr Trp Leu 1125 1130 1135
AGC TCC TTA AGG CAA CAA ATC ATT GAC AAC TGG GGG AGT GCT AAT GAC 3512 Ser Ser Leu Arg Gin Gin He He Asp Asn Trp Gly Ser Ala Asn Asp 1140 1145 1150
ATC CAT TCA ACG AAT TTT CAA GTG GCG CTG AAT TTT GGG GTG CGC ACC 3560 He His Ser Thr Asn Phe Gin Val Ala Leu Asn Phe Gly Val Arg Thr 1155 1160 1165 1170
AAC TTC GCG GAG TTT AAG CGT TTT GCT AAG AAA TTC CAC AAT CAA GGG 3608 Asn Phe Ala Glu Phe Lys Arg Phe Ala Lys Lys Phe His Asn Gin Gly 1175 1180 1185
GTC ATC AGC CAA AAG AGC GTG GAA TTT GGG ATC AAA GTG CCT CTC ATC 3656 Val He Ser Gin Lys Ser Val Glu Phe Gly He Lys Val Pro Leu He 1190 1195 1200
AAT CAA GCG TAT TTG AAT AGC GCT GGA GCT GAT GTG AGT TAC AGG AGG 3704 Asn Gin Ala Tyr Leu Asn Ser Ala Gly Ala Asp Val Ser Tyr Arg Arg 1205 1210 1215
CTT TAT ACT TTT TAT ATC AAT TAC ATC ATG GGG TTT TAAAAAAGGG TGTGTC 3756 Leu Tyr Thr Phe Tyr He Asn Tyr He Met Gly Phe 1220 1225 1230
ATGGAAATCT TACAATTCAT CGGCTATGGG AATATGG 3793
(2) INFORMATION FOR SEQ ID NO: 150:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1230 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:150:
Met He Lys Lys Ala Arg Lys Phe He Pro Phe Phe Leu He Gly Ser
1 5 10 15
Leu Leu Ala Glu Asp Asn Gly Trp Tyr Met Ser Val Gly Tyr Gin He 20 25 30
Gly Gly Thr Gin Gin Phe He Asn Asn Lys Gin Leu Leu Glu Asn Gin
35 40 45
Asn He He Asn Ser Val Thr Gin Ser Ala He Asn He Ala Gly Pro
50 55 60
Thr Thr Gly Leu He Thr Leu Ser Ser Gin Thr Val He Asp Ala Leu 65 70 75 80
Gly Tyr Gly Val Ser Asn Thr Val Gly Asn Gin Leu Glu Gly He Ser
85 90 95
Asn He Leu Asn Gin He Gly Lys Arg Lys Asp Phe Tyr Ser Ser Arg
100 105 110
Gin He Ser Ser He Ser Gin Gin He He Gly Leu Lys Gly Ser Ser
115 120 125
Asp Pro Leu Lys Ala His Ser Ser Gin He Thr Ala Lys Leu Leu Ser
130 135 140
Asn Thr Gin Ser Ala Phe Asp Gin Gly He Ala Leu Ser Thr Asn He 145 150 155 160
He Ser Ser He Asn Ser Leu Asn Pro Ser Asn Asn Thr Gin Glu Val
165 170 175
Lys Lys Gin Leu Gin Asn Thr Ala Gin Ser Met Thr Glu Leu Leu Gin
180 185 190
Gin He Glu His Ser He Thr Lys Thr Thr Ser Thr Thr Tyr Ala Gin
195 200 205
Ser Leu Leu Ser Asn Leu Thr Asp Ala Val Asn Ala Ser Ser Asn Asn
210 215 220
Thr Ala Tyr Val Ser Ala Leu Val Asn Ala Leu Asn Thr Leu Gly Val 225 230 235 240
Gly Val Phe Pro Thr Thr Thr Thr Thr His Val Val Leu Asn Pro Pro
245 250 255
Gly Gin Val Val Phe Tyr Pro Thr Asn Ser He Leu Gly Ser Thr Ser
260 265 270
Ser Asn Ser Asn Asn Gin Gin Gin Tyr Asn Asn Thr Leu Leu Met Asn
275 280 285
Thr Leu Gin Gly Thr Leu Ser Ala Asn Thr Gin Asn Asn Pro Asn Gly
290 295 300
Cys Ala Asn Gin Val Gin Cys Leu Glu Gin Phe He Gin Asn Leu Ala 305 310 315 320
Pro Leu Ala Ala Thr Pro Thr Ser Asn Asn Gin Ala Asn Gin Gin Val
325 330 335
Gin Ala He Ala Gin Lys Leu Gin Ser Val Ala He Asn Thr Leu Asp
340 345 350
Asn Asn Ala He Asn Asn Thr Thr Tyr Asn Leu Asn Asn Leu His Asn
355 360 365
Ala Leu Asn Phe Gin Ala Tyr Glu Ser Thr He Glu Gin Tyr Asn Asn
370 375 380
Ala Leu Lys Gin He Ser Trp He Ser Phe Thr Glu Pro Lys Asn Leu 385 390 395 400
Leu Lys Asn Thr Ser Asn Asn Tyr Gin He Gly Thr Val Thr Asn Ala
405 410 415
Gin Gly Gin Asn He Ser Ala Tyr Asp Cys Met Thr Ala Thr Gly Ser
420 425 430
Leu Ser Ser Asn Ala Ser Ser Gly He Ser Cys Ser Ala Thr Ser Ser
435 440 445
Thr Ser Ser Thr Asn Ser Phe Asp Asn Ser Leu Val Ala Thr Ser Lys 450 455 460 Val Gin Thr He Asn Gly Lys Glu Gin He Gly Val Asn Ser Phe Asn 465 470 475 480
Leu Val Ser Gin Val Trp Ser Val Tyr Asn Ser Leu Lys Thr Ser Glu
485 490 495
Glu Asn Leu Gin Lys Asn Ala Asn He Leu Cys Ala Asn Gly Thr Gin
500 505 510
Ser Gly Thr Ser Ser Cys Asn Ser Ser Ser Gly Gly Leu Ser He Ser
515 520 525
Gly Asn Ala Gin Leu Gin Asn He Leu Ser Pro Thr Ser Gly Thr Thr
530 535 540
Thr Asn Thr Gin Ala Lys Ser Asn Ala Pro Lys Leu Lys Ala Met Val 545 550 555 560
Val Val Asn Asn Glu Glu Glu Ala Lys Thr Ala Asn Leu Ala Gin Ser
565 570 575
Ser Gly Thr Thr Thr Gin Ser Pro Asn Ser Thr Val Met Gly Ala Leu
580 585 590
Asn Thr Val Leu Gin Asn Val Ser Asn Phe Gin Gin Ser He Gin Asn
595 600 605
Ala Phe Gin Asn Gin Glu Ser Asn He Gin Ala Trp Ala Asn Ala He
610 615 620
Tyr Asn Thr Asn Gly Ser Gin Ser Gin Glu Met Thr Pro Asn Asn Asn 625 630 635 640
Gin Asp Leu Arg He Gin Leu Arg Ala Asn Phe Tyr Gin Leu He Asn 645 650 655
' Thr He Asn Gin Gin Val Pro Thr Asp Met Asn Ala Leu He Asn Gin 660 665 670
Ser Gin Gin Thr Gin Gin Thr Ser Gly Ser Ala Ser Asn Asn Asn Ala
675 680 685
Cys Ala Ser Gly Met Ser Gly Ser Asn Gly Asn Trp Cys Tyr Gin Gin
690 695 700
Trp Ser Asp Ser Lys Ala Tyr Tyr Ser Gly Leu Gin Ser Ala Leu Gly 705 710 715 720
Tyr Gin Thr Gin Ala Thr Thr Gin Ser Gly Ser Asn Gly Gly Asn Ser
725 730 735
He Thr Tyr Asn Val Gin Gin He Thr Leu Thr Ser Asn Gly Leu Leu
740 745 750
Asn Gin He He Thr Asn Leu Lys Ser Val Asn Gly Gly Asn Gly Ala
755 760 765
Ser Gly Thr Gly Ser Gly Asn Gly Thr Ser Gin He Asn Thr Ala Tyr
770 775 780
Gin Met Leu Thr Asp Ala Ser Asp Gly Lys Leu Gly Thr Tyr Ser Ser 785 790 795 800
Ser Ser Gly Ser Asn Asn Gly Tyr Thr Pro Cys Asn Ser Thr Asn Gly
805 810 815
Ser Asn Lys Thr Ser Gly Asn Asn Cys Tyr Glu Pro Asn Lys Gin Gin
820 825 830
Asn Ala Thr Thr Ala Thr Ala Thr Thr Asp Ser Asn Leu Gin Lys Val
835 840 845
Tyr Asn Asp Ala Gin Lys He Ala Asn He He Ala Ser Ser Gly Asn
850 855 860
Asn Lys Gly Val Glu Asn Gly Leu Lys Gin Phe Phe Glu Ala Leu Lys 865 870 875 880
Asn Asn Ser Ser Ser Leu Ser Asn Leu Cys Gly Asn Gly Ser Ser Gly
885 890 895
Ser Ser Gly Thr Thr Cys Ser Gly Trp Leu He Asn Leu Leu Gly Ala 900 905 910
He Pro Thr Asn Gly Val Ser Asp Thr Asn Asn Leu He Asn Leu Leu
915 920 925
Thr Glu Phe He Lys Thr Ala Gly Phe He Gin Asn Asn Asp Ser Ser
930 935 940
Val Ser Thr Ser Leu Thr Ser Ala Phe Gin Ala He Thr Ser Ala He 945 950 955 960
Ser Gin Gly Phe Gin Ala Leu Gin Asn Asp He Ser Pro Asn Ala He
965 970 975
Leu Thr Leu Leu Gin Glu He Thr Ser Asn Thr Thr Thr He Gin Ser
980 985 990
Phe Ser Gin Thr Leu Arg Gin Leu Leu Gly Asp Lys Thr Phe Phe Met
995 1000 1005
Ala Gin Gin Lys Leu He Asp Ala Met He Asn Ala Arg Asn Gin Val
1010 1015 1020
Gin Asn Ala Gin Asn Gin Ala Asn Asn Tyr Gly Ser Gin Pro Val Leu 025 1030 1035 1040
Ser Gin Tyr Ala Ala Ala Lys Ser Thr Gin His Gly Met Ser Asn Gly
1045 1050 1055
Leu Gly Val Gly Leu Gly Tyr Lys Tyr Phe Phe Gly Lys Ala Arg Lys
1060 1065 1070
Leu Gly Leu Arg His Tyr Phe Phe Phe Asp Tyr Gly Phe Ser Glu He
1075 1080 1085
Gly Leu Ala Asn Gin Ser Val Lys Ala Asn He Phe Ala Tyr Gly Val
1090 1095 1100
Gly Thr Asp Phe Leu Trp Asn Leu Phe Arg Arg Thr Tyr Asn Thr Lys 105 1110 1115 1120
Ala Leu Asn Phe Gly Leu Phe Ala Gly Val Gin Leu Gly Gly Ala Thr
1125 1130 1135
Trp Leu Ser Ser Leu Arg Gin Gin He He Asp Asn Trp Gly Ser Ala
1140 1145 1150
Asn Asp He His Ser Thr Asn Phe Gin Val Ala Leu Asn Phe Gly Val
1155 1160 1165
Arg Thr Asn Phe Ala Glu Phe Lys Arg Phe Ala Lys Lys Phe His Asn
1170 1175 1180
Gin Gly Val He Ser Gin Lys Ser Val Glu Phe Gly He Lys Val Pro 185 1190 1195 1200
Leu He Asn Gin Ala Tyr Leu Asn Ser Ala Gly Ala Asp Val Ser Tyr
1205 1210 1215
Arg Arg Leu Tyr Thr Phe Tyr He Asn Tyr He Met Gly Phe 1220 1225 1230
(2) INFORMATION FOR SEQ ID NO: 151:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1259 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 48...1226 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151:
TAAGGATAAA ATCAAGCGAT TAGCCCGAAT TTTAAGAGAG TATTAAG ATG AAT AAA 56
Met Asn Lys
1
AAA GCG TAT TTT GGG GAG TTT GGA GGG AGT TTT GTT TCG GAG TTG TTA 104 Lys Ala Tyr Phe Gly Glu Phe Gly Gly Ser Phe Val Ser Glu Leu Leu 5 10 15
GTG CCT GCA TTA AGA GAA TTA GAA CAG GCG TTT GAT GCG TGT TTG AAA 152 Val Pro Ala Leu Arg Glu Leu Glu Gin Ala Phe Asp Ala Cys Leu Lys 20 25 30 35
GAT GAA AAA TTC CAA AAA GAA TAT TTT CGT CTT TTA AAG GAT TTT GTG 200 Asp Glu Lys Phe Gin Lys Glu Tyr Phe Arg Leu Leu Lys Asp Phe Val 40 45 50
GGC CGT CCT AGC CCT TTA ACC TTG TGT CAA AAT ATC GTT TCT AAC CCT 248 Gly Arg Pro Ser Pro Leu Thr Leu Cys Gin Asn He Val Ser Asn Pro 55 60 65
AAA GTC AAG CTT TAT TTA AAA CGA GAG GAT TTA ATC CAT GGC GGG GCG 296 Lys Val Lys Leu Tyr Leu Lys Arg Glu Asp Leu He His Gly Gly Ala 70 75 80
CAT AAG ACT AAT CAA GCC TTA GGG CAA GCC CTT TTA GCG AAA AAA ATG 344 His Lys Thr Asn Gin Ala Leu Gly Gin Ala Leu Leu Ala Lys Lys Met 85 90 95
GGT AAA ACA AGG ATC ATC GCT GAA ACA GGC GCC GGT CAG CAT GGC GTG 392 Gly Lys Thr Arg He He Ala Glu Thr Gly Ala Gly Gin His Gly Val 100 105 110 115
GCG ACG GCT ATC GCT TGC GCA TTA TTG AAC TTA AAA TGC GTG GTT TTT 440 Ala Thr Ala He Ala Cys Ala Leu Leu Asn Leu Lys Cys Val Val Phe 120 125 130
ATG GGA TCT AAA GAC ATC AAG CGC CAG GAA ATG AAT GTT TTT AGA ATG 488 Met Gly Ser Lys Asp He Lys Arg Gin Glu Met Asn Val Phe Arg Met 135 140 145
CAC TTA TTA GGC GCT GAA GTG AGA GAG GTT AAT TCA GGG AGC GCG ACG 536 His Leu Leu Gly Ala Glu Val Arg Glu Val Asn Ser Gly Ser Ala Thr 150 155 160
CTT AAA GAC GCT GTG AAT GAA GCC TTA AGA GAT TGG GCG AGC AGT TAC 584 Leu Lys Asp Ala Val Asn Glu Ala Leu Arg Asp Trp Ala Ser Ser Tyr 165 170 175 AAG GAC ACG CAT TAT TTG CTA GGC ACA GCC GCC GGG CCA CAC CCT TAC 632 Lys Asp Thr His Tyr Leu Leu Gly Thr Ala Ala Gly Pro His Pro Tyr 180 185 190 195
CCC ACA ATG GTT AAA ACC TTT CAA AAA ATG ATA GGC GAT GAG GTT AAA 680 Pro Thr Met Val Lys Thr Phe Gin Lys Met He Gly Asp Glu Val Lys 200 205 210
AGC CAG ATT TTA GAA AAA GAA AAC CGC TTG CCT GAT TAT GTG ATC GCA 728 Ser Gin He Leu Glu Lys Glu Asn Arg Leu Pro Asp Tyr Val He Ala 215 220 225
TGC GTT GGA GGG GGG TCT AAC GCT ATA GGG ATA TTC AGC GCA TTT TTA 776 Cys Val Gly Gly Gly Ser Asn Ala He Gly He Phe Ser Ala Phe Leu 230 235 240
AAC GAC AAA GAA GTT AAA CTC ATA GGC GTA GAG CCG GCG GGT TTA GGG 824 Asn Asp Lys Glu Val Lys Leu He Gly Val Glu Pro Ala Gly Leu Gly 245 250 255
CTA GAA ACC AAT AAG CAT GGG GCG ACT TTG AAT AAG GGG CGT GTG GGG 872 Leu Glu Thr Asn Lys His Gly Ala Thr Leu Asn Lys Gly Arg Val Gly 260 265 270 275
ATT TTG CAT GGG AAT AAA ACC TAT CTT TTA CAA GAT GAT GAA GGC CAG 920 He Leu His Gly Asn Lys Thr Tyr Leu Leu Gin Asp Asp Glu Gly Gin 280 285 290
ATT GCA GAA AGC CAT AGC ATT AGC GCC GGG CTT GAT TAT CCA GGA GTG 968 He Ala Glu Ser His Ser He Ser Ala Gly Leu Asp Tyr Pro Gly Val 295 300 305
GGG CCA GAA CAC AGC TAT TTA AAA GAA AGT GGG CGT GCG GTT TAT GAA 1016 Gly Pro Glu His Ser Tyr Leu Lys Glu Ser Gly Arg Ala Val Tyr Glu 310 315 320
AGC GCA AGC GAT GCT GAA GCG CTA GAA GCC TTC AAG TTG TTG TGC CAA 1064 Ser Ala Ser Asp Ala Glu Ala Leu Glu Ala Phe Lys Leu Leu Cys Gin 325 330 335
AAA GAA GGC ATT ATC CCA GCG CTA GAA AGC TCA CAC GCC TTA GCG TAT 1112 Lys Glu Gly He He Pro Ala Leu Glu Ser Ser His Ala Leu Ala Tyr 340 345 350 355
GCC TTA AAG CTC GCT CAA AAA TGC GAA GAA GAA AGC ATC ATC GTA GTG 1160 Ala Leu Lys Leu Ala Gin Lys Cys Glu Glu Glu Ser He He Val Val 360 365 370
AAT TTA AGC GGC AGA GGG GAT AAG GAT TTA AGC ACC GTT TAT AAC GCT 1208 Asn Leu Ser Gly Arg Gly Asp Lys Asp Leu Ser Thr Val Tyr Asn Ala 375 380 385
TTA AAA GGA GGT TTA AAA TGAGGTATCA AAACATGTTT GAAACCTTAA AAA 1259 Leu Lys Gly Gly Leu Lys 390 (2) INFORMATION FOR SEQ ID NO: 152:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152:
Met Asn Lys Lys Ala Tyr Phe Gly Glu Phe Gly Gly Ser Phe Val Ser
1 5 10 15
Glu Leu Leu Val Pro Ala Leu Arg Glu Leu Glu Gin Ala Phe Asp Ala
20 25 30
Cys Leu Lys Asp Glu Lys Phe Gin Lys Glu Tyr Phe Arg Leu Leu Lys
35 40 45
Asp Phe Val Gly Arg Pro Ser Pro Leu Thr Leu Cys Gin Asn He Val
50 55 60
Ser Asn Pro Lys Val Lys Leu Tyr Leu Lys Arg Glu Asp Leu He His 65 70 75 80
Gly Gly Ala His Lys Thr Asn Gin Ala Leu Gly Gin Ala Leu Leu Ala
85 90 95
Lys Lys Met Gly Lys Thr Arg He He Ala Glu Thr Gly Ala Gly Gin
100 105 110
His Gly Val Ala Thr Ala He Ala Cys Ala Leu Leu Asn Leu Lys Cys
115 120 125
Val Val Phe Met Gly Ser Lys Asp He Lys Arg Gin Glu Met Asn Val
130 135 140
Phe Arg Met His Leu Leu Gly Ala Glu Val Arg Glu Val Asn Ser Gly 145 150 155 160
Ser Ala Thr Leu Lys Asp Ala Val Asn Glu Ala Leu Arg Asp Trp Ala
165 170 175
Ser Ser Tyr Lys Asp Thr His Tyr Leu Leu Gly Thr Ala Ala Gly Pro
180 185 190
His Pro Tyr Pro Thr Met Val Lys Thr Phe Gin Lys Met He Gly Asp
195 200 205
Glu Val Lys Ser Gin He Leu Glu Lys Glu Asn Arg Leu Pro Asp Tyr
210 215 220
Val He Ala Cys Val Gly Gly Gly Ser Asn Ala He Gly He Phe Ser 225 230 235 240
Ala Phe Leu Asn Asp Lys Glu Val Lys Leu He Gly Val Glu Pro Ala
245 250 255
Gly Leu Gly Leu Glu Thr Asn Lys His Gly Ala Thr Leu Asn Lys Gly
260 265 270
Arg Val Gly He Leu His Gly Asn Lys Thr Tyr Leu Leu Gin Asp Asp
275 280 285
Glu Gly Gin He Ala Glu Ser His Ser He Ser Ala Gly Leu Asp Tyr
290 295 300
Pro Gly Val Gly Pro Glu His Ser Tyr Leu Lys Glu Ser Gly Arg Ala 305 310 315 320
Val Tyr Glu Ser Ala Ser Asp Ala Glu Ala Leu Glu Ala Phe Lys Leu 325 330 335
Leu Cys Gin Lys Glu Gly He He Pro Ala Leu Glu Ser Ser His Ala
340 345 350
Leu Ala Tyr Ala Leu Lys Leu Ala Gin Lys Cys Glu Glu Glu Ser He
355 360 365
He Val Val Asn Leu Ser Gly Arg Gly Asp Lys Asp Leu Ser Thr Val
370 375 380
Tyr Asn Ala Leu Lys Gly Gly Leu Lys 385 390
(2) INFORMATION FOR SEQ ID NO:153:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 601 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 197...547 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:153:
TGGTGATGCA AAACCAAAAC AAGCGCATCA TGAATTACAT TCCTATTAAG TTGAATTTAA 60
GTGGGGTGAT CCCCCCTATT TTCGCTTCAG CTTTGCTCGT GTTCCCTTCT ACGATTTTGC 120
AGCAAGCCAC AAGCAACAAA ACCTTGCAAG CGGTTGCGNA TTTTTTAAGC CCGCAAGGTA 180
TGCGTATAAT ATTTTG ATG TTC TTG CTC ATC ATC TTT TTT GCT TAC TTT TAT 232 Met Phe Leu Leu He He Phe Phe Ala Tyr Phe Tyr 1 5 10
TCT TCT ATT GTG TTC AAT TCT AAG GAT ATT GCG GAT AAT TTG AGG CGT 280 Ser Ser He Val Phe Asn Ser Lys Asp He Ala Asp Asn Leu Arg Arg 15 20 25
AAT GGC GGG TAT ATT CCA GGG CTT AGG CCT GGA GAG GGG ACT TCA TCG 328 Asn Gly Gly Tyr He Pro Gly Leu Arg Pro Gly Glu Gly Thr Ser Ser 30 35 40
TTT TTA AAT TCT GTA GCG AGT AAG CTC ACT TTG TGG GGT TCA TTG TAT 376 Phe Leu Asn Ser Val Ala Ser Lys Leu Thr Leu Trp Gly Ser Leu Tyr 45 50 55 60
TTA GCG CTC ATT TCT ACC GTG CCT TGG ATT TTG GTT AAG GCT ATG GGC 424 Leu Ala Leu He Ser Thr Val Pro Trp He Leu Val Lys Ala Met Gly 65 70 75
GTG CCT TTT TAC TTT GGA GGC ACA GCG GTG CTG ATT GTG GTT CAA GTC 472 Val Pro Phe Tyr Phe Gly Gly Thr Ala Val Leu He Val Val Gin Val 80 85 90
GCT ATT GAC ACC ATG AAA AAG ATT GAA GCG CAA ATT TAT ATG AGC AAG 520 Ala He Asp Thr Met Lys Lys He Glu Ala Gin He Tyr Met Ser Lys 95 100 105
TAT AAA ACT TTA AGC GCG GTA GGC TTT TAATGGCAAT CTCTATTAAA AGCCCAA 574 Tyr Lys Thr Leu Ser Ala Val Gly Phe 110 115
AAGAAATCAA AGCTCTAAGA AAAGCCG 601
(2) INFORMATION FOR SEQ ID NO: 154:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 117 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 154 :
Met Phe Leu Leu He He Phe Phe Ala Tyr Phe Tyr Ser Ser He Val
1 5 10 15
Phe Asn Ser Lys Asp He Ala Asp Asn Leu Arg Arg Asn Gly Gly Tyr
20 25 30
He Pro Gly Leu Arg Pro Gly Glu Gly Thr Ser Ser Phe Leu Asn Ser
35 40 45
Val Ala Ser Lys Leu Thr Leu Trp Gly Ser Leu Tyr Leu Ala Leu He
50 55 60
Ser Thr Val Pro Trp He Leu Val Lys Ala Met Gly Val Pro Phe Tyr 65 70 75 80
Phe Gly Gly Thr Ala Val Leu He Val Val Gin Val Ala He Asp Thr
85 90 95
Met Lys Lys He Glu Ala Gin He Tyr Met Ser Lys Tyr Lys Thr Leu
100 105 110
Ser Ala Val Gly Phe 115
(2) INFORMATION FOR SEQ ID NO: 155:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 725 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 64...675 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155:
GAAAACAGGA TAACGCATGA AACATGTGAG TAGGGATTTT GATACCGGTT GGGTTGCGTA 60 TCA ATG ACT CTA GGC ATT GAT GAA GCG GGT AGG GGG TGT TTG GCC GGT 108 Met Thr Leu Gly He Asp Glu Ala Gly Arg Gly Cys Leu Ala Gly 1 5 10 15
TCG CTT TTT GTG GCT GGG GTG GCG TGT AAT GAA AAA ACA GCC TTA GAA 156 Ser Leu Phe Val Ala Gly Val Ala Cys Asn Glu Lys Thr Ala Leu Glu 20 25 30
TTT CTA AAA ATG GGT TTA AAA GAC AGC AAG AAG CTC AGC CTA AAA AAG 204 Phe Leu Lys Met Gly Leu Lys Asp Ser Lys Lys Leu Ser Leu Lys Lys 35 40 45
CGC TTT TTC TTA GAA TAT AAG ATC AAA ACG CAT GGT GAG GTG GGG TTT 252 Arg Phe Phe Leu Glu Tyr Lys He Lys Thr His Gly Glu Val Gly Phe 50 55 60
TTC GTG GTT AAA AAA AGC GCA AAT GAA ATT GAT AGC TTG GGC TTA GGG 300 Phe Val Val Lys Lys Ser Ala Asn Glu He Asp Ser Leu Gly Leu Gly 65 70 75
GCG TGT TTG AAA CTC GCT GTG CAA GAA ATT TTA GAA AAT GGT TGC TCT 348 Ala Cys Leu Lys Leu Ala Val Gin Glu He Leu Glu Asn Gly Cys Ser 80 85 90 95
TTA GTT GAT GAA ATA AAA ATA GAC GGC AAC ACG GCG TTT GGC TTG AAC 396 Leu Val Asp Glu He Lys He Asp Gly Asn Thr Ala Phe Gly Leu Asn 100 105 110
AAA CGC TAC CCC CAT ATA CAA ACC ATC ATC AAG GGC GAT GAA ACA ATC 444 Lys Arg Tyr Pro His He Gin Thr He He Lys Gly Asp Glu Thr He 115 120 125
GCT CAA ATC GCT ATG GCG TCT GTT TTG GCG AAA GCT TTT AAG GAC AGA 492 Ala Gin He Ala Met Ala Ser Val Leu Ala Lys Ala Phe Lys Asp Arg 130 135 140
GAA ATG CTA GAG TTG CAC GCT TTG TTT AAG GAA TAC GGC TGG GAT AAG 540 Glu Met Leu Glu Leu His Ala Leu Phe Lys Glu Tyr Gly Trp Asp Lys 145 150 155
AAT TGC GGG TAT GGG ACT AAA CAA CAT ATA GAA GCG ATC ATT AAG CTA 588 Asn Cys Gly Tyr Gly Thr Lys Gin His He Glu Ala He He Lys Leu 160 165 170 175
GGG GCT ACG CCT TTT CAT CGG CAT AGC TTC ACG CTT AAA AAC CGC ATC 636 Gly Ala Thr Pro Phe His Arg His Ser Phe Thr Leu Lys Asn Arg He 180 185 190 TTA AAT CCC AAA CTC TTA GAG GTG GAA CAA CGC CTT ATT TAAAAGGGCG CT 687 Leu Asn Pro Lys Leu Leu Glu Val Glu Gin Arg Leu He 195 200
GAGATGGGTA GCGCTCGCTG AAGAAAGGTC GATGCGTT 725
(2) INFORMATION FOR SEQ ID NO: 156:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 204 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156:
Met Thr Leu Gly He Asp Glu Ala Gly Arg Gly Cys Leu Ala Gly Ser
1 5 10 15
Leu Phe Val Ala Gly Val Ala Cys Asn Glu Lys Thr Ala Leu Glu Phe
20 25 30
Leu Lys Met Gly Leu Lys Asp Ser Lys Lys Leu Ser Leu Lys Lys Arg
35 40 45
Phe Phe Leu Glu Tyr Lys He Lys Thr His Gly Glu Val Gly Phe Phe
50 55 60
Val Val Lys Lys Ser Ala Asn Glu He Asp Ser Leu Gly Leu Gly Ala 65 70 75 80
Cys Leu Lys Leu Ala Val Gin Glu He Leu Glu Asn Gly Cys Ser Leu
85 90 95
Val Asp Glu He Lys He Asp Gly Asn Thr Ala Phe Gly Leu Asn Lys
100 105 110
Arg Tyr Pro His He Gin Thr He He Lys Gly Asp Glu Thr He Ala
115 120 125
Gin He Ala Met Ala Ser Val Leu Ala Lys Ala Phe Lys Asp Arg Glu
130 135 140
Met Leu Glu Leu His Ala Leu Phe Lys Glu Tyr Gly Trp Asp Lys Asn 145 150 155 160
Cys Gly Tyr Gly Thr Lys Gin His He Glu Ala He He Lys Leu Gly
165 170 175
Ala Thr Pro Phe His Arg His Ser Phe Thr Leu Lys Asn Arg He Leu
180 185 190
Asn Pro Lys Leu Leu Glu Val Glu Gin Arg Leu He 195 200
(2) INFORMATION FOR SEQ ID NO: 157:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2821 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA ( ix) FEATURE :
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 58...2769 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157:
GATTAATCAG TGGAAGAATA CAAAGACACC CTAAACTTAA ACACAACCAC CTTTTCT ATG 60
Met 1
AAG GGG AAT TTG AGC GTT AAT GAG CCT AAA ACT TAC GCC AAA TGG CAA 108 Lys Gly Asn Leu Ser Val Asn Glu Pro Lys Thr Tyr Ala Lys Trp Gin 5 10 15
GAG CAA CAA GCG TTT AAA CGC ATG CAA GCT AGG AAA GAC AAC CAT GGG 156 Glu Gin Gin Ala Phe Lys Arg Met Gin Ala Arg Lys Asp Asn His Gly 20 25 30
GAT TTC ACT TTG CAT GAC GGG CCG CCT TAT GCG AAC GGG CAT TTG CAT 204 Asp Phe Thr Leu His Asp Gly Pro Pro Tyr Ala Asn Gly His Leu His 35 40 45
TTG GGG CAT GCC TTA AAT AAA ATT TTA AAA GAC ATT GTC GTT AAA AGA 252 Leu Gly His Ala Leu Asn Lys He Leu Lys Asp He Val Val Lys Arg 50 55 60 65
GAA TAT TTT AAG GGG AAG AAA ATC TAT TAC ACG CCC GGT TGG GAT TGC 300 Glu Tyr Phe Lys Gly Lys Lys He Tyr Tyr Thr Pro Gly Trp Asp Cys 70 75 80
CAT GGT TTG CCC ATT GAG CAG CAA ATT TTA GAG CGA TTA GAA AAA GAA 348 His Gly Leu Pro He Glu Gin Gin He Leu Glu Arg Leu Glu Lys Glu 85 90 95
AAA ACA AGC CTA GAA AAC CCC ACG CTG TTT AGA GAA AAG TGC CGA GAT 396 Lys Thr Ser Leu Glu Asn Pro Thr Leu Phe Arg Glu Lys Cys Arg Asp 100 105 110
CAT GCG AAG AAA TTT TTA GAA ATC CAA AAG AAT GAA TTT TTG CAA TTG 444 His Ala Lys Lys Phe Leu Glu He Gin Lys Asn Glu Phe Leu Gin Leu 115 120 125
GGT GTT TTG GGG GAT TTT GAA GAT CCT TAT AAA ACC ATG GAT TTT AAA 492 Gly Val Leu Gly Asp Phe Glu Asp Pro Tyr Lys Thr Met Asp Phe Lys 130 135 140 145
TTT GAA GCG AGC ATT TAT AGA GCC TTA GTG GAA GTG GCT AAA AAA GGG 540 Phe Glu Ala Ser He Tyr Arg Ala Leu Val Glu Val Ala Lys Lys Gly 150 155 160
CTT TTG AAA GAG CGC CAC AAG CCT ATT TAT TGG AGT TAT GCA TGC GAG 588 Leu Leu Lys Glu Arg His Lys Pro He Tyr Trp Ser Tyr Ala Cys Glu 165 170 175
AGC GCT TTA GCG GAA GCT GAA GTG GAA TAC AAA ATG AAA AAA TCG CCC 636 Ser Ala Leu Ala Glu Ala Glu Val Glu Tyr Lys Met Lys Lys Ser Pro 180 185 190
TCC ATT TTC GTG GCG TTT GGT TTG AAA AAG GAG AGT TTA GAA AAA TTA 684 Ser He Phe Val Ala Phe Gly Leu Lys Lys Glu Ser Leu Glu Lys Leu 195 200 205
AAA GTC AAA AAA GCG AGC TTG GTG ATT TGG ACG ACC ACG CCT TGG ACT 732 Lys Val Lys Lys Ala Ser Leu Val He Trp Thr Thr Thr Pro Trp Thr 210 215 220 225
TTG TAT GCG AAT GTA GCG ATC GCT TTG AAA AAA GAC GCT GTT TAT GCG 780 Leu Tyr Ala Asn Val Ala He Ala Leu Lys Lys Asp Ala Val Tyr Ala 230 235 240
CTC ACC CAA AAA GGC TAT TTA GTC GCT AAA GCC TTG CAT GAA AAA TTA 828 Leu Thr Gin Lys Gly Tyr Leu Val Ala Lys Ala Leu His Glu Lys Leu 245 250 255
GCC GCT TTA GGG GTG GTG GAT AAT GAG ATC ACA CAT GAA TTC AAT TCC 876 Ala Ala Leu Gly Val Val Asp Asn Glu He Thr His Glu Phe Asn Ser 260 265 270
AAT GAT TTA GAA TAT TTA GTG GCT ACA AAC CCG CTC AAT CAA AGG GAT 924 Asn Asp Leu Glu Tyr Leu Val Ala Thr Asn Pro Leu Asn Gin Arg Asp 275 280 285
TCG CTG GTG GCT TTA GGA GAG CAT GTC GGT TTA GAA GAT GGC ACA GGA 972 Ser Leu Val Ala Leu Gly Glu His Val Gly Leu Glu Asp Gly Thr Gly 290 295 300 305
GCC GTG CAT ACC GCA CCT GGG CAT GGT GAA GAG GAC TAT TAT TTA GGC 1020 Ala Val His Thr Ala Pro Gly His Gly Glu Glu Asp Tyr Tyr Leu Gly 310 315 320
TTA AGA TAT AAT TTA GAA GTG TTA ATG TCT GTA GAT GAG AAA GGT TGC 1068 Leu Arg Tyr Asn Leu Glu Val Leu Met Ser Val Asp Glu Lys Gly Cys 325 330 335
TAT GAT GAG GGC ATT ATC CAT AAC CAA CTA TTA GAT GAA AGC TAT CTG 1116 Tyr Asp Glu Gly He He His Asn Gin Leu Leu Asp Glu Ser Tyr Leu 340 345 350
GGC GAG CAT GTT TTT AAG GCT CAA AAA CGC ATT ATA GAG CAA TTG GGC 1164 Gly Glu His Val Phe Lys Ala Gin Lys Arg He He Glu Gin Leu Gly 355 360 365
GAT TCT TTA TTG CTA GAG CAA GAG ATT GAG CAT TCT TAT CCG CAT TGC 1212 Asp Ser Leu Leu Leu Glu Gin Glu He Glu His Ser Tyr Pro His Cys 370 375 380 385 TGG AGG ACG CAC AAG CCT GTG ATT TAC AGA GCG ACT ACG CAA TGG TTT 1260 Trp Arg Thr His Lys Pro Val He Tyr Arg Ala Thr Thr Gin Trp Phe 390 395 400
ATT TTA ATG GAT GAG CCT TTT ATC CAA AAT GAT GGC TCT CAA AAA ACC 1308 He Leu Met Asp Glu Pro Phe He Gin Asn Asp Gly Ser Gin Lys Thr 405 410 415
TTA AGA GAA GTG GCT TTA GAT GCG ATT GAA AAG GTG GAA TTT GTG CCA 1356 Leu Arg Glu Val Ala Leu Asp Ala He Glu Lys Val Glu Phe Val Pro 420 425 430
AGC AGC GGG AAA AAC CGC CTA AAA ACC ATG ATA GAA AAC CGC CCT GAT 1404 Ser Ser Gly Lys Asn Arg Leu Lys Thr Met He Glu Asn Arg Pro Asp 435 440 445
TGG TGC TTG AGC CGG CAA AGA AAA TGG GGC GTG CCA CTG GCC TTT TTC 1452 Trp Cys Leu Ser Arg Gin Arg Lys Trp Gly Val Pro Leu Ala Phe Phe 450 455 460 465
ATA GAC AAA CGC ACG AAT AAG CCT TGT TTT GAA AGC GAA GTT TTA GAG 1500 He Asp Lys Arg Thr Asn Lys Pro Cys Phe Glu Ser Glu Val Leu Glu 470 475 480
CAT GTG GCC AAT CTT TTT GAG AAA AAA GGC TGT GAT GTG TGG TGG GAG 1548 His Val Ala Asn Leu Phe Glu Lys Lys Gly Cys Asp Val Trp Trp Glu 485 490 495
TAT AGC GTG AAA GAT TTA TTG CCC CCT AGC TAT CAA GAG GAC GCC AAG 1596 Tyr Ser Val Lys Asp Leu Leu Pro Pro Ser Tyr Gin Glu Asp Ala Lys 500 505 510
CAT TAT GAG AAA ATC ATG CAC ATT TTA GAC GTG TGG TTT GAT AGT GGT 1644 His Tyr Glu Lys He Met His He Leu Asp Val Trp Phe Asp Ser Gly 515 520 525
AGC ACC TTT AAG GCG GTT TTA GAA GAC TAT CAT GGA GAA AAA GGG CAA 1692 Ser Thr Phe Lys Ala Val Leu Glu Asp Tyr His Gly Glu Lys Gly Gin 530 535 540 545
AGC CCT AGC GAT GTG ATC TTA GAA GGG AGC GAT CAG CAT AGG GGG TGG 1740 Ser Pro Ser Asp Val He Leu Glu Gly Ser Asp Gin His Arg Gly Trp 550 555 560
TTT CAA AGC TCG CTT CTA ATC GGT TGT GTT TTA AAC AAC CAA GCC CCT 1788 Phe Gin Ser Ser Leu Leu He Gly Cys Val Leu Asn Asn Gin Ala Pro 565 570 575
TTT AAA AAG GTC ATT ACG CAT GGC TTT ATC GTA GAT GAA AAG GGC GAA 1836 Phe Lys Lys Val He Thr His Gly Phe He Val Asp Glu Lys Gly Glu 580 585 590
AAA ATG AGT AAA TCT AAG GGC AAT GTG GTG TCT TTG GAC AAG CTG CTC 1884 Lys Met Ser Lys Ser Lys Gly Asn Val Val Ser Leu Asp Lys Leu Leu 595 600 605 AAA ACG CAT GGG AGC GAT GTG GTG CGT TTG TGG GTA GCG TTT AAT GAC 1932 Lys Thr His Gly Ser Asp Val Val Arg Leu Trp Val Ala Phe Asn Asp 610 615 620 625
TAT CAA AAC GAT TTG AGA GTC TCT CAA ACC TTT TTC ACT CAA ACA GAA 1980 Tyr Gin Asn Asp Leu Arg Val Ser Gin Thr Phe Phe Thr Gin Thr Glu 630 635 640
CAA CAT TAT AAA AAA TTC CGC AAC ACC CTG AAA TTC TTA CTC GCT AAT 2028 Gin His Tyr Lys Lys Phe Arg Asn Thr Leu Lys Phe Leu Leu Ala Asn 645 650 655
TTT AGC GAT ATG GAT CTC AAG AAT TTA GAA CGC CCC CAT AAC TTC AGC 2076 Phe Ser Asp Met Asp Leu Lys Asn Leu Glu Arg Pro His Asn Phe Ser 660 665 670
CCT TTA GAT CAT TTT ATG TTA GAG ACT TTA GAA ACC ATA AGC GCT GGA 2124 Pro Leu Asp His Phe Met Leu Glu Thr Leu Glu Thr He Ser Ala Gly 675 680 685
GTC AAT AGC GCG TTT GAA GAG CAT GAT TTT GTG AAA GGC TTG AAT ATT 2172 Val Asn Ser Ala Phe Glu Glu His Asp Phe Val Lys Gly Leu Asn He 690 695 700 705
TTA ATG GCG TTT GTT ACC AAT GAA TTG AGC GGG ATT TAT TTA GAC GCT 2220 Leu Met Ala Phe Val Thr Asn Glu Leu Ser Gly He Tyr Leu Asp Ala 710 715 720
TGC AAG GAT AGC TTG TAT TGC GAT AGC AAA AAC AAT GAA AAA CGC CAA 2268 Cys Lys Asp Ser Leu Tyr Cys Asp Ser Lys Asn Asn Glu Lys Arg Gin 725 730 735
GCC ATT CAA ATG GTT TTA CTC GCT ACA GCT AGT AAG TTG TGC TAC TTT 2316 Ala He Gin Met Val Leu Leu Ala Thr Ala Ser Lys Leu Cys Tyr Phe 740 745 750
TTA GCC CCG ATT TTA ACG CAC ACG ATT GAA GAA GTT TTA GAG CAT AGC 2364 Leu Ala Pro He Leu Thr His Thr He Glu Glu Val Leu Glu His Ser 755 760 765
CAA GCG CTT CGC ATT TTT TTA CAA GCC AAA GAT GTG TTT GAT TTA AAA 2412 Gin Ala Leu Arg He Phe Leu Gin Ala Lys Asp Val Phe Asp Leu Lys 770 775 780 785
GAC ATT AGC GTT TCA GAA AAA CTC CAC CTC AAA GAG TTT AAA AAA CCA 2460 Asp He Ser Val Ser Glu Lys Leu His Leu Lys Glu Phe Lys Lys Pro 790 795 800
GAA AAT TTT GAA GCC GTT TTA GCC TTG CGT TCT GCC TTT AAT GAA GAG 2508 Glu Asn Phe Glu Ala Val Leu Ala Leu Arg Ser Ala Phe Asn Glu Glu 805 810 815
TTA GAC CGA TTG AAA AAA GAA GGC GTC ATT AAA AAT TCG TTA GAG TGC 2556 Leu Asp Arg Leu Lys Lys Glu Gly Val He Lys Asn Ser Leu Glu Cys 820 825 830 GCT ATT GAA GTA AAA GAA AAA GCG TTG GAT GAA AAT TTA GTA GAA GAG 2604 Ala He Glu Val Lys Glu Lys Ala Leu Asp Glu Asn Leu Val Glu Glu 835 840 845
TTG CTG ATG GTA AGC TTT GTG GGG ATT GCA AAA GAA AAA TTG AGT GAA 2652 Leu Leu Met Val Ser Phe Val Gly He Ala Lys Glu Lys Leu Ser Glu 850 855 860 865
ACG CCA GCA TTC ACG CTC TTT AAA GCC CCC TTT TAT AAA TGC CCC AGG 2700 Thr Pro Ala Phe Thr Leu Phe Lys Ala Pro Phe Tyr Lys Cys Pro Arg 870 875 880
TGT TGG CGT TTT AAA AGC GAG CTA GAA AAC ACC CCT TGC AAG CGT TGC 2748 Cys Trp Arg Phe Lys Ser Glu Leu Glu Asn Thr Pro Cys Lys Arg Cys 885 890 895
GAA CAG GTT TTA AAA GAG CGA TGATAAAAGG ATAGGGCTTT TGAAAACTTT ACAA 2803 Glu Gin Val Leu Lys Glu Arg 900
ACCCATAGAG TTTTACAA 2821
(2) INFORMATION FOR SEQ ID NO: 158:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 904 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:158:
Met Lys Gly Asn Leu Ser Val Asn Glu Pro Lys Thr Tyr Ala Lys Trp
1 5 10 15
Gin Glu Gin Gin Ala Phe Lys Arg Met Gin Ala Arg Lys Asp Asn His
20 25 30
Gly Asp Phe Thr Leu His Asp Gly Pro Pro Tyr Ala Asn Gly His Leu
35 40 45
His Leu Gly His Ala Leu Asn Lys He Leu Lys Asp He Val Val Lys
50 55 60
Arg Glu Tyr Phe Lys Gly Lys Lys He Tyr Tyr Thr Pro Gly Trp Asp 65 70 75 80
Cys His Gly Leu Pro He Glu Gin Gin He Leu Glu Arg Leu Glu Lys
85 90 95
Glu Lys Thr Ser Leu Glu Asn Pro Thr Leu Phe Arg Glu Lys Cys Arg
100 105 110
Asp His Ala Lys Lys Phe Leu Glu He Gin Lys Asn Glu Phe Leu Gin
115 120 125
Leu Gly Val Leu Gly Asp Phe Glu Asp Pro Tyr Lys Thr Met Asp Phe
130 135 140
Lys Phe Glu Ala Ser He Tyr Arg Ala Leu Val Glu Val Ala Lys Lys 145 150 155 160 Gly Leu Leu Lys Glu Arg His Lys Pro He Tyr Trp Ser Tyr Ala Cys
165 170 175
Glu Ser Ala Leu Ala Glu Ala Glu Val Glu Tyr Lys Met Lys Lys Ser
180 185 190
Pro Ser He Phe Val Ala Phe Gly Leu Lys Lys Glu Ser Leu Glu Lys
195 200 205
Leu Lys Val Lys Lys Ala Ser Leu Val He Trp Thr Thr Thr Pro Trp
210 215 220
Thr Leu Tyr Ala Asn Val Ala He Ala Leu Lys Lys Asp Ala Val Tyr 225 230 235 240
Ala Leu Thr Gin Lys Gly Tyr Leu Val Ala Lys Ala Leu His Glu Lys
245 250 255
Leu Ala Ala Leu Gly Val Val Asp Asn Glu He Thr His Glu Phe Asn
260 265 270
Ser Asn Asp Leu Glu Tyr Leu Val Ala Thr Asn Pro Leu Asn Gin Arg
275 280 285
Asp Ser Leu Val Ala Leu Gly Glu His Val Gly Leu Glu Asp Gly Thr
290 295 300
Gly Ala Val His Thr Ala Pro Gly His Gly Glu Glu Asp Tyr Tyr Leu 305 310 315 320
Gly Leu Arg Tyr Asn Leu Glu Val Leu Met Ser Val Asp Glu Lys Gly
325 330 335
Cys Tyr Asp Glu Gly He He His Asn Gin Leu Leu Asp Glu Ser Tyr
340 345 350
Leu Gly Glu His Val Phe Lys Ala Gin Lys Arg He He Glu Gin Leu
355 360 365
Gly Asp Ser Leu Leu Leu Glu Gin Glu He Glu His Ser Tyr Pro His
370 375 380
Cys Trp Arg Thr His Lys Pro Val He Tyr Arg Ala Thr Thr Gin Trp 385 390 395 400
Phe He Leu Met Asp Glu Pro Phe He Gin Asn Asp Gly Ser Gin Lys
405 410 415
Thr Leu Arg Glu Val Ala Leu Asp Ala He Glu Lys Val Glu Phe Val
420 425 430
Pro Ser Ser Gly Lys Asn Arg Leu Lys Thr Met He Glu Asn Arg Pro
435 440 445
Asp Trp Cys Leu Ser Arg Gin Arg Lys Trp Gly Val Pro Leu Ala Phe
450 455 460
Phe He Asp Lys Arg Thr Asn Lys Pro Cys Phe Glu Ser Glu Val Leu 465 470 475 480
Glu His Val Ala Asn Leu Phe Glu Lys Lys Gly Cys Asp Val Trp Trp
485 490 495
Glu Tyr Ser Val Lys Asp Leu Leu Pro Pro Ser Tyr Gin Glu Asp Ala
500 505 510
Lys His Tyr Glu Lys He Met His He Leu Asp Val Trp Phe Asp Ser
515 520 525
Gly Ser Thr Phe Lys Ala Val Leu Glu Asp Tyr His Gly Glu Lys Gly
530 535 540
Gin Ser Pro Ser Asp Val He Leu Glu Gly Ser Asp Gin His Arg Gly 545 550 555 560
Trp Phe Gin Ser Ser Leu Leu He Gly Cys Val Leu Asn Asn Gin Ala
565 570 575
Pro Phe Lys Lys Val He Thr His Gly Phe He Val Asp Glu Lys Gly
580 585 590
Glu Lys Met Ser Lys Ser Lys Gly Asn Val Val Ser Leu Asp Lys Leu 595 600 605
Leu Lys Thr His Gly Ser Asp Val Val Arg Leu Trp Val Ala Phe Asn
610 615 620
Asp Tyr Gin Asn Asp Leu Arg Val Ser Gin Thr Phe Phe Thr Gin Thr 625 630 635 640
Glu Gin His Tyr Lys Lys Phe Arg Asn Thr Leu Lys Phe Leu Leu Ala
645 650 655
Asn Phe Ser Asp Met Asp Leu Lys Asn Leu Glu Arg Pro His Asn Phe
660 665 670
Ser Pro Leu Asp His Phe Met Leu Glu Thr Leu Glu Thr He Ser Ala
675 680 685
Gly Val Asn Ser Ala Phe Glu Glu His Asp Phe Val Lys Gly Leu Asn
690 695 700
He Leu Met Ala Phe Val Thr Asn Glu Leu Ser Gly He Tyr Leu Asp 705 710 715 720
Ala Cys Lys Asp Ser Leu Tyr Cys Asp Ser Lys Asn Asn Glu Lys Arg
725 730 735
Gin Ala He Gin Met Val Leu Leu Ala Thr Ala Ser Lys Leu Cys Tyr
740 745 750
Phe Leu Ala Pro He Leu Thr His Thr He Glu Glu Val Leu Glu His
755 760 765
Ser Gin Ala Leu Arg He Phe Leu Gin Ala Lys Asp Val Phe Asp Leu
770 775 780
Lys Asp He Ser Val Ser Glu Lys Leu His Leu Lys Glu Phe Lys Lys 785 790 795 800
Pro Glu Asn Phe Glu Ala Val Leu Ala Leu Arg Ser Ala Phe Asn Glu
805 810 815
Glu Leu Asp Arg Leu Lys Lys Glu Gly Val He Lys Asn Ser Leu Glu
820 825 830
Cys Ala He Glu Val Lys Glu Lys Ala Leu Asp Glu Asn Leu Val Glu
835 840 845
Glu Leu Leu Met Val Ser Phe Val Gly He Ala Lys Glu Lys Leu Ser
850 855 860
Glu Thr Pro Ala Phe Thr Leu Phe Lys Ala Pro Phe Tyr Lys Cys Pro 865 870 875 880
Arg Cys Trp Arg Phe Lys Ser Glu Leu Glu Asn Thr Pro Cys Lys Arg
885 890 895
Cys Glu Gin Val Leu Lys Glu Arg 900
(2) INFORMATION FOR SEQ ID NO: 159:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 339 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 70...288 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159:
TGTAGAATGA AATCCTAGCC AGTGAGCTAG AATTTAAATT TTTAATCAAA GGAGTCATCA 60 TGGCACACC ATG AAG AAC AAC ACG GCG GGC ACC ACC ACC ACC ATC ACC ACA 111 Met Lys Asn Asn Thr Ala Gly Thr Thr Thr Thr He Thr Thr 1 5 10
CAC ACC ACC ACC ACT ATC ATG GCG GTG AAC ACC ACC ATC ACC ACC ACA 159 His Thr Thr Thr Thr He Met Ala Val Asn Thr Thr He Thr Thr Thr 15 20 25 30
GCT CTC ATC ATG AAG AAG GTT GTT GCA GCA CTA GCG ACA GTC ATC ATC 207 Ala Leu He Met Lys Lys Val Val Ala Ala Leu Ala Thr Val He He 35 40 45
AAG AAG AAG GTT GCT GCC ACG GGC ATC ACG AGT AAT ATC GGT GTG GCT 255 Lys Lys Lys Val Ala Ala Thr Gly He Thr Ser Asn He Gly Val Ala 50 55 60
AGG GGC AAC TTG ACT AGG GTT GTC TCT GGC TTT TGACTTTAAA ATACAATCAT 308 Arg Gly Asn Leu Thr Arg Val Val Ser Gly Phe 65 70
TCCATTCTAA CCCATTCTGA TCAAACCCGT T 339
(2) INFORMATION FOR SEQ ID NO: 160:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 73 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160:
Met Lys Asn Asn Thr Ala Gly Thr Thr Thr Thr He Thr Thr His Thr
1 5 10 15
Thr Thr Thr He Met Ala Val Asn Thr Thr He Thr Thr Thr Ala Leu
20 25 30
He Met Lys Lys Val Val Ala Ala Leu Ala Thr Val He He Lys Lys
35 40 45
Lys Val Ala Ala Thr Gly He Thr Ser Asn He Gly Val Ala Arg Gly
50 55 60
Asn Leu Thr Arg Val Val Ser Gly Phe 65 70
(2) INFORMATION FOR SEQ ID NO: 161:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 787 base pairs (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...734 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161:
GTTTTCTACT TATGATTTTG TGGAAGAATA TTGCAAATTA AAGGAAATGC ATG CTT 56
Met Leu 1
GAA AAA GTG TTT CAA GAA ATT ACC AAT AAA AGA AAG TTT TTT GCA AGT 104 Glu Lys Val Phe Gin Glu He Thr Asn Lys Arg Lys Phe Phe Ala Ser 5 10 15
TCT AGC ACA GGG GAG CAG TTT GAA AAC CAA TTT AGG AAT GAA TTA AAA 152 Ser Ser Thr Gly Glu Gin Phe Glu Asn Gin Phe Arg Asn Glu Leu Lys 20 25 30
AAA CAC TTT AGC GAA ATC AAT GGC GAT TTA ACA GAA GAA TTA AGC CAT 200 Lys His Phe Ser Glu He Asn Gly Asp Leu Thr Glu Glu Leu Ser His 35 40 45 50
ATT GAA GAA AAG CCT AAT AAA GAA ATC AAA ACC ACT TTT AAC CAA CTC 248 He Glu Glu Lys Pro Asn Lys Glu He Lys Thr Thr Phe Asn Gin Leu 55 60 65
AAA AAG CAA GTT TTA GAA AAA AAT CAC CCG CAC ACC CTT AAA AAC CCT 296 Lys Lys Gin Val Leu Glu Lys Asn His Pro His Thr Leu Lys Asn Pro 70 75 80
TTT TCA AAC CTT ACA AGC CAT TTT TTA TAC CAG CCT TTT GGC TCA CAA 344 Phe Ser Asn Leu Thr Ser His Phe Leu Tyr Gin Pro Phe Gly Ser Gin 85 90 95
AAT TAC CCT GAT TTT TTG GTT TTT ATT TTT GAC TAT GTG GTG GGG ATT 392 Asn Tyr Pro Asp Phe Leu Val Phe He Phe Asp Tyr Val Val Gly He 100 105 110
GAA ATC AAG TTT TCT AAA AAC GAT AAG GGT GAA AAA AAT CTT CAA ACA 440 Glu He Lys Phe Ser Lys Asn Asp Lys Gly Glu Lys Asn Leu Gin Thr 115 120 125 130
TCT CGC CCC ATG TGG AAT TCA AAC CTG CCT AAA CCC AAT GCG ATT TAT 488 Ser Arg Pro Met Trp Asn Ser Asn Leu Pro Lys Pro Asn Ala He Tyr 135 140 145 GTG TAT GGA GTC GCT AAT GCA AAC ATC ACT TTT TTT AAA GGC TCA GAT 536
Val Tyr Gly Val Ala Asn Ala Asn He Thr Phe Phe Lys Gly Ser Asp 150 155 160
ATT TTG AGT TAT GAA ACC AGA GAG GTC TTG CTC AAG TAT TTT GAT ATT 584
He Leu Ser Tyr Glu Thr Arg Glu Val Leu Leu Lys Tyr Phe Asp He 165 170 175
TTA GAT AAA GAT GAA AGA AGT TTG AAA AAC GCC TTA AAG GAT TTA GAA 632
Leu Asp Lys Asp Glu Arg Ser Leu Lys Asn Ala Leu Lys Asp Leu Glu 180 185 190
AAC CCT TTT GGG TTT GCC CCC TAC ATC AGA AAA GCT TAT GAG CAT AAA 680
Asn Pro Phe Gly Phe Ala Pro Tyr He Arg Lys Ala Tyr Glu His Lys
195 200 205 210
AGG AAT TTT CTA ACC ACC ACC AGA TTG AAA GCT TCT TTT CGC CCA ACC 728
Arg Asn Phe Leu Thr Thr Thr Arg Leu Lys Ala Ser Phe Arg Pro Thr
215 220 225
ACA TTT TAAGAGAGCG GAATGTCTTG GAATTTTTGA AAACGCTCAC TCATTAGCGT AT 786 Thr Phe
787
(2) INFORMATION FOR SEQ ID NO : 162 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 228 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162:
Met Leu Glu Lys Val Phe Gin Glu He Thr Asn Lys Arg Lys Phe Phe
1 5 10 15
Ala Ser Ser Ser Thr Gly Glu Gin Phe Glu Asn Gin Phe Arg Asn Glu
20 25 30
Leu Lys Lys His Phe Ser Glu He Asn Gly Asp Leu Thr Glu Glu Leu
35 40 45
Ser His He Glu Glu Lys Pro Asn Lys Glu He Lys Thr Thr Phe Asn
50 55 60
Gin Leu Lys Lys Gin Val Leu Glu Lys Asn His Pro His Thr Leu Lys 65 70 75 80
Asn Pro Phe Ser Asn Leu Thr Ser His Phe Leu Tyr Gin Pro Phe Gly
85 90 95
Ser Gin Asn Tyr Pro Asp Phe Leu Val Phe He Phe Asp Tyr Val Val
100 105 110
Gly He Glu He Lys Phe Ser Lys Asn Asp Lys Gly Glu Lys Asn Leu 115 120 125 Gin Thr Ser Arg Pro Met Trp Asn Ser Asn Leu Pro Lys Pro Asn Ala
130 135 140
He Tyr Val Tyr Gly Val Ala Asn Ala Asn He Thr Phe Phe Lys Gly 145 150 155 160
Ser Asp He Leu Ser Tyr Glu Thr Arg Glu Val Leu Leu Lys Tyr Phe
165 170 175
Asp He Leu Asp Lys Asp Glu Arg Ser Leu Lys Asn Ala Leu Lys Asp
180 185 190
Leu Glu Asn Pro Phe Gly Phe Ala Pro Tyr He Arg Lys Ala Tyr Glu
195 200 205
His Lys Arg Asn Phe Leu Thr Thr Thr Arg Leu Lys Ala Ser Phe Arg
210 215 220
Pro Thr Thr Phe 225
(2) INFORMATION FOR SEQ ID NO:163:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 540 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 53...493 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:163:
CCAAACCCTT TTGAAACACT TGCTCACTAA CCCATTATAA GCCGCAAAAA CC ATG CTC 58
Met Leu
1
TCT TTA AAA CAA GAT TCC TTT TTT TTC TTA TGT TTA GGA ATC CTG GGG 106 Ser Leu Lys Gin Asp Ser Phe Phe Phe Leu Cys Leu Gly He Leu Gly 5 10 15
TTT TAT TTT TAT AGC CTT TTG AGG GAT TTA ATG CCT TTT TTA CCC CCA 154 Phe Tyr Phe Tyr Ser Leu Leu Arg Asp Leu Met Pro Phe Leu Pro Pro 20 25 30
ATG ATT GGG TTT TTA TTC TTG TTT TAT GCG AAA AAA TAC GAT CAT TTT 202 Met He Gly Phe Leu Phe Leu Phe Tyr Ala Lys Lys Tyr Asp His Phe 35 40 45 50
TTA CCC AGT TTG AGC GTG TTT GGT TGT TTG TTT TGG TTT GAG AGC ATG 250 Leu Pro Ser Leu Ser Val Phe Gly Cys Leu Phe Trp Phe Glu Ser Met 55 60 65
CAT TTA AAG ACT TTA GGC GTT TTA GCT TTA TTG TTT TTA ATC TAC CAT 298 His Leu Lys Thr Leu Gly Val Leu Ala Leu Leu Phe Leu He Tyr His 70 75 80
CAA ATC GCC TAT AAA AAC TCT TTA AAG CTT TTT AAT GAC GGC TTT TTA 346 Gin He Ala Tyr Lys Asn Ser Leu Lys Leu Phe Asn Asp Gly Phe Leu 85 90 95
TTC AAA ACT TTG CAT GTT TTT TTG GTT TAT TAC CTT TAT TTA TCG CGC 394 Phe Lys Thr Leu His Val Phe Leu Val Tyr Tyr Leu Tyr Leu Ser Arg 100 105 110
TTT TTT TCG ATG TCT TTG AGT TTG AAA ATA CTC GGC TTT CTC GCT CTT 442 Phe Phe Ser Met Ser Leu Ser Leu Lys He Leu Gly Phe Leu Ala Leu 115 120 125 130
TTT GCT TTA ATA GAA AGC GCT TTG TGG GGT TTG TAT GAA AAA TCT TCG 490 Phe Ala Leu He Glu Ser Ala Leu Trp Gly Leu Tyr Glu Lys Ser Ser 135 140 145
CTA TAAGCTTTTG CTCTTTGTTT TTATAGGGTT TTGGGGGTTA CTAGCCT 540
Leu
(2) INFORMATION FOR SEQ ID NO: 164:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 147 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164:
Met Leu Ser Leu Lys Gin Asp Ser Phe Phe Phe Leu Cys Leu Gly He
1 5 10 15
Leu Gly Phe Tyr Phe Tyr Ser Leu Leu Arg Asp Leu Met Pro Phe Leu
20 25 30
Pro Pro Met He Gly Phe Leu Phe Leu Phe Tyr Ala Lys Lys Tyr Asp
35 40 45
His Phe Leu Pro Ser Leu Ser Val Phe Gly Cys Leu Phe Trp Phe Glu
50 55 60
Ser Met His Leu Lys Thr Leu Gly Val Leu Ala Leu Leu Phe Leu He 65 70 75 80
Tyr His Gin He Ala Tyr Lys Asn Ser Leu Lys Leu Phe Asn Asp Gly
85 90 95
Phe Leu Phe Lys Thr Leu His Val Phe Leu Val Tyr Tyr Leu Tyr Leu
100 105 110
Ser Arg Phe Phe Ser Met Ser Leu Ser Leu Lys He Leu Gly Phe Leu
115 120 125
Ala Leu Phe Ala Leu He Glu Ser Ala Leu Trp Gly Leu Tyr Glu Lys
130 135 140
Ser Ser Leu 145
(2) INFORMATION FOR SEQ ID NO: 165:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1888 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1835 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:165:
CACTAAAGTC AATCCAAGCG CAAGTTGGAT GAAAAAATAA GAAGGAAGTT ATG AAA 56
Met Lys 1
AAG TCA TTC AAA AAA TTA GGC TTT GTC TCT TTA GCG GCT AGT GGC GTG 104 Lys Ser Phe Lys Lys Leu Gly Phe Val Ser Leu Ala Ala Ser Gly Val 5 10 15
CTT TTA GGG AGC ATG AAC GCT ACC GAT TTA GAA ACC TAC GCA GCA TTG 152 Leu Leu Gly Ser Met Asn Ala Thr Asp Leu Glu Thr Tyr Ala Ala Leu 20 25 30
CAA AAA TCA TCG CAT GTT TTT GGT AAT TAT GCT GAA AAG GAT AAG GAT 200 Gin Lys Ser Ser His Val Phe Gly Asn Tyr Ala Glu Lys Asp Lys Asp 35 40 45 50
AGT AAA TTA ACA AGC GAT TCA CCA ACG CAA CAA CAA GAT CAA AAA GTA 248 Ser Lys Leu Thr Ser Asp Ser Pro Thr Gin Gin Gin Asp Gin Lys Val 55 60 65
GCC CAA AAC ACC GCT TCA AAC GAC AGC CAA GAA GCG ACA ACA CTT GAA 296 Ala Gin Asn Thr Ala Ser Asn Asp Ser Gin Glu Ala Thr Thr Leu Glu 70 75 80
AAC ACC GCT TCT ACT GAC AAC ACA ACC GCC ACA ACT GAT GAA ACT TAT 344 Asn Thr Ala Ser Thr Asp Asn Thr Thr Ala Thr Thr Asp Glu Thr Tyr 85 90 95
ACA AAA AGC ACT GAC ACT ACT GTA GCT GGT GCG GCT CAA AAA GTA GAA 392 Thr Lys Ser Thr Asp Thr Thr Val Ala Gly Ala Ala Gin Lys Val Glu 100 105 110
ACC GAT AAC ACA GCC GTT CAA AGC GCT GAA CAA ACT TTA AAA ACA GAT 440 Thr Asp Asn Thr Ala Val Gin Ser Ala Glu Gin Thr Leu Lys Thr Asp 115 120 125 130
GTA GCT AAA GTT CAA GCT GAT GCT AGT GCT AAA GAT TTT GAT GAA ACC 488 Val Ala Lys Val Gin Ala Asp Ala Ser Ala Lys Asp Phe Asp Glu Thr 135 140 145
ACT TTT CAA GCC GAT CAA GCA GCA GAG CAA ACC GCT GAA AAA GCT TTA 536 Thr Phe Gin Ala Asp Gin Ala Ala Glu Gin Thr Ala Glu Lys Ala Leu 150 155 160
CAA CAG GCT GAG AGC AAA CTC AAC ACC GAT CAA CAG ACT TTA AAC ACA 584 Gin Gin Ala Glu Ser Lys Leu Asn Thr Asp Gin Gin Thr Leu Asn Thr 165 170 175
GCG TTA CAA GAT CAG ACG AAA ACA CCA ACC CCA TCA ACC CCA CCA ACT 632 Ala Leu Gin Asp Gin Thr Lys Thr Pro Thr Pro Ser Thr Pro Pro Thr 180 185 190
AAA GAG GAA CCA AAA CAC ACC GCT TCA AGC GGC ACA CCA CCA GCT CCA 680 Lys Glu Glu Pro Lys His Thr Ala Ser Ser Gly Thr Pro Pro Ala Pro 195 200 205 210
GAA AGC CCA CCA GCT AAA AAA GAT GAA ACA AGT GGC ACA CCA AGT GCT 728 Glu Ser Pro Pro Ala Lys Lys Asp Glu Thr Ser Gly Thr Pro Ser Ala 215 220 225
AGT GGG AGT TCT GTG GCA AGC CAG CTA ACC AAA GAT ACC ACT ATG GTT 776 Ser Gly Ser Ser Val Ala Ser Gin Leu Thr Lys Asp Thr Thr Met Val 230 235 240
AAT AAT CTT AAG AGT GTG AGC GTG AGC GCG ATG AAC ACC ACT TTA AGT 824 Asn Asn Leu Lys Ser Val Ser Val Ser Ala Met Asn Thr Thr Leu Ser 245 250 255
GGA GTA GAA ACC ATG TCT CAA CAA ACT GCA ACG ATT GGC AAC CTT TTG 872 Gly Val Glu Thr Met Ser Gin Gin Thr Ala Thr He Gly Asn Leu Leu 260 265 270
AAT AGT AGC ACC GAT TTA AGC AGT GTG ATT CCC AAC GCT CAA GGG CTA 920 Asn Ser Ser Thr Asp Leu Ser Ser Val He Pro Asn Ala Gin Gly Leu 275 280 285 290
AAC AGC GCG TTT AGC ACA TTA GAA AGC GCT CAA AAC ACT CTA AAA GGC 968 Asn Ser Ala Phe Ser Thr Leu Glu Ser Ala Gin Asn Thr Leu Lys Gly 295 300 305
TAT TTA AAT TCT TCT AGC GCG ACG ATT GGG CAA TTG ACA AAC GGA TCT 1016 Tyr Leu Asn Ser Ser Ser Ala Thr He Gly Gin Leu Thr Asn Gly Ser 310 315 320
AAT GCG GTT GTG GGC GCG TTA GAT AAA GCT ATC AAT CAA GTG GAT ATG 1064 Asn Ala Val Val Gly Ala Leu Asp Lys Ala He Asn Gin Val Asp Met 325 330 335
GCT TTG GCC GAT CTT AGT GCA GCT GAT ACG CAA AAA ACG CAA GCC GTT 1112 Ala Leu Ala Asp Leu Ser Ala Ala Asp Thr Gin Lys Thr Gin Ala Val 340 345 350
ACG CTT GCA ACT GCT AGT GAT AGT CCA ACG ACA ACG ACA GAT GCC ATC 1160 Thr Leu Ala Thr Ala Ser Asp Ser Pro Thr Thr Thr Thr Asp Ala He 355 360 365 370
AAT TTC TTA AAC GCG CTA AAA AGC AAT CTA ATG GCT CAA AAA GAC GCT 1208 Asn Phe Leu Asn Ala Leu Lys Ser Asn Leu Met Ala Gin Lys Asp Ala 375 380 385
TTT TTG AAT GTG CAT AAA AAC ATT CAA ACC GCT GTC GCT CAA GCC CAG 1256 Phe Leu Asn Val His Lys Asn He Gin Thr Ala Val Ala Gin Ala Gin 390 395 400
GAA ACC TAC ACG CCA AGC GTG ATC AAC ACC AAT AAT TAC GGG CAA ATG 1304 Glu Thr Tyr Thr Pro Ser Val He Asn Thr Asn Asn Tyr Gly Gin Met 405 410 415
TAT GGG GTA GAT GCG ATG GCA GGG TAT AAG TGG TTC TTT GGC AAA ACC 1352 Tyr Gly Val Asp Ala Met Ala Gly Tyr Lys Trp Phe Phe Gly Lys Thr 420 425 430
AAA CGC TTT GGC TTT AGG TCT TAT GGA TAC TAC AGC TAT AAC CAT GCG 1400 Lys Arg Phe Gly Phe Arg Ser Tyr Gly Tyr Tyr Ser Tyr Asn His Ala 435 440 445 450
AAT TTA AGC TTT GTG GGG AGC CAG CTT GGA ATC ATG GAG GGC GCG TCT 1448 Asn Leu Ser Phe Val Gly Ser Gin Leu Gly He Met Glu Gly Ala Ser 455 460 465
CAA GTG AAT AAC TTC ACT TAT GGC GTG GGC TTT GAT GTG CTC TAT AAC 1496 Gin Val Asn Asn Phe Thr Tyr Gly Val Gly Phe Asp Val Leu Tyr Asn 470 475 480
TTC TAT GAA AGC AAA GAG GGC TAT AAC ACA GCA GGG TTG TTC TTA GGC 1544 Phe Tyr Glu Ser Lys Glu Gly Tyr Asn Thr Ala Gly Leu Phe Leu Gly 485 490 495
TTT GGG TTA GGA GGG GAT TCG TTT ATC GTT CAA GGA GAG AGC TAC TTG 1592 Phe Gly Leu Gly Gly Asp Ser Phe He Val Gin Gly Glu Ser Tyr Leu 500 505 510
AAA TCT CAA ATG CAC ATT TGC AAC AAC ACC GCC GGC TGT TCA GCG AGC 1640 Lys Ser Gin Met His He Cys Asn Asn Thr Ala Gly Cys Ser Ala Ser 515 520 525 530
ATG AAC ACA AGC TAC TTC CAA ATG CCT GTT GAA TTT GGT TTT AGG AGC 1688 Met Asn Thr Ser Tyr Phe Gin Met Pro Val Glu Phe Gly Phe Arg Ser 535 540 545
AAT TTC TCT AAA CAC AGC GGG ATT GAA GTG GGC TTT AAA TTG CCT TTA 1736 Asn Phe Ser Lys His Ser Gly He Glu Val Gly Phe Lys Leu Pro Leu 550 555 560 TTC ACC AAC CAA TTC TAT AAA GAA AGG GGC GTA GAT GGA TCG GTA GAT 1784 Phe Thr Asn Gin Phe Tyr Lys Glu Arg Gly Val Asp Gly Ser Val Asp 565 570 575
GTG TTC TAT AAA AGG AAT TTC TCT ATT TAT TTT AAC TAC ATG ATC AAC 1832 Val Phe Tyr Lys Arg Asn Phe Ser He Tyr Phe Asn Tyr Met He Asn 580 585 590
TTC TAAGCCTTTC TATTCTTTCC AATAGAGGGT TTTCTCTCTG TTGGTTTCTT TTT 188£
Phe
595
(2) INFORMATION FOR SEQ ID NO: 166:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 595 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166:
Met Lys Lys Ser Phe Lys Lys Leu Gly Phe Val Ser Leu Ala Ala Ser
1 5 10 15
Gly Val Leu Leu Gly Ser Met Asn Ala Thr Asp Leu Glu Thr Tyr Ala
20 25 30
Ala Leu Gin Lys Ser Ser His Val Phe Gly Asn Tyr Ala Glu Lys Asp
35 40 45
Lys Asp Ser Lys Leu Thr Ser Asp Ser Pro Thr Gin Gin Gin Asp Gin
50 55 60
Lys Val Ala Gin Asn Thr Ala Ser Asn Asp Ser Gin Glu Ala Thr Thr 65 70 75 80
Leu Glu Asn Thr Ala Ser Thr Asp Asn Thr Thr Ala Thr Thr Asp Glu
85 90 95
Thr Tyr Thr Lys Ser Thr Asp Thr Thr Val Ala Gly Ala Ala Gin Lys
100 105 110
Val Glu Thr Asp Asn Thr Ala Val Gin Ser Ala Glu Gin Thr Leu Lys
115 120 125
Thr Asp Val Ala Lys Val Gin Ala Asp Ala Ser Ala Lys Asp Phe Asp
130 135 140
Glu Thr Thr Phe Gin Ala Asp Gin Ala Ala Glu Gin Thr Ala Glu Lys 145 150 155 160
Ala Leu Gin Gin Ala Glu Ser Lys Leu Asn Thr Asp Gin Gin Thr Leu
165 170 175
Asn Thr Ala Leu Gin Asp Gin Thr Lys Thr Pro Thr Pro Ser Thr Pro
180 185 190
Pro Thr Lys Glu Glu Pro Lys His Thr Ala Ser Ser Gly Thr Pro Pro
195 200 205
Ala Pro Glu Ser Pro Pro Ala Lys Lys Asp Glu Thr Ser Gly Thr Pro
210 215 220
Ser Ala Ser Gly Ser Ser Val Ala Ser Gin Leu Thr Lys Asp Thr Thr 225 230 235 240 Met Val Asn Asn Leu Lys Ser Val Ser Val Ser Ala Met Asn Thr Thr
245 250 255
Leu Ser Gly Val Glu Thr Met Ser Gin Gin Thr Ala Thr He Gly Asn
260 265 270
Leu Leu Asn Ser Ser Thr Asp Leu Ser Ser Val He Pro Asn Ala Gin
275 280 285
Gly Leu Asn Ser Ala Phe Ser Thr Leu Glu Ser Ala Gin Asn Thr Leu
290 295 300
Lys Gly Tyr Leu Asn Ser Ser Ser Ala Thr He Gly Gin Leu Thr Asn 305 310 315 320
Gly Ser Asn Ala Val Val Gly Ala Leu Asp Lys Ala He Asn Gin Val
325 330 335
Asp Met Ala Leu Ala Asp Leu Ser Ala Ala Asp Thr Gin Lys Thr Gin
340 345 350
Ala Val Thr Leu Ala Thr Ala Ser Asp Ser Pro Thr Thr Thr Thr Asp
355 360 365
Ala He Asn Phe Leu Asn Ala Leu Lys Ser Asn Leu Met Ala Gin Lys
370 375 380
Asp Ala Phe Leu Asn Val His Lys Asn He Gin Thr Ala Val Ala Gin 385 390 395 400
Ala Gin Glu Thr Tyr Thr Pro Ser Val He Asn Thr Asn Asn Tyr Gly
405 410 415
Gin Met Tyr Gly Val Asp Ala Met Ala Gly Tyr Lys Trp Phe Phe Gly
420 425 430
Lys Thr Lys Arg Phe Gly Phe Arg Ser Tyr Gly Tyr Tyr Ser Tyr Asn
435 440 445
His Ala Asn Leu Ser Phe Val Gly Ser Gin Leu Gly He Met Glu Gly
450 455 460
Ala Ser Gin Val Asn Asn Phe Thr Tyr Gly Val Gly Phe Asp Val Leu 465 470 475 480
Tyr Asn Phe Tyr Glu Ser Lys Glu Gly Tyr Asn Thr Ala Gly Leu Phe
485 490 495
Leu Gly Phe Gly Leu Gly Gly Asp Ser Phe He Val Gin Gly Glu Ser
500 505 510
Tyr Leu Lys Ser Gin Met His He Cys Asn Asn Thr Ala Gly Cys Ser
515 520 525
Ala Ser Met Asn Thr Ser Tyr Phe Gin Met Pro Val Glu Phe Gly Phe
530 535 540
Arg Ser Asn Phe Ser Lys His Ser Gly He Glu Val Gly Phe Lys Leu 545 550 555 560
Pro Leu Phe Thr Asn Gin Phe Tyr Lys Glu Arg Gly Val Asp Gly Ser
565 570 575
Val Asp Val Phe Tyr Lys Arg Asn Phe Ser He Tyr Phe Asn Tyr Met
580 585 590
He Asn Phe 595
(2) INFORMATION FOR SEQ ID NO: 167:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1470 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 46...1416 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167:
TTTAAGAAAC TATTTGCGCA TTTGATGTTA AGGTTTCTCT AAAGC ATG CGT TAT TTT 57
Met Arg Tyr Phe 1
CTT GTA GTT TTC TTG TTT TTG TTT GTG GGT TGC ACA AAA AAG GAT TTC 105 Leu Val Val Phe Leu Phe Leu Phe Val Gly Cys Thr Lys Lys Asp Phe 5 10 15 20
ACG CTC AAA GAT TTA TCC TTG CCC CAA GAG GCT TCA AGC TAT CTT GCA 153 Thr Leu Lys Asp Leu Ser Leu Pro Gin Glu Ala Ser Ser Tyr Leu Ala 25 30 35
AGC TCT CAA AAT GGC AGT AAC AAC AAC CAA AGC ATT GAC CCC CAA GCG 201 Ser Ser Gin Asn Gly Ser Asn Asn Asn Gin Ser He Asp Pro Gin Ala 40 45 50
TTA AGA GAA AAT CTG AAA GAG AGC TAT CTC AAA GCG TGG TAT TCC CCA 249 Leu Arg Glu Asn Leu Lys Glu Ser Tyr Leu Lys Ala Trp Tyr Ser Pro 55 60 65
TGG CTA GAT ATG AAA GTC AAA AGC AAT AAA AAA GAA GTG TTT TGG ATC 297 Trp Leu Asp Met Lys Val Lys Ser Asn Lys Lys Glu Val Phe Trp He 70 75 80
CTT AAG GAG ATG AAT AAA TCC ACC GGT TAT GGC GAA GAT CTA AAA CCC 345 Leu Lys Glu Met Asn Lys Ser Thr Gly Tyr Gly Glu Asp Leu Lys Pro 85 90 95 100
AAC GCA AAA GCT TTC AAT GAC GCA CTC ATT AAG AGC ATG GAT ATT GAG 393 Asn Ala Lys Ala Phe Asn Asp Ala Leu He Lys Ser Met Asp He Glu 105 110 115
CAT TAC CCT AGC GTT AAG ATT AGG GCT GTT GTA GCG CGA GAT AGC GAT 441 His Tyr Pro Ser Val Lys He Arg Ala Val Val Ala Arg Asp Ser Asp 120 125 130
GTG AGG GCT GTG CCT ACT AAC AAA CCT TAT TAT CTT TCT CAA AAA GGC 489 Val Arg Ala Val Pro Thr Asn Lys Pro Tyr Tyr Leu Ser Gin Lys Gly 135 140 145
TAT CCT TTT GAT AGG TAT CAA AAT TCG CTG ATT TTT CAA GGC ACG CCG 537 Tyr Pro Phe Asp Arg Tyr Gin Asn Ser Leu He Phe Gin Gly Thr Pro 150 155 160 GTT TTA ATC ACG CAT TTT AAT CTA GAT AAA ACT TAT GCC CAC ATT CAA 585 Val Leu He Thr His Phe Asn Leu Asp Lys Thr Tyr Ala His He Gin 165 170 175 180
AGC AGT TTT GTT TAT GGC TGG ATC AAA GTT AGC GAT TTA GTC TAC ATG 633 Ser Ser Phe Val Tyr Gly Trp He Lys Val Ser Asp Leu Val Tyr Met 185 190 195
CAC GAT AAA GAC ATA GAG CTT TTA ACC CAT CTT AAA GAT TAT GTC ATG 681 His Asp Lys Asp He Glu Leu Leu Thr His Leu Lys Asp Tyr Val Met 200 205 210
CCT ATA AAA GAT AAA ATC CCC CTT TAT ACA GAC TAT GGG GAT TTT TAC 729 Pro He Lys Asp Lys He Pro Leu Tyr Thr Asp Tyr Gly Asp Phe Tyr 215 220 225
ACC AAC GCC AGA GTG GGC GAA TTG TTC GCT CTC ATC CCC CAA AGT CAA 777 Thr Asn Ala Arg Val Gly Glu Leu Phe Ala Leu He Pro Gin Ser Gin 230 235 240
AAA ACA CCT CAA AAA CCC CAA AAA AAG GAA TTG AAA GCC TAT GGT TTT 825 Lys Thr Pro Gin Lys Pro Gin Lys Lys Glu Leu Lys Ala Tyr Gly Phe 245 250 255 260
TTG AGA GAC GCT AAG GGT TAT GCA GCT TTA CAA AGC GTG ATC TTA GAA 873 Leu Arg Asp Ala Lys Gly Tyr Ala Ala Leu Gin Ser Val He Leu Glu 265 270 275
GAA AAG GAT TTT TTT GTT TTC CCT AAG GCT TTT AAC AGC GAG AAC ATG 921 Glu Lys Asp Phe Phe Val Phe Pro Lys Ala Phe Asn Ser Glu Asn Met 280 285 290
GCG TAT TTT ATA GAC ACC ATG TTA GGG CAA AAA TAC GGC TGG GGC GGG 969 Ala Tyr Phe He Asp Thr Met Leu Gly Gin Lys Tyr Gly Trp Gly Gly 295 300 305
CTA TTG GGT AAT AGG GAT TGC TCG GCT TTC ACC AGA GAT AGT TTT GCT 1017 Leu Leu Gly Asn Arg Asp Cys Ser Ala Phe Thr Arg Asp Ser Phe Ala 310 315 320
AAT TTT GGT ATT TTG CTC CCC AGA AAT TCC TAT GCG CAA AGC CGT TAT 1065 Asn Phe Gly He Leu Leu Pro Arg Asn Ser Tyr Ala Gin Ser Arg Tyr 325 330 335 340
GCG AAC AAT TAT GTG GAT TTA AGC TCT ATG AAA GCC AAA GAA AAA GAA 1113 Ala Asn Asn Tyr Val Asp Leu Ser Ser Met Lys Ala Lys Glu Lys Glu 345 350 355
GAC TAC ATC CTT AAA AAC GCC ACG CCT TTT GGA ACG CTC ATC TAT TTA 1161 Asp Tyr He Leu Lys Asn Ala Thr Pro Phe Gly Thr Leu He Tyr Leu 360 365 370
AAA GGG CAT ATC ATG CTT TAT TTA GGC GCA CAC AAC CAT CAA GCG ATA 1209 Lys Gly His He Met Leu Tyr Leu Gly Ala His Asn His Gin Ala He 375 380 385 GTC GCT CAC AGC ATT TGG TCG GTG CAA ACC CAA AAG CAT TTT AAA ACC 1257 Val Ala His Ser He Trp Ser Val Gin Thr Gin Lys His Phe Lys Thr 390 395 400
TTG AGC CAT AAA ATA GGA GGC GTG GTG ATC ACT TCG TTA TGG TTA GCT 1305 Leu Ser His Lys He Gly Gly Val Val He Thr Ser Leu Trp Leu Ala 405 410 415 420
GAA GAG CAT AAT GGG GCG TTT TCT AAA AAG AAA TTA TTG ATT GAT AGG 1353 Glu Glu His Asn Gly Ala Phe Ser Lys Lys Lys Leu Leu He Asp Arg 425 430 435
GTG CTT GGA ATG AGC GAT TTG AAA GAT TTT GTC AAT AAA ACT TCA AGC 1401 Val Leu Gly Met Ser Asp Leu Lys Asp Phe Val Asn Lys Thr Ser Ser 440 445 450
CCT TTA AAT GCG AAT TGATTTTCTT ATATTATGAT TACGATTTAT CAATTTAAAA C 1457 Pro Leu Asn Ala Asn 455
ATTTGGAGAA AGA 1470
(2) INFORMATION FOR SEQ ID NO: 168:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 457 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168:
Met Arg Tyr Phe Leu Val Val Phe Leu Phe Leu Phe Val Gly Cys Thr
1 5 10 15
Lys Lys Asp Phe Thr Leu Lys Asp Leu Ser Leu Pro Gin Glu Ala Ser
20 25 30
Ser Tyr Leu Ala Ser Ser Gin Asn Gly Ser Asn Asn Asn Gin Ser He
35 40 45
Asp Pro Gin Ala Leu Arg Glu Asn Leu Lys Glu Ser Tyr Leu Lys Ala
50 55 60
Trp Tyr Ser Pro Trp Leu Asp Met Lys Val Lys Ser Asn Lys Lys Glu 65 70 75 80
Val Phe Trp He Leu Lys Glu Met Asn Lys Ser Thr Gly Tyr Gly Glu
85 90 95
Asp Leu Lys Pro Asn Ala Lys Ala Phe Asn Asp Ala Leu He Lys Ser
100 105 110
Met Asp He Glu His Tyr Pro Ser Val Lys He Arg Ala Val Val Ala
115 120 125
Arg Asp Ser Asp Val Arg Ala Val Pro Thr Asn Lys Pro Tyr Tyr Leu
130 135 140
Ser Gin Lys Gly Tyr Pro Phe Asp Arg Tyr Gin Asn Ser Leu He Phe 145 150 155 160 Gin Gly Thr Pro Val Leu He Thr His Phe Asn Leu Asp Lys Thr Tyr
165 170 175
Ala His He Gin Ser Ser Phe Val Tyr Gly Trp He Lys Val Ser Asp
180 185 190
Leu Val Tyr Met His Asp Lys Asp He Glu Leu Leu Thr His Leu Lys
195 200 205
Asp Tyr Val Met Pro He Lys Asp Lys He Pro Leu Tyr Thr Asp Tyr
210 215 220
Gly Asp Phe Tyr Thr Asn Ala Arg Val Gly Glu Leu Phe Ala Leu He 225 230 235 240
Pro Gin Ser Gin Lys Thr Pro Gin Lys Pro Gin Lys Lys Glu Leu Lys
245 250 255
Ala Tyr Gly Phe Leu Arg Asp Ala Lys Gly Tyr Ala Ala Leu Gin Ser
260 265 270
Val He Leu Glu Glu Lys Asp Phe Phe Val Phe Pro Lys Ala Phe Asn
275 280 285
Ser Glu Asn Met Ala Tyr Phe He Asp Thr Met Leu Gly Gin Lys Tyr
290 295 300
Gly Trp Gly Gly Leu Leu Gly Asn Arg Asp Cys Ser Ala Phe Thr Arg 305 310 315 320
Asp Ser Phe Ala Asn Phe Gly He Leu Leu Pro Arg Asn Ser Tyr Ala
325 330 335
Gin Ser Arg Tyr Ala Asn Asn Tyr Val Asp Leu Ser Ser Met Lys Ala
340 345 350
Lys Glu Lys Glu Asp Tyr He Leu Lys Asn Ala Thr Pro Phe Gly Thr
355 360 365
Leu He Tyr Leu Lys Gly His He Met Leu Tyr Leu Gly Ala His Asn
370 375 380
His Gin Ala He Val Ala His Ser He Trp Ser Val Gin Thr Gin Lys 385 390 395 400
His Phe Lys Thr Leu Ser His Lys He Gly Gly Val Val He Thr Ser
405 410 415
Leu Trp Leu Ala Glu Glu His Asn Gly Ala Phe Ser Lys Lys Lys Leu
420 425 430
Leu He Asp Arg Val Leu Gly Met Ser Asp Leu Lys Asp Phe Val Asn
435 440 445
Lys Thr Ser Ser Pro Leu Asn Ala Asn 450 455
(2) INFORMATION FOR SEQ ID NO: 169:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 235 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...182 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169:
CTTTTACTTT ATAATTATCG TTGGCATTTT AATATTCAAA GGAGCTTGAA ATG AGA 56
Met Arg 1
ATT TCT CTT TTA GCT GTA ATT TTA GCG TTA TTG TTT GTG GCT TGC CAC 104 He Ser Leu Leu Ala Val He Leu Ala Leu Leu Phe Val Ala Cys His 5 10 15
GAA ACT AAA AAA CAA ATC TTA CAA AAC GAA GCC GAT AGC ACC CCT TCA 152 Glu Thr Lys Lys Gin He Leu Gin Asn Glu Ala Asp Ser Thr Pro Ser 20 25 30
GAA AAA ACC ATT TGG CAA CCT GAA CAA AAA TAAAAATTGT AAAAATACTC AAA 205 Glu Lys Thr He Trp Gin Pro Glu Gin Lys 35 40
GGCATTTTTT AAAATAAACG CAATAAAAAA 235
(2) INFORMATION FOR SEQ ID NO: 170:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 44 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170:
Met Arg He Ser Leu Leu Ala Val He Leu Ala Leu Leu Phe Val Ala
1 5 10 15
Cys His Glu Thr Lys Lys Gin He Leu Gin Asn Glu Ala Asp Ser Thr
20 25 30
Pro Ser Glu Lys Thr He Trp Gin Pro Glu Gin Lys 35 40
(2) INFORMATION FOR SEQ ID NO: 171:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1351 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1298 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171:
CCAGCGATGC GTCCTGTTAA AAACGATTTT AATGTGGAGA GTGAAGAATA ATG GCG 56
Met Ala 1
TAT TTT TTA GAA CAA ACG GAT AGT GAA ATT TTT GAG CTT ATC TTT GAA 104 Tyr Phe Leu Glu Gin Thr Asp Ser Glu He Phe Glu Leu He Phe Glu 5 10 15
GAA TAC AAG CGG CAA AAT GAG CAT TTA GAA ATG ATA GCG AGC GAG AAT 152 Glu Tyr Lys Arg Gin Asn Glu His Leu Glu Met He Ala Ser Glu Asn 20 25 30
TAC ACT TTT GCA AGC GTT ATG GAG GCT ATG GGG AGT GTT TTA ACG AAT 200 Tyr Thr Phe Ala Ser Val Met Glu Ala Met Gly Ser Val Leu Thr Asn 35 40 45 50
AAA TAC GCT GAA GGC TAC CCT AAC AAG CGC TAT TAT GGA GGC TGT GAA 248 Lys Tyr Ala Glu Gly Tyr Pro Asn Lys Arg Tyr Tyr Gly Gly Cys Glu 55 60 65
GTG GTG GAT AAA ATA GAA AGC CTA GCC ATA GAA AGG GCT AAA AAG CTT 296 Val Val Asp Lys He Glu Ser Leu Ala He Glu Arg Ala Lys Lys Leu 70 75 80
TTT AAT TGC CAG TTC GCT AAC GTG CAA GCG CAT TCA GGC TCA CAA GCC 344 Phe Asn Cys Gin Phe Ala Asn Val Gin Ala His Ser Gly Ser Gin Ala 85 90 95
AAT AAC GCT GTC TAT CAC GCT CTT TTA AAG CCT TAT GAC AAG ATT TTA 392 Asn Asn Ala Val Tyr His Ala Leu Leu Lys Pro Tyr Asp Lys He Leu 100 105 110
GGC ATG GAT TTA AGC TGT GGA GGG CAT TTA ACG CAT GGC GCT AAA GTG 440 Gly Met Asp Leu Ser Cys Gly Gly His Leu Thr His Gly Ala Lys Val 115 120 125 130
AGT TTA ACC GGC AAG CAT TAT CAG AGC TTT TCT TAT GGC GTG AAT TTG 488 Ser Leu Thr Gly Lys His Tyr Gin Ser Phe Ser Tyr Gly Val Asn Leu 135 140 145
GAT GGC TAT ATT GAT TAT GAA GAG GCG CTA AAA ATC GCT CAA AGC GTT 536 Asp Gly Tyr He Asp Tyr Glu Glu Ala Leu Lys He Ala Gin Ser Val 150 155 160
AAG CCA GAA ATC ATC GTG TGC GGG TTT TCA GCC TAT CCA AGG GAG ATT 584 Lys Pro Glu He He Val Cys Gly Phe Ser Ala Tyr Pro Arg Glu He 165 170 175
GAT TTT AAG AAA TTT AGA GAA ATC GCT GAT GAA GTG GGG GCG TTA CTA 632 Asp Phe Lys Lys Phe Arg Glu He Ala Asp Glu Val Gly Ala Leu Leu 180 185 190
TTA GGC GAT ATA GCC CAT GTG GCA GGG CTT GTG GTA ACC GGT GAG CAT 680 Leu Gly Asp He Ala His Val Ala Gly Leu Val Val Thr Gly Glu His 195 200 205 210
GCC CAT CCT TTC CCG CAT TGC CAT GTG GTT TCA AGC ACC ACT CAT AAG 728 Ala His Pro Phe Pro His Cys His Val Val Ser Ser Thr Thr His Lys 215 220 225
ACC TTA AGA GGG CCT AGA GGG GGG ATT ATT TTA ACT AAT GAT GAA GAG 776 Thr Leu Arg Gly Pro Arg Gly Gly He He Leu Thr Asn Asp Glu Glu 230 235 240
ATA GCG GCT AAG ATT GAC AAA GCG ATT TTT CCA GGA ACT CAA GGC GGG 824 He Ala Ala Lys He Asp Lys Ala He Phe Pro Gly Thr Gin Gly Gly 245 250 255
CCT TTG ATG CAT GTG ATT GCT GCT AAA GCG GTG GGT TTT AAA GAG AAT 872 Pro Leu Met His Val He Ala Ala Lys Ala Val Gly Phe Lys Glu Asn 260 265 270
CTA AAA CCA GAA TTT AAA GCT TAT GCA CAA TTA GTG AAA TCT AAC ATG 920 Leu Lys Pro Glu Phe Lys Ala Tyr Ala Gin Leu Val Lys Ser Asn Met 275 280 285 290
CAA GTT TTG GCT AAA GCG TTA AAA GAA AAA AAC CAT AAG TTA GTG AGT 968 Gin Val Leu Ala Lys Ala Leu Lys Glu Lys Asn His Lys Leu Val Ser 295 300 305
GGT GGC ACT TCT AAC CAT TTG CTT TTA ATG GAT TTT TTA GAT AAG CCT 1016 Gly Gly Thr Ser Asn His Leu Leu Leu Met Asp Phe Leu Asp Lys Pro 310 315 320
TAT AGC GGG AAA GAC GCT GAT ATT GCA TTA GGG AAT GCC GGA ATC ACC 1064 Tyr Ser Gly Lys Asp Ala Asp He Ala Leu Gly Asn Ala Gly He Thr 325 330 335
GTG AAT AAA AAC ACC ATT CCT GGT GAA ACG CGC AGC CCT TTT GTA ACG 1112 Val Asn Lys Asn Thr He Pro Gly Glu Thr Arg Ser Pro Phe Val Thr 340 345 350
AGC GGG ATA AGG ATT GGC TCA GCG GCA TTG AGC GCA AGG GGC ATG GGA 1160 Ser Gly He Arg He Gly Ser Ala Ala Leu Ser Ala Arg Gly Met Gly 355 360 365 370
GCT AAG GAA TTT GAA ATC ATA GGG AAT AAA ATA TCA GAT ATT TTG AAT 1208 Ala Lys Glu Phe Glu He He Gly Asn Lys He Ser Asp He Leu Asn 375 380 385
GAT ATT AAT AAT GTT AGT TTG CAA TTG CAT GTG AAA GAA GAA TTG AAA 1256 Asp He Asn Asn Val Ser Leu Gin Leu His Val Lys Glu Glu Leu Lys 390 395 400 GCC ATG GTC AAT CAA TTC CCT GTG TAC CAC CAA CCT ATT TTT TAAGGGAGT 1307 Ala Met Val Asn Gin Phe Pro Val Tyr His Gin Pro He Phe 405 410 415
CAAGATGACA GAAATGGAAT TAAAGCTCAT TAAGATAGAC ACAA 1351
(2) INFORMATION FOR SEQ ID NO: 172:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 416 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172:
Met Ala Tyr Phe Leu Glu Gin Thr Asp Ser Glu He Phe Glu Leu He
1 5 10 15
Phe Glu Glu Tyr Lys Arg Gin Asn Glu His Leu Glu Met He Ala Ser
20 25 30
Glu Asn Tyr Thr Phe Ala Ser Val Met Glu Ala Met Gly Ser Val Leu
35 40 45
Thr Asn Lys Tyr Ala Glu Gly Tyr Pro Asn Lys Arg Tyr Tyr Gly Gly
50 55 60
Cys Glu Val Val Asp Lys He Glu Ser Leu Ala He Glu Arg Ala Lys 65 70 75 80
Lys Leu Phe Asn Cys Gin Phe Ala Asn Val Gin Ala His Ser Gly Ser
85 90 95
Gin Ala Asn Asn Ala Val Tyr His Ala Leu Leu Lys Pro Tyr Asp Lys
100 105 110
He Leu Gly Met Asp Leu Ser Cys Gly Gly His Leu Thr His Gly Ala
115 120 125
Lys Val Ser Leu Thr Gly Lys His Tyr Gin Ser Phe Ser Tyr Gly Val
130 135 140
Asn Leu Asp Gly Tyr He Asp Tyr Glu Glu Ala Leu Lys He Ala Gin 145 150 155 160
Ser Val Lys Pro Glu He He Val Cys Gly Phe Ser Ala Tyr Pro Arg
165 170 175
Glu He Asp Phe Lys Lys Phe Arg Glu He Ala Asp Glu Val Gly Ala
180 185 190
Leu Leu Leu Gly Asp He Ala His Val Ala Gly Leu Val Val Thr Gly
195 200 205
Glu His Ala His Pro Phe Pro His Cys His Val Val Ser Ser Thr Thr
210 215 220
His Lys Thr Leu Arg Gly Pro Arg Gly Gly He He Leu Thr Asn Asp 225 230 235 240
Glu Glu He Ala Ala Lys He Asp Lys Ala He Phe Pro Gly Thr Gin
245 250 255
Gly Gly Pro Leu Met His Val He Ala Ala Lys Ala Val Gly Phe Lys
260 265 270
Glu Asn Leu Lys Pro Glu Phe Lys Ala Tyr Ala Gin Leu Val Lys Ser 275 280 285 Asn Met Gin Val Leu Ala Lys Ala Leu Lys Glu Lys Asn His Lys Leu
290 295 300
Val Ser Gly Gly Thr Ser Asn His Leu Leu Leu Met Asp Phe Leu Asp 305 310 315 320
Lys Pro Tyr Ser Gly Lys Asp Ala Asp He Ala Leu Gly Asn Ala Gly
325 330 335
He Thr Val Asn Lys Asn Thr He Pro Gly Glu Thr Arg Ser Pro Phe
340 345 350
Val Thr Ser Gly He Arg He Gly Ser Ala Ala Leu Ser Ala Arg Gly
355 360 365
Met Gly Ala Lys Glu Phe Glu He He Gly Asn Lys He Ser Asp He
370 375 380
Leu Asn Asp He Asn Asn Val Ser Leu Gin Leu His Val Lys Glu Glu 385 390 395 400
Leu Lys Ala Met Val Asn Gin Phe Pro Val Tyr His Gin Pro He Phe 405 410 415
(2) INFORMATION FOR SEQ ID NO: 173:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1513 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1460 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:173:
TGAAAAACCA CTCCTTTAAA AAAACGATCG CTCTTTCCTT ACTAGCGAGC ATG TCT 56
Met Ser 1
TTG TGC AGG GCT GAA GAA GAT GGG GCG TTT TTT GTC ATA GAT TAC CAG 104 Leu Cys Arg Ala Glu Glu Asp Gly Ala Phe Phe Val He Asp Tyr Gin 5 10 15
ACG AGT TTG GCC AGA CAG GAA TTG AAA AAT CCA GGC TTC ACC CAA GCG 152 Thr Ser Leu Ala Arg Gin Glu Leu Lys Asn Pro Gly Phe Thr Gin Ala 20 25 30
CAA GAA TTA AGG CAG TTG ATC AGA GAT GGG GCT GTG AGG TTG CAA ACT 200 Gin Glu Leu Arg Gin Leu He Arg Asp Gly Ala Val Arg Leu Gin Thr 35 40 45 50
TCT GCC ATT CCC TTA TCC TAC TAC TTG GAT ATT TTA GGG AAT AAA ACA 248 Ser Ala He Pro Leu Ser Tyr Tyr Leu Asp He Leu Gly Asn Lys Thr 55 60 65
GCA ACT CTT TTG CGT GAA AGC CTG AAA AAC AAT GCA CAA CCA TCA CAA 296 Ala Thr Leu Leu Arg Glu Ser Leu Lys Asn Asn Ala Gin Pro Ser Gin 70 75 80
CCA AAC GCG CAA CCA CCG CAA CAA AAC GGA CCA TCA AAC CAA GCC TTA 344 Pro Asn Ala Gin Pro Pro Gin Gin Asn Gly Pro Ser Asn Gin Ala Leu 85 90 95
GCC AAT TTA GAG CAA TCT CTA GGG ATT TTA GGA AAA CTA TTG GAT CTA 392 Ala Asn Leu Glu Gin Ser Leu Gly He Leu Gly Lys Leu Leu Asp Leu 100 105 110
TCC CAA CAA TAC GCT AGT CAG GGT GTC ATT AAG CCT TTG GTG GTG GAT 440 Ser Gin Gin Tyr Ala Ser Gin Gly Val He Lys Pro Leu Val Val Asp 115 120 125 130
GTG GGG AAA GAA CAA ATC GGT ATC ACG GAT AGC ATG CTC TTG GTG GCT 488 Val Gly Lys Glu Gin He Gly He Thr Asp Ser Met Leu Leu Val Ala 135 140 145
CAA AAC ATC GTT TTA GCT TTA GGG CAA GTG GAT TTG AGC AAA ATC CAA 536 Gin Asn He Val Leu Ala Leu Gly Gin Val Asp Leu Ser Lys He Gin 150 155 160
CAA AAC AAT AAC GAA CAG CTA TAC GAA AAT ATT ATG AAA GTC ATG CTT 584 Gin Asn Asn Asn Glu Gin Leu Tyr Glu Asn He Met Lys Val Met Leu 165 170 175
TTA GGC GCG GGC GGG ACT AAT GGG GCG TAT AAT GGC GTG AGT GTG GGC 632 Leu Gly Ala Gly Gly Thr Asn Gly Ala Tyr Asn Gly Val Ser Val Gly 180 185 190
GAC ATT GCC ACG GGC ATG CAA AAT TTT TCT TCG CAA ACG GGC TTG ATA 680 Asp He Ala Thr Gly Met Gin Asn Phe Ser Ser Gin Thr Gly Leu He 195 200 205 210
GGG GCT AAT TCT ACG GTT AGC GAG CTG AAT GCT TTG ATT AAG AGC GGG 728 Gly Ala Asn Ser Thr Val Ser Glu Leu Asn Ala Leu He Lys Ser Gly 215 220 225
ATT TCT TTG GAT CGT GAG ACT TTG GGG TTA GGG AGT TTT ATT GAA AAA 776 He Ser Leu Asp Arg Glu Thr Leu Gly Leu Gly Ser Phe He Glu Lys 230 235 240
AAT ATC TGT AGC GGT GCA TCG TCT TGT TTT AGT GGG AAT CAG CTT ATC 824 Asn He Cys Ser Gly Ala Ser Ser Cys Phe Ser Gly Asn Gin Leu He 245 250 255
TAT AAG AAA GGG CTA GAC AGA ACC ATA AAC ATC ATT AAT ACG GTA TTA 872 Tyr Lys Lys Gly Leu Asp Arg Thr He Asn He He Asn Thr Val Leu 260 265 270
GGT CAG TTT GAA TCT TCG GCT AGT TCT CTT TAT AAG ATT TCT TAT ATC 920 Gly Gin Phe Glu Ser Ser Ala Ser Ser Leu Tyr Lys He Ser Tyr He 275 280 285 290
CCT AAC CTC TTT TCG CTC AAG GAT TAC CAG TCA GCG AGC ATG AAC GGC 968 Pro Asn Leu Phe Ser Leu Lys Asp Tyr Gin Ser Ala Ser Met Asn Gly 295 300 305
TTT GGG GCT AAG ATG GGC TAT AAA CAA TTT TTC ACC CAT AAG AAA AAT 1016 Phe Gly Ala Lys Met Gly Tyr Lys Gin Phe Phe Thr His Lys Lys Asn 310 315 320
GTT GGC TTA AGG TAT TAC GGG TTT TTG GAT TAT GGC TAT GCG AAC TTT 1064 Val Gly Leu Arg Tyr Tyr Gly Phe Leu Asp Tyr Gly Tyr Ala Asn Phe 325 330 335
GGC GAT ACG AAT TTA AAA GTG GGG GCG AAT CTT GTT ACT TAT GGG GTA 1112 Gly Asp Thr Asn Leu Lys Val Gly Ala Asn Leu Val Thr Tyr Gly Val 340 345 350
GGA ACG GAT TTT TTA TAC AAT GTG TAT GAA CGC TCT AGA AGG AGG GAA 1160 Gly Thr Asp Phe Leu Tyr Asn Val Tyr Glu Arg Ser Arg Arg Arg Glu 355 360 365 370
AGG ACT ACG ATC GGT CTT TTC TTT GGC GCT CAA ATT GCA GGG CAA ACT 1208 Arg Thr Thr He Gly Leu Phe Phe Gly Ala Gin He Ala Gly Gin Thr 375 380 385
TGG AGC ACT AAT GTA ACG AAC TTA TTG AGC GGG CAA AGG CCT GAT GTC 1256 Trp Ser Thr Asn Val Thr Asn Leu Leu Ser Gly Gin Arg Pro Asp Val 390 395 400
AAG TCC AGT TCG TTC CAA TTC TTG TTT GAT TTG GGC GTG CGC ACC AAC 1304 Lys Ser Ser Ser Phe Gin Phe Leu Phe Asp Leu Gly Val Arg Thr Asn 405 410 415
TTT GCA AAA ACC AAT TTC AAT AAG CAC AGG CTA GAC CAA GGG ATA GAA 1352 Phe Ala Lys Thr Asn Phe Asn Lys His Arg Leu Asp Gin Gly He Glu 420 425 430
TTT GGG GTG AAA ATC CCT GTT ATC GCT CAT AAA TAT TTT GCA ACC CAA 1400 Phe Gly Val Lys He Pro Val He Ala His Lys Tyr Phe Ala Thr Gin 435 440 445 450
GGC TCA AGC GCG AGC TAT ATG AGG AAT TTT AGC TTC TAT GTG GGC TAT 1448 Gly Ser Ser Ala Ser Tyr Met Arg Asn Phe Ser Phe Tyr Val Gly Tyr 455 460 465
TCA GTC GGT TTT TAAGGAAGGC TCTTGATGAA AAATACCAAT ACAAAAGAGA TAAAG 1505 Ser Val Gly Phe 470
AATACAAG 1513
(2) INFORMATION FOR SEQ ID NO: 174: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 470 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174:
Met Ser Leu Cys Arg Ala Glu Glu Asp Gly Ala Phe Phe Val He Asp
1 5 10 15
Tyr Gin Thr Ser Leu Ala Arg Gin Glu Leu Lys Asn Pro Gly Phe Thr
20 25 30
Gin Ala Gin Glu Leu Arg Gin Leu He Arg Asp Gly Ala Val Arg Leu
35 40 45
Gin Thr Ser Ala He Pro Leu Ser Tyr Tyr Leu Asp He Leu Gly Asn
50 55 60
Lys Thr Ala Thr Leu Leu Arg Glu Ser Leu Lys Asn Asn Ala Gin Pro 65 70 75 80
Ser Gin Pro Asn Ala Gin Pro Pro Gin Gin Asn Gly Pro Ser Asn Gin
85 90 95
Ala Leu Ala Asn Leu Glu Gin Ser Leu Gly He Leu Gly Lys Leu Leu
100 105 110
Asp Leu Ser Gin Gin Tyr Ala Ser Gin Gly Val He Lys Pro Leu Val
115 120 125
Val Asp Val Gly Lys Glu Gin He Gly He Thr Asp Ser Met Leu Leu
130 135 140
Val Ala Gin Asn He Val Leu Ala Leu Gly Gin Val Asp Leu Ser Lys 145 150 155 160
He Gin Gin Asn Asn Asn Glu Gin Leu Tyr Glu Asn He Met Lys Val
165 170 175
Met Leu Leu Gly Ala Gly Gly Thr Asn Gly Ala Tyr Asn Gly Val Ser
180 185 190
Val Gly Asp He Ala Thr Gly Met Gin Asn Phe Ser Ser Gin Thr Gly
195 200 205
Leu He Gly Ala Asn Ser Thr Val Ser Glu Leu Asn Ala Leu He Lys
210 215 220
Ser Gly He Ser Leu Asp Arg Glu Thr Leu Gly Leu Gly Ser Phe He 225 230 235 240
Glu Lys Asn He Cys Ser Gly Ala Ser Ser Cys Phe Ser Gly Asn Gin
245 250 255
Leu He Tyr Lys Lys Gly Leu Asp Arg Thr He Asn He He Asn Thr
260 265 270
Val Leu Gly Gin Phe Glu Ser Ser Ala Ser Ser Leu Tyr Lys He Ser
275 280 285
Tyr He Pro Asn Leu Phe Ser Leu Lys Asp Tyr Gin Ser Ala Ser Met
290 295 300
Asn Gly Phe Gly Ala Lys Met Gly Tyr Lys Gin Phe Phe Thr His Lys 305 310 315 320
Lys Asn Val Gly Leu Arg Tyr Tyr Gly Phe Leu Asp Tyr Gly Tyr Ala
325 330 335
Asn Phe Gly Asp Thr Asn Leu Lys Val Gly Ala Asn Leu Val Thr Tyr 340 345 350 Gly Val Gly Thr Asp Phe Leu Tyr Asn Val Tyr Glu Arg Ser Arg Arg
355 360 365
Arg Glu Arg Thr Thr He Gly Leu Phe Phe Gly Ala Gin He Ala Gly
370 375 380
Gin Thr Trp Ser Thr Asn Val Thr Asn Leu Leu Ser Gly Gin Arg Pro 385 390 395 400
Asp Val Lys Ser Ser Ser Phe Gin Phe Leu Phe Asp Leu Gly Val Arg
405 410 415
Thr Asn Phe Ala Lys Thr Asn Phe Asn Lys His Arg Leu Asp Gin Gly
420 425 430
He Glu Phe Gly Val Lys He Pro Val He Ala His Lys Tyr Phe Ala
435 440 445
Thr Gin Gly Ser Ser Ala Ser Tyr Met Arg Asn Phe Ser Phe Tyr Val
450 455 460
Gly Tyr Ser Val Gly Phe 465 470
(2) INFORMATION FOR SEQ ID NO:175
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 505 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...452 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175:
AAGACCCTAA ATACACACAA AATAAAACAA AATAAATCCA ATCAAATCCC ATG TGC 56
Met Cys
1
CAA ATC CAA TGC TTG CTT ATT TTA CTT TCT ATC AAT ATA GTT AGC GCG 104 Gin He Gin Cys Leu Leu He Leu Leu Ser He Asn He Val Ser Ala 5 10 15
ATC ATC GTT TAT TTT TTC CAA GCA TTT CAA GGG GTT TTG AAT TTT GAA 152 He He Val Tyr Phe Phe Gin Ala Phe Gin Gly Val Leu Asn Phe Glu 20 25 30
GGG GGT TTT TTA GGG TTT TTT ATC GTG GCG TTG TCT TCG TAT TAC GGC 200 Gly Gly Phe Leu Gly Phe Phe He Val Ala Leu Ser Ser Tyr Tyr Gly 35 40 45 50
GTT AAA AAG CGT TTG GAT TTA AGG AAA CAA AAT TCA ATA GAA AAA GAA 248 Val Lys Lys Arg Leu Asp Leu Arg Lys Gin Asn Ser He Glu Lys Glu 55 60 65
GAA AAG CAA AAA TTC CAA AAA TTC GCC CTG GGC TTG GAA ATG TCT TTC 296 Glu Lys Gin Lys Phe Gin Lys Phe Ala Leu Gly Leu Glu Met Ser Phe 70 75 80
AAT GTG TGG CGT TTA GGA GGG TAT GGG GTT TTA CTA GGC ATT TTA GGA 344 Asn Val Trp Arg Leu Gly Gly Tyr Gly Val Leu Leu Gly He Leu Gly 85 90 95
ACG CTT TTA TTC TTG CAT CTT TTT AAC GGG TTA ATC TTT CTT ATT GGC 392 Thr Leu Leu Phe Leu His Leu Phe Asn Gly Leu He Phe Leu He Gly 100 105 110
GTG TTT GTG AGC TCG CTC TCT AGC GCG TTA TTA CGA TTT TTG AAT AAT 440 Val Phe Val Ser Ser Leu Ser Ser Ala Leu Leu Arg Phe Leu Asn Asn 115 120 125 130
AAT GGT AAG TTT TGACACAAAC TCACATGGAT TTTAACCCCT TTAATCCTCT TTTAA 497 Asn Gly Lys Phe
TTTTTAAT 505
(2) INFORMATION FOR SEQ ID NO: 176:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 134 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176:
Met Cys Gin He Gin Cys Leu Leu He Leu Leu Ser He Asn He Val
1 5 10 15
Ser Ala He He Val Tyr Phe Phe Gin Ala Phe Gin Gly Val Leu Asn
20 25 30
Phe Glu Gly Gly Phe Leu Gly Phe Phe He Val Ala Leu Ser Ser Tyr
35 40 45
Tyr Gly Val Lys Lys Arg Leu Asp Leu Arg Lys Gin Asn Ser He Glu
50 55 60
Lys Glu Glu Lys Gin Lys Phe Gin Lys Phe Ala Leu Gly Leu Glu Met 65 70 75 80
Ser Phe Asn Val Trp Arg Leu Gly Gly Tyr Gly Val Leu Leu Gly He
85 90 95
Leu Gly Thr Leu Leu Phe Leu His Leu Phe Asn Gly Leu He Phe Leu
100 105 110
He Gly Val Phe Val Ser Ser Leu Ser Ser Ala Leu Leu Arg Phe Leu
115 120 125
Asn Asn Asn Gly Lys Phe 130
(2) INFORMATION FOR SEQ ID NO: 177:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 511 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...458 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177:
TTTTTGCACT ATCGTTGTTT GCGCTGTGGT GTTTGGCACG CTTGAAAAAA ATG CTC 56
Met Leu 1
AAG AGT ACC ATC AAA GAA GAT TAT TTG ATG CTG ATG TCT AGA GAA GTG 104 Lys Ser Thr He Lys Glu Asp Tyr Leu Met Leu Met Ser Arg Glu Val 5 10 15
AGT GCT TTT GTG GGG ACT CTT TTC TTC ATT GGC TTG AGT TGC TAT GCG 152 Ser Ala Phe Val Gly Thr Leu Phe Phe He Gly Leu Ser Cys Tyr Ala 20 25 30
ATC TAT CAT GGC AAC ATG CCC GAT TAT TTG AGA CCG GCT TTG ATA GAC 200 He Tyr His Gly Asn Met Pro Asp Tyr Leu Arg Pro Ala Leu He Asp 35 40 45 50
ACT ATT AAG GCA GCG AGT GAT TCC ATC TAT TCC AGC TGC GAC TAC ATG 248 Thr He Lys Ala Ala Ser Asp Ser He Tyr Ser Ser Cys Asp Tyr Met 55 60 65
GAT TAT TTT TTG AAG GCT AGA AAG ATG TTA GAG GGG TTT GCT TGG TGG 296 Asp Tyr Phe Leu Lys Ala Arg Lys Met Leu Glu Gly Phe Ala Trp Trp 70 75 80
AGC ATG TTC AAA GCG GAG AGC ATG GGC TTA AAT AAG GGG TTT ATG GTT 344 Ser Met Phe Lys Ala Glu Ser Met Gly Leu Asn Lys Gly Phe Met Val 85 90 95
GCG GGC TGG GTA GCG TTT ATC ATC TAT AAC GCT CTT AGC GGG ATA GCC 392 Ala Gly Trp Val Ala Phe He He Tyr Asn Ala Leu Ser Gly He Ala 100 105 no
ATC AGC AGG CTG AGC GCT CAA ATC ATT TAT TGG TTA TCA AAA TAT TTT 440 He Ser Arg Leu Ser Ala Gin He He Tyr Trp Leu Ser Lys Tyr Phe 115 120 125 130
AGG AGT GAG TAT GGA AAA TGATGTTAAA GAAGATCTAG AGCAAGCAAG ACCAAAGT 496 Arg Ser Glu Tyr Gly Lys 135
TAGAGCCAGA AAAGC 511
(2) INFORMATION FOR SEQ ID NO: 178:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 136 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:178:
Met Leu Lys Ser Thr He Lys Glu Asp Tyr Leu Met Leu Met Ser Arg
1 5 10 15
Glu Val Ser Ala Phe Val Gly Thr Leu Phe Phe He Gly Leu Ser Cys
20 25 30
Tyr Ala He Tyr His Gly Asn Met Pro Asp Tyr Leu Arg Pro Ala Leu
35 40 45
He Asp Thr He Lys Ala Ala Ser Asp Ser He Tyr Ser Ser Cys Asp
50 55 60
Tyr Met Asp Tyr Phe Leu Lys Ala Arg Lys Met Leu Glu Gly Phe Ala 65 70 75 80
Trp Trp Ser Met Phe Lys Ala Glu Ser Met Gly Leu Asn Lys Gly Phe
85 90 95
Met Val Ala Gly Trp Val Ala Phe He He Tyr Asn Ala Leu Ser Gly
100 105 110
He Ala He Ser Arg Leu Ser Ala Gin He He Tyr Trp Leu Ser Lys
115 120 125
Tyr Phe Arg Ser Glu Tyr Gly Lys 130 135
(2) INFORMATION FOR SEQ ID NO: 179:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2203 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...2150 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179:
GAGTTACACA CTCTTTGAGA ACAAAACGCC AAACCATTTA GGAAATTACC ATG CTA 56
Met Leu
1
AGA TTC GTT AGT AAA ACG ATT TGC TTG TCT TTA ATC GGC TTG TTC AAC 104 Arg Phe Val Ser Lys Thr He Cys Leu Ser Leu He Gly Leu Phe Asn 5 10 15
CCT TTA GAA GCC TTT CAA AAA CAC CAA AAA GAC GGC TTT TTT ATA GAA 152 Pro Leu Glu Ala Phe Gin Lys His Gin Lys Asp Gly Phe Phe He Glu 20 25 30
GCT GGG TTT GAA ACT GGG TTA TTA GAA GGA ACG CAA ACT AAA GAA GAA 200 Ala Gly Phe Glu Thr Gly Leu Leu Glu Gly Thr Gin Thr Lys Glu Glu 35 40 45 50
GTC ATA ACC ACC CAA AAA ATC TAT GAA AAC CCC CTA ACC CAC CCA CAA 248 Val He Thr Thr Gin Lys He Tyr Glu Asn Pro Leu Thr His Pro Gin 55 60 65
ACT AAA GAA CAG CCT AAA GAA CAA AAT AAA AGC GAT ACG GCC ACC CCA 296 Thr Lys Glu Gin Pro Lys Glu Gin Asn Lys Ser Asp Thr Ala Thr Pro 70 75 80
CAA AGC GCT TAC GGA AAA TAC TAC ATA CCC CAA AGC ACC ATT TTA AAA 344 Gin Ser Ala Tyr Gly Lys Tyr Tyr He Pro Gin Ser Thr He Leu Lys 85 90 95
AAT GCA ACG GCT TTA TTC ACC ACG GAC AAG ATA GAA AAT GGC TTA ACT 392 Asn Ala Thr Ala Leu Phe Thr Thr Asp Lys He Glu Asn Gly Leu Thr 100 105 110
TTT TAT TCT CAA AAC CCT GTG TAT GCG AAT ATG GTT AAT GGG AGC GTA 440 Phe Tyr Ser Gin Asn Pro Val Tyr Ala Asn Met Val Asn Gly Ser Val 115 120 125 130
ACC ATA CAA AAC TTT CTG CCT TAT AAT TTA AAC AAT GTT GAA CTG AGT 488 Thr He Gin Asn Phe Leu Pro Tyr Asn Leu Asn Asn Val Glu Leu Ser 135 140 145
TTT AAA GAC GCT CAA GGC AAG GTG GTC AAT TTA GGC GTG ATA GAG ACC 536 Phe Lys Asp Ala Gin Gly Lys Val Val Asn Leu Gly Val He Glu Thr 150 155 160
ATC CCT AAA CAA TCT CAA ATT ACC TTG CCT GCA AGC TTG TTT AAT GAT 584 He Pro Lys Gin Ser Gin He Thr Leu Pro Ala Ser Leu Phe Asn Asp 165 170 175
TCA GAA TTT GAA CAA GCT GAT AGC TTT AAT TAC CAA CAA CTT CAA GCC 632 Ser Glu Phe Glu Gin Ala Asp Ser Phe Asn Tyr Gin Gin Leu Gin Ala 180 185 190
ACT GCC ACA CAA TTT TCT GAC GCT AAC ACG CAA AGT TTG TTT CAA AAG 680 Thr Ala Thr Gin Phe Ser Asp Ala Asn Thr Gin Ser Leu Phe Gin Lys 195 200 205 210
CTC AGC AAG ATC ACA ACC AAT GTA ACA ATG AGT TAT GAA AAC GCC GAT 728 Leu Ser Lys He Thr Thr Asn Val Thr Met Ser Tyr Glu Asn Ala Asp 215 220 225
ACC AAC AAT TTT AAA GGT AAT TGC CAT GAT TGT GTG TCA GAT TTC ACC 776 Thr Asn Asn Phe Lys Gly Asn Cys His Asp Cys Val Ser Asp Phe Thr 230 235 240
CCA CAA ACC GCA GAA GAA TTG ACC AAT TTA ATG CTA GAT ATG ATT GCG 824 Pro Gin Thr Ala Glu Glu Leu Thr Asn Leu Met Leu Asp Met He Ala 245 250 255
GTG TTT GAC TCT AAA TCG TGG GAA GAA GCC GTT TTA AAC GCT CCT TTC 872 Val Phe Asp Ser Lys Ser Trp Glu Glu Ala Val Leu Asn Ala Pro Phe 260 265 270
CAA TTT TCT AAC AGC TCA TCA GAG TGC GGC TCT GAC TTT CCT AAG TGC 920 Gin Phe Ser Asn Ser Ser Ser Glu Cys Gly Ser Asp Phe Pro Lys Cys 275 280 285 290
GTG AAT CCT TTC AAT AAC GGG CGT GTC GCT CCC ATC TAT GAA AAA TAC 968 Val Asn Pro Phe Asn Asn Gly Arg Val Ala Pro He Tyr Glu Lys Tyr 295 300 305
GTG CTA ACC CCA CAA TCC GTT ATA GAT GCG TTT AGA AGA ACG ATC AAT 1016 Val Leu Thr Pro Gin Ser Val He Asp Ala Phe Arg Arg Thr He Asn 310 315 320
CTT GAA GTG AAT ATC CTA AAA TCA GGG TTT GTA GGG CTA GGG TAT GAA 1064 Leu Glu Val Asn He Leu Lys Ser Gly Phe Val Gly Leu Gly Tyr Glu 325 330 335
CTT GAT GAT AAT GAT GGT AAT CTG GGG ATA GAA GCT TCT GCC TTA AAT 1112 Leu Asp Asp Asn Asp Gly Asn Leu Gly He Glu Ala Ser Ala Leu Asn 340 345 350
CCT GAA AAA TTG TTT GGT AAA ACT TTG AAC AAA GTT GAT ATT GTG GAA 1160 Pro Glu Lys Leu Phe Gly Lys Thr Leu Asn Lys Val Asp He Val Glu 355 360 365 370
TTA AGA GAC ATT ATC CAT GAA TTT AGC CAC ACT AAA GGC TAT ACG CAT 1208 Leu Arg Asp He He His Glu Phe Ser His Thr Lys Gly Tyr Thr His 375 380 385
AAT GGG AAC ATG ACT TAT CAA AGA GTG CGC TTG TGT CAA GAA AAC GGC 1256 Asn Gly Asn Met Thr Tyr Gin Arg Val Arg Leu Cys Gin Glu Asn Gly 390 395 400 GGA GCC ATA CAA GAA TGT GAG GGT GGG AAA GAA GAG TTA GTC AAT GGA 1304 Gly Ala He Gin Glu Cys Glu Gly Gly Lys Glu Glu Leu Val Asn Gly 405 410 415
AAA GAA GAA CTA AAA TTT ACA AAT GGG AAA GAA GTG AAA GAT CAG GAT 1352 Lys Glu Glu Leu Lys Phe Thr Asn Gly Lys Glu Val Lys Asp Gin Asp 420 425 430
GGT TAC ACC TAT GAT GTA TGT TCT TTT TAT AAG GAC AAC CAC CAA GTC 1400 Gly Tyr Thr Tyr Asp Val Cys Ser Phe Tyr Lys Asp Asn His Gin Val 435 440 445 450
TAT ACA GCG AGC AAT TAC CCC AAT TCC ATT TAT ACG AAT TGC GCT CAA 1448 Tyr Thr Ala Ser Asn Tyr Pro Asn Ser He Tyr Thr Asn Cys Ala Gin 455 460 465
GTC CCT GCT GGG CTT ATA GGG GTT ACC ACC GCT GTC TGG CAA CAG CTC 1496 Val Pro Ala Gly Leu He Gly Val Thr Thr Ala Val Trp Gin Gin Leu 470 475 480
ATC AAT CAA AAC GCT CTG CCC ATT AAT TTC GCT AAT CTA AAT AGC CCA 1544 He Asn Gin Asn Ala Leu Pro He Asn Phe Ala Asn Leu Asn Ser Pro 485 490 495
ACC AAC CAC TTA AAC GCC GGG TTG AAC GCA CAA AAT TTT GCA ACC TCT 1592 Thr Asn His Leu Asn Ala Gly Leu Asn Ala Gin Asn Phe Ala Thr Ser 500 505 510
ATA GTC AGC GCG ATC GCG CAA AAT TTT TCC ACC ACT TCC ACC ACC ACT 1640 He Val Ser Ala He Ala Gin Asn Phe Ser Thr Thr Ser Thr Thr Thr 515 520 525 530
TAC CGC TCT TCA AGT AAG AAT TTT AGA AGC CCT ATT TTA GGG GTT AAT 1688 Tyr Arg Ser Ser Ser Lys Asn Phe Arg Ser Pro He Leu Gly Val Asn 535 540 545
GTT AAA ATA GGC TAC CAA CAT TAT TTC AAT GAC TAC ATA GGG TTA. GCC 1736 Val Lys He Gly Tyr Gin His Tyr Phe Asn Asp Tyr He Gly Leu Ala 550 555 560
TAT TAC GGC ATT ATC AAA TAC AAT TAC GCC AAA ACT AAC GAT GAA AAA 1784 Tyr Tyr Gly He He Lys Tyr Asn Tyr Ala Lys Thr Asn Asp Glu Lys 565 570 575
ATC CAG CAA TTA AGC TAT GGT GGG GGA ATG GAT GTG TTG TTT GAT TTC 1832 He Gin Gin Leu Ser Tyr Gly Gly Gly Met Asp Val Leu Phe Asp Phe 580 585 590
ATC ACC ACT TAC GCT AAC AAA AAG CAA GAC AAC CCA ACT AAA AAA GTT 1880 He Thr Thr Tyr Ala Asn Lys Lys Gin Asp Asn Pro Thr Lys Lys Val 595 600 605 610
TTT GCT TCC TCT TTT GGG GTG TTT GGG GGG TTA AGG GGC TTA TAC AAT 1928 Phe Ala Ser Ser Phe Gly Val Phe Gly Gly Leu Arg Gly Leu Tyr Asn 615 620 625 AGC TAT TAT GTC TTC AAC CAA GTC AAA GGA AGC GGT AAT TTA GAT ATA 1976 Ser Tyr Tyr Val Phe Asn Gin Val Lys Gly Ser Gly Asn Leu Asp He 630 635 640
GTT ACT GGG TTT AAT TAC CGC TAC AAG CAT TCT AAA TAT TCT GTA GGC 2024 Val Thr Gly Phe Asn Tyr Arg Tyr Lys His Ser Lys Tyr Ser Val Gly 645 650 655
ATT AGC GTT CCT TTA ATC CAA AGC GGT ATT AAA ATC GCT TCT AAT AAT 2072 He Ser Val Pro Leu He Gin Ser Gly He Lys He Ala Ser Asn Asn 660 665 670
GGC ATC TAT GCG AAC TCC GTT GTT TTG AAT GAA GGG GGC AGT CAT TTT 2120 Gly He Tyr Ala Asn Ser Val Val Leu Asn Glu Gly Gly Ser His Phe 675 680 685 690
AAA GTG TTT TTT AAT TAC GGG TGG ATT TTT TAGGATTTAA AATCCCCAAT AAC 2173 Lys Val Phe Phe Asn Tyr Gly Trp He Phe 695 700
CCCCTAAACT TGTGCGATAC TCGCTACAAA 2203
(2) INFORMATION FOR SEQ ID NO: 180:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 700 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180:
Met Leu Arg Phe Val Ser Lys Thr He Cys Leu Ser Leu He Gly Leu
1 5 10 15
Phe Asn Pro Leu Glu Ala Phe Gin Lys His Gin Lys Asp Gly Phe Phe
20 25 30
He Glu Ala Gly Phe Glu Thr Gly Leu Leu Glu Gly Thr Gin Thr Lys
35 40 45
Glu Glu Val He Thr Thr Gin Lys He Tyr Glu Asn Pro Leu Thr His
50 55 60
Pro Gin Thr Lys Glu Gin Pro Lys Glu Gin Asn Lys Ser Asp Thr Ala 65 70 75 80
Thr Pro Gin Ser Ala Tyr Gly Lys Tyr Tyr He Pro Gin Ser Thr He
85 90 95
Leu Lys Asn Ala Thr Ala Leu Phe Thr Thr Asp Lys He Glu Asn Gly
100 105 110
Leu Thr Phe Tyr Ser Gin Asn Pro Val Tyr Ala Asn Met Val Asn Gly
115 120 125
Ser Val Thr He Gin Asn Phe Leu Pro Tyr Asn Leu Asn Asn Val Glu
130 135 140
Leu Ser Phe Lys Asp Ala Gin Gly Lys Val Val Asn Leu Gly Val He 145 150 155 160 Glu Thr He Pro Lys Gin Ser Gin He Thr Leu Pro Ala Ser Leu Phe
165 170 175
Asn Asp Ser Glu Phe Glu Gin Ala Asp Ser Phe Asn Tyr Gin Gin Leu
180 185 190
Gin Ala Thr Ala Thr Gin Phe Ser Asp Ala Asn Thr Gin Ser Leu Phe
195 200 205
Gin Lys Leu Ser Lys He Thr Thr Asn Val Thr Met Ser Tyr Glu Asn
210 215 220
Ala Asp Thr Asn Asn Phe Lys Gly Asn Cys His Asp Cys Val Ser Asp 225 230 235 240
Phe Thr Pro Gin Thr Ala Glu Glu Leu Thr Asn Leu Met Leu Asp Met
245 250 255
He Ala Val Phe Asp Ser Lys Ser Trp Glu Glu Ala Val Leu Asn Ala
260 265 270
Pro Phe Gin Phe Ser Asn Ser Ser Ser Glu Cys Gly Ser Asp Phe Pro
275 280 285
Lys Cys Val Asn Pro Phe Asn Asn Gly Arg Val Ala Pro He Tyr Glu
290 295 300
Lys Tyr Val Leu Thr Pro Gin Ser Val He Asp Ala Phe Arg Arg Thr 305 310 315 320
He Asn Leu Glu Val Asn He Leu Lys Ser Gly Phe Val Gly Leu Gly
325 330 335
Tyr Glu Leu Asp Asp Asn Asp Gly Asn Leu Gly He Glu Ala Ser Ala
340 345 350
Leu Asn Pro Glu Lys Leu Phe Gly Lys Thr Leu Asn Lys Val Asp He
355 360 365
Val Glu Leu Arg Asp He He His Glu Phe Ser His Thr Lys Gly Tyr
370 375 380
Thr His Asn Gly Asn Met Thr Tyr Gin Arg Val Arg Leu Cys Gin Glu 385 390 395 400
Asn Gly Gly Ala He Gin Glu Cys Glu Gly Gly Lys Glu Glu Leu Val
405 410 415
Asn Gly Lys Glu Glu Leu Lys Phe Thr Asn Gly Lys Glu Val Lys Asp
420 425 430
Gin Asp Gly Tyr Thr Tyr Asp Val Cys Ser Phe Tyr Lys Asp Asn His
435 440 445
Gin Val Tyr Thr Ala Ser Asn Tyr Pro Asn Ser He Tyr Thr Asn Cys
450 455 460
Ala Gin Val Pro Ala Gly Leu He Gly Val Thr Thr Ala Val Trp Gin 465 470 475 480
Gin Leu He Asn Gin Asn Ala Leu Pro He Asn Phe Ala Asn Leu Asn
485 490 495
Ser Pro Thr Asn His Leu Asn Ala Gly Leu Asn Ala Gin Asn Phe Ala
500 505 510
Thr Ser He Val Ser Ala He Ala Gin Asn Phe Ser Thr Thr Ser Thr
515 520 525
Thr Thr Tyr Arg Ser Ser Ser Lys Asn Phe Arg Ser Pro He Leu Gly
530 535 540
Val Asn Val Lys He Gly Tyr Gin His Tyr Phe Asn Asp Tyr He Gly 545 550 555 560
Leu Ala Tyr Tyr Gly He He Lys Tyr Asn Tyr Ala Lys Thr Asn Asp
565 570 575
Glu Lys He Gin Gin Leu Ser Tyr Gly Gly Gly Met Asp Val Leu Phe
580 585 590
Asp Phe He Thr Thr Tyr Ala Asn Lys Lys Gin Asp Asn Pro Thr Lys 595 600 605
Lys Val Phe Ala Ser Ser Phe Gly Val Phe Gly Gly Leu Arg Gly Leu
610 615 620
Tyr Asn Ser Tyr Tyr Val Phe Asn Gin Val Lys Gly Ser Gly Asn Leu 625 630 635 640
Asp He Val Thr Gly Phe Asn Tyr Arg Tyr Lys His Ser Lys Tyr Ser
645 650 655
Val Gly He Ser Val Pro Leu He Gin Ser Gly He Lys He Ala Ser
660 665 670
Asn Asn Gly He Tyr Ala Asn Ser Val Val Leu Asn Glu Gly Gly Ser
675 680 685
His Phe Lys Val Phe Phe Asn Tyr Gly Trp He Phe 690 695 700
(2) INFORMATION FOR SEQ ID NO: 181:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 397 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...344 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181:
TCGTTTTGGA AAAAGATATA GCCCATGCGC GTTTCAAGGG TAATGAAAGC ATG GTG 56
Met Val
1
TAT GAA GAA AAT TTT GTG CAT GCC GGG TTT GTG CTT ATT GCG TGC AAT 104 Tyr Glu Glu Asn Phe Val His Ala Gly Phe Val Leu He Ala Cys Asn 5 10 15
TAT GCG GCC TTG TGC GCG TTG AAT AAA AGA CAC AGC GTG GTG GTT TCT 152 Tyr Ala Ala Leu Cys Ala Leu Asn Lys Arg His Ser Val Val Val Ser 20 25 30
AAT AAC ATC AAT TTT TAT GCC CCC CTA GAA TTG AAT CAA GAA GCA CTC 200 Asn Asn He Asn Phe Tyr Ala Pro Leu Glu Leu Asn Gin Glu Ala Leu 35 40 45 50
ATT AAA GCG CAA GTG ATT CAA GAT GGC GTG AAA AAA GCT GAA ATA AAA 248 He Lys Ala Gin Val He Gin Asp Gly Val Lys Lys Ala Glu He Lys 55 60 65
ATA GAG GCG TTT GTG TTA GAC ATT CAG GTT TTA GAG GGA ATG ATA GAA 296 He Glu Ala Phe Val Leu Asp He Gin Val Leu Glu Gly Met He Glu 70 75 80
ATT GTG GTG TTT GAT AAA AAG CCT TTT AAA TTC AAT TTT AAA GAA GAG T 345 He Val Val Phe Asp Lys Lys Pro Phe Lys Phe Asn Phe Lys Glu Glu 85 90 95
AGTTAAATGG TTATTGTTTT AGTCGTGGAT AGTTTTAAAG ACACCAGTAA TG 397
(2) INFORMATION FOR SEQ ID NO: 182:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 98 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:182:
Met Val Tyr Glu Glu Asn Phe Val His Ala Gly Phe Val Leu He Ala
1 5 10 15
Cys Asn Tyr Ala Ala Leu Cys Ala Leu Asn Lys Arg His Ser Val Val
20 25 30
Val Ser Asn Asn He Asn Phe Tyr Ala Pro Leu Glu Leu Asn Gin Glu
35 40 45
Ala Leu He Lys Ala Gin Val He Gin Asp Gly Val Lys Lys Ala Glu
50 55 60
He Lys He Glu Ala Phe Val Leu Asp He Gin Val Leu Glu Gly Met 65 70 75 80
He Glu He Val Val Phe Asp Lys Lys Pro Phe Lys Phe Asn Phe Lys
85 90 95
Glu Glu
(2) INFORMATION FOR SEQ ID NO:183:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1261 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1208 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183:
ATGATAGTAA GGAAATAAGA GTGGAATGCA AGAATCACCC TATTGAAAAG ATG GCA 56
Met Ala 1
GAA AAA TTA GAG GAA ACT AAT CCT GAA TGG TTT GAA AAA TGG AGG GAA 104 Glu Lys Leu Glu Glu Thr Asn Pro Glu Trp Phe Glu Lys Trp Arg Glu 5 10 15
AAA CAA TAC ACC CAA ACT GGC GAA TCT AAG CCA TCA AAA CGA ATC AAA 152 Lys Gin Tyr Thr Gin Thr Gly Glu Ser Lys Pro Ser Lys Arg He Lys 20 25 30
GTT TTT AAA AAC TTT ACG GCA TTT GAT GAC AGA TTG TAT ACA ATT GAA 200 Val Phe Lys Asn Phe Thr Ala Phe Asp Asp Arg Leu Tyr Thr He Glu 35 40 45 50
TGT AAT TTA AAA AAT CTG GAT ACC CAT CAA AAA AAG TTT GAA ATT TGT 248 Cys Asn Leu Lys Asn Leu Asp Thr His Gin Lys Lys Phe Glu He Cys 55 60 65
GGG GCT CTG TAT GAC ATT TAT GAA CAA ATT TTT GAT GAA ACA CCA AGC 296 Gly Ala Leu Tyr Asp He Tyr Glu Gin He Phe Asp Glu Thr Pro Ser 70 75 80
TTG AAA GGG CGC GAT TTA GAA ACA TAC AAA GCA CAA GAT TTG TCA AAG 344 Leu Lys Gly Arg Asp Leu Glu Thr Tyr Lys Ala Gin Asp Leu Ser Lys 85 90 95
AAA TTC ATG CAT TTA GGT TTT GAA CAG ATC TCA AAA GAT TTA AAC GAC 392 Lys Phe Met His Leu Gly Phe Glu Gin He Ser Lys Asp Leu Asn Asp 100 105 110
TCT AGA TTG AAC GCT TTA TTG TGC TAT GAG GAA AAA GTC ATG CAA GCT 440 Ser Arg Leu Asn Ala Leu Leu Cys Tyr Glu Glu Lys Val Met Gin Ala 115 120 125 130
TTG GCT AAA AAA TAC CCT AGT TTT TTA CAA GAT TTG CAT GAT ATA AAA 488 Leu Ala Lys Lys Tyr Pro Ser Phe Leu Gin Asp Leu His Asp He Lys 135 140 145
AAA TAC AGG AAT AAA GAT AAA CAC GGC GAG AAA CCA CAA GAT GGG TCT 536 Lys Tyr Arg Asn Lys Asp Lys His Gly Glu Lys Pro Gin Asp Gly Ser 150 155 160
TCT TTA ACG AGA GTG GAA TTA GAA AGA TAC AGA GAT GGA ATT TAT TTT 584 Ser Leu Thr Arg Val Glu Leu Glu Arg Tyr Arg Asp Gly He Tyr Phe 165 170 175
CTA GTA GAA AAT CTT TTA AAA AAC CCC TTG ATT AAA GAG AGA GAA AAT 632 Leu Val Glu Asn Leu Leu Lys Asn Pro Leu He Lys Glu Arg Glu Asn 180 185 190
GCT CAA GAA GAA AAA CAT TAT AAG AAA AAT GCA GAG ATT GAC GAC CGA 680 Ala Gin Glu Glu Lys His Tyr Lys Lys Asn Ala Glu He Asp Asp Arg 195 200 205 210
TCC CAG CTA TCA AAC TTA AAC GCA CCC AAA CCC TTA TTT GAA TGT TTT 728 Ser Gin Leu Ser Asn Leu Asn Ala Pro Lys Pro Leu Phe Glu Cys Phe 215 220 225
GTA GGA GTT AAT CTG GCC AAA GCC AAA TAT TAT TCT AAA AAA GAA GAA 776 Val Gly Val Asn Leu Ala Lys Ala Lys Tyr Tyr Ser Lys Lys Glu Glu 230 235 240
AGA GAA AAA GAA AAG ATG ATC TTG AAT TTT TGT AAG ATA TTT GAA ATT 824 Arg Glu Lys Glu Lys Met He Leu Asn Phe Cys Lys He Phe Glu He 245 250 255
ATT CTT TTT GAA GCT ATC CAA AAA CAA CCA AAG CCT GAT TTT AAA AAT 872 He Leu Phe Glu Ala He Gin Lys Gin Pro Lys Pro Asp Phe Lys Asn 260 265 270
AAA GAC GAG CTT TTA GGG GAT TAT CCT AAT CTT AAA AAT TTA GAT TCT 920 Lys Asp Glu Leu Leu Gly Asp Tyr Pro Asn Leu Lys Asn Leu Asp Ser 275 280 285 290
TTA AGA GAA GTG AGG GAA GAC TTT TTG AAA AGA GCG TTT AAG AAT GAT 968 Leu Arg Glu Val Arg Glu Asp Phe Leu Lys Arg Ala Phe Lys Asn Asp 295 300 305
GAA GCG AGT TTG GGA GCG TAT GTG TTA GTG TTG CTT AGC TGT AAG TAT 1016 Glu Ala Ser Leu Gly Ala Tyr Val Leu Val Leu Leu Ser Cys Lys Tyr 310 315 320
TTT GAG AGC GTG TTT GAA AAA GTT CAA GAA TGG CTA GAT TTT ATC GCT 1064 Phe Glu Ser Val Phe Glu Lys Val Gin Glu Trp Leu Asp Phe He Ala 325 330 335
AGG CTT ATT GCT TTG AGA GGC CAT GTG CAC AAG ATA ACT AAA GAA CTT 1112 Arg Leu He Ala Leu Arg Gly His Val His Lys He Thr Lys Glu Leu 340 345 350
GAA AGA TTA GAA GAA GAG GAT TTA GAA AAA TTG GAA AAA CAA GCA CTA 1160 Glu Arg Leu Glu Glu Glu Asp Leu Glu Lys Leu Glu Lys Gin Ala Leu 355 360 365 370
GAA TAT TTT AAT AAA ATA GCA AAT AAA ATA TAT CTA AAG GAG AAA CGA T 1209 Glu Tyr Phe Asn Lys He Ala Asn Lys He Tyr Leu Lys Glu Lys Arg 375 380 385
GAGCGGGAAT GAAGAATTGG AGCTAAGAGC CAGAGAAACT GAGTTGGATA AA 1261
(2) INFORMATION FOR SEQ ID NO: 184:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 386 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:184:
Met Ala Glu Lys Leu Glu Glu Thr Asn Pro Glu Trp Phe Glu Lys Trp
1 5 10 15
Arg Glu Lys Gin Tyr Thr Gin Thr Gly Glu Ser Lys Pro Ser Lys Arg
20 25 30
He Lys Val Phe Lys Asn Phe Thr Ala Phe Asp Asp Arg Leu Tyr Thr
35 40 45
He Glu Cys Asn Leu Lys Asn Leu Asp Thr His Gin Lys Lys Phe Glu
50 55 60
He Cys Gly Ala Leu Tyr Asp He Tyr Glu Gin He Phe Asp Glu Thr 65 70 75 80
Pro Ser Leu Lys Gly Arg Asp Leu Glu Thr Tyr Lys Ala Gin Asp Leu
85 90 95
Ser Lys Lys Phe Met His Leu Gly Phe Glu Gin He Ser Lys Asp Leu
100 105 110
Asn Asp Ser Arg Leu Asn Ala Leu Leu Cys Tyr Glu Glu Lys Val Met
115 120 125
Gin Ala Leu Ala Lys Lys Tyr Pro Ser Phe Leu Gin Asp Leu His Asp
130 135 140
He Lys Lys Tyr Arg Asn Lys Asp Lys His Gly Glu Lys Pro Gin Asp 145 150 155 160
Gly Ser Ser Leu Thr Arg Val Glu Leu Glu Arg Tyr Arg Asp Gly He
165 170 175
Tyr Phe Leu Val Glu Asn Leu Leu Lys Asn Pro Leu He Lys Glu Arg
180 185 190
Glu Asn Ala Gin Glu Glu Lys His Tyr Lys Lys Asn Ala Glu He Asp
195 200 205
Asp Arg Ser Gin Leu Ser Asn Leu Asn Ala Pro Lys Pro Leu Phe Glu
210 215 220
Cys Phe Val Gly Val Asn Leu Ala Lys Ala Lys Tyr Tyr Ser Lys Lys 225 230 235 240
Glu Glu Arg Glu Lys Glu Lys Met He Leu Asn Phe Cys Lys He Phe
245 250 255
Glu He He Leu Phe Glu Ala He Gin Lys Gin Pro Lys Pro Asp Phe
260 265 270
Lys Asn Lys Asp Glu Leu Leu Gly Asp Tyr Pro Asn Leu Lys Asn Leu
275 280 285
Asp Ser Leu Arg Glu Val Arg Glu Asp Phe Leu Lys Arg Ala Phe Lys
290 295 300
Asn Asp Glu Ala Ser Leu Gly Ala Tyr Val Leu Val Leu Leu Ser Cys 305 310 315 320
Lys Tyr Phe Glu Ser Val Phe Glu Lys Val Gin Glu Trp Leu Asp Phe
325 330 335
He Ala Arg Leu He Ala Leu Arg Gly His Val His Lys He Thr Lys
340 345 350
Glu Leu Glu Arg Leu Glu Glu Glu Asp Leu Glu Lys Leu Glu Lys Gin
355 360 365
Ala Leu Glu Tyr Phe Asn Lys He Ala Asn Lys He Tyr Leu Lys Glu 370 375 380 Lys Arg 385
(2) INFORMATION FOR SEQ ID NO: 185:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 412 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...359 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185:
CATTTAATGC TAAGTCTAAT AAGATTGCCC TAGATAGACA TTACGCCAAA ATG TTT 56
Met Phe
1
TTG CAA GTT GTA GCA AGA ACT CTA AGA AAG AAT GTC AAT ATA TTA GAA 104 Leu Gin Val Val Ala Arg Thr Leu Arg Lys Asn Val Asn He Leu Glu 5 10 15
GAG CAA GGT TTT ATT GAA GTC ATT AAA GGA AAA CAA AGA TAC TTG TAT 152 Glu Gin Gly Phe He Glu Val He Lys Gly Lys Gin Arg Tyr Leu Tyr 20 25 30
GTG TAT CTT AAA GAT TAC AGA GAA TTA GAG GGC TAT AAC TCC GTA GGA 200 Val Tyr Leu Lys Asp Tyr Arg Glu Leu Glu Gly Tyr Asn Ser Val Gly 35 40 45 50
GCT AAT CAA AAG AAC AAT ATC CCA TCG CCT TTT TTC TTA CAG ATT ATG 248 Ala Asn Gin Lys Asn Asn He Pro Ser Pro Phe Phe Leu Gin He Met 55 60 65
CGT TTC TTA GAA AAG TTT GCC AAA GAA ATT GAG AGA GTA AAA ATA ACA 296 Arg Phe Leu Glu Lys Phe Ala Lys Glu He Glu Arg Val Lys He Thr 70 75 80
ACA AAG AAT GTG TTA TGC ATA TTC CTA GCC AAG AGC TTA TGC AAA GAG 344 Thr Lys Asn Val Leu Cys He Phe Leu Ala Lys Ser Leu Cys Lys Glu 85 90 95
TTA ATA ATG TTG TTT TAAAATTCAC GCCTATTTCT AATCCTAATA CCACTTACAC T 400 Leu He Met Leu Phe 100 TTATCCTACA AG 412
(2) INFORMATION FOR SEQ ID NO: 186:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 103 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186:
Met Phe Leu Gin Val Val Ala Arg Thr Leu Arg Lys Asn Val Asn He
1 5 10 15
Leu Glu Glu Gin Gly Phe He Glu Val He Lys Gly Lys Gin Arg Tyr
20 25 30
Leu Tyr Val Tyr Leu Lys Asp Tyr Arg Glu Leu Glu Gly Tyr Asn Ser
35 40 45
Val Gly Ala Asn Gin Lys Asn Asn He Pro Ser Pro Phe Phe Leu Gin
50 55 60
He Met Arg Phe Leu Glu Lys Phe Ala Lys Glu He Glu Arg Val Lys 65 70 75 80
He Thr Thr Lys Asn Val Leu Cys He Phe Leu Ala Lys Ser Leu Cys
85 90 95
Lys Glu Leu He Met Leu Phe 100
(2) INFORMATION FOR SEQ ID NO: 187:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1204 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1151 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:187:
TTCTATTAAA ATTAGTGTAT GATTGAGATT ATTTTTGATT AGGATCAACC ATG CAA 56
Met Gin 1
AAA GCC TTA TTA CAT TCA TCA TTC TTT TTA CCT TTA TTT TTA TCT TTT 104 Lys Ala Leu Leu His Ser Ser Phe Phe Leu Pro Leu Phe Leu Ser Phe 5 10 15
TGT ATC GCT GAA GAA AAT GGG GCG TAT GCG AGC GTG GGT TTT GAA TAT 152 Cys He Ala Glu Glu Asn Gly Ala Tyr Ala Ser Val Gly Phe Glu Tyr 20 25 30
TCC ATT AGT CAT GCC GTT GAA CAC AAT AAC CCC TTT TTA AAT CAA GAA 200 Ser He Ser His Ala Val Glu His Asn Asn Pro Phe Leu Asn Gin Glu 35 40 45 50
CGC ATC CAA ATC ATT TCT AAC GCT CAA AAT AAA ATC TAT AAA CTC CAT 248 Arg He Gin He He Ser Asn Ala Gin Asn Lys He Tyr Lys Leu His 55 60 65
CAA GTT AAA AAT GAA ATC ACA AGC ATG CCT AAA ACC TTT GCA TAT ATC 296 Gin Val Lys Asn Glu He Thr Ser Met Pro Lys Thr Phe Ala Tyr He 70 75 80
AAC AAC GCT TTA AAA AAC AAC TCC AAA TTA ACC CCC ACT GAA ATG CAA 344 Asn Asn Ala Leu Lys Asn Asn Ser Lys Leu Thr Pro Thr Glu Met Gin 85 90 95
GCC GAA CAA TAC TAC CTC CAA TCC ACC TTT CAA AAC ATT GAA AAA ATA 392 Ala Glu Gin Tyr Tyr Leu Gin Ser Thr Phe Gin Asn He Glu Lys He 100 105 110
GTA ATG CTT AGC GGT GGC GTT TCA TCT AAC CCA CAA TTA GTC CAA GCG 440 Val Met Leu Ser Gly Gly Val Ser Ser Asn Pro Gin Leu Val Gin Ala 115 120 125 130
TTG GAA AAA ATG CAA GAA CCC ATT ACT AAC CCT TTA GAA TTT GAA GAA 488 Leu Glu Lys Met Gin Glu Pro He Thr Asn Pro Leu Glu Phe Glu Glu 135 140 145
AAC TTA AGA AAT TTA GAA GTG CAA TTT GCT CAA TCT CAA AAC CGC ATG 536 Asn Leu Arg Asn Leu Glu Val Gin Phe Ala Gin Ser Gin Asn Arg Met 150 155 160
CTT TCT TCT TTA TCT TCT CAA ATC GCT GCC ATT TCA AAT TCC TTA AAC 584 Leu Ser Ser Leu Ser Ser Gin He Ala Ala He Ser Asn Ser Leu Asn 165 170 175
GCG CTT GAT CCT AAC TCT TAT TCT AAA AAC ATT TCA AGC ATG TAT GGG 632 Ala Leu Asp Pro Asn Ser Tyr Ser Lys Asn He Ser Ser Met Tyr Gly 180 185 190
GTG AGT TTG AGC GTA GGT TAT AAG CAT TTC TTT ACC AAG AAA AAA AAT 680 Val Ser Leu Ser Val Gly Tyr Lys His Phe Phe Thr Lys Lys Lys Asn 195 200 205 210
CAA GGG TTG CGC TAT TAC TTG TTT TAT GAC TAT GGT TAC ACT AAT TTT 728 Gin Gly Leu Arg Tyr Tyr Leu Phe Tyr Asp Tyr Gly Tyr Thr Asn Phe 215 220 225 GGT TTT GTG GGC AAT GGC TTT GAT GGT TTA GGC AAA ATG AAT AAC CAT 776 Gly Phe Val Gly Asn Gly Phe Asp Gly Leu Gly Lys Met Asn Asn His 230 235 240
CTC TAT GGG CTT GGG ATA GAC TAT CTT TAT AAT TTC ATT GAT AAT GCA 824 Leu Tyr Gly Leu Gly He Asp Tyr Leu Tyr Asn Phe He Asp Asn Ala 245 250 255
AAA AAA CAC TCT AGC GTA GGT TTT TAT CTG GGT TTT GCT TTA GCG GGG 872 Lys Lys His Ser Ser Val Gly Phe Tyr Leu Gly Phe Ala Leu Ala Gly 260 265 270
AGT TCG TGG GTA GGG AGT GGT TTG AGC ATG TGG GTG AGC CAA ACG GAT 920 Ser Ser Trp Val Gly Ser Gly Leu Ser Met Trp Val Ser Gin Thr Asp 275 280 285 290
TTT ATC AAC AAT TAC TTG ACG GGC TAT CAA GCT AAA ATG CAC ACG AGT 968 Phe He Asn Asn Tyr Leu Thr Gly Tyr Gin Ala Lys Met His Thr Ser 295 300 305
TTT TTC CAG ATC CCT TTG AAT TTT GGG GTT CGT GTG AAT GTC AAT AGG 1016 Phe Phe Gin He Pro Leu Asn Phe Gly Val Arg Val Asn Val Asn Arg 310 315 320
CAT AAT GGC TTT GAA ATG GGC TTG AAA ATC CCT TTA GCG ATG AAT TCC 1064 His Asn Gly Phe Glu Met Gly Leu Lys He Pro Leu Ala Met Asn Ser 325 330 335
TTT TAT GAA ACG CAT GGC AAA GGG CTA AAC ACT TCC CTC TTT TTC AAA 1112 Phe Tyr Glu Thr His Gly Lys Gly Leu Asn Thr Ser Leu Phe Phe Lys 340 345 350
CGC CTT GTC ATG TTT AAC GTG AGT TAC GTT TAT AGT TTT TAGGGGGGTA AA 1163 Arg Leu Val Met Phe Asn Val Ser Tyr Val Tyr Ser Phe 355 360 365
TGCCTTCAAA CGCTCTTTTG ATTGAAGAAA TCACTCATTT A 1204
(2) INFORMATION FOR SEQ ID NO: 188:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 367 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:188:
Met Gin Lys Ala Leu Leu His Ser Ser Phe Phe Leu Pro Leu Phe Leu
1 5 10 15
Ser Phe Cys He Ala Glu Glu Asn Gly Ala Tyr Ala Ser Val Gly Phe 20 25 30 Glu Tyr Ser He Ser His Ala Val Glu His Asn Asn Pro Phe Leu Asn
35 40 45
Gin Glu Arg He Gin He He Ser Asn Ala Gin Asn Lys He Tyr Lys
50 55 60
Leu His Gin Val Lys Asn Glu He Thr Ser Met Pro Lys Thr Phe Ala 65 70 75 80
Tyr He Asn Asn Ala Leu Lys Asn Asn Ser Lys Leu Thr Pro Thr Glu
85 90 95
Met Gin Ala Glu Gin Tyr Tyr Leu Gin Ser Thr Phe Gin Asn He Glu
100 105 110
Lys He Val Met Leu Ser Gly Gly Val Ser Ser Asn Pro Gin Leu Val
115 120 125
Gin Ala Leu Glu Lys Met Gin Glu Pro He Thr Asn Pro Leu Glu Phe
130 135 140
Glu Glu Asn Leu Arg Asn Leu Glu Val Gin Phe Ala Gin Ser Gin Asn 145 150 155 160
Arg Met Leu Ser Ser Leu Ser Ser Gin He Ala Ala He Ser Asn Ser
165 170 175
Leu Asn Ala Leu Asp Pro Asn Ser Tyr Ser Lys Asn He Ser Ser Met
180 185 190
Tyr Gly Val Ser Leu Ser Val Gly Tyr Lys His Phe Phe Thr Lys Lys
195 200 205
Lys Asn Gin Gly Leu Arg Tyr Tyr Leu Phe Tyr Asp Tyr Gly Tyr Thr
210 215 220
Asn Phe Gly Phe Val Gly Asn Gly Phe Asp Gly Leu Gly Lys Met Asn 225 230 235 240
Asn His Leu Tyr Gly Leu Gly He Asp Tyr Leu Tyr Asn Phe He Asp
245 250 255
Asn Ala Lys Lys His Ser Ser Val Gly Phe Tyr Leu Gly Phe Ala Leu
260 265 270
Ala Gly Ser Ser Trp Val Gly Ser Gly Leu Ser Met Trp Val Ser Gin
275 280 285
Thr Asp Phe He Asn Asn Tyr Leu Thr Gly Tyr Gin Ala Lys Met His
290 295 300
Thr Ser Phe Phe Gin He Pro Leu Asn Phe Gly Val Arg Val Asn Val 305 310 315 320
Asn Arg His Asn Gly Phe Glu Met Gly Leu Lys He Pro Leu Ala Met
325 330 335
Asn Ser Phe Tyr Glu Thr His Gly Lys Gly Leu Asn Thr Ser Leu Phe
340 345 350
Phe Lys Arg Leu Val Met Phe Asn Val Ser Tyr Val Tyr Ser Phe 355 360 365
(2) INFORMATION FOR SEQ ID NO: 189:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1687 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence (B) LOCATION: 51...1634 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189:
TTATTTTTAC AGAGTAATTT ATCTATTCTC AGGTAAAGTA AGGAAGAGGA ATG AAA 56
Met Lys 1
TTA AAG AAA CGA AAA GTT GCG GCT GCA TTG CTA AAG CGT TTT ACC TTG 104 Leu Lys Lys Arg Lys Val Ala Ala Ala Leu Leu Lys Arg Phe Thr Leu 5 10 15
CCA CTA TTG TTC ACT ACG GGT TCA TTA GGG GCG GTT ACT TAT GAA GTG 152 Pro Leu Leu Phe Thr Thr Gly Ser Leu Gly Ala Val Thr Tyr Glu Val 20 25 30
CAT GGA GAT TTT ATC AAT TTT GCT AAA GTG GGT TTT AAC CAT TCG CCC 200 His Gly Asp Phe He Asn Phe Ala Lys Val Gly Phe Asn His Ser Pro 35 40 45 50
ATT AAT CCT GTT AAA GGT ATC TAT CCC ACA GAA ACT TTT GTT AAC CTT 248 He Asn Pro Val Lys Gly He Tyr Pro Thr Glu Thr Phe Val Asn Leu 55 60 65
ACG GGT AAG CTA GAG GGG TCT GTG CAT TTA GGT AGG GGA TGG ACC GTG 296 Thr Gly Lys Leu Glu Gly Ser Val His Leu Gly Arg Gly Trp Thr Val 70 75 80
AAT TTA GGC GGT GTT TTG GGC GGA CAG GCT TAT GAT GGC ACT AAG TAT 344 Asn Leu Gly Gly Val Leu Gly Gly Gin Ala Tyr Asp Gly Thr Lys Tyr 85 90 95
GAT AGG TGG GCG AAG GAT TTT ACC CCC CCA AGC TAT TGG GAT AAA ACT 392 Asp Arg Trp Ala Lys Asp Phe Thr Pro Pro Ser Tyr Trp Asp Lys Thr 100 105 110
TCT TGC GGT ACT GAT TCT ATG AGT CTT TGT ATG AAT GCC ACT AAA ATG 440 Ser Cys Gly Thr Asp Ser Met Ser Leu Cys Met Asn Ala Thr Lys Met 115 120 125 130
TGG CAG CAA TCA GGG CCA GGT GGC GTC ATT AAC CCT AGA GGT ATT GGT 488 Trp Gin Gin Ser Gly Pro Gly Gly Val He Asn Pro Arg Gly He Gly 135 140 145
TGG GAA TAC ATG GGT GAG TGG AAC GGC TTG TTC CCT AAC TAC TAT CCG 536 Trp Glu Tyr Met Gly Glu Trp Asn Gly Leu Phe Pro Asn Tyr Tyr Pro 150 155 160
GCT AAC GCC TAC TTG CCT GGT GGC TCA AGG CGC TAT CAA GTC TAT AAA 584 Ala Asn Ala Tyr Leu Pro Gly Gly Ser Arg Arg Tyr Gin Val Tyr Lys 165 170 175 GCA AAT TTG ACC TAT GAT AGC GAC AGG GTC CAT ATG GTA ATG GGG CGT 632 Ala Asn Leu Thr Tyr Asp Ser Asp Arg Val His Met Val Met Gly Arg 180 185 190
TTT GAC ATT ACC GAG CAG GAG CAA ATG GAT TGG ATT TAC CAA TTG TTC 680 Phe Asp He Thr Glu Gin Glu Gin Met Asp Trp He Tyr Gin Leu Phe 195 200 205 210
CAA GGG TTT TAT GGG ACT TTC AAG CTC ACT AAG AAT ATG AAA TTC TTG 728 Gin Gly Phe Tyr Gly Thr Phe Lys Leu Thr Lys Asn Met Lys Phe Leu 215 220 225
CTC TTT AGT GGT TGG GGT CGT GGT ATC GCT GAT GGT CAG TGG TTG TTC 776 Leu Phe Ser Gly Trp Gly Arg Gly He Ala Asp Gly Gin Trp Leu Phe 230 235 240
CCT ATC TAT CGT GAA AAG CCT TGG GGG GTT CAT AAA GCG GGT ATT ATT 824 Pro He Tyr Arg Glu Lys Pro Trp Gly Val His Lys Ala Gly He He 245 250 255
TAT CGC CCT ACA AAG AAT TTG ATG ATC CAC CCT TAT GTG TAT CTT ATC 872 Tyr Arg Pro Thr Lys Asn Leu Met He His Pro Tyr Val Tyr Leu He 260 265 270
CCA ATG GTA GGC ACA TTG CCT GGT GCT AAA ATA GAA TAC GAT ACC AAT 920 Pro Met Val Gly Thr Leu Pro Gly Ala Lys He Glu Tyr Asp Thr Asn 275 280 285 290
CCT GAA TTT AGC GGT AGG GGC ATT AGG AAC AGA ACG ACT TTC TAT GCG 968 Pro Glu Phe Ser Gly Arg Gly He Arg Asn Arg Thr Thr Phe Tyr Ala 295 300 305
TTG TAT GAC TAT CGT TGG AAT AAC GCT GAA TAC GGT CGT TAC GCG CCC 1016 Leu Tyr Asp Tyr Arg Trp Asn Asn Ala Glu Tyr Gly Arg Tyr Ala Pro 310 315 320
GCT CGT TAT AAC ACT TGG GAT CCG TTC TTG GAT AAT GGT AAG TGG CGT 1064 Ala Arg Tyr Asn Thr Trp Asp Pro Phe Leu Asp Asn Gly Lys Trp Arg 325 330 335
GGC TTG CAA GGT CCT GGT GGT GCG ACG CTC CTT TTA CGC CAC CAT ATA 1112 Gly Leu Gin Gly Pro Gly Gly Ala Thr Leu Leu Leu Arg His His He 340 345 350
GAT ATT AAC AAC TAC TTT GTG GTT GGT GGT GCT TAT CTC AAC ATT GGT 1160 Asp He Asn Asn Tyr Phe Val Val Gly Gly Ala Tyr Leu Asn He Gly 355 360 365 370
AAC CCT AAC ATG AAC TTA GGT ACT TGG GGT AAC CCT GTG GCT GTT GAT 1208 Asn Pro Asn Met Asn Leu Gly Thr Trp Gly Asn Pro Val Ala Val Asp 375 380 385
GGT ATC GAA CAA TGG GTC GGT AGT ATC TAT AGC TTA GGG TTT GCG GGG 1256 Gly He Glu Gin Trp Val Gly Ser He Tyr Ser Leu Gly Phe Ala Gly 390 395 400 ATT GAC AAC ATT ACC GAT GCT GAC GCG TTC ACC GAG TAT GTT AAA GGT 1304 He Asp Asn He Thr Asp Ala Asp Ala Phe Thr Glu Tyr Val Lys Gly 405 410 415
GGA GGC AAG CAT GGT AAG TTT AGT TGG AGC GTT TAT CAG CGC TTC ACT 1352 Gly Gly Lys His Gly Lys Phe Ser Trp Ser Val Tyr Gin Arg Phe Thr 420 425 430
ACC GCT CCA AGG GCT TTG GAA TAT GGT ATC GGT ATG TAT CTA GAC TAT 1400 Thr Ala Pro Arg Ala Leu Glu Tyr Gly He Gly Met Tyr Leu Asp Tyr 435 440 445 450
CAG TTC AGC AAG CAT GTT AAA GCG GGT CTC AAA CTC GTA TGG TTA GAG 1448 Gin Phe Ser Lys His Val Lys Ala Gly Leu Lys Leu Val Trp Leu Glu 455 460 465
TTC CAA ATT CGT GCG GGT TAC AAC CCT GGA ACC GGT TTC CTT GGG CCA 1496 Phe Gin He Arg Ala Gly Tyr Asn Pro Gly Thr Gly Phe Leu Gly Pro 470 475 480
AAC GGT CAG CCG CTT AAC TTG AAT ACT GGT TTG TTT GAG TCT TCA GCG 1544 Asn Gly Gin Pro Leu Asn Leu Asn Thr Gly Leu Phe Glu Ser Ser Ala 485 490 495
TTC GCT CAA GGC CCT CAA AAC ATG GGC GGT ATC GCA AAA AGC ATC ACT 1592 Phe Ala Gin Gly Pro Gin Asn Met Gly Gly He Ala Lys Ser He Thr 500 505 510
CAA GAC AGA AGC CAT TTG ATG ACA CAC ATT AGT TAT AGT TTC TAAGAGAGT 1643 Gin Asp Arg Ser His Leu Met Thr His He Ser Tyr Ser Phe 515 520 525
TCTCCCCCTA TCTCTTAGAT ATGCCTTTTT GTATTTTTAT TTTA 1687
(2) INFORMATION FOR SEQ ID NO: 190:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 528 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190:
Met Lys Leu Lys Lys Arg Lys Val Ala Ala Ala Leu Leu Lys Arg Phe
1 5 10 15
Thr Leu Pro Leu Leu Phe Thr Thr Gly Ser Leu Gly Ala Val Thr Tyr
20 25 30
Glu Val His Gly Asp Phe He Asn Phe Ala Lys Val Gly Phe Asn His
35 40 45
Ser Pro He Asn Pro Val Lys Gly He Tyr Pro Thr Glu Thr Phe Val 50 55 60 Asn Leu Thr Gly Lys Leu Glu Gly Ser Val His Leu Gly Arg Gly Trp 65 70 75 80
Thr Val Asn Leu Gly Gly Val Leu Gly Gly Gin Ala Tyr Asp Gly Thr
85 90 95
Lys Tyr Asp Arg Trp Ala Lys Asp Phe Thr Pro Pro Ser Tyr Trp Asp
100 105 110
Lys Thr Ser Cys Gly Thr Asp Ser Met Ser Leu Cys Met Asn Ala Thr
115 120 125
Lys Met Trp Gin Gin Ser Gly Pro Gly Gly Val He Asn Pro Arg Gly
130 135 140
He Gly Trp Glu Tyr Met Gly Glu Trp Asn Gly Leu Phe Pro Asn Tyr 145 150 155 160
Tyr Pro Ala Asn Ala Tyr Leu Pro Gly Gly Ser Arg Arg Tyr Gin Val
165 170 175
Tyr Lys Ala Asn Leu Thr Tyr Asp Ser Asp Arg Val His Met Val Met
180 185 190
Gly Arg Phe Asp He Thr Glu Gin Glu Gin Met Asp Trp He Tyr Gin
195 200 205
Leu Phe Gin Gly Phe Tyr Gly Thr Phe Lys Leu Thr Lys Asn Met Lys
210 215 220
Phe Leu Leu Phe Ser Gly Trp Gly Arg Gly He Ala Asp Gly Gin Trp 225 230 235 240
Leu Phe Pro He Tyr Arg Glu Lys Pro Trp Gly Val His Lys Ala Gly
245 250 255
He He Tyr Arg Pro Thr Lys Asn Leu Met He His Pro Tyr Val Tyr
260 265 270
Leu He Pro Met Val Gly Thr Leu Pro Gly Ala Lys He Glu Tyr Asp
275 280 285
Thr Asn Pro Glu Phe Ser Gly Arg Gly He Arg Asn Arg Thr Thr Phe
290 295 300
Tyr Ala Leu Tyr Asp Tyr Arg Trp Asn Asn Ala Glu Tyr Gly Arg Tyr 305 310 315 320
Ala Pro Ala Arg Tyr Asn Thr Trp Asp Pro Phe Leu Asp Asn Gly Lys
325 330 335
Trp Arg Gly Leu Gin Gly Pro Gly Gly Ala Thr Leu Leu Leu Arg His
340 345 350
His He Asp He Asn Asn Tyr Phe Val Val Gly Gly Ala Tyr Leu Asn
355 360 365
He Gly Asn Pro Asn Met Asn Leu Gly Thr Trp Gly Asn Pro Val Ala
370 375 380
Val Asp Gly He Glu Gin Trp Val Gly Ser He Tyr Ser Leu Gly Phe 385 390 395 400
Ala Gly He Asp Asn He Thr Asp Ala Asp Ala Phe Thr Glu Tyr Val
405 410 415
Lys Gly Gly Gly Lys His Gly Lys Phe Ser Trp Ser Val Tyr Gin Arg
420 425 430
Phe Thr Thr Ala Pro Arg Ala Leu Glu Tyr Gly He Gly Met Tyr Leu
435 440 445
Asp Tyr Gin Phe Ser Lys His Val Lys Ala Gly Leu Lys Leu Val Trp
450 455 460
Leu Glu Phe Gin He Arg Ala Gly Tyr Asn Pro Gly Thr Gly Phe Leu 465 470 475 480
Gly Pro Asn Gly Gin Pro Leu Asn Leu Asn Thr Gly Leu Phe Glu Ser
485 490 495
Ser Ala Phe Ala Gin Gly Pro Gin Asn Met Gly Gly He Ala Lys Ser 500 505 510
He Thr Gin Asp Arg Ser His Leu Met Thr His He Ser Tyr Ser Phe 515 520 525
(2) INFORMATION FOR SEQ ID NO: 191:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 412 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...359 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191:
TTTTGTCTGA TTTGTTGCTA CCAAAACCAT TACCAACCAA AGCAGATCCC ATG TTT 56
Met Phe
1
TTG ATA CTA TCG AAT CCA TTC TTC AAC ACT TCT GCC ATA AAA TTC TTG 104 Leu He Leu Ser Asn Pro Phe Phe Asn Thr Ser Ala He Lys Phe Leu 5 10 15
ATA TTG TCC ATA GGC AAG TTG AAT TTT TTC CCT AAT ATT TCA TTA AGT 152 He Leu Ser He Gly Lys Leu Asn Phe Phe Pro Asn He Ser Leu Ser 20 25 30
CCC ATC ATT AAC ATC AGG AAG AAC AAA AAA TTT AAT ATC ATA GAA AAC 200 Pro He He Asn He Arg Lys Asn Lys Lys Phe Asn He He Glu Asn 35 40 45 50
AAA TCA CTG GAT AAA CCT GTA AAA AGA TTT GTT CCG CCA CCC AAC AAA 248 Lys Ser Leu Asp Lys Pro Val Lys Arg Phe Val Pro Pro Pro Asn Lys 55 60 65
GAA GCT AAA ATT TTT CCC ATG ATC AGT CCT TTT ATT TTT GGT TGT GTA 296 Glu Ala Lys He Phe Pro Met He Ser Pro Phe He Phe Gly Cys Val 70 75 80
AGT TCT TGC TTG TTC GGA TCT CTA ATG CGT GTT TTA GTA GGA AGC ATT 344 Ser Ser Cys Leu Phe Gly Ser Leu Met Arg Val Leu Val Gly Ser He 85 90 95
TCA CAA TGG CAT ACC TAAAGCTACT AAGAAAATTC TTGAATCTAT TGGTAAGATT A 400 Ser Gin Trp His Thr 100
CTCATGAAAT CA 412 (2) INFORMATION FOR SEQ ID NO: 192:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 103 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192:
Met Phe Leu He Leu Ser Asn Pro Phe Phe Asn Thr Ser Ala He Lys
1 5 10 15
Phe Leu He Leu Ser He Gly Lys Leu Asn Phe Phe Pro Asn He Ser
20 25 30
Leu Ser Pro He He Asn He Arg Lys Asn Lys Lys Phe Asn He He
35 40 45
Glu Asn Lys Ser Leu Asp Lys Pro Val Lys Arg Phe Val Pro Pro Pro
50 55 60
Asn Lys Glu Ala Lys He Phe Pro Met He Ser Pro Phe He Phe Gly 65 70 75 80
Cys Val Ser Ser Cys Leu Phe Gly Ser Leu Met Arg Val Leu Val Gly
85 90 95
Ser He Ser Gin Trp His Thr 100
(2) INFORMATION FOR SEQ ID NO: 193:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 447 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 67...405 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 193 :
TCCAATCCGT CTAATATCTC TTTATTTTCG CTCAATTCTT TAACCATAAC GGGTTTTTTA 60 GCGCTT GTG GGG GTT ACT GGG CTA AAG TTT GGA GCG TTT TGC ACT TCT 108 Val Gly Val Thr Gly Leu Lys Phe Gly Ala Phe Cys Thr Ser 1 5 10
TTT TCT TCT TTT TTT AGA TTT TCC TTT ATC ATT TCT TCT ATC CTT CCT 156 Phe Ser Ser Phe Phe Arg Phe Ser Phe He He Ser Ser He Leu Pro 15 20 25 30 TCT ATC ATT TCT TCT TGC GTG TTT TCT TGT GGG TTT TCT TCT TTT TTA 204 Ser He He Ser Ser Cys Val Phe Ser Cys Gly Phe Ser Ser Phe Leu 35 40 45
GGG TGG TTG GGG GTT TTT TGG TTT TCT GTT TTG TTG TCA TTT TCT ATT 252 Gly Trp Leu Gly Val Phe Trp Phe Ser Val Leu Leu Ser Phe Ser He 50 55 60
ATG GGT GCA AGT GTG GGC ATG ATA GGT TTG GGC GTG GTG GGC GTA AGA 300 Met Gly Ala Ser Val Gly Met He Gly Leu Gly Val Val Gly Val Arg 65 70 75
GTT TCT TTT GTA GGC GTG GGT TCT CTT TCT TTA GTT TCT TGT TTA ATT 348 Val Ser Phe Val Gly Val Gly Ser Leu Ser Leu Val Ser Cys Leu He 80 85 90
TCT TTT AAA GGG GGG TTA GTG GGG TTA GTC AAA TCA TCA AAT CGG TTT 396 Ser Phe Lys Gly Gly Leu Val Gly Leu Val Lys Ser Ser Asn Arg Phe 95 100 105 110
CTT TTA GGG TAAAT GGTGTAATGG GTAGGGGGGT GGGAGGAAAT TTGGACT 447
Leu Leu Gly
(2) INFORMATION FOR SEQ ID NO: 194:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 113 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194:
Val Gly Val Thr Gly Leu Lys Phe Gly Ala Phe Cys Thr Ser Phe Ser
1 5 10 15
Ser Phe Phe Arg Phe Ser Phe He He Ser Ser He Leu Pro Ser He
20 25 30
He Ser Ser Cys Val Phe Ser Cys Gly Phe Ser Ser Phe Leu Gly Trp
35 40 45
Leu Gly Val Phe Trp Phe Ser Val Leu Leu Ser Phe Ser He Met Gly
50 55 60
Ala Ser Val Gly Met He Gly Leu Gly Val Val Gly Val Arg Val Ser 65 70 75 80
Phe Val Gly Val Gly Ser Leu Ser Leu Val Ser Cys Leu He Ser Phe
85 90 95
Lys Gly Gly Leu Val Gly Leu Val Lys Ser Ser Asn Arg Phe Leu Leu
100 105 110
Gly
(2) INFORMATION FOR SEQ ID NO: 195: (i) SEQUENCE CHARACTERISTICS :
(A) LENGTH: 1180 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1127 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195:
CCAAGAAAGA GTATAATAGC GCATAAGAAT TTAACTGATG AAGAGGTTTA ATG CTA 56
Met Leu 1
GAA AAT AGA GTT AAG ACC AAG CAA ATT TTT ATC GGT GGC GTG GCC ATA 104 Glu Asn Arg Val Lys Thr Lys Gin He Phe He Gly Gly Val Ala He 5 10 15
GGG GGT GAT GCT CCC ATA AGC ACG CAA AGC ATG ACC TTT AGC AAA ACC 152 Gly Gly Asp Ala Pro He Ser Thr Gin Ser Met Thr Phe Ser Lys Thr 20 25 30
GCT GAT ATT GAA AGC ACT AAA AAT CAA ATT GAC AGA CTC AAA CTC GCC 200 Ala Asp He Glu Ser Thr Lys Asn Gin He Asp Arg Leu Lys Leu Ala 35 40 45 50
GGG GCC GAT TTA GTG AGG GTG GCG GTG AGT AAT GAA AAG GAC GCT CTA 248 Gly Ala Asp Leu Val Arg Val Ala Val Ser Asn Glu Lys Asp Ala Leu 55 60 65
GCC TTA AAA GAA TTG AAA AAA GTG TCC CCT TTG CCT TTA ATC GCT GAT 296 Ala Leu Lys Glu Leu Lys Lys Val Ser Pro Leu Pro Leu He Ala Asp 70 75 80
ATT CAT TTC CAT TAT AAA TTC GCT CTC ATT GCC GCT CAA AGC GTG GAT 344 He His Phe His Tyr Lys Phe Ala Leu He Ala Ala Gin Ser Val Asp 85 90 95
GCG ATC AGG ATT AAC CCC GGA AAC ATC GGC TCT AAA GAG AAG ATC AAA 392 Ala He Arg He Asn Pro Gly Asn He Gly Ser Lys Glu Lys He Lys 100 105 110
GCG GTG GTT GAT GCT TGT AAA GAA AAA AAC ATT CCT ATA AGA ATT GGC 440 Ala Val Val Asp Ala Cys Lys Glu Lys Asn He Pro He Arg He Gly 115 120 125 130
GTG AAT GCT GGG AGT TTA GAA AAG CAG TTT GAT CAA AAA TAC GGA CCC 488 Val Asn Ala Gly Ser Leu Glu Lys Gin Phe Asp Gin Lys Tyr Gly Pro 135 140 145
ACC CCA AAA GGC ATG GTA GAA AGC GCT TTG TAT AAC GCC AAA CTT TTA 536 Thr Pro Lys Gly Met Val Glu Ser Ala Leu Tyr Asn Ala Lys Leu Leu 150 155 160
GAA GAT TTG GAT TTT ACC AAT TTT AAG ATT TCT TTA AAA GCG AGC GAT 584 Glu Asp Leu Asp Phe Thr Asn Phe Lys He Ser Leu Lys Ala Ser Asp 165 170 175
GTG ATT CGC ACC ATA GAA GCT TAC AGG ATG CTT CGC CCT CTT GTG ATC 632 Val He Arg Thr He Glu Ala Tyr Arg Met Leu Arg Pro Leu Val He 180 185 190
TAT CCT TTC CAT TTG GGG GTT ACG GAG GCG GGG AAT CTT TTT AGC TCC 680 Tyr Pro Phe His Leu Gly Val Thr Glu Ala Gly Asn Leu Phe Ser Ser 195 200 205 210
AGT ATC AAA TCC GCT ATG GCT TTA GGG GGG CTT TTA ATG GAG GGC ATT 728 Ser He Lys Ser Ala Met Ala Leu Gly Gly Leu Leu Met Glu Gly He 215 220 225
GGG GAT ACG ATG CGC GTA TCC ATC ACA GGG GAA TTA GAA AAT GAA ATC 776 Gly Asp Thr Met Arg Val Ser He Thr Gly Glu Leu Glu Asn Glu He 230 235 240
AAA GTG GCC AGA GCA ATT TTA CGC CAT AGC GGG CGG TTG AAA GAA GGG 824 Lys Val Ala Arg Ala He Leu Arg His Ser Gly Arg Leu Lys Glu Gly 245 250 255
ATT AAT TGG ATT TCT TGC CCC ACT TGC GGG CGC ATT GAA GCC AAT TTA 872 He Asn Trp He Ser Cys Pro Thr Cys Gly Arg He Glu Ala Asn Leu 260 265 270
GTG GAT ATG GCG ATC AAG GTA GAA AAA CGC TTA AGC CAC ATC AAA ACC 920 Val Asp Met Ala He Lys Val Glu Lys Arg Leu Ser His He Lys Thr 275 280 285 290
CCT TTA GAC ATT AGC GTG ATG GGT TGC GTG GTG AAT GCT TTG GGT GAA 968 Pro Leu Asp He Ser Val Met Gly Cys Val Val Asn Ala Leu Gly Glu 295 300 305
GCC AAG CAT GCA GAC ATG GCG ATC GCT TTT GGG AAT CGC AGC GGT TTG 1016 Ala Lys His Ala Asp Met Ala He Ala Phe Gly Asn Arg Ser Gly Leu 310 315 320
ATC ATT AAA GAG GGT AAA GTC ATT CAC AAA CTG GCT GAA AAG GAT TTA 1064 He He Lys Glu Gly Lys Val He His Lys Leu Ala Glu Lys Asp Leu 325 330 335
TTT GAA ACT TTT GTG ATA GAA GTG GAA AAT TTA GCT AAA GAA AGA GAA 1112 Phe Glu Thr Phe Val He Glu Val Glu Asn Leu Ala Lys Glu Arg Glu 340 345 350
AAA AGT TTA AAG GAT TAGGCATGAT CAATAAGTTT AAAAATTTTG TGAGCAACTA C 1168 Lys Ser Leu Lys Asp 355
CAGCAATCTA AC 1180
(2) INFORMATION FOR SEQ ID NO: 196:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 359 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196:
Met Leu Glu Asn Arg Val Lys Thr Lys Gin He Phe He Gly Gly Val
1 5 10 15
Ala He Gly Gly Asp Ala Pro He Ser Thr Gin Ser Met Thr Phe Ser
20 25 30
Lys Thr Ala Asp He Glu Ser Thr Lys Asn Gin He Asp Arg Leu Lys
35 40 45
Leu Ala Gly Ala Asp Leu Val Arg Val Ala Val Ser Asn Glu Lys Asp
50 55 60
Ala Leu Ala Leu Lys Glu Leu Lys Lys Val Ser Pro Leu Pro Leu He 65 70 75 80
Ala Asp He His Phe His Tyr Lys Phe Ala Leu He Ala Ala Gin Ser
85 90 95
Val Asp Ala He Arg He Asn Pro Gly Asn He Gly Ser Lys Glu Lys
100 105 110
He Lys Ala Val Val Asp Ala Cys Lys Glu Lys Asn He Pro He Arg
115 120 125
He Gly Val Asn Ala Gly Ser Leu Glu Lys Gin Phe Asp Gin Lys Tyr
130 135 140
Gly Pro Thr Pro Lys Gly Met Val Glu Ser Ala Leu Tyr Asn Ala Lys 145 150 155 160
Leu Leu Glu Asp Leu Asp Phe Thr Asn Phe Lys He Ser Leu Lys Ala
165 170 175
Ser Asp Val He Arg Thr He Glu Ala Tyr Arg Met Leu Arg Pro Leu
180 185 190
Val He Tyr Pro Phe His Leu Gly Val Thr Glu Ala Gly Asn Leu Phe
195 200 205
Ser Ser Ser He Lys Ser Ala Met Ala Leu Gly Gly Leu Leu Met Glu
210 215 220
Gly He Gly Asp Thr Met Arg Val Ser He Thr Gly Glu Leu Glu Asn 225 230 235 240
Glu He Lys Val Ala Arg Ala He Leu Arg His Ser Gly Arg Leu Lys
245 250 255
Glu Gly He Asn Trp He Ser Cys Pro Thr Cys Gly Arg He Glu Ala
260 265 270
Asn Leu Val Asp Met Ala He Lys Val Glu Lys Arg Leu Ser His He
275 280 285
Lys Thr Pro Leu Asp He Ser Val Met Gly Cys Val Val Asn Ala Leu 290 295 300
Gly Glu Ala Lys His Ala Asp Met Ala He Ala Phe Gly Asn Arg Ser 305 310 315 320
Gly Leu He He Lys Glu Gly Lys Val He His Lys Leu Ala Glu Lys
325 330 335
Asp Leu Phe Glu Thr Phe Val He Glu Val Glu Asn Leu Ala Lys Glu
340 345 350
Arg Glu Lys Ser Leu Lys Asp 355
(2) INFORMATION FOR SEQ ID NO: 197
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1399 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1346 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197:
GCCTATGAAA TCTTAAAGCG TTATCCGGCT AAAGCAAAGG TATAAATAAC ATG AAA 56
Met Lys
1
AAA TTT TTA ATC ACT TTA TTA TTA GGA GTT TTT ATG GGG TTA CAA GCG 104 Lys Phe Leu He Thr Leu Leu Leu Gly Val Phe Met Gly Leu Gin Ala 5 10 15
AGC GCT TTG ACA CAC CAA GAA ATC AAT CAA GCT AAA GTC CCT GTG ATT 152 Ser Ala Leu Thr His Gin Glu He Asn Gin Ala Lys Val Pro Val He 20 25 30
TAT GAA GAA AAC CAT TTG TTG CCT ATG GGG TTT ATC CAT TTA GCC TTT 200 Tyr Glu Glu Asn His Leu Leu Pro Met Gly Phe He His Leu Ala Phe 35 40 45 50
AGG GGG GGT GGG AGC TTA AGC GAT AAA AAC CAG TTG GGT TTG GCG AAA 248 Arg Gly Gly Gly Ser Leu Ser Asp Lys Asn Gin Leu Gly Leu Ala Lys 55 60 65
TTA TTC GCG CAA GTT TTA AAC GAA GGC ACT AAA GAG CTT GGT GCG GTG 296 Leu Phe Ala Gin Val Leu Asn Glu Gly Thr Lys Glu Leu Gly Ala Val 70 75 80 GGG TTT GCG CAA CTT TTA GAG CAA AAA GCG ATC AGT TTG AAT GTG GAT 344 Gly Phe Ala Gin Leu Leu Glu Gin Lys Ala He Ser Leu Asn Val Asp 85 90 95
ACC AGC ACA GAA GAT TTG CAA ATC ACT TTA GAA TTT TTA AAA GAA TAC 392 Thr Ser Thr Glu Asp Leu Gin He Thr Leu Glu Phe Leu Lys Glu Tyr 100 105 110
GAA GAT GAA GCC ATT ACG CGC TTA AAA GAG CTT TTA AAA TCC CCT AAT 440 Glu Asp Glu Ala He Thr Arg Leu Lys Glu Leu Leu Lys Ser Pro Asn 115 120 125 130
TTC ACG CAA AAC GCT TTA GAA AAA GTC AAA ACC CAA ATG TTA GCC GCA 488 Phe Thr Gin Asn Ala Leu Glu Lys Val Lys Thr Gin Met Leu Ala Ala 135 140 145
CTT TTA CAA AAA GAA AGC GAT TTT GAC TAT TTG GCT AAA TTG ACT TTA 536 Leu Leu Gin Lys Glu Ser Asp Phe Asp Tyr Leu Ala Lys Leu Thr Leu 150 155 160
AAG CAA GAG CTT TTT GCT AAC ACC CCT TTA GCT AAC GCA GCC TTA GGC 584 Lys Gin Glu Leu Phe Ala Asn Thr Pro Leu Ala Asn Ala Ala Leu Gly 165 170 175
ACT AAA GAG AGC ATT CAA AAA ATC AAG CTA GAC GAT TTG AAA CAG CAA 632 Thr Lys Glu Ser He Gin Lys He Lys Leu Asp Asp Leu Lys Gin Gin 180 185 190
TTT GCT AAG GTC TTT GAA CTC AAT AAG CTC GTG GTG GTG CTT GGG GGC 680 Phe Ala Lys Val Phe Glu Leu Asn Lys Leu Val Val Val Leu Gly Gly 195 200 205 210
GAT TTG AAA ATC GAT CAA ACC CTT AAG CGT TTG AAT AAC GCC CTT AAT 728 Asp Leu Lys He Asp Gin Thr Leu Lys Arg Leu Asn Asn Ala Leu Asn 215 220 225
TTC TTG CCA CAA GGT AAA GCG TAT GAA GAG CCT TAT TTT GAA ACG AGC 776 Phe Leu Pro Gin Gly Lys Ala Tyr Glu Glu Pro Tyr Phe Glu Thr Ser 230 235 240
GAT AAA AAA AGC GAA AAA GTC CTC TAT AAA GAC ACT GAG CAG GCT TTC 824 Asp Lys Lys Ser Glu Lys Val Leu Tyr Lys Asp Thr Glu Gin Ala Phe 245 250 255
GTG TAT TTT GGT GCG CCC TTT AAA ATC AAG GAT TTA AAA CAG GAT TTA 872 Val Tyr Phe Gly Ala Pro Phe Lys He Lys Asp Leu Lys Gin Asp Leu 260 265 270
GCG AAA TCT AAA GTC ATG ATG TTT GTG CTT GGT GGG GGG TTT GGC TCT 920 Ala Lys Ser Lys Val Met Met Phe Val Leu Gly Gly Gly Phe Gly Ser 275 280 285 290
CGT TTA ATG GAA AAA ATC AGG GTT CAA GAG GGA TTA GCT TAT AGC GTG 968 Arg Leu Met Glu Lys He Arg Val Gin Glu Gly Leu Ala Tyr Ser Val 295 300 305 TAT ATC CGC TCC AAT TTT TCT AAA GTG GCG CAT TTT GCG AGC GGG TAT 1016 Tyr He Arg Ser Asn Phe Ser Lys Val Ala His Phe Ala Ser Gly Tyr 310 315 320
TTG CAA ACC AAG CTC AGC ACT CAA ACT AAA AGC GTT GCC TTA GTT AAA 1064 Leu Gin Thr Lys Leu Ser Thr Gin Thr Lys Ser Val Ala Leu Val Lys 325 330 335
AAA ATC GTT AAG GAA TTT ATA GAA AAA GGC ATG ACG CAA CAA GAA TTA 1112 Lys He Val Lys Glu Phe He Glu Lys Gly Met Thr Gin Gin Glu Leu 340 345 350
GAC GAC GCT AAA AAG TTT TTA CTA GGC TCT GAG CCT TTA AGG AAT GAA 1160 Asp Asp Ala Lys Lys Phe Leu Leu Gly Ser Glu Pro Leu Arg Asn Glu 355 360 365 370
ACG ATC TCT AGC CGC TTG AAC ACC ACT TAC AAT TAT TTT TAT TTA GGT 1208 Thr He Ser Ser Arg Leu Asn Thr Thr Tyr Asn Tyr Phe Tyr Leu Gly 375 380 385
TTG CCT TTA AAT TTT AAC CAA ACG CTG CTC AAT CAA ATC CAA AAA ATG 1256 Leu Pro Leu Asn Phe Asn Gin Thr Leu Leu Asn Gin He Gin Lys Met 390 395 400
AGT TTG AAA GAA ATC AAT GAT TTC ATT AAA GCC CAC ACC GAA ATC AAC 1304 Ser Leu Lys Glu He Asn Asp Phe He Lys Ala His Thr Glu He Asn 405 410 415
GAC TTG ACT TTT GCT ATT GTG AGC AAT AAA AAG AAG GAC AAA TGATGCCAT 1355 Asp Leu Thr Phe Ala He Val Ser Asn Lys Lys Lys Asp Lys 420 425 430
TTGAAGCTGT AATCGGGCTA GAAGTCCATG TCCAACTCAA CACC 1399
(2) INFORMATION FOR SEQ ID NO: 198:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 432 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198:
Met Lys Lys Phe Leu He Thr Leu Leu Leu Gly Val Phe Met Gly Leu
1 5 10 15
Gin Ala Ser Ala Leu Thr His Gin Glu He Asn Gin Ala Lys Val Pro
20 25 30
Val He Tyr Glu Glu Asn His Leu Leu Pro Met Gly Phe He His Leu
35 40 45
Ala Phe Arg Gly Gly Gly Ser Leu Ser Asp Lys Asn Gin Leu Gly Leu 50 55 60 Ala Lys Leu Phe Ala Gin Val Leu Asn Glu Gly Thr Lys Glu Leu Gly 65 70 75 80
Ala Val Gly Phe Ala Gin Leu Leu Glu Gin Lys Ala He Ser Leu Asn
85 90 95
Val Asp Thr Ser Thr Glu Asp Leu Gin He Thr Leu Glu Phe Leu Lys
100 105 110
Glu Tyr Glu Asp Glu Ala He Thr Arg Leu Lys Glu Leu Leu Lys Ser
115 120 125
Pro Asn Phe Thr Gin Asn Ala Leu Glu Lys Val Lys Thr Gin Met Leu
130 135 140
Ala Ala Leu Leu Gin Lys Glu Ser Asp Phe Asp Tyr Leu Ala Lys Leu 145 150 155 160
Thr Leu Lys Gin Glu Leu Phe Ala Asn Thr Pro Leu Ala Asn Ala Ala
165 170 175
Leu Gly Thr Lys Glu Ser He Gin Lys He Lys Leu Asp Asp Leu Lys
180 185 190
Gin Gin Phe Ala Lys Val Phe Glu Leu Asn Lys Leu Val Val Val Leu
195 200 205
Gly Gly Asp Leu Lys He Asp Gin Thr Leu Lys Arg Leu Asn Asn Ala
210 215 220
Leu Asn Phe Leu Pro Gin Gly Lys Ala Tyr Glu Glu Pro Tyr Phe Glu 225 230 235 240
Thr Ser Asp Lys Lys Ser Glu Lys Val Leu Tyr Lys Asp Thr Glu Gin
245 250 255
Ala Phe Val Tyr Phe Gly Ala Pro Phe Lys He Lys Asp Leu Lys Gin
260 265 270
Asp Leu Ala Lys Ser Lys Val Met Met Phe Val Leu Gly Gly Gly Phe
275 280 285
Gly Ser Arg Leu Met Glu Lys He Arg Val Gin Glu Gly Leu Ala Tyr
290 295 300
Ser Val Tyr He Arg Ser Asn Phe Ser Lys Val Ala His Phe Ala Ser 305 310 315 320
Gly Tyr Leu Gin Thr Lys Leu Ser Thr Gin Thr Lys Ser Val Ala Leu
325 330 335
Val Lys Lys He Val Lys Glu Phe He Glu Lys Gly Met Thr Gin Gin
340 345 350
Glu Leu Asp Asp Ala Lys Lys Phe Leu Leu Gly Ser Glu Pro Leu Arg
355 360 365
Asn Glu Thr He Ser Ser Arg Leu Asn Thr Thr Tyr Asn Tyr Phe Tyr
370 375 380
Leu Gly Leu Pro Leu Asn Phe Asn Gin Thr Leu Leu Asn Gin He Gin 385 390 395 400
Lys Met Ser Leu Lys Glu He Asn Asp Phe He Lys Ala His Thr Glu
405 410 415
He Asn Asp Leu Thr Phe Ala He Val Ser Asn Lys Lys Lys Asp Lys 420 425 430
(2) INFORMATION FOR SEQ ID NO: 199:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 574 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...521 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199:
GAAGAGCCAG AAAATTTAGA AACCTCTTCG GCACAAAATT TGTTTGAGTG ATG CGT 56
Met Arg 1
TTC TTT TCA TTC TTT TAT TTT TTA TTT TAT TTT TTA GGG GTT TCT TTG 104 Phe Phe Ser Phe Phe Tyr Phe Leu Phe Tyr Phe Leu Gly Val Ser Leu 5 10 15
CAA GCT CTC AGC CCC CTA GAA GAT CAA GAA TTT TTA ATT TCG TAC CGC 152 Gin Ala Leu Ser Pro Leu Glu Asp Gin Glu Phe Leu He Ser Tyr Arg 20 25 30
TTG AAA ATC GTT GAT TCT AGA GTG ATG GGC GAA GAG TAT TCT GTC TCT 200 Leu Lys He Val Asp Ser Arg Val Met Gly Glu Glu Tyr Ser Val Ser 35 40 45 50
AAA CCT ATC GTT AGC CGC ATT AAA ACA GCC CCC TAT GTT TTA GAC TAT 248 Lys Pro He Val Ser Arg He Lys Thr Ala Pro Tyr Val Leu Asp Tyr 55 60 65
CAT TGC TCC ATC ATC ACT CGT AAC TTA CCC AAT TTG AAA AAC CCC TTG 296 His Cys Ser He He Thr Arg Asn Leu Pro Asn Leu Lys Asn Pro Leu 70 75 80
CTC CCA ATA AAG TTA GAA CGC TTC CTT TTA GAA ATC GCG TTA AAA AAA 344 Leu Pro He Lys Leu Glu Arg Phe Leu Leu Glu He Ala Leu Lys Lys 85 90 95
GAA AAA GAG CGG GTC ATA GAC TGC ATT TTA AAA AGC CAG GTC GCT ATC 392 Glu Lys Glu Arg Val He Asp Cys He Leu Lys Ser Gin Val Ala He 100 105 110
ACG CAT TAT GAT CAT AGC TAT AAA AAC GGC ACC ACT ACC ACA AGC ATT 440 Thr His Tyr Asp His Ser Tyr Lys Asn Gly Thr Thr Thr Thr Ser He 115 120 125 130
CTT GCC CTC AAA GCC TTA AGC GTT AGA GCG AGT TTA GTG GGA GAT GCG 488 Leu Ala Leu Lys Ala Leu Ser Val Arg Ala Ser Leu Val Gly Asp Ala 135 140 145
CTG TTT TTA GAT ATT TTT AGA AAG GAA GAA GAA TGAAAATCGC CATTGTAGAA 541 Leu Phe Leu Asp He Phe Arg Lys Glu Glu Glu 150 155 GATGATATTA ACATGCGTAA AAGCCTGGAG CTT 574
(2) INFORMATION FOR SEQ ID NO: 200:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 157 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200:
Met Arg Phe Phe Ser Phe Phe Tyr Phe Leu Phe Tyr Phe Leu Gly Val
1 5 10 15
Ser Leu Gin Ala Leu Ser Pro Leu Glu Asp Gin Glu Phe Leu He Ser
20 25 30
Tyr Arg Leu Lys He Val Asp Ser Arg Val Met Gly Glu Glu Tyr Ser
35 40 45
Val Ser Lys Pro He Val Ser Arg He Lys Thr Ala Pro Tyr Val Leu
50 55 60
Asp Tyr His Cys Ser He He Thr Arg Asn Leu Pro Asn Leu Lys Asn 65 70 75 80
Pro Leu Leu Pro He Lys Leu Glu Arg Phe Leu Leu Glu He Ala Leu
85 90 95
Lys Lys Glu Lys Glu Arg Val He Asp Cys He Leu Lys Ser Gin Val
100 105 110
Ala He Thr His Tyr Asp His Ser Tyr Lys Asn Gly Thr Thr Thr Thr
115 120 125
Ser He Leu Ala Leu Lys Ala Leu Ser Val Arg Ala Ser Leu Val Gly
130 135 140
Asp Ala Leu Phe Leu Asp He Phe Arg Lys Glu Glu Glu 145 150 155
(2) INFORMATION FOR SEQ ID NO: 201:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1003 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...950 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: ATCATGAGAA AACGCTTCAC TCCACTTTTG TTATTCAGGA AATAATACAG ATG AGA 56
Met Arg 1
AAA ACG ATT TCA GCG TTG TTT TTA TCA GCG TGC ATA GGG TTA TCG TCT 104 Lys Thr He Ser Ala Leu Phe Leu Ser Ala Cys He Gly Leu Ser Ser 5 10 15
GTT TAT GCA GAT AAC GCT TTG ATT TTG CAA ACC GAT TTT AGT CTA AAA 152 Val Tyr Ala Asp Asn Ala Leu He Leu Gin Thr Asp Phe Ser Leu Lys 20 25 30
GAT GGG GCC GTC TCG GCG ATG AAA GGC GTC GCT TTC AGC GTT GAT TCC 200 Asp Gly Ala Val Ser Ala Met Lys Gly Val Ala Phe Ser Val Asp Ser 35 40 45 50
CAT CTT AAA ATC TTT GAT TTA ACG CAC GAA ATC CCC CCG TAT AAC ATC 248 His Leu Lys He Phe Asp Leu Thr His Glu He Pro Pro Tyr Asn He 55 60 65
TGG GAA GGC GCT TAC CGC TTG TAT CAG ACC GCC AGT TAT TGG CCA AAA 296 Trp Glu Gly Ala Tyr Arg Leu Tyr Gin Thr Ala Ser Tyr Trp Pro Lys 70 75 80
GGT TCG GTA TTT GTG AGC GTA GTT GAT CCG GGC GTA GGC ACT AAG CGT 344 Gly Ser Val Phe Val Ser Val Val Asp Pro Gly Val Gly Thr Lys Arg 85 90 95
AAA TCG GTG GTA CTA AAA ACT AAA AAC GGC CAG TAT TTC GTC TCG CCG 392 Lys Ser Val Val Leu Lys Thr Lys Asn Gly Gin Tyr Phe Val Ser Pro 100 105 110
GAT AAC GGC ACG CTG ACT TTG GTG GCA CAA ACT TTG GGG ATT GAT AGC 440 Asp Asn Gly Thr Leu Thr Leu Val Ala Gin Thr Leu Gly He Asp Ser 115 120 125 130
GTG CGT GAA ATT GAT GAA AAA GCT AAC CGC TTG AAA GGT TCT GAA AAA 488 Val Arg Glu He Asp Glu Lys Ala Asn Arg Leu Lys Gly Ser Glu Lys 135 140 145
TCC TAT ACT TTC CAT GGT CGT GAT GTG TAT GCT TAC ACC GGT GCA CGC 536 Ser Tyr Thr Phe His Gly Arg Asp Val Tyr Ala Tyr Thr Gly Ala Arg 150 155 160
TTG GCT TCT GGG GCG ATC ACA TTC GAG CAG GTC GGG CCA GAG CTT CCC 584 Leu Ala Ser Gly Ala He Thr Phe Glu Gin Val Gly Pro Glu Leu Pro 165 170 175
CCA AAA GTC GTT GAA ATT CCT TAC CAA AAA GCG AAA GCC ACA AAA GGG 632 Pro Lys Val Val Glu He Pro Tyr Gin Lys Ala Lys Ala Thr Lys Gly 180 185 190
GAA GTG AAA GGT AAT ATC CCG ATT TTG GAT ATT CAA TAT GGC AAT GTT 680 Glu Val Lys Gly Asn He Pro He Leu Asp He Gin Tyr Gly Asn Val 195 200 205 210 TGG AGC AAC ATC AGC GAT AAA TTA CTC AAT CAA GCA AAA ATC AAA CTC 728 Trp Ser Asn He Ser Asp Lys Leu Leu Asn Gin Ala Lys He Lys Leu 215 220 225
AAT GAC ACG CTG TGT GTA ACG ATT TTT AAA GGT TCT AAG AAA CAA TAC 776 Asn Asp Thr Leu Cys Val Thr He Phe Lys Gly Ser Lys Lys Gin Tyr 230 235 240
GAA GGG AAA ATG CCG TAT GTC GCA AGC TTT GGC GAT GTG CCA GAA GGC 824 Glu Gly Lys Met Pro Tyr Val Ala Ser Phe Gly Asp Val Pro Glu Gly 245 250 255
CAG CCG TTA GTT TAT TTA AAC AGC TTG TTA AAT GTT TCC GTG GCG CTG 872 Gin Pro Leu Val Tyr Leu Asn Ser Leu Leu Asn Val Ser Val Ala Leu 260 265 270
AAT AGG GAT AAT TTC GCG CAA AAA TAT CAA ATC AAA TCC GGT GCT GAC 920 Asn Arg Asp Asn Phe Ala Gin Lys Tyr Gin He Lys Ser Gly Ala Asp 275 280 285 290
TGG AAT ATT GAT ATA AAG AAG TGC GCT AAG TAAAGCGCTG TTTAGAAAAT TAA 973 Trp Asn He Asp He Lys Lys Cys Ala Lys 295 300
GGGGCGTGAA ACGCCCTAAC CGCTAAAGAT 1003
(2) INFORMATION FOR SEQ ID NO:202:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 300 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202:
Met Arg Lys Thr He Ser Ala Leu Phe Leu Ser Ala Cys He Gly Leu
1 5 10 15
Ser Ser Val Tyr Ala Asp Asn Ala Leu He Leu Gin Thr Asp Phe Ser
20 25 30
Leu Lys Asp Gly Ala Val Ser Ala Met Lys Gly Val Ala Phe Ser Val
35 40 45
Asp Ser His Leu Lys He Phe Asp Leu Thr His Glu He Pro Pro Tyr
50 55 60
Asn He Trp Glu Gly Ala Tyr Arg Leu Tyr Gin Thr Ala Ser Tyr Trp 65 70 75 80
Pro Lys Gly Ser Val Phe Val Ser Val Val Asp Pro Gly Val Gly Thr
85 90 95
Lys Arg Lys Ser Val Val Leu Lys Thr Lys Asn Gly Gin Tyr Phe Val
100 105 110
Ser Pro Asp Asn Gly Thr Leu Thr Leu Val Ala Gin Thr Leu Gly He 115 120 125 Asp Ser Val Arg Glu He Asp Glu Lys Ala Asn Arg Leu Lys Gly Ser
130 135 140
Glu Lys Ser Tyr Thr Phe His Gly Arg Asp Val Tyr Ala Tyr Thr Gly 145 150 155 160
Ala Arg Leu Ala Ser Gly Ala He Thr Phe Glu Gin Val Gly Pro Glu
165 170 175
Leu Pro Pro Lys Val Val Glu He Pro Tyr Gin Lys Ala Lys Ala Thr
180 185 190
Lys Gly Glu Val Lys Gly Asn He Pro He Leu Asp He Gin Tyr Gly
195 200 205
Asn Val Trp Ser Asn He Ser Asp Lys Leu Leu Asn Gin Ala Lys He
210 215 220
Lys Leu Asn Asp Thr Leu Cys Val Thr He Phe Lys Gly Ser Lys Lys 225 230 235 240
Gin Tyr Glu Gly Lys Met Pro Tyr Val Ala Ser Phe Gly Asp Val Pro
245 250 255
Glu Gly Gin Pro Leu Val Tyr Leu Asn Ser Leu Leu Asn Val Ser Val
260 265 270
Ala Leu Asn Arg Asp Asn Phe Ala Gin Lys Tyr Gin He Lys Ser Gly
275 280 285
Ala Asp Trp Asn He Asp He Lys Lys Cys Ala Lys 290 295 300
(2) INFORMATION FOR SEQ ID NO:203:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1213 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1160 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:203:
ATTATTTTTA ATCTTGCATG AAATCTTAAA TATAGAATTA GTCCCTTTGG ATG GGA 56
Met Gly 1
TTT TCN CTC GCG CTA GGC TAT TTG TGT TTG TTT ATA TTC GTT TTA AGC 104 Phe Xaa Leu Ala Leu Gly Tyr Leu Cys Leu Phe He Phe Val Leu Ser 5 10 15
GCT TCT TTA ATC TCT GAA AAA GCC TTA TCC AAG CAG TAT TTG CAA ACC 152 Ala Ser Leu He Ser Glu Lys Ala Leu Ser Lys Gin Tyr Leu Gin Thr 20 25 30 GCT AAA GAT AAA ATC ACC TCT TTA AAG AAT TTA AAA GTC ATC GCC ATT 200 Ala Lys Asp Lys He Thr Ser Leu Lys Asn Leu Lys Val He Ala He 35 40 45 50
ACC GGA AGC TTT GGG AAA ACC AGC ACC AAA AAT TTC TTG CTT CAA ATC 248 Thr Gly Ser Phe Gly Lys Thr Ser Thr Lys Asn Phe Leu Leu Gin He 55 60 65
TTA CAA ACC ACA TTC AAC GCG CAT GCA AGC CCC AAA AGC GTC AAT ACC 296 Leu Gin Thr Thr Phe Asn Ala His Ala Ser Pro Lys Ser Val Asn Thr 70 75 80
CTT TTA GGG CTT GCG AAT GAT ATT AAT CAG AAT TTA GAC GAT AGG AGT 344 Leu Leu Gly Leu Ala Asn Asp He Asn Gin Asn Leu Asp Asp Arg Ser 85 90 95
GAA ATC TAT ATC GCT GAA GCC GGG GCA AGG AAT AAG GGC GAT ATT AAA 392 Glu He Tyr He Ala Glu Ala Gly Ala Arg Asn Lys Gly Asp He Lys 100 105 110
GAA ATC ACC TGT CTC ATT GAA CCG CAC CTT GTT GTG GTT GCA GAA GTG 440 Glu He Thr Cys Leu He Glu Pro His Leu Val Val Val Ala Glu Val 115 120 125 130
GGC GAA CAG CAT TTA GAA TAC TTT AAA ACT TTA GAA AAT ATT TGC GAG 488 Gly Glu Gin His Leu Glu Tyr Phe Lys Thr Leu Glu Asn He Cys Glu 135 140 145
ACT AAA GCG GAA TTA TTG GAT TCC AAA CGC TTA GAA AAA GCC TTT TGT 536 Thr Lys Ala Glu Leu Leu Asp Ser Lys Arg Leu Glu Lys Ala Phe Cys 150 155 160
TAC TCG GTG GAA AAG ATC AAG CCC TAT GCC CCT AAA GAT AGC CCT TTA 584 Tyr Ser Val Glu Lys He Lys Pro Tyr Ala Pro Lys Asp Ser Pro Leu 165 170 175
ATA GAC TAT TCT AGC CTG GTT AAA AAC ATC CAA TCC ACT TTA AAA GGC 632 He Asp Tyr Ser Ser Leu Val Lys Asn He Gin Ser Thr Leu Lys Gly 180 185 190
ACT TCT TTT GAA ATG CTT ATA GGT AGC GTT TGG GAA AGA TTT GAA ACA 680 Thr Ser Phe Glu Met Leu He Gly Ser Val Trp Glu Arg Phe Glu Thr 195 200 205 210
AAG GTT CTA GGG GAG TTT AGC GCT TAT AAT ATC GCT TCA GCC ATT TTA 728 Lys Val Leu Gly Glu Phe Ser Ala Tyr Asn He Ala Ser Ala He Leu 215 220 225
ATC GCT AAG CAT TTA GGC TTA GAG ACC GAA AGG ATC AAA CGG CTT GTT 776 He Ala Lys His Leu Gly Leu Glu Thr Glu Arg He Lys Arg Leu Val 230 235 240
TTA GAA CTC AAC CCT ATT GCT CAT CGT TTG CAA CTT TTG GAA GTG AAT 824 Leu Glu Leu Asn Pro He Ala His Arg Leu Gin Leu Leu Glu Val Asn 245 250 255 CAA AAA ATC ATC ATA GAC GAT AGC TTT AAT GGG AAT TTA AAG GGC ATG 872 Gin Lys He He He Asp Asp Ser Phe Asn Gly Asn Leu Lys Gly Met 260 265 270
TTA GAG GGC ATT CGT TTA GCG AGT TTG CAC AAA GGG CGT AAA GTC ATT 920 Leu Glu Gly He Arg Leu Ala Ser Leu His Lys Gly Arg Lys Val He 275 280 285 290
GTA ACA CCG GGC TTA GTG GAA AGC AAT ACA GAA AGT AAT GAG GCT TTA 968 Val Thr Pro Gly Leu Val Glu Ser Asn Thr Glu Ser Asn Glu Ala Leu 295 300 305
GCG CAA AAA ATA GAC GGG GTT TTT GAT GTC GCT ATC ATC ACA GGG GAG 1016 Ala Gin Lys He Asp Gly Val Phe Asp Val Ala He He Thr Gly Glu 310 315 320
TTG AAT TCC AAA ACG ATT GCT TCA CAA TTG AAA ACC CCC CAA AAA ATC 1064 Leu Asn Ser Lys Thr He Ala Ser Gin Leu Lys Thr Pro Gin Lys He 325 330 335
TTA CTC AAG GAT AAG GCG CAA TTG GAA AAT ATC TTA CAA GCC ACC ACG 1112 Leu Leu Lys Asp Lys Ala Gin Leu Glu Asn He Leu Gin Ala Thr Thr 340 345 350
ATT CAA GGC GAT TTG ATT TTA TTC GCT AAT GAC GCC CCT AAT TAC ATT T 1161 He Gin Gly Asp Leu He Leu Phe Ala Asn Asp Ala Pro Asn Tyr He 355 360 365 370
AGGAAATGAA CATGCAACAT TTATACGCTC CTTGGCGCGA AAGTTATTTG AA 1213
(2) INFORMATION FOR SEQ ID NO: 204:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 370 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204:
Met Gly Phe Xaa Leu Ala Leu Gly Tyr Leu Cys Leu Phe He Phe Val
1 5 10 15
Leu Ser Ala Ser Leu He Ser Glu Lys Ala Leu Ser Lys Gin Tyr Leu
20 25 30
Gin Thr Ala Lys Asp Lys He Thr Ser Leu Lys Asn Leu Lys Val He
35 40 45
Ala He Thr Gly Ser Phe Gly Lys Thr Ser Thr Lys Asn Phe Leu Leu
50 55 60
Gin He Leu Gin Thr Thr Phe Asn Ala His Ala Ser Pro Lys Ser Val 65 70 75 80
Asn Thr Leu Leu Gly Leu Ala Asn Asp He Asn Gin Asn Leu Asp Asp 85 90 95 Arg Ser Glu He Tyr He Ala Glu Ala Gly Ala Arg Asn Lys Gly Asp
100 105 110
He Lys Glu He Thr Cys Leu He Glu Pro His Leu Val Val Val Ala
115 120 125
Glu Val Gly Glu Gin His Leu Glu Tyr Phe Lys Thr Leu Glu Asn He
130 135 140
Cys Glu Thr Lys Ala Glu Leu Leu Asp Ser Lys Arg Leu Glu Lys Ala 145 150 155 160
Phe Cys Tyr Ser Val Glu Lys He Lys Pro Tyr Ala Pro Lys Asp Ser
165 170 175
Pro Leu He Asp Tyr Ser Ser Leu Val Lys Asn He Gin Ser Thr Leu
180 185 190
Lys Gly Thr Ser Phe Glu Met Leu He Gly Ser Val Trp Glu Arg Phe
195 200 205
Glu Thr Lys Val Leu Gly Glu Phe Ser Ala Tyr Asn He Ala Ser Ala
210 215 220
He Leu He Ala Lys His Leu Gly Leu Glu Thr Glu Arg He Lys Arg 225 230 235 240
Leu Val Leu Glu Leu Asn Pro He Ala His Arg Leu Gin Leu Leu Glu
245 250 255
Val Asn Gin Lys He He He Asp Asp Ser Phe Asn Gly Asn Leu Lys
260 265 270
Gly Met Leu Glu Gly He Arg Leu Ala Ser Leu His Lys Gly Arg Lys
275 280 285
Val He Val Thr Pro Gly Leu Val Glu Ser Asn Thr Glu Ser Asn Glu
290 295 300
Ala Leu Ala Gin Lys He Asp Gly Val Phe Asp Val Ala He He Thr 305 310 315 320
Gly Glu Leu Asn Ser Lys Thr He Ala Ser Gin Leu Lys Thr Pro Gin
325 330 335
Lys He Leu Leu Lys Asp Lys Ala Gin Leu Glu Asn He Leu Gin Ala
340 345 350
Thr Thr He Gin Gly Asp Leu He Leu Phe Ala Asn Asp Ala Pro Asn
355 360 365
Tyr He 370
(2) INFORMATION FOR SEQ ID NO:205:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1303 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1250 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205:
CCTTGTGGTC TCATTTGTTT CTGTTTTACT TGTAGCTTGG AGGACTAGGC ATG TAT 56
Met Tyr 1
AAA TTA GGG GTG TTT TTG TTA GCC ACC TTA CTA TCA GCT AAC ACG CAA 104 Lys Leu Gly Val Phe Leu Leu Ala Thr Leu Leu Ser Ala Asn Thr Gin 5 10 15
AAA GTG AGC GAT ATT GCT AAA GAT ATC CAA CAT AAA GAA ACC CTT TTG 152 Lys Val Ser Asp He Ala Lys Asp He Gin His Lys Glu Thr Leu Leu 20 25 30
AAA AAA ACC CAT GAA GAA AAA AAC CAA CTA AAC AGC CGT TTG AGT TCT 200 Lys Lys Thr His Glu Glu Lys Asn Gin Leu Asn Ser Arg Leu Ser Ser 35 40 45 50
TTA GGC GAA GCG ATC CGC TCT AAA GAG CTT CAA AAG GCT GAG ATG GAG 248 Leu Gly Glu Ala He Arg Ser Lys Glu Leu Gin Lys Ala Glu Met Glu 55 60 65
CGC CAA ATG ATC GCT TTA AAA AAG AGT CTT GAA AAA AAT CGT AAC GAA 296 Arg Gin Met He Ala Leu Lys Lys Ser Leu Glu Lys Asn Arg Asn Glu 70 75 80
AGT TTG GCG CAA GAA AAA GTC CTA ACC AAC TAC CGC AAG TCT TTA GAT 344 Ser Leu Ala Gin Glu Lys Val Leu Thr Asn Tyr Arg Lys Ser Leu Asp 85 90 95
CAT TTG CAA AAA AAG CGA TCA TTT TTA CAA AAG AGG GTG TTT GAT ACG 392 His Leu Gin Lys Lys Arg Ser Phe Leu Gin Lys Arg Val Phe Asp Thr 100 105 110
CTT TTA CAG GAT TTC CTT TTT TCA CAA GCC CTA AAG GGG CAG AAT TTA 440 Leu Leu Gin Asp Phe Leu Phe Ser Gin Ala Leu Lys Gly Gin Asn Leu 115 120 125 130
GCC TCT TCT AAT GAT GTT GTT TTG CAA GTG GCG TTT GAA AAC TTG CAC 488 Ala Ser Ser Asn Asp Val Val Leu Gin Val Ala Phe Glu Asn Leu His 135 140 145
CAA AGC ACT CTG TCT AAA ATG TCG CAA CTG AGC CAA GAA GAA AAG GAA 536 Gin Ser Thr Leu Ser Lys Met Ser Gin Leu Ser Gin Glu Glu Lys Glu 150 155 160
CTC AAT ACG CAA GCT TTA AAA GTC AAA AAC AGC ATT CAA AAA ATC TCA 584 Leu Asn Thr Gin Ala Leu Lys Val Lys Asn Ser He Gin Lys He Ser 165 170 175
TCC ATC ATA GAT GAG CAA AAA ACT CGT GAA GTA ACC TTA AAA TCC TTG 632 Ser He He Asp Glu Gin Lys Thr Arg Glu Val Thr Leu Lys Ser Leu 180 185 190
AAA ACC GAA CAA GAT AAG CTC ATT TTG AGC ATG CAA AAA GAT TAT GCG 680 Lys Thr Glu Gin Asp Lys Leu He Leu Ser Met Gin Lys Asp Tyr Ala 195 200 205 210
ATC TAC AAC CAA CGC CTA ACC CTT TTA GAA AAA GAG CGC CAG AAT TTA 728 He Tyr Asn Gin Arg Leu Thr Leu Leu Glu Lys Glu Arg Gin Asn Leu 215 220 225
AAC GCT CTT TTA AAA CGC TTG AAT ATC ATC AAA CAA AAC AGA GAA AAT 776 Asn Ala Leu Leu Lys Arg Leu Asn He He Lys Gin Asn Arg Glu Asn 230 235 240
GAA GAA AAA GTC AGT TTG AAA AAA TCT TCT CAA GCC TTA GAA GTC AAA 824 Glu Glu Lys Val Ser Leu Lys Lys Ser Ser Gin Ala Leu Glu Val Lys 245 250 255
CAA GTG GCT AGC TCT TAT CAA AAT ATC AAC ACC ACG AGC TAT AAC GGA 872 Gin Val Ala Ser Ser Tyr Gin Asn He Asn Thr Thr Ser Tyr Asn Gly 260 265 270
CCA AAA ACG ATC GCT CCC TTG AAC GAT TAT GAA GTG GTG CAA AAA TTT 920 Pro Lys Thr He Ala Pro Leu Asn Asp Tyr Glu Val Val Gin Lys Phe 275 280 285 290
GGC CCC TAT ATT GAC CCG GTT TAT AAT TTA AAA ATT TTT AGC GAG TCT 968 Gly Pro Tyr He Asp Pro Val Tyr Asn Leu Lys He Phe Ser Glu Ser 295 300 305
ATT ACG CTC GTG TCA AAA ACC CCA AAC GCT TTG GTG CGT AAT GTT TTA 1016 He Thr Leu Val Ser Lys Thr Pro Asn Ala Leu Val Arg Asn Val Leu 310 315 320
GAC GGG AAA ATC GTG TTC GCT AAA GAA ATC AAC ATG CTT AAA AAA GTC 1064 Asp Gly Lys He Val Phe Ala Lys Glu He Asn Met Leu Lys Lys Val 325 330 335
GTT ATC ATT GAG CAT AAA AAT GGG ATC CGC ACG ATT TAT TCT CAA TTG 1112 Val He He Glu His Lys Asn Gly He Arg Thr He Tyr Ser Gin Leu 340 345 350
GAT AAA ATC GCT CCC ACC ATT AAA AGC GGC ATG CGG ATC CAA AAA GGC 1160 Asp Lys He Ala Pro Thr He Lys Ser Gly Met Arg He Gin Lys Gly 355 360 365 370
TAT GTT TTA GGG CGC ATT GAT CAA CGC TTG GGC TTT GAA GTT ACC ATG 1208 Tyr Val Leu Gly Arg He Asp Gin Arg Leu Gly Phe Glu Val Thr Met 375 380 385
AGA GAA AAA CAC ATC AAC CCC TTA GAA CTC ATC GCA CGC AAT TAAACAAAT 1259 Arg Glu Lys His He Asn Pro Leu Glu Leu He Ala Arg Asn 390 395 400
CGTTTTTATT GCCGATATTG GCTAAAGAAT TTATGCAAAC AAAT 1303
(2) INFORMATION FOR SEQ ID NO: 206: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 400 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206:
Met Tyr Lys Leu Gly Val Phe Leu Leu Ala Thr Leu Leu Ser Ala Asn
1 5 10 15
Thr Gin Lys Val Ser Asp He Ala Lys Asp He Gin His Lys Glu Thr
20 25 30
Leu Leu Lys Lys Thr His Glu Glu Lys Asn Gin Leu Asn Ser Arg Leu
35 40 45
Ser Ser Leu Gly Glu Ala He Arg Ser Lys Glu Leu Gin Lys Ala Glu
50 55 60
Met Glu Arg Gin Met He Ala Leu Lys Lys Ser Leu Glu Lys Asn Arg 65 70 75 80
Asn Glu Ser Leu Ala Gin Glu Lys Val Leu Thr Asn Tyr Arg Lys Ser
85 90 95
Leu Asp His Leu Gin Lys Lys Arg Ser Phe Leu Gin Lys Arg Val Phe
100 105 110
Asp Thr Leu Leu Gin Asp Phe Leu Phe Ser Gin Ala Leu Lys Gly Gin
115 120 125
Asn Leu Ala Ser Ser Asn Asp Val Val Leu Gin Val Ala Phe Glu Asn
130 135 140
Leu His Gin Ser Thr Leu Ser Lys Met Ser Gin Leu Ser Gin Glu Glu 145 150 155 160
Lys Glu Leu Asn Thr Gin Ala Leu Lys Val Lys Asn Ser He Gin Lys
165 170 175
He Ser Ser He He Asp Glu Gin Lys Thr Arg Glu Val Thr Leu Lys
180 185 190
Ser Leu Lys Thr Glu Gin Asp Lys Leu He Leu Ser Met Gin Lys Asp
195 200 205
Tyr Ala He Tyr Asn Gin Arg Leu Thr Leu Leu Glu Lys Glu Arg Gin
210 215 220
Asn Leu Asn Ala Leu Leu Lys Arg Leu Asn He He Lys Gin Asn Arg 225 230 235 240
Glu Asn Glu Glu Lys Val Ser Leu Lys Lys Ser Ser Gin Ala Leu Glu
245 250 255
Val Lys Gin Val Ala Ser Ser Tyr Gin Asn He Asn Thr Thr Ser Tyr
260 265 270
Asn Gly Pro Lys Thr He Ala Pro Leu Asn Asp Tyr Glu Val Val Gin
275 280 285
Lys Phe Gly Pro Tyr He Asp Pro Val Tyr Asn Leu Lys He Phe Ser
290 295 300
Glu Ser He Thr Leu Val Ser Lys Thr Pro Asn Ala Leu Val Arg Asn 305 310 315 320
Val Leu Asp Gly Lys He Val Phe Ala Lys Glu He Asn Met Leu Lys
325 330 335
Lys Val Val He He Glu His Lys Asn Gly He Arg Thr He Tyr Ser 340 345 350 Gin Leu Asp Lys He Ala Pro Thr He Lys Ser Gly Met Arg He Gin
355 360 365
Lys Gly Tyr Val Leu Gly Arg He Asp Gin Arg Leu Gly Phe Glu Val
370 375 380
Thr Met Arg Glu Lys His He Asn Pro Leu Glu Leu He Ala Arg Asn 385 390 395 400
(2) INFORMATION FOR SEQ ID NO: 207:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 361 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...308 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207:
AGGATTTAAG ATTGCGTTTG AAGTTAGCTA GCGTATGCTT GGGCGTTTTG ATG AGT 56
Met Ser
1
GGT TGT GCG TCT TCT TCG CCA ACT GGC ACT CTT ATC ACT ATG GTA ACG 104 Gly Cys Ala Ser Ser Ser Pro Thr Gly Thr Leu He Thr Met Val Thr 5 10 15
ATG CCA GTT TCT GGG AAT GAT GCA CAA TAC TCC AAA GAA GGG CGT GCG 152 Met Pro Val Ser Gly Asn Asp Ala Gin Tyr Ser Lys Glu Gly Arg Ala 20 25 30
AGT TGT TGG AGT GTT TTT AGT CTT GTG GCT GCC GGT AAT TGT TCG GTA 200 Ser Cys Trp Ser Val Phe Ser Leu Val Ala Ala Gly Asn Cys Ser Val 35 40 45 50
GAA AAA GCG GCT AAA AGT GGC GGT GTT ACC AAG ATT AAA ATG GTG AGC 248 Glu Lys Ala Ala Lys Ser Gly Gly Val Thr Lys He Lys Met Val Ser 55 60 65
CGT GAG ACA AAC AAC TTT TTA GGT ATT GTT GGC AAA TAC ACC ACG ATC 296 Arg Glu Thr Asn Asn Phe Leu Gly He Val Gly Lys Tyr Thr Thr He 70 75 80
GTT CAA GGC GAC TAGTTTTAAT ATTTAGAGAG CGTAGTTGAA TCGTCTTTCG TTCCA 353 Val Gin Gly Asp 85 CTCTAGGC 361
(2) INFORMATION FOR SEQ ID NO: 208:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 86 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:208:
Met Ser Gly Cys Ala Ser Ser Ser Pro Thr Gly Thr Leu He Thr Met
1 5 10 15
Val Thr Met Pro Val Ser Gly Asn Asp Ala Gin Tyr Ser Lys Glu Gly
20 25 30
Arg Ala Ser Cys Trp Ser Val Phe Ser Leu Val Ala Ala Gly Asn Cys
35 40 45
Ser Val Glu Lys Ala Ala Lys Ser Gly Gly Val Thr Lys He Lys Met
50 55 60
Val Ser Arg Glu Thr Asn Asn Phe Leu Gly He Val Gly Lys Tyr Thr 65 70 75 80
Thr He Val Gin Gly Asp 85
(2) INFORMATION FOR SEQ ID NO: 209:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1564 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1511 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209:
TTTAAACCTG AATTTCATAT TTTTGATTTT TTAAAGGGAT TAGAGTTCTT ATG ATT 56
Met He 1
GAA TGG ATG CAA AAT CAT AGA AAA TAT TTA GTG GTT ACA ATA TGG ATA 104 Glu Trp Met Gin Asn His Arg Lys Tyr Leu Val Val Thr He Trp He 5 10 15 AGC ACG ATC GCT TTT ATT GCC GCT GGG ATG ATA GGC TGG GGG CAA TAC 152 Ser Thr He Ala Phe He Ala Ala Gly Met He Gly Trp Gly Gin Tyr 20 25 30
AGC TTT TCT TTA GAT AGC GAT AGC GCT GCC AAA GTG GGA CAG ATT AAG 200 Ser Phe Ser Leu Asp Ser Asp Ser Ala Ala Lys Val Gly Gin He Lys 35 40 45 50
ATT TCT CAA GAA GAA TTA GCC CAA GAA TAC CGC CGC CTT AAA GAC GCA 248 He Ser Gin Glu Glu Leu Ala Gin Glu Tyr Arg Arg Leu Lys Asp Ala 55 60 65
TAT GCT GAG TCT ATC CCT GAT TTT AAA GAA CTC ACC AAA GAT CAA ATC 296 Tyr Ala Glu Ser He Pro Asp Phe Lys Glu Leu Thr Lys Asp Gin He 70 75 80
AAA GCC ATG CAT TTA GAA AAA AGC GCT TTA GAT TCG CTC ATC AAT CAA 344 Lys Ala Met His Leu Glu Lys Ser Ala Leu Asp Ser Leu He Asn Gin 85 90 95
GCC TTA TTG AGA AAT CTC GCT TTA GAT TTA GGG CTT GGC GCT ACA AAG 392 Ala Leu Leu Arg Asn Leu Ala Leu Asp Leu Gly Leu Gly Ala Thr Lys 100 105 110
CAA GAA GTG GCG AAA GAG ATC AGA AAA ACG AGC GTT TTC CAA AAA GAT 440 Gin Glu Val Ala Lys Glu He Arg Lys Thr Ser Val Phe Gin Lys Asp 115 120 125 130
GGC GTT TTT GAT GAA GAA TTG TAT AAA AAT ATC TTA AAG CAA AGC CAT 488 Gly Val Phe Asp Glu Glu Leu Tyr Lys Asn He Leu Lys Gin Ser His 135 140 145
TAC CGC CCC AAA CAT TTT GAA GAA AGC GTT GAA AGG CTT TTA ATC CTT 536 Tyr Arg Pro Lys His Phe Glu Glu Ser Val Glu Arg Leu Leu He Leu 150 155 160
CAA AAA ATC AGC ACT CTA TTC CCC AAA ACC ACT ACC CCT TTG GAG CAA 584 Gin Lys He Ser Thr Leu Phe Pro Lys Thr Thr Thr Pro Leu Glu Gin 165 170 175
TCC AGC CTA TCG CTT TGG GCA AAA TTG CAA GAC AAA TTA GAC ATT CTT 632 Ser Ser Leu Ser Leu Trp Ala Lys Leu Gin Asp Lys Leu Asp He Leu 180 185 190
ATC CTA AAC CCT AGT GAT GTT AAA ATC TCT CTT AAT GAA GAA GAG ATG 680 He Leu Asn Pro Ser Asp Val Lys He Ser Leu Asn Glu Glu Glu Met 195 200 205 210
AAA AAA TAT TAC GAG TCC CAT AAA AAG GAT TTT AAA AAG CCC ACG AGC 728 Lys Lys Tyr Tyr Glu Ser His Lys Lys Asp Phe Lys Lys Pro Thr Ser 215 220 225
TTT AAA ACA CGC TCT TTA TAT TTT GAC GCT AGT TTG GAA AAA CCT GAT 776 Phe Lys Thr Arg Ser Leu Tyr Phe Asp Ala Ser Leu Glu Lys Pro Asp 230 235 240 TTG AAG GAG TTG GAG GAA TAC TAC CAT AAA AAC AAG GTG TCT TAT TTG 824 Leu Lys Glu Leu Glu Glu Tyr Tyr His Lys Asn Lys Val Ser Tyr Leu 245 250 255
GAC AAA GAG GGG AAA TTG CAG GAT TTT AAA AGC GTT CAA GAG CAA GTC 872 Asp Lys Glu Gly Lys Leu Gin Asp Phe Lys Ser Val Gin Glu Gin Val 260 265 270
AAG CAT GAT TTA AGC ATG CAA AAA GCG AAT GAA AAA GCC TTA AGG AGC 920 Lys His Asp Leu Ser Met Gin Lys Ala Asn Glu Lys Ala Leu Arg Ser 275 280 285 290
TAT ATC GCT CTA AAA AAA GCG AAC GCG CAA AAC TAC ACC ACA CAA GAT 968 Tyr He Ala Leu Lys Lys Ala Asn Ala Gin Asn Tyr Thr Thr Gin Asp 295 300 305
TTT GAA GAG AAC AAC TCC CCC TAT ACT GCT GAA ATC ACG CAA AAA CTC 1016 Phe Glu Glu Asn Asn Ser Pro Tyr Thr Ala Glu He Thr Gin Lys Leu 310 315 320
ACC GCT CTC AAA CCC CTT GAA ATC CTA AAG CCA GAG CCT TTT AAA GAT 1064 Thr Ala Leu Lys Pro Leu Glu He Leu Lys Pro Glu Pro Phe Lys Asp 325 330 335
GGT TTT ATT GTG GTG CAA CTC ATC TCT CAA ATT AAA GAC GAA TTG CAA 1112 Gly Phe He Val Val Gin Leu He Ser Gin He Lys Asp Glu Leu Gin 340 345 350
AAT TTT AAT GAA GCT AAA AGC GCT CTT AAA ACC CGC CTA ACT CAA GAA 1160 Asn Phe Asn Glu Ala Lys Ser Ala Leu Lys Thr Arg Leu Thr Gin Glu 355 360 365 370
AAA ACC CTT ATG GCG TTG CAA ACT TTA GCC AAA GAA AAG CTT AAG GAT 1208 Lys Thr Leu Met Ala Leu Gin Thr Leu Ala Lys Glu Lys Leu Lys Asp 375 380 385
TTT AAG GGC AAA AGC GTG GGC TAT GTA AGC CCT AAT TTT GGA GGC ACT 1256 Phe Lys Gly Lys Ser Val Gly Tyr Val Ser Pro Asn Phe Gly Gly Thr 390 395 400
ATT AGT GAG CTT AAC CAA GAA GAA AGT GCT AAG TTT ATC AAC GCT CTT 1304 He Ser Glu Leu Asn Gin Glu Glu Ser Ala Lys Phe He Asn Ala Leu 405 410 415
TTT AAC CGC CAG GAA AAA AAG GGG TTT ATC GCT ATT AAT AAT AAA GTG 1352 Phe Asn Arg Gin Glu Lys Lys Gly Phe He Ala He Asn Asn Lys Val 420 425 430
GTG CTC TAT CAA ATC ACA GAA CAA AAT TTC AAC CAC TCA TTT AGT GCA 1400 Val Leu Tyr Gin He Thr Glu Gin Asn Phe Asn His Ser Phe Ser Ala 435 440 445 450
GAA GAA AGC CAG TAT ATG CAG CGT TTA GTC AAT AAC ACT AAA ACG GAT 1448 Glu Glu Ser Gin Tyr Met Gin Arg Leu Val Asn Asn Thr Lys Thr Asp 455 460 465 TTT TTT GAT AAA GCG TTG ATA GAA GAA TTG AAA AAA CGC TAT AAG ATA 1496 Phe Phe Asp Lys Ala Leu He Glu Glu Leu Lys Lys Arg Tyr Lys He 470 475 480
GTC AAA TAC ATT CAA TAAATGCAAG GGGAAATCAT GGAACATAAA GAAATCGTTA T 1552 Val Lys Tyr He Gin 485
AGGGGTTGAT CT 1564
(2) INFORMATION FOR SEQ ID NO: 210:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 487 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210:
Met He Glu Trp Met Gin Asn His Arg Lys Tyr Leu Val Val Thr He
1 5 10 15
Trp He Ser Thr He Ala Phe He Ala Ala Gly Met He Gly Trp Gly
20 25 30
Gin Tyr Ser Phe Ser Leu Asp Ser Asp Ser Ala Ala Lys Val Gly Gin
35 40 45
He Lys He Ser Gin Glu Glu Leu Ala Gin Glu Tyr Arg Arg Leu Lys
50 55 60
Asp Ala Tyr Ala Glu Ser He Pro Asp Phe Lys Glu Leu Thr Lys Asp 65 70 75 80
Gin He Lys Ala Met His Leu Glu Lys Ser Ala Leu Asp Ser Leu He
85 90 95
Asn Gin Ala Leu Leu Arg Asn Leu Ala Leu Asp Leu Gly Leu Gly Ala
100 105 110
Thr Lys Gin Glu Val Ala Lys Glu He Arg Lys Thr Ser Val Phe Gin
115 120 125
Lys Asp Gly Val Phe Asp Glu Glu Leu Tyr Lys Asn He Leu Lys Gin
130 135 140
Ser His Tyr Arg Pro Lys His Phe Glu Glu Ser Val Glu Arg Leu Leu 145 150 155 160
He Leu Gin Lys He Ser Thr Leu Phe Pro Lys Thr Thr Thr Pro Leu
165 170 175
Glu Gin Ser Ser Leu Ser Leu Trp Ala Lys Leu Gin Asp Lys Leu Asp
180 185 190
He Leu He Leu Asn Pro Ser Asp Val Lys He Ser Leu Asn Glu Glu
195 200 205
Glu Met Lys Lys Tyr Tyr Glu Ser His Lys Lys Asp Phe Lys Lys Pro
210 215 220
Thr Ser Phe Lys Thr Arg Ser Leu Tyr Phe Asp Ala Ser Leu Glu Lys 225 230 235 240
Pro Asp Leu Lys Glu Leu Glu Glu Tyr Tyr His Lys Asn Lys Val Ser 245 250 255 Tyr Leu Asp Lys Glu Gly Lys Leu Gin Asp Phe Lys Ser Val Gin Glu
260 265 270
Gin Val Lys His Asp Leu Ser Met Gin Lys Ala Asn Glu Lys Ala Leu
275 280 285
Arg Ser Tyr He Ala Leu Lys Lys Ala Asn Ala Gin Asn Tyr Thr Thr
290 295 300
Gin Asp Phe Glu Glu Asn Asn Ser Pro Tyr Thr Ala Glu He Thr Gin 305 310 315 320
Lys Leu Thr Ala Leu Lys Pro Leu Glu He Leu Lys Pro Glu Pro Phe
325 330 335
Lys Asp Gly Phe He Val Val Gin Leu He Ser Gin He Lys Asp Glu
340 345 350
Leu Gin Asn Phe Asn Glu Ala Lys Ser Ala Leu Lys Thr Arg Leu Thr
355 360 365
Gin Glu Lys Thr Leu Met Ala Leu Gin Thr Leu Ala Lys Glu Lys Leu
370 375 380
Lys Asp Phe Lys Gly Lys Ser Val Gly Tyr Val Ser Pro Asn Phe Gly 385 390 395 400
Gly Thr He Ser Glu Leu Asn Gin Glu Glu Ser Ala Lys Phe He Asn
405 410 415
Ala Leu Phe Asn Arg Gin Glu Lys Lys Gly Phe He Ala He Asn Asn
420 425 430
Lys Val Val Leu Tyr Gin He Thr Glu Gin Asn Phe Asn His Ser Phe
435 440 445
Ser Ala Glu Glu Ser Gin Tyr Met Gin Arg Leu Val Asn Asn Thr Lys
450 455 460
Thr Asp Phe Phe Asp Lys Ala Leu He Glu Glu Leu Lys Lys Arg Tyr 465 470 475 480
Lys He Val Lys Tyr He Gin 485
(2) INFORMATION FOR SEQ ID NO: 211:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1435 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1382 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211:
CAAAAAGATG GATTTTTGAG CGTCAAAGAG GCTATAGGAG CGGATTTAAG ATG AAA 56
Met Lys 1 CAT TTT TCT GTT AAA AGA CTT TTA GGG CTT AGT TCT GTC TTG TTA GTC 104 His Phe Ser Val Lys Arg Leu Leu Gly Leu Ser Ser Val Leu Leu Val 5 10 15
ACT TTA GGA GCG AGC ATG CAC GCA CAA TCT TAC TTA CCC AAA CAT GAG 152 Thr Leu Gly Ala Ser Met His Ala Gin Ser Tyr Leu Pro Lys His Glu 20 25 30
AGC GTT ACC TTA AAA AAC GGG TTG CAA GTC GTG AGC GTC CCC CTA GAA 200 Ser Val Thr Leu Lys Asn Gly Leu Gin Val Val Ser Val Pro Leu Glu 35 40 45 50
AAT AAA ACC GGG GTT ATA GAA GTG GAT GTG CTT TAT AAA GTC GGC TCT 248 Asn Lys Thr Gly Val He Glu Val Asp Val Leu Tyr Lys Val Gly Ser 55 60 65
AGA AAC GAA ACC ATG GGA AAG AGC GGG ATC GCT CAC ATG TTA GAG CAT 296 Arg Asn Glu Thr Met Gly Lys Ser Gly He Ala His Met Leu Glu His 70 75 80
TTG AAT TTT AAA AGC ACC AAA AAC CTT AAA GCC GGC GAA TTT GAT AAA 344 Leu Asn Phe Lys Ser Thr Lys Asn Leu Lys Ala Gly Glu Phe Asp Lys 85 90 95
ATC GTT AAG CGT TTT GGG GGC GTG AGT AAC GCT TCT ACG AGT TTT GAT 392 He Val Lys Arg Phe Gly Gly Val Ser Asn Ala Ser Thr Ser Phe Asp 100 105 110
ATT ACG CGC TAC TTC ATT AAA ACC AGT CAG GCT AAC TTG GAT AAG TCT 440 He Thr Arg Tyr Phe He Lys Thr Ser Gin Ala Asn Leu Asp Lys Ser 115 120 125 130
TTA GAA TTG TTC GCT GAA ACC ATG GGT TCA TTG AAT TTA AAA GAA GAT 488 Leu Glu Leu Phe Ala Glu Thr Met Gly Ser Leu Asn Leu Lys Glu Asp 135 140 145
GAG TTT TTG CCT GAG CGT CAA GTG GTC GCT GAA GAA AGG CGA TGG CGC 536 Glu Phe Leu Pro Glu Arg Gin Val Val Ala Glu Glu Arg Arg Trp Arg 150 155 160
ACT GAT AAT TCC CCT ATC GGC ATG CTT TAT TTC CGC TTT TTT AAC ACC 584 Thr Asp Asn Ser Pro He Gly Met Leu Tyr Phe Arg Phe Phe Asn Thr 165 170 175
GCT TAT GTC TAT CAC CCC TAC CAT TGG ACG CCC ATT GGT TTT ATG GAT 632 Ala Tyr Val Tyr His Pro Tyr His Trp Thr Pro He Gly Phe Met Asp 180 185 190
GAT ATT CAA AAT TGG ACT TTA AAA GAC ATT AAA AAA TTC CAT TCG CTC 680 Asp He Gin Asn Trp Thr Leu Lys Asp He Lys Lys Phe His Ser Leu 195 200 205 210
TAT TAT CAG CCT AAA AAC GCT ATC GTT TTG GTG GTA GGC GAT GTC AAT 728 Tyr Tyr Gin Pro Lys Asn Ala He Val Leu Val Val Gly Asp Val Asn 215 220 225 TCC CAA AAG GTT TTT GAA TTG AGT AAA AAG CAT TTT GAA TCC TTA AAA 776 Ser Gin Lys Val Phe Glu Leu Ser Lys Lys His Phe Glu Ser Leu Lys 230 235 240
AAC CTT GAT GAA AAA GCT ATC CCC ACC CCT TAC ATG AAA GAG CCT AAG 824 Asn Leu Asp Glu Lys Ala He Pro Thr Pro Tyr Met Lys Glu Pro Lys 245 250 255
CAA GAT GGA GCC AGA ACG GCA GTC GTG CAT AAA GAT GGG GTC CAT TTA 872 Gin Asp Gly Ala Arg Thr Ala Val Val His Lys Asp Gly Val His Leu 260 265 270
GAA TGG GTG GCC CTT GGG TAT AAA GTG CCT GCT TTC AAG CAT AAA GAT 920 Glu Trp Val Ala Leu Gly Tyr Lys Val Pro Ala Phe Lys His Lys Asp 275 280 285 290
CAA GTC GCC TTA GAC GCA CTA AGT AGG CTT TTA GGC GAA GGC AAA AGC 968 Gin Val Ala Leu Asp Ala Leu Ser Arg Leu Leu Gly Glu Gly Lys Ser 295 300 305
TCG TGG TTG CAA AGC GAA TTA GTG GAT AAA AAA CGC TTG GCT TCT CAA 1016 Ser Trp Leu Gin Ser Glu Leu Val Asp Lys Lys Arg Leu Ala Ser Gin 310 315 320
GCT TTC TCG CAC AAC ATG CAA TTA CAA GAT GAA AGC GTG TTT TTA TTC 1064 Ala Phe Ser His Asn Met Gin Leu Gin Asp Glu Ser Val Phe Leu Phe 325 330 335
ATT GCG GGG GGT AAT CCT AAT GTC AAA GCC GAA GCC TTA CAA AAA GAA 1112 He Ala Gly Gly Asn Pro Asn Val Lys Ala Glu Ala Leu Gin Lys Glu 340 345 350
ATC GTA GCG CTT TTA GAA AAG CTG AAA AAA GGC GAA ATC ACT CAA GCG 1160 He Val Ala Leu Leu Glu Lys Leu Lys Lys Gly Glu He Thr Gin Ala 355 360 365 370
GAA TTA GAC AAG CTC AAA ATC AAT CAA AAA GCT GAC TTT ATT TCT AAT 1208 Glu Leu Asp Lys Leu Lys He Asn Gin Lys Ala Asp Phe He Ser Asn 375 380 385
TTA GAA AGT TCT AGC GAT GTT GCG GGG CTT TTT GCG GAC TAT TTA GTG 1256 Leu Glu Ser Ser Ser Asp Val Ala Gly Leu Phe Ala Asp Tyr Leu Val 390 395 400
CAA AAC GAT ATT CAA GGC TTG ACG GAT TAC CAG CGA CAA TTT TTG GAT 1304 Gin Asn Asp He Gin Gly Leu Thr Asp Tyr Gin Arg Gin Phe Leu Asp 405 410 415
TTA AAA GTG AGC GAT TTG GTG CGT GTG GCC AAT GAA TAT TTT AAA GAC 1352 Leu Lys Val Ser Asp Leu Val Arg Val Ala Asn Glu Tyr Phe Lys Asp 420 425 430
ACC CAA TCA ACC ACC GTG TTT TTG AAA CCT TAAAAGAGCC TTATAACATG CAA 1405 Thr Gin Ser Thr Thr Val Phe Leu Lys Pro 435 440 TTTCATTCAT CTAGCGCGTT GATTACGCCT 1435
(2) INFORMATION FOR SEQ ID NO: 212:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 444 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE : internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212:
Met Lys His Phe Ser Val Lys Arg Leu Leu Gly Leu Ser Ser Val Leu
1 5 10 15
Leu Val Thr Leu Gly Ala Ser Met His Ala Gin Ser Tyr Leu Pro Lys
20 25 30
His Glu Ser Val Thr Leu Lys Asn Gly Leu Gin Val Val Ser Val Pro
35 40 45
Leu Glu Asn Lys Thr Gly Val He Glu Val Asp Val Leu Tyr Lys Val
50 55 60
Gly Ser Arg Asn Glu Thr Met Gly Lys Ser Gly He Ala His Met Leu 65 70 75 80
Glu His Leu Asn Phe Lys Ser Thr Lys Asn Leu Lys Ala Gly Glu Phe
85 90 95
Asp Lys He Val Lys Arg Phe Gly Gly Val Ser Asn Ala Ser Thr Ser
100 105 110
Phe Asp He Thr Arg Tyr Phe He Lys Thr Ser Gin Ala Asn Leu Asp
115 120 125
Lys Ser Leu Glu Leu Phe Ala Glu Thr Met Gly Ser Leu Asn Leu Lys
130 135 140
Glu Asp Glu Phe Leu Pro Glu Arg Gin Val Val Ala Glu Glu Arg Arg 145 150 155 160
Trp Arg Thr Asp Asn Ser Pro He Gly Met Leu Tyr Phe Arg Phe Phe
165 170 175
Asn Thr Ala Tyr Val Tyr His Pro Tyr His Trp Thr Pro He Gly Phe
180 185 190
Met Asp Asp He Gin Asn Trp Thr Leu Lys Asp He Lys Lys Phe His
195 200 205
Ser Leu Tyr Tyr Gin Pro Lys Asn Ala He Val Leu Val Val Gly Asp
210 215 220
Val Asn Ser Gin Lys Val Phe Glu Leu Ser Lys Lys His Phe Glu Ser 225 230 235 240
Leu Lys Asn Leu Asp Glu Lys Ala He Pro Thr Pro Tyr Met Lys Glu
245 250 255
Pro Lys Gin Asp Gly Ala Arg Thr Ala Val Val His Lys Asp Gly Val
260 265 270
His Leu Glu Trp Val Ala Leu Gly Tyr Lys Val Pro Ala Phe Lys His
275 280 285
Lys Asp Gin Val Ala Leu Asp Ala Leu Ser Arg Leu Leu Gly Glu Gly
290 295 300
Lys Ser Ser Trp Leu Gin Ser Glu Leu Val Asp Lys Lys Arg Leu Ala 305 310 315 320 Ser Gin Ala Phe Ser His Asn Met Gin Leu Gin Asp Glu Ser Val Phe
325 330 335
Leu Phe He Ala Gly Gly Asn Pro Asn Val Lys Ala Glu Ala Leu Gin
340 345 350
Lys Glu He Val Ala Leu Leu Glu Lys Leu Lys Lys Gly Glu He Thr
355 360 365
Gin Ala Glu Leu Asp Lys Leu Lys He Asn Gin Lys Ala Asp Phe He
370 375 380
Ser Asn Leu Glu Ser Ser Ser Asp Val Ala Gly Leu Phe Ala Asp Tyr 385 390 395 400
Leu Val Gin Asn Asp He Gin Gly Leu Thr Asp Tyr Gin Arg Gin Phe
405 410 415
Leu Asp Leu Lys Val Ser Asp Leu Val Arg Val Ala Asn Glu Tyr Phe
420 425 430
Lys Asp Thr Gin Ser Thr Thr Val Phe Leu Lys Pro 435 440
(2) INFORMATION FOR SEQ ID NO: 213:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 250 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...197 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:213:
ATATAATCTT TTCTTAATTT TGAAGTTTAG CAAATTTTAA GGAAGTAACC ATG ATG 56
Met Met 1
AAA AAA ACC CTT TTT ATC TCT TTG GCT TTA GCG TTA AGC TTG AAT GCG 104 Lys Lys Thr Leu Phe He Ser Leu Ala Leu Ala Leu Ser Leu Asn Ala 5 10 15
GGC AAT ATC CAA ATC CAG AGC ATG CCC AAA GTT AAA GAG CGA GTG AGT 152 Gly Asn He Gin He Gin Ser Met Pro Lys Val Lys Glu Arg Val Ser 20 25 30
GTC CCC TCT AAA GAC GAT ACG GAT CTA TTC TTA CCA CGA TTC TAT TAAGG 202 Val Pro Ser Lys Asp Asp Thr Asp Leu Phe Leu Pro Arg Phe Tyr 35 40 45
ACTCTATTAA GGCGGTGGTG AATATCTCCA CTGAAAAGAA GATTAAAA 250 (2) INFORMATION FOR SEQ ID NO: 214:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 49 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214:
Met Met Lys Lys Thr Leu Phe He Ser Leu Ala Leu Ala Leu Ser Leu
1 5 10 15
Asn Ala Gly Asn He Gin He Gin Ser Met Pro Lys Val Lys Glu Arg
20 25 30
Val Ser Val Pro Ser Lys Asp Asp Thr Asp Leu Phe Leu Pro Arg Phe
35 40 45
Tyr
(2) INFORMATION FOR SEQ ID NO: 215:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 328 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...275 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215:
TTAAGATTGC GGTTATACTG AAAAAAACAA TATGAAATCA AGGAGCTTGT ATG CAA 56
Met Gin 1
CAG CGT CAT TTA GGC CCT TTA AAA GTG GGT GCA TTA GCT CTA GGG TGC 104 Gin Arg His Leu Gly Pro Leu Lys Val Gly Ala Leu Ala Leu Gly Cys 5 10 15
ATG GGC ATG ACT TAT GGG TAT GGG GAA GTC CAT GAT AAA AAG CAG ATG 152 Met Gly Met Thr Tyr Gly Tyr Gly Glu Val His Asp Lys Lys Gin Met 20 25 30
GTT AAA CTT ATC CAT AAG GCT TTG GAA TTG GGT ATT AAC TTT TTT GAC 200 Val Lys Leu He His Lys Ala Leu Glu Leu Gly He Asn Phe Phe Asp 35 40 45 50
ACT GCA GAG GCT TAT GGG GAA GAT AAT GAA AAG CTT TTA GGC GAA CGA 248 Thr Ala Glu Ala Tyr Gly Glu Asp Asn Glu Lys Leu Leu Gly Glu Arg 55 60 65
TCA AGC CTT TTA AAG ACA AGG TTG TGG TAGCGAGCAA GTTTGGGATT TACTACG 302 Ser Ser Leu Leu Lys Thr Arg Leu Trp 70 75
CAGATCCTAA TGACAAATAC GCAACC 328
(2) INFORMATION FOR SEQ ID NO: 216:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 75 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal'
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216:
Met Gin Gin Arg His Leu Gly Pro Leu Lys Val Gly Ala Leu Ala Leu
1 5 10 15
Gly Cys Met Gly Met Thr Tyr Gly Tyr Gly Glu Val His Asp Lys Lys
20 25 30
Gin Met Val Lys Leu He His Lys Ala Leu Glu Leu Gly He Asn Phe
35 40 45
Phe Asp Thr Ala Glu Ala Tyr Gly Glu Asp Asn Glu Lys Leu Leu Gly
50 55 60
Glu Arg Ser Ser Leu Leu Lys Thr Arg Leu Trp 65 70 75
(2) INFORMATION FOR SEQ ID NO: 217:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 649 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...596 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:217:
CGAATTGCTG TATAGTTAGC GTTTTTAATT CAAAATGAAG TGAGGAAACA ATG AAA 56
Met Lys 1
AAA GCG TTA ATA TCC ACC CTT TTT GGT GTT AGT TTG GCG TTT GCA AAA 104 Lys Ala Leu He Ser Thr Leu Phe Gly Val Ser Leu Ala Phe Ala Lys 5 10 15
CCT TAT ACG ATT GAT AAG GCA AAC TCT AGC GTG TGG TTT GAG GTC AAA 152 Pro Tyr Thr He Asp Lys Ala Asn Ser Ser Val Trp Phe Glu Val Lys 20 25 30
CAC TTC ACG TTC AAT GAA ACA AGA GGC GCG TTT GAT AAT TTT GAT GGC 200 His Phe Thr Phe Asn Glu Thr Arg Gly Ala Phe Asp Asn Phe Asp Gly 35 40 45 50
AAA ATT GAT CTA GAG CCC AAC ACT AAA ATG CTC AGC GTT TTT GAA GGC 248 Lys He Asp Leu Glu Pro Asn Thr Lys Met Leu Ser Val Phe Glu Gly 55 60 65
AAT ATT GAT GTG AAA AGC GTC AAT ACT AGG GAT AGA AAA AGA GAT AAC 296 Asn He Asp Val Lys Ser Val Asn Thr Arg Asp Arg Lys Arg Asp Asn 70 75 80
CAC TTG AAA ACA GCG GAC TTT TTT GAT GTG GTA AAA TAC CCC AAA GGG 344 His Leu Lys Thr Ala Asp Phe Phe Asp Val Val Lys Tyr Pro Lys Gly 85 90 95
AGC TTT AAA ATG ACC AAA TAC GAA GAT GGT AAA ATC TAT GGG GAT TTG 392 Ser Phe Lys Met Thr Lys Tyr Glu Asp Gly Lys He Tyr Gly Asp Leu 100 105 110
ACT CTT CGT GGC GTA ACC AAG CCT GTC GTA TTG GAA GCC AAA ATC CAA 440 Thr Leu Arg Gly Val Thr Lys Pro Val Val Leu Glu Ala Lys He Gin 115 120 125 130
GCC CCC TTA CAA AAC CCC ATG AAT AAA AAA GAA TTC ATG GTG TTA CAA 488 Ala Pro Leu Gin Asn Pro Met Asn Lys Lys Glu Phe Met Val Leu Gin 135 140 145
GCT GAA GGC AAA ATC AAC CGC AAG GAT TTT GGT ATC GGT AAA ACC TTT 536 Ala Glu Gly Lys He Asn Arg Lys Asp Phe Gly He Gly Lys Thr Phe 150 155 160
AGC GAT GCT GTC GTT GGA GAT GAG GTA AAG ATT GAG CTC AAA CTA GAA 584 Ser Asp Ala Val Val Gly Asp Glu Val Lys He Glu Leu Lys Leu Glu 165 170 175
GCT TAC GCC CAA TAATCGTTTT GCAAGAGATA GATATCTTCT TCTCTTGCGT TTTTC 641 Ala Tyr Ala Gin 180
TAACAGCA 649 (2) INFORMATION FOR SEQ ID NO:218:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 182 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218:
Met Lys Lys Ala Leu He Ser Thr Leu Phe Gly Val Ser Leu Ala Phe
1 5 10 15
Ala Lys Pro Tyr Thr He Asp Lys Ala Asn Ser Ser Val Trp Phe Glu
20 25 30
Val Lys His Phe Thr Phe Asn Glu Thr Arg Gly Ala Phe Asp Asn Phe
35 40 45
Asp Gly Lys He Asp Leu Glu Pro Asn Thr Lys Met Leu Ser Val Phe
50 55 60
Glu Gly Asn He Asp Val Lys Ser Val Asn Thr Arg Asp Arg Lys Arg 65 70 75 80
Asp Asn His Leu Lys Thr Ala Asp Phe Phe Asp Val Val Lys Tyr Pro
85 90 95
Lys Gly Ser Phe Lys Met Thr Lys Tyr Glu Asp Gly Lys He Tyr Gly
100 105 110
Asp Leu Thr Leu Arg Gly Val Thr Lys Pro Val Val Leu Glu Ala Lys
115 120 125
He Gin Ala Pro Leu Gin Asn Pro Met Asn Lys Lys Glu Phe Met Val
130 135 140
Leu Gin Ala Glu Gly Lys He Asn Arg Lys Asp Phe Gly He Gly Lys 145 150 155 160
Thr Phe Ser Asp Ala Val Val Gly Asp Glu Val Lys He Glu Leu Lys
165 170 175
Leu Glu Ala Tyr Ala Gin 180
(2) INFORMATION FOR SEQ ID NO: 219:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 478 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...425 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219:
TCCGTTCGCA ACAAGAATTT TCTTGTTATC TTAATGTAAA GGTCAAAACG ATG AAA 56
Met Lys 1
AAG TTA GCC GCT TTA TTT TTA GTA AGC GTG TTG GGG GTT ATG GGT TTA 104 Lys Leu Ala Ala Leu Phe Leu Val Ser Val Leu Gly Val Met Gly Leu 5 10 15
AAC GCA TGG GAG CAA ACC CTA AAA GCT AAT GAC TTG GAA GTG AAA ATC 152 Asn Ala Trp Glu Gin Thr Leu Lys Ala Asn Asp Leu Glu Val Lys He 20 25 30
AAA TCC GTG GGT AAC CCC ATT AAA GGC GAT AAC ACT TTC ATT CTC AGC 200 Lys Ser Val Gly Asn Pro He Lys Gly Asp Asn Thr Phe He Leu Ser 35 40 45 50
CCC ACT TTA AAA GGT AAG GCT TTA GAA AAA GCT ATC GTT AGG GTG CAG 248 Pro Thr Leu Lys Gly Lys Ala Leu Glu Lys Ala He Val Arg Val Gin 55 60 65
TTT ATG ATG CCT GAA ATG CCC GGC ATG CCA GCG ATG AAA GAA ATG GCG 296 Phe Met Met Pro Glu Met Pro Gly Met Pro Ala Met Lys Glu Met Ala 70 75 80
CAA GTG AGT GAA AAA AAC GGC CTT TAT GAA GCT AAA ACC AAT CTT TCT 344 Gin Val Ser Glu Lys Asn Gly Leu Tyr Glu Ala Lys Thr Asn Leu Ser 85 90 95
ATG AAC GGG ACA TGG CAG GTT AGG GTG GAT ATT AAA TCT AAA GAG GGT 392 Met Asn Gly Thr Trp Gin Val Arg Val Asp He Lys Ser Lys Glu Gly 100 105 110
CAG GTT TAT CGC GCT AAA ACA AGC CTG GAT TTA TAAGAGCATG CTATCTTTTA 445 Gin Val Tyr Arg Ala Lys Thr Ser Leu Asp Leu 115 120 125
TAAGCGCGTT TGATAAAAGG GGCGTTTCAA TAC 478
(2) INFORMATION FOR SEQ ID NO: 220:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 125 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220:
Met Lys Lys Leu Ala Ala Leu Phe Leu Val Ser Val Leu Gly Val Met 1 5 10 15 Gly Leu Asn Ala Trp Glu Gin Thr Leu Lys Ala Asn Asp Leu Glu Val
20 25 30
Lys He Lys Ser Val Gly Asn Pro He Lys Gly Asp Asn Thr Phe He
35 40 45
Leu Ser Pro Thr Leu Lys Gly Lys Ala Leu Glu Lys Ala He Val Arg
50 55 60
Val Gin Phe Met Met Pro Glu Met Pro Gly Met Pro Ala Met Lys Glu 65 70 75 80
Met Ala Gin Val Ser Glu Lys Asn Gly Leu Tyr Glu Ala Lys Thr Asn
85 90 95
Leu Ser Met Asn Gly Thr Trp Gin Val Arg Val Asp He Lys Ser Lys
100 105 110
Glu Gly Gin Val Tyr Arg Ala Lys Thr Ser Leu Asp Leu 115 120 125
(2) INFORMATION FOR SEQ ID NO: 221:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1117 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1064 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221:
AGCGCTTTAA ATAGCACTTA TTTGTCTTTA CAAAACCTTA AAGGATTAGA ATG AAA 56
Met Lys 1
CGG ATT TTA TGG TTA GCC TTG ATT TTA TTT TTT AGC CCC TTA TTC GCT 104 Arg He Leu Trp Leu Ala Leu He Leu Phe Phe Ser Pro Leu Phe Ala 5 10 15
AAC GCT CAA AAA ACT CAA GAA ATT AAA AAA ACT AAA GAA GCT AAA AGC 152 Asn Ala Gin Lys Thr Gin Glu He Lys Lys Thr Lys Glu Ala Lys Ser 20 25 30
CAA ACC CGT TTT AAT ATT TCC ACC ACT AAG GTC ATA GAA AAA GAA TTT 200 Gin Thr Arg Phe Asn He Ser Thr Thr Lys Val He Glu Lys Glu Phe 35 40 45 50
TCT CAA AGC CGG CGC TAT TAC GCG CTT TTA GAG CCC AAT GAA GCG CTG 248 Ser Gin Ser Arg Arg Tyr Tyr Ala Leu Leu Glu Pro Asn Glu Ala Leu 55 60 65 ATT TTT TCT CAA ACC CTG CGT TTT GAT GGC TAT GTG GAA AAG CTT TAT 296 He Phe Ser Gin Thr Leu Arg Phe Asp Gly Tyr Val Glu Lys Leu Tyr 70 75 80
GCG AAT AAA ACC TAT ACC CCC ATT AAA AAG GGC GAC AGG TTA TTG AGC 344 Ala Asn Lys Thr Tyr Thr Pro He Lys Lys Gly Asp Arg Leu Leu Ser 85 90 95
GTG TAT TCC CCT GAA TTA GTG AGC GCT CAA AGC GAA TTG CTA TCA TCA 392 Val Tyr Ser Pro Glu Leu Val Ser Ala Gin Ser Glu Leu Leu Ser Ser 100 105 110
TTG AAA TTC AAC CAA CAA GTG GGA GCG ATT AAA GAA AAA TTA AAA CTA 440 Leu Lys Phe Asn Gin Gin Val Gly Ala He Lys Glu Lys Leu Lys Leu 115 120 125 130
TTA GGG TTA GAA AAC TCT AGC ATT GAA AAA ATC ATT AGC AGC CAT AAA 488 Leu Gly Leu Glu Asn Ser Ser He Glu Lys He He Ser Ser His Lys 135 140 145
GTC CAA AAT GAA ATG ACT ATT TAC TCT CAC TTC AAC GGC ATT ATT TTT 536 Val Gin Asn Glu Met Thr He Tyr Ser His Phe Asn Gly He He Phe 150 155 160
AAA AAA AGC CCG GAT CTC AAT GAG GGG AGC TTC ATT AAA AAA GGG CAA 584 Lys Lys Ser Pro Asp Leu Asn Glu Gly Ser Phe He Lys Lys Gly Gin 165 170 175
GAG TTG TTT CAA ATC ATA GAT TTA AGC CAA TTG TGG GCG CTG GTT AAA 632 Glu Leu Phe Gin He He Asp Leu Ser Gin Leu Trp Ala Leu Val Lys 180 185 190
GTC AAT CAA GAG GAT TTA GAA TTT TTA AAA AAC ACG CAT AAA GCG ATC 680 Val Asn Gin Glu Asp Leu Glu Phe Leu Lys Asn Thr His Lys Ala He 195 200 205 210
TTG TTT GTA GAA GGG ATT AAA GGC GAG CAA GAA ATC ACG CTT GAA AAT 728 Leu Phe Val Glu Gly He Lys Gly Glu Gin Glu He Thr Leu Glu Asn 215 220 225
ATC AAC CCC ATC ATC AAC AAA GAA GAT AAA ATG CTA GAA GCG CGC TTC 776 He Asn Pro He He Asn Lys Glu Asp Lys Met Leu Glu Ala Arg Phe 230 235 240
AAT GTG CCT AAT GTT AAA CAG ATT TAT TAC CCT AAC ATG TTC GCT CAA 824 Asn Val Pro Asn Val Lys Gin He Tyr Tyr Pro Asn Met Phe Ala Gin 245 250 255
GTA GAA ATC TTT CAA AAA CCA CAA AAA ATG AAG ATT TTG CCT AAA GAA 872 Val Glu He Phe Gin Lys Pro Gin Lys Met Lys He Leu Pro Lys Glu 260 265 270
GCG GTT TTG ATT AAA GGG GGG AAA GCT ATC GTG TTT AAA AAA GAC GAT 920 Ala Val Leu He Lys Gly Gly Lys Ala He Val Phe Lys Lys Asp Asp 275 280 285 290 TTT GGC TTA AGC CCG TTA GAA ATT AAA GCC GTC CGC TTG AGC GAT GGG 968 Phe Gly Leu Ser Pro Leu Glu He Lys Ala Val Arg Leu Ser Asp Gly 295 300 305
AGT TAT GAG ATT TTA GAG GGT TTA AAG GCG GGC GAA GAA GTC GCT AAT 1016 Ser Tyr Glu He Leu Glu Gly Leu Lys Ala Gly Glu Glu Val Ala Asn 310 315 320
AAC GCT TTA TTC GTG CTA GAC GCT GAC GCT CAA AAC AAT GGG GAT TAT T 1065 Asn Ala Leu Phe Val Leu Asp Ala Asp Ala Gin Asn Asn Gly Asp Tyr 325 330 335
GAATGATAGA AAAGATCATT GATTTAAGCG TTAAAAACAA ACTCCTTACC AC 1117
(2) INFORMATION FOR SEQ ID NO: 222:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 338 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222:
Met Lys Arg He Leu Trp Leu Ala Leu He Leu Phe Phe Ser Pro Leu
1 5 10 15
Phe Ala Asn Ala Gin Lys Thr Gin Glu He Lys Lys Thr Lys Glu Ala
20 25 30
Lys Ser Gin Thr Arg Phe Asn He Ser Thr Thr Lys Val He Glu Lys
35 40 45
Glu Phe Ser Gin Ser Arg Arg Tyr Tyr Ala Leu Leu Glu Pro Asn Glu
50 55 60
Ala Leu He Phe Ser Gin Thr Leu Arg Phe Asp Gly Tyr Val Glu Lys 65 70 75 80
Leu Tyr Ala Asn Lys Thr Tyr Thr Pro He Lys Lys Gly Asp Arg Leu
85 90 95
Leu Ser Val Tyr Ser Pro Glu Leu Val Ser Ala Gin Ser Glu Leu Leu
100 105 110
Ser Ser Leu Lys Phe Asn Gin Gin Val Gly Ala He Lys Glu Lys Leu
115 120 125
Lys Leu Leu Gly Leu Glu Asn Ser Ser He Glu Lys He He Ser Ser
130 135 140
His Lys Val Gin Asn Glu Met Thr He Tyr Ser His Phe Asn Gly He 145 150 155 160
He Phe Lys Lys Ser Pro Asp Leu Asn Glu Gly Ser Phe He Lys Lys
165 170 175
Gly Gin Glu Leu Phe Gin He He Asp Leu Ser Gin Leu Trp Ala Leu
180 185 190
Val Lys Val Asn Gin Glu Asp Leu Glu Phe Leu Lys Asn Thr His Lys
195 200 205
Ala He Leu Phe Val Glu Gly He Lys Gly Glu Gin Glu He Thr Leu 210 215 220 Glu Asn He Asn Pro He He Asn Lys Glu Asp Lys Met Leu Glu Ala 225 230 235 240
Arg Phe Asn Val Pro Asn Val Lys Gin He Tyr Tyr Pro Asn Met Phe
245 250 255
Ala Gin Val Glu He Phe Gin Lys Pro Gin Lys Met Lys He Leu Pro
260 265 270
Lys Glu Ala Val Leu He Lys Gly Gly Lys Ala He Val Phe Lys Lys
275 280 285
Asp Asp Phe Gly Leu Ser Pro Leu Glu He Lys Ala Val Arg Leu Ser
290 295 300
Asp Gly Ser Tyr Glu He Leu Glu Gly Leu Lys Ala Gly Glu Glu Val 305 310 315 320
Ala Asn Asn Ala Leu Phe Val Leu Asp Ala Asp Ala Gin Asn Asn Gly
325 330 335
Asp Tyr
(2) INFORMATION FOR SEQ ID NO: 223:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1249 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1196 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:223:
AAAACTTAGA TAAAATAACA CGATAAAACC ATAGTAATAA AGATAACCCC ATG AGA 56
Met Arg 1
TTT TTT TGC TTT TTC TTA TTT TTT CTA ACC TTT TCA AAC GCA CAG ATA 104 Phe Phe Cys Phe Phe Leu Phe Phe Leu Thr Phe Ser Asn Ala Gin He 5 10 15
ATG ATG ACT TTT GAT TCT CAA ACT AAC GCC AAA CTC TCG CGC TCT AAC 152 Met Met Thr Phe Asp Ser Gin Thr Asn Ala Lys Leu Ser Arg Ser Asn 20 25 30
GAA CAG CTT TCA GAC ATG CTC TAT AAA CTC AAT GAA AGT TTA AGA ATC 200 Glu Gin Leu Ser Asp Met Leu Tyr Lys Leu Asn Glu Ser Leu Arg He 35 40 45 50
TAT CAA AGC GTG CTT TCC AAT AAC CAA GAT CAA CTC AAA GAA ATC AAA 248 Tyr Gin Ser Val Leu Ser Asn Asn Gin Asp Gin Leu Lys Glu He Lys 55 60 65
AAA GCT AAC AGC ACC CTA AAT AGC CAA AGG CGT TTT TTT AAC GCC AGC 296 Lys Ala Asn Ser Thr Leu Asn Ser Gin Arg Arg Phe Phe Asn Ala Ser 70 75 80
CAG ATC CGC CTT ATG GAC ACT GAT GCA CTA TTG AAA CAA AGC GCT TTG 344 Gin He Arg Leu Met Asp Thr Asp Ala Leu Leu Lys Gin Ser Ala Leu 85 90 95
GAA TTA GAA AAA TTA CAA GCT TTA GAA AAA CAC ATA AAA AAG GGC ATG 392 Glu Leu Glu Lys Leu Gin Ala Leu Glu Lys His He Lys Lys Gly Met 100 105 110
GAA CAA GAA CGC TTA ATA GAA GAA TCC CAA ACG CTT TTT TTA CAA GAG 440 Glu Gin Glu Arg Leu He Glu Glu Ser Gin Thr Leu Phe Leu Gin Glu 115 120 125 130
CAT TGC CCT TAT TTG AGC GGC GTT AAG AAT TTA GAA GAG GCT TCA AAC 488 His Cys Pro Tyr Leu Ser Gly Val Lys Asn Leu Glu Glu Ala Ser Asn 135 140 145
GCT TTA GAA GTC CAA GAG CAA AAC AAC GCC CTT TTC TTA CTC AAA GAG 536 Ala Leu Glu Val Gin Glu Gin Asn Asn Ala Leu Phe Leu Leu Lys Glu 150 155 160
CCT AAA CTC GCC CGT TTG CTC TCA CGA TTG GAT TTG ATG AGC GCT TTA 584 Pro Lys Leu Ala Arg Leu Leu Ser Arg Leu Asp Leu Met Ser Ala Leu 165 170 175
AAC GCC TTG TGC GAT CAG GTT TTA GAA AAC CAA GCC CAT AAC CAA CAA 632 Asn Ala Leu Cys Asp Gin Val Leu Glu Asn Gin Ala His Asn Gin Gin 180 185 190
TCC CAT AAC AAA ATT TTA GAA TAC AAC GCT CTT AAA AAC CAT GAT TTT 680 Ser His Asn Lys He Leu Glu Tyr Asn Ala Leu Lys Asn His Asp Phe 195 200 205 210
CAA GCC TAT AAA GCC ATG CGT TTG AAA AAA TTT AAA AAC AAG CTT CAA 728 Gin Ala Tyr Lys Ala Met Arg Leu Lys Lys Phe Lys Asn Lys Leu Gin 215 220 225
AGT CAA ATC CAA GCC CAA GAA GAC GCT CTA AAA ACC TTT TTA CCC TTA 776 Ser Gin He Gin Ala Gin Glu Asp Ala Leu Lys Thr Phe Leu Pro Leu 230 235 240
GAA AAA CGC TTG GAA ACT TTA AAA ACG CAT TTT TTA TGC GAT AAA GAA 824 Glu Lys Arg Leu Glu Thr Leu Lys Thr His Phe Leu Cys Asp Lys Glu 245 250 255
AAC CTA AAA TCA TGC GCT AAA GAA TTG CAC CAA CGC TAC CAA AAC GCC 872 Asn Leu Lys Ser Cys Ala Lys Glu Leu His Gin Arg Tyr Gin Asn Ala 260 265 270
CTT ATA GAG CGA GAT AAA GAA TTA AAA AAC GCT AAA AAT AAT AAA GAA 920 Leu He Glu Arg Asp Lys Glu Leu Lys Asn Ala Lys Asn Asn Lys Glu 275 280 285 290
AAG CAT GCT CTA ATC TTA GCC AAT TAC GAG CAT ACT TTA AAA ACC TTG 968 Lys His Ala Leu He Leu Ala Asn Tyr Glu His Thr Leu Lys Thr Leu 295 300 305
AAT ATA GAA TTT TTA AGC GAA TTA AAT AAG CAA ATG GCG TTT TTG AAT 1016 Asn He Glu Phe Leu Ser Glu Leu Asn Lys Gin Met Ala Phe Leu Asn 310 315 320
GAA ACC ATG GCG TTA AAC GCC CGA GTT TTA GCC CTT TTA GCC AAA CAG 1064 Glu Thr Met Ala Leu Asn Ala Arg Val Leu Ala Leu Leu Ala Lys Gin 325 330 335
CAT GCC AAA ACG CCA AAG CCT TTC AAT TTG AGC GGT GGT TTA AGC GGT 1112 His Ala Lys Thr Pro Lys Pro Phe Asn Leu Ser Gly Gly Leu Ser Gly 340 345 350
GAT TTG AGC GGT GGG AAA GCT CTT ATT AAA AAT ATC CGC TTA GAT CCG 1160 Asp Leu Ser Gly Gly Lys Ala Leu He Lys Asn He Arg Leu Asp Pro 355 360 365 370
CAT GGA TTC CCT AGC TTT AAA AAT TTT AAG CAA GAG TAGGACAATA TTTGAC 1212 His Gly Phe Pro Ser Phe Lys Asn Phe Lys Gin Glu 375 380
AAGCAAAAAC AATTATAGTA AAATAAGAGC ATAACTT 1249
(2) INFORMATION FOR SEQ ID NO:224:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 382 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:224:
Met Arg Phe Phe Cys Phe Phe Leu Phe Phe Leu Thr Phe Ser Asn Ala
1 5 10 15
Gin He Met Met Thr Phe Asp Ser Gin Thr Asn Ala Lys Leu Ser Arg
20 25 30
Ser Asn Glu Gin Leu Ser Asp Met Leu Tyr Lys Leu Asn Glu Ser Leu
35 40 45
Arg He Tyr Gin Ser Val Leu Ser Asn Asn Gin Asp Gin Leu Lys Glu
50 55 60
He Lys Lys Ala Asn Ser Thr Leu Asn Ser Gin Arg Arg Phe Phe Asn 65 70 75 80
Ala Ser Gin He Arg Leu Met Asp Thr Asp Ala Leu Leu Lys Gin Ser
85 90 95
Ala Leu Glu Leu Glu Lys Leu Gin Ala Leu Glu Lys His He Lys Lys 100 105 110
Gly Met Glu Gin Glu Arg Leu He Glu Glu Ser Gin Thr Leu Phe Leu
115 120 125
Gin Glu His Cys Pro Tyr Leu Ser Gly Val Lys Asn Leu Glu Glu Ala
130 135 140
Ser Asn Ala Leu Glu Val Gin Glu Gin Asn Asn Ala Leu Phe Leu Leu 145 150 155 160
Lys Glu Pro Lys Leu Ala Arg Leu Leu Ser Arg Leu Asp Leu Met Ser
165 170 175
Ala Leu Asn Ala Leu Cys Asp Gin Val Leu Glu Asn Gin Ala His Asn
180 185 190
Gin Gin Ser His Asn Lys He Leu Glu Tyr Asn Ala Leu Lys Asn His
195 200 205
Asp Phe Gin Ala Tyr Lys Ala Met Arg Leu Lys Lys Phe Lys Asn Lys
210 215 220
Leu Gin Ser Gin He Gin Ala Gin Glu Asp Ala Leu Lys Thr Phe Leu 225 230 235 240
Pro Leu Glu Lys Arg Leu Glu Thr Leu Lys Thr His Phe Leu Cys Asp
245 250 255
Lys Glu Asn Leu Lys Ser Cys Ala Lys Glu Leu His Gin Arg Tyr Gin
260 265 270
Asn Ala Leu He Glu Arg Asp Lys Glu Leu Lys Asn Ala Lys Asn Asn
275 280 285
Lys Glu Lys His Ala Leu He Leu Ala Asn Tyr Glu His Thr Leu Lys
290 295 300
Thr Leu Asn He Glu Phe Leu Ser Glu Leu Asn Lys Gin Met Ala Phe 305 310 315 320
Leu Asn Glu Thr Met Ala Leu Asn Ala Arg Val Leu Ala Leu Leu Ala
325 330 335
Lys Gin His Ala Lys Thr Pro Lys Pro Phe Asn Leu Ser Gly Gly Leu
340 345 350
Ser Gly Asp Leu Ser Gly Gly Lys Ala Leu He Lys Asn He Arg Leu
355 360 365
Asp Pro His Gly Phe Pro Ser Phe Lys Asn Phe Lys Gin Glu 370 375 380
(2) INFORMATION FOR SEQ ID NO: 225:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 490 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...437 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: TTGTTGAGAA AATCCGATGT TTTGAGCGAA AAATTCAGGA TCATGAAAAA ATG AAA 56
Met Lys 1
AGC ATC AGA AGA GGC GAT GGG CTG AAT GTT GTC CCT TTC ATT GAT ATT 104 Ser He Arg Arg Gly Asp Gly Leu Asn Val Val Pro Phe He Asp He 5 10 15
ATG CTC GTT TTG CTA GCG ATT GTG TTG AGC ATT TCT ACT TTT ATT GCA 152 Met Leu Val Leu Leu Ala He Val Leu Ser He Ser Thr Phe He Ala 20 25 30
CAA GGT AAG ATT AAG GTC AGT CTC CCT AAC GCT AAA AAT GCG GAA AAA 200 Gin Gly Lys He Lys Val Ser Leu Pro Asn Ala Lys Asn Ala Glu Lys 35 40 45 50
TCC CAG CCA AAC GAT CAA AAA GTG GTG GTC ATC TCT GTA GAT GAG CAT 248 Ser Gin Pro Asn Asp Gin Lys Val Val Val He Ser Val Asp Glu His 55 60 65
GAC AAT ATT TTC GTA GAT GAC AAA CCG ATG AAT TTA GAA GCT TTG AGC 296 Asp Asn He Phe Val Asp Asp Lys Pro Met Asn Leu Glu Ala Leu Ser 70 75 80
GCT GTA GTC AAA CAA ACA GAC CCT AAA ACC CTT ATA GAC TTA AAA AGC 344 Ala Val Val Lys Gin Thr Asp Pro Lys Thr Leu He Asp Leu Lys Ser 85 90 95
GAC AAA AGC TCT CGT TTT GAA ACT TTT ATC AGC ATT ATG GAT ATT TTA 392 Asp Lys Ser Ser Arg Phe Glu Thr Phe He Ser He Met Asp He Leu 100 105 110
AAA GAG CAT AAT CAT GAA AAT TTC TCC ATC TCC ACG CAA GCT CAG TAAAG 442 Lys Glu His Asn His Glu Asn Phe Ser He Ser Thr Gin Ala Gin 115 120 125
TTTCAACGAG TGTTAGCTTT TTAATCTCTT TTGCCCTATA CGCTATAG 490
(2) INFORMATION FOR SEQ ID NO: 226:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 129 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226:
Met Lys Ser He Arg Arg Gly Asp Gly Leu Asn Val Val Pro Phe He
1 5 10 15
Asp He Met Leu Val Leu Leu Ala He Val Leu Ser He Ser Thr Phe 20 25 30 He Ala Gin Gly Lys He Lys Val Ser Leu Pro Asn Ala Lys Asn Ala
35 40 45
Glu Lys Ser Gin Pro Asn Asp Gin Lys Val Val Val He Ser Val Asp
50 55 60
Glu His Asp Asn He Phe Val Asp Asp Lys Pro Met Asn Leu Glu Ala 65 70 75 80
Leu Ser Ala Val Val Lys Gin Thr Asp Pro Lys Thr Leu He Asp Leu
85 90 95
Lys Ser Asp Lys Ser Ser Arg Phe Glu Thr Phe He Ser He Met Asp
100 105 110
He Leu Lys Glu His Asn His Glu Asn Phe Ser He Ser Thr Gin Ala
115 120 125
Gin
(2) INFORMATION FOR SEQ ID NO: 227:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 958 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...905 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227:
TCGTTTTGAA ACTTTTATCA GCATTATGGA TATTTTAAAA GAGCATAATC ATG AAA 56
Met Lys 1
ATT TCT CCA TCT CCA CGC AAG CTC AGT AAA GTT TCA ACG AGT GTT AGC 104 He Ser Pro Ser Pro Arg Lys Leu Ser Lys Val Ser Thr Ser Val Ser 5 10 15
TTT TTA ATC TCT TTT GCC CTA TAC GCT ATA GGG TTT GGC TAT TTT TTA 152 Phe Leu He Ser Phe Ala Leu Tyr Ala He Gly Phe Gly Tyr Phe Leu 20 25 30
CTG CGC GAA GAC GCC CCA GAG CCT TTA GCG CAA GCC GGG ACC ACT AAG 200 Leu Arg Glu Asp Ala Pro Glu Pro Leu Ala Gin Ala Gly Thr Thr Lys 35 40 45 50
GTT ACC ATG AGT TTA GCC AGC ATC AAC ACT AAT TCC AAT ACA AAG ACT 248 Val Thr Met Ser Leu Ala Ser He Asn Thr Asn Ser Asn Thr Lys Thr 55 60 65 AAT GCT GAG TCG GCT AAA CCC AAA GAA GAG CCT AAA GAA AAA CCC AAG 296 Asn Ala Glu Ser Ala Lys Pro Lys Glu Glu Pro Lys Glu Lys Pro Lys 70 75 80
AAA GAA GAG CCA AAA AAA GAA GAA CCC AAA AAG GAG GTT ACA AAG CCT 344 Lys Glu Glu Pro Lys Lys Glu Glu Pro Lys Lys Glu Val Thr Lys Pro 85 90 95
AAA CCT AAG CCT AAA CCC AAG CCA AAG CCA AAA CCA AAA CCT AAG CCT 392 Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro 100 105 110
GAA CCC AAA CCT GAA CCA AAA CCC GAG CCT AAG CCT GAG CCT AAA GTT 440 Glu Pro Lys Pro Glu Pro Lys Pro Glu Pro Lys Pro Glu Pro Lys Val 115 120 125 130
GAA GAG GTT AAA AAA GAA GAG CCT AAA GAA GAG CCC AAA AAA GAA GAA 488 Glu Glu Val Lys Lys Glu Glu Pro Lys Glu Glu Pro Lys Lys Glu Glu 135 140 145
GCT AAA GAG GAA GCT AAA GAA AAA AGC GCT CCT AAA CAA GTA ACA ACT 536 Ala Lys Glu Glu Ala Lys Glu Lys Ser Ala Pro Lys Gin Val Thr Thr 150 155 160
AAG GAT ATA GTC AAA GAA AAA GAC AAG CAA GAA GAA TCC AAC AAA ACC 584 Lys Asp He Val Lys Glu Lys Asp Lys Gin Glu Glu Ser Asn Lys Thr 165 170 175
TCT GAG GGG GCC ACT TCT GAA GCT CAA GCT TAT AAC CCA GGG GTG AGC 632 Ser Glu Gly Ala Thr Ser Glu Ala Gin Ala Tyr Asn Pro Gly Val Ser 180 185 190
AAC GAA TTT TTA ATG AAG ATC CAA ACC GCT ATT TCT TCT AAA AAC CGC 680 Asn Glu Phe Leu Met Lys He Gin Thr Ala He Ser Ser Lys Asn Arg 195 200 205 210
TAC CCT AAA ATG GCG CAG ATT AGG GGT ATT GAG GGC GAA GTG TTG GTG 728 Tyr Pro Lys Met Ala Gin He Arg Gly He Glu Gly Glu Val Leu Val 215 220 225
AGC TTT ACG ATC AAT GCT GAT GGG AGC GTT ACG GAC ATT AAA GTG GTC 776 Ser Phe Thr He Asn Ala Asp Gly Ser Val Thr Asp He Lys Val Val 230 235 240
AAA AGC AAC ACC ACA GAT ATT TTA AAC CAT GCG GCT TTA GAA GCC ATT 824 Lys Ser Asn Thr Thr Asp He Leu Asn His Ala Ala Leu Glu Ala He 245 250 255
AAA AGC GCG GCA CAT CTA TTC CCT AAA CCA GAA GAA ACC GTG CAT CTA 872 Lys Ser Ala Ala His Leu Phe Pro Lys Pro Glu Glu Thr Val His Leu 260 265 270
AAA ATC CCT ATC GCT TAT AGC TTG AAA GAA GAC TGATTAGTCT TTCTTTTAGG 925 Lys He Pro He Ala Tyr Ser Leu Lys Glu Asp 275 280 285 GGCGATTCAA GCCTTAAAAG CCGGGTCAAA ATC 958
(2) INFORMATION FOR SEQ ID NO: 228:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 285 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS : single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228:
Met Lys He Ser Pro Ser Pro Arg Lys Leu Ser Lys Val Ser Thr Ser
1 5 10 15
Val Ser Phe Leu He Ser Phe Ala Leu Tyr Ala He Gly Phe Gly Tyr
20 25 30
Phe Leu Leu Arg Glu Asp Ala Pro Glu Pro Leu Ala Gin Ala Gly Thr
35 40 45
Thr Lys Val Thr Met Ser Leu Ala Ser He Asn Thr Asn Ser Asn Thr
50 55 60
Lys Thr Asn Ala Glu Ser Ala Lys Pro Lys Glu Glu *Pro Lys Glu Lys 65 70 75 80
Pro Lys Lys Glu Glu Pro Lys Lys Glu Glu Pro Lys Lys Glu Val Thr
85 90 95
Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro
100 105 110
Lys Pro Glu Pro Lys Pro Glu Pro Lys Pro Glu Pro Lys Pro Glu Pro
115 120 125
Lys Val Glu Glu Val Lys Lys Glu Glu Pro Lys Glu Glu Pro Lys Lys
130 135 140
Glu Glu Ala Lys Glu Glu Ala Lys Glu Lys Ser Ala Pro Lys Gin Val 145 150 155 160
Thr Thr Lys Asp He Val Lys Glu Lys Asp Lys Gin Glu Glu Ser Asn
165 170 175
Lys Thr Ser Glu Gly Ala Thr Ser Glu Ala Gin Ala Tyr Asn Pro Gly
180 185 190
Val Ser Asn Glu Phe Leu Met Lys He Gin Thr Ala He Ser Ser Lys
195 200 205
Asn Arg Tyr Pro Lys Met Ala Gin He Arg Gly He Glu Gly Glu Val
210 215 220
Leu Val Ser Phe Thr He Asn Ala Asp Gly Ser Val Thr Asp He Lys 225 230 235 240
Val Val Lys Ser Asn Thr Thr Asp He Leu Asn His Ala Ala Leu Glu
245 250 255
Ala He Lys Ser Ala Ala His Leu Phe Pro Lys Pro Glu Glu Thr Val
260 265 270
His Leu Lys He Pro He Ala Tyr Ser Leu Lys Glu Asp 275 280 285
(2) INFORMATION FOR SEQ ID NO: 229:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 757 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...704 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229:
TTGATGAAAA AAATAACGCT CCTCTTTTTA AAACTCTTTT AGAGGATGCC ATG AGA 56
Met Arg 1
GTG TCT TCT AAA GAG ATT TTA CTC ATT GTG GGG GGG AGC AGT TTT TAC 104 Val Ser Ser Lys Glu He Leu Leu He Val Gly Gly Ser Ser Phe Tyr 5 10 15
CTC AAA TCC ATT TTA GAA GGT TTG AGC CGC ATG CCA AAA CTG AGC GGT 152 Leu Lys Ser He Leu Glu Gly Leu Ser Arg Met Pro Lys Leu Ser Gly 20 25 30
GAG GAG GTT GTA AAA ATA GAG CGA GAA ATT GCC ACT CTT TCT AAC CCT 200 Glu Glu Val Val Lys He Glu Arg Glu He Ala Thr Leu Ser Asn Pro 35 40 45 50
TAT ATA TTT TTA AAA TCC ATT GAC CCT AAC ATG GCT TTT AAA ATC CAT 248 Tyr He Phe Leu Lys Ser He Asp Pro Asn Met Ala Phe Lys He His 55 60 65
CCA AAC GAC ACT TAC CGC ACC CAT AAG GCT TTA GAA ATC TTT TAT GCC 296 Pro Asn Asp Thr Tyr Arg Thr His Lys Ala Leu Glu He Phe Tyr Ala 70 75 80
ACC TGC ACG CCC CCA AGC GAG TAT TTT AAG GCC AAC CCT AAA AAA CCC 344 Thr Cys Thr Pro Pro Ser Glu Tyr Phe Lys Ala Asn Pro Lys Lys Pro 85 90 95
TTT GAG CAT GCT ATC TCC TTA TTC GCT CTG TCT ATT GAA AAA AGC GCG 392 Phe Glu His Ala He Ser Leu Phe Ala Leu Ser He Glu Lys Ser Ala 100 105 110
CTC CAT AAC AAT ATC AAA CGG CGC ACC AAA AAC ATG CTC CAT TCA GGG 440 Leu His Asn Asn He Lys Arg Arg Thr Lys Asn Met Leu His Ser Gly 115 120 125 130
CTT GTT GAA GAA ATC AAA GCC CTC TAT ACT CAA TAC CCT AAA GAT TCG 488 Leu Val Glu Glu He Lys Ala Leu Tyr Thr Gin Tyr Pro Lys Asp Ser 135 140 145
CAG CCT TTT AAA GCC ATA GGC GTT AAA GAG AGC GTT CTT TTT TTA GAA 536 Gin Pro Phe Lys Ala He Gly Val Lys Glu Ser Val Leu Phe Leu Glu 150 155 160
AAA CGA CTC ACT TTA AAG GAG CTA GAA GAA GCG ATT ACC TCT AAC ACC 584 Lys Arg Leu Thr Leu Lys Glu Leu Glu Glu Ala He Thr Ser Asn Thr 165 170 175
ATG AAA TTA GCC AAG CGC CAA AAC ACT TTC AAT AAA ACC CAA TTC AAT 632 Met Lys Leu Ala Lys Arg Gin Asn Thr Phe Asn Lys Thr Gin Phe Asn 180 185 190
AAC CTT TAT GTG GGG AGC GCT GAA GAA GTT AGG CAT GCG ATT TTA AAA 680 Asn Leu Tyr Val Gly Ser Ala Glu Glu Val Arg His Ala He Leu Lys 195 200 205 210
CAC TCA AAA AGC GGC ATT AAA GGA TAATCTAATG GATACACAAA ACTTACCCGA 734 His Ser Lys Ser Gly He Lys Gly 215
TCAAATTATC CCTATTTTTA TGA 757
(2) INFORMATION FOR SEQ ID NO: 230:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 218 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230:
Met Arg Val Ser Ser Lys Glu He Leu Leu He Val Gly Gly Ser Ser
1 5 10 15
Phe Tyr Leu Lys Ser He Leu Glu Gly Leu Ser Arg Met Pro Lys Leu
20 25 30
Ser Gly Glu Glu Val Val Lys He Glu Arg Glu He Ala Thr Leu Ser
35 40 45
Asn Pro Tyr He Phe Leu Lys Ser He Asp Pro Asn Met Ala Phe Lys
50 55 60
He His Pro Asn Asp Thr Tyr Arg Thr His Lys Ala Leu Glu He Phe 65 70 75 80
Tyr Ala Thr Cys Thr Pro Pro Ser Glu Tyr Phe Lys Ala Asn Pro Lys
85 90 95
Lys Pro Phe Glu His Ala He Ser Leu Phe Ala Leu Ser He Glu Lys
100 105 110
Ser Ala Leu His Asn Asn He Lys Arg Arg Thr Lys Asn Met Leu His
115 120 125
Ser Gly Leu Val Glu Glu He Lys Ala Leu Tyr Thr Gin Tyr Pro Lys 130 135 140 Asp Ser Gin Pro Phe Lys Ala He Gly Val Lys Glu Ser Val Leu Phe 145 150 155 160
Leu Glu Lys Arg Leu Thr Leu Lys Glu Leu Glu Glu Ala He Thr Ser
165 170 175
Asn Thr Met Lys Leu Ala Lys Arg Gin Asn Thr Phe Asn Lys Thr Gin
180 185 190
Phe Asn Asn Leu Tyr Val Gly Ser Ala Glu Glu Val Arg His Ala He
195 200 205
Leu Lys His Ser Lys Ser Gly He Lys Gly 210 215
(2) INFORMATION FOR SEQ ID NO: 231:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 454 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...401 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231:
CTATATGAAC AAAGCCTTAA AAGACTTGAA AAAAGGAATA ACTCATACCT ATG CGA 56
Met Arg 1
AAC AAT AAA ACG CCT TTT TTG AGC GCG ATT TTT ACG GCA TCA ATT AGG 104 Asn Asn Lys Thr Pro Phe Leu Ser Ala He Phe Thr Ala Ser He Arg 5 10 15
GGT TAC CAA CGC TTT TTT TCG GCT TTC ACC CCT TCA AGC TGC CGG TTT 152 Gly Tyr Gin Arg Phe Phe Ser Ala Phe Thr Pro Ser Ser Cys Arg Phe 20 25 30
TAC CCC ACT TGT TCC AAC TAC GCT CTG TGG TTG CTC TGT TTT GAA AGC 200 Tyr Pro Thr Cys Ser Asn Tyr Ala Leu Trp Leu Leu Cys Phe Glu Ser 35 40 45 50
CCT TTG AGC GCT ATG GGT AAG ATC GCT ATA AGG ATA CTC TCA TGC AAC 248 Pro Leu Ser Ala Met Gly Lys He Ala He Arg He Leu Ser Cys Asn 55 60 65
CCT TTT TGC TCT GGG GGC ATT GCT TAC CCT ACT ACT CGC TTG AAA CGC 296 Pro Phe Cys Ser Gly Gly He Ala Tyr Pro Thr Thr Arg Leu Lys Arg 70 75 80 CCA AGC CTG ATC CAA TCT CAT AAA GAT TCT AAT CGC AAT TTT AAA ACC 344 Pro Ser Leu He Gin Ser His Lys Asp Ser Asn Arg Asn Phe Lys Thr 85 90 95
ATC ACT TTT TGG CTC GTT CCC ACA AAA AGC CAC GCA ACT TAC TAC ATC 392 He Thr Phe Trp Leu Val Pro Thr Lys Ser His Ala Thr Tyr Tyr He 100 105 110
ATT AAG GTT TAATCACAAT GGATAAAAAC AACAATAATC TCCGCTTGAT TTTAGCGAT 450
He Lys Val
115
CGCT 454
(2) INFORMATION FOR SEQ ID NO:232:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 117 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232:
Met Arg Asn Asn Lys Thr Pro Phe Leu Ser Ala He Phe Thr Ala Ser
1 5 10 15
He Arg Gly Tyr Gin Arg Phe Phe Ser Ala Phe Thr Pro Ser Ser Cys
20 25 30
Arg Phe Tyr Pro Thr Cys Ser Asn Tyr Ala Leu Trp Leu Leu Cys Phe
35 40 45
Glu Ser Pro Leu Ser Ala Met Gly Lys He Ala He Arg He Leu Ser
50 55 60
Cys Asn Pro Phe Cys Ser Gly Gly He Ala Tyr Pro Thr Thr Arg Leu 65 70 75 80
Lys Arg Pro Ser Leu He Gin Ser His Lys Asp Ser Asn Arg Asn Phe
85 90 95
Lys Thr He Thr Phe Trp Leu Val Pro Thr Lys Ser His Ala Thr Tyr
100 105 110
Tyr He He Lys Val 115
(2) INFORMATION FOR SEQ ID NO: 233:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1153 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1100 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:233:
TTATAATAAG AAAGTTTTTA TTATTTTTAA TGCTATTTTA GGAGTTCATC ATG AAA 56
Met Lys
1
AAA TCC ATT TTA TTG GGC GTT TGC TTG GCT TTT TCT TGC GCT CAT GCC 104 Lys Ser He Leu Leu Gly Val Cys Leu Ala Phe Ser Cys Ala His Ala 5 10 15
CTA AAC GAT TTA GAA TTG ATC AAA AAA GCG AGG GAA AGC CAG CTA GAA 152 Leu Asn Asp Leu Glu Leu He Lys Lys Ala Arg Glu Ser Gin Leu Glu 20 25 30
CCC ATG CCT ATG GGC AAA GCG CTC AAA GAA TAC CAG ATT AAA AAG ACC 200 Pro Met Pro Met Gly Lys Ala Leu Lys Glu Tyr Gin He Lys Lys Thr 35 40 45 50
AGA GAT GTG GGT ATT GGC ACC AAA AAC AGC GAA ATC ATG ACC TCC GCT 248 Arg Asp Val Gly He Gly Thr Lys Asn Ser Glu He Met Thr Ser Ala 55 60 65
CAA GTG GAA TTA GGC AAA ATG CTC TAT TTT GAC CCT AGG ATT TCC ACT 296 Gin Val Glu Leu Gly Lys Met Leu Tyr Phe Asp Pro Arg He Ser Thr 70 75 80
TCC TAC CTC GTG TCT TGC AAC ACA TGC CAT AAT CTG GGC TTA GGC GGG 344 Ser Tyr Leu Val Ser Cys Asn Thr Cys His Asn Leu Gly Leu Gly Gly 85 90 95
GTG GAT TTA GTC CCA AGC GCC ATA GGC TCT CAA TGG AAG AAA AAC CCC 392 Val Asp Leu Val Pro Ser Ala He Gly Ser Gin Trp Lys Lys Asn Pro 100 105 110
CAC CTT TTA AGC TCC CCA ACG GTG TAT AAC TCT GTG TTT AAC GAT GTG 440 His Leu Leu Ser Ser Pro Thr Val Tyr Asn Ser Val Phe Asn Asp Val 115 120 125 130
CAG TTT TGG GAT GGC AGG GTT ACG CAT TTA AAC GAA CAG GCG CAA GGG 488 Gin Phe Trp Asp Gly Arg Val Thr His Leu Asn Glu Gin Ala Gin Gly 135 140 145
CCC ATC CAG TCT TCT TTT GAA ATG GGG GCT GAT CCC AAA GTG GTG GTA 536 Pro He Gin Ser Ser Phe Glu Met Gly Ala Asp Pro Lys Val Val Val 150 155 160
GAA AAA ATC AAT TCC ATG CCA GGC TAT GTC AAG CTC TTT AGA AAA GCC 584 Glu Lys He Asn Ser Met Pro Gly Tyr Val Lys Leu Phe Arg Lys Ala 165 170 175 TAT GGC TCT AAA GTC AAA ATT GAT TTT AAA TTG ATC GCT GAT AGT ATC 632 Tyr Gly Ser Lys Val Lys He Asp Phe Lys Leu He Ala Asp Ser He 180 185 190
GCT ATG TTT GAA GCC ACG CTT ATT ACC CCA AGC CGT TAC GAC GAT TTT 680 Ala Met Phe Glu Ala Thr Leu He Thr Pro Ser Arg Tyr Asp Asp Phe 195 200 205 210
TTA AGA GGC AAT CCT AAA GCG CTC AGC AAA GCC GAA AAA GAG GGG CTG 728 Leu Arg Gly Asn Pro Lys Ala Leu Ser Lys Ala Glu Lys Glu Gly Leu 215 220 225
AAT TTA TTC ATT TCT AAA GGC TGT GTG GCT TGC CAT AAC GGC ATT AAT 776 Asn Leu Phe He Ser Lys Gly Cys Val Ala Cys His Asn Gly He Asn 230 235 240
CTT GGG GGA ACG ATG CAG CCT TTT GGG GTG GTC AAA CCT TAT AAA TTC 824 Leu Gly Gly Thr Met Gin Pro Phe Gly Val Val Lys Pro Tyr Lys Phe 245 250 255
GCT AAT GTG GGC GAT TTC AAA GGC GAT AAA AAC GGG CTT GTG AAA GTG 872 Ala Asn Val Gly Asp Phe Lys Gly Asp Lys Asn Gly Leu Val Lys Val 260 265 270
CCT ACT TTA AGG AAT ATC ACC GAA ACG ATG CCC TAT TTC CAT AAC GGG 920 Pro Thr Leu Arg Asn He Thr Glu Thr Met Pro Tyr Phe His Asn Gly 275 280 285 290
CAA TTC TGG GAT GTT AAG GAT GCG ATT AAA GAA ATG GGC TCT ATC CAG 968 Gin Phe Trp Asp Val Lys Asp Ala He Lys Glu Met Gly Ser He Gin 295 300 305
TTA GGC ATT GAA ATC AGC GAT GAA GAA GCG AAA AAA ATT GAA ACT TTC 1016 Leu Gly He Glu He Ser Asp Glu Glu Ala Lys Lys He Glu Thr Phe 310 315 320
TTT GGA GCC TTA AGG GGT AAA AAA CCT AAA ATA ATC TAC CCA GAA CTC 1064 Phe Gly Ala Leu Arg Gly Lys Lys Pro Lys He He Tyr Pro Glu Leu 325 330 335
CCC ATA ATG ACA GAC AAA ACC CCT AAA CCC TCT TTT TGATTTAAAA AAGTCC 1116 Pro He Met Thr Asp Lys Thr Pro Lys Pro Ser Phe 340 345 350
TTTTAGGGGT CTTTGGCGCT AAATCTAAAA AATACTC 1153
(2) INFORMATION FOR SEQ ID NO: 234:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 350 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:234:
Met Lys Lys Ser He Leu Leu Gly Val Cys Leu Ala Phe Ser Cys Ala
1 5 10 15
His Ala Leu Asn Asp Leu Glu Leu He Lys Lys Ala Arg Glu Ser Gin
20 25 30
Leu Glu Pro Met Pro Met Gly Lys Ala Leu Lys Glu Tyr Gin He Lys
35 40 45
Lys Thr Arg Asp Val Gly He Gly Thr Lys Asn Ser Glu He Met Thr
50 55 60
Ser Ala Gin Val Glu Leu Gly Lys Met Leu Tyr Phe Asp Pro Arg He 65 70 75 80
Ser Thr Ser Tyr Leu Val Ser Cys Asn Thr Cys His Asn Leu Gly Leu
85 90 95
Gly Gly Val Asp Leu Val Pro Ser Ala He Gly Ser Gin Trp Lys Lys
100 105 110
Asn Pro His Leu Leu Ser Ser Pro Thr Val Tyr Asn Ser Val Phe Asn
115 120 125
Asp Val Gin Phe Trp Asp Gly Arg Val Thr His Leu Asn Glu Gin Ala
130 135 140
Gin Gly Pro He Gin Ser Ser Phe Glu Met Gly Ala Asp Pro Lys Val 145 150 155 160
Val Val Glu Lys He Asn Ser Met Pro Gly Tyr Val Lys Leu Phe Arg
165 170 175
Lys Ala Tyr Gly Ser Lys Val Lys He Asp Phe Lys Leu He Ala Asp
180 185 190
Ser He Ala Met Phe Glu Ala Thr Leu He Thr Pro Ser Arg Tyr Asp
195 200 205
Asp Phe Leu Arg Gly Asn Pro Lys Ala Leu Ser Lys Ala Glu Lys Glu
210 215 220
Gly Leu Asn Leu Phe He Ser Lys Gly Cys Val Ala Cys His Asn Gly 225 230 235 240
He Asn Leu Gly Gly Thr Met Gin Pro Phe Gly Val Val Lys Pro Tyr
245 250 255
Lys Phe Ala Asn Val Gly Asp Phe Lys Gly Asp Lys Asn Gly Leu Val
260 265 270
Lys Val Pro Thr Leu Arg Asn He Thr Glu Thr Met Pro Tyr Phe His
275 280 285
Asn Gly Gin Phe Trp Asp Val Lys Asp Ala He Lys Glu Met Gly Ser
290 295 300
He Gin Leu Gly He Glu He Ser Asp Glu Glu Ala Lys Lys He Glu 305 310 315 320
Thr Phe Phe Gly Ala Leu Arg Gly Lys Lys Pro Lys He He Tyr Pro
325 330 335
Glu Leu Pro He Met Thr Asp Lys Thr Pro Lys Pro Ser Phe 340 345 350
(2) INFORMATION FOR SEQ ID NO: 235:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 426 base pairs
(B) TYPE: nucleic acid
( C) STRANDEDNESS : s ingle (D) TOPOLOGY : linear
(ii) MOLECULE ' TYPE: Genomic DNA
(ix) FEATURE :
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...374 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235:
GTTAGAGATT TCTCCCAATT CTCAAGTGGG AGCGAGCGTG AAAATCCGCT ATG AAA 56
Met Lys 1
GCA ATC TTT AGC CTC TTT TTC CTT CTT ATT GTT TTA AAA GCA AAC CCC 104 Ala He Phe Ser Leu Phe Phe Leu Leu He Val Leu Lys Ala Asn Pro 5 10 15
ATA AAC CCT TTA TTA GAG CCG TTA TAT TTC CCC AGT TAC GCG CAA TTT 152 He Asn Pro Leu Leu Glu Pro Leu Tyr Phe Pro Ser Tyr Ala Gin Phe 20 25 30
TTA AAC TTA GCA CCT CAC TTT GTC ATT AAA AAA AAG CGC GCT TAT AGA 200 Leu Asn Leu Ala Pro His Phe Val He Lys Lys Lys Arg Ala Tyr Arg 35 40 45 50
CCC TTT CAA TGG GGG AAT ACC ATT ATC ATC AAA CGC CAT GAT TTA GAA 248 Pro Phe Gin Trp Gly Asn Thr He He He Lys Arg His Asp Leu Glu 55 60 65
GAA CGC CAA AGC AAC CAG CCA AGC GAT ATT TTC CGC CAA AAC GCT GAA 296 Glu Arg Gin Ser Asn Gin Pro Ser Asp He Phe Arg Gin Asn Ala Glu 70 75 80
ATC AAT GTG TCT TCT CAA ACT TTT TTA AAA GGA ATG AGC AAC GCT TCT 344 He Asn Val Ser Ser Gin Thr Phe Leu Lys Gly Met Ser Asn Ala Ser 85 90 95
TCA CGA ACA GTG CTT GAT TCA GCC GCT CAG TAAAATGCTA AAACTTTTTT TAA 397 Ser Arg Thr Val Leu Asp Ser Ala Ala Gin 100 105
TCACATTTTT CTTGGTATTT TCTTAATCC 426
(2) INFORMATION FOR SEQ ID NO: 236:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 108 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236:
Met Lys Ala He Phe Ser Leu Phe Phe Leu Leu He Val Leu Lys Ala
1 5 10 15
Asn Pro He Asn Pro Leu Leu Glu Pro Leu Tyr Phe Pro Ser Tyr Ala
20 25 30
Gin Phe Leu Asn Leu Ala Pro His Phe Val He Lys Lys Lys Arg Ala
35 40 45
Tyr Arg Pro Phe Gin Trp Gly Asn Thr He He He Lys Arg His Asp
50 55 60
Leu Glu Glu Arg Gin Ser Asn Gin Pro Ser Asp He Phe Arg Gin Asn 65 70 75 80
Ala Glu He Asn Val Ser Ser Gin Thr Phe Leu Lys Gly Met Ser Asn
85 90 95
Ala Ser Ser Arg Thr Val Leu Asp Ser Ala Ala Gin 100 105
(2) INFORMATION FOR SEQ ID NO: 237:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 799 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...746 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:237:
AGCGTGAGCC CAATTACGCC ACGCCTATGT CTAGATAGGG GAGGTTGGAA ATG TTT 56
Met Phe
1
AGT TTT TTA GAA AAA AAC CCG TTC TTT TTC ACT CTT GCG TTT ATT TTT 104 Ser Phe Leu Glu Lys Asn Pro Phe Phe Phe Thr Leu Ala Phe He Phe 5 10 15
GTG TTT GCG ATC GCG GGC TTG GTG GAG ATT TTG CCC AAC TTC TTC AAA 152 Val Phe Ala He Ala Gly Leu Val Glu He Leu Pro Asn Phe Phe Lys 20 25 30
TCC GCT CGC CCG ATT GAA GGC TTA CGG CCT TAT ACG GTT TTA GAG ACA 200 Ser Ala Arg Pro He Glu Gly Leu Arg Pro Tyr Thr Val Leu Glu Thr 35 40 45 50 GCG GGG AGG CAA ATT TAT ATC CAA GAA GGT TGC TAT CAT TGC CAT TCC 248 Ala Gly Arg Gin He Tyr He Gin Glu Gly Cys Tyr His Cys His Ser 55 60 65
CAG CTT ATT CGC CCT TTC CAA GCT GAG GTG GAT CGA TAT GGC GCG TAT 296 Gin Leu He Arg Pro Phe Gin Ala Glu Val Asp Arg Tyr Gly Ala Tyr 70 75 80
AGT TTG AGT GGG GAA TAC GCG TAT GAC AGG CCA TTT TTG TGG GGT TCT 344 Ser Leu Ser Gly Glu Tyr Ala Tyr Asp Arg Pro Phe Leu Trp Gly Ser 85 90 95
AAA AGG ATT GGC CCT GAT TTG CAC AGG GTA GGG GAT TAT CGC ACA ACC 392 Lys Arg He Gly Pro Asp Leu His Arg Val Gly Asp Tyr Arg Thr Thr 100 105 110
GAT TGG CAT GAA AAG CAC ATG TTT GAT CCT AAA AGC GTT GTG CCG CAC 440 Asp Trp His Glu Lys His Met Phe Asp Pro Lys Ser Val Val Pro His 115 120 125 130
AGC ATC ATG CCC GCC TAT AAG CAT TTA TTT ACA AAA AAG AGC GAT TTT 488 Ser He Met Pro Ala Tyr Lys His Leu Phe Thr Lys Lys Ser Asp Phe 135 140 145
GAC ACC GCT TAT GCA GAA GCT TTG ACG CAA AAA AAG GTT TTT GGC GTG 536 Asp Thr Ala Tyr Ala Glu Ala Leu Thr Gin Lys Lys Val Phe Gly Val 150 155 160
CCT TAT GAC ACA GAA AAC GGC GTG AAA TTA GGG AGC GTA GAA GAA GCG 584 Pro Tyr Asp Thr Glu Asn Gly Val Lys Leu Gly Ser Val Glu Glu Ala 165 170 175
AAA AAA GCC TAT TTA GAA GAA GCT AAA AAA ATC ACA GCC GAT ATG AAA 632 Lys Lys Ala Tyr Leu Glu Glu Ala Lys Lys He Thr Ala Asp Met Lys 180 185 190
GAC AAG AGG GTG CTA GAA GCG ATT GAG AGA GGT GAA GTG TTA GAA ATT 680 Asp Lys Arg Val Leu Glu Ala He Glu Arg Gly Glu Val Leu Glu He 195 200 205 210
GTG GCT TTG ATC GCT TAT TTG AAT AGC TTG GGT AAT TCC AGG ATC AAC 728 Val Ala Leu He Ala Tyr Leu Asn Ser Leu Gly Asn Ser Arg He Asn 215 220 225
GCC AAT CAA AAC GCT AAA TAAGGGGTGA ATGATGGATT TAGAAAGTTT GAGAGGTT 784 Ala Asn Gin Asn Ala Lys 230
TTGCGTATGC GTTTT 799
(2) INFORMATION FOR SEQ ID NO: 238:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 232 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:238:
Met Phe Ser Phe Leu Glu Lys Asn Pro Phe Phe Phe Thr Leu Ala Phe
1 5 10 15
He Phe Val Phe Ala He Ala Gly Leu Val Glu He Leu Pro Asn Phe
20 25 30
Phe Lys Ser Ala Arg Pro He Glu Gly Leu Arg Pro Tyr Thr Val Leu
35 40 45
Glu Thr Ala Gly Arg Gin He Tyr He Gin Glu Gly Cys Tyr His Cys
50 55 60
His Ser Gin Leu He Arg Pro Phe Gin Ala Glu Val Asp Arg Tyr Gly 65 70 75 80
Ala Tyr Ser Leu Ser Gly Glu Tyr Ala Tyr Asp Arg Pro Phe Leu Trp
85 90 95
Gly Ser Lys Arg He Gly Pro Asp Leu His Arg Val Gly Asp Tyr Arg
100 105 110
Thr Thr Asp Trp His Glu Lys His Met Phe Asp Pro Lys Ser Val Val
115 120 125
Pro His Ser He Met Pro Ala Tyr Lys His Leu Phe Thr Lys Lys Ser
130 135 140
Asp Phe Asp Thr Ala Tyr Ala Glu Ala Leu Thr Gin Lys Lys Val Phe 145 150 155 160
Gly Val Pro Tyr Asp Thr Glu Asn Gly Val Lys Leu Gly Ser Val Glu
165 170 175
Glu Ala Lys Lys Ala Tyr Leu Glu Glu Ala Lys Lys He Thr Ala Asp
180 185 190
Met Lys Asp Lys Arg Val Leu Glu Ala He Glu Arg Gly Glu Val Leu
195 200 205
Glu He Val Ala Leu He Ala Tyr Leu Asn Ser Leu Gly Asn Ser Arg
210 215 220
He Asn Ala Asn Gin Asn Ala Lys 225 230
(2) INFORMATION FOR SEQ ID NO: 239:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 322 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...269 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239:
CTTGGGTAAT TCCAGGATCA ACGCCAATCA AAACGCTAAA TAAGGGGTGA ATG ATG 56
Met Met
1
GAT TTA GAA AGT TTG AGA GGT TTT GCG TAT GCG TTT TTT ACC ATT CTT 104 Asp Leu Glu Ser Leu Arg Gly Phe Ala Tyr Ala Phe Phe Thr He Leu 5 10 15
TTT ACG CTC TTT TTG TAT GCC TAT ATT TTT AGC ATG TAT AGA AAG CAA 152 Phe Thr Leu Phe Leu Tyr Ala Tyr He Phe Ser Met Tyr Arg Lys Gin 20 25 30
AAA AAA GGC ATT ATG GAT TAT GAG CGA TAC GGA TAC TTA GCG TTA AAT 200 Lys Lys Gly He Met Asp Tyr Glu Arg Tyr Gly Tyr Leu Ala Leu Asn 35 40 45 50
GAT GCT TTA GAA GAC GAG TTG ATT GAA CCA CGC CAT AAA AAA GTT CAT 248 Asp Ala Leu Glu Asp Glu Leu He Glu Pro Arg His Lys Lys Val His 55 60 65
GAT AAT GGC ATA AAG GAA AGT TGAAATGGAT TTTTTAAACG ACCATATAAA TGTT 303 Asp Asn Gly He Lys Glu Ser 70
TTTGGCTTGA TTGCAGCGC 322
(2) INFORMATION FOR SEQ ID NO: 240:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 73 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240:
Met Met Asp Leu Glu Ser Leu Arg Gly Phe Ala Tyr Ala Phe Phe Thr
1 5 10 15
He Leu Phe Thr Leu Phe Leu Tyr Ala Tyr He Phe Ser Met Tyr Arg
20 25 30
Lys Gin Lys Lys Gly He Met Asp Tyr Glu Arg Tyr Gly Tyr Leu Ala
35 40 45
Leu Asn Asp Ala Leu Glu Asp Glu Leu He Glu Pro Arg His Lys Lys
50 55 60
Val His Asp Asn Gly He Lys Glu Ser 65 70
(2) INFORMATION FOR SEQ ID NO: 241: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1021 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...968 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241:
TTACTGATTT TTCTTTGTGT GAGCTTTGGC TTAGTTTTGT AAGGAATGAG ATG ATA 56
Met He 1
AAG AGT TGG ACT AAA AAG TGG TTT TTG ATT TTA TTT TTA ATG GCA AGT 104 Lys Ser Trp Thr Lys Lys Trp Phe Leu He Leu Phe Leu Met Ala Ser 5 10 15
TGT TCC AGT TAT TTG GTG GCT ACA ACC GGT GAG AAA TAT TTT AAA ATG 152 Cys Ser Ser Tyr Leu Val Ala Thr Thr Gly Glu Lys Tyr Phe Lys Met 20 25 30
GCT ACT CAA GCC TTT AAG AGA GGG GAC TAC CAT AAA GCG GTG GCT TTT 200 Ala Thr Gin Ala Phe Lys Arg Gly Asp Tyr His Lys Ala Val Ala Phe 35 40 45 50
TAT AAG AGG AGC TGT AAT TTA AGG GTG GGG GTT GGT TGC ACG AGT TTA 248 Tyr Lys Arg Ser Cys Asn Leu Arg Val Gly Val Gly Cys Thr Ser Leu 55 60 65
GGC TCT ATG TAT GAA GAT GGC GAT GGC GTG GAT CAG AAT ATT ACA AAA 296 Gly Ser Met Tyr Glu Asp Gly Asp Gly Val Asp Gin Asn He Thr Lys 70 75 80
GCC GTT TTT TAT TAC AGA AGA GGG TGT AAT TTA AGG AAT CAT CTC GCT 344 Ala Val Phe Tyr Tyr Arg Arg Gly Cys Asn Leu Arg Asn His Leu Ala 85 90 95
TGC GCG AGT CTA GGC TCT ATG TAT GAA GAT GGC GAT GGT GTG CAA AAA 392 Cys Ala Ser Leu Gly Ser Met Tyr Glu Asp Gly Asp Gly Val Gin Lys 100 105 110
AAC CTT CCA AAG GCT ATC TAT TAT TAC AGG AGA GGG TGC CAC TTA AAG 440 Asn Leu Pro Lys Ala He Tyr Tyr Tyr Arg Arg Gly Cys His Leu Lys 115 120 125 130
GGT GGG GTG AGC TGT GGG AGT TTA GGT TTT ATG TAT TTT AAT GGC ACG 488 Gly Gly Val Ser Cys Gly Ser Leu Gly Phe Met Tyr Phe Asn Gly Thr 135 140 145
GGC GTT AAG CAA AAT TAT GCC AAA GCC CTT TTT CTT TCT AAA TAC GCT 536 Gly Val Lys Gin Asn Tyr Ala Lys Ala Leu Phe Leu Ser Lys Tyr Ala 150 155 160
TGC AGT TTG AAT TAC GGC ATT AGT TGT AAC TTT GTA GGG TAT ATG TAT 584 Cys Ser Leu Asn Tyr Gly He Ser Cys Asn Phe Val Gly Tyr Met Tyr 165 170 175
AGG AAC GCC AAA GGC GTA CAG AAG GAT TTG AAA AAA GCC CTT GCG AAT 632 Arg Asn Ala Lys Gly Val Gin Lys Asp Leu Lys Lys Ala Leu Ala Asn 180 185 190
TTT AAA AGA GGG TGC CAT TTG AAA GAC GGA GCG AGT TGT GTG AGC TTG 680 Phe Lys Arg Gly Cys His Leu Lys Asp Gly Ala Ser Cys Val Ser Leu 195 200 205 210
GGA TAC ATG TAT GAA GTC GGT ATG GAT GTC AAA CAA AAT GGA GAG CAA 728 Gly Tyr Met Tyr Glu Val Gly Met Asp Val Lys Gin Asn Gly Glu Gin 215 220 225
GCC TTG AAT CTT TAT AAA AAG GGT TGT TAT TTA AAA AGG GGG AGC GGT 776 Ala Leu Asn Leu Tyr Lys Lys Gly Cys Tyr Leu Lys Arg Gly Ser Gly 230 235 240
TGT CAT AAT GTG GCG GTG ATG TAT TAC ACC GGT AAG GGC GTT CCA AAG 824 Cys His Asn Val Ala Val Met Tyr Tyr Thr Gly Lys Gly Val Pro Lys 245 250 255
GAT TTA GAT AAA GCC ATT TCG TAT TAT AAG AAA GGT TGC ACT CTA GGC 872 Asp Leu Asp Lys Ala He Ser Tyr Tyr Lys Lys Gly Cys Thr Leu Gly 260 265 270
TTT AGT GGT AGC TGT AAA GTG TTA GAA GAA GTG ATT GGC AAG AAG TCT 920 Phe Ser Gly Ser Cys Lys Val Leu Glu Glu Val He Gly Lys Lys Ser 275 280 285 290
GAT GAT TTG CAA GAT GAC GCG CAA AAC GAC ACG CAA GAT GAT ATG CAA T 969 Asp Asp Leu Gin Asp Asp Ala Gin Asn Asp Thr Gin Asp Asp Met Gin 295 300 305
AAGTTAAAGC TTATGGACTA ATGATTAAAA CTCATCTTAT AGAAATCTTT CT 1021
(2) INFORMATION FOR SEQ ID NO: 242:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 306 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:242:
Met He Lys Ser Trp Thr Lys Lys Trp Phe Leu He Leu Phe Leu Met
1 5 10 15
Ala Ser Cys Ser Ser Tyr Leu Val Ala Thr Thr Gly Glu Lys Tyr Phe
20 25 30
Lys Met Ala Thr Gin Ala Phe Lys Arg Gly Asp Tyr His Lys Ala Val
35 40 45
Ala Phe Tyr Lys Arg Ser Cys Asn Leu Arg Val Gly Val Gly Cys Thr
50 55 60
Ser Leu Gly Ser Met Tyr Glu Asp Gly Asp Gly Val Asp Gin Asn He 65 70 75 80
Thr Lys Ala Val Phe Tyr Tyr Arg Arg Gly Cys Asn Leu Arg Asn His
85 90 95
Leu Ala Cys Ala Ser Leu Gly Ser Met Tyr Glu Asp Gly Asp Gly Val
100 105 110
Gin Lys Asn Leu Pro Lys Ala He Tyr Tyr Tyr Arg Arg Gly Cys His
115 120 125
Leu Lys Gly Gly Val Ser Cys Gly Ser Leu Gly Phe Met Tyr Phe Asn
130 135 140
Gly Thr Gly Val Lys Gin Asn Tyr Ala Lys Ala Leu Phe Leu Ser Lys 145 150 155 160
Tyr Ala Cys Ser Leu Asn Tyr Gly He Ser Cys Asn Phe Val Gly Tyr
165 170 175
Met Tyr Arg Asn Ala Lys Gly Val Gin Lys Asp Leu Lys Lys Ala Leu
180 185 190
Ala Asn Phe Lys Arg Gly Cys His Leu Lys Asp Gly Ala Ser Cys Val
195 200 205
Ser Leu Gly Tyr Met Tyr Glu Val Gly Met Asp Val Lys Gin Asn Gly
210 215 220
Glu Gin Ala Leu Asn Leu Tyr Lys Lys Gly Cys Tyr Leu Lys Arg Gly 225 230 235 240
Ser Gly Cys His Asn Val Ala Val Met Tyr Tyr Thr Gly Lys Gly Val
245 250 255
Pro Lys Asp Leu Asp Lys Ala He Ser Tyr Tyr Lys Lys Gly Cys Thr
260 265 270
Leu Gly Phe Ser Gly Ser Cys Lys Val Leu Glu Glu Val He Gly Lys
275 280 285
Lys Ser Asp Asp Leu Gin Asp Asp Ala Gin Asn Asp Thr Gin Asp Asp
290 295 300
Met Gin 305
(2) INFORMATION FOR SEQ ID NO:243:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1000 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence (B) LOCATION: 51...947 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243:
CTATAATGTG AATTTAATGA TGAAAATTAG TTTAGAGTGG AGAACACACA ATG AAA 56
Met Lys
1
AAA AAT ATC TTA AAT TTA GCG TTA GTG GGT GCG TTG AGC ACG TCG TTT 104 Lys Asn He Leu Asn Leu Ala Leu Val Gly Ala Leu Ser Thr Ser Phe 5 10 15
TTG ATG GCT AAG CCG GCT CAT AAC GCA AAT AAC GCT ACG CAT AAC ACG 152 Leu Met Ala Lys Pro Ala His Asn Ala Asn Asn Ala Thr His Asn Thr 20 25 30
AAA AAA ACG ACT GAT TCT TCA GCA GGC GTG TTA GCG ACA GTG GAT GGC 200 Lys Lys Thr Thr Asp Ser Ser Ala Gly Val Leu Ala Thr Val Asp Gly 35 40 45 50
AGA CCT ATC ACT AAA AGC GAT TTT GAC ATG ATT AAG CAA CGA AAT CCT 248 Arg Pro He Thr Lys Ser Asp Phe Asp Met He Lys Gin Arg Asn Pro 55 60 65
AAT TTT GAT TTT GAC AAG CTT AAA GAG AAA GAA AAA GAA GCC TTG ATT 296 Asn Phe Asp Phe Asp Lys Leu Lys Glu Lys Glu Lys Glu Ala Leu He 70 75 80
GAT CAA GCT ATT CGC ACC GCC CTT GTA GAA AAT GAA GCT AAA ACC GAG 344 Asp Gin Ala He Arg Thr Ala Leu Val Glu Asn Glu Ala Lys Thr Glu 85 90 95
AAA TTG GAC AGC ACT CCA GAA TTT AAA GCG ATG ATG GAA GCG GTT AAA 392 Lys Leu Asp Ser Thr Pro Glu Phe Lys Ala Met Met Glu Ala Val Lys 100 105 110
AAA CAG GCT TTA GTG GAA TTT TGG GCT AAA AAA CAG GCT GAA GAA GTG 440 Lys Gin Ala Leu Val Glu Phe Trp Ala Lys Lys Gin Ala Glu Glu Val 115 120 125 130
AAA AAA GTC CAA ATC CCA GAA AAA GAA ATG CAA GAT TTT TAC AAC GCT 488 Lys Lys Val Gin He Pro Glu Lys Glu Met Gin Asp Phe Tyr Asn Ala 135 140 145
AAC AAA GAT CAG CTT TTT GTC AAG CAA GAA GCC CAT GCT AGG CAT ATT 536 Asn Lys Asp Gin Leu Phe Val Lys Gin Glu Ala His Ala Arg His He 150 155 160
TTA GTG AAA ACC GAA GAT GAG GCT AAA CGG ATT ATT TCT GAG ATT GAC 584 Leu Val Lys Thr Glu Asp Glu Ala Lys Arg He He Ser Glu He Asp 165 170 175 AAA CAG CCA AAG GCT AAA AAA GAA GCT AAA TTC ATT GAG TTA GCC AAT 632 Lys Gin Pro Lys Ala Lys Lys Glu Ala Lys Phe He Glu Leu Ala Asn 180 185 190
CGG GAT ACG ATT GAT CCT AAC AGC AAG AAC GCG CAA AAT GGC GGT GAT 680 Arg Asp Thr He Asp Pro Asn Ser Lys Asn Ala Gin Asn Gly Gly Asp 195 200 205 210
TTG GGG AAA TTC CAA AAG AAC CAA ATG GCT CCG GAT TTT TCT AAA GCC 728 Leu Gly Lys Phe Gin Lys Asn Gin Met Ala Pro Asp Phe Ser Lys Ala 215 220 225
GCT TTC GCT TTA ACT CCT GGG GAT TAC ACT AAA ACC CCT GTT AAA ACA 776 Ala Phe Ala Leu Thr Pro Gly Asp Tyr Thr Lys Thr Pro Val Lys Thr 230 235 240
GAG TTT GGT TAT CAT ATT ATC TAT TTG ATT TCT AAA GAT AGC CCT GTA 824 Glu Phe Gly Tyr His He He Tyr Leu He Ser Lys Asp Ser Pro Val 245 250 255
ACT TAT ACT TAT GAA CAG GCT AAA CCT ACC ATT AAG GGG ATG TTA CAA 872 Thr Tyr Thr Tyr Glu Gin Ala Lys Pro Thr He Lys Gly Met Leu Gin 260 265 270
GAA AAG CTT TTC CAA GAA CGC ATG AAT CAA CGC ATT GAG GAA CTA AGA 920 Glu Lys Leu Phe Gin Glu Arg Met Asn Gin Arg He Glu Glu Leu Arg 275 280 285 290
AAG CAC GCT AAA ATT GTT ATC AAC AAG TAATTGATGA GGTGTTATCA TGTTAGT 974 Lys His Ala Lys He Val He Asn Lys 295
TAAAGGCAAT GAAATTTTAT TGAAAG 1000
(2) INFORMATION FOR SEQ ID N0:244:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 299 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244:
Met Lys Lys Asn He Leu Asn Leu Ala Leu Val Gly Ala Leu Ser Thr
1 5 10 15
Ser Phe Leu Met Ala Lys Pro Ala His Asn Ala Asn Asn Ala Thr His
20 25 30
Asn Thr Lys Lys Thr Thr Asp Ser Ser Ala Gly Val Leu Ala Thr Val
35 40 45
Asp Gly Arg Pro He Thr Lys Ser Asp Phe Asp Met He Lys Gin Arg 50 55 60 Asn Pro Asn Phe Asp Phe Asp Lys Leu Lys Glu Lys Glu Lys Glu Ala 65 70 75 80
Leu He Asp Gin Ala He Arg Thr Ala Leu Val Glu Asn Glu Ala Lys
85 90 95
Thr Glu Lys Leu Asp Ser Thr Pro Glu Phe Lys Ala Met Met Glu Ala
100 105 110
Val Lys Lys Gin Ala Leu Val Glu Phe Trp Ala Lys Lys Gin Ala Glu
115 120 125
Glu Val Lys Lys Val Gin He Pro Glu Lys Glu Met Gin Asp Phe Tyr
130 135 140
Asn Ala Asn Lys Asp Gin Leu Phe Val Lys Gin Glu Ala His Ala Arg 145 150 155 160
His He Leu Val Lys Thr Glu Asp Glu Ala Lys Arg He He Ser Glu
165 170 175
He Asp Lys Gin Pro Lys Ala Lys Lys Glu Ala Lys Phe He Glu Leu
180 185 190
Ala Asn Arg Asp Thr He Asp Pro Asn Ser Lys Asn Ala Gin Asn Gly
195 200 205
Gly Asp Leu Gly Lys Phe Gin Lys Asn Gin Met Ala Pro Asp Phe Ser
210 215 220
Lys Ala Ala Phe Ala Leu Thr Pro Gly Asp Tyr Thr Lys Thr Pro Val 225 230 235 240
Lys Thr Glu Phe Gly Tyr His He He Tyr Leu He Ser Lys Asp Ser
245 250 255
Pro Val Thr Tyr Thr Tyr Glu Gin Ala Lys Pro Thr He Lys Gly Met
260 265 270
Leu Gin Glu Lys Leu Phe Gin Glu Arg Met Asn Gin Arg He Glu Glu
275 280 285
Leu Arg Lys His Ala Lys He Val He Asn Lys 290 295
(2) INFORMATION FOR SEQ ID NO: 245:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 376 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...323 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245:
CAGCGTTGGT GTATTTTGGA GGAAGTTAGG AAAATATTGA AGGAGTATTG ATG AAA 56
Met Lys
1 AAG GTT GTT TTT TTA TTG TTA GTT ATA CTA GGG GGT TTA GAA GCG CAA 104 Lys Val Val Phe Leu Leu Leu Val He Leu Gly Gly Leu Glu Ala Gin 5 10 15
AGT ACT TAT TGC AGT GAT CAT TGC GAA GGC ACG CCA GAT AGC CGT ATC 152 Ser Thr Tyr Cys Ser Asp His Cys Glu Gly Thr Pro Asp Ser Arg He 20 25 30
CCT CCT ATG GGG TTT CAT TTC AGT TTT GTG CAT TCA GTG AAA TAT TAC 200 Pro Pro Met Gly Phe His Phe Ser Phe Val His Ser Val Lys Tyr Tyr 35 40 45 50
TTG CAA GAT CCG CAA GAG CGC GAT CAC AAG CTT GAA AAA TGC CAT CAA 248 Leu Gin Asp Pro Gin Glu Arg Asp His Lys Leu Glu Lys Cys His Gin 55 60 65
GCC TTT GAT TCG ACT CTT AAG GTT AAT TTT ATT ACG AAT CTT TTA AAA 296 Ala Phe Asp Ser Thr Leu Lys Val Asn Phe He Thr Asn Leu Leu Lys 70 75 80
AGG ATT GCA AGC ATG CGC AAA TGG CTT TAGAGCAAGC CCAAAAAGGG ACTCCAT 350 Arg He Ala Ser Met Arg Lys Trp Leu 85 90
AAAAGGGGTT TCTTTAGGGA TTTTAT 376
(2) INFORMATION FOR SEQ ID NO: 246:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 91 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246:
Met Lys Lys Val Val Phe Leu Leu Leu Val He Leu Gly Gly Leu Glu
1 5 10 15
Ala Gin Ser Thr Tyr Cys Ser Asp His Cys Glu Gly Thr Pro Asp Ser
20 25 30
Arg He Pro Pro Met Gly Phe His Phe Ser Phe Val His Ser Val Lys
35 40 45
Tyr Tyr Leu Gin Asp Pro Gin Glu Arg Asp His Lys Leu Glu Lys Cys
50 55 60
His Gin Ala Phe Asp Ser Thr Leu Lys Val Asn Phe He Thr Asn Leu
65 70 75 80
Leu Lys Arg He Ala Ser Met Arg Lys Trp Leu 85 90
(2) INFORMATION FOR SEQ ID NO: 247:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1180 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1127 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:247:
TTTTTAGTTA TAATGGCGGA CACCATTAAA ATTAAAACAA AGGTTATTCA ATG AAG 56
Met Lys
1
GTA TTA TCT TAT TTG AAA AAT TTT TAT CTT TTT TTA GCG ATA GGA GCA 104 Val Leu Ser Tyr Leu Lys Asn Phe Tyr Leu Phe Leu Ala He Gly Ala 5 10 15
ATT ATG CAA GCG AGT GAA AAC ATG GGA TCT CAA CAC CAA AAA ACC GAT 152 He Met Gin Ala Ser Glu Asn Met Gly Ser Gin His Gin Lys Thr Asp 20 25 30
GAA AGA GTG ATT TAC TTG GCT GGG GGG TGT TTT TGG GGG CTA GAG GCG 200 Glu Arg Val He Tyr Leu Ala Gly Gly Cys Phe Trp Gly Leu Glu Ala 35 40 45 50
TAT ATG GAG AGG ATT TAT GGC GTC ATA GAC GCA AGC TCT GGT TAC GCT 248 Tyr Met Glu Arg He Tyr Gly Val He Asp Ala Ser Ser Gly Tyr Ala 55 60 65
AAC GGC AAG ACT TCA AGC ACG AAT TAT GAG AAA TTG CAT GAA AGT GAT 296 Asn Gly Lys Thr Ser Ser Thr Asn Tyr Glu Lys Leu His Glu Ser Asp 70 75 80
CAT GCT GAA AGC GTG AAA GTC ATT TAT GAT CCT AAA AAA ATC AGT TTG 344 His Ala Glu Ser Val Lys Val He Tyr Asp Pro Lys Lys He Ser Leu 85 90 95
GAC AAA TTG TTG CGT TAC TAT TTT AAG GTG GTT GAT CCG GTG AGC GTG 392 Asp Lys Leu Leu Arg Tyr Tyr Phe Lys Val Val Asp Pro Val Ser Val 100 105 110
AAC AAG CAG GGT AAT GAT GTG GGC AGG CAG TAT CGC ACG GGG ATT TAT 440 Asn Lys Gin Gly Asn Asp Val Gly Arg Gin Tyr Arg Thr Gly He Tyr 115 120 125 130
TAT GTC AAT AGC GCG GAT AAA GAA GTG ATA GAT CAT GCC TTA AAA GCG 488 Tyr Val Asn Ser Ala Asp Lys Glu Val He Asp His Ala Leu Lys Ala 135 140 145
TTA CAG AAA GAA GTG AAA GGT AAA ATC GCT ATT GAA GTA GAG CCT TTA 536 Leu Gin Lys Glu Val Lys Gly Lys He Ala He Glu Val Glu Pro Leu 150 155 160
AAA AAT TAT GTG AGG GCT GAA GAG TAT CAT CAG GAT TAT TTG AAG AAA 584 Lys Asn Tyr Val Arg Ala Glu Glu Tyr His Gin Asp Tyr Leu Lys Lys 165 170 175
CAC CCT AGT GGT TAT TGC CAT ATT GAT TTG AAA AAG GCG GAT GAA GTG 632 His Pro Ser Gly Tyr Cys His He Asp Leu Lys Lys Ala Asp Glu Val 180 185 190
ATT GTG GAT GAC GAT AAA TAC ACC AAA CCT AGC GAT GAA GTT TTA AAG 680 He Val Asp Asp Asp Lys Tyr Thr Lys Pro Ser Asp Glu Val Leu Lys 195 200 205 210
AAA AAA CTC ACC AAA CTC CAG TAT GAG GTT ACG CAA AAC AAA CAC ACT 728 Lys Lys Leu Thr Lys Leu Gin Tyr Glu Val Thr Gin Asn Lys His Thr 215 220 225
GAG AAA CCC TTT GAA AAC GAG TAT TAC AAC AAA GAA GAA GAG GGC ATT 776 Glu Lys Pro Phe Glu Asn Glu Tyr Tyr Asn Lys Glu Glu Glu Gly He 230 235 240
TAT GTG GAT ATT ACC ACA GGC GAG CCG TTA TTT TCT TCA GCG GAT AAA 824 Tyr Val Asp He Thr Thr Gly Glu Pro Leu Phe Ser Ser Ala Asp Lys 245 250 255
TAC GAC TCC GGT TGC GGG TGG CCA AGC TTT TCT AAG CCT ATC AAT AAA 872 Tyr Asp Ser Gly Cys Gly Trp Pro Ser Phe Ser Lys Pro He Asn Lys 260 265 270
GAT GTG GTG AAA TAC GAA GAC GAT GAG AGC CTT AAT AGG AAA CGC ATT 920 Asp Val Val Lys Tyr Glu Asp Asp Glu Ser Leu Asn Arg Lys Arg He 275 280 285 290
GAA GTG TTG AGC CGT ATT GGT AAG GCG CAT TTA GGG CAT GTG TTT AAC 968 Glu Val Leu Ser Arg He Gly Lys Ala His Leu Gly His Val Phe Asn 295 300 305
GAT GGG CCT AAA GAA TTA GGG GGC TTA AGG TAT TGC ATC AAC AGC GCG 1016 Asp Gly Pro Lys Glu Leu Gly Gly Leu Arg Tyr Cys He Asn Ser Ala 310 315 320
GCT TTA AGG TTT ATC CCC TTA AAA GAC ATG GAA AAA GAG GGT TAT GGC 1064 Ala Leu Arg Phe He Pro Leu Lys Asp Met Glu Lys Glu Gly Tyr Gly 325 330 335
GAG TTT ATC CCT TAT ATC AAA AAG GGT GAA TTG AAA AAA TAC ATC AAT 1112 Glu Phe He Pro Tyr He Lys Lys Gly Glu Leu Lys Lys Tyr He Asn 340 345 350
GAT AAA AAG TCG CAT TAAGGGGTAA TGACTAAGCC CCCTAAGGGG GGTTAAAATG A 1168 Asp Lys Lys Ser His 355
GGGGTTTAAG CG 1180
(2) INFORMATION FOR SEQ ID NO: 248:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 359 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248:
Met Lys Val Leu Ser Tyr Leu Lys Asn Phe Tyr Leu Phe Leu Ala He
1 5 10 15
Gly Ala He Met Gin Ala Ser Glu Asn Met Gly Ser Gin His Gin Lys
20 25 30
Thr Asp Glu Arg Val He Tyr Leu Ala Gly Gly Cys Phe Trp Gly Leu
35 40 45
Glu Ala Tyr Met Glu Arg He Tyr Gly Val He Asp Ala Ser Ser Gly
50 55 60
Tyr Ala Asn Gly Lys Thr Ser Ser Thr Asn Tyr Glu Lys Leu His Glu 65 70 75 80
Ser Asp His Ala Glu Ser Val Lys Val He Tyr Asp Pro Lys Lys He
85 90 95
Ser Leu Asp Lys Leu Leu Arg Tyr Tyr Phe Lys Val Val Asp Pro Val
100 105 110
Ser Val Asn Lys Gin Gly Asn Asp Val Gly Arg Gin Tyr Arg Thr Gly
115 120 125
He Tyr Tyr Val Asn Ser Ala Asp Lys Glu Val He Asp His Ala Leu
130 135 140
Lys Ala Leu Gin Lys Glu Val Lys Gly Lys He Ala He Glu Val Glu 145 150 155 160
Pro Leu Lys Asn Tyr Val Arg Ala Glu Glu Tyr His Gin Asp Tyr Leu
165 170 175
Lys Lys His Pro Ser Gly Tyr Cys His He Asp Leu Lys Lys Ala Asp
180 185 190
Glu Val He Val Asp Asp Asp Lys Tyr Thr Lys Pro Ser Asp Glu Val
195 200 205
Leu Lys Lys Lys Leu Thr Lys Leu Gin Tyr Glu Val Thr Gin Asn Lys
210 215 220
His Thr Glu Lys Pro Phe Glu Asn Glu Tyr Tyr Asn Lys Glu Glu Glu 225 230 235 240
Gly He Tyr Val Asp He Thr Thr Gly Glu Pro Leu Phe Ser Ser Ala
245 250 255
Asp Lys Tyr Asp Ser Gly Cys Gly Trp Pro Ser Phe Ser Lys Pro He
260 265 270
Asn Lys Asp Val Val Lys Tyr Glu Asp Asp Glu Ser Leu Asn Arg Lys
275 280 285
Arg He Glu Val Leu Ser Arg He Gly Lys Ala His Leu Gly His Val 290 295 300
Phe Asn Asp Gly Pro Lys Glu Leu Gly Gly Leu Arg Tyr Cys He Asn 305 310 315 320
Ser Ala Ala Leu Arg Phe He Pro Leu Lys Asp Met Glu Lys Glu Gly
325 330 335
Tyr Gly Glu Phe He Pro Tyr He Lys Lys Gly Glu Leu Lys Lys Tyr
340 345 350
He Asn Asp Lys Lys Ser His 355
(2) INFORMATION FOR SEQ ID NO: 249:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 898 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...845 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249:
TTATTTCTTA TTATTGTAAG GATTTAGGCT ATTGAACTTT AGGAGTTTTA ATG ATA 56
Met He
1
TTA AGA GCG AGT GTG TTG AGC GCG TTA CTT CTT GTA GGC TTA GGG GCA 104 Leu Arg Ala Ser Val Leu Ser Ala Leu Leu Leu Val Gly Leu Gly Ala 5 10 15
GCC CCT AAA CAT TCA GTT TCA GCT AAT GAC AAA CGG ATG CAG GAT AAT 152 Ala Pro Lys His Ser Val Ser Ala Asn Asp Lys Arg Met Gin Asp Asn 20 25 30
TTA GTG AGC GTG ATT GAA AAA CAG ACC AAT AAA AAG GTG CGT ATT TTA 200 Leu Val Ser Val He Glu Lys Gin Thr Asn Lys Lys Val Arg He Leu 35 40 45 50
GAA ATC AAA CCT TTA AAA TCT AGC CAG GAT TTA AAA ATG GTC GTT ATT 248 Glu He Lys Pro Leu Lys Ser Ser Gin Asp Leu Lys Met Val Val He 55 60 65
GAA GAT CCG GAC ACT AAA TAC AAT ATC CCG CTT GTG GTG AGT AAG GAT 296 Glu Asp Pro Asp Thr Lys Tyr Asn He Pro Leu Val Val Ser Lys Asp 70 75 80
GGT AAT TTA ATC ATA GGG CTT AGC AAC ATA TTC TTT AGC AAT AAA AGC 344 Gly Asn Leu He He Gly Leu Ser Asn He Phe Phe Ser Asn Lys Ser 85 90 95
GAT GAT GTG CAA TTA GTT GCA GAA ACC AAT CAA AAA GTT CAA GCT CTT 392 Asp Asp Val Gin Leu Val Ala Glu Thr Asn Gin Lys Val Gin Ala Leu 100 105 110
AAC GCC ACC CAA CAA AAT AGC GCG AAA TTG AAC GCT ATT TTT AAT GAA 440 Asn Ala Thr Gin Gin Asn Ser Ala Lys Leu Asn Ala He Phe Asn Glu 115 120 125 130
ATA CCG GCT GAT TAT GCG ATA GAG TTG CCC TCT ACT AAC GCT GCA AAT 488 He Pro Ala Asp Tyr Ala He Glu Leu Pro Ser Thr Asn Ala Ala Asn 135 140 145
AAG GAT AAA ATC CTT TAT ATT GTC TCT GAT CCC ATG TGC CCA CAT TGC 536 Lys Asp Lys He Leu Tyr He Val Ser Asp Pro Met Cys Pro His Cys 150 155 160
CAA AAA GAG CTC ACT AAA CTT AGG GAT CAT TTA AAA GAA AAC ACC GTG 584 Gin Lys Glu Leu Thr Lys Leu Arg Asp His Leu Lys Glu Asn Thr Val 165 170 175
AGA ATG GTC GTG GTG GGG TGG CTT GGG GTC AAT TCA GCT AAA AAA GCG 632 Arg Met Val Val Val Gly Trp Leu Gly Val Asn Ser Ala Lys Lys Ala 180 185 190
GCT TTA ATC CAA GAA GAA ATG GCG AAA GCT AGG GCT AGG GGA GCG AGC 680 Ala Leu He Gin Glu Glu Met Ala Lys Ala Arg Ala Arg Gly Ala Ser 195 200 205 210
GTG GAA GAT AAG ATC TCT ATT CTT GAA AAG ATT TAT TCC ACC CAA TAC 728 Val Glu Asp Lys He Ser He Leu Glu Lys He Tyr Ser Thr Gin Tyr 215 220 225
GAT ATT AAC GCT CAA AAA GAG CCT GAA GAT TTA CGC ACT AAA GTG GAA 776 Asp He Asn Ala Gin Lys Glu Pro Glu Asp Leu Arg Thr Lys Val Glu 230 235 240
AAT ACC ACT AAA AAG ATT TTT GAA TCT GGC GTG ATT AAG GGT GTG CCT 824 Asn Thr Thr Lys Lys He Phe Glu Ser Gly Val He Lys Gly Val Pro 245 250 255
TTC TTA TAC CAT TAT AAG GCA TGATATAAGG TTGCTCTCAT GAAAAAACCC TATA 879 Phe Leu Tyr His Tyr Lys Ala 260 265
GGAAGATTTC TGATTATGC 898
(2) INFORMATION FOR SEQ ID NO: 250:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 265 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250:
Met He Leu Arg Ala Ser Val Leu Ser Ala Leu Leu Leu Val Gly Leu
1 5 10 15
Gly Ala Ala Pro Lys His Ser Val Ser Ala Asn Asp Lys Arg Met Gin
20 25 30
Asp Asn Leu Val Ser Val He Glu Lys Gin Thr Asn Lys Lys Val Arg
35 40 45
He Leu Glu He Lys Pro Leu Lys Ser Ser Gin Asp Leu Lys Met Val
50 55 60
Val He Glu Asp Pro Asp Thr Lys Tyr Asn He Pro Leu Val Val Ser 65 70 75 80
Lys Asp Gly Asn Leu He He Gly Leu Ser Asn He Phe Phe Ser Asn
85 90 95
Lys Ser Asp Asp Val Gin Leu Val Ala Glu Thr Asn Gin Lys Val Gin
100 105 110
Ala Leu Asn Ala Thr Gin Gin Asn Ser Ala Lys Leu Asn Ala He Phe
115 120 125
Asn Glu He Pro Ala Asp Tyr Ala He Glu Leu Pro Ser Thr Asn Ala
130 135 140
Ala Asn Lys Asp Lys He Leu Tyr He Val Ser Asp Pro Met Cys Pro 145 150 155 160
His Cys Gin Lys Glu Leu Thr Lys Leu Arg Asp His Leu Lys Glu Asn
165 170 175
Thr Val Arg Met Val Val Val Gly Trp Leu Gly Val Asn Ser Ala Lys
180 185 190
Lys Ala Ala Leu He Gin Glu Glu Met Ala Lys Ala Arg Ala Arg Gly
195 200 205
Ala Ser Val Glu Asp Lys He Ser He Leu Glu Lys He Tyr Ser Thr
210 215 220
Gin Tyr Asp He Asn Ala Gin Lys Glu Pro Glu Asp Leu Arg Thr Lys 225 230 235 240
Val Glu Asn Thr Thr Lys Lys He Phe Glu Ser Gly Val He Lys Gly
245 250 255
Val Pro Phe Leu Tyr His Tyr Lys Ala 260 265
(2) INFORMATION FOR SEQ ID NO: 251:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 760 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...707 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:251:
TCTTTTAGAC GAAAACGCCA TGATTTTACA CTGGCAAAAA GAGGGCTTGC ATG CGT 56
Met Arg 1
AAA ATC TTG TTA TTG GGT CTG ATT TTA CAA GCG CTC TTC AGC GAA GAA 104 Lys He Leu Leu Leu Gly Leu He Leu Gin Ala Leu Phe Ser Glu Glu 5 10 15
GCC GCG CAA GAA TTG TTG CAA TGC TCT GCG ATT TTT GAA TCT AAA AAA 152 Ala Ala Gin Glu Leu Leu Gin Cys Ser Ala He Phe Glu Ser Lys Lys 20 25 30
GCC GAA TTG AAA GAC GAT TTG CGC CGA TTG AGT GAA AAA GAG CAG TCT 200 Ala Glu Leu Lys Asp Asp Leu Arg Arg Leu Ser Glu Lys Glu Gin Ser 35 40 45 50
TTA AGG ATC TTG CAA ACC GAA AAC GCC CGC CTT TTA GAT GAA AAA ACC 248 Leu Arg He Leu Gin Thr Glu Asn Ala Arg Leu Leu Asp Glu Lys Thr 55 60 65
GAT CTG TTG AAC CAA AAA GAA AAA GAA GTG GAA GAA AAA CTG AAA AAT 296 Asp Leu Leu Asn Gin Lys Glu Lys Glu Val Glu Glu Lys Leu Lys Asn 70 75 80
TTA GCC GCT AAA GAA GAA GCC TTT AAA ACC TTA CAA ACG GAA GAA AAA 344 Leu Ala Ala Lys Glu Glu Ala Phe Lys Thr Leu Gin Thr Glu Glu Lys 85 90 95
AAA CGC CTT AAA AAT TTG ATA GAA GAA AAC GAA GGC ATT TTA AGA GAA 392 Lys Arg Leu Lys Asn Leu He Glu Glu Asn Glu Gly He Leu Arg Glu 100 105 110
ATC AAG CAG GCT AAA GAC AGC AAG ATT GGC GAG ACT TAT TCT AAA ATG 440 He Lys Gin Ala Lys Asp Ser Lys He Gly Glu Thr Tyr Ser Lys Met 115 120 125 130
AAA GAT TCT AAA TCG GCT CTG ATT TTA GAA AAT TTA CCC ACT CAA AAC 488 Lys Asp Ser Lys Ser Ala Leu He Leu Glu Asn Leu Pro Thr Gin Asn 135 140 145
GCA TTA GAA ATT TTA ATG GCG CTA AAA CCC CAA GAA CTC GGT AAA ATT 536 Ala Leu Glu He Leu Met Ala Leu Lys Pro Gin Glu Leu Gly Lys He 150 155 160
TTA GCC AAA ATG GAT CCT AAA AAA GCG GCG GCT TTG ACA GAG TTG TGG 584 Leu Ala Lys Met Asp Pro Lys Lys Ala Ala Ala Leu Thr Glu Leu Trp 165 170 175 CAA AAA CCC CCA AAA GAA AAT AAA GAA ATC CCA AAA ACC ACA GCA CCC 632 Gin Lys Pro Pro Lys Glu Asn Lys Glu He Pro Lys Thr Thr Ala Pro 180 185 190
ACG CCC CCT ATA GCA CCC ACG CCT TTA AAA GAG CCG ATG ATA AAA GAT 680 Thr Pro Pro He Ala Pro Thr Pro Leu Lys Glu Pro Met He Lys Asp 195 200 205 210
CCT AAC ACC AAA GAG CCT GCA GGG GTA TGATGTTCAT TGTAGCGGTT TTGATGC 734 Pro Asn Thr Lys Glu Pro Ala Gly Val 215
TGGCGTTTTT AATCTTTGTC CATGAA 760
(2) INFORMATION FOR SEQ ID NO:252:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 219 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:252:
Met Arg Lys He Leu Leu Leu Gly Leu He Leu Gin Ala Leu Phe Ser
1 5 10 15
Glu Glu Ala Ala Gin Glu Leu Leu Gin Cys Ser Ala He Phe Glu Ser
20 25 30
Lys Lys Ala Glu Leu Lys Asp Asp Leu Arg Arg Leu Ser Glu Lys Glu
35 40 45
Gin Ser Leu Arg He Leu Gin Thr Glu Asn Ala Arg Leu Leu Asp Glu
50 55 60
Lys Thr Asp Leu Leu Asn Gin Lys Glu Lys Glu Val Glu Glu Lys Leu 65 70 75 80
Lys Asn Leu Ala Ala Lys Glu Glu Ala Phe Lys Thr Leu Gin Thr Glu
85 90 95
Glu Lys Lys Arg Leu Lys Asn Leu He Glu Glu Asn Glu Gly He Leu
100 105 110
Arg Glu He Lys Gin Ala Lys Asp Ser Lys He Gly Glu Thr Tyr Ser
115 120 125
Lys Met Lys Asp Ser Lys Ser Ala Leu He Leu Glu Asn Leu Pro Thr
130 135 140
Gin Asn Ala Leu Glu He Leu Met Ala Leu Lys Pro Gin Glu Leu Gly 145 150 155 160
Lys He Leu Ala Lys Met Asp Pro Lys Lys Ala Ala Ala Leu Thr Glu
165 170 175
Leu Trp Gin Lys Pro Pro Lys Glu Asn Lys Glu He Pro Lys Thr Thr
180 185 190
Ala Pro Thr Pro Pro He Ala Pro Thr Pro Leu Lys Glu Pro Met He
195 200 205
Lys Asp Pro Asn Thr Lys Glu Pro Ala Gly Val 210 215 (2) INFORMATION FOR SEQ ID NO:253:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1393 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1340 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:253:
CTAAAGTGCG CTAAAATTCA CTTCAGTGAT ACAAAAAAGG AAATAAAATA ATG AAT 56
Met Asn 1
ATT CAA ATA AAG AAA AGG TTT TTA GCA AAT TTG TTG CTT TTT AGC CTG 104 He Gin He Lys Lys Arg Phe Leu Ala Asn Leu Leu Leu Phe Ser Leu 5 10 15
TTT TGC CTT AAG GCT GAA ACC CTT TCA GAA GAT CAT CAA ATC CTG TTG X52 Phe Cys Leu Lys Ala Glu Thr Leu Ser Glu Asp His Gin He Leu Leu 20 25 30
AGT TCA GAC GCT TTC CAT AGA GGG GAT TTT GCT GCC GCT CAA AAA GGC 200 Ser Ser Asp Ala Phe His Arg Gly Asp Phe Ala Ala Ala Gin Lys Gly 35 40 45 50
TAT ATG AAT CTC TAT AAG CAA ACC AAT AAG GTG GTG TAT GCT AAA GAA 248 Tyr Met Asn Leu Tyr Lys Gin Thr Asn Lys Val Val Tyr Ala Lys Glu 55 60 65
GCG GCC ATT TCA GCG GCG AGC TTA GGG GAT ATT AAA ACC GCT ATG CAT 296 Ala Ala He Ser Ala Ala Ser Leu Gly Asp He Lys Thr Ala Met His 70 75 80
TTA GCC ATG CTC TAT CAA AAA ATC ACC AAT AAT CGT AAT GAT GTT TCT 344 Leu Ala Met Leu Tyr Gin Lys He Thr Asn Asn Arg Asn Asp Val Ser 85 90 95
ATC AAT AAG ATT TTA GTG GAT GGC TAT GCG CAA ATG GGG CAG ATT GAT 392 He Asn Lys He Leu Val Asp Gly Tyr Ala Gin Met Gly Gin He Asp 100 105 110
AAG GCG ATT GAA TTG CTG CAC AAA ATC CGT AAA GAA GAA AAG ACC ATA 440 Lys Ala He Glu Leu Leu His Lys He Arg Lys Glu Glu Lys Thr He 115 120 125 130 GCC ACA GAC AAT GTG TTA GGG ACT TTG TAT TTG ACT CAA AAG CGT TTG 488 Ala Thr Asp Asn Val Leu Gly Thr Leu Tyr Leu Thr Gin Lys Arg Leu 135 140 145
GAT AAG GCT TTC CCA TTG TTG AAT AAG TTT TAT AAC CAA GTG CAT GAT 536 Asp Lys Ala Phe Pro Leu Leu Asn Lys Phe Tyr Asn Gin Val His Asp 150 155 160
GAA GAC AGC CTA GAA AAA CTC ATT ACG ATC TAT TTT TTG CAA AAT CGT 584 Glu Asp Ser Leu Glu Lys Leu He Thr He Tyr Phe Leu Gin Asn Arg 165 170 175
AAA AAA GAG GGC TTG GAT TTG TTG CAA TCT CAT ATA GAC AGG TAT GGT 632 Lys Lys Glu Gly Leu Asp Leu Leu Gin Ser His He Asp Arg Tyr Gly 180 185 190
TGC TCA GAG CAA TTG TGC CAA AAA GCG CTC AAC ACT TTC ACG CAA TTT 680 Cys Ser Glu Gin Leu Cys Gin Lys Ala Leu Asn Thr Phe Thr Gin Phe 195 200 205 210
AAC GAG CTT GAT TTG GCT AAA ACG ACT TTT GCT CGT TTG TAT GAA AAA 728 Asn Glu Leu Asp Leu Ala Lys Thr Thr Phe Ala Arg Leu Tyr Glu Lys 215 220 225
AAC CCT ATT GTT CAA AAT GCT CAG TTT TAC ATA GGG GTA TTA ATC TTG 776 Asn Pro He Val Gin Asn Ala Gin Phe Tyr He Gly Val Leu He Leu 230 235 240
TTA AAA GAG TTT GAT AAG GCC CAG AAA ATC GCA GAA TTA TTC CCT TTT 824 Leu Lys Glu Phe Asp Lys Ala Gin Lys He Ala Glu Leu Phe Pro Phe 245 250 255
GAC AGG CGT TTG TTG TTA GAC TTA TAC ACC GCA CAA AAA AAA TTC GAT 872 Asp Arg Arg Leu Leu Leu Asp Leu Tyr Thr Ala Gin Lys Lys Phe Asp 260 265 270
CAA GCT TCC AAA CAA GCT TCT TTG ATC TAT CAA GAA AAA AAA GAC CCT 920 Gin Ala Ser Lys Gin Ala Ser Leu He Tyr Gin Glu Lys Lys Asp Pro 275 280 285 290
AAA TTC TTA GGA TTA GAG GCC ATT TAT CAT TAT GAA AGC TTG AGT GCG 968 Lys Phe Leu Gly Leu Glu Ala He Tyr His Tyr Glu Ser Leu Ser Ala 295 300 305
AAT AAG AAA AAG CTC ACC AAA GAA GAG ATG TTG CCT ATC ATT CAA AAA 1016 Asn Lys Lys Lys Leu Thr Lys Glu Glu Met Leu Pro He He Gin Lys 310 315 320
TTA GAG CAA GCC ACC AAA GAG CGC CAA GCA TGG CTC GCT AAA ACC AAA 1064 Leu Glu Gin Ala Thr Lys Glu Arg Gin Ala Trp Leu Ala Lys Thr Lys 325 330 335
GAT AAA GAA GAC GCG CAA GAC GCT TTC TTT TAT AAT TTT TTA GGG TAT 1112 Asp Lys Glu Asp Ala Gin Asp Ala Phe Phe Tyr Asn Phe Leu Gly Tyr 340 345 350 TCC TTA ATA GAT TAT GAC ATG GAT ATT AAA AGG GGC ATG GAT TTT GTG 1160 Ser Leu He Asp Tyr Asp Met Asp He Lys Arg Gly Met Asp Phe Val 355 360 365 370
AGG AAA GCC TTA GCG TTG GAT TCT GGA TCA GTG CTT TAT TTG GAT TCT 1208 Arg Lys Ala Leu Ala Leu Asp Ser Gly Ser Val Leu Tyr Leu Asp Ser 375 380 385
TTA GCA TGG GGT TAT TAC AAA TTA GGG AAT TGT TTG GAA GCT AAA AAA 1256 Leu Ala Trp Gly Tyr Tyr Lys Leu Gly Asn Cys Leu Glu Ala Lys Lys 390 395 400
ATC TTT TCT AGC ATC GCT AAA GAG TCT ATC CAA GCC GAA CCT GAA TTG 1304 He Phe Ser Ser He Ala Lys Glu Ser He Gin Ala Glu Pro Glu Leu 405 410 415
AAA GAA CAC AAT AAA ATC ATT CAA GAA TGC AAG AAA TAGGGATTTT AGAAAA 1356 Lys Glu His Asn Lys He He Gin Glu Cys Lys Lys 420 425 430
TTTACAAAAA AGCTTAGCCT TAAAAGAGGG CATGCTT 1393
(2) INFORMATION FOR SEQ ID NO:254:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 430 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254:
Met Asn He Gin He Lys Lys Arg Phe Leu Ala Asn Leu Leu Leu Phe
1 5 10 15
Ser Leu Phe Cys Leu Lys Ala Glu Thr Leu Ser Glu Asp His Gin He
20 25 30
Leu Leu Ser Ser Asp Ala Phe His Arg Gly Asp Phe Ala Ala Ala Gin
35 40 45
Lys Gly Tyr Met Asn Leu Tyr Lys Gin Thr Asn Lys Val Val Tyr Ala
50 55 60
Lys Glu Ala Ala He Ser Ala Ala Ser Leu Gly Asp He Lys Thr Ala 65 70 75 80
Met His Leu Ala Met Leu Tyr Gin Lys He Thr Asn Asn Arg Asn Asp
85 90 95
Val Ser He Asn Lys He Leu Val Asp Gly Tyr Ala Gin Met Gly Gin
100 105 110
He Asp Lys Ala He Glu Leu Leu His Lys He Arg Lys Glu Glu Lys
115 120 125
Thr He Ala Thr Asp Asn Val Leu Gly Thr Leu Tyr Leu Thr Gin Lys
130 135 140
Arg Leu Asp Lys Ala Phe Pro Leu Leu Asn Lys Phe Tyr Asn Gin Val 145 150 155 160 His Asp Glu Asp Ser Leu Glu Lys Leu He Thr He Tyr Phe Leu Gin
165 170 175
Asn Arg Lys Lys Glu Gly Leu Asp Leu Leu Gin Ser His He Asp Arg
180 185 190
Tyr Gly Cys Ser Glu Gin Leu Cys Gin Lys Ala Leu Asn Thr Phe Thr
195 200 205
Gin Phe Asn Glu Leu Asp Leu Ala Lys Thr Thr Phe Ala Arg Leu Tyr
210 215 220
Glu Lys Asn Pro He Val Gin Asn Ala Gin Phe Tyr He Gly Val Leu 225 230 235 240
He Leu Leu Lys Glu Phe Asp Lys Ala Gin Lys He Ala Glu Leu Phe
245 250 255
Pro Phe Asp Arg Arg Leu Leu Leu Asp Leu Tyr Thr Ala Gin Lys Lys
260 265 270
Phe Asp Gin Ala Ser Lys Gin Ala Ser Leu He Tyr Gin Glu Lys Lys
275 280 285
Asp Pro Lys Phe Leu Gly Leu Glu Ala He Tyr His Tyr Glu Ser Leu
290 295 300
Ser Ala Asn Lys Lys Lys Leu Thr Lys Glu Glu Met Leu Pro He He 305 310 315 320
Gin Lys Leu Glu Gin Ala Thr Lys Glu Arg Gin Ala Trp Leu Ala Lys
325 330 335
Thr Lys Asp Lys Glu Asp Ala Gin Asp Ala Phe Phe Tyr Asn Phe Leu
340 345 350
Gly Tyr Ser Leu He Asp Tyr Asp Met Asp He Lys Arg Gly Met Asp
355 360 365
Phe Val Arg Lys Ala Leu Ala Leu Asp Ser Gly Ser Val Leu Tyr Leu
370 375 380
Asp Ser Leu Ala Trp Gly Tyr Tyr Lys Leu Gly Asn Cys Leu Glu Ala 385 390 395 400
Lys Lys He Phe Ser Ser He Ala Lys Glu Ser He Gin Ala Glu Pro
405 410 415
Glu Leu Lys Glu His Asn Lys He He Gin Glu Cys Lys Lys 420 425 430
(2) INFORMATION FOR SEQ ID NO:255:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1090 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1037 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255 TTACCCTAAA ACGCTATTTT TAAAATAATC CATTAAAATA AAGGCGAGGA ATG AAA 56
Met Lys 1
AGA TTT GTT TTG TTT TTA TTG TTC ATG TGC GTT TGC GTT CAA GCT TAC 104 Arg Phe Val Leu Phe Leu Leu Phe Met Cys Val Cys Val Gin Ala Tyr 5 10 15
GCC GAG CAA GAT TAC TTT TTT AGG GAT TTT AAA TCT AGA GAT TTG CCC 152 Ala Glu Gin Asp Tyr Phe Phe Arg Asp Phe Lys Ser Arg Asp Leu Pro 20 25 30
CAA AAA CTC CAT CTT GAT AAA AAG CTC TCC CAA ACA ATA CAG CCA TGC 200 Gin Lys Leu His Leu Asp Lys Lys Leu Ser Gin Thr He Gin Pro Cys 35 40 45 50
ATG CAA CTT AAC GCA TCA AAA CAC TAC ACT TCT ACC GGG GTT AGA GAG 248 Met Gin Leu Asn Ala Ser Lys His Tyr Thr Ser Thr Gly Val Arg Glu 55 60 65
CCT GAT AAA TGC ACA AAG AGT TTT AAA AAA TCC GCT CTC ATG TCC TAT 296 Pro Asp Lys Cys Thr Lys Ser Phe Lys Lys Ser Ala Leu Met Ser Tyr 70 75 80
GAC TTA GCG CTA GGT TAT TTG GTG AGT AAG AAT AAG CAA TAC GGC TTA 344 Asp Leu Ala Leu Gly Tyr Leu Val Ser Lys Asn Lys Gin Tyr Gly Leu 85 90 95
AAG GCT ATA GAA ATT TTA AAC GCT TGG GCT AAA GAG CTT CAA AGC GTG 392 Lys Ala He Glu He Leu Asn Ala Trp Ala Lys Glu Leu Gin Ser Val 100 105 110
GAT ACT TAT CAG AGC GAG GAT AAT ATC AAT TTT TAC ATG CCT TAT ATG 440 Asp Thr Tyr Gin Ser Glu Asp Asn He Asn Phe Tyr Met Pro Tyr Met 115 120 125 130
AAC ATG GCT TAT TGG TTT GTC AAA AAG GCG TTT CCT AGC CCA GAA TAT 488 Asn Met Ala Tyr Trp Phe Val Lys Lys Ala Phe Pro Ser Pro Glu Tyr 135 140 145
GAA GAT TTC ATT AAG CGG ATG CGC CAG TAT TCT CAA TCA GCT CTT AAC 536 Glu Asp Phe He Lys Arg Met Arg Gin Tyr Ser Gin Ser Ala Leu Asn 150 155 160
ACT AAC CAT GGG GCG TGG GGC ATT CTT TTT GAT GTG AGT TCT GCG CTA 584 Thr Asn His Gly Ala Trp Gly He Leu Phe Asp Val Ser Ser Ala Leu 165 170 175
GCG TTA GAC GAT AAT GCC CTT TTG CAC AAT AGC GCT AAT CGG TGG CAG 632 Ala Leu Asp Asp Asn Ala Leu Leu His Asn Ser Ala Asn Arg Trp Gin 180 185 190
GAG TGG GTG TTT AAA GCC ATA GAT GAG AAT GGG GTT ATT GNT AGC GCG 680 Glu Trp Val Phe Lys Ala He Asp Glu Asn Gly Val He Xaa Ser Ala 195 200 205 210 ATC ACT AGG AGC GAT ACG AGC GAT TAT CAT GGC GGC CCT ACA AAG GGC 728 He Thr Arg Ser Asp Thr Ser Asp Tyr His Gly Gly Pro Thr Lys Gly 215 220 225
ATT AAG GGG ATA GCT TAT ACC AAT TTC GCG CTT CTT GCG CTA ACC ATA 776 He Lys Gly He Ala Tyr Thr Asn Phe Ala Leu Leu Ala Leu Thr He 230 235 240
TCA GGC GAA TTG CTT TTT GAG AAC GGG TAT GAT TTG TGG GGT AGT GGA 824 Ser Gly Glu Leu Leu Phe Glu Asn Gly Tyr Asp Leu Trp Gly Ser Gly 245 250 255
GCT GGG AAA AGG CTC TCT GTG GCG TAT AAC AAA GTT GCA ACA TGG ATT 872 Ala Gly Lys Arg Leu Ser Val Ala Tyr Asn Lys Val Ala Thr Trp He 260 265 270
TTA AAC CCT GAA ACT TTC CCT TAT TTC CAG CCT AAC CTT ATC GGG GTG 920 Leu Asn Pro Glu Thr Phe Pro Tyr Phe Gin Pro Asn Leu He Gly Val 275 280 285 290
CAT AAC AAC GCC TAT TTC ATT ATT TTA GCC AAG CAT TAT TCT AGC CCT 968 His Asn Asn Ala Tyr Phe He He Leu Ala Lys His Tyr Ser Ser Pro 295 300 305
AGT GCA AAT GAG CTT TTA AAG CAA GGC GAT TTA CAC GAA GAT GGT TTC 1016 Ser Ala Asn Glu Leu Leu Lys Gin Gly Asp Leu His Glu Asp Gly Phe 310 315 320
AGG CTG AAA CTC CGA TCG CCA TGAATTTTTC TGTATCCAAG GTTAGCCTTA AGGA 1071 Arg Leu Lys Leu Arg Ser Pro 325
TGGCCATGCG CTTTAACCT 1090
(2) INFORMATION FOR SEQ ID NO: 256:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 329 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256:
Met Lys Arg Phe Val Leu Phe Leu Leu Phe Met Cys Val Cys Val Gin
1 5 10 15
Ala Tyr Ala Glu Gin Asp Tyr Phe Phe Arg Asp Phe Lys Ser Arg Asp
20 25 30
Leu Pro Gin Lys Leu His Leu Asp Lys Lys Leu Ser Gin Thr He Gin
35 40 45
Pro Cys Met Gin Leu Asn Ala Ser Lys His Tyr Thr Ser Thr Gly Val 50 55 60 Arg Glu Pro Asp Lys Cys Thr Lys Ser Phe Lys Lys Ser Ala Leu Met 65 70 75 80
Ser Tyr Asp Leu Ala Leu Gly Tyr Leu Val Ser Lys Asn Lys Gin Tyr
85 90 95
Gly Leu Lys Ala He Glu He Leu Asn Ala Trp Ala Lys Glu Leu Gin
100 105 110
Ser Val Asp Thr Tyr Gin Ser Glu Asp Asn He Asn Phe Tyr Met Pro
115 120 125
Tyr Met Asn Met Ala Tyr Trp Phe Val Lys Lys Ala Phe Pro Ser Pro
130 135 140
Glu Tyr Glu Asp Phe He Lys Arg Met Arg Gin Tyr Ser Gin Ser Ala 145 150 155 160
Leu Asn Thr Asn His Gly Ala Trp Gly He Leu Phe Asp Val Ser Ser
165 170 175
Ala Leu Ala Leu Asp Asp Asn Ala Leu Leu His Asn Ser Ala Asn Arg
180 185 190
Trp Gin Glu Trp Val Phe Lys Ala He Asp Glu Asn Gly Val He Xaa
195 200 205
Ser Ala He Thr Arg Ser Asp Thr Ser Asp Tyr His Gly Gly Pro Thr
210 215 220
Lys Gly He Lys Gly He Ala Tyr Thr Asn Phe Ala Leu Leu Ala Leu 225 230 235 240
Thr He Ser Gly Glu Leu Leu Phe Glu Asn Gly Tyr Asp Leu Trp Gly
245 250 255
Ser Gly Ala Gly Lys Arg Leu Ser Val Ala Tyr Asn Lys Val Ala Thr
260 265 270
Trp He Leu Asn Pro Glu Thr Phe Pro Tyr Phe Gin Pro Asn Leu He
275 280 285
Gly Val His Asn Asn Ala Tyr Phe He He Leu Ala Lys His Tyr Ser
290 295 300
Ser Pro Ser Ala Asn Glu Leu Leu Lys Gin Gly Asp Leu His Glu Asp 305 310 315 320
Gly Phe Arg Leu Lys Leu Arg Ser Pro 325
(2) INFORMATION FOR SEQ ID NO: 257:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 373 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...320 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: TTAACGATCG CAAAAGCCGA TGAAAGTTTT GATGAAATCA TAAAAGGTGT GTG AAT 56 Val Asn
1
TTT TTG AAA AAG CCA AAG TAT TAT AAA TTC ATA GAG GGG GCG AAT TAT 104 Phe Leu Lys Lys Pro Lys Tyr Tyr Lys Phe He Glu Gly Ala Asn Tyr 5 10 15
TTG AGC TTG GGG CTT TCT ATG GTG GTA GCG ATC CTT ATG GGC GTG GCT 152 Leu Ser Leu Gly Leu Ser Met Val Val Ala He Leu Met Gly Val Ala 20 25 30
ATA GGC TAT GGG CTT AAA AAA CTC ACT CAT ATT TCG TGG CTT TTT TGG 200 He Gly Tyr Gly Leu Lys Lys Leu Thr His He Ser Trp Leu Phe Trp 35 40 45 50
CTT GGG GTT ATT TGG GGC GTC TTA GCG AGC TTT CTC AAT GTC TAT AAA 248 Leu Gly Val He Trp Gly Val Leu Ala Ser Phe Leu Asn Val Tyr Lys 55 60 65
GCT TAT AAA AAC ATG CAA AAA GAC TAT GAA GAA CTA GCC AAA GAC CCT 296 Ala Tyr Lys Asn Met Gin Lys Asp Tyr Glu Glu Leu Ala Lys Asp Pro 70 75 80
AAA TAC ACA CAA AAT AAA ACA AAA TAAATCCAAT CAAATCCCAT GTGCCAAATC 350 Lys Tyr Thr Gin Asn Lys Thr Lys 85 90
CAATGCTTGC TTATTTTACT TTC 373
(2) INFORMATION FOR SEQ ID NO: 258:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 90 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258:
Val Asn Phe Leu Lys Lys Pro Lys Tyr Tyr Lys Phe He Glu Gly Ala
1 5 10 15
Asn Tyr Leu Ser Leu Gly Leu Ser Met Val Val Ala He Leu Met Gly
20 25 30
Val Ala He Gly Tyr Gly Leu Lys Lys Leu Thr His He Ser Trp Leu
35 40 45
Phe Trp Leu Gly Val He Trp Gly Val Leu Ala Ser Phe Leu Asn Val
50 55 60
Tyr Lys Ala Tyr Lys Asn Met Gin Lys Asp Tyr Glu Glu Leu Ala Lys 65 70 75 80
Asp Pro Lys Tyr Thr Gin Asn Lys Thr Lys 85 90 (2) INFORMATION FOR SEQ ID NO: 259:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 643 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...590 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:259:
TTGGTTGGTT GTTTTTATCA TAGAGTGTAA TTTAAAATAA GGATCATTTG ATG TTA 56
Met Leu 1
AAC AAG TTT AAA AAA ATC GTT GGC GTT AGT GTG TTA GTG GGC TGT TTA 104 Asn Lys Phe Lys Lys He Val Gly Val Ser Val Leu Val Gly Cys Leu 5 10 15
GGG GTT TTG CAA GCT AAA AAC AGC TTA TTT GTC TTA CCT TAT GAG CAA 152 Gly Val Leu Gin Ala Lys Asn Ser Leu Phe Val Leu Pro Tyr Glu Gin 20 25 30
AAA GAC GCT CTC AAT TCT TTA GTT TCT GGC ATT AGT AAC GCC AGA GAG 200 Lys Asp Ala Leu Asn Ser Leu Val Ser Gly He Ser Asn Ala Arg Glu 35 40 45 50
AGC GTG AAA ATC GCT ATC TAT AGT TTC ACG CAC AGA GAT ATT GCA AGA 248 Ser Val Lys He Ala He Tyr Ser Phe Thr His Arg Asp He Ala Arg 55 60 65
GCG ATT AAA AGC GTA GCG AGT AGG GGG ATT AAG GTG CAA ATC ATT TAT 296 Ala He Lys Ser Val Ala Ser Arg Gly He Lys Val Gin He He Tyr 70 75 80
GAT TAT GAA AGC AAT CAT CAT AAC AAG CAA TCC ACT ATT GGC TAT CTG 344 Asp Tyr Glu Ser Asn His His Asn Lys Gin Ser Thr He Gly Tyr Leu 85 90 95
GAC AAA TAC CCT AAC ACG AAA GTG TGC TTA TTG AAA GGG CTT AAG GCT 392 Asp Lys Tyr Pro Asn Thr Lys Val Cys Leu Leu Lys Gly Leu Lys Ala 100 105 110
AAA AAC GGG AAT TAT TAC GGC ATC ATG CAC CAA AAA GTA GCG ATC ATT 440 Lys Asn Gly Asn Tyr Tyr Gly He Met His Gin Lys Val Ala He He 115 120 125 130 GAT GAT AAG ATC GTG TTT TTA GGC TCA GCG AAT TGG AGC AAA AAC GCT 488
Asp Asp Lys He Val Phe Leu Gly Ser Ala Asn Trp Ser Lys Asn Ala 135 140 145
TTT GAA AAC AAT TAT GAA GTG CTT TTA AAA ACC GAT GAC ACA GAA ACG 536
Phe Glu Asn Asn Tyr Glu Val Leu Leu Lys Thr Asp Asp Thr Glu Thr
150 155 160
ATC CTC AAA GCC AAG AGC TAT TAC CAA AAG ATG TTA GGG AGT TGC GTT 584
He Leu Lys Ala Lys Ser Tyr Tyr Gin Lys Met Leu Gly Ser Cys Val 165 170 175
GGG TTT TAAAAGCCCT TTAGAAGTGG TAATTATACC CCACATAAAA GGCAAAGACC CT 642 Gly Phe 180
643
(2) INFORMATION FOR SEQ ID NO: 260:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 180 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260:
Met Leu Asn Lys Phe Lys Lys He Val Gly Val Ser Val Leu Val Gly
1 5 10 15
Cys Leu Gly Val Leu Gin Ala Lys Asn Ser Leu Phe Val Leu Pro Tyr
20 25 30
Glu Gin Lys Asp Ala Leu Asn Ser Leu Val Ser Gly He Ser Asn Ala
35 40 45
Arg Glu Ser Val Lys He Ala He Tyr Ser Phe Thr His Arg Asp He
50 55 60
Ala Arg Ala He Lys Ser Val Ala Ser Arg Gly He Lys Val Gin He 65 70 75 80
He Tyr Asp Tyr Glu Ser Asn His His Asn Lys Gin Ser Thr He Gly
85 90 95
Tyr Leu Asp Lys Tyr Pro Asn Thr Lys Val Cys Leu Leu Lys Gly Leu
100 105 110
Lys Ala Lys Asn Gly Asn Tyr Tyr Gly He Met His Gin Lys Val Ala
115 120 125
He He Asp Asp Lys He Val Phe Leu Gly Ser Ala Asn Trp Ser Lys
130 135 140
Asn Ala Phe Glu Asn Asn Tyr Glu Val Leu Leu Lys Thr Asp Asp Thr 145 150 155 160
Glu Thr He Leu Lys Ala Lys Ser Tyr Tyr Gin Lys Met Leu Gly Ser
165 170 175
Cys Val Gly Phe 180 (2) INFORMATION FOR SEQ ID NO: 261:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 814 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...761 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261:
TATAAGAGAG TATAATTCAA GGCTTAAAAT AACTCAAGTA AGGCTAGTGG ATG AAA 56
Met Lys
1
AAA GCG CTT TAT TTA GGG GCT GTT GCG TTT AGC GTT GCA TTC AGC ATG 104 Lys Ala Leu Tyr Leu Gly Ala Val Ala Phe Ser Val Ala Phe Ser Met 5 10 15
GCA TCA GCC AAT GAG CCA AAA ATT GAT TTT AAC CCT CCC AAT TAT GTA 152 Ala Ser Ala Asn Glu Pro Lys He Asp Phe Asn Pro Pro Asn Tyr Val 20 25 30
GAA GAA ACC CCC TCT AAA GAA TTT ATC CCT GAA TTG AAC AAG TTA GGG 200 Glu Glu Thr Pro Ser Lys Glu Phe He Pro Glu Leu Asn Lys Leu Gly 35 40 45 50
AGT TTG TTT GGG CAG GGT GAG CGC CCC TTG TTT GCG GAC AGG AGG GCG 248 Ser Leu Phe Gly Gin Gly Glu Arg Pro Leu Phe Ala Asp Arg Arg Ala 55 60 65
ATG AAG CCT AAC GAT TTG ATC ACA ATC ATT GTT TCT GAA AAA GCG AGC 296 Met Lys Pro Asn Asp Leu He Thr He He Val Ser Glu Lys Ala Ser 70 75 80
GCG AAT TAT TCC AGC TCT AAA GAT TAT AAA AGC GCT TCA GGG GGT AAT 344 Ala Asn Tyr Ser Ser Ser Lys Asp Tyr Lys Ser Ala Ser Gly Gly Asn 85 90 95
TCC ACG CCC CCA AGA CTC ACT TAT AAC GGG CTA GAT GAA AGA AAG AAA 392 Ser Thr Pro Pro Arg Leu Thr Tyr Asn Gly Leu Asp Glu Arg Lys Lys 100 105 110
AAA GAA GCG GAG TAT TTA GAC GAT AAG AAT AAT TAC AAT TTC ACC AAA 440 Lys Glu Ala Glu Tyr Leu Asp Asp Lys Asn Asn Tyr Asn Phe Thr Lys 115 120 125 130 TCC AGC AAT AAC ACG AAT TTT AAA GGC GGT GGC TCG CAA AAA AAG AGC 488 Ser Ser Asn Asn Thr Asn Phe Lys Gly Gly Gly Ser Gin Lys Lys Ser 135 140 145
GAA GAT TTA GAG ATT GTG TTG AGC GCT CGA ATC ATT AAG GTG CTA GAA 536 Glu Asp Leu Glu He Val Leu Ser Ala Arg He He Lys Val Leu Glu 150 155 160
AAC GGG AAT TAT TTC ATC TAT GGG AAT AAG GAA GTG CTA GTG GAT GGG 584 Asn Gly Asn Tyr Phe He Tyr Gly Asn Lys Glu Val Leu Val Asp Gly 165 170 175
GAA AAG CAA ATC CTT AAG GTG AGT GGG GTG ATC CGC CCT TAT GAT ATT 632 Glu Lys Gin He Leu Lys Val Ser Gly Val He Arg Pro Tyr Asp He 180 185 190
GAA AGG AAT AAC ACC ATC CAA TCC AAG TTT TTA GCC GAC GCT AAG ATT 680 Glu Arg Asn Asn Thr He Gin Ser Lys Phe Leu Ala Asp Ala Lys He 195 200 205 210
GAA TAC ACG AAT TTA GGG CAT TTG AGC GAT TCC AAT AAG AAG AAA TTC 728 Glu Tyr Thr Asn Leu Gly His Leu Ser Asp Ser Asn Lys Lys Lys Phe 215 220 225
GCT GCT GAT GCG ATG GAA ACC CAA ATG CCT TAT TAAAAAGAGC AAAGCCTAGC 781 Ala Ala Asp Ala Met Glu Thr Gin Met Pro Tyr 230 235
ATGAGAGCGA TCGCTATTGT TTTAGCCAGA AGT 814
(2) INFORMATION FOR SEQ ID NO: 262:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 237 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262:
Met Lys Lys Ala Leu Tyr Leu Gly Ala Val Ala Phe Ser Val Ala Phe
1 5 10 15
Ser Met Ala Ser Ala Asn Glu Pro Lys He Asp Phe Asn Pro Pro Asn
20 25 30
Tyr Val Glu Glu Thr Pro Ser Lys Glu Phe He Pro Glu Leu Asn Lys
35 40 45
Leu Gly Ser Leu Phe Gly Gin Gly Glu Arg Pro Leu Phe Ala Asp Arg
50 55 60
Arg Ala Met Lys Pro Asn Asp Leu He Thr He He Val Ser Glu Lys 65 70 75 80
Ala Ser Ala Asn Tyr Ser Ser Ser Lys Asp Tyr Lys Ser Ala Ser Gly 85 90 95 Gly Asn Ser Thr Pro Pro Arg Leu Thr Tyr Asn Gly Leu Asp Glu Arg
100 105 110
Lys Lys Lys Glu Ala Glu Tyr Leu Asp Asp Lys Asn Asn Tyr Asn Phe
115 120 125
Thr Lys Ser Ser Asn Asn Thr Asn Phe Lys Gly Gly Gly Ser Gin Lys
130 135 140
Lys Ser Glu Asp Leu Glu He Val Leu Ser Ala Arg He He Lys Val 145 150 155 160
Leu Glu Asn Gly Asn Tyr Phe He Tyr Gly Asn Lys Glu Val Leu Val
165 170 175
Asp Gly Glu Lys Gin He Leu Lys Val Ser Gly Val He Arg Pro Tyr
180 185 190
Asp He Glu Arg Asn Asn Thr He Gin Ser Lys Phe Leu Ala Asp Ala
195 200 205
Lys He Glu Tyr Thr Asn Leu Gly His Leu Ser Asp Ser Asn Lys Lys
210 215 220
Lys Phe Ala Ala Asp Ala Met Glu Thr Gin Met Pro Tyr 225 230 235
(2) INFORMATION FOR SEQ ID NO:263:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 850 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...797 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:263:
TTGGGTAAGA TTAGGAATTG ATTTTAAAGA AAAAGAAAGA AAGGAATTTA ATG AAA 56
Met Lys
1
AAA GGT AGT TTG GCA ATC GTT TTA GGA TCG CTA TTA GCG AGT GGG GCG 104 Lys Gly Ser Leu Ala He Val Leu Gly Ser Leu Leu Ala Ser Gly Ala 5 10 15
TTT TAT ACG GCT CTA GCT GAT GGA ATG CCT GCA AAA CAG CAG CAC AAT 152 Phe Tyr Thr Ala Leu Ala Asp Gly Met Pro Ala Lys Gin Gin His Asn 20 25 30
AAT ACG GGC GAG TCA GTG GAG TTG CAT TTC CAC TAT CCT ATT AAA GGC 200 Asn Thr Gly Glu Ser Val Glu Leu His Phe His Tyr Pro He Lys Gly 35 40 45 50 AAG CAA GAG CCT AAA AAC AGC CAT TTA GTC GTT TTG ATC GAA CCT AAA 248 Lys Gin Glu Pro Lys Asn Ser His Leu Val Val Leu He Glu Pro Lys 55 60 65
ATA GAG ATC AAT AAA GTT ATC CCT GAA AGT TAT CAA AAA GAG TTT GAG 296 He Glu He Asn Lys Val He Pro Glu Ser Tyr Gin Lys Glu Phe Glu 70 75 80
AAG TCT TTG TTT CTC CAG TTG AGT AGT TTT TTA GAG AGA AAA GGC TAT 344 Lys Ser Leu Phe Leu Gin Leu Ser Ser Phe Leu Glu Arg Lys Gly Tyr 85 90 95
AGC GTT TCG CAA TTT AAA GAT GCT AGC GAA ATC CCT CAA GAC ATC AAA 392 Ser Val Ser Gin Phe Lys Asp Ala Ser Glu He Pro Gin Asp He Lys 100 105 110
GAA AAA GCG TTG CTC GTT TTA CGC ATG GAT GGG AAT GTG GCT ATC TTG 440 Glu Lys Ala Leu Leu Val Leu Arg Met Asp Gly Asn Val Ala He Leu 115 120 125 130
GAA GAT ATT GTA GAA GAG AGC GAT GCG CTT AGC GAA GAA AAA GTG ATA 488 Glu Asp He Val Glu Glu Ser Asp Ala Leu Ser Glu Glu Lys Val He 135 140 145
GAC ATG TCT TCA GGG TAT TTG AAC TTG AAT TTT GTT GAG CCA AAA AGT 536 Asp Met Ser Ser Gly Tyr Leu Asn Leu Asn Phe Val Glu Pro Lys Ser 150 155 160
GAA GAT ATT ATC CAT AGT TTT GGT ATT GAT GTT TCA AAG ATT AAG GCT 584 Glu Asp He He His Ser Phe Gly He Asp Val Ser Lys He Lys Ala 165 170 175
GTG ATT GAA AGA GTG GAA TTG CGG CGC ACC AAT TCT GGA GGT TTT GTC 632 Val He Glu Arg Val Glu Leu Arg Arg Thr Asn Ser Gly Gly Phe Val 180 185 190
CCC AAA ACT TTT GTG CAT AGG ATT AAG GAA ACC GAT CAT GAT CAA GCC 680 Pro Lys Thr Phe Val His Arg He Lys Glu Thr Asp His Asp Gin Ala 195 200 205 210
ATT AGA AAA ATC ATG AAT CAA GCC TAT CAC AAA GTG ATG GTG CAT ATT 728 He Arg Lys He Met Asn Gin Ala Tyr His Lys Val Met Val His He 215 220 225
ACC AAA GAG TTA AGC AAA AAA CAC ATG GAA CAT TAT GAA AAA GTT TCT 776 Thr Lys Glu Leu Ser Lys Lys His Met Glu His Tyr Glu Lys Val Ser 230 235 240
AGT GAA ATG AAA AAA CGA AAG TAGTTTTTAA GAAACGAAAA GCTTAAAAAT CATT 831 Ser Glu Met Lys Lys Arg Lys 245
GAGAGCTATT TTTAAAAAA 850
(2) INFORMATION FOR SEQ ID NO:264: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 249 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:264:
Met Lys Lys Gly Ser Leu Ala He Val Leu Gly Ser Leu Leu Ala Ser
1 5 10 15
Gly Ala Phe Tyr Thr Ala Leu Ala Asp Gly Met Pro Ala Lys Gin Gin
20 25 30
His Asn Asn Thr Gly Glu Ser Val Glu Leu His Phe His Tyr Pro He
35 40 45
Lys Gly Lys Gin Glu Pro Lys Asn Ser His Leu Val Val Leu He Glu
50 55 60
Pro Lys He Glu He Asn Lys Val He Pro Glu Ser Tyr Gin Lys Glu 65 70 75 80
Phe Glu Lys Ser Leu Phe Leu Gin Leu Ser Ser Phe Leu Glu Arg Lys
85 90 95
Gly Tyr Ser Val Ser Gin Phe Lys Asp Ala Ser Glu He Pro Gin Asp
100 105 110
He Lys Glu Lys Ala Leu Leu Val Leu Arg Met Asp Gly Asn Val Ala
115 120 125
He Leu Glu Asp He Val Glu Glu Ser Asp Ala Leu Ser Glu Glu Lys
130 135 140
Val He Asp Met Ser Ser Gly Tyr Leu Asn Leu Asn Phe Val Glu Pro 145 150 155 160
Lys Ser Glu Asp He He His Ser Phe Gly He Asp Val Ser Lys He
165 170 175
Lys Ala Val He Glu Arg Val Glu Leu Arg Arg Thr Asn Ser Gly Gly
180 185 190
Phe Val Pro Lys Thr Phe Val His Arg He Lys Glu Thr Asp His Asp
195 200 205
Gin Ala He Arg Lys He Met Asn Gin Ala Tyr His Lys Val Met Val
210 215 220
His He Thr Lys Glu Leu Ser Lys Lys His Met Glu His Tyr Glu Lys 225 230 235 240
Val Ser Ser Glu Met Lys Lys Arg Lys 245
(2) INFORMATION FOR SEQ ID NO: 265:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 841 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...788 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265:
TACAAATAGG TATAATCACC AATTCCAATC ATTTAATCAA AGGGAGTTCT ATG AAA 56
Met Lys
1
AAT ACT TTC AAA GCG TTT GCC TTT TTA ATT GTA TTT TTT TCA AGC GCT 104 Asn Thr Phe Lys Ala Phe Ala Phe Leu He Val Phe Phe Ser Ser Ala 5 10 15
TTA TTA GCG CAG GAT TTA AAA ATC GCT GCT GCT GCT AAT CTT ACA CGC 152 Leu Leu Ala Gin Asp Leu Lys He Ala Ala Ala Ala Asn Leu Thr Arg 20 25 30
GCT TTA AAA GCC CTT GTT AAA GAA TTT CAA AAA GAA CAC CCC AAA GAC 200 Ala Leu Lys Ala Leu Val Lys Glu Phe Gin Lys Glu His Pro Lys Asp 35 40 45 50
ACT GTT AAT ATT AGC TTT AAT TCT TCA GGC AAA CTC TAC GCT CAA ATC 248 Thr Val Asn He Ser Phe Asn Ser Ser Gly Lys Leu Tyr Ala Gin He 55 60 65
ATT CAA AAC GCC CCT TTT GAT TTA TTC ATT TCA GCA GAT ATG ATT AGA 296 He Gin Asn Ala Pro Phe Asp Leu Phe He Ser Ala Asp Met He Arg 70 75 80
CCT AAA AAG CTT TAT GAT AAA AAA ATA ACC CCT TTT AAA GAA GAA GTC 344 Pro Lys Lys Leu Tyr Asp Lys Lys He Thr Pro Phe Lys Glu Glu Val 85 90 95
TAT GCT AAA GGC GTG TTG GTT TTA TGG AGT GAA GAT CTA AAA ATG GAT 392 Tyr Ala Lys Gly Val Leu Val Leu Trp Ser Glu Asp Leu Lys Met Asp 100 105 110
TCT TTA GAA ATT CTT AAA AAT CCT AAA ATC AAG CGT ATC GCT ATG GCT 440 Ser Leu Glu He Leu Lys Asn Pro Lys He Lys Arg He Ala Met Ala 115 120 125 130
AAT CCT AAA CTA GCC CCT TAT GGA AAA GCC AGC ATG GAA GTC TTA GAG 488 Asn Pro Lys Leu Ala Pro Tyr Gly Lys Ala Ser Met Glu Val Leu Glu 135 140 145
AAT TTA AAA CTC ACT CCC AGT CTT AAA TCT AAA ATC GTT TAT GGC GCT 536 Asn Leu Lys Leu Thr Pro Ser Leu Lys Ser Lys He Val Tyr Gly Ala 150 155 160
TCT ATT TCT CAA GCC CAT CAA TTT GTC GCT ACT AAA AAC GCT CAA ATA 584 Ser He Ser Gin Ala His Gin Phe Val Ala Thr Lys Asn Ala Gin He 165 170 175
GGC TTT GGA GCG TTA TCC TTG ATG GAT AAA AAA GAT AAA AAC CTC TCT 632 Gly Phe Gly Ala Leu Ser Leu Met Asp Lys Lys Asp Lys Asn Leu Ser 180 185 190
TAT TTC ATC ATT GAT AAA GCC CTT TAT AAC CCT ATT GAA CAA GCC TTG 680 Tyr Phe He He Asp Lys Ala Leu Tyr Asn Pro He Glu Gin Ala Leu 195 200 205 210
ATT ATC ACT AAA AAT GGG GCT AAC AAC CCT TTA GCC AAA GTC TTT AAA 728 He He Thr Lys Asn Gly Ala Asn Asn Pro Leu Ala Lys Val Phe Lys 215 220 225
GAT TTT TTA TTC AGC CCT AAA GCC AGA GCT ATT TTT AAA GAA TAC GGC 776 Asp Phe Leu Phe Ser Pro Lys Ala Arg Ala He Phe Lys Glu Tyr Gly 230 235 240
TAT ATT GTG GAT TAAAACGCAT AAAAAAGGCG AGCAATGGAT CATGAGTTTT TGATT 833 Tyr He Val Asp 245
ACCATGCG 841
(2) INFORMATION FOR SEQ ID NO: 266:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 246 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:266:
Met Lys Asn Thr Phe Lys Ala Phe Ala Phe Leu He Val Phe Phe Ser
1 5 10 15
Ser Ala Leu Leu Ala Gin Asp Leu Lys He Ala Ala Ala Ala Asn Leu
20 25 30
Thr Arg Ala Leu Lys Ala Leu Val Lys Glu Phe Gin Lys Glu His Pro
35 40 45
Lys Asp Thr Val Asn He Ser Phe Asn Ser Ser Gly Lys Leu Tyr Ala
50 55 60
Gin He He Gin Asn Ala Pro Phe Asp Leu Phe He Ser Ala Asp Met 65 70 75 80
He Arg Pro Lys Lys Leu Tyr Asp Lys Lys He Thr Pro Phe Lys Glu
85 90 95
Glu Val Tyr Ala Lys Gly Val Leu Val Leu Trp Ser Glu Asp Leu Lys
100 105 110
Met Asp Ser Leu Glu He Leu Lys Asn Pro Lys He Lys Arg He Ala
115 120 125
Met Ala Asn Pro Lys Leu Ala Pro Tyr Gly Lys Ala Ser Met Glu Val 130 135 140 Leu Glu Asn Leu Lys Leu Thr Pro Ser Leu Lys Ser Lys He Val Tyr 145 150 155 160
Gly Ala Ser He Ser Gin Ala His Gin Phe Val Ala Thr Lys Asn Ala
165 170 175
Gin He Gly Phe Gly Ala Leu Ser Leu Met Asp Lys Lys Asp Lys Asn
180 185 190
Leu Ser Tyr Phe He He Asp Lys Ala Leu Tyr Asn Pro He Glu Gin
195 200 205
Ala Leu He He Thr Lys Asn Gly Ala Asn Asn Pro Leu Ala Lys Val
210 215 220
Phe Lys Asp Phe Leu Phe Ser Pro Lys Ala Arg Ala He Phe Lys Glu 225 230 235 240
Tyr Gly Tyr He Val Asp 245
(2) INFORMATION FOR SEQ ID NO: 267:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1459 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1406 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:267:
AGCCATTTTA TGGCATTTAA AAAAGTTTAA AAATGTTTAA AAGGAATTTT ATG TTA 56
Met Leu
1
AGG CTT TTG ATA GGA CTT CTT CTA ATG AGT TTT ATA AGC TTG CAA TCA 104 Arg Leu Leu He Gly Leu Leu Leu Met Ser Phe He Ser Leu Gin Ser 5 10 15
GCC TCT TGG CAA GAA CCC TTA AGA GTG AGT ATA GAA TTT GTG GAT TTG 152 Ala Ser Trp Gin Glu Pro Leu Arg Val Ser He Glu Phe Val Asp Leu 20 25 30
CCT AAA AAA ATC ATT CGT TTT CCG GCT CAT GAT TTG CAA GTG GGG GAG 200 Pro Lys Lys He He Arg Phe Pro Ala His Asp Leu Gin Val Gly Glu 35 40 45 50
TTT GGT TTT GTC GTT ACT AAA CTT TCA GAT TAT GAA ATC GTT AAT TCT 248 Phe Gly Phe Val Val Thr Lys Leu Ser Asp Tyr Glu He Val Asn Ser 55 60 65 GAA GTG GTC ATT ATT GCC GTT GAA AAT GGC GTC GCA ACG GCT AAA TTC 296 Glu Val Val He He Ala Val Glu Asn Gly Val Ala Thr Ala Lys Phe 70 75 80
AGA GCG TTT GAG TCT ATG AAA CAA AGG CAT TTA CCC ACT CCA AGA ATG 344 Arg Ala Phe Glu Ser Met Lys Gin Arg His Leu Pro Thr Pro Arg Met 85 90 95
GTC GCT AGA AAG GGT GAT TTA GTC TAT TTT AGG CAA TTC AAC AAC CAA 392 Val Ala Arg Lys Gly Asp Leu Val Tyr Phe Arg Gin Phe Asn Asn Gin 100 105 110
GCG TTT TTA ATC GCT CCT AAT GAT GAA CTC TAT GAG CAA ATC AGA GCG 440 Ala Phe Leu He Ala Pro Asn Asp Glu Leu Tyr Glu Gin He Arg Ala 115 120 125 130
ACT AAC ACC GAT ATT AAT TTT ATT AGT TCT GAT TTG TTG GTT ACT TTT 488 Thr Asn Thr Asp He Asn Phe He Ser Ser Asp Leu Leu Val Thr Phe 135 140 145
TTG AAT GGG TTT GAC CCA AAA ATC GCT AAT TTA AGG AAA GCG TGC AAC 536 Leu Asn Gly Phe Asp Pro Lys He Ala Asn Leu Arg Lys Ala Cys Asn 150 155 160
GTT TAT AGC GTG GGG GTG ATT TAT ATT GTA ACC ACC AAC ACG CTC AAT 584 Val Tyr Ser Val Gly Val He Tyr He Val Thr Thr Asn Thr Leu Asn 165 170 175
ATT TTA AGT TGT GAG AGT TTT GAA ATT TTA GAA AAA AGA GAG CTG GAT 632 He Leu Ser Cys Glu Ser Phe Glu He Leu Glu Lys Arg Glu Leu Asp 180 185 190
ACA AGC GGC GTT ACT AAA ACT TCC ACG CCG TTT TTT TCT AGG GTT GAG 680 Thr Ser Gly Val Thr Lys Thr Ser Thr Pro Phe Phe Ser Arg Val Glu 195 200 205 210
GGT ATT GAT GCA GGC ACG CTA GGG AAA CTT TTT TCA GGC AGT CAG TCT 728 Gly He Asp Ala Gly Thr Leu Gly Lys Leu Phe Ser Gly Ser Gin Ser 215 220 225
AAA AAT TAC TTC GCT TAC TAT GAC GCT TTA GTG AAG AAA GAA AAA CGC 776 Lys Asn Tyr Phe Ala Tyr Tyr Asp Ala Leu Val Lys Lys Glu Lys Arg 230 235 240
AAA GAA GTG AGG ATT AAA AAG AGG GAA GAA AAG ATT GAT TCT AGA GAA 824 Lys Glu Val Arg He Lys Lys Arg Glu Glu Lys He Asp Ser Arg Glu 245 250 255
ATT AAA CGA GAA ATC AAG CAA GAG GCC ATT AAA GAG CCT AAA AAA GCC 872 He Lys Arg Glu He Lys Gin Glu Ala He Lys Glu Pro Lys Lys Ala 260 265 270
AAT CAA GGC ACA CAA AAC GCT CCT ACT TTA GAA GAG AAA AAC TAC CAA 920 Asn Gin Gly Thr Gin Asn Ala Pro Thr Leu Glu Glu Lys Asn Tyr Gin 275 280 285 290 AAA GCA GAG CGC AAA CTT GAT GCT AAA GAA GAA AGG CGT TAT TTG AGA 968 Lys Ala Glu Arg Lys Leu Asp Ala Lys Glu Glu Arg Arg Tyr Leu Arg 295 300 305
GAT GAA AGG AAA AAA GCC AAA GCC ACC AAA AAG GCT ATG GAA TTT GAA 1016 Asp Glu Arg Lys Lys Ala Lys Ala Thr Lys Lys Ala Met Glu Phe Glu 310 315 320
GAA AGA GAA AAA GAG CAT GAT GAA AGG GAC GAA CAA GAG ACT GAA GGA 1064 Glu Arg Glu Lys Glu His Asp Glu Arg Asp Glu Gin Glu Thr Glu Gly 325 330 335
AGA AGA AAA GCT TTA GAA ATG GAT AAA GGC GAT AAA AAA GAA GAA AGA 1112 Arg Arg Lys Ala Leu Glu Met Asp Lys Gly Asp Lys Lys Glu Glu Arg 340 345 350
GTC AAA CCC AAA GAA AAT GAG CGA GAA ATC AAG CAA GAA GCC ATT AAA 1160 Val Lys Pro Lys Glu Asn Glu Arg Glu He Lys Gin Glu Ala He Lys 355 360 365 370
GAG CCA AGT GAT GGA AAT AAC GCC ACC CAA CAA GGC GAA AAA CAA AAC 1208 Glu Pro Ser Asp Gly Asn Asn Ala Thr Gin Gin Gly Glu Lys Gin Asn 375 380 385
GCT CCT AAA GAG AAC AAC GCT CAA AAA GAA GAG AAT AAA CCA AAT TCT 1256 Ala Pro Lys Glu Asn Asn Ala Gin Lys Glu Glu Asn Lys Pro Asn Ser 390 395 400
AAA GAA GAA AAA CGC CGC TTG AAA GAA GAA AAG AAA AAA GCC AAA GCC 1304 Lys Glu Glu Lys Arg Arg Leu Lys Glu Glu Lys Lys Lys Ala Lys Ala 405 410 415
GAA CAA AGA GCG AGA GAA TTT GAA CAA AGA GCG AGA GAG CAT CAA GAA 1352 Glu Gin Arg Ala Arg Glu Phe Glu Gin Arg Ala Arg Glu His Gin Glu 420 425 430
AGA GAT GAA AAA GAG CTT GAA GAG CGA AGA AAG GCG CTA GAA GCG GGT 1400 Arg Asp Glu Lys Glu Leu Glu Glu Arg Arg Lys Ala Leu Glu Ala Gly 435 440 445 450
AAA AAA TAACATGTTA GACCAACAAC ACATCCAATA CTTTAAAAAC CTAGTAGGGG GA 1458 Lys Lys
G 1459
(2) INFORMATION FOR SEQ ID NO: 268:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 452 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268:
Met Leu Arg Leu Leu He Gly Leu Leu Leu Met Ser Phe He Ser Leu
1 5 10 15
Gin Ser Ala Ser Trp Gin Glu Pro Leu Arg Val Ser He Glu Phe Val
20 25 30
Asp Leu Pro Lys Lys He He Arg Phe Pro Ala His Asp Leu Gin Val
35 40 45
Gly Glu Phe Gly Phe Val Val Thr Lys Leu Ser Asp Tyr Glu He Val
50 55 60
Asn Ser Glu Val Val He He Ala Val Glu Asn Gly Val Ala Thr Ala 65 70 75 80
Lys Phe Arg Ala Phe Glu Ser Met Lys Gin Arg His Leu Pro Thr Pro
85 90 95
Arg Met Val Ala Arg Lys Gly Asp Leu Val Tyr Phe Arg Gin Phe Asn
100 105 110
Asn Gin Ala Phe Leu He Ala Pro Asn Asp Glu Leu Tyr Glu Gin He
115 120 125
Arg Ala Thr Asn Thr Asp He Asn Phe He Ser Ser Asp Leu Leu Val
130 135 140
Thr Phe Leu Asn Gly Phe Asp Pro Lys He Ala Asn Leu Arg Lys Ala 145 150 155 160
Cys Asn Val Tyr Ser Val Gly Val He Tyr He Val Thr Thr Asn Thr
165 170 175
Leu Asn He Leu Ser Cys Glu Ser Phe Glu He Leu Glu Lys Arg Glu
180 185 190
Leu Asp Thr Ser Gly Val Thr Lys Thr Ser Thr Pro Phe Phe Ser Arg
195 200 205
Val Glu Gly He Asp Ala Gly Thr Leu Gly Lys Leu Phe Ser Gly Ser
210 215 220
Gin Ser Lys Asn Tyr Phe Ala Tyr Tyr Asp Ala Leu Val Lys Lys Glu 225 230 235 240
Lys Arg Lys Glu Val Arg He Lys Lys Arg Glu Glu Lys He Asp Ser
245 250 255
Arg Glu He Lys Arg Glu He Lys Gin Glu Ala He Lys Glu Pro Lys
260 265 270
Lys Ala Asn Gin Gly Thr Gin Asn Ala Pro Thr Leu Glu Glu Lys Asn
275 280 285
Tyr Gin Lys Ala Glu Arg Lys Leu Asp Ala Lys Glu Glu Arg Arg Tyr
290 295 300
Leu Arg Asp Glu Arg Lys Lys Ala Lys Ala Thr Lys Lys Ala Met Glu 305 310 315 320
Phe Glu Glu Arg Glu Lys Glu His Asp Glu Arg Asp Glu Gin Glu Thr
325 330 335
Glu Gly Arg Arg Lys Ala Leu Glu Met Asp Lys Gly Asp Lys Lys Glu
340 345 350
Glu Arg Val Lys Pro Lys Glu Asn Glu Arg Glu He Lys Gin Glu Ala
355 360 365
He Lys Glu Pro Ser Asp Gly Asn Asn Ala Thr Gin Gin Gly Glu Lys
370 375 380
Gin Asn Ala Pro Lys Glu Asn Asn Ala Gin Lys Glu Glu Asn Lys Pro 385 390 395 400
Asn Ser Lys Glu Glu Lys Arg Arg Leu Lys Glu Glu Lys Lys Lys Ala
405 410 415
Lys Ala Glu Gin Arg Ala Arg Glu Phe Glu Gin Arg Ala Arg Glu His 420 425 430
Gin Glu Arg Asp Glu Lys Glu Leu Glu Glu Arg Arg Lys Ala Leu Glu
435 440 445
Ala Gly Lys Lys 450
(2) INFORMATION FOR SEQ ID NO: 269:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 995 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 74...943 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:269:
GGCCTATGAC GATTGTCTCG CTTTTAGAAA ACACTCTAAT CGCTTTTGAA AAACAACAAA 60 GGAAGGGATT TTA ATG AAA TTT TTA CGC TCT GTT TAT GCA TTT TGC TCC 109 Met Lys Phe Leu Arg Ser Val Tyr Ala Phe Cys Ser 1 5 10
AGT TGG GTA GGG ACG ATT GTT ATT GTG CTG TTG GTT ATC TTT TTT ATC 157 Ser Trp Val Gly Thr He Val He Val Leu Leu Val He Phe Phe He 15 20 25
GCG CAA GCC TTT ATC ATT CCC TCT CGC TCT ATG GTT GGC ACG CTC TAT 205 Ala Gin Ala Phe He He Pro Ser Arg Ser Met Val Gly Thr Leu Tyr 30 35 40
GAG GGC GAC ATG CTC TTT GTC AAA AAG TTT TCT TAC GGC ATA CCC ATT 253 Glu Gly Asp Met Leu Phe Val Lys Lys Phe Ser Tyr Gly He Pro He 45 50 55 60
CCT AAA ATC CCA TGG ATT GAG CTT CCT GTT ATG CCT GAT TTT AAA AAT 301 Pro Lys He Pro Trp He Glu Leu Pro Val Met Pro Asp Phe Lys Asn 65 70 75
AAC GGA CAT TTG ATA GAG GGG GAT CGC CCT AAG CGT GGC GAA GTG GTG 349 Asn Gly His Leu He Glu Gly Asp Arg Pro Lys Arg Gly Glu Val Val 80 85 90
GTG TTT ATC CCT CCC CAT GAA AAA AAG TCT TAC TAT GTT AAA AGG AAT 397 Val Phe He Pro Pro His Glu Lys Lys Ser Tyr Tyr Val Lys Arg Asn 95 100 105 TTT GCC ATT GGA GGC GAT GAG GTG TTG TTC ACT AAT GAG GGT TTT TAT 445 Phe Ala He Gly Gly Asp Glu Val Leu Phe Thr Asn Glu Gly Phe Tyr 110 115 120
TTG CAC CCT TTT GAG AGC GAC ACG GAC AAA AAT TAC ATC GCT AAA CAT 493 Leu His Pro Phe Glu Ser Asp Thr Asp Lys Asn Tyr He Ala Lys His 125 130 135 140
TAC CCT AAC GCC ATG ACA AAA GAA TTT ATG GGT AAA ATT TTT GTT TTA 541 Tyr Pro Asn Ala Met Thr Lys Glu Phe Met Gly Lys He Phe Val Leu 145 150 155
AAC CCT TAT AAA AAT GAG CAT CCG GGT ATC CAT TAC CAA AAA GAC AAT 589 Asn Pro Tyr Lys Asn Glu His Pro Gly He His Tyr Gin Lys Asp Asn 160 165 170
GAA ACC TTC CAC TTA ATG GAG CAA TTA GCC ACT CAA GGC GCA GAA GCT 637 Glu Thr Phe His Leu Met Glu Gin Leu Ala Thr Gin Gly Ala Glu Ala 175 180 185
AAT ATC AGC ATG CAA CTC ATT CAA ATG GAG GGC GAA AAG GTG TTT TAT 685 Asn He Ser Met Gin Leu He Gin Met Glu Gly Glu Lys Val Phe Tyr 190 195 200
AAG AAA ATC AAT GAC GAT GAA TTT TTC ATG ATC GGC GAC AAC AGA GAC 733 Lys Lys He Asn Asp Asp Glu Phe Phe Met He Gly Asp Asn Arg Asp 205 210 215 220
AAT TCT AGC GAC TCG CGC TTT TGG GGG AGT GTG GCT TAT AAA AAC ATC 781 Asn Ser Ser Asp Ser Arg Phe Trp Gly Ser Val Ala Tyr Lys Asn He 225 230 235
GTG GGT TCG CCA TGG TTT GTT TAT TTC AGT TTG AGT TTA AAA AAT AGC 829 Val Gly Ser Pro Trp Phe Val Tyr Phe Ser Leu Ser Leu Lys Asn Ser 240 245 250
CTA GAA ATG GAT GCA GAA AAT AAC CCT AAA AAA CGC TAT CTG GTG CGT 877 Leu Glu Met Asp Ala Glu Asn Asn Pro Lys Lys Arg Tyr Leu Val Arg 255 260 265
TGG GAA CGC ATG TTT AAA AGC GTT GGA GGC TTA GAA AAA ATC ATT AAA 925 Trp Glu Arg Met Phe Lys Ser Val Gly Gly Leu Glu Lys He He Lys 270 275 280
AAA GAA AAC GCA ACG CAT TAAGGTTTTT TGTGCAATTT TTTGATTTCT CTTTAGAA 981 Lys Glu Asn Ala Thr His 285 290
AGTTTTATTA CCAC 995
(2) INFORMATION FOR SEQ ID NO: 270:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 290 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270:
Met Lys Phe Leu Arg Ser Val Tyr Ala Phe Cys Ser Ser Trp Val Gly
1 5 10 15
Thr He Val He Val Leu Leu Val He Phe Phe He Ala Gin Ala Phe
20 25 30
He He Pro Ser Arg Ser Met Val Gly Thr Leu Tyr Glu Gly Asp Met
35 40 45
Leu Phe Val Lys Lys Phe Ser Tyr Gly He Pro He Pro Lys He Pro
50 55 60
Trp He Glu Leu Pro Val Met Pro Asp Phe Lys Asn Asn Gly His Leu 65 70 75 80
He Glu Gly Asp Arg Pro Lys Arg Gly Glu Val Val Val Phe He Pro
85 90 95
Pro His Glu Lys Lys Ser Tyr Tyr Val Lys Arg Asn Phe Ala He Gly
100 105 110
Gly Asp Glu Val Leu Phe Thr Asn Glu Gly Phe Tyr Leu His Pro Phe
115 120 125
Glu Ser Asp Thr Asp Lys Asn Tyr He Ala Lys His Tyr Pro Asn Ala
130 135 140
Met Thr Lys Glu Phe Met Gly Lys He Phe Val Leu Asn Pro Tyr Lys 145 150 155 160
Asn Glu His Pro Gly He His Tyr Gin Lys Asp Asn Glu Thr Phe His
165 170 175
Leu Met Glu Gin Leu Ala Thr Gin Gly Ala Glu Ala Asn He Ser Met
180 185 190
Gin Leu He Gin Met Glu Gly Glu Lys Val Phe Tyr Lys Lys He Asn
195 200 205
Asp Asp Glu Phe Phe Met He Gly Asp Asn Arg Asp Asn Ser Ser Asp
210 215 220
Ser Arg Phe Trp Gly Ser Val Ala Tyr Lys Asn He Val Gly Ser Pro 225 230 235 240
Trp Phe Val Tyr Phe Ser Leu Ser Leu Lys Asn Ser Leu Glu Met Asp
245 250 255
Ala Glu Asn Asn Pro Lys Lys Arg Tyr Leu Val Arg Trp Glu Arg Met
260 265 270
Phe Lys Ser Val Gly Gly Leu Glu Lys He He Lys Lys Glu Asn Ala
275 280 285
Thr His 290
(2) INFORMATION FOR SEQ ID NO: 271:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2473 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 548...2419 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271:
GATACAATTC CAAATTTAAA AAACAAACGA TTTAATTCAA ATTTAAGGAA AAATTTTGAT 60
TACAGTGGTT AAACGAAACG GGCGCATTGA GCCTTTGGAC ATTACCAAAA TCCAAAAATA 120
CACTAAGGAC GCTACGGACA ATTTAGAGGG CGTGAGCCAA AGTGAGCTGG AAGTGGATGC 180
GAGGTTGCAA TTCAGGGACA AGATCACTAC TGAAGAAATC CAACAAACTT TGATTAAAAC 240
CGCTGTGGAT AAGATAGATA TTGACACGCC TAATTGGAGT TTTGTCGCCT CAAGGCTTTT 300
TTTGTATGAT TTATACCATA AAGTAAGTGG TTTTACAGGG TATAGGCATT TGAAAGAGTA 360
TTTTGAAAAC GCTGAAGAAA AGGGCCGCAT CCTTAAGGGC TTTAAGGAAA AATTTGATCT 420
AGAGTTTTTA AATAGCCAGA TCAAGCCTGA AAGGGATTTC CAATTCAATT ATTTAGGGAT 480
TAAAACCTTG TATGATCGCT ATTTGTTAAA AGACGCTAAC AACAACCCTA TTGAATTGCC 540
CCAACAC ATG TTT ATG AGC ATT GCG ATG TTT TTA GCA CAA AAC GAA CAA 589 Met Phe Met Ser He Ala Met Phe Leu Ala Gin Asn Glu Gin 1 5 10
GAA CCC AAT AAA ATC GCC TTA GAA TTT TAT GAA GTT TTG AGC AAG TTT 637 Glu Pro Asn Lys He Ala Leu Glu Phe Tyr Glu Val Leu Ser Lys Phe 15 20 25 30
GAA GCG ATG TGC GCG ACC CCC ACT CTA GCG AAC GCC CGC ACC ACC AAA 685 Glu Ala Met Cys Ala Thr Pro Thr Leu Ala Asn Ala Arg Thr Thr Lys 35 40 45
CAC CAG CTC AGC TCA TGC TAT ATT GGC AGC ACG CCG GAT AAT ATT GAG 733 His Gin Leu Ser Ser Cys Tyr He Gly Ser Thr Pro Asp Asn He Glu 50 55 60
GGG ATT TTT GAC AGC TAT AAG GAA ATG GCG CTG TTG TCC AAA TAC GGC 781 Gly He Phe Asp Ser Tyr Lys Glu Met Ala Leu Leu Ser Lys Tyr Gly 65 70 75
GGA GGG ATT GGC TGG GAT TTT TCT TTG GTG CGC TCT ATT GGG AGT TAT 829 Gly Gly He Gly Trp Asp Phe Ser Leu Val Arg Ser He Gly Ser Tyr 80 85 90
ATT GAT GGG CAT AAA AAT GCG AGC GCT GGC ACG ATC CCT TTT TTA AAA 877 He Asp Gly His Lys Asn Ala Ser Ala Gly Thr He Pro Phe Leu Lys 95 100 105 110
ATC GCT AAC GAT GTG GCG ATT GCG GTG GAT CAA TTA GGC ACA CGA AAG 925 He Ala Asn Asp Val Ala He Ala Val Asp Gin Leu Gly Thr Arg Lys 115 120 125
GGC GCG ATT GCG GTG TAT TTG GAA ATT TGG CAC ATT GAT GTG ATG GAG 973 Gly Ala He Ala Val Tyr Leu Glu He Trp His He Asp Val Met Glu 130 135 140
TTC ATT GAT TTA AGG AAA AAT AGC GGC GAT GAA AGG CGA AGA GCG CAT 1021 Phe He Asp Leu Arg Lys Asn Ser Gly Asp Glu Arg Arg Arg Ala His 145 150 155
GAT TTA TTC CCG GCT CTT TGG GTG TGC GAT TTG TTT TTG AAA AGG GTT 1069 Asp Leu Phe Pro Ala Leu Trp Val Cys Asp Leu Phe Leu Lys Arg Val 160 165 170
TTA GAA GAT GCG ATG TGG ACT TTA TTT GAC CCT TAT GAG TGT AAG GAT 1117 Leu Glu Asp Ala Met Trp Thr Leu Phe Asp Pro Tyr Glu Cys Lys Asp 175 180 185 190
TTG ACT GAG CTT TAT GGG CAG GAT TTT GAA AAA CGC TAT TTA GAG TAT 1165 Leu Thr Glu Leu Tyr Gly Gin Asp Phe Glu Lys Arg Tyr Leu Glu Tyr 195 200 205
GAA AAA GAT CCC AAG ATC ATT AAG GAA TAC ATT AAC GCT AAA GAT TTA 1213 Glu Lys Asp Pro Lys He He Lys Glu Tyr He Asn Ala Lys Asp Leu 210 215 220
TGG AAA AAA ATC TTA ATG AAT TAT TTT GAA GCC GGT TTG CCT TTC TTA 1261 Trp Lys Lys He Leu Met Asn Tyr Phe Glu Ala Gly Leu Pro Phe Leu 225 230 235
GCC TTT AAA GAT AAC GCC AAT CGG TGC AAC CCA AAC GCT CAT GCA GGA 1309 Ala Phe Lys Asp Asn Ala Asn Arg Cys Asn Pro Asn Ala His Ala Gly 240 245 250
ATC ATT CGA TCC AGC AAT CTA TGC ACG GAG ATT TTC CAA AAT ACC GCG 1357 He He Arg Ser Ser Asn Leu Cys Thr Glu He Phe Gin Asn Thr Ala 255 260 265 270
CCT AAC CAC TAC TAC ATG CAA ATA GAA TAC ACC GAC GGC ACC ATA GAG 1405 Pro Asn His Tyr Tyr Met Gin He Glu Tyr Thr Asp Gly Thr He Glu 275 280 285
TTT TTT GAA GAA AAA GAG TTG GTA ACG ACA GAT AGT AAT ATC ACT AAA 1453 Phe Phe Glu Glu Lys Glu Leu Val Thr Thr Asp Ser Asn He Thr Lys 290 295 300
TGC GCT AAC AAG CTC ACT AGC ACC GAT ATT CTA AAG GGC AAG CCA ATC 1501 Cys Ala Asn Lys Leu Thr Ser Thr Asp He Leu Lys Gly Lys Pro He 305 310 315
TAT ATC GCT ACT AAA GTC GCT AAA GAC GGG CAA ACG GCG GTG TGC AAT 1549 Tyr He Ala Thr Lys Val Ala Lys Asp Gly Gin Thr Ala Val Cys Asn 320 325 330
CTG GCG AGC ATC AAT TTA AGC AAA ATC AAC ACT GAA GAA GAC ATT AAA 1597 Leu Ala Ser He Asn Leu Ser Lys He Asn Thr Glu Glu Asp He Lys 335 340 345 350
AGG GTT GTG CCG ATC ATG GTC AGG CTT TTA GAC AAT GTG ATT GAT TTG 1645 Arg Val Val Pro He Met Val Arg Leu Leu Asp Asn Val He Asp Leu 355 360 365
AAT TTC TAC CCT AAC CGC AAA GTC AAA GCC ACT AAT TTA CAA AAT AGG 1693 Asn Phe Tyr Pro Asn Arg Lys Val Lys Ala Thr Asn Leu Gin Asn Arg 370 375 380
GCC ATA GGG TTA GGG GTT ATG GGT GAA GCG CAA ATG CTC GCA GAA CAC 1741 Ala He Gly Leu Gly Val Met Gly Glu Ala Gin Met Leu Ala Glu His 385 390 395
CAA ATC GCT TGG GGG TCT AAA GAG CAT TTA GAA AAA ATT GAC GCT TTA 1789 Gin He Ala Trp Gly Ser Lys Glu His Leu Glu Lys He Asp Ala Leu 400 405 410
ATG GAG CAA ATC AGC TAC CAT GCG ATT GAC ACG AGC GCG AAT TTA GCG 1837 Met Glu Gin He Ser Tyr His Ala He Asp Thr Ser Ala Asn Leu Ala 415 420 425 430
AAA GAA AAA GGG GTT TAT AAG GAT TTT GAA AAT TCA GAA TGG AGT AAG 1885 Lys Glu Lys Gly Val Tyr Lys Asp Phe Glu Asn Ser Glu Trp Ser Lys 435 440 445
GGG ATT TTC CCC ATT GAT AAA GCC AAT AAT GAA GCC TTA AAG CTC ACC 1933 Gly He Phe Pro He Asp Lys Ala Asn Asn Glu Ala Leu Lys Leu Thr 450 455 460
GAA AAA GGG CTT TTT AAT CAC GCT TGC GAT TGG CAA GGT TTG AGG GAA 1981 Glu Lys Gly Leu Phe Asn His Ala Cys Asp Trp Gin Gly Leu Arg Glu 465 470 475
AAA GTC AAA GCC AAT GGC ATG CGT AAT GGC TAT TTA ATG GCG ATC GCT 2029 Lys Val Lys Ala Asn Gly Met Arg Asn Gly Tyr Leu Met Ala He Ala 480 485 490
CCC ACA AGC TCC ATT TCT ATT TTA GTA GGC ACA ACC CAA ACG ATT GAA 2077 Pro Thr Ser Ser He Ser He Leu Val Gly Thr Thr Gin Thr He Glu 495 500 505 510
CCC ATT TAT AAG AAA AAA TGG TTT GAA GAA AAT TTG AGC GGG CTT ATT 2125 Pro He Tyr Lys Lys Lys Trp Phe Glu Glu Asn Leu Ser Gly Leu He 515 520 525
CCT GTT GTG GTG CCT AAT TTG AAT GTA GAA ACC TGG AAT TTT TAC ACA 2173 Pro Val Val Val Pro Asn Leu Asn Val Glu Thr Trp Asn Phe Tyr Thr 530 535 540
TCA GCC TAT GAT ATT GAC GCT AAA GAT TTG ATT AAA GCA GCG GCC GTG 2221 Ser Ala Tyr Asp He Asp Ala Lys Asp Leu He Lys Ala Ala Ala Val 545 550 555
CGC CAA AAG TGG ATT GAT CAA GGC CAA AGC CTT AAT GTG TTT TTA CGC 2269 Arg Gin Lys Trp He Asp Gin Gly Gin Ser Leu Asn Val Phe Leu Arg 560 565 570 ATA GAA AAC GCC AGC GGT AAA ACC TTG CAT GAC ATC TAC ACG CTC GCT 2317 He Glu Asn Ala Ser Gly Lys Thr Leu His Asp He Tyr Thr Leu Ala 575 580 585 590
TGG AAA TTA GGA CTC AAA TCC ACT TAT TAT TTG CGC AGC GAA AGC CCT 2365 Trp Lys Leu Gly Leu Lys Ser Thr Tyr Tyr Leu Arg Ser Glu Ser Pro 595 600 605
AGC ATA GAT GAA AAA AGC GTG TTG GAT CGA TCG GTG GAG TGT TTT AAT 2413 Ser He Asp Glu Lys Ser Val Leu Asp Arg Ser Val Glu Cys Phe Asn 610 615 620
TGC CAA TAATATAAGC TTAAATAAGC TAATCTTTGC TAAAATGAGA TTTAAAATTA TT 2471 Cys Gin
TA 2473
(2) INFORMATION FOR SEQ ID NO:272:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 624 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:272:
Met Phe Met Ser He Ala Met Phe Leu Ala Gin Asn Glu Gin Glu Pro
1 5 10 15
Asn Lys He Ala Leu Glu Phe Tyr Glu Val Leu Ser Lys Phe Glu Ala
20 25 30
Met Cys Ala Thr Pro Thr Leu Ala Asn Ala Arg Thr Thr Lys His Gin
35 40 45
Leu Ser Ser Cys Tyr He Gly Ser Thr Pro Asp Asn He Glu Gly He
50 55 60
Phe Asp Ser Tyr Lys Glu Met Ala Leu Leu Ser Lys Tyr Gly Gly Gly 65 70 75 80
He Gly Trp Asp Phe Ser Leu Val Arg Ser He Gly Ser Tyr He Asp
85 90 95
Gly His Lys Asn Ala Ser Ala Gly Thr He Pro Phe Leu Lys He Ala
100 105 110
Asn Asp Val Ala He Ala Val Asp Gin Leu Gly Thr Arg Lys Gly Ala
115 120 125
He Ala Val Tyr Leu Glu He Trp His He Asp Val Met Glu Phe He
130 135 140
Asp Leu Arg Lys Asn Ser Gly Asp Glu Arg Arg Arg Ala His Asp Leu 145 150 155 160
Phe Pro Ala Leu Trp Val Cys Asp Leu Phe Leu Lys Arg Val Leu Glu
165 170 175
Asp Ala Met Trp Thr Leu Phe Asp Pro Tyr Glu Cys Lys Asp Leu Thr 180 185 190 Glu Leu Tyr Gly Gin Asp Phe Glu Lys Arg Tyr Leu Glu Tyr Glu Lys
195 200 205
Asp Pro Lys He He Lys Glu Tyr He Asn Ala Lys Asp Leu Trp Lys
210 215 220
Lys He Leu Met Asn Tyr Phe Glu Ala Gly Leu Pro Phe Leu Ala Phe 225 230 235 240
Lys Asp Asn Ala Asn Arg Cys Asn Pro Asn Ala His Ala Gly He He
245 250 255
Arg Ser Ser Asn Leu Cys Thr Glu He Phe Gin Asn Thr Ala Pro Asn
260 265 270
His Tyr Tyr Met Gin He Glu Tyr Thr Asp Gly Thr He Glu Phe Phe
275 280 285
Glu Glu Lys Glu Leu Val Thr Thr Asp Ser Asn He Thr Lys Cys Ala
290 295 300
Asn Lys Leu Thr Ser Thr Asp He Leu Lys Gly Lys Pro He Tyr He 305 310 315 320
Ala Thr Lys Val Ala Lys Asp Gly Gin Thr Ala Val Cys Asn Leu Ala
325 330 335
Ser He Asn Leu Ser Lys He Asn Thr Glu Glu Asp He Lys Arg Val
340 345 350
Val Pro He Met Val Arg Leu Leu Asp Asn Val He Asp Leu Asn Phe
355 360 365
Tyr Pro Asn Arg Lys Val Lys Ala Thr Asn Leu Gin Asn Arg Ala He
370 375 380
Gly Leu Gly Val Met Gly Glu Ala Gin Met Leu Ala Glu His Gin He 385 390 395 400
Ala Trp Gly Ser Lys Glu His Leu Glu Lys He Asp Ala Leu Met Glu
405 410 415
Gin He Ser Tyr His Ala He Asp Thr Ser Ala Asn Leu Ala Lys Glu
420 425 430
Lys Gly Val Tyr Lys Asp Phe Glu Asn Ser Glu Trp Ser Lys Gly He
435 440 445
Phe Pro He Asp Lys Ala Asn Asn Glu Ala Leu Lys Leu Thr Glu Lys
450 455 460
Gly Leu Phe Asn His Ala Cys Asp Trp Gin Gly Leu Arg Glu Lys Val 465 470 475 480
Lys Ala Asn Gly Met Arg Asn Gly Tyr Leu Met Ala He Ala Pro Thr
485 490 495
Ser Ser He Ser He Leu Val Gly Thr Thr Gin Thr He Glu Pro He
500 505 510
Tyr Lys Lys Lys Trp Phe Glu Glu Asn Leu Ser Gly Leu He Pro Val
515 520 525
Val Val Pro Asn Leu Asn Val Glu Thr Trp Asn Phe Tyr Thr Ser Ala
530 535 540
Tyr Asp He Asp Ala Lys Asp Leu He Lys Ala Ala Ala Val Arg Gin 545 550 555 560
Lys Trp He Asp Gin Gly Gin Ser Leu Asn Val Phe Leu Arg He Glu
565 570 575
Asn Ala Ser Gly Lys Thr Leu His Asp He Tyr Thr Leu Ala Trp Lys
580 585 590
Leu Gly Leu Lys Ser Thr Tyr Tyr Leu Arg Ser Glu Ser Pro Ser He
595 600 605
Asp Glu Lys Ser Val Leu Asp Arg Ser Val Glu Cys Phe Asn Cys Gin 610 615 620 (2) INFORMATION FOR SEQ ID NO: 273:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1440 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 56...1390 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:273:
GCAAAATTCT AGCCTTAAAT CTTTGATGAA ACGAAGTCAA ATTATAAGAT AAGGC ATG 58
Met 1
TTA AAA TTC CCT AAA ATG AGT TTA AGG ATT TTA ATG CTT TCT GTC ATC 106 Leu Lys Phe Pro Lys Met Ser Leu Arg He Leu Met Leu Ser Val He 5 10 15
ATA CTG GCC GCT GGT AAA GGC ACT CGC ATG CGT TCT AGC CTG CCT AAA 154 He Leu Ala Ala Gly Lys Gly Thr Arg Met Arg Ser Ser Leu Pro Lys 20 25 30
ACT TTA CAC ACC ATT TGT GGG GAG CCT ATG TTG TTT TAC ATT TTA GAA 202 Thr Leu His Thr He Cys Gly Glu Pro Met Leu Phe Tyr He Leu Glu 35 40 45
ACG GCT TTT TCA ATC AGC GAT GAT GTG CAT CTT ATC TTA CAC CAC CAA 250 Thr Ala Phe Ser He Ser Asp Asp Val His Leu He Leu His His Gin -50 55 60 65
CAA GAA CGC ATT AAA GAA GCG GTG TTG GAG CGT TTT AAG GGC GTC ATT 298 Gin Glu Arg He Lys Glu Ala Val Leu Glu Arg Phe Lys Gly Val He 70 75 80
TTT CAC ACT CAA ATT GTG GAA AAA TAT TCA GGG ACA GGT GGG GCT ATC 346 Phe His Thr Gin He Val Glu Lys Tyr Ser Gly Thr Gly Gly Ala He 85 90 95
ATG CAA AAA GAT AAA ACG CCT ATT TCT ACG AAA CAT GAG CGG GTT TTG 394 Met Gin Lys Asp Lys Thr Pro He Ser Thr Lys His Glu Arg Val Leu 100 105 110
ATT TTG AAT GCG GAC ATG CCT TTA ATC ACT AAA GAC GCT CTC GCC CCC 442 He Leu Asn Ala Asp Met Pro Leu He Thr Lys Asp Ala Leu Ala Pro 115 120 125 TTA TTA GAA AGC AAG AAT AAC GCT ATA GGC TTA CTC CAT TTA GCT GAC 490 Leu Leu Glu Ser Lys Asn Asn Ala He Gly Leu Leu His Leu Ala Asp 130 135 140 145
CCT AAA GGT TAT GGG CGC GTT GTT TTA GAA AAC CAT CAG GTT AAA AAG 538 Pro Lys Gly Tyr Gly Arg Val Val Leu Glu Asn His Gin Val Lys Lys 150 155 160
ATT GTA GAA GAA AAG GAC GCT AAT GAT GAA GAA AAA GAA ATT AAA AGC 586 He Val Glu Glu Lys Asp Ala Asn Asp Glu Glu Lys Glu He Lys Ser 165 170 175
GTG AAT GCT GGC GTG TAT GGG TTT GAA AGG GAT TTT TTA GAA AAA TAC 634 Val Asn Ala Gly Val Tyr Gly Phe Glu Arg Asp Phe Leu Glu Lys Tyr 180 185 190
TTA CCC AAG CTC CAT GAC CAA AAC GCC CAA AAA GAA TAC TAC CTC ACG 682 Leu Pro Lys Leu His Asp Gin Asn Ala Gin Lys Glu Tyr Tyr Leu Thr 195 200 205
GAT TTA ATC GCT CTA GGG ATC AAT GAA AAC GAA ACA ATT GAC GCT ATT 730 Asp Leu He Ala Leu Gly He Asn Glu Asn Glu Thr He Asp Ala He 210 215 220 225
TTC TTA AAA GAA GAG TGT TTT TTA GGG GTG AAT AGC CAA ACA GAA AGG 778 Phe Leu Lys Glu Glu Cys Phe Leu Gly Val Asn Ser Gin Thr Glu Arg 230 235 240
GCG AAA GCT GAA GAA ATC ATG CTA GAA AGA CTG CGC AAA AAC GCC ATG 826 Ala Lys Ala Glu Glu He Met Leu Glu Arg Leu Arg Lys Asn Ala Met 245 250 255
GAC TTG GGG GTA GTG ATG CAA TTG CCT AAT AGC ATT TAT TTA GAA AAA 874 Asp Leu Gly Val Val Met Gin Leu Pro Asn Ser He Tyr Leu Glu Lys 260 265 270
GGC GTG AGT TTT AAG GGG GAG TGC GTT TTA GAG CAA GGG GTG CGT TTG 922 Gly Val Ser Phe Lys Gly Glu Cys Val Leu Glu Gin Gly Val Arg Leu 275 280 285
ATT GGG AAT TGT TTG ATA GAA AAC GCG CAT ATT AAG GCT TAT AGC GTG 970 He Gly Asn Cys Leu He Glu Asn Ala His He Lys Ala Tyr Ser Val 290 295 300 305
ATA GAA GAG AGC CAG ATT GTT AAT AGC AGT GTG GGG CCG TTT GCC CAT 1018 He Glu Glu Ser Gin He Val Asn Ser Ser Val Gly Pro Phe Ala His 310 315 320
GCG CGC CCT AAA AGC GTG ATT TGT AAT AGC CAT GTG GGG AAT TTT GTA 1066 Ala Arg Pro Lys Ser Val He Cys Asn Ser His Val Gly Asn Phe Val 325 330 335
GAG ACT AAA AAC GCT AAA CTT CAA GGC ACT AAA GCA GGG CAT TTG AGC 1114 Glu Thr Lys Asn Ala Lys Leu Gin Gly Thr Lys Ala Gly His Leu Ser 340 345 350 TAT TTA GGG GAT TGT GAG ATA GGG AAA AAC ACA AAT GTA GGG GCT GGC 1162 Tyr Leu Gly Asp Cys Glu He Gly Lys Asn Thr Asn Val Gly Ala Gly 355 360 365
GTG ATC ACT TGC AAT TAC GAT GGT AAA AAG AAA CAC CAA ACA ATC ATC 1210 Val He Thr Cys Asn Tyr Asp Gly Lys Lys Lys His Gin Thr He He 370 375 380 385
GGT GAA AAT GTC TTT ATA GGG AGC GAT AGC CAG CTA GTC GCC CCC ATA 1258 Gly Glu Asn Val Phe He Gly Ser Asp Ser Gin Leu Val Ala Pro He 390 395 400
AAT ATC GGC TCT AAT GTC TTA ATC GGC AGC GGC ACC ACT ATC ACT AAA 1306 Asn He Gly Ser Asn Val Leu He Gly Ser Gly Thr Thr He Thr Lys 405 410 415
GAC ATT CCT AGC GGT TCG TTG AGC CTT TCA CGC GCC CCT CAA ACC AAC 1354 Asp He Pro Ser Gly Ser Leu Ser Leu Ser Arg Ala Pro Gin Thr Asn 420 425 430
ATT GAA AAC GGG TAT TTT AAG TTT TTT AAG AAA CCT TAATTTGTTT GAATAA 1406 He Glu Asn Gly Tyr Phe Lys Phe Phe Lys Lys Pro 435 440 445
TGAAAAATCC TAAAATATTA ATCATTTACT TTAA 1440
(2) INFORMATION FOR SEQ ID NO: 274:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 445 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:274:
Met Leu Lys Phe Pro Lys Met Ser Leu Arg He Leu Met Leu Ser Val
1 5 10 15
He He Leu Ala Ala Gly Lys Gly Thr Arg Met Arg Ser Ser Leu Pro
20 25 30
Lys Thr Leu His Thr He Cys Gly Glu Pro Met Leu Phe Tyr He Leu
35 40 45
Glu Thr Ala Phe Ser He Ser Asp Asp Val His Leu He Leu His His
50 55 60
Gin Gin Glu Arg He Lys Glu Ala Val Leu Glu Arg Phe Lys Gly Val 65 70 75 80
He Phe His Thr Gin He Val Glu Lys Tyr Ser Gly Thr Gly Gly Ala
85 90 95
He Met Gin Lys Asp Lys Thr Pro He Ser Thr Lys His Glu Arg Val
100 105 110
Leu He Leu Asn Ala Asp Met Pro Leu He Thr Lys Asp Ala Leu Ala 115 120 125 Pro Leu Leu Glu Ser Lys Asn Asn Ala He Gly Leu Leu His Leu Ala
130 135 140
Asp Pro Lys Gly Tyr Gly Arg Val Val Leu Glu Asn His Gin Val Lys 145 150 155 160
Lys He Val Glu Glu Lys Asp Ala Asn Asp Glu Glu Lys Glu He Lys
165 170 175
Ser Val Asn Ala Gly Val Tyr Gly Phe Glu Arg Asp Phe Leu Glu Lys
180 185 190
Tyr Leu Pro Lys Leu His Asp Gin Asn Ala Gin Lys Glu Tyr Tyr Leu
195 200 205
Thr Asp Leu He Ala Leu Gly He Asn Glu Asn Glu Thr He Asp Ala
210 215 220
He Phe Leu Lys Glu Glu Cys Phe Leu Gly Val Asn Ser Gin Thr Glu 225 230 235 240
Arg Ala Lys Ala Glu Glu He Met Leu Glu Arg Leu Arg Lys Asn Ala
245 250 255
Met Asp Leu Gly Val Val Met Gin Leu Pro Asn Ser He Tyr Leu Glu
260 265 270
Lys Gly Val Ser Phe Lys Gly Glu Cys Val Leu Glu Gin Gly Val Arg
275 280 285
Leu He Gly Asn Cys Leu He Glu Asn Ala His He Lys Ala Tyr Ser
290 295 300
Val He Glu Glu Ser Gin He Val Asn Ser Ser Val Gly Pro Phe Ala 305 310 315 320
His Ala Arg Pro Lys Ser Val He Cys Asn Ser His Val Gly Asn Phe
325 330 335
Val Glu Thr Lys Asn Ala Lys Leu Gin Gly Thr Lys Ala Gly His Leu
340 345 350
Ser Tyr Leu Gly Asp Cys Glu He Gly Lys Asn Thr Asn Val Gly Ala
355 360 365
Gly Val He Thr Cys Asn Tyr Asp Gly Lys Lys Lys His Gin Thr He
370 375 380
He Gly Glu Asn Val Phe He Gly Ser Asp Ser Gin Leu Val Ala Pro 385 390 395 400
He Asn He Gly Ser Asn Val Leu He Gly Ser Gly Thr Thr He Thr
405 410 415
Lys Asp He Pro Ser Gly Ser Leu Ser Leu Ser Arg Ala Pro Gin Thr
420 425 430
Asn He Glu Asn Gly Tyr Phe Lys Phe Phe Lys Lys Pro 435 440 445
(2) INFORMATION FOR SEQ ID NO:275:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 771 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 227...715 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:275:
GCAAAGATAA AAAACAAACG GGTTTTGGTG AAATTTTCTG GGGAAGCGTT AGCTGGNGGA 60
CAACCAGTTT GGGATTGACA TTCATGTGTT AGATCACATC GCTAAAGAGA TCAAAAGTTT 120
AGTGGAAAAC GATATTGAAG TGGGTATTGT GATTGGTGGA GGCAATATTA TTAGGGGGGT 180
TAGCGCGGCT CAAGGGGGGA TTATTAGGCG CACCAGTGGG GATTAT ATG GGC ATG 235
Met Gly Met 1
TTA GCC ACC GTG ATT AAT GCG GTA GCG ATG CAA GAA GCT TTA GAG CAT 283 Leu Ala Thr Val He Asn Ala Val Ala Met Gin Glu Ala Leu Glu His 5 10 15
ATC GGC TTA GAC ACA AGG GTG CAG AGC GCG ATT GAA ATC AAA GAG ATT 331 He Gly Leu Asp Thr Arg Val Gin Ser Ala He Glu He Lys Glu He 20 25 30 35
TGT GAA AGT TAC ATT TAC AGA AAA GCG ATC AGG CAT TTA GAA AAG GGT 379 Cys Glu Ser Tyr He Tyr Arg Lys Ala He Arg His Leu Glu Lys Gly 40 45 50
AGG GTG GTG ATT TTT GGC GCA GGC ACG GGA AAC CCG TTT TTC ACT ACG 427 Arg Val Val He Phe Gly Ala Gly Thr Gly Asn Pro Phe Phe Thr Thr 55 60 65
GAT ACG GCT GCC ACT TTA AGA GCG ATT GAA ATT GGA TCG GAT TTA ATC 475 Asp Thr Ala Ala Thr Leu Arg Ala He Glu He Gly Ser Asp Leu He 70 75 80
ATT AAA GCG ACT AAA GTG GAT GGC ATT TAC GAC AAA GAT CCT AAC AAG 523 He Lys Ala Thr Lys Val Asp Gly He Tyr Asp Lys Asp Pro Asn Lys 85 90 95
TTT AAA GAC GCT AAA AAA TTA GAC ACT TTA AGC TAT AAC GAT GCC TTG 571 Phe Lys Asp Ala Lys Lys Leu Asp Thr Leu Ser Tyr Asn Asp Ala Leu 100 105 110 115
ATA GGG GAT ATT GAA GTG ATG GAC GAT ACC GCT ATT TCT TTA GCT AAA 619 He Gly Asp He Glu Val Met Asp Asp Thr Ala He Ser Leu Ala Lys 120 125 130
GAC AAT AAG CTC CCC ATT GTG GTG TGT AAC ATG TTC AAA AAA GGG AAT 667 Asp Asn Lys Leu Pro He Val Val Cys Asn Met Phe Lys Lys Gly Asn 135 140 145
TTA TTG CAA GTG ATC AAG CAC CAA CAA GGC GTA TTT TCT ATG GTA AAA T 716 Leu Leu Gin Val He Lys His Gin Gin Gly Val Phe Ser Met Val Lys 150 155 160
AAGCCCTTTA ACATTGGATA GAACTCAAAA TAAAAGGATC AGTTTGAAAA AAGAG 771 (2) INFORMATION FOR SEQ ID NO: 276:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 163 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:276:
Met Gly Met Leu Ala Thr Val He Asn Ala Val Ala Met Gin Glu Ala
1 5 10 15
Leu Glu His He Gly Leu Asp Thr Arg Val Gin Ser Ala He Glu He
20 25 30
Lys Glu He Cys Glu Ser Tyr He Tyr Arg Lys Ala He Arg His Leu
35 40 45
Glu Lys Gly Arg Val Val He Phe Gly Ala Gly Thr Gly Asn Pro Phe
50 55 60
Phe Thr Thr Asp Thr Ala Ala Thr Leu Arg Ala He Glu He Gly Ser 65 70 75 80
Asp Leu He He Lys Ala Thr Lys Val Asp Gly He Tyr Asp Lys Asp
85 90 95
Pro Asn Lys Phe Lys Asp Ala Lys Lys Leu Asp Thr Leu Ser Tyr Asn
100 105 110
Asp Ala Leu He Gly Asp He Glu Val Met Asp Asp Thr Ala He Ser
115 120 125
Leu Ala Lys Asp Asn Lys Leu Pro He Val Val Cys Asn Met Phe Lys
130 135 140
Lys Gly Asn Leu Leu Gin Val He Lys His Gin Gin Gly Val Phe Ser 145 150 155 160
Met Val Lys
(2) INFORMATION FOR SEQ ID NO:277:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 659 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 56...607 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:277: TCATGTAAAA TAAAGGGTTT TATTAAAGAT GAGAGATTGT TTTAAGGTTT GAATA ATG 58
Met 1
AGA GCT TTT TTA AAG ATT TTA ATG GTT TTG ATT TTT ATG AGC GTT GCT 106 Arg Ala Phe Leu Lys He Leu Met Val Leu He Phe Met Ser Val Ala 5 10 15
TAT GCT AAA AAT CCT TCA ACG CTT TCT AAA GAA GAA GAG GTT TTG CAG 154 Tyr Ala Lys Asn Pro Ser Thr Leu Ser Lys Glu Glu Glu Val Leu Gin 20 25 30
CAT TTG CAA AGT TTT AGC GCG CAT TTC AAG CAG GTT TTA AAA AAT GAA 202 His Leu Gin Ser Phe Ser Ala His Phe Lys Gin Val Leu Lys Asn Glu 35 40 45
AAA CCT TTA GTT TAT TAC GGG GTT TTA AAG GCT AAA GCC CCT AAT TGG 250 Lys Pro Leu Val Tyr Tyr Gly Val Leu Lys Ala Lys Ala Pro Asn Trp 50 55 60 65
GCT TTA TGG GTT TAT GAA AAG CCT TTA AAA AAA GAA ATT TAC ATG AAC 298 Ala Leu Trp Val Tyr Glu Lys Pro Leu Lys Lys Glu He Tyr Met Asn 70 75 80
GAT AAA GAA GTG GTA ATT TAT GAG CCT AAT TTG TTT CAA GCG ACC ATC 346 Asp Lys Glu Val Val He Tyr Glu Pro Asn Leu Phe Gin Ala Thr He 85 90 95
ACG CCC TTA AAA GAC AAG ACG GAT TTT TTC ACC ATT CTC AAG CGT TTA 394 Thr Pro Leu Lys Asp Lys Thr Asp Phe Phe Thr He Leu Lys Arg Leu 100 105 110
AAA AAG CAA GAT GAC GGA TCT TTT AAA ACG ACT ATC AAC AAA ACC ACT 442 Lys Lys Gin Asp Asp Gly Ser Phe Lys Thr Thr He Asn Lys Thr Thr 115 120 125
TAT CGT TTG GTT TTT AAA GAC GGC AAG CCT TTT TCA TTG GAA TTT .AAA 490 Tyr Arg Leu Val Phe Lys Asp Gly Lys Pro Phe Ser Leu Glu Phe Lys 130 135 140 145
GAT GGA ATG AAC AAT CTT GTA ACG ATC ACT TTT TCT CAA GCA GAA ATC 538 Asp Gly Met Asn Asn Leu Val Thr He Thr Phe Ser Gin Ala Glu He 150 155 160
AAC CCC ACC ATT GCT AAT GAA ATC TTT GTT TTT AAG CCT AAA GAT GAA 586 Asn Pro Thr He Ala Asn Glu He Phe Val Phe Lys Pro Lys Asp Glu 165 170 175
AAC ATT GAT ATT GTG CGC CAA TGATTTTTAA TGATTCATTG CATCTTGTTA GCAA 641 Asn He Asp He Val Arg Gin 180
AAGTTAGCTA AAATAGAC 659
(2) INFORMATION FOR SEQ ID NO: 278: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 184 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278:
Met Arg Ala Phe Leu Lys He Leu Met Val Leu He Phe Met Ser Val
1 5 10 15
Ala Tyr Ala Lys Asn Pro Ser Thr Leu Ser Lys Glu Glu Glu Val Leu
20 25 30
Gin His Leu Gin Ser Phe Ser Ala His Phe Lys Gin Val Leu Lys Asn
35 40 45
Glu Lys Pro Leu Val Tyr Tyr Gly Val Leu Lys Ala Lys Ala Pro Asn
50 55 60
Trp Ala Leu Trp Val Tyr Glu Lys Pro Leu Lys Lys Glu He Tyr Met 65 70 75 80
Asn Asp Lys Glu Val Val He Tyr Glu Pro Asn Leu Phe Gin Ala Thr
85 90 95
He Thr Pro Leu Lys Asp Lys Thr Asp Phe Phe Thr He Leu Lys Arg
100 105 110
Leu Lys Lys Gin Asp Asp Gly Ser Phe Lys Thr Thr He Asn Lys Thr
115 120 125
Thr Tyr Arg Leu Val Phe Lys Asp Gly Lys Pro Phe Ser Leu Glu Phe
130 135 140
Lys Asp Gly Met Asn Asn Leu Val Thr He Thr Phe Ser Gin Ala Glu 145 150 155 160
He Asn Pro Thr He Ala Asn Glu He Phe Val Phe Lys Pro Lys Asp
165 170 175
Glu Asn He Asp He Val Arg Gin 180
(2) INFORMATION FOR SEQ ID NO: 279:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3035 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 729...2981 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: GAAACGATCG CAGAAAGCAA TGAAAGCACG GTAGTAGCGG AATTTCATAG CAGTAATGAA 60
AAAAAAGCGC TTATGAGAGC GAAGCAGAGC TAGAAAGGGC GTTTATTAAG CTTTTAGAAA 120
AACAAGGCTA TGAATTTAAA AAAATCCACA AAGAAGAAGA ATTAAAAGAC AATTTAAAAG 180
AGCAGTTAGA AAAGCTTAAT GATCATTCTT TCACGCCTAA AGAATGGGAC ACTCTTTATT 240
CTCAATTCAT CGCTAATAAA AACGATGACT ATAAGGCTAA AACGAAAAAG ATCCAAGAAG 300
ATCCGATTTT TAATCTCACG CTAGAGAACG GGAAAACCAA AAACATTAAA ATCATTGATA 360
AGAAAAATAT CCATAGAAAC GCCTTGCAAG TGATCCACCA ATACAGCAAT AAAGGGGGGA 420
AGTATCAAAA CCGCTATGAT GTGAGTATCC TTGTGAATGG CTTGCCTTTA GTGCATGTGG 480
AATTGAAAAA AAGAGGCGTG GCGATCAGGG AGGCGTTCAA CCAGATCAAG CGCTATAAAA 540
GGGATAGTTT TAGCGCTGAA GACGGGCTTT TTGATTTTGT GCAGATTTTT GTCATCAGTA 600
ACGGCACGAG CTCTAAATAC TATTCAAACA CCACAAGAAT AGCCCAGCTG GAAAAAAACC 660
ATAAAGCCGA TACTTTTGAA TTCACGAATT ATTGGGCGGA TAGCAAGAAT CACAATATTG 720
AGGATTTA ATG GAT TTT GCT AAG GCG TTT TTT GCA AAG CGC AGC CTT TTG 770 Met Asp Phe Ala Lys Ala Phe Phe Ala Lys Arg Ser Leu Leu 1 5 10
AAC GTT TTA ACG TGC TAT TGC GTT TTC ACA AGC GAA GAG GTT TTA TTG 818 Asn Val Leu Thr Cys Tyr Cys Val Phe Thr Ser Glu Glu Val Leu Leu 15 20 25 30
GTG ATG CGG CCT TAT CAA ATC GTG GCG GCC GAA AGG ATT TTG GAA AAG 866 Val Met Arg Pro Tyr Gin He Val Ala Ala Glu Arg He Leu Glu Lys 35 40 45
ATC AAA ACC GCG CAA AAT AGT AAA ACG AAA AAT CAA AGC AAA GGC TAT 914 He Lys Thr Ala Gin Asn Ser Lys Thr Lys Asn Gin Ser Lys Gly Tyr 50 55 60
ATC TGG CAC ACG ACA GGG AGC GGT AAA ACC CTA ACG AGC TTT AAA AGC 962 He Trp His Thr Thr Gly Ser Gly Lys Thr Leu Thr Ser Phe Lys Ser 65 70 75
GCA ACG TTG GCT AAA GAA TTA GAG AGC GTT TCA AAA GTC TTG TTC GTG 1010 Ala Thr Leu Ala Lys Glu Leu Glu Ser Val Ser Lys Val Leu Phe Val 80 85 90
GTG GAC AGG AAG GAT TTG GAC TAT CAA ACC ATG AAA GAA TAC GAT AAA 1058 Val Asp Arg Lys Asp Leu Asp Tyr Gin Thr Met Lys Glu Tyr Asp Lys 95 100 105 110
TTC CAA AAA GAT TGC GCT AAT TCC AAC ACA AGC ACT AAG ATT TTA AAA 1106 Phe Gin Lys Asp Cys Ala Asn Ser Asn Thr Ser Thr Lys He Leu Lys 115 120 125
GAA CAG CTT GAA GAT TCT AAC GCT AAA ATC ATT ATC ACC ACG ATC CAA 1154 Glu Gin Leu Glu Asp Ser Asn Ala Lys He He He Thr Thr He Gin 130 135 140
AAA TTA GAC AAA TTC GTT AAA TCC CAT AAA GGG CAT GCG ATT TTT AAT 1202 Lys Leu Asp Lys Phe Val Lys Ser His Lys Gly His Ala He Phe Asn 145 150 155
GAA GAA GTT GTG ATG ATT TTT GAT GAA TGC CAC AGG AGT CAG TTA GGC 1250 Glu Glu Val Val Met He Phe Asp Glu Cys His Arg Ser Gin Leu Gly 160 165 170 TCT ATG CAT CAA GCC ATC ACT AAA GCG TTT AAA AAA TAC CAC CTT TTT 1298 Ser Met His Gin Ala He Thr Lys Ala Phe Lys Lys Tyr His Leu Phe 175 180 185 190
GGC TTT ACT GGC ACG CCC ATT TTT GCA GCT AAT TGC GAT AAA AAC AAC 1346 Gly Phe Thr Gly Thr Pro He Phe Ala Ala Asn Cys Asp Lys Asn Asn 195 200 205
CCT TTA GGC ACG ACA GAG CAA AAG TTT GGG AAA TGC CTC CAC CAA TAC 1394 Pro Leu Gly Thr Thr Glu Gin Lys Phe Gly Lys Cys Leu His Gin Tyr 210 215 220
ACC ATT ATT GAT GCG ATC AGG GAT AAA AAC GTT TTG CCC TTT AGA GTG 1442 Thr He He Asp Ala He Arg Asp Lys Asn Val Leu Pro Phe Arg Val 225 230 235
GAA TAC CAC AAC ACC ATT AAA GCT AAA GAG GAC ATT AAG GAT AAT AAG 1490 Glu Tyr His Asn Thr He Lys Ala Lys Glu Asp He Lys Asp Asn Lys 240 245 250
GTT AGA GCG GTT GAT GAA AAA AAC GCC CTT TTG GAT ACT AGG AGG ATC 1538 Val Arg Ala Val Asp Glu Lys Asn Ala Leu Leu Asp Thr Arg Arg He 255 260 265 270
AAA GAA ATC ACT AAA TGC ATT TTA GAG CGT TTC AAT CAA GCC ACT AAA 1586 Lys Glu He Thr Lys Cys He Leu Glu Arg Phe Asn Gin Ala Thr Lys 275 280 285
AAT AAA AAA TTC AAT TCC ATT CTG GCA TGC TCT AGC ATA GAA GCG CTG 1634 Asn Lys Lys Phe Asn Ser He Leu Ala Cys Ser Ser He Glu Ala Leu 290 295 300
AAA AAA TAC TAC CAA GCC TTT AAA GAA GAA AAA CAC GAT CTT AAA ATC 1682 Lys Lys Tyr Tyr Gin Ala Phe Lys Glu Glu Lys His Asp Leu Lys He 305 310 315
GCT GCC ATT TTT AGC TAT AGC GCT AAT GAG GAA ATT GAC ACG CTA GAA 1730 Ala Ala He Phe Ser Tyr Ser Ala Asn Glu Glu He Asp Thr Leu Glu 320 325 330
GAT GAA AAC AAT GAA AGC GCT TGC CGG CTA GAC AAA AGC TCA AGG GAT 1778 Asp Glu Asn Asn Glu Ser Ala Cys Arg Leu Asp Lys Ser Ser Arg Asp 335 340 345 350
TTT TTA GAG GGC GCG ATT GCG GAT TAT AAT GGG ATG TTT GGC GTT TCT 1826 Phe Leu Glu Gly Ala He Ala Asp Tyr Asn Gly Met Phe Gly Val Ser 355 360 365
TTT GAC ACT TCG GAT CAA AAA TTC CAA AGT TAT TAC AAG GAT CTT TCT 1874 Phe Asp Thr Ser Asp Gin Lys Phe Gin Ser Tyr Tyr Lys Asp Leu Ser 370 375 380
CAA AAA ATG AAA GAG CGT AAA ATC GAT CTT TTA ATG GTG GTG AAC ATG 1922 Gin Lys Met Lys Glu Arg Lys He Asp Leu Leu Met Val Val Asn Met 385 390 395 TTT TTG ACC GGG TTT GAC GCT ACA AGG CTC AAC ACC CTT TGG GTG GAT 1970 Phe Leu Thr Gly Phe Asp Ala Thr Arg Leu Asn Thr Leu Trp Val Asp 400 405 410
AAA AAT CTC AAA TAC CAT GGG CTA ATT CAA GCT TTT TCA CGC GCA AAC 2018 Lys Asn Leu Lys Tyr His Gly Leu He Gin Ala Phe Ser Arg Ala Asn 415 420 425 430
CGC ATT TTA GAT AGC GTT AAA ACG CAT GGG AAT ATC GTG TGT TTT AGG 2066 Arg He Leu Asp Ser Val Lys Thr His Gly Asn He Val Cys Phe Arg 435 440 445
GAT TTA GAA CAG GAT TTG AAT GAC GCT CTC ATG CTT TTT GGC AAC AAG 2114 Asp Leu Glu Gin Asp Leu Asn Asp Ala Leu Met Leu Phe Gly Asn Lys 450 455 460
GAC GCT CAA TCT ATT GCG CTG TTA AGA AAA TAT GAA GAT TAT TTG AAA 2162 Asp Ala Gin Ser He Ala Leu Leu Arg Lys Tyr Glu Asp Tyr Leu Lys 465 470 475
GGC TAC ACG GAT AAC AAC AAA GAA TAC GAG GGC TAT GAG GGT TTG ATT 2210 Gly Tyr Thr Asp Asn Asn Lys Glu Tyr Glu Gly Tyr Glu Gly Leu He 480 485 490
AAA AGG CTT TTA ACC GAA TTC CCA TTA AAA GAG CCA ATC GTT TCA GAA 2258 Lys Arg Leu Leu Thr Glu Phe Pro Leu Lys Glu Pro He Val Ser Glu 495 500 505 510
AGC CAG AAA AAG GAT TTT ATT AAG CTT TTT GGC AAG ATT TTG AAA TTA 2306 Ser Gin Lys Lys Asp Phe He Lys Leu Phe Gly Lys He Leu Lys Leu 515 520 525
GAA AAT ATT TTA AAC AGC TTT GAA AAT TTC AAA AAA GAC GAT TAC ATC 2354 Glu Asn He Leu Asn Ser Phe Glu Asn Phe Lys Lys Asp Asp Tyr He 530 535 540
AAT CCC AGG GAT TTT CAA GAC TAT CAA AGC AAA TAC CTT GAT TTT TAC 2402 Asn Pro Arg Asp Phe Gin Asp Tyr Gin Ser Lys Tyr Leu Asp Phe Tyr 545 550 555
GAT GCA ATG AGA TCA GAA AAA GGG AAG GAT AAA GAA GAG ATT AAT GAT 2450 Asp Ala Met Arg Ser Glu Lys Gly Lys Asp Lys Glu Glu He Asn Asp 560 565 570
GAT TTG ATT TTT GAA ATT GAA CTC ATC AAA CAA GTG GAA GTC AAT ATT 2498 Asp Leu He Phe Glu He Glu Leu He Lys Gin Val Glu Val Asn He 575 580 585 590
GAC TAT ATT TTG AAT TTG ATT GAA GAG TTC GCT AAA GAG CAT GGG GTG 2546 Asp Tyr He Leu Asn Leu He Glu Glu Phe Ala Lys Glu His Gly Val 595 600 605
GAA ATC CAA GGC GTT AAA ACC AAA ATA GAG CCA ATC ATC AAC TCC AGC 2594 Glu He Gin Gly Val Lys Thr Lys He Glu Pro He He Asn Ser Ser 610 615 620 ATA GAG TTA AGG AAT AAA AAA GAT TTG ATC ATG GAT TTC ATT GAC AAA 2642 He Glu Leu Arg Asn Lys Lys Asp Leu He Met Asp Phe He Asp Lys 625 630 635
TAC AAC AAA GAC CAA GAA GTC CAT GCG CAT TTT CAA GAT TAT ATC CAC 2690 Tyr Asn Lys Asp Gin Glu Val His Ala His Phe Gin Asp Tyr He His 640 645 650
CAA AAA AGA GAA GAG GAA TTC CAA AAT ATC ATA GAA GAA AAC CGC TTG 2738 Gin Lys Arg Glu Glu Glu Phe Gin Asn He He Glu Glu Asn Arg Leu 655 660 665 670
AAT GAA GAA AAA GCC TAT TCG TTC ATG CAG CAT GCC TTT AAA GGG GGC 2786 Asn Glu Glu Lys Ala Tyr Ser Phe Met Gin His Ala Phe Lys Gly Gly 675 680 685
GAA ATC AGT TTT AGT GGG ACG GAA TTC CCT AAA ATC ATT GAA GAA AAA 2834 Glu He Ser Phe Ser Gly Thr Glu Phe Pro Lys He He Glu Glu Lys 690 695 700
CCC TCC ATG TTT GGT AAA AAT TCG CGC TAT CAA GAG GTG AAA GAA AAA 2882 Pro Ser Met Phe Gly Lys Asn Ser Arg Tyr Gin Glu Val Lys Glu Lys 705 710 715
GTC GCT GCA AGC CTT TCT CGT TTT TTC CAC CGC TTT TGT GAT CTC ACT 2930 Val Ala Ala Ser Leu Ser Arg Phe Phe His Arg Phe Cys Asp Leu Thr 720 725 730
AGC GCT ATA TTT AAG AAA AAT GAG GTT AAA AAA GAT GAG GTT AAT GAA 2978 Ser Ala He Phe Lys Lys Asn Glu Val Lys Lys Asp Glu Val Asn Glu 735 740 745 750
AAA TAGTTCATGA ACGCTTTTGC ATTAAGGCTC AAAAAAAGCG CCGTTTAATG GATT 3035 Lys
(2) INFORMATION FOR SEQ ID NO: 280:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 751 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280:
Met Asp Phe Ala Lys Ala Phe Phe Ala Lys Arg Ser Leu Leu Asn Val
1 5 10 15
Leu Thr Cys Tyr Cys Val Phe Thr Ser Glu Glu Val Leu Leu Val Met
20 25 30
Arg Pro Tyr Gin He Val Ala Ala Glu Arg He Leu Glu Lys He Lys 35 40 45
Thr Ala Gin Asn Ser Lys Thr Lys Asn Gin Ser Lys Gly Tyr He Trp
50 55 60
His Thr Thr Gly Ser Gly Lys Thr Leu Thr Ser Phe Lys Ser Ala Thr 65 70 75 80
Leu Ala Lys Glu Leu Glu Ser Val Ser Lys Val Leu Phe Val Val Asp
85 90 95
Arg Lys Asp Leu Asp Tyr Gin Thr Met Lys Glu Tyr Asp Lys Phe Gin
100 105 110
Lys Asp Cys Ala Asn Ser Asn Thr Ser Thr Lys He Leu Lys Glu Gin
115 120 125
Leu Glu Asp Ser Asn Ala Lys He He He Thr Thr He Gin Lys Leu
130 135 140
Asp Lys Phe Val Lys Ser His Lys Gly His Ala He Phe Asn Glu Glu 145 150 155 160
Val Val Met He Phe Asp Glu Cys His Arg Ser Gin Leu Gly Ser Met
165 170 175
His Gin Ala He Thr Lys Ala Phe Lys Lys Tyr His Leu Phe Gly Phe
180 185 190
Thr Gly Thr Pro He Phe Ala Ala Asn Cys Asp Lys Asn Asn Pro Leu
195 200 205
Gly Thr Thr Glu Gin Lys Phe Gly Lys Cys Leu His Gin Tyr Thr He
210 215 220
He Asp Ala He Arg Asp Lys Asn Val Leu Pro Phe Arg Val Glu Tyr 225 230 235 240
His Asn Thr He Lys Ala Lys Glu Asp He Lys Asp Asn Lys Val Arg
245 250 255
Ala Val Asp Glu Lys Asn Ala Leu Leu Asp Thr Arg Arg He Lys Glu
260 265 270
He Thr Lys Cys He Leu Glu Arg Phe Asn Gin Ala Thr Lys Asn Lys
275 280 285
Lys Phe Asn Ser He Leu Ala Cys Ser Ser He Glu Ala Leu Lys Lys
290 295 300
Tyr Tyr Gin Ala Phe Lys Glu Glu Lys His Asp Leu Lys He Ala Ala 305 310 315 320
He Phe Ser Tyr Ser Ala Asn Glu Glu He Asp Thr Leu Glu Asp Glu
325 330 335
Asn Asn Glu Ser Ala Cys Arg Leu Asp Lys Ser Ser Arg Asp Phe Leu
340 345 350
Glu Gly Ala He Ala Asp Tyr Asn Gly Met Phe Gly Val Ser Phe Asp
355 360 365
Thr Ser Asp Gin Lys Phe Gin Ser Tyr Tyr Lys Asp Leu Ser Gin Lys
370 375 380
Met Lys Glu Arg Lys He Asp Leu Leu Met Val Val Asn Met Phe Leu 385 390 395 400
Thr Gly Phe Asp Ala Thr Arg Leu Asn Thr Leu Trp Val Asp Lys Asn
405 410 415
Leu Lys Tyr His Gly Leu He Gin Ala Phe Ser Arg Ala Asn Arg He
420 425 430
Leu Asp Ser Val Lys Thr His Gly Asn He Val Cys Phe Arg Asp Leu
435 440 445
Glu Gin Asp Leu Asn Asp Ala Leu Met Leu Phe Gly Asn Lys Asp Ala
450 455 460
Gin Ser He Ala Leu Leu Arg Lys Tyr Glu Asp Tyr Leu Lys Gly Tyr 465 470 475 480 Thr Asp Asn Asn Lys Glu Tyr Glu Gly Tyr Glu Gly Leu He Lys Arg
485 490 495
Leu Leu Thr Glu Phe Pro Leu Lys Glu Pro He Val Ser Glu Ser Gin
500 505 510
Lys Lys Asp Phe He Lys Leu Phe Gly Lys He Leu Lys Leu Glu Asn
515 520 525
He Leu Asn Ser Phe Glu Asn Phe Lys Lys Asp Asp Tyr He Asn Pro
530 535 540
Arg Asp Phe Gin Asp Tyr Gin Ser Lys Tyr Leu Asp Phe Tyr Asp Ala 545 550 555 560
Met Arg Ser Glu Lys Gly Lys Asp Lys Glu Glu He Asn Asp Asp Leu
565 570 575
He Phe Glu He Glu Leu He Lys Gin Val Glu Val Asn He Asp Tyr
580 585 590
He Leu Asn Leu He Glu Glu Phe Ala Lys Glu His Gly Val Glu He
595 600 605
Gin Gly Val Lys Thr Lys He Glu Pro He He Asn Ser Ser He Glu
610 615 620
Leu Arg Asn Lys Lys Asp Leu He Met Asp Phe He Asp Lys Tyr Asn 625 630 635 640
Lys Asp Gin Glu Val His Ala His Phe Gin Asp Tyr He His Gin Lys
645 650 655
Arg Glu Glu Glu Phe Gin Asn He He Glu Glu Asn Arg Leu Asn Glu
660 665 670
Glu Lys Ala Tyr Ser Phe Met Gin His Ala Phe Lys Gly Gly Glu He
675 680 685
Ser Phe Ser Gly Thr Glu Phe Pro Lys He He Glu Glu Lys Pro Ser
690 695 700
Met Phe Gly Lys Asn Ser Arg Tyr Gin Glu Val Lys Glu Lys Val Ala 705 710 715 720
Ala Ser Leu Ser Arg Phe Phe His Arg Phe Cys Asp Leu Thr Ser Ala
725 730 735
He Phe Lys Lys Asn Glu Val Lys Lys Asp Glu Val Asn Glu Lys 740 745 750
(2) INFORMATION FOR SEQ ID NO: 281:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 850 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 68...799 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: ACCACTATTG TAGAAAAATA ACAAGAGGGT TTGCAAAAAC TCTCATTAAA AACAAGGAGC 60 AAAAAAG ATG AAA AAG GCG GGC TTT CTT TTT TTA GCG GTA ATG GCT ATC 109 Met Lys Lys Ala Gly Phe Leu Phe Leu Ala Val Met Ala He 1 5 10
GTT GTT ATG AGT TTA AAC GCT AAA GAT CCG AAT GTG TTG CGT AAG ATT 157 Val Val Met Ser Leu Asn Ala Lys Asp Pro Asn Val Leu Arg Lys He 15 20 25 30
GTT TTT GAG AAA TGT CTG CCT AAT TAT GAG AAA AAT CAG AAT CCT TCG 205 Val Phe Glu Lys Cys Leu Pro Asn Tyr Glu Lys Asn Gin Asn Pro Ser 35 40 45
CCA TGC ATA GAA GTC AAA CCC GAT GCC GGC TAT GTG GTT TTA AAA GAT 253 Pro Cys He Glu Val Lys Pro Asp Ala Gly Tyr Val Val Leu Lys Asp 50 55 60
ATT AAC GGC CCG TTG CAA TAT TTG TTG ATG CCA ACA ACT CAC ATT AGC 301 He Asn Gly Pro Leu Gin Tyr Leu Leu Met Pro Thr Thr His He Ser 65 70 75
GGT ATT GAA AGC CCT TTG TTA CTT GAT CCT TCT ACG CCT AAC TTT TTT 349 Gly He Glu Ser Pro Leu Leu Leu Asp Pro Ser Thr Pro Asn Phe Phe 80 85 90
TAT TTA TCC TGG CAA GCG CGT GAT TTT ATG AGT AAA AAA TAC GGC CAA 397 Tyr Leu Ser Trp Gin Ala Arg Asp Phe Met Ser Lys Lys Tyr Gly Gin 95 100 105 110
CCC ATT CCT GAT TAT GCG ATT TCT TTG ACG ATT AAC TCT AGC AAA GGG 445 Pro He Pro Asp Tyr Ala He Ser Leu Thr He Asn Ser Ser Lys Gly 115 120 125
CGA TCG CAA AAC CAT TTT CAT ATC CAT ATC TCT TGC ATT AGT CTT GAA 493 Arg Ser Gin Asn His Phe His He His He Ser Cys He Ser Leu Glu 130 135 140
GCA CGC AAA CAG CTG GAT AAT AAC CTA AAA AAA ATC AAC AGC CGT TGG 541 Ala Arg Lys Gin Leu Asp Asn Asn Leu Lys Lys He Asn Ser Arg Trp 145 150 155
TCG CCA TTA CCG GGC GGT TTG AAT GGG CAT AAA TAC TTG GCG CGT CGG 589 Ser Pro Leu Pro Gly Gly Leu Asn Gly His Lys Tyr Leu Ala Arg Arg 160 165 170
GTA ACA GAG AGC GAG TTA GTG CAA AAA AGC CCG TTT GTC ATG CTT AAT 637 Val Thr Glu Ser Glu Leu Val Gin Lys Ser Pro Phe Val Met Leu Asn 175 180 185 190
AAA GAA GTG CCT AAT GCG TAC AAA CGC ATG GGG GAC TAT GGC TTA GCG 685 Lys Glu Val Pro Asn Ala Tyr Lys Arg Met Gly Asp Tyr Gly Leu Ala 195 200 205
GTG GTG CAA CAA AGC GAT AAC TCC TTT GTC TTA TTA GCG ACA CAA TTT 733 Val Val Gin Gin Ser Asp Asn Ser Phe Val Leu Leu Ala Thr Gin Phe 210 215 220
AAC CCA TTG ACT TTA AAT CGC GCT TCA GCC GAA GAG ATT CAA GAT CAT 781 Asn Pro Leu Thr Leu Asn Arg Ala Ser Ala Glu Glu He Gin Asp His 225 230 235
GAA TGC GCG ATT TTG CAC TAAAGCGAGT TAGATTCTTA AGCTTGAGCG ATAACCTT 837 Glu Cys Ala He Leu His 240
TAAAAAGCGT TAT 850
(2) INFORMATION FOR SEQ ID NO:282:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 244 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.282:
Met Lys Lys Ala Gly Phe Leu Phe Leu Ala Val Met Ala He Val Val
1 5 10 15
Met Ser Leu Asn Ala Lys Asp Pro Asn Val Leu Arg Lys He Val Phe
20 25 30
Glu Lys Cys Leu Pro Asn Tyr Glu Lys Asn Gin Asn Pro Ser Pro Cys
35 40 45
He Glu Val Lys Pro Asp Ala Gly Tyr Val Val Leu Lys Asp He Asn
50 55 60
Gly Pro Leu Gin Tyr Leu Leu Met Pro Thr Thr His He Ser Gly He 65 70 75 80
Glu Ser Pro Leu Leu Leu Asp Pro Ser Thr Pro Asn Phe Phe Tyr Leu
85 90 95
Ser Trp Gin Ala Arg Asp Phe Met Ser Lys Lys Tyr Gly Gin Pro He
100 105 110
Pro Asp Tyr Ala He Ser Leu Thr He Asn Ser Ser Lys Gly Arg Ser
115 120 125
Gin Asn His Phe His He His He Ser Cys He Ser Leu Glu Ala Arg
130 135 140
Lys Gin Leu Asp Asn Asn Leu Lys Lys He Asn Ser Arg Trp Ser Pro 145 150 155 160
Leu Pro Gly Gly Leu Asn Gly His Lys Tyr Leu Ala Arg Arg Val Thr
165 170 175
Glu Ser Glu Leu Val Gin Lys Ser Pro Phe Val Met Leu Asn Lys Glu
180 185 190
Val Pro Asn Ala Tyr Lys Arg Met Gly Asp Tyr Gly Leu Ala Val Val
195 200 205
Gin Gin Ser Asp Asn Ser Phe Val Leu Leu Ala Thr Gin Phe Asn Pro
210 215 220
Leu Thr Leu Asn Arg Ala Ser Ala Glu Glu He Gin Asp His Glu Cys 225 230 235 240 Ala He Leu His
(2) INFORMATION FOR SEQ ID NO: 283:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 981 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 57...929 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:283:
GTCCTATTTT TTCATTCATT CAACGAATTT AAAAATTACA ATAAAGAGTT ATAGTT ATG 59
Met 1
AAA CGA AGG GAT TTT ATT AAA ACG ACT ACT TTA GGC GCT ACA GGT GCT 107 Lys Arg Arg Asp Phe He Lys Thr Thr Thr Leu Gly Ala Thr Gly Ala 5 10 15
GTT TTA GGA GCA CAG ATT TTG CAG GCA GAA GAA AGT AAA GGG AGT GTT 155 Val Leu Gly Ala Gin He Leu Gin Ala Glu Glu Ser Lys Gly Ser Val 20 25 30
GCA AAA TAT AAA ATA GAA GCT CAA TAC AGC ATT GAT TTT GAT TCT GCA 203 Ala Lys Tyr Lys He Glu Ala Gin Tyr Ser He Asp Phe Asp Ser Ala 35 40 45
GAA CAC ACT TCA CTT TTC ATT CCC ATG CCG AGT GTT GTA GCG AGC AAT 251 Glu His Thr Ser Leu Phe He Pro Met Pro Ser Val Val Ala Ser Asn 50 55 60 65
GTG CAT TTA CAA GGC AAT CAT GCT AGC TAT AAA AGC ATG CTC AAT TTT 299 Val His Leu Gin Gly Asn His Ala Ser Tyr Lys Ser Met Leu Asn Phe 70 75 80
GGA GTG CCT TAT TTG CAA GTG GAT TTT TTA AAA AGC ACT CAA AAA AAG 347 Gly Val Pro Tyr Leu Gin Val Asp Phe Leu Lys Ser Thr Gin Lys Lys 85 90 95
CAA GTC CAT TTG TCT TAT GAG ATC GCT AGC TAT CAA TTG AAT GAG CGT 395 Gin Val His Leu Ser Tyr Glu He Ala Ser Tyr Gin Leu Asn Glu Arg 100 105 110 TTG TTT GAA ACG AGC GAT TTT GTA GCA ATG GGG CGT TAT GAA AGA GAC 443 Leu Phe Glu Thr Ser Asp Phe Val Ala Met Gly Arg Tyr Glu Arg Asp 115 120 125
GAT GCG AGC GTG GCT AAC ATT GCC AAC CAG CTT AAG GGA ACA ACC CCT 491 Asp Ala Ser Val Ala Asn He Ala Asn Gin Leu Lys Gly Thr Thr Pro 130 135 140 145
AAA GAA AGC GTT CGC AAT TTT TAT GCG TTC ATC AAG CAT GAG ATG CCT 539 Lys Glu Ser Val Arg Asn Phe Tyr Ala Phe He Lys His Glu Met Pro 150 155 160
AAG AGA CAG AAG GCT TTA GAG GGT AAA GAA AAT TTA CCT AAG CGT GAG 587 Lys Arg Gin Lys Ala Leu Glu Gly Lys Glu Asn Leu Pro Lys Arg Glu 165 170 175
AGT TTG CCC TGG TTT GCA ACC ATT TCA AAA GAG AGC ATG TTT GTG TCC 635 Ser Leu Pro Trp Phe Ala Thr He Ser Lys Glu Ser Met Phe Val Ser 180 185 190
TTA TGC CAT GCG TGC GGG ATT AAA AGC GCT GAA GTG CAA GGC TTG AAA 683 Leu Cys His Ala Cys Gly He Lys Ser Ala Glu Val Gin Gly Leu Lys 195 200 205
CTG GGT CAA AAC AGC GTG GTG AAA AAC GCT CCT AGA GTG GAA GTG TAT 731 Leu Gly Gin Asn Ser Val Val Lys Asn Ala Pro Arg Val Glu Val Tyr 210 215 220 225
TTG AAA GAT TCA TTT CTA GCG TTT GAT TTT CAA AAT AAT CAC AAG GAA 779 Leu Lys Asp Ser Phe Leu Ala Phe Asp Phe Gin Asn Asn His Lys Glu 230 235 240
GTT TTT ATC CCG TTG AAT CGT CAT AAA GAC ATG CAG TTA GAT TCT GCC 827 Val Phe He Pro Leu Asn Arg His Lys Asp Met Gin Leu Asp Ser Ala 245 250 255
TTA TTG GCG ACT TTT GGC GAT GCT TTT GCC CTT GTG GAT GGT AGG GAT 875 Leu Leu Ala Thr Phe Gly Asp Ala Phe Ala Leu Val Asp Gly Arg Asp 260 265 270
TTA GGC AAT TAC GAG AGC AAA CTT TTT GAA AAA AGA GTG TCC TAT ACG 923 Leu Gly Asn Tyr Glu Ser Lys Leu Phe Glu Lys Arg Val Ser Tyr Thr 275 280 285
ATT GTC TAAAGGCATG AAATCTAGGA ATATTCCTTG ATAGCGGGCT TTCCTTTTTA GG 981
He Val
290
(2) INFORMATION FOR SEQ ID NO: 284:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 291 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:284:
Met Lys Arg Arg Asp Phe He Lys Thr Thr Thr Leu Gly Ala Thr Gly
1 5 10 15
Ala Val Leu Gly Ala Gin He Leu Gin Ala Glu Glu Ser Lys Gly Ser
20 25 30
Val Ala Lys Tyr Lys He Glu Ala Gin Tyr Ser He Asp Phe Asp Ser
35 40 45
Ala Glu His Thr Ser Leu Phe He Pro Met Pro Ser Val Val Ala Ser
50 55 60
Asn Val His Leu Gin Gly Asn His Ala Ser Tyr Lys Ser Met Leu Asn 65 70 75 80
Phe Gly Val Pro Tyr Leu Gin Val Asp Phe Leu Lys Ser Thr Gin Lys
85 90 95
Lys Gin Val His Leu Ser Tyr Glu He Ala Ser Tyr Gin Leu Asn Glu
100 105 110
Arg Leu Phe Glu Thr Ser Asp Phe Val Ala Met Gly Arg Tyr Glu Arg
115 120 125
Asp Asp Ala Ser Val Ala Asn He Ala Asn Gin Leu Lys Gly Thr Thr
130 135 140
Pro Lys Glu Ser Val Arg Asn Phe Tyr Ala Phe He Lys His Glu Met 145 150 155 160
Pro Lys Arg Gin Lys Ala Leu Glu Gly Lys Glu Asn Leu Pro Lys Arg
165 170 175
Glu Ser Leu Pro Trp Phe Ala Thr He Ser Lys Glu Ser Met Phe Val
180 185 190
Ser Leu Cys His Ala Cys Gly He Lys Ser Ala Glu Val Gin Gly Leu
195 200 205
Lys Leu Gly Gin Asn Ser Val Val Lys Asn Ala Pro Arg Val Glu Val
210 215 220
Tyr Leu Lys Asp Ser Phe Leu Ala Phe Asp Phe Gin Asn Asn His Lys 225 230 235 240
Glu Val Phe He Pro Leu Asn Arg His Lys Asp Met Gin Leu Asp Ser
245 250 255
Ala Leu Leu Ala Thr Phe Gly Asp Ala Phe Ala Leu Val Asp Gly Arg
260 265 270
Asp Leu Gly Asn Tyr Glu Ser Lys Leu Phe Glu Lys Arg Val Ser Tyr
275 280 285
Thr He Val 290
(2) INFORMATION FOR SEQ ID NO: 285:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 686 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA ( ix) FEATURE :
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 82...633 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285:
TGTGATTAAA CAAAATCAAA AACTTTTTAA CTATAATCAA ACCTAAATTA AAGTTCAAGG 60 AGTGGCATTT TGTTTAAAAG A ATG GTT TTA ATC GCT CTT TTA GGG GTG TTT 111
Met Val Leu He Ala Leu Leu Gly Val Phe 1 5 10
TCA AGC GTT TCA TTA AGC GCT AAG AGT CTT TTA AGA GAT GAT GGG ATT 159 Ser Ser Val Ser Leu Ser Ala Lys Ser Leu Leu Arg Asp Asp Gly He 15 20 25
TTA GTC TCT GAT TTA AAG GGC ATG AAA TCA GAA CTA TCT GAT GCT CCT 207 Leu Val Ser Asp Leu Lys Gly Met Lys Ser Glu Leu Ser Asp Ala Pro 30 35 40
GCT TGG GTT TTT GAA GAC GCT AAA GCC CCC TAC GAA GAA ATG GGC GTG 255 Ala Trp Val Phe Glu Asp Ala Lys Ala Pro Tyr Glu Glu Met Gly Val 45 50 55
GCG TAT ATC CCT GTT AAT AAT AAA TAT TTA GGG ATT GAG CAA GCG ACC 303 Ala Tyr He Pro Val Asn Asn Lys Tyr Leu Gly He Glu Gin Ala Thr 60 65 70
TTA AAC GCT AAA TTG AGT CTG ATC GTG GTT TTT CAT GAA ATC ATG ATG 351 Leu Asn Ala Lys Leu Ser Leu He Val Val Phe His Glu He Met Met 75 80 85 90
AAG TAT AAA AAA CGC TTC ATG GAG CAA TTC CAT GAG TCC GAG CAG ACG 399 Lys Tyr Lys Lys Arg Phe Met Glu Gin Phe His Glu Ser Glu Gin Thr 95 100 105
ACT ACG AAT ATC AGT TAC GCT ATC TAT AAT TAT CTA GCG ACT AAG ATC 447 Thr Thr Asn He Ser Tyr Ala He Tyr Asn Tyr Leu Ala Thr Lys He 110 115 120
CAG GTA TCC AAC ACC TAC ACG AAT TTA AAA TCG GAG GTG GCG GTG GTG 495 Gin Val Ser Asn Thr Tyr Thr Asn Leu Lys Ser Glu Val Ala Val Val 125 130 135
AAA ATC AAG CTA GTG GGT TGT CAG ATT GAG CAA ATC AAA AGG TAT TTA 543 Lys He Lys Leu Val Gly Cys Gin He Glu Gin He Lys Arg Tyr Leu 140 145 150
AAA GCG AGC GTT GAA AAC CTT AAC GAT AAT GAA ATC GCT TAC ATC GCT 591 Lys A'la Ser Val Glu Asn Leu Asn Asp Asn Glu He Ala Tyr He Ala 155 160 165 170 AAG GTC GCT CAA AAA GAA TTT GGT AGC GTT TGT GCG TTA AGG TAGTTTTAT 642 Lys Val Ala Gin Lys Glu Phe Gly Ser Val Cys Ala Leu Arg 175 180
AGCATTCTAG CGAGCATGTT TAAGGCATGC TCTACGCTTT TATT 686
(2) INFORMATION FOR SEQ ID NO: 286:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 184 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286:
Met Val Leu He Ala Leu Leu Gly Val Phe Ser Ser Val Ser Leu Ser
1 5 10 15
Ala Lys Ser Leu Leu Arg Asp Asp Gly He Leu Val Ser Asp Leu Lys
20 25 30
Gly Met Lys Ser Glu Leu Ser Asp Ala Pro Ala Trp Val Phe Glu Asp
35 40 45
Ala Lys Ala Pro Tyr Glu Glu Met Gly Val Ala Tyr He Pro Val Asn
50 55 60
Asn Lys Tyr Leu Gly He Glu Gin Ala Thr Leu Asn Ala Lys Leu Ser 65 70 75 80
Leu He Val Val Phe His Glu He Met Met Lys Tyr Lys Lys Arg Phe
85 90 95
Met Glu Gin Phe His Glu Ser Glu Gin Thr Thr Thr Asn He Ser Tyr
100 105 110
Ala He Tyr Asn Tyr Leu Ala Thr Lys He Gin Val Ser Asn Thr Tyr
115 120 125
Thr Asn Leu Lys Ser Glu Val Ala Val Val Lys He Lys Leu Val Gly
130 135 140
Cys Gin He Glu Gin He Lys Arg Tyr Leu Lys Ala Ser Val Glu Asn 145 150 155 160
Leu Asn Asp Asn Glu He Ala Tyr He Ala Lys Val Ala Gin Lys Glu
165 170 175
Phe Gly Ser Val Cys Ala Leu Arg 180
(2) INFORMATION FOR SEQ ID NO: 287:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 310 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 112...252 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287:
ATGCCTGCCA TTTCATAGCC TAAATTTTCT TTAGAGCCGA ATTGATAAGC GGCTTTTAAG 60 ACTTCTTTTT GCTTAGCGTT AAAATCTTTA ATATTGTCGC AATTGGTCAT C ATG ACT 117
Met Thr 1
TTA GTA ACG GGC GAT TTG GGC TTG TTT TTA ACC CCT TTA GCG GGC TTA 165 Leu Val Thr Gly Asp Leu Gly Leu Phe Leu Thr Pro Leu Ala Gly Leu 5 10 15
GGC TCT GTT TTA GTG GGG CTT TCT GTT GCG GCT AAA CTT AAA GAT GCA 213 Gly Ser Val Leu Val Gly Leu Ser Val Ala Ala Lys Leu Lys Asp Ala 20 25 30
CTT AAG GCT GTG CCT AGC CAT AAG GCT TTA AAG ATG GTG TGAGTGAGTG GG 264 Leu Lys Ala Val Pro Ser His Lys Ala Leu Lys Met Val 35 40 45
GTTAAATGTT TCAAAACGCC TACCTTTTGT ATTAAGAAAT AAACTA 310
(2) INFORMATION FOR SEQ ID NO: 288:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 47 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288:
Met Thr Leu Val Thr Gly Asp Leu Gly Leu Phe Leu Thr Pro Leu Ala
1 5 10 15
Gly Leu Gly Ser Val Leu Val Gly Leu Ser Val Ala Ala Lys Leu Lys
20 25 30
Asp Ala Leu Lys Ala Val Pro Ser His Lys Ala Leu Lys Met Val 35 40 45
(2) INFORMATION FOR SEQ ID NO: 289:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 631 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 145...579 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289:
GCGTTCAATA GAATGCTTTA GTTAGGAAGC TCCTTGCTTT AGCAAGGNGT GGTTTCACTG 60 AAAGAGAGTA AGAAATTTGA AGAAAGGGTT TATCTTTTTT TAGATGAATT TGTGCGTTTT 120 GGTAAATTGC CTTTTTTATT AGAA ATG CCA GCA TTA AGT AGG AGC TAT GGT 171
Met Pro Ala Leu Ser Arg Ser Tyr Gly 1 5
GTG GTT TTA ATT TTT ATC ACG CAA TCC AAC GCT CTT ATT GAA AAA TAT 219 Val Val Leu He Phe He Thr Gin Ser Asn Ala Leu He Glu Lys Tyr 10 15 20 25
TAC GGC AGA GAA GAT GCA AGA ATT GTT AAT AGC ACC GTG GCT TAC AAA 267 Tyr Gly Arg Glu Asp Ala Arg He Val Asn Ser Thr Val Ala Tyr Lys 30 35 40
ATA ATT TTC AAA ATG GAT GAT TTA GAA TAC GCT AAA CAG GTG AGC GAA 315 He He Phe Lys Met Asp Asp Leu Glu Tyr Ala Lys Gin Val Ser Glu 45 50 55
GAA GTC GGT AAG ATG ACT AGA AAA ACA CGA AGC CAC TCT ACA GAA AAA 363 Glu Val Gly Lys Met Thr Arg Lys Thr Arg Ser His Ser Thr Glu Lys 60 65 70
GGA CAA CTC ATT ACC GGA GGG ACT TCT AGT ATA GGT AAA GAG GCG TGG 411 Gly Gin Leu He Thr Gly Gly Thr Ser Ser He Gly Lys Glu Ala Trp 75 80 85
GAC TTA TTG AGC GCG CAA GAT ATT ATG AAT ATT GAT AAA GAT GAA GTG 459 Asp Leu Leu Ser Ala Gin Asp He Met Asn He Asp Lys Asp Glu Val 90 95 100 105
ATC GTT TTA GTA AGC GGT CAT AAG GCT AAA CCC TTA AAA TTA AAA GCG 507 He Val Leu Val Ser Gly His Lys Ala Lys Pro Leu Lys Leu Lys Ala 110 115 120
AAT TAT TAT TTC AAA AAC AAA GAA TTA CTC TCT CGT ATT AAC TGG GAA 555 Asn Tyr Tyr Phe Lys Asn Lys Glu Leu Leu Ser Arg He Asn Trp Glu 125 130 135
GTC AAG CCC AAT GAA GAA GTG TTT TGATGGATTA AAAAAGTTTG CATGAGTATT 609 Val Lys Pro Asn Glu Glu Val Phe 140 145
TTTTAATTGC TTTTTTAAAA AT 631 (2) INFORMATION FOR SEQ ID NO: 290:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 145 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290:
Met Pro Ala Leu Ser Arg Ser Tyr Gly Val Val Leu He Phe He Thr
1 5 10 15
Gin Ser Asn Ala Leu He Glu Lys Tyr Tyr Gly Arg Glu Asp Ala Arg
20 25 30
He Val Asn Ser Thr Val Ala Tyr Lys He He Phe Lys Met Asp Asp
35 40 45
Leu Glu Tyr Ala Lys Gin Val Ser Glu Glu Val Gly Lys Met Thr Arg
50 55 60
Lys Thr Arg Ser His Ser Thr Glu Lys Gly Gin Leu He Thr Gly Gly 65 70 75 80
Thr Ser Ser He Gly Lys Glu Ala Trp Asp Leu Leu Ser Ala Gin Asp
85 90 95
He Met Asn He Asp Lys Asp Glu Val He Val Leu Val Ser Gly His
100 105 110
Lys Ala Lys Pro Leu Lys Leu Lys Ala Asn Tyr Tyr Phe Lys Asn Lys
115 120 125
Glu Leu Leu Ser Arg He Asn Trp Glu Val Lys Pro Asn Glu Glu Val
130 135 140
Phe 145
(2) INFORMATION FOR SEQ ID NO: 291:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 290 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 106...237 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: TAAGGGCTTA CGCATAAAAT CGCATCCGCG CCGATTTTTT GAGCGAACTT TGCTAAAGAA 60 AGGGACTCGC TCGTGGCGTT ACTGCCCACG CCGGCTAACA CTTTC ATG CGC GAA TTT 117
Met Arg Glu Phe 1
GAG GGC GTT TTA GTG TTT TTG CAA GTT TCT ATG GCG ATT TCA ATG CAA 165 Glu Gly Val Leu Val Phe Leu Gin Val Ser Met Ala He Ser Met Gin 5 10 15 20
CGC ATG TGC TCT TTG TGG GTG AGC GTG GCG GAT TCT CCT GTC GTG CCA 213 Arg Met Cys Ser Leu Trp Val Ser Val Ala Asp Ser Pro Val Val Pro 25 30 35
ACA GGC ACG CAT GCG TCC ATG CCC TGAAAAATTT GGCGCTTGAT CAAGGTTTCA 267 Thr Gly Thr His Ala Ser Met Pro 40
TAAGCGGCCT CATCAACGCT CAA 290
(2) INFORMATION FOR SEQ ID NO: 292:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 44 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292:
Met Arg Glu Phe Glu Gly Val Leu Val Phe Leu Gin Val Ser Met Ala
1 5 10 15
He Ser Met Gin Arg Met Cys Ser Leu Trp Val Ser Val Ala Asp Ser
20 25 30
Pro Val Val Pro Thr Gly Thr His Ala Ser Met Pro 35 40
(2) INFORMATION FOR SEQ ID NO:293:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 421 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 58...369 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:293:
AAAGAAGGTT TGAAAAGCTT TTTTAAAAGG CTTTTGAAGT ATTTGGGGTA GGCTTGA ATG 60
Met 1
AAA GTG CAA AAT TTT ATC CAT TTT TCT GTT GTG GTA GGG TTT TTT TTG 108 Lys Val Gin Asn Phe He His Phe Ser Val Val Val Gly Phe Phe Leu 5 10 15
GGG TTA GTG TTT TCG GTG TTG AAA TTC AAT GAG CCA GAG AGC ATT TTA 156 Gly Leu Val Phe Ser Val Leu Lys Phe Asn Glu Pro Glu Ser He Leu 20 25 30
TTA TGG ACG GTG TTA TCC ACG CTT GGG GGG TAC TTG ATT GCG TTG TTG 204 Leu Trp Thr Val Leu Ser Thr Leu Gly Gly Tyr Leu He Ala Leu Leu 35 40 45
TTT GCG TCT ATT TTT ATC GCT TGC ACG GAT TTG GAT ATT TGT CTT TTT 252 Phe Ala Ser He Phe He Ala Cys Thr Asp Leu Asp He Cys Leu Phe 50 55 60 65
GAC AAA AAA GGC ACT GAA GAG AGT TTG CTT CGT TTC AAC CAT GAG TTT 300 Asp Lys Lys Gly Thr Glu Glu Ser Leu Leu Arg Phe Asn His Glu Phe 70 75 80
AAA AAC AGA GAA AAA GAA GTG GCT AGT ATT TTA GAA TAC ATT AGA AGT 348 Lys Asn Arg Glu Lys Glu Val Ala Ser He Leu Glu Tyr He Arg Ser 85 90 95
TAT GAT TTT GAT GAT GGA AAA TAGAATGCCC AAAGGAATTC AAAAAACTGA AACA 403 Tyr Asp Phe Asp Asp Gly Lys 100
AGCGAAAAAA ATATAGAA 421
(2) INFORMATION FOR SEQ ID NO: 294:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 104 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294:
Met Lys Val Gin Asn Phe He His Phe Ser Val Val Val Gly Phe Phe
1 5 10 15
Leu Gly Leu Val Phe Ser Val Leu Lys Phe Asn Glu Pro Glu Ser He
20 25 30
Leu Leu Trp Thr Val Leu Ser Thr Leu Gly Gly Tyr Leu He Ala Leu 35 40 45 Leu Phe Ala Ser He Phe He Ala Cys Thr Asp Leu Asp He Cys Leu
50 55 60
Phe Asp Lys Lys Gly Thr Glu Glu Ser Leu Leu Arg Phe Asn His Glu 65 70 75 80
Phe Lys Asn Arg Glu Lys Glu Val Ala Ser He Leu Glu Tyr He Arg
85 90 95
Ser Tyr Asp Phe Asp Asp Gly Lys 100
(2) INFORMATION FOR SEQ ID NO: 295:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 670 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...617 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:295:
GTTTAGAAAA GCGTTTGAAC GCTATAAAAA ATGCAGAGTG GCTTTAAGGC ATG AAA 56
Met Lys 1
AAG ATT GCA TTT TTT ATT TTT GTC ATT TTG TTT TCG GTA GGG ATT TAT 104 Lys He Ala Phe Phe He Phe Val He Leu Phe Ser Val Gly He Tyr 5 10 15
TTA ATT TGG CAT GTT TTA TTG GAA AAA GCC CTA GAA TTG AAA TTA GCA 152 Leu He Trp His Val Leu Leu Glu Lys Ala Leu Glu Leu Lys Leu Ala 20 25 30
ACC TCA GCT AAT GAT TTG CTT TTA AAA TTG TTG GCA ATT CTT GGC GTT 200 Thr Ser Ala Asn Asp Leu Leu Leu Lys Leu Leu Ala He Leu Gly Val 35 40 45 50
TTT TCA ATG TTA GTG CTT TTT CAA GGC ATT ATT TCT TCG TAT AAG AAG 248 Phe Ser Met Leu Val Leu Phe Gin Gly He He Ser Ser Tyr Lys Lys 55 60 65
CGC CAA CTC AAA CGC ATT TTA CAA AAA ATA GAC GCC ATG AAC GGC TTT 296 Arg Gin Leu Lys Arg He Leu Gin Lys He Asp Ala Met Asn Gly Phe 70 75 80
GAA TTT GAA GAA TAT TCC AAA ATC TTT TTC ACT TCA AAG GGT TTT GAA 344 Glu Phe Glu Glu Tyr Ser Lys He Phe Phe Thr Ser Lys Gly Phe Glu 85 90 95
GTG AGC ATC ACG CAA AAA AGC GGC GAT TAT GGA GCG GAT TTG ATT ATA 392 Val Ser He Thr Gin Lys Ser Gly Asp Tyr Gly Ala Asp Leu He He 100 105 110
GAA AAA GAC GGC ATC AAG TGG GCG GTT CAA GTC AAA CGC TAC TCG CAT 440 Glu Lys Asp Gly He Lys Trp Ala Val Gin Val Lys Arg Tyr Ser His 115 120 125 130
AAA GTT TCG CCC AAA GCC ATT CAA GAG GTG GTC TCT TCT AAA GCT TAT 488 Lys Val Ser Pro Lys Ala He Gin Glu Val Val Ser Ser Lys Ala Tyr 135 140 145
TAC GCT TGC GAA AAA GCT TGC GTG ATC ACC AAC AGC TAT TTC ACG CAA 536 Tyr Ala Cys Glu Lys Ala Cys Val He Thr Asn Ser Tyr Phe Thr Gin 150 155 160
GCC GCT CAA AAA CTG GCT CAA GCT AAC GAA GTG CTC TTG ATT GAC AGA 584 Ala Ala Gin Lys Leu Ala Gin Ala Asn Glu Val Leu Leu He Asp Arg 165 170 175
GAC GAA TGG GTC AGG TTT TTG AAC GAA AAG AGA TGAACCGATC CCATCAGATC 637 Asp Glu Trp Val Arg Phe Leu Asn Glu Lys Arg 180 185
GTTTGTTCTC AAGTTCTTTT AAAATTTTGT CGT 670
(2) INFORMATION FOR SEQ ID NO: 296:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 189 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296:
Met Lys Lys He Ala Phe Phe He Phe Val He Leu Phe Ser Val Gly
1 5 10 15
He Tyr Leu He Trp His Val Leu Leu Glu Lys Ala Leu Glu Leu Lys
20 25 30
Leu Ala Thr Ser Ala Asn Asp Leu Leu Leu Lys Leu Leu Ala He Leu
35 40 45
Gly Val Phe Ser Met Leu Val Leu Phe Gin Gly He He Ser Ser Tyr
50 55 60
Lys Lys Arg Gin Leu Lys Arg He Leu Gin Lys He Asp Ala Met Asn 65 70 75 80
Gly Phe Glu Phe Glu Glu Tyr Ser Lys He Phe Phe Thr Ser Lys Gly
85 90 95
Phe Glu Val Ser He Thr Gin Lys Ser Gly Asp Tyr Gly Ala Asp Leu 100 105 110 He He Glu Lys Asp Gly He Lys Trp Ala Val Gin Val Lys Arg Tyr
115 120 125
Ser His Lys Val Ser Pro Lys Ala He Gin Glu Val Val Ser Ser Lys
130 135 140
Ala Tyr Tyr Ala Cys Glu Lys Ala Cys Val He Thr Asn Ser Tyr Phe 145 150 155 160
Thr Gin Ala Ala Gin Lys Leu Ala Gin Ala Asn Glu Val Leu Leu He
165 170 175
Asp Arg Asp Glu Trp Val Arg Phe Leu Asn Glu Lys Arg 180 185
(2) INFORMATION FOR SEQ ID NO: 297:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 600 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 125...538 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297:
CAAAGAGTGG GATAAAATCA CAGAAATTTG TAAGAGAGCG CTAGCTTTAA GATAACAAAA 60
AGATCATGGC ATTTTGTATT TGCTTAATAA CACTATAATA AAATTTTTAA TAAGGAGATA 120
CATC ATG TTA GAA AAT GTC AAA AAG TCC TTT TTT AGG GTT TTG TGC TTG 169 Met Leu Glu Asn Val Lys Lys Ser Phe Phe Arg Val Leu Cys Leu 1 5 10 15
GGT GCG TTG TGT TTA GGG GGG CTA ATG GCA GAG CAA GAC CCT AAA GAG 217 Gly Ala Leu Cys Leu Gly Gly Leu Met Ala Glu Gin Asp Pro Lys Glu 20 25 30
CTT GTG GGT TTG GGG GCA AAG AGC TAC AAA GAG AAA GAT TTC ACT CAA 265 Leu Val Gly Leu Gly Ala Lys Ser Tyr Lys Glu Lys Asp Phe Thr Gin 35 40 45
GCG AAG AAA TAT TTT GAG AAA GCG TGC GAT TTG AAA GAA AAT AGC GGG 313 Ala Lys Lys Tyr Phe Glu Lys Ala Cys Asp Leu Lys Glu Asn Ser Gly 50 55 60
TGT TTT AAT TTA GGG GTG CTT TAT TAT CAA GGG CAA GGG GTG GAA AAG 361 Cys Phe Asn Leu Gly Val Leu Tyr Tyr Gin Gly Gin Gly Val Glu Lys 65 70 75
AAC TTG AAA AAA GCC GCC TCC TTT TAC GCT AAA GCT TGC GAT TTG AAT 409 Asn Leu Lys Lys Ala Ala Ser Phe Tyr Ala Lys Ala Cys Asp Leu Asn 80 85 90 95
TAC AGC AAT GGG TGT CAT TTG CTA GGG AAT TTA TAT TAC AGC GGG CAA 457 Tyr Ser Asn Gly Cys His Leu Leu Gly Asn Leu Tyr Tyr Ser Gly Gin 100 105 110
GGC GTG TCC CAA AAC ACC AAT AAA GCC CTA CAA TAC TAC TCT AAA GCG 505 Gly Val Ser Gin Asn Thr Asn Lys Ala Leu Gin Tyr Tyr Ser Lys Ala 115 120 125
TGC GAT TTG AAA TAC GCT GAA GGG TGC GCG ACT TAGGGGGGAT TTATCATGAT 558 Cys Asp Leu Lys Tyr Ala Glu Gly Cys Ala Thr 130 135
GGTAAAGTGG TAACTAGGGA TTTTAAAAAA GCGGTGGAAT AT 600
(2) INFORMATION FOR SEQ ID NO: 298:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 138 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298:
Met Leu Glu Asn Val Lys Lys Ser Phe Phe Arg Val Leu Cys Leu Gly
1 5 10 15
Ala Leu Cys Leu Gly Gly Leu Met Ala Glu Gin Asp Pro Lys Glu Leu
20 25 30
Val Gly Leu Gly Ala Lys Ser Tyr Lys Glu Lys Asp Phe Thr Gin Ala
35 40 45
Lys Lys Tyr Phe Glu Lys Ala Cys Asp Leu Lys Glu Asn Ser Gly Cys
50 55 60
Phe Asn Leu Gly Val Leu Tyr Tyr Gin Gly Gin Gly Val Glu Lys Asn 65 70 75 80
Leu Lys Lys Ala Ala Ser Phe Tyr Ala Lys Ala Cys Asp Leu Asn Tyr
85 90 95
Ser Asn Gly Cys His Leu Leu Gly Asn Leu Tyr Tyr Ser Gly Gin Gly
100 105 110
Val Ser Gin Asn Thr Asn Lys Ala Leu Gin Tyr Tyr Ser Lys Ala Cys
115 120 125
Asp Leu Lys Tyr Ala Glu Gly Cys Ala Thr 130 135
(2) INFORMATION FOR SEQ ID NO: 299:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 879 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 59...826 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: TTTTAAGATT GGTAGCCATT GGCATTATGT TTGATCTTAT TAAAGCAGAG GAGTAACA 58
ATG GGA TAC GCA AGC AAA TTA GCC TTG AAG ATT TGT TTG GCA AGT TTA 106 Met Gly Tyr Ala Ser Lys Leu Ala Leu Lys He Cys Leu Ala Ser Leu 1 5 10 15
TGT TTA TTT AGC GCT CTT GGT GCA GAA CAC CTT GAA CAA AAA AGG AAC 154 Cys Leu Phe Ser Ala Leu Gly Ala Glu His Leu Glu Gin Lys Arg Asn 20 25 30
TAT ATT TAT AAM GGG GAG GAA GCC TAT AAT AAT AAG GAA TAT GAG CGG 202 Tyr He Tyr Xaa Gly Glu Glu Ala Tyr Asn Asn Lys Glu Tyr Glu Arg 35 40 45
GCG GCT TCT TTT TAT AAG AGC GCG ATT AAA AAT GGC GAG CCG CTT GCT 250 Ala Ala Ser Phe Tyr Lys Ser Ala He Lys Asn Gly Glu Pro Leu Ala 50 55 60
TAT GTT CTT TTA GGG ATC ATG TAT GAA AAT GGT AGG GGT GTG CCT AAA 298 Tyr Val Leu Leu Gly He Met Tyr Glu Asn Gly Arg Gly Val Pro Lys 65 70 75 80
GAT GAA AAG AAA GCG GCT GAA TAT TTT CAA AAA GCG GTT GAT AAC GAT 346 Asp Glu Lys Lys Ala Ala Glu Tyr Phe Gin Lys Ala Val Asp Asn Asp 85 90 95
ATA CCT AGA GGG TAT AAC AAT TTA GGC GTG ATG TAT AAA GAG GGT AGA 394 He Pro Arg Gly Tyr Asn Asn Leu Gly Val Met Tyr Lys Glu Gly Arg 100 105 110
GGT GTG CCT AAA GAT GAA AAG AAA GCC GTG GAG TAT TTT AGA ATA GCT 442 Gly Val Pro Lys Asp Glu Lys Lys Ala Val Glu Tyr Phe Arg He Ala 115 120 125
ACC GAG AAG GGC TAT ACT AAC GCC TAT ATA AAC TTA GGC ATC ATG TAT 490 Thr Glu Lys Gly Tyr Thr Asn Ala Tyr He Asn Leu Gly He Met Tyr 130 135 140
ATG GAG GGT AGG GGA GTT CCA AGC AAC TAT GTG AAA GCG ACA GAG TGC 538 Met Glu Gly Arg Gly Val Pro Ser Asn Tyr Val Lys Ala Thr Glu Cys 145 150 155 160 TTT AGA AAA GCG ATG CAT AAG GGT AAT GTA GAA GCT TAT ATC CTT TTA 586 Phe Arg Lys Ala Met His Lys Gly Asn Val Glu Ala Tyr He Leu Leu 165 170 175
GGG GAT ATT TAT TAT AGT GGG AAT GAT CAA TTG GGT ATT GAG CCA GAC 634 Gly Asp He Tyr Tyr Ser Gly Asn Asp Gin Leu Gly He Glu Pro Asp 180 185 190
AAA GAT AAG GCG ATT GTC TAT TAT AAA ATG GCG GCT GAT ATG AGC TCT 682 Lys Asp Lys Ala He Val Tyr Tyr Lys Met Ala Ala Asp Met Ser Ser 195 200 205
TCT AGA GCT TAT GAA GGG TTA GCA GAG TCT TAT CAG TAT GGG TTA GGC 730 Ser Arg Ala Tyr Glu Gly Leu Ala Glu Ser Tyr Gin Tyr Gly Leu Gly 210 215 220
GTG GAA AAA GAT AAG AAA AAG GCT GAA GAA TAC ATG CAA AAA GCA TGC 778 Val Glu Lys Asp Lys Lys Lys Ala Glu Glu Tyr Met Gin Lys Ala Cys 225 230 235 240
GAT TTT GAC ATT GAT AAA AAT TGT AAG AAA AAG AAC ACT TCA AGC CGA 826 Asp Phe Asp He Asp Lys Asn Cys Lys Lys Lys Asn Thr Ser Ser Arg 245 250 255
TAACTCTCAA ACTTGGGCTT GATTAGGATT TTTGTTTTAT TTTAAGTAGC ATG 879
(2) INFORMATION FOR SEQ ID NO: 300:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 256 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300:
Met Gly Tyr Ala Ser Lys Leu Ala Leu Lys He Cys Leu Ala Ser Leu
1 5 10 15
Cys Leu Phe Ser Ala Leu Gly Ala Glu His Leu Glu Gin Lys Arg Asn
20 25 30
Tyr He Tyr Xaa Gly Glu Glu Ala Tyr Asn Asn Lys Glu Tyr Glu Arg
35 40 45
Ala Ala Ser Phe Tyr Lys Ser Ala He Lys Asn Gly Glu Pro Leu Ala
50 55 60
Tyr Val Leu Leu Gly He Met Tyr Glu Asn Gly Arg Gly Val Pro Lys 65 70 75 80
Asp Glu Lys Lys Ala Ala Glu Tyr Phe Gin Lys Ala Val Asp Asn Asp
85 90 95
He Pro Arg Gly Tyr Asn Asn Leu Gly Val Met Tyr Lys Glu Gly Arg
100 105 110
Gly Val Pro Lys Asp Glu Lys Lys Ala Val Glu Tyr Phe Arg He Ala 115 120 125 Thr Glu Lys Gly Tyr Thr Asn Ala Tyr He Asn Leu Gly He Met Tyr
130 135 140
Met Glu Gly Arg Gly Val Pro Ser Asn Tyr Val Lys Ala Thr Glu Cys 145 150 155 160
Phe Arg Lys Ala Met His Lys Gly Asn Val Glu Ala Tyr He Leu Leu
165 170 175
Gly Asp He Tyr Tyr Ser Gly Asn Asp Gin Leu Gly He Glu Pro Asp
180 185 190
Lys Asp Lys Ala He Val Tyr Tyr Lys Met Ala Ala Asp Met Ser Ser
195 200 205
Ser Arg Ala Tyr Glu Gly Leu Ala Glu Ser Tyr Gin Tyr Gly Leu Gly
210 215 220
Val Glu Lys Asp Lys Lys Lys Ala Glu Glu Tyr Met Gin Lys Ala Cys 225 230 235 240
Asp Phe Asp He Asp Lys Asn Cys Lys Lys Lys Asn Thr Ser Ser Arg 245 250 255
(2) INFORMATION FOR SEQ ID NO: 301:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 319 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS : single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 66...269 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301:
TGGGGGTTGT GGTTGCTCAT GTTCGCATGG GTAGTAAGGT ATAGGAGTAT TTAAAAGGCA 60
AGGTC ATG AAT AGT TCT AAT CTC AAA AAT TGG CTA TTC CCT ACC ATT TGC 110
Met Asn Ser Ser Asn Leu Lys Asn Trp Leu Phe Pro Thr He Cys
1 5 10 15
TTT TTT TTA TTT TGT TAT ATT TTA ATT TTT TTA ATG TTC TTT ATG TTT 158 Phe Phe Leu Phe Cys Tyr He Leu He Phe Leu Met Phe Phe Met Phe 20 25 30
AAA AGT TTG CAA TCG CAA TCG TTT GGC TCT GTG GCA GAA ACC GGA AAA 206 Lys Ser Leu Gin Ser Gin Ser Phe Gly Ser Val Ala Glu Thr Gly Lys 35 40 45
AAA CCC ATC ACC ACC ACC AAG AAA TTT GGT AAG GAA TTG CAA AAA CAG 254 Lys Pro He Thr Thr Thr Lys Lys Phe Gly Lys Glu Leu Gin Lys Gin 50 55 60
ATT TCA AAA ATC CAT TAACTTTTTT TCTTTTTTGC CGATACTTGC TGTAATGGAA T 310 He Ser Lys He His 65
GAATATCAA 319
(2) INFORMATION FOR SEQ ID NO: 302:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 68 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302:
Met Asn Ser Ser Asn Leu Lys Asn Trp Leu Phe Pro Thr He Cys Phe
1 5 10 15
Phe Leu Phe Cys Tyr He Leu He Phe Leu Met Phe Phe Met Phe Lys
20 25 30
Ser Leu Gin Ser Gin Ser Phe Gly Ser Val Ala Glu Thr Gly Lys Lys
35 40 45
Pro He Thr Thr Thr Lys Lys Phe Gly Lys Glu Leu Gin Lys Gin He
50 55 60
Ser Lys He His 65
(2) INFORMATION FOR SEQ ID NO:303:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1112 base, pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 66...1058 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303:
AAAAACTAGA GAGTGTTATA AAGAAAACAG AAGAGTGGAT GTCAAATTAA TGAAGTAATT 60
CTAGG ATG AAA AGG CTT TTT TTT ATC CCT TTT ATC GCT CCC TTT TTT CTC 110
Met Lys Arg Leu Phe Phe He Pro Phe He Ala Pro Phe Phe Leu
1 5 10 15
AAT GGG GAG CCT TCA GCG TTT GAT TTG CAA AGT GGG GCT ACC AAA AAA 158 Asn Gly Glu Pro Ser Ala Phe Asp Leu Gin Ser Gly Ala Thr Lys Lys 20 25 30
GAA CTC AAG CAG TTG CAA ATC AAT AGT AAG AAT TTT TCT AAT ATT TTG 206 Glu Leu Lys Gin Leu Gin He Asn Ser Lys Asn Phe Ser Asn He Leu 35 40 45
ACC AAA ATC CAT TCG CAA GTA GAG GCT AAC ACT CAA GCT CAA GAG GGT 254 Thr Lys He His Ser Gin Val Glu Ala Asn Thr Gin Ala Gin Glu Gly 50 55 60
TTG AGA AGC GTT TAT GAG GGG CAG GCT AAT AAG ATT AAA GAT CTC AAT 302 Leu Arg Ser Val Tyr Glu Gly Gin Ala Asn Lys He Lys Asp Leu Asn 65 70 75
AAC GCT ATC CTT TCC CAA GAA GAA TCC TTA CGA GCC TTA AAA GCT TCG 350 Asn Ala He Leu Ser Gin Glu Glu Ser Leu Arg Ala Leu Lys Ala Ser 80 85 90 95
CAA GAA GTG CAG GCT AAC ACG CTT AAG CAG CAA TCG CAA ACT TTA GAG 398 Gin Glu Val Gin Ala Asn Thr Leu Lys Gin Gin Ser Gin Thr Leu Glu 100 105 110
GAT TTG AGG AAT GAG ATT CAC GCT AAC CAG CAA GCT ATC CAG CAG TTA 446 Asp Leu Arg Asn Glu He His Ala Asn Gin Gin Ala He Gin Gin Leu 115 120 125
GAC AAG CAA AAT AAA GAG ATG AGT GAA TTA TTG ACC AAG TTA AGC CAG 494 Asp Lys Gin Asn Lys Glu Met Ser Glu Leu Leu Thr Lys Leu Ser Gin 130 135 140
GAT TTG GTT TCA CAA ATC GCC TTA ATC CAA AAA GCT CTC AAA GAA CAA 542 Asp Leu Val Ser Gin He Ala Leu He Gin Lys Ala Leu Lys Glu Gin 145 150 155
GAG GAA AAA GCT GAA AAG CCG CTC AAA TCA AAC GCT CCG GCT AAT AAA 590 Glu Glu Lys Ala Glu Lys Pro Leu Lys Ser Asn Ala Pro Ala Asn Lys 160 165 170 175
ACC CCC TCT TTG AAA GCC GAA TCC CCA AAA AAT CAA GAG GGA AAA ACT 638 Thr Pro Ser Leu Lys Ala Glu Ser Pro Lys Asn Gin Glu Gly Lys Thr 180 185 190
CAA GAA AAG GCG AAA ATT GAG TTT GAT AAA GAC TTG TCT AAG CAA AAA 686 Gin Glu Lys Ala Lys He Glu Phe Asp Lys Asp Leu Ser Lys Gin Lys 195 200 205
GAG ATC TTT CAA GAA GCT CTG TCT TTT TTT AAA AAT AAA TCC TAT GCA 734 Glu He Phe Gin Glu Ala Leu Ser Phe Phe Lys Asn Lys Ser Tyr Ala 210 215 220
GAA GCC AAA GAG CGT TTG TTG TGG TTA GAA GCC AAT AGT TAC AGA CTT 782 Glu Ala Lys Glu Arg Leu Leu Trp Leu Glu Ala Asn Ser Tyr Arg Leu 225 230 235 TAT TAT GTG CGT TAT GTT CTT GGA GAA GTG GCT TAT GGG GAA AAG AGA 830 Tyr Tyr Val Arg Tyr Val Leu Gly Glu Val Ala Tyr Gly Glu Lys Arg 240 245 250 255
TAC AGA GAA GCG ATC AAG TAT TAC AAA GAG AGC GCT CTT TTA AAC AAA 878 Tyr Arg Glu Ala He Lys Tyr Tyr Lys Glu Ser Ala Leu Leu Asn Lys 260 265 270
AAA GCG TCT TAC ATG CCT GTG CTT TTG TGG CAT ACG GCA TGG TCG TTT 926 Lys Ala Ser Tyr Met Pro Val Leu Leu Trp His Thr Ala Trp Ser Phe 275 280 285
AAA AAA ATC AAA GAC GAT CAA AAC TAT TAT AAA TTT TTA AAC ACT TTG 974 Lys Lys He Lys Asp Asp Gin Asn Tyr Tyr Lys Phe Leu Asn Thr Leu 290 295 300
CAA CAC TTG TAT CCT TCA AGC GAA CAA GCT AAA ATG GCG CAA AAA ATC 1022 Gin His Leu Tyr Pro Ser Ser Glu Gin Ala Lys Met Ala Gin Lys He 305 310 315
TTA GAA AAC AAG GAG AAA CAC CAC CAT GCA AAA CCA TGATTTAGAG TCAATC 1074 Leu Glu Asn Lys Glu Lys His His His Ala Lys Pro 320 325 330
AAACAAGCCG CTTTGATTGA ATATGAAGTG AGAGAACA 1112
(2) INFORMATION FOR SEQ ID NO: 304:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 331 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:304:
Met Lys Arg Leu Phe Phe He Pro Phe He Ala Pro Phe Phe Leu Asn
1 5 10 15
Gly Glu Pro Ser Ala Phe Asp Leu Gin Ser Gly Ala Thr Lys Lys Glu
20 25 30
Leu Lys Gin Leu Gin He Asn Ser Lys Asn Phe Ser Asn He Leu Thr
35 40 45
Lys He His Ser Gin Val Glu Ala Asn Thr Gin Ala Gin Glu Gly Leu
50 55 60
Arg Ser Val Tyr Glu Gly Gin Ala Asn Lys He Lys Asp Leu Asn Asn 65 70 75 80
Ala He Leu Ser Gin Glu Glu Ser Leu Arg Ala Leu Lys Ala Ser Gin
85 90 95
Glu Val Gin Ala Asn Thr Leu Lys Gin Gin Ser Gin Thr Leu Glu Asp
100 105 110
Leu Arg Asn Glu He His Ala Asn Gin Gin Ala He Gin Gin Leu Asp 115 120 125 Lys Gin Asn Lys Glu Met Ser Glu Leu Leu Thr Lys Leu Ser Gin Asp
130 135 140
Leu Val Ser Gin He Ala Leu He Gin Lys Ala Leu Lys Glu Gin Glu 145 150 155 160
Glu Lys Ala Glu Lys Pro Leu Lys Ser Asn Ala Pro Ala Asn Lys Thr
165 170 175
Pro Ser Leu Lys Ala Glu Ser Pro Lys Asn Gin Glu Gly Lys Thr Gin
180 185 190
Glu Lys Ala Lys He Glu Phe Asp Lys Asp Leu Ser Lys Gin Lys Glu
195 200 205
He Phe Gin Glu Ala Leu Ser Phe Phe Lys Asn Lys Ser Tyr Ala Glu
210 215 220
Ala Lys Glu Arg Leu Leu Trp Leu Glu Ala Asn Ser Tyr Arg Leu Tyr 225 230 235 240
Tyr Val Arg Tyr Val Leu Gly Glu Val Ala Tyr Gly Glu Lys Arg Tyr
245 250 255
Arg Glu Ala He Lys Tyr Tyr Lys Glu Ser Ala Leu Leu Asn Lys Lys
260 265 270
Ala Ser Tyr Met Pro Val Leu Leu Trp His Thr Ala Trp Ser Phe Lys
275 280 285
Lys He Lys Asp Asp Gin Asn Tyr Tyr Lys Phe Leu Asn Thr Leu Gin
290 295 300
His Leu Tyr Pro Ser Ser Glu Gin Ala Lys Met Ala Gin Lys He Leu 305 310 315 320
Glu Asn Lys Glu Lys His His His Ala Lys Pro 325 330
(2) INFORMATION FOR SEQ ID NO: 305:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 531 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...483 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305:
AACAAATGCG AGTTTCAAAT ATTTTGTAGG ATTTTAGGAA AGAAATAGGT T ATG AAT 57
Met Asn 1
ATA TCG GTT AAC CCC TAT TTA ATG GCG GTC GTT TTT GTG GTG TTT GTG 105 He Ser Val Asn Pro Tyr Leu Met Ala Val Val Phe Val Val Phe Val 5 10 15 TTA TTG TTA TGG GCG ATG AAT GTT TGG GTG TAT AGG CCT TTG TTG GCT 153 Leu Leu Leu Trp Ala Met Asn Val Trp Val Tyr Arg Pro Leu Leu Ala 20 25 30
TTT ATG GAT AAC AGA CAG GCA GAG ATA AAG GAT AGC TTG GCT AAA ATC 201 Phe Met Asp Asn Arg Gin Ala Glu He Lys Asp Ser Leu Ala Lys He 35 40 45 50
AAA ACG GAT AAT GCC CAA AGT GTG GAG ATT GGC CAT CAA ATT GAG GCT 249 Lys Thr Asp Asn Ala Gin Ser Val Glu He Gly His Gin He Glu Ala 55 60 65
CTT CTT AAA GAA GCG GCT GAA AAA CGC AGA GAA ATA ATA GCA GAA GCG 297 Leu Leu Lys Glu Ala Ala Glu Lys Arg Arg Glu He He Ala Glu Ala 70 75 80
ATT CAA AAA GCC ACA GAG TCC TAT GAC GCT GTG ATC AAG CAA AAA GAG 345 He Gin Lys Ala Thr Glu Ser Tyr Asp Ala Val He Lys Gin Lys Glu 85 90 95
AAC GAA CTC AAT CAA GAG TTT GAA GCG TTT GCG AAG CAA TTA CAA AAT 393 Asn Glu Leu Asn Gin Glu Phe Glu Ala Phe Ala Lys Gin Leu Gin Asn 100 105 110
GAA AAG CAA GCG CTA AAA GAG CAG TTG CAA GCG CAA ATG CCG GTA TTT 441 Glu Lys Gin Ala Leu Lys Glu Gin Leu Gin Ala Gin Met Pro Val Phe 115 120 125 130
GAA GAC GAG TTA AAC AAG CGT GTG GCT ATG GGT TTA GGG AGT TGATGAATG 492 Glu Asp Glu Leu Asn Lys Arg Val Ala Met Gly Leu Gly Ser 135 140
TTTGTAGTTA AAATGGTGTT AGGGTTTTTG ATCCTTTTA 531
(2) INFORMATION FOR SEQ ID NO: 306:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 144 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306:
Met Asn He Ser Val Asn Pro Tyr Leu Met Ala Val Val Phe Val Val
1 5 10 15
Phe Val Leu Leu Leu Trp Ala Met Asn Val Trp Val Tyr Arg Pro Leu
20 25 30
Leu Ala Phe Met Asp Asn Arg Gin Ala Glu He Lys Asp Ser Leu Ala
35 40 45
Lys He Lys Thr Asp Asn Ala Gin Ser Val Glu He Gly His Gin He 50 55 60 Glu Ala Leu Leu Lys Glu Ala Ala Glu Lys Arg Arg Glu He He Ala 65 70 75 80
Glu Ala He Gin Lys Ala Thr Glu Ser Tyr Asp Ala Val He Lys Gin
85 90 95
Lys Glu Asn Glu Leu Asn Gin Glu Phe Glu Ala Phe Ala Lys Gin Leu
100 105 110
Gin Asn Glu Lys Gin Ala Leu Lys Glu Gin Leu Gin Ala Gin Met Pro
115 120 125
Val Phe Glu Asp Glu Leu Asn Lys Arg Val Ala Met Gly Leu Gly Ser 130 135 140
(2) INFORMATION FOR SEQ ID NO: 307:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 5832 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 387...5777 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307:
AAGCTTGATA TGGAGCATTT TGATCGCTTG ACCATGCTCA ATAGAGAAGA ATTGTTGCGC 60
GTTACTCGCT CCTTTCTCAA GCGATTTTAG AAGAGCCTTT CAGCCATAAC GGCAAGGATT 120
ATAAAGAAGG CGATCAAATC CCTAAAGAAG AAATCGCTTC AATCAACCGC TTCACTTTGG 180
CTAGTTTGGT CAAAAAGTAT TCTAAAGAAG TGCAAAACCA CTATGAAATC ACTAAAAACA 240
ATTTCTTAGA GCAAAAGAAA GTTTTGGGCG AAGAGCATGA AGAAAAGCTT TCTATTTTAG 300
AAAAAGATGA TATTTTGCCT AATGGCGTGA TCAAAAAAGT CAAGCTCTAT ATCGCTACAA 360
AACGAAAGCT TAAAGTGGGC GATAAA ATG GCA GGA AGG CAT GGG AAT AAA GGG 413
Met Ala Gly Arg His Gly Asn Lys Gly 1 5
ATT GTG TCT AAT ATC GTG CCG GTT GCG GAT ATG CCT TAT ACC GCT GAT 461 He Val Ser Asn He Val Pro Val Ala Asp Met Pro Tyr Thr Ala Asp 10 15 20 25
GGC GAG CCT GTA GAT ATT GTT TTA AAC CCT TTA GGC GTG CCA AGC CGC 509 Gly Glu Pro Val Asp He Val Leu Asn Pro Leu Gly Val Pro Ser Arg 30 35 40
ATG AAT ATC GGG CAG ATT TTA GAA ATG CAT TTA GGC TTA GTG GGG AAA 557 Met Asn He Gly Gin He Leu Glu Met His Leu Gly Leu Val Gly Lys 45 50 55
GAA TTT GGG AAG CAA ATC GCT CGC ATG CTA GAG GAT AAA ACC AAA GAT 605 Glu Phe Gly Lys Gin He Ala Arg Met Leu Glu Asp Lys Thr Lys Asp 60 65 70
TTT GCC AAA GAA TTG CGT GCT AAA ATG CTA GAA AWC GCT AAC GCT ATT 653 Phe Ala Lys Glu Leu Arg Ala Lys Met Leu Glu Xaa Ala Asn Ala He 75 80 85
AAT GAA AAA GAC CCC TTG ACA ATC CAT GCG CTT GAG AAT TGT TCT GAT 701 Asn Glu Lys Asp Pro Leu Thr He His Ala Leu Glu Asn Cys Ser Asp 90 95 100 105
GAA GAG CTT TTG GAA TAC GCA AAA GAT TGG AGC AAG GGC GTT AAG ATG 749 Glu Glu Leu Leu Glu Tyr Ala Lys Asp Trp Ser Lys Gly Val Lys Met 110 115 120
GCT ATC CCT GTG TTT GAA GGC ATC TCG CAA GAA AAA TTT TAT AAG CTA 797 Ala He Pro Val Phe Glu Gly He Ser Gin Glu Lys Phe Tyr Lys Leu 125 130 135
TTT GAA TTA GCT AAG ATC GCT ATG GAT GGC AAA ATG GAT CTG TAT GAC 845 Phe Glu Leu Ala Lys He Ala Met Asp Gly Lys Met Asp Leu Tyr Asp 140 145 150
GGA CGC ACA GGC GAG AAA ATG AGG GAG CGC GTG AAT GTG GGC TAC ATG 893 Gly Arg Thr Gly Glu Lys Met Arg Glu Arg Val Asn Val Gly Tyr Met 155 160 165
TAT ATG ATC AAA CTC CAC CAT TTA GTG GAT GAA AAA GTC CAT GCC AGA 941 Tyr Met He Lys Leu His His Leu Val Asp Glu Lys Val His Ala Arg 170 175 180 185
AGC ACA GGC CCT TAT AGC TTA GTA ACG CAC CAG CCC GTG GGG GGT AAA 989 Ser Thr Gly Pro Tyr Ser Leu Val Thr His Gin Pro Val Gly Gly Lys 190 195 200
GCG CTC TTT GGG GGT CAA AGG TTT GGG GAA ATG GAA GTG TGG GCC TTG 1037 Ala Leu Phe Gly Gly Gin Arg Phe Gly Glu Met Glu Val Trp Ala Leu 205 210 215
GAA GCT TAT GGC GCA GCG CAC ACT CTA AAA GAA ATG CTC ACC ATT AAA 1085 Glu Ala Tyr Gly Ala Ala His Thr Leu Lys Glu Met Leu Thr He Lys 220 225 230
TCC GAT GAT ATT AGA GGC AGA GAG AAC GCT TAT AGG GCT ATC GCT AAA 1133 Ser Asp Asp He Arg Gly Arg Glu Asn Ala Tyr Arg Ala He Ala Lys 235 240 245
GGT GAG CAA GTG GGC GAG AGT GAA ATC CCT GAG ACT TTC TAT GTT TTG 1181 Gly Glu Gin Val Gly Glu Ser Glu He Pro Glu Thr Phe Tyr Val Leu 250 255 260 265
ACT AAA GAA TTG CAA TCG CTC GCT TTG GAT ATT AAT ATT TTT GGG GAC 1229 Thr Lys Glu Leu Gin Ser Leu Ala Leu Asp He Asn He Phe Gly Asp 270 275 280
GAT GTG GAT GAG GAT GGA GCA CCT AAA CCC ATT GTC ATT AAA GAA GAT 1277 Asp Val Asp Glu Asp Gly Ala Pro Lys Pro He Val He Lys Glu Asp 285 290 295
GAC AGG CCT AAA GAC TTT AGC TCT TTC CAG CTC ACA CTA GCT AGC CCT 1325 Asp Arg Pro Lys Asp Phe Ser Ser Phe Gin Leu Thr Leu Ala Ser Pro 300 305 310
GAA AAA ATC CAT TCT TGG AGT TAT GGG GAA GTT AAA AAG CCA GAA ACG 1373 Glu Lys He His Ser Trp Ser Tyr Gly Glu Val Lys Lys Pro Glu Thr 315 320 325
ATC AAT TAT CGC ACC CTA AAA CCT GAA CGA GAC GGC TTG TTT TGC ATG 1421 He Asn Tyr Arg Thr Leu Lys Pro Glu Arg Asp Gly Leu Phe Cys Met 330 335 340 345
AAA ATC TTT GGC CCC ACT AAA GAT TAT GAA TGC TTG TGC GGC AAA TAC 1469 Lys He Phe Gly Pro Thr Lys Asp Tyr Glu Cys Leu Cys Gly Lys Tyr 350 355 360
AAA AAG CCT CGC TTC AAA GAC ATT GGC ACA TGC GAA AAA TGC GGC GTG 1517 Lys Lys Pro Arg Phe Lys Asp He Gly Thr Cys Glu Lys Cys Gly Val 365 370 375
GCG ATC ACG CAC TCC AAA GTC AGG CGT TTT AGA ATG GGG CAT ATT GAA 1565 Ala He Thr His Ser Lys Val Arg Arg Phe Arg Met Gly His He Glu 380 385 390
TTG GCC ACT CCT GTA GCG CAT ATC TGG TAT GTT AAT TCC TTG CCT AGC 1613 Leu Ala Thr Pro Val Ala His He Trp Tyr Val Asn Ser Leu Pro Ser 395 400 405
CGT ATC GGC ACG CTT TTA GGC GTT AAG ATG AAA GAC TTA GAG CGC GTG 1661 Arg He Gly Thr Leu Leu Gly Val Lys Met Lys Asp Leu Glu Arg Val 410 415 420 425
TTG TAT TAT GAA GCT TAT ATC GTT AAA GAA CCA GGC GAA GCC GCT TAT 1709 Leu Tyr Tyr Glu Ala Tyr He Val Lys Glu Pro Gly Glu Ala Ala Tyr 430 435 440
GAC AAT GAA GGC ACT AAG CTT GTG ATG AAA TAC GAT ATT TTG AAT GAA 1757 Asp Asn Glu Gly Thr Lys Leu Val Met Lys Tyr Asp He Leu Asn Glu 445 450 455
GAG CAG TAT CAA AAT ATC TCA CGA AGA TAC GAA GAC AGG GGC TTT GTA 1805 Glu Gin Tyr Gin Asn He Ser Arg Arg Tyr Glu Asp Arg Gly Phe Val 460 465 470
GCG CAA ATG GGC GGT GAA GCG ATC AAG GAT TTG TTA GAA GAA ATT GAT 1853 Ala Gin Met Gly Gly Glu Ala He Lys Asp Leu Leu Glu Glu He Asp 475 480 485
TTG ATC ACC TTA TTG CAG AGT TTG AAA GAA GAA GTG AAA GAC ACC AAT 1901 Leu He Thr Leu Leu Gin Ser Leu Lys Glu Glu Val Lys Asp Thr Asn 490 495 500 505 TCT GAT GCG AAA AAG AAA AAA CTC ATT AAG CGT TTG AAA GTG GTA GAA 1949 Ser Asp Ala Lys Lys Lys Lys Leu He Lys Arg Leu Lys Val Val Glu 510 515 520
AGC TTT TTA AAT TCT GGT AAT AGG CCT GAA TGG ATG ATG CTC ACG GTT 1997 Ser Phe Leu Asn Ser Gly Asn Arg Pro Glu Trp Met Met Leu Thr Val 525 530 535
TTA CCG GTA TTG CCA CCG GAT TTA AGG CCT TTA GTC GCG CTA GAT GGC 2045 Leu Pro Val Leu Pro Pro Asp Leu Arg Pro Leu Val Ala Leu Asp Gly 540 545 550
GGG AAG TTT GCA GTC AGC GAT GTG AAT GAA TTG TAT CGT CGT GTC ATC 2093 Gly Lys Phe Ala Val Ser Asp Val Asn Glu Leu Tyr Arg Arg Val He 555 560 565
AAT CGT AAC CAA CGC TTG AAA CGC TTA ATG GAG CTT GGA GCG CCA GAA 2141 Asn Arg Asn Gin Arg Leu Lys Arg Leu Met Glu Leu Gly Ala Pro Glu 570 575 580 585
ATC ATT GTG CGC AAT GAA AAA AGG ATG TTG CAA GAA GCC GTG GAT GTG 2189 He He Val Arg Asn Glu Lys Arg Met Leu Gin Glu Ala Val Asp Val 590 595 600
CTT TTT GAT AAC GGC CGC AGC ACT AAT GCG GTT AAA GGG GCT AAC AAA 2237 Leu Phe Asp Asn Gly Arg Ser Thr Asn Ala Val Lys Gly Ala Asn Lys 605 610 615
CGC CCT TTA AAA TCG CTC AGT GAA ATC ATT AAA GGC AAG CAG GGG CGT 2285 Arg Pro Leu Lys Ser Leu Ser Glu He He Lys Gly Lys Gin Gly Arg 620 625 630
TTC AGG CAA AAC CTT TTA GGT AAG CGC GTG GAT TTT TCA GGC AGA AGC 2333 Phe Arg Gin Asn Leu Leu Gly Lys Arg Val Asp Phe Ser Gly Arg Ser 635 640 645
GTG ATT GTG GTT GGG CCT AAT CTC AAA ATG GAT GAA TGC GGG TTG CCT 2381 Val He Val Val Gly Pro Asn Leu Lys Met Asp Glu Cys Gly Leu Pro 650 655 660 665
AAA AAC ATG GCG TTA GAA CTC TTC AAA CCG CAT TTG TTA TCC AAG CTT 2429 Lys Asn Met Ala Leu Glu Leu Phe Lys Pro His Leu Leu Ser Lys Leu 670 675 680
GAA GAG AGA GGC TAT GCC ACC ACG CTC AAA CAG GCT AAA CGC ATG ATT 2477 Glu Glu Arg Gly Tyr Ala Thr Thr Leu Lys Gin Ala Lys Arg Met He 685 690 695
GAG CAA AAA AGC AAT GAA GTA TGG GAG TGC TTG CAA GAA ATC ACA GAG 2525 Glu Gin Lys Ser Asn Glu Val Trp Glu Cys Leu Gin Glu He Thr Glu 700 705 710
GGG TAT CCG GTG CTA CTC AAC CGC GCT CCT ACC TTG CAC AAG CAA TCC 2573 Gly Tyr Pro Val Leu Leu Asn Arg Ala Pro Thr Leu His Lys Gin Ser 715 720 725 ATT CAA GCG TTC CAT CCA AAG CTG ATT GAC GGC AAA GCG ATC CAA TTG 2621 He Gin Ala Phe His Pro Lys Leu He Asp Gly Lys Ala He Gin Leu 730 735 740 745
CAC CCG TTA GTG TGT TCA GCG TTC AAC GCC GAT TTT GAC GGG GAC CAA 2669 His Pro Leu Val Cys Ser Ala Phe Asn Ala Asp Phe Asp Gly Asp Gin 750 755 760
ATG GCG GTG CAT GTG CCT TTA AGC CAG GAA GCG ATC GCT GAA TGC AAG 2717 Met Ala Val His Val Pro Leu Ser Gin Glu Ala He Ala Glu Cys Lys 765 770 775
GTG CTG ATG CTA AGC TCT ATG AAT ATC CTT TTG CCT GCT AGC GGT AAG 2765 Val Leu Met Leu Ser Ser Met Asn He Leu Leu Pro Ala Ser Gly Lys 780 785 790
GCC GTA GCC ATT CCT AGC CAA GAT ATG GTT TTA GGG CTT TAT TAT CTT 2813 Ala Val Ala He Pro Ser Gin Asp Met Val Leu Gly Leu Tyr Tyr Leu 795 800 805
TCT TTA GAA AAG AGC GGG GTC AAG GGC GAG CAT AAG CTT TTT TCT AGC 2861 Ser Leu Glu Lys Ser Gly Val Lys Gly Glu His Lys Leu Phe Ser Ser 810 815 820 825
GTG AAT GAA ATC ATC ACC GCC ATT GAC ACG AAA GAA TTA GAC ATC CAC 2909 Val Asn Glu He He Thr Ala He Asp Thr Lys Glu Leu Asp He His 830 835 840
GCA AAG ATT AGG GTT TTA GAT CAA GGG AAT ATT ATC GCT ACG AGT GCA 2957 Ala Lys He Arg Val Leu Asp Gin Gly Asn He He Ala Thr Ser Ala 845 850 855
GGG CGC ATG ATC ATT AAG TCC ATT TTG CCT GAT TTT ATC CCT ACG GAT 3005 Gly Arg Met He He Lys Ser He Leu Pro Asp Phe He Pro Thr Asp 860 865 870
TTG TGG AAC AGA CCC ATG AAG AAA AAA GAT ATT GGC GTG CTT GTG GAT 3053 Leu Trp Asn Arg Pro Met Lys Lys Lys Asp He Gly Val Leu Val Asp 875 880 885
TAT GTG CAT AAA GTT GGC GGT ATC GGT ATT ACT GCA ACC TTT TTG GAT 3101 Tyr Val His Lys Val Gly Gly He Gly He Thr Ala Thr Phe Leu Asp 890 895 900 905
AAT TTA AAA ACG CTT GGC TTT AGG TAT GCG ACT AAG GCT GGT ATT TCT 3149 Asn Leu Lys Thr Leu Gly Phe Arg Tyr Ala Thr Lys Ala Gly He Ser 910 915 920
ATC TCT ATG GAG GAT ATT ATC ACG CCA AAA GAC AAG CAA AAA ATG GTG 3197 He Ser Met Glu Asp He He Thr Pro Lys Asp Lys Gin Lys Met Val 925 930 935
GAA AAA GCC AAA GTA GAG GTT AAA AAA ATC CAG CAA CAA TAC GAT CAA 3245 Glu Lys Ala Lys Val Glu Val Lys Lys He Gin Gin Gin Tyr Asp Gin 940 945 950 GGG CTG CTC ACT GAC CAA GAG CGT TAC AAT AAG ATC ATT GAC ACT TGG 3293 Gly Leu Leu Thr Asp Gin Glu Arg Tyr Asn Lys He He Asp Thr Trp 955 960 965
ACT GAA GTC AAT GAC AAA ATG AGT AAA GAA ATG ATG ACC GCT ATC GCG 3341 Thr Glu Val Asn Asp Lys Met Ser Lys Glu Met Met Thr Ala He Ala 970 975 980 985
CAA GAT AAA GAG GGC TTT AAC TCT ATT TAT ATG ATG GCA GAT AGC GGC 3389 Gin Asp Lys Glu Gly Phe Asn Ser He Tyr Met Met Ala Asp Ser Gly 990 995 1000
GCA AGG GGT AGC GCG GCG CAA ATC CGT CAG CTT TCA GCG ATG AGG GGT 3437 Ala Arg Gly Ser Ala Ala Gin He Arg Gin Leu Ser Ala Met Arg Gly 1005 1010 1015
CTT ATG ACA AAG CCG GAC GGC AGT ATC ATT GAA ACG CCC ATT ATT TCT 3485 Leu Met Thr Lys Pro Asp Gly Ser He He Glu Thr Pro He He Ser 1020 1025 1030
AAC TTT AAA GAG GGG TTG AAT GTC TTA GAA TAC TTC AAT TCC ACG CAT 3533 Asn Phe Lys Glu Gly Leu Asn Val Leu Glu Tyr Phe Asn Ser Thr His 1035 1040 1045
GGC GCT AGA AAG GGC TTA GCG GAT ACA GCG CTA AAA ACA GCC AAT GCG 3581 Gly Ala Arg Lys Gly Leu Ala Asp Thr Ala Leu Lys Thr Ala Asn Ala 1050 1055 1060 1065
GGG TAT TTG ACC AGA AAG CTC ATT GAT GTT TCG CAA AAT GTC AAG GTG 3629 Gly Tyr Leu Thr Arg Lys Leu He Asp Val Ser Gin Asn Val Lys Val 1070 1075 1080
GTG TCT GAT GAT TGC GGC ACG CAT GAA GGG ATT GAA ATC ACG GAT ATT 3677 Val Ser Asp Asp Cys Gly Thr His Glu Gly He Glu He Thr Asp He 1085 1090 1095
GCG GTG GGG AGT GAG CTG ATT GAA CCT TTA GAA GAG CGT ATT TTT GGG 3725 Ala Val Gly Ser Glu Leu He Glu Pro Leu Glu Glu Arg He Phe Gly 1100 1105 1110
CGC GTT TTA TTA GAA GAT GTG ATC GAT CCC ATT ACG AAT GAA ATC TTG 3773 Arg Val Leu Leu Glu Asp Val He Asp Pro He Thr Asn Glu He Leu 1115 1120 1125
CTT TAT GCG GAC ACT TTG ATT GAT GAA GAG GGT GCT AAA AAG GTG GTT 3821 Leu Tyr Ala Asp Thr Leu He Asp Glu Glu Gly Ala Lys Lys Val Val 1130 1135 1140 1145
GAA GCC GGG ATT AAA TCC ATT ACG ATC CGC ACC CCA GTA ACT TGT AAA 3869 Glu Ala Gly He Lys Ser He Thr He Arg Thr Pro Val Thr Cys Lys 1150 1155 1160
GCG CCA AAG GGC GTG TGC GCG AAA TGC TAT GGC TTG AAT TTG GGC GAA 3917 Ala Pro Lys Gly Val Cys Ala Lys Cys Tyr Gly Leu Asn Leu Gly Glu 1165 1170 1175
GGC AAG ATG AGT TAT CCG GGT GAA GCG GTG GGC GTG GTA GCC GCG CAA 3965 Gly Lys Met Ser Tyr Pro Gly Glu Ala Val Gly Val Val Ala Ala Gin 1180 1185 1190
TCT ATT GGG GAG CCT GGA ACG CAG CTC ACT TTA AGG ACT TTC CAT GTG 4013 Ser He Gly Glu Pro Gly Thr Gin Leu Thr Leu Arg Thr Phe His Val 1195 1200 1205
GGC GGG ACA GCG AGC AGG AGT CAG GAT GAG CGC GAA ATC GTA GCG AGC 4061 Gly Gly Thr Ala Ser Arg Ser Gin Asp Glu Arg Glu He Val Ala Ser 1210 1215 1220 1225
AAA GAA GGT TTT GTG CGT TTT TAC AAC CTT AGG ACT TAC ACG AAT AAA 4109 Lys Glu Gly Phe Val Arg Phe Tyr Asn Leu Arg Thr Tyr Thr Asn Lys 1230 1235 1240
GAG GGT AAA AAC ATT ATC GCT AAC CGC CGT AAC GCT TCT ATT TTA GTG 4157 Glu Gly Lys Asn He He Ala Asn Arg Arg Asn Ala Ser He Leu Val 1245 1250 1255
GTA GAG CCT AAG ATT AAA GCG CCT TTT GAT GGG GAA TTA CGC ATT GAA 4205 Val Glu Pro Lys He Lys Ala Pro Phe Asp Gly Glu Leu Arg He Glu 1260 1265 1270
ACG GTT TAT GAA GAA GTC GTT GTG AGC GTG AAA AAT GGC GAT CAA GAA 4253 Thr Val Tyr Glu Glu Val Val Val Ser Val Lys Asn Gly Asp Gin Glu 1275 1280 1285
GCT AAA TTT GTT TTA AGG AGA AGC GAT ATT GTC AAG CCA AGC GAA TTA 4301 Ala Lys Phe Val Leu Arg Arg Ser Asp He Val Lys Pro Ser Glu Leu 1290 1295 1300 1305
GCC GGC GTT GGC GGT AAG ATT GAG GGG AAA GTG TAT TTG CCT TAT GCT 4349 Ala Gly Val Gly Gly Lys He Glu Gly Lys Val Tyr Leu Pro Tyr Ala 1310 1315 1320
AGT GGG CAT AAG GTG CAT AAG GGG GGA AGT ATC GCT GAT ATT ATC CAA 4397 Ser Gly His Lys Val His Lys Gly Gly Ser He Ala Asp He He Gin 1325 1330 1335
GAG GGC TGG AAT GTG CCT AAT CGC ATC CCT TAT GCG AGC GAA TTG CTA 4445 Glu Gly Trp Asn Val Pro Asn Arg He Pro Tyr Ala Ser Glu Leu Leu 1340 1345 1350
GTC AAG GAT AAT GAC CCT ATT GCG CAA GAT GTG TAT GCC AAA GAA AAA 4493 Val Lys Asp Asn Asp Pro He Ala Gin Asp Val Tyr Ala Lys Glu Lys 1355 1360 1365
GGC GTA ATC AAA TAC TAT GTT TTA GAG GCT AAC CAT TTA GAG CGC ACC 4541 Gly Val He Lys Tyr Tyr Val Leu Glu Ala Asn His Leu Glu Arg Thr 1370 1375 1380 1385
CAT GGG ATC AAA AAG GGC GAT ATG GTG AGT GAA AAA GGC TTG TTT GCG 4589 His Gly He Lys Lys Gly Asp Met Val Ser Glu Lys Gly Leu Phe Ala 1390 1395 1400
GTG ATA GCT GAT GAT AAT GGT AGG GAA GCC GCT CGC CAT TAT ATC GCT 4637 Val He Ala Asp Asp Asn Gly Arg Glu Ala Ala Arg His Tyr He Ala 1405 1410 1415
AGG GGT TCT GAG ATC TTG ATT GAT GAT AAT AGT GAA GTG AGC ACT AAT 4685 Arg Gly Ser Glu He Leu He Asp Asp Asn Ser Glu Val Ser Thr Asn 1420 1425 1430
AGC GTG ATT TCT AAA CCC ACG ACT AAC ACT TTC AAA ACG ATT GCC ACA 4733 Ser Val He Ser Lys Pro Thr Thr Asn Thr Phe Lys Thr He Ala Thr 1435 1440 1445
TGG GAT CCT TAC AAC ACC CCT ATC ATT GCG GAC TTT AAA GGT AAG GTG 4781 Trp Asp Pro Tyr Asn Thr Pro He He Ala Asp Phe Lys Gly Lys Val 1450 1455 1460 1465
GGT TTT GTG GAT GTT ATC GCA GGG GTT ACG GTC GCT GAA AAA GAA GAC 4829 Gly Phe Val Asp Val He Ala Gly Val Thr Val Ala Glu Lys Glu Asp 1470 1475 1480
GAA AAT ACC GGT ATC ACA AGC TTA GTG GTG AAT GAT TAC ATT CCA AGC 4877 Glu Asn Thr Gly He Thr Ser Leu Val Val Asn Asp Tyr He Pro Ser 1485 1490 1495
GGA TAC AAA CCA AGC TTG TTT TTA GAG GGG GCT AAT GGC GAA GAG ATG 4925 Gly Tyr Lys Pro Ser Leu Phe Leu Glu Gly Ala Asn Gly Glu Glu Met 1500 1505 1510
CGT TAT TTC CTA GAG CCA AAA ACC TCT ATC GCC ATT AGC GAT GGC TCT 4973 Arg Tyr Phe Leu Glu Pro Lys Thr Ser He Ala He Ser Asp Gly Ser 1515 1520 1525
AGC GTG GAG CAA GCT GAA GTG TTA GCG AAA ATC CCT AAA GCG ACC GTT 5021 Ser Val Glu Gin Ala Glu Val Leu Ala Lys He Pro Lys Ala Thr Val 1530 1535 1540 1545
AAA TCT AGG GAT ATT ACC GGG GGT CTC CCA AGG GTT TCG GAA CTC TTT 5069 Lys Ser Arg Asp He Thr Gly Gly Leu Pro Arg Val Ser Glu Leu Phe 1550 1555 1560
GAA GCG AGA AAA CCC AAG CCT AAA GAT GTG GCG ATC CTT TCT GAA GTT 5117 Glu Ala Arg Lys Pro Lys Pro Lys Asp Val Ala He Leu Ser Glu Val 1565 1570 1575
GAT GGG ATT GTG AGT TTT GGC AAA CCC ATT CGC AAT AAA GAA CAC ATC 5165 Asp Gly He Val Ser Phe Gly Lys Pro He Arg Asn Lys Glu His He 1580 1585 1590
ATC GTA ACT TCT AAA GAT GGC CGT TCC ATG GAT TAT TTT GTG GAT AAA 5213 He Val Thr Ser Lys Asp Gly Arg Ser Met Asp Tyr Phe Val Asp Lys 1595 1600 1605 GGC AAG CAA ATT TTA GTG CAT GCC GAT GAA TTT GTG CAT GCG GGA GAA 5261 Gly Lys Gin He Leu Val His Ala Asp Glu Phe Val His Ala Gly Glu 1610 1615 1620 1625
GCG ATG ACG GAC GGA GTA ATT TCA AGC CAT GAT ATT TTA AGG ATC AGT 5309 Ala Met Thr Asp Gly Val He Ser Ser His Asp He Leu Arg He Ser 1630 1635 1640
GGC GAA AAA GAG CTT TAT AAA TAC ATT GTG AGC GAA GTC CAG CAA GTG 5357 Gly Glu Lys Glu Leu Tyr Lys Tyr He Val Ser Glu Val Gin Gin Val 1645 1650 1655
TAT CGC AGG CAG GGG GTG AGC ATT GCG GAC AAG CAC ATT GAA ATC ATT 5405 Tyr Arg Arg Gin Gly Val Ser He Ala Asp Lys His He Glu He He 1660 1665 1670
GTT TCT CAA ATG CTA AGA CAG GTG CGT ATT TTA GAC AGC GGG GAT AGC 5453 Val Ser Gin Met Leu Arg Gin Val Arg He Leu Asp Ser Gly Asp Ser 1675 1680 1685
AAG TTT ATT GAA GGG GAT TTA GTC AGT AAA AAA CTT TTC AAA GAA GAA 5501 Lys Phe He Glu Gly Asp Leu Val Ser Lys Lys Leu Phe Lys Glu Glu 1690 1695 1700 1705
AAC GCT CGT GTG ATC GCT TTA AAA GGC GAG CCA GCG ATT GCT GAA CCG 5549 Asn Ala Arg Val He Ala Leu Lys Gly Glu Pro Ala He Ala Glu Pro 1710 1715 1720
GTG CTT TTA GGG ATC ACT AGA GCG GCT ATT GGG AGC GAT AGC ATC ATC 5597 Val Leu Leu Gly He Thr Arg Ala Ala He Gly Ser Asp Ser He He 1725 1730 1735
TCA GCG GCC TCT TTC CAA GAA ACG ACT AAA GTT TTA ACA GAA GCC AGT 5645 Ser Ala Ala Ser Phe Gin Glu Thr Thr Lys Val Leu Thr Glu Ala Ser 1740 1745 1750
ATC GCT ATG AAA AAA GAC TTT TTA GAG GAT TTG AAA GAG AAT GTG GTG 5693 He Ala Met Lys Lys Asp Phe Leu Glu Asp Leu Lys Glu Asn Val Val 1755 1760 1765
TTG GGG AGG ATG ATC CCT GTG GGA ACA GGC ATG TAT AAG AAT AAA AAA 5741 Leu Gly Arg Met He Pro Val Gly Thr Gly Met Tyr Lys Asn Lys Lys 1770 1775 1780 1785
ATC GTG TTA AGA GCG CTT GAG GAT AAC TCT AAA TTT TGATATGAAA AATCGG 5793 He Val Leu Arg Ala Leu Glu Asp Asn Ser Lys Phe 1790 1795
TTAAGATTTT TAAAAGAAAA ATTAGGGTAA AATGGGGGA 5832
(2) INFORMATION FOR SEQ ID NO: 308:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1797 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308:
Met Ala Gly Arg His Gly Asn Lys Gly He Val Ser Asn He Val Pro
1 5 10 15
Val Ala Asp Met Pro Tyr Thr Ala Asp Gly Glu Pro Val Asp He Val
20 25 30
Leu Asn Pro Leu Gly Val Pro Ser Arg Met Asn He Gly Gin He Leu
35 40 45
Glu Met His Leu Gly Leu Val Gly Lys Glu Phe Gly Lys Gin He Ala
50 55 60
Arg Met Leu Glu Asp Lys Thr Lys Asp Phe Ala Lys Glu Leu Arg Ala 65 70 75 80
Lys Met Leu Glu Xaa Ala Asn Ala He Asn Glu Lys Asp Pro Leu Thr
85 90 95
He His Ala Leu Glu Asn Cys Ser Asp Glu Glu Leu Leu Glu Tyr Ala
100 105 110
Lys Asp Trp Ser Lys Gly Val Lys Met Ala He Pro Val Phe Glu Gly
115 120 125
He Ser Gin Glu Lys Phe Tyr Lys Leu Phe Glu Leu Ala Lys He Ala
130 135 140
Met Asp Gly Lys Met Asp Leu Tyr Asp Gly Arg Thr Gly Glu Lys Met 145 150 155 160
Arg Glu Arg Val Asn Val Gly Tyr Met Tyr Met He Lys Leu His His
165 170 175
Leu Val Asp Glu Lys Val His Ala Arg Ser Thr Gly Pro Tyr Ser Leu
180 185 190
Val Thr His Gin Pro Val Gly Gly Lys Ala Leu Phe Gly Gly Gin Arg
195 200 205
Phe Gly Glu Met Glu Val Trp Ala Leu Glu Ala Tyr Gly Ala Ala His
210 215 220
Thr Leu Lys Glu Met Leu Thr He Lys Ser Asp Asp He Arg Gly Arg 225 230 235 240
Glu Asn Ala Tyr Arg Ala He Ala Lys Gly Glu Gin Val Gly Glu Ser
245 250 255
Glu He Pro Glu Thr Phe Tyr Val Leu Thr Lys Glu Leu Gin Ser Leu
260 265 270
Ala Leu Asp He Asn He Phe Gly Asp Asp Val Asp Glu Asp Gly Ala
275 280 285
Pro Lys Pro He Val He Lys Glu Asp Asp Arg Pro Lys Asp Phe Ser
290 295 300
Ser Phe Gin Leu Thr Leu Ala Ser Pro Glu Lys He His Ser Trp Ser 305 310 315 320
Tyr Gly Glu Val Lys Lys Pro Glu Thr He Asn Tyr Arg Thr Leu Lys
325 330 335
Pro Glu Arg Asp Gly Leu Phe Cys Met Lys He Phe Gly Pro Thr Lys
340 345 350
Asp Tyr Glu Cys Leu Cys Gly Lys Tyr Lys Lys Pro Arg Phe Lys Asp
355 360 365
He Gly Thr Cys Glu Lys Cys Gly Val Ala He Thr His Ser Lys Val 370 375 380
Arg Arg Phe Arg Met Gly His He Glu Leu Ala Thr Pro Val Ala His 385 390 395 400
He Trp Tyr Val Asn Ser Leu Pro Ser Arg He Gly Thr Leu Leu Gly
405 410 415
Val Lys Met Lys Asp Leu Glu Arg Val Leu Tyr Tyr Glu Ala Tyr He
420 425 430
Val Lys Glu Pro Gly Glu Ala Ala Tyr Asp Asn Glu Gly Thr Lys Leu
435 440 445
Val Met Lys Tyr Asp He Leu Asn Glu Glu Gin Tyr Gin Asn He Ser
450 455 460
Arg Arg Tyr Glu Asp Arg Gly Phe Val Ala Gin Met Gly Gly Glu Ala 465 470 475 480
He Lys Asp Leu Leu Glu Glu He Asp Leu He Thr Leu Leu Gin Ser
485 490 495
Leu Lys Glu Glu Val Lys Asp Thr Asn Ser Asp Ala Lys Lys Lys Lys
500 505 510
Leu He Lys Arg Leu Lys Val Val Glu Ser Phe Leu Asn Ser Gly Asn
515 520 525
Arg Pro Glu Trp Met Met Leu Thr Val Leu Pro Val Leu Pro Pro Asp
530 535 540
Leu Arg Pro Leu Val Ala Leu Asp Gly Gly Lys Phe Ala Val Ser Asp 545 550 555 560
Val Asn Glu Leu Tyr Arg Arg Val He Asn Arg Asn Gin Arg Leu Lys
565 570 575
Arg Leu Met Glu Leu Gly Ala Pro Glu He He Val Arg Asn Glu Lys
580 585 590
Arg Met Leu Gin Glu Ala Val Asp Val Leu Phe Asp Asn Gly Arg Ser
595 600 605
Thr Asn Ala Val Lys Gly Ala Asn Lys Arg Pro Leu Lys Ser Leu Ser
610 615 620
Glu He He Lys Gly Lys Gin Gly Arg Phe Arg Gin Asn Leu Leu Gly 625 630 635 640
Lys Arg Val Asp Phe Ser Gly Arg Ser Val He Val Val Gly Pro Asn
645 650 655
Leu Lys Met Asp Glu Cys Gly Leu Pro Lys Asn Met Ala Leu Glu Leu
660 665 670
Phe Lys Pro His Leu Leu Ser Lys Leu Glu Glu Arg Gly Tyr Ala Thr
675 680 685
Thr Leu Lys Gin Ala Lys Arg Met He Glu Gin Lys Ser Asn Glu Val
690 695 700
Trp Glu Cys Leu Gin Glu He Thr Glu Gly Tyr Pro Val Leu Leu Asn 705 710 715 720
Arg Ala Pro Thr Leu His Lys Gin Ser He Gin Ala Phe His Pro Lys
725 730 735
Leu He Asp Gly Lys Ala He Gin Leu His Pro Leu Val Cys Ser Ala
740 745 750
Phe Asn Ala Asp Phe Asp Gly Asp Gin Met Ala Val His Val Pro Leu
755 760 765
Ser Gin Glu Ala He Ala Glu Cys Lys Val Leu Met Leu Ser Ser Met
770 775 780
Asn He Leu Leu Pro Ala Ser Gly Lys Ala Val Ala He Pro Ser Gin 785 790 795 800
Asp Met Val Leu Gly Leu Tyr Tyr Leu Ser Leu Glu Lys Ser Gly Val 805 810 815 Lys Gly Glu His Lys Leu Phe Ser Ser Val Asn Glu He He Thr Ala
820 825 830
He Asp Thr Lys Glu Leu Asp He His Ala Lys He Arg Val Leu Asp
835 840 845
Gin Gly Asn He He Ala Thr Ser Ala Gly Arg Met He He Lys Ser
850 855 860
He Leu Pro Asp Phe He Pro Thr Asp Leu Trp Asn Arg Pro Met Lys 865 870 875 880
Lys Lys Asp He Gly Val Leu Val Asp Tyr Val His Lys Val Gly Gly
885 890 895
He Gly He Thr Ala Thr Phe Leu Asp Asn Leu Lys Thr Leu Gly Phe
900 905 910
Arg Tyr Ala Thr Lys Ala Gly He Ser He Ser Met Glu Asp He He
915 920 925
Thr Pro Lys Asp Lys Gin Lys Met Val Glu Lys Ala Lys Val Glu Val
930 935 940
Lys Lys He Gin Gin Gin Tyr Asp Gin Gly Leu Leu Thr Asp Gin Glu 945 950 955 960
Arg Tyr Asn Lys He He Asp Thr Trp Thr Glu Val Asn Asp Lys Met
965 970 975
Ser Lys Glu Met Met Thr Ala He Ala Gin Asp Lys Glu Gly Phe Asn
980 985 990
Ser He Tyr Met Met Ala Asp Ser Gly Ala Arg Gly Ser Ala Ala Gin
995 1000 1005
He Arg Gin Leu Ser Ala Met Arg Gly Leu Met Thr Lys Pro Asp Gly
1010 1015 1020
Ser He He Glu Thr Pro He He Ser Asn Phe Lys Glu Gly Leu Asn 025 1030 1035 1040
Val Leu Glu Tyr Phe Asn Ser Thr His Gly Ala Arg Lys Gly Leu Ala
1045 1050 1055
Asp Thr Ala Leu Lys Thr Ala Asn Ala Gly Tyr Leu Thr Arg Lys Leu
1060 1065 1070
He Asp Val Ser Gin Asn Val Lys Val Val Ser Asp Asp Cys Gly Thr
1075 1080 1085
His Glu Gly He Glu He Thr Asp He Ala Val Gly Ser Glu Leu He
1090 1095 1100
Glu Pro Leu Glu Glu Arg He Phe Gly Arg Val Leu Leu Glu Asp Val 105 1110 1115 1120
He Asp Pro He Thr Asn Glu He Leu Leu Tyr Ala Asp Thr Leu He
1125 1130 1135
Asp Glu Glu Gly Ala Lys Lys Val Val Glu Ala Gly He Lys Ser He
1140 1145 1150
Thr He Arg Thr Pro Val Thr Cys Lys Ala Pro Lys Gly Val Cys Ala
1155 1160 1165
Lys Cys Tyr Gly Leu Asn Leu Gly Glu Gly Lys Met Ser Tyr Pro Gly
1170 1175 1180
Glu Ala Val Gly Val Val Ala Ala Gin Ser He Gly Glu Pro Gly Thr 185 1190 1195 1200
Gin Leu Thr Leu Arg Thr Phe His Val Gly Gly Thr Ala Ser Arg Ser
1205 1210 1215
Gin Asp Glu Arg Glu He Val Ala Ser Lys Glu Gly Phe Val Arg Phe
1220 1225 1230
Tyr Asn Leu Arg Thr Tyr Thr Asn Lys Glu Gly Lys Asn He He Ala
1235 1240 1245
Asn Arg Arg Asn Ala Ser He Leu Val Val Glu Pro Lys He Lys Ala 1250 1255 1260
Pro Phe Asp Gly Glu Leu Arg He Glu Thr Val Tyr Glu Glu Val Val 265 1270 1275 1280
Val Ser Val Lys Asn Gly Asp Gin Glu Ala Lys Phe Val Leu Arg Arg
1285 1290 1295
Ser Asp He Val Lys Pro Ser Glu Leu Ala Gly Val Gly Gly Lys He
1300 1305 1310
Glu Gly Lys Val Tyr Leu Pro Tyr Ala Ser Gly His Lys Val His Lys
1315 1320 1325
Gly Gly Ser He Ala Asp He He Gin Glu Gly Trp Asn Val Pro Asn
1330 1335 1340
Arg He Pro Tyr Ala Ser Glu Leu Leu Val Lys Asp Asn Asp Pro He 345 1350 1355 1360
Ala Gin Asp Val Tyr Ala Lys Glu Lys Gly Val He Lys Tyr Tyr Val
1365 1370 1375
Leu Glu Ala Asn His Leu Glu Arg Thr His Gly He Lys Lys Gly Asp
1380 1385 1390
Met Val Ser Glu Lys Gly Leu Phe Ala Val He Ala Asp Asp Asn Gly
1395 1400 1405
Arg Glu Ala Ala Arg His Tyr He Ala Arg Gly Ser Glu He Leu He
1410 1415 1420
Asp Asp Asn Ser Glu Val Ser Thr Asn Ser Val He Ser Lys Pro Thr 425 1430 1435 1440
Thr Asn Thr Phe Lys Thr He Ala Thr Trp Asp Pro Tyr Asn Thr Pro
1445 1450 1455
He He Ala Asp Phe Lys Gly Lys Val Gly Phe Val Asp Val He Ala
1460 1465 1470
Gly Val Thr Val Ala Glu Lys Glu Asp Glu Asn Thr Gly He Thr Ser
1475 1480 1485
Leu Val Val Asn Asp Tyr He Pro Ser Gly Tyr Lys Pro Ser Leu Phe
1490 1495 1500
Leu Glu Gly Ala Asn Gly Glu Glu Met Arg Tyr Phe Leu Glu Pro Lys 505 1510 1515 1520
Thr Ser He Ala He Ser Asp Gly Ser Ser Val Glu Gin Ala Glu Val
1525 1530 1535
Leu Ala Lys He Pro Lys Ala Thr Val Lys Ser Arg Asp He Thr Gly
1540 1545 1550
Gly Leu Pro Arg Val Ser Glu Leu Phe Glu Ala Arg Lys Pro Lys Pro
1555 1560 1565
Lys Asp Val Ala He Leu Ser Glu Val Asp Gly He Val Ser Phe Gly
1570 1575 1580
Lys Pro He Arg Asn Lys Glu His He He Val Thr Ser Lys Asp Gly 585 1590 1595 1600
Arg Ser Met Asp Tyr Phe Val Asp Lys Gly Lys Gin He Leu Val His
1605 1610 1615
Ala Asp Glu Phe Val His Ala Gly Glu Ala Met Thr Asp Gly Val He
1620 1625 1630
Ser Ser His Asp He Leu Arg He Ser Gly Glu Lys Glu Leu Tyr Lys
1635 1640 1645
Tyr He Val Ser Glu Val Gin Gin Val Tyr Arg Arg Gin Gly Val Ser
1650 1655 1660
He Ala Asp Lys His He Glu He He Val Ser Gin Met Leu Arg Gin 665 1670 1675 1680
Val Arg He Leu Asp Ser Gly Asp Ser Lys Phe He Glu Gly Asp Leu 1685 1690 1695 " Val Ser Lys Lys Leu Phe Lys Glu Glu Asn Ala Arg Val He Ala Leu
1700 1705 1710
Lys Gly Glu Pro Ala He Ala Glu Pro Val Leu Leu Gly He Thr Arg
1715 1720 1725
Ala Ala He Gly Ser Asp Ser He He Ser Ala Ala Ser Phe Gin Glu
1730 1735 1740
Thr Thr Lys Val Leu Thr Glu Ala Ser He Ala Met Lys Lys Asp Phe 745 1750 1755 1760
Leu Glu Asp Leu Lys Glu Asn Val Val Leu Gly Arg Met He Pro Val
1765 1770 1775
Gly Thr Gly Met Tyr Lys Asn Lys Lys He Val Leu Arg Ala Leu Glu
1780 1785 1790
Asp Asn Ser Lys Phe 1795
(2) INFORMATION FOR SEQ ID NO: 309:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 690 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 65...640 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309:
TAAGCGATTT GCTCTGTGTG GTGATTGACC CTAGGATTGA TTTTGAAAAG CGTTGAGGGT 60 AGGA ATG AAA ACT GAG ATG AAA TCT TCT TTA AAA CTT TTT ATG CGG CCT 109 Met Lys Thr Glu Met Lys Ser Ser Leu Lys Leu Phe Met Arg Pro 1 5 10 15
TTG TTG GTG GTT TTA GCG TTC ATG TTG TTG TAT GCT TTA GTG CAT GCT 157 Leu Leu Val Val Leu Ala Phe Met Leu Leu Tyr Ala Leu Val His Ala 20 25 30
GCG CTT GGT TTT TAT GTA AAA AAA GAC AGC GCT CCA ATA AGC CCA AAT 205 Ala Leu Gly Phe Tyr Val Lys Lys Asp Ser Ala Pro He Ser Pro Asn 35 40 45
GTA GAA AAA ACC GAG ACA GAG CGT CAA AAC GGC GTG CTT TCG CCC AAA 253 Val Glu Lys Thr Glu Thr Glu Arg Gin Asn Gly Val Leu Ser Pro Lys 50 55 60
CAA GAA GAA GCC AAC GCA ACC ACA ACT GCC ACA GAA GAA AGC CCC ACC 301 Gin Glu Glu Ala Asn Ala Thr Thr Thr Ala Thr Glu Glu Ser Pro Thr 65 70 75 AAA GAC ACA GCG CCG CCT TTA GAC ACA GCC GCG CAA AAA CAA GAA ACT 349 Lys Asp Thr Ala Pro Pro Leu Asp Thr Ala Ala Gin Lys Gin Glu Thr 80 85 90 95
AAA CAA GAG CAA GAA AAA GAA AAC GAG CCT AAA CAA GAT AGC GTC CCG 397 Lys Gin Glu Gin Glu Lys Glu Asn Glu Pro Lys Gin Asp Ser Val Pro 100 105 110
CCC GTT CAA AAC AAT CAA AAA ACC CCT ACA ACC CCC TTA ATG GGA AAA 445 Pro Val Gin Asn Asn Gin Lys Thr Pro Thr Thr Pro Leu Met Gly Lys 115 120 125
AAA CCT TTA GAG TAT AAA GTC GCA GTC AGT GGC GTG AAT GTG CGC GCT 493 Lys Pro Leu Glu Tyr Lys Val Ala Val Ser Gly Val Asn Val Arg Ala 130 135 140
TTT CCC AGC ACA AAA GGT AAA ATC TTG GGA TTG CTT TTA AAA AAT AAA 541 Phe Pro Ser Thr Lys Gly Lys He Leu Gly Leu Leu Leu Lys Asn Lys 145 150 155
AGC GTG AAA GTT TTA GAA ATC CAA AAC GAT TGG GCT GAA ATT GAA TTT 589 Ser Val Lys Val Leu Glu He Gin Asn Asp Trp Ala Glu He Glu Phe 160 165 170 175
TCT CAC GAA ACA AAG GGC TAT GTG TTT TTA AAA CTT TTA AAA AAG GCT 637 Ser His Glu Thr Lys Gly Tyr Val Phe Leu Lys Leu Leu Lys Lys Ala 180 185 190
GAA TGAAAGAATA ATGAAATTAA AATCTTTTGG GGTTTTTGGA AATCCCATTA 690
Glu
(2) INFORMATION FOR SEQ ID NO: 310:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 192 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310:
Met Lys Thr Glu Met Lys Ser Ser Leu Lys Leu Phe Met Arg Pro Leu
1 5 10 15
Leu Val Val Leu Ala Phe Met Leu Leu Tyr Ala Leu Val His Ala Ala
20 25 30
Leu Gly Phe Tyr Val Lys Lys Asp Ser Ala Pro He Ser Pro Asn Val
35 40 45
Glu Lys Thr Glu Thr Glu Arg Gin Asn Gly Val Leu Ser Pro Lys Gin
50 55 60
Glu Glu Ala Asn Ala Thr Thr Thr Ala Thr Glu Glu Ser Pro Thr Lys 65 70 75 80
Asp Thr Ala Pro Pro Leu Asp Thr Ala Ala Gin Lys Gin Glu Thr Lys
85 90 95
Gin Glu Gin Glu Lys Glu Asn Glu Pro Lys Gin Asp Ser Val Pro Pro
100 105 110
Val Gin Asn Asn Gin Lys Thr Pro Thr Thr Pro Leu Met Gly Lys Lys
115 120 125
Pro Leu Glu Tyr Lys Val Ala Val Ser Gly Val Asn Val Arg Ala Phe
130 135 140
Pro Ser Thr Lys Gly Lys He Leu Gly Leu Leu Leu Lys Asn Lys Ser 145 150 155 160
Val Lys Val Leu Glu He Gin Asn Asp Trp Ala Glu He Glu Phe Ser
165 170 175
His Glu Thr Lys Gly Tyr Val Phe Leu Lys Leu Leu Lys Lys Ala Glu 180 185 190
(2) INFORMATION FOR SEQ ID NO: 311:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1550 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 66...1502 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311:
TTATGACTTT TACTAAACCT TTTTTTAAGC TATAATCCAA AAATCTAAAA TAAAAAGGAA 60
TAAGC ATG AAA AAA TCC CTT TGT CTG TCT TTC TTT CTG ACT TTC TCT AAC 110
Met Lys Lys Ser Leu Cys Leu Ser Phe Phe Leu Thr Phe Ser Asn
1 5 10 15
CCT CTT CAA GCC CTT GTG ATC GAG CTT TTA GAA GAA ATC AAA ACT TCG 158 Pro Leu Gin Ala Leu Val He Glu Leu Leu Glu Glu He Lys Thr Ser 20 25 30
CCG CAT AAA GGC ACT TTT AAG GCT AAA GTC CTT GAT TCT AAA AAA CCA 206 Pro His Lys Gly Thr Phe Lys Ala Lys Val Leu Asp Ser Lys Lys Pro 35 40 45
AGA CAA GTT TTA GGC GTT TAT AAT ATC TCC CCA CAC AAA AAA CTC ACG 254 Arg Gin Val Leu Gly Val Tyr Asn He Ser Pro His Lys Lys Leu Thr 50 55 60
CTC ACT ATC ACC CAC ATA TCC ACT GCA ATC GTC TAT CAA CCC CTT GAT 302 Leu Thr He Thr His He Ser Thr Ala He Val Tyr Gin Pro Leu Asp 65 70 75
GAA AAA CTT TCT TTA GAA ACA ACC TTA AAC CCT AAC CGC CCT ACT ATC 350 Glu Lys Leu Ser Leu Glu Thr Thr Leu Asn Pro Asn Arg Pro Thr He 80 85 90 95
CCT AGA AAC ACC CAG ATT GTT TTT TCT TCA AAA GAA TTG AAA GAG TCG 398 Pro Arg Asn Thr Gin He Val Phe Ser Ser Lys Glu Leu Lys Glu Ser 100 105 110
CAC CCG CAC CAA ATG CCT TCT TTA AAC GCG CCC ATG CAA AAA CCA CAA 446 His Pro His Gin Met Pro Ser Leu Asn Ala Pro Met Gin Lys Pro Gin 115 120 125
AAC AAA CCC CAT TCA TCG CAA CAA CCT TCT CAA AAC TTT TCT TAC CCA 494 Asn Lys Pro His Ser Ser Gin Gin Pro Ser Gin Asn Phe Ser Tyr Pro 130 135 140
GAG CCC AAA CTA GGC TCT AAA AAC TCT AAA AAC AGC CTT TTA CAG CCT 542 Glu Pro Lys Leu Gly Ser Lys Asn Ser Lys Asn Ser Leu Leu Gin Pro 145 150 155
TTA GCA ATT CCT AGC AAA ATA AGT CCC ACT AAC GAA ACT CAA ACG CCA 590 Leu Ala He Pro Ser Lys He Ser Pro Thr Asn Glu Thr Gin Thr Pro 160 165 170 175
ACA AAC GAC ACT AAA CCC CCT TTA AAG CAT TCT TCA GAA GAT CAA GAA 638 Thr Asn Asp Thr Lys Pro Pro Leu Lys His Ser Ser Glu Asp Gin Glu 180 185 190
AGC AAC CTC TTT ATA ACG CCA CCC ACT GAA AAA ACG CTC CCT AAC AAC 686 Ser Asn Leu Phe He Thr Pro Pro Thr Glu Lys Thr Leu Pro Asn Asn 195 200 205
ACC TCT AAC GCT GAT ATT AGT GAA AAC AAT GAA AGC AAT GAG AAT AAA 734 Thr Ser Asn Ala Asp He Ser Glu Asn Asn Glu Ser Asn Glu Asn Lys 210 215 220
GAT AAT GTG GAA AAA CAA GCC ATT AGA GAT GCT AAT ATT AAA GAA TTT 782 Asp Asn Val Glu Lys Gin Ala He Arg Asp Ala Asn He Lys Glu Phe 225 230 235
GCA TGC GGG AAG TGG GTC TAT GAC GAT GAA AAT TTA CAA GCC TAC CGC 830 Ala Cys Gly Lys Trp Val Tyr Asp Asp Glu Asn Leu Gin Ala Tyr Arg 240 245 250 255
CCA AGC ATT TTA AAA CGC GTT GAT GAA GAC AAA CAA ACT GCA ACA GAT 878 Pro Ser He Leu Lys Arg Val Asp Glu Asp Lys Gin Thr Ala Thr Asp 260 265 270
ATT ACC CCT TGC GAT TAC AGC ACC GCT GAA AAT AAA AGC GGT AAA ATC 926 He Thr Pro Cys Asp Tyr Ser Thr Ala Glu Asn Lys Ser Gly Lys He 275 280 285
ATT ACC CCC TAT ACT AAA ATC TCC GTT CAT AAA ACA GAG CCT TTA GAA 974 He Thr Pro Tyr Thr Lys He Ser Val His Lys Thr Glu Pro Leu Glu 290 295 300
GAG CCA CAA ACT TTT GAA GCT AAA AAT AAT TTC GCC ATT CTT CAA GCC 1022 Glu Pro Gin Thr Phe Glu Ala Lys Asn Asn Phe Ala He Leu Gin Ala 305 310 315
AGA AGC TCT ACA GAA AAA TGC AAA AGG GCT AGA GCA AGA AAA GAC GGC 1070 Arg Ser Ser Thr Glu Lys Cys Lys Arg Ala Arg Ala Arg Lys Asp Gly 320 325 330 335
ACG ACT AGG CAA TGC TAT CTA ATA GAA GAG CCT TTA AAA CAA GCA TGG 1118 Thr Thr Arg Gin Cys Tyr Leu He Glu Glu Pro Leu Lys Gin Ala Trp 340 345 350
GAG AGT GAG TAT GAA ATC ACC ACG CAA TTA GTG AAA GCC ATT TAT GAG 1166 Glu Ser Glu Tyr Glu He Thr Thr Gin Leu Val Lys Ala He Tyr Glu 355 360 365
CGC CCC AAA CAA GAC GAT CAA GTA GAG CCG ACT TTT TAT GAA ACC AGC 1214 Arg Pro Lys Gin Asp Asp Gin Val Glu Pro Thr Phe Tyr Glu Thr Ser 370 375 380
GAA TTG GCT TAT TCT TCC ACA CGA AAA AGC GAA ATA ACG CAC AAT GAA 1262 Glu Leu Ala Tyr Ser Ser Thr Arg Lys Ser Glu He Thr His Asn Glu 385 390 395
TTG AAT TTG AAT GAA AAA TTC ATG GAA TTT GTG GAA GTG TAT GAG GGG 1310 Leu Asn Leu Asn Glu Lys Phe Met Glu Phe Val Glu Val Tyr Glu Gly 400 405 410 415
CAT TAT TTA AAC GAT ATA ATT AAA GAG AGC AGT GAA TAT AAA GAA TGG 1358 His Tyr Leu Asn Asp He He Lys Glu Ser Ser Glu Tyr Lys Glu Trp 420 425 430
GTT AAA AAC CAT GTG CGC TTT AAA GAA GGG GTG TGC ATG GCT TTA GAA 1406 Val Lys Asn His Val Arg Phe Lys Glu Gly Val Cys Met Ala Leu Glu 435 440 445
ATA GAA GAA CAG CCA CGA GCT AAA AGC ACG CCT TTG AGT ATT GAA AAC 1454 He Glu Glu Gin Pro Arg Ala Lys Ser Thr Pro Leu Ser He Glu Asn 450 455 460
TCT CGT GTG GTA TGT GTC AAA AAG GGG AAT TAT TTA TTC AAC GAA GTT T 1503 Ser Arg Val Val Cys Val Lys Lys Gly Asn Tyr Leu Phe Asn Glu Val 465 470 475
AAGATGGTGG CTTGAGGCGG AATCGAACCA CCGACACGAA GATTTTC 1550
(2) INFORMATION FOR SEQ ID NO: 312:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 479 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312:
Met Lys Lys Ser Leu Cys Leu Ser Phe Phe Leu Thr Phe Ser Asn Pro
1 5 10 15
Leu Gin Ala Leu Val He Glu Leu Leu Glu Glu He Lys Thr Ser Pro
20 25 30
His Lys Gly Thr Phe Lys Ala Lys Val Leu Asp Ser Lys Lys Pro Arg
35 40 45
Gin Val Leu Gly Val Tyr Asn He Ser Pro His Lys Lys Leu Thr Leu
50 55 60
Thr He Thr His He Ser Thr Ala He Val Tyr Gin Pro Leu Asp Glu 65 70 75 80
Lys Leu Ser Leu Glu Thr Thr Leu Asn Pro Asn Arg Pro Thr He Pro
85 90 95
Arg Asn Thr Gin He Val Phe Ser Ser Lys Glu Leu Lys Glu Ser His
100 105 110
Pro His Gin Met Pro Ser Leu Asn Ala Pro Met Gin Lys Pro Gin Asn
115 120 125
Lys Pro His Ser Ser Gin Gin Pro Ser Gin Asn Phe Ser Tyr Pro Glu
130 135 140
Pro Lys Leu Gly Ser Lys Asn Ser Lys Asn Ser Leu Leu Gin Pro Leu 145 150 155 160
Ala He Pro Ser Lys He Ser Pro Thr Asn Glu Thr Gin Thr Pro Thr
165 170 175
Asn Asp Thr Lys Pro Pro Leu Lys His Ser Ser Glu Asp Gin Glu Ser
180 185 190
Asn Leu Phe He Thr Pro Pro Thr Glu Lys Thr Leu Pro Asn Asn Thr
195 200 205
Ser Asn Ala Asp He Ser Glu Asn Asn Glu Ser Asn Glu Asn Lys Asp
210 215 220
Asn Val Glu Lys Gin Ala He Arg Asp Ala Asn He Lys Glu Phe Ala 225 230 235 240
Cys Gly Lys Trp Val Tyr Asp Asp Glu Asn Leu Gin Ala Tyr Arg Pro
245 250 255
Ser He Leu Lys Arg Val Asp Glu Asp Lys Gin Thr Ala Thr Asp He
260 265 270
Thr Pro Cys Asp Tyr Ser Thr Ala Glu Asn Lys Ser Gly Lys He He
275 280 285
Thr Pro Tyr Thr Lys He Ser Val His Lys Thr Glu Pro Leu Glu Glu
290 295 300
Pro Gin Thr Phe Glu Ala Lys Asn Asn Phe Ala He Leu Gin Ala Arg 305 310 315 320
Ser Ser Thr Glu Lys Cys Lys Arg Ala Arg Ala Arg Lys Asp Gly Thr
325 330 335
Thr Arg Gin Cys Tyr Leu He Glu Glu Pro Leu Lys Gin Ala Trp Glu
340 345 350
Ser Glu Tyr Glu He Thr Thr Gin Leu Val Lys Ala He Tyr Glu Arg
355 360 365
Pro Lys Gin Asp Asp Gin Val Glu Pro Thr Phe Tyr Glu Thr Ser Glu 370 375 380 Leu Ala Tyr Ser Ser Thr Arg Lys Ser Glu He Thr His Asn Glu Leu 385 390 395 400
Asn Leu Asn Glu Lys Phe Met Glu Phe Val Glu Val Tyr Glu Gly His
405 410 415
Tyr Leu Asn Asp He He Lys Glu Ser Ser Glu Tyr Lys Glu Trp Val
420 425 430
Lys Asn His Val Arg Phe Lys Glu Gly Val Cys Met Ala Leu Glu He
435 440 445
Glu Glu Gin Pro Arg Ala Lys Ser Thr Pro Leu Ser He Glu Asn Ser
450 455 460
Arg Val Val Cys Val Lys Lys Gly Asn Tyr Leu Phe Asn Glu Val 465 470 475
(2) INFORMATION FOR SEQ ID NO:313:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 620 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 68...568 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:313:
TGTAAAATAG GGATTTGCTA GGCCTTTAGT CGTTAAAAGG TAATTATCAT TAAGGAGTTT 60 TTTAATC ATG GCA GAT ATT CAA AGG CGT GAT TTT TTA GGA ATG AGC CTT 109 Met Ala Asp He Gin Arg Arg Asp Phe Leu Gly Met Ser Leu 1 5 10
GCT AGT GTT ACA GCT ATA GGG GCT ATA GCG AGT CTG GTA GCG ATG AAA 157 Ala Ser Val Thr Ala He Gly Ala He Ala Ser Leu Val Ala Met Lys 15 20 25 30
AAG ACT TGG GAT CCG CTT CCA AGC GTT GTT TCA GCC GGT TTT ACG ACC 205 Lys Thr Trp Asp Pro Leu Pro Ser Val Val Ser Ala Gly Phe Thr Thr 35 40 45
ATA GAT GTG GCG AAT ATG CAA GAA GGG CAG TTT TCC ACC GTG GAA TGG 253 He Asp Val Ala Asn Met Gin Glu Gly Gin Phe Ser Thr Val Glu Trp 50 55 60
CGT GGG AAA CCG GTC TAT ATC CTC AAG CGT TCT AAA AAA GAG GGC TTT 301 Arg Gly Lys Pro Val Tyr He Leu Lys Arg Ser Lys Lys Glu Gly Phe 65 70 75
AAT GAA AAG CGC GAT TTT AAA GTT GGC GAG AGC GTT TTT ACC ACA GCC 349 Asn Glu Lys Arg Asp Phe Lys Val Gly Glu Ser Val Phe Thr Thr Ala 80 85 90
ATT CAA ATT TGC ACG CAT TTA GGG TGT ATC CCC ACT TAT CAA GAT GAA 397 He Gin He Cys Thr His Leu Gly Cys He Pro Thr Tyr Gin Asp Glu 95 100 105 110
GAA AAA GGC TTT TTA TGC CCA TGC CAT GGG GGG CGT TTC ACT TCT GAT 445 Glu Lys Gly Phe Leu Cys Pro Cys His Gly Gly Arg Phe Thr Ser Asp 115 120 125
GGC GTG AAT ATT GCC GGC ACT CCC CCT CCA CGC CCT TTT GAT ATC CCG 493 Gly Val Asn He Ala Gly Thr Pro Pro Pro Arg Pro Phe Asp He Pro 130 135 140
CCT TTT AAA ATT GAA GGC ACT AAG ATC ACT TTT GGT GAA GCC GGG GCT 541 Pro Phe Lys He Glu Gly Thr Lys He Thr Phe Gly Glu Ala Gly Ala 145 150 155
GAA TAC AAG AAA ATG ATG GCT AAA GCG TAAGGAGAGT TTAATGGCAG AGATAAA 595 Glu Tyr Lys Lys Met Met Ala Lys Ala 160 165
AAAAGCGAAA AATTTAGGCG AATGG 620
(2) INFORMATION FOR SEQ ID NO: 314:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 167 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314:
Met Ala Asp He Gin Arg Arg Asp Phe Leu Gly Met Ser Leu Ala Ser
1 5 10 15
Val Thr Ala He Gly Ala He Ala Ser Leu Val Ala Met Lys Lys Thr
20 25 30
Trp Asp Pro Leu Pro Ser Val Val Ser Ala Gly Phe Thr Thr He Asp
35 40 45
Val Ala Asn Met Gin Glu Gly Gin Phe Ser Thr Val Glu Trp Arg Gly
50 55 60
Lys Pro Val Tyr He Leu Lys Arg Ser Lys Lys Glu Gly Phe Asn Glu 65 70 75 80
Lys Arg Asp Phe Lys Val Gly Glu Ser Val Phe Thr Thr Ala He Gin
85 90 95
He Cys Thr His Leu Gly Cys He Pro Thr Tyr Gin Asp Glu Glu Lys
100 105 110
Gly Phe Leu Cys Pro Cys His Gly Gly Arg Phe Thr Ser Asp Gly Val
115 120 125
Asn He Ala Gly Thr Pro Pro Pro Arg Pro Phe Asp He Pro Pro Phe 130 135 140
Lys He Glu Gly Thr Lys He Thr Phe Gly Glu Ala Gly Ala Glu Tyr 145 150 155 160
Lys Lys Met Met Ala Lys Ala 165
(2) INFORMATION FOR SEQ ID NO: 315:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1221 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...1167 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315:
TTATAATCAA AGCTATTTTA AAAGCTGAAT AGCTATAGTT ATTAGGATGC G ATG TCA 57
Met Ser 1
AAA AGA ATG AAG TGT TTT AGT CAA AAA TGG TTG GTT TTT TTT GTT ACC 105 Lys Arg Met Lys Cys Phe Ser Gin Lys Trp Leu Val Phe Phe Val Thr 5 10 15
CTT TTA TTG GCT TCT TTA GGC CAT GCG AAA ATG GCT TTT GAA TCC GAT 153 Leu Leu Leu Ala Ser Leu Gly His Ala Lys Met Ala Phe Glu Ser Asp 20 25 30
ATT GAC ACC AAA GCG CTA GAG GCT TTT GGG GTT AAT GCG GGC TTT TTA 201 He Asp Thr Lys Ala Leu Glu Ala Phe Gly Val Asn Ala Gly Phe Leu 35 40 45 50
TCC CAA ATG CCC AAC GCT TTA AAA AAA ATG AAT AAA GAA GAA GAA TGG 249 Ser Gin Met Pro Asn Ala Leu Lys Lys Met Asn Lys Glu Glu Glu Trp 55 60 65
AAG AGA CTT GTC AAA AGA TTT GAT GTG AAT TAC CAG TTC ATC CCC ATC 297 Lys Arg Leu Val Lys Arg Phe Asp Val Asn Tyr Gin Phe He Pro He 70 75 80
ATT AAA AAC ATG CTC ATA GAA GCG AGC GTG CCG CAA GAA TTT TTA TTT 345 He Lys Asn Met Leu He Glu Ala Ser Val Pro Gin Glu Phe Leu Phe 85 90 95
TTA GCC ATG GCC GAG TCT AAA TTT TCA TCA AGG GCT TAT AGC AGG AAA 393 Leu Ala Met Ala Glu Ser Lys Phe Ser Ser Arg Ala Tyr Ser Arg Lys 100 105 110
AAA GCG GTA GGG ATT TGG CAA TTC ATG CCA AGC ACG GCT AAA GAA TTA 441 Lys Ala Val Gly He Trp Gin Phe Met Pro Ser Thr Ala Lys Glu Leu 115 120 125 130
GGG CTT AAG GTC AAT CAT TAC ATT GAT GAA AGA AGA GAT CCC ATT AAA 489 Gly Leu Lys Val Asn His Tyr He Asp Glu Arg Arg Asp Pro He Lys 135 140 145
AGC ACT CAA GCG GCG ATC ACT TAT TTG AAA CGG CTC TAC AAG CAA ACC 537 Ser Thr Gin Ala Ala He Thr Tyr Leu Lys Arg Leu Tyr Lys Gin Thr 150 155 160
GGA GAG TGG TAT TTG GTC GCT ATG GCG TAT AAT TAC GGC TTA CGC AAG 585 Gly Glu Trp Tyr Leu Val Ala Met Ala Tyr Asn Tyr Gly Leu Arg Lys 165 170 175
GTT CAA AAC GCT ATT AAA GCC GCC GGC ACT TCG GAC ATT AAA ATT TTG 633 Val Gin Asn Ala He Lys Ala Ala Gly Thr Ser Asp He Lys He Leu 180 185 190
TTG GAT GAA GAT AAG AAA TAC CTC CCT AAA GAA ACA CGA GAG TAT ATC 681 Leu Asp Glu Asp Lys Lys Tyr Leu Pro Lys Glu Thr Arg Glu Tyr He 195 200 205 210
CGC TCC ATT CTA AGC CTA GCG TTA AAA TTC AAC AGC CTA GAC AAC CTC 729 Arg Ser He Leu Ser Leu Ala Leu Lys Phe Asn Ser Leu Asp Asn Leu 215 220 225
AAA GAT AAA GAA TAT CTG CTC AAT CGT GGG GCG AGG GTG AGT TTA GTG 777 Lys Asp Lys Glu Tyr Leu Leu Asn Arg Gly Ala Arg Val Ser Leu Val 230 235 240
GGC GTC CCG TTT AAA AGG CGT GCT TCT TTA GTC CAA GTA GCC AAA AAT 825 Gly Val Pro Phe Lys Arg Arg Ala Ser Leu Val Gin Val Ala Lys Asn 245 250 255
TTG AAT TTG AGT TTG GAA ACC TTA AAA TCC TAC AAC CAC CAA TTC CGT 873 Leu Asn Leu Ser Leu Glu Thr Leu Lys Ser Tyr Asn His Gin Phe Arg 260 265 270
TAT AAC ATT CTG CCT TCT AAA GAC CCC ACT TAT ACC ATT TAT ATC CCT 921 Tyr Asn He Leu Pro Ser Lys Asp Pro Thr Tyr Thr He Tyr He Pro 275 280 285 290
TAT GAA AAA CTC GCT CTT TTC AAA CAA CGC CAG ATC AAA CAA AAT AAA 969 Tyr Glu Lys Leu Ala Leu Phe Lys Gin Arg Gin He Lys Gin Asn Lys 295 300 305
AAC ATT CAA GCC AGT TCA AAA AGC CCT TTT ATC ACC CAT GTG GTC TTA 1017 Asn He Gin Ala Ser Ser Lys Ser Pro Phe He Thr His Val Val Leu 310 315 320 CCT AAA GAA ACC CTA TCT TCT ATC GCT AAA CGC TAT CAA GTC AGT ATT 1065 Pro Lys Glu Thr Leu Ser Ser He Ala Lys Arg Tyr Gin Val Ser He 325 330 335
TCC AAT ATC CAA TTA GCC AAT GAT CTC AAA GAT TCT AAT ATT TTT ATC 1113 Ser Asn He Gin Leu Ala Asn Asp Leu Lys Asp Ser Asn He Phe He 340 345 350
CAC CAG CGT TTA ATC ATC CCC ACC AAC AAA AAA TTA CTC GCT ACA AGG 1161 His Gin Arg Leu He He Pro Thr Asn Lys Lys Leu Leu Ala Thr Arg 355 360 365 370
GAA TTT TAATGGGTTT GGCGTTGGAA AAAGTTTGTT TTTTAGGCGT TATTTTTTTG AT 1219 Glu Phe
TA 1221
(2) INFORMATION FOR SEQ ID NO: 316:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 372 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316:
Met Ser Lys Arg Met Lys Cys Phe Ser Gin Lys Trp Leu Val Phe Phe
1 5 10 15
Val Thr Leu Leu Leu Ala Ser Leu Gly His Ala Lys Met Ala Phe Glu
20 25 30
Ser Asp He Asp Thr Lys Ala Leu Glu Ala Phe Gly Val Asn Ala Gly
35 40 45
Phe Leu Ser Gin Met Pro Asn Ala Leu Lys Lys Met Asn Lys Glu Glu
50 55 60
Glu Trp Lys Arg Leu Val Lys Arg Phe Asp Val Asn Tyr Gin Phe He 65 70 75 80
Pro He He Lys Asn Met Leu He Glu Ala Ser Val Pro Gin Glu Phe
85 90 95
Leu Phe Leu Ala Met Ala Glu Ser Lys Phe Ser Ser Arg Ala Tyr Ser
100 105 110
Arg Lys Lys Ala Val Gly He Trp Gin Phe Met Pro Ser Thr Ala Lys
115 120 125
Glu Leu Gly Leu Lys Val Asn His Tyr He Asp Glu Arg Arg Asp Pro
130 135 140
He Lys Ser Thr Gin Ala Ala He Thr Tyr Leu Lys Arg Leu Tyr Lys 145 150 155 160
Gin Thr Gly Glu Trp Tyr Leu Val Ala Met Ala Tyr Asn Tyr Gly Leu
165 170 175
Arg Lys Val Gin Asn Ala He Lys Ala Ala Gly Thr Ser Asp He Lys 180 185 190 He Leu Leu Asp Glu Asp Lys Lys Tyr Leu Pro Lys Glu Thr Arg Glu
195 200 205
Tyr He Arg Ser He Leu Ser Leu Ala Leu Lys Phe Asn Ser Leu Asp
210 215 220
Asn Leu Lys Asp Lys Glu Tyr Leu Leu Asn Arg Gly Ala Arg Val Ser 225 230 235 240
Leu Val Gly Val Pro Phe Lys Arg Arg Ala Ser Leu Val Gin Val Ala
245 250 255
Lys Asn Leu Asn Leu Ser Leu Glu Thr Leu Lys Ser Tyr Asn His Gin
260 265 270
Phe Arg Tyr Asn He Leu Pro Ser Lys Asp Pro Thr Tyr Thr He Tyr
275 280 285
He Pro Tyr Glu Lys Leu Ala Leu Phe Lys Gin Arg Gin He Lys Gin
290 295 300
Asn Lys Asn He Gin Ala Ser Ser Lys Ser Pro Phe He Thr His Val 305 310 315 320
Val Leu Pro Lys Glu Thr Leu Ser Ser He Ala Lys Arg Tyr Gin Val
325 330 335
Ser He Ser Asn He Gin Leu Ala Asn Asp Leu Lys Asp Ser Asn He
340 345 350
Phe He His Gin Arg Leu He He Pro Thr Asn Lys Lys Leu Leu Ala
355 360 365
Thr Arg Glu Phe 370
(2) INFORMATION FOR SEQ ID NO: 317:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 561 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 46...510 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:317:
AAATGAGCTA AAATGAGCGT TTCATTTGAC AAATAAAGGG ATTGA ATG GCT TTT AAG 57
Met Ala Phe Lys 1
GTG GTG CAA ATT TGC GGA GGG CTT GGG AAT CAA ATG TTT CAA TAC GCT 105 Val Val Gin He Cys Gly Gly Leu Gly Asn Gin Met Phe Gin Tyr Ala 5 10 15 20
TTC GCT AAA AGT TTG CAA AAA CAC TCT AAT ACG CCT GTG CTG TTA GAT 153 Phe Ala Lys Ser Leu Gin Lys His Ser Asn Thr Pro Val Leu Leu Asp 25 30 35
ATC ACT TCT TTT GAT TGG AGC GAT AGG AAA ATG CAA TTA GAA CTT TTC 201 He Thr Ser Phe Asp Trp Ser Asp Arg Lys Met Gin Leu Glu Leu Phe 40 45 50
CCT ATT GAT TTG CCC TAT GCG AGC GCG AAA GAA ATC GCT ATA GCT AAA 249 Pro He Asp Leu Pro Tyr Ala Ser Ala Lys Glu He Ala He Ala Lys 55 60 65
ATG CAA CAC CTC CCC AAG CTA GTA AGA GAC GCG CTC AAA TGC ATG GGA 297 Met Gin His Leu Pro Lys Leu Val Arg Asp Ala Leu Lys Cys Met Gly 70 75 80
TTT GAT AGG GTG AGT CAA GAA ATC GTT TTT GAA TAC GAG CCT AAA TTG 345 Phe Asp Arg Val Ser Gin Glu He Val Phe Glu Tyr Glu Pro Lys Leu 85 90 95 100
CTA AAG CCA AGC CGC TTG ACT TAT TTT TTT GGC TAT TTC CAA GAT CCA 393 Leu Lys Pro Ser Arg Leu Thr Tyr Phe Phe Gly Tyr Phe Gin Asp Pro 105 110 115
CGA TAC TTT GAT GCT ATA TCC CCT TTA ATC AAG CAA ACC TTC ACT CTA 441 Arg Tyr Phe Asp Ala He Ser Pro Leu He Lys Gin Thr Phe Thr Leu 120 125 130
CCC CCC CCC CCC CCG AAA ATA ATA AGA ATA ATA ATA AAA AAG AGG AAG 489 Pro Pro Pro Pro Pro Lys He He Arg He He He Lys Lys Arg Lys 135 140 145
AAT ATC AGT GCA AGC TTT CTT TGATTTTAGC CGCTAAAAAC AGCGTGTTTG TGCA 544 Asn He Ser Ala Ser Phe Leu 150 155
TATAAGAAGA GGGGATT 561
(2) INFORMATION FOR SEQ ID NO: 318:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 155 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:318:
Met Ala Phe Lys Val Val Gin He Cys Gly Gly Leu Gly Asn Gin Met
1 5 10 15
Phe Gin Tyr Ala Phe Ala Lys Ser Leu Gin Lys His Ser Asn Thr Pro
20 25 30
Val Leu Leu Asp He Thr Ser Phe Asp Trp Ser Asp Arg Lys Met Gin 35 40 45 Leu Glu Leu Phe Pro He Asp Leu Pro Tyr Ala Ser Ala Lys Glu He
50 55 60
Ala He Ala Lys Met Gin His Leu Pro Lys Leu Val Arg Asp Ala Leu 65 70 75 80
Lys Cys Met Gly Phe Asp Arg Val Ser Gin Glu He Val Phe Glu Tyr
85 90 95
Glu Pro Lys Leu Leu Lys Pro Ser Arg Leu Thr Tyr Phe Phe Gly Tyr
100 105 110
Phe Gin Asp Pro Arg Tyr Phe Asp Ala He Ser Pro Leu He Lys Gin
115 120 125
Thr Phe Thr Leu Pro Pro Pro Pro Pro Lys He He Arg He He He
130 135 140
Lys Lys Arg Lys Asn He Ser Ala Ser Phe Leu 145 150 155
(2) INFORMATION FOR SEQ ID NO: 319:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1251 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 86...1201 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319:
GGTTATTATA GTATAATATT GTCAAAAAAT AAATTCAACT TTGGATTAAA TTGATTAAAA 60 ACCTATTTTA GGGAAACCGC TTAAA ATG AGT ATT ATT ATT CCT ATT GTC ATC 112
Met Ser He He He Pro He Val He 1 5
GCT TTT GAT AAT CAC TAT GCC ATG CCG GCT GGC GTG AGC TTG TAT TCC 160 Ala Phe Asp Asn His Tyr Ala Met Pro Ala Gly Val Ser Leu Tyr Ser 10 15 20 25
ATG CTA GCT TGC GCT AAA ACA GAA CAC CCC CAA TCA CAA AAT GAT AGT 208 Met Leu Ala Cys Ala Lys Thr Glu His Pro Gin Ser Gin Asn Asp Ser 30 35 40
GAA AAA CTT TTT TAT AAG ATC CAC TGC CTG GTG GAT AAC TTA AGC CTT 256 Glu Lys Leu Phe Tyr Lys He His Cys Leu Val Asp Asn Leu Ser Leu 45 50 55
GAA AAC CAG AGC AAA CTA AAA GAG ACT CTA GCC CCC TTT AGC GCT TTT 304 Glu Asn Gin Ser Lys Leu Lys Glu Thr Leu Ala Pro Phe Ser Ala Phe 60 65 70 TCG AGC CTA GAA TTT TTA GAC ATT TCA ACC CCC AAT CTT CAC GCC ACT 352 Ser Ser Leu Glu Phe Leu Asp He Ser Thr Pro Asn Leu His Ala Thr 75 80 85
CCA ATA GAA CCC TCT GCG ATT GAT AAA ATC AAT GAA GCT TTT TTG CAA 400 Pro He Glu Pro Ser Ala He Asp Lys He Asn Glu Ala Phe Leu Gin 90 95 100 105
CTC AAT ATT TAC GCT AAG ACT CGC TTT TCT AAA ATG GTC ATG TGC CGC 448 Leu Asn He Tyr Ala Lys Thr Arg Phe Ser Lys Met Val Met Cys Arg 110 115 120
TTG TTT TTG GCT TCT TTA TTC CCA CAA TAC GAC AAA ATC ATC ATG TTT 496 Leu Phe Leu Ala Ser Leu Phe Pro Gin Tyr Asp Lys He He Met Phe 125 130 135
GAT GCA GAC ACT TTG TTT TTA AAC GAT GTG AGC GAG AGC TTT TTC ATC 544 Asp Ala Asp Thr Leu Phe Leu Asn Asp Val Ser Glu Ser Phe Phe He 140 145 150
CCA CTA GAT GGC TAT TAT TTT GGA GCG GCT AAA GAT TTT GCT TCC GAT 592 Pro Leu Asp Gly Tyr Tyr Phe Gly Ala Ala Lys Asp Phe Ala Ser Asp 155 160 165
AAA AGC CCT AAA CAT TTT CAA ATA GTG CGA GAA AAA GAC CCT CGT CAA 640 Lys Ser Pro Lys His Phe Gin He Val Arg Glu Lys Asp Pro Arg Gin 170 175 180 185
GCC TTT TCC CTT TAT GAG CAT TAC CTT AAT GAA AGC GAT ATG CAA ATC 688 Ala Phe Ser Leu Tyr Glu His Tyr Leu Asn Glu Ser Asp Met Gin He 190 195 200
ATC TAT GAA AGC AAT TAT AAC GCC GGG TTT TTA GTC GTG AAT TTA AAG 736 He Tyr Glu Ser Asn Tyr Asn Ala Gly Phe Leu Val Val Asn Leu Lys 205 210 215
CTG TGG CGT GCT GAT CAT TTA GAA GAG CGC TTA CTC AAT TTA ACC CAT 784 Leu Trp Arg Ala Asp His Leu Glu Glu Arg Leu Leu Asn Leu Thr His 220 225 230
CAA AAA GGC CAG TGC GTG TTT TAC CCT GAA CAG GAC CTT TTA ACG CTC 832 Gin Lys Gly Gin Cys Val Phe Tyr Pro Glu Gin Asp Leu Leu Thr Leu 235 240 245
GCA TGC TAT CAA AAA GTT TTA ATC TTG CCT TAT ATT TAT AAC ACC CAC 880 Ala Cys Tyr Gin Lys Val Leu He Leu Pro Tyr He Tyr Asn Thr His 250 255 260 265
CCT TTC ATG GCC AAT CAA AAA CGC TTC ATC CCT GAC AAA AAA GAA ATC 928 Pro Phe Met Ala Asn Gin Lys Arg Phe He Pro Asp Lys Lys Glu He 270 275 280
GTC ATG CTG CAT TTT TAT TTT GTA GGA AAA CCT TGG GTT TTA CCT ACT 976 Val Met Leu His Phe Tyr Phe Val Gly Lys Pro Trp Val Leu Pro Thr 285 290 295 TTT TCA TAT TCT AAA GAA TGG CAT GAG ACT CTT TTA AAA ACC CCT TTT 1024 Phe Ser Tyr Ser Lys Glu Trp His Glu Thr Leu Leu Lys Thr Pro Phe 300 305 310
TAT GCT GAA TAT TCC GTG AAA TTC CTT AAA CAA ATG ACA GAA TGT TTA 1072 Tyr Ala Glu Tyr Ser Val Lys Phe Leu Lys Gin Met Thr Glu Cys Leu 315 320 325
AGC CTT AAA GAC AAA CAA AAA ACC TTT GAA TTT CTT GCC CCC CTA CTC 1120 Ser Leu Lys Asp Lys Gin Lys Thr Phe Glu Phe Leu Ala Pro Leu Leu 330 335 340 345
AAT AAA AAA ACC CTT TTA GAA TAC GTC TTT TTT AGA TTG AAT AGG ATT 1168 Asn Lys Lys Thr Leu Leu Glu Tyr Val Phe Phe Arg Leu Asn Arg He 350 355 360
TTC AAA CGC TTA AAA GAA AAA TTT TTT AAC TCT TAGCGTTCTC GTTTGGGCAA 1221 Phe Lys Arg Leu Lys Glu Lys Phe Phe Asn Ser 365 370
CACGCTATAG GCGAATTTGA CATAAATCGC 1251
(2) INFORMATION FOR SEQ ID NO: 320:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 372 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320:
Met Ser He He He Pro He Val He Ala Phe Asp Asn His Tyr Ala
1 5 10 15
Met Pro Ala Gly Val Ser Leu Tyr Ser Met Leu Ala Cys Ala Lys Thr
20 25 30
Glu His Pro Gin Ser Gin Asn Asp Ser Glu Lys Leu Phe Tyr Lys He
35 40 45
His Cys Leu Val Asp Asn Leu Ser Leu Glu Asn Gin Ser Lys Leu Lys
50 55 60
Glu Thr Leu Ala Pro Phe Ser Ala Phe Ser Ser Leu Glu Phe Leu Asp 65 70 75 80
He Ser Thr Pro Asn Leu His Ala Thr Pro He Glu Pro Ser Ala He
85 90 95
Asp Lys He Asn Glu Ala Phe Leu Gin Leu Asn He Tyr Ala Lys Thr
100 105 110
Arg Phe Ser Lys Met Val Met Cys Arg Leu Phe Leu Ala Ser Leu Phe
115 120 125
Pro Gin Tyr Asp Lys He He Met Phe Asp Ala Asp Thr Leu Phe Leu
130 135 140
Asn Asp Val Ser Glu Ser Phe Phe He Pro Leu Asp Gly Tyr Tyr Phe 145 150 155 160 Gly Ala Ala Lys Asp Phe Ala Ser Asp Lys Ser Pro Lys His Phe Gin
165 170 175
He Val Arg Glu Lys Asp Pro Arg Gin Ala Phe Ser Leu Tyr Glu His
180 185 190
Tyr Leu Asn Glu Ser Asp Met Gin He He Tyr Glu Ser Asn Tyr Asn
195 200 205
Ala Gly Phe Leu Val Val Asn Leu Lys Leu Trp Arg Ala Asp His Leu
210 215 220
Glu Glu Arg Leu Leu Asn Leu Thr His Gin Lys Gly Gin Cys Val Phe 225 230 235 240
Tyr Pro Glu Gin Asp Leu Leu Thr Leu Ala Cys Tyr Gin Lys Val Leu
245 250 255
He Leu Pro Tyr He Tyr Asn Thr His Pro Phe Met Ala Asn Gin Lys
260 265 270
Arg Phe He Pro Asp Lys Lys Glu He Val Met Leu His Phe Tyr Phe
275 280 285
Val Gly Lys Pro Trp Val Leu Pro Thr Phe Ser Tyr Ser Lys Glu Trp
290 295 300
His Glu Thr Leu Leu Lys Thr Pro Phe Tyr Ala Glu Tyr Ser Val Lys 305 310 315 320
Phe Leu Lys Gin Met Thr Glu Cys Leu Ser Leu Lys Asp Lys Gin Lys
325 330 335
Thr Phe Glu Phe Leu Ala Pro Leu Leu Asn Lys Lys Thr Leu Leu Glu
340 345 350
Tyr Val Phe Phe Arg Leu Asn Arg He Phe Lys Arg Leu Lys Glu Lys
355 360 365
Phe Phe Asn Ser 370
(2) INFORMATION FOR SEQ ID NO: 321:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2241 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...2193 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321:
CAAGAAGCCA TAGAAGCTGA TGGGAAATTC CACAAAGAAT AAGGGTAGAA A ATG AAA 57
Met Lys 1
ATA ACA TAT TGT GAT GCG CTA ATT ATT GGA GGC GGA CTA GCT GGG TTA 105 He Thr Tyr Cys Asp Ala Leu He He Gly Gly Gly Leu Ala Gly Leu 5 10 15
AGG GCT AGT ATC GCA TGC AAA CAA AAG GGT TTA AAC ACC ATC GTT TTA 153 Arg Ala Ser He Ala Cys Lys Gin Lys Gly Leu Asn Thr He Val Leu 20 25 30
AGC CTA GTG CCT GTC AGG CGT TCG CAC TCT GCA GCC GCT CAA GGG GGC 201 Ser Leu Val Pro Val Arg Arg Ser His Ser Ala Ala Ala Gin Gly Gly 35 40 45 50
ATG CAA GCG AGC CTT GCG AAC GCT AAA AAA AGC GAG GGC GAT AAT GAA 249 Met Gin Ala Ser Leu Ala Asn Ala Lys Lys Ser Glu Gly Asp Asn Glu 55 60 65
GAT TTA CAC TTT TTA GAC ACG GTT AAG GGG AGC GAT TGG GGG TGC GAT 297 Asp Leu His Phe Leu Asp Thr Val Lys Gly Ser Asp Trp Gly Cys Asp 70 75 80
CAG CAA GTG GCT AGG ATG TTT GTA ACC ACT GCT CCT AAA GCC ATT AGG 345 Gin Gin Val Ala Arg Met Phe Val Thr Thr Ala Pro Lys Ala He Arg 85 90 95
GAA TTG GCC AGT TGG GGG GTG CCT TGG ACT AGG ATT AAA AAG GGC GAT 393 Glu Leu Ala Ser Trp Gly Val Pro Trp Thr Arg He Lys Lys Gly Asp 100 105 110
AGG CCT GCG GTC GTC AAT GGT GAG CAT GTA ACT ATC ACT GAA AGA GAC 441 Arg Pro Ala Val Val Asn Gly Glu His Val Thr He Thr Glu Arg Asp 115 120 125 130
GAC AGG CAT GGT TAT ATC TTA AGC CGT GAT TTT GGC GGC ACT AAA AAA 489 Asp Arg His Gly Tyr He Leu Ser Arg Asp Phe Gly Gly Thr Lys Lys 135 140 145
TGG CGC ACA TGC TTT ACG GCT GAT GCC ACA GGG CAT ACC ATG CTT TAT 537 Trp Arg Thr Cys Phe Thr Ala Asp Ala Thr Gly His Thr Met Leu Tyr 150 155 160
GCG GTC GCT AAT GAA GCC TTA CAC CAC AAA GTG GAT ATT CAA GAC AGA 585 Ala Val Ala Asn Glu Ala Leu His His Lys Val Asp He Gin Asp Arg 165 170 175
AAG GAC ATG CTC GCT TTC ATT CAT CAT GAT AAT AAA TGC TAT GGG GCG 633 Lys Asp Met Leu Ala Phe He His His Asp Asn Lys Cys Tyr Gly Ala 180 185 190
GTG GTA AGG GAT TTG ATC ACA GGC GAA ATT TCA GCG TAT GTT TCT AAA 681 Val Val Arg Asp Leu He Thr Gly Glu He Ser Ala Tyr Val Ser Lys 195 200 205 210
GGC ACG CTT TTA GCT ACC GGA GGT TAT GGG CGC GTG TAT AAA CAC ACC 729 Gly Thr Leu Leu Ala Thr Gly Gly Tyr Gly Arg Val Tyr Lys His Thr 215 220 225
ACT AAC GCT GTG ATT TGC GAT GGA GCC GGG GCT GCA AGC GCC TTA GAA 777 Thr Asn Ala Val He Cys Asp Gly Ala Gly Ala Ala Ser Ala Leu Glu 230 235 240
ACC GGC GTG GCT AAA TTG GGC AAC ATG GAA GCG GTG CAA TTC CAC CCT 825 Thr Gly Val Ala Lys Leu Gly Asn Met Glu Ala Val Gin Phe His Pro 245 250 255
ACC GCT TTA GTG CCA AGC GGG ATT TTA ATG ACC GAA GGT TGC AGG GGC 873 Thr Ala Leu Val Pro Ser Gly He Leu Met Thr Glu Gly Cys Arg Gly 260 265 270
GAT GGC GGT GTT TTA AGA GAC AAG TTT GGC AGA CGC TTC ATG CCC GCT 921 Asp Gly Gly Val Leu Arg Asp Lys Phe Gly Arg Arg Phe Met Pro Ala 275 280 285 290
TAT GAG CCG GAG AAA AAA GAG CTT GCA AGC AGA GAT GTG GTC TCA AGG 969 Tyr Glu Pro Glu Lys Lys Glu Leu Ala Ser Arg Asp Val Val Ser Arg 295 300 305
CGG ATT TTA GAG CAT ATC CAA AAA GGC TAT GGA GCC AAA TCG CCT TAT 1017 Arg He Leu Glu His He Gin Lys Gly Tyr Gly Ala Lys Ser Pro Tyr 310 315 320
GGG GAT CAT GTG TGG CTG GAT ATT GCT ATT TTA GGG CGT AAC CAT GTG 1065 Gly Asp His Val Trp Leu Asp He Ala He Leu Gly Arg Asn His Val 325 330 335
GAA AAA AAC TTA AGG GAT GTG CGC GAT ATA GCC ATG ACT TTT GCG GGC 1113 Glu Lys Asn Leu Arg Asp Val Arg Asp He Ala Met Thr Phe Ala Gly 340 345 350
ATT GAT CCG GCT GAT AGC AAG GAA CAA ACC AAA GAC AAC ATG CAA GGA 1161 He Asp Pro Ala Asp Ser Lys Glu Gin Thr Lys Asp Asn Met Gin Gly 355 360 365 370
GTG CCC GCA AAT GAG CCT GAA TAC GGG CAA GCG ATG GCC AAG CAA AAA 1209 Val Pro Ala Asn Glu Pro Glu Tyr Gly Gin Ala Met Ala Lys Gin Lys 375 380 385
GGC TGG ATC CCC ATA AAA CCC ATG CAA CAC TAT TCT ATG GGT GGG GTT 1257 Gly Trp He Pro He Lys Pro Met Gin His Tyr Ser Met Gly Gly Val 390 395 400
AGG ACA AAC CCT AAA GGC GAA ACC CAT TTA AAA GGC TTG TTT TGC GCG 1305 Arg Thr Asn Pro Lys Gly Glu Thr His Leu Lys Gly Leu Phe Cys Ala 405 410 415
GGT GAA GCG GCA TGC TGG GAT TTG CAT GGG TTT AAC CGC TTG GGG GGT 1353 Gly Glu Ala Ala Cys Trp Asp Leu His Gly Phe Asn Arg Leu Gly Gly 420 425 430
AAT TCT GTG AGT GAA GCG GTG GTC GCT GGC ATG ATC ATT GGG GAT TAT 1401 Asn Ser Val Ser Glu Ala Val Val Ala Gly Met He He Gly Asp Tyr 435 440 445 450 TTT GCC TCG CAT TGT TTA GAA GCG CAA ATT GAA ATC AAC ACG CAA AAA 1449 Phe Ala Ser His Cys Leu Glu Ala Gin He Glu He Asn Thr Gin Lys 455 460 465
GTT GAA GCT TTC ATT AAA GAA AGC CAA GAC TAT ATG CAT TTT TTA TTG 1497 Val Glu Ala Phe He Lys Glu Ser Gin Asp Tyr Met His Phe Leu Leu 470 475 480
CAT AAT GAA GGC AAA GAA GAT GTG TAT GAA ATT AGA GAG CGC ATG AAA 1545 His Asn Glu Gly Lys Glu Asp Val Tyr Glu He Arg Glu Arg Met Lys 485 490 495
GAA GTC ATG GAT GAA AAA GTG GGC GTT TTT AGA GAA GGC AAA AGG CTA 1593 Glu Val Met Asp Glu Lys Val Gly Val Phe Arg Glu Gly Lys Arg Leu 500 505 510
GAA GAA GCC CTT AAA GAA TTG CAA GAG CTT TAT GCA CGC TCC AAA AAC 1641 Glu Glu Ala Leu Lys Glu Leu Gin Glu Leu Tyr Ala Arg Ser Lys Asn 515 520 525 530
ATT TGC GTG AAA AAC AAG GTT TTA CAC AAT AAC CCT GAA TTA GAA GAC 1689 He Cys Val Lys Asn Lys Val Leu His Asn Asn Pro Glu Leu Glu Asp 535 540 545
GCT TAC CGC ACC AAA AAA ATG CTC AAA CTC GCG CTT TGT ATC ACT CAA 1737 Ala Tyr Arg Thr Lys Lys Met Leu Lys Leu Ala Leu Cys He Thr Gin 550 555 560
GGA GCG TTA CTG CGC ACT GAA AGC AGA GGG GCT CAC ACA AGG ATT GAC 1785 Gly Ala Leu Leu Arg Thr Glu Ser Arg Gly Ala His Thr Arg He Asp 565 570 575
TAC CCT AAA AGA GAC GAT GAA AAA TGG CTT AAT CGG ACT CTA GCG AGC 1833 Tyr Pro Lys Arg Asp Asp Glu Lys Trp Leu Asn Arg Thr Leu Ala Ser 580 585 590
TGG CCT AGC GCT GAG CAA GAC ATG CCC ACG ATT GAA TAC GAA GAA TTA 1881 Trp Pro Ser Ala Glu Gin Asp Met Pro Thr He Glu Tyr Glu Glu Leu 595 600 605 610
GAT GTG ATG AAA ATG GAA ATC AGC CCT GAT TTT AGG GGC TAT GGC AAA 1929 Asp Val Met Lys Met Glu He Ser Pro Asp Phe Arg Gly Tyr Gly Lys 615 620 625
AAG GGT AAT TTC ATC CCC CAC CCC AAA AAA GAA GAG CGC GAC GCT GAG 1977 Lys Gly Asn Phe He Pro His Pro Lys Lys Glu Glu Arg Asp Ala Glu 630 635 640
ATT TTG AAA ACG ATT TTA GAA CTA GAA AAG CTT GGA AAA GAC AGA ATA 2025 He Leu Lys Thr He Leu Glu Leu Glu Lys Leu Gly Lys Asp Arg He 645 650 655
GAA GTC CAA CAT GCG CTC ATG CCT TTT GAA TTG CAA GAA AAA TAC AAG 2073 Glu Val Gin His Ala Leu Met Pro Phe Glu Leu Gin Glu Lys Tyr Lys 660 665 670 GCT AGG AAT ATG CGT TTA GAA GAT GAG GAA GTC AGG GCT AGG GGG GAA 2121 Ala Arg Asn Met Arg Leu Glu Asp Glu Glu Val Arg Ala Arg Gly Glu 675 680 685 690
CAT TTG TAT TCT TTC AAT GTC CAT GAG TTA TTG GAC CAA CAC AAC GCT 2169 His Leu Tyr Ser Phe Asn Val His Glu Leu Leu Asp Gin His Asn Ala 695 700 705
AAC CTA AAA GGA GAA CAC CAT GAG TGATAATGAA CGAACGATTG TAGTTAGAGT 2223 Asn Leu Lys Gly Glu His His Glu 710
GCTAAAATTT GACCCTCA 2241
(2) INFORMATION FOR SEQ ID NO: 322:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 714 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:322:
Met Lys He Thr Tyr Cys Asp Ala Leu He He Gly Gly Gly Leu Ala
1 5 10 15
Gly Leu Arg Ala Ser He Ala Cys Lys Gin Lys Gly Leu Asn Thr He
20 25 30
Val Leu Ser Leu Val Pro Val Arg Arg Ser His Ser Ala Ala Ala Gin
35 40 45
Gly Gly Met Gin Ala Ser Leu Ala Asn Ala Lys Lys Ser Glu Gly Asp
50 55 60
Asn Glu Asp Leu His Phe Leu Asp Thr Val Lys Gly Ser Asp Trp Gly 65 70 75 80
Cys Asp Gin Gin Val Ala Arg Met Phe Val Thr Thr Ala Pro Lys Ala
85 90 95
He Arg Glu Leu Ala Ser Trp Gly Val Pro Trp Thr Arg He Lys Lys
100 105 110
Gly Asp Arg Pro Ala Val Val Asn Gly Glu His Val Thr He Thr Glu
115 120 125
Arg Asp Asp Arg His Gly Tyr He Leu Ser Arg Asp Phe Gly Gly Thr
130 135 140
Lys Lys Trp Arg Thr Cys Phe Thr Ala Asp Ala Thr Gly His Thr Met 145 150 155 160
Leu Tyr Ala Val Ala Asn Glu Ala Leu His His Lys Val Asp He Gin
165 170 175
Asp Arg Lys Asp Met Leu Ala Phe He His His Asp Asn Lys Cys Tyr
180 185 190
Gly Ala Val Val Arg Asp Leu He Thr Gly Glu He Ser Ala Tyr Val
195 200 205
Ser Lys Gly Thr Leu Leu Ala Thr Gly Gly Tyr Gly Arg Val Tyr Lys 210 215 220 His Thr Thr Asn Ala Val He Cys Asp Gly Ala Gly Ala Ala Ser Ala 225 230 235 240
Leu Glu Thr Gly Val Ala Lys Leu Gly Asn Met Glu Ala Val Gin Phe
245 250 255
His Pro Thr Ala Leu Val Pro Ser Gly He Leu Met Thr Glu Gly Cys
260 265 270
Arg Gly Asp Gly Gly Val Leu Arg Asp Lys Phe Gly Arg Arg Phe Met
275 280 285
Pro Ala Tyr Glu Pro Glu Lys Lys Glu Leu Ala Ser Arg Asp Val Val
290 295 300
Ser Arg Arg He Leu Glu His He Gin Lys Gly Tyr Gly Ala Lys Ser 305 310 315 320
Pro Tyr Gly Asp His Val Trp Leu Asp He Ala He Leu Gly Arg Asn
325 330 335
His Val Glu Lys Asn Leu Arg Asp Val Arg Asp He Ala Met Thr Phe
340 345 350
Ala Gly He Asp Pro Ala Asp Ser Lys Glu Gin Thr Lys Asp Asn Met
355 360 365
Gin Gly Val Pro Ala Asn Glu Pro Glu Tyr Gly Gin Ala Met Ala Lys
370 375 380
Gin Lys Gly Trp He Pro He Lys Pro Met Gin His Tyr Ser Met Gly 385 390 395 400
Gly Val Arg Thr Asn Pro Lys Gly Glu Thr His Leu Lys Gly Leu Phe
405 410 415
Cys Ala Gly Glu Ala Ala Cys Trp Asp Leu His Gly Phe Asn Arg Leu
420 425 430
Gly Gly Asn Ser Val Ser Glu Ala Val Val Ala Gly Met He He Gly
435 440 445
Asp Tyr Phe Ala Ser His Cys Leu Glu Ala Gin He Glu He Asn Thr
450 455 460
Gin Lys Val Glu Ala Phe He Lys Glu Ser Gin Asp Tyr Met His Phe 465 470 475 480
Leu Leu His Asn Glu Gly Lys Glu Asp Val Tyr Glu He Arg Glu Arg
485 490 495
Met Lys Glu Val Met Asp Glu Lys Val Gly Val Phe Arg Glu Gly Lys
500 505 510
Arg Leu Glu Glu Ala Leu Lys Glu Leu Gin Glu Leu Tyr Ala Arg Ser
515 520 525
Lys Asn He Cys Val Lys Asn Lys Val Leu His Asn Asn Pro Glu Leu
530 535 540
Glu Asp Ala Tyr Arg Thr Lys Lys Met Leu Lys Leu Ala Leu Cys He 545 550 555 560
Thr Gin Gly Ala Leu Leu Arg Thr Glu Ser Arg Gly Ala His Thr Arg
565 570 575
He Asp Tyr Pro Lys Arg Asp Asp Glu Lys Trp Leu Asn Arg Thr Leu
580 585 590
Ala Ser Trp Pro Ser Ala Glu Gin Asp Met Pro Thr He Glu Tyr Glu
595 600 605
Glu Leu Asp Val Met Lys Met Glu He Ser Pro Asp Phe Arg Gly Tyr
610 615 620
Gly Lys Lys Gly Asn Phe He Pro His Pro Lys Lys Glu Glu Arg Asp 625 630 635 640
Ala Glu He Leu Lys Thr He Leu Glu Leu Glu Lys Leu Gly Lys Asp
645 650 655
Arg He Glu Val Gin His Ala Leu Met Pro Phe Glu Leu Gin Glu Lys 660 665 670
Tyr Lys Ala Arg Asn Met Arg Leu Glu Asp Glu Glu Val Arg Ala Arg
675 680 685
Gly Glu His Leu Tyr Ser Phe Asn Val His Glu Leu Leu Asp Gin His
690 695 700
Asn Ala Asn Leu Lys Gly Glu His His Glu 705 710
(2) INFORMATION FOR SEQ ID NO: 323:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 496 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 77...445 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323:
ATTTAGTTCA AGAGCTTTTA GAAGAATTTT TGCAAAGCGG GGCTAAAGAG ATTTTAGAAA 60 AGGCGCAGTT GTTTTA ATG CGT TTG TTT ATC GCG CTA GTT TTG TTT TGG TGG 112
Met Arg Leu Phe He Ala Leu Val Leu Phe Trp Trp 1 5 10
TGG TTA AGC TTG AAC GCT AAA GAA GCG GAT TTT ATC TCT GAT TTA GAA 160 Trp Leu Ser Leu Asn Ala Lys Glu Ala Asp Phe He Ser Asp Leu Glu 15 20 25
TAC GGG ATG GCT CTT TAT AAA AAC CCT AGG GGT GTT GCG TGC GCG AAA 208 Tyr Gly Met Ala Leu Tyr Lys Asn Pro Arg Gly Val Ala Cys Ala Lys 30 35 40
TGC CAT GGC ATT AAA GGC GAA CAA CAA GAA ATC ACC TTT TAT TAT GAA 256 Cys His Gly He Lys Gly Glu Gin Gin Glu He Thr Phe Tyr Tyr Glu 45 50 55 60
AAA GGC GAG AAA AAA ATC CTC TAC GCC CCT AAA ATC AAC CAT TTG GAT 304 Lys Gly Glu Lys Lys He Leu Tyr Ala Pro Lys He Asn His Leu Asp 65 70 75
TTT AAA ACC TTT AAA GAC GCC TTG AGT TTA GGC AAA GGC ATG ATG CCT 352 Phe Lys Thr Phe Lys Asp Ala Leu Ser Leu Gly Lys Gly Met Met Pro 80 85 90
AAA TAC AAT CTC AAT TTA GAA GAA ATC CAA GCG ATT TAT CTT TAT ATC 400 Lys Tyr Asn Leu Asn Leu Glu Glu He Gin Ala He Tyr Leu Tyr He 95 100 105
ATC TCT TTA GAG CAT AAA GAA GAG CGT AAG GAT TCT CCT AAG CCT TAATC 450 He Ser Leu Glu His Lys Glu Glu Arg Lys Asp Ser Pro Lys Pro 110 115 120
AAAGCGCTTG ATTTATGCTA AAATGGAGCG TTGCATTTTT GTTTTG 496
(2) INFORMATION FOR SEQ ID NO: 324:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 123 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324:
Met Arg Leu Phe He Ala Leu Val Leu Phe Trp Trp Trp Leu Ser Leu
1 5 10 15
Asn Ala Lys Glu Ala Asp Phe He Ser Asp Leu Glu Tyr Gly Met Ala
20 25 30
Leu Tyr Lys Asn Pro Arg Gly Val Ala Cys Ala Lys Cys His Gly He
35 40 45
Lys Gly Glu Gin Gin Glu He Thr Phe Tyr Tyr Glu Lys Gly Glu Lys
50 55 60
Lys He Leu Tyr Ala Pro Lys He Asn His Leu Asp Phe Lys Thr Phe 65 70 75 80
Lys Asp Ala Leu Ser Leu Gly Lys Gly Met Met Pro Lys Tyr Asn Leu
85 90 95
Asn Leu Glu Glu He Gin Ala He Tyr Leu Tyr He He Ser Leu Glu
100 105 110
His Lys Glu Glu Arg Lys Asp Ser Pro Lys Pro 115 120
(2) INFORMATION FOR SEQ ID NO: 325:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 521 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 72...464 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:325:
GTTGCAACAA AAAATAGAGA GCAGGAAAAC AGACATTGTG ATCCAATCCA TGGCGAATAT 60 TCTCAGCGGG A ATG AAT GAG CTT ATC CGC TAT GGC TTG ATA TTT CTC TTT 110 Met Asn Glu Leu He Arg Tyr Gly Leu He Phe Leu Phe 1 5 10
TTT TTA AAG GCG TTT GGG CTT GAT TAT GGG ATA GAT AAA ACG CTA GAA 158 Phe Leu Lys Ala Phe Gly Leu Asp Tyr Gly He Asp Lys Thr Leu Glu 15 20 25
TTA AAA AAA GAT GAA GTG TTT AAA GCG ATC ATC AAA GAC ACT TCA AAT 206 Leu Lys Lys Asp Glu Val Phe Lys Ala He He Lys Asp Thr Ser Asn 30 35 40 45
GAA CAA ACC AAA GAA ATC ACG CTC TAT TGG ACG CTA TAT GCA AAT AAA 254 Glu Gin Thr Lys Glu He Thr Leu Tyr Trp Thr Leu Tyr Ala Asn Lys 50 55 60
GGT TTA GTC ATC AAC ATG CGT TTT AAC CAT TTC CCT TAC CAG TTT ATT 302 Gly Leu Val He Asn Met Arg Phe Asn His Phe Pro Tyr Gin Phe He 65 70 75
TTA TAC ACC GAT CAT GCG AGA AAC ACC TAT AAT CTC AAA GTT TTT GAA 350 Leu Tyr Thr Asp His Ala Arg Asn Thr Tyr Asn Leu Lys Val Phe Glu 80 85 90
GAA AAA TTT TCT TCT AAC AGC ACT CTG TCG CTT GTG TTT AAA GAT TTT 398 Glu Lys Phe Ser Ser Asn Ser Thr Leu Ser Leu Val Phe Lys Asp Phe 95 100 105
AAA GAA GAT AAA GCC GCT TTA AGG CTT TTA GCC CTT ATG CCC CTT GTT 446 Lys Glu Asp Lys Ala Ala Leu Arg Leu Leu Ala Leu Met Pro Leu Val 110 115 120 125
TTT TCT CCT AAA GAG CCT TAAGGAATTT GCATGCAAGA AAAACAACTT AAAACCAT 502 Phe Ser Pro Lys Glu Pro 130
TCAAAATAAG ATCGCTTCC 521
(2) INFORMATION FOR SEQ ID NO: 326:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 131 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326:
Met Asn Glu Leu He Arg Tyr Gly Leu He Phe Leu Phe Phe Leu Lys 1 5 10 15
Ala Phe Gly Leu Asp Tyr Gly He Asp Lys Thr Leu Glu Leu Lys Lys
20 25 30
Asp Glu Val Phe Lys Ala He He Lys Asp Thr Ser Asn Glu Gin Thr
35 40 45
Lys Glu He Thr Leu Tyr Trp Thr Leu Tyr Ala Asn Lys Gly Leu Val
50 55 60
He Asn Met Arg Phe Asn His Phe Pro Tyr Gin Phe He Leu Tyr Thr 65 70 75 80
Asp His Ala Arg Asn Thr Tyr Asn Leu Lys Val Phe Glu Glu Lys Phe
85 90 95
Ser Ser Asn Ser Thr Leu Ser Leu Val Phe Lys Asp Phe Lys Glu Asp
100 105 110
Lys Ala Ala Leu Arg Leu Leu Ala Leu Met Pro Leu Val Phe Ser Pro
115 120 125
Lys Glu Pro 130
(2) INFORMATION FOR SEQ ID NO: 327:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 269 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 55...222 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:327:
GCTCCCTCTA AAAGGGTTTT TAAACTATCT TGAGATTTAC CCAATTTATA GGTG ATG 57
Met
1
CTT TCA AAA CTC CCA TTT ACT GGT GTT TTA GCC TTA GTT TTA AAG GCT 105 Leu Ser Lys Leu Pro Phe Thr Gly Val Leu Ala Leu Val Leu Lys Ala 5 10 15
GTC CAT GTT AGC TTA GCC GAA GAT AAA TCC AAA TTC ACC GCT TGC AAA 153 Val His Val Ser Leu Ala Glu Asp Lys Ser Lys Phe Thr Ala Cys Lys 20 25 30
AAC CCT GCT AGT AAA ACC GAT ACC AAA ACC ATT TTT TTC ATT CAT TAT 201 Asn Pro Ala Ser Lys Thr Asp Thr Lys Thr He Phe Phe He His Tyr 35 40 45
CCT TTA ATG TGG TCT TAT CAA TAACGCTTAT TATTTTAGTG TAAATAAGCA CGCT 256 Pro Leu Met Trp Ser Tyr Gin 50 55
TACAACTAAA ACG 269
(2) INFORMATION FOR SEQ ID NO: 328:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 56 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328:
Met Leu Ser Lys Leu Pro Phe Thr Gly Val Leu Ala Leu Val Leu Lys
1 5 10 15
Ala Val His Val Ser Leu Ala Glu Asp Lys Ser Lys Phe Thr Ala Cys
20 25 30
Lys Asn Pro Ala Ser Lys Thr Asp Thr Lys Thr He Phe Phe He His
35 40 45
Tyr Pro Leu Met Trp Ser Tyr Gin 50 55
(2) INFORMATION FOR SEQ ID NO: 329:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 671 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 66...611 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329:
AATGACTTTA GGGGATATTC TTAAAGAAAA ACTCTAAAGA GTGATTTTAA AAGCATGAGA 60 ATGGC ATG AGA TTT AAG GGT GTT GTT GCT TTT ATT TCC CTA GCT GTC GCT 110 Met Arg Phe Lys Gly Val Val Ala Phe He Ser Leu Ala Val Ala 1 5 10 15
CTT GGC GTT TTA GCC TAT TTG TTT TTA AGC GTT AAA AAA GAA ATG CCC 158 Leu Gly Val Leu Ala Tyr Leu Phe Leu Ser Val Lys Lys Glu Met Pro 20 25 30 GCT ACT TCT CAT GCG ATC TCT CAA ACA CAT GCG ATC TCT CAA ACC AAT 206 Ala Thr Ser His Ala He Ser Gin Thr His Ala He Ser Gin Thr Asn 35 40 45
GAA GGC CTC TCT CAA ACA GAT GCA AAA AGC CAT GAC ATC GAT CTA GAA 254 Glu Gly Leu Ser Gin Thr Asp Ala Lys Ser His Asp He Asp Leu Glu 50 55 60
GAA AAT AGC CCC ACT GAA ACC TCT CAT AAT GAA AAA GCC TCC CAT AAC 302 Glu Asn Ser Pro Thr Glu Thr Ser His Asn Glu Lys Ala Ser His Asn 65 70 75
GAA GAA GAT CAC AAT AAC GCC CTT TCT CAA AAT CTT GAT GCG CAA GAA 350 Glu Glu Asp His Asn Asn Ala Leu Ser Gin Asn Leu Asp Ala Gin Glu 80 85 90 95
TCT ATC AAT TAC CCC GTT GTG GAA CAT TAT TCT GAA ATC CCT TTT GAA 398 Ser He Asn Tyr Pro Val Val Glu His Tyr Ser Glu He Pro Phe Glu 100 105 110
GAA AAA AAA AGG GAA TAT TCA AAG CTT ATC ATT AAG GAT TTA AAG GAC 446 Glu Lys Lys Arg Glu Tyr Ser Lys Leu He He Lys Asp Leu Lys Asp 115 120 125
TAT CAA TGG TGG TGC TTA AAA GAA ATC CTC AAA AAA GAA CAG ATT GAT 494 Tyr Gin Trp Trp Cys Leu Lys Glu He Leu Lys Lys Glu Gin He Asp 130 135 140
TAC GCT TAC GAT AAC ACC AAA AAC CAA CCT AAC CTC ATC ATC TAT TTA 542 Tyr Ala Tyr Asp Asn Thr Lys Asn Gin Pro Asn Leu He He Tyr Leu 145 150 155
GAT GAA AAT AAA AAA GAA CGC TTG CTG GCT GAT TTA GAC TAT TAT AAA 590 Asp Glu Asn Lys Lys Glu Arg Leu Leu Ala Asp Leu Asp Tyr Tyr Lys 160 165 170 175
ATA CGC TAT CAT GCT GTT TTT TAAATTCAAA GGATAAAAAT GTATCAAGTA GCCA 645 He Arg Tyr His Ala Val Phe 180
TTTGCGACCC CATCCATGCT AAAGGC 671
(2) INFORMATION FOR SEQ ID NO: 330:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 182 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: Met Arg Phe Lys Gly Val Val Ala Phe He Ser Leu Ala Val Ala Leu
1 5 10 15
Gly Val Leu Ala Tyr Leu Phe Leu Ser Val Lys Lys Glu Met Pro Ala
20 25 30
Thr Ser His Ala He Ser Gin Thr His Ala He Ser Gin Thr Asn Glu
35 40 45
Gly Leu Ser Gin Thr Asp Ala Lys Ser His Asp He Asp Leu Glu Glu
50 55 60
Asn Ser Pro Thr Glu Thr Ser His Asn Glu Lys Ala Ser His Asn Glu 65 70 75 80
Glu Asp His Asn Asn Ala Leu Ser Gin Asn Leu Asp Ala Gin Glu Ser
85 90 95
He Asn Tyr Pro Val Val Glu His Tyr Ser Glu He Pro Phe Glu Glu
100 105 110
Lys Lys Arg Glu Tyr Ser Lys Leu He He Lys Asp Leu Lys Asp Tyr
115 120 125
Gin Trp Trp Cys Leu Lys Glu He Leu Lys Lys Glu Gin He Asp Tyr
130 135 140
Ala Tyr Asp Asn Thr Lys Asn Gin Pro Asn Leu He He Tyr Leu Asp 145 150 155 160
Glu Asn Lys Lys Glu Arg Leu Leu Ala Asp Leu Asp Tyr Tyr Lys He
165 170 175
Arg Tyr His Ala Val Phe 180
(2) INFORMATION FOR SEQ ID NO: 331:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 341 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 86...295 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331:
ACCCACAAAA CTAAAACCCA CTAACACAAT TAACCCTAAC AACACATAAA GATTGCCCAA 60 AGACGCGCAC AACACGCTCG CAACA ATG GTT GCA AAA ACA AAC ACA ATC CCC 112
Met Val Ala Lys Thr Asn Thr He Pro 1 5
CCC ATC GTA GGG GTA TCT TTT TTA TTC TGG TGG CTT GGC ACG AAG CTA 160 Pro He Val Gly Val Ser Phe Leu Phe Trp Trp Leu Gly Thr Lys Leu 10 15 20 25
GAA ATG GGC TGG TTA GCC TTT TTA GCC TTG GCC CAT AGA ATG AAT TTA 208 Glu Met Gly Trp Leu Ala Phe Leu Ala Leu Ala His Arg Met Asn Leu 30 35 40
GGC ATT AAA AAA AGC GTG AGA AAA AAA GCT ATG AAA AAC CCT AAC CCT 256 Gly He Lys Lys Ser Val Arg Lys Lys Ala Met Lys Asn Pro Asn Pro 45 50 55
GCT CTA AAA GTC AAA TAC TGG AAA AGA TTG ATA TTG AAA TAGCCATATA GT 307 Ala Leu Lys Val Lys Tyr Trp Lys Arg Leu He Leu Lys 60 65 70
AAAGAATAGA GCATAAAATC CCCTAAAATC GCCA 341
(2) INFORMATION FOR SEQ ID NO: 332:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 70 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:332:
Met Val Ala Lys Thr Asn Thr He Pro Pro He Val Gly Val Ser Phe
1 5 10 15
Leu Phe Trp Trp Leu Gly Thr Lys Leu Glu Met Gly Trp Leu Ala Phe
20 25 30
Leu Ala Leu Ala His Arg Met Asn Leu Gly He Lys Lys Ser Val Arg
35 40 45
Lys Lys Ala Met Lys Asn Pro Asn Pro Ala Leu Lys Val Lys Tyr Trp
50 55 60
Lys Arg Leu He Leu Lys 65 70
(2) INFORMATION FOR SEQ ID NO: 333
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2481 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 58...2430 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333:
TTTTTTTTTT TTTTTGATTT TTATTTTTTA AATTTTTAGA TTAAGGAGAG TTGTTGG ATG 60
Met
1
TTT TTA AGA GTA TAC CCA AAG CTT AGA TAC GCT TTA TGT TTC CCC CTA 108 Phe Leu Arg Val Tyr Pro Lys Leu Arg Tyr Ala Leu Cys Phe Pro Leu 5 10 15
CTC GCT GAG ACT TGC TAT AGC GAA GAG CGG ACT TTA AAT AAG GTT ACC 156 Leu Ala Glu Thr Cys Tyr Ser Glu Glu Arg Thr Leu Asn Lys Val Thr 20 25 30
ACC CAA GCT AAA AGG ATT TTC ACT TAC AAC AAT GAG TTT AAA GTA ACT 204 Thr Gin Ala Lys Arg He Phe Thr Tyr Asn Asn Glu Phe Lys Val Thr 35 40 45
TCT AAA GAA CTA GAT CAA CGC CAA AGC AAT GAA GTC AAG GAC TTG TTT 252 Ser Lys Glu Leu Asp Gin Arg Gin Ser Asn Glu Val Lys Asp Leu Phe 50 55 60 65
AGG ACT AAC CCT GAT GTG AAT GTG GGC GGA GGG AGC GTG ATG GGG CAG 300 Arg Thr Asn Pro Asp Val Asn Val Gly Gly Gly Ser Val Met Gly Gin 70 75 80
AAA ATC TAT GTG AGA GGC GTT GAA GAC AGG CTT TTA AGG GTT ACA GTG 348 Lys He Tyr Val Arg Gly Val Glu Asp Arg Leu Leu Arg Val Thr Val 85 90 95
GAT GGG GCT GCA CAA AAT GGC AAT ATC TAC CAC CAC CAA GGC AAC ACC 396 Asp Gly Ala Ala Gin Asn Gly Asn He Tyr His His Gin Gly Asn Thr 100 105 110
GTG ATT GAC CCT GGC ATG CTC AAA AGC GTG GAA GTT ACC AAA GGC GCG 444 Val He Asp Pro Gly Met Leu Lys Ser Val Glu Val Thr Lys Gly Ala 115 120 125
GCG AAT GCG AGC GCG GGG CCA GGA GCG ATT GCG GGA GTG ATT AAA ATG 492 Ala Asn Ala Ser Ala Gly Pro Gly Ala He Ala Gly Val He Lys Met 130 135 140 145
GAG ACT AAA GGA GCG GCT GAT TTT ATC CCT AGG GGG AAA AAT TAT GCT 540 Glu Thr Lys Gly Ala Ala Asp Phe He Pro Arg Gly Lys Asn Tyr Ala 150 155 160
GCC AGT GGG GCG GTG AGT TTT TAT ACC AAT TTT GGC GAT CGA GAG ACT 588 Ala Ser Gly Ala Val Ser Phe Tyr Thr Asn Phe Gly Asp Arg Glu Thr 165 170 175
TTC AGA TCG GCT TAT CAA AAC GCG CAT TTT GAT ATT ATC GCT TAC TAC 636 Phe Arg Ser Ala Tyr Gin Asn Ala His Phe Asp He He Ala Tyr Tyr 180 185 190
ACG CAC CAA AAC ATC TTC TAT TAT AGA AGC GGC GCT ACA GCG ATG AAA 684 Thr His Gin Asn He Phe Tyr Tyr Arg Ser Gly Ala Thr Ala Met Lys 195 200 205
AAC CTT TTC AAT CCC ACA CAA GCC GAT AAA GAG CCA GGA ACT CCT AGC 732 Asn Leu Phe Asn Pro Thr Gin Ala Asp Lys Glu Pro Gly Thr Pro Ser 210 215 220 225
GAG CAA AAC AAC GCT TTG ATT AAA ATG AAT GGT TAT TTG AGC GAC AGA 780 Glu Gin Asn Asn Ala Leu He Lys Met Asn Gly Tyr Leu Ser Asp Arg 230 235 240
GAC ACG CTC ACT TTC AGC TGG AAC ATG ACA CGA GAT AAC GCT ACA CGC 828 Asp Thr Leu Thr Phe Ser Trp Asn Met Thr Arg Asp Asn Ala Thr Arg 245 250 255
CCT TTA AGG AGT AAC GCT ATA GGG TTA GCC TAT CCT TGT GAA GCC CCC 876 Pro Leu Arg Ser Asn Ala He Gly Leu Ala Tyr Pro Cys Glu Ala Pro 260 265 270
TTT AGT CCT GAT AGT TCT CAA GGG TGT CCT AAT GTG TTA GAT AGT TTC 924 Phe Ser Pro Asp Ser Ser Gin Gly Cys Pro Asn Val Leu Asp Ser Phe 275 280 285
ACA AGA TAC ATG TAT CAC TCT ATT AAT AGT GCC AAC AAT CTT TCC TTA 972 Thr Arg Tyr Met Tyr His Ser He Asn Ser Ala Asn Asn Leu Ser Leu 290 295 300 305
CAA TAC AAA AGG GAA GCG GGA AAT TCT TTT GGC GAC CCA CGA TTA GAT 1020 Gin Tyr Lys Arg Glu Ala Gly Asn Ser Phe Gly Asp Pro Arg Leu Asp 310 315 320
TTT ACC CTT TAT ACA AGC ATC AGG AAC GCT CAG TTT GAT CCC CTA TTT 1068 Phe Thr Leu Tyr Thr Ser He Arg Asn Ala Gin Phe Asp Pro Leu Phe 325 330 335
GAT CCT AAT GGC GTT TAT GCT AAA TTC CCC ACT TCT TTA GCG AGC GCA 1116 Asp Pro Asn Gly Val Tyr Ala Lys Phe Pro Thr Ser Leu Ala Ser Ala 340 345 350
TGG GAA AAA GAA AAT TAC CCA TGC GTT GAA GGC GCT TAT TGC ACC CCA 1164 Trp Glu Lys Glu Asn Tyr Pro Cys Val Glu Gly Ala Tyr Cys Thr Pro 355 360 365
AGC TTT TCA GAT GTG GAT AAA CCA AGC TCA CAG CCT AGG AAT TTG TTT 1212 Ser Phe Ser Asp Val Asp Lys Pro Ser Ser Gin Pro Arg Asn Leu Phe 370 375 380 385
TTA AAC AAC ACC GGC TTA AAC CTT AAA GTC GCG CAT GTG ATT GAT GAA 1260 Leu Asn Asn Thr Gly Leu Asn Leu Lys Val Ala His Val He Asp Glu 390 395 400
GCC ACA GAC AGC CTT TTT GAA TAC GGA TTC AAC TAC CAA AAT TTG AGC 1308 Ala Thr Asp Ser Leu Phe Glu Tyr Gly Phe Asn Tyr Gin Asn Leu Ser 405 410 415 GTT TTT GAC GCT CGC ATC CCT AAA TCA GAA TTA TAC AGG CCT AAT CAA 1356 Val Phe Asp Ala Arg He Pro Lys Ser Glu Leu Tyr Arg Pro Asn Gin 420 425 430
GTT TAT ACT GAT GAT AAA GGA CAA AAA CAA ATC GCT TGC TCT CTT GTG 1404 Val Tyr Thr Asp Asp Lys Gly Gin Lys Gin He Ala Cys Ser Leu Val 435 440 445
AAT AAT AAC CCC AAT GAC CCC ACT CTG TGC CAA AGA GGG AAA GCG AAC 1452 Asn Asn Asn Pro Asn Asp Pro Thr Leu Cys Gin Arg Gly Lys Ala Asn 450 455 460 465
GGG AAT ATT TAT GGA GGC TAC GTG CAA GCG AAT TAC TCG CCT CAT AAA 1500 Gly Asn He Tyr Gly Gly Tyr Val Gin Ala Asn Tyr Ser Pro His Lys 470 475 480
ATC ATC ACT TTT GGA GCC GGG GTA AGG TGG GAC GCT TAC ACG CTT TAT 1548 He He Thr Phe Gly Ala Gly Val Arg Trp Asp Ala Tyr Thr Leu Tyr 485 490 495
GAT AAA GAC TGG AAC CAC CGC TAC ACT CAA GGC TTT AGC CCT AGC GCG 1596 Asp Lys Asp Trp Asn His Arg Tyr Thr Gin Gly Phe Ser Pro Ser Ala 500 505 510
GCT CTT GTG CTA AGC CCC ATT GAG CCT TTA TCT TTA AAA ATC ACT TAT 1644 Ala Leu Val Leu Ser Pro He Glu Pro Leu Ser Leu Lys He Thr Tyr 515 520 525
TCT CAA GTT ACA AGA GGG GTG ATG CCA GGA GAT GGC GTG TAC ATG CGT 1692 Ser Gin Val Thr Arg Gly Val Met Pro Gly Asp Gly Val Tyr Met Arg 530 535 540 545
CAA AAC GAT TTA CGA TAC GCC AAA AAC ATC AAG CCT GAA GTG GGC TCT 1740 Gin Asn Asp Leu Arg Tyr Ala Lys Asn He Lys Pro Glu Val Gly Ser 550 555 560
AAC GCT GAA TTT AAT ATT GAT TAT TCA AGC CAG TAT TTT AGC GGG AGG 1788 Asn Ala Glu Phe Asn He Asp Tyr Ser Ser Gin Tyr Phe Ser Gly Arg 565 570 575
GCT GCG GCG TTT TAT CAG GCT TTG GAT AAT TTC ATC TCA CAA TAC GCA 1836 Ala Ala Ala Phe Tyr Gin Ala Leu Asp Asn Phe He Ser Gin Tyr Ala 580 585 590
CAA AAT TTG ATT GTA ACC AAT TTG AGT CAA GCG ATT CGT ATT TAT GGC 1884 Gin Asn Leu He Val Thr Asn Leu Ser Gin Ala He Arg He Tyr Gly 595 600 605
TAT GAA GTG GGT GGG ACT TTC AGA TAC AAG GGC GTG AGT TTG AAT GTA 1932 Tyr Glu Val Gly Gly Thr Phe Arg Tyr Lys Gly Val Ser Leu Asn Val 610 615 620 625
GGG GTC TCG CGC ACC TGG CCC ACC ACT AGG GGG TAT TTA ATG GCG GAT 1980 Gly Val Ser Arg Thr Trp Pro Thr Thr Arg Gly Tyr Leu Met Ala Asp 630 635 640 AGC TAT GAG CTT GCC GCA AGC ACC GGT AAT GTT TTT ATC ATC AAA TTG 2028 Ser Tyr Glu Leu Ala Ala Ser Thr Gly Asn Val Phe He He Lys Leu 645 650 655
GAT TAC ACC ATC CCC AAA ACA GGG ATC AAT CTT GCA TGG CTT AGC CGC 2076 Asp Tyr Thr He Pro Lys Thr Gly He Asn Leu Ala Trp Leu Ser Arg 660 665 670
TTT GTT ACC GGT TTA GAT TAT TGC GGG TTT GAT ATT TAC TTG CCT GAT 2124 Phe Val Thr Gly Leu Asp Tyr Cys Gly Phe Asp He Tyr Leu Pro Asp 675 680 685
TAT GGG ACG GCT GAG AAA CCC AAA ACC CCT ACC GAT TTA GCC AAA TGC 2172 Tyr Gly Thr Ala Glu Lys Pro Lys Thr Pro Thr Asp Leu Ala Lys Cys 690 695 700 705
GGA TCT CAA TTA GGG TTA GTG CAT ATG CAT AAA CCG GGC TAT GGC GTG 2220 Gly Ser Gin Leu Gly Leu Val His Met His Lys Pro Gly Tyr Gly Val 710 715 720
AGT AAT TTT TAT ATC AAT TGG AGT CCT AAA ACC AAA AGC CGC TGG AAG 2268 Ser Asn Phe Tyr He Asn Trp Ser Pro Lys Thr Lys Ser Arg Trp Lys 725 730 735
GGT TTG TTG CTT TCA GCC GTG TTT AAT AAT GTT TTC AAC AAA TTC TAT 2316 Gly Leu Leu Leu Ser Ala Val Phe Asn Asn Val Phe Asn Lys Phe Tyr 740 745 750
GTG GAT CAA ACA AGC CCT TAT GTC ATG AGC CCG GAT ATG CCA GGC ACT 2364 Val Asp Gin Thr Ser Pro Tyr Val Met Ser Pro Asp Met Pro Gly Thr 755 760 765
GAC GCT GTT AAA AGA GCG ATC GCT GAG CCT GGG TTT AAC GCG CGT TTT 2412 Asp Ala Val Lys Arg Ala He Ala Glu Pro Gly Phe Asn Ala Arg Phe 770 775 780 785
GAA GTG GCT TAC AAA TGG TAGTTAATGG AGCTTTAAGC GTTGCGCATG CGTGATAG 2468 Glu Val Ala Tyr Lys Trp 790
CAACGGCTAT CGC 2481
(2) INFORMATION FOR SEQ ID NO: 334:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 791 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:334: Met Phe Leu Arg Val Tyr Pro Lys Leu Arg Tyr Ala Leu Cys Phe Pro
1 5 10 15
Leu Leu Ala Glu Thr Cys Tyr Ser Glu Glu Arg Thr Leu Asn Lys Val
20 25 30
Thr Thr Gin Ala Lys Arg He Phe Thr Tyr Asn Asn Glu Phe Lys Val
35 40 45
Thr Ser Lys Glu Leu Asp Gin Arg Gin Ser Asn Glu Val Lys Asp Leu
50 55 60
Phe Arg Thr Asn Pro Asp Val Asn Val Gly Gly Gly Ser Val Met Gly 65 70 75 80
Gin Lys He Tyr Val Arg Gly Val Glu Asp Arg Leu Leu Arg Val Thr
85 90 95
Val Asp Gly Ala Ala Gin Asn Gly Asn He Tyr His His Gin Gly Asn
100 105 110
Thr Val He Asp Pro Gly Met Leu Lys Ser Val Glu Val Thr Lys Gly
115 120 125
Ala Ala Asn Ala Ser Ala Gly Pro Gly Ala He Ala Gly Val He Lys
130 135 140
Met Glu Thr Lys Gly Ala Ala Asp Phe He Pro Arg Gly Lys Asn Tyr 145 150 155 160
Ala Ala Ser Gly Ala Val Ser Phe Tyr Thr Asn Phe Gly Asp Arg Glu
165 170 175
Thr Phe Arg Ser Ala Tyr Gin Asn Ala His Phe Asp He He Ala Tyr
180 185 190
Tyr Thr His Gin Asn He Phe Tyr Tyr Arg Ser Gly Ala Thr Ala Met
195 200 205
Lys Asn Leu Phe Asn Pro Thr Gin Ala Asp Lys Glu Pro Gly Thr Pro
210 215 220
Ser Glu Gin Asn Asn Ala Leu He Lys Met Asn Gly Tyr Leu Ser Asp 225 230 235 240
Arg Asp Thr Leu Thr Phe Ser Trp Asn Met Thr Arg Asp Asn Ala Thr
245 250 255
Arg Pro Leu Arg Ser Asn Ala He Gly Leu Ala Tyr Pro Cys Glu Ala
260 265 270
Pro Phe Ser Pro Asp Ser Ser Gin Gly Cys Pro Asn Val Leu Asp Ser
275 280 285
Phe Thr Arg Tyr Met Tyr His Ser He Asn Ser Ala Asn Asn Leu Ser
290 295 300
Leu Gin Tyr Lys Arg Glu Ala Gly Asn Ser Phe Gly Asp Pro Arg Leu 305 310 315 320
Asp Phe Thr Leu Tyr Thr Ser He Arg Asn Ala Gin Phe Asp Pro Leu
325 330 335
Phe Asp Pro Asn Gly Val Tyr Ala Lys Phe Pro Thr Ser Leu Ala Ser
340 345 350
Ala Trp Glu Lys Glu Asn Tyr Pro Cys Val Glu Gly Ala Tyr Cys Thr
355 360 365
Pro Ser Phe Ser Asp Val Asp Lys Pro Ser Ser Gin Pro Arg Asn Leu
370 375 380
Phe Leu Asn Asn Thr Gly Leu Asn Leu Lys Val Ala His Val He Asp 385 390 395 400
Glu Ala Thr Asp Ser Leu Phe Glu Tyr Gly Phe Asn Tyr Gin Asn Leu
405 410 415
Ser Val Phe Asp Ala Arg He Pro Lys Ser Glu Leu Tyr Arg Pro Asn
420 425 430
Gin Val Tyr Thr Asp Asp Lys Gly Gin Lys Gin He Ala Cys Ser Leu 435 440 445
Val Asn Asn Asn Pro Asn Asp Pro Thr Leu Cys Gin Arg Gly Lys Ala
450 455 460
Asn Gly Asn He Tyr Gly Gly Tyr Val Gin Ala Asn Tyr Ser Pro His 465 470 475 480
Lys He He Thr Phe Gly Ala Gly Val Arg Trp Asp Ala Tyr Thr Leu
485 490 495
Tyr Asp Lys Asp Trp Asn His Arg Tyr Thr Gin Gly Phe Ser Pro Ser
500 505 510
Ala Ala Leu Val Leu Ser Pro He Glu Pro Leu Ser Leu Lys He Thr
515 520 525
Tyr Ser Gin Val Thr Arg Gly Val Met Pro Gly Asp Gly Val Tyr Met
530 535 540
Arg Gin Asn Asp Leu Arg Tyr Ala Lys Asn He Lys Pro Glu Val Gly 545 550 555 560
Ser Asn Ala Glu Phe Asn He Asp Tyr Ser Ser Gin Tyr Phe Ser Gly
565 570 575
Arg Ala Ala Ala Phe Tyr Gin Ala Leu Asp Asn Phe He Ser Gin Tyr
580 585 590
Ala Gin Asn Leu He Val Thr Asn Leu Ser Gin Ala He Arg He Tyr
595 600 605
Gly Tyr Glu Val Gly Gly Thr Phe Arg Tyr Lys Gly Val Ser Leu Asn
610 615 620
Val Gly Val Ser Arg Thr Trp Pro Thr Thr Arg Gly Tyr Leu Met Ala 625 630 635 640
Asp Ser Tyr Glu Leu Ala Ala Ser Thr Gly Asn Val Phe He He Lys
645 650 655
Leu Asp Tyr Thr He Pro Lys Thr Gly He Asn Leu Ala Trp Leu Ser
660 665 670
Arg Phe Val Thr Gly Leu Asp Tyr Cys Gly Phe Asp He Tyr Leu Pro
675 680 685
Asp Tyr Gly Thr Ala Glu Lys Pro Lys Thr Pro Thr Asp Leu Ala Lys
690 695 700
Cys Gly Ser Gin Leu Gly Leu Val His Met His Lys Pro Gly Tyr Gly 705 710 715 720
Val Ser Asn Phe Tyr He Asn Trp Ser Pro Lys Thr Lys Ser Arg Trp
725 730 735
Lys Gly Leu Leu Leu Ser Ala Val Phe Asn Asn Val Phe Asn Lys Phe
740 745 750
Tyr Val Asp Gin Thr Ser Pro Tyr Val Met Ser Pro Asp Met Pro Gly
755 760 765
Thr Asp Ala Val Lys Arg Ala He Ala Glu Pro Gly Phe Asn Ala Arg
770 775 780
Phe Glu Val Ala Tyr Lys Trp 785 790
(2) INFORMATION FOR SEQ ID NO: 335:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 477 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 120...428 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:335:
GTCGGTCGGG TAATGTTCAA ATTCACAAAT GAGTCTGAAG ACAAAGAAGT CTTGATCTAG 60 AAGCCGAAAT TCTCATACCG CACTTAGAAT TGCGTCAAAA ACAAATTGAT GCGCTGTTG 119
GTG CAC GAT ATT ACC AAG CTA TGT TAC ACC AAA CCA CTA GGG TGT GTT 167 Val His Asp He Thr Lys Leu Cys Tyr Thr Lys Pro Leu Gly Cys Val 1 5 10 15
GTG CTG TTC AGC AAG GAT ACT GAT CTT GTG CCT GTG TTA GAA TCC GCT 215 Val Leu Phe Ser Lys Asp Thr Asp Leu Val Pro Val Leu Glu Ser Ala 20 25 30
TGG GAG AAA GGC TTT GAA GTC TTC ATT GCT AAC ATT CAA GAA TGC CCC 263 Trp Glu Lys Gly Phe Glu Val Phe He Ala Asn He Gin Glu Cys Pro 35 40 45
AAT TCT GTC CCT TCA GAC TTG AAG AAG TCT TGC AAT GTG AGG GAA CGC 311 Asn Ser Val Pro Ser Asp Leu Lys Lys Ser Cys Asn Val Arg Glu Arg 50 55 60
AGT GTC GCT GAA ATT GTA GAT AAC TTG CCC AAA AAT CAG CAC ACT CCC 359 Ser Val Ala Glu He Val Asp Asn Leu Pro Lys Asn Gin His Thr Pro 65 70 75 80
AAG AAA AAG AAC TTT TCC ACC AAC GAG CCT TTT AAC AAC CCA TTT AAA 407 Lys Lys Lys Asn Phe Ser Thr Asn Glu Pro Phe Asn Asn Pro Phe Lys 85 90 95
GAC CAA CTC TTT AAG AAG AAC TAACACGATC CCCACACCAA GGGGACAAAA AAGCA 463 Asp Gin Leu Phe Lys Lys Asn 100
CCCATTTTAA AAGG 477
(2) INFORMATION FOR SEQ ID NO: 336:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 103 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336:
Val His Asp He Thr Lys Leu Cys Tyr Thr Lys Pro Leu Gly Cys Val
1 5 10 15
Val Leu Phe Ser Lys Asp Thr Asp Leu Val Pro Val Leu Glu Ser Ala
20 25 30
Trp Glu Lys Gly Phe Glu Val Phe He Ala Asn He Gin Glu Cys Pro
35 40 45
Asn Ser Val Pro Ser Asp Leu Lys Lys Ser Cys Asn Val Arg Glu Arg
50 55 60
Ser Val Ala Glu He Val Asp Asn Leu Pro Lys Asn Gin His Thr Pro 65 70 75 80
Lys Lys Lys Asn Phe Ser Thr Asn Glu Pro Phe Asn Asn Pro Phe Lys
85 90 95
Asp Gin Leu Phe Lys Lys Asn 100
(2) INFORMATION FOR SEQ ID NO: 337:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 685 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 220...624 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:337:
TCTTTTGAAA TTGCCTGATG TGGAAAAAGA AATGCCCAAA GAGACGACTC AAAAAAGCTT 60
GTTTTCGCAC AAACACTTTG TTTTTGGGGC TTGGGGATCT TTTTTTATGT GGGGGGAGAA 120
NTGGCGATTG GCTCATTCTT GGTGCTAAGC TTTGAAAAGC TTTTGAATTT AGACTCTCAA 180
TCAAGCGCGC ATTACTTGGT GTATTATTGG GGAGGCGCG ATG GTG GGC CGT TTC 234
Met Val Gly Arg Phe 1 5
TTA GGC AGT GTG TTG ATG AAT AAA ATT GCC CCT AAT AAA TAC TTG GCT 282 Leu Gly Ser Val Leu Met Asn Lys He Ala Pro Asn Lys Tyr Leu Ala 10 15 20
TTC AAC GCC TTA AGC TCT ATT GTT CTC ATC GCT TTA GCC ATT ATC ATT 330 Phe Asn Ala Leu Ser Ser He Val Leu He Ala Leu Ala He He He 25 30 35
GGA GGC AAG ATC GCT TTA TTC GCT CTG ACT TTT GTG GGC TTT TTC AAC 378 Gly Gly Lys He Ala Leu Phe Ala Leu Thr Phe Val Gly Phe Phe Asn 40 45 50 TCT ATC ATG TTC CCT ACC ATC TTT TCT TTG GCT ACG CTC AAT TTA GGG 426 Ser He Met Phe Pro Thr He Phe Ser Leu Ala Thr Leu Asn Leu Gly 55 60 65
CAT CTC ACT TCT AAA GCT TCT GGG GTG ATT AGC ATG GCG ATT GTG GGA 474 His Leu Thr Ser Lys Ala Ser Gly Val He Ser Met Ala He Val Gly 70 75 80 85
GGG GCG TTA ATC CCC CCC ATT CAA GGT GCG GTT ACA GAC ATG CTA ACA 522 Gly Ala Leu He Pro Pro He Gin Gly Ala Val Thr Asp Met Leu Thr 90 95 100
GCA ACC GAA TCA AAT TTG CTC TAC GCT TAT GGT GTG CCG TTG TTG TGC 570 Ala Thr Glu Ser Asn Leu Leu Tyr Ala Tyr Gly Val Pro Leu Leu Cys 105 110 115
TAT TTT TAT ATT CTC TTC TTT GCG CTT AAA GGG TAT AAG CAA GAA GAA 618 Tyr Phe Tyr He Leu Phe Phe Ala Leu Lys Gly Tyr Lys Gin Glu Glu 120 125 130
AAC TCC TAAAAAAAGG GGGGGTTTCT TTCTTCTTTC CTTTCTTTTA TCTTGTTTTA AA 676 Asn Ser 135
AATCAGTAA 685
(2) INFORMATION FOR SEQ ID NO: 338:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 135 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338:
Met Val Gly Arg Phe Leu Gly Ser Val Leu Met Asn Lys He Ala Pro
1 5 10 15
Asn Lys Tyr Leu Ala Phe Asn Ala Leu Ser Ser He Val Leu He Ala
20 25 30
Leu Ala He He He Gly Gly Lys He Ala Leu Phe Ala Leu Thr Phe
35 40 45
Val Gly Phe Phe Asn Ser He Met Phe Pro Thr He Phe Ser Leu Ala
50 55 60
Thr Leu Asn Leu Gly His Leu Thr Ser Lys Ala Ser Gly Val He Ser 65 70 75 80
Met Ala He Val Gly Gly Ala Leu He Pro Pro He Gin Gly Ala Val
85 90 95
Thr Asp Met Leu Thr Ala Thr Glu Ser Asn Leu Leu Tyr Ala Tyr Gly
100 105 110
Val Pro Leu Leu Cys Tyr Phe Tyr He Leu Phe Phe Ala Leu Lys Gly 115 120 125 Tyr Lys Gin Glu Glu Asn Ser 130 135
(2) INFORMATION FOR SEQ ID NO: 339:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 809 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 58...765 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339:
ACCGATCACT AAAACCAATG TAACTTACCG CTCTTTACAG CGTAAGTGAG AAAAGGA ATG 60
Met 1
CAT TTG AAT ACG GAT TTT AGC CAT ATC ACC GAT ATA GAG GGC ATG CGT 108 His Leu Asn Thr Asp Phe Ser His He Thr Asp He Glu Gly Met Arg 5 10 15
TTT ATC AAT GAA GAA GAC GCT TTG AAC AAA TTG ATT AAT GAA ATC CAC 156 Phe He Asn Glu Glu Asp Ala Leu Asn Lys Leu He Asn Glu He His 20 25 30
ACG CGC CAC ATT GAT TTA AAA GAT TCC ATC ATG CTC GCT TTG AGT TTT 204 Thr Arg His He Asp Leu Lys Asp Ser He Met Leu Ala Leu Ser Phe 35 40 45
AAC GCT CTG TAT TTA GCT CAC GCT TTA GCG CAA AAA TTT GGA GCG ACT 252 Asn Ala Leu Tyr Leu Ala His Ala Leu Ala Gin Lys Phe Gly Ala Thr 50 55 60 65
TAT GAT ATA CTT TTT TTA GAA CCT ATC CTA GCC CCT TTA AAC TCA AAA 300 Tyr Asp He Leu Phe Leu Glu Pro He Leu Ala Pro Leu Asn Ser Lys 70 75 80
TGC GAG ATC GCT TTA GTG AGT GAG AGC ATG GAT ATA GTG ATG AAT GAA 348 Cys Glu He Ala Leu Val Ser Glu Ser Met Asp He Val Met Asn Glu 85 90 95
AGT TTG ATC AAT TCC TTT GAC ATC ACT TTA GAC TAT GTT TAT GGG GAA 396 Ser Leu He Asn Ser Phe Asp He Thr Leu Asp Tyr Val Tyr Gly Glu 100 105 110 GCC AAG CGA GCT TAT GAA GAA GAC ATT TTG TCT CAC ATC TAT CAG TAT 444 Ala Lys Arg Ala Tyr Glu Glu Asp He Leu Ser His He Tyr Gin Tyr 115 120 125
CGC AAA GGC AAT GCG ATC AAA AGC TTA AAA GAT AAA AAT ATT TTT ATC 492 Arg Lys Gly Asn Ala He Lys Ser Leu Lys Asp Lys Asn He Phe He 130 135 140 145
GTA GAT AGG GGG ATT GAA ACC GGG TTT AGA GCA GGG TTA GGC GTG CAA 540 Val Asp Arg Gly He Glu Thr Gly Phe Arg Ala Gly Leu Gly Val Gin 150 155 160
ACT TGC TTG AAA AAA GAA TGC CAA GAC ATT TAT ATT TTA ACC CCC ATT 588 Thr Cys Leu Lys Lys Glu Cys Gin Asp He Tyr He Leu Thr Pro He 165 170 175
GTC GCG CAA AAT GTC GCT CAA GGC TTA GAA AGT TTG TGC GAT GGG GTG 636 Val Ala Gin Asn Val Ala Gin Gly Leu Glu Ser Leu Cys Asp Gly Val 180 185 190
ATT AGT GTG TAT CGC CCT GAA TGT TTT GTC TCT GTG GAG CAT CAT TAT 684 He Ser Val Tyr Arg Pro Glu Cys Phe Val Ser Val Glu His His Tyr 195 200 205
AAA GAA CTC AAG CGA TTA AGC AAT GAA GAA GTT GAA AAA TAC TTG GGC 732 Lys Glu Leu Lys Arg Leu Ser Asn Glu Glu Val Glu Lys Tyr Leu Gly 210 215 220 225
GCT AAC AAC ATG CCT AAT TTA AAA AAG GAA CAT TAAATATGGA TTTTATCACC 785 Ala Asn Asn Met Pro Asn Leu Lys Lys Glu His 230 235
ATCAATTCTA GTAACAAAAC CGAA 809
(2) INFORMATION FOR SEQ ID NO: 340:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 236 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340:
Met His Leu Asn Thr Asp Phe Ser His He Thr Asp He Glu Gly Met
1 5 10 15
Arg Phe He Asn Glu Glu Asp Ala Leu Asn Lys Leu He Asn Glu He
20 25 30
His Thr Arg His He Asp Leu Lys Asp Ser He Met Leu Ala Leu Ser
35 40 45
Phe Asn Ala Leu Tyr Leu Ala His Ala Leu Ala Gin Lys Phe Gly Ala 50 55 60
Thr Tyr Asp He Leu Phe Leu Glu Pro He Leu Ala Pro Leu Asn Ser 65 70 75 80
Lys Cys Glu He Ala Leu Val Ser Glu Ser Met Asp He Val Met Asn
85 90 95
Glu Ser Leu He Asn Ser Phe Asp He Thr Leu Asp Tyr Val Tyr Gly
100 105 110
Glu Ala Lys Arg Ala Tyr Glu Glu Asp He Leu Ser His He Tyr Gin
115 120 125
Tyr Arg Lys Gly Asn Ala He Lys Ser Leu Lys Asp Lys Asn He Phe
130 135 140
He Val Asp Arg Gly He Glu Thr Gly Phe Arg Ala Gly Leu Gly Val 145 150 155 160
Gin Thr Cys Leu Lys Lys Glu Cys Gin Asp He Tyr He Leu Thr Pro
165 170 175
He Val Ala Gin Asn Val Ala Gin Gly Leu Glu Ser Leu Cys Asp Gly
180 185 190
Val He Ser Val Tyr Arg Pro Glu Cys Phe Val Ser Val Glu His His
195 200 205
Tyr Lys Glu Leu Lys Arg Leu Ser Asn Glu Glu Val Glu Lys Tyr Leu
210 215 220
Gly Ala Asn Asn Met Pro Asn Leu Lys Lys Glu His 225 230 235
(2) INFORMATION FOR SEQ ID NO: 341:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 325 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 70...285 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341:
TAACACAAGC CACCATGAGC ATACTATCGC CATAGTTGGC AATAAAGCAG TGATTCTTAC 60 GGAGCGTTA ATG GCA AGA GAT GAT GTT ATA GAA GTG GAT GGG AAA GTG ATT 111 Met Ala Arg Asp Asp Val He Glu Val Asp Gly Lys Val He 1 5 10
GAG GCG TTG CCT AAC GCT ACT TTT AAG GTG GAG TTA GAC AAT AAG CAT 159 Glu Ala Leu Pro Asn Ala Thr Phe Lys Val Glu Leu Asp Asn Lys His 15 20 25 30
GTG GTG TTG TGC CGT ATT TCT GGA AAG ATG CGC ATG CAC TAT ATT AGG 207 Val Val Leu Cys Arg He Ser Gly Lys Met Arg Met His Tyr He Arg 35 40 45
ATT GCT TTA GGC GAT AGG GTT AAG CTA GAG CTT ACG CCC TAT AGC TTA 255 He Ala Leu Gly Asp Arg Val Lys Leu Glu Leu Thr Pro Tyr Ser Leu 50 55 60
GAC AAA GGT CGG ATA ACT TTT AGA TAT AAA TGAATTTAAG GGTTATTTCA ATG 308 Asp Lys Gly Arg He Thr Phe Arg Tyr Lys 65 70
AAAATATGTT AATATAA 325
(2) INFORMATION FOR SEQ ID NO: 342:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 72 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342:
Met Ala Arg Asp Asp Val He Glu Val Asp Gly Lys Val He Glu Ala
1 5 10 15
Leu Pro Asn Ala Thr Phe Lys Val Glu Leu Asp Asn Lys His Val Val
20 25 30
Leu Cys Arg He Ser Gly Lys Met Arg Met His Tyr He Arg He Ala
35 40 45
Leu Gly Asp Arg Val Lys Leu Glu Leu Thr Pro Tyr Ser Leu Asp Lys
50 55 60
Gly Arg He Thr Phe Arg Tyr Lys 65 70
(2) INFORMATION FOR SEQ ID NO: 343:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 360 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...309 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:343: ATCGCTCAAA TTTCAACGAC CATGCTTGTT AAAAAAAACT AAAGGAATGT T ATG CAA 57
Met Gin 1
GAT GAA TTA TTT GAA ACC GAA AAA ATC CCC CCA AAA AAC ACT AAA AAT 105 Asp Glu Leu Phe Glu Thr Glu Lys He Pro Pro Lys Asn Thr Lys Asn 5 10 15
ACT AAA AAC GCC CCT AAA AAA AGT TTT GAA GAG CAT GTT CAT TCC CTA 153 Thr Lys Asn Ala Pro Lys Lys Ser Phe Glu Glu His Val His Ser Leu 20 25 30
GAG CGA GCC ATA GAT CGC TTG AAT GAT CCC AAT TTA TCC TTA AAA GAC 201 Glu Arg Ala He Asp Arg Leu Asn Asp Pro Asn Leu Ser Leu Lys Asp 35 40 45 50
GGG ATG GAT TTG TAT AAA ACG GCC ATG CAA GAG TTG TTT TTG GCT CAA 249 Gly Met Asp Leu Tyr Lys Thr Ala Met Gin Glu Leu Phe Leu Ala Gin 55 60 65
AAG CTT TTA GAA AAC GCT TAT TTA GAG CAT GAA AAA CTC CAA ACG CCA 297 Lys Leu Leu Glu Asn Ala Tyr Leu Glu His Glu Lys Leu Gin Thr Pro 70 75 80
GAC CAA AAG GCT TAAAGCATGC GAGTGTTTGC TTTGCAATTA GAATCTTTTA AAGAA 354 Asp Gin Lys Ala 85
AATCTC 360
(2) INFORMATION FOR SEQ ID NO: 344:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 86 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:344:
Met Gin Asp Glu Leu Phe Glu Thr Glu Lys He Pro Pro Lys Asn Thr
1 5 10 15
Lys Asn Thr Lys Asn Ala Pro Lys Lys Ser Phe Glu Glu His Val His
20 25 30
Ser Leu Glu Arg Ala He Asp Arg Leu Asn Asp Pro Asn Leu Ser Leu
35 40 45
Lys Asp Gly Met Asp Leu Tyr Lys Thr Ala Met Gin Glu Leu Phe Leu
50 55 60
Ala Gin Lys Leu Leu Glu Asn Ala Tyr Leu Glu His Glu Lys Leu Gin 65 70 75 80
Thr Pro Asp Gin Lys Ala 85 (2) INFORMATION FOR SEQ ID NO: 345:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 841 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 46...795 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:345:
ATGACTAAAA AGATATAATT CATTCAAAAT TAAACAAGGA TTACA ATG AAA CTG ATT 57
Met Lys Leu He
1
TCA TGG AAT GTG AAC GGG TTA AGG GCT TGC ATG ACT AAG GGC TTT ATG 105 Ser Trp Asn Val Asn Gly Leu Arg Ala Cys Met Thr Lys Gly Phe Met 5 10 15 20
GAT TTT TTC AAT AGC GTT GAT GCG GAT GTT TTT TGC ATT CAA GAA TCT 153 Asp Phe Phe Asn Ser Val Asp Ala Asp Val Phe Cys He Gin Glu Ser 25 30 35
AAA ATG CAG CAA GAA CAA AAC ACC TTT GAA TTT AAA GGG TAT TTT GAT 201 Lys Met Gin Gin Glu Gin Asn Thr Phe Glu Phe Lys Gly Tyr Phe Asp 40 45 50
TTT TGG AAT TGC GCG ATT AAA AAG GGC TAT TCT GGG GTG GTA ACT TTC 249 Phe Trp Asn Cys Ala He Lys Lys Gly Tyr Ser Gly Val Val Thr Phe 55 60 65
ACT AAA AAA GAG CCT TTA AGC GTG AGC TAT GGT ATT AAT ATG GAA GAG 297 Thr Lys Lys Glu Pro Leu Ser Val Ser Tyr Gly He Asn Met Glu Glu 70 75 80
CAT GAC AAA GAA GGG CGC GTA ATA ACT TGC GAA TTT GAG TCG TTT TAT 345 His Asp Lys Glu Gly Arg Val He Thr Cys Glu' Phe Glu Ser Phe Tyr 85 90 95 100
TTG GTG AAT GTT TAT ACC CCT AAT TCC CAA CAA GCC CTA TCC AGG CTT 393 Leu Val Asn Val Tyr Thr Pro Asn Ser Gin Gin Ala Leu Ser Arg Leu 105 110 115
AGT TAT CGC ATG AGT TGG GAA GTG GAG TTT AAG AAA TTT TTA AAA GCT 441 Ser Tyr Arg Met Ser Trp Glu Val Glu Phe Lys Lys Phe Leu Lys Ala 120 125 130 TTA GAG TTG AAA AAA CCG GTC ATT GTG TGT GGG GAT TTG AAT GTG GCT 489 Leu Glu Leu Lys Lys Pro Val He Val Cys Gly Asp Leu Asn Val Ala 135 140 145
CAC AAT GAA ATT GAT TTA GAA AAC CCC AAA ACC AAC CGA AAA AAT GCC 537 His Asn Glu He Asp Leu Glu Asn Pro Lys Thr Asn Arg Lys Asn Ala 150 155 160
GGC TTT AGC GAT GAA GAG AGA GAA AAA TTC AGC GAG CTT TTG AAC GCC 585 Gly Phe Ser Asp Glu Glu Arg Glu Lys Phe Ser Glu Leu Leu Asn Ala 165 170 175 180
GGT TTT ATT GAC ACT TTC CGT TAT TTT TAC CCT AAC AAA GAA AAG GCT 633 Gly Phe He Asp Thr Phe Arg Tyr Phe Tyr Pro Asn Lys Glu Lys Ala 185 190 195
TAC ACC TGG TGG AGT TAC ATG CAA CAA GCA AGG GAT AAA AAC ATT GGT 681 Tyr Thr Trp Trp Ser Tyr Met Gin Gin Ala Arg Asp Lys Asn He Gly 200 205 210
TGG CGC ATT GAT TAT TTT TTA TGC TCT AAC CCT TTA AAA ACG CGC TTA 729 Trp Arg He Asp Tyr Phe Leu Cys Ser Asn Pro Leu Lys Thr Arg Leu 215 220 225
AAA GAC GCT TTA ATC TAT AAA GAT ATT TTA GGG AGC GAT CAT TGC CCG 777 Lys Asp Ala Leu He Tyr Lys Asp He Leu Gly Ser Asp His Cys Pro 230 235 240
GTA GGG TTG GAA TTA GTT TAAAGGTAGA AAGTGTGCGA AATAAAGACA GAAAAAAG 833 Val Gly Leu Glu Leu Val 245 250
CCTTACAA 841
(2) INFORMATION FOR SEQ ID NO: 346:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 250 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346:
Met Lys Leu He Ser Trp Asn Val Asn Gly Leu Arg Ala Cys Met Thr
1 5 10 15
Lys Gly Phe Met Asp Phe Phe Asn Ser Val Asp Ala Asp Val Phe Cys
20 25 30
He Gin Glu Ser Lys Met Gin Gin Glu Gin Asn Thr Phe Glu Phe Lys
35 40 45
Gly Tyr Phe Asp Phe Trp Asn Cys Ala He Lys Lys Gly Tyr Ser Gly 50 55 60 Val Val Thr Phe Thr Lys Lys Glu Pro Leu Ser Val Ser Tyr Gly He 65 70 75 80
Asn Met Glu Glu His Asp Lys Glu Gly Arg Val He Thr Cys Glu Phe
85 90 95
Glu Ser Phe Tyr Leu Val Asn Val Tyr Thr Pro Asn Ser Gin Gin Ala
100 105 110
Leu Ser Arg Leu Ser Tyr Arg Met Ser Trp Glu Val Glu Phe Lys Lys
115 120 125
Phe Leu Lys Ala Leu Glu Leu Lys Lys Pro Val He Val Cys Gly Asp
130 135 140
Leu Asn Val Ala His Asn Glu He Asp Leu Glu Asn Pro Lys Thr Asn 145 150 155 160
Arg Lys Asn Ala Gly Phe Ser Asp Glu Glu Arg Glu Lys Phe Ser Glu
165 170 175
Leu Leu Asn Ala Gly Phe He Asp Thr Phe Arg Tyr Phe Tyr Pro Asn
180 185 190
Lys Glu Lys Ala Tyr Thr Trp Trp Ser Tyr Met Gin Gin Ala Arg Asp
195 200 205
Lys Asn He Gly Trp Arg He Asp Tyr Phe Leu Cys Ser Asn Pro Leu
210 215 220
Lys Thr Arg Leu Lys Asp Ala Leu He Tyr Lys Asp He Leu Gly Ser 225 230 235 240
Asp His Cys Pro Val Gly Leu Glu Leu Val 245 250
(2) INFORMATION FOR SEQ ID NO: 347:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 618 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 62...571 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347:
AGAATGCGAG CGTTAAAAAA GAAATTTATG TGCCTAATAA GCTTGTTAAT TTTGTTATCG 60 C ATG AGG GCT TTA CTT TTT TTT ATT TTG TTA CTT TGG TTC AAG GGT TGT 109 Met Arg Ala Leu Leu Phe Phe He Leu Leu Leu Trp Phe Lys Gly Cys 1 5 10 15
GGG TAT AAG CCT ATT GCA GCT TAC GCT CAA AAC GCT TTA GGC GAT AGC 157 Gly Tyr Lys Pro He Ala Ala Tyr Ala Gin Asn Ala Leu Gly Asp Ser 20 25 30
GTA TAC GTG AAA CTC ATT GTG AAT TTG CCT AAC CCT GAA AAC TCT GTA 205 Val Tyr Val Lys Leu He Val Asn Leu Pro Asn Pro Glu Asn Ser Val 35 40 45
GAG TTT AAG GAT TTG ATG AAT CGT TTA GTC GTG CAA CGC TTC CAA AGC 253 Glu Phe Lys Asp Leu Met Asn Arg Leu Val Val Gin Arg Phe Gin Ser 50 55 60
CGC TTA GCG AGT GAA AAG GAT GCG GAT TCT ATC ATT ATT ATA GAA ATC 301 Arg Leu Ala Ser Glu Lys Asp Ala Asp Ser He He He He Glu He 65 70 75 80
ACG AAT GTA ACC GAT ACG AGT ATC ACG CAA AAT AAA GAA GGC TTC ACG 349 Thr Asn Val Thr Asp Thr Ser He Thr Gin Asn Lys Glu Gly Phe Thr 85 90 95
ACT TTC TAT CGC GCA ACC GTG TCT GTG AAT TAC ACC TAC GAT AAT AAA 397 Thr Phe Tyr Arg Ala Thr Val Ser Val Asn Tyr Thr Tyr Asp Asn Lys 100 105 110
AGA GGC ACA CAA AAG ACT TTT CAA GAT AGC GGG TAT TAC AAT TAC GCT 445 Arg Gly Thr Gin Lys Thr Phe Gin Asp Ser Gly Tyr Tyr Asn Tyr Ala 115 120 125
GTG AAT TTG CAA GAC CCC CTT AAT ACC TAC CAG AAC CGC TAT TAT GCT 493 Val Asn Leu Gin Asp Pro Leu Asn Thr Tyr Gin Asn Arg Tyr Tyr Ala 130 135 140
ATC AAT CAG GCT GTG GAA CAG ACT TTG ACT AAA TTT GTG GCT CAA ATC 541 He Asn Gin Ala Val Glu Gin Thr Leu Thr Lys Phe Val Ala Gin He 145 150 155 160
GCT TAT GAG GGG AAA TTC AAT AAT GAA AAA TAGCCCTTTG AATGGATTGA ATG 594 Ala Tyr Glu Gly Lys Phe Asn Asn Glu Lys 165 170
GACTAAAGGC GTTTTTAGAA ACAA 618
(2) INFORMATION FOR SEQ ID NO: 348:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 170 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:348:
Met Arg Ala Leu Leu Phe Phe He Leu Leu Leu Trp Phe Lys Gly Cys
1 5 10 15
Gly Tyr Lys Pro He Ala Ala Tyr Ala Gin Asn Ala Leu Gly Asp Ser
20 25 30
Val Tyr Val Lys Leu He Val Asn Leu Pro Asn Pro Glu Asn Ser Val 35 40 45
Glu Phe Lys Asp Leu Met Asn Arg Leu Val Val Gin Arg Phe Gin Ser
50 55 60
Arg Leu Ala Ser Glu Lys Asp Ala Asp Ser He He He He Glu He 65 70 75 80
Thr Asn Val Thr Asp Thr Ser He Thr Gin Asn Lys Glu Gly Phe Thr
85 90 95
Thr Phe Tyr Arg Ala Thr Val Ser Val Asn Tyr Thr Tyr Asp Asn Lys
100 105 110
Arg Gly Thr Gin Lys Thr Phe Gin Asp Ser Gly Tyr Tyr Asn Tyr Ala
115 120 125
Val Asn Leu Gin Asp Pro Leu Asn Thr Tyr Gin Asn Arg Tyr Tyr Ala
130 135 140
He Asn Gin Ala Val Glu Gin Thr Leu Thr Lys Phe Val Ala Gin He 145 150 155 160
Ala Tyr Glu Gly Lys Phe Asn Asn Glu Lys 165 170
(2) INFORMATION FOR SEQ ID NO: 349:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1277 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 61...1224 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349:
AATACCCATA AAATACCTTA AGAGAACGCC TATTCAAAAA CCAAAAATAA GGAAATCCTA 60 ATG ACT ACA GAC AGA AAT TTG TTT TTT TGC GCT TCG CTA TTG ATT TTT 108 Met Thr Thr Asp Arg Asn Leu Phe Phe Cys Ala Ser Leu Leu He Phe 1 5 10 15
TTG GGG GTA TTG ATG AGC TAT TCG CTC TCA ACT TAC ACC ACA GTG GTG 156 Leu Gly Val Leu Met Ser Tyr Ser Leu Ser Thr Tyr Thr Thr Val Val 20 25 30
CTG TAT CAT TAT GGG GAG TTC CAT TTT TTC ATA CGC CAG CTT GTG AGC 204 Leu Tyr His Tyr Gly Glu Phe His Phe Phe He Arg Gin Leu Val Ser 35 40 45
GCG ATC ATA GGG ATT GTT ATC ATG TGG GGG TTG TCT AGG GTT GAT CCT 252 Ala He He Gly He Val He Met Trp Gly Leu Ser Arg Val Asp Pro 50 55 60 AGC AAG TGG TTT AGC CGT TTG GGG TTT TTT CTT CTT TTT GTC CCA CCA 300 Ser Lys Trp Phe Ser Arg Leu Gly Phe Phe Leu Leu Phe Val Pro Pro 65 70 75 80
TTA CTC ATT ATT GGC ATG TTT TTT TTG CCA GAA AGC CTT TCT AGC AGT 348 Leu Leu He He Gly Met Phe Phe Leu Pro Glu Ser Leu Ser Ser Ser 85 90 95
GCT GGG GGG GCG AAG CGA TGG ATT CGT TTG GGG TTT TTT TCT CTA GCG 396 Ala Gly Gly Ala Lys Arg Trp He Arg Leu Gly Phe Phe Ser Leu Ala 100 105 110
CCT TTG GAG TTT TTG AAG ATT GGT TTC ACC TTT TTT CTT GCG TGG AGT 444 Pro Leu Glu Phe Leu Lys He Gly Phe Thr Phe Phe Leu Ala Trp Ser 115 120 125
TTG TCT CGC ACT TTT GTG GCA AAA GAA AAG GCT AAT GTT AAA GAA GAA 492 Leu Ser Arg Thr Phe Val Ala Lys Glu Lys Ala Asn Val Lys Glu Glu 130 135 140
CTC ATC ACT TTT GTG CCT TAT TCA GTG GTG TTT GTA GCC TTA GCG ATT 540 Leu He Thr Phe Val Pro Tyr Ser Val Val Phe Val Ala Leu Ala He 145 150 155 160
GGG GTG GGG GTT TTG CAA AAC GAT TTG GGG CAG ATT GTT CTT TTG GGG 588 Gly Val Gly Val Leu Gin Asn Asp Leu Gly Gin He Val Leu Leu Gly 165 170 175
GCG GTT TTA GCG GTG TTG TTG GTT TTT TCT GGG GGG AGC GTG CAT TTG 636 Ala Val Leu Ala Val Leu Leu Val Phe Ser Gly Gly Ser Val His Leu 180 185 190
TTT GGC TTG ATT ATT TCA GGG GCG TTT GCG ATC AGC GTT TTA GCG ATT 684 Phe Gly Leu He He Ser Gly Ala Phe Ala He Ser Val Leu Ala He 195 200 205
GTT ACA AGC GAG CAT AGG ATT TTG CGC CTG AAA TTG TGG TGG TCT AAT 732 Val Thr Ser Glu His Arg He Leu Arg Leu Lys Leu Trp Trp Ser Asn 210 215 220
TTG CAA AAT TCG CTT TTC ACG CTC TTG CCG GAT AGA TTA GCG AAC GCT 780 Leu Gin Asn Ser Leu Phe Thr Leu Leu Pro Asp Arg Leu Ala Asn Ala 225 230 235 240
CTT AGA ATA AGC GAC TTG CCC GAA TCC TAT CAG GTC TTT CAT GCA GGC 828 Leu Arg He Ser Asp Leu Pro Glu Ser Tyr Gin Val Phe His Ala Gly 245 250 255
AAT GCC ATG CAT AAT GGG GGG TTG TTT GGG CAA GGG CTT GGG CTT GGG 876 Asn Ala Met His Asn Gly Gly Leu Phe Gly Gin Gly Leu Gly Leu Gly 260 265 270
CAA ATC AAG CTT GGG TTT TTG AGC GAA GTG CAT ACG GAC ATG GTC TTA 924 Gin He Lys Leu Gly Phe Leu Ser Glu Val His Thr Asp Met Val Leu 275 280 285 GCT GGG ATC GCC GAA GAA TGG GGG TTT TTG GGG CTA TGC GTT TGT TTT 972 Ala Gly He Ala Glu Glu Trp Gly Phe Leu Gly Leu Cys Val Cys Phe 290 295 300
ATT TTG TTT TCT GTT TTG ATT GTT TTG ATT TTT AGG ATC GCT AAC CGC 1020 He Leu Phe Ser Val Leu He Val Leu He Phe Arg He Ala Asn Arg 305 310 315 320
TTG AAA GAG CCA AAA TAT TCG CTA TTT TGC GTG GGC GTG GTG CTG CTT 1068 Leu Lys Glu Pro Lys Tyr Ser Leu Phe Cys Val Gly Val Val Leu Leu 325 330 335
ATT AGT TTT TCT TTG GTG ATC AAC GCC TTT GGG GTG GGC GGG ATT CTT 1116 He Ser Phe Ser Leu Val He Asn Ala Phe Gly Val Gly Gly He Leu 340 345 350
CCG GTT AAA GGT CTA GCG GTG CCG TTT TTG AGC TAT GGA GGG AGT TCG 1164 Pro Val Lys Gly Leu Ala Val Pro Phe Leu Ser Tyr Gly Gly Ser Ser 355 360 365
CTT CTA GCG AAT TGT ATC GCT ATA GGG CTT GTT CTA AGC CTA GCG CGA 1212 Leu Leu Ala Asn Cys He Ala He Gly Leu Val Leu Ser Leu Ala Arg 370 375 380
TAC ACG AAA GGC TAAAAACATC AACCCCTTTT TAAAAATTAA TGCCATAAAA AGGGC 1269
Tyr Thr Lys Gly
385
TCAACCTC 1277
(2) INFORMATION FOR SEQ ID NO: 350:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 388 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350:
Met Thr Thr Asp Arg Asn Leu Phe Phe Cys Ala Ser Leu Leu He Phe
1 5 10 15
Leu Gly Val Leu Met Ser Tyr Ser Leu Ser Thr Tyr Thr Thr Val Val
20 25 30
Leu Tyr His Tyr Gly Glu Phe His Phe Phe He Arg Gin Leu Val Ser
35 40 45
Ala He He Gly He Val He Met Trp Gly Leu Ser Arg Val Asp Pro
50 55 60
Ser Lys Trp Phe Ser Arg Leu Gly Phe Phe Leu Leu Phe Val Pro Pro 65 70 75 80
Leu Leu He He Gly Met Phe Phe Leu Pro Glu Ser Leu Ser Ser Ser 85 90 95 Ala Gly Gly Ala Lys Arg Trp He Arg Leu Gly Phe Phe Ser Leu Ala
100 105 110
Pro Leu Glu Phe Leu Lys He Gly Phe Thr Phe Phe Leu Ala Trp Ser
115 120 125
Leu Ser Arg Thr Phe Val Ala Lys Glu Lys Ala Asn Val Lys Glu Glu
130 135 140
Leu He Thr Phe Val Pro Tyr Ser Val Val Phe Val Ala Leu Ala He 145 150 155 160
Gly Val Gly Val Leu Gin Asn Asp Leu Gly Gin He Val Leu Leu Gly
165 170 175
Ala Val Leu Ala Val Leu Leu Val Phe Ser Gly Gly Ser Val His Leu
180 185 190
Phe Gly Leu He He Ser Gly Ala Phe Ala He Ser Val Leu Ala He
195 200 205
Val Thr Ser Glu His Arg He Leu Arg Leu Lys Leu Trp Trp Ser Asn
210 215 220
Leu Gin Asn Ser Leu Phe Thr Leu Leu Pro Asp Arg Leu Ala Asn Ala 225 230 235 240
Leu Arg He Ser Asp Leu Pro Glu Ser Tyr Gin Val Phe His Ala Gly
245 250 255
Asn Ala Met His Asn Gly Gly Leu Phe Gly Gin Gly Leu Gly Leu Gly
260 265 270
Gin He Lys Leu Gly Phe Leu Ser Glu Val His Thr Asp Met Val Leu
275 280 285
Ala Gly He Ala Glu Glu Trp Gly Phe Leu Gly Leu Cys Val Cys Phe
290 295 300
He Leu Phe Ser Val Leu He Val Leu He Phe Arg He Ala Asn Arg 305 310 315 320
Leu Lys Glu Pro Lys Tyr Ser Leu Phe Cys Val Gly Val Val Leu Leu
325 330 335
He Ser Phe Ser Leu Val He Asn Ala Phe Gly Val Gly Gly He Leu
340 345 350
Pro Val Lys Gly Leu Ala Val Pro Phe Leu Ser Tyr Gly Gly Ser Ser
355 360 365
Leu Leu Ala Asn Cys He Ala He Gly Leu Val Leu Ser Leu Ala Arg
370 375 380
Tyr Thr Lys Gly 385
(2) INFORMATION FOR SEQ ID NO: 351:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 961 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...908 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351:
AGAACAGTAT CTATTTTTGT TGCGGTTGTA TATTTAATTA GGAGTTTGGT GTG AAA 56
Val Lys
1
CGG ATT TTA TTT TTT TTA GTA GCT ACG ACT TTT TTG TTG AGA GCA GAA 104 Arg He Leu Phe Phe Leu Val Ala Thr Thr Phe Leu Leu Arg Ala Glu 5 10 15
ACG GAT TCT GCC ACT ATT AAC ACT ACA GTT GAT CCC AAT GTT ATG TTT 152 Thr Asp Ser Ala Thr He Asn Thr Thr Val Asp Pro Asn Val Met Phe 20 25 30
TCT GAA AGC TCC ACA GGG AAT GTG AAA AAA GAC CGC AAG AGG GTT TTA 200 Ser Glu Ser Ser Thr Gly Asn Val Lys Lys Asp Arg Lys Arg Val Leu 35 40 45 50
AAG AGC ATG GTT AAT TTG GAA AAA GAG CGC GTG AAG AAT TTT AAC CGG 248 Lys Ser Met Val Asn Leu Glu Lys Glu Arg Val Lys Asn Phe Asn Arg 55 60 65
TAT TCT GAA ACC AAG ATG AGT AAG GGC GAC TTA TCC GCT TTT GGA GCT 296 Tyr Ser Glu Thr Lys Met Ser Lys Gly Asp Leu Ser Ala Phe Gly Ala 70 75 80
TTC TTT AAG GGG AGT TTG GAA AGT TGT GTG GAT CAA AAG ATT TGT TAT 344 Phe Phe Lys Gly Ser Leu Glu Ser Cys Val Asp Gin Lys He Cys Tyr 85 90 95
TAT GAG CAT AAA GAT GGC AAG GTT TCT TTT GTG GTG AAT GAC AGG GAG 392 Tyr Glu His Lys Asp Gly Lys Val Ser Phe Val Val Asn Asp Arg Glu 100 105 110
AAG TTT TAT AAA CAT GTG CTT AAA GAC TTA GGG ACA GAG CTT TCG CTC 440 Lys Phe Tyr Lys His Val Leu Lys Asp Leu Gly Thr Glu Leu Ser Leu 115 120 125 130
CCT TTG TTT AAC TGG CTT TAC AAA GGC TCG GAT TTT GGG GCT TTG CAT 488 Pro Leu Phe Asn Trp Leu Tyr Lys Gly Ser Asp Phe Gly Ala Leu His 135 140 145
GAG CAG TTT GGG GAT ATG TAT GAT GGG TAT ATC AAA TAC TTG ATC AGT 536 Glu Gin Phe Gly Asp Met Tyr Asp Gly Tyr He Lys Tyr Leu He Ser 150 155 160
ATG GTT AGA ATA AGC CAA AAA GAA AAG GCT AGA AAA GTG GAT GCA ATC 584 Met Val Arg He Ser Gin Lys Glu Lys Ala Arg Lys Val Asp Ala He 165 170 175
GTT CTT AAG AAA ATG GAA GAA CAA GCT GAG AAA GAC ACT AAG GCA GCG 632 Val Leu Lys Lys Met Glu Glu Gin Ala Glu Lys Asp Thr Lys Ala Ala 180 185 190 TTT CAA AAG AGG AGC AGT GGG GAG CTT GAA AGC CAT ACT GAT AGC CCT 680 Phe Gin Lys Arg Ser Ser Gly Glu Leu Glu Ser His Thr Asp Ser Pro 195 200 205 210
GAA TTT ATA AGC TCT TCT AAG AGG ACA CAG AAC GCT TCT AAT TCG GAT 728 Glu Phe He Ser Ser Ser Lys Arg Thr Gin Asn Ala Ser Asn Ser Asp 215 220 225
CTC AAT TCT ATG ACC AAT GCT AAC GCG CTC AAA GAA ACA GCT TCA AAA 776 Leu Asn Ser Met Thr Asn Ala Asn Ala Leu Lys Glu Thr Ala Ser Lys 230 235 240
GAG CCA GAG GCT TCT TCA AAA AAA GAG AAA AAG TCT AAG AAA AAA CGT 824 Glu Pro Glu Ala Ser Ser Lys Lys Glu Lys Lys Ser Lys Lys Lys Arg 245 250 255
CGC CTT TCA AAG AAA GAA AAA CAA CAA CAA GCC TTG CAA CAA GAG TTT 872 Arg Leu Ser Lys Lys Glu Lys Gin Gin Gin Ala Leu Gin Gin Glu Phe 260 265 270
GAA AAA CAA ATT AGC GAC TCT AGT AAG TCT GAA AAA TAGTAATAAT AGTTAA 924 Glu Lys Gin He Ser Asp Ser Ser Lys Ser Glu Lys 275 280 285
GCTTACCTTT TTAGGGGGCT TTCAATAAAT CTCTTAA 961
(2) INFORMATION FOR SEQ ID NO: 352:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 286 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:352:
Val Lys Arg He Leu Phe Phe Leu Val Ala Thr Thr Phe Leu Leu Arg
1 5 10 15
Ala Glu Thr Asp Ser Ala Thr He Asn Thr Thr Val Asp Pro Asn Val
20 25 30
Met Phe Ser Glu Ser Ser Thr Gly Asn Val Lys Lys Asp Arg Lys Arg
35 40 45
Val Leu Lys Ser Met Val Asn Leu Glu Lys Glu Arg Val Lys Asn Phe
50 55 60
Asn Arg Tyr Ser Glu Thr Lys Met Ser Lys Gly Asp Leu Ser Ala Phe 65 70 75 80
Gly Ala Phe Phe Lys Gly Ser Leu Glu Ser Cys Val Asp Gin Lys He
85 90 95
Cys Tyr Tyr Glu His Lys Asp Gly Lys Val Ser Phe Val Val Asn Asp
100 105 110
Arg Glu Lys Phe Tyr Lys His Val Leu Lys Asp Leu Gly Thr Glu Leu 115 120 125
Ser Leu Pro Leu Phe Asn Trp Leu Tyr Lys Gly Ser Asp Phe Gly Ala
130 135 140
Leu His Glu Gin Phe Gly Asp Met Tyr Asp Gly Tyr He Lys Tyr Leu 145 150 155 160
He Ser Met Val Arg He Ser Gin Lys Glu Lys Ala Arg Lys Val Asp
165 170 175
Ala He Val Leu Lys Lys Met Glu Glu Gin Ala Glu Lys Asp Thr Lys
180 185 190
Ala Ala Phe Gin Lys Arg Ser Ser Gly Glu Leu Glu Ser His Thr Asp
195 200 205
Ser Pro Glu Phe He Ser Ser Ser Lys Arg Thr Gin Asn Ala Ser Asn
210 215 220
Ser Asp Leu Asn Ser Met Thr Asn Ala Asn Ala Leu Lys Glu Thr Ala 225 230 235 240
Ser Lys Glu Pro Glu Ala Ser Ser Lys Lys Glu Lys Lys Ser Lys Lys
245 250 255
Lys Arg Arg Leu Ser Lys Lys Glu Lys Gin Gin Gin Ala Leu Gin Gin
260 265 270
Glu Phe Glu Lys Gin He Ser Asp Ser Ser Lys Ser Glu Lys 275 280 285
(2) INFORMATION FOR SEQ ID NO: 353:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1555 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1499 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:353:
CTTTTTAGCC TCAAGACTTG GGCTTTAACA TTAAAGAATT ATTTTAAGGA ATG ATC 56
Met He
1
ATG GAA AAA TAC CAT AGC GAC CAA GAA TAC GAA GAA ATC ATC ACC GAC 104 Met Glu Lys Tyr His Ser Asp Gin Glu Tyr Glu Glu He He Thr Asp 5 10 15
CAA TTA GGC GAT ATG CAA TTA AGG GAA AAT TTG CGT TCT GCA ATG GAT 152 Gin Leu Gly Asp Met Gin Leu Arg Glu Asn Leu Arg Ser Ala Met Asp 20 25 30
ACC TTA AGG GCT AAT CGT AAG AAT CTC CTT AAA AAT CGT TAC AGC GAA 200 Thr Leu Arg Ala Asn Arg Lys Asn Leu Leu Lys Asn Arg Tyr Ser Glu 35 40 45 50
TGG GAA AAT TTA AGG GAA TTA GGC AAA GAA GTC AAG CTT AAA ATC TTA 248 Trp Glu Asn Leu Arg Glu Leu Gly Lys Glu Val Lys Leu Lys He Leu 55 60 65
TCC AGG CTT GAT GAA TAT TTG GAA TTG TTT GAA AAA AAC GCC ACT CAA 296 Ser Arg Leu Asp Glu Tyr Leu Glu Leu Phe Glu Lys Asn Ala Thr Gin 70 75 80
AAC GGC TTT AAA ATC CAT TAC GCT AAA GAC GGC GAT GAA GCT AAT GAA 344 Asn Gly Phe Lys He His Tyr Ala Lys Asp Gly Asp Glu Ala Asn Glu 85 90 95
ATC ATT TAC AAC CTC GCT AAA GAA AAG AAT ATC AAG CGC ATT TTA AAG 392 He He Tyr Asn Leu Ala Lys Glu Lys Asn He Lys Arg He Leu Lys 100 105 110
CAA AAA TCC ATG GCG AGC GAA GAA ATT GGC TTG AAC CAT TAC TTG AAA 440 Gin Lys Ser Met Ala Ser Glu Glu He Gly Leu Asn His Tyr Leu Lys 115 120 125 130
GAA AAG GGC ATT CAA GCA CAA GAA ACG GAT TTG GGC GAA TTG ATT ATC 488 Glu Lys Gly He Gin Ala Gin Glu Thr Asp Leu Gly Glu Leu He He 135 140 145
CAA CTC ATC AAT GAA CAC CCT GTG CAT ATT GTC GTG CCA GCT ATC CAT 536 Gin Leu He Asn Glu His Pro Val His He Val Val Pro Ala He His 150 155 160
AAA AAC CGC AAG CAA ATC GGT AAG ATT TTT GAA GAA AAA CTC AAC GCC 584 Lys Asn Arg Lys Gin He Gly Lys He Phe Glu Glu Lys Leu Asn Ala 165 170 175
GCT TAT GAA GAA GAG CCT GAA AAG CTT AAT GCG ATC GCC AGA AAA CAC 632 Ala Tyr Glu Glu Glu Pro Glu Lys Leu Asn Ala He Ala Arg Lys His 180 185 190
ATG CGC AAA GAA TTT GAA AGC TTT AAA ATG GGG ATT AGT GGG GTT AAT 680 Met Arg Lys Glu Phe Glu Ser Phe Lys Met Gly He Ser Gly Val Asn 195 200 205 210
TTT GCT ATC GCT AAC GAA GGA GCG ATC TGG TTA GTG GAA AAT GAA GGC 728 Phe Ala He Ala Asn Glu Gly Ala He Trp Leu Val Glu Asn Glu Gly 215 220 225
AAT GGC AGA ATG AGC ACC ACT GCA TGC GAT GTG CAT GTC GCA ATT TGT 776 Asn Gly Arg Met Ser Thr Thr Ala Cys Asp Val His Val Ala He Cys 230 235 240
GGG ATT GAA AAA TTA GTA GAA AGC TTT GAT GAT GCG GCG ATT TTA AAC 824 Gly He Glu Lys Leu Val Glu Ser Phe Asp Asp Ala Ala He Leu Asn 245 250 255 AAT TTG CTC GCC CCA AGC GCT GTG GGT GTG CCT ATC ACT TGC TAT CAA 872 Asn Leu Leu Ala Pro Ser Ala Val Gly Val Pro He Thr Cys Tyr Gin 260 265 270
AAC ATT ATC ACA GGC CCT AGA AAA GAG GGC GAT TTA GAC GGC CCT AAA 920 Asn He He Thr Gly Pro Arg Lys Glu Gly Asp Leu Asp Gly Pro Lys 275 280 285 290
GAA GCC CAC ATC ATT TTA TTA GAC AAC AAC CGC TCT AAT ATT TTG GCT 968 Glu Ala His He He Leu Leu Asp Asn Asn Arg Ser Asn He Leu Ala 295 300 305
GAT GAA AAG TAT TAT CGC GCT CTT TCA TGC ATC CGT TGC GGG ACT TGT 1016 Asp Glu Lys Tyr Tyr Arg Ala Leu Ser Cys He Arg Cys Gly Thr Cys 310 315 320
TTG AAC CAC TGC CCT GTG TAT GAT AAA ATC GGT GGG CAT GCC TAT CTT 1064 Leu Asn His Cys Pro Val Tyr Asp Lys He Gly Gly His Ala Tyr Leu 325 330 335
TCT ACT TAT CCT GGC CCT ATA GGC GTG GTG GTA TCC CCC CAA CTC TTT 1112 Ser Thr Tyr Pro Gly Pro He Gly Val Val Val Ser Pro Gin Leu Phe 340 345 350
GGC TTG AAT AAT TAC GGG CAT ATC CCT AAT TTG TGC AGT CTT TGC GGG 1160 Gly Leu Asn Asn Tyr Gly His He Pro Asn Leu Cys Ser Leu Cys Gly 355 360 365 370
CGT TGC ACT GAA GTA TGC CCC GTA GAA ATC CCT TTA GCC GAA CTC ATT 1208 Arg Cys Thr Glu Val Cys Pro Val Glu He Pro Leu Ala Glu Leu He 375 380 385
AGG GAT TTA CGA TCC GAT AAA GTG GGC GAG GGC AGG GGT GTA ATT AAG 1256 Arg Asp Leu Arg Ser Asp Lys Val Gly Glu Gly Arg Gly Val He Lys 390 395 400
GGG GCT AAA AGC ACC CAA CAC AGC GGG ATG GAA AAA TTC TCT ATG .AAA 1304 Gly Ala Lys Ser Thr Gin His Ser Gly Met Glu Lys Phe Ser Met Lys 405 410 415
ATG TTT GCC AAA ATG GCA AGC GAT GGG GCT AAG TGG CGT TTC CAA TTG 1352 Met Phe Ala Lys Met Ala Ser Asp Gly Ala Lys Trp Arg Phe Gin Leu 420 425 430
AAA ATG GCT CAA TTT TTC TCG CCT TTA GGC AAG CTT TTA GCT CCC ATA 1400 Lys Met Ala Gin Phe Phe Ser Pro Leu Gly Lys Leu Leu Ala Pro He 435 440 445 450
CTG CCT TTA GTC AAA GAG TGG GCG AGC GTT AGG ACC TTA CCC AAT ATG 1448 Leu Pro Leu Val Lys Glu Trp Ala Ser Val Arg Thr Leu Pro Asn Met 455 460 465
GAC ACG AGC TTG CAT GCA AAA GTC CAG CAC TTA GAA GGG GTG ATT TAT 1496 Asp Thr Ser Leu His Ala Lys Val Gin His Leu Glu Gly Val He Tyr 470 475 480 GAG TAAAGAGCTT ATTTTAAAGC GCATTAAAGA AGCCAGAGCC AAGCATGCCA TTCAGG 1555 Glu
(2) INFORMATION FOR SEQ ID NO: 354:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 483 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:354:
Met He Met Glu Lys Tyr His Ser Asp Gin Glu Tyr Glu Glu He He
1 5 10 15
Thr Asp Gin Leu Gly Asp Met Gin Leu Arg Glu Asn Leu Arg Ser Ala
20 25 30
Met Asp Thr Leu Arg Ala Asn Arg Lys Asn Leu Leu Lys Asn Arg Tyr
35 40 45
Ser Glu Trp Glu Asn Leu Arg Glu Leu Gly Lys Glu Val Lys Leu Lys
50 55 60
He Leu Ser Arg Leu Asp Glu Tyr Leu Glu Leu Phe Glu Lys Asn Ala 65 70 75 80
Thr Gin Asn Gly Phe Lys He His Tyr Ala Lys Asp Gly Asp Glu Ala
85 90 95
Asn Glu He He Tyr Asn Leu Ala Lys Glu Lys Asn He Lys Arg He
100 105 110
Leu Lys Gin Lys Ser Met Ala Ser Glu Glu He Gly Leu Asn His Tyr
115 120 125
Leu Lys Glu Lys Gly He Gin Ala Gin Glu Thr Asp Leu Gly Glu Leu
130 135 140
He He Gin Leu He Asn Glu His Pro Val His He Val Val Pro Ala 145 150 155 160
He His Lys Asn Arg Lys Gin He Gly Lys He Phe Glu Glu Lys Leu
165 170 175
Asn Ala Ala Tyr Glu Glu Glu Pro Glu Lys Leu Asn Ala He Ala Arg
180 185 190
Lys His Met Arg Lys Glu Phe Glu Ser Phe Lys Met Gly He Ser Gly
195 200 205
Val Asn Phe Ala He Ala Asn Glu Gly Ala He Trp Leu Val Glu Asn
210 215 220
Glu Gly Asn Gly Arg Met Ser Thr Thr Ala Cys Asp Val His Val Ala 225 230 235 240
He Cys Gly He Glu Lys Leu Val Glu Ser Phe Asp Asp Ala Ala He
245 250 255
Leu Asn Asn Leu Leu Ala Pro Ser Ala Val Gly Val Pro He Thr Cys
260 265 270
Tyr Gin Asn He He Thr Gly Pro Arg Lys Glu Gly Asp Leu Asp Gly
275 280 285
Pro Lys Glu Ala His He He Leu Leu Asp Asn Asn Arg Ser Asn He 290 295 300 Leu Ala Asp Glu Lys Tyr Tyr Arg Ala Leu Ser Cys He Arg Cys Gly 305 310 315 320
Thr Cys Leu Asn His Cys Pro Val Tyr Asp Lys He Gly Gly His Ala
325 330 335
Tyr Leu Ser Thr Tyr Pro Gly Pro He Gly Val Val Val Ser Pro Gin
340 345 350
Leu Phe Gly Leu Asn Asn Tyr Gly His He Pro Asn Leu Cys Ser Leu
355 360 365
Cys Gly Arg Cys Thr Glu Val Cys Pro Val Glu He Pro Leu Ala Glu
370 375 380
Leu He Arg Asp Leu Arg Ser Asp Lys Val Gly Glu Gly Arg Gly Val 385 390 395 400
He Lys Gly Ala Lys Ser Thr Gin His Ser Gly Met Glu Lys Phe Ser
405 410 415
Met Lys Met Phe Ala Lys Met Ala Ser Asp Gly Ala Lys Trp Arg Phe
420 425 430
Gin Leu Lys Met Ala Gin Phe Phe Ser Pro Leu Gly Lys Leu Leu Ala
435 440 445
Pro He Leu Pro Leu Val Lys Glu Trp Ala Ser Val Arg Thr Leu Pro
450 455 460
Asn Met Asp Thr Ser Leu His Ala Lys Val Gin His Leu Glu Gly Val 465 470 475 480
He Tyr Glu
(2) INFORMATION FOR SEQ ID NO: 355:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1630 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 294...1577 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:355:
GTATCGCCAC TTTGCGTAAC AGGATGTTGA AAGTGGGTAA AAGCCGCTAA TTTCTTTTTA 60
GTGGGTCGTT TTTGAAAATC TTTTTAGTCT TTTTAAGCGT CTTTTTTTTT AATGGGTGTT 120
TTGGGTTAGT CTATAAGACT CCCATTTCAA GCTCTCCTAT CTCTTATGAT CCCTACACTA 180
CCCCCATTGG GAGCTTGTAT GCTGAAAAAT TAAAAGAAAA CCCTAACCAT AGCGCGGCCA 240
TTCTTTTAGA AGATGGCTTT GACGCTCTGT TGCATAGAGT GGGACTTATT AGA ATG 296
Met 1
AGC CAA AAA AGC ATT GAC ATG CAA ACT TAT ATC TAT AAA AAC GAC CTT 344 Ser Gin Lys Ser He Asp Met Gin Thr Tyr He Tyr Lys Asn Asp Leu 5 10 15
TCT TCT CAA GTG ATT GCT AAA GAA CTT TTA AAT GCG GCC AAT CGT GGG 392 Ser Ser Gin Val He Ala Lys Glu Leu Leu Asn Ala Ala Asn Arg Gly 20 25 30
GTA AAA GTG CGC ATC CTT TTA GAC GAT AAC GGA TTG GAT TCG GAT TTT 440 Val Lys Val Arg He Leu Leu Asp Asp Asn Gly Leu Asp Ser Asp Phe 35 40 45
TCA GAT ATT ATG CTC TTA AAT TTC CAT AAA AAC ATT GAG GTG AAA ATT 488 Ser Asp He Met Leu Leu Asn Phe His Lys Asn He Glu Val Lys He 50 55 60 65
TTT AAC CCC TAC TAT ATC CGC AAT AAA GGC TTG CGT TAT TTT GAA ATG 536 Phe Asn Pro Tyr Tyr He Arg Asn Lys Gly Leu Arg Tyr Phe Glu Met 70 75 80
CTT GCG GAT TAT GAG CGC ATT AAA AAA CGC ATG CAC AAC AAG CTT TTC 584 Leu Ala Asp Tyr Glu Arg He Lys Lys Arg Met His Asn Lys Leu Phe 85 90 95
ATC GTG GAT AAT TTC GCT GTC ATT ATA GGG GGG CGC AAT ATT GGG GAC 632 He Val Asp Asn Phe Ala Val He He Gly Gly Arg Asn He Gly Asp 100 105 110
AAT TAT TTT GAT AAC GAT TTA GAC ACG AAT TTT TTA GAT TTA GAC GCT 680 Asn Tyr Phe Asp Asn Asp Leu Asp Thr Asn Phe Leu Asp Leu Asp Ala 115 120 125
TTG TTT TTT GGG GGG GTT GCT TCA AAA GCC AAA GAA AGC TTT GAA CGC 728 Leu Phe Phe Gly Gly Val Ala Ser Lys Ala Lys Glu Ser Phe Glu Arg 130 135 140 145
TAT TGG AGA TTC CAC CGC TCT ATC CCT GTT TCA TTA CTA AGA ACC CAT 776 Tyr Trp Arg Phe His Arg Ser He Pro Val Ser Leu Leu Arg Thr His 150 155 160
AAA AGA CTC AAA AAC AAC GCT AAA GAA ATC GCT AAA CTC CAT GAA AAA 824 Lys Arg Leu Lys Asn Asn Ala Lys Glu He Ala Lys Leu His Glu Lys 165 170 175
ATC CCT ATC AGC GCT GAA GAC AAA AAC CAG TTT GAA AAA AAA GTC AAT 872 He Pro He Ser Ala Glu Asp Lys Asn Gin Phe Glu Lys Lys Val Asn 180 185 190
GAT TTT ATA GAT CGT TTC CAA AAA TAC CAA TAC CCC ATT TAT TAT GGG 920 Asp Phe He Asp Arg Phe Gin Lys Tyr Gin Tyr Pro He Tyr Tyr Gly 195 200 205
AAT GCC ATT TTT TTA GCC GAT TCA CCC AAA AAA ATT GAC ACG CCC TTG 968 Asn Ala He Phe Leu Ala Asp Ser Pro Lys Lys He Asp Thr Pro Leu 210 215 220 225
TAT TCG CCT ATC AAA ATC GCT TTT GAG AAA GCC CTT AAA AAC GCT AAG 1016 Tyr Ser Pro He Lys He Ala Phe Glu Lys Ala Leu Lys Asn Ala Lys 230 235 240
GAC TCC GTT TTT ATC GCT TCA TCG TAT TTT ATT CCA GGC AAA AAG ATG 1064 Asp Ser Val Phe He Ala Ser Ser Tyr Phe He Pro Gly Lys Lys Met 245 250 255
ATG AAA ATC TTT AAA AAT CAA ATT TCT AAG GGG ATT GAA TTG AAC ATC 1112 Met Lys He Phe Lys Asn Gin He Ser Lys Gly He Glu Leu Asn He 260 265 270
CTT ACC AAT TCC CTT TCA TCT ACT GAT GCG ATA GTG GTC TAT GGG GCA 1160 Leu Thr Asn Ser Leu Ser Ser Thr Asp Ala He Val Val Tyr Gly Ala 275 280 285
TGG GAA AGG TAT CGC AAC CAA TTA GTG CGA ATG GGC GCG AAT GTC TAT 1208 Trp Glu Arg Tyr Arg Asn Gin Leu Val Arg Met Gly Ala Asn Val Tyr 290 295 300 305
GAA ATA CGA AAC GAT TTT TTC AAC CGC CAG ATT AAA GGG CGC TTT AGC 1256 Glu He Arg Asn Asp Phe Phe Asn Arg Gin He Lys Gly Arg Phe Ser 310 315 320
ACC AAA CAT TCC TTG CAT GGC AAG ACG ATT GTT TTT GAT GAC AAT TTA 1304 Thr Lys His Ser Leu His Gly Lys Thr He Val Phe Asp Asp Asn Leu 325 330 335
ACG CTT CTA GGG AGT TTC AAT ATT GAT CCG CGC TCT GCA TAC ATC AAC 1352 Thr Leu Leu Gly Ser Phe Asn He Asp Pro Arg Ser Ala Tyr He Asn 340 345 350
ACT GAA AGC GCG GTT TTG TTT GAC AAC CCG TCT TTT GCT AAA AGG GTG 1400 Thr Glu Ser Ala Val Leu Phe Asp Asn Pro Ser Phe Ala Lys Arg Val 355 360 365
CGT TTG TCG CTT AAA GAT CAT GCC CAA CAA TCA TGG CAT TTG GTG GTG 1448 Arg Leu Ser Leu Lys Asp His Ala Gin Gin Ser Trp His Leu Val Val 370 375 380 385
TAT CGG CAT AGA GTG ATT TGG GAA GCG GTG GAA GAA GGC ATT TTA ATC 1496 Tyr Arg His Arg Val He Trp Glu Ala Val Glu Glu Gly He Leu He 390 395 400
CAT GAA AAA ACT TCG CCT GAC ACT TCC TTC TTT TTG CGC TTG ATT AAA 1544 His Glu Lys Thr Ser Pro Asp Thr Ser Phe Phe Leu Arg Leu He Lys 405 410 415
GAA TGG TCT AAA GTC CTT CCT GAA AGA GAG CTT TAAAACTTTT AATGCGCTTT 1597 Glu Trp Ser Lys Val Leu Pro Glu Arg Glu Leu 420 425
ATTTTGCGAA AAAGCGATGT TATTGGTAAC GGC 1630
(2) INFORMATION FOR SEQ ID NO: 356: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 428 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:356:
Met Ser Gin Lys Ser He Asp Met Gin Thr Tyr He Tyr Lys Asn Asp
1 5 10 15
Leu Ser Ser Gin Val He Ala Lys Glu Leu Leu Asn Ala Ala Asn Arg
20 25 30
Gly Val Lys Val Arg He Leu Leu Asp Asp Asn Gly Leu Asp Ser Asp
35 40 45
Phe Ser Asp He Met Leu Leu Asn Phe His Lys Asn He Glu Val Lys
50 55 60
He Phe Asn Pro Tyr Tyr He Arg Asn Lys Gly Leu Arg Tyr Phe Glu 65 70 75 80
Met Leu Ala Asp Tyr Glu Arg He Lys Lys Arg Met His Asn Lys Leu
85 90 95
Phe He Val Asp Asn Phe Ala Val He He Gly Gly Arg Asn He Gly
100 105 110
Asp Asn Tyr Phe Asp Asn Asp Leu Asp Thr Asn Phe Leu Asp Leu Asp
115 120 125
Ala Leu Phe Phe Gly Gly Val Ala Ser Lys Ala Lys Glu Ser Phe Glu
130 135 140
Arg Tyr Trp Arg Phe His Arg Ser He Pro Val Ser Leu Leu Arg Thr 145 150 155 160
His Lys Arg Leu Lys Asn Asn Ala Lys Glu He Ala Lys Leu His Glu
165 170 175
Lys He Pro He Ser Ala Glu Asp Lys Asn Gin Phe Glu Lys Lys Val
180 185 190
Asn Asp Phe He Asp Arg Phe Gin Lys Tyr Gin Tyr Pro He Tyr Tyr
195 200 205
Gly Asn Ala He Phe Leu Ala Asp Ser Pro Lys Lys He Asp Thr Pro
210 215 220
Leu Tyr Ser Pro He Lys He Ala Phe Glu Lys Ala Leu Lys Asn Ala 225 230 235 240
Lys Asp Ser Val Phe He Ala Ser Ser Tyr Phe He Pro Gly Lys Lys
245 250 255
Met Met Lys He Phe Lys Asn Gin He Ser Lys Gly He Glu Leu Asn
260 265 270
He Leu Thr Asn Ser Leu Ser Ser Thr Asp Ala He Val Val Tyr Gly
275 280 285
Ala Trp Glu Arg Tyr Arg Asn Gin Leu Val Arg Met Gly Ala Asn Val
290 295 300
Tyr Glu He Arg Asn Asp Phe Phe Asn Arg Gin He Lys Gly Arg Phe 305 310 315 320
Ser Thr Lys His Ser Leu His Gly Lys Thr He Val Phe Asp Asp Asn
325 330 335
Leu Thr Leu Leu Gly Ser Phe Asn He Asp Pro Arg Ser Ala Tyr He 340 345 350 Asn Thr Glu Ser Ala Val Leu Phe Asp Asn Pro Ser Phe Ala Lys Arg
355 360 365
Val Arg Leu Ser Leu Lys Asp His Ala Gin Gin Ser Trp His Leu Val
370 375 380
Val Tyr Arg His Arg Val He Trp Glu Ala Val Glu Glu Gly He Leu 385 390 395 400
He His Glu Lys Thr Ser Pro Asp Thr Ser Phe Phe Leu Arg Leu He
405 410 415
Lys Glu Trp Ser Lys Val Leu Pro Glu Arg Glu Leu 420 425
(2) INFORMATION FOR SEQ ID NO: 357:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 550 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 79...510 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:357:
AACTCTTTTG GTGTAGGATA GCGATCAAGG TTTTTATGAA AATAAAAGCC TAAAACAATT 60 TTAAAAAAAG GACTTTTG ATG AAA ACA TTT GAA ATT CTA AAA CAT TTG CAA 111
Met Lys Thr Phe Glu He Leu Lys His Leu Gin 1 5 10
GCG GAT GCG ATC GTG TTA TTT ATG AAA GTG CAT AAC TTC CAT TGG AAT 159 Ala Asp Ala He Val Leu Phe Met Lys Val His Asn Phe His Trp Asn 15 20 25
GTG AAA GGC ACC GAT TTT TTC AAT GTG CAT AAA GCC ACT GAA GAA ATT 207 Val Lys Gly Thr Asp Phe Phe Asn Val His Lys Ala Thr Glu Glu He 30 35 40
TAT GAA GAG TTT GCG GAC ATG TTT GAC GAT CTC GCT GAA AGG ATC GTT 255 Tyr Glu Glu Phe Ala Asp Met Phe Asp Asp Leu Ala Glu Arg He Val 45 50 55
CAA TTA GGG CAT CAC CCC TTA GTC ACT TTA TCC GAA GCG ATC AAA CTC 303 Gin Leu Gly His His Pro Leu Val Thr Leu Ser Glu Ala He Lys Leu 60 65 70 75
ACT CGT GTT AAA GAA GAA ACT AAA ACG AGC TTC CAC TCT AAA GAC ATC 351 Thr Arg Val Lys Glu Glu Thr Lys Thr Ser Phe His Ser Lys Asp He 80 85 90 TTT AAA GAA ATT CTA GAG GAC TAC AAA TAT CTA GAA AAA GAA TTT AAA 399 Phe Lys Glu He Leu Glu Asp Tyr Lys Tyr Leu Glu Lys Glu Phe Lys 95 100 105
GAG CTC TCT AAC ACC GCT GAA AAA GAA GGC GAT AAA GTT ACC GTA ACT 447 Glu Leu Ser Asn Thr Ala Glu Lys Glu Gly Asp Lys Val Thr Val Thr 110 115 120
TAT GCG GAT GAT CAA TTA GCC AAG TTG CAA AAA TCC ATT TGG ATG CTG 495 Tyr Ala Asp Asp Gin Leu Ala Lys Leu Gin Lys Ser He Trp Met Leu 125 130 135
CAA GCC CAT TTG GCT TAAGCGACCA AAAAGAAGCC AGCATGAGAG ATTACAGCGA 550
Gin Ala His Leu Ala
140
(2) INFORMATION FOR SEQ ID NO: 358:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 144 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358:
Met Lys Thr Phe Glu He Leu Lys His Leu Gin Ala Asp Ala He Val
1 5 10 15
Leu Phe Met Lys Val His Asn Phe His Trp Asn Val Lys Gly Thr Asp
20 25 30
Phe Phe Asn Val His Lys Ala Thr Glu Glu He Tyr Glu Glu Phe Ala
35 40 45
Asp Met Phe Asp Asp Leu Ala Glu Arg He Val Gin Leu Gly His His
50 55 60
Pro Leu Val Thr Leu Ser Glu Ala He Lys Leu Thr Arg Val Lys Glu 65 70 75 80
Glu Thr Lys Thr Ser Phe His Ser Lys Asp He Phe Lys Glu He Leu
85 90 95
Glu Asp Tyr Lys Tyr Leu Glu Lys Glu Phe Lys Glu Leu Ser Asn Thr
100 105 110
Ala Glu Lys Glu Gly Asp Lys Val Thr Val Thr Tyr Ala Asp Asp Gin
115 120 125
Leu Ala Lys Leu Gin Lys Ser He Trp Met Leu Gin Ala His Leu Ala 130 135 140
(2) INFORMATION FOR SEQ ID NO: 359:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 376 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...323 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359:
CTTTTTTCTC TTCCACTTTT TCCACTTTTT TAAGAGCCTT GTGTTTATGA TTG GGC 56
Leu Gly 1
TTT TTG GGT TCA GGT TTG GGT TTT GGT TTA GGC TCA GGC TTA GGC TTT 104 Phe Leu Gly Ser Gly Leu Gly Phe Gly Leu Gly Ser Gly Leu Gly Phe 5 10 15
TCT ATA GGT TTT GGC GGG GTT GGC GGG GTT GGC GGA GTT GGG GGT GTG 152 Ser He Gly Phe Gly Gly Val Gly Gly Val Gly Gly Val Gly Gly Val 20 25 30
GGA GGC GTT GGA GGT TTT TGG GGG CCA GCC AGC GTG GGT TTA GGA GCG 200 Gly Gly Val Gly Gly Phe Trp Gly Pro Ala Ser Val Gly Leu Gly Ala 35 40 45 50
CCC TGG GTG TTT TTA CTG GGA TCT TGC GAA TGG CCT CTT TTT AAA ACC 248 Pro Trp Val Phe Leu Leu Gly Ser Cys Glu Trp Pro Leu Phe Lys Thr 55 60 65
AAT AAA TTT TCA GGA TTT AAT TTA ACA AGC TTG GGT TTT GAA GGA AAA 296 Asn Lys Phe Ser Gly Phe Asn Leu Thr Ser Leu Gly Phe Glu Gly Lys 70 75 80
AAA TCT TCT CTG TGT TCA AAT AAA AAA TAAATCAACC AGTGTAACAA TACAGAC 350 Lys Ser Ser Leu Cys Ser Asn Lys Lys 85 90
AGAATGAGAG AAAAGAAAAA ATTCCT 376
(2) INFORMATION FOR SEQ ID NO: 360:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 91 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360:
Leu Gly Phe Leu Gly Ser Gly Leu Gly Phe Gly Leu Gly Ser Gly Leu 1 5 10 15
Gly Phe Ser He Gly Phe Gly Gly Val Gly Gly Val Gly Gly Val Gly
20 25 30
Gly Val Gly Gly Val Gly Gly Phe Trp Gly Pro Ala Ser Val Gly Leu
35 40 45
Gly Ala Pro Trp Val Phe Leu Leu Gly Ser Cys Glu Trp Pro Leu Phe
50 55 60
Lys Thr Asn Lys Phe Ser Gly Phe Asn Leu Thr Ser Leu Gly Phe Glu 65 70 75 80
Gly Lys Lys Ser Ser Leu Cys Ser Asn Lys Lys 85 90
(2) INFORMATION FOR SEQ ID NO: 361:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2890 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 458...2836 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361:
TCAAAATTTA AGTTAATTTT AATAATTATT TTTATAGTAT GCATCGGTTT GAATTAAATG 60
AGAAAGGTTA TCACAATGAA TGGTTATTTG AGAGTAAAAA CCTCTTATTT TTTAGCGTTG 120
AACGCTTTGA CTTTTTTGTC TTTTAACTCT TTGGTGGGCG CGAAAGAACA GCATCACACT 180
TTGCAAAAAG TGACAACCAC TGAGCAAAAA TTCAATCCAA GCGCGCCGCT TTCATGGCAA 240
AGCGAAGAGA TGCGTAATTC CACAAGCTCT CGCACGGTGA TTTCCAACAA GGAACTCAAA 300
AAAACGGGGA ATTTGAATAT TGAAAACGCC TTGCAAAACG TGCCAGGGAT TCAAATCAGA 360
GACGCTACAG GCACAGGCGT GCTGCCTAAA ATTTCGGTGC TCAAAATTTA AGTTAATTTT 420
AATAATTATT TTTATAGTAT GCATCGGTTT GAATTAA ATG AGA AAG GTT ATC ACA 475
Met Arg Lys Val He Thr 1 5
ATG AAT GGT TAT TTG AGA GTA AAA ACC TCT TAT TTT TTA GCG TTG AAC 523 Met Asn Gly Tyr Leu Arg Val Lys Thr Ser Tyr Phe Leu Ala Leu Asn 10 15 20
GCT TTG ACT TTT TTG TCT TTT AAC TCT TTG GTG GGC GCG AAA GAA CAG 571 Ala Leu Thr Phe Leu Ser Phe Asn Ser Leu Val Gly Ala Lys Glu Gin 25 30 35
CAT CAC ACT TTG CAA AAA GTG ACA ACC ACT GAG CAA AAA TTC AAT CCA 619 His His Thr Leu Gin Lys Val Thr Thr Thr Glu Gin Lys Phe Asn Pro 40 45 50
AGC GCG CCG CTT TCA TGG CAA AGC GAA GAG ATG CGT AAT TCC ACA AGC 667 Ser Ala Pro Leu Ser Trp Gin Ser Glu Glu Met Arg Asn Ser Thr Ser 55 60 65 70
TCT CGC ACG GTG ATT TCC AAC AAG GAA CTC AAA AAA ACG GGG AAT TTG 715 Ser Arg Thr Val He Ser Asn Lys Glu Leu Lys Lys Thr Gly Asn Leu 75 80 85
AAT ATT GAA AAC GCC TTG CAA AAC GTG CCA GGG ATT CAA ATC AGA GAC 763 Asn He Glu Asn Ala Leu Gin Asn Val Pro Gly He Gin He Arg Asp 90 95 100
GCT ACA GGC ACA GGC GTG CTG CCT AAA ATT TCG GTG CGC GGT TTT GGT 811 Ala Thr Gly Thr Gly Val Leu Pro Lys He Ser Val Arg Gly Phe Gly 105 110 115
GGG GGC GGT AAC GGG CAT AGC AAT ACC AAC ATG ATT TTA GTC AAT GGT 859 Gly Gly Gly Asn Gly His Ser Asn Thr Asn Met He Leu Val Asn Gly 120 125 130
ATC CCC ATT TAT GGC GCG CCG TAT TCC AAT ATT GAA CTG GCG ATT TTC 907 He Pro He Tyr Gly Ala Pro Tyr Ser Asn He Glu Leu Ala He Phe 135 140 145 150
CCT GTA ACT TTC CAG TCA GTG GAT AGG ATT GAT GTG ATT AAA GGG GGC 955 Pro Val Thr Phe Gin Ser Val Asp Arg He Asp Val He Lys Gly Gly 155 160 165
ACG AGC GTG CAA TAC GGC CCT AAT ACT TTT GGA GGC GTG GTG AAT ATC 1003 Thr Ser Val Gin Tyr Gly Pro Asn Thr Phe Gly Gly Val Val Asn He 170 175 180
ATC ACT AAA GAA ATC CCT AAA GAG TGG GAA AAT CAA GCG GCT GAA AGG 1051 He Thr Lys Glu He Pro Lys Glu Trp Glu Asn Gin Ala Ala Glu Arg 185 190 195
ATC ACT TTT TGG GGG CGA TCC TCT AAT GGG AAT TTT GTA GAT CCC AAA 1099 He Thr Phe Trp Gly Arg Ser Ser Asn Gly Asn Phe Val Asp Pro Lys 200 205 210
GAA AAA GGC AAG CCT TTA GCC CAA ACT TTA GGA AAC CAA ATG CTG TTT 1147 Glu Lys Gly Lys Pro Leu Ala Gin Thr Leu Gly Asn Gin Met Leu Phe 215 220 225 230
AAC ACT TAC GGG CGA ACG GCT GGA ATG TTG GGT AAG CAT GTA GGA ATT 1195 Asn Thr Tyr Gly Arg Thr Ala Gly Met Leu Gly Lys His Val Gly He 235 240 245
AGC GCT CAA GGC AAT TGG ATT AAC GGG CAA GGT TTC AGG CAA AAC AGC 1243 Ser Ala Gin Gly Asn Trp He Asn Gly Gin Gly Phe Arg Gin Asn Ser 250 255 260
CCC ACA AAG GTG CAA AAC TAC TTG CTT GAT GCG GTT TAT AAG ATT AAT 1291 Pro Thr Lys Val Gin Asn Tyr Leu Leu Asp Ala Val Tyr Lys He Asn 265 270 275 GCG ACC AAT ACT TTT AAA GCT TAT TAC CAA TAT TAT CAA TAC AAC TCT 1339 Ala Thr Asn Thr Phe Lys Ala Tyr Tyr Gin Tyr Tyr Gin Tyr Asn Ser 280 285 290
TAC CAT CCA GGC ACT TTG AGT GCA CAA GAT TAT GCT TAT AAC CGC TTC 1387 Tyr His Pro Gly Thr Leu Ser Ala Gin Asp Tyr Ala Tyr Asn Arg Phe 295 300 305 310
ATT AAT GAG CGC CCT GAC AAT CAA GAT GGA GGG CGA GCC AAG CGC TTT 1435 He Asn Glu Arg Pro Asp Asn Gin Asp Gly Gly Arg Ala Lys Arg Phe 315 320 325
GGG ATC GTG TAT CAA AAT TAT TTT GGC GAT CCG GAT AGG AAA GTG GGG 1483 Gly He Val Tyr Gin Asn Tyr Phe Gly Asp Pro Asp Arg Lys Val Gly 330 335 340
GGA GAT TTT AAA TTC ACT TAT TTC ACG CAT GAC ATG AGT AGG GAT TTT 1531 Gly Asp Phe Lys Phe Thr Tyr Phe Thr His Asp Met Ser Arg Asp Phe 345 350 355
GGG TTT TCC AAC CAA TAC CAA AGC GTG TAT ATG AGC AGT CAA AAC AAG 1579 Gly Phe Ser Asn Gin Tyr Gin Ser Val Tyr Met Ser Ser Gin Asn Lys 360 365 370
ATT TTA CCT TTT AAA GGC AAG GGA AAA ATT AGC GCG ACT AAC CCT AAT 1627 He Leu Pro Phe Lys Gly Lys Gly Lys He Ser Ala Thr Asn Pro Asn 375 380 385 390
TGC GGT TTG TAT TCT TAT AGC GAC ACG AAC AGC CCT TGT TGG CAA TTT 1675 Cys Gly Leu Tyr Ser Tyr Ser Asp Thr Asn Ser Pro Cys Trp Gin Phe 395 400 405
TTT GAC AAT ATC CGC CGA TCC GTG GTG AAT GCC TTT GAG CCA AAA CTC 1723 Phe Asp Asn He Arg Arg Ser Val Val Asn Ala Phe Glu Pro Lys Leu 410 415 420
AAT CTT ATC GTC AAT ACC GGT AAA GTC AAA CAA ACT TTT AAT ATG GGA 1771 Asn Leu He Val Asn Thr Gly Lys Val Lys Gin Thr Phe Asn Met Gly 425 430 435
ATG CGC TTT TTA ACT GAA GAT TTA TAC CGC CGA TCC ACC ACC AGG AAA 1819 Met Arg Phe Leu Thr Glu Asp Leu Tyr Arg Arg Ser Thr Thr Arg Lys 440 445 450
AAC CCT AGC ATG CCT AAT AAT GGG AGT GGT TTT GAT GCA GGA ACT TCA 1867 Asn Pro Ser Met Pro Asn Asn Gly Ser Gly Phe Asp Ala Gly Thr Ser 455 460 465 470
CTC AAT AAT TTC AAC AAT TAT ACC GCT GTG TAT GCC AGC GAT GAG ATC 1915 Leu Asn Asn Phe Asn Asn Tyr Thr Ala Val Tyr Ala Ser Asp Glu He 475 480 485
AAT TTC AAT AAC GGC ATG CTA ACG ATC ACG CCG GGC TTG AGA TAC ACT 1963 Asn Phe Asn Asn Gly Met Leu Thr He Thr Pro Gly Leu Arg Tyr Thr 490 495 500 TTT TTA AAT TAC GAA AAA AAA GAC GCT CCT CCT TTT AAA GCA GGC CAA 2011 Phe Leu Asn Tyr Glu Lys Lys Asp Ala Pro Pro Phe Lys Ala Gly Gin 505 510 515
ACA GGA AAA ACC ATT AAA GAT CGT TAT AAC CAA TGG AAT CCA GCA GTG 2059 Thr Gly Lys Thr He Lys Asp Arg Tyr Asn Gin Trp Asn Pro Ala Val 520 525 530
AAT GTC GGC TAT AAA CCC ATT AAA GAA TTG TTG TTT TAT TTC AAT TAC 2107 Asn Val Gly Tyr Lys Pro He Lys Glu Leu Leu Phe Tyr Phe Asn Tyr 535 540 545 550
CAA AGA AGC TAC ATT CCG CCT CAA TTC AGC AAT ATC GGT AGT TTT GTA 2155 Gin Arg Ser Tyr He Pro Pro Gin Phe Ser Asn He Gly Ser Phe Val 555 560 565
GGC ACA AGC ACG GAT TAT TTT CAA ATC TTT AAT GTC ATG GAA GGC GGC 2203 Gly Thr Ser Thr Asp Tyr Phe Gin He Phe Asn Val Met Glu Gly Gly 570 575 580
TCA AGA TAT TAT TTT AAC AAC CAA GTG AGT TTT AAC GCG AAT TAT TTT 2251 Ser Arg Tyr Tyr Phe Asn Asn Gin Val Ser Phe Asn Ala Asn Tyr Phe 585 590 595
GTG ATT TTT GCG AAT AAC TAT TTT ACC GGG CGC TAT GGG GAT AAT AAA 2299 Val He Phe Ala Asn Asn Tyr Phe Thr Gly Arg Tyr Gly Asp Asn Lys 600 605 610
GAG CCG GTC AAT GCG AGA TCG CAA GGC GTG GAG CTA GAG TTG TAT TAC 2347 Glu Pro Val Asn Ala Arg Ser Gin Gly Val Glu Leu Glu Leu Tyr Tyr 615 620 625 630
ACG CCG ATT AGA GGG CTT AAT TTC CAT GCG GCT TAC ACT TTC ATA GAT 2395 Thr Pro He Arg Gly Leu Asn Phe His Ala Ala Tyr Thr Phe He Asp 635 640 645
GCC AAT ATC ACA AGC CAC ACG ATG GTT ACT AAC CCC GCT AAT CCT AAA 2443 Ala Asn He Thr Ser His Thr Met Val Thr Asn Pro Ala Asn Pro Lys 650 655 660
GGG CCT AAA AAA GAT ATT TTT GGC AAA AAG CTC CCT TTT GTA AGC CCG 2491 Gly Pro Lys Lys Asp He Phe Gly Lys Lys Leu Pro Phe Val Ser Pro 665 670 675
CAC CAA TTC ATT TTA GAC GCG AGC TAC ACT TAC GCT AAA ACC ACG ATT 2539 His Gin Phe He Leu Asp Ala Ser Tyr Thr Tyr Ala Lys Thr Thr He 680 685 690
GGG TTG AGT TCT TTC TTT TAT AGC CGA ACT TAT AGC GAT GTG TTA AAC 2587 Gly Leu Ser Ser Phe Phe Tyr Ser Arg Thr Tyr Ser Asp Val Leu Asn 695 700 705 710
ACC GTG CCT TTT ATT CAA TAC GCG CCC ACG ATC AAA AAT GGT GCT ATC 2635 Thr Val Pro Phe He Gin Tyr Ala Pro Thr He Lys Asn Gly Ala He 715 720 725 ACT ACC AAA ACA GCG GGC ATG ACG CCA TGG TAT TGG GTG TGG AAT TTG 2683
Thr Thr Lys Thr Ala Gly Met Thr Pro Trp Tyr Trp Val Trp Asn Leu 730 735 740
CAA ATT TCT ACC ACT TTT TGG GAA CGC AAA AAG CAA AGC GTT AAT GCG 2731
Gin He Ser Thr Thr Phe Trp Glu Arg Lys Lys Gin Ser Val Asn Ala 745 750 755
AGC TTG CAA ATC AAT AAC ATT TTT AAC ATG AAA TAT TGG TTT AGC GGG 2779
Ser Leu Gin He Asn Asn He Phe Asn Met Lys Tyr Trp Phe Ser Gly
760 765 770
ATA GGC ACT AGC CTA ACG GGA AAG AAG CCG CGC CTC CTA GGA GCA TCA 2827
He Gly Thr Ser Leu Thr Gly Lys Lys Pro Arg Leu Leu Gly Ala Ser 775 780 785 790
CAG CGT ATG TGAGCTATCA TTTTTAATTT TAGGGTTGTA ATGTTTTGAG AAGTTGGGC 2885 Gin Arg Met
GTAAA 2890
(2) INFORMATION FOR SEQ ID NO: 362:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 793 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362:
Met Arg Lys Val He Thr Met Asn Gly Tyr Leu Arg Val Lys Thr Ser
1 5 10 15
Tyr Phe Leu Ala Leu Asn Ala Leu Thr Phe Leu Ser Phe Asn Ser Leu
20 25 30
Val Gly Ala Lys Glu Gin His His Thr Leu Gin Lys Val Thr Thr Thr
35 40 45
Glu Gin Lys Phe Asn Pro Ser Ala Pro Leu Ser Trp Gin Ser Glu Glu
50 55 60
Met Arg Asn Ser Thr Ser Ser Arg Thr Val He Ser Asn Lys Glu Leu 65 70 75 80
Lys Lys Thr Gly Asn Leu Asn He Glu Asn Ala Leu Gin Asn Val Pro
85 90 95
Gly He Gin He Arg Asp Ala Thr Gly Thr Gly Val Leu Pro Lys He
100 105 110
Ser Val Arg Gly Phe Gly Gly Gly Gly Asn Gly His Ser Asn Thr Asn
115 120 125
Met He Leu Val Asn Gly He Pro He Tyr Gly Ala Pro Tyr Ser Asn
130 135 140
He Glu Leu Ala He Phe Pro Val Thr Phe Gin Ser Val Asp Arg He 145 150 155 160 Asp Val He Lys Gly Gly Thr Ser Val Gin Tyr Gly Pro Asn Thr Phe
165 170 175
Gly Gly Val Val Asn He He Thr Lys Glu He Pro Lys Glu Trp Glu
180 185 190
Asn Gin Ala Ala Glu Arg He Thr Phe Trp Gly Arg Ser Ser Asn Gly
195 200 205
Asn Phe Val Asp Pro Lys Glu Lys Gly Lys Pro Leu Ala Gin Thr Leu
210 215 220
Gly Asn Gin Met Leu Phe Asn Thr Tyr Gly Arg Thr Ala Gly Met Leu 225 230 235 240
Gly Lys His Val Gly He Ser Ala Gin Gly Asn Trp He Asn Gly Gin
245 250 255
Gly Phe Arg Gin Asn Ser Pro Thr Lys Val Gin Asn Tyr Leu Leu Asp
260 265 270
Ala Val Tyr Lys He Asn Ala Thr Asn Thr Phe Lys Ala Tyr Tyr Gin
275 280 285
Tyr Tyr Gin Tyr Asn Ser Tyr His Pro Gly Thr Leu Ser Ala Gin Asp
290 295 300
Tyr Ala Tyr Asn Arg Phe He Asn Glu Arg Pro Asp Asn Gin Asp Gly 305 310 315 320
Gly Arg Ala Lys Arg Phe Gly He Val Tyr Gin Asn Tyr Phe Gly Asp
325 330 335
Pro Asp Arg Lys Val Gly Gly Asp Phe Lys Phe Thr Tyr Phe Thr His
340 345 350
Asp Met Ser Arg Asp Phe Gly Phe Ser Asn Gin Tyr Gin Ser Val Tyr
355 360 365
Met Ser Ser Gin Asn Lys He Leu Pro Phe Lys Gly Lys Gly Lys He
370 375 380
Ser Ala Thr Asn Pro Asn Cys Gly Leu Tyr Ser Tyr Ser Asp Thr Asn 385 390 395 400
Ser Pro Cys Trp Gin Phe Phe Asp Asn He Arg Arg Ser Val Val Asn
405 410 415
Ala Phe Glu Pro Lys Leu Asn Leu He Val Asn Thr Gly Lys Val Lys
420 425 430
Gin Thr Phe Asn Met Gly Met Arg Phe Leu Thr Glu Asp Leu Tyr Arg
435 440 445
Arg Ser Thr Thr Arg Lys Asn Pro Ser Met Pro Asn Asn Gly Ser Gly
450 455 460
Phe Asp Ala Gly Thr Ser Leu Asn Asn Phe Asn Asn Tyr Thr Ala Val 465 470 475 480
Tyr Ala Ser Asp Glu He Asn Phe Asn Asn Gly Met Leu Thr He Thr
485 490 495
Pro Gly Leu Arg Tyr Thr Phe Leu Asn Tyr Glu Lys Lys Asp Ala Pro
500 505 510
Pro Phe Lys Ala Gly Gin Thr Gly Lys Thr He Lys Asp Arg Tyr Asn
515 520 525
Gin Trp Asn Pro Ala Val Asn Val Gly Tyr Lys Pro He Lys Glu Leu
530 535 540
Leu Phe Tyr Phe Asn Tyr Gin Arg Ser Tyr He Pro Pro Gin Phe Ser 545 550 555 560
Asn He Gly Ser Phe Val Gly Thr Ser Thr Asp Tyr Phe Gin He Phe
565 570 575
Asn Val Met Glu Gly Gly Ser Arg Tyr Tyr Phe Asn Asn Gin Val Ser
580 585 590
Phe Asn Ala Asn Tyr Phe Val He Phe Ala Asn Asn Tyr Phe Thr Gly 595 600 605
Arg Tyr Gly Asp Asn Lys Glu Pro Val Asn Ala Arg Ser Gin Gly Val
610 615 620
Glu Leu Glu Leu Tyr Tyr Thr Pro He Arg Gly Leu Asn Phe His Ala 625 630 635 640
Ala Tyr Thr Phe He Asp Ala Asn He Thr Ser His Thr Met Val Thr
645 650 655
Asn Pro Ala Asn Pro Lys Gly Pro Lys Lys Asp He Phe Gly Lys Lys
660 665 670
Leu Pro Phe Val Ser Pro His Gin Phe He Leu Asp Ala Ser Tyr Thr
675 680 685
Tyr Ala Lys Thr Thr He Gly Leu Ser Ser Phe Phe Tyr Ser Arg Thr
690 695 700
Tyr Ser Asp Val Leu Asn Thr Val Pro Phe He Gin Tyr Ala Pro Thr 705 710 715 720
He Lys Asn Gly Ala He Thr Thr Lys Thr Ala Gly Met Thr Pro Trp
725 730 735
Tyr Trp Val Trp Asn Leu Gin He Ser Thr Thr Phe Trp Glu Arg Lys
740 745 750
Lys Gin Ser Val Asn Ala Ser Leu Gin He Asn Asn He Phe Asn Met
755 760 765
Lys Tyr Trp Phe Ser Gly He Gly Thr Ser Leu Thr Gly Lys Lys Pro
770 775 780
Arg Leu Leu Gly Ala Ser Gin Arg Met 785 790
(2) INFORMATION FOR SEQ ID NO:363:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 406 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...353 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.363:
TCGTCTTTTG TCGGTTCAAA AAAATCTCTC GACTTAAAAA AACCTATTAA ATC AAG 56
He Lys 1
ATA ATC TAT CAA ATC ATC AAG TTT TTT CGT TCT AAG AAT TTT ATT TTG 104 He He Tyr Gin He He Lys Phe Phe Arg Ser Lys Asn Phe He Leu 5 10 15
TTT TTT AGA ATA GCA ACG ATA AAG TTC TTC TTT TAT CTC ATT TGG GAA 152 Phe Phe Arg He Ala Thr He Lys Phe Phe Phe Tyr Leu He Trp Glu 20 25 30
TTT TTG AAT GTT ATA GAC AAT TTC ACT ATC TCT TTG ATT TTG TTT ATA 200 Phe Leu Asn Val He Asp Asn Phe Thr He Ser Leu He Leu Phe He 35 40 45 50
TTT TTT AGC CCC ATA CCA AAG AAA TAT TTG ATA AAA AAT AAG AAA AAT 248 Phe Phe Ser Pro He Pro Lys Lys Tyr Leu He Lys Asn Lys Lys Asn 55 60 65
AGC GTA AAA GAA AAA GAA AAT AAA GAA AAA AGA AAG AGA AAA AGA AAG 296 Ser Val Lys Glu Lys Glu Asn Lys Glu Lys Arg Lys Arg Lys Arg Lys 70 75 80
GAT TTT GTT TTG GGT GTA TTG GAA AAT AGA CTC AAA AAT CAA TTG AAA 344 Asp Phe Val Leu Gly Val Leu Glu Asn Arg Leu Lys Asn Gin Leu Lys 85 90 95
AAC CCC TTT TAGATTAAAA ATAAAAACAA TAAGCGAAAC GACAAAAGCA AGCAGAAAA 402 Asn Pro Phe 100
GAAG 406
(2) INFORMATION FOR SEQ ID NO: 364:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 101 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:364:
He Lys He He Tyr Gin He He Lys Phe Phe Arg Ser Lys Asn Phe
1 5 10 15
He Leu Phe Phe Arg He Ala Thr He Lys Phe Phe Phe Tyr Leu He
20 25 30
Trp Glu Phe Leu Asn Val He Asp Asn Phe Thr He Ser Leu He Leu
35 40 45
Phe He Phe Phe Ser Pro He Pro Lys Lys Tyr Leu He Lys Asn Lys
50 55 60
Lys Asn Ser Val Lys Glu Lys Glu Asn Lys Glu Lys Arg Lys Arg Lys 65 70 75 80
Arg Lys Asp Phe Val Leu Gly Val Leu Glu Asn Arg Leu Lys Asn Gin
85 90 95
Leu Lys Asn Pro Phe 100
(2) INFORMATION FOR SEQ ID NO: 365:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1143 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 89...1087 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365:
GATTTTGTGA AAAATAGTTT CATTTTTACT GCTTGTATTT CCTTAATGGT GTTATAATCG 60 CTCCATAAAT CATACAAAAA GGATCGTT ATG TTA GTT ACT CGC TTT AAA AAA 112
Met Leu Val Thr Arg Phe Lys Lys 1 5
GCT TTA ATC TCT TAT TCT TTA GGC GCG CTT CTT GTT TCA TCG TTA TTG 160 Ala Leu He Ser Tyr Ser Leu Gly Ala Leu Leu Val Ser Ser Leu Leu 10 15 20
GGC GTG GCT AGT GCT TCC AAT CAA GAA ATC CAA GTC AAA GAT TAT TTT 208 Gly Val Ala Ser Ala Ser Asn Gin Glu He Gin Val Lys Asp Tyr Phe 25 30 35 40
GGG GAT CAA GCC ATC AAG CTT CCT GTT TCT AAA ATA ATC TAC TTG GGT 256 Gly Asp Gin Ala He Lys Leu Pro Val Ser Lys He He Tyr Leu Gly 45 50 55
AGC TTT GCA GAA GTG CCT GCT ATG TTC CAT ACT TGG GAT AGG GTC GTG 304 Ser Phe Ala Glu Val Pro Ala Met Phe His Thr Trp Asp Arg Val Val 60 65 70
GGA ATT TCG GAT TAC GCT TTT AAA TCT GAT ATT GTT AAA GCT ACT CTC 352 Gly He Ser Asp Tyr Ala Phe Lys Ser Asp He Val Lys Ala Thr Leu 75 80 85
AAA GAT CCT AAA CGC ATT AAA TCC ATG AGC AGT GAT CAT GTG GCG GCG 400 Lys Asp Pro Lys Arg He Lys Ser Met Ser Ser Asp His Val Ala Ala 90 95 100
TTG AAT GTG GAG CTT TTA AAA AAG CTT GGC CCC GAT CTT GTG GTA ACC 448 Leu Asn Val Glu Leu Leu Lys Lys Leu Gly Pro Asp Leu Val Val Thr 105 110 115 120
TTT GTG GGC AAC CCT AAA GCG GTA GAG CAT GCG AAA AAA TTT GGT ATA 496 Phe Val Gly Asn Pro Lys Ala Val Glu His Ala Lys Lys Phe Gly He 125 130 135
TTA TTT CTT TCT TTC CAA GAA AAA ACC ATT GCA GAA GTC ATG GAA GAT 544 Leu Phe Leu Ser Phe Gin Glu Lys Thr He Ala Glu Val Met Glu Asp 140 145 150
ATT GAC GCT CAA GCT AAA GCC TTA GAA ATT GAT GCT TCT AAA AAA CTG 592 He Asp Ala Gin Ala Lys Ala Leu Glu He Asp Ala Ser Lys Lys Leu 155 160 165
GCC AAA ATG CAA GAA ACT TTG GAT TTT ATT GCT GAG CGT TTG AAA GGT 640 Ala Lys Met Gin Glu Thr Leu Asp Phe He Ala Glu Arg Leu Lys Gly 170 175 180
GTC AAA AAG AAA AAA GGG GTG GAG CTT TTC CAT AAG GCC AAT AAG ATC 688 Val Lys Lys Lys Lys Gly Val Glu Leu Phe His Lys Ala Asn Lys He 185 190 195 200
AGC GGC CAT CAA GCC CTT GAT TCA GAC ATT TTA GAA AAA GGA GGC ATA 736 Ser Gly His Gin Ala Leu Asp Ser Asp He Leu Glu Lys Gly Gly He 205 210 215
GAC AAT TTT GGC TTG AAA TAT GTC AAA TTT GGG CGT GCT GAC ATT AGC 784 Asp Asn Phe Gly Leu Lys Tyr Val Lys Phe Gly Arg Ala Asp He Ser 220 225 230
GTG GAA AAA ATC GTT AAA GAA AAC CCT GAG ATT ATC TTT ATT TGG TGG 832 Val Glu Lys He Val Lys Glu Asn Pro Glu He He Phe He Trp Trp 235 240 245
ATA AGC CCA CTC ACG CCT GAA GAT GTG TTA AAC AAC CCC AAA TTT GCT 880 He Ser Pro Leu Thr Pro Glu Asp Val Leu Asn Asn Pro Lys Phe Ala 250 255 260
ACC ATC AAA GCC ATT AAA AAC AAG CAG GTT TAT AAA CTC CCC ACA ATG 928 Thr He Lys Ala He Lys Asn Lys Gin Val Tyr Lys Leu Pro Thr Met 265 270 275 280
GAT ATT GGC GGG CCT AGA GCC CCA CTC ATA AGT CTT TTT ATC GCT CTA 976 Asp He Gly Gly Pro Arg Ala Pro Leu He Ser Leu Phe He Ala Leu 285 290 295
AAA GCC CAC CCT GAA GCC TTT AAG GGC GTG GAT ATT AAT GCG ATT GTT 1024 Lys Ala His Pro Glu Ala Phe Lys Gly Val Asp He Asn Ala He Val 300 305 310
AAA GAC TAC TAT AAA GTG GTT TTT GAT TTG AAT GAT GCA GAG GTT GAA 1072 Lys Asp Tyr Tyr Lys Val Val Phe Asp Leu Asn Asp Ala Glu Val Glu 315 320 325
CCC TTT TTA TGG CAT TAATTTTTAA AAAAGGGCTG ATATTTTTAG CCCTTTGTGT A 1128 Pro Phe Leu Trp His 330
TCGCGCTAGG ATTAG 1143
(2) INFORMATION FOR SEQ ID NO: 366:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 333 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366:
Met Leu Val Thr Arg Phe Lys Lys Ala Leu He Ser Tyr Ser Leu Gly
1 5 10 15
Ala Leu Leu Val Ser Ser Leu Leu Gly Val Ala Ser Ala Ser Asn Gin
20 25 30
Glu He Gin Val Lys Asp Tyr Phe Gly Asp Gin Ala He Lys Leu Pro
35 40 45
Val Ser Lys He He Tyr Leu Gly Ser Phe Ala Glu Val Pro Ala Met
50 55 60
Phe His Thr Trp Asp Arg Val Val Gly He Ser Asp Tyr Ala Phe Lys 65 70 75 80
Ser Asp He Val Lys Ala Thr Leu Lys Asp Pro Lys Arg He Lys Ser
85 90 95
Met Ser Ser Asp His Val Ala Ala Leu Asn Val Glu Leu Leu Lys Lys
100 105 110
Leu Gly Pro Asp Leu Val Val Thr Phe Val Gly Asn Pro Lys Ala Val
115 120 125
Glu His Ala Lys Lys Phe Gly He Leu Phe Leu Ser Phe Gin Glu Lys
130 135 140
Thr He Ala Glu Val Met Glu Asp He Asp Ala Gin Ala Lys Ala Leu 145 150 155 160
Glu He Asp Ala Ser Lys Lys Leu Ala Lys Met Gin Glu Thr Leu Asp
165 170 175
Phe He Ala Glu Arg Leu Lys Gly Val Lys Lys Lys Lys Gly Val Glu
180 185 190
Leu Phe His Lys Ala Asn Lys He Ser Gly His Gin Ala Leu Asp Ser
195 200 205
Asp He Leu Glu Lys Gly Gly He Asp Asn Phe Gly Leu Lys Tyr Val
210 215 220
Lys Phe Gly Arg Ala Asp He Ser Val Glu Lys He Val Lys Glu Asn 225 230 235 240
Pro Glu He He Phe He Trp Trp He Ser Pro Leu Thr Pro Glu Asp
245 250 255
Val Leu Asn Asn Pro Lys Phe Ala Thr He Lys Ala He Lys Asn Lys
260 265 270
Gin Val Tyr Lys Leu Pro Thr Met Asp He Gly Gly Pro Arg Ala Pro
275 280 285
Leu He Ser Leu Phe He Ala Leu Lys Ala His Pro Glu Ala Phe Lys
290 295 300
Gly Val Asp He Asn Ala He Val Lys Asp Tyr Tyr Lys Val Val Phe 305 310 315 320
Asp Leu Asn Asp Ala Glu Val Glu Pro Phe Leu Trp His 325 330
(2) INFORMATION FOR SEQ ID NO: 367: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 898 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...845 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367:
GGGGGGTTAT AGTAAAAACA TGCAAGTAAT TTAAAGTTAA TTTAAGATAA TTA GGC 56
Leu Gly 1
ACA ATA GCC ACA AAA AGT TTA AGG CTG TAT TTG AAA ACT CTA TTT AGT 104 Thr He Ala Thr Lys Ser Leu Arg Leu Tyr Leu Lys Thr Leu Phe Ser 5 10 15
ATT TAT CTC TTT TTA TCG TTG AAC CCA CTC TTT TTA GAA GCT AAT GAA 152 He Tyr Leu Phe Leu Ser Leu Asn Pro Leu Phe Leu Glu Ala Asn Glu 20 25 30
ATC ACT TGG TCT AAA TTC TTG GAA AAT TTT AAA AAC AAG AAT GAT GAT 200 He Thr Trp Ser Lys Phe Leu Glu Asn Phe Lys Asn Lys Asn Asp Asp 35 40 45 50
GAC AAA CCT AAA CCC CTA ACT ATT GAT AAA AAC AAT GAA AAA CAG CAA 248 Asp Lys Pro Lys Pro Leu Thr He Asp Lys Asn Asn Glu Lys Gin Gin 55 60 65
ATC TTA GAC AAA AAC CAG CAA ATC TTA AAA AGG GCT TTG GAA AAA AGC 296 He Leu Asp Lys Asn Gin Gin He Leu Lys Arg Ala Leu Glu Lys Ser 70 75 80
CTT AAA TTC TTT TTC ATT TTT GGA TAC AAC TAT TCG CAA GCC ACT TTT 344 Leu Lys Phe Phe Phe He Phe Gly Tyr Asn Tyr Ser Gin Ala Thr Phe 85 90 95
TCA ACT TCT AAC CAA ACC TTG ACT TTT GTA GCC AAT AGC ATA GGG TTT 392 Ser Thr Ser Asn Gin Thr Leu Thr Phe Val Ala Asn Ser He Gly Phe 100 105 110
AAC ACC GCT ACC GGT TTA GAG CAT TTT TTA AGA AAC CAC CCT AAA GTC 440 Asn Thr Ala Thr Gly Leu Glu His Phe Leu Arg Asn His Pro Lys Val 115 120 125 130
GGT TTT AGA ATC TTT AGC GTC TAT AAC TAT TTC CAT TCT GTT TCC CTC 488 Gly Phe Arg He Phe Ser Val Tyr Asn Tyr Phe His Ser Val Ser Leu 135 140 145
TCC CAG CCT CAA ACC TTA ATG GTG CAA AAT TAT GGG GGC GCG TTA GAT 536 Ser Gin Pro Gin Thr Leu Met Val Gin Asn Tyr Gly Gly Ala Leu Asp 150 155 160
TTT TCT TGG ATT TTT GTA GAT AAA AAT ATT TAT CGC TTT AGG AGT TAT 584 Phe Ser Trp He Phe Val Asp Lys Asn He Tyr Arg Phe Arg Ser Tyr 165 170 175
TTA GGG ATC GCT TTA GAA CAA GGG GTG TTG TTA GTG GAT ACG ATT AAA 632 Leu Gly He Ala Leu Glu Gin Gly Val Leu Leu Val Asp Thr He Lys 180 185 190
CCA GGT GCT ATC ACA ACG ATT ATC CCA AGA ACC AAA AAA ACC TTT TTT 680 Pro Gly Ala He Thr Thr He He Pro Arg Thr Lys Lys Thr Phe Phe 195 200 205 210
CAA GCC CCT TTG CGT TTT GGT TTT ATC GTG GAT TTT ATC GGC TAT TTG 728 Gin Ala Pro Leu Arg Phe Gly Phe He Val Asp Phe He Gly Tyr Leu 215 220 225
TCT TTG CAA TTA GGG ATT GAA ATG CCT TTA GTG AGG AAT GTT TTT TAC 776 Ser Leu Gin Leu Gly He Glu Met Pro Leu Val Arg Asn Val Phe Tyr 230 235 240
ACC TAC AAC AAC CAT CAA GAA AGA TTC AAA CCA CGA TTT AAC GCT AAT 824 Thr Tyr Asn Asn His Gin Glu Arg Phe Lys Pro Arg Phe Asn Ala Asn 245 250 255
CTT TCT TTA ATC GTT TCG TTT TAGCCCCCCT TTTCCCCTTT TAAATAAGCC CATG 879 Leu Ser Leu He Val Ser Phe 260 265
ATTTTCCTAG GGTATTTTA 89E
(2) INFORMATION FOR SEQ ID NO: 368:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 265 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368:
Leu Gly Thr He Ala Thr Lys Ser Leu Arg Leu Tyr Leu Lys Thr Leu
1 5 10 15
Phe Ser He Tyr Leu Phe Leu Ser Leu Asn Pro Leu Phe Leu Glu Ala
20 25 30
Asn Glu He Thr Trp Ser Lys Phe Leu Glu Asn Phe Lys Asn Lys Asn 35 40 45 Asp Asp Asp Lys Pro Lys Pro Leu Thr He Asp Lys Asn Asn Glu Lys
50 55 60
Gin Gin He Leu Asp Lys Asn Gin Gin He Leu Lys Arg Ala Leu Glu 65 70 75 80
Lys Ser Leu Lys Phe Phe Phe He Phe Gly Tyr Asn Tyr Ser Gin Ala
85 90 95
Thr Phe Ser Thr Ser Asn Gin Thr Leu Thr Phe Val Ala Asn Ser He
100 105 110
Gly Phe Asn Thr Ala Thr Gly Leu Glu His Phe Leu Arg Asn His Pro
115 120 125
Lys Val Gly Phe Arg He Phe Ser Val Tyr Asn Tyr Phe His Ser Val
130 135 140
Ser Leu Ser Gin Pro Gin Thr Leu Met Val Gin Asn Tyr Gly Gly Ala 145 150 155 160
Leu Asp Phe Ser Trp He Phe Val Asp Lys Asn He Tyr Arg Phe Arg
165 170 175
Ser Tyr Leu Gly He Ala Leu Glu Gin Gly Val Leu Leu Val Asp Thr
180 185 190
He Lys Pro Gly Ala He Thr Thr He He Pro Arg Thr Lys Lys Thr
195 200 205
Phe Phe Gin Ala Pro Leu Arg Phe Gly Phe He Val Asp Phe He Gly
210 215 220
Tyr Leu Ser Leu Gin Leu Gly He Glu Met Pro Leu Val Arg Asn Val 225 230 235 240
Phe Tyr Thr Tyr Asn Asn His Gin Glu Arg Phe Lys Pro Arg Phe Asn
245 250 255
Ala Asn Leu Ser Leu He Val Ser Phe 260 265
(2) INFORMATION FOR SEQ ID NO: 369:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 742 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...689 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369:
GCACTTCACG CATTCCATAA ACGATATTGA TTATTTGCTA GACAGCTTGA AAA AAG 56
Lys Lys 1
CAG TTA AAA AAT TGC GTT AAG CTA AAA CTA TTT TTA AGG AAA AAT TTG 104 Gin Leu Lys Asn Cys Val Lys Leu Lys Leu Phe Leu Arg Lys Asn Leu 5 10 15 GAT ATT TTA GAT TTG AAC AAA GCG CAA GCG GTG CAA CAA AAT GAA CAA 152 Asp He Leu Asp Leu Asn Lys Ala Gin Ala Val Gin Gin Asn Glu Gin 20 25 30
GAG GTA GAG GAT AAA GAG CGA GAG TCT AAA GAG CCG GTG GTT TTA GAA 200 Glu Val Glu Asp Lys Glu Arg Glu Ser Lys Glu Pro Val Val Leu Glu 35 40 45 50
GAT TTG AGC GCT TTA GCG TGG CTT GAA TTA GAA GAG TTT AGC CGC CTT 248 Asp Leu Ser Ala Leu Ala Trp Leu Glu Leu Glu Glu Phe Ser Arg Leu 55 60 65
TCA GGG CTT CCT AAA GAA AGG ATT TTG GAA TTA GTG AAT CTT GGT AAA 296 Ser Gly Leu Pro Lys Glu Arg He Leu Glu Leu Val Asn Leu Gly Lys 70 75 80
ATC AAG AGC AAA ATA AGC AGC AAC AAG CTT TTA ATT GAT GCG AGC AGC 344 He Lys Ser Lys He Ser Ser Asn Lys Leu Leu He Asp Ala Ser Ser 85 90 95
GGG ACA AAC GCT TTA ATC AAA AAG GTA GAA AAT AGT TTG ATT TCT ATG 392 Gly Thr Asn Ala Leu He Lys Lys Val Glu Asn Ser Leu He Ser Met 100 105 110
GAT ATG AAC GGG CGT TCT TTA GAA CCT GTG TTT GTG GAA AAG ACC ATT 440 Asp Met Asn Gly Arg Ser Leu Glu Pro Val Phe Val Glu Lys Thr He 115 120 125 130
AAC ACG ATT TTA AAC TTG CAT GAT AAG GTC ATT GGC GCT AAA GAT GAA 488 Asn Thr He Leu Asn Leu His Asp Lys Val He Gly Ala Lys Asp Glu 135 140 145
ACG ATT TCA GCC TTT AAA AAT GAA AAC ATG TTT TTA AAA GAC GCT TTA 536 Thr He Ser Ala Phe Lys Asn Glu Asn Met Phe Leu Lys Asp Ala Leu 150 155 160
ATC TCT ATG CAA GAA GTC TAT GAA GAA GAT AAA AAA ACC ATT GAT CTT 584 He Ser Met Gin Glu Val Tyr Glu Glu Asp Lys Lys Thr He Asp Leu 165 170 175
TTG CGC GAT GAA CTC AAT CAA GCG AGA GAA GAA ATT GAA TTT ATG AAG 632 Leu Arg Asp Glu Leu Asn Gin Ala Arg Glu Glu He Glu Phe Met Lys 180 185 190
AGG AAA TAC CGC TTG ATG TGG GGG AAA GTC GCT GAC ATG AGC AGC GTG 680 Arg Lys Tyr Arg Leu Met Trp Gly Lys Val Ala Asp Met Ser Ser Val 195 200 205 210
AAT AAA AAG TAGTTTTAAA TTAACGCCCA TGCTGAGGGC TTATTAGCGG TAATTTTAG 738 Asn Lys Lys
GTGA 742
(2) INFORMATION FOR SEQ ID NO: 370: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 213 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370:
Lys Lys Gin Leu Lys Asn Cys Val Lys Leu Lys Leu Phe Leu Arg Lys
1 5 10 15
Asn Leu Asp He Leu Asp Leu Asn Lys Ala Gin Ala Val Gin Gin Asn
20 25 30
Glu Gin Glu Val Glu Asp Lys Glu Arg Glu Ser Lys Glu Pro Val Val
35 40 45
Leu Glu Asp Leu Ser Ala Leu Ala Trp Leu Glu Leu Glu Glu Phe Ser
50 55 60
Arg Leu Ser Gly Leu Pro Lys Glu Arg He Leu Glu Leu Val Asn Leu 65 70 75 80
Gly Lys He Lys Ser Lys He Ser Ser Asn Lys Leu Leu He Asp Ala
85 90 95
Ser Ser Gly Thr Asn Ala Leu He Lys Lys Val Glu Asn Ser Leu He
100 105 110
Ser Met Asp Met Asn Gly Arg Ser Leu Glu Pro Val Phe Val Glu Lys
115 120 125
Thr He Asn Thr He Leu Asn Leu His Asp Lys Val He Gly Ala Lys
130 135 140
Asp Glu Thr He Ser Ala Phe Lys Asn Glu Asn Met Phe Leu Lys Asp 145 150 155 160
Ala Leu He Ser Met Gin Glu Val Tyr Glu Glu Asp Lys Lys Thr He
165 170 175
Asp Leu Leu Arg Asp Glu Leu Asn Gin Ala Arg Glu Glu He Glu Phe
180 185 190
Met Lys Arg Lys Tyr Arg Leu Met Trp Gly Lys Val Ala Asp Met Ser
195 200 205
Ser Val Asn Lys Lys 210
(2) INFORMATION FOR SEQ ID NO: 371:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1004 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 56...931 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371:
CATTAAACGC ATGATTTTTG CTACAATAAT AGGATTTTAA TTATATAAAG GACAA ATG 58
Met 1
GGC ATG CCA AAT AGG GGC GTT GTT TTA TTA GAC GGG CAA GCG CTA GCT 106 Gly Met Pro Asn Arg Gly Val Val Leu Leu Asp Gly Gin Ala Leu Ala 5 10 15
GAT AAT ATA GAA AAA GAT TTG AAA CAT AAA ATC CAA ATA ATA ACC GCA 154 Asp Asn He Glu Lys Asp Leu Lys His Lys He Gin He He Thr Ala 20 25 30
CAA ACG CAT AAA CGC CCC AAA CTA GCC GTG ATT TTA GTG GGG AAA GAT 202 Gin Thr His Lys Arg Pro Lys Leu Ala Val He Leu Val Gly Lys Asp 35 40 45
CCC GCT AGT ATC ACT TAT GTC AAT ATG AAG ATC AAA GCA TGC GAA AGG 250 Pro Ala Ser He Thr Tyr Val Asn Met Lys He Lys Ala Cys Glu Arg 50 55 60 65
GTG GGC ATG GAT TTT GAC TTA AAA ACC CTC CAA GAA AAT ATT ACT GAA 298 Val Gly Met Asp Phe Asp Leu Lys Thr Leu Gin Glu Asn He Thr Glu 70 75 80
GCC AAA TTG CTA TCC TTG ATT AAA GAT TAC AAT ACC GAT CAA AAC ATT 346 Ala Lys Leu Leu Ser Leu He Lys Asp Tyr Asn Thr Asp Gin Asn He 85 90 95
TCA GGC GTT TTA GTC CAG CTC CCT TTG CCC AGA CAC ATT GAT ACT AAA 394 Ser Gly Val Leu Val Gin Leu Pro Leu Pro Arg His He Asp Thr Lys 100 105 110
ATG ATT TTA GAA GCC ATT GAC CCA AAC AAA GAT GTG GAT GGT TTC CAC 442 Met He Leu Glu Ala He Asp Pro Asn Lys Asp Val Asp Gly Phe His 115 120 125
CCC CTT AAT ATC GGT AAG CTC TGC ACT CAA AAA GAA TCG TTT CTG CCA 490 Pro Leu Asn He Gly Lys Leu Cys Thr Gin Lys Glu Ser Phe Leu Pro 130 135 140 145
GCC ACC CCT ATG GGC GTG ATG CGG CTT TTA GAG CAT TAC CAT ATT GAA 538 Ala Thr Pro Met Gly Val Met Arg Leu Leu Glu His Tyr His He Glu 150 155 160
ATC AAG GGT AAG GAT GTG GCG ATT ATT GGA GCG AGC AAT ATC ATT GGC 586 He Lys Gly Lys Asp Val Ala He He Gly Ala Ser Asn He He Gly 165 170 175
AAA CCT TTA AGC ATG CTC ATG CTA AAC GCT GGG GCT AGC GTG AGC GTG 634 Lys Pro Leu Ser Met Leu Met Leu Asn Ala Gly Ala Ser Val Ser Val 180 185 190
TGC CAT ATT TTG ACT AAA GAC ATT AGT TTT TAC ACC CAA AAC GCT GAT 682 Cys His He Leu Thr Lys Asp He Ser Phe Tyr Thr Gin Asn Ala Asp 195 200 205
ATT GTC TGC GTG GGC GTG GGT AAA CCT GAT TTG ATT AAA GCG AGC ATG 730 He Val Cys Val Gly Val Gly Lys Pro Asp Leu He Lys Ala Ser Met 210 215 220 225
TTA AAA AAA GGG GCT GTA GTG GTG GAT ATT GGG ATC AAT CAT TTG AAC 778 Leu Lys Lys Gly Ala Val Val Val Asp He Gly He Asn His Leu Asn 230 235 240
GAT GGG CGT ATC GTG GGC GAT GTG GAT TTT AAC AAC GTG CAA AAA GTC 826 Asp Gly Arg He Val Gly Asp Val Asp Phe Asn Asn Val Gin Lys Val 245 250 255
GCC GGT TTT ATC ACC CCT GTG CCT AAA GGC GTG GGG CCT ATG ACG ATT 874 Ala Gly Phe He Thr Pro Val Pro Lys Gly Val Gly Pro Met Thr He 260 265 270
GTC TCG CTT TTA GAA AAC ACT CTA ATC GCT TTT GAA AAA CAA CAA AGG 922 Val Ser Leu Leu Glu Asn Thr Leu He Ala Phe Glu Lys Gin Gin Arg 275 280 285
AAG GGA TTT TAATGAAATT TTTACGCTCT GTTTATGCAT TTTGCTCCAG TTGGGTAGG 980
Lys Gly Phe
290
GACGATTGTT ATTGTGCTGT TGGT 1004
(2) INFORMATION FOR SEQ ID NO: 372:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 292 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:372:
Met Gly Met Pro Asn Arg Gly Val Val Leu Leu Asp Gly Gin Ala Leu
1 5 10 15
Ala Asp Asn He Glu Lys Asp Leu Lys His Lys He Gin He He Thr
20 25 30
Ala Gin Thr His Lys Arg Pro Lys Leu Ala Val He Leu Val Gly Lys
35 40 45
Asp Pro Ala Ser He Thr Tyr Val Asn Met Lys He Lys Ala Cys Glu
50 55 60
Arg Val Gly Met Asp Phe Asp Leu Lys Thr Leu Gin Glu Asn He Thr 65 70 75 80
Glu Ala Lys Leu Leu Ser Leu He Lys Asp Tyr Asn Thr Asp Gin Asn
85 90 95
He Ser Gly Val Leu Val Gin Leu Pro Leu Pro Arg His He Asp Thr
100 105 110
Lys Met He Leu Glu Ala He Asp Pro Asn Lys Asp Val Asp Gly Phe
115 120 125
His Pro Leu Asn He Gly Lys Leu Cys Thr Gin Lys Glu Ser Phe Leu
130 135 140
Pro Ala Thr Pro Met Gly Val Met Arg Leu Leu Glu His Tyr His He 145 150 155 160
Glu He Lys Gly Lys Asp Val Ala He He Gly Ala Ser Asn He He
165 170 175
Gly Lys Pro Leu Ser Met Leu Met Leu Asn Ala Gly Ala Ser Val Ser
180 185 190
Val Cys His He Leu Thr Lys Asp He Ser Phe Tyr Thr Gin Asn Ala
195 200 205
Asp He Val Cys Val Gly Val Gly Lys Pro Asp Leu He Lys Ala Ser
210 215 220
Met Leu Lys Lys Gly Ala Val Val Val Asp He Gly He Asn His Leu 225 230 235 240
Asn Asp Gly Arg He Val Gly Asp Val Asp Phe Asn Asn Val Gin Lys
245 250 255
Val Ala Gly Phe He Thr Pro Val Pro Lys Gly Val Gly Pro Met Thr
260 265 270
He Val Ser Leu Leu Glu Asn Thr Leu He Ala Phe Glu Lys Gin Gin
275 280 285
Arg Lys Gly Phe 290
(2) INFORMATION FOR SEQ ID NO:373:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2162 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 66...2099 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:373:
CCCTTGCTTC TTTTTGTCTT TTTTAAGACT TTATCTCTTG TTAAAAAAAG GTTGTATTAA 60
CGCTT ATG AAA TCC CTA TCT AAT GCC CTT TTT TCG CTC TTT TTA AAA GGT 110
Met Lys Ser Leu Ser Asn Ala Leu Phe Ser Leu Phe Leu Lys Gly
1 5 10 15 TTT TAT TTC ACC TTT TTT ATG AGC TTG TTG TTT GTG TTT AAT CGT ATC 158 Phe Tyr Phe Thr Phe Phe Met Ser Leu Leu Phe Val Phe Asn Arg He 20 25 30
GGC TTT ATC CTT TAT ACT GGC TAT TAT AAG CAT GCT TTA AAA AAC CCT 206 Gly Phe He Leu Tyr Thr Gly Tyr Tyr Lys His Ala Leu Lys Asn Pro 35 40 45
GTT TTT GAT GAA ATC ATC AAA ACC CTA TTC AAT GGA GCC AGA TAT GAT 254 Val Phe Asp Glu He He Lys Thr Leu Phe Asn Gly Ala Arg Tyr Asp 50 55 60
AAT CGT GTG GTC TCA AGC TTA GCG ATT CTT TTT ATC ATC ATC GGG TTA 302 Asn Arg Val Val Ser Ser Leu Ala He Leu Phe He He He Gly Leu 65 70 75
TTG GGG TTA TTT ATC CCT AAA CAC CAA ACC AAA ATG CTT AAT ATT GTG 350 Leu Gly Leu Phe He Pro Lys His Gin Thr Lys Met Leu Asn He Val 80 85 90 95
GCG TAT TTT TCT ATC GCT ATT ATC CTG TTT TTA AAC ATT GCA AAC ATT 398 Ala Tyr Phe Ser He Ala He He Leu Phe Leu Asn He Ala Asn He 100 105 110
GTT TAT TAT GGT ATT TAT GGG AAT GTG TTT GAT GAA AAT TTA TTG GAA 446 Val Tyr Tyr Gly He Tyr Gly Asn Val Phe Asp Glu Asn Leu Leu Glu 115 120 125
TTT TTG CAT GAA GAC ACG CTC ACG ATT TTA AAA ATG AGC GGG GAA TAC 494 Phe Leu His Glu Asp Thr Leu Thr He Leu Lys Met Ser Gly Glu Tyr 130 135 140
CCT ATT TTT TCT AGT TTT TCA CTC TTT TTA ATC CTT AGC GTT TTA ACC 542 Pro He Phe Ser Ser Phe Ser Leu Phe Leu He Leu Ser Val Leu Thr 145 150 155
TCT TTT ATC TAT TTC AAA CTC CAA AAC GAC CTT TTT AAA CCC AAA AAT 590 Ser Phe He Tyr Phe Lys Leu Gin Asn Asp Leu Phe Lys Pro Lys Asn 160 165 170 175
GCT TAT CAA GCC GCC CAC ACC AAA CCC CTT AAA ACT TTC ATT TTA TTT 638 Ala Tyr Gin Ala Ala His Thr Lys Pro Leu Lys Thr Phe He Leu Phe 180 185 190
GCG CTT TTT TCC CTC ACA CAA ATG TTT TAC ATT AAC GCG CAA TTG AGT 686 Ala Leu Phe Ser Leu Thr Gin Met Phe Tyr He Asn Ala Gin Leu Ser 195 200 205
TTT GTG GGC GCG TCT TTA GAT CTC AGC ATA GAG CCA GCC AAA GAT CCT 734 Phe Val Gly Ala Ser Leu Asp Leu Ser He Glu Pro Ala Lys Asp Pro 210 215 220
TTT TTA ATG AAA ATT ACC CCC GGA GCG TTT CGC AAC CTT TAT CTT TTA 782 Phe Leu Met Lys He Thr Pro Gly Ala Phe Arg Asn Leu Tyr Leu Leu 225 230 235 GCA CGC AAT TAC AGA CAA AGC CAT AAC CTT AAA TTC AGC GAT TTT GCT 830 Ala Arg Asn Tyr Arg Gin Ser His Asn Leu Lys Phe Ser Asp Phe Ala 240 245 250 255
AAA GAA ACG CCT TTA GAA GTG GCG AAA AAT TAT TTC CAT CTT AAA GAG 878 Lys Glu Thr Pro Leu Glu Val Ala Lys Asn Tyr Phe His Leu Lys Glu 260 265 270
AAC CCT TCA AAC AAC CTC TAT GAG TTG CTA ACT CAG ACA AGC CGC AAC 926 Asn Pro Ser Asn Asn Leu Tyr Glu Leu Leu Thr Gin Thr Ser Arg Asn 275 280 285
AAT TCC AAT CAA ACC ATT CAA CAT GTT TTT TAT ATC GTT TCA GAG TCT 974 Asn Ser Asn Gin Thr He Gin His Val Phe Tyr He Val Ser Glu Ser 290 295 300
TTG AGT TCA TGG CAT TTT GAT CCA AAA TTT GAC GCT ATA GGG CTA ACG 1022 Leu Ser Ser Trp His Phe Asp Pro Lys Phe Asp Ala He Gly Leu Thr 305 310 315
AGC GCT TTA CAA GAC TTG GTT AAA AAA GAG CAT GCC CAC ATG CTT TCT 1070 Ser Ala Leu Gin Asp Leu Val Lys Lys Glu His Ala His Met Leu Ser 320 325 330 335
GCT TTT ATT GAA AGC GCC CCA CGG ACC GTT AAA AGC CTA GAT GTC CAA 1118 Ala Phe He Glu Ser Ala Pro Arg Thr Val Lys Ser Leu Asp Val Gin 340 345 350
ATC ACA GGC TTA CCC TAT ATC AAT GAT AAT AAC TTA GTC AAT TCA GGG 1166 He Thr Gly Leu Pro Tyr He Asn Asp Asn Asn Leu Val Asn Ser Gly 355 360 365
GTG ATC CTC CCT AGC TTT CCT ATG GCG ATT GGC AAT ATC ACA AAA ACT 1214 Val He Leu Pro Ser Phe Pro Met Ala He Gly Asn He Thr Lys Thr 370 375 380
CTG GGT TAT AAA AAC AAC TTT TAT TAT GGG GGT AGC GGG ATT TGG AAC 1262 Leu Gly Tyr Lys Asn Asn Phe Tyr Tyr Gly Gly Ser Gly He Trp Asn 385 390 395
AAA CTC ACT AGT TTC ACC AAA AAA CAA GGT TTT CAC GCC CTT TAT TTC 1310 Lys Leu Thr Ser Phe Thr Lys Lys Gin Gly Phe His Ala Leu Tyr Phe 400 405 410 415
AAT AAC CAT CTC TTA GAA TTT GCC CAA AAC AAG CCC TAC CCT AAA CCC 1358 Asn Asn His Leu Leu Glu Phe Ala Gin Asn Lys Pro Tyr Pro Lys Pro 420 425 430
ATA GAG AGC AAC TGG GGA GTG CAT GAT AAT ATT TTA TTT GAC TAT ATT 1406 He Glu Ser Asn Trp Gly Val His Asp Asn He Leu Phe Asp Tyr He 435 440 445
TTA GAA AAC ACC AAC CCC CAT GAA AAA ACT TTC AGC ATG GTC ATG ACT 1454 Leu Glu Asn Thr Asn Pro His Glu Lys Thr Phe Ser Met Val Met Thr 450 455 460 TTA AGC AAC CAT GCG ATC AAA AAC GTG AAT CTC AAA GCC TTT GGC GTG 1502 Leu Ser Asn His Ala He Lys Asn Val Asn Leu Lys Ala Phe Gly Val 465 470 475
CCT TTA GAA AAA ATC CAA CAA TTT GTG GAA AAA ACC CCC AAA TCA GAA 1550 Pro Leu Glu Lys He Gin Gin Phe Val Glu Lys Thr Pro Lys Ser Glu 480 485 490 495
AAT TTA CCG GAC GCT AAT TCT TTA GGG CAT ATT TAC TGG TAT GAC AAA 1598 Asn Leu Pro Asp Ala Asn Ser Leu Gly His He Tyr Trp Tyr Asp Lys 500 505 510
GTA ATC GTC AGT TTC ATC AAA AAA GCC AGC CAA AAA TTC CCT AAC TCG 1646 Val He Val Ser Phe He Lys Lys Ala Ser Gin Lys Phe Pro Asn Ser 515 520 525
CTT TTT ATC ATC ACA GGG GAT CAT TTT GAC AGG AGC TAT GAA TAC GCT 1694 Leu Phe He He Thr Gly Asp His Phe Asp Arg Ser Tyr Glu Tyr Ala 530 535 540
AAA AAC GAT TTG TAT ATC ATT AAA TCC GTG CCG CTT ATT TTA TAT GCC 1742 Lys Asn Asp Leu Tyr He He Lys Ser Val Pro Leu He Leu Tyr Ala 545 550 555
CCT ACT TTA AAG CCT AAA AAA ATC AGT CAG GTC GGA TCG CAT TTA GAC 1790 Pro Thr Leu Lys Pro Lys Lys He Ser Gin Val Gly Ser His Leu Asp 560 565 570 575
ATC GCC CCT ACG ATT ATT GAA TTA GTC GCC CCT AAA GGC TTT CAA TTC 1838 He Ala Pro Thr He He Glu Leu Val Ala Pro Lys Gly Phe Gin Phe 580 585 590
GTG AGT TTT GGG AAG CCT TTA TTT TCT AAC AAT ACA ACA AAC CCC CCA 1886 Val Ser Phe Gly Lys Pro Leu Phe Ser Asn Asn Thr Thr Asn Pro Pro 595 600 605
AGC CAC CCC AAT TAC GCG CTA GGC TAT GAA GCG ATC GCT ACC AAA GAT 1934 Ser His Pro Asn Tyr Ala Leu Gly Tyr Glu Ala He Ala Thr Lys Asp 610 615 620
TAT TTT TAT AAC CCA AGT TTG GGG TTA AGG TAT TTG AAC GAA AGC CCT 1982 Tyr Phe Tyr Asn Pro Ser Leu Gly Leu Arg Tyr Leu Asn Glu Ser Pro 625 630 635
AAA GAG CCA AAG GAT AAA CAA AAC GAC AAA ATA GAA GCT TCT AAG TTT 2030 Lys Glu Pro Lys Asp Lys Gin Asn Asp Lys He Glu Ala Ser Lys Phe 640 645 650 655
TAT CAG CAA TTA GAA TCT TTG AAA GCC CTT AGT TAT TAC TTG CTC TAT 2078 Tyr Gin Gin Leu Glu Ser Leu Lys Ala Leu Ser Tyr Tyr Leu Leu Tyr 660 665 670
CAT GGG GCT AAT CTT AAA GAT TGACAAACTA GGTTTTTATT CCATTAAACG CATG 2133 His Gly Ala Asn Leu Lys Asp 675 ATTTTTGCTA CAATAATAGG ATTTTAATT 2162
(2) INFORMATION FOR SEQ ID NO: 374:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 678 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:374:
Met Lys Ser Leu Ser Asn Ala Leu Phe Ser Leu Phe Leu Lys Gly Phe
1 5 10 15
Tyr Phe Thr Phe Phe Met Ser Leu Leu Phe Val Phe Asn Arg He Gly
20 25 30
Phe He Leu Tyr Thr Gly Tyr Tyr Lys His Ala Leu Lys Asn Pro Val
35 40 45
Phe Asp Glu He He Lys Thr Leu Phe Asn Gly Ala Arg Tyr Asp Asn
50 55 60
Arg Val Val Ser Ser Leu Ala He Leu Phe He He He Gly Leu Leu 65 70 75 80
Gly Leu Phe He Pro Lys His Gin Thr Lys Met Leu Asn He Val Ala
85 90 95
Tyr Phe Ser He Ala He He Leu Phe Leu Asn He Ala Asn He Val
100 105 110
Tyr Tyr Gly He Tyr Gly Asn Val Phe Asp Glu Asn Leu Leu Glu Phe
115 120 125
Leu His Glu Asp Thr Leu Thr He Leu Lys Met Ser Gly Glu Tyr Pro
130 135 140
He Phe Ser Ser Phe Ser Leu Phe Leu He Leu Ser Val Leu Thr Ser 145 150 155 160
Phe He Tyr Phe Lys Leu Gin Asn Asp Leu Phe Lys Pro Lys Asn Ala
165 170 175
Tyr Gin Ala Ala His Thr Lys Pro Leu Lys Thr Phe He Leu Phe Ala
180 185 190
Leu Phe Ser Leu Thr Gin Met Phe Tyr He Asn Ala Gin Leu Ser Phe
195 200 205
Val Gly Ala Ser Leu Asp Leu Ser He Glu Pro Ala Lys Asp Pro Phe
210 215 220
Leu Met Lys He Thr Pro Gly Ala Phe Arg Asn Leu Tyr Leu Leu Ala 225 230 235 240
Arg Asn Tyr Arg Gin Ser His Asn Leu Lys Phe Ser Asp Phe Ala Lys
245 250 255
Glu Thr Pro Leu Glu Val Ala Lys Asn Tyr Phe His Leu Lys Glu Asn
260 265 270
Pro Ser Asn Asn Leu Tyr Glu Leu Leu Thr Gin Thr Ser Arg Asn Asn
275 280 285
Ser Asn Gin Thr He Gin His Val Phe Tyr He Val Ser Glu Ser Leu
290 295 300
Ser Ser Trp His Phe Asp Pro Lys Phe Asp Ala He Gly Leu Thr Ser 305 310 315 320 Ala Leu Gin Asp Leu Val Lys Lys Glu His Ala His Met Leu Ser Ala
325 330 335
Phe He Glu Ser Ala Pro Arg Thr Val Lys Ser Leu Asp Val Gin He
340 345 350
Thr Gly Leu Pro Tyr He Asn Asp Asn Asn Leu Val Asn Ser Gly Val
355 360 365
He Leu Pro Ser Phe Pro Met Ala He Gly Asn He Thr Lys Thr Leu
370 375 380
Gly Tyr Lys Asn Asn Phe Tyr Tyr Gly Gly Ser Gly He Trp Asn Lys 385 390 395 400
Leu Thr Ser Phe Thr Lys Lys Gin Gly Phe His Ala Leu Tyr Phe Asn
405 410 415
Asn His Leu Leu Glu Phe Ala Gin Asn Lys Pro Tyr Pro Lys Pro He
420 425 430
Glu Ser Asn Trp Gly Val His Asp Asn He Leu Phe Asp Tyr He Leu
435 440 445
Glu Asn Thr Asn Pro His Glu Lys Thr Phe Ser Met Val Met Thr Leu
450 455 460
Ser Asn His Ala He Lys Asn Val Asn Leu Lys Ala Phe Gly Val Pro 465 470 475 480
Leu Glu Lys He Gin Gin Phe Val Glu Lys Thr Pro Lys Ser Glu Asn
485 490 495
Leu Pro Asp Ala Asn Ser Leu Gly His He Tyr Trp Tyr Asp Lys Val
500 505 510
He Val Ser Phe He Lys Lys Ala Ser Gin Lys Phe Pro Asn Ser Leu
515 520 525
Phe He He Thr Gly Asp His Phe Asp Arg Ser Tyr Glu Tyr Ala Lys
530 535 540
Asn Asp Leu Tyr He He Lys Ser Val Pro Leu He Leu Tyr Ala Pro 545 550 555 560
Thr Leu Lys Pro Lys Lys He Ser Gin Val Gly Ser His Leu Asp He
565 570 575
Ala Pro Thr He He Glu Leu Val Ala Pro Lys Gly Phe Gin Phe Val
580 585 590
Ser Phe Gly Lys Pro Leu Phe Ser Asn Asn Thr Thr Asn Pro Pro Ser
595 600 605
His Pro Asn Tyr Ala Leu Gly Tyr Glu Ala He Ala Thr Lys Asp Tyr
610 615 620
Phe Tyr Asn Pro Ser Leu Gly Leu Arg Tyr Leu Asn Glu Ser Pro Lys 625 630 635 640
Glu Pro Lys Asp Lys Gin Asn Asp Lys He Glu Ala Ser Lys Phe Tyr
645 650 655
Gin Gin Leu Glu Ser Leu Lys Ala Leu Ser Tyr Tyr Leu Leu Tyr His
660 665 670
Gly Ala Asn Leu Lys Asp 675
(2) INFORMATION FOR SEQ ID NO: 375:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1573 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1520 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375:
TAGCCCACAG CTCTAGCGTT TTGCCTAACA TGAATCTATA AGGGGGATGC ATG GAT 56
Met Asp 1
AAA GAA ACC CGA TTT TAC AAC CTT TTT TCT TTG GCA ATT TTA GGG ATT 104 Lys Glu Thr Arg Phe Tyr Asn Leu Phe Ser Leu Ala He Leu Gly He 5 10 15
TTG ATC TTT CCT GTG GGT TTG GCG AAT TTT TAT TTT GGC TAT GTT TTG 152 Leu He Phe Pro Val Gly Leu Ala Asn Phe Tyr Phe Gly Tyr Val Leu 20 25 30
AAA GAT TCG CCT TGT ATT TTT TGC TGG GCG CAA CGC ATC AAC ATG ATT 200 Lys Asp Ser Pro Cys He Phe Cys Trp Ala Gin Arg He Asn Met He 35 40 45 50
TTA ATA GGG GCT GTG GCG CTT TTG GTG GTG CGT TTT GGG TTT AAG CCT 248 Leu He Gly Ala Val Ala Leu Leu Val Val Arg Phe Gly Phe Lys Pro 55 60 65
AAA TAC ATT GCC TTG CTG TTG CTT ATG GCT AGT AGC GGG TTA TAT GAG 296 Lys Tyr He Ala Leu Leu Leu Leu Met Ala Ser Ser Gly Leu Tyr Glu 70 75 80
AGC TTT TAT CAT ACC GGT AGC CAT GCT TTA GAA GAT GTG GGG CAG GGA 344 Ser Phe Tyr His Thr Gly Ser His Ala Leu Glu Asp Val Gly Gin Gly 85 90 95
TTC GCG CTC GCT ATT TTG GGC TTG CAC ACG CAG TTT TGG GCG CTT TTT 392 Phe Ala Leu Ala He Leu Gly Leu His Thr Gin Phe Trp Ala Leu Phe 100 105 110
GTC TTT TTT AGC GTG GTG GTG CTT TTA GCG GTT TTG CTC TTT TTT GCC 440 Val Phe Phe Ser Val Val Val Leu Leu Ala Val Leu Leu Phe Phe Ala 115 120 125 130
CCT AAT GCC CAA CCT TTC AAA GAT CAT TCG TTA AAC GCG CTC CAA AAA 488 Pro Asn Ala Gin Pro Phe Lys Asp His Ser Leu Asn Ala Leu Gin Lys 135 140 145
ATC GCT TTT TAT GTT TTC TTT ATG GTG GTT GGT TCT AAC GCC GTG CAA 536 He Ala Phe Tyr Val Phe Phe Met Val Val Gly Ser Asn Ala Val Gin 150 155 160 GCG TTT ATT TCT ACC GGG CCT TTC CCT TAC ATA GGG CAA AGC GAT CCG 584 Ala Phe He Ser Thr Gly Pro Phe Pro Tyr He Gly Gin Ser Asp Pro 165 170 175
GTG CGT TTT TCG TGG AAT TTG AAA GAA TCG GTC TGG TCT ATG GAG AAT 632 Val Arg Phe Ser Trp Asn Leu Lys Glu Ser Val Trp Ser Met Glu Asn 180 185 190
TGG GAT CAT TTG AAA TTC CCA AGA AGC GTT TTG GGC AGA AGG GAT GTG 680 Trp Asp His Leu Lys Phe Pro Arg Ser Val Leu Gly Arg Arg Asp Val 195 200 205 210
GGC GAG CCT TTG AAA TTG AGC GCT TTG CCT AAA GAT AAC GAT TAT GAG 728 Gly Glu Pro Leu Lys Leu Ser Ala Leu Pro Lys Asp Asn Asp Tyr Glu 215 220 225
CGT TCG CCT TTA GAA ATT ACA AAA ACT CTA AAG ATT GGA AAA AAA GAA 776 Arg Ser Pro Leu Glu He Thr Lys Thr Leu Lys He Gly Lys Lys Glu 230 235 240
GAG CTT TTT TTA AAA TTG AAT GGA GCG ATC ACG GAT TTG AGT TTC AAT 824 Glu Leu Phe Leu Lys Leu Asn Gly Ala He Thr Asp Leu Ser Phe Asn 245 250 255
GAA GAC AAG GCG ATT CTT ACC ACA GAA AAC CAA GGG CTT TAT CTT GTA 872 Glu Asp Lys Ala He Leu Thr Thr Glu Asn Gin Gly Leu Tyr Leu Val 260 265 270
AGT AAC GAT TTG AAA ACC ATT CAT AGC CAT ATG GTG TTG GAT AGC TAT 920 Ser Asn Asp Leu Lys Thr He His Ser His Met Val Leu Asp Ser Tyr 275 280 285 290
TAT AGC GCG ACG GTG GGG TCG TTC GTG GGG GCG GAT TTC AAC GAA GAT 968 Tyr Ser Ala Thr Val Gly Ser Phe Val Gly Ala Asp Phe Asn Glu Asp 295 300 305
GAA AAC ATT GTG ATC ATG GGC AAT AAT AAA ACG AGC GTG GAA ATC ACT 1016 Glu Asn He Val He Met Gly Asn Asn Lys Thr Ser Val Glu He Thr 310 315 320
CCT AAC AAA AAC GCT AAC ATG CTT AAA AAC TTC CCT TAT TTT TTA GAA 1064 Pro Asn Lys Asn Ala Asn Met Leu Lys Asn Phe Pro Tyr Phe Leu Glu 325 330 335
GGG GTC AAC TCT TTT GAC GAA GTG GAA CGC AGC CGC TTG AAA ACT TCT 1112 Gly Val Asn Ser Phe Asp Glu Val Glu Arg Ser Arg Leu Lys Thr Ser 340 345 350
AGG GCG AAA AAC TAT TAT GTT AGC GTT GCA AGA AGA GGG GCT AAA TTC 1160 Arg Ala Lys Asn Tyr Tyr Val Ser Val Ala Arg Arg Gly Ala Lys Phe 355 360 365 370
ACT TAT TTG ATC AGC GCT CCT AAC AAG CGT TAT AAG GAT TTG ATT ATT 1208 Thr Tyr Leu He Ser Ala Pro Asn Lys Arg Tyr Lys Asp Leu He He 375 380 385 ATT TCC ATG CGT AAT AGC GAT AAA CAG GTG CAT GGG GAG TTT TTA CTG 1256 He Ser Met Arg Asn Ser Asp Lys Gin Val His Gly Glu Phe Leu Leu 390 395 400
GAA TTA GGC AAT GCC AAA CTT AAA GAA AAA AGG GGA TTG GGC GAG TTA 1304 Glu Leu Gly Asn Ala Lys Leu Lys Glu Lys Arg Gly Leu Gly Glu Leu 405 410 415
GTC ATT AGC ACT TTG GCT TTA AAG GAT AAT AAA CTT TAT GCG TTC AGT 1352 Val He Ser Thr Leu Ala Leu Lys Asp Asn Lys Leu Tyr Ala Phe Ser 420 425 430
AAG GAA TTT AAC ACG CTT TTA GTC ATA GAC CCT ACA AAA GAA GAG ATT 1400 Lys Glu Phe Asn Thr Leu Leu Val He Asp Pro Thr Lys Glu Glu He 435 440 445 450
CTT GAA GTT TAT GGC TTG CCT AAA GAG ATT AAA AAT ATC AGT GCT GGA 1448 Leu Glu Val Tyr Gly Leu Pro Lys Glu He Lys Asn He Ser Ala Gly 455 460 465
GGG TTT AGA AAC GAT GAG CTT GTC CTT GTG AGC TAT GAG AAT AAT AAA 1496 Gly Phe Arg Asn Asp Glu Leu Val Leu Val Ser Tyr Glu Asn Asn Lys 470 475 480
AAT ATT CTC TAC ACC CTT AAT TTT TAAACTCTTT TAAAGCTACT TTTTTCTAAT 1550 Asn He Leu Tyr Thr Leu Asn Phe 485 490
ATATTAACGC ATTAGAAGAT GGT 1573
(2) INFORMATION FOR SEQ ID NO: 376:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 490 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:376:
Met Asp Lys Glu Thr Arg Phe Tyr Asn Leu Phe Ser Leu Ala He Leu
1 5 10 15
Gly He Leu He Phe Pro Val Gly Leu Ala Asn Phe Tyr Phe Gly Tyr
20 25 30
Val Leu Lys Asp Ser Pro Cys He Phe Cys Trp Ala Gin Arg He Asn
35 40 45
Met He Leu He Gly Ala Val Ala Leu Leu Val Val Arg Phe Gly Phe
50 55 60
Lys Pro Lys Tyr He Ala Leu Leu Leu Leu Met Ala Ser Ser Gly Leu 65 70 75 80
Tyr Glu Ser Phe Tyr His Thr Gly Ser His Ala Leu Glu Asp Val Gly 85 90 95 Gin Gly Phe Ala Leu Ala He Leu Gly Leu His Thr Gin Phe Trp Ala
100 105 110
Leu Phe Val Phe Phe Ser Val Val Val Leu Leu Ala Val Leu Leu Phe
115 120 125
Phe Ala Pro Asn Ala Gin Pro Phe Lys Asp His Ser Leu Asn Ala Leu
130 135 140
Gin Lys He Ala Phe Tyr Val Phe Phe Met Val Val Gly Ser Asn Ala 145 150 155 160
Val Gin Ala Phe He Ser Thr Gly Pro Phe Pro Tyr He Gly Gin Ser
165 170 175
Asp Pro Val Arg Phe Ser Trp Asn Leu Lys Glu Ser Val Trp Ser Met
180 185 190
Glu Asn Trp Asp His Leu Lys Phe Pro Arg Ser Val Leu Gly Arg Arg
195 200 205
Asp Val Gly Glu Pro Leu Lys Leu Ser Ala Leu Pro Lys Asp Asn Asp
210 215 220
Tyr Glu Arg Ser Pro Leu Glu He Thr Lys Thr Leu Lys He Gly Lys 225 230 235 240
Lys Glu Glu Leu Phe Leu Lys Leu Asn Gly Ala He Thr Asp Leu Ser
245 250 255
Phe Asn Glu Asp Lys Ala He Leu Thr Thr Glu Asn Gin Gly Leu Tyr
260 265 270
Leu Val Ser Asn Asp Leu Lys Thr He His Ser His Met Val Leu Asp
275 280 285
Ser Tyr Tyr Ser Ala Thr Val Gly Ser Phe Val Gly Ala Asp Phe Asn
290 295 300
Glu Asp Glu Asn He Val He Met Gly Asn Asn Lys Thr Ser Val Glu 305 310 315 320
He Thr Pro Asn Lys Asn Ala Asn Met Leu Lys Asn Phe Pro Tyr Phe
325 330 335
Leu Glu Gly Val Asn Ser Phe Asp Glu Val Glu Arg Ser Arg Leu Lys
340 345 350
Thr Ser Arg Ala Lys Asn Tyr Tyr Val Ser Val Ala Arg Arg Gly Ala
355 360 365
Lys Phe Thr Tyr Leu He Ser Ala Pro Asn Lys Arg Tyr Lys Asp Leu
370 375 380
He He He Ser Met Arg Asn Ser Asp Lys Gin Val His Gly Glu Phe 385 390 395 400
Leu Leu Glu Leu Gly Asn Ala Lys Leu Lys Glu Lys Arg Gly Leu Gly
405 410 415
Glu Leu Val He Ser Thr Leu Ala Leu Lys Asp Asn Lys Leu Tyr Ala
420 425 430
Phe Ser Lys Glu Phe Asn Thr Leu Leu Val He Asp Pro Thr Lys Glu
435 440 445
Glu He Leu Glu Val Tyr Gly Leu Pro Lys Glu He Lys Asn He Ser
450 455 460
Ala Gly Gly Phe Arg Asn Asp Glu Leu Val Leu Val Ser Tyr Glu Asn 465 470 475 480
Asn Lys Asn He Leu Tyr Thr Leu Asn Phe 485 490
(2) INFORMATION FOR SEQ ID NO: 377:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 679 base pairs (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...626 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377:
TCTTAAAGAT TTATGTTACA CTCTGTGAAA TCAAAATCAA AGGGGATAGC GTG TTA 56
Val Leu
1
GAA AAA TCT TTT TTA AAA AGC AAG CAA TTA TTT TTA TGC GGA CTG GGT 104 Glu Lys Ser Phe Leu Lys Ser Lys Gin Leu Phe Leu Cys Gly Leu Gly 5 10 15
GTT TTG ATG CTG CAG GCT TGC ACT TGC CCA AAC ACT TCA CAA AGG AAT 152 Val Leu Met Leu Gin Ala Cys Thr Cys Pro Asn Thr Ser Gin Arg Asn 20 25 30
TCT TTC TTG CAA GAT GTG CCT TAT TGG ATG TTG CAA AAT CGC AGT GAG 200 Ser Phe Leu Gin Asp Val Pro Tyr Trp Met Leu Gin Asn Arg Ser Glu 35 40 45 50
TAT ATC ACG CAA GGG GTG GAT AGC TCG CAC ATT GTA GAT GGT AAG AAA 248 Tyr He Thr Gin Gly Val Asp Ser Ser His He Val Asp Gly Lys Lys 55 60 65
ACT GAA GAG ATA GAA AAA ATC GCT ACC AAA AGA GCG ACA ATA AGA GTG 296 Thr Glu Glu He Glu Lys He Ala Thr Lys Arg Ala Thr He Arg Val 70 75 80
GCA CAA AAT ATT GTG CAT AAA CTT AAA GAA GCT TAC CTT TCC AAA ACC 344 Ala Gin Asn He Val His Lys Leu Lys Glu Ala Tyr Leu Ser Lys Thr 85 90 95
AAT CGC ATC AAG CAA AAG ATC ACT AAT GAA ATG TTT ATC CAA ATG ACA 392 Asn Arg He Lys Gin Lys He Thr Asn Glu Met Phe He Gin Met Thr 100 105 110
CAG CCC ATT TAT GAC AGC TTG ATG AAT GTG GAT CGT TTA GGG ATT TAT 440 Gin Pro He Tyr Asp Ser Leu Met Asn Val Asp Arg Leu Gly He Tyr 115 120 125 130
ATC AAT CCT AAC AAT GAG GAA GTG TTT GCG TTA GTG CGC GCG CGT GGT 488 He Asn Pro Asn Asn Glu Glu Val Phe Ala Leu Val Arg Ala Arg Gly 135 140 145 TTT GAT AAG GAC GCT TTG AGC GAA GGG TTG CAT AAA ATG TCC TTA GAC 536 Phe Asp Lys Asp Ala Leu Ser Glu Gly Leu His Lys Met Ser Leu Asp 150 155 160
AAT CAA GCG GTG AGT ATC CTT GTG GCT AAA GTG GAA GAA ATC TTT AAA 584 Asn Gin Ala Val Ser He Leu Val Ala Lys Val Glu Glu He Phe Lys 165 170 175
GAT TCT GTC AAT TAC GGA GAT GTT AAA GTC CCT ATA GCC ATG TAGGCTTAG 635 Asp Ser Val Asn Tyr Gly Asp Val Lys Val Pro He Ala Met 180 185 190
AACAACAAGC GTTCCTCGCT ATCGTCTGTT CTTTTGGGGG TGGG 679
(2) INFORMATION FOR SEQ ID NO: 378:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 192 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378:
Val Leu Glu Lys Ser Phe Leu Lys Ser Lys Gin Leu Phe Leu Cys Gly
1 5 10 15
Leu Gly Val Leu Met Leu Gin Ala Cys Thr Cys Pro Asn Thr Ser Gin
20 25 30
Arg Asn Ser Phe Leu Gin Asp Val Pro Tyr Trp Met Leu Gin Asn Arg
35 40 45
Ser Glu Tyr He Thr Gin Gly Val Asp Ser Ser His He Val Asp Gly
50 55 60
Lys Lys Thr Glu Glu He Glu Lys He Ala Thr Lys Arg Ala Thr He 65 70 75 80
Arg Val Ala Gin Asn He Val His Lys Leu Lys Glu Ala Tyr Leu Ser
85 90 95
Lys Thr Asn Arg He Lys Gin Lys He Thr Asn Glu Met Phe He Gin
100 105 110
Met Thr Gin Pro He Tyr Asp Ser Leu Met Asn Val Asp Arg Leu Gly
115 120 125
He Tyr He Asn Pro Asn Asn Glu Glu Val Phe Ala Leu Val Arg Ala
130 135 140
Arg Gly Phe Asp Lys Asp Ala Leu Ser Glu Gly Leu His Lys Met Ser 145 150 155 160
Leu Asp Asn Gin Ala Val Ser He Leu Val Ala Lys Val Glu Glu He
165 170 175
Phe Lys Asp Ser Val Asn Tyr Gly Asp Val Lys Val Pro He Ala Met 180 185 190
(2) INFORMATION FOR SEQ ID NO: 379:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 2386 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...2333 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379:
TGGGGCTTTT TGGCGATGGG TTTATCCAAA CGAAATAATA GGATATTTTG ATG TAT 56
Met Tyr 1
AAA ACA GCG ATT AAT CGT CCT ATT ACG ACC TTA ATG TTT GCT TTG GCG 104 Lys Thr Ala He Asn Arg Pro He Thr Thr Leu Met Phe Ala Leu Ala 5 10 15
ATT GTC TTT TTT GGG ACT ATG GGG TTT AAA AAA TTG AGC GTG GCG CTT 152 He Val Phe Phe Gly Thr Met Gly Phe Lys Lys Leu Ser Val Ala Leu 20 25 30
TTC CCT AAA ATT GAT TTG CCT ACG GTG GTG GTT ACT ACG ACT TAT CCT 200 Phe Pro Lys He Asp Leu Pro Thr Val Val Val Thr Thr Thr Tyr Pro 35 40 45 50
GGG GCT AGC GCT GAA ATC ATA GAG AGT AAG GTA ACC GAT AAG ATT GAA 248 Gly Ala Ser Ala Glu He He Glu Ser Lys Val Thr Asp Lys He Glu 55 60 65
GAA GCG GTG ATG GGG ATT GAT GGG ATC AAA AAG GTT ACT TCC ACG AGT 296 Glu Ala Val Met Gly He Asp Gly He Lys Lys Val Thr Ser Thr Ser 70 75 80
TCT AAA AAT GTG AGT ATC GTC GTC ATT GAA TTT GAG TTA GAA AAA CCT 344 Ser Lys Asn Val Ser He Val Val He Glu Phe Glu Leu Glu Lys Pro 85 90 95
AAT GAA GAA GCC TTA AAC GAT GTG GTG AAT AAA ATT TCT TCG GTG CGT 392 Asn Glu Glu Ala Leu Asn Asp Val Val Asn Lys He Ser Ser Val Arg 100 105 110
TTT GAT GAC TCT AAC ATT AAA AAA CCC TCT ATC AAT AAA TTT GAT ACC 440 Phe Asp Asp Ser Asn He Lys Lys Pro Ser He Asn Lys Phe Asp Thr 115 120 125 130
GAC AGC CAA GCC ATT ATT TCA TTG TTT GTG AGC AGT TCA AGC GTG CCG 488 Asp Ser Gin Ala He He Ser Leu Phe Val Ser Ser Ser Ser Val Pro 135 140 145
GCT ACA ACC CTT AAT GAC TAC GCT AAA AAC ACC ATC AAA CCC ATG CTC 536 Ala Thr Thr Leu Asn Asp Tyr Ala Lys Asn Thr He Lys Pro Met Leu 150 155 160
CAA AAA ATC AAT GGG GTA GGG GGC GTG CAG CTC AAC GGC TTT AGG GAG 584 Gin Lys He Asn Gly Val Gly Gly Val Gin Leu Asn Gly Phe Arg Glu 165 170 175
CGC CAG ATT AGG ATT TAT GCA AAT CCC ACT TTG ATG AAT AAA TAC AAC 632 Arg Gin He Arg He Tyr Ala Asn Pro Thr Leu Met Asn Lys Tyr Asn 180 185 190
CTG ACT TAT GCG GAT CTT TTC AGC ACG CTT AAA GCG GAG AAT GTG GAA 680 Leu Thr Tyr Ala Asp Leu Phe Ser Thr Leu Lys Ala Glu Asn Val Glu 195 200 205 210
ATT GAT GGG GGG CGC ATT GTC AAT AGC CAA AGG GAA TTT TCT ATT TTA 728 He Asp Gly Gly Arg He Val Asn Ser Gin Arg Glu Phe Ser He Leu 215 220 225
ATC AAT GCG AAT AGT TAT AGC GTT GCG GAT GTG GAA AAG ATT CAA GTG 776 He Asn Ala Asn Ser Tyr Ser Val Ala Asp Val Glu Lys He Gin Val 230 235 240
GGT AAT CAT GTG CGT CTT GGC GAT ATT GCA AAA ATT GAA ATC GGT TTG 824 Gly Asn His Val Arg Leu Gly Asp He Ala Lys He Glu He Gly Leu 245 250 255
GAA GAA GAC AAC ACT TTT GCG AGC TTT AAA GAC AAA CCC GGT GTG ATT 872 Glu Glu Asp Asn Thr Phe Ala Ser Phe Lys Asp Lys Pro Gly Val He 260 265 270
TTA GAA ATC CAA AAG ATT GCC GGA GCG AAT GAA ATT GAA ATC GTA GAT 920 Leu Glu He Gin Lys He Ala Gly Ala Asn Glu He Glu He Val Asp 275 280 285 290
AGG GTG TAT GAA GCT TTA AAG CGC ATT CAA GCC ATT AGC CCT AAC TAT 968 Arg Val Tyr Glu Ala Leu Lys Arg He Gin Ala He Ser Pro Asn Tyr 295 300 305
GAA ATC AGA CCC TTT TTA GAC ACC ACG GGC TAT ATC CGC ACC TCT ATT 1016 Glu He Arg Pro Phe Leu Asp Thr Thr Gly Tyr He Arg Thr Ser He 310 315 320
GAA GAC GTG AAA TTT GAT CTA GTT TTA GGG GCG ATT TTA GCG GTT TTA 1064 Glu Asp Val Lys Phe Asp Leu Val Leu Gly Ala He Leu Ala Val Leu 325 330 335
GTG GTG TTT GCG TTC TTG CGT AAC GGC ACG ATC ACC CTC GTT TCA GCG 1112 Val Val Phe Ala Phe Leu Arg Asn Gly Thr He Thr Leu Val Ser Ala 340 345 350
ATC TCT ATC CCT ATT TCT ATC ATG GGG ACT TTT GCG CTC ATC CAA TGG 1160 He Ser He Pro He Ser He Met Gly Thr Phe Ala Leu He Gin Trp 355 360 365 370
ATG GGC TTT TCA TTA AAC ATG CTC ACC ATG GTG GCT TTA ACG TTG GCG 1208 Met Gly Phe Ser Leu Asn Met Leu Thr Met Val Ala Leu Thr Leu Ala 375 380 385
ATA GGG ATT ATC ATT GAT GAT GCG ATC GTG GTG ATT GAA AAC ATC CAT 1256 He Gly He He He Asp Asp Ala He Val Val He Glu Asn He His 390 395 400
AAA AAG CTA GAA ATG GGC ATG AGT AAA CGA AAA GCG AGC TAT GAG GGG 1304 Lys Lys Leu Glu Met Gly Met Ser Lys Arg Lys Ala Ser Tyr Glu Gly 405 410 415
GTG AGA GAA ATT GGC TTT GCT CTA GTG GCG ATT TCA GCG ATG CTG CTC 1352 Val Arg Glu He Gly Phe Ala Leu Val Ala He Ser Ala Met Leu Leu 420 425 430
TCT GTT TTT GTG CCT ATA GGG AAC ATG AAA GGC ATT ATT GGG CGT TTT 1400 Ser Val Phe Val Pro He Gly Asn Met Lys Gly He He Gly Arg Phe 435 440 445 450
TTT CAA AGT TTT GGG ATC ACG GTG GCT TTA GCG ATC GCT CTA TCG TAT 1448 Phe Gin Ser Phe Gly He Thr Val Ala Leu Ala He Ala Leu Ser Tyr 455 460 465
GTG GTG GTC GTT ACG ATT ATC CCC ATG GTA AGC TCA GTC GTG GTC AAT 1496 Val Val Val Val Thr He He Pro Met Val Ser Ser Val Val Val Asn 470 475 480
CCC AGG CAT TCT CGT TTT TAT GTG TGG AGT GAG CCT TTT TTT AAG GCT 1544 Pro Arg His Ser Arg Phe Tyr Val Trp Ser Glu Pro Phe Phe Lys Ala 485 490 495
TTA GAG TCT CGT TAT ACC AAG TTG CTC CAA TGG GTA TTA AAC CAC AAG 1592 Leu Glu Ser Arg Tyr Thr Lys Leu Leu Gin Trp Val Leu Asn His Lys 500 505 510
ATC ATT ATC TCT ATA GCG GTG GTT TTG GTG TTT GTG GGA TCG CTT TTT 1640 He He He Ser He Ala Val Val Leu Val Phe Val Gly Ser Leu Phe 515 520 525 530
GTG GCT TCT AAG ATT GGT ATG GAG TTC ATG CTG AAA GAA GAT AGG GGG 1688 Val Ala Ser Lys He Gly Met Glu Phe Met Leu Lys Glu Asp Arg Gly 535 540 545
AGG TTT TTG GTG TGG CTT AAG GCT AAA CCG GGC GTG AGC ATA GAT TAC 1736 Arg Phe Leu Val Trp Leu Lys Ala Lys Pro Gly Val Ser He Asp Tyr 550 555 560
ATG ACA CAA AAG AGT AAG ATC TTT CAA AAA GCG ATT GAA AAA CAT GCT 1784 Met Thr Gin Lys Ser Lys He Phe Gin Lys Ala He Glu Lys His Ala 565 570 575 GAA GTG GAA TTC ACC ACC TTG CAA GTG GGT TAT GGC ACC ACA CAA AAC 1832 Glu Val Glu Phe Thr Thr Leu Gin Val Gly Tyr Gly Thr Thr Gin Asn 580 585 590
CCT TTT AAG GCT AAG ATT TTT GTG CAA CTC AAG CCT TTA AAA GAG CGT 1880 Pro Phe Lys Ala Lys He Phe Val Gin Leu Lys Pro Leu Lys Glu Arg 595 600 605 610
AAA AAA GAG CAT CAA TTG GGG CAA TTT GAG TTG ATG AGC GTT TTA AGG 1928 Lys Lys Glu His Gin Leu Gly Gin Phe Glu Leu Met Ser Val Leu Arg 615 620 625
AAA GAG TTG AGA AGC TTG CCT GAA GCT AAA GGT TTA GAT ACT ATT AAT 1976 Lys Glu Leu Arg Ser Leu Pro Glu Ala Lys Gly Leu Asp Thr He Asn 630 635 640
CTT TCT GAA GTT ACT CTT ATA GGG GGC GGT GGG GAT AGT TCG CCC TTC 2024 Leu Ser Glu Val Thr Leu He Gly Gly Gly Gly Asp Ser Ser Pro Phe 645 650 655
CAA ACC TTT GTG TTT TCC CAT TCT CAA GAA GCG GTG GAT AAA AGC GTG 2072 Gin Thr Phe Val Phe Ser His Ser Gin Glu Ala Val Asp Lys Ser Val 660 665 670
GAG AAT TTG AAA AAA TTC TTA TTA GAA AGC CCT GAA TTA AAA GGC AAG 2120 Glu Asn Leu Lys Lys Phe Leu Leu Glu Ser Pro Glu Leu Lys Gly Lys 675 680 685 690
GTT GAA AGC TAT CAT ACA AGC ACG AGC GAA TCG CAA CCG CAA TTG CAA 2168 Val Glu Ser Tyr His Thr Ser Thr Ser Glu Ser Gin Pro Gin Leu Gin 695 700 705
CTC AAA ATC TTA AGA CAA AAC GCT AAC AAA TAC GGC GTG AGC GCT CAA 2216 Leu Lys He Leu Arg Gin Asn Ala Asn Lys Tyr Gly Val Ser Ala Gin 710 715 720
ACC ATT GGA TCA GTG GTG AGC TCT GCT TTT TCT GGG ACT TCT CAA GCG 2264 Thr He Gly Ser Val Val Ser Ser Ala Phe Ser Gly Thr Ser Gin Ala 725 730 735
AGC GTG TTC AAA GAA GAT GGC AAA GAA TAC GAC ATG ATC TTA GAG TGC 2312 Ser Val Phe Lys Glu Asp Gly Lys Glu Tyr Asp Met He Leu Glu Cys 740 745 750
CTG ATG ACA AGC GCG TTT CTG TAGAAGACAT CAAACGCTTG CAAGTGCGTA ACAA 2367 Leu Met Thr Ser Ala Phe Leu 755 760
ATACGATAAA TTGATGTTT 2386
(2) INFORMATION FOR SEQ ID NO: 380:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 761 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380:
Met Tyr Lys Thr Ala He Asn Arg Pro He Thr Thr Leu Met Phe Ala
1 5 10 15
Leu Ala He Val Phe Phe Gly Thr Met Gly Phe Lys Lys Leu Ser Val
20 25 30
Ala Leu Phe Pro Lys He Asp Leu Pro Thr Val Val Val Thr Thr Thr
35 40 45
Tyr Pro Gly Ala Ser Ala Glu He He Glu Ser Lys Val Thr Asp Lys
50 55 60
He Glu Glu Ala Val Met Gly He Asp Gly He Lys Lys Val Thr Ser 65 70 75 80
Thr Ser Ser Lys Asn Val Ser He Val Val He Glu Phe Glu Leu Glu
85 90 95
Lys Pro Asn Glu Glu Ala Leu Asn Asp Val Val Asn Lys He Ser Ser
100 105 110
Val Arg Phe Asp Asp Ser Asn He Lys Lys Pro Ser He Asn Lys Phe
115 120 125
Asp Thr Asp Ser Gin Ala He He Ser Leu Phe Val Ser Ser Ser Ser
130 135 140
Val Pro Ala Thr Thr Leu Asn Asp Tyr Ala Lys Asn Thr He Lys Pro 145 150 155 160
Met Leu Gin Lys He Asn Gly Val Gly Gly Val Gin Leu Asn Gly Phe
165 170 175
Arg Glu Arg Gin He Arg He Tyr Ala Asn Pro Thr Leu Met Asn Lys
180 185 190
Tyr Asn Leu Thr Tyr Ala Asp Leu Phe Ser Thr Leu Lys Ala Glu Asn
195 200 205
Val Glu He Asp Gly Gly Arg He Val Asn Ser Gin Arg Glu Phe Ser
210 215 220
He Leu He Asn Ala Asn Ser Tyr Ser Val Ala Asp Val Glu Lys He 225 230 235 240
Gin Val Gly Asn His Val Arg Leu Gly Asp He Ala Lys He Glu He
245 250 255
Gly Leu Glu Glu Asp Asn Thr Phe Ala Ser Phe Lys Asp Lys Pro Gly
260 265 270
Val He Leu Glu He Gin Lys He Ala Gly Ala Asn Glu He Glu He
275 280 285
Val Asp Arg Val Tyr Glu Ala Leu Lys Arg He Gin Ala He Ser Pro
290 295 300
Asn Tyr Glu He Arg Pro Phe Leu Asp Thr Thr Gly Tyr He Arg Thr 305 310 315 320
Ser He Glu Asp Val Lys Phe Asp Leu Val Leu Gly Ala He Leu Ala
325 330 335
Val Leu Val Val Phe Ala Phe Leu Arg Asn Gly Thr He Thr Leu Val
340 345 350
Ser Ala He Ser He Pro He Ser He Met Gly Thr Phe Ala Leu He
355 360 365
Gin Trp Met Gly Phe Ser Leu Asn Met Leu Thr Met Val Ala Leu Thr 370 375 380
Leu Ala He Gly He He He Asp Asp Ala He Val Val He Glu Asn 385 390 395 400
He His Lys Lys Leu Glu Met Gly Met Ser Lys Arg Lys Ala Ser Tyr
405 410 415
Glu Gly Val Arg Glu He Gly Phe Ala Leu Val Ala He Ser Ala Met
420 425 430
Leu Leu Ser Val Phe Val Pro He Gly Asn Met Lys Gly He He Gly
435 440 445
Arg Phe Phe Gin Ser Phe Gly He Thr Val Ala Leu Ala He Ala Leu
450 455 460
Ser Tyr Val Val Val Val Thr He He Pro Met Val Ser Ser Val Val 465 470 475 480
Val Asn Pro Arg His Ser Arg Phe Tyr Val Trp Ser Glu Pro Phe Phe
485 490 495
Lys Ala Leu Glu Ser Arg Tyr Thr Lys Leu Leu Gin Trp Val Leu Asn
500 505 510
His Lys He He He Ser He Ala Val Val Leu Val Phe Val Gly Ser
515 520 525
Leu Phe Val Ala Ser Lys He Gly Met Glu Phe Met Leu Lys Glu Asp
530 535 540
Arg Gly Arg Phe Leu Val Trp Leu Lys Ala Lys Pro Gly Val Ser He 545 550 555 560
Asp Tyr Met Thr Gin Lys Ser Lys He Phe Gin Lys Ala He Glu Lys
565 570 575
His Ala Glu Val Glu Phe Thr Thr Leu Gin Val Gly Tyr Gly Thr Thr
580 585 590
Gin Asn Pro Phe Lys Ala Lys He Phe Val Gin Leu Lys Pro Leu Lys
595 600 605
Glu Arg Lys Lys Glu His Gin Leu Gly Gin Phe Glu Leu Met Ser Val
610 615 620
Leu Arg Lys Glu Leu Arg Ser Leu Pro Glu Ala Lys Gly Leu Asp Thr 625 630 635 640
He Asn Leu Ser Glu Val Thr Leu He Gly Gly Gly Gly Asp Ser Ser
645 650 655
Pro Phe Gin Thr Phe Val Phe Ser His Ser Gin Glu Ala Val Asp Lys
660 665 670
Ser Val Glu Asn Leu Lys Lys Phe Leu Leu Glu Ser Pro Glu Leu Lys
675 680 685
Gly Lys Val Glu Ser Tyr His Thr Ser Thr Ser Glu Ser Gin Pro Gin
690 695 700
Leu Gin Leu Lys He Leu Arg Gin Asn Ala Asn Lys Tyr Gly Val Ser 705 710 715 720
Ala Gin Thr He Gly Ser Val Val Ser Ser Ala Phe Ser Gly Thr Ser
725 730 735
Gin Ala Ser Val Phe Lys Glu Asp Gly Lys Glu Tyr Asp Met He Leu
740 745 750
Glu Cys Leu Met Thr Ser Ala Phe Leu 755 760
(2) INFORMATION FOR SEQ ID NO: 381:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 6025 base pairs
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...5972 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381:
TTACACGCTT TTAAGCGGAA ATAGTATCAA ATACAATAAC CAAGCTTTAG CGG GAC 56
Arg Asp 1
AAT GCT TTT TCA AAA AAT TTA TGG AAT TTA ATC CAT TAT GGT GGC GAA 104 Asn Ala Phe Ser Lys Asn Leu Trp Asn Leu He His Tyr Gly Gly Glu 5 10 15
CAA GGG ACT CTA TTA AGA GCG GAT AAC AAC ACC TTT TTT GTG CAA TTC 152 Gin Gly Thr Leu Leu Arg Ala Asp Asn Asn Thr Phe Phe Val Gin Phe 20 25 30
ACC CAA AGC AAC GGC CAA AAA TTT GTT TTT GAA GAA ACT TTT AAT CCG 200 Thr Gin Ser Asn Gly Gin Lys Phe Val Phe Glu Glu Thr Phe Asn Pro 35 40 45 50
GGC TCT ATC ACC TAT AAA TAT TTC ACT ATC CAT TCT TCG CTT TTC CAC 248 Gly Ser He Thr Tyr Lys Tyr Phe Thr He His Ser Ser Leu Phe His 55 60 65
ACA GAC GCT GAT TCT AAG GAT ATT TGG AGT CAA GTG AGG AAG CAA TTT 296 Thr Asp Ala Asp Ser Lys Asp He Trp Ser Gin Val Arg Lys Gin Phe 70 75 80
GAT TTC ATT CCA GGA AAA ACC CCT GTG TGT GTT GGC GTG TGC TAT ATC 344 Asp Phe He Pro Gly Lys Thr Pro Val Cys Val Gly Val Cys Tyr He 85 90 95
GCG CCT TAT AAA AAT CAA GAC CTT ATT GGC TCT AGC GCT TTT GCG TGG 392 Ala Pro Tyr Lys Asn Gin Asp Leu He Gly Ser Ser Ala Phe Ala Trp 100 105 110
TCG CTG AAC TTT GGG GCC ACG GTG GTA GGG ACT TTG CTT TTA GGG AGC 440 Ser Leu Asn Phe Gly Ala Thr Val Val Gly Thr Leu Leu Leu Gly Ser 115 120 125 130
GCT CAA GAA AAA GCC AAT AAT AAT GGC GGA TCG ATC TGG TTT GGT AAG 488 Ala Gin Glu Lys Ala Asn Asn Asn Gly Gly Ser He Trp Phe Gly Lys 135 140 145 AAT AAT TTG CTG TAT TTG CAT GGC AAT TTC AAC GCG ACT AAT ATC TTT 536 Asn Asn Leu Leu Tyr Leu His Gly Asn Phe Asn Ala Thr Asn He Phe 150 155 160
TTA ACG AAT AAT TTT AAT GTC GGC AAC CCT AAC GCT GGC GGT GGG GCG 584 Leu Thr Asn Asn Phe Asn Val Gly Asn Pro Asn Ala Gly Gly Gly Ala 165 170 175
ACG ATT AAT TTT AAC GCT GAT GAA ACC TTG AAC GCT GAC GGG TTA AAT 632 Thr He Asn Phe Asn Ala Asp Glu Thr Leu Asn Ala Asp Gly Leu Asn 180 185 190
TAC ACG AAT TTC CAA ACC GTG GCT TTG GGC TTA CAA ACC AGT GCG AGC 680 Tyr Thr Asn Phe Gin Thr Val Ala Leu Gly Leu Gin Thr Ser Ala Ser 195 200 205 210
CAG CAT TCA TGG GCG AAT TTT AAT TCC AAG CTT TCT ATG GAG ATT AAA 728 Gin His Ser Trp Ala Asn Phe Asn Ser Lys Leu Ser Met Glu He Lys 215 220 225
AAT TCT AAC TTT AGG GAT TTC ACA TGG GGA GGC TTT AAT TTT AAT TCA 776 Asn Ser Asn Phe Arg Asp Phe Thr Trp Gly Gly Phe Asn Phe Asn Ser 230 235 240
GGG CGT ATC ACT TTT GAA AAC ACC ACT TTT AGC GGC TGG ACC AAT ATT 824 Gly Arg He Thr Phe Glu Asn Thr Thr Phe Ser Gly Trp Thr Asn He 245 250 255
AAC GGA GCG ACT GAG AGC GGC TCA TCG TAT GTG AAT ATG GTT GCG AAT 872 Asn Gly Ala Thr Glu Ser Gly Ser Ser Tyr Val Asn Met Val Ala Asn 260 265 270
ACG GAT TTG ATA TTT TCT AAT TCC ATT TTA GGA GGG GGC ATT CGC TAT 920 Thr Asp Leu He Phe Ser Asn Ser He Leu Gly Gly Gly He Arg Tyr 275 280 285 290
GAT TTG AAA GCT AAT AAC ATT ATT TTC AAT AAC TCT CAA ATG GTT ATT 968 Asp Leu Lys Ala Asn Asn He He Phe Asn Asn Ser Gin Met Val He 295 300 305
GAT GTG TCT AAG AAT GTG AAT CAG TCA TCA TTG AAT GGG AAT GTT ACT 1016 Asp Val Ser Lys Asn Val Asn Gin Ser Ser Leu Asn Gly Asn Val Thr 310 315 320
TTC AAT AAT TCC AGG CTT TCA GTC AAG CCC AAT GCG GCT ATT AAT ATT 1064 Phe Asn Asn Ser Arg Leu Ser Val Lys Pro Asn Ala Ala He Asn He 325 330 335
GGG GAT AGC CAA ACC CAA ACG GCT TTA GAA AAC GCT TCA AGC CTT TCT 1112 Gly Asp Ser Gin Thr Gin Thr Ala Leu Glu Asn Ala Ser Ser Leu Ser 340 345 350
TTT TAC AAC AAC AGC GTG GCG AAT TTT AAC GGC ACA ACC GCT TTT AAC 1160 Phe Tyr Asn Asn Ser Val Ala Asn Phe Asn Gly Thr Thr Ala Phe Asn 355 360 365 370 GGG GTG TCT TAT TTG AAT TTG AAC CCT AAC GCT CAA GTA AGC TTC AAT 1208 Gly Val Ser Tyr Leu Asn Leu Asn Pro Asn Ala Gin Val Ser Phe Asn 375 380 385
CAA GTA AAT TTC AAT AAC GCT AAT GTA ACT TTT TAT GGC ATT CCT TTA 1256 Gin Val Asn Phe Asn Asn Ala Asn Val Thr Phe Tyr Gly He Pro Leu 390 395 400
TTT GGT AAA ACG CCT GAT TTT GGC AAC TCT GCA CGC CTT ATC AAT TTC 1304 Phe Gly Lys Thr Pro Asp Phe Gly Asn Ser Ala Arg Leu He Asn Phe 405 410 415
AAA GGG AAT ACG AAT TTT AAT CAA GCC ACG CTC AAT TTA AGG GCT AAA 1352 Lys Gly Asn Thr Asn Phe Asn Gin Ala Thr Leu Asn Leu Arg Ala Lys 420 425 430
AAT ATC CAT ATC AAT TTC CAA GGC GTT TCT ACT TTT AAA CAA AAC TCT 1400 Asn He His He Asn Phe Gin Gly Val Ser Thr Phe Lys Gin Asn Ser 435 440 445 450
ACG ATG AAT TTA GCT GAA AGT TCC CAA GCG AGC TTT AAC GCT CTT AAA 1448 Thr Met Asn Leu Ala Glu Ser Ser Gin Ala Ser Phe Asn Ala Leu Lys 455 460 465
GTG GAA GGG GAA ACG AAT TTC AAT CTC AAT AAC TCA AGC TTG TTG AAT 1496 Val Glu Gly Glu Thr Asn Phe Asn Leu Asn Asn Ser Ser Leu Leu Asn 470 475 480
TTC AAT GGC AAT AGC GTT TTC AAC GCT CCT GTG AGT TTT TAT GCT AAT 1544 Phe Asn Gly Asn Ser Val Phe Asn Ala Pro Val Ser Phe Tyr Ala Asn 485 490 495
CAT TCT CAA ATT TCT TTC ACT AAA TTA GCG ACT TTT AAT TCT GAC GCT 1592 His Ser Gin He Ser Phe Thr Lys Leu Ala Thr Phe Asn Ser Asp Ala 500 505 510
TCT TTT GAT TTA AGC AAC AAC AGC ACC CTG AAT TTT CAA AGC GTT CTT 1640 Ser Phe Asp Leu Ser Asn Asn Ser Thr Leu Asn Phe Gin Ser Val Leu 515 520 525 530
TTA AAT GGT GCT CTA AAC CTT TTA GGC AAT GGC AGT AAC AAT CTA GCG 1688 Leu Asn Gly Ala Leu Asn Leu Leu Gly Asn Gly Ser Asn Asn Leu Ala 535 540 545
ATC AAC GCT AAA GGG AAT TTT AGT TTT GGG TCT AAA GGG ATT TTG AAT 1736 He Asn Ala Lys Gly Asn Phe Ser Phe Gly Ser Lys Gly He Leu Asn 550 555 560
CTG TCT TAT ATG AAT CTA TTT GGG GGG GAT AAA AAA ACT TCC GTT TAT 1784 Leu Ser Tyr Met Asn Leu Phe Gly Gly Asp Lys Lys Thr Ser Val Tyr 565 570 575
GAT GTG TTG CAA GCC CAA AAT ATT GAT GGC TTA ATG GGG AAT AAC GGC 1832 Asp Val Leu Gin Ala Gin Asn He Asp Gly Leu Met Gly Asn Asn Gly 580 585 590 TAT GAG AAG ATC CGT TTT TAT GGC ATA CAG ATT GAC AAG GCT GAT TAC 1880 Tyr Glu Lys He Arg Phe Tyr Gly He Gin He Asp Lys Ala Asp Tyr 595 600 605 610
TCG TTT GAT AAC GGC GTT CAT TCT TGG AGA TTC ACT AAC CCG CTC AAT 1928 Ser Phe Asp Asn Gly Val His Ser Trp Arg Phe Thr Asn Pro Leu Asn 615 620 625
ACG ACT GAA ACG ATT ACA GAA ACC TTG CAT AAC AAC CGC TTG AAA GTG 1976 Thr Thr Glu Thr He Thr Glu Thr Leu His Asn Asn Arg Leu Lys Val 630 635 640
CAG ATC TCT CAA AAC GGC GTT TCT AAT AAT AAG ATG TTC AAT CTC GCT 2024 Gin He Ser Gin Asn Gly Val Ser Asn Asn Lys Met Phe Asn Leu Ala 645 650 655
CCT AGC TTG TAT GAT TAC CAA AAA AAC CCT TAT AAT GAA ACC GAG AAT 2072 Pro Ser Leu Tyr Asp Tyr Gin Lys Asn Pro Tyr Asn Glu Thr Glu Asn 660 665 670
TCC TAT AAT TAC ACA AGC GAT AAG GTT GGC ACT TAT TAT TTA ACG AGC 2120 Ser Tyr Asn Tyr Thr Ser Asp Lys Val Gly Thr Tyr Tyr Leu Thr Ser 675 680 685 690
AAT ATC AAA GGC TTT AAT CAA AAC AAT AAA ACA CCC GGG ACT TAT AAC 2168 Asn He Lys Gly Phe Asn Gin Asn Asn Lys Thr Pro Gly Thr Tyr Asn 695 700 705
GCG CAA AAC CAA CCC TTA CAA GCC TTA CAC ATT TAC AAT CAG GCT ATC 2216 Ala Gin Asn Gin Pro Leu Gin Ala Leu His He Tyr Asn Gin Ala He 710 715 720
ACT AAG CAA GAT TTG AAC ATG ATC GCC AGT TTG GGT AAG GAG TTT TTG 2264 Thr Lys Gin Asp Leu Asn Met He Ala Ser Leu Gly Lys Glu Phe Leu 725 730 735
CCT AAA ATA GCC AAT CTT TTA TCT TCA GGG GCT TTG GAT AAT CTC AAT 2312 Pro Lys He Ala Asn Leu Leu Ser Ser Gly Ala Leu Asp Asn Leu Asn 740 745 750
AGC CCG AAT AGT TTT GAA ACT CTT TTT GGT ATC TTT GAA AAG TAT GGT 2360 Ser Pro Asn Ser Phe Glu Thr Leu Phe Gly He Phe Glu Lys Tyr Gly 755 760 765 770
ATC ACT TTA AAC CAA GAA AAT TGG AAG AGC TTA TTA AAG ATT ATC AAT 2408 He Thr Leu Asn Gin Glu Asn Trp Lys Ser Leu Leu Lys He He Asn 775 780 785
AAT TTT TCC AAC ACA ACT AAT TAT GAT TTC TCT CAA GGC AAT CTC GTT 2456 Asn Phe Ser Asn Thr Thr Asn Tyr Asp Phe Ser Gin Gly Asn Leu Val 790 795 800
GTA GGA GCG ATC AAA GAG GGG CAA ACG AAC ACT AAA AGC GTG GTG TGG 2504 Val Gly Ala He Lys Glu Gly Gin Thr Asn Thr Lys Ser Val Val Trp 805 810 815 TTT GGA GGC GAA GGC TAT AAA GAG CCA TGT GCG GTT GGG GAT AAC ACT 2552 Phe Gly Gly Glu Gly Tyr Lys Glu Pro Cys Ala Val Gly Asp Asn Thr 820 825 830
TGC CAG ATG TTC AGA CAG ACT AAT TTA GGG CAA TTG CTC CAT TCT AGT 2600 Cys Gin Met Phe Arg Gin Thr Asn Leu Gly Gin Leu Leu His Ser Ser 835 840 845 850
ACG CCT TAT TTA GGC TAC ATT AAC GCT AAT TTT AGG GCT AAA AAC ATT 2648 Thr Pro Tyr Leu Gly Tyr He Asn Ala Asn Phe Arg Ala Lys Asn He 855 860 865
TAC ATT ACC GGA ACC ATC GGC AGC GGG AAC GCT TGG GGG AGT GGA GGG 2696 Tyr He Thr Gly Thr He Gly Ser Gly Asn Ala Trp Gly Ser Gly Gly 870 875 880
AGT GCG AAT GTG TCT TTT GAA AGC GGC ACT AAT TTA GTG CTT AAT CAA 2744 Ser Ala Asn Val Ser Phe Glu Ser Gly Thr Asn Leu Val Leu Asn Gin 885 890 895
GCT AAG ATT GAC GCT CAA GGG ACC GAT AAA ATC TTT TCT TAC TTG GGG 2792 Ala Lys He Asp Ala Gin Gly Thr Asp Lys He Phe Ser Tyr Leu Gly 900 905 910
CAA GGG GGT ATT GAA AAG CTT TTT GGA GAA AAA GGT TTA GGG AAT GCG 2840 Gin Gly Gly He Glu Lys Leu Phe Gly Glu Lys Gly Leu Gly Asn Ala 915 920 925 930
CTT TCT AAT ATC ATT TAT GAA GAG AGC TTG AAT GAT AAC GCT ATC CCT 2888 Leu Ser Asn He He Tyr Glu Glu Ser Leu Asn Asp Asn Ala He Pro 935 940 945
AAA GAT TTA GCC AAC ATG ATC CCT AAA GAT TTT GGA TCT AAG ACT TTA 2936 Lys Asp Leu Ala Asn Met He Pro Lys Asp Phe Gly Ser Lys Thr Leu 950 955 960
AGC TCA TTG CTT AGC CCT ACT GAA GTG AAT AAC CTC TTA GGC GTG AGC 2984 Ser Ser Leu Leu Ser Pro Thr Glu Val Asn Asn Leu Leu Gly Val Ser 965 970 975
GCA TTC AAA AAC GCG ATC ATG GAA ATT TTA AAT TCT AAA ACG GTG GGC 3032 Ala Phe Lys Asn Ala He Met Glu He Leu Asn Ser Lys Thr Val Gly 980 985 990
GAT GTT TTT GGT GAA AAC GGG CTT TTA AAC GCG CTA GAT CCT ACG GAA 3080 Asp Val Phe Gly Glu Asn Gly Leu Leu Asn Ala Leu Asp Pro Thr Glu 995 1000 1005 1010
AGA AAA AAA ATT GAT CAA ATG CTT TTA GAG CAA ATC CAA GCC CAT TCT 3128 Arg Lys Lys He Asp Gin Met Leu Leu Glu Gin He Gin Ala His Ser 1015 1020 1025
TCA GGG TTT GAA AAA TTC ATC GTG AAA ACT TTA GGG ATT GAA AAT GTA 3176 Ser Gly Phe Glu Lys Phe He Val Lys Thr Leu Gly He Glu Asn Val 1030 1035 1040 GAG AAT TTC ATC AAT AAC TGG TAT GGC AAG CAA AGC TTG AGT TCT TTT 3224 Glu Asn Phe He Asn Asn Trp Tyr Gly Lys Gin Ser Leu Ser Ser Phe 1045 1050 1055
GCC AAT AAT TTT GTG CCT GGA GGC TTG AAT CAA GCC CTT GAT AAA ATA 3272 Ala Asn Asn Phe Val Pro Gly Gly Leu Asn Gin Ala Leu Asp Lys He 1060 1065 1070
GGC TCT AGC TCT GAT GCC AAA GAC TTA CAG AAC TTC TTG GAT AAA ACG 3320 Gly Ser Ser Ser Asp Ala Lys Asp Leu Gin Asn Phe Leu Asp Lys Thr 1075 1080 1085 1090
ACT TTT GGG GAT ATT TTA AAT CAA ATG ATT GAA CAA GCC CCC TTA ATC 3368 Thr Phe Gly Asp He Leu Asn Gin Met He Glu Gin Ala Pro Leu He 1095 1100 1105
AAT AAA CTC ATT TCT TGG CTG GGT CCG CAG GAT TTG AGC GTT TTA GTG 3416 Asn Lys Leu He Ser Trp Leu Gly Pro Gin Asp Leu Ser Val Leu Val 1110 1115 1120
AAT ATC GCT TTA AAT AGC ATC ACT AAC CCT AGT AAA GAG CTG ACT AGC 3464 Asn He Ala Leu Asn Ser He Thr Asn Pro Ser Lys Glu Leu Thr Ser 1125 1130 1135
ACC ATT TCT AGC ATA GGT GAA AAA GCG TTA AAT GAC TTA TTA GGC GAT 3512 Thr He Ser Ser He Gly Glu Lys Ala Leu Asn Asp Leu Leu Gly Asp 1140 1145 1150
GGC GTA GTG AAT AAA ATC ATG AGC AAT CAA GTC TTA GGG CAA ATG ATC 3560 Gly Val Val Asn Lys He Met Ser Asn Gin Val Leu Gly Gin Met He 1155 1160 1165 1170
AAT AAA ATC ATT GCT GAT AAG GGC TTT GGA GGC GTT TAT CAG CAA GGT 3608 Asn Lys He He Ala Asp Lys Gly Phe Gly Gly Val Tyr Gin Gin Gly 1175 1180 1185
TTA GGC TCC ATA CTG CCT CAA TCT TTA CAA GAT GAA TTG AAG AAA TTG 3656 Leu Gly Ser He Leu Pro Gin Ser Leu Gin Asp Glu Leu Lys Lys Leu 1190 1195 1200
GGC ATG GGC TCT TTA CTA GGA TCT AGG GGG TTG CAC AAT CTT TGG CAA 3704 Gly Met Gly Ser Leu Leu Gly Ser Arg Gly Leu His Asn Leu Trp Gin 1205 1210 1215
AGA GGG AAT TTC AAT TTT GTG GCT AAA GAT TAT TTA TTC ACT AAT AAC 3752 Arg Gly Asn Phe Asn Phe Val Ala Lys Asp Tyr Leu Phe Thr Asn Asn 1220 1225 1230
AGC TCG TTT AGT AAC GCC ACA GGG GGG GAA TTG AAT TTT GTG GCG GGC 3800 Ser Ser Phe Ser Asn Ala Thr Gly Gly Glu Leu Asn Phe Val Ala Gly 1235 1240 1245 1250
AAG TCT ATT ATT TTT AAC GGG AAA AAT ACG ATC AAT TTC ACG CAG TAT 3848 Lys Ser He He Phe Asn Gly Lys Asn Thr He Asn Phe Thr Gin Tyr 1255 1260 1265 CAG GGT AAG CTT TCG TTT ATT TCT AAA GAT TTT TCT AAC ATT TCA TTA 3896 Gin Gly Lys Leu Ser Phe He Ser Lys Asp Phe Ser Asn He Ser Leu 1270 1275 1280
GAT ACC TTA AAC GCT ACT AAC GGA TTA ACG CTT AAT GCT CCT AAA AAT 3944 Asp Thr Leu Asn Ala Thr Asn Gly Leu Thr Leu Asn Ala Pro Lys Asn 1285 1290 1295
GAC ATT AGC GTT CAA AAA GGT CAG ATT TGC GTG AAT GTT TTA AAT TGC 3992 Asp He Ser Val Gin Lys Gly Gin He Cys Val Asn Val Leu Asn Cys 1300 1305 1310
ATG GGC GAG AAA AAA GCT CAT TCT TCA AGC GCG ACA GCC CCA ACC AAT 4040 Met Gly Glu Lys Lys Ala His Ser Ser Ser Ala Thr Ala Pro Thr Asn 1315 1320 1325 1330
GAA ACA CTA GAA GCG AAT GCG AAT AAT TTC GCT TTT TTA GGT GCA ATT 4088 Glu Thr Leu Glu Ala Asn Ala Asn Asn Phe Ala Phe Leu Gly Ala He 1335 1340 1345
AAG GCT AAT GGA TTA GTG GAT TTT TCA AAA GTT TTA CAA AAT ACT ACG 4136 Lys Ala Asn Gly Leu Val Asp Phe Ser Lys Val Leu Gin Asn Thr Thr 1350 1355 1360
ATC GGG ACT TTA GAT TTA GGG CCA AAC GCT ACT TTT AAA GCG AAT CAT 4184 He Gly Thr Leu Asp Leu Gly Pro Asn Ala Thr Phe Lys Ala Asn His 1365 1370 1375
TTG ATC GTG AAT AAC GCT TTT AAC AAT AAC TCT AAT TAC AGG GCT GAT 4232 Leu He Val Asn Asn Ala Phe Asn Asn Asn Ser Asn Tyr Arg Ala Asp 1380 1385 1390
ATT AGC GGT AAT CTC AAT GTG GTT AAA GGA GCG GCT CTC AGC ACG AAT 4280 He Ser Gly Asn Leu Asn Val Val Lys Gly Ala Ala Leu Ser Thr Asn 1395 1400 1405 1410
GAA AAT GGT TTG AAT GTG GGG GGC GAT TTC AAG AGC GAA GGG TCA TTA 4328 Glu Asn Gly Leu Asn Val Gly Gly Asp Phe Lys Ser Glu Gly Ser Leu 1415 1420 1425
ATC TTT AAT CTT AAC AAT AAA ACC AAT CAA ACG ATT ATT AAT GTG GCT 4376 He Phe Asn Leu Asn Asn Lys Thr Asn Gin Thr He He Asn Val Ala 1430 1435 1440
GGC AAT TCT ACG ATC ATG TCT TAT AAC AAT CAA GCT TTA ATC CAT TTT 4424 Gly Asn Ser Thr He Met Ser Tyr Asn Asn Gin Ala Leu He His Phe 1445 1450 1455
AAT ACC CAA CTC AAG CAA GGC GCT TAC ACG CTT ATT AAT GCG AAA CGC 4472 Asn Thr Gin Leu Lys Gin Gly Ala Tyr Thr Leu He Asn Ala Lys Arg 1460 1465 1470
ATG CTT TAT GGT TAT GAC AAT CAA ATC ATT CGT GGA GGG AGC TTG AGC 4520 Met Leu Tyr Gly Tyr Asp Asn Gin He He Arg Gly Gly Ser Leu Ser 1475 1480 1485 1490 GAT TAC CTC AAG CTT TAC ACC CTC ATT GAT TTT AAC GGC AAA CGC ATG 4568 Asp Tyr Leu Lys Leu Tyr Thr Leu He Asp Phe Asn Gly Lys Arg Met 1495 1500 1505
CAA TTA AAC GGC GAT TCA CTA AGC TAT GAC AAC CAA CCG GTC AAT ATT 4616 Gin Leu Asn Gly Asp Ser Leu Ser Tyr Asp Asn Gin Pro Val Asn He 1510 1515 1520
AAA GAT GGG GGT CTT GTG GTA AGC TTT AAA GAC AAT CAG GGG CAA ATG 4664 Lys Asp Gly Gly Leu Val Val Ser Phe Lys Asp Asn Gin Gly Gin Met 1525 1530 1535
GTG TAT TCA TCT ATC CTT TAT GAT AAA GTT CAA GTT AGC GTC TCT GAT 4712 Val Tyr Ser Ser He Leu Tyr Asp Lys Val Gin Val Ser Val Ser Asp 1540 1545 1550
AAG CCC ATG GAT ATT CAT GCC CCT AGT TTG GAG TAT TAC ATT AAA TAC 4760 Lys Pro Met Asp He His Ala Pro Ser Leu Glu Tyr Tyr He Lys Tyr 1555 1560 1565 1570
ATT CAA GGC AGT GCT GGT TTG GAT GCG ATC AAA TCT GCA GGC AAT AAT 4808 He Gin Gly Ser Ala Gly Leu Asp Ala He Lys Ser Ala Gly Asn Asn 1575 1580 1585
TCC ATT CTG TGG TTG AAT GAG CTT TTT GTG GCT AAA GGG GGT AAT CCC 4856 Ser He Leu Trp Leu Asn Glu Leu Phe Val Ala Lys Gly Gly Asn Pro 1590 1595 1600
TTG TTC GCT CCT TAT TAT TTG CAA GAC AAT CCC ACT GAA CAC ATT GTT 4904 Leu Phe Ala Pro Tyr Tyr Leu Gin Asp Asn Pro Thr Glu His He Val 1605 1610 1615
ACT TTA ATG AAA GAT ATT ACT AGC GCT TTA GGC ATG CTT TCT AAA CCC 4952 Thr Leu Met Lys Asp He Thr Ser Ala Leu Gly Met Leu Ser Lys Pro 1620 1625 1630
AAT CTT AAA AAC AAT TCC ACC GAT GCT TTA CAG CTC AAC ACT TAC ACG 5000 Asn Leu Lys Asn Asn Ser Thr Asp Ala Leu Gin Leu Asn Thr Tyr Thr 1635 1640 1645 1650
CAA CAA ATG AGC CGT TTA GCC AAG CTT TCT AAT TTC GCT TCC TTT GAT 5048 Gin Gin Met Ser Arg Leu Ala Lys Leu Ser Asn Phe Ala Ser Phe Asp 1655 1660 1665
TCA ACG GAT TTT AGC GAA CGC TTG AGC AGT CTT AAA AAC CAA AGA TTT 5096 Ser Thr Asp Phe Ser Glu Arg Leu Ser Ser Leu Lys Asn Gin Arg Phe 1670 1675 1680
GCT GAT GCA ATC CCT AAT GCG ATG GAT GTG ATT TTA AAA TAC TCT CAA 5144 Ala Asp Ala He Pro Asn Ala Met Asp Val He Leu Lys Tyr Ser Gin 1685 1690 1695
AGG GAT AAA CTA AAA AAC AAC CTT TGG GCG ACC GGC GTT GGG GGC GTG 5192 Arg Asp Lys Leu Lys Asn Asn Leu Trp Ala Thr Gly Val Gly Gly Val 1700 1705 1710 AGC TTT GTG GAA AAT GGC ACA GGA ACG CTC TAT GGT GTC AAT GTG GGC 5240 Ser Phe Val Glu Asn Gly Thr Gly Thr Leu Tyr Gly Val Asn Val Gly 1715 1720 1725 1730
TAT GAC AGA TTC ATT AAG GGT GTG ATT GTT GGA GGG TAT GCG GCT TAT 5288 Tyr Asp Arg Phe He Lys Gly Val He Val Gly Gly Tyr Ala Ala Tyr 1735 1740 1745
GGG TAT AGC GGT TTT TAT GAA CGC ATC ACT AAT TCT AAA TCC GAT AAT 5336 Gly Tyr Ser Gly Phe Tyr Glu Arg He Thr Asn Ser Lys Ser Asp Asn 1750 1755 1760
GTG GAT GTG GGC TTG TAT GCG AGG GCT TTC ATT AAA AAG AGC GAG CTA 5384 Val Asp Val Gly Leu Tyr Ala Arg Ala Phe He Lys Lys Ser Glu Leu 1765 1770 1775
ACC TTT AGC GTC AAT GAA ACT TGG GGG GCT AAT AAA AAC CAA ATC AGC 5432 Thr Phe Ser Val Asn Glu Thr Trp Gly Ala Asn Lys Asn Gin He Ser 1780 1785 1790
TCC AAC GAC ACT CTG CTT TCT ATG ATC AAT CAG TCC TAT AAA TAC AGC 5480 Ser Asn Asp Thr Leu Leu Ser Met He Asn Gin Ser Tyr Lys Tyr Ser 1795 1800 1805 1810
ACA TGG ACG ACG AAC GCA AAA GTT AAT TAC GGG TAT GAT TTC ATG TTT 5528 Thr Trp Thr Thr Asn Ala Lys Val Asn Tyr Gly Tyr Asp Phe Met Phe 1815 1820 1825
AAA AAC AAA AGC ATC ATT TTA AAA CCT CAA ATT GGT TTA AGG TAT TAC 5576 Lys Asn Lys Ser He He Leu Lys Pro Gin He Gly Leu Arg Tyr Tyr 1830 1835 1840
TAT ATC GGC ATG ACC GGT TTA GAA GGG GTG ATG CAT AAT GCG CTC TAT 5624 Tyr He Gly Met Thr Gly Leu Glu Gly Val Met His Asn Ala Leu Tyr 1845 1850 1855
AAC CAG TTT AAA GCG AAC GCC GAT CCG TCT AAA AAA TCC GTT TTA ACG 5672 Asn Gin Phe Lys Ala Asn Ala Asp Pro Ser Lys Lys Ser Val Leu Thr 1860 1865 1870
ATT GAA CTT GCT TTG GAG AAC CGC CAT TAT TTC AAC ACA AAC TCT TAT 5720 He Glu Leu Ala Leu Glu Asn Arg His Tyr Phe Asn Thr Asn Ser Tyr 1875 1880 1885 1890
TTT TAT GCG ATT GGC GGC TTT GGT AGA GAC TTA TTA GTT AAT TCT ATG 5768 Phe Tyr Ala He Gly Gly Phe Gly Arg Asp Leu Leu Val Asn Ser Met 1895 1900 1905
GGG GAT AAA TTG GTG CGT TTT ATT GGT AAC AAC ACT TTG AGC TAC AGG 5816 Gly Asp Lys Leu Val Arg Phe He Gly Asn Asn Thr Leu Ser Tyr Arg 1910 1915 1920
AAA GGC GAG CTT TAT AAC ACT TTT GCG AGC ATC ACT ACA GGC GGG GAA 5864 Lys Gly Glu Leu Tyr Asn Thr Phe Ala Ser He Thr Thr Gly Gly Glu 1925 1930 1935 GTG AGG TTG TTT AAA AGC TTT TAT GCG AAT GCT GGG GTG GGG GCT AGG 5912 Val Arg Leu Phe Lys Ser Phe Tyr Ala Asn Ala Gly Val Gly Ala Arg 1940 1945 1950
TTT GGA TTG GAC TAT AAA ATG ATC AAC ATT ACC GGA AAT ATT GGA ATG 5960 Phe Gly Leu Asp Tyr Lys Met He Asn He Thr Gly Asn He Gly Met 1955 1960 1965 1970
CGT TTA GCG TTT TAAAAGGTGG GGGCTTACCC CTTTTTTGAG CCATTAAATA AATCA 6017 Arg Leu Ala Phe
1
ATGGTGGG 6025
(2) INFORMATION FOR SEQ ID NO: 382:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1974 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:382:
Arg Asp Asn Ala Phe Ser Lys Asn Leu Trp Asn Leu He His Tyr Gly
1 5 10 15
Gly Glu Gin Gly Thr Leu Leu Arg Ala Asp Asn Asn Thr Phe Phe Val
20 25 30
Gin Phe Thr Gin Ser Asn Gly Gin Lys Phe Val Phe Glu Glu Thr Phe
35 40 45
Asn Pro Gly Ser He Thr Tyr Lys Tyr Phe Thr He His Ser Ser Leu
50 55 60
Phe His Thr Asp Ala Asp Ser Lys Asp He Trp Ser Gin Val Arg Lys 65 70 75 80
Gin Phe Asp Phe He Pro Gly Lys Thr Pro Val Cys Val Gly Val Cys
85 90 95
Tyr He Ala Pro Tyr Lys Asn Gin Asp Leu He Gly Ser Ser Ala Phe
100 105 110
Ala Trp Ser Leu Asn Phe Gly Ala Thr Val Val Gly Thr Leu Leu Leu
115 120 125
Gly Ser Ala Gin Glu Lys Ala Asn Asn Asn Gly Gly Ser He Trp Phe
130 135 140
Gly Lys Asn Asn Leu Leu Tyr Leu His Gly Asn Phe Asn Ala Thr Asn 145 150 155 160
He Phe Leu Thr Asn Asn Phe Asn Val Gly Asn Pro Asn Ala Gly Gly
165 170 175
Gly Ala Thr He Asn Phe Asn Ala Asp Glu Thr Leu Asn Ala Asp Gly
180 185 190
Leu Asn Tyr Thr Asn Phe Gin Thr Val Ala Leu Gly Leu Gin Thr Ser
195 200 205
Ala Ser Gin His Ser Trp Ala Asn Phe Asn Ser Lys Leu Ser Met Glu 210 215 220 He Lys Asn Ser Asn Phe Arg Asp Phe Thr Trp Gly Gly Phe Asn Phe 225 230 235 240
Asn Ser Gly Arg He Thr Phe Glu Asn Thr Thr Phe Ser Gly Trp Thr
245 250 255
Asn He Asn Gly Ala Thr Glu Ser Gly Ser Ser Tyr Val Asn Met Val
260 265 270
Ala Asn Thr Asp Leu He Phe Ser Asn Ser He Leu Gly Gly Gly He
275 280 285
Arg Tyr Asp Leu Lys Ala Asn Asn He He Phe Asn Asn Ser Gin Met
290 295 300
Val He Asp Val Ser Lys Asn Val Asn Gin Ser Ser Leu Asn Gly Asn 305 310 315 320
Val Thr Phe Asn Asn Ser Arg Leu Ser Val Lys Pro Asn Ala Ala He
325 330 335
Asn He Gly Asp Ser Gin Thr Gin Thr Ala Leu Glu Asn Ala Ser Ser
340 345 350
Leu Ser Phe Tyr Asn Asn Ser Val Ala Asn Phe Asn Gly Thr Thr Ala
355 360 365
Phe Asn Gly Val Ser Tyr Leu Asn Leu Asn Pro Asn Ala Gin Val Ser
370 375 380
Phe Asn Gin Val Asn Phe Asn Asn Ala Asn Val Thr Phe Tyr Gly He 385 390 395 400
Pro Leu Phe Gly Lys Thr Pro Asp Phe Gly Asn Ser Ala Arg Leu He
405 410 415
Asn Phe Lys Gly Asn Thr Asn Phe Asn Gin Ala Thr Leu Asn Leu Arg
420 425 430
Ala Lys Asn He His He Asn Phe Gin Gly Val Ser Thr Phe Lys Gin
435 440 445
Asn Ser Thr Met Asn Leu Ala Glu Ser Ser Gin Ala Ser Phe Asn Ala
450 455 460
Leu Lys Val Glu Gly Glu Thr Asn Phe Asn Leu Asn Asn Ser Ser Leu 465 470 475 480
Leu Asn Phe Asn Gly Asn Ser Val Phe Asn Ala Pro Val Ser Phe Tyr
485 490 495
Ala Asn His Ser Gin He Ser Phe Thr Lys Leu Ala Thr Phe Asn Ser
500 505 510
Asp Ala Ser Phe Asp Leu Ser Asn Asn Ser Thr Leu Asn Phe Gin Ser
515 520 525
Val Leu Leu Asn Gly Ala Leu Asn Leu Leu Gly Asn Gly Ser Asn Asn
530 535 540
Leu Ala He Asn Ala Lys Gly Asn Phe Ser Phe Gly Ser Lys Gly He 545 550 555 560
Leu Asn Leu Ser Tyr Met Asn Leu Phe Gly Gly Asp Lys Lys Thr Ser
565 570 575
Val Tyr Asp Val Leu Gin Ala Gin Asn He Asp Gly Leu Met Gly Asn
580 585 590
Asn Gly Tyr Glu Lys He Arg Phe Tyr Gly He Gin He Asp Lys Ala
595 600 605
Asp Tyr Ser Phe Asp Asn Gly Val His Ser Trp Arg Phe Thr Asn Pro
610 615 620
Leu Asn Thr Thr Glu Thr He Thr Glu Thr Leu His Asn Asn Arg Leu 625 630 635 640
Lys Val Gin He Ser Gin Asn Gly Val Ser Asn Asn Lys Met Phe Asn
645 650 655
Leu Ala Pro Ser Leu Tyr Asp Tyr Gin Lys Asn Pro Tyr Asn Glu Thr 660 665 670
Glu Asn Ser Tyr Asn Tyr Thr Ser Asp Lys Val Gly Thr Tyr Tyr Leu
675 680 685
Thr Ser Asn He Lys Gly Phe Asn Gin Asn Asn Lys Thr Pro Gly Thr
690 695 700
Tyr Asn Ala Gin Asn Gin Pro Leu Gin Ala Leu His He Tyr Asn Gin 705 710 715 720
Ala He Thr Lys Gin Asp Leu Asn Met He Ala Ser Leu Gly Lys Glu
725 730 735
Phe Leu Pro Lys He Ala Asn Leu Leu Ser Ser Gly Ala Leu Asp Asn
740 745 750
Leu Asn Ser Pro Asn Ser Phe Glu Thr Leu Phe Gly He Phe Glu Lys
755 760 765
Tyr Gly He Thr Leu Asn Gin Glu Asn Trp Lys Ser Leu Leu Lys He
770 775 780
He Asn Asn Phe Ser Asn Thr Thr Asn Tyr Asp Phe Ser Gin Gly Asn 785 790 795 800
Leu Val Val Gly Ala He Lys Glu Gly Gin Thr Asn Thr Lys Ser Val
805 810 815
Val Trp Phe Gly Gly Glu Gly Tyr Lys Glu Pro Cys Ala Val Gly Asp
820 825 830
Asn Thr Cys Gin Met Phe Arg Gin Thr Asn Leu Gly Gin Leu Leu His
835 840 845
Ser Ser Thr Pro Tyr Leu Gly Tyr He Asn Ala Asn Phe Arg Ala Lys
850 855 860
Asn He Tyr He Thr Gly Thr He Gly Ser Gly Asn Ala Trp Gly Ser 865 870 875 880
Gly Gly Ser Ala Asn Val Ser Phe Glu Ser Gly Thr Asn Leu Val Leu
885 890 895
Asn Gin Ala Lys He Asp Ala Gin Gly Thr Asp Lys He Phe Ser Tyr
900 905 910
Leu Gly Gin Gly Gly He Glu Lys Leu Phe Gly Glu Lys Gly Leu Gly
915 920 925
Asn Ala Leu Ser Asn He He Tyr Glu Glu Ser Leu Asn Asp Asn Ala
930 935 940
He Pro Lys Asp Leu Ala Asn Met He Pro Lys Asp Phe Gly Ser Lys 945 950 955 960
Thr Leu Ser Ser Leu Leu Ser Pro Thr Glu Val Asn Asn Leu Leu Gly
965 970 975
Val Ser Ala Phe Lys Asn Ala He Met Glu He Leu Asn Ser Lys Thr
980 985 990
Val Gly Asp Val Phe Gly Glu Asn Gly Leu Leu Asn Ala Leu Asp Pro
995 1000 1005
Thr Glu Arg Lys Lys He Asp Gin Met Leu Leu Glu Gin He Gin Ala
1010 1015 1020
His Ser Ser Gly Phe Glu Lys Phe He Val Lys Thr Leu Gly He Glu 025 1030 1035 1040
Asn Val Glu Asn Phe He Asn Asn Trp Tyr Gly Lys Gin Ser Leu Ser
1045 1050 1055
Ser Phe Ala Asn Asn Phe Val Pro Gly Gly Leu Asn Gin Ala Leu Asp
1060 1065 1070
Lys He Gly Ser Ser Ser Asp Ala Lys Asp Leu Gin Asn Phe Leu Asp
1075 1080 1085
Lys Thr Thr Phe Gly Asp He Leu Asn Gin Met He Glu Gin Ala Pro 1090 1095 1100 Leu He Asn Lys Leu He Ser Trp Leu Gly Pro Gin Asp Leu Ser Val 105 1110 1115 1120
Leu Val Asn He Ala Leu Asn Ser He Thr Asn Pro Ser Lys Glu Leu
1125 1130 1135
Thr Ser Thr He Ser Ser He Gly Glu Lys Ala Leu Asn Asp Leu Leu
1140 1145 1150
Gly Asp Gly Val Val Asn Lys He Met Ser Asn Gin Val Leu Gly Gin
1155 1160 1165
Met He Asn Lys He He Ala Asp Lys Gly Phe Gly Gly Val Tyr Gin
1170 1175 1180
Gin Gly Leu Gly Ser He Leu Pro Gin Ser Leu Gin Asp Glu Leu Lys 185 1190 1195 1200
Lys Leu Gly Met Gly Ser Leu Leu Gly Ser Arg Gly Leu His Asn Leu
1205 1210 1215
Trp Gin Arg Gly Asn Phe Asn Phe Val Ala Lys Asp Tyr Leu Phe Thr
1220 1225 1230
Asn Asn Ser Ser Phe Ser Asn Ala Thr Gly Gly Glu Leu Asn Phe Val
1235 1240 1245
Ala Gly Lys Ser He He Phe Asn Gly Lys Asn Thr He Asn Phe Thr
1250 1255 1260
Gin Tyr Gin Gly Lys Leu Ser Phe He Ser Lys Asp Phe Ser Asn He 265 1270 1275 1280
Ser Leu Asp Thr Leu Asn Ala Thr Asn Gly Leu Thr Leu Asn Ala Pro
1285 1290 1295
Lys Asn Asp He Ser Val Gin Lys Gly Gin He Cys Val Asn Val Leu
1300 1305 1310
Asn Cys Met Gly Glu Lys Lys Ala His Ser Ser Ser Ala Thr Ala Pro
1315 1320 1325
Thr Asn Glu Thr Leu Glu Ala Asn Ala Asn Asn Phe Ala Phe Leu Gly
1330 1335 1340
Ala He Lys Ala Asn Gly Leu Val Asp Phe Ser Lys Val Leu Gin Asn 345 1350 1355 1360
Thr Thr He Gly Thr Leu Asp Leu Gly Pro Asn Ala Thr Phe Lys Ala
1365 1370 1375
Asn His Leu He Val Asn Asn Ala Phe Asn Asn Asn Ser Asn Tyr Arg
1380 1385 1390
Ala Asp He Ser Gly Asn Leu Asn Val Val Lys Gly Ala Ala Leu Ser
1395 1400 1405
Thr Asn Glu Asn Gly Leu Asn Val Gly Gly Asp Phe Lys Ser Glu Gly
1410 1415 1420
Ser Leu He Phe Asn Leu Asn Asn Lys Thr Asn Gin Thr He He Asn 425 1430 1435 1440
Val Ala Gly Asn Ser Thr He Met Ser Tyr Asn Asn Gin Ala Leu He
1445 1450 1455
His Phe Asn Thr Gin Leu Lys Gin Gly Ala Tyr Thr Leu He Asn Ala
1460 1465 1470
Lys Arg Met Leu Tyr Gly Tyr Asp Asn Gin He He Arg Gly Gly Ser
1475 1480 1485
Leu Ser Asp Tyr Leu Lys Leu Tyr Thr Leu He Asp Phe Asn Gly Lys
1490 1495 1500
Arg Met Gin Leu Asn Gly Asp Ser Leu Ser Tyr Asp Asn Gin Pro Val 505 1510 1515 1520
Asn He Lys Asp Gly Gly Leu Val Val Ser Phe Lys Asp Asn Gin Gly
1525 1530 1535
Gin Met Val Tyr Ser Ser He Leu Tyr Asp Lys Val Gin Val Ser Val 1540 1545 1550
Ser Asp Lys Pro Met Asp He His Ala Pro Ser Leu Glu Tyr Tyr He
1555 1560 1565
Lys Tyr He Gin Gly Ser Ala Gly Leu Asp Ala He Lys Ser Ala Gly
1570 1575 1580
Asn Asn Ser He Leu Trp Leu Asn Glu Leu Phe Val Ala Lys Gly Gly 585 1590 1595 1600
Asn Pro Leu Phe Ala Pro Tyr Tyr Leu Gin Asp Asn Pro Thr Glu His
1605 1610 1615
He Val Thr Leu Met Lys Asp He Thr Ser Ala Leu Gly Met Leu Ser
1620 1625 1630
Lys Pro Asn Leu Lys Asn Asn Ser Thr Asp Ala Leu Gin Leu Asn Thr
1635 1640 1645
Tyr Thr Gin Gin Met Ser Arg Leu Ala Lys Leu Ser Asn Phe Ala Ser
1650 1655 1660
Phe Asp Ser Thr Asp Phe Ser Glu Arg Leu Ser Ser Leu Lys Asn Gin 665 1670 1675 1680
Arg Phe Ala Asp Ala He Pro Asn Ala Met Asp Val He Leu Lys Tyr
1685 1690 1695
Ser Gin Arg Asp Lys Leu Lys Asn Asn Leu Trp Ala Thr Gly Val Gly
1700 1705 1710
Gly Val Ser Phe Val Glu Asn Gly Thr Gly Thr Leu Tyr Gly Val Asn
1715 1720 1725
Val Gly Tyr Asp Arg Phe He Lys Gly Val He Val Gly Gly Tyr Ala
1730 1735 1740
Ala Tyr Gly Tyr Ser Gly Phe Tyr Glu Arg He Thr Asn Ser Lys Ser 745 1750 1755 1760
Asp Asn Val Asp Val Gly Leu Tyr Ala Arg Ala Phe He Lys Lys Ser
1765 1770 1775
Glu Leu Thr Phe Ser Val Asn Glu Thr Trp Gly Ala Asn Lys Asn Gin
1780 1785 1790
He Ser Ser Asn Asp Thr Leu Leu Ser Met He Asn Gin Ser Tyr Lys
1795 1800 1805
Tyr Ser Thr Trp Thr Thr Asn Ala Lys Val Asn Tyr Gly Tyr Asp Phe
1810 1815 1820
Met Phe Lys Asn Lys Ser He He Leu Lys Pro Gin He Gly Leu Arg 825 1830 1835 1840
Tyr Tyr Tyr He Gly Met Thr Gly Leu Glu Gly Val Met His Asn Ala
1845 1850 1855
Leu Tyr Asn Gin Phe Lys Ala Asn Ala Asp Pro Ser Lys Lys Ser Val
1860 1865 1870
Leu Thr He Glu Leu Ala Leu Glu Asn Arg His Tyr Phe Asn Thr Asn
1875 1880 1885
Ser Tyr Phe Tyr Ala He Gly Gly Phe Gly Arg Asp Leu Leu Val Asn
1890 1895 1900
Ser Met Gly Asp Lys Leu Val Arg Phe He Gly Asn Asn Thr Leu Ser 905 1910 1915 1920
Tyr Arg Lys Gly Glu Leu Tyr Asn Thr Phe Ala Ser He Thr Thr Gly
1925 1930 1935
Gly Glu Val Arg Leu Phe Lys Ser Phe Tyr Ala Asn Ala Gly Val Gly
1940 1945 1950
Ala Arg Phe Gly Leu Asp Tyr Lys Met He Asn He Thr Gly Asn He
1955 1960 1965
Gly Met Arg Leu Ala Phe 1970 1 (2) INFORMATION FOR SEQ ID NO: 383:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 755 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 124...690 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:383:
TTGCGCTTAT AAATTCCCGG CTTTAGAAAA ACACACCAAA ATTGTAGGAG TCATTAACCA 60
AGTGGGGCGC ASSGGGCGAT CACACCGGTC GCTCTTTTAG AGCCTGTGGA AATTGCTGGA 120
GCT ATG ATT AAT AGA GCG ACC TTA CAC AAT TAT TCT GAA ATT GAA AAA 168 Met He Asn Arg Ala Thr Leu His Asn Tyr Ser Glu He Glu Lys 1 5 10 15
AAG AAT ATC ATG CTC AGT GAT AGG GTC GTT GTC ATT AGA AGC GGC GAT 216 Lys Asn He Met Leu Ser Asp Arg Val Val Val He Arg Ser Gly Asp 20 25 30
GTG ATC CCT AAA ATC ATC AAG CCT TTA GAA TCT TAT AGA GAC GGC TCG 264 Val He Pro Lys He He Lys Pro Leu Glu Ser Tyr Arg Asp Gly Ser 35 40 45
CAA CAT AAA ATT GAA CGC CCC AAG GTT TGC CCT ATA TGT TCG CAT GAG 312 Gin His Lys He Glu Arg Pro Lys Val Cys Pro He Cys Ser His Glu 50 55 60
CTT TTG TGC GAA GAG ATT TTT ACT TAT TGT CAA AAC CTT AAT TGC CCG 360 Leu Leu Cys Glu Glu He Phe Thr Tyr Cys Gin Asn Leu Asn Cys Pro 65 70 75
GCA AGG TTG AAA GAA AGC TTG ATT CAT TTC GCT TCT AAA GAC GCT TTA 408 Ala Arg Leu Lys Glu Ser Leu He His Phe Ala Ser Lys Asp Ala Leu 80 85 90 95
AAC ATT CAA GGC TTG GGC GAT AAA GTC ATA GAG CAA CTT TTT GAA GAA 456 Asn He Gin Gly Leu Gly Asp Lys Val He Glu Gin Leu Phe Glu Glu 100 105 110
AAG CTC ATT TTT AAC GCT CTG GAT TTG TAT GCT TTA AAA TTA GAA GAT 504 Lys Leu He Phe Asn Ala Leu Asp Leu Tyr Ala Leu Lys Leu Glu Asp 115 120 125
TTA ATG CGG CTA GAC AAA TTT AAA ATT AAA AAA GCT CAA AAT CTA TTA 552 Leu Met Arg Leu Asp Lys Phe Lys He Lys Lys Ala Gin Asn Leu Leu 130 135 140
GAC GCT ATT TTA AAA AGC AAA AAC CCT CCC TTA TGG CGT TTG ATT AAC 600 Asp Ala He Leu Lys Ser Lys Asn Pro Pro Leu Trp Arg Leu He Asn 145 150 155
GCT TTA GGG ATT GAG CAT ATT GGT AAG GGA GCG AGT AAA ACG CTG GCC 648 Ala Leu Gly He Glu His He Gly Lys Gly Ala Ser Lys Thr Leu Ala 160 165 170 175
AAA TAC GGC TTA AAT GTG TTA GAA AAA AGC GAA SCG AGT TTT TAGAAATGG 699 Lys Tyr Gly Leu Asn Val Leu Glu Lys Ser Glu Xaa Ser Phe 180 185
AAGGCTTTGG GGTGGAAATG GCGCGCTCTT TAGTCAATTT TTATGCGAGC AATCAA 755
(2) INFORMATION FOR SEQ ID NO: 384:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 189 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384:
Met He Asn Arg Ala Thr Leu His Asn Tyr Ser Glu He Glu Lys Lys
1 5 10 15
Asn He Met Leu Ser Asp Arg Val Val Val He Arg Ser Gly Asp Val
20 25 30
He Pro Lys He He Lys Pro Leu Glu Ser Tyr Arg Asp Gly Ser Gin
35 40 45
His Lys He Glu Arg Pro Lys Val Cys Pro He Cys Ser His Glu Leu
50 55 60
Leu Cys Glu Glu He Phe Thr Tyr Cys Gin Asn Leu Asn Cys Pro Ala 65 70 75 80
Arg Leu Lys Glu Ser Leu He His Phe Ala Ser Lys Asp Ala Leu Asn
85 90 95
He Gin Gly Leu Gly Asp Lys Val He Glu Gin Leu Phe Glu Glu Lys
100 105 110
Leu He Phe Asn Ala Leu Asp Leu Tyr Ala Leu Lys Leu Glu Asp Leu
115 120 125
Met Arg Leu Asp Lys Phe Lys He Lys Lys Ala Gin Asn Leu Leu Asp
130 135 140
Ala He Leu Lys Ser Lys Asn Pro Pro Leu Trp Arg Leu He Asn Ala 145 150 155 160
Leu Gly He Glu His He Gly Lys Gly Ala Ser Lys Thr Leu Ala Lys
165 170 175
Tyr Gly Leu Asn Val Leu Glu Lys Ser Glu Xaa Ser Phe 180 185 (2) INFORMATION FOR SEQ ID NO: 385:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 403 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...350 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385:
AAAGGGCGAT TATCAAGGGG CTTTCAAGCT TTTTTCCCAA TCGTGCGATA ATG GTA 56
Met Val
1
ATG CGG CCG GGT GTT TTG CAA GTG GGG GCG ATG TAT GCT AAT GGG GTA 104 Met Arg Pro Gly Val Leu Gin Val Gly Ala Met Tyr Ala Asn Gly Val 5 10 15
GGG ATC CAA ACC AAC AGA TTA AAA GCC GCT CGC TAT TAT GAA TGG GTT 152 Gly He Gin Thr Asn Arg Leu Lys Ala Ala Arg Tyr Tyr Glu Trp Val 20 25 30
GCA GCG GGG GCG ATG CGA CCG CTT GCG CGA ATC TGG CTC AGA TGT ATG 200 Ala Ala Gly Ala Met Arg Pro Leu Ala Arg He Trp Leu Arg Cys Met 35 40 45 50
AAA ACA AGA AAA ATG CGG ATT CAA ACG ATA AAG AAA ACG CTT TGC AAT 248 Lys Thr Arg Lys Met Arg He Gin Thr He Lys Lys Thr Leu Cys Asn 55 60 65
TGT ATG CGG TGG CTT GTC AAG GGG GGG ATA TGC TCG CAT GCA ATA ATT 296 Cys Met Arg Trp Leu Val Lys Gly Gly He Cys Ser His Ala He He 70 75 80
TGG GGT GGA TGT TTG CTA ACG GAA GTG GGG TCC CAA AAG ATT ATT ACA 344 Trp Gly Gly Cys Leu Leu Thr Glu Val Gly Ser Gin Lys He He Thr 85 90 95
AAG CGA TAAGTTATTA TAAATTTTCA TGCGAGAATG GGAATGATAT GGGGTGTTAT AA 402 Lys Arg 100
T 403
(2) INFORMATION FOR SEQ ID NO: 386: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 100 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386:
Met Val Met Arg Pro Gly Val Leu Gin Val Gly Ala Met Tyr Ala Asn
1 5 10 15
Gly Val Gly He Gin Thr Asn Arg Leu Lys Ala Ala Arg Tyr Tyr Glu
20 25 30
Trp Val Ala Ala Gly Ala Met Arg Pro Leu Ala Arg He Trp Leu Arg
35 40 45
Cys Met Lys Thr Arg Lys Met Arg He Gin Thr He Lys Lys Thr Leu
50 55 60
Cys Asn Cys Met Arg Trp Leu Val Lys Gly Gly He Cys Ser His Ala 65 70 75 80
He He Trp Gly Gly Cys Leu Leu Thr Glu Val Gly Ser Gin Lys He
85 90 95
He Thr Lys Arg 100
(2) INFORMATION FOR SEQ ID NO: 387:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1837 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1784 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387:
CGCATGCGCT CCTTTCTAAA GCGATCAAAA ACAAAGAGTA AGGGATTAAC ATG TCA 56
Met Ser 1
AAA AAA ATC GTA GTC GAT CCT ATC ACT AGG ATT GAG GGG CAT TTA AGG 104 Lys Lys He Val Val Asp Pro He Thr Arg He Glu Gly His Leu Arg 5 10 15
ATT GAA GTG ATC GTA GAT GAT GAT AAC GTG ATC ACT GAT GCG TTT TCT 152 He Glu Val He Val Asp Asp Asp Asn Val He Thr Asp Ala Phe Ser 20 25 30
TCT TCT ACG CTT TTT AGG GGG CTA GAA ACC ATT ATT AAA GGC AGA GAT 200 Ser Ser Thr Leu Phe Arg Gly Leu Glu Thr He He Lys Gly Arg Asp 35 40 45 50
CCA CGA GAT GCA GGC TTC ATC GCT CAA AGG ATT TGC GGG GTA TGC ACT 248 Pro Arg Asp Ala Gly Phe He Ala Gin Arg He Cys Gly Val Cys Thr 55 60 65
TAT TCG CAT TAT AAG GCC GGT ATC ACG GCG GTA GAA AAC GCT CTA GGC 296 Tyr Ser His Tyr Lys Ala Gly He Thr Ala Val Glu Asn Ala Leu Gly 70 75 80
ATC ACT CCC CCA TTA AAC GCG CAA TTG GTG CGA TCT TTG ATG AAC ATG 344 He Thr Pro Pro Leu Asn Ala Gin Leu Val Arg Ser Leu Met Asn Met 85 90 95
GCG CTG CTT TTT CAT GAC CAT GTG GTG CAT TTC TAT ACT TTG CAT GGG 392 Ala Leu Leu Phe His Asp His Val Val His Phe Tyr Thr Leu His Gly 100 105 110
CTT GAT TGG TGC GAT ATC ATG AGC GCT TTA AAA GCC GAT CCC ATT CAA 440 Leu Asp Trp Cys Asp He Met Ser Ala Leu Lys Ala Asp Pro He Gin 115 120 125 130
GCG GCA AAA CTT TCT TTC AAA TAC AGC CCT TAC CCT ATT AAT ACC GGT 488 Ala Ala Lys Leu Ser Phe Lys Tyr Ser Pro Tyr Pro He Asn Thr Gly 135 140 145
GCC GGT GAA TTA AAA GCG GTT CAA AAA CGC TTG AGC GAT TTC GCT AAA 536 Ala Gly Glu Leu Lys Ala Val Gin Lys Arg Leu Ser Asp Phe Ala Lys 150 155 160
AGC GGA TCT TTG GGG CCT TTC AGT AAC GGC TAT TAC GGG CAT AAA ACT 584 Ser Gly Ser Leu Gly Pro Phe Ser Asn Gly Tyr Tyr Gly His Lys Thr 165 170 175
TAT CGT TTA AGT CCG GAG CAA AAT TTA ATC GTC TTA AGC CAC TAC CTC 632 Tyr Arg Leu Ser Pro Glu Gin Asn Leu He Val Leu Ser His Tyr Leu 180 185 190
AAG CTT TTA GAA ATC CAA AGG GAA GCG GCG AAA ATG ACC GCT ATT TTT 680 Lys Leu Leu Glu He Gin Arg Glu Ala Ala Lys Met Thr Ala He Phe 195 200 205 210
GGG GCC AAA CAG CCT CAC CCA CAA AGC CTA ACG GTG GGG GGT GTT ACG 728 Gly Ala Lys Gin Pro His Pro Gin Ser Leu Thr Val Gly Gly Val Thr 215 220 225
AGT GTT ATG GAT ATA TTG GAT CCG ACG AGA TTG GCT GAA TGG AAG AGC 776 Ser Val Met Asp He Leu Asp Pro Thr Arg Leu Ala Glu Trp Lys Ser 230 235 240 AAG TTT GAA GTG GTG GCC AAT TTC ATC AAC CAT GCT TAC TAC CCT GAT 824 Lys Phe Glu Val Val Ala Asn Phe He Asn His Ala Tyr Tyr Pro Asp 245 250 255
TTG GTG ATG GCA GGC GAA ATG TTC GCT AAC GAA CAA TCC GTT ATC AAA 872 Leu Val Met Ala Gly Glu Met Phe Ala Asn Glu Gin Ser Val He Lys 260 265 270
GGC TGT GGC TTA AGG AAT TTT ATC GCT TAT GAA GAA GTG CTG CTT GGG 920 Gly Cys Gly Leu Arg Asn Phe He Ala Tyr Glu Glu Val Leu Leu Gly 275 280 285 290
AGG GAT AAA TAC CTT TTG AGT AGT GGG GTG GTG CTT GAT GGG GAT ATT 968 Arg Asp Lys Tyr Leu Leu Ser Ser Gly Val Val Leu Asp Gly Asp He 295 300 305
TCT AAA TTA CAC CCC ATT GAT GAA AGT TTG ATT AAA GAA GAA GTT ACG 1016 Ser Lys Leu His Pro He Asp Glu Ser Leu He Lys Glu Glu Val Thr 310 315 320
CAT TCT TGG TAT CAA TAC GAA GAC ACT AAA GAA GTG CAA CTC CAC CCT 1064 His Ser Trp Tyr Gin Tyr Glu Asp Thr Lys Glu Val Gin Leu His Pro 325 330 335
TAT GAC GGG CAA ACG AAC CCG CAT TAT ACC GGT TTA AAA GAC GGC GAG 1112 Tyr Asp Gly Gin Thr Asn Pro His Tyr Thr Gly Leu Lys Asp Gly Glu 340 345 350
AGC GTG GGG ATT GAA AAT AAA ATC ATC CCT GCT AAA GTG CTT GAC ACT 1160 Ser Val Gly He Glu Asn Lys He He Pro Ala Lys Val Leu Asp Thr 355 360 365 370
AAA AAT AAA TAT TCT TGG ATA AAA TCG CCC AGA TAC GAT AGT AAG CCC 1208 Lys Asn Lys Tyr Ser Trp He Lys Ser Pro Arg Tyr Asp Ser Lys Pro 375 380 385
ATG GAA GTA GGT CCT TTA AGT TCC GTA GTG GTA GGT TTA GCG GCG AAA 1256 Met Glu Val Gly Pro Leu Ser Ser Val Val Val Gly Leu Ala Ala Lys 390 395 400
AAC CCT TAT GTT ACT GAA GTG GCT ACG AAG TTT TTA AAA GAC ACT AAA 1304 Asn Pro Tyr Val Thr Glu Val Ala Thr Lys Phe Leu Lys Asp Thr Lys 405 410 415
CTG CCT TTA GAG GCG TTG TTT TCA ACG CTT GGG CGA ACA GCT GCA AGG 1352 Leu Pro Leu Glu Ala Leu Phe Ser Thr Leu Gly Arg Thr Ala Ala Arg 420 425 430
TGT ATT GAA GCT AAA ACG ATC GCT GAT AAT GGC CTT TTG GCG TTT GAT 1400 Cys He Glu Ala Lys Thr He Ala Asp Asn Gly Leu Leu Ala Phe Asp 435 440 445 450
GCG TTA GTG GAA AAT CTA AAA AGC GAT CAA AGC ACT TGT GCT CCT TAT 1448 Ala Leu Val Glu Asn Leu Lys Ser Asp Gin Ser Thr Cys Ala Pro Tyr 455 460 465 CAC ATT GAT AAA AAT CAA GAA TAT AAA GGG CGC TAC ATT GGT CAA GTG 1496 His He Asp Lys Asn Gin Glu Tyr Lys Gly Arg Tyr He Gly Gin Val 470 475 480
CCA AGG GGC ATG CTA AGC CAT TGG GTG CGT ATT AAA AAC GGC GTG GTG 1544 Pro Arg Gly Met Leu Ser His Trp Val Arg He Lys Asn Gly Val Val 485 490 495
GAA AAT TAT CAA GCG GTG GTG CCT TCT ACT TGG AAT GCA GGG CCT AGA 1592 Glu Asn Tyr Gin Ala Val Val Pro Ser Thr Trp Asn Ala Gly Pro Arg 500 505 510
GAT TCT CAA AAT CAA AGG GGG GCT TAT GAA ATG AGC TTG ATT GGC ACT 1640 Asp Ser Gin Asn Gin Arg Gly Ala Tyr Glu Met Ser Leu He Gly Thr 515 520 525 530
AAA ATC GCT GAT TTA ACC CAG CCT TTA GAA ATC ATT AGG ACT ATC CAT 1688 Lys He Ala Asp Leu Thr Gin Pro Leu Glu He He Arg Thr He His 535 540 545
TCT TTT GAC CCA TGC ATC GCA TGC TCG GTG CAT GTG ATG GAT TTT AAA 1736 Ser Phe Asp Pro Cys He Ala Cys Ser Val His Val Met Asp Phe Lys 550 555 560
GGG CAG TCT TTA AAC GAG TTT AAA GTA GAG CCT AAT TTC GCT AAA TTC T 1785 Gly Gin Ser Leu Asn Glu Phe Lys Val Glu Pro Asn Phe Ala Lys Phe 565 570 575
AAAAAGGGTT ACGCATGGAT AAAATGAATA AGGTCGTTTT ACACAAAGAA TA 1837
(2) INFORMATION FOR SEQ ID NO: 388:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 578 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388:
Met Ser Lys Lys He Val Val Asp Pro He Thr Arg He Glu Gly His
1 5 10 15
Leu Arg He Glu Val He Val Asp Asp Asp Asn Val He Thr Asp Ala
20 25 30
Phe Ser Ser Ser Thr Leu Phe Arg Gly Leu Glu Thr He He Lys Gly
35 40 45
Arg Asp Pro Arg Asp Ala Gly Phe He Ala Gin Arg He Cys Gly Val
50 55 60
Cys Thr Tyr Ser His Tyr Lys Ala Gly He Thr Ala Val Glu Asn Ala 65 70 75 80
Leu Gly He Thr Pro Pro Leu Asn Ala Gin Leu Val Arg Ser Leu Met 85 90 95 Asn Met Ala Leu Leu Phe His Asp His Val Val His Phe Tyr Thr Leu
100 105 110
His Gly Leu Asp Trp Cys Asp He Met Ser Ala Leu Lys Ala Asp Pro
115 120 125
He Gin Ala Ala Lys Leu Ser Phe Lys Tyr Ser Pro Tyr Pro He Asn
130 135 140
Thr Gly Ala Gly Glu Leu Lys Ala Val Gin Lys Arg Leu Ser Asp Phe 145 150 155 160
Ala Lys Ser Gly Ser Leu Gly Pro Phe Ser Asn Gly Tyr Tyr Gly His
165 170 175
Lys Thr Tyr Arg Leu Ser Pro Glu Gin Asn Leu He Val Leu Ser His
180 185 190
Tyr Leu Lys Leu Leu Glu He Gin Arg Glu Ala Ala Lys Met Thr Ala
195 200 205
He Phe Gly Ala Lys Gin Pro His Pro Gin Ser Leu Thr Val Gly Gly
210 215 220
Val Thr Ser Val Met Asp He Leu Asp Pro Thr Arg Leu Ala Glu Trp 225 230 235 240
Lys Ser Lys Phe Glu Val Val Ala Asn Phe He Asn His Ala Tyr Tyr
245 250 255
Pro Asp Leu Val Met Ala Gly Glu Met Phe Ala Asn Glu Gin Ser Val
260 265 270
He Lys Gly Cys Gly Leu Arg Asn Phe He Ala Tyr Glu Glu Val Leu
275 280 285
Leu Gly Arg Asp Lys Tyr Leu Leu Ser Ser Gly Val Val Leu Asp Gly
290 295 300
Asp He Ser Lys Leu His Pro He Asp Glu Ser Leu He Lys Glu Glu 305 310 315 320
Val Thr His Ser Trp Tyr Gin Tyr Glu Asp Thr Lys Glu Val Gin Leu
325 330 335
His Pro Tyr Asp Gly Gin Thr Asn Pro His Tyr Thr Gly Leu Lys Asp
340 345 350
Gly Glu Ser Val Gly He Glu Asn Lys He He Pro Ala Lys Val Leu
355 360 365
Asp Thr Lys Asn Lys Tyr Ser Trp He Lys Ser Pro Arg Tyr Asp Ser
370 375 380
Lys Pro Met Glu Val Gly Pro Leu Ser Ser Val Val Val Gly Leu Ala 385 390 395 400
Ala Lys Asn Pro Tyr Val Thr Glu Val Ala Thr Lys Phe Leu Lys Asp
405 410 415
Thr Lys Leu Pro Leu Glu Ala Leu Phe Ser Thr Leu Gly Arg Thr Ala
420 425 430
Ala Arg Cys He Glu Ala Lys Thr He Ala Asp Asn Gly Leu Leu Ala
435 440 445
Phe Asp Ala Leu Val Glu Asn Leu Lys Ser Asp Gin Ser Thr Cys Ala
450 455 460
Pro Tyr His He Asp Lys Asn Gin Glu Tyr Lys Gly Arg Tyr He Gly 465 470 475 480
Gin Val Pro Arg Gly Met Leu Ser His Trp Val Arg He Lys Asn Gly
485 490 495
Val Val Glu Asn Tyr Gin Ala Val Val Pro Ser Thr Trp Asn Ala Gly
500 505 510
Pro Arg Asp Ser Gin Asn Gin Arg Gly Ala Tyr Glu Met Ser Leu He
515 520 525
Gly Thr Lys He Ala Asp Leu Thr Gin Pro Leu Glu He He Arg Thr 530 535 540
He His Ser Phe Asp Pro Cys He Ala Cys Ser Val His Val Met Asp 545 550 555 560
Phe Lys Gly Gin Ser Leu Asn Glu Phe Lys Val Glu Pro Asn Phe Ala
565 570 575
Lys Phe
(2) INFORMATION FOR SEQ ID NO: 389:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 720 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 80...613 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389:
ATATGGTGTT TTTCCATTCT ATCAGGTATG AAAGTTCGGG GGCGGATTCT ATGATTAATG 60 GCTATGGTTA TACCAAAGA ATG AGT CAA AAA ATC CTA ATT CTA GGT ATT GGC 112
Met Ser Gin Lys He Leu He Leu Gly He Gly 1 5 10
AAT ATC CTT TTT GGC GAT GAA GGG ATT GGG GTG CAT TTA GCC CAC TAC 160 Asn He Leu Phe Gly Asp Glu Gly He Gly Val His Leu Ala His Tyr 15 20 25
CTC AAA AAA AAT TTT TCT TTT TTC CCT AGC GTG GAT ATT ATA GAT GGG 208 Leu Lys Lys Asn Phe Ser Phe Phe Pro Ser Val Asp He He Asp Gly 30 35 40
GGG ACA ATG GCC CAG CAG CTC ATT CCT TTA ATC ACT TCG TAT GAA AAG 256 Gly Thr Met Ala Gin Gin Leu He Pro Leu He Thr Ser Tyr Glu Lys 45 50 55
GTT TTG ATT TTG GAT TGC GTG AGC GCT GAA GGC GTT GAG ATA GGA TCA 304 Val Leu He Leu Asp Cys Val Ser Ala Glu Gly Val Glu He Gly Ser 60 65 70 75
GTC TAT GCT TTT GAT TTT AAG GAC GCT CCT AAA GAA ATC ACA TGG GCT 352 Val Tyr Ala Phe Asp Phe Lys Asp Ala Pro Lys Glu He Thr Trp Ala 80 85 90
GGG AGC GCT CAT GAA GTG GAA ATG CTA CAC ACT TTA AGG CTC ACG GAG 400 Gly Ser Ala His Glu Val Glu Met Leu His Thr Leu Arg Leu Thr Glu 95 100 105
TTT TTA GGG GAT TTG CCT AAA ACT TTT ATC GTG GGG CTT GTG CCT TTT 448 Phe Leu Gly Asp Leu Pro Lys Thr Phe He Val Gly Leu Val Pro Phe 110 115 120
GTG ATA GGG AGC GAG ACC ACT TTC AAG CTT TCA AGC AAA ATT TTA AAC 496 Val He Gly Ser Glu Thr Thr Phe Lys Leu Ser Ser Lys He Leu Asn 125 130 135
GCT TTA GAA ACC GCC TTA AAA GCC ATA GAA ACC CAA CTC AAC GCA TGG 544 Ala Leu Glu Thr Ala Leu Lys Ala He Glu Thr Gin Leu Asn Ala Trp 140 145 150 155
GGG GTT AAA ATG CAA CGC ACC GAT CAT ATC GCT TTA GAA TGT ATC GCT 592 Gly Val Lys Met Gin Arg Thr Asp His He Ala Leu Glu Cys He Ala 160 165 170
GAA CTT TCT TAT AAG GGT TTT TGAATTGGTT TTTGTTTTTC TTTTTAAATG CGTT 647 Glu Leu Ser Tyr Lys Gly Phe 175
AATGAAGAAA CAAGCCTGAA TTTTACGCCC CTTTTAGAGC GAATGGCATG CAATTTGCAA 707 GCGCGTTTTT ATA 720
(2) INFORMATION FOR SEQ ID NO: 390:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 178 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390:
Met Ser Gin Lys He Leu He Leu Gly He Gly Asn He Leu Phe Gly
1 5 10 15
Asp Glu Gly He Gly Val His Leu Ala His Tyr Leu Lys Lys Asn Phe
20 25 30
Ser Phe Phe Pro Ser Val Asp He He Asp Gly Gly Thr Met Ala Gin
35 40 45
Gin Leu He Pro Leu He Thr Ser Tyr Glu Lys Val Leu He Leu Asp
50 55 60
Cys Val Ser Ala Glu Gly Val Glu He Gly Ser Val Tyr Ala Phe Asp 65 70 75 80
Phe Lys Asp Ala Pro Lys Glu He Thr Trp Ala Gly Ser Ala His Glu
85 90 95
Val Glu Met Leu His Thr Leu Arg Leu Thr Glu Phe Leu Gly Asp Leu
100 105 110
Pro Lys Thr Phe He Val Gly Leu Val Pro Phe Val He Gly Ser Glu
115 120 125
Thr Thr Phe Lys Leu Ser Ser Lys He Leu Asn Ala Leu Glu Thr Ala 130 135 140
Leu Lys Ala He Glu Thr Gin Leu Asn Ala Trp Gly Val Lys Met Gin 145 150 155 160
Arg Thr Asp His He Ala Leu Glu Cys He Ala Glu Leu Ser Tyr Lys
165 170 175
Gly Phe
(2) INFORMATION FOR SEQ ID NO: 391:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 508 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...455 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391:
ATTGAAGCGA GTAACGCTTA TTATAAAAAA CGCTTATAAA TCTTATCAAC ATG GGC 56
Met Gly 1
AAT TTG ACT TAT TAC GCT TAC ATG TAT TTG ATC CTC TTT GTA TGC TTG 104 Asn Leu Thr Tyr Tyr Ala Tyr Met Tyr Leu He Leu Phe Val Cys Leu 5 10 15
CTG CCT GTG TTA TTA ATG GGG CTT GTT TGG AGG CTT ACT CGC CCC CCC 152 Leu Pro Val Leu Leu Met Gly Leu Val Trp Arg Leu Thr Arg Pro Pro 20 25 30
TTA AAG CAA AAT ATT CCT AAT AAA AGC CTC TCT TTA GAA AAT TTA AAC 200 Leu Lys Gin Asn He Pro Asn Lys Ser Leu Ser Leu Glu Asn Leu Asn 35 40 45 50
GAA CAA ATC AAA AAC CTT AAA AGC GTA CCA GCT TTA GAA AAA CTG AAA 248 Glu Gin He Lys Asn Leu Lys Ser Val Pro Ala Leu Glu Lys Leu Lys 55 60 65
AAC GAC TTC AAT GAG CGT TTT AAA ATT TGC CCC AAA GAT AAA GAA ACT 296 Asn Asp Phe Asn Glu Arg Phe Lys He Cys Pro Lys Asp Lys Glu Thr 70 75 80
CTG TGG TTA GAA ACG ATC CAA AAA TTA GTC GCT TCA GAA TTT TTT GAA 344 Leu Trp Leu Glu Thr He Gin Lys Leu Val Ala Ser Glu Phe Phe Glu 85 90 95 TTA GAA GAC GCT ATT AAT TTT GGG CAA GAA TTA GAA AAC GCT AAC CCT 392 Leu Glu Asp Ala He Asn Phe Gly Gin Glu Leu Glu Asn Ala Asn Pro 100 105 110
AAT TAC CAA CAA AAA ATC GCT AAC GCT ACC GGC TTA GCC CTT AAG AAT 440 Asn Tyr Gin Gin Lys He Ala Asn Ala Thr Gly Leu Ala Leu Lys Asn 115 120 125 130
AAA AAA GAA AAA GGA TAGAATTGGA TTTTTTAGAG ATTGTAGGAC AAGTCCCTTT A 496 Lys Lys Glu Lys Gly 135
AAAGGAGAGG TA 508
(2) INFORMATION FOR SEQ ID NO: 392:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 135 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392:
Met Gly Asn Leu Thr Tyr Tyr Ala Tyr Met Tyr Leu He Leu Phe Val
1 5 10 15
Cys Leu Leu Pro Val Leu Leu Met Gly Leu Val Trp Arg Leu Thr Arg
20 25 30
Pro Pro Leu Lys Gin Asn He Pro Asn Lys Ser Leu Ser Leu Glu Asn
35 40 45
Leu Asn Glu Gin He Lys Asn Leu Lys Ser Val Pro Ala Leu Glu Lys
50 55 60
Leu Lys Asn Asp Phe Asn Glu Arg Phe Lys He Cys Pro Lys Asp Lys 65 70 75 80
Glu Thr Leu Trp Leu Glu Thr He Gin Lys Leu Val Ala Ser Glu Phe
85 90 95
Phe Glu Leu Glu Asp Ala He Asn Phe Gly Gin Glu Leu Glu Asn Ala
100 105 110
Asn Pro Asn Tyr Gin Gin Lys He Ala Asn Ala Thr Gly Leu Ala Leu
115 120 125
Lys Asn Lys Lys Glu Lys Gly 130 135
(2) INFORMATION FOR SEQ ID NO: 393:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1183 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA ( ix) FEATURE :
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1130 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:393:
ATCTAAAAAT GCTACAATGT TTATCTTTAA AACGAAAGGG CAATTTAACC ATG GAC 56
Met Asp 1
TTT TTA GAA AAA GTA TTA GAC AAT CAA GTT ACT GAA AGT AAA GAA TTG 104 Phe Leu Glu Lys Val Leu Asp Asn Gin Val Thr Glu Ser Lys Glu Leu 5 10 15
GTC AGG CTT TAT GAT TAT GAT TTA TAC ACG CTA GGG GAA GTA GCG GAT 152 Val Arg Leu Tyr Asp Tyr Asp Leu Tyr Thr Leu Gly Glu Val Ala Asp 20 25 30
CGC ATG CGC CAA AAC ATG CAC CAA AAA ATC GTG TAT TTT AAT GTC AAT 200 Arg Met Arg Gin Asn Met His Gin Lys He Val Tyr Phe Asn Val Asn 35 40 45 50
AGG CAT TTA AAC CCT AGC AAT ATT TGC GCG GAC GCT TGC AAA TTT TGC 248 Arg His Leu Asn Pro Ser Asn He Cys Ala Asp Ala Cys Lys Phe Cys 55 60 65
GCT TTT TCA GCC CAC AGA AAA AAC CCA AAC CCT TAT GAA ATG AGC TTA 296 Ala Phe Ser Ala His Arg Lys Asn Pro Asn Pro Tyr Glu Met Ser Leu 70 75 80
GAA GAA ATC CTA GAA AAG GTT AAA AAC TCC TAC AAC AAG GGG ATT AAA 344 Glu Glu He Leu Glu Lys Val Lys Asn Ser Tyr Asn Lys Gly He Lys 85 90 95
GAA GTC CAT ATC GTG AGC GCT CAT AAC CCT AAT TAC TCC TAT GAA TGG 392 Glu Val His He Val Ser Ala His Asn Pro Asn Tyr Ser Tyr Glu Trp 100 105 110
TAT TTA AAG GTG TTT GAA ACC ATC AAG CAA GAA ATG CCT AAC TTG CAT 440 Tyr Leu Lys Val Phe Glu Thr He Lys Gin Glu Met Pro Asn Leu His 115 120 125 130
TTA AAG GCC ATG ACC GCT GCA GAA GTG CAT TTT TTA AGC GTT AAA TTC 488 Leu Lys Ala Met Thr Ala Ala Glu Val His Phe Leu Ser Val Lys Phe 135 140 145
AAC AAA CCT TTT GAA TTG GTG CTA GAA GAC ATG CTC AAA GCC GGG GTG 536 Asn Lys Pro Phe Glu Leu Val Leu Glu Asp Met Leu Lys Ala Gly Val 150 155 160 GAT TCC ATG CCT GGT GGG GGG GCG GAG ATT TTT GAT GAA GAA ATC AGG 584 Asp Ser Met Pro Gly Gly Gly Ala Glu He Phe Asp Glu Glu He Arg 165 170 175
CGT AAA ATC TGT AAT GGT AAG GTG GGA TCT TCT CGG TGG TTA GAA ATC 632 Arg Lys He Cys Asn Gly Lys Val Gly Ser Ser Arg Trp Leu Glu He 180 185 190
CAT GCT TAT TGG CAC AAA TTA GGC AAA ATG AGT AAC GCT ACC ATG CTT 680 His Ala Tyr Trp His Lys Leu Gly Lys Met Ser Asn Ala Thr Met Leu 195 200 205 210
TTT GGG CAT ATT GAA AAT AAA ATC CAT CGC ATC GAT CAC ATG CTA AGA 728 Phe Gly His He Glu Asn Lys He His Arg He Asp His Met Leu Arg 215 220 225
ATC AAA AAA ATC CAA AGC CCT AAA AAT CAA GTA GAA AAC AAA GAA GGG 776 He Lys Lys He Gin Ser Pro Lys Asn Gin Val Glu Asn Lys Glu Gly 230 235 240
GGT TTT AAC GCT TTT ATC CCC TTG TTG TAT CAA AAA GAA AAC AAT TAT 824 Gly Phe Asn Ala Phe He Pro Leu Leu Tyr Gin Lys Glu Asn Asn Tyr 245 250 255
TTG AAT GTG GAA AAA TCC CCC AGT GCG ATA GAA ATC TTA AAA ACC ATC 872 Leu Asn Val Glu Lys Ser Pro Ser Ala He Glu He Leu Lys Thr He 260 265 270
GCC ATA TCT CGC ATT CTT TTA AAC AAT ATC CCT CAC ATT AAA GCT TAT 920 Ala He Ser Arg He Leu Leu Asn Asn He Pro His He Lys Ala Tyr 275 280 285 290
TGG GCG ACT TTG GGC TTG AAT TTG GCT TTA GTG GCT CAA GAA TTT GGC 968 Trp Ala Thr Leu Gly Leu Asn Leu Ala Leu Val Ala Gin Glu Phe Gly 295 300 305
GCT AAC GAT TTA GAC GGC ACG ATA GAG ATA GAG AGC ATT CAA AGC GCG 1016 Ala Asn Asp Leu Asp Gly Thr He Glu He Glu Ser He Gin Ser Ala 310 315 320
GCA GGC GCA AAG AGC CGG CAT GGT TTA GAA AAA GAA GAT TTG ATA TTT 1064 Ala Gly Ala Lys Ser Arg His Gly Leu Glu Lys Glu Asp Leu He Phe 325 330 335
AAA ATC AAG GAC GCT GGT TTT GTT GCG GTA GAA AGG GAT AGT TTG TAT 1112 Lys He Lys Asp Ala Gly Phe Val Ala Val Glu Arg Asp Ser Leu Tyr 340 345 350
AAT TTT ATA CAG AAA TTT TAATAATTTT TAGCGTTTTT AAGAATGATT AGTTATAA 1168 Asn Phe He Gin Lys Phe 355 360
TAACGCTACT AACAA 1183
(2) INFORMATION FOR SEQ ID NO: 394: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 360 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394:
Met Asp Phe Leu Glu Lys Val Leu Asp Asn Gin Val Thr Glu Ser Lys
1 5 10 15
Glu Leu Val Arg Leu Tyr Asp Tyr Asp Leu Tyr Thr Leu Gly Glu Val
20 25 30
Ala Asp Arg Met Arg Gin Asn Met His Gin Lys He Val Tyr Phe Asn
35 40 45
Val Asn Arg His Leu Asn Pro Ser Asn He Cys Ala Asp Ala Cys Lys
50 55 60
Phe Cys Ala Phe Ser Ala His Arg Lys Asn Pro Asn Pro Tyr Glu Met 65 70 75 80
Ser Leu Glu Glu He Leu Glu Lys Val Lys Asn Ser Tyr Asn Lys Gly
85 90 95
He Lys Glu Val His 'lie Val Ser Ala His Asn Pro Asn Tyr Ser Tyr
100 105 110
Glu Trp Tyr Leu Lys Val Phe Glu Thr He Lys Gin Glu Met Pro Asn
115 120 125
Leu His Leu Lys Ala Met Thr Ala Ala Glu Val His Phe Leu Ser Val
130 135 140
Lys Phe Asn Lys Pro Phe Glu Leu Val Leu Glu Asp Met Leu Lys Ala 145 150 155 160
Gly Val Asp Ser Met Pro Gly Gly Gly Ala Glu He Phe Asp Glu Glu
165 170 175
He Arg Arg Lys He Cys Asn Gly Lys Val Gly Ser Ser Arg Trp Leu
180 185 190
Glu He His Ala Tyr Trp His Lys Leu Gly Lys Met Ser Asn Ala Thr
195 200 205
Met Leu Phe Gly His He Glu Asn Lys He His Arg He Asp His Met
210 215 220
Leu Arg He Lys Lys He Gin Ser Pro Lys Asn Gin Val Glu Asn Lys 225 230 235 240
Glu Gly Gly Phe Asn Ala Phe He Pro Leu Leu Tyr Gin Lys Glu Asn
245 250 255
Asn Tyr Leu Asn Val Glu Lys Ser Pro Ser Ala He Glu He Leu Lys
260 265 270
Thr He Ala He Ser Arg He Leu Leu Asn Asn He Pro His He Lys
275 280 285
Ala Tyr Trp Ala Thr Leu Gly Leu Asn Leu Ala Leu Val Ala Gin Glu
290 295 300
Phe Gly Ala Asn Asp Leu Asp Gly Thr He Glu He Glu Ser He Gin 305 310 315 320
Ser Ala Ala Gly Ala Lys Ser Arg His Gly Leu Glu Lys Glu Asp Leu
325 330 335
He Phe Lys He Lys Asp Ala Gly Phe Val Ala Val Glu Arg Asp Ser 340 345 350 Leu Tyr Asn Phe He Gin Lys Phe 355 360
(2) INFORMATION FOR SEQ ID NO: 395:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 616 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...563 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395:
TAATGAGAAT TAAACGAAAT TGGATACAAT CAGCTTAAAA AGGATATAAA GTG GAA 56
Val Glu
1
AAA TTA CCT AAA AAA CGA GTT TCT AAA ACC AAA TCA CAA AAA CTT ATC 104 Lys Leu Pro Lys Lys Arg Val Ser Lys Thr Lys Ser Gin Lys Leu He 5 10 15
CAT AGC TTA ACC ACC CAA AAA AAC AGA GCC TTT CTC AAA AAA ATC AGC 152 His Ser Leu Thr Thr Gin Lys Asn Arg Ala Phe Leu Lys Lys He Ser 20 25 30
GCT AAT GAA ATG CTT TTA GAA TTA GAA AAA GGG GCG TTT AAA AAA AAT 200 Ala Asn Glu Met Leu Leu Glu Leu Glu Lys Gly Ala Phe Lys Lys Asn 35 40 45 50
GAA GCT TAT TTT ATT TCT GAT GAA GAA GAT AAA AAT TAT GTT TTG GTG 248 Glu Ala Tyr Phe He Ser Asp Glu Glu Asp Lys Asn Tyr Val Leu Val 55 60 65
CCA GAT AAC GTG ATC TCT CTT TTG GCA GAA AAC GCC AGA AAG GCT TTT 296 Pro Asp Asn Val He Ser Leu Leu Ala Glu Asn Ala Arg Lys Ala Phe 70 75 80
GAA GCC AGG CTT AGG GCG GAA TTA GAA AGG GAT ATT ATC ACC CAA GCG 344 Glu Ala Arg Leu Arg Ala Glu Leu Glu Arg Asp He He Thr Gin Ala 85 90 95
CCG ATT GAT TTT GAA GAC GTG CGC GAA GTT TCC TTG CAA CTA TTG GAA 392 Pro He Asp Phe Glu Asp Val Arg Glu Val Ser Leu Gin Leu Leu Glu 100 105 110 AAT TTA CGC CAA AAA GAT GGG AAT TTG CCT AAT ATC AAC ACC TTA AAC 440 Asn Leu Arg Gin Lys Asp Gly Asn Leu Pro Asn He Asn Thr Leu Asn 115 120 125 130
TTT GTC AAA CAA ATC AAA AAA GAA CAC CCT AAT TTA TTC TTT AAT TTT 488 Phe Val Lys Gin He Lys Lys Glu His Pro Asn Leu Phe Phe Asn Phe 135 140 145
GAC AAC ATG TTC AAA CAA CCC CCT TTT AAT GAG AAT AAT TTT GAA AAT 536 Asp Asn Met Phe Lys Gin Pro Pro Phe Asn Glu Asn Asn Phe Glu Asn 150 155 160
TTT GAC AAT AGC GAT GAG GAA AAT TTT TAATGCAAAC CATTGATTTT GAAAAAT 590 Phe Asp Asn Ser Asp Glu Glu Asn Phe 165 170
TTTCACAATA TTCCAAGCCC GGCCCA 616
(2) INFORMATION FOR SEQ ID NO: 396:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 171 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396:
Val Glu Lys Leu Pro Lys Lys Arg Val Ser Lys Thr Lys Ser Gin Lys
1 5 10 15
Leu He His Ser Leu Thr Thr Gin Lys Asn Arg Ala Phe Leu Lys Lys
20 25 30
He Ser Ala Asn Glu Met Leu Leu Glu Leu Glu Lys Gly Ala Phe Lys
35 40 45
Lys Asn Glu Ala Tyr Phe He Ser Asp Glu Glu Asp Lys Asn Tyr Val
50 55 60
Leu Val Pro Asp Asn Val He Ser Leu Leu Ala Glu Asn Ala Arg Lys 65 70 75 80
Ala Phe Glu Ala Arg Leu Arg Ala Glu Leu Glu Arg Asp He He Thr
85 90 95
Gin Ala Pro He Asp Phe Glu Asp Val Arg Glu Val Ser Leu Gin Leu
100 105 110
Leu Glu Asn Leu Arg Gin Lys Asp Gly Asn Leu Pro Asn He Asn Thr
115 120 125
Leu Asn Phe Val Lys Gin He Lys Lys Glu His Pro Asn Leu Phe Phe
130 135 140
Asn Phe Asp Asn Met Phe Lys Gin Pro Pro Phe Asn Glu Asn Asn Phe 145 150 155 160
Glu Asn Phe Asp Asn Ser Asp Glu Glu Asn Phe 165 170
(2) INFORMATION FOR SEQ ID NO: 397: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 952 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...899 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:397:
GAGTTTGACG CGTATTTGCG TGGGGGCGAA AAACATTTCA GTAAAACGCT ATG AAT 56
Met Asn
1
GAA AAT ATT AAT GAA AAT ATT TTT GAA GAA GTA GGG GAC GCT TGC GTT 104 Glu Asn He Asn Glu Asn He Phe Glu Glu Val Gly Asp Ala Cys Val 5 10 15
AAA TGC GCT AAG TGC GTG CCA GGC TGC ACC ATA TAC CGC ATT CAT AAA 152 Lys Cys Ala Lys Cys Val Pro Gly Cys Thr He Tyr Arg He His Lys 20 25 30
GAC GAG GCG ACT TCG CCT AGA GGC TTT TTA GAT TTG ATG CGC TTA AAC 200 Asp Glu Ala Thr Ser Pro Arg Gly Phe Leu Asp Leu Met Arg Leu Asn 35 40 45 50
GCT CAA AAC AAG CTC CAA TTA GAC ACG AAT TTA AAA CAC CTT TTA GAA 248 Ala Gin Asn Lys Leu Gin Leu Asp Thr Asn Leu Lys His Leu Leu Glu 55 60 65
ACT TGC TTT TTA TGC ACC GCT TGC GTG GAA ATT TGC CCT TTT CAT TTG 296 Thr Cys Phe Leu Cys Thr Ala Cys Val Glu He Cys Pro Phe His Leu 70 75 80
CCC ATA GAC ACC TTA ATA GAA AAA GCC AGA GAA AAA ATC GCT CAA AAG 344 Pro He Asp Thr Leu He Glu Lys Ala Arg Glu Lys He Ala Gin Lys 85 90 95
CAT GGC ATC GCT TGG TAT AAA AAA TCC TAT TTT TCC CTT TTA AAA AAC 392 His Gly He Ala Trp Tyr Lys Lys Ser Tyr Phe Ser Leu Leu Lys Asn 100 105 110
CGC AAA AAA ATG GAT AGG GTG TTT TCA ACT GCG CAT TTT TTA GCC CCT 440 Arg Lys Lys Met Asp Arg Val Phe Ser Thr Ala His Phe Leu Ala Pro 115 120 125 130
TGC GTT TTC AAG CAA GTA GGG GAT AGT TTA GAG CCT AGG GCG GTG TTT 488 Cys Val Phe Lys Gin Val Gly Asp Ser Leu Glu Pro Arg Ala Val Phe 135 140 145
AAA GGT TTG TTC AAA CGC TTT AAA AAA AGC GCG CTG CCT CCT TTA AAT 536 Lys Gly Leu Phe Lys Arg Phe Lys Lys Ser Ala Leu Pro Pro Leu Asn 150 155 160
CAA AAA AGT TTT TTA CAA AAG CAT GCA GAA ATG AAG CTT TTA GAA AAC 584 Gin Lys Ser Phe Leu Gin Lys His Ala Glu Met Lys Leu Leu Glu Asn 165 170 175
CCC ATT CAA AAA GTG GCC ATT TTT ATA GGG TGC TTG AGC AAT TAC CAT 632 Pro He Gin Lys Val Ala He Phe He Gly Cys Leu Ser Asn Tyr His 180 185 190
TAC CAG CAA GTG GGG GAA AGC TTG TTG TAT ATT TTA GAA AAA CTC AAC 680 Tyr Gin Gin Val Gly Glu Ser Leu Leu Tyr He Leu Glu Lys Leu Asn 195 200 205 210
ATT CAA GCG ATC ATC CCT AAG CAA GAA TGC TGC TCA GCG CCT GCG TAT 728 He Gin Ala He He Pre Lys Gin Glu Cys Cys Ser Ala Pro Ala Tyr 215 220 225
TTT ACC GGC GAT AAA GAC ACC ACG CTT TTT TTA GTG AAA AAA AAC ATA 776 Phe Thr Gly Asp Lys Asp Thr Thr Leu Phe Leu Val Lys Lys Asn He 230 235 240
GAA TGG TTT GAA AGC TAT TTA GAT AAA GTG GAT GCG ATC ATT GTG CCT 824 Glu Trp Phe Glu Ser Tyr Leu Asp Lys Val Asp Ala He He Val Pro 245 250 255
GAA GCC ACA TGC GCT ACA TGC TCA TCA ACG ATT ATT ACA AGG TGT TTT 872 Glu Ala Thr Cys Ala Thr Cys Ser Ser Thr He He Thr Arg Cys Phe 260 265 270
TGG GCG AAA AAG ATA AGG ATT TGT ATG TGAAGCGCTT GGAAAAAATC ACGCCTA 926 Trp Ala Lys Lys He Arg He Cys Met 275 280
AAATCTATCT GGCGAGCGTG TTTTTA 952
(2) INFORMATION FOR SEQ ID NO: 398:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 283 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:398:
Met Asn Glu Asn He Asn Glu Asn He Phe Glu Glu Val Gly Asp Ala 1 5 10 15
Cys Val Lys Cys Ala Lys Cys Val Pro Gly Cys Thr He Tyr Arg He
20 25 30
His Lys Asp Glu Ala Thr Ser Pro Arg Gly Phe Leu Asp Leu Met Arg
35 40 45
Leu Asn Ala Gin Asn Lys Leu Gin Leu Asp Thr Asn Leu Lys His Leu
50 55 60
Leu Glu Thr Cys Phe Leu Cys Thr Ala Cys Val Glu He Cys Pro Phe 65 70 75 80
His Leu Pro He Asp Thr Leu He Glu Lys Ala Arg Glu Lys He Ala
85 90 95
Gin Lys His Gly He Ala Trp Tyr Lys Lys Ser Tyr Phe Ser Leu Leu
100 105 110
Lys Asn Arg Lys Lys Met Asp Arg Val Phe Ser Thr Ala His Phe Leu
115 120 125
Ala Pro Cys Val Phe Lys Gin Val Gly Asp Ser Leu Glu Pro Arg Ala
130 135 140
Val Phe Lys Gly Leu Phe Lys Arg Phe Lys Lys Ser Ala Leu Pro Pro 145 150 155 160
Leu Asn Gin Lys Ser Phe Leu Gin Lys His Ala Glu Met Lys Leu Leu
165 170 175
Glu Asn Pro He Gin Lys Val Ala He Phe He Gly Cys Leu Ser Asn
180 185 190
Tyr His Tyr Gin Gin Val Gly Glu Ser Leu Leu Tyr He Leu Glu Lys
195 200 205
Leu Asn He Gin Ala He He Pro Lys Gin Glu Cys Cys Ser Ala Pro
210 215 220
Ala Tyr Phe Thr Gly Asp Lys Asp Thr Thr Leu Phe Leu Val Lys Lys 225 230 235 240
Asn He Glu Trp Phe Glu Ser Tyr Leu Asp Lys Val Asp Ala He He
245 250 255
Val Pro Glu Ala Thr Cys Ala Thr Cys Ser Ser Thr He He Thr Arg
260 265 270
Cys Phe Trp Ala Lys Lys He Arg He Cys Met 275 280
(2) INFORMATION FOR SEQ ID NO: 399:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1361 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...1305 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: TATTGATAGC ATGAGTTGTT TTTGGTTTGG AATTTTAAGG AGTAGCTT ATG AAA GAG 57
Met Lys Glu
1
CAA TCA ATG ATT GAT TTT TTA AAA CTT AGA GAT TAT GAC ATT AGA AAA 105 Gin Ser Met He Asp Phe Leu Lys Leu Arg Asp Tyr Asp He Arg Lys 5 10 15
ACA CAA AAT GCG CGA TGG ATA GAT CAA AAA TGC ACC CCT GAT GTG TTG 153 Thr Gin Asn Ala Arg Trp He Asp Gin Lys Cys Thr Pro Asp Val Leu 20 25 30 35
TCT CTT GTT GCT GAT TGT ATT TTA GAG TTT ACG CAA TGT AAT ATT GGA 201 Ser Leu Val Ala Asp Cys He Leu Glu Phe Thr Gin Cys Asn He Gly 40 45 50
AAA TCA TTT TCT ATT AGG GAT ATT TGG GAT AGC CCT TAC ACC AAT GAA 249 Lys Ser Phe Ser He Arg Asp He Trp Asp Ser Pro Tyr Thr Asn Glu 55 60 65
AAT GTT AAA ATG ATT TTT TCT AAA CCT GAT TTA AAT TCT GAC TTT TCC 297 Asn Val Lys Met He Phe Ser Lys Pro Asp Leu Asn Ser Asp Phe Ser 70 75 80
ATG CAT GAA TAC GAT AAG TTT TTT TCT CAG CCT ATT AAA TTA TTA GCC 345 Met His Glu Tyr Asp Lys Phe Phe Ser Gin Pro He Lys Leu Leu Ala 85 90 95
TAT AGC GGT ATT TTA TTT GAA ACA AAA ACT GGC AAT AGA AAT ATT TAT 393 Tyr Ser Gly He Leu Phe Glu Thr Lys Thr Gly Asn Arg Asn He Tyr 100 105 110 115
ACC ATA CAA AAC ATA GAG CTA TTA GAA TAT CTC ATG CAA AGA GAA ACA 441 Thr He Gin Asn He Glu Leu Leu Glu Tyr Leu Met Gin Arg Glu Thr 120 125 130
AAC GCT TTG AAA TTC CTT ATT TTA TAT ATT CAA AAG GTA TTA ATG GAT 489 Asn Ala Leu Lys Phe Leu He Leu Tyr He Gin Lys Val Leu Met Asp 135 140 145
AGT GGG ATT TAT CCT TTA TTT GAC AAC TTT TTA CAA AAA CAA GAC ACA 537 Ser Gly He Tyr Pro Leu Phe Asp Asn Phe Leu Gin Lys Gin Asp Thr 150 155 160
GAA AGT TTT AAG CAA CTA AAA GAT GGT TTC ACT CAT TTT ACT ATC AAT 585 Glu Ser Phe Lys Gin Leu Lys Asp Gly Phe Thr His Phe Thr He Asn 165 170 175
AAC ACA GCA ATC AAT AAC GCT ACG GAA TGT TTT AGG ATT TTT ACT AAA 633 Asn Thr Ala He Asn Asn Ala Thr Glu Cys Phe Arg He Phe Thr Lys 180 185 190 195
ATT ATC AAT CCT TTA GCT TTT TAT TAT GGT AAA AAA GGC ACA AGA AAA 681 He He Asn Pro Leu Ala Phe Tyr Tyr Gly Lys Lys Gly Thr Arg Lys 200 205 210 GGG TAT TTG TCC AAC ACT ATA ATT ACA AAA GAT GAG CTT AAT TAT AAT 729 Gly Tyr Leu Ser Asn Thr He He Thr Lys Asp Glu Leu Asn Tyr Asn 215 220 225
CGT ATC AAT TGG CGA GAT ATA GGA AAA GAT AAA AAT ACC ACC AGA CAA 777 Arg He Asn Trp Arg Asp He Gly Lys Asp Lys Asn Thr Thr Arg Gin 230 235 240
GAA TAC GAT CTT ATA AAC TCT AAA AGG ATT GCT AAT TCT AAC TAT CTT 825 Glu Tyr Asp Leu He Asn Ser Lys Arg He Ala Asn Ser Asn Tyr Leu 245 250 255
ATT TCA AAA GCT AAG AAA GTG GTG AAA CGA TAT AAT GAT AGA TTT AAT 873 He Ser Lys Ala Lys Lys Val Val Lys Arg Tyr Asn Asp Arg Phe Asn 260 265 270 275
AAT TCT CTC TCT GAA GTA AAA CAA GAA AAA GAA GAG TCG CAA GCC ACA 921 Asn Ser Leu Ser Glu Val Lys Gin Glu Lys Glu Glu Ser Gin Ala Thr 280 285 290
CAA ATA CAC CAT ATT TTT CCC ATC CAA GAC TTT CCC ATT ATT GCT AAC 969 Gin He His His He Phe Pro He Gin Asp Phe Pro He He Ala Asn 295 300 305
TAT ATA GAG AAT CTT ATC GCA CTC ACT CCT AAT CAA CAT TTT ATT TAC 1017 Tyr He Glu Asn Leu He Ala Leu Thr Pro Asn Gin His Phe He Tyr 310 315 320
GCC CAC CCT AAT AAT CAA ACC CGC TTG ATT GAT AAA GAT TTT CAA TAT 1065 Ala His Pro Asn Asn Gin Thr Arg Leu He Asp Lys Asp Phe Gin Tyr 325 330 335
ATC TGC TTA TTA GCT AAA ACG ACC ACA ATT CTT AAT GAC ACT CAA GGC 1113 He Cys Leu Leu Ala Lys Thr Thr Thr He Leu Asn Asp Thr Gin Gly 340 345 350 355
GTA TAT GAT TGG AAT GAT TAT ATT GTT GTG TTG AAT ATG GGC CTC AAA 1161 Val Tyr Asp Trp Asn Asp Tyr He Val Val Leu Asn Met Gly Leu Lys 360 365 370
ACA ACT ATC TTT TCT CAA GTC AAG AAC GAA TGG GAA TTA TTA AAA GTA 1209 Thr Thr He Phe Ser Gin Val Lys Asn Glu Trp Glu Leu Leu Lys Val 375 380 385
ATA GAT GCT TTT TAT TTT GAT TTT AAC AAG AGC AAA GAT CCA AGT TGG 1257 He Asp Ala Phe Tyr Phe Asp Phe Asn Lys Ser Lys Asp Pro Ser Trp 390 395 400
TCA TAC TTG CTA GAT AAA AAC GAT TTA AGA GCT TTC AAG CTA AAA TTT T 1306 Ser Tyr Leu Leu Asp Lys Asn Asp Leu Arg Ala Phe Lys Leu Lys Phe 405 410 415
AATAAGTTTT ATTGAAACTG GCTATAAAAA CCCGCTTGAC TTATCTTATC CTTTT 1361
(2) INFORMATION FOR SEQ ID NO: 400: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 419 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400:
Met Lys Glu Gin Ser Met He Asp Phe Leu Lys Leu Arg Asp Tyr Asp
1 5 10 15
He Arg Lys Thr Gin Asn Ala Arg Trp He Asp Gin Lys Cys Thr Pro
20 25 30
Asp Val Leu Ser Leu Val Ala Asp Cys He Leu Glu Phe Thr Gin Cys
35 40 45
Asn He Gly Lys Ser Phe Ser He Arg Asp He Trp Asp Ser Pro Tyr
50 55 60
Thr Asn Glu Asn Val Lys Met He Phe Ser Lys Pro Asp Leu Asn Ser 65 70 75 80
Asp Phe Ser Met His Glu Tyr Asp Lys Phe Phe Ser Gin Pro He Lys
85 90 95
Leu Leu Ala Tyr Ser Gly He Leu Phe Glu Thr Lys Thr Gly Asn Arg
100 105 110
Asn He Tyr Thr He Gin Asn He Glu Leu Leu Glu Tyr Leu Met Gin
115 120 125
Arg Glu Thr Asn Ala Leu Lys Phe Leu He Leu Tyr He Gin Lys Val
130 135 140
Leu Met Asp Ser Gly He Tyr Pro Leu Phe Asp Asn Phe Leu Gin Lys 145 150 155 160
Gin Asp Thr Glu Ser Phe Lys Gin Leu Lys Asp Gly Phe Thr His Phe
165 170 175
Thr He Asn Asn Thr Ala He Asn Asn Ala Thr Glu Cys Phe Arg He
180 185 190
Phe Thr Lys He He Asn Pro Leu Ala Phe Tyr Tyr Gly Lys Lys Gly
195 200 205
Thr Arg Lys Gly Tyr Leu Ser Asn Thr He He Thr Lys Asp Glu Leu
210 215 220
Asn Tyr Asn Arg He Asn Trp Arg Asp He Gly Lys Asp Lys Asn Thr 225 230 235 240
Thr Arg Gin Glu Tyr Asp Leu He Asn Ser Lys Arg He Ala Asn Ser
245 250 255
Asn Tyr Leu He Ser Lys Ala Lys Lys Val Val Lys Arg Tyr Asn Asp
260 265 270
Arg Phe Asn Asn Ser Leu Ser Glu Val Lys Gin Glu Lys Glu Glu Ser
275 280 285
Gin Ala Thr Gin He His His He Phe Pro He Gin Asp Phe Pro He
290 295 300
He Ala Asn Tyr He Glu Asn Leu He Ala Leu Thr Pro Asn Gin His 305 310 315 320
Phe He Tyr Ala His Pro Asn Asn Gin Thr Arg Leu He Asp Lys Asp
325 330 335
Phe Gin Tyr He Cys Leu Leu Ala Lys Thr Thr Thr He Leu Asn Asp 340 345 350 Thr Gin Gly Val Tyr Asp Trp Asn Asp Tyr He Val Val Leu Asn Met
355 360 365
Gly Leu Lys Thr Thr He Phe Ser Gin Val Lys Asn Glu Trp Glu Leu
370 375 380
Leu Lys Val He Asp Ala Phe Tyr Phe Asp Phe Asn Lys Ser Lys Asp 385 390 395 400
Pro Ser Trp Ser Tyr Leu Leu Asp Lys Asn Asp Leu Arg Ala Phe Lys
405 410 415
Leu Lys Phe
(2) INFORMATION FOR SEQ ID NO: 401:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 763 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 53...709 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401:
AAATAAGCTA ATCTTTGCTA AAATGAGATT TAAAATTATT TAAGGAAGAT GA ATG CTT 58
Met Leu 1
TTT GCG ATG ATT GGT TCA GGG GGG TTT ATC GCT CCC AAG CAC TTG CAA 106 Phe Ala Met He Gly Ser Gly Gly Phe He Ala Pro Lys His Leu Gin 5 10 15
GCG ATT AGA GAT ACA GGG CAT TTT TTG GAT TGC TCT TTT GAT ATT CAT 154 Ala He Arg Asp Thr Gly His Phe Leu Asp Cys Ser Phe Asp He His 20 25 30
GAT AGC GTG GGG GTT TTA GAT GAG TAT TTC GCG CAA TCA GAG TTT TTT 202 Asp Ser Val Gly Val Leu Asp Glu Tyr Phe Ala Gin Ser Glu Phe Phe 35 40 45 50
ACG AAT ATT GAA GAT TTT GAA AAG CAT TTA GAG CAA TCT AAG GAT ATG 250 Thr Asn He Glu Asp Phe Glu Lys His Leu Glu Gin Ser Lys Asp Met 55 60 65
GGT AAA GAA ATC AAC TAT TTG AGT GTT TGC ACG CCT ACG CAC ACG CAT 298 Gly Lys Glu He Asn Tyr Leu Ser Val Cys Thr Pro Thr His Thr His 70 75 80 TTT GAT CAC ATC CGT TTC GGG TTA AGA AAC GGC ATG CAT GTG ATT TGT 346 Phe Asp His He Arg Phe Gly Leu Arg Asn Gly Met His Val He Cys 85 90 95
GAA AAA CCC TTA GTT TTA GAC CCT GGC GAA ATA CAA GAA TTG AAA GAT 394 Glu Lys Pro Leu Val Leu Asp Pro Gly Glu He Gin Glu Leu Lys Asp 100 105 110
TTA GAG GTG AAA CAC CAA AAA AGG GTG TTT AGT CTT TTA CCC TTG CGC 442 Leu Glu Val Lys His Gin Lys Arg Val Phe Ser Leu Leu Pro Leu Arg 115 120 125 130
TTG CAT TGC GAC ACG CTG GCT TTG AAA GAA AAA ATT AAG AGC GAA TTA 490 Leu His Cys Asp Thr Leu Ala Leu Lys Glu Lys He Lys Ser Glu Leu 135 140 145
GAC AAA AAC CCT AGC AAG GTG TTT GAC ATC ACG CTC ACT TAT ATC AGC 538 Asp Lys Asn Pro Ser Lys Val Phe Asp He Thr Leu Thr Tyr He Ser 150 155 160
GTT CAA GGG AAA TGG TAT TTT TCT TCA TGG CGA GCG GAT GTG AAT AGG 586 Val Gin Gly Lys Trp Tyr Phe Ser Ser Trp Arg Ala Asp Val Asn Arg 165 170 175
AGC GGA GGG TTA GCC ACT CAA ATG GGG GTG AAT ATT TTT GAC ACT TTA 634 Ser Gly Gly Leu Ala Thr Gin Met Gly Val Asn He Phe Asp Thr Leu 180 185 190
ATC TAT TTG TTT GGA AGC GTT AAA GAC AAG GTT ATC AAT AAA GAA GAG 682 He Tyr Leu Phe Gly Ser Val Lys Asp Lys Val He Asn Lys Glu Glu 195 200 205 210
CCT GAT TGC GTA GGG GGA TAC TCT TTT TAGAGCATGC CAAAATAAGA TGGTTTT 736 Pro Asp Cys Val Gly Gly Tyr Ser Phe 215
TTTCCATCAA TCCAGAACAC ATGGGAG 763
(2) INFORMATION FOR SEQ ID NO: 402:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 219 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402:
Met Leu Phe Ala Met He Gly Ser Gly Gly Phe He Ala Pro Lys His
1 5 10 15
Leu Gin Ala He Arg Asp Thr Gly His Phe Leu Asp Cys Ser Phe Asp 20 25 30 He His Asp Ser Val Gly Val Leu Asp Glu Tyr Phe Ala Gin Ser Glu
35 40 45
Phe Phe Thr Asn He Glu Asp Phe Glu Lys His Leu Glu Gin Ser Lys
50 55 60
Asp Met Gly Lys Glu He Asn Tyr Leu Ser Val Cys Thr Pro Thr His 65 70 75 80
Thr His Phe Asp His He Arg Phe Gly Leu Arg Asn Gly Met His Val
85 90 95
He Cys Glu Lys Pro Leu Val Leu Asp Pro Gly Glu He Gin Glu Leu
100 105 110
Lys Asp Leu Glu Val Lys His Gin Lys Arg Val Phe Ser Leu Leu Pro
115 120 125
Leu Arg Leu His Cys Asp Thr Leu Ala Leu Lys Glu Lys He Lys Ser
130 135 140
Glu Leu Asp Lys Asn Pro Ser Lys Val Phe Asp He Thr Leu Thr Tyr 145 150 155 160
He Ser Val Gin Gly Lys Trp Tyr Phe Ser Ser Trp Arg Ala Asp Val
165 170 175
Asn Arg Ser Gly Gly Leu Ala Thr Gin Met Gly Val Asn He Phe Asp
180 185 190
Thr Leu He Tyr Leu Phe Gly Ser Val Lys Asp Lys Val He Asn Lys
195 200 205
Glu Glu Pro Asp Cys Val Gly Gly Tyr Ser Phe 210 215
(2) INFORMATION FOR SEQ ID NO:403:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1465 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1412 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:403:
GACAGAAGCT GAATTTGAGG TGCGCTTATA GCTTGTAAAA GGGGGTGTTT ATG TTT 56
Met Phe 1
TTA TTA AGG CAT TTG ACT TCA GCG TGC GTG TTT TTG GCG TCT AAA TGT 104 Leu Leu Arg His Leu Thr Ser Ala Cys Val Phe Leu Ala Ser Lys Cys 5 10 15
TTG CCG GAC TCC TTT GTC TTG GTC GCT CTT TTA TCG TTT GTC GTG TTT 152 Leu Pro Asp Ser Phe Val Leu Val Ala Leu Leu Ser Phe Val Val Phe 20 25 30
GTT CTT GTT TAT TGC TTG ACA GGG CAA GAC GCT TTT TCT GTC ATT TCT 200 Val Leu Val Tyr Cys Leu Thr Gly Gin Asp Ala Phe Ser Val He Ser 35 40 45 50
AGT TGG GGG AAT GGC GCT TGG ACG CTT TTA GGT TTT TCT ATG CAA ATG 248 Ser Trp Gly Asn Gly Ala Trp Thr Leu Leu Gly Phe Ser Met Gin Met 55 60 65
GCC CTT ATT TTG GTG TTG GGT CAG GCT CTG GCT AAC GCT AAA TTA GTC 296 Ala Leu He Leu Val Leu Gly Gin Ala Leu Ala Asn Ala Lys Leu Val 70 75 80
CAA AAG CTT TTA AAA TAT CTA GCG TCT TTA CCT AAA GGG TAT TAT ACG 344 Gin Lys Leu Leu Lys Tyr Leu Ala Ser Leu Pro Lys Gly Tyr Tyr Thr 85 90 95
GCT TTA TGG TTG GTT ACT TTT TTA TCG TTA ATC GCT AAT TGG ATC AAC 392 Ala Leu Trp Leu Val Thr Phe Leu Ser Leu He Ala Asn Trp He Asn 100 105 110
TGG GGT TTT GGC TTG GTG ATT AGT GCG ATT TTT GCA AAA GAG ATC GCC 440 Trp Gly Phe Gly Leu Val He Ser Ala He Phe Ala Lys Glu He Ala 115 120 125 130
AAA AAT GTT AAG GGG GTG GAT TAC AGG CTG CTC ATT GCT AGC GCT TAT 488 Lys Asn Val Lys Gly Val Asp Tyr Arg Leu Leu He Ala Ser Ala Tyr 135 140 145
TCG GGT TTT GTC ATC TGG CAT GGG GGT TTA TCA GGC TCT ATC CCT TTA 536 Ser Gly Phe Val He Trp His Gly Gly Leu Ser Gly Ser He Pro Leu 150 155 160
AGC GTT GCC ACC CAA AAT GAA AAT CTA TCC AAA ATA AGC GCT GGG GTG 584 Ser Val Ala Thr Gin Asn Glu Asn Leu Ser Lys He Ser Ala Gly Val 165 170 175
ATT GAA AAA GCT ATC CCT ATC AGT CAG ACG ATT TTT TCT TCT TAT AAT 632 He Glu Lys Ala He Pro He Ser Gin Thr He Phe Ser Ser Tyr Asn 180 185 190
TTA ATC ATT ATA GGG ATC ATT CTT GTA GGG TTA CCC TTT TTA ATG GCA 680 Leu He He He Gly He He Leu Val Gly Leu Pro Phe Leu Met Ala 195 200 205 210
ATG ATC CAC CCT AAA AAA GAA GAA ATC GTT GAG ATT GAT TCA AAG CTT 728 Met He His Pro Lys Lys Glu Glu He Val Glu He Asp Ser Lys Leu 215 220 225
TTA AAA GAC GAG TAC AAA GAG ATT GAA CTC ATT AGC CAC CAA CAA GAC 776 Leu Lys Asp Glu Tyr Lys Glu He Glu Leu He Ser His Gin Gin Asp 230 235 240
AAA ACG ATC GCG CAT TTT TTG GAA AAC AGC GCT TTG CTT TCT TAT CTT 824 Lys Thr He Ala His Phe Leu Glu Asn Ser Ala Leu Leu Ser Tyr Leu 245 250 255
TTG GTT TTT TTG GGT TTT GGG TAT CTT GGT GTT TAT TTT TTT AAA GGG 872 Leu Val Phe Leu Gly Phe Gly Tyr Leu Gly Val Tyr Phe Phe Lys Gly 260 265 270
GGA GGG ATT AGT TTA AAC ATT GTC AAT ACG ATT TTC CTT TTT TTA GGG 920 Gly Gly He Ser Leu Asn He Val Asn Thr He Phe Leu Phe Leu Gly 275 280 285 290
ATT TTA CTG CAT AAA ACC CCT TTA GCT TAT GTG AAA GCG ATC GAT CGT 968 He Leu Leu His Lys Thr Pro Leu Ala Tyr Val Lys Ala He Asp Arg 295 300 305
TCC GCT ANG AGC GTG GCT GGG ATT TTA TTG CAA TTC CCT TTT TAC GCT 1016 Ser Ala Xaa Ser Val Ala Gly He Leu Leu Gin Phe Pro Phe Tyr Ala 310 315 320
GGG ATT ATG GGG ATG ATG GCA AGC CAT AGC GTG GGG GGT CAT TCT TTA 1064 Gly He Met Gly Met Met Ala Ser His Ser Val Gly Gly His Ser Leu 325 330 335
GCG CAA ATG CTT TCT TTA GCT TTC ACG CAC ATC GCT AAT GAA AAA ACT 1112 Ala Gin Met Leu Ser Leu Ala Phe Thr His He Ala Asn Glu Lys Thr 340 345 350
TTC GTG CTC ATG ACT TTT TTG AGC GCA GGG ATT GTC AAT ATT TTT ATT 1160 Phe Val Leu Met Thr Phe Leu Ser Ala Gly He Val Asn He Phe He 355 360 365 370
CCG TCT GGC GGA GGG CAA TGG GCG ATT CAA GCT CCT ATC ATG CTT CCG 1208 Pro Ser Gly Gly Gly Gin Trp Ala He Gin Ala Pro He Met Leu Pro 375 380 385
GCT GGG CAA AGC TTA GGG GTG GAT CCG GGA GTG GTT TCT ATG GCT ATC 1256 Ala Gly Gin Ser Leu Gly Val Asp Pro Gly Val Val Ser Met Ala He 390 395 400
GCT TGG GGA GAT GCT TGG ACG AAT ATG ATA CAG CCT TTT TGG GCT TTG 1304 Ala Trp Gly Asp Ala Trp Thr Asn Met He Gin Pro Phe Trp Ala Leu 405 410 415
CCC GCT TTA GCC ATT GCG GGT TTG GGC GCT AAA GAT ATT ATG GGC TAT 1352 Pro Ala Leu Ala He Ala Gly Leu Gly Ala Lys Asp He Met Gly Tyr 420 425 430
TGC GTT TTG ACT TTA ATT TTT GTA GGC TTA GTC GTG TGT GGG GTG TTT 1400 Cys Val Leu Thr Leu He Phe Val Gly Leu Val Val Cys Gly Val Phe 435 440 445 450
TAT TTT TTA GTG TGAGTTTTTT ATGCCTAAAA CCATGCTCTT TTCAATGGGG TAAGG 1457 Tyr Phe Leu Val TTTTCTCT 1465
(2) INFORMATION FOR SEQ ID NO:404:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 454 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404:
Met Phe Leu Leu Arg His Leu Thr Ser Ala Cys Val Phe Leu Ala Ser
1 5 10 15
Lys Cys Leu Pro Asp Ser Phe Val Leu Val Ala Leu Leu Ser Phe Val
20 25 30
Val Phe Val Leu Val Tyr Cys Leu Thr Gly Gin Asp Ala Phe Ser Val
35 40 45
He Ser Ser Trp Gly Asn Gly Ala Trp Thr Leu Leu Gly Phe Ser Met
50 55 60
Gin Met Ala Leu He Leu Val Leu Gly Gin Ala Leu Ala Asn Ala Lys 65 70 75 80
Leu Val Gin Lys Leu Leu Lys Tyr Leu Ala Ser Leu Pro Lys Gly Tyr
85 90 95
Tyr Thr Ala Leu Trp Leu Val Thr Phe Leu Ser Leu He Ala Asn Trp
100 105 110
He Asn Trp Gly Phe Gly Leu Val He Ser Ala He Phe Ala Lys Glu
115 120 125
He Ala Lys Asn Val Lys Gly Val Asp Tyr Arg Leu Leu He Ala Ser
130 135 140
Ala Tyr Ser Gly Phe Val He Trp His Gly Gly Leu Ser Gly Ser He 145 150 155 160
Pro Leu Ser Val Ala Thr Gin Asn Glu Asn Leu Ser Lys He Ser Ala
165 170 175
Gly Val He Glu Lys Ala He Pro He Ser Gin Thr He Phe Ser Ser
180 185 190
Tyr Asn Leu He He He Gly He He Leu Val Gly Leu Pro Phe Leu
195 200 205
Met Ala Met He His Pro Lys Lys Glu Glu He Val Glu He Asp Ser
210 215 220
Lys Leu Leu Lys Asp Glu Tyr Lys Glu He Glu Leu He Ser His Gin 225 230 235 240
Gin Asp Lys Thr He Ala His Phe Leu Glu Asn Ser Ala Leu Leu Ser
245 250 255
Tyr Leu Leu Val Phe Leu Gly Phe Gly Tyr Leu Gly Val Tyr Phe Phe
260 265 270
Lys Gly Gly Gly He Ser Leu Asn He Val Asn Thr He Phe Leu Phe
275 280 285
Leu Gly He Leu Leu His Lys Thr Pro Leu Ala Tyr Val Lys Ala He
290 295 300
Asp Arg Ser Ala Xaa Ser Val Ala Gly He Leu Leu Gin Phe Pro Phe 305 310 315 320 Tyr Ala Gly He Met Gly Met Met Ala Ser His Ser Val Gly Gly His
325 330 335
Ser Leu Ala Gin Met Leu Ser Leu Ala Phe Thr His He Ala Asn Glu
340 345 350
Lys Thr Phe Val Leu Met Thr Phe Leu Ser Ala Gly He Val Asn He
355 360 365
Phe He Pro Ser Gly Gly Gly Gin Trp Ala He Gin Ala Pro He Met
370 375 380
Leu Pro Ala Gly Gin Ser Leu Gly Val Asp Pro Gly Val Val Ser Met 385 390 395 400
Ala He Ala Trp Gly Asp Ala Trp Thr Asn Met He Gin Pro Phe Trp
405 410 415
Ala Leu Pro Ala Leu Ala He Ala Gly Leu Gly Ala Lys Asp He Met
420 425 430
Gly Tyr Cys Val Leu Thr Leu He Phe Val Gly Leu Val Val Cys Gly
435 440 445
Val Phe Tyr Phe Leu Val 450
(2) INFORMATION FOR SEQ ID NO: 405:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1114 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1061 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:405:
TATCACTATA CTTTCAATAA AATAAGGGGT TTTTTTTGAC TAAAAAATTC ATG TCT 56
Met Ser 1
TGG ATG GTG GTT ATT GGG GCT TTA ATT TGC ATG CTT TTA GGG GTG TTT 104 Trp Met Val Val He Gly Ala Leu He Cys Met Leu Leu Gly Val Phe 5 10 15
ATC TTC TTC ACT AGC ATG TCG GTT AAA AAA TTT TTA AGC GCT TAT CTT 152 He Phe Phe Thr Ser Met Ser Val Lys Lys Phe Leu Ser Ala Tyr Leu 20 25 30
AAC GCT TAT TTG GAT CAA CGC CCC CAT ATT AAG GGC ATG GGG ATT GCA 200 Asn Ala Tyr Leu Asp Gin Arg Pro His He Lys Gly Met Gly He Ala 35 40 45 50 GGC ACT CCC TTT GAA TGC GAA GGG TTT TTT AAA ATC GCA TGC GTT TCT 248 Gly Thr Pro Phe Glu Cys Glu Gly Phe Phe Lys He Ala Cys Val Ser 55 60 65
AAA GAG CTC AGT TTT TTA GAC TCT CAA AAC TCC CCT ATT GTG AAT TTT 296 Lys Glu Leu Ser Phe Leu Asp Ser Gin Asn Ser Pro He Val Asn Phe 70 75 80
AAA AAT TTG AGT ATT AAG CTC CGT TCT TTA GAT AAA AGC TCT CTT ACT 344 Lys Asn Leu Ser He Lys Leu Arg Ser Leu Asp Lys Ser Ser Leu Thr 85 90 95
CTT TCT GTC CAT TCT CAA ATC AAA TCC CCT ATT TTA GAA CAA GAT ATG 392 Leu Ser Val His Ser Gin He Lys Ser Pro He Leu Glu Gin Asp Met 100 105 110
CAG CAA AAA ATC AGC CAA ATC CCC CTA AAA GAC TTG AAT GCC TTA TTA 440 Gin Gin Lys He Ser Gin He Pro Leu Lys Asp Leu Asn Ala Leu Leu 115 120 125 130
GAA AAA ATG AAA CCC ACG CGC TTG AAT TGC TCT TTA ACA TTC AAC GCT 488 Glu Lys Met Lys Pro Thr Arg Leu Asn Cys Ser Leu Thr Phe Asn Ala 135 140 145
CTA GAT GAA AAA ACC TTA AAC GAC AAC TTA AAA TGC GAT TTG ACT AAT 536 Leu Asp Glu Lys Thr Leu Asn Asp Asn Leu Lys Cys Asp Leu Thr Asn 150 155 160
GCG GAA AAT ATC CTT GCT TAC ACT TTT TTT CAA GAG GGT TTA ATG GAG 584 Ala Glu Asn He Leu Ala Tyr Thr Phe Phe Gin Glu Gly Leu Met Glu 165 170 175
GCT CAA GAA AAT CTA TCC CTT AAA AAT ATT TTT AAA ACC TTG AGT TCT 632 Ala Gin Glu Asn Leu Ser Leu Lys Asn He Phe Lys Thr Leu Ser Ser 180 185 190
AAA GAC GCT AAA GCC ATA GAA GAG TTG CAA GAC AAA CTG CGT TTT TCA 680 Lys Asp Ala Lys Ala He Glu Glu Leu Gin Asp Lys Leu Arg Phe Ser 195 200 205 210
GCG CCA AAG TTG GGC GTT TCT ATC CAA GCG CAC CAT CTT AAA AAC CTT 728 Ala Pro Lys Leu Gly Val Ser He Gin Ala His His Leu Lys Asn Leu 215 220 225
TTG GAA GCC TTT TAT CAC CAA AAT AAA GAG AGT TTG GGC TTT TTT TCC 776 Leu Glu Ala Phe Tyr His Gin Asn Lys Glu Ser Leu Gly Phe Phe Ser 230 235 240
CCT TAT TTT AGT TTG CGA TCT CAA ACC CCT AGC GTC TCT TAT GAA AGC 824 Pro Tyr Phe Ser Leu Arg Ser Gin Thr Pro Ser Val Ser Tyr Glu Ser 245 250 255
GCG TTA GCT TCT TTA GAA AAC TAT TTT ATG GCT TTG TTC CAA TCC CAT 872 Ala Leu Ala Ser Leu Glu Asn Tyr Phe Met Ala Leu Phe Gin Ser His 260 265 270 TTT AAA GAC GAT ACC GCA CTC CAA CAG AAT TTT AAA GGA TTG TTG CAA 920 Phe Lys Asp Asp Thr Ala Leu Gin Gin Asn Phe Lys Gly Leu Leu Gin 275 280 285 290
GCC TTT GTT TCT ATG GCT AAA GAC AAA CGA TCC CAA ATC GCT CTT AAC 968 Ala Phe Val Ser Met Ala Lys Asp Lys Arg Ser Gin He Ala Leu Asn 295 300 305
GCC CAA GCT AAA GAC AAC GCC AAG CTA ACT TTT AAC GCC TTG TTA GAA 1016 Ala Gin Ala Lys Asp Asn Ala Lys Leu Thr Phe Asn Ala Leu Leu Glu 310 315 320
AGC CTT AGC GTG AAT TTC TTT CAA TCT TAC AAA ATA AGC CAT GAG TGATT 1066 Ser Leu Ser Val Asn Phe Phe Gin Ser Tyr Lys He Ser His Glu 325 330 335
TCGAAGTCCC CCCCAAAGCT AAAGGGTTTA AACGCCTTTT TAAAGCCC 1114
(2) INFORMATION FOR SEQ ID NO: 406:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 337 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406:
Met Ser Trp Met Val Val He Gly Ala Leu He Cys Met Leu Leu Gly
1 5 10 15
Val Phe He Phe Phe Thr Ser Met Ser Val Lys Lys Phe Leu Ser Ala
20 25 30
Tyr Leu Asn Ala Tyr Leu Asp Gin Arg Pro His He Lys Gly Met Gly
35 40 45
He Ala Gly Thr Pro Phe Glu Cys Glu Gly Phe Phe Lys He Ala Cys
50 55 60
Val Ser Lys Glu Leu Ser Phe Leu Asp Ser Gin Asn Ser Pro He Val 65 70 75 80
Asn Phe Lys Asn Leu Ser He Lys Leu Arg Ser Leu Asp Lys Ser Ser
85 90 95
Leu Thr Leu Ser Val His Ser Gin He Lys Ser Pro He Leu Glu Gin
100 105 110
Asp Met Gin Gin Lys He Ser Gin He Pro Leu Lys Asp Leu Asn Ala
115 120 125
Leu Leu Glu Lys Met Lys Pro Thr Arg Leu Asn Cys Ser Leu Thr Phe
130 135 140
Asn Ala Leu Asp Glu Lys Thr Leu Asn Asp Asn Leu Lys Cys Asp Leu 145 150 155 160
Thr Asn Ala Glu Asn He Leu Ala Tyr Thr Phe Phe Gin Glu Gly Leu
165 170 175
Met Glu Ala Gin Glu Asn Leu Ser Leu Lys Asn He Phe Lys Thr Leu 180 185 190 Ser Ser Lys Asp Ala Lys Ala He Glu Glu Leu Gin Asp Lys Leu Arg
195 200 205
Phe Ser Ala Pro Lys Leu Gly Val Ser He Gin Ala His His Leu Lys
210 215 220
Asn Leu Leu Glu Ala Phe Tyr His Gin Asn Lys Glu Ser Leu Gly Phe 225 230 235 240
Phe Ser Pro Tyr Phe Ser Leu Arg Ser Gin Thr Pro Ser Val Ser Tyr
245 250 255
Glu Ser Ala Leu Ala Ser Leu Glu Asn Tyr Phe Met Ala Leu Phe Gin
260 265 270
Ser His Phe Lys Asp Asp Thr Ala Leu Gin Gin Asn Phe Lys Gly Leu
275 280 285
Leu Gin Ala Phe Val Ser Met Ala Lys Asp Lys Arg Ser Gin He Ala
290 295 300
Leu Asn Ala Gin Ala Lys Asp Asn Ala Lys Leu Thr Phe Asn Ala Leu 305 310 315 320
Leu Glu Ser Leu Ser Val Asn Phe Phe Gin Ser Tyr Lys He Ser His
325 330 335
Glu
(2) INFORMATION FOR SEQ ID NO: 407:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 445 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...392 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:407:
AAAATGAGGG TGTTTCAATT CAAGCCATGA GTAAGTTATA TGAGTAAGCC ATG AAT 56
Met Asn
1
ATC AAA ACC CAT TCT TCA AAT GAA AAA GAA CGC TTT GTG CGC ATA GAA 104 He Lys Thr His Ser Ser Asn Glu Lys Glu Arg Phe Val Arg He Glu 5 10 15
GAG GAC GAA AAG AAA GGA TTA TTT GCT GGA ACT GCA AAT GAA AAT TCG 152 Glu Asp Glu Lys Lys Gly Leu Phe Ala Gly Thr Ala Asn Glu Asn Ser 20 25 30
CAC GGC CTT TCT TTA ATG GCT TTA ATA GGG GTA TTG GTT TTT GGG GGC 200 His Gly Leu Ser Leu Met Ala Leu He Gly Val Leu Val Phe Gly Gly 35 40 45 50
GCG TTT TTA GCT CTG TTA GCG CCT AAA ATC TAT TTA AGC AAT AAT ATC 248 Ala Phe Leu Ala Leu Leu Ala Pro Lys He Tyr Leu Ser Asn Asn He 55 60 65
TAT TAT ATT AGC CGT AAA ATC AAC ACC CTA GAA GAT CAA AAA CGC CTG 296 Tyr Tyr He Ser Arg Lys He Asn Thr Leu Glu Asp Gin Lys Arg Leu 70 75 80
CTT TTA GAA GAG CAA CAA ATC CTA AAA AAC GAA TTA GAA AAA GAG CGT 344 Leu Leu Glu Glu Gin Gin He Leu Lys Asn Glu Leu Glu Lys Glu Arg 85 90 95
TTT AAA TAC TAC ATA GAA AAT AGT GAA AAT ATT GGC GAT ATT GCG TTT T 393 Phe Lys Tyr Tyr He Glu Asn Ser Glu Asn He Gly Asp He Ala Phe 100 105 110
AAGTGAAAAA CCCCCTATCC CCTTAAGAGA GCTTAATTAA TGGTCAATCA TT 445
(2) INFORMATION FOR SEQ ID NO: 408:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 114 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408:
Met Asn He Lys Thr His Ser Ser Asn Glu Lys Glu Arg Phe Val Arg
1 5 10 15
He Glu Glu Asp Glu Lys Lys Gly Leu Phe Ala Gly Thr Ala Asn Glu
20 25 30
Asn Ser His Gly Leu Ser Leu Met Ala Leu He Gly Val Leu Val Phe
35 40 45
Gly Gly Ala Phe Leu Ala Leu Leu Ala Pro Lys He Tyr Leu Ser Asn
50 55 60
Asn He Tyr Tyr He Ser Arg Lys He Asn Thr Leu Glu Asp Gin Lys 65 70 75 80
Arg Leu Leu Leu Glu Glu Gin Gin He Leu Lys Asn Glu Leu Glu Lys
85 90 95
Glu Arg Phe Lys Tyr Tyr He Glu Asn Ser Glu Asn He Gly Asp He
100 105 110
Ala Phe
(2) INFORMATION FOR SEQ ID NO: 409:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 218 base pairs
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...165 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409:
AAGGTTTTTA GCGAAATAAG CCCCCACGCC AAACACGCCC ATGCTCATTA G CAC AAT 57
His Asn 1
ATC GCA CAT AAA GCA CAA AGC GCA AAT CAA AAA CAC ATA ATT CCT AGC 105 He Ala His Lys Ala Gin Ser Ala Asn Gin Lys His He He Pro Ser 5 10 15
CAT CCC CCT TTC CAC AAT AAA CAA GGA TTG CGC CCC CAC CGC CGC ACA 153 His Pro Pro Phe His Asn Lys Gin Gly Leu Arg Pro His Arg Arg Thr 20 25 30
CAA AGA AAT CGC TAAACCAAAA CCTTCTATAA AAACCACAAA CATCTTGCTT AATCC 210
Gin Arg Asn Arg
35
TTTCACTC 21Ϊ
(2) INFORMATION FOR SEQ ID NO: 410:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 38 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:410:
His Asn He Ala His Lys Ala Gin Ser Ala Asn Gin Lys His He He
1 5 10 15
Pro Ser His Pro Pro Phe His Asn Lys Gin Gly Leu Arg Pro His Arg
20 25 30
Arg Thr Gin Arg Asn Arg 35
(2) INFORMATION FOR SEQ ID NO: 411: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 967 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...914 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411:
TAGTCAAGTC GCTATTACTG CAAGCCCGGT GAGCGCAGCG GTNGGTGTTT ATG AGC 56
Met Ser 1
GGC ATT TTA GAG CCT TTA GGA GCA AAT TAC TTG ACC CTT TTA ATG GTT 104 Gly He Leu Glu Pro Leu Gly Ala Asn Tyr Leu Thr Leu Leu Met Val 5 10 15
TGG ATC CCT ACG ACT TTT TTA GCA TGC ATG CTC ACG GCA TTT ATT ATG 152 Trp He Pro Thr Thr Phe Leu Ala Cys Met Leu Thr Ala Phe He Met 20 25 30
GGT TTT ACT GAT TTG AAA TTA GAC AGC GAT CCG CAT TAT TTA GAG CGC 200 Gly Phe Thr Asp Leu Lys Leu Asp Ser Asp Pro His Tyr Leu Glu Arg 35 40 45 50
TTG AAA GCG GGC AAA ATC TCG CCC CCT AAA ATC AAA GAA GAA AAA GAA 248 Leu Lys Ala Gly Lys He Ser Pro Pro Lys He Lys Glu Glu Lys Glu 55 60 65
ACC TCA AAA AAC GCG AAA TTA TCG TTA TGG ATT TTT ATC GGT GGG GTT 296 Thr Ser Lys Asn Ala Lys Leu Ser Leu Trp He Phe He Gly Gly Val 70 75 80
GTA GCG ATC GTT TTT TAT GCG AGC GCG ATT TCT AAA AAT ATC GCT TTT 344 Val Ala He Val Phe Tyr Ala Ser Ala He Ser Lys Asn He Ala Phe 85 90 95
GTT AGC CCG GTG GTT TTA GGC AGA GAT CAC GCG ATT GTG TCT TTC ATG 392 Val Ser Pro Val Val Leu Gly Arg Asp His Ala He Val Ser Phe Met 100 105 110
CTA AGC GTG GCG ACT TTA ATT GTG CTT TTT TGC AAA ATT AAC GCT AAT 440 Leu Ser Val Ala Thr Leu He Val Leu Phe Cys Lys He Asn Ala Asn 115 120 125 130
GAA ATC GCT CAT TCA AGC GTG TTT AAA TCC GGC ATG CAA GCG TGC GTG 488 Glu He Ala His Ser Ser Val Phe Lys Ser Gly Met Gin Ala Cys Val 135 140 145
TGC GTG TTG GGC GTG GCG TGG TTG GGC GAT ACT TTT GTG AGC AAT CAT 536 Cys Val Leu Gly Val Ala Trp Leu Gly Asp Thr Phe Val Ser Asn His 150 155 160
ATA GAT GAG ATC AAA CGA TAC GCT TCT TTT TTG ATC GCA GAT TAT CCG 584 He Asp Glu He Lys Arg Tyr Ala Ser Phe Leu He Ala Asp Tyr Pro 165 170 175
TTT TTA TTA GCC GTA GCG CTC TTT TTG GCT TCC ATG CTT TTG TAT TCG 632 Phe Leu Leu Ala Val Ala Leu Phe Leu Ala Ser Met Leu Leu Tyr Ser 180 185 190
CAA GCC GCC ACC TCT AAA GCG CTC ATC CCA AGC GTG ATC ACA GCC TTA 680 Gin Ala Ala Thr Ser Lys Ala Leu He Pro Ser Val He Thr Ala Leu 195 200 205 210
GGC ATT AGC GCT AAT CAT ACG GAG CAT TTG TAT ATT ATC GTG GCT TCG 728 Gly He Ser Ala Asn His Thr Glu His Leu Tyr He He Val Ala Ser 215 220 225
TTT GCG AGC GTT TCG GCG TTG TTT GTG TTA CCC ACT TAC CCC ACT TTA 776 Phe Ala Ser Val Ser Ala Leu Phe Val Leu Pro Thr Tyr Pro Thr Leu 230 235 240
CTA GGA GCG ATC GCT ATG GAT AAC ACC GGC ACC ACT AAA ATG GGC CGT 824 Leu Gly Ala He Ala Met Asp Asn Thr Gly Thr Thr Lys Met Gly Arg 245 250 255
TAT GTG TTT GAT CAT GCG TTT TTG ATC CCT GGG GTT TTA GTC GTG TCT 872 Tyr Val Phe Asp His Ala Phe Leu He Pro Gly Val Leu Val Val Ser 260 265 270
TTG AGC GTA GCG TTA GGG TTT GTT GTC GCG CCG TTA GTT TTG TAGATTTTA 923 Leu Ser Val Ala Leu Gly Phe Val Val Ala Pro Leu Val Leu 275 280 285
TCACCAACGA TAAAAAGCTT GGCGTTGCGA TTTTTCTAAA CCCC 967
(2) INFORMATION FOR SEQ ID NO:412:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 288 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:412:
Met Ser Gly He Leu Glu Pro Leu Gly Ala Asn Tyr Leu Thr Leu Leu 1 5 10 15
Met Val Trp He Pro Thr Thr Phe Leu Ala Cys Met Leu Thr Ala Phe
20 25 30
He Met Gly Phe Thr Asp Leu Lys Leu Asp Ser Asp Pro His Tyr Leu
35 40 45
Glu Arg Leu Lys Ala Gly Lys He Ser Pro Pro Lys He Lys Glu Glu
50 55 60
Lys Glu Thr Ser Lys Asn Ala Lys Leu Ser Leu Trp He Phe He Gly 65 70 75 80
Gly Val Val Ala He Val Phe Tyr Ala Ser Ala He Ser Lys Asn He
85 90 95
Ala Phe Val Ser Pro Val Val Leu Gly Arg Asp His Ala He Val Ser
100 105 110
Phe Met Leu Ser Val Ala Thr Leu He Val Leu Phe Cys Lys He Asn
115 120 125
Ala Asn Glu He Ala His Ser Ser Val Phe Lys Ser Gly Met Gin Ala
130 135 140
Cys Val Cys Val Leu Gly Val Ala Trp Leu Gly Asp Thr Phe Val Ser 145 150 155 160
Asn His He Asp Glu He Lys Arg Tyr Ala Ser Phe Leu He Ala Asp
165 170 175
Tyr Pro Phe Leu Leu Ala Val Ala Leu Phe Leu Ala Ser Met Leu Leu
180 185 190
Tyr Ser Gin Ala Ala Thr Ser Lys Ala Leu He Pro Ser Val He Thr
195 200 205
Ala Leu Gly He Ser Ala Asn His Thr Glu His Leu Tyr He He Val
210 215 220
Ala Ser Phe Ala Ser Val Ser Ala Leu Phe Val Leu Pro Thr Tyr Pro 225 230 235 240
Thr Leu Leu Gly Ala He Ala Met Asp Asn Thr Gly Thr Thr Lys Met
245 250 255
Gly Arg Tyr Val Phe Asp His Ala Phe Leu He Pro Gly Val Leu Val
260 265 270
Val Ser Leu Ser Val Ala Leu Gly Phe Val Val Ala Pro Leu Val Leu 275 280 285
(2) INFORMATION FOR SEQ ID NO-.413:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1237 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1184 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413 AAAAATTCTA TCATTTTTGC GCGGTATTTT CATTTTAACA AGGAGCAAAA ATG CTA 56
Met Leu 1
GAA AAT AGC TCT ATA TGG AGC AAT CCT GCC TTT GTG GCT ATC ATT TGC 104 Glu Asn Ser Ser He Trp Ser Asn Pro Ala Phe Val Ala He He Cys 5 10 15
ATG TGC GTT CTT AGC CTT TTA AGG CTC AAT GTC ATG CTT TCT ATG ATT 152 Met Cys Val Leu Ser Leu Leu Arg Leu Asn Val Met Leu Ser Met He 20 25 30
AGT GCG ACT CTC ATA GCA GGA CTT ATG GGA GGG CTT GGG ATC ACG GAG 200 Ser Ala Thr Leu He Ala Gly Leu Met Gly Gly Leu Gly He Thr Glu 35 40 45 50
AGT TTT AAT GCA ATG ATA GAC GGC ATG AAA GGC AAT TTG AAC ATC GCT 248 Ser Phe Asn Ala Met He Asp Gly Met Lys Gly Asn Leu Asn He Ala 55 60 65
TTA AGC TAC ATC CTT TTA GGG GCT TTA GCG GTA GCG ATC GCT AAA AGC 296 Leu Ser Tyr He Leu Leu Gly Ala Leu Ala Val Ala He Ala Lys Ser 70 75 80
AAT CTC ATT AAA GTC GCT TTG AGT AAA TTA ATA GGT TTA ATG GAT TAC 344 Asn Leu He Lys Val Ala Leu Ser Lys Leu He Gly Leu Met Asp Tyr 85 90 95
AAG CGA TCC ACT TTT TGC TTT TTG ATC GCT TTC ATC GCA TGC TTT TCG 392 Lys Arg Ser Thr Phe Cys Phe Leu He Ala Phe He Ala Cys Phe Ser 100 105 110
CAA AAT TTA GTG CCG GTG CAT ATC GCT TTT ATC CCT ATT TTA ATC CCC 440 Gin Asn Leu Val Pro Val His He Ala Phe He Pro He Leu He Pro 115 120 125 130
CCT CTT TTG CAT TTA ATG AAC CGG CTA GAA TTG GAT AGA AGA GCG GTC 488 Pro Leu Leu His Leu Met Asn Arg Leu Glu Leu Asp Arg Arg Ala Val 135 140 145
GCT TGC GCT TTA ACC TTT GGC TTG CAA GCC CCC TAC TTG GTG CTT CCT 536 Ala Cys Ala Leu Thr Phe Gly Leu Gin Ala Pro Tyr Leu Val Leu Pro 150 155 160
GTA GGG TTT GGC TTG ATT TTT CAA ACC ACC ATT TTA GAG CAA TTA AAA 584 Val Gly Phe Gly Leu He Phe Gin Thr Thr He Leu Glu Gin Leu Lys 165 170 175
GCT AAT GGC GTT AGC ACC ACC ATA GCG CAA ATC ACA GGA GTG ATG TGG 632 Ala Asn Gly Val Ser Thr Thr He Ala Gin He Thr Gly Val Met Trp 180 185 190
ATA GCG GGG TTA GCG ATG GTC GTT GGA CTG CTT GTT GCT GTA TTA ACG 680 He Ala Gly Leu Ala Met Val Val Gly Leu Leu Val Ala Val Leu Thr 195 200 205 210 CTA TAC AAA AAA CCC AGG CAC TAC AAA GAG AAA TCT TTT AAT ATA GAA 728 Leu Tyr Lys Lys Pro Arg His Tyr Lys Glu Lys Ser Phe Asn He Glu 215 220 225
AAT TAC GCC TCG CTT CAA TTA AAC TAC CAT GAC TAC TTG ACT TTT ATA 776 Asn Tyr Ala Ser Leu Gin Leu Asn Tyr His Asp Tyr Leu Thr Phe He 230 235 240
GGG ATT GTC GTA GCG TTT GTG ATC CAA TTA GCC ACC GAT TCG ATG CCC 824 Gly He Val Val Ala Phe Val He Gin Leu Ala Thr Asp Ser Met Pro 245 250 255
TTA GCC GCC TTT TTA GCG TTA GCG ATC ATC TTA TTA GGC CGT GGC ATT 872 Leu Ala Ala Phe Leu Ala Leu Ala He He Leu Leu Gly Arg Gly He 260 265 270
AAG TTT AAA GAA ACA GAC TCG CTT ATG GAT GAT AGC GTG AAA ATG ATG 920 Lys Phe Lys Glu Thr Asp Ser Leu Met Asp Asp Ser Val Lys Met Met 275 280 285 290
GCG TTT ATC GCT TTT GTG ATG TTG GTG GCT AGC GGG TTT GGA GAA GTG 968 Ala Phe He Ala Phe Val Met Leu Val Ala Ser Gly Phe Gly Glu Val 295 300 305
TTG CAA AAA GTG CAT GCG ATA GAG GGC TTA GTG AAT GCG ATT ACA AGC 1016 Leu Gin Lys Val His Ala He Glu Gly Leu Val Asn Ala He Thr Ser 310 315 320
GTA GTC CAA GGG AAG CTT TTA GGG GCT TTT TTA ATG CTT GTT GTA GGG 1064 Val Val Gin Gly Lys Leu Leu Gly Ala Phe Leu Met Leu Val Val Gly 325 330 335
CTT TTT ATC ACT ATG GGG ATA GGG ACT TCT TTT GGC ACT ATT CCT ATC 1112 Leu Phe He Thr Met Gly He Gly Thr Ser Phe Gly Thr He Pro He 340 345 350
ATC GCT GTG TTT TAT GTC CCT TTA TGC GCG AAA TTA GGT TTT AGC GTA 1160 He Ala Val Phe Tyr Val Pro Leu Cys Ala Lys Leu Gly Phe Ser Val 355 360 365 370
GAA TCT ACG ATT TTA CTC ATC GCA TAGCCGCAGC TTTAGGCGAT GCAGGCTCAC 1214 Glu Ser Thr He Leu Leu He Ala 375
CGGCTAGCGA TAGCACCATG GGG 1237
(2) INFORMATION FOR SEQ ID NO: 414:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 378 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 414:
Met Leu Glu Asn Ser Ser He Trp Ser Asn Pro Ala Phe Val Ala He
1 5 10 15
He Cys Met Cys Val Leu Ser Leu Leu Arg Leu Asn Val Met Leu Ser
20 25 30
Met He Ser Ala Thr Leu He Ala Gly Leu Met Gly Gly Leu Gly He
35 40 45
Thr Glu Ser Phe Asn Ala Met He Asp Gly Met Lys Gly Asn Leu Asn
50 55 60
He Ala Leu Ser Tyr He Leu Leu Gly Ala Leu Ala Val Ala He Ala 65 70 75 80
Lys Ser Asn Leu He Lys Val Ala Leu Ser Lys Leu He Gly Leu Met
85 90 95
Asp Tyr Lys Arg Ser Thr Phe Cys Phe Leu He Ala Phe He Ala Cys
100 105 110
Phe Ser Gin Asn Leu Val Pro Val His He Ala Phe He Pro He Leu
115 120 125
He Pro Pro Leu Leu His Leu Met Asn Arg Leu Glu Leu Asp Arg Arg
130 135 140
Ala Val Ala Cys Ala Leu Thr Phe Gly Leu Gin Ala Pro Tyr Leu Val 145 150 155 160
Leu Pro Val Gly Phe Gly Leu He Phe Gin Thr Thr He Leu Glu Gin
165 170 175
Leu Lys Ala Asn Gly Val Ser Thr Thr He Ala Gin He Thr Gly Val
180 185 190
Met Trp He Ala Gly Leu Ala Met Val Val Gly Leu Leu Val Ala Val
195 200 205
Leu Thr Leu Tyr Lys Lys Pro Arg His Tyr Lys Glu Lys Ser Phe Asn
210 215 220
He Glu Asn Tyr Ala Ser Leu Gin Leu Asn Tyr His Asp Tyr Leu Thr 225 230 235 240
Phe He Gly He Val Val Ala Phe Val He Gin Leu Ala Thr Asp Ser
245 250 255
Met Pro Leu Ala Ala Phe Leu Ala Leu Ala He He Leu Leu Gly Arg
260 265 270
Gly He Lys Phe Lys Glu Thr Asp Ser Leu Met Asp Asp Ser Val Lys
275 280 285
Met Met Ala Phe He Ala Phe Val Met Leu Val Ala Ser Gly Phe Gly
290 295 300
Glu Val Leu Gin Lys Val His Ala He Glu Gly Leu Val Asn Ala He 305 310 315 320
Thr Ser Val Val Gin Gly Lys Leu Leu Gly Ala Phe Leu Met Leu Val
325 330 335
Val Gly Leu Phe He Thr Met Gly He Gly Thr Ser Phe Gly Thr He
340 345 350
Pro He He Ala Val Phe Tyr Val Pro Leu Cys Ala Lys Leu Gly Phe
355 360 365
Ser Val Glu Ser Thr He Leu Leu He Ala 370 375
(2) INFORMATION FOR SEQ ID NO: 415: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 945 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 61...636 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:415:
TTAGCCACTA CCACAGGCAC AGGGTGTTCT TCATCAAATT TGAGAGCGAC GGCGTAATGG 60 GTG GGG TTA GTA ACC ACG ACA TTG GCT TTA GGG ATT TCT TGC ATC ATT 108 Val Gly Leu Val Thr Thr Thr Leu Ala Leu Gly He Ser Cys He He 1 5 10 15
TTA TTC GTG GCG TTT TTT AGC ATC ATT TGG CGG ATT TTG GCT TTG ATT 156 Leu Phe Val Ala Phe Phe Ser He He Trp Arg He Leu Ala Leu He 20 25 30
TCT GGG TTC CCT TCT TGC TGT TTG TAT TCG TCC TTA ACT TCT TGT TTA 204 Ser Gly Phe Pro Ser Cys Cys Leu Tyr Ser Ser Leu Thr Ser Cys Leu 35 40 45
GTC ATT TTT AAA GAG TTG GTG TAT TGG CGG CGT TTG ATC GCT AAA TCT 252 Val He Phe Lys Glu Leu Val Tyr Trp Arg Arg Leu He Ala Lys Ser 50 55 60
ATA AAA GCC AAG ACA AAA AAT AAA AAT AAA AGC GAA GAA ATG AGC CAT 300 He Lys Ala Lys Thr Lys Asn Lys Asn Lys Ser Glu Glu Met Ser His 65 70 75 80
AAC GCC TTA TTT TTA AAC CAC AAC AAC TGG CCT TGT AAA TTC AAA AGA 348 Asn Ala Leu Phe Leu Asn His Asn Asn Trp Pro Cys Lys Phe Lys Arg 85 90 95
GCC GCA TGG TTT AAT TCC CCT AAA AAC AAA GAA AAG ATG AAA AAC CCC 396 Ala Ala Trp Phe Asn Ser Pro Lys Asn Lys Glu Lys Met Lys Asn Pro 100 105 110
AGA AAA AAA GCT AAA AAA ACT TTT AAG GTG ATC AAA CTC CCA TCA AGG 444 Arg Lys Lys Ala Lys Lys Thr Phe Lys Val He Lys Leu Pro Ser Arg 115 120 125
AGC TTT TTT AAA GAA AAA AGG TTT TTG ACG CCA TTG ATA GGG TTG ATT 492 Ser Phe Phe Lys Glu Lys Arg Phe Leu Thr Pro Leu He Gly Leu He 130 135 140 TTA GAA AAT TTA GGC TCA ATG ACT TTA GGG GCA AAG AGC CAG CCA AAT 540 Leu Glu Asn Leu Gly Ser Met Thr Leu Gly Ala Lys Ser Gin Pro Asn 145 150 155 160
TGC AAG ACA TTA GAT AAA AAC GCC ACC ACC ACT AAA ATG ATT AAA ATC 588 Cys Lys Thr Leu Asp Lys Asn Ala Thr Thr Thr Lys Met He Lys He 165 170 175
GGT AAA AGC AAT AAA AAA GTG TCT TTA GCC AGT TGG TTA AAC AGC TCT T 637 Gly Lys Ser Asn Lys Lys Val Ser Leu Ala Ser Trp Leu Asn Ser Ser 180 185 190
GAACGCTTTC TTTACTGAAA TCTAGGGAAA AATCTTTCAA CACATGGCGA TACATTTCGC 697
TAAAGCCATC CACCCACCAT ATAAAAAAAA CAAAAATACT AATTAGCCCG GCCAATAACC 757
CCAAAACCCC CACCACTTCC ATGCTCTTAG GCACATTGCC TTCTTCTCTG GCTTTTTGGA 817
TTTTTTTCGC GCTAGGGAGT TCGGTTTTTT CTTCTTCAGC CATTGGCCCT CTTTTTTAAA 877
ATTTGAACGG CTAATTCATA GTCTTTTAGG GTGTTTAGGT TTAAAAATTC TTCTTCTTTG 937
TCAAAATT 945
(2) INFORMATION FOR SEQ ID NO: 416:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 192 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416:
Val Gly Leu Val Thr Thr Thr Leu Ala Leu Gly He Ser Cys He He
1 5 10 15
Leu Phe Val Ala Phe Phe Ser He He Trp Arg He Leu Ala Leu He
20 25 30
Ser Gly Phe Pro Ser Cys Cys Leu Tyr Ser Ser Leu Thr Ser Cys Leu
35 40 45
Val He Phe Lys Glu Leu Val Tyr Trp Arg Arg Leu He Ala Lys Ser
50 55 60
He Lys Ala Lys Thr Lys Asn Lys Asn Lys Ser Glu Glu Met Ser His 65 70 75 80
Asn Ala Leu Phe Leu Asn His Asn Asn Trp Pro Cys Lys Phe Lys Arg
85 90 95
Ala Ala Trp Phe Asn Ser Pro Lys Asn Lys Glu Lys Met Lys Asn Pro
100 105 110
Arg Lys Lys Ala Lys Lys Thr Phe Lys Val He Lys Leu Pro Ser Arg
115 120 125
Ser Phe Phe Lys Glu Lys Arg Phe Leu Thr Pro Leu He Gly Leu He
130 135 140
Leu Glu Asn Leu Gly Ser Met Thr Leu Gly Ala Lys Ser Gin Pro Asn 145 150 155 160
Cys Lys Thr Leu Asp Lys Asn Ala Thr Thr Thr Lys Met He Lys He
165 170 175
Gly Lys Ser Asn Lys Lys Val Ser Leu Ala Ser Trp Leu Asn Ser Ser 180 185 190
(2) INFORMATION FOR SEQ ID NO: 417:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 851 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...783 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417:
ATTGATAGTT TCTTCAGCAA GAATGATTAG GCAATGATTA GGTTGTAG ATG AAT TTT 57
Met Asn Phe 1
TAT CAA AAA ATA TAC ACT CAT AAA GTC GTT TTT TCT TCA TTG TTT TTT 105 Tyr Gin Lys He Tyr Thr His Lys Val Val Phe Ser Ser Leu Phe Phe 5 10 15
TTG TTG TTT TTG TTC AAT GTG GAA ACT TTG TTG CTT TCG CAT TTC AGC 153 Leu Leu Phe Leu Phe Asn Val Glu Thr Leu Leu Leu Ser His Phe Ser 20 25 30 35
GAT GAT TTT TCG CAA TTG TTT TTT TTG TTT GAA AAC CAT GTT TAT GAT 201 Asp Asp Phe Ser Gin Leu Phe Phe Leu Phe Glu Asn His Val Tyr Asp 40 45 50
TTC ATT GTC AAA TTA GAT TAT TTG GGG CTA ATA GGC GTT TCT TTA ATT 249 Phe He Val Lys Leu Asp Tyr Leu Gly Leu He Gly Val Ser Leu He 55 60 65
TAT CTG CTT GTG CTT ATT CTA AAG CCT TTC ACC CTC ACG CGC CAA AAA 297 Tyr Leu Leu Val Leu He Leu Lys Pro Phe Thr Leu Thr Arg Gin Lys 70 75 80
TGC GCT TGC GTA GGG ATA TTA TGC CTT TCT TTC TAC GCT TGG AAT TTT 345 Cys Ala Cys Val Gly He Leu Cys Leu Ser Phe Tyr Ala Trp Asn Phe 85 90 95
CCT GTT AAA GAT TCT TTA ATG GTG CTT TAT CTT TTC TAT TTT GCG CTG 393 Pro Val Lys Asp Ser Leu Met Val Leu Tyr Leu Phe Tyr Phe Ala Leu 100 105 110 115
TTA GCG ACT TTA TTG TGG CGT TTT TTA GGG GCT AGC ATG AAG CAA TCT 441 Leu Ala Thr Leu Leu Trp Arg Phe Leu Gly Ala Ser Met Lys Gin Ser 120 125 130
TTC TTG CCC TCT ATG AAT ATT TGC ATC GTG TGG GTT TTT GCT TCT TCT 489 Phe Leu Pro Ser Met Asn He Cys He Val Trp Val Phe Ala Ser Ser 135 140 145
TTA CAG AGT TTT AGG TTT TTA AGC GTG TCT GAT TGC GTG GAT TTT TCC 537 Leu Gin Ser Phe Arg Phe Leu Ser Val Ser Asp Cys Val Asp Phe Ser 150 155 160
CTT TTT ACA CTC GCG CTT ATT TTA TTG ATA CTG GTT TTA ATC TAT TGC 585 Leu Phe Thr Leu Ala Leu He Leu Leu He Leu Val Leu He Tyr Cys 165 170 175
AAA CGC CTT TTT GGG TTG TAT GAA TAC GCT AAC ACG CTC ATT TTG ATC 633 Lys Arg Leu Phe Gly Leu Tyr Glu Tyr Ala Asn Thr Leu He Leu He 180 185 190 195
GTG GGG CTT AGC GTG GTG GTG CTA TGC TCT AGC ATG TTC ATT CAA ACT 681 Val Gly Leu Ser Val Val Val Leu Cys Ser Ser Met Phe He Gin Thr 200 205 210
AAA GAA TAC TAT GGC ATG CGA TTG GGT TTT TAT TTT TTA GGC CTG TTA 729 Lys Glu Tyr Tyr Gly Met Arg Leu Gly Phe Tyr Phe Leu Gly Leu Leu 215 220 225
GGG TGG CTT TTA GAA TAT GTG CAT AAC ACT TTA AGG CGT TTG GAA CAT 777 Gly Trp Leu Leu Glu Tyr Val His Asn Thr Leu Arg Arg Leu Glu His 230 235 240
CAA ATT TAAAGCTCAA ATAGGAATAG CTAAAGCCTT TTGATTGAGT GTTTTTTTAG GG 835 Gin He 245
CTTAAAAGCG GGTTTA 851
(2) INFORMATION FOR SEQ ID NO: 418:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 245 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:418:
Met Asn Phe Tyr Gin Lys He Tyr Thr His Lys Val Val Phe Ser Ser
1 5 10 15
Leu Phe Phe Leu Leu Phe Leu Phe Asn Val Glu Thr Leu Leu Leu Ser
20 25 30
His Phe Ser Asp Asp Phe Ser Gin Leu Phe Phe Leu Phe Glu Asn His 35 40 45
Val Tyr Asp Phe He Val Lys Leu Asp Tyr Leu Gly Leu He Gly Val
50 55 60
Ser Leu He Tyr Leu Leu Val Leu He Leu Lys Pro Phe Thr Leu Thr 65 70 75 80
Arg Gin Lys Cys Ala Cys Val Gly He Leu Cys Leu Ser Phe Tyr Ala
85 90 95
Trp Asn Phe Pro Val Lys Asp Ser Leu Met Val Leu Tyr Leu Phe Tyr
100 105 110
Phe Ala Leu Leu Ala Thr Leu Leu Trp Arg Phe Leu Gly Ala Ser Met
115 120 125
Lys Gin Ser Phe Leu Pro Ser Met Asn He Cys He Val Trp Val Phe
130 135 140
Ala Ser Ser Leu Gin Ser Phe Arg Phe Leu Ser Val Ser Asp Cys Val 145 150 155 160
Asp Phe Ser Leu Phe Thr Leu Ala Leu He Leu Leu He Leu Val Leu
165 170 175
He Tyr Cys Lys Arg Leu Phe Gly Leu Tyr Glu Tyr Ala Asn Thr Leu
180 185 190
He Leu He Val Gly Leu Ser Val Val Val Leu Cys Ser Ser Met Phe
195 200 205
He Gin Thr Lys Glu Tyr Tyr Gly Met Arg Leu Gly Phe Tyr Phe Leu
210 215 220
Gly Leu Leu Gly Trp Leu Leu Glu Tyr Val His Asn Thr Leu Arg Arg 225 230 235 240
Leu Glu His Gin He 245
(2) INFORMATION FOR SEQ ID NO: 419:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 827 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 95...745 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:419:
CTTTTTACTT TTAGTTAAGT TGTAAGAAAC TTTAGCTACC ATGCGATACA AAAAAGGATT 60 TTAAGTGCGT TTTGGTAAAA TTGATTATTT GAAC ATG CTC CCT TTT GAT GTG TTT 115
Met Leu Pro Phe Asp Val Phe 1 5
ATC AAA TCC TAC CCC ACC CCT TGT TAT TTC AAA CAA TTC TTA CGG CTT 163 He Lys Ser Tyr Pro Thr Pro Cys Tyr Phe Lys Gin Phe Leu Arg Leu 10 15 20
AAA AAA ACC TAC CCC TCC AAA CTC AAT GAG AGT TTT TTA TTC AGG CGT 211 Lys Lys Thr Tyr Pro Ser Lys Leu Asn Glu Ser Phe Leu Phe Arg Arg 25 30 35
ATT GAT GCG GGG TTT ATT TCT TCT ATC GCC GGC TAT CCA TTC GCT CTT 259 He Asp Ala Gly Phe He Ser Ser He Ala Gly Tyr Pro Phe Ala Leu 40 45 50 55
CAT TCC CAT TCT CTA GGC ATT GTC GCT TAT AAG GAA GTT TTA AGC GTG 307 His Ser His Ser Leu Gly He Val Ala Tyr Lys Glu Val Leu Ser Val 60 65 70
CTG GTT GTG GAT ACA AAA AAC GCT TTT GAT AAA GAA AGC GCT TCT TCA 355 Leu Val Val Asp Thr Lys Asn Ala Phe Asp Lys Glu Ser Ala Ser Ser 75 80 85
AAC GCC CTC TCT CAA GCG CTA GGG TTA AAG GGC GAA GTG TTA ATC GGC 403 Asn Ala Leu Ser Gin Ala Leu Gly Leu Lys Gly Glu Val Leu He Gly 90 95 100
AAT AAA GCA CTG CAG TTT TAT TAT TCC AAC CCT AAA AAA GAT TTT ATA 451 Asn Lys Ala Leu Gin Phe Tyr Tyr Ser Asn Pro Lys Lys Asp Phe He 105 110 115
GAT TTA GCC GCT CTT TGG TAT GAA AAA AAA CGC TTG CCG TTT GTT TTT 499 Asp Leu Ala Ala Leu Trp Tyr Glu Lys Lys Arg Leu Pro Phe Val Phe 120 125 130 135
GGG CGT TTG TGT TAT TAC CAA AAC AAG GAT TTT TAC AAG CGC TTG TCT 547 Gly Arg Leu Cys Tyr Tyr Gin Asn Lys Asp Phe Tyr Lys Arg Leu Ser 140 145 150
TTA GCT TTC AAA CAT CAA AAA ACA AAA ATC CCT TAC TAC ATC CTT AAA 595 Leu Ala Phe Lys His Gin Lys Thr Lys He Pro Tyr Tyr He Leu Lys 155 160 165
GAA GCC GCT TTA AAA ACC AAC TTA AAA CGC CAA GAT ATT TTA AAT TAC 643 Glu Ala Ala Leu Lys Thr Asn Leu Lys Arg Gin Asp He Leu Asn Tyr 170 175 180
TTG CAA AAA ATT TAC TAC ACT TTA GGC AAA AAG GAG CAA TTA GGT CTT 691 Leu Gin Lys He Tyr Tyr Thr Leu Gly Lys Lys Glu Gin Leu Gly Leu 185 190 195
AAA GCG TTC TAT CGT GAA TTG TTA TTC AAA CGC ATT CAA AAA CCC AAG 739 Lys Ala Phe Tyr Arg Glu Leu Leu Phe Lys Arg He Gin Lys Pro Lys 200 205 210 215
CGT TTT TAGTGATTCG CTGAGAATGT AGGCTTAAAA TTCAGAAAGG GTGTTTTTAA GC 797 Arg Phe
AAGATTAGGT TACAATCACA AGTTTTATTA 827 (2) INFORMATION FOR SEQ ID NO:420:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 217 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:420:
Met Leu Pro Phe Asp Val Phe He Lys Ser Tyr Pro Thr Pro Cys Tyr
1 5 10 15
Phe Lys Gin Phe Leu Arg Leu Lys Lys Thr Tyr Pro Ser Lys Leu Asn
20 25 30
Glu Ser Phe Leu Phe Arg Arg He Asp Ala Gly Phe He Ser Ser He
35 40 45
Ala Gly Tyr Pro Phe Ala Leu His Ser His Ser Leu Gly He Val Ala
50 55 60
Tyr Lys Glu Val Leu Ser Val Leu Val Val Asp Thr Lys Asn Ala Phe 65 70 75 80
Asp Lys Glu Ser Ala Ser Ser Asn Ala Leu Ser Gin Ala Leu Gly Leu
85 90 95
Lys Gly Glu Val Leu He Gly Asn Lys Ala Leu Gin Phe Tyr Tyr Ser
100 105 110
Asn Pro Lys Lys Asp Phe He Asp Leu Ala Ala Leu Trp Tyr Glu Lys
115 120 125
Lys Arg Leu Pro Phe Val Phe Gly Arg Leu Cys Tyr Tyr Gin Asn Lys
130 135 140
Asp Phe Tyr Lys Arg Leu Ser Leu Ala Phe Lys His Gin Lys Thr Lys 145 150 155 160
He Pro Tyr Tyr He Leu Lys Glu Ala Ala Leu Lys Thr Asn Leu Lys
165 170 175
Arg Gin Asp He Leu Asn Tyr Leu Gin Lys He Tyr Tyr Thr Leu Gly
180 185 190
Lys Lys Glu Gin Leu Gly Leu Lys Ala Phe Tyr Arg Glu Leu Leu Phe
195 200 205
Lys Arg He Gin Lys Pro Lys Arg Phe 210 215
(2) INFORMATION FOR SEQ ID NO: 421:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 736 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 76...663 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421:
GCAACAAGAA ATTTTAGAGA AAATGAGACC CAACGCTCAA AAAATTCAAG CGGGTTAAAC 60 GGCTAAAAAG GAGAG ATG ATG GGA TAC ATT CCT TAT GTA ATA GAG AAT ACC 111 Met Met Gly Tyr He Pro Tyr Val He Glu Asn Thr 1 5 10
GAT CGT GGG GAG CGT AGC TAT GAT ATT TAC TCG CGC CTT TTA AAG GAT 159 Asp Arg Gly Glu Arg Ser Tyr Asp He Tyr Ser Arg Leu Leu Lys Asp 15 20 25
CGC ATT GTT TTA TTG AGC GGT GAA ATT AAT GAC AGC GTG GCG TCT TCT 207 Arg He Val Leu Leu Ser Gly Glu He Asn Asp Ser Val Ala Ser Ser 30 35 40
ATC GTG GCC CAA CTC TTG TTT TTG GAA GCT GAA GAC CCT GAA AAA GAC 255 He Val Ala Gin Leu Leu Phe Leu Glu Ala Glu Asp Pro Glu Lys Asp 45 50 55 60
ATT GGT TTG TAT ATC AAT TCT CCC GGT GGG GTG ATA ACA AGC GGT CTT 303 He Gly Leu Tyr He Asn Ser Pro Gly Gly Val He Thr Ser Gly Leu 65 70 75
AGT ATT TAT GAC ACC ATG AAT TTT ATC CGC CCT GAT GTT TCC ACG ATT 351 Ser He Tyr Asp Thr Met Asn Phe He Arg Pro Asp Val Ser Thr He 80 85 90
TGC ATC GGT CAA GCG GCT TCT ATG GGG GCG TTT TTA CTG AGC TGT GGG 399 Cys He Gly Gin Ala Ala Ser Met Gly Ala Phe Leu Leu Ser Cys Gly 95 100 105
GCT AAG GGC AAG CGC TTT TCG CTA CCC CAT TCA AGG ATT ATG ATC CAC 447 Ala Lys Gly Lys Arg Phe Ser Leu Pro His Ser Arg He Met He His 110 115 120
CAG CCT TTA GGG GGG GCT CAA GGG CAA GCG AGC GAT ATT GAA ATC ATT 495 Gin Pro Leu Gly Gly Ala Gin Gly Gin Ala Ser Asp He Glu He He 125 130 135 140
TCT AAT GAG ATT CTC AGG CTT AAA GGC TTG ATG AAT TCT ATT CTA GCT 543 Ser Asn Glu He Leu Arg Leu Lys Gly Leu Met Asn Ser He Leu Ala 145 150 155
CAA AAC TCA GGG CAG AGT TTG GAG CAA ATC GCT AAA GAC ACG GAC AGG 591 Gin Asn Ser Gly Gin Ser Leu Glu Gin He Ala Lys Asp Thr Asp Arg 160 165 170
GAT TTT TAT ATG AGT GCT AAA GAA GCT AAA GAG TAT GGC TTG ATT GAT 639 Asp Phe Tyr Met Ser Ala Lys Glu Ala Lys Glu Tyr Gly Leu He Asp 175 180 185 AAA GTG TTA CAG AAA AAC GTG AAG TGATTGCATG GCGTTATTAG AGATTATCCA 693 Lys Val Leu Gin Lys Asn Val Lys 190 195
TTACCCTTCT AAAATCTTAA GAACGATTTC TAAAGAGGTC GTT 736
(2) INFORMATION FOR SEQ ID NO:422:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 196 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422:
Met Met Gly Tyr He Pro Tyr Val He Glu Asn Thr Asp Arg Gly Glu
1 5 10 15
Arg Ser Tyr Asp He Tyr Ser Arg Leu Leu Lys Asp Arg He Val Leu
20 25 30
Leu Ser Gly Glu He Asn Asp Ser Val Ala Ser Ser He Val Ala Gin
35 40 45
Leu Leu Phe Leu Glu Ala Glu Asp Pro Glu Lys Asp He Gly Leu Tyr
50 55 60
He Asn Ser Pro Gly Gly Val He Thr Ser Gly Leu Ser He Tyr Asp 65 70 75 80
Thr Met Asn Phe He Arg Pro Asp Val Ser Thr He Cys He Gly Gin
85 90 95
Ala Ala Ser Met Gly Ala Phe Leu Leu Ser Cys Gly Ala Lys Gly Lys
100 105 110
Arg Phe Ser Leu Pro His Ser Arg He Met He His Gin Pro Leu Gly
115 120 125
Gly Ala Gin Gly Gin Ala Ser Asp He Glu He He Ser Asn Glu He
130 135 140
Leu Arg Leu Lys Gly Leu Met Asn Ser He Leu Ala Gin Asn Ser Gly 145 150 155 160
Gin Ser Leu Glu Gin He Ala Lys Asp Thr Asp Arg Asp Phe Tyr Met
165 170 175
Ser Ala Lys Glu Ala Lys Glu Tyr Gly Leu He Asp Lys Val Leu Gin
180 185 190
Lys Asn Val Lys 195
(2) INFORMATION FOR SEQ ID NO: 423:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 904 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA ( ix) FEATURE :
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 1...852 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:423:
CGC ATA AAA AAA GAA CGC TTG AAC AAA CTG CTT AAA AGG GGG TTT TTA 48
Arg He Lys Lys Glu Arg Leu Asn Lys Leu Leu Lys Arg Gly Phe Leu 1 5 10 15
GCG TTC TTT TTG AGC GTG TAT TTA AGG GCT GAT GAT TTG GTT ACT TAC 96 Ala Phe Phe Leu Ser Val Tyr Leu Arg Ala Asp Asp Leu Val Thr Tyr 20 25 30
ACC ATC ATC AAA GAA AAA GAT CTA GGA TAC CAG CGG TTT TTA GCC AAG 144 Thr He He Lys Glu Lys Asp Leu Gly Tyr Gin Arg Phe Leu Ala Lys 35 40 45
AAG TGT TTA AGG GGT AAA ACC CAC CCT CCG TGT TTT ACT AAG CCT AAA 192 Lys Cys Leu Arg Gly Lys Thr His Pro Pro Cys Phe Thr Lys Pro Lys 50 55 60
AAG CCT AAA AGA AAA CTT TTT AAT ATA GAC AAA AGC TCC CAC TAT TAT 240 Lys Pro Lys Arg Lys Leu Phe Asn He Asp Lys Ser Ser His Tyr Tyr 65 70 75 80
GGC ACA AGC GTG GTG CAA ATG TCA TGG CTA CAG AGT AGG GAA AAA TTT 288 Gly Thr Ser Val Val Gin Met Ser Trp Leu Gin Ser Arg Glu Lys Phe 85 90 95
GAA AAC CAT TCA AAA TAC CGA GAC ATT CCT TTT GCT GAA GTC AGT TTG 336 Glu Asn His Ser Lys Tyr Arg Asp He Pro Phe Ala Glu Val Ser Leu 100 105 110
ATT TAT GGC TAT AAA CAA TTT TTT CCT AAA AAA GAG CGC TAC GGC TTC 384 He Tyr Gly Tyr Lys Gin Phe Phe Pro Lys Lys Glu Arg Tyr Gly Phe 115 120 125
CGT TTT TAT GTC TCT TTG GAT TAC GCT TAT GGG TTT TTT CTT AAA AAT 432 Arg Phe Tyr Val Ser Leu Asp Tyr Ala Tyr Gly Phe Phe Leu Lys Asn 130 135 140
AAG GGC GTG TTG GGC GAT AGT TTG AGG GAG AGT TCG CAA ATC CCT AAA 480 Lys Gly Val Leu Gly Asp Ser Leu Arg Glu Ser Ser Gin He Pro Lys 145 150 155 160
AGC TAT AGA GAA AAA TTG CAA AGA AAA GAG ACT TTT ATT AAC GCT ATT 528 Ser Tyr Arg Glu Lys Leu Gin Arg Lys Glu Thr Phe He Asn Ala He 165 170 175 TTT TAT GGC GCG GGA GCT GAC TTT TTA TAC AAA CGC GCT TTT GGA ACG 576 Phe Tyr Gly Ala Gly Ala Asp Phe Leu Tyr Lys Arg Ala Phe Gly Thr 180 185 190
CTG ATT TTA GGG ATG AAT TTC GTG GGA GAA ACC TGG TTT TAT GAA ACA 624 Leu He Leu Gly Met Asn Phe Val Gly Glu Thr Trp Phe Tyr Glu Thr 195 200 205
AAG ATT TTT AAA AAG TGG GCT AAA GAT CCT TTG AGC GTT TAT CAC CCT 672 Lys He Phe Lys Lys Trp Ala Lys Asp Pro Leu Ser Val Tyr His Pro 210 215 220
TAC ATG TTT CAA GTG ATG TTG AAT GTG GGG TAT CGT TAC CGC TTT TCA 720 Tyr Met Phe Gin Val Met Leu Asn Val Gly Tyr Arg Tyr Arg Phe Ser 225 230 235 240
AGG TAT AAG AAT TGG GCG ATA GAA TTG GGT GCG CGC ATC CCT TTT TTA 768 Arg Tyr Lys Asn Trp Ala He Glu Leu Gly Ala Arg He Pro Phe Leu 245 250 255
ACC AAT GAT TAT TTT AAA ACC CCT TTA TAC ACC CTT CAT TTC AAG CGC 816 Thr Asn Asp Tyr Phe Lys Thr Pro Leu Tyr Thr Leu His Phe Lys Arg 260 265 270
AAT ATT TCT GTC TAT CTC ACT TCA ACT TAT GAC TTT TAGTTTTTTA AATTTT 868 Asn He Ser Val Tyr Leu Thr Ser Thr Tyr Asp Phe 275 280
TGAAAACTAG AATTAAAACC GCTTTTTATA AACTGG 904
(2) INFORMATION FOR SEQ ID NO: 424:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 284 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:424:
Arg He Lys Lys Glu Arg Leu Asn Lys Leu Leu Lys Arg Gly Phe Leu
1 5 10 15
Ala Phe Phe Leu Ser Val Tyr Leu Arg Ala Asp Asp Leu Val Thr Tyr
20 25 30
Thr He He Lys Glu Lys Asp Leu Gly Tyr Gin Arg Phe Leu Ala Lys
35 40 45
Lys Cys Leu Arg Gly Lys Thr His Pro Pro Cys Phe Thr Lys Pro Lys
50 55 60
Lys Pro Lys Arg Lys Leu Phe Asn He Asp Lys Ser Ser His Tyr Tyr 65 70 75 80
Gly Thr Ser Val Val Gin Met Ser Trp Leu Gin Ser Arg Glu Lys Phe 85 90 95 Glu Asn His Ser Lys Tyr Arg Asp He Pro Phe Ala Glu Val Ser Leu
100 105 110
He Tyr Gly Tyr Lys Gin Phe Phe Pro Lys Lys Glu Arg Tyr Gly Phe
115 120 125
Arg Phe Tyr Val Ser Leu Asp Tyr Ala Tyr Gly Phe Phe Leu Lys Asn
130 135 140
Lys Gly Val Leu Gly Asp Ser Leu Arg Glu Ser Ser Gin He Pro Lys 145 150 155 160
Ser Tyr Arg Glu Lys Leu Gin Arg Lys Glu Thr Phe He Asn Ala He
165 170 175
Phe Tyr Gly Ala Gly Ala Asp Phe Leu Tyr Lys Arg Ala Phe Gly Thr
180 185 190
Leu He Leu Gly Met Asn Phe Val Gly Glu Thr Trp Phe Tyr Glu Thr
195 200 205
Lys He Phe Lys Lys Trp Ala Lys Asp Pro Leu Ser Val Tyr His Pro
210 215 220
Tyr Met Phe Gin Val Met Leu Asn Val Gly Tyr Arg Tyr Arg Phe Ser 225 230 235 240
Arg Tyr Lys Asn Trp Ala He Glu Leu Gly Ala Arg He Pro Phe Leu
245 250 255
Thr Asn Asp Tyr Phe Lys Thr Pro Leu Tyr Thr Leu His Phe Lys Arg
260 265 270
Asn He Ser Val Tyr Leu Thr Ser Thr Tyr Asp Phe 275 280
(2) INFORMATION FOR SEQ ID NO: 425:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1172 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 75...1106 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:425:
TAATCGTTAT TAAACATGCT ATATTTCTTT TTTCTATAAA ACTCAATATT ATTGAATAAA 60 ACTAGGGAGT TAGA ATG ATC TTA AAA CGA GTT ACT GAA GCT TTA GAA GCG 110 Met He Leu Lys Arg Val Thr Glu Ala Leu Glu Ala 1 5 10
TAT AAA AAT GGC GAA ATG CTC ATT GTT ATG GAC GAT GAA GAC AGA GAA 158 Tyr Lys Asn Gly Glu Met Leu He Val Met Asp Asp Glu Asp Arg Glu 15 20 25
AAT GAG GGG GAT TTG GTT TTA GCT GGG ATT TTT TCT ACC CCT GAG AAA 206 Asn Glu Gly Asp Leu Val Leu Ala Gly He Phe Ser Thr Pro Glu Lys 30 35 40
ATC AAT TTC ATG GCC ACG CAT GCT AGG GGG TTG ATT TGC GTG TCT TTG 254 He Asn Phe Met Ala Thr His Ala Arg Gly Leu He Cys Val Ser Leu 45 50 55 60
ACC AAA GAT TTA GCG AAA AAA TTT GAA TTA CCC CCT ATG GTT AGC GTG 302 Thr Lys Asp Leu Ala Lys Lys Phe Glu Leu Pro Pro Met Val Ser Val 65 70 75
AAT GAT TCT AAC CAT GAG ACC GCT TTC ACG GTT TCC ATT GAC GCT AAA 350 Asn Asp Ser Asn His Glu Thr Ala Phe Thr Val Ser He Asp Ala Lys 80 85 90
GAA GCC AGA ACC GGG ATT TCT GCT TTT GAA AGG CAT TTA ACG ATT GAA 398 Glu Ala Arg Thr Gly He Ser Ala Phe Glu Arg His Leu Thr He Glu 95 100 105
TTA TTG TGT AAA GAC ACC ACC AAA CCG AGC GAT TTT GTG CGC CCG GGG 446 Leu Leu Cys Lys Asp Thr Thr Lys Pro Ser Asp Phe Val Arg Pro Gly 110 115 120
CAT ATT TTC CCT TTG ATC GCC AAA GAC GGG GGC GTG TTA GCG CGC ACG 494 His He Phe Pro Leu He Ala Lys Asp Gly Gly Val Leu Ala Arg Thr 125 130 135 140
GGC CAT ACT GAA GCG AGC GTG GAT TTG TGC AAA TTA GCT GGA TTA AAG 542 Gly His Thr Glu Ala Ser Val Asp Leu Cys Lys Leu Ala Gly Leu Lys 145 150 155
CCC GTG AGC GTG ATT TGT GAA ATC ATG AAA GAA GAT GGC TCT ATG GCG 590 Pro Val Ser Val He Cys Glu He Met Lys Glu Asp Gly Ser Met Ala 160 165 170
AGA AGG GGG GAT AAA TTT TTG AGC GAT TTC GCC CTC AAA CAT AAC CTT 638 Arg Arg Gly Asp Lys Phe Leu Ser Asp Phe Ala Leu Lys His Asn Leu 175 180 185
AAA ACT CTC TAT GTC TCT GAT TTG ATT AGC TAT CGT TTG GAA AAT GAA 686 Lys Thr Leu Tyr Val Ser Asp Leu He Ser Tyr Arg Leu Glu Asn Glu 190 195 200
AGT TTG CTG AAA ATG TTT TGT CAA GAA GAA AGG GAA TTT TTA AAA CAC 734 Ser Leu Leu Lys Met Phe Cys Gin Glu Glu Arg Glu Phe Leu Lys His 205 210 215 220
CAA ACG CAA TGC TAC ACT TTT TTA GAT CAC CAG CAA AAA AAC CAT TAC 782 Gin Thr Gin Cys Tyr Thr Phe Leu Asp His Gin Gin Lys Asn His Tyr 225 230 235
GCT TTT AAG TTT AAA GGC GCA AAA ACC CAT GAT TTA GCC CCT TTA GTG 830 Ala Phe Lys Phe Lys Gly Ala Lys Thr His Asp Leu Ala Pro Leu Val 240 245 250 CGT TTC CAC CCT ATC AAA GAG GAT TTT GAT TTT TTA ACG ACT GAT GCG 878 Arg Phe His Pro He Lys Glu Asp Phe Asp Phe Leu Thr Thr Asp Ala 255 260 265
TTT GAA GTG TTT TTT AAA GCG TTA GAA TAT TTA AAG CAC GAA GGG GGC 926 Phe Glu Val Phe Phe Lys Ala Leu Glu Tyr Leu Lys His Glu Gly Gly 270 275 280
TAT TTG ATC TTT ATG AAC ACC CAT TCT AAA GAA AAC AAT GTC GTT AAA 974 Tyr Leu He Phe Met Asn Thr His Ser Lys Glu Asn Asn Val Val Lys 285 290 295 300
GAT TTT GGG ATC GGG GCG TTG GTG TTA AAA AAT TTG GGG ATA AAG GAT 1022 Asp Phe Gly He Gly Ala Leu Val Leu Lys Asn Leu Gly He Lys Asp 305 310 315
TTC AGG CTC TTA AGC TCT TGT GAA GAC AGG CAG TAT AAG GCT TTG AGC 1070 Phe Arg Leu Leu Ser Ser Cys Glu Asp Arg Gin Tyr Lys Ala Leu Ser 320 325 330
GGG TTT GGG CTT AAG CTT GTA GAA ACG ATT AGC CTT TAAGAGGCTC GTTAAG 1122 Gly Phe Gly Leu Lys Leu Val Glu Thr He Ser Leu 335 340
TTTTATTGAA TGTGTTGTAA TGTTTTTAAG GTATAATAAA CTCTTTTTAA 1172
(2) INFORMATION FOR SEQ ID NO: 426:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 344 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426:
Met He Leu Lys Arg Val Thr Glu Ala Leu Glu Ala Tyr Lys Asn Gly
1 5 10 15
Glu Met Leu He Val Met Asp Asp Glu Asp Arg Glu Asn Glu Gly Asp
20 25 30
Leu Val Leu Ala Gly He Phe Ser Thr Pro Glu Lys He Asn Phe Met
35 40 45
Ala Thr His Ala Arg Gly Leu He Cys Val Ser Leu Thr Lys Asp Leu
50 55 60
Ala Lys Lys Phe Glu Leu Pro Pro Met Val Ser Val Asn Asp Ser Asn 65 70 75 80
His Glu Thr Ala Phe Thr Val Ser He Asp Ala Lys Glu Ala Arg Thr
85 90 95
Gly He Ser Ala Phe Glu Arg His Leu Thr He Glu Leu Leu Cys Lys
100 105 110
Asp Thr Thr Lys Pro Ser Asp Phe Val Arg Pro Gly His He Phe Pro 115 120 125 Leu He Ala Lys Asp Gly Gly Val Leu Ala Arg Thr Gly His Thr Glu
130 135 140
Ala Ser Val Asp Leu Cys Lys Leu Ala Gly Leu Lys Pro Val Ser Val 145 150 155 160
He Cys Glu He Met Lys Glu Asp Gly Ser Met Ala Arg Arg Gly Asp
165 170 175
Lys Phe Leu Ser Asp Phe Ala Leu Lys His Asn Leu Lys Thr Leu Tyr
180 185 190
Val Ser Asp Leu He Ser Tyr Arg Leu Glu Asn Glu Ser Leu Leu Lys
195 200 205
Met Phe Cys Gin Glu Glu Arg Glu Phe Leu Lys His Gin Thr Gin Cys
210 215 220
Tyr Thr Phe Leu Asp His Gin Gin Lys Asn His Tyr Ala Phe Lys Phe 225 230 235 240
Lys Gly Ala Lys Thr His Asp Leu Ala Pro Leu Val Arg Phe His Pro
245 250 255
He Lys Glu Asp Phe Asp Phe Leu Thr Thr Asp Ala Phe Glu Val Phe
260 265 270
Phe Lys Ala Leu Glu Tyr Leu Lys His Glu Gly Gly Tyr Leu He Phe
275 280 285
Met Asn Thr His Ser Lys Glu Asn Asn Val Val Lys Asp Phe Gly He
290 295 300
Gly Ala Leu Val Leu Lys Asn Leu Gly He Lys Asp Phe Arg Leu Leu 305 310 315 320
Ser Ser Cys Glu Asp Arg Gin Tyr Lys Ala Leu Ser Gly Phe Gly Leu
325 330 335
Lys Leu Val Glu Thr He Ser Leu 340
(2) INFORMATION FOR SEQ ID NO: 427:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 394 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...341 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:427:
GATTTATCCT TGCATAAAAC AATCTCGCTT GGCGATAAAA AACGCTCTAA AAA CTT 56
Lys Leu 1
CAT TTT AAA GCG TTT CAC GCA CTT TTC TAT CCT AGC AAT AGA GAC AAT 104 His Phe Lys Ala Phe His Ala Leu Phe Tyr Pro Ser Asn Arg Asp Asn 5 10 15
ATC TAT GCC AAT CAT TTA AAA TTA TTG GAT AAT GAA ATC AGT GAA AAA 152 He Tyr Ala Asn His Leu Lys Leu Leu Asp Asn Glu He Ser Glu Lys 20 25 30
GAC ATT TTT AAT AAA GCC ATC AAT CAA AAA CGA ATT CAA ATG GCT CTT 200 Asp He Phe Asn Lys Ala He Asn Gin Lys Arg He Gin Met Ala Leu 35 40 45 50
AAT CTC ATC TTT AAG CTT GTT TTT GCC TTT GTT AGT AAC CAC TTC TTC 248 Asn Leu He Phe Lys Leu Val Phe Ala Phe Val Ser Asn His Phe Phe 55 60 65
CAC GCT TTT AGA CGA CAG AAT CTC TAT AAT CGT GTC TTT AAT CGC TGT 296 His Ala Phe Arg Arg Gin Asn Leu Tyr Asn Arg Val Phe Asn Arg Cys 70 75 80
GTC TTT AAC CTT GAC TTC ATT CAA AAG CTT TTC ATT ACT CAA TTC TAACG 346 Val Phe Asn Leu Asp Phe He Gin Lys Leu Phe He Thr Gin Phe 85 90 95
AAATAGAAGC CTTAAGGTAG CGTCTGCCAT TTTGAGAGAC CAGATTCA 394
(2) INFORMATION FOR SEQ ID NO: 428:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 97 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428:
Lys Leu His Phe Lys Ala Phe His Ala Leu Phe Tyr Pro Ser Asn Arg
1 5 10 15
Asp Asn He Tyr Ala Asn His Leu Lys Leu Leu Asp Asn Glu He Ser
20 25 30
Glu Lys Asp He Phe Asn Lys Ala He Asn Gin Lys Arg He Gin Met
35 40 45
Ala Leu Asn Leu He Phe Lys Leu Val Phe Ala Phe Val Ser Asn His
50 55 60
Phe Phe His Ala Phe Arg Arg Gin Asn Leu Tyr Asn Arg Val Phe Asn 65 70 75 80
Arg Cys Val Phe Asn Leu Asp Phe He Gin Lys Leu Phe He Thr Gin
85 90 95
Phe
(2) INFORMATION FOR SEQ ID NO: 429: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH : 360 base pairs
(B) TYPE : nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 82...270 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429:
ACTTAAAGGC ATAAAAACCT TAAGCTTTTT GAGTTTCAAA AGGGTTTCAA GCTTTTTATA 60 AGACTTTTTT TGAATGAGTA A GGA GAA AAT ATT TTG TTC CAT AAA CTG ATC 111
Gly Glu Asn He Leu Phe His Lys Leu He 1 5 10
TTA ACA TGC TTT TTA GCG CTT GTA GCA ATA ACC ATT CAA GCT TGC GGT 159 Leu Thr Cys Phe Leu Ala Leu Val Ala He Thr He Gin Ala Cys Gly 15 20 25
TAT AAA GCC CCT CCA TTC AAT GAA AAA CCC GCT AAA AAA ACT TCA AAC 207 Tyr Lys Ala Pro Pro Phe Asn Glu Lys Pro Ala Lys Lys Thr Ser Asn 30 35 40
AGC TCT AAT TCT TCT ATG CAA ACG CCC ACC AAC AGC ACC ACG CCA GAA 255 Ser Ser Asn Ser Ser Met Gin Thr Pro Thr Asn Ser Thr Thr Pro Glu 45 50 55
TTT TTA AAT CAG CCT TAAAATCACT GCTCTTGTTT AAGGGCTTTG ATTTCTAGGG T 311 Phe Leu Asn Gin Pro 60
TTTTGTGGCT AACTTTTGAN STTCGCTTTC ATCATGCGTT ACCATAATG 360
(2) INFORMATION FOR SEQ ID NO:430:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 63 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430:
Gly Glu Asn He Leu Phe His Lys Leu He Leu Thr Cys Phe Leu Ala 1 5 10 15 Leu Val Ala He Thr He Gin Ala Cys Gly Tyr Lys Ala Pro Pro Phe
20 25 30
Asn Glu Lys Pro Ala Lys Lys Thr Ser Asn Ser Ser Asn Ser Ser Met
35 40 45
Gin Thr Pro Thr Asn Ser Thr Thr Pro Glu Phe Leu Asn Gin Pro 50 55 60
(2) INFORMATION FOR SEQ ID NO: 431:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 445 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...392 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431:
ATTTACAAAG CGTGTTGGAT ACCCCCAAGA TGATTCGTTT GGAAAATTGA ATG CGC 56
Met Arg 1
TTT TTG AAC AAC AAA CAT AGA GAA AAG GGC TTA AAG GCT GAA GAA GAA 104 Phe Leu Asn Asn Lys His Arg Glu Lys Gly Leu Lys Ala Glu Glu Glu 5 10 15
GCT TGC GGA TTT TTA AAA TCG TTA GGT TTT GAA ATG GTG GAG AGG AAC 152 Ala Cys Gly Phe Leu Lys Ser Leu Gly Phe Glu Met Val Glu Arg Asn 20 25 30
TTT TTT TCA CAA TTT GGC GAA ATT GAT ATT ATC GCT TTG AAA AAA GGG 200 Phe Phe Ser Gin Phe Gly Glu He Asp He He Ala Leu Lys Lys Gly 35 40 45 50
GTT TTG CAT TTC ATT GAA GTC AAA AGC GGG GAA AAT TTT GAT CCC ATT 248 Val Leu His Phe He Glu Val Lys Ser Gly Glu Asn Phe Asp Pro He 55 60 65
TAT GCG ATC ACG CCG AGC AAA TTA AAA AAG ATG ATT AAA ACG ATC CGC 296 Tyr Ala He Thr Pro Ser Lys Leu Lys Lys Met He Lys Thr He Arg 70 75 80
TGT TAT TTG TCC CAA AAA GAT CCC AAT AGC GAT TTT TGC ATA GAC GCT 344 Cys Tyr Leu Ser Gin Lys Asp Pro Asn Ser Asp Phe Cys He Asp Ala 85 90 95 CTT ATT GTG AAA AAT GGT AAA TTT GAG CTT TTA GAA AAT ATC ACT TTT T 393 Leu He Val Lys Asn Gly Lys Phe Glu Leu Leu Glu Asn He Thr Phe 100 105 110
AGATTTTTAC AGAAAGTAAA TGCGATTTCA TTAACATTCT TAAGCTAATA TA 445
(2) INFORMATION FOR SEQ ID NO:432:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 114 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:432:
Met Arg Phe Leu Asn Asn Lys His Arg Glu Lys Gly Leu Lys Ala Glu
1 5 10 15
Glu Glu Ala Cys Gly Phe Leu Lys Ser Leu Gly Phe Glu Met Val Glu
20 25 30
Arg Asn Phe Phe Ser Gin Phe Gly Glu He Asp He He Ala Leu Lys
35 40 45
Lys Gly Val Leu His Phe He Glu Val Lys Ser Gly Glu Asn Phe Asp
50 55 60
Pro He Tyr Ala He Thr Pro Ser Lys Leu Lys Lys Met He Lys Thr 65 70 75 80
He Arg Cys Tyr Leu Ser Gin Lys Asp Pro Asn Ser Asp Phe Cys He
85 90 95
Asp Ala Leu He Val Lys Asn Gly Lys Phe Glu Leu Leu Glu Asn He
100 105 110
Thr Phe
(2) INFORMATION FOR SEQ ID NO: 433
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 831 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic RNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 86...763 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:433: CGTTAAAAAG CTTGGTTTAA TGAGTTTTTA AAACTTAATT GCTACAATTA AGGAAATTTT 60 CAATAAGGCT TAGAGAAACT TTTTC ATG GAA CAC AGA GTA TTT ACT ATT GCT 112
Met Glu His Arg Val Phe Thr He Ala 1 5
AAT TTT TTT AGC TCC AAT CAT GAT TTT ATC ACC GGG TTT TTT GTC GTT 160 Asn Phe Phe Ser Ser Asn His Asp Phe He Thr Gly Phe Phe Val Val 10 15 20 25
TTG ACA GCG GTT TTG ATG TTT TTA ATC TCG CTT GGC GCG TCG CGC AAA 208 Leu Thr Ala Val Leu Met Phe Leu He Ser Leu Gly Ala Ser Arg Lys 30 35 40
ATG CAG ATG GTA CCT ATG GGT TTG CAG AAT GTG TAT GAG AGC ATC ATT 256 Met Gin Met Val Pro Met Gly Leu Gin Asn Val Tyr Glu Ser He He 45 50 55
AGC GCG ATT TTG AGC GTG GCT AAG GAT ATT ATA GGC GAA GAA TTA GCC 304 Ser Ala He Leu Ser Val Ala Lys Asp He He Gly Glu Glu Leu Ala 60 65 70
CGC AAA TAC TTC CCC CTA GCT GGC ACG ATC GCT TTG TAT GTC TTT TTT 352 Arg Lys Tyr Phe Pro Leu Ala Gly Thr He Ala Leu Tyr Val Phe Phe 75 80 85
TCT AAC ATG ATA GGC ATC ATT CCT GGC TTT GAA TCC CCT ACG GCT AGC 400 Ser Asn Met He Gly He He Pro Gly Phe Glu Ser Pro Thr Ala Ser 90 95 100 105
TGG AGC TTT ACG CTG GTT TTA GCG CTG ATT GTG TTT TTT TAT TAC CAT 448 Trp Ser Phe Thr Leu Val Leu Ala Leu He Val Phe Phe Tyr Tyr His 110 115 120
TTT GAA GGC ATT AGA GTG CAG GGC TTT TTT AAG TAT TTC GCT CAT TTT 496 Phe Glu Gly He Arg Val Gin Gly Phe Phe Lys Tyr Phe Ala His Phe 125 130 135
GCA GGT CCT GTG AAA TGG CTC GCC CCT TTC ATG TTC CCT ATT GAG ATC 544 Ala Gly Pro Val Lys Trp Leu Ala Pro Phe Met Phe Pro He Glu He 140 145 150
ATC TCG CAT TTT TCT AGG ATC GTG TCT TTA TCG TTT CGT TTG TTT GGG 592 He Ser His Phe Ser Arg He Val Ser Leu Ser Phe Arg Leu Phe Gly 155 160 165
AAT ATC AAG GGC GAT GAC ATG TTC TTG CTC ATC ATG CTT TTA TTA GTG 640 Asn He Lys Gly Asp Asp Met Phe Leu Leu He Met Leu Leu Leu Val 170 175 180 185
CCT TGG GCG GTT CCT GTA GCG CCT TTT ATG GTG TTG TTT TTC ATG GGG 688 Pro Trp Ala Val Pro Val Ala Pro Phe Met Val Leu Phe Phe Met Gly 190 195 200
ATT TTA CAA GCT TTT GTT TTT ATG ATC CTC ACT TAT GTG TAT TTG GCA 736 He Leu Gin Ala Phe Val Phe Met He Leu Thr Tyr Val Tyr Leu Ala 205 210 215
GGG GCT GTT TTA ACC GAT GAA GGG CAT TAAGCAATAA CATTCTTGTT TGGCTTT 790 Gly Ala Val Leu Thr Asp Glu Gly His 220 225
AATATTGTTT TTTAAAACTT TGTTTTATGG TAAAGCTTTT A 831
(2) INFORMATION FOR SEQ ID NO:434:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 226 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434:
Met Glu His Arg Val Phe Thr He Ala Asn Phe Phe Ser Ser Asn His
1 5 10 15
Asp Phe He Thr Gly Phe Phe Val Val Leu Thr Ala Val Leu Met Phe
20 25 30
Leu He Ser Leu Gly Ala Ser Arg Lys Met Gin Met Val Pro Met Gly
35 40 45
Leu Gin Asn Val Tyr Glu Ser He He Ser Ala He Leu Ser Val Ala
50 55 60
Lys Asp He He Gly Glu Glu Leu Ala Arg Lys Tyr Phe Pro Leu Ala 65 70 75 80
Gly Thr He Ala Leu Tyr Val Phe Phe Ser Asn Met He Gly He He
85 90 95
Pro Gly Phe Glu Ser Pro Thr Ala Ser Trp Ser Phe Thr Leu Val Leu
100 105 110
Ala Leu He Val Phe Phe Tyr Tyr His Phe Glu Gly He Arg Val Gin
115 120 125
Gly Phe Phe Lys Tyr Phe Ala His Phe Ala Gly Pro Val Lys Trp Leu
130 135 140
Ala Pro Phe Met Phe Pro He Glu He He Ser His Phe Ser Arg He 145 150 155 160
Val Ser Leu Ser Phe Arg Leu Phe Gly Asn He Lys Gly Asp Asp Met
165 170 175
Phe Leu Leu He Met Leu Leu Leu Val Pro Trp Ala Val Pro Val Ala
180 185 190
Pro Phe Met Val Leu Phe Phe Met Gly He Leu Gin Ala Phe Val Phe
195 200 205
Met He Leu Thr Tyr Val Tyr Leu Ala Gly Ala Val Leu Thr Asp Glu
210 215 220
Gly His 225
(2) INFORMATION FOR SEQ ID NO:435:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH : 787 base pairs
(B) TYPE : nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...734 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:435:
AGCCATAATA ACAACCAATG GCTAGAGCCT GATGATTTGT TGAAATTATT ATG AAT 56
Met Asn
1
TTT TTA GAA GAT TTG TTT TAC CCC TTA AGA TTG TTA GAA AAC AAG CGT 104 Phe Leu Glu Asp Leu Phe Tyr Pro Leu Arg Leu Leu Glu Asn Lys Arg 5 10 15
GTT TTA TTG CTC GTG AGC GGT TCT ATT GCA GCG TAT AAA TCC CTA GAA 152 Val Leu Leu Leu Val Ser Gly Ser He Ala Ala Tyr Lys Ser Leu Glu 20 25 30
TTA GTG CGC TTG TTG TTT AAA AGC GGG GCT AGT ATC CAA GTG GTG ATG 200 Leu Val Arg Leu Leu Phe Lys Ser Gly Ala Ser He Gin Val Val Met 35 40 45 50
AGT AAG GGT GCG AAA AAA TTC ATC AAA CCC TTA AGT TTT GAA GCT TTG 248 Ser Lys Gly Ala Lys Lys Phe He Lys Pro Leu Ser Phe Glu Ala Leu 55 60 65
AGC CAC CAT AAA GTC TTG CAT GAT CGT AAT GAA AAA TGG TAT TAC AAC 296 Ser His His Lys Val Leu His Asp Arg Asn Glu Lys Trp Tyr Tyr Asn 70 75 80
CAC CAA AAC GCC TTA CAC CAT AAC CAC ATC GCA TGC GCT GCT AAT GCT 344 His Gin Asn Ala Leu His His Asn His He Ala Cys Ala Ala Asn Ala 85 90 95
GAT TTG CTC ATC TTT GCC CCT TTA AGC ACT AAC AGC TTG TCT AAA ATC 392 Asp Leu Leu He Phe Ala Pro Leu Ser Thr Asn Ser Leu Ser Lys He 100 105 110
GCT CAC GCT TTA GCG GAT AAT ATC GTA AGC GCG ACT TTT TTA GCT TGC 440 Ala His Ala Leu Ala Asp Asn He Val Ser Ala Thr Phe Leu Ala Cys 115 120 125 130
GCT TCC CCT AAA ATC CTA GCC CCT AGC ATG AAC ACT AAC ATG CTC AAT 488 Ala Ser Pro Lys He Leu Ala Pro Ser Met Asn Thr Asn Met Leu Asn 135 140 145
TCC CCT ATC ACT CAA AGT AAT TTA AAA CGC TTG AAA GAT TCC AAC CAC 536 Ser Pro He Thr Gin Ser Asn Leu Lys Arg Leu Lys Asp Ser Asn His 150 155 160
ATT ATT TTA GAC ACC AAA AAC GCC CTT TTA GCA TGC GAC ACT AAA GGC 584 He He Leu Asp Thr Lys Asn Ala Leu Leu Ala Cys Asp Thr Lys Gly 165 170 175
GAT GGG GCG ATG GCT GAG CCT TTA GAA ATC CTT TTT AAA GCC GCT CAA 632 Asp Gly Ala Met Ala Glu Pro Leu Glu He Leu Phe Lys Ala Ala Gin 180 185 190
ACG CTC CTA AAA GAC GCT TAT TTT GAA AAC AGA GAA GTC ATA GTC ATG 680 Thr Leu Leu Lys Asp Ala Tyr Phe Glu Asn Arg Glu Val He Val Met 195 200 205 210
GGC GGC GCG AGT ATA GAA AAG ATT GAC AGC GTT CGA ACG ATT AGC AAT 728 Gly Gly Ala Ser He Glu Lys He Asp Ser Val Arg Thr He Ser Asn 215 220 225
ACT TTC TAGCGGGATT CAAGCGAGCG CTTTAGCTTT GGCGTTATAT TTTAAGGGAG CC 786 Thr Phe
787
(2) INFORMATION FOR SEQ ID NO: 436:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 228 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436:
Met Asn Phe Leu Glu Asp Leu Phe Tyr Pro Leu Arg Leu Leu Glu Asn
1 5 10 15
Lys Arg Val Leu Leu Leu Val Ser Gly Ser He Ala Ala Tyr Lys Ser
20 25 30
Leu Glu Leu Val Arg Leu Leu Phe Lys Ser Gly Ala Ser He Gin Val
35 40 45
Val Met Ser Lys Gly Ala Lys Lys Phe He Lys Pro Leu Ser Phe Glu
50 55 60
Ala Leu Ser His His Lys Val Leu His Asp Arg Asn Glu Lys Trp Tyr 65 70 75 80
Tyr Asn His Gin Asn Ala Leu His His Asn His He Ala Cys Ala Ala
85 90 95
Asn Ala Asp Leu Leu He Phe Ala Pro Leu Ser Thr Asn Ser Leu Ser 100 105 110 Lys He Ala His Ala Leu Ala Asp Asn He Val Ser Ala Thr Phe Leu
115 120 125
Ala Cys Ala Ser Pro Lys He Leu Ala Pro Ser Met Asn Thr Asn Met
130 135 140
Leu Asn Ser Pro He Thr Gin Ser Asn Leu Lys Arg Leu Lys Asp Ser 145 150 155 160
Asn His He He Leu Asp Thr Lys Asn Ala Leu Leu Ala Cys Asp Thr
165 170 175
Lys Gly Asp Gly Ala Met Ala Glu Pro Leu Glu He Leu Phe Lys Ala
180 185 190
Ala Gin Thr Leu Leu Lys Asp Ala Tyr Phe Glu Asn Arg Glu Val He
195 200 205
Val Met Gly Gly Ala Ser He Glu Lys He Asp Ser Val Arg Thr He
210 215 220
Ser Asn Thr Phe 225
(2) INFORMATION FOR SEQ ID NO: 437:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1078 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 71...1009 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:437:
TCCCAAGCCC TAAAGGCGCT TCAATCATTG ATTTTAAAGG CAGTTATGAA GAGTATTTGG 60
CGAGCAAAAA ATG AAA CCG CAA GAC ATT GAA ATC GTT CAA AGC GTT TTA 109
Met Lys Pro Gin Asp He Glu He Val Gin Ser Val Leu 1 5 10
GAG ATT ACA GGA CCG ATT AAG CCT ACT GAA GTG TAT GAT AAA GCC AAA 157 Glu He Thr Gly Pro He Lys Pro Thr Glu Val Tyr Asp Lys Ala Lys 15 20 25
GAG CTT TTT GAA AAA GGT GAG ATT ACA AAC ATG TTT GAT TGT GGG GGC 205 Glu Leu Phe Glu Lys Gly Glu He Thr Asn Met Phe Asp Cys Gly Gly 30 35 40 45
AAA ACC CCG CAC CAG AGC GTT AGT TCT TAT ATT TAT ACA GCC TTA AAC 253 Lys Thr Pro His Gin Ser Val Ser Ser Tyr He Tyr Thr Ala Leu Asn 50 55 60
AAG GGC GAA GAA CTG CCT TTT AAA AAA GTG CAA GAA AAC CCA ACC TTA 301 Lys Gly Glu Glu Leu Pro Phe Lys Lys Val Gin Glu Asn Pro Thr Leu 65 70 75
ATC GCT TTA AAA GAC GCG GCT AAA GAG CTA GGT TTA GAC GCT CAA AAA 349 He Ala Leu Lys Asp Ala Ala Lys Glu Leu Gly Leu Asp Ala Gin Lys 80 85 90
ATA AGC GCT CCA AGC TCT AAA ATC GCG CAT GAA AGG GAT TTG CAC CCC 397 He Ser Ala Pro Ser Ser Lys He Ala His Glu Arg Asp Leu His Pro 95 100 105
TTT TTA ACC TAC ATG GCT ATT AAT AAC GAA AAT TTG AAA TGC TAC ACG 445 Phe Leu Thr Tyr Met Ala He Asn Asn Glu Asn Leu Lys Cys Tyr Thr 110 115 120 125
AAA ACC ATT TTT CAT GAA GAG AGT TCA AAA TCA ATA AAA GGC ATG GAC 493 Lys Thr He Phe His Glu Glu Ser Ser Lys Ser He Lys Gly Met Asp 130 135 140
AGG TGG CTT TAT CCG GAC ATG GTG GGG GTT AGG TTT TTG CAC GCT GAA 541 Arg Trp Leu Tyr Pro Asp Met Val Gly Val Arg Phe Leu His Ala Glu 145 150 155
TTA TCT AAT GAA AAT TTA ATC GCT TTT TCT AAG AAA TTT GAC ACT TTA 589 Leu Ser Asn Glu Asn Leu He Ala Phe Ser Lys Lys Phe Asp Thr Leu 160 165 170
CCC ATT AAA CTG GTG AGC TTT GAA TTG AAA AAA GAA ATC AGC GTG CAT 637 Pro He Lys Leu Val Ser Phe Glu Leu Lys Lys Glu He Ser Val His 175 180 185
AAT TGC AGG GAG TGT TAC TTT CAA GCG ATT TCC AAC AGC TCG TGG GCT 685 Asn Cys Arg Glu Cys Tyr Phe Gin Ala He Ser Asn Ser Ser Trp Ala 190 195 200 205
AAT GAA GGG TAT TTA GTG GGC CGT CAT ATT GAT ACG CAC AAT CCT CAA 733 Asn Glu Gly Tyr Leu Val Gly Arg His He Asp Thr His Asn Pro Gin 210 215 220
CTC ATG GAT TTG TTG AAG CGT TTG CAT GCG AGT TTT GGG ATT GGC GTG 781 Leu Met Asp Leu Leu Lys Arg Leu His Ala Ser Phe Gly He Gly Val 225 230 235
ATT GAT TTA AGA ACT AAT GAG GAT AAA AGC GCT ATT TTA TTG AAC GCT 829 He Asp Leu Arg Thr Asn Glu Asp Lys Ser Ala He Leu Leu Asn Ala 240 245 250
AAA TAC AAA GAA AAG ATT GAT TAC ACC GTG GCT TCA GAG CTT AGC GCG 877 Lys Tyr Lys Glu Lys He Asp Tyr Thr Val Ala Ser Glu Leu Ser Ala 255 260 265
AAA AAT GAA AAA TTC AGC GGT TTT TTA AAG AGC GTT GTG GAT TAT GAC 925 Lys Asn Glu Lys Phe Ser Gly Phe Leu Lys Ser Val Val Asp Tyr Asp 270 275 280 285 CCA AAC CAC CCA CAA CGC TAT AAA GAT GAA TTT GAT GAG GTT AAA AAG 973 Pro Asn His Pro Gin Arg Tyr Lys Asp Glu Phe Asp Glu Val Lys Lys 290 295 300
AAA GAG GAG TTA TAC CCT AAC CCA TCG CTT TCT TTT TAAAAATGAG ATTTTA 1025 Lys Glu Glu Leu Tyr Pro Asn Pro Ser Leu Ser Phe 305 310
AAAAACGCTT TAAGTGTTTT TGTAAAAAAT AGCAAAGAGC TTGATTTTAA TCA 1078
(2) INFORMATION FOR SEQ ID NO: 438:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 313 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438:
Met Lys Pro Gin Asp He Glu He Val Gin Ser Val Leu Glu He Thr
1 5 10 15
Gly Pro He Lys Pro Thr Glu Val Tyr Asp Lys Ala Lys Glu Leu Phe
20 25 30
Glu Lys Gly Glu He Thr Asn Met Phe Asp Cys Gly Gly Lys Thr Pro
35 40 45
His Gin Ser Val Ser Ser Tyr He Tyr Thr Ala Leu Asn Lys Gly Glu
50 55 60
Glu Leu Pro Phe Lys Lys Val Gin Glu Asn Pro Thr Leu He Ala Leu 65 70 75 80
Lys Asp Ala Ala Lys Glu Leu Gly Leu Asp Ala Gin Lys He Ser Ala
85 90 95
Pro Ser Ser Lys He Ala His Glu Arg Asp Leu His Pro Phe Leu Thr
100 105 110
Tyr Met Ala He Asn Asn Glu Asn Leu Lys Cys Tyr Thr Lys Thr He
115 120 125
Phe His Glu Glu Ser Ser Lys Ser He Lys Gly Met Asp Arg Trp Leu
130 135 140
Tyr Pro Asp Met Val Gly Val Arg Phe Leu His Ala Glu Leu Ser Asn 145 150 155 160
Glu Asn Leu He Ala Phe Ser Lys Lys Phe Asp Thr Leu Pro He Lys
165 170 175
Leu Val Ser Phe Glu Leu Lys Lys Glu He Ser Val His Asn Cys Arg
180 185 190
Glu Cys Tyr Phe Gin Ala He Ser Asn Ser Ser Trp Ala Asn Glu Gly
195 200 205
Tyr Leu Val Gly Arg His He Asp Thr His Asn Pro Gin Leu Met Asp
210 215 220
Leu Leu Lys Arg Leu His Ala Ser Phe Gly He Gly Val He Asp Leu 225 230 235 240
Arg Thr Asn Glu Asp Lys Ser Ala He Leu Leu Asn Ala Lys Tyr Lys 245 250 255 Glu Lys He Asp Tyr Thr Val Ala Ser Glu Leu Ser Ala Lys Asn Glu
260 265 270
Lys Phe Ser Gly Phe Leu Lys Ser Val Val Asp Tyr Asp Pro Asn His
275 280 285
Pro Gin Arg Tyr Lys Asp Glu Phe Asp Glu Val Lys Lys Lys Glu Glu
290 295 300
Leu Tyr Pro Asn Pro Ser Leu Ser Phe 305 310
(2) INFORMATION FOR SEQ ID NO: 439:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 444 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 112...375 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439:
AACTCGCCTT AGGCGCGATG GCGTTTAAGG AAGTCAAGCT TTTTGAATTT GGCGAGCAAT 60 TAAGGGATTT TGTACTATAT TGAAGATGTG ATCCAAGCGA GCGTGAAAGC G ATG AAG 117
Met Lys 1
GCT CAA AAA AGC GGG GTT TAT AAT GTG GGT TAT TCC CAA GCC AGA AGT 165 Ala Gin Lys Ser Gly Val Tyr Asn Val Gly Tyr Ser Gin Ala Arg Ser 5 10 15
TAT AAT GAA ATC GTT AGC ATT TTA AAA GAG CAT TTA GGG GAT TTT AAA 213 Tyr Asn Glu He Val Ser He Leu Lys Glu His Leu Gly Asp Phe Lys 20 25 30
GTG AGT TAT ATC AAA AAC CCT TAT GCT TTC TTC CAA AAG CAC ACC CAA 261 Val Ser Tyr He Lys Asn Pro Tyr Ala Phe Phe Gin Lys His Thr Gin 35 40 45 50
GCA CAC ATT GAG CCT GCT ATT TTG GAT TTG GAT TAC ACC CCT TTA TAC 309 Ala His He Glu Pro Ala He Leu Asp Leu Asp Tyr Thr Pro Leu Tyr 55 60 65
GAT TTG GAA AGC GGC ATT AAA GAT TAT TTG CCC CAT ATC CAT GCG ATT 357 Asp Leu Glu Ser Gly He Lys Asp Tyr Leu Pro His He His Ala He 70 75 80
TTT AAA GGA CAG TGC GCA TGAAAAAAAT CTTAGTCATA GGCGATCTGA TCGCTGAT 413 Phe Lys Gly Gin Cys Ala 85
TATTATTTGT GGGGGAAGAG CGAACGGCTT T 444
(2) INFORMATION FOR SEQ ID NO: 440:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 88 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440:
Met Lys Ala Gin Lys Ser Gly Val Tyr Asn Val Gly Tyr Ser Gin Ala
1 5 10 15
Arg Ser Tyr Asn Glu He Val Ser He Leu Lys Glu His Leu Gly Asp
20 25 30
Phe Lys Val Ser Tyr He Lys Asn Pro Tyr Ala Phe Phe Gin Lys His
35 40 45
Thr Gin Ala His He Glu Pro Ala He Leu Asp Leu Asp Tyr Thr Pro
50 55 60
Leu Tyr Asp Leu Glu Ser Gly He Lys Asp Tyr Leu Pro His He His 65 70 75 80
Ala He Phe Lys Gly Gin Cys Ala 85
(2) INFORMATION FOR SEQ ID NO: 441:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 822 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 153...728 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441:
CAGCATGCCG TTGTTTTTAA TGGGGATTTT TTTAAGCAAA ATTTCCGTTT CTTACAGGAA 60
ATTTTTCAAT CTTTTGTCTA AAATTTTAAT GGGGGTTTTT GGGCTTTATA TCCTTTATAT 120
GGGGATCATG CTCATTAACC ACAAAATGCC TC ATG CCA TGC ATC ATC AAA ACA 173
Met Pro Cys He He Lys Thr ACA CCA CTC AGC ATG ATC ATA AAG GAG TGC ATT CGC ATG AAC ACT AAC 221 Thr Pro Leu Ser Met He He Lys Glu Cys He Arg Met Asn Thr Asn 10 15 20
AAA GCC CTT TTT TTG GAC AGA GAC GGC ATT ATC AAT ATT GAT AAA GGC 269 Lys Ala Leu Phe Leu Asp Arg Asp Gly He He Asn He Asp Lys Gly 25 30 35
TAT GTG AGT CAA AAA GAA GAT TTT GAG TTT CAA AAA GGG ATT TTT GAA 317 Tyr Val Ser Gin Lys Glu Asp Phe Glu Phe Gin Lys Gly He Phe Glu 40 45 50 55
TTG CTA AAG CAT GCG AAA TCT TTA GGC TAC AAA CTG CTT TTA ATC ACC 365 Leu Leu Lys His Ala Lys Ser Leu Gly Tyr Lys Leu Leu Leu He Thr 60 65 70
AAC CAA TCT GGG ATC AAC CGA GGC TAT TAC ACC CTT AAA GAT TTT GAA 413 Asn Gin Ser Gly He Asn Arg Gly Tyr Tyr Thr Leu Lys Asp Phe Glu 75 80 85
CAA CTC ACC CAA TAC CTC CAA GAA AGC TTG TTC AAA GAA TTA GGT TTT 461 Gin Leu Thr Gin Tyr Leu Gin Glu Ser Leu Phe Lys Glu Leu Gly Phe 90 95 100
AAT CTG GAT GGC ATC TAT TTT TGC AGG CAC GCC CCA GAA GAA AAT TGC 509 Asn Leu Asp Gly He Tyr Phe Cys Arg His Ala Pro Glu Glu Asn Cys 105 110 115
GCT TGC AGG AAG CCA AAG CCT TCT TTG ATT TTG CAA GCT GCT AAA GAG 557 Ala Cys Arg Lys Pro Lys Pro Ser Leu He Leu Gin Ala Ala Lys Glu 120 125 130 135
CAT CAA ATT TGC TTG GAG CAA TCT TTT ATG ATA GGC GAT AAA GAG AGC 605 His Gin He Cys Leu Glu Gin Ser Phe Met He Gly Asp Lys Glu Ser 140 145 150
GAC ATG TTA GCC GGC TTG AAC GCT AAA GTT AAA AAT AAC CTT TTG CTC 653 Asp Met Leu Ala Gly Leu Asn Ala Lys Val Lys Asn Asn Leu Leu Leu 155 160 165
ATT CAA AAC CCT TTA AAA ACT CCT CAT TCT TGG ATA CAA TGT AAA GAT 701 He Gin Asn Pro Leu Lys Thr Pro His Ser Trp He Gin Cys Lys Asp 170 175 180
TTT AAA GAG ATG ATA GAT CTA ATC AAA TAAGGACAAG AATGCGTTAT ATTGATG 755 Phe Lys Glu Met He Asp Leu He Lys 185 190
ATGAATTAGA AAATCAAACG ATTTTAATCA CCGGTGGGGC TGGCTTTGTA GGCAGTAATC 815 TAGCCTT 822
(2) INFORMATION FOR SEQ ID NO: 442: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 192 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:442:
Met Pro Cys He He Lys Thr Thr Pro Leu Ser Met He He Lys Glu
1 5 10 15
Cys He Arg Met Asn Thr Asn Lys Ala Leu Phe Leu Asp Arg Asp Gly
20 25 30
He He Asn He Asp Lys Gly Tyr Val Ser Gin Lys Glu Asp Phe Glu
35 40 45
Phe Gin Lys Gly He Phe Glu Leu Leu Lys His Ala Lys Ser Leu Gly
50 55 60
Tyr Lys Leu Leu Leu He Thr Asn Gin Ser Gly He Asn Arg Gly Tyr 65 70 75 80
Tyr Thr Leu Lys Asp Phe Glu Gin Leu Thr Gin Tyr Leu Gin Glu Ser
85 90 95
Leu Phe Lys Glu Leu Gly Phe Asn Leu Asp Gly He Tyr Phe Cys Arg
100 105 110
His Ala Pro Glu Glu Asn Cys Ala Cys Arg Lys Pro Lys Pro Ser Leu
115 120 125
He Leu Gin Ala Ala Lys Glu His Gin He Cys Leu Glu Gin Ser Phe
130 135 140
Met He Gly Asp Lys Glu Ser Asp Met Leu Ala Gly Leu Asn Ala Lys 145 150 155 160
Val Lys Asn Asn Leu Leu Leu He Gin Asn Pro Leu Lys Thr Pro His
165 170 175
Ser Trp He Gin Cys Lys Asp Phe Lys Glu Met He Asp Leu He Lys 180 185 190
(2) INFORMATION FOR SEQ ID NO: 443:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 831 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 88...756 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:443: AACAAAGAGC GAGAGAGCAT CAAGAAAGAG ATGAAAAAGA GCTTGAAGAA AGAAGAAAAG 60 CTTTAGAAAT GAATAAGAAG TAGGCCT ATG CCA GCT AGG CAA TCT TTT ACA GAT 114
Met Pro Ala Arg Gin Ser Phe Thr Asp 1 5
TTG AAA AAC CTG GTT TTG TGC GAT ATA GGC AAC ACG CGT ATC CAT TTT 162 Leu Lys Asn Leu Val Leu Cys Asp He Gly Asn Thr Arg He His Phe 10 15 20 25
GCA CAA AAC TAT CAG CTC TTT TCA AGC GCT AAA GAA GAT TTA AAG CGT 210 Ala Gin Asn Tyr Gin Leu Phe Ser Ser Ala Lys Glu Asp Leu Lys Arg 30 35 40
TTG GGT ATT CAA AAG GAA ATT TTT TAC ATT AGC GTG AAT GAA GAA AAT 258 Leu Gly He Gin Lys Glu He Phe Tyr He Ser Val Asn Glu Glu Asn 45 50 55
GAA AAA GCC CTT TTG AAT TGT TAC CCT AAC GCT AAA AAT ATT GCA GGG 306 Glu Lys Ala Leu Leu Asn Cys Tyr Pro Asn Ala Lys Asn He Ala Gly 60 65 70
TTT TTT CAT TTA GAA ACC GAC TAT GTA GGG CTT GGG ATA GAC CGG CAA 354 Phe Phe His Leu Glu Thr Asp Tyr Val Gly Leu Gly He Asp Arg Gin 75 80 85
ATG GCG TGT CTG GCG GTA AAT AAT GGC GTG GTG GTG GAT GCC GGG AGT 402 Met Ala Cys Leu Ala Val Asn Asn Gly Val Val Val Asp Ala Gly Ser 90 95 100 105
GCG ATT ACG ATA GAT TTA ATC AAA GAG GGC AAG CAT TTA GGA GGG TGT 450 Ala He Thr He Asp Leu He Lys Glu Gly Lys His Leu Gly Gly Cys 110 115 120
ATT TTA CCC GGT TTA GCC CAA TAT ATT CAT GCG TAT AAA AAA AGC GCT 498 He Leu Pro Gly Leu Ala Gin Tyr He His Ala Tyr Lys Lys Ser Ala 125 130 135
AAA ATT TTA GAG CAA CCT TTC AAG GCC TTA GAT TCT TTA GAA GTT TTA 546 Lys He Leu Glu Gin Pro Phe Lys Ala Leu Asp Ser Leu Glu Val Leu 140 145 150
CCT AAA AGC ACT AGA GAC GCT GTG AAT TAC GGC ATG GTT TTG AGC GTC 594 Pro Lys Ser Thr Arg Asp Ala Val Asn Tyr Gly Met Val Leu Ser Val 155 160 165
ATT GCT TGT ATC CAG CAT TTA GCC AAA AAT CAA AAA ATC TAT CTT TGT 642 He Ala Cys He Gin His Leu Ala Lys Asn Gin Lys He Tyr Leu Cys 170 175 180 185
GGG GGC GAT GCG AAG TAT TTG AGC GCG TTT TTA CCC CAT TCT GTT TGC 690 Gly Gly Asp Ala Lys Tyr Leu Ser Ala Phe Leu Pro His Ser Val Cys 190 195 200
AAG GAG CGT TTG GTT TTT GAC GGG ATG GAA ATC GCT CTT AAA AAA GCA 738 Lys Glu Arg Leu Val Phe Asp Gly Met Glu He Ala Leu Lys Lys Ala 205 210 215
GGG ATA CTA GAA TGC AAA TGATGCACAA TTTGAGTTTT TTGGGCATGT TTTTAGCC 794 Gly He Leu Glu Cys Lys 220
GCTTTGAGCA TGTCTTTAGG GCATTGTGTG GGCATGT 831
(2) INFORMATION FOR SEQ ID NO:444:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 223 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444:
Met Pro Ala Arg Gin Ser Phe Thr Asp Leu Lys Asn Leu Val Leu Cys
1 5 10 15
Asp He Gly Asn Thr Arg He His Phe Ala Gin Asn Tyr Gin Leu Phe
20 25 30
Ser Ser Ala Lys Glu Asp Leu Lys Arg Leu Gly He Gin Lys Glu He
35 40 45
Phe Tyr He Ser Val Asn Glu Glu Asn Glu Lys Ala Leu Leu Asn Cys
50 55 60
Tyr Pro Asn Ala Lys Asn He Ala Gly Phe Phe His Leu Glu Thr Asp 65 70 75 80
Tyr Val Gly Leu Gly He Asp Arg Gin Met Ala Cys Leu Ala Val Asn
85 90 95
Asn Gly Val Val Val Asp Ala Gly Ser Ala He Thr He Asp Leu He
100 105 110
Lys Glu Gly Lys His Leu Gly Gly Cys He Leu Pro Gly Leu Ala Gin
115 120 125
Tyr He His Ala Tyr Lys Lys Ser Ala Lys He Leu Glu Gin Pro Phe
130 135 140
Lys Ala Leu Asp Ser Leu Glu Val Leu Pro Lys Ser Thr Arg Asp Ala 145 150 155 160
Val Asn Tyr Gly Met Val Leu Ser Val He Ala Cys He Gin His Leu
165 170 175
Ala Lys Asn Gin Lys He Tyr Leu Cys Gly Gly Asp Ala Lys Tyr Leu
180 185 190
Ser Ala Phe Leu Pro His Ser Val Cys Lys Glu Arg Leu Val Phe Asp
195 200 205
Gly Met Glu He Ala Leu Lys Lys Ala Gly He Leu Glu Cys Lys 210 215 220
(2) INFORMATION FOR SEQ ID NO:445:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1780 base pairs
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 195...1709 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445:
TTGTTTTTAA TCTTTCTTAT TTTCATTAAT TGTTACGAAT AGAAATACTT AAGGGGGTTT 60
TTCATTCTTA AAAAAAGGAT TTTTTAAGGA AATTGAATCT TGTTAGTCTT TGTATAACAA 120
ATTATGTGAT AATCACCACA AGTAATCGGC TTAGTGTCAT ATTACGAAGA TTTAAGATCA 180
TAAAAGGAAA AAAG ATG GTT AAT AAA GAT GTG AAA CAA ACC ACT GCT TTT 230 Met Val Asn Lys Asp Val Lys Gin Thr Thr Ala Phe 1 5 10
GGC GCT CCC GTT TGG GAT GAC AAC AAT GTG ATT ACG GCC GGC CCT AGA 278 Gly Ala Pro Val Trp Asp Asp Asn Asn Val He Thr Ala Gly Pro Arg 15 20 25
GGT CCT GTT TTA TTA CAA AGC ACT TGG TTT TTG GAA AAG TTA GCG GCG 326 Gly Pro Val Leu Leu Gin Ser Thr Trp Phe Leu Glu Lys Leu Ala Ala 30 35 40
TTT GAC AGA GAA AGA ATC CCT GAA AGG GTG GTG CAT GCT AAA GGA AGC 374 Phe Asp Arg Glu Arg He Pro Glu Arg Val Val His Ala Lys Gly Ser 45 50 55 60
GGA GCT TAT GGC ACT TTC ACT GTG ACT AAA GAC ATC ACT AAA TAC ACT 422 Gly Ala Tyr Gly Thr Phe Thr Val Thr Lys Asp He Thr Lys Tyr Thr 65 70 75
AAA GCG AAA ATT TTC TCT AAA GTG GGC AAA AAA ACC GAA TGC TTC TTC 470 Lys Ala Lys He Phe Ser Lys Val Gly Lys Lys Thr Glu Cys Phe Phe 80 85 90
AGA TTT TCT ACT GTG GCT GGT GAA AGA GGC AGT GCG GAT GCG GTG AGA 518 Arg Phe Ser Thr Val Ala Gly Glu Arg Gly Ser Ala Asp Ala Val Arg 95 100 105
GAC CCT AGA GGT TTT GCG ATG AAG TAT TAC ACT GAA GAA GGT AAC TGG 566 Asp Pro Arg Gly Phe Ala Met Lys Tyr Tyr Thr Glu Glu Gly Asn Trp 110 115 120
GAT TTA GTG GGG AAC AAC ACG CCT GTT TTC TTT ATC CGT GAT GCG ATC 614 Asp Leu Val Gly Asn Asn Thr Pro Val Phe Phe He Arg Asp Ala He 125 130 135 140
AAA TTC CCT GAT TTC ATC CAC ACT CAA AAA CGA GAT CCT CAA ACC AAT 662 Lys Phe Pro Asp Phe He His Thr Gin Lys Arg Asp Pro Gin Thr Asn 145 150 155
TTG CCT AAC CAT GAC ATG GTA TGG GAT TTT TGG AGT AAT GTT CCT GAA 710 Leu Pro Asn His Asp Met Val Trp Asp Phe Trp Ser Asn Val Pro Glu 160 165 170
AGC TTA TAC CAA GTA ACA TGG GTT ATG AGC GAT AGG GGT ATT CCT AAA 758 Ser Leu Tyr Gin Val Thr Trp Val Met Ser Asp Arg Gly He Pro Lys 175 180 185
TCT TTC CGC CAC ATG GAT GGT TTT GGC AGC CAC ACT TTC AGT CTT ATC 806 Ser Phe Arg His Met Asp Gly Phe Gly Ser His Thr Phe Ser Leu He 190 195 200
AAC GCG AAA GGC GAA CGC TTT TGG GTG AAA TTC CAC TTT CAC ACC ATG 854 Asn Ala Lys Gly Glu Arg Phe Trp Val Lys Phe His Phe His Thr Met 205 210 215 220
CAA GGC GTT AAG CAT TTG ACT AAC GAA GAA GCC GCA GAA GTT AGG AAG 902 Gin Gly Val Lys His Leu Thr Asn Glu Glu Ala Ala Glu Val Arg Lys 225 230 235
TAT GAT CCG GAT TCC AAT CAA AGG GAT TTA TTC AAT GCG ATC GCT AGA 950 Tyr Asp Pro Asp Ser Asn Gin Arg Asp Leu Phe Asn Ala He Ala Arg 240 245 250
GGG GAT TTC CCA AAA TGG AAA TTA AGC ATT CAA GTG ATG CCA GAA GAA 998 Gly Asp Phe Pro Lys Trp Lys Leu Ser He Gin Val Met Pro Glu Glu 255 260 265
GAT GCT AAG AAG TAT CGA TTC CAT CCG TTT GAT GTA ACT AAA ATT TGG 1046 Asp Ala Lys Lys Tyr Arg Phe His Pro Phe Asp Val Thr Lys He Trp 270 275 280
TAT CTC CAA GAT TAT CCA TTG ATG GAA GTG GGC ATT GTG GAG TTG AAT 1094 Tyr Leu Gin Asp Tyr Pro Leu Met Glu Val Gly He Val Glu Leu Asn 285 290 295 300
AAA AAT CCT GAA AAC TAT TTC GCA GAA GTG GAA CAA GCG GCA TTC AGT 1142 Lys Asn Pro Glu Asn Tyr Phe Ala Glu Val Glu Gin Ala Ala Phe Ser 305 310 315
CCG GCT AAT GTC GTT CCT GGA ATT GGC TAT AGC CCT GAT AGG ATG TTA 1190 Pro Ala Asn Val Val Pro Gly He Gly Tyr Ser Pro Asp Arg Met Leu 320 325 330
CAA GGG CGC TTG TTC TCT TAT GGA GAC ACA CAC CGC TAC CGC TTA GGC 1238 Gin Gly Arg Leu Phe Ser Tyr Gly Asp Thr His Arg Tyr Arg Leu Gly 335 340 345
GTT AAT TAT CCT CAA ATA CCG GTT AAT AAA CCA AGA TGC CCA TTC CAC 1286 Val Asn Tyr Pro Gin He Pro Val Asn Lys Pro Arg Cys Pro Phe His 350 355 360 TCT TCT AGC AGA GAT GGT TAC ATG CAA AAC GGA TAC TAC GGC TCT TTA 1334 Ser Ser Ser Arg Asp Gly Tyr Met Gin Asn Gly Tyr Tyr Gly Ser Leu 365 370 375 380
CAA AAC TAT ACG CCT AGC TCA TTG CCT GGC TAT AAA GAA GAT AAG AGC 1382 Gin Asn Tyr Thr Pro Ser Ser Leu Pro Gly Tyr Lys Glu Asp Lys Ser 385 390 395
GCG AGA GAT CCT AAG TTC AAC TTA GCT CAT ATT GAG AAA GAG TTT GAA 1430 Ala Arg Asp Pro Lys Phe Asn Leu Ala His He Glu Lys Glu Phe Glu 400 405 410
GTG TGG AAT TGG GAT TAC AGA GCT GAT GAT AGC GAT TAC TAC ACC CAA 1478 Val Trp Asn Trp Asp Tyr Arg Ala Asp Asp Ser Asp Tyr Tyr Thr Gin 415 420 425
CCA GGT GAT TAC TAC CGC TCA TTG CCA GCT GAT GAA AAA GAA AGG TTG 1526 Pro Gly Asp Tyr Tyr Arg Ser Leu Pro Ala Asp Glu Lys Glu Arg Leu 430 435 440
CAT GAC ACT ATT GGA GAG TCT TTA GCT CAT GTT ACC CAT AAG GAA ATT 1574 His Asp Thr He Gly Glu Ser Leu Ala His Val Thr His Lys Glu He 445 450 455 460
GTG GAT AAA CAA TTG GAG CAT TTC AAG AAA GCT GAC CCC AAA TAC GCT 1622 Val Asp Lys Gin Leu Glu His Phe Lys Lys Ala Asp Pro Lys Tyr Ala 465 470 475
GAG GGA GTT AAA AAA GCT CTT GAA AAA CAC CAA AAA ATG ATG AAA GAC 1670 Glu Gly Val Lys Lys Ala Leu Glu Lys His Gin Lys Met Met Lys Asp 480 485 490
ATG CAT GGA AAA GAC ATG CAC CAC ACA AAA AAG AAA AAG TAACCCTTTT CT 1721 Met His Gly Lys Asp Met His His Thr Lys Lys Lys Lys 495 500 505
TTAAGCGTTC TTATTTTTTA GGAACGCTTT GTCTTTCAAA ATTTAGGTTT TTGGATACT 1780
(2) INFORMATION FOR SEQ ID NO: 446:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 505 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446:
Met Val Asn Lys Asp Val Lys Gin Thr Thr Ala Phe Gly Ala Pro Val
1 5 10 15
Trp Asp Asp Asn Asn Val He Thr Ala Gly Pro Arg Gly Pro Val Leu 20 25 30 Leu Gin Ser Thr Trp Phe Leu Glu Lys Leu Ala Ala Phe Asp Arg Glu
35 40 45
Arg He Pro Glu Arg Val Val His Ala Lys Gly Ser Gly Ala Tyr Gly
50 55 60
Thr Phe Thr Val Thr Lys Asp He Thr Lys Tyr Thr Lys Ala Lys He 65 70 75 80
Phe Ser Lys Val Gly Lys Lys Thr Glu Cys Phe Phe Arg Phe Ser Thr
85 90 95
Val Ala Gly Glu Arg Gly Ser Ala Asp Ala Val Arg Asp Pro Arg Gly
100 105 110
Phe Ala Met Lys Tyr Tyr Thr Glu Glu Gly Asn Trp Asp Leu Val Gly
115 120 125
Asn Asn Thr Pro Val Phe Phe He Arg Asp Ala He Lys Phe Pro Asp
130 135 140
Phe He His Thr Gin Lys Arg Asp Pro Gin Thr Asn Leu Pro Asn His 145 150 155 160
Asp Met Val Trp Asp Phe Trp Ser Asn Val Pro Glu Ser Leu Tyr Gin
165 170 175
Val Thr Trp Val Met Ser Asp Arg Gly He Pro Lys Ser Phe Arg His
180 185 190
Met Asp Gly Phe Gly Ser His Thr Phe Ser Leu He Asn Ala Lys Gly
195 200 205
Glu Arg Phe Trp Val Lys Phe His Phe His Thr Met Gin Gly Val Lys
210 215 220
His Leu Thr Asn Glu Glu Ala Ala Glu Val Arg Lys Tyr Asp Pro Asp 225 230 235 240
Ser Asn Gin Arg Asp Leu Phe Asn Ala He Ala Arg Gly Asp Phe Pro
245 250 255
Lys Trp Lys Leu Ser He Gin Val Met Pro Glu Glu Asp Ala Lys Lys
260 265 270
Tyr Arg Phe His Pro Phe Asp Val Thr Lys He Trp Tyr Leu Gin Asp
275 280 285
Tyr Pro Leu Met Glu Val Gly He Val Glu Leu Asn Lys Asn Pro Glu
290 295 300
Asn Tyr Phe Ala Glu Val Glu Gin Ala Ala Phe Ser Pro Ala Asn Val 305 310 315 320
Val Pro Gly He Gly Tyr Ser Pro Asp Arg Met Leu Gin Gly Arg Leu
325 330 335
Phe Ser Tyr Gly Asp Thr His Arg Tyr Arg Leu Gly Val Asn Tyr Pro
340 345 350
Gin He Pro Val Asn Lys Pro Arg Cys Pro Phe His Ser Ser Ser Arg
355 360 365
Asp Gly Tyr Met Gin Asn Gly Tyr Tyr Gly Ser Leu Gin Asn Tyr Thr
370 375 380
Pro Ser Ser Leu Pro Gly Tyr Lys Glu Asp Lys Ser Ala Arg Asp Pro 385 390 395 400
Lys Phe Asn Leu Ala His He Glu Lys Glu Phe Glu Val Trp Asn Trp
405 410 415
Asp Tyr Arg Ala Asp Asp Ser Asp Tyr Tyr Thr Gin Pro Gly Asp Tyr
420 425 430
Tyr Arg Ser Leu Pro Ala Asp Glu Lys Glu Arg Leu His Asp Thr He
435 440 445
Gly Glu Ser Leu Ala His Val Thr His Lys Glu He Val Asp Lys Gin
450 455 460
Leu Glu His Phe Lys Lys Ala Asp Pro Lys Tyr Ala Glu Gly Val Lys 465 470 475 480
Lys Ala Leu Glu Lys His Gin Lys Met Met Lys Asp Met His Gly Lys
485 490 495
Asp Met His His Thr Lys Lys Lys Lys 500 505
(2) INFORMATION FOR SEQ ID NO: 447:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 727 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...674 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447:
GAAGCGTGCC AAACCCGTTG CCCTTTGGCG TGGAGCTTGC CAAATATTGG ATG CAT 56
Met His
1
AAC GGC TTT GTG AAT ATC AAT AAC GAA AAA ATG TCT AAA AGT TTG GGG 104 Asn Gly Phe Val Asn He Asn Asn Glu Lys Met Ser Lys Ser Leu Gly 5 10 15
AAT AGC TTT TTT GTT AAA GAC GCT CTC AAA AAC TAT GAT GGC GAA ATT 152 Asn Ser Phe Phe Val Lys Asp Ala Leu Lys Asn Tyr Asp Gly Glu He 20 25 30
TTG CGC AAT TAC TTA CTA GGG GTG CAT TAT CGC TCT GTT TTG AAT TTC 200 Leu Arg Asn Tyr Leu Leu Gly Val His Tyr Arg Ser Val Leu Asn Phe 35 40 45 50
AAT GAA GAA GAC TTG TTA GTG AGT AAA AAA CGC TTG GAT AAA ATC TAT 248 Asn Glu Glu Asp Leu Leu Val Ser Lys Lys Arg Leu Asp Lys He Tyr 55 60 65
CGT TTA AAA CAG CGC GTT TTA GGG ACT CTT GGA GGA ATA AAT CCA AAC 296 Arg Leu Lys Gin Arg Val Leu Gly Thr Leu Gly Gly He Asn Pro Asn 70 75 80
TTT AAA AAA GAA ATT TTA GAG TGC ATG CAA GAT GAT TTA AAC GTT TCT 344 Phe Lys Lys Glu He Leu Glu Cys Met Gin Asp Asp Leu Asn Val Ser 85 90 95
AAA GCG TTG AGC GTT TTA GAA AGC ATG CTT TCT TCC ACT AAT GAA AAA 392 Lys Ala Leu Ser Val Leu Glu Ser Met Leu Ser Ser Thr Asn Glu Lys 100 105 110
TTG GAT CAA AAC CCT AAA AAC AAG GCT TTA AAG GGC GAA ATT TTA GCG 440 Leu Asp Gin Asn Pro Lys Asn Lys Ala Leu Lys Gly Glu He Leu Ala 115 120 125 130
AAT TTG AAA TTC ATA GAA GAA CTG CTT GGC ATC GGG TTT AAA GAC CCT 488 Asn Leu Lys Phe He Glu Glu Leu Leu Gly He Gly Phe Lys Asp Pro 135 140 145
AGC GCC TAT TTC CAA TTA GGC GTG AGT GAA AGC GAA AAA CAA GAA ATT 536 Ser Ala Tyr Phe Gin Leu Gly Val Ser Glu Ser Glu Lys Gin Glu He 150 155 160
GAA AAC AAG ATA GAA GAA AGA AAA CGC GCC AAA GAG CGA AAA GAT TTT 584 Glu Asn Lys He Glu Glu Arg Lys Arg Ala Lys Glu Arg Lys Asp Phe 165 170 175
TTA AAA GCC GAT AGC ATC AGA GAA GAG CTT TTG AAA CAA AAA ATC GCT 632 Leu Lys Ala Asp Ser He Arg Glu Glu Leu Leu Lys Gin Lys He Ala 180 185 190
TTG ATG GAC ACC CCA CAA GGC ACG ATC TGG GAG AAG TTT TTT TAAACACCT 683 Leu Met Asp Thr Pro Gin Gly Thr He Trp Glu Lys Phe Phe 195 200 205
CCAATTTTAC CTTTTTACAC ATTCTAGCAA CAACTTTCAG CATT 727
(2) INFORMATION FOR SEQ ID NO: 448:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 208 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448:
Met His Asn Gly Phe Val Asn He Asn Asn Glu Lys Met Ser Lys Ser
1 5 10 15
Leu Gly Asn Ser Phe Phe Val Lys Asp Ala Leu Lys Asn Tyr Asp Gly
20 25 30
Glu He Leu Arg Asn Tyr Leu Leu Gly Val His Tyr Arg Ser Val Leu
35 40 45
Asn Phe Asn Glu Glu Asp Leu Leu Val Ser Lys Lys Arg Leu Asp Lys
50 55 60
He Tyr Arg Leu Lys Gin Arg Val Leu Gly Thr Leu Gly Gly He Asn 65 70 75 80
Pro Asn Phe Lys Lys Glu He Leu Glu Cys Met Gin Asp Asp Leu Asn
85 90 95
Val Ser Lys Ala Leu Ser Val Leu Glu Ser Met Leu Ser Ser Thr Asn 100 105 110
Glu Lys Leu Asp Gin Asn Pro Lys Asn Lys Ala Leu Lys Gly Glu He
115 120 125
Leu Ala Asn Leu Lys Phe He Glu Glu Leu Leu Gly He Gly Phe Lys
130 135 140
Asp Pro Ser Ala Tyr Phe Gin Leu Gly Val Ser Glu Ser Glu Lys Gin 145 150 155 160
Glu He Glu Asn Lys He Glu Glu Arg Lys Arg Ala Lys Glu Arg Lys
165 170 175
Asp Phe Leu Lys Ala Asp Ser He Arg Glu Glu Leu Leu Lys Gin Lys
180 185 190
He Ala Leu Met Asp Thr Pro Gin Gly Thr He Trp Glu Lys Phe Phe 195 200 205
(2) INFORMATION FOR SEQ ID NO: 449:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 410 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 60...329 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: TTTTAAAAGC ATAGAGGATT TTAAGAAGCA TTGTGAAAAC TTATAAATAA GATTTAAAA 59
ATG CTG ACG ATT GAA ACC AGT AAA AAA TTT GAT AAG GAT CTT AAA ATT 107 Met Leu Thr He Glu Thr Ser Lys Lys Phe Asp Lys Asp Leu Lys He 1 5 10 15
CTT GTT AAA AAC GGG TTT GAT TTA AAG CTT TTG TAT AAA GTG GTT GGA 155 Leu Val Lys Asn Gly Phe Asp Leu Lys Leu Leu Tyr Lys Val Val Gly 20 25 30
AAT TTA GCC ACA GAG CAA CCC CTA GCT CCC AAA TAC AAA GAC CAC CCA 203 Asn Leu Ala Thr Glu Gin Pro Leu Ala Pro Lys Tyr Lys Asp His Pro 35 40 45
CTC AAA GGC GGT TTA AAA GAT TTT AGG GAA TGC CAC TTA AAA CCG GAT 251 Leu Lys Gly Gly Leu Lys Asp Phe Arg Glu Cys His Leu Lys Pro Asp 50 55 60
TTA TTG CTT GTC TAT CAA ATT AAA AAA CAA GAA AAC ACC CTC TTT TTA 299 Leu Leu Leu Val Tyr Gin He Lys Lys Gin Glu Asn Thr Leu Phe Leu 65 70 75 80
GTA AGG TTA GGC AGT CAT AGC GAG CTG TTT TGAACCGCCC ACACCCCTTA TAAC 353 Val Arg Leu Gly Ser His Ser Glu Leu Phe 85 90
GCTTAAACCA ACTACCCCCC TTTTTTAGGG ATAAATTTAG GGTTGAAACA CCGCTTA 410
(2) INFORMATION FOR SEQ ID NO: 450:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 90 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450:
Met Leu Thr He Glu Thr Ser Lys Lys Phe Asp Lys Asp Leu Lys He
1 5 10 15
Leu Val Lys Asn Gly Phe Asp Leu Lys Leu Leu Tyr Lys Val Val Gly
20 25 30
Asn Leu Ala Thr Glu Gin Pro Leu Ala Pro Lys Tyr Lys Asp His Pro
35 40 45
Leu Lys Gly Gly Leu Lys Asp Phe Arg Glu Cys His Leu Lys Pro Asp
50 55 60
Leu Leu Leu Val Tyr Gin He Lys Lys Gin Glu Asn Thr Leu Phe Leu 65 70 75 80
Val Arg Leu Gly Ser His Ser Glu Leu Phe 85 90
(2) INFORMATION FOR SEQ ID NO: 451:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 425 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 78...341 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451 AGATGTAGGT AACAAAGAGA CAGATTTGAT TGTTGAGGAT TTTTCTAGTT ACAGCAATGA 60 AAGAAAAAGG GCTTTAG GTG TTG AAG CTC AAT CTT AAA AAA TCT TTT CAA 110
Val Leu Lys Leu Asn Leu Lys Lys Ser Phe Gin 1 5 10
AAA GAT TTT GAT AAA TTG CTT TTG AAT GGG TTT GAT GAT AGC GTT TTG 158 Lys Asp Phe Asp Lys Leu Leu Leu Asn Gly Phe Asp Asp Ser Val Leu 15 20 25
AAT GAA GTC ATT CTA ACC TTA AGA AAA AAA GAA CCG CTA GAT CCA CAA 206 Asn Glu Val He Leu Thr Leu Arg Lys Lys Glu Pro Leu Asp Pro Gin 30 35 40
TTT CAA GAT CAT GCC TTA AAG GGA AAG TGG AAA CCT TTT AGG GAA TGC 254 Phe Gin Asp His Ala Leu Lys Gly Lys Trp Lys Pro Phe Arg Glu Cys 45 50 55
CAC ATT AAG CCT GAT GTT TTG CTT GTG TAT TTA GTG AAA GAT GAT GAA 302 His He Lys Pro Asp Val Leu Leu Val Tyr Leu Val Lys Asp Asp Glu 60 65 70 75
CTG ATT TTG TTA AGG TTA GGC AGT CAT AGC GAG CTG TTT TAATCCACCC AC 353 Leu He Leu Leu Arg Leu Gly Ser His Ser Glu Leu Phe 80 85
ACCCCTTATA ACGCTTAAAC CAAATCGCTT GCGCTATAAT GAACTGATAT TATATTTTAA 413 AAGGAATAAA CA 425
(2) INFORMATION FOR SEQ ID NO: 52:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 88 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:452:
Val Leu Lys Leu Asn Leu Lys Lys Ser Phe Gin Lys Asp Phe Asp Lys
1 5 10 15
Leu Leu Leu Asn Gly Phe Asp Asp Ser Val Leu Asn Glu Val He Leu
20 25 30
Thr Leu Arg Lys Lys Glu Pro Leu Asp Pro Gin Phe Gin Asp His Ala
35 40 45
Leu Lys Gly Lys Trp Lys Pro Phe Arg Glu Cys His He Lys Pro Asp
50 55 60
Val Leu Leu Val Tyr Leu Val Lys Asp Asp Glu Leu He Leu Leu Arg 65 70 75 80
Leu Gly Ser His Ser Glu Leu Phe 85
(2) INFORMATION FOR SEQ ID NO: 453: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 844 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 111...779 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:453:
GTCGTTATTC GCGCTTAATG AGAACAGAGG TTTTTAAAAC TATGGTTTCG TTTAGGTTTA 60 ATTAAATTTC GCTACAATTA AATAAAAACG ATAATTTTAG AGAGATTGGC ATG CAA 116
Met Gin 1
GGT TTA TGG ATT TAT CCA GAG GAT ACA GAA GTT TTA GGG GTT GCT TGT 164 Gly Leu Trp He Tyr Pro Glu Asp Thr Glu Val Leu Gly Val Ala Cys 5 10 15
AAG AGC CTT TTA AAA GCA CTA ACG CCA CGC TAT CAA AAA GTC GCC TTG 212 Lys Ser Leu Leu Lys Ala Leu Thr Pro Arg Tyr Gin Lys Val Ala Leu 20 25 30
TTT TCG CCC ATT AGT GGA GGG TGT GAG AGC TTG GAG GAG TGC GAG AGC 260 Phe Ser Pro He Ser Gly Gly Cys Glu Ser Leu Glu Glu Cys Glu Ser 35 40 45 50
TTG AAC CCT TTA GAA TTT CAT AGT GCG ATA AGC AAA CAA AAG GCT TTA 308 Leu Asn Pro Leu Glu Phe His Ser Ala He Ser Lys Gin Lys Ala Leu 55 60 65
GAG CTT GCG AGC ACC GCT CAA GAA GAG TTA CTA TTT GAA ACG ATT CTC 356 Glu Leu Ala Ser Thr Ala Gin Glu Glu Leu Leu Phe Glu Thr He Leu 70 75 80
AAA CGC TAT GAT GAA TTA CAA TCC ACG CAT GAT TTT GTC ATT AAT TTG 404 Lys Arg Tyr Asp Glu Leu Gin Ser Thr His Asp Phe Val He Asn Leu 85 90 95
GGG TGT GCG CCG AAG TTT TTC TTA AAC GCT CCT TTA GAT TTA AAC ACC 452 Gly Cys Ala Pro Lys Phe Phe Leu Asn Ala Pro Leu Asp Leu Asn Thr 100 105 110
ATT TTA GCC AAG CAT TTA AAC GCT TCT GTT GTG GCT GTC GCG CAA ACG 500 He Leu Ala Lys His Leu Asn Ala Ser Val Val Ala Val Ala Gin Thr 115 120 125 130 AGT TTG GAA TAT TTG AAA GCC ATG CAC TCT CAT ATT CTC AAA AAA GAA 548 Ser Leu Glu Tyr Leu Lys Ala Met His Ser His He Leu Lys Lys Glu 135 140 145
GCC CCT TTC GCT GTA GGG TTA TTT GCG GGC GAA ACG CTT GAA AAA CCA 596 Ala Pro Phe Ala Val Gly Leu Phe Ala Gly Glu Thr Leu Glu Lys Pro 150 155 160
CAT TTT TTA AGC ATG TCT CTT TGC AAG CAA CAA TGC GAA TTA GAA GCG 644 His Phe Leu Ser Met Ser Leu Cys Lys Gin Gin Cys Glu Leu Glu Ala 165 170 175
GAT CTG ATT GAA AGC GTG TTG CAA ATA AAA AGC GAG ATT ATT ACC CCT 692 Asp Leu He Glu Ser Val Leu Gin He Lys Ser Glu He He Thr Pro 180 185 190
TTA GCC TTT CAA AGG GGT TTG GAA AAA AAG GCT AAA AAA CAG ATT AAA 740 Leu Ala Phe Gin Arg Gly Leu Glu Lys Lys Ala Lys Lys Gin He Lys 195 200 205 210
AAA GTG GTT TTA CCA GAG AGC GAA AAG ATG AAA GGA TTT TGAAAGCTGC AC 791 Lys Val Val Leu Pro Glu Ser Glu Lys Met Lys Gly Phe 215 220
ATCGTTTGAA TTTAATGGGC GCGGTAGGAT TGATCTTATT AGGCGATAAA GAA 844
(2) INFORMATION FOR SEQ ID NO:454:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 223 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:454:
Met Gin Gly Leu Trp He Tyr Pro Glu Asp Thr Glu Val Leu Gly Val
1 5 10 15
Ala Cys Lys Ser Leu Leu Lys Ala Leu Thr Pro Arg Tyr Gin Lys Val
20 25 30
Ala Leu Phe Ser Pro He Ser Gly Gly Cys Glu Ser Leu Glu Glu Cys
35 40 45
Glu Ser Leu Asn Pro Leu Glu Phe His Ser Ala He Ser Lys Gin Lys
50 55 60
Ala Leu Glu Leu Ala Ser Thr Ala Gin Glu Glu Leu Leu Phe Glu Thr 65 70 75 80
He Leu Lys Arg Tyr Asp Glu Leu Gin Ser Thr His Asp Phe Val He
85 90 95
Asn Leu Gly Cys Ala Pro Lys Phe Phe Leu Asn Ala Pro Leu Asp Leu
100 105 110
Asn Thr He Leu Ala Lys His Leu Asn Ala Ser Val Val Ala Val Ala 115 120 125 Gin Thr Ser Leu Glu Tyr Leu Lys Ala Met His Ser His He Leu Lys
130 135 140
Lys Glu Ala Pro Phe Ala Val Gly Leu Phe Ala Gly Glu Thr Leu Glu 145 150 155 160
Lys Pro His Phe Leu Ser Met Ser Leu Cys Lys Gin Gin Cys Glu Leu
165 170 175
Glu Ala Asp Leu He Glu Ser Val Leu Gin He Lys Ser Glu He He
180 185 190
Thr Pro Leu Ala Phe Gin Arg Gly Leu Glu Lys Lys Ala Lys Lys Gin
195 200 205
He Lys Lys Val Val Leu Pro Glu Ser Glu Lys Met Lys Gly Phe 210 215 220
(2) INFORMATION FOR SEQ ID NO: 455:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 821 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 79...753 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:455:
TATGCCTGGA GGTTTGGTGT GGATGCGTCA AGATAATTTG CGNCTACAAC CGCAATTTAA 60 AGCCAGAAAT TGGGCAAA ATG TGG AAT TTT AAC ACC GAA TAC AGC AGT CAG 111
Met Trp Asn Phe Asn Thr Glu Tyr Ser Ser Gin 1 5 10
TAT TTT GAT TTT AGA GCC GCC GGT TTT GTC CAA TTG ATT TCT AAT TAC 159 Tyr Phe Asp Phe Arg Ala Ala Gly Phe Val Gin Leu He Ser Asn Tyr 15 20 25
ATC AAT CAA TTT TCT TCA ACG CTT TTT GTA ACC AAC TTG CCC GCA CAA 207 He Asn Gin Phe Ser Ser Thr Leu Phe Val Thr Asn Leu Pro Ala Gin 30 35 40
GAT ATT ATT TAT GTG CCT GGT TAT GAA GTT TCA GGG ACG GCT AAA TAC 255 Asp He He Tyr Val Pro Gly Tyr Glu Val Ser Gly Thr Ala Lys Tyr 45 50 55
AAG GGC TTT TCT TTA GGC TTG AGC GTG GCG CGA TCA TGG CCT TCT TTA 303 Lys Gly Phe Ser Leu Gly Leu Ser Val Ala Arg Ser Trp Pro Ser Leu 60 65 70 75
AAG GGG CGT TTG ATC GCT GAT GTG TAT GAA TTG GCG GCC ACG ACA GGC 351 Lys Gly Arg Leu He Ala Asp Val Tyr Glu Leu Ala Ala Thr Thr Gly 80 85 90
AAT GTG TTT ATT TTG ACG GCA AGT TAT AAA ATC CCA CGC ACT GGT CTT 399 Asn Val Phe He Leu Thr Ala Ser Tyr Lys He Pro Arg Thr Gly Leu 95 100 105
AGC ATC ACT TGG CTT TCA CGC TTC GTT ACG GAT TTG AGT TAT TGC TCT 447 Ser He Thr Trp Leu Ser Arg Phe Val Thr Asp Leu Ser Tyr Cys Ser 110 115 120
TAT AGC CCT TAT CGT AAC GGC CCT ACG GAT ATT GAC AGA CGG CCT AGT 495 Tyr Ser Pro Tyr Arg Asn Gly Pro Thr Asp He Asp Arg Arg Pro Ser 125 130 135
AAT TGC CCT AAA ACG CCC GGG ATT TTT CAT GTT CAT AAA CCC GGT TAT 543 Asn Cys Pro Lys Thr Pro Gly He Phe His Val His Lys Pro Gly Tyr 140 145 150 155
GGG GTG AGC AGT TTT TTT GTA ACC TAC AAA CCC ACC TAT AAG AAG CTT 591 Gly Val Ser Ser Phe Phe Val Thr Tyr Lys Pro Thr Tyr Lys Lys Leu 160 165 170
AAA GGG TTG AGC TTG AAT GCG GTG TTT AAC AAT GTT TTT AAC CAA CAA 639 Lys Gly Leu Ser Leu Asn Ala Val Phe Asn Asn Val Phe Asn Gin Gin 175 180 185
TAT ATT GAT CAA GCA AGC CCG GTG ATG AGC CCT GAT GAA CCC AAT CAA 687 Tyr He Asp Gin Ala Ser Pro Val Met Ser Pro Asp Glu Pro Asn Gin 190 195 200
GAC AAA TAC GCA AGA GGC ATG GCA GAG CCT GGC TTT AAC GCT AGA TTT 735 Asp Lys Tyr Ala Arg Gly Met Ala Glu Pro Gly Phe Asn Ala Arg Phe 205 210 215
GAA ATT TCC TAT AAG TTT TAATAATGGA TCTAAAAATA AGGATTTCAT GGGTAGCG 791 Glu He Ser Tyr Lys Phe 220 225
GATCTAATCA AAAATAAAAC ATTCTTTAGA 821
(2) INFORMATION FOR SEQ ID NO: 456:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 225 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456:
Met Trp Asn Phe Asn Thr Glu Tyr Ser Ser Gin Tyr Phe Asp Phe Arg 1 5 10 15
Ala Ala Gly Phe Val Gin Leu He Ser Asn Tyr He Asn Gin Phe Ser
20 25 30
Ser Thr Leu Phe Val Thr Asn Leu Pro Ala Gin Asp He He Tyr Val
35 40 45
Pro Gly Tyr Glu Val Ser Gly Thr Ala Lys Tyr Lys Gly Phe Ser Leu
50 55 60
Gly Leu Ser Val Ala Arg Ser Trp Pro Ser Leu Lys Gly Arg Leu He 65 70 75 80
Ala Asp Val Tyr Glu Leu Ala Ala Thr Thr Gly Asn Val Phe He Leu
85 90 95
Thr Ala Ser Tyr Lys He Pro Arg Thr Gly Leu Ser He Thr Trp Leu
100 105 110
Ser Arg Phe Val Thr Asp Leu Ser Tyr Cys Ser Tyr Ser Pro Tyr Arg
115 120 125
Asn Gly Pro Thr Asp He Asp Arg Arg Pro Ser Asn Cys Pro Lys Thr
130 135 140
Pro Gly He Phe His Val His Lys Pro Gly Tyr Gly Val Ser Ser Phe 145 150 155 160
Phe Val Thr Tyr Lys Pro Thr Tyr Lys Lys Leu Lys Gly Leu Ser Leu
165 170 175
Asn Ala Val Phe Asn Asn Val Phe Asn Gin Gin Tyr He Asp Gin Ala
180 185 190
Ser Pro Val Met Ser Pro Asp Glu Pro Asn Gin Asp Lys Tyr Ala Arg
195 200 205
Gly Met Ala Glu Pro Gly Phe Asn Ala Arg Phe Glu He Ser Tyr Lys
210 215 220
Phe 225
(2) INFORMATION FOR SEQ ID NO: 457:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1350 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 60...1202 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: AATTTTGAAA ACATTGACTC AGTTTCGCTC TCAAGGGCGT TTAATTCAAG GATCAAAGC 59
ATG AAT TTA AAT TTT ATG CCC CTA TTG CAT GCT TAT AAC CAT GCG AGC 107 Met Asn Leu Asn Phe Met Pro Leu Leu His Ala Tyr Asn His Ala Ser 1 5 10 15
ATT GAT TTT CAT TTC AAT TCT AGT GCT AGG GAT TTT TGC GTG CAT GAA 155 He Asp Phe His Phe Asn Ser Ser Ala Arg Asp Phe Cys Val His Glu 20 25 30
GTG CCT TTG TAT GAA TTT AGT AAC ACG GGC GAA CAT GCC GTT ATT CAA 203 Val Pro Leu Tyr Glu Phe Ser Asn Thr Gly Glu His Ala Val He Gin 35 40 45
GTG AGG AAA AGC GGT TTA AGC ACT TTA GAA ATG CTT CAG ATT TTT TCT 251 Val Arg Lys Ser Gly Leu Ser Thr Leu Glu Met Leu Gin He Phe Ser 50 55 60
CAA ATT TTA GGG GTA AGA ATC GCT GAA TTG GGT TAT GCG GGC TTG AAA 299 Gin He Leu Gly Val Arg He Ala Glu Leu Gly Tyr Ala Gly Leu Lys 65 70 75 80
GAT AAA AAC GCG CTG ACG ACT CAA TTC ATC TCA CTC CCT AAA AAA TAC 347 Asp Lys Asn Ala Leu Thr Thr Gin Phe He Ser Leu Pro Lys Lys Tyr 85 90 95
GCC CCT TTA TTA GAA AAA AAT ACG AGC AAC TTT CAA GAA AAA AAC CTT 395 Ala Pro Leu Leu Glu Lys Asn Thr Ser Asn Phe Gin Glu Lys Asn Leu 100 105 110
AAA ATC CTG TCT TTG AAT TAC CAC CAC AAT AAA ATC AAA TTG GGG CAT 443 Lys He Leu Ser Leu Asn Tyr His His Asn Lys He Lys Leu Gly His 115 120 125
TTG AAA GGG AAT CGC TTT TTT ATG CGT TTT AAA AAA ATG ACC CCT CTA 491 Leu Lys Gly Asn Arg Phe Phe Met Arg Phe Lys Lys Met Thr Pro Leu 130 135 140
AAC GCT CAA AAA ACA AAG CAG GTT TTA GAA CAA ATC GCG CAG TTT GGA 539 Asn Ala Gin Lys Thr Lys Gin Val Leu Glu Gin He Ala Gin Phe Gly 145 150 155 160
ATG CCT AAT TAT TTT GGC TCG CAA CGC TTT GGG AAG TTC AAT GAC AAC 587 Met Pro Asn Tyr Phe Gly Ser Gin Arg Phe Gly Lys Phe Asn Asp Asn 165 170 175
CAC CAA GAG GGT TTA AAA ATC TTA CAA AAT CAA ACG AAA TTC GCC CAT 635 His Gin Glu Gly Leu Lys He Leu Gin Asn Gin Thr Lys Phe Ala His 180 185 190
CAA AAA TTA AAC GCT TTT TTA ATT TCA AGC TAT CAA AGT TAT TTG TTT 683 Gin Lys Leu Asn Ala Phe Leu He Ser Ser Tyr Gin Ser Tyr Leu Phe 195 200 205
AAC GCG CTT TTA AGC AAA CGA TTA GAA ATC AGT AAA ATC ATT AGC GCT 731 Asn Ala Leu Leu Ser Lys Arg Leu Glu He Ser Lys He He Ser Ala 210 215 220
TTT AGT GTC AAA GAA AAT TTA GAA TTT TTT AAA CAA AAA AAT TTA AGC 779 Phe Ser Val Lys Glu Asn Leu Glu Phe Phe Lys Gin Lys Asn Leu Ser 225 230 235 240
GTT GAT TCA GAC ACT CTA AAA ACC CTT AAA AAC CAA GCC CAC CCC TTT 827 Val Asp Ser Asp Thr Leu Lys Thr Leu Lys Asn Gin Ala His Pro Phe 245 250 255
AAA ATC TTA GAA GGC GAT GTG ATG TGC CAT TAC CCT TAT GGG AAG TTT 875 Lys He Leu Glu Gly Asp Val Met Cys His Tyr Pro Tyr Gly Lys Phe 260 265 270
TTT GAC GCT TTA GAA TTA GAA AAA GAG GGC GAA AGG TTT TTG AAA AAA 923 Phe Asp Ala Leu Glu Leu Glu Lys Glu Gly Glu Arg Phe Leu Lys Lys 275 280 285
GAA GTT GCG CCT ACG GGG TTA CTA GAC GGC AAA AAA GCT CTT TAT GCA 971 Glu Val Ala Pro Thr Gly Leu Leu Asp Gly Lys Lys Ala Leu Tyr Ala 290 295 300
AAA AAT TTG AGT TTA GAA ATT GAA AAA GAA TTC CAG CAT AAC CTT TTA 1019 Lys Asn Leu Ser Leu Glu He Glu Lys Glu Phe Gin His Asn Leu Leu 305 310 315 320
AGT AGC CAT GCT AAA ACG CTA GGC TCT AGG CGG TTT TTT TGG GTG TTT 1067 Ser Ser His Ala Lys Thr Leu Gly Ser Arg Arg Phe Phe Trp Val Phe 325 330 335
GTA GAA AAT GTA ACT TCT CAA TAC GTG AAA GAA AAA GCG CAA TTT GAA 1115 Val Glu Asn Val Thr Ser Gin Tyr Val Lys Glu Lys Ala Gin Phe Glu 340 345 350
TTG GGA TTT TAC TTG CCT AAA GGG AGT TAT GCG AGC GCG TTG CTC AAA 1163 Leu Gly Phe Tyr Leu Pro Lys Gly Ser Tyr Ala Ser Ala Leu Leu Lys 355 360 365
GAA ATC AAG CAT GAG AAA GGA GAA AAT AAT GAC GAA TTT TGAAAAGATT ATC 1215 Glu He Lys His Glu Lys Gly Glu Asn Asn Asp Glu Phe 370 375 380
GCGCAAAACA GGATCAAAAC GAACGCGGTT TTAGCGACTT ATTGCGTGAT TTTTGCTTTT 1275 ATCGGGTTGT TGGTGGATGT CATTAGAATT AATGCTAATG ATTTAGGAAT AGCTCTTTTT 1335 AAACTCATGA CTTTT 1350
(2) INFORMATION FOR SEQ ID NO: 458:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 381 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458:
Met Asn Leu Asn Phe Met Pro Leu Leu His Ala Tyr Asn His Ala Ser
1 5 10 15
He Asp Phe His Phe Asn Ser Ser Ala Arg Asp Phe Cys Val His Glu
20 25 30
Val Pro Leu Tyr Glu Phe Ser Asn Thr Gly Glu His Ala Val He Gin
35 40 45
Val Arg Lys Ser Gly Leu Ser Thr Leu Glu Met Leu Gin He Phe Ser
50 55 60
Gin He Leu Gly Val Arg He Ala Glu Leu Gly Tyr Ala Gly Leu Lys 65 70 75 80
Asp Lys Asn Ala Leu Thr Thr Gin Phe He Ser Leu Pro Lys Lys Tyr
85 90 95
Ala Pro Leu Leu Glu Lys Asn Thr Ser Asn Phe Gin Glu Lys Asn Leu
100 105 110
Lys He Leu Ser Leu Asn Tyr His His Asn Lys He Lys Leu Gly His
115 120 125
Leu Lys Gly Asn Arg Phe Phe Met Arg Phe Lys Lys Met Thr Pro Leu
130 135 140
Asn Ala Gin Lys Thr Lys Gin Val Leu Glu Gin He Ala Gin Phe Gly 145 150 155 160
Met Pro Asn Tyr Phe Gly Ser Gin Arg Phe Gly Lys Phe Asn Asp Asn
165 170 175
His Gin Glu Gly Leu Lys He Leu Gin Asn Gin Thr Lys Phe Ala His
180 185 190
Gin Lys Leu Asn Ala Phe Leu He Ser Ser Tyr Gin Ser Tyr Leu Phe
195 200 205
Asn Ala Leu Leu Ser Lys Arg Leu Glu He Ser Lys He He Ser Ala
210 215 220
Phe Ser Val Lys Glu Asn Leu Glu Phe Phe Lys Gin Lys Asn Leu Ser 225 230 235 240
Val Asp Ser Asp Thr Leu Lys Thr Leu Lys Asn Gin Ala His Pro Phe
245 250 255
Lys He Leu Glu Gly Asp Val Met Cys His Tyr Pro Tyr Gly Lys Phe
260 265 270
Phe Asp Ala Leu Glu Leu Glu Lys Glu Gly Glu Arg Phe Leu Lys Lys
275 280 285
Glu Val Ala Pro Thr Gly Leu Leu Asp Gly Lys Lys Ala Leu Tyr Ala
290 295 300
Lys Asn Leu Ser Leu Glu He Glu Lys Glu Phe Gin His Asn Leu Leu 305 310 315 320
Ser Ser His Ala Lys Thr Leu Gly Ser Arg Arg Phe Phe Trp Val Phe
325 330 335
Val Glu Asn Val Thr Ser Gin Tyr Val Lys Glu Lys Ala Gin Phe Glu
340 345 350
Leu Gly Phe Tyr Leu Pro Lys Gly Ser Tyr Ala Ser Ala Leu Leu Lys
355 360 365
Glu He Lys His Glu Lys Gly Glu Asn Asn Asp Glu Phe 370 375 380
(2) INFORMATION FOR SEQ ID NO: 459:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1080 base pairs (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 76...828 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459:
CTTTTTCAAT AAAATCATCA GCGATGAAAA ACAATGCTTT TTCCACGCTA AACCCTTACA 60 CCAGATCCCT TAAAA ATG AAA CTC CCG GTC GTT GAG AGC TTT TTT TCC TTA 111 Met Lys Leu Pro Val Val Glu Ser Phe Phe Ser Leu 1 5 10
CAA GGT GAA GGA AAA AGG ATA GGC AAG CCC AGT CTT TTT TTG CGC TTA 159 Gin Gly Glu Gly Lys Arg He Gly Lys Pro Ser Leu Phe Leu Arg Leu 15 20 25
GGG GGG TGT AAC CTT TCA TGC AAG GGC TTT AAT TGT AAA ACC TTA TTG 207 Gly Gly Cys Asn Leu Ser Cys Lys Gly Phe Asn Cys Lys Thr Leu Leu 30 35 40
AAT GAT GAA ATC CTA ACA GGT TGC GAC AGC TTG TAT GCG GTG CAT CCC 255 Asn Asp Glu He Leu Thr Gly Cys Asp Ser Leu Tyr Ala Val His Pro 45 50 55 60
AAA TTC AAA ACA TCT TGG GAT TAT TAT AAT GAG CCT AAG CCC TTG ATT 303 Lys Phe Lys Thr Ser Trp Asp Tyr Tyr Asn Glu Pro Lys Pro Leu He 65 70 75
GAA CGA TTA GAG GAT TTA GCC CCT AAT TAT AAG GAT TTT GAT TTC ATT 351 Glu Arg Leu Glu Asp Leu Ala Pro Asn Tyr Lys Asp Phe Asp Phe He 80 85 90
CTT ACA GGC GGG GAG CCA AGC TTG TAT TTC AAT AAC CCT ATT TTA ATC 399 Leu Thr Gly Gly Glu Pro Ser Leu Tyr Phe Asn Asn Pro He Leu He 95 100 105
AGC GTT TTA GAG CAT TTT TAT CGC CAA AAA ATC CCT TTA TGT GTA GAG 447 Ser Val Leu Glu His Phe Tyr Arg Gin Lys He Pro Leu Cys Val Glu 110 115 120
AGT AAT GGT TCT ATT TTT TTT GAA TTT AGC CCT ATT TTA AAA GAA TTG 495 Ser Asn Gly Ser He Phe Phe Glu Phe Ser Pro He Leu Lys Glu Leu 125 130 135 140
CAT TTC ACT CTA AGC GTC AAA CTC TCT TTT TCT TTA GAG GAA GAA AGC 543 His Phe Thr Leu Ser Val Lys Leu Ser Phe Ser Leu Glu Glu Glu Ser 145 150 155
AAG CGG ATC CAT CTT AAA GCC TTA CAA AAT ATC TTA AAT AAC GCT AAA 591 Lys Arg He His Leu Lys Ala Leu Gin Asn He Leu Asn Asn Ala Lys 160 165 170
AGC GCG CAT TTT AAA TTT GTT TTA GAG AGC CAA AAC GCC GCT CAA TCT 639 Ser Ala His Phe Lys Phe Val Leu Glu Ser Gin Asn Ala Ala Gin Ser 175 180 185
ATT ATA GAA ATT CAA AGC CTC TTG AAA CAA CTC TCC TTA AAA AAT AAT 687 He He Glu He Gin Ser Leu Leu Lys Gin Leu Ser Leu Lys Asn Asn 190 195 200
GAA ATC TTT TTA ATG CCT TTA GGC ACA AAT AAC AAC GAG CTA GAC AAA 735 Glu He Phe Leu Met Pro Leu Gly Thr Asn Asn Asn Glu Leu Asp Lys 205 210 215 220
AAT CTA AAA ACC CTA GCC CCC CTA GCC ATA AAG CAT GGT TTC AGG CTA 783 Asn Leu Lys Thr Leu Ala Pro Leu Ala He Lys His Gly Phe Arg Leu 225 230 235
AGC GAT AGG CTT CAT ATC CGC TTG TGG GAT AAT CAA AAA GGG TTT TAAAA 833 Ser Asp Arg Leu His He Arg Leu Trp Asp Asn Gin Lys Gly Phe 240 245 250
AGTTAATCAT GACCATCAAA GTTTTTTCGC CCAAATACCC CACTGAATTA GAAGAATTTT 893
ATGCTGAGCG TATCGCTGAC AACCCTTTAG GGTTTATCCA ACGCTTGGAT CTTTTGCCTA 953
GTATTAGCGG GTTCGTTCAA AAATTGCGCG AGCATGGCGG GGAATTTTTT GAAATGAGAG 1013
AGGGTAACAA GCTCATTGGG ATTTGTGGGC TTAATCCTAT CAATCAAACA GAAGCCGAGC 1073
TGTGCAA 1080
(2) INFORMATION FOR SEQ ID NO: 460:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 251 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460:
Met Lys Leu Pro Val Val Glu Ser Phe Phe Ser Leu Gin Gly Glu Gly
1 5 10 15
Lys Arg He Gly Lys Pro Ser Leu Phe Leu Arg Leu Gly Gly Cys Asn
20 25 30
Leu Ser Cys Lys Gly Phe Asn Cys Lys Thr Leu Leu Asn Asp Glu He
35 40 45
Leu Thr Gly Cys Asp Ser Leu Tyr Ala Val His Pro Lys Phe Lys Thr
50 55 60
Ser Trp Asp Tyr Tyr Asn Glu Pro Lys Pro Leu He Glu Arg Leu Glu 65 70 75 80 Asp Leu Ala Pro Asn Tyr Lys Asp Phe Asp Phe He Leu Thr Gly Gly
85 90 95
Glu Pro Ser Leu Tyr Phe Asn Asn Pro He Leu He Ser Val Leu Glu
100 105 110
His Phe Tyr Arg Gin Lys He Pro Leu Cys Val Glu Ser Asn Gly Ser
115 120 125
He Phe Phe Glu Phe Ser Pro He Leu Lys Glu Leu His Phe Thr Leu
130 135 140
Ser Val Lys Leu Ser Phe Ser Leu Glu Glu Glu Ser Lys Arg He His 145 150 155 160
Leu Lys Ala Leu Gin Asn He Leu Asn Asn Ala Lys Ser Ala His Phe
165 170 175
Lys Phe Val Leu Glu Ser Gin Asn Ala Ala Gin Ser He He Glu He
180 185 190
Gin Ser Leu Leu Lys Gin Leu Ser Leu Lys Asn Asn Glu He Phe Leu
195 200 205
Met Pro Leu Gly Thr Asn Asn Asn Glu Leu Asp Lys Asn Leu Lys Thr
210 215 220
Leu Ala Pro Leu Ala He Lys His Gly Phe Arg Leu Ser Asp Arg Leu 225 230 235 240
His He Arg Leu Trp Asp Asn Gin Lys Gly Phe 245 250
(2) INFORMATION FOR SEQ ID NO -.461:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1710 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 120...1559 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461:
TAAAGAATTT TGTGAATATT GATTGTCTCT TTTAATTGAA ATTTAAAGAT TAGTTTAAAG 60 GATTTTATTC GGTGGGATTG TCAGCATCAA GCCTCATTGT TCCTATTAGC GTTATTTTA 119
ATG GTG GTT TTT ACT AAA AGA GTC GCA CTC TCG TTA TTT GTG GGC ATT 167
Met Val Val Phe Thr Lys Arg Val Ala Leu Ser Leu Phe Val Gly He 1 5 10 15
TTA GTG AGC GCT GTT TTA ATG CAT TCG TTA CAC CTT TCC CAA CTC GTA 215
Leu Val Ser Ala Val Leu Met His Ser Leu His Leu Ser Gin Leu Val 20 25 30 GAA TAT ATT TAT CAT AAA ATC ACT TCC GTT TTT TAC ACT TAC GAG CCA 263 Glu Tyr He Tyr His Lys He Thr Ser Val Phe Tyr Thr Tyr Glu Pro 35 40 45
GAA AAG GGG CTT AAT TTC AAT CTT TCC AAC CTC TAT GTT TTT GGG TTT 311 Glu Lys Gly Leu Asn Phe Asn Leu Ser Asn Leu Tyr Val Phe Gly Phe 50 55 60
TTA ATC TTT TTA GGC GTC TTA AGC CAA GTG ATT TTA AAA TCC GGT AGC 359 Leu He Phe Leu Gly Val Leu Ser Gin Val He Leu Lys Ser Gly Ser 65 70 75 80
GTG CAA AAC TTT GTC AAA AAA GCT AAA AAA TAC TCA AAA AAC GCT AAA 407 Val Gin Asn Phe Val Lys Lys Ala Lys Lys Tyr Ser Lys Asn Ala Lys 85 90 95
ACT CCC GAA TTT ATC GCC TTT TTT TCA GGT ATC ATT ATT TTT GTA GAT 455 Thr Pro Glu Phe He Ala Phe Phe Ser Gly He He He Phe Val Asp 100 105 110
GAT TAT TTT AAC GCC CTA ACC GTG GGG CAA ATC TCA AAG TCT TTA AAC 503 Asp Tyr Phe Asn Ala Leu Thr Val Gly Gin He Ser Lys Ser Leu Asn 115 120 125
GAC GCT CAT AAC TCC ACA CGA GAG CGC TTG GCT TAT ATT ATA GAC TCC 551 Asp Ala His Asn Ser Thr Arg Glu Arg Leu Ala Tyr He He Asp Ser 130 135 140
ACT TCA GCG CCG GTG TGC TTG CTA GTC CCC ATT TCT AGT TGG GGG GCG 599 Thr Ser Ala Pro Val Cys Leu Leu Val Pro He Ser Ser Trp Gly Ala 145 150 155 160
TAT ATT ATG GGG ATC ATG AAT AAC GAC AGC TCG CCC TTA TTA AAA GAT 647 Tyr He Met Gly He Met Asn Asn Asp Ser Ser Pro Leu Leu Lys Asp 165 170 175
AGT TTT TCG GTG CTT GTG CAA AGC TTA AGC AGT AAT TAT TAT GCC ATT 695 Ser Phe Ser Val Leu Val Gin Ser Leu Ser Ser Asn Tyr Tyr Ala He 180 185 190
TTT GCA CTC ATT GCA GTC TTT CTC ACC ATT TTA TGG CAA ATC AAC CTC 743 Phe Ala Leu He Ala Val Phe Leu Thr He Leu Trp Gin He Asn Leu 195 200 205
CCT AGC ATG AGA AAG TAT CAA AAC ATA GGC GTG AAG GAT TTT TAT AGC 791 Pro Ser Met Arg Lys Tyr Gin Asn He Gly Val Lys Asp Phe Tyr Ser 210 215 220
GAA CAA GAA GAA AGC TCT TCA AAA CTA GCC CCC TTG AGT TTG TTA CCC 839 Glu Gin Glu Glu Ser Ser Ser Lys Leu Ala Pro Leu Ser Leu Leu Pro 225 230 235 240
CTT TCT ATT TTA TTG TTG ATT GTG TCC ATT TCA TCA TTG CTT TTT TAT 887 Leu Ser He Leu Leu Leu He Val Ser He Ser Ser Leu Leu Phe Tyr 245 250 255 ACA GGA GTG ATT TTA AAA AAC ACT GAT GCG AGT TTT TCG CTC TTT TAT 935 Thr Gly Val He Leu Lys Asn Thr Asp Ala Ser Phe Ser Leu Phe Tyr 260 265 270
GGA GGG TTG TTT TCG CTC ATC GTT ACT TAT CTT TTA GCT TAT AAG TTT 983 Gly Gly Leu Phe Ser Leu He Val Thr Tyr Leu Leu Ala Tyr Lys Phe 275 280 285
TTA GAA AAA GGG AGC TTT TTT AAA CTC ATG TTG GAT GGC TTT AAG AGT 1031 Leu Glu Lys Gly Ser Phe Phe Lys Leu Met Leu Asp Gly Phe Lys Ser 290 295 300
GTG GGG CCG GCG ATA CTA GTC TTA ACG CTC GCT TGG GCT ATC GGG CCT 1079 Val Gly Pro Ala He Leu Val Leu Thr Leu Ala Trp Ala He Gly Pro 305 310 315 320
GTG ATT AGA GAT GAC GCT CAA ACA GGG CTT TAC TTG GCT AAC ATC AGC 1127 Val He Arg Asp Asp Ala Gin Thr Gly Leu Tyr Leu Ala Asn He Ser 325 330 335
AAG GGG TTT TTA AAT AAT GGA GGA GGC GTG TAT ATG CCT TTA ATC TTT 1175 Lys Gly Phe Leu Asn Asn Gly Gly Gly Val Tyr Met Pro Leu He Phe 340 345 350
TTT TTA ATC TCT GGG TTT ATC GCT TTT TCT ACC GGC ACA AGC TGG GGA 1223 Phe Leu He Ser Gly Phe He Ala Phe Ser Thr Gly Thr Ser Trp Gly 355 360 365
GCG TTT GCG ATC ATG CTT CCC ATT GGA GCG GGC ATG GCT AGT GAA AGC 1271 Ala Phe Ala He Met Leu Pro He Gly Ala Gly Met Ala Ser Glu Ser 370 375 380
GAT ATT ATT TTG ATT GTT TCA GCG ATT CTC TCA GGC GCG GTT TAT GGC 1319 Asp He He Leu He Val Ser Ala He Leu Ser Gly Ala Val Tyr Gly 385 390 395 400
GAT CAC ACA AGC CCT ATT TCT GAC ACG ACT ATA CTA TCG GCT ACG GGG 1367 Asp His Thr Ser Pro He Ser Asp Thr Thr He Leu Ser Ala Thr Gly 405 410 415
GCA GGG TGT TCG GTG CAA AGC CAT TTT ATC ACG CAA CTC CCT TAT GCG 1415 Ala Gly Cys Ser Val Gin Ser His Phe He Thr Gin Leu Pro Tyr Ala 420 425 430
ACC ATT GCG ATG CTT TGC AGC GCG GTG AGT TTG GGG GTG GCA AGC TTT 1463 Thr He Ala Met Leu Cys Ser Ala Val Ser Leu Gly Val Ala Ser Phe 435 440 445
ATG TAT TCG CGT TCG CTC GCT CTT TTA ATC GGT GTG GCT TTG CTT GTG 1511 Met Tyr Ser Arg Ser Leu Ala Leu Leu He Gly Val Ala Leu Leu Val 450 455 460
GGG GTG TTT TAT CTT TTA AAA AAA TTT TAT GGT GAA AAT CTA AAA ACT TG 1561 Gly Val Phe Tyr Leu Leu Lys Lys Phe Tyr Gly Glu Asn Leu Lys Thr 465 470 475 480 AATATTGATT GAAGAAGCTT AAAAATCCCA TTTTTTAAAA TTAAAATAAG GTTTTATCGA 1621 TCCCTATTTG ACTCAAAAAG AGTCTTATTC CATTATCAAT CAATTAAAAA AGGTTATTCA 1681 AAAATAACCA TACAATTATA AAAATCTTC 1710
(2) INFORMATION FOR SEQ ID NO:462:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 480 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:462:
Met Val Val Phe Thr Lys Arg Val Ala Leu Ser Leu Phe Val Gly He
1 5 10 15
Leu Val Ser Ala Val Leu Met His Ser Leu His Leu Ser Gin Leu Val
20 25 30
Glu Tyr He Tyr His Lys He Thr Ser Val Phe Tyr Thr Tyr Glu Pro
35 40 45
Glu Lys Gly Leu Asn Phe Asn Leu Ser Asn Leu Tyr Val Phe Gly Phe
50 55 60
Leu He Phe Leu Gly Val Leu Ser Gin Val He Leu Lys Ser Gly Ser 65 70 75 80
Val Gin Asn Phe Val Lys Lys Ala Lys Lys Tyr Ser Lys Asn Ala Lys
85 90 95
Thr Pro Glu Phe He Ala Phe Phe Ser Gly He He He Phe Val Asp
100 105 110
Asp Tyr Phe Asn Ala Leu Thr Val Gly Gin He Ser Lys Ser Leu Asn
115 120 125
Asp Ala His Asn Ser Thr Arg Glu Arg Leu Ala Tyr He He Asp Ser
130 135 140
Thr Ser Ala Pro Val Cys Leu Leu Val Pro He Ser Ser Trp Gly Ala 145 150 155 160
Tyr He Met Gly He Met Asn Asn Asp Ser Ser Pro Leu Leu Lys Asp
165 170 175
Ser Phe Ser Val Leu Val Gin Ser Leu Ser Ser Asn Tyr Tyr Ala He
180 185 190
Phe Ala Leu He Ala Val Phe Leu Thr He Leu Trp Gin He Asn Leu
195 200 205
Pro Ser Met Arg Lys Tyr Gin Asn He Gly Val Lys Asp Phe Tyr Ser
210 215 220
Glu Gin Glu Glu Ser Ser Ser Lys Leu Ala Pro Leu Ser Leu Leu Pro 225 230 235 240
Leu Ser He Leu Leu Leu He Val Ser He Ser Ser Leu Leu Phe Tyr
245 250 255
Thr Gly Val He Leu Lys Asn Thr Asp Ala Ser Phe Ser Leu Phe Tyr
260 265 270
Gly Gly Leu Phe Ser Leu He Val Thr Tyr Leu Leu Ala Tyr Lys Phe
275 280 285
Leu Glu Lys Gly Ser Phe Phe Lys Leu Met Leu Asp Gly Phe Lys Ser 290 295 300 Val Gly Pro Ala He Leu Val Leu Thr Leu Ala Trp Ala He Gly Pro 305 310 315 320
Val He Arg Asp Asp Ala Gin Thr Gly Leu Tyr Leu Ala Asn He Ser
325 330 335
Lys Gly Phe Leu Asn Asn Gly Gly Gly Val Tyr Met Pro Leu He Phe
340 345 350
Phe Leu He Ser Gly Phe He Ala Phe Ser Thr Gly Thr Ser Trp Gly
355 360 365
Ala Phe Ala He Met Leu Pro He Gly Ala Gly Met Ala Ser Glu Ser
370 375 380
Asp He He Leu He Val Ser Ala He Leu Ser Gly Ala Val Tyr Gly 385 390 395 400
Asp His Thr Ser Pro He Ser Asp Thr Thr He Leu Ser Ala Thr Gly
405 410 415
Ala Gly Cys Ser Val Gin Ser His Phe He Thr Gin Leu Pro Tyr Ala
420 425 430
Thr He Ala Met Leu Cys Ser Ala Val Ser Leu Gly Val Ala Ser Phe
435 440 445
Met Tyr Ser Arg Ser Leu Ala Leu Leu He Gly Val Ala Leu Leu Val
450 455 460
Gly Val Phe Tyr Leu Leu Lys Lys Phe Tyr Gly Glu Asn Leu Lys Thr 465 470 475 480
(2) INFORMATION FOR SEQ ID NO:463:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 629 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 82...525 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:463:
CAAATCTCTA AAGAGTAACG CTTTTTAAAA AAATACATTT TTTTTAATTT TTTAATCAAT 60 CATTAAGGTG TTTTAAGTTA A ATT TCC TTA TCT GTT AAA CAT ACG GAT AAT 111
He Ser Leu Ser Val Lys His Thr Asp Asn 1 5 10
GTT ATA TCT TTA AGG AAA GAA AAT GGG GTT AGG ACA CTA ATA AGT TTA 159 Val He Ser Leu Arg Lys Glu Asn Gly Val Arg Thr Leu He Ser Leu 15 20 25
GGG ATT TTG TTA AGC GTT TTG AGT GGC GAT GAT CTG AAG TTG TAT TCA 207 Gly He Leu Leu Ser Val Leu Ser Gly Asp Asp Leu Lys Leu Tyr Ser 30 35 40 AAA CTT TCA GTC TAT TCG GCT GGA AGT GGG ATG ATT GGG ATT GAT ATT 255 Lys Leu Ser Val Tyr Ser Ala Gly Ser Gly Met He Gly He Asp He 45 50 55
GAC AAA CGG ACA TTT TAT AAG CGA GCG TTC GCT TTC ACG ATG AAA TCG 303 Asp Lys Arg Thr Phe Tyr Lys Arg Ala Phe Ala Phe Thr Met Lys Ser 60 65 70
TTG TTC GGT GAA AAC TTG CTT TTG TTT GTC AAA TTA AAG CAT TCT GCG 351 Leu Phe Gly Glu Asn Leu Leu Leu Phe Val Lys Leu Lys His Ser Ala 75 80 85 90
TTG ACG AGC AAA CAC ATG AAA GGG CCT TTA GAA AAC CGC CAT CAC CAT 399 Leu Thr Ser Lys His Met Lys Gly Pro Leu Glu Asn Arg His His His 95 100 105
TCT TTC ACT AAA AAT TAT GAA AAA GCG GTT AAT GGT TGT CAA AAG TAT 447 Ser Phe Thr Lys Asn Tyr Glu Lys Ala Val Asn Gly Cys Gin Lys Tyr 110 115 120
TTC CAT ATT AAA TTG CCT GAA GGC GCT CCT AGC AAC TTC AAA TCA GGT 495 Phe His He Lys Leu Pro Glu Gly Ala Pro Ser Asn Phe Lys Ser Gly 125 130 135
TCA TAC ATG GCC ACT ATG GTG GTG CGT TTT TAAAGCGTTA TTTGGGGTAT TCT 548 Ser Tyr Met Ala Thr Met Val Val Arg Phe 140 145
TTAATACCCT TATCGTCTTT TAAAATACCA TCTTTTAAAA GCACAAATTT ATTTTTTAGC 608 CCTTTTTTAA ATCTTCTTAA A 629
(2) INFORMATION FOR SEQ ID NO: 464:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 148 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464:
He Ser Leu Ser Val Lys His Thr Asp Asn Val He Ser Leu Arg Lys
1 5 10 15
Glu Asn Gly Val Arg Thr Leu He Ser Leu Gly He Leu Leu Ser Val
20 25 30
Leu Ser Gly Asp Asp Leu Lys Leu Tyr Ser Lys Leu Ser Val Tyr Ser
35 40 45
Ala Gly Ser Gly Met He Gly He Asp He Asp Lys Arg Thr Phe Tyr
50 55 60
Lys Arg Ala Phe Ala Phe Thr Met Lys Ser Leu Phe Gly Glu Asn Leu 65 70 75 80
Leu Leu Phe Val Lys Leu Lys His Ser Ala Leu Thr Ser Lys His Met 85 90 95
Lys Gly Pro Leu Glu Asn Arg His His His Ser Phe Thr Lys Asn Tyr
100 105 110
Glu Lys Ala Val Asn Gly Cys Gin Lys Tyr Phe His He Lys Leu Pro
115 120 125
Glu Gly Ala Pro Ser Asn Phe Lys Ser Gly Ser Tyr Met Ala Thr Met
130 135 140
Val Val Arg Phe 145
(2) INFORMATION FOR SEQ ID NO: 465:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 626 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 98...547 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:465:
ATGATTGTGC ACAGGAAGGA TTTGAAGAAG ACTTTGAGCG ATCTCATCGC TATGATGACG 60 CATAAGACTT CAAAGATTTT TTAAAGTTTT AACATTG ATG CGT TGC GTG GTG TAT 115
Met Arg Cys Val Val Tyr 1 5
TCT ATC GCT AAA AGT TCG CCT TTA GAG TTA GTG AAA ATC TAT CAA AAG 163 Ser He Ala Lys Ser Ser Pro Leu Glu Leu Val Lys He Tyr Gin Lys 10 15 20
CAA TGC AGG CAA TTT GAT TGC GAG CTG GAA TTG GTG GAT TTA TTC CCT 211 Gin Cys Arg Gin Phe Asp Cys Glu Leu Glu Leu Val Asp Leu Phe Pro 25 30 35
AAA AAT ACC GCC AAC GCT CAA AAA GTT TCT AAA AAA CTG GCT CAA AAA 259 Lys Asn Thr Ala Asn Ala Gin Lys Val Ser Lys Lys Leu Ala Gin Lys 40 45 50
AGC TAC TCT CTA GCT TTT GAG CCG TAT TTA AAC CCT AAG GCA AAA AAT 307 Ser Tyr Ser Leu Ala Phe Glu Pro Tyr Leu Asn Pro Lys Ala Lys Asn 55 60 65 70
ATC GCC TTA CAC CCT AAA GCT CAA AGG GGC GAT AGC TTT GCG TTT AGT 355 He Ala Leu His Pro Lys Ala Gin Arg Gly Asp Ser Phe Ala Phe Ser 75 80 85 AAA ATG TTA GAA AAT CAT CTT AAT ATT AAT TTT TTT ATC GCT GGA GCG 403 Lys Met Leu Glu Asn His Leu Asn He Asn Phe Phe He Ala Gly Ala 90 95 100
TAT GGG TTT GAA GAA AAT TTT TTA AAG GAT TGT CAA GCT TGG AGT TTG 451 Tyr Gly Phe Glu Glu Asn Phe Leu Lys Asp Cys Gin Ala Trp Ser Leu 105 110 115
AGC GAG ATG ACT TTT AGC CAT GAA GTG GCT AAA ATT GTC TTA TGC GAG 499 Ser Glu Met Thr Phe Ser His Glu Val Ala Lys He Val Leu Cys Glu 120 125 130
CAA ATC TAT AGG GCT TTA AGC ATT ATT TTT AAG CAT CCA TAC CAT AAA T 548 Gin He Tyr Arg Ala Leu Ser He He Phe Lys His Pro Tyr His Lys 135 140 145 150
AGGAGGTGCG CATGCGTTTT TACATTATCT TTACATTTTT GTTTATTGTG GGTTTTGGTG 608 TGTTTGTTTA TAGTATTG 626
(2) INFORMATION FOR SEQ ID NO: 466:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 150 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466:
Met Arg Cys Val Val Tyr Ser He Ala Lys Ser Ser Pro Leu Glu Leu
1 5 10 15
Val Lys He Tyr Gin Lys Gin Cys Arg Gin Phe Asp Cys Glu Leu Glu
20 25 30
Leu Val Asp Leu Phe Pro Lys Asn Thr Ala Asn Ala Gin Lys Val Ser
35 40 45
Lys Lys Leu Ala Gin Lys Ser Tyr Ser Leu Ala Phe Glu Pro Tyr Leu
50 55 60
Asn Pro Lys Ala Lys Asn He Ala Leu His Pro Lys Ala Gin Arg Gly 65 70 75 80
Asp Ser Phe Ala Phe Ser Lys Met Leu Glu Asn His Leu Asn He Asn
85 90 95
Phe Phe He Ala Gly Ala Tyr Gly Phe Glu Glu Asn Phe Leu Lys Asp
100 105 110
Cys Gin Ala Trp Ser Leu Ser Glu Met Thr Phe Ser His Glu Val Ala
115 120 125
Lys He Val Leu Cys Glu Gin He Tyr Arg Ala Leu Ser He He Phe
130 135 140
Lys His Pro Tyr His Lys 145 150
(2) INFORMATION FOR SEQ ID NO: 467: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1053 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 110...976 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467:
CTTTGTGTTT GTAAAAGAAA CCCTTGCATC ACTACTCTTA AGCTAATTTT GTTTATTATA 60 AGCAAAACTT GGATACAATC CTAACAAAAC TGCAAAATTA AGGAAAAAC ATG GGA TTT 118
Met Gly Phe
1
GCA GAT TTC TTT AAA AAT TTT AAG ATC AAT AAA TTG CGG ACA GCG CCA 166 Ala Asp Phe Phe Lys Asn Phe Lys He Asn Lys Leu Arg Thr Ala Pro 5 10 15
AGT AAG GAA GAA CAG CCA AGC CAT TGG GTG AAA TGC CCT AAA TGT TAT 214 Ser Lys Glu Glu Gin Pro Ser His Trp Val Lys Cys Pro Lys Cys Tyr 20 25 30 35
GCG TTA ATG TAT CAT AAA GAA GTG TTT AGT AAA TAC AGC GTG TGT TTG 262 Ala Leu Met Tyr His Lys Glu Val Phe Ser Lys Tyr Ser Val Cys Leu 40 45 50
AAA TGC CAT TAC CAT TTC CGC ATG AAA GCG GCT GAA AGG ATT GAA TTT 310 Lys Cys His Tyr His Phe Arg Met Lys Ala Ala Glu Arg He Glu Phe 55 60 65
TTA TGC GAT GTG GGG AGT TTT GAA GAG TTT GAC AAG CAT TTA CGG CCT 358 Leu Cys Asp Val Gly Ser Phe Glu Glu Phe Asp Lys His Leu Arg Pro 70 75 80
AAT GAT CCT TTA AAT TTC GTG GAT AAA GAG AGC TAT AAA CAA CGC ATT 406 Asn Asp Pro Leu Asn Phe Val Asp Lys Glu Ser Tyr Lys Gin Arg He 85 90 95
AAA AAA TAC GAA AAA AGG ACT AAC CGC CCA AGC TCA GTG ATC AGC GGT 454 Lys Lys Tyr Glu Lys Arg Thr Asn Arg Pro Ser Ser Val He Ser Gly 100 105 110 115
GAG GCT AAA ATC AAC CGC ATG CCT TTG CAG ATC GTG GTG TTT GAT TTT 502 Glu Ala Lys He Asn Arg Met Pro Leu Gin He Val Val Phe Asp Phe 120 125 130 AGC TTT ATG GGG GGG AGT TTA GGC TCT GTG GAG GGC GAA AAG ATC GTA 550 Ser Phe Met Gly Gly Ser Leu Gly Ser Val Glu Gly Glu Lys He Val 135 140 145
AGA GCA ATC AAT CGC GCG GTC GCT AAA AGA GAA GCG TTA TTG ATT GTT 598 Arg Ala He Asn Arg Ala Val Ala Lys Arg Glu Ala Leu Leu He Val 150 155 160
TCA GCG AGT GGG GGG GCT AGG ATG CAA GAA TCC ACT TAT TCG CTC ATG 646 Ser Ala Ser Gly Gly Ala Arg Met Gin Glu Ser Thr Tyr Ser Leu Met 165 170 175
CAA ATG GCT AAA ACG AGC GCG GCT TTG AAC CGA TTG AGT GAG GCC AAA 694 Gin Met Ala Lys Thr Ser Ala Ala Leu Asn Arg Leu Ser Glu Ala Lys 180 185 190 195
CTC CCT TTC ATT TCG CTC TTA AGC GAT CCC ACT TAT GGG GGC GTT AGC 742 Leu Pro Phe He Ser Leu Leu Ser Asp Pro Thr Tyr Gly Gly Val Ser 200 205 210
GCA TCT TTT GCT TTT TTA GGG GAT CTC ATT ATC GCA GAG CCA GGG GCG 790 Ala Ser Phe Ala Phe Leu Gly Asp Leu He He Ala Glu Pro Gly Ala 215 220 225
ATG ATA GGC TTT GCG GGG CCT AGG GTG ATT AAG CAA ACT ATA GGG GCG 838 Met He Gly Phe Ala Gly Pro Arg Val He Lys Gin Thr He Gly Ala 230 235 240
GAT TTG CCT GAG GGC TTT CAA ACA GCG GAA TTT TTA TTA GAG CAT GGC 886 Asp Leu Pro Glu Gly Phe Gin Thr Ala Glu Phe Leu Leu Glu His Gly 245 250 255
TTG ATT GAT ATG ATT GTG CAC AGG AAG GAT TTG AAG AAG ACT TTG AGC 934 Leu He Asp Met He Val His Arg Lys Asp Leu Lys Lys Thr Leu Ser 260 265 270 275
GAT CTC ATC GCT ATG ATG ACG CAT AAG ACT TCA AAG ATT TTT TAAAGTTTT 985 Asp Leu He Ala Met Met Thr His Lys Thr Ser Lys He Phe 280 285
AACATTGATG CGTTGCGTGG TGTATTCTAT CGCTAAAAGT TCGCCTTTAG AGTTAGTGAA 1045 AATCTATC 1053
(2) INFORMATION FOR SEQ ID NO: 468:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 289 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:468: Met Gly Phe Ala Asp Phe Phe Lys Asn Phe Lys He Asn Lys Leu Arg
1 5 10 15
Thr Ala Pro Ser Lys Glu Glu Gin Pro Ser His Trp Val Lys Cys Pro
20 25 30
Lys Cys Tyr Ala Leu Met Tyr His Lys Glu Val Phe Ser Lys Tyr Ser
35 40 45
Val Cys Leu Lys Cys His Tyr His Phe Arg Met Lys Ala Ala Glu Arg
50 55 60
He Glu Phe Leu Cys Asp Val Gly Ser Phe Glu Glu Phe Asp Lys His 65 70 75 80
Leu Arg Pro Asn Asp Pro Leu Asn Phe Val Asp Lys Glu Ser Tyr Lys
85 90 95
Gin Arg He Lys Lys Tyr Glu Lys Arg Thr Asn Arg Pro Ser Ser Val
100 105 110
He Ser Gly Glu Ala Lys He Asn Arg Met Pro Leu Gin He Val Val
115 120 125
Phe Asp Phe Ser Phe Met Gly Gly Ser Leu Gly Ser Val Glu Gly Glu
130 135 140
Lys He Val Arg Ala He Asn Arg Ala Val Ala Lys Arg Glu Ala Leu 145 150 155 160
Leu He Val Ser Ala Ser Gly Gly Ala Arg Met Gin Glu Ser Thr Tyr
165 170 175
Ser Leu Met Gin Met Ala Lys Thr Ser Ala Ala Leu Asn Arg Leu Ser
180 185 190
Glu Ala Lys Leu Pro Phe He Ser Leu Leu Ser Asp Pro Thr Tyr Gly
195 200 205
Gly Val Ser Ala Ser Phe Ala Phe Leu Gly Asp Leu He He Ala Glu
210 215 220
Pro Gly Ala Met He Gly Phe Ala Gly Pro Arg Val He Lys Gin Thr 225 230 235 240
He Gly Ala Asp Leu Pro Glu Gly Phe Gin Thr Ala Glu Phe Leu Leu
245 250 255
Glu His Gly Leu He Asp Met He Val His Arg Lys Asp Leu Lys Lys
260 265 270
Thr Leu Ser Asp Leu He Ala Met Met Thr His Lys Thr Ser Lys He
275 280 285
Phe
(2) INFORMATION FOR SEQ ID NO: 469:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 810 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 95...706 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69:
TCTGCAAATC CCATGTTTTT CCTTAATTTT GCAGTTTTGT TAGGATTGTA TCCAAGTTTT 60 GCTTATAATA AACAAAATTA GCTTAAGAGT AGTG ATG CAA GGG TTT CTT TTA CAA 115
Met Gin Gly Phe Leu Leu Gin 1 5
ACA CAA AGC ATA AGA GAT GAA GAT TTG ATC GTG CGC GTT TTA ACC AAA 163 Thr Gin Ser He Arg Asp Glu Asp Leu He Val Arg Val Leu Thr Lys 10 15 20
AAC CAG CTC AAA ACC CTC TAT CGT TTC TAT GGC AAA CGC CAT AGC GTG 211 Asn Gin Leu Lys Thr Leu Tyr Arg Phe Tyr Gly Lys Arg His Ser Val 25 30 35
CTG AAT GTG GGG CGT AAA ATT GAT TTT GAA GAA GAA AAC GAT GAT AAG 259 Leu Asn Val Gly Arg Lys He Asp Phe Glu Glu Glu Asn Asp Asp Lys 40 45 50 55
TTT TTA CCC AAG TTA AGG AAT ATT TTG CAT TTA GGC TAT ATT TGG GAA 307 Phe Leu Pro Lys Leu Arg Asn He Leu His Leu Gly Tyr He Trp Glu 60 65 70
AGA GAA ATG GAG CGC TTG TTT TTT TGG CAA CGC TTT TGC GCT CTC TTG 355 Arg Glu Met Glu Arg Leu Phe Phe Trp Gin Arg Phe Cys Ala Leu Leu 75 80 85
TTT AGG CAT TTA GAA GGC GTG CAT TCT TTA GAT AGC GTC TAT TTT GAC 403 Phe Arg His Leu Glu Gly Val His Ser Leu Asp Ser Val Tyr Phe Asp 90 95 100
ACT TTA GAT GAT GGG GCT AAC AAA CTC GCC AAA CAG CAC CCC TTA AGA 451 Thr Leu Asp Asp Gly Ala Asn Lys Leu Ala Lys Gin His Pro Leu Arg 105 110 115
GTG ATT TTA GAA ATG TAT GCA ACG CTT TTG AAT TTT GAA GGG CGC TTG 499 Val He Leu Glu Met Tyr Ala Thr Leu Leu Asn Phe Glu Gly Arg Leu 120 125 130 135
CAA AGT TAC AAT TCT TGT TTT TTA TGC GAT GCA AAA TTA GAG CGT TCT 547 Gin Ser Tyr Asn Ser Cys Phe Leu Cys Asp Ala Lys Leu Glu Arg Ser 140 145 150
GTC GCT TTA GCG CAA GGG TTT ATT CTA GCG CAC CCC TCT TGT TTG AAA 595 Val Ala Leu Ala Gin Gly Phe He Leu Ala His Pro Ser Cys Leu Lys 155 160 165
GCT AAA AGC CTA AAT TTA GAA AAA ATC CAA GCT TTT TTT CGC ACT CAA 643 Ala Lys Ser Leu Asn Leu Glu Lys He Gin Ala Phe Phe Arg Thr Gin 170 175 180
AGC ACG ATT GAT TTA GAA ACA GAA GAA GTA GAA GAA TTA TGG CGC ACG 691 Ser Thr He Asp Leu Glu Thr Glu Glu Val Glu Glu Leu Trp Arg Thr 185 190 195
CTG AAT TTA GGG TTT TGAAAGGTTA AAAATGAAAT TTAAATTTTT GAATATGGAT A 747
Leu Asn Leu Gly Phe
200
ATGAAAGCGG TTTTATTTTG ATTGAAAAAG AATTGAAACG ATTAAACATT CTCGCTCAAG 807 TCA 810
(2) INFORMATION FOR SEQ ID NO: 470:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 204 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470:
Met Gin Gly Phe Leu Leu Gin Thr Gin Ser He Arg Asp Glu Asp Leu
1 5 10 15
He Val Arg Val Leu Thr Lys Asn Gin Leu Lys Thr Leu Tyr Arg Phe
20 25 30
Tyr Gly Lys Arg His Ser Val Leu Asn Val Gly Arg Lys He Asp Phe
35 40 45
Glu Glu Glu Asn Asp Asp Lys Phe Leu Pro Lys Leu Arg Asn He Leu
50 55 60
His Leu Gly Tyr He Trp Glu Arg Glu Met Glu Arg Leu Phe Phe Trp 65 70 75 80
Gin Arg Phe Cys Ala Leu Leu Phe Arg His Leu Glu Gly Val His Ser
85 90 95
Leu Asp Ser Val Tyr Phe Asp Thr Leu Asp Asp Gly Ala Asn Lys Leu
100 105 110
Ala Lys Gin His Pro Leu Arg Val He Leu Glu Met Tyr Ala Thr Leu
115 120 125
Leu Asn Phe Glu Gly Arg Leu Gin Ser Tyr Asn Ser Cys Phe Leu Cys
130 135 140
Asp Ala Lys Leu Glu Arg Ser Val Ala Leu Ala Gin Gly Phe He Leu 145 150 155 160
Ala His Pro Ser Cys Leu Lys Ala Lys Ser Leu Asn Leu Glu Lys He
165 170 175
Gin Ala Phe Phe Arg Thr Gin Ser Thr He Asp Leu Glu Thr Glu Glu
180 185 190
Val Glu Glu Leu Trp Arg Thr Leu Asn Leu Gly Phe 195 200
(2) INFORMATION FOR SEQ ID NO: 471:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 999 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 76...927 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471:
GCACATAAAA TCGCGCTACT AGGGTATGAA TTTGAAGCGA TCGCTCCTAA AGAATTTGAA 60 ATTTAAGGAT TGATC ATG AAC GCT TGG AAT ACG ATT TAT GAT CAA TTT AAC 111 Met Asn Ala Trp Asn Thr He Tyr Asp Gin Phe Asn 1 5 10
CCT ATC GCT TTT AGT CTT GGC AGT ATT GAA GTG CAT TGG TAT GGT TTG 159 Pro He Ala Phe Ser Leu Gly Ser He Glu Val His Trp Tyr Gly Leu 15 20 25
GCG TAT GCG TGT GCG ATT GTT ACC GCT TTT TAT ATG GCG TTA AGA ATG 207 Ala Tyr Ala Cys Ala He Val Thr Ala Phe Tyr Met Ala Leu Arg Met 30 35 40
ATC CAA AAA GAC CCC AAG CGA TTC CCC ATT GAA AGG AAG GAA TTT GAG 255 He Gin Lys Asp Pro Lys Arg Phe Pro He Glu Arg Lys Glu Phe Glu 45 50 55 60
AGT TAT TTT TTA TGG GCG GAG CTT GGC ATT GTG CTA GGG GCA AGG ATA 303 Ser Tyr Phe Leu Trp Ala Glu Leu Gly He Val Leu Gly Ala Arg He 65 70 75
GGA TAC ATT CTT ATT TAT GAG CCT AAT TCT GGC TAT TAT TTG ACG CAT 351 Gly Tyr He Leu He Tyr Glu Pro Asn Ser Gly Tyr Tyr Leu Thr His 80 85 90
TTT TGG CAA ATC TTT AAC CCT TTT GAT AGC CAT GGG AAT TTT GTA GGC 399 Phe Trp Gin He Phe Asn Pro Phe Asp Ser His Gly Asn Phe Val Gly 95 100 105
ATT CGT GGG ATG AGC TAT CAT GGG GGG TTG GTG GGG TTT TTG ATC GCT 447 He Arg Gly Met Ser Tyr His Gly Gly Leu Val Gly Phe Leu He Ala 110 115 120
TCG TAT CTT TAT AGC CGT AAG GAT TTG AAA AAG CTT TTG ATT TAT TTG 495 Ser Tyr Leu Tyr Ser Arg Lys Asp Leu Lys Lys Leu Leu He Tyr Leu 125 130 135 140
GAT TTG ATT GCG ATC AGC CTG CCT TTA GGG TAT GTT TTT GGG AGG ATT 543 Asp Leu He Ala He Ser Leu Pro Leu Gly Tyr Val Phe Gly Arg He 145 150 155 GGG AAT TTT TTA AAC CAG GAG CTT GTG GGA AGA ATT GTC CCC AAA GAC 591 Gly Asn Phe Leu Asn Gin Glu Leu Val Gly Arg He Val Pro Lys Asp 160 165 170
AGC CAT TTA GGG CAA ATC ATA GGC ATT ATG GTG GAT AAT GAG TTG CGT 639 Ser His Leu Gly Gin He He Gly He Met Val Asp Asn Glu Leu Arg 175 180 185
TAT CCC AGC CAA TTG ATT GAA GCG TTT TTA GAG GGG GTT ATC GTG TTT 687 Tyr Pro Ser Gin Leu He Glu Ala Phe Leu Glu Gly Val He Val Phe 190 195 200
TTA ATG GTA ATG TGG GCT AAA AAA CAC ACC AAA ACG CAT GGG TTG CTG 735 Leu Met Val Met Trp Ala Lys Lys His Thr Lys Thr His Gly Leu Leu 205 210 215 220
ATT GTG GTT TAT GGT TTG GGG TAT TCC TTG ATG CGC TTT ATT GCG GAA 783 He Val Val Tyr Gly Leu Gly Tyr Ser Leu Met Arg Phe He Ala Glu 225 230 235
TTT TAC AGA GAG CCG GAC AGC CAA ATG GGG GTT TAT TTT TTA AAT TTG 831 Phe Tyr Arg Glu Pro Asp Ser Gin Met Gly Val Tyr Phe Leu Asn Leu 240 245 250
AGC ATG GGG CAG ATT TTA AGC TTA TTT ATG GTA ATT GTT TCG TTA GGG 879 Ser Met Gly Gin He Leu Ser Leu Phe Met Val He Val Ser Leu Gly 255 260 265
ATT TTA TTG TAT GCT ACA AAA AAT TCT AAA AAA ATA AAG GAA AAT CAA T 928 He Leu Leu Tyr Ala Thr Lys Asn Ser Lys Lys He Lys Glu Asn Gin 270 275 280
GAAATTTTTG GATCAAGAAA AAAGAAGACA ATTATTAAAC GAGCGCCATT CTTGCAAGAT 988 GTTTGATAGC C 999
(2) INFORMATION FOR SEQ ID NO:472:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 284 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:472:
Met Asn Ala Trp Asn Thr He Tyr Asp Gin Phe Asn Pro He Ala Phe
1 5 10 15
Ser Leu Gly Ser He Glu Val His Trp Tyr Gly Leu Ala Tyr Ala Cys
20 25 30
Ala He Val Thr Ala Phe Tyr Met Ala Leu Arg Met He Gin Lys Asp
35 40 45
Pro Lys Arg Phe Pro He Glu Arg Lys Glu Phe Glu Ser Tyr Phe Leu 50 55 60
Trp Ala Glu Leu Gly He Val Leu Gly Ala Arg He Gly Tyr He Leu 65 70 75 80
He Tyr Glu Pro Asn Ser Gly Tyr Tyr Leu Thr His Phe Trp Gin He
85 90 95
Phe Asn Pro Phe Asp Ser His Gly Asn Phe Val Gly He Arg Gly Met
100 105 110
Ser Tyr His Gly Gly Leu Val Gly Phe Leu He Ala Ser Tyr Leu Tyr
115 120 125
Ser Arg Lys Asp Leu Lys Lys Leu Leu He Tyr Leu Asp Leu He Ala
130 135 140
He Ser Leu Pro Leu Gly Tyr Val Phe Gly Arg He Gly Asn Phe Leu 145 150 155 160
Asn Gin Glu Leu Val Gly Arg He Val Pro Lys Asp Ser His Leu Gly
165 170 175
Gin He He Gly He Met Val Asp Asn Glu Leu Arg Tyr Pro Ser Gin
180 185 190
Leu He Glu Ala Phe Leu Glu Gly Val He Val Phe Leu Met Val Met
195 200 205
Trp Ala Lys Lys His Thr Lys Thr His Gly Leu Leu He Val Val Tyr
210 215 220
Gly Leu Gly Tyr Ser Leu Met Arg Phe He Ala Glu Phe Tyr Arg Glu 225 230 235 240
Pro Asp Ser Gin Met Gly Val Tyr Phe Leu Asn Leu Ser Met Gly Gin
245 250 255
He Leu Ser Leu Phe Met Val He Val Ser" Leu Gly He Leu Leu Tyr
260 265 270
Ala Thr Lys Asn Ser Lys Lys He Lys Glu Asn Gin 275 280
(2) INFORMATION FOR SEQ ID NO: 473:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1104 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 86...994 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:473:
AGCGAATTAG CCTTGCTTTT AAAGGGTAAG AGTGTGCTAG AGAGCATGAA CGATTTGATC 60 AGACGCGCTT AAAAGGAAAG AGAGC ATG CAA GAT TTT TCA AGT TTA TTA TTA 112
Met Gin Asp Phe Ser Ser Leu Leu Leu 1 5 AAA CTA CAA GAG TAT TGG AAG AAT CAA GGC TGT TTG GTG ATC CAG CCT 160 Lys Leu Gin Glu Tyr Trp Lys Asn Gin Gly Cys Leu Val He Gin Pro 10 15 20 25
TAT GAT ATT CCT GCA GGA GCT GGG ACA TTC CAT CCG GCC ACG CTT TTA 208 Tyr Asp He Pro Ala Gly Ala Gly Thr Phe His Pro Ala Thr Leu Leu 30 35 40
AGG AGT TTG GAT AAA AAG CCG TGG AAT GTG GCG TAT GTC GCG CCC TCT 256 Arg Ser Leu Asp Lys Lys Pro Trp Asn Val Ala Tyr Val Ala Pro Ser 45 50 55
AGA AGG CCT ACT GAT GGG CGC TAT GGG GAA AAC CCT AAC CGC TTG GGG 304 Arg Arg Pro Thr Asp Gly Arg Tyr Gly Glu Asn Pro Asn Arg Leu Gly 60 65 70
AGT TAT TAC CAA TTC CAA GTA GTC ATC AAG CCC AGC CCT TCT AAT ATC 352 Ser Tyr Tyr Gin Phe Gin Val Val He Lys Pro Ser Pro Ser Asn He 75 80 85
CAG GAA CTC TAT TTA AAA AGC TTA GAA GTG TTA GGG ATA AAC CTT AAT 400 Gin Glu Leu Tyr Leu Lys Ser Leu Glu Val Leu Gly He Asn Leu Asn 90 95 100 105
GAG CAT GAT ATA CGA TTT GTA GAA GAC AAT TGG GAG AGT CCG ACT TTA 448 Glu His Asp He Arg Phe Val Glu Asp Asn Trp Glu Ser Pro Thr Leu 110 115 120
GGG GCA TGG GGG CTT GGC TGG GAA GTG TGG CTT GAT GGC ATG GAA GTT 496 Gly Ala Trp Gly Leu Gly Trp Glu Val Trp Leu Asp Gly Met Glu Val 125 130 135
ACG CAA TTC ACT TAT TTC CAG CAA GTG GGG GGC ATT GCT TGT AGC CCT 544 Thr Gin Phe Thr Tyr Phe Gin Gin Val Gly Gly He Ala Cys Ser Pro 140 145 150
ATT CCT GTA GAG ATC ACT TAC GGC TTA GAA AGA TTA GCG ATG TAT GTG 592 He Pro Val Glu He Thr Tyr Gly Leu Glu Arg Leu Ala Met Tyr Val 155 160 165
CAA AAA GTG GAA AAT ATC CTA GAG ATT GAA TGG GCT AAA AAA AAT CAT 640 Gin Lys Val Glu Asn He Leu Glu He Glu Trp Ala Lys Lys Asn His 170 175 180 185
GAC AGC GTG AAT TAC GCA CAA GTG CAT TTG GAA AGC GAA TAC GAA TTC 688 Asp Ser Val Asn Tyr Ala Gin Val His Leu Glu Ser Glu Tyr Glu Phe 190 195 200
AGC AAG TAT CAT TTT GAA ACA GCG AGC GTG AAA CGG CTA TTA GAA ATG 736 Ser Lys Tyr His Phe Glu Thr Ala Ser Val Lys Arg Leu Leu Glu Met 205 210 215
TTT AAA AAC GCT CAA GCC GAA GCC TTG CAT TGC TTG GAA AAC AAG CTC 784 Phe Lys Asn Ala Gin Ala Glu Ala Leu His Cys Leu Glu Asn Lys Leu 220 225 230 CCC TTG CCG GCT TAT GAT TTT GTG ATG TTA TGC TCG CAT TTT TTC AAT 832 Pro Leu Pro Ala Tyr Asp Phe Val Met Leu Cys Ser His Phe Phe Asn 235 240 245
ATT TTA GAC GCC AGA AAA GCG ATT TCG GTG GCT GAA AGG CAA AAT TAT 880 He Leu Asp Ala Arg Lys Ala He Ser Val Ala Glu Arg Gin Asn Tyr 250 255 260 265
ATT TTA CAA ATC AGG GAT TTA GCC AAA GGG TGT GCG CTT CTT TAT AAA 928 He Leu Gin He Arg Asp Leu Ala Lys Gly Cys Ala Leu Leu Tyr Lys 270 275 280
GAA CAA GAA GAA GAG AGG GAA GAG CGT TTA AAA AAC GCT TTA ACA AAG 976 Glu Gin Glu Glu Glu Arg Glu Glu Arg Leu Lys Asn Ala Leu Thr Lys 285 290 295
GCT GAA AAT GGC GTT AGT TAAGGAAGTG TTGGTAGTTT TGAATCGCCT TTCGCCTT 1032 Ala Glu Asn Gly Val Ser 300
TTGAACTCCA AGAATCATGG GATAATAGCG GGTTGAATGT GGGGAGTGAA AATAGTGAAT 1092 TTAGCGAGAT TG 1104
(2) INFORMATION FOR SEQ ID NO: 474:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 303 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474:
Met Gin Asp Phe Ser Ser Leu Leu Leu Lys Leu Gin Glu Tyr Trp Lys
1 5 10 15
Asn Gin Gly Cys Leu Val He Gin Pro Tyr Asp He Pro Ala Gly Ala
20 25 30
Gly Thr Phe His Pro Ala Thr Leu Leu Arg Ser Leu Asp Lys Lys Pro
35 40 45
Trp Asn Val Ala Tyr Val Ala Pro Ser Arg Arg Pro Thr Asp Gly Arg
50 55 60
Tyr Gly Glu Asn Pro Asn Arg Leu Gly Ser Tyr Tyr Gin Phe Gin Val 65 70 75 80
Val He Lys Pro Ser Pro Ser Asn He Gin Glu Leu Tyr Leu Lys Ser
85 90 95
Leu Glu Val Leu Gly He Asn Leu Asn Glu His Asp He Arg Phe Val
100 105 110
Glu Asp Asn Trp Glu Ser Pro Thr Leu Gly Ala Trp Gly Leu Gly Trp
115 120 125
Glu Val Trp Leu Asp Gly Met Glu Val Thr Gin Phe Thr Tyr Phe Gin
130 135 140
Gin Val Gly Gly He Ala Cys Ser Pro He Pro Val Glu He Thr Tyr 145 150 155 160
Gly Leu Glu Arg Leu Ala Met Tyr Val Gin Lys Val Glu Asn He Leu
165 170 175
Glu He Glu Trp Ala Lys Lys Asn His Asp Ser Val Asn Tyr Ala Gin
180 185 190
Val His Leu Glu Ser Glu Tyr Glu Phe Ser Lys Tyr His Phe Glu Thr
195 200 205
Ala Ser Val Lys Arg Leu Leu Glu Met Phe Lys Asn Ala Gin Ala Glu
210 215 220
Ala Leu His Cys Leu Glu Asn Lys Leu Pro Leu Pro Ala Tyr Asp Phe 225 230 235 240
Val Met Leu Cys Ser His Phe Phe Asn He Leu Asp Ala Arg Lys Ala
245 250 255
He Ser Val Ala Glu Arg Gin Asn Tyr He Leu Gin He Arg Asp Leu
260 265 270
Ala Lys Gly Cys Ala Leu Leu Tyr Lys Glu Gin Glu Glu Glu Arg Glu
275 280 285
Glu Arg Leu Lys Asn Ala Leu Thr Lys Ala Glu Asn Gly Val Ser 290 295 300
(2) INFORMATION FOR SEQ ID NO: 475:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1620 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 66...1538 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:475:
GGCAAAGGAA TGCGATTTTA GCTAACCACA TCACTCTCAT GCAAGAGCTT TAAAAAGTCC 60
TAAAA ATG GCG CAA AAA ACT CTT TTG ATT ATC ACT GAT GGC ATT GGG TAT 110
Met Ala Gin Lys Thr Leu Leu He He Thr Asp Gly He Gly Tyr
1 5 10 15
CGT AAA GAT AGC GAT CAT AAC GCT TTC TTC CAT GCC AAA AAA CCC ACT 158 Arg Lys Asp Ser Asp His Asn Ala Phe Phe His Ala Lys Lys Pro Thr 20 25 30
TAT GAT TTG ATG TTT AAA ACC TTG CCT TAT AGC CTG ATT GAT ACG CAT 206 Tyr Asp Leu Met Phe Lys Thr Leu Pro Tyr Ser Leu He Asp Thr His 35 40 45
GGC TTG AGC GTG GGC TTA CCT AAG GGG CAA ATG GGA AAT TCT GAA GTG 254 Gly Leu Ser Val Gly Leu Pro Lys Gly Gin Met Gly Asn Ser Glu Val 50 55 60
GGG CAT ATG TGT ATT GGG GCT GGT AGG GTG CTC TAT CAG GAT TTA GTC 302 Gly His Met Cys He Gly Ala Gly Arg Val Leu Tyr Gin Asp Leu Val 65 70 75
AAA ATT TCT TTA AGC CTT CAA AAC GAT GAA TTA AAA AAC AAC CCC GCT 350 Lys He Ser Leu Ser Leu Gin Asn Asp Glu Leu Lys Asn Asn Pro Ala 80 85 90 95
TTT TTA AAC ACG ATC CAA AAA AGC CCT GTG GTG CAT CTT ATG GGT TTA 398 Phe Leu Asn Thr He Gin Lys Ser Pro Val Val His Leu Met Gly Leu 100 105 110
ATG AGC GAT GGA GGC GTG CAT TCA CAC ATT GAG CAT TTT ATC GCT CTG 446 Met Ser Asp Gly Gly Val His Ser His He Glu His Phe He Ala Leu 115 120 125
GCT TTA GAG TGT GAA AAA TCC CAT AAA AAA GTC TGT CTG CAT TTA ATC 494 Ala Leu Glu Cys Glu Lys Ser His Lys Lys Val Cys Leu His Leu He 130 135 140
ACC GAT GGG CGC GAT GTC GCT CCT AAA AGC GCT TTA ACT TAT TTA AAA 542 Thr Asp Gly Arg Asp Val Ala Pro Lys Ser Ala Leu Thr Tyr Leu Lys 145 150 155
CAA ATG CAA AAT ATC TGC AAT GAA AGC ATT CAA ATC GCT ACC ATA AGC 590 Gin Met Gin Asn He Cys Asn Glu Ser He Gin He Ala Thr He Ser 160 165 170 175
GGT CGT TTT TAT GCC ATG GAT AGG GAT AAG CGC TTT GAA AGG ATT GAG 638 Gly Arg Phe Tyr Ala Met Asp Arg Asp Lys Arg Phe Glu Arg He Glu 180 185 190
CTT GCG TAT CAT AGC TTA ATG GGG CTT AAT CAC ACG CCT TTA AGC CCT 686 Leu Ala Tyr His Ser Leu Met Gly Leu Asn His Thr Pro Leu Ser Pro 195 200 205
AGC GAG TAT ATC CAA AGC CAG TAT GAT AAA AAT ATC ACC GAT GAA TTT 734 Ser Glu Tyr He Gin Ser Gin Tyr Asp Lys Asn He Thr Asp Glu Phe 210 215 220
ATC ATG CCC GCT TGT TTT AAA AAT TAT TGC GGC ATG CAA GAT GAT GAG 782 He Met Pro Ala Cys Phe Lys Asn Tyr Cys Gly Met Gin Asp Asp Glu 225 230 235
AGT TTT ATT TTT ATC AAT TTC AGG AAT GAT AGG GCT AGA GAA ATC GTG 830 Ser Phe He Phe He Asn Phe Arg Asn Asp Arg Ala Arg Glu He Val 240 245 250 255
AGC GCT TTA GGC CAA AAA CAA TTC AGT GGC TTT AAG CGC CAA GTT TTT 878 Ser Ala Leu Gly Gin Lys Gin Phe Ser Gly Phe Lys Arg Gin Val Phe 260 265 270
AAA AAA CTC CAT ATC GCT ACC ATG ACG CCT TAT GAT AAC ACT TTC CCC 926 Lys Lys Leu His He Ala Thr Met Thr Pro Tyr Asp Asn Thr Phe Pro 275 280 285
TAC CCT GTT TTA TTC CCC AAA GAA AGC GTT CAA AAC ACG CTC GCT GAA 974 Tyr Pro Val Leu Phe Pro Lys Glu Ser Val Gin Asn Thr Leu Ala Glu 290 295 300
GTG GTC TCT CAA CAC AAC CTG ACC CAA AGC CAT ATC GCT GAA ACT GAA 1022 Val Val Ser Gin His Asn Leu Thr Gin Ser His He Ala Glu Thr Glu 305 310 315
AAA TAC GCG CAT GTA ACC TTT TTC ATC AAT GGC GGA GTG GAG ACG CCT 1070 Lys Tyr Ala His Val Thr Phe Phe He Asn Gly Gly Val Glu Thr Pro 320 325 330 335
TTT AAA AAT GAA AAC CGG GTG CTT ATC CAA AGC CCT AAA GTT ACC ACT 1118 Phe Lys Asn Glu Asn Arg Val Leu He Gin Ser Pro Lys Val Thr Thr 340 345 350
TAT GAC TTA AAG CCT GAA ATG AGC GCT AAA GAA GTA ACC CTT GCG GTG 1166 Tyr Asp Leu Lys Pro Glu Met Ser Ala Lys Glu Val Thr Leu Ala Val 355 360 365
TTA GAG CAA ATG AAA CTA GGC ACG GAT TTG ATC ATT GTG AAT TTT GCT 1214 Leu Glu Gin Met Lys Leu Gly Thr Asp Leu He He Val Asn Phe Ala 370 375 380
AAT GGC GAT ATG GTA GGG CAT ACG GGG AAT TTT GAA GCG AGC GTC AAA 1262 Asn Gly Asp Met Val Gly His Thr Gly Asn Phe Glu Ala Ser Val Lys 385 390 395
GCG GTG GAA GCA GTG GAT GCA TGT TTA GGG GAA ATC CTT TCA CTG GCT 1310 Ala Val Glu Ala Val Asp Ala Cys Leu Gly Glu He Leu Ser Leu Ala 400 405 410 415
AAA AAA TTG GAT TAC GCC ATG CTT TTA ACC AGC GAT CAT GGG AAT TGC 1358 Lys Lys Leu Asp Tyr Ala Met Leu Leu Thr Ser Asp His Gly Asn Cys 420 425 430
GAG CGC ATG AAA GAC GAA AAC CAA AAC CCC TTA ACC AAC CAC ACC GCC 1406 Glu Arg Met Lys Asp Glu Asn Gin Asn Pro Leu Thr Asn His Thr Ala 435 440 445
GGG AGC GTG TAT TGC TTT GTT TTA GGG GAT GGA GTC AAA TCC ATA AAA 1454 Gly Ser Val Tyr Cys Phe Val Leu Gly Asp Gly Val Lys Ser He Lys 450 455 460
AAC GGA GCC TTA AAC AAT ATC GCT AGC AGC GTG TTA AAA CTC ATG GGC 1502 Asn Gly Ala Leu Asn Asn He Ala Ser Ser Val Leu Lys Leu Met Gly 465 470 475
CTT AAA GCC CCA GCA ACG ATG GAC GAA CCC CTA TTT TAAACTAAAG GAAAAG 1554 Leu Lys Ala Pro Ala Thr Met Asp Glu Pro Leu Phe 480 485 490 AATGCAAATT GATGACGCAT TATTGCAACG CTTGGAAAAA TTGAGCATGC TAGAGATTAA 1614 AGATGA 1620
(2) INFORMATION FOR SEQ ID NO: 476:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 491 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476:
Met Ala Gin Lys Thr Leu Leu He He Thr Asp Gly He Gly Tyr Arg
1 5 10 15
Lys Asp Ser Asp His Asn Ala Phe Phe His Ala Lys Lys Pro Thr Tyr
20 25 30
Asp Leu Met Phe Lys Thr Leu Pro Tyr Ser Leu He Asp Thr His Gly
35 40 45
Leu Ser Val Gly Leu Pro Lys Gly Gin Met Gly Asn Ser Glu Val Gly
50 55 60
His Met Cys He Gly Ala Gly Arg Val Leu Tyr Gin Asp Leu Val Lys 65 70 75 80
He Ser Leu Ser Leu Gin Asn Asp Glu Leu Lys Asn Asn Pro Ala Phe
85 90 95
Leu Asn Thr He Gin Lys Ser Pro Val Val His Leu Met Gly Leu Met
100 105 110
Ser Asp Gly Gly Val His Ser His He Glu His Phe He Ala Leu Ala
115 120 125
Leu Glu Cys Glu Lys Ser His Lys Lys Val Cys Leu His Leu He Thr
130 135 140
Asp Gly Arg Asp Val Ala Pro Lys Ser Ala Leu Thr Tyr Leu Lys Gin 145 150 155 160
Met Gin Asn He Cys Asn Glu Ser He Gin He Ala Thr He Ser Gly
165 170 175
Arg Phe Tyr Ala Met Asp Arg Asp Lys Arg Phe Glu Arg He Glu Leu
180 185 190
Ala Tyr His Ser Leu Met Gly Leu Asn His Thr Pro Leu Ser Pro Ser
195 200 205
Glu Tyr He Gin Ser Gin Tyr Asp Lys Asn He Thr Asp Glu Phe He
210 215 220
Met Pro Ala Cys Phe Lys Asn Tyr Cys Gly Met Gin Asp Asp Glu Ser 225 230 235 240
Phe He Phe He Asn Phe Arg Asn Asp Arg Ala Arg Glu He Val Ser
245 250 255
Ala Leu Gly Gin Lys Gin Phe Ser Gly Phe Lys Arg Gin Val Phe Lys
260 265 270
Lys Leu His He Ala Thr Met Thr Pro Tyr Asp Asn Thr Phe Pro Tyr
275 280 285
Pro Val Leu Phe Pro Lys Glu Ser Val Gin Asn Thr Leu Ala Glu Val
290 295 300
Val Ser Gin His Asn Leu Thr Gin Ser His He Ala Glu Thr Glu Lys 305 310 315 320
Tyr Ala His Val Thr Phe Phe He Asn Gly Gly Val Glu Thr Pro Phe
325 330 335
Lys Asn Glu Asn Arg Val Leu He Gin Ser Pro Lys Val Thr Thr Tyr
340 345 350
Asp Leu Lys Pro Glu Met Ser Ala Lys Glu Val Thr Leu Ala Val Leu
355 360 365
Glu Gin Met Lys Leu Gly Thr Asp Leu He He Val Asn Phe Ala Asn
370 375 380
Gly Asp Met Val Gly His Thr Gly Asn Phe Glu Ala Ser Val Lys Ala 385 390 395 400
Val Glu Ala Val Asp Ala Cys Leu Gly Glu He Leu Ser Leu Ala Lys
405 410 415
Lys Leu Asp Tyr Ala Met Leu Leu Thr Ser Asp His Gly Asn Cys Glu
420 425 430
Arg Met Lys Asp Glu Asn Gin Asn Pro Leu Thr Asn His Thr Ala Gly
435 440 445
Ser Val Tyr Cys Phe Val Leu Gly Asp Gly Val Lys Ser He Lys Asn
450 455 460
Gly Ala Leu Asn Asn He Ala Ser Ser Val Leu Lys Leu Met Gly Leu 465 470 475 480
Lys Ala Pro Ala Thr Met Asp Glu Pro Leu Phe 485 490
(2) INFORMATION FOR SEQ ID NO: 477:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1440 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 72...1379 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477:
ATAGTAAAAT CAAATAACCT TATTTTAACC AAAGGTTATT AAAATTATCC TTATTATAGA 60 GAGTTTTTAA C ATG AAT TTT CAA GAA AAT TTA GCC GCT TTG GAT TTG GAG 110 Met Asn Phe Gin Glu Asn Leu Ala Ala Leu Asp Leu Glu 1 5 10
TAT CTT TGG CAC CCT TGT TCG CAA ATG CAA GAG CAT CAA AAT TTC CCC 158 Tyr Leu Trp His Pro Cys Ser Gin Met Gin Glu His Gin Asn Phe Pro 15 20 25
ATT ATC CCC ATT AAA AAG GCT CAA GGG ATT TAC CTC TAT GAT TTT AAT 206 He He Pro He Lys Lys Ala Gin Gly He Tyr Leu Tyr Asp Phe Asn 30 35 40 45
GAT AAC GCT TAC ATG GAT TTG ATC AGC TCA TGG TGG GTG AAT CTT TTT 254 Asp Asn Ala Tyr Met Asp Leu He Ser Ser Trp Trp Val Asn Leu Phe 50 55 60
GGG CAT AAT AAC GCC TAC ATC AGC CAG CAA CTC AAA AAT CAA ATT GAT 302 Gly His Asn Asn Ala Tyr He Ser Gin Gin Leu Lys Asn Gin He Asp 65 70 75
GAT TTA GAG CAT GTC CTT TTG GCT TCT TTT AGC CAT AAG CCC ATT ATC 350 Asp Leu Glu His Val Leu Leu Ala Ser Phe Ser His Lys Pro He He 80 85 90
ACG CTC TCT CAA AGG CTT TGC CAG CTC ACT CAT ATG GAT AAA TGC TTT 398 Thr Leu Ser Gin Arg Leu Cys Gin Leu Thr His Met Asp Lys Cys Phe 95 100 105
TAT GCG GAT AAC GGC TCA TCT TGT GTT GAA ATC GCT TTG AAA ATG AGC 446 Tyr Ala Asp Asn Gly Ser Ser Cys Val Glu He Ala Leu Lys Met Ser 110 115 120 125
TAT CAC GCC CAT TTT TTA AAG AAT CAA ACG CGC CGC AAA AAG CTT TTT 494 Tyr His Ala His Phe Leu Lys Asn Gin Thr Arg Arg Lys Lys Leu Phe 130 135 140
TTA TCG CTC TCT AAT TCC TAT CAT GGC GAG ACT TTG GGA GCG TTA AGC 542 Leu Ser Leu Ser Asn Ser Tyr His Gly Glu Thr Leu Gly Ala Leu Ser 145 150 155
GTG GGC GAT GTG AAA CTT TAT AAA GAC ACT TAC ACC CCT TTA TTG CTC 590 Val Gly Asp Val Lys Leu Tyr Lys Asp Thr Tyr Thr Pro Leu Leu Leu 160 165 170
AAA AAT CTC ACC ACA CCT GTG CCT AAA AAC GAC CAT GAA ATA GAA AAT 638 Lys Asn Leu Thr Thr Pro Val Pro Lys Asn Asp His Glu He Glu Asn 175 180 185
AGT TTG AAC GCT TTA AAG CGT TTG TTA GAC AAG CAT AGT GAA GAA ATT 686 Ser Leu Asn Ala Leu Lys Arg Leu Leu Asp Lys His Ser Glu Glu He 190 195 200 205
TGC GCT TTC ATT GCA GAG CCT CTT TTG CAA TGC GCA GGG AAT ATG CAT 734 Cys Ala Phe He Ala Glu Pro Leu Leu Gin Cys Ala Gly Asn Met His 210 215 220
ATT TAT AGC GCA AGA TAT TTA AAA CAA GCC GTT TTA TTG TGC AAG CAA 782 He Tyr Ser Ala Arg Tyr Leu Lys Gin Ala Val Leu Leu Cys Lys Gin 225 230 235
AAA AAC ATC CAC ATT ATT TTT GAT GAA ATC GCT ACC GGG TTT GGG CGC 830 Lys Asn He His He He Phe Asp Glu He Ala Thr Gly Phe Gly Arg 240 245 250
ACA GGG AGC ATG TTT GCT TAT GAA CAA TGC GAA ATT AAG CCG GAT TTT 878 Thr Gly Ser Met Phe Ala Tyr Glu Gin Cys Glu He Lys Pro Asp Phe 255 260 265
TTA TGC TTG TCT AAG GGG ATT AGT GGG GGG TAT TTG CCT TTA AGC GCA 926 Leu Cys Leu Ser Lys Gly He Ser Gly Gly Tyr Leu Pro Leu Ser Ala 270 275 280 285
CTA TTA ACC CAT AAT GAA ATC TAT AAC CAA TTT TAC GCC CCC TAT GAA 974 Leu Leu Thr His Asn Glu He Tyr Asn Gin Phe Tyr Ala Pro Tyr Glu 290 295 300
GAA AAT AAA GCG TTT TTG CAT TCG CAC AGC TAC ACA GGA AAC GCT TTG 1022 Glu Asn Lys Ala Phe Leu His Ser His Ser Tyr Thr Gly Asn Ala Leu 305 310 315
GCA TGC GCA TGC GCG AAC GCT ACG CTG GAT ATT TTT GAA AAA GAA AAT 1070 Ala Cys Ala Cys Ala Asn Ala Thr Leu Asp He Phe Glu Lys Glu Asn 320 325 330
GTT ATT GAA AAG AAC AAG GCT TTA AGC GGG TTT ATT TTT AAT ACG CTC 1118 Val He Glu Lys Asn Lys Ala Leu Ser Gly Phe He Phe Asn Thr Leu 335 340 345
CAA AAC GCA TTA AAA CCC TTG ATG GAG CAA CAA GTG GTG TCT GAT TTA 1166 Gin Asn Ala Leu Lys Pro Leu Met Glu Gin Gin Val Val Ser Asp Leu 350 355 360 365
AGG CAT TTG GGC ATG GTC TTT GCC TTT GAA GTC TTT ATT CAA ACC AAA 1214 Arg His Leu Gly Met Val Phe Ala Phe Glu Val Phe He Gin Thr Lys 370 375 380
GAG CGT TTG AGT TTG GCG GTT TTT AAA AAA ACT CTA AAA AAA GGC CTG 1262 Glu Arg Leu Ser Leu Ala Val Phe Lys Lys Thr Leu Lys Lys Gly Leu 385 390 395
TTA TTA CGC CCT TTA AAC AAC ACC ATT TAC CTC ATG CCC CCT TAC ATT 1310 Leu Leu Arg Pro Leu Asn Asn Thr He Tyr Leu Met Pro Pro Tyr He 400 405 410
ATC ACG CAT GAA GAA GTC AAA AAG GCG GTT GCG GGG CTA GTG GAA ATT 1358 He Thr His Glu Glu Val Lys Lys Ala Val Ala Gly Leu Val Glu He 415 420 425
CTT GAT GAG TTA AGA AAA GGC TGAAAGCGTT TTTAAAAATA AAATAGTAAA AAGC 1413 Leu Asp Glu Leu Arg Lys Gly 430 435
TTGAATTGGA TCAAGCGATA GCTTTTT 1440
(2) INFORMATION FOR SEQ ID NO: 478:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 436 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:478:
Met Asn Phe Gin Glu Asn Leu Ala Ala Leu Asp Leu Glu Tyr Leu Trp
1 5 10 15
His Pro Cys Ser Gin Met Gin Glu His Gin Asn Phe Pro He He Pro
20 25 30
He Lys Lys Ala Gin Gly He Tyr Leu Tyr Asp Phe Asn Asp Asn Ala
35 40 45
Tyr Met Asp Leu He Ser Ser Trp Trp Val Asn Leu Phe Gly His Asn
50 55 60
Asn Ala Tyr He Ser Gin Gin Leu Lys Asn Gin He Asp Asp Leu Glu 65 70 75 80
His Val Leu Leu Ala Ser Phe Ser His Lys Pro He He Thr Leu Ser
85 90 95
Gin Arg Leu Cys Gin Leu Thr His Met Asp Lys Cys Phe Tyr Ala Asp
100 105 110
Asn Gly Ser Ser Cys Val Glu He Ala Leu Lys Met Ser Tyr His Ala
115 120 125
His Phe Leu Lys Asn Gin Thr Arg Arg Lys Lys Leu Phe Leu Ser Leu
130 135 140
Ser Asn Ser Tyr His Gly Glu Thr Leu Gly Ala Leu Ser Val Gly Asp 145 150 155 160
Val Lys Leu Tyr Lys Asp Thr Tyr Thr Pro Leu Leu Leu Lys Asn Leu
165 170 175
Thr Thr Pro Val Pro Lys Asn Asp His Glu He Glu Asn Ser Leu Asn
180 185 190
Ala Leu Lys Arg Leu Leu Asp Lys His Ser Glu Glu He Cys Ala Phe
195 200 205
He Ala Glu Pro Leu Leu Gin Cys Ala Gly Asn Met His He Tyr Ser
210 215 220
Ala Arg Tyr Leu Lys Gin Ala Val Leu Leu Cys Lys Gin Lys Asn He 225 230 235 240
His He He Phe Asp Glu He Ala Thr Gly Phe Gly Arg Thr Gly Ser
245 250 255
Met Phe Ala Tyr Glu Gin Cys Glu He Lys Pro Asp Phe Leu Cys Leu
260 265 270
Ser Lys Gly He Ser Gly Gly Tyr Leu Pro Leu Ser Ala Leu Leu Thr
275 280 285
His Asn Glu He Tyr Asn Gin Phe Tyr Ala Pro Tyr Glu Glu Asn Lys
290 295 300
Ala Phe Leu His Ser His Ser Tyr Thr Gly Asn Ala Leu Ala Cys Ala 305 310 315 320
Cys Ala Asn Ala Thr Leu Asp He Phe Glu Lys Glu Asn Val He Glu
325 330 335
Lys Asn Lys Ala Leu Ser Gly Phe He Phe Asn Thr Leu Gin Asn Ala
340 345 350
Leu Lys Pro Leu Met Glu Gin Gin Val Val Ser Asp Leu Arg His Leu
355 360 365
Gly Met Val Phe Ala Phe Glu Val Phe He Gin Thr Lys Glu Arg Leu 370 375 380 Ser Leu Ala Val Phe Lys Lys Thr Leu Lys Lys Gly Leu Leu Leu Arg 385 390 395 400
Pro Leu Asn Asn Thr He Tyr Leu Met Pro Pro Tyr He He Thr His
405 410 415
Glu Glu Val Lys Lys Ala Val Ala Gly Leu Val Glu He Leu Asp Glu
420 425 430
Leu Arg Lys Gly 435
(2) INFORMATION FOR SEQ ID NO: 479:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 360 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 82...294 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479:
AAAGCTAAAC AAGAGCAAGA GAAACGCATG GCATAAGAGC CATGCGCTTG GCAGTGAAAC 60 AAAAGAAAAA AAGGAAACAA C ATG GGA AAA ATG AAA CAA GAA ACA GCG ATT 111
Met Gly Lys Met Lys Gin Glu Thr Ala He 1 5 10
GAC TAT GAA AAA TTA GCG AAT CAT TGG AAT AAT AAT GAT GAA AAC AGC 159 Asp Tyr Glu Lys Leu Ala Asn His Trp Asn Asn Asn Asp Glu Asn Ser 15 20 25
GAA GCA CTA AAC GCT TTT GCA GAC GCT TAC CTT TAT AAA CAT GAG AAA 207 Glu Ala Leu Asn Ala Phe Ala Asp Ala Tyr Leu Tyr Lys His Glu Lys 30 35 40
AAG AGT CAA AAG ATT CGG GCA ATA GAG ATA AGT TCT CTA AAC AAA GCC 255 Lys Ser Gin Lys He Arg Ala He Glu He Ser Ser Leu Asn Lys Ala 45 50 55
TGC ATG GGA GAA TTT TAC CAC AAA AAC CCA AAA TTA TTT TAATAACGAT CG 306 Cys Met Gly Glu Phe Tyr His Lys Asn Pro Lys Leu Phe 60 65 70
CTCCAAGGAA CCAACGCCCC ATGACCTCAA GAAAAGAGAA TAGCTTGAAT CGGT 360
(2) INFORMATION FOR SEQ ID NO: 480:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 71 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480:
Met Gly Lys Met Lys Gin Glu Thr Ala He Asp Tyr Glu Lys Leu Ala
1 5 10 15
Asn His Trp Asn Asn Asn Asp Glu Asn Ser Glu Ala Leu Asn Ala Phe
20 25 30
Ala Asp Ala Tyr Leu Tyr Lys His Glu Lys Lys Ser Gin Lys He Arg
35 40 45
Ala He Glu He Ser Ser Leu Asn Lys Ala Cys Met Gly Glu Phe Tyr
50 55 60
His Lys Asn Pro Lys Leu Phe 65 70
(2) INFORMATION FOR SEQ ID NO: 481:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1300 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 62...1255 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481:
TTGCTAACTC AAAAGGCGAA TTGCAATATT CTAACACGCC TAATATTTAT AAGGCGATTA 60 A AGA CAT AGA AAC AGA GCT AGA TGC ACT AGA AAA CAG GCT AGA AAC AAT 109 Arg His Arg Asn Arg Ala Arg Cys Thr Arg Lys Gin Ala Arg Asn Asn 1 5 10 15
AAG AGT TTT AGG CAT GAA AAC TAT TTT TAT AAA GTT TTG GGT AGT GCA 157 Lys Ser Phe Arg His Glu Asn Tyr Phe Tyr Lys Val Leu Gly Ser Ala 20 25 30
ACT TCT CAA ATA GAA AGT TTG AAA AAA AGA GAA AAT GCC CTA TTT GAT 205 Thr Ser Gin He Glu Ser Leu Lys Lys Arg Glu Asn Ala Leu Phe Asp 35 40 45
CAT TTA GAT AGT CTA AAA AGT TTA TTA GAA AAA ACA CAT TGG GAA AAA 253 His Leu Asp Ser Leu Lys Ser Leu Leu Glu Lys Thr His Trp Glu Lys 50 55 60
GAA AAA TTC ACG CCC CCA ATA AAT GAA AAA GAA CTT AAT AGG CAA CTT 301 Glu Lys Phe Thr Pro Pro He Asn Glu Lys Glu Leu Asn Arg Gin Leu 65 70 75 80
AAA GAA GTG AGA TGG TTC AAT AAA GAA ACT CCA ACT TCT AAA AAC ACT 349 Lys Glu Val Arg Trp Phe Asn Lys Glu Thr Pro Thr Ser Lys Asn Thr 85 90 95
TAT AAG AAA ATT CAA AAA TTA GCT GTT TAT AAA AGC CCT TTA ATA AAA 397 Tyr Lys Lys He Gin Lys Leu Ala Val Tyr Lys Ser Pro Leu He Lys 100 105 110
GAT TAT CTT TAT ACC ATT AAA AAA CTT TTT GCC ACA CAA AAA AAG ATT 445 Asp Tyr Leu Tyr Thr He Lys Lys Leu Phe Ala Thr Gin Lys Lys He 115 120 125
ATA GAT TTA GAA AAA AAT TAT AAA GAT TTA AGA GCC TTA AAG GAA GAA 493 He Asp Leu Glu Lys Asn Tyr Lys Asp Leu Arg Ala Leu Lys Glu Glu 130 135 140
TTT AGC AAA GAT TTA GAA ACT GAT TTA TCC CAT TCA AAA AAA CGC TTT 541 Phe Ser Lys Asp Leu Glu Thr Asp Leu Ser His Ser Lys Lys Arg Phe 145 150 155 160
GAA CTT TAC ACT AGA CTA AAG AGC ATG AGC AAA GTT TTT ATA AGC AAA 589 Glu Leu Tyr Thr Arg Leu Lys Ser Met Ser Lys Val Phe He Ser Lys 165 170 175
AGC ATT GTT AAA AAT TTA GAA AAA ATT GCT TTA GAT TTT AAA AGC GAT 637 Ser He Val Lys Asn Leu Glu Lys He Ala Leu Asp Phe Lys Ser Asp 180 185 190
AGA CAT AGT ATT TCG CAA AGA GCT TTT GAA TTT TTT AAG TAT ATG AAT 685 Arg His Ser He Ser Gin Arg Ala Phe Glu Phe Phe Lys Tyr Met Asn 195 200 205
TAT CAA AAT TTA AGC TTG ACT GAT AAA GGC AAT ATG TTT TTA GTG GCT 733 Tyr Gin Asn Leu Ser Leu Thr Asp Lys Gly Asn Met Phe Leu Val Ala 210 215 220
AAG TTT TTT AAA GAT AGT GCT TTA CTT GTT AAT ATT GCT AGG TTT GAA 781 Lys Phe Phe Lys Asp Ser Ala Leu Leu Val Asn He Ala Arg Phe Glu 225 230 235 240
ATG AAA AAG ATA GAT GAT AGT GTT AAA AAT TCT AAC CCA CAA GAC AAT 829 Met Lys Lys He Asp Asp Ser Val Lys Asn Ser Asn Pro Gin Asp Asn 245 250 255
TTA TTA GAC AAA CAA GTT TGG CTC AAT CTT TTA GAG CAT TTA AAA AGA 877 Leu Leu Asp Lys Gin Val Trp Leu Asn Leu Leu Glu His Leu Lys Arg 260 265 270 CTT GAA GAG GAA AAT TAT TGT TTT GCT AAG AAA CGA AAA GAA TTC TTA 925 Leu Glu Glu Glu Asn Tyr Cys Phe Ala Lys Lys Arg Lys Glu Phe Leu 275 280 285
GAG ACT AGA GCG ATG GAG CTA TCA AAA GAT TTA AAA TTT TTA ACA CAG 973 Glu Thr Arg Ala Met Glu Leu Ser Lys Asp Leu Lys Phe Leu Thr Gin 290 295 300
GCT AAT GAA AAT GAT TTG CCC ATT TAT GAA AGA GGG CAA AGG GAT AAA 1021 Ala Asn Glu Asn Asp Leu Pro He Tyr Glu Arg Gly Gin Arg Asp Lys 305 310 315 320
ATC ATT AAA CGC TGT GAA AAA TCG CTT AAC TTT TTG CAG AAA GAA TTA 1069 He He Lys Arg Cys Glu Lys Ser Leu Asn Phe Leu Gin Lys Glu Leu 325 330 335
CAA TGC TTT AAA ACC TTA TTG AAA AGT GCA AGT ATA GCT TTA GAA AAC 1117 Gin Cys Phe Lys Thr Leu Leu Lys Ser Ala Ser He Ala Leu Glu Asn 340 345 350
TTG CAA AAT AAC CAT CAA ATC ACA GCC GTT ACA CAA GAC ACG CAA GAA 1165 Leu Gin Asn Asn His Gin He Thr Ala Val Thr Gin Asp Thr Gin Glu 355 360 365
AAC ACA AAC GCG CTC AAA AAT ACT ACT CAA GAT TTT AAC AAA ACT ACC 1213 Asn Thr Asn Ala Leu Lys Asn Thr Thr Gin Asp Phe Asn Lys Thr Thr 370 375 380
AAT GAA CCA ACA AAC CCT AAC AAT AAC TAT GGA ATG GAT TTT TAAAACCAT 1264 Asn Glu Pro Thr Asn Pro Asn Asn Asn Tyr Gly Met Asp Phe 385 390 395
CCATAAAATT AAATAACTTT ACTTAGCGTA TTTTTT 1300
(2) INFORMATION FOR SEQ ID NO: 482:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 398 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:482:
Arg His Arg Asn Arg Ala Arg Cys Thr Arg Lys Gin Ala Arg Asn Asn
1 5 10 15
Lys Ser Phe Arg His Glu Asn Tyr Phe Tyr Lys Val Leu Gly Ser Ala
20 25 30
Thr Ser Gin He Glu Ser Leu Lys Lys Arg Glu Asn Ala Leu Phe Asp
35 40 45
His Leu Asp Ser Leu Lys Ser Leu Leu Glu Lys Thr His Trp Glu Lys 50 55 60 Glu Lys Phe Thr Pro Pro He Asn Glu Lys Glu Leu Asn Arg Gin Leu 65 70 75 80
Lys Glu Val Arg Trp Phe Asn Lys Glu Thr Pro Thr Ser Lys Asn Thr
85 90 95
Tyr Lys Lys He Gin Lys Leu Ala Val Tyr Lys Ser Pro Leu He Lys
100 105 110
Asp Tyr Leu Tyr Thr He Lys Lys Leu Phe Ala Thr Gin Lys Lys He
115 120 125
He Asp Leu Glu Lys Asn Tyr Lys Asp Leu Arg Ala Leu Lys Glu Glu
130 135 140
Phe Ser Lys Asp Leu Glu Thr Asp Leu Ser His Ser Lys Lys Arg Phe 145 150 155 160
Glu Leu Tyr Thr Arg Leu Lys Ser Met Ser Lys Val Phe He Ser Lys
165 170 175
Ser He Val Lys Asn Leu Glu Lys He Ala Leu Asp Phe Lys Ser Asp
180 185 190
Arg His Ser He Ser Gin Arg Ala Phe Glu Phe Phe Lys Tyr Met Asn
195 200 205
Tyr Gin Asn Leu Ser Leu Thr Asp Lys Gly Asn Met Phe Leu Val Ala
210 215 220
Lys Phe Phe Lys Asp Ser Ala Leu Leu Val Asn He Ala Arg Phe Glu 225 230 235 240
Met Lys Lys He Asp Asp Ser Val Lys Asn Ser Asn Pro Gin Asp Asn
245 250 255
Leu Leu Asp Lys Gin Val Trp Leu Asn Leu Leu Glu His Leu Lys Arg
260 265 270
Leu Glu Glu Glu Asn Tyr Cys Phe Ala Lys Lys Arg Lys Glu Phe Leu
275 280 285
Glu Thr Arg Ala Met Glu Leu Ser Lys Asp Leu Lys Phe Leu Thr Gin
290 295 300
Ala Asn Glu Asn Asp Leu Pro He Tyr Glu Arg Gly Gin Arg Asp Lys 305 310 315 320
He He Lys Arg Cys Glu Lys Ser Leu Asn Phe Leu Gin Lys Glu Leu
325 330 335
Gin Cys Phe Lys Thr Leu Leu Lys Ser Ala Ser He Ala Leu Glu Asn
340 345 350
Leu Gin Asn Asn His Gin He Thr Ala Val Thr Gin Asp Thr Gin Glu
355 360 365
Asn Thr Asn Ala Leu Lys Asn Thr Thr Gin Asp Phe Asn Lys Thr Thr
370 375 380
Asn Glu Pro Thr Asn Pro Asn Asn Asn Tyr Gly Met Asp Phe 385 390 395
(2) INFORMATION FOR SEQ ID NO:483:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 630 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence (B) LOCATION: 105...518 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:483:
GTTGCAACTT ATCTTGTTGT TCTTTAGTGG GATACAAGCG GAATTTAAAA CCCTTATTGA 60 CTTTCATAGA AAGTATTTTA ACCTCTTTTT GTTAAAATAG GTCT ATG AAA AAA ATT 116
Met Lys Lys He 1
GAT GAT ATG AGA CAC GGA AGA CAT TGT GTT TTT TTA ATG CAT GTG CAT 164 Asp Asp Met Arg His Gly Arg His Cys Val Phe Leu Met His Val His 5 10 15 20
TTT GTA TTT GTT ACT AAA TAC AGG CGT TCA GCA TTC AAT AAG GAA GTG 212 Phe Val Phe Val Thr Lys Tyr Arg Arg Ser Ala Phe Asn Lys Glu Val 25 30 35
ATA GAT TTT TTA GGA TCG GTG TTT GCC AAA GTG TGT AAG GAC TTT GAG 260 He Asp Phe Leu Gly Ser Val Phe Ala Lys Val Cys Lys Asp Phe Glu 40 45 50
AGC GAA TTG GTA GAA TTT GAT GGG GAG AGC GAT CAT GTG CAT TTG CTT 308 Ser Glu Leu Val Glu Phe Asp Gly Glu Ser Asp His Val His Leu Leu 55 60 65
ATC AAC TAC CCT CCA AAA GTG AGC GTG AGT AAG TTA GTT AAT TCT TTA 356 He Asn Tyr Pro Pro Lys Val Ser Val Ser Lys Leu Val Asn Ser Leu 70 75 80
AAA GGC GTT AGC AGT CGT TTG ACT AGA CAA CAC CAT TTC AAA AGC GTT 404 Lys Gly Val Ser Ser Arg Leu Thr Arg Gin His His Phe Lys Ser Val 85 90 95 100
GAA GCT AGT TTG TGG GGG AAG CAT TTA TGG TCG CCT AGT TAT TTC GCT 452 Glu Ala Ser Leu Trp Gly Lys His Leu Trp Ser Pro Ser Tyr Phe Ala 105 110 115
GGG AGT TGT GGG GAC GCG CCT TTA GAG ATG ATT AAG CAA TAC ATA CAA 500 Gly Ser Cys Gly Asp Ala Pro Leu Glu Met He Lys Gin Tyr He Gin 120 125 130
GAT CAA GAA ACA CCG CAT TAAATTAGCT AACTTTGATT TTTAAGTAGA ACGCGCTA 556 Asp Gin Glu Thr Pro His 135
AAAAGCGAAT GGATCTAAGT GAAACAATGT TCAAATAGCC TAACGGCTAA ACGCTTACAT 616 CTCCGCCCTA AAGG 630
(2) INFORMATION FOR SEQ ID NO: 484:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 138 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484:
Met Lys Lys He Asp Asp Met Arg His Gly Arg His Cys Val Phe Leu
1 5 10 15
Met His Val His Phe Val Phe Val Thr Lys Tyr Arg Arg Ser Ala Phe
20 25 30
Asn Lys Glu Val He Asp Phe Leu Gly Ser Val Phe Ala Lys Val Cys
35 40 45
Lys Asp Phe Glu Ser Glu Leu Val Glu Phe Asp Gly Glu Ser Asp His
50 55 60
Val His Leu Leu He Asn Tyr Pro Pro Lys Val Ser Val Ser Lys Leu 65 70 75 80
Val Asn Ser Leu Lys Gly Val Ser Ser Arg Leu Thr Arg Gin His His
85 90 95
Phe Lys Ser Val Glu Ala Ser Leu Trp Gly Lys His Leu Trp Ser Pro
100 105 110
Ser Tyr Phe Ala Gly Ser Cys Gly Asp Ala Pro Leu Glu Met He Lys
115 120 125
Gin Tyr He Gin Asp Gin Glu Thr Pro His 130 135
(2) INFORMATION FOR SEQ ID NO: 485:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1080 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 101...1000 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485:
TGGATTTAAA AGTGAGCGAT TTGGTGCGTG TGGCCAATGA ATATTTTAAA GACACCCAAT 60 CAACCACCGT GTTTTTGAAA CCTTAAAAGA GCCTTATAAC ATG CAA TTT CAT TCA 115
Met Gin Phe His Ser 1 5
TCT AGC GCG TTG ATT ACG CCT TTT AAA AAA GAT TTG AGC GTT GAT GAG 163 Ser Ser Ala Leu He Thr Pro Phe Lys Lys Asp Leu Ser Val Asp Glu 10 15 20
GCC GCT TAT GAA ACC TTG ATC AAG CGC CAA ATT TTT CAG GGC ATG GAC 211 Ala Ala Tyr Glu Thr Leu He Lys Arg Gin He Phe Gin Gly Met Asp 25 30 35
GCA TGC GTG CCT GTT GGC ACG ACA GGA GAA TCC GCC ACG CTC ACC CAC 259 Ala Cys Val Pro Val Gly Thr Thr Gly Glu Ser Ala Thr Leu Thr His 40 45 50
AAA GAG CAC ATG CGT TGC ATT GAA ATC GCC ATA GAA ACT TGC AAA AAC 307 Lys Glu His Met Arg Cys He Glu He Ala He Glu Thr Cys Lys Asn 55 60 65
ACT AAA ACG CCC TCA AAT TCG CGC ATG AAA GTG TTA GCC GGC GTG GGC 355 Thr Lys Thr Pro Ser Asn Ser Arg Met Lys Val Leu Ala Gly Val Gly 70 75 80 85
AGT AAC GCC ACG AGC GAG TCC CTT TCT TTA GCA AAG TTC GCT CAA AAA 403 Ser Asn Ala Thr Ser Glu Ser Leu Ser Leu Ala Lys Phe Ala Gin Lys 90 95 100
ATC GGC GCG GAT GCG ATT TTA TGC GTA AGC CCT TAT TAT AAC CGC CCC 451 He Gly Ala Asp Ala He Leu Cys Val Ser Pro Tyr Tyr Asn Arg Pro 105 110 115
ACC CAA CAA GGC TTG TTT GAA CAT TAT AAA ACC ATC GCT CAA TCG GTG 499 Thr Gin Gin Gly Leu Phe Glu His Tyr Lys Thr He Ala Gin Ser Val 120 125 130
GAA ATC CCT GTC ATG CTT TAT GAT GTG CCA AGT AGA ACA GGC GTG TCT 547 Glu He Pro Val Met Leu Tyr Asp Val Pro Ser Arg Thr Gly Val Ser 135 140 145
ATT GAA GTT CCA ACC GCC CTC AAA CTC TTT AGA GAA GTC CCT AAC ATT 595 He Glu Val Pro Thr Ala Leu Lys Leu Phe Arg Glu Val Pro Asn He 150 155 160 165
AAA GCC ATT AAA GAA GCG TCT GGC TCT TTG AAA AGG GTA ACA GAA TTG 643 Lys Ala He Lys Glu Ala Ser Gly Ser Leu Lys Arg Val Thr Glu Leu 170 175 180
CAT TAT TAT GAA AAA GAT TTT AAA ATT TTT AGT GGG GAA GAT TCG CTC 691 His Tyr Tyr Glu Lys Asp Phe Lys He Phe Ser Gly Glu Asp Ser Leu 185 190 195
AAC CAC TCC ATC ATG TTT TCA GGG GGG TGC GGC GTG ATT TCA GTG ACC 739 Asn His Ser He Met Phe Ser Gly Gly Cys Gly Val He Ser Val Thr 200 205 210
GGT AAT TTA ATG CCC AAT CTG ATT TCA CAA ATG GTC AAT TGC GCG CTC 787 Gly Asn Leu Met Pro Asn Leu He Ser Gin Met Val Asn Cys Ala Leu 215 220 225 AAA CAA AAA TAC CAA CAA GCC CTA GAA ATC CAA AAT AAG CTT TTT TGT 835 Lys Gin Lys Tyr Gin Gin Ala Leu Glu He Gin Asn Lys Leu Phe Cys 230 235 240 245
TTG CAC CAA GCC CTT TTT GTA GAA ACA AAT CCC ATC CCT ATT AAA ATG 883 Leu His Gin Ala Leu Phe Val Glu Thr Asn Pro He Pro He Lys Met 250 255 260
GCT ATG CAT TTA GCC GGC TTG ATT GAA AAC CCA AGC TAC AGA CTG CCT 931 Ala Met His Leu Ala Gly Leu He Glu Asn Pro Ser Tyr Arg Leu Pro 265 270 275
TTA GTG GCC CCA AGC AAA GAA ACG ATT CAA CTT TTA GAA AAA ACT TTA 979 Leu Val Ala Pro Ser Lys Glu Thr He Gin Leu Leu Glu Lys Thr Leu 280 285 290
CAA CAA TAT GAG GTA ATT GCA TGAATGGTTC CAATCACATG AAAAATAAAA CCCT 1034 Gin Gin Tyr Glu Val He Ala 295 300
AGTGATCAGC GGCGCGACTA GAGGGATTGG CAAGGCGATA TTGTAC 1080
(2) INFORMATION FOR SEQ ID NO: 486:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 300 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS : single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:486:
Met Gin Phe His Ser Ser Ser Ala Leu He Thr Pro Phe Lys Lys Asp
1 5 10 15
Leu Ser Val Asp Glu Ala Ala Tyr Glu Thr Leu He Lys Arg Gin He
20 25 30
Phe Gin Gly Met Asp Ala Cys Val Pro Val Gly Thr Thr Gly Glu Ser
35 40 45
Ala Thr Leu Thr His Lys Glu His Met Arg Cys He Glu He Ala He
50 55 60
Glu Thr Cys Lys Asn Thr Lys Thr Pro Ser Asn Ser Arg Met Lys Val 65 70 75 80
Leu Ala Gly Val Gly Ser Asn Ala Thr Ser Glu Ser Leu Ser Leu Ala
85 90 95
Lys Phe Ala Gin Lys He Gly Ala Asp Ala He Leu Cys Val Ser Pro
100 105 110
Tyr Tyr Asn Arg Pro Thr Gin Gin Gly Leu Phe Glu His Tyr Lys Thr
115 120 125
He Ala Gin Ser Val Glu He Pro Val Met Leu Tyr Asp Val Pro Ser
130 135 140
Arg Thr Gly Val Ser He Glu Val Pro Thr Ala Leu Lys Leu Phe Arg 145 150 155 160 Glu Val Pro Asn He Lys Ala He Lys Glu Ala Ser Gly Ser Leu Lys
165 170 175
Arg Val Thr Glu Leu His Tyr Tyr Glu Lys Asp Phe Lys He Phe Ser
180 185 190
Gly Glu Asp Ser Leu Asn His Ser He Met Phe Ser Gly Gly Cys Gly
195 200 205
Val He Ser Val Thr Gly Asn Leu Met Pro Asn Leu He Ser Gin Met
210 215 220
Val Asn Cys Ala Leu Lys Gin Lys Tyr Gin Gin Ala Leu Glu He Gin 225 230 235 240
Asn Lys Leu Phe Cys Leu His Gin Ala Leu Phe Val Glu Thr Asn Pro
245 250 255
He Pro He Lys Met Ala Met His Leu Ala Gly Leu He Glu Asn Pro
260 265 270
Ser Tyr Arg Leu Pro Leu Val Ala Pro Ser Lys Glu Thr He Gin Leu
275 280 285
Leu Glu Lys Thr Leu Gin Gin Tyr Glu Val He Ala 290 295 300
(2) INFORMATION FOR SEQ ID NO: 487:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1709 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 68...1624 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:487:
AAAAAACCTT AAAACGCTTA AAAAACTATC GCTTGATCAA GCTCTTTTAC TATTTAATCT 60 TTAAAAA ATG CTT TTG ATC GTT TTT TTT AAA TTT TAT TTT CAA TAT TCA 109 Met Leu Leu He Val Phe Phe Lys Phe Tyr Phe Gin Tyr Ser 1 5 10
ATT AAA AAA AAA TCA TTT TAT TTT ATT TTT GTT ATA ATT CAA GCT ATT 157 He Lys Lys Lys Ser Phe Tyr Phe He Phe Val He He Gin Ala He 15 20 25 30
TTT ATT TTC AAT CTA AGG AGG TGT CGC ATG GAC AAT CAA AAG ATA ACG 205 Phe He Phe Asn Leu Arg Arg Cys Arg Met Asp Asn Gin Lys He Thr 35 40 45
CAT CAA AAT ATC ACG CAA AAA CAA GGC GAG CTT AAA AGA GAC ATG AAA 253 His Gin Asn He Thr Gin Lys Gin Gly Glu Leu Lys Arg Asp Met Lys 50 55 60 ATG CGC CAT CTC TTA ATG ATT GCA TTT GGA GGA GCG ATC GGC ACA GGG 301 Met Arg His Leu Leu Met He Ala Phe Gly Gly Ala He Gly Thr Gly 65 70 75
CTT TTT GTA GGC ACT GGG GGT AAT ATT GCG AGC GCT GGC CCT TTA GGG 349 Leu Phe Val Gly Thr Gly Gly Asn He Ala Ser Ala Gly Pro Leu Gly 80 85 90
ACC TTG ATC GCT TAT TGT TTT GGA GGG CTT GTG GTC TAT TGT ATC ATG 397 Thr Leu He Ala Tyr Cys Phe Gly Gly Leu Val Val Tyr Cys He Met 95 100 105 110
CTC TCT TTA GGC GAA TTG GCT AGC GTT TAT CCC ACT ACA GGA AGT TTT 445 Leu Ser Leu Gly Glu Leu Ala Ser Val Tyr Pro Thr Thr Gly Ser Phe 115 120 125
GGG GAT TAT GCG GCT AAA TTC ATA GGC CCT GGC ACG GGC TAT ATG GTT 493 Gly Asp Tyr Ala Ala Lys Phe He Gly Pro Gly Thr Gly Tyr Met Val 130 135 140
TTT TGG ATG TAT TGG CTT GGC TGG GTG ATC ACG GTG GCG TTA GAA TAC 541 Phe Trp Met Tyr Trp Leu Gly Trp Val He Thr Val Ala Leu Glu Tyr 145 150 155
ATC GCT ATA GGC ATG CTC ATG CAA CGC TGG TTT GCG GAT ATT CCC ATC 589 He Ala He Gly Met Leu Met Gin Arg Trp Phe Ala Asp He Pro He 160 165 170
CAT TAT TGG GTT ATT TTA TGC ATT GCG TTA GTT TTT TTA TTG AAC TTT 637 His Tyr Trp Val He Leu Cys He Ala Leu Val Phe Leu Leu Asn Phe 175 180 185 190
TTT TCG GTT AAA ATT TTT GCC GAG GGC GAG TTT TTC TTT AGC CTG ATT 685 Phe Ser Val Lys He Phe Ala Glu Gly Glu Phe Phe Phe Ser Leu He 195 200 205
AAA GTT TTA GCG GTG ATC GCT TTT ATA GGC ATT GGC GCG ATT GGG ATT 733 Lys Val Leu Ala Val He Ala Phe He Gly He Gly Ala He Gly He 210 215 220
ATT TAT CAA ATC TAT TCG CAT GGG TTT GGT TCT ATT TTT GAT AAT TTC 781 He Tyr Gin He Tyr Ser His Gly Phe Gly Ser He Phe Asp Asn Phe 225 230 235
CAT TTT GGC GAT AAG GGG TTT TTC CCT AAT GGG AGC GCA GCG GTT TTT 829 His Phe Gly Asp Lys Gly Phe Phe Pro Asn Gly Ser Ala Ala Val Phe 240 245 250
AGC GCG ATG CTC GCT GTT ATT TTT GCT TTC ACT GGC ACA GAG GTG ATT 877 Ser Ala Met Leu Ala Val He Phe Ala Phe Thr Gly Thr Glu Val He 255 260 265 270
GGG GTG GCT GTG GGA GAG ACT AAA AAC GCT AGC GAA GTG ATG CCC AAA 925 Gly Val Ala Val Gly Glu Thr Lys Asn Ala Ser Glu Val Met Pro Lys 275 280 285 GCG ATT AAA GCG ACC TTG TGG CGG ATT GTC TTT TTC TTT TTA GGC TCT 973 Ala He Lys Ala Thr Leu Trp Arg He Val Phe Phe Phe Leu Gly Ser 290 295 300
GTG TTT GTC ATT TCT GTT TTT TTA CCC ATG AAT GAT TCT TCT ATC ACG 1021 Val Phe Val He Ser Val Phe Leu Pro Met Asn Asp Ser Ser He Thr 305 310 315
CAA AGC CCT TTT GTG AGC GTT TTA GAA CGC ATT AAT TTG CCC TTT ATT 1069 Gin Ser Pro Phe Val Ser Val Leu Glu Arg He Asn Leu Pro Phe He 320 325 330
GGC ATG GGT ATC CCT TAT GTG GCT GAT ATA ATG AAC GCT GTT ATC ATT 1117 Gly Met Gly He Pro Tyr Val Ala Asp He Met Asn Ala Val He He 335 340 345 350
ACG GCG ATG TTT TCT ACC GCT AAT TCA GGG CTT TAT GGA GCG AGC CGC 1165 Thr Ala Met Phe Ser Thr Ala Asn Ser Gly Leu Tyr Gly Ala Ser Arg 355 360 365
ATG ATT TAT GGG CTG TCC AAA CAA AAG ATG TTT TTT AAG GTT TTT TCC 1213 Met He Tyr Gly Leu Ser Lys Gin Lys Met Phe Phe Lys Val Phe Ser 370 375 380
CAA CTC AAC CGA CAA GGC ACG CCC ACT TAT GCG ATG TTT TTT TCC CTT 1261 Gin Leu Asn Arg Gin Gly Thr Pro Thr Tyr Ala Met Phe Phe Ser Leu 385 390 395
TCT TTT TCT CTC ATA GGG CTT TTA GTC CAA ATT TAT GCC AAA GAA AAT 1309 Ser Phe Ser Leu He Gly Leu Leu Val Gin He Tyr Ala Lys Glu Asn 400 405 410
GTC GTG GAA GCT TTG ATT AAT GTG ATC AGT TTC ACG GTG ATT ATT GTG 1357 Val Val Glu Ala Leu He Asn Val He Ser Phe Thr Val He He Val 415 420 425 430
TGG GTT AGC GTG TCT GTT TCG CAA TAT TCT TTC CGC AAG CAA TAC TTA 1405 Trp Val Ser Val Ser Val Ser Gin Tyr Ser Phe Arg Lys Gin Tyr Leu 435 440 445
AAA GCC GGG CAT TCT TTA GAG GAT TTG CCT TAT AAA GCC CCC TTT CTA 1453 Lys Ala Gly His Ser Leu Glu Asp Leu Pro Tyr Lys Ala Pro Phe Leu 450 455 460
CCC TTT TTG CAA CTC ATA GGG ATC ACT GGG TGT GCC ATC GGC GTG ATT 1501 Pro Phe Leu Gin Leu He Gly He Thr Gly Cys Ala He Gly Val He 465 470 475
GGT TCG GCT ATG GAT AAG GAT CAA CGC ATT GGG ATG ATT TTA ACG ATT 1549 Gly Ser Ala Met Asp Lys Asp Gin Arg He Gly Met He Leu Thr He 480 485 490
GTT TTC GCT GTT ATT TGT TAC ATT GGA TAC TAT TTT ACA CAA AAA GCT 1597 Val Phe Ala Val He Cys Tyr He Gly Tyr Tyr Phe Thr Gin Lys Ala 495 500 505 510 AAT GAA AAT AAC AAA AAA GAT TTG ATA TAATCTTTTC TTAATTTTGA AGTTTAG 1651 Asn Glu Asn Asn Lys Lys Asp Leu He 515
CAAATTTTAA GGAAGTAACC ATGATGAAAA AAACCCTTTT TATCTCTTTG GCTTTAGC 1709
(2) INFORMATION FOR SEQ ID NO: 488:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 519 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488:
Met Leu Leu He Val Phe Phe Lys Phe Tyr Phe Gin Tyr Ser He Lys
1 5 10 15
Lys Lys Ser Phe Tyr Phe He Phe Val He He Gin Ala He Phe He
20 25 30
Phe Asn Leu Arg Arg Cys Arg Met Asp Asn Gin Lys He Thr His Gin
35 40 45
Asn He Thr Gin Lys Gin Gly Glu Leu Lys Arg Asp Met Lys Met Arg
50 55 60
His Leu Leu Met He Ala Phe Gly Gly Ala He Gly Thr Gly Leu Phe 65 70 75 80
Val Gly Thr Gly Gly Asn He Ala Ser Ala Gly Pro Leu Gly Thr Leu
85 90 95
He Ala Tyr Cys Phe Gly Gly Leu Val Val Tyr Cys He Met Leu Ser
100 105 110
Leu Gly Glu Leu Ala Ser Val Tyr Pro Thr Thr Gly Ser Phe Gly Asp
115 120 125
Tyr Ala Ala Lys Phe He Gly Pro Gly Thr Gly Tyr Met Val Phe Trp
130 135 140
Met Tyr Trp Leu Gly Trp Val He Thr Val Ala Leu Glu Tyr He Ala 145 150 155 160
He Gly Met Leu Met Gin Arg Trp Phe Ala Asp He Pro He His Tyr
165 170 175
Trp Val He Leu Cys He Ala Leu Val Phe Leu Leu Asn Phe Phe Ser
180 185 190
Val Lys He Phe Ala Glu Gly Glu Phe Phe Phe Ser Leu He Lys Val
195 200 205
Leu Ala Val He Ala Phe He Gly He Gly Ala He Gly He He Tyr
210 215 220
Gin He Tyr Ser His Gly Phe Gly Ser He Phe Asp Asn Phe His Phe 225 230 235 240
Gly Asp Lys Gly Phe Phe Pro Asn Gly Ser Ala Ala Val Phe Ser Ala
245 250 255
Met Leu Ala Val He Phe Ala Phe Thr Gly Thr Glu Val He Gly Val
260 265 270
Ala Val Gly Glu Thr Lys Asn Ala Ser Glu Val Met Pro Lys Ala He 275 280 285 Lys Ala Thr Leu Trp Arg He Val Phe Phe Phe Leu Gly Ser Val Phe
290 295 300
Val He Ser Val Phe Leu Pro Met Asn Asp Ser Ser He Thr Gin Ser 305 310 315 320
Pro Phe Val Ser Val Leu Glu Arg He Asn Leu Pro Phe He Gly Met
325 330 335
Gly He Pro Tyr Val Ala Asp He Met Asn Ala Val He He Thr Ala
340 345 350
Met Phe Ser Thr Ala Asn Ser Gly Leu Tyr Gly Ala Ser Arg Met He
355 360 365
Tyr Gly Leu Ser Lys Gin Lys Met Phe Phe Lys Val Phe Ser Gin Leu
370 375 380
Asn Arg Gin Gly Thr Pro Thr Tyr Ala Met Phe Phe Ser Leu Ser Phe 385 390 395 400
Ser Leu He Gly Leu Leu Val Gin He Tyr Ala Lys Glu Asn Val Val
405 410 415
Glu Ala Leu He Asn Val He Ser Phe Thr Val He He Val Trp Val
420 425 430
Ser Val Ser Val Ser Gin Tyr Ser Phe Arg Lys Gin Tyr Leu Lys Ala
435 440 445
Gly His Ser Leu Glu Asp Leu Pro Tyr Lys Ala Pro Phe Leu Pro Phe
450 455 460
Leu Gin Leu He Gly He Thr Gly Cys Ala He Gly Val He Gly Ser 465 470 475 480
Ala Met Asp Lys Asp Gin Arg He Gly Met He Leu Thr He Val Phe
485 490 495
Ala Val He Cys Tyr He Gly Tyr Tyr Phe Thr Gin Lys Ala Asn Glu
500 505 510
Asn Asn Lys Lys Asp Leu He 515
(2) INFORMATION FOR SEQ ID NO: 489:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1529 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 230...1390 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489:
TTAAGCTTGA ATGCGGGCAA TATCCAAATC CAGAGCATGC CCAAAGTTAA AGAGCGAGTG 60
AGTGTCCCCT CTAAAGACGA TACGGATCTA TTCTTACCAC GATTCTATTA AGGACTCTAT 120
TAAGGCGGTG GTGAATATCT CCACTGAAAA GAAGATTAAA AACAATTTTA TAGGTGGCGG 180
TGTGTTTAAT GACCCCTTTT TCCAACAATT TTTTGGGGAT TTGGGTGGC ATG ATT CCT 238 Met He Pro 1
AAA GAA AGA ATG GAA AGG GCT TTA GGC AGC GGC GTA ATC ATT TCT AAA 286 Lys Glu Arg Met Glu Arg Ala Leu Gly Ser Gly Val He He Ser Lys 5 10 15
GAC GGC TAT ATT GTA ACT AAT AAC CAT GTG ATT GAT GGC GCG GAT AAG 334 Asp Gly Tyr He Val Thr Asn Asn His Val He Asp Gly Ala Asp Lys 20 25 30 35
ATT AAA GTT ACC ATT CCA GGG AGC AAT AAA GAA TAT TCC GCC ACT CTA 382 He Lys Val Thr He Pro Gly Ser Asn Lys Glu Tyr Ser Ala Thr Leu 40 45 50
GTA GGC ACC GAT TCT GAA AGC GAT TTA GCG GTG ATT CGC ATC ACT AAA 430 Val Gly Thr Asp Ser Glu Ser Asp Leu Ala Val He Arg He Thr Lys 55 60 65
GAC AAT CTG CCC ACG ATC AAA TTC TCT GAT TCT AAT GAT ATT TCA GTG 478 Asp Asn Leu Pro Thr He Lys Phe Ser Asp Ser Asn Asp He Ser Val 70 75 80
GGC GAT TTG GTT TTT GCG ATT GGT AAC CCT TTT GGC GTG GGC GAA AGC 526 Gly Asp Leu Val Phe Ala He Gly Asn Pro Phe Gly Val Gly Glu Ser 85 90 95
GTT ACG CAA GGC ATT GTT TCA GCG CTC AAT AAA AGC GGG ATT GGG ATC 574 Val Thr Gin Gly He Val Ser Ala Leu Asn Lys Ser Gly He Gly He 100 105 110 115
AAC AGC TAT GAG AAT TTC ATT CAA ACA GAC GCT TCC ATC AAT CCT GGA 622 Asn Ser Tyr Glu Asn Phe He Gin Thr Asp Ala Ser He Asn Pro Gly 120 125 130
AAT TCC GGC GGC GCT TTA ATT GAT AGC CGT GGA GGG TTA GTG GGG ATT 670 Asn Ser Gly Gly Ala Leu He Asp Ser Arg Gly Gly Leu Val Gly He 135 140 145
AAT ACC GCT ATT ATC TCT AAA ACT GGG GGC AAC CAC GGC ATT GGC TTT 718 Asn Thr Ala He He Ser Lys Thr Gly Gly Asn His Gly He Gly Phe 150 155 160
GCC ATC CCT TCT AAC ATG GTT AAA GAT ACT GTA ACC CAA CTC ATC AAA 766 Ala He Pro Ser Asn Met Val Lys Asp Thr Val Thr Gin Leu He Lys 165 170 175
ACC GGT AAG ATT GAA AGA GGT TAC TTG GGC GTG GGC TTG CAA GAT TTG 814 Thr Gly Lys He Glu Arg Gly Tyr Leu Gly Val Gly Leu Gin Asp Leu 180 185 190 195
AGT GGC GAT TTG CAA AAT TCT TAT GAC AAC AAA GAA GGG GCG GTA GTC 862 Ser Gly Asp Leu Gin Asn Ser Tyr Asp Asn Lys Glu Gly Ala Val Val 200 205 210 ATT AGC GTA GAA AAA GAC TCT CCG GCT AAA AAA GCA GGG ATT TTG GTG 910 He Ser Val Glu Lys Asp Ser Pro Ala Lys Lys Ala Gly He Leu Val 215 220 225
TGG GAT TTG ATC ACC GAA GTC AAT GGG AAA AAG GTT AAA AAC ACG AAT 958 Trp Asp Leu He Thr Glu Val Asn Gly Lys Lys Val Lys Asn Thr Asn 230 235 240
GAG TTA AGA AAT CTA ATC GGC TCC ATG CTA CCC AAT CAA AGA GTA ACC 1006 Glu Leu Arg Asn Leu He Gly Ser Met Leu Pro Asn Gin Arg Val Thr 245 250 255
TTA AAA GTC ATT AGA GAC AAA AAA GAA CGC GCT TTC ACC CTC ACT CTA 1054 Leu Lys Val He Arg Asp Lys Lys Glu Arg Ala Phe Thr Leu Thr Leu 260 265 270 275
GCT GAA AGG AAA AAC CCT AAC AAA AAA GAA ACC ATT TCT GCT CAA AAC 1102 Ala Glu Arg Lys Asn Pro Asn Lys Lys Glu Thr He Ser Ala Gin Asn 280 285 290
GGC GCG CAA GGC CAA TTG AAC GGG CTT CAA GTA GAA GAT TTA ACT CAA 1150 Gly Ala Gin Gly Gin Leu Asn Gly Leu Gin Val Glu Asp Leu Thr Gin 295 300 305
GAA ACC AAA AGG TCT ATG CGT TTG AGC GAT GAT GTT CAA GGG GTT TTA 1198 Glu Thr Lys Arg Ser Met Arg Leu Ser Asp Asp Val Gin Gly Val Leu 310 315 320
GTC TCT CAA GTG AAT GAA AAT TCC CCA GCA GAG CAA GCC GGA TTT AGG 1246 Val Ser Gin Val Asn Glu Asn Ser Pro Ala Glu Gin Ala Gly Phe Arg 325 330 335
CAA GGT AAC ATT ATC ACA AAA ATT GAA GAG GTT GAA GTT AAA AGC GTT 1294 Gin Gly Asn He He Thr Lys He Glu Glu Val Glu Val Lys Ser Val 340 345 350 355
GCG GAT TTT AAC CAT GCT TTA GAA AAG TAT AAA GGC AAA CCC AAA CGA 1342 Ala Asp Phe Asn His Ala Leu Glu Lys Tyr Lys Gly Lys Pro Lys Arg 360 365 370
TTC TTA GTT TTA GAC TTG AAT CAA GGT TAT AGG ATC ATT TTG GTG AAA T 1391 Phe Leu Val Leu Asp Leu Asn Gin Gly Tyr Arg He He Leu Val Lys 375 380 385
GATAGGGGTG GGTCGTTAGT CGCATGTCTT TGATTAGAGT GAATGGGGAA GCTTTTAAAC 1451 TCTCTTTAGA AAGTTTAGAA GAAGACCCTT TTGAAACTAA AGAAACGCTA GAAACGCTTA 1511 TCAAACAAAC GAGCGTTG 1529
(2) INFORMATION FOR SEQ ID NO: 490:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 387 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490:
Met He Pro Lys Glu Arg Met Glu Arg Ala Leu Gly Ser Gly Val He
1 5 10 15
He Ser Lys Asp Gly Tyr He Val Thr Asn Asn His Val He Asp Gly
20 25 30
Ala Asp Lys He Lys Val Thr He Pro Gly Ser Asn Lys Glu Tyr Ser
35 40 45
Ala Thr Leu Val Gly Thr Asp Ser Glu Ser Asp Leu Ala Val He Arg
50 55 60
He Thr Lys Asp Asn Leu Pro Thr He Lys Phe Ser Asp Ser Asn Asp 65 70 75 80
He Ser Val Gly Asp Leu Val Phe Ala He Gly Asn Pro Phe Gly Val
85 90 95
Gly Glu Ser Val Thr Gin Gly He Val Ser Ala Leu Asn Lys Ser Gly
100 105 110
He Gly He Asn Ser Tyr Glu Asn Phe He Gin Thr Asp Ala Ser He
115 120 125
Asn Pro Gly Asn Ser Gly Gly Ala Leu He Asp Ser Arg Gly Gly Leu
130 135 140
Val Gly He Asn Thr Ala He He Ser Lys Thr Gly Gly Asn His Gly 145 150 155 160
He Gly Phe Ala He Pro Ser Asn Met Val Lys Asp Thr Val Thr Gin
165 170 175
Leu He Lys Thr Gly Lys He Glu Arg Gly Tyr Leu Gly Val Gly Leu
180 185 190
Gin Asp Leu Ser Gly Asp Leu Gin Asn Ser Tyr Asp Asn Lys Glu Gly
195 200 205
Ala Val Val He Ser Val Glu Lys Asp Ser Pro Ala Lys Lys Ala Gly
210 215 220
He Leu Val Trp Asp Leu He Thr Glu Val Asn Gly Lys Lys Val Lys 225 230 235 240
Asn Thr Asn Glu Leu Arg Asn Leu He Gly Ser Met Leu Pro Asn Gin
245 250 255
Arg Val Thr Leu Lys Val He Arg Asp Lys Lys Glu Arg Ala Phe Thr
260 265 270
Leu Thr Leu Ala Glu Arg Lys Asn Pro Asn Lys Lys Glu Thr He Ser
275 280 285
Ala Gin Asn Gly Ala Gin Gly Gin Leu Asn Gly Leu Gin Val Glu Asp
290 295 300
Leu Thr Gin Glu Thr Lys Arg Ser Met Arg Leu Ser Asp Asp Val Gin 305 310 315 320
Gly Val Leu Val Ser Gin Val Asn Glu Asn Ser Pro Ala Glu Gin Ala
325 330 335
Gly Phe Arg Gin Gly Asn He He Thr Lys He Glu Glu Val Glu Val
340 345 350
Lys Ser Val Ala Asp Phe Asn His Ala Leu Glu Lys Tyr Lys Gly Lys
355 360 365
Pro Lys Arg Phe Leu Val Leu Asp Leu Asn Gin Gly Tyr Arg He He
370 375 380
Leu Val Lys 385 (2) INFORMATION FOR SEQ ID NO: 491:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 990 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 39...902 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:491:
ATGATAGTAT TCTAATAAAA CTATTTTAAA GGTGATCC ATG AGT AAG AGT TTA TAC 56
Met Ser Lys Ser Leu Tyr 1 5
CAA ACT TTA AAT GTG AGC GAA AAC GCC AGC CAA GAT GAA ATC AAA AAA 104 Gin Thr Leu Asn Val Ser Glu Asn Ala Ser Gin Asp Glu He Lys Lys 10 15 20
TCC TAC CGC CGT TTA GCC CGA CAA TAC CAC CCG GAT TTG AAT AAA ACC 152 Ser Tyr Arg Arg Leu Ala Arg Gin Tyr His Pro Asp Leu Asn Lys Thr 25 30 35
AAA GAA GCC GAA GAG AAA TTC AAA GAA ATC AAC GCC GCT TAT GAA ATT 200 Lys Glu Ala Glu Glu Lys Phe Lys Glu He Asn Ala Ala Tyr Glu He 40 45 50
TTG AGC GAT GAA GAA AAA CGC CGC CAA TAC GAT CAG TTT GGC GAT AAC 248 Leu Ser Asp Glu Glu Lys Arg Arg Gin Tyr Asp Gin Phe Gly Asp Asn 55 60 65 70
ATG TTT GGC GGG CAG AAT TTC AGC GAT TTT GCC AGA AGC CGT GGT CCT 296 Met Phe Gly Gly Gin Asn Phe Ser Asp Phe Ala Arg Ser Arg Gly Pro 75 80 85
AGT GAA GAT TTA GAC GAT ATT TTA AGC TCT ATT TTT GGG AAA GGA GGC 344 Ser Glu Asp Leu Asp Asp He Leu Ser Ser He Phe Gly Lys Gly Gly 90 95 100
TTT TCG CAA AGA TTT TCT CAA AAC TCG CAA GGC TTT TCT GGC TTT AAT 392 Phe Ser Gin Arg Phe Ser Gin Asn Ser Gin Gly Phe Ser Gly Phe Asn 105 110 115
TTT TCC AAT TTC GCC CCT GAA AAT TTA GAC ATA ACC GCC GCT TTA AAT 440 Phe Ser Asn Phe Ala Pro Glu Asn Leu Asp He Thr Ala Ala Leu Asn 120 125 130 GTC TCT GTT TTA GAC ACC CTT TTA GGC AAT AAA AAA CAA GTG AGC ATC 488 Val Ser Val Leu Asp Thr Leu Leu Gly Asn Lys Lys Gin Val Ser He 135 140 145 150
AAT AAT GAG ACT TTT AGC CTT AAA ATC CCT ATT GGC GTG GAA GAG GGC 536 Asn Asn Glu Thr Phe Ser Leu Lys He Pro He Gly Val Glu Glu Gly 155 160 165
GAA AAG ATT AGG GTT CGC AAC AAG GGG AAA ACG GGG CGA ACG ACT AGG 584 Glu Lys He Arg Val Arg Asn Lys Gly Lys Thr Gly Arg Thr Thr Arg 170 175 180
GGC GAT TTG CTC TTA GAG ATC CAT ATT GAA GAA GAT GAA ATG TAT AGG 632 Gly Asp Leu Leu Leu Glu He His He Glu Glu Asp Glu Met Tyr Arg 185 190 195
CGC GAG AAA GAT GAT ATT ACC CAA ATC TTT GAT TTA CCC TTA AAA ACG 680 Arg Glu Lys Asp Asp He Thr Gin He Phe Asp Leu Pro Leu Lys Thr 200 205 210
GCT CTT TTT GGA GGG AAA ATT GAA ATC GCT ACT TGG CAT AAA ACC TTA 728 Ala Leu Phe Gly Gly Lys He Glu He Ala Thr Trp His Lys Thr Leu 215 220 225 230
ACC CTA ACC ATT CCC CCT AAC ACC AAA GCG ATG CAA AAA TTC CGC ATT 776 Thr Leu Thr He Pro Pro Asn Thr Lys Ala Met Gin Lys Phe Arg He 235 240 245
AAA GAA AAA GGG ATC AAA AAC AGA AAA ACT TCG CAT GTG GGG GAT TTG 824 Lys Glu Lys Gly He Lys Asn Arg Lys Thr Ser His Val Gly Asp Leu 250 255 260
TAT TTG CAG GCT CGT TTG ATT TTG CCT AAA ACT GAA ACG CTT TCT AAT 872 Tyr Leu Gin Ala Arg Leu He Leu Pro Lys Thr Glu Thr Leu Ser Asn 265 270 275
GAG TTG AAA GCG TTA TTA GAA AAA GAA TTG TAAGGAGGAA TCGTGTGCGA TTA 925 Glu Leu Lys Ala Leu Leu Glu Lys Glu Leu 280 285
TGATGAACCG CTTTATTTAA TCAGCGTCGT GGCTAAAATC TTAGGCGTGC ACCCTCAAAC 985 CTTGC 990
(2) INFORMATION FOR SEQ ID NO: 492:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 288 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:492: Met Ser Lys Ser Leu Tyr Gin Thr Leu Asn Val Ser Glu Asn Ala Ser
1 5 10 15
Gin Asp Glu He Lys Lys Ser Tyr Arg Arg Leu Ala Arg Gin Tyr His
20 25 30
Pro Asp Leu Asn Lys Thr Lys Glu Ala Glu Glu Lys Phe Lys Glu He
35 40 45
Asn Ala Ala Tyr Glu He Leu Ser Asp Glu Glu Lys Arg Arg Gin Tyr
50 55 60
Asp Gin Phe Gly Asp Asn Met Phe Gly Gly Gin Asn Phe Ser Asp Phe 65 70 75 80
Ala Arg Ser Arg Gly Pro Ser Glu Asp Leu Asp Asp He Leu Ser Ser
85 90 95
He Phe Gly Lys Gly Gly Phe Ser Gin Arg Phe Ser Gin Asn Ser Gin
100 105 110
Gly Phe Ser Gly Phe Asn Phe Ser Asn Phe Ala Pro Glu Asn Leu Asp
115 120 125
He Thr Ala Ala Leu Asn Val Ser Val Leu Asp Thr Leu Leu Gly Asn
130 135 140
Lys Lys Gin Val Ser He Asn Asn Glu Thr Phe Ser Leu Lys He Pro 145 150 155 160
He Gly Val Glu Glu Gly Glu Lys He Arg Val Arg Asn Lys Gly Lys
165 170 175
Thr Gly Arg Thr Thr Arg Gly Asp Leu Leu Leu Glu He His He Glu
180 185 190
Glu Asp Glu Met Tyr Arg Arg Glu Lys Asp Asp He Thr Gin He Phe
195 200 205
Asp Leu Pro Leu Lys Thr Ala Leu Phe Gly Gly Lys He Glu He Ala
210 215 220
Thr Trp His Lys Thr Leu Thr Leu Thr He Pro Pro Asn Thr Lys Ala 225 230 235 240
Met Gin Lys Phe Arg He Lys Glu Lys Gly He Lys Asn Arg Lys Thr
245 250 255
Ser His Val Gly Asp Leu Tyr Leu Gin Ala Arg Leu He Leu Pro Lys
260 265 270
Thr Glu Thr Leu Ser Asn Glu Leu Lys Ala Leu Leu Glu Lys Glu Leu 275 280 285
(2) INFORMATION FOR SEQ ID NO:493:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1350 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 113...1285 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:493:
ACCTGAATAA AGAGTTGCAA GACGCTCTGC ACAAACACTC TAAAAATACC AAAACCCCAA 60 CGAAAAATTT AAACACCCCT ACGAATTTTT ACGAATTGAT TTTATTTAAA AA ATG AGC 118
Met Ser
1
CTG ACT TCG CTT TTA AAC CCA AAA AGC CTA GAA GAT TTT TTA GGC CAA 166 Leu Thr Ser Leu Leu Asn Pro Lys Ser Leu Glu Asp Phe Leu Gly Gin 5 10 15
GAG CAT TTA GTA GGG AAA GAC GCC CCC TTA TTT AAA GCC CTA CAA TCC 214 Glu His Leu Val Gly Lys Asp Ala Pro Leu Phe Lys Ala Leu Gin Ser 20 25 30
AAA CAC TTC CCC CAT GCC TTT TTC TAT GGC CCT CCT GGC GTG GGT AAA 262 Lys His Phe Pro His Ala Phe Phe Tyr Gly Pro Pro Gly Val Gly Lys 35 40 45 50
ACA AGC CTG GCT CAA ATC ATC GCC TAT ATG CTA GAG CGC CCC ATT CTT 310 Thr Ser Leu Ala Gin He He Ala Tyr Met Leu Glu Arg Pro He Leu 55 60 65
TTA TTC AAT GCG ACG GAT TTT AAA TTA GAG GAT TTG CGC CTT AAG CTT 358 Leu Phe Asn Ala Thr Asp Phe Lys Leu Glu Asp Leu Arg Leu Lys Leu 70 75 80
AAA AAT TAC CAA AAT ACC CTT TTA AAA CCC GTT GTT TTT ATT GAT GAA 406 Lys Asn Tyr Gin Asn Thr Leu Leu Lys Pro Val Val Phe He Asp Glu 85 90 95
ACC CAC AGA TTG AAT AAA ACC CAA CAA GAA TTT TTA CTC CCC ATT ATG 454 Thr His Arg Leu Asn Lys Thr Gin Gin Glu Phe Leu Leu Pro He Met 100 105 110
GAA AAA GAT CAC GCT TTA ATT TTA GGG GCT AGC ACG CAA GAT CCT AAT 502 Glu Lys Asp His Ala Leu He Leu Gly Ala Ser Thr Gin Asp Pro Asn 115 120 125 130
TAC AGC CTA AGC CAT GCG ATC CGA TCA AGA AGT TTT ATT TTT GAA TTA 550 Tyr Ser Leu Ser His Ala He Arg Ser Arg Ser Phe He Phe Glu Leu 135 140 145
ACC CCC CTA AAC AAG AGC GAT TTA GAC AGG CTT TGC GCT AAA GCT TTA 598 Thr Pro Leu Asn Lys Ser Asp Leu Asp Arg Leu Cys Ala Lys Ala Leu 150 155 160
ACA TTG CTC AAA AAA CAA ATA GAG CCT GGC GCT AAA ACC TAT CTT TTA 646 Thr Leu Leu Lys Lys Gin He Glu Pro Gly Ala Lys Thr Tyr Leu Leu 165 170 175
AAC AAC AGC GCT GGC GAC GCT AGA GCG TTA TTA AAC CTT TTA GAT TTG 694 Asn Asn Ser Ala Gly Asp Ala Arg Ala Leu Leu Asn Leu Leu Asp Leu 180 185 190 AGC GCT AAA ATA GAA GAT CCT ATC ACT TTA AAA ACG CTA CAA TCC TTA 742 Ser Ala Lys He Glu Asp Pro He Thr Leu Lys Thr Leu Gin Ser Leu 195 200 205 210
CGG CCT CAT AGC CTA AAT GAT GGA TCT TAT AGC GAT GAT ACG CAT TAT 790 Arg Pro His Ser Leu Asn Asp Gly Ser Tyr Ser Asp Asp Thr His Tyr 215 220 225
AAC CTT ACT AGC GCG TTA ATC AAA TCT TTA AGA GGG AGC GAT GAA AAC 838 Asn Leu Thr Ser Ala Leu He Lys Ser Leu Arg Gly Ser Asp Glu Asn 230 235 240
GCT TCC ATC TAT TAT CTG GCG CGC TTG ATT GCT GGC GGG GAA AAC CCG 886 Ala Ser He Tyr Tyr Leu Ala Arg Leu He Ala Gly Gly Glu Asn Pro 245 250 255
GAA TTT ATC GCC AGA AGG CTG GTG ATT TTT GCG AGC GAA GAT ATT GGT 934 Glu Phe He Ala Arg Arg Leu Val He Phe Ala Ser Glu Asp He Gly 260 265 270
AAC GCT AAC CCG AAC GCC CTT AAT TTA GCC GCT TCT TGT TTG TTT GCA 982 Asn Ala Asn Pro Asn Ala Leu Asn Leu Ala Ala Ser Cys Leu Phe Ala 275 280 285 290
GTC AAA CAA ATC GGC TAC CCT GAA GCG CGC ATC ATT TTA AGC CAA TGC 1030 Val Lys Gin He Gly Tyr Pro Glu Ala Arg He He Leu Ser Gin Cys 295 300 305
GTG ATT TAT CTG GCT TGT TCG CCC AAG TCT AAC ACG GCT TAT AGA GCG 1078 Val He Tyr Leu Ala Cys Ser Pro Lys Ser Asn Thr Ala Tyr Arg Ala 310 315 320
ATC AAT CAG GCT TTG GAT TGC GTT CAA AAA GGC TCA CTC TAC CCT ATT 1126 He Asn Gin Ala Leu Asp Cys Val Gin Lys Gly Ser Leu Tyr Pro He 325 330 335
CCT AAA CAC CTG CTG CCT AAC GCT AAA GAT TAC CTT TAC CCG CAT GAT 1174 Pro Lys His Leu Leu Pro Asn Ala Lys Asp Tyr Leu Tyr Pro His Asp 340 345 350
TAT AAC GGC TAT GTC AAA CAA GAT TAT TTG GAA AAA CCC CTA GAT TTG 1222 Tyr Asn Gly Tyr Val Lys Gin Asp Tyr Leu Glu Lys Pro Leu Asp Leu 355 360 365 370
GTT TCT TCT CAA GGC ATA GGA TTT GAA AAA ACC CTT TTA GAA TGG CTT 1270 Val Ser Ser Gin Gly He Gly Phe Glu Lys Thr Leu Leu Glu Trp Leu 375 380 385
GAT AAG ATA AGA AAT TGATCTTATA AGTTACATTA AAATGCGACA ATGGTAATAA A 1326 Asp Lys He Arg Asn 390
AAATCAATAT TTTTGGATTG AATT 1350
(2) INFORMATION FOR SEQ ID NO: 494: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 391 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494:
Met Ser Leu Thr Ser Leu Leu Asn Pro Lys Ser Leu Glu Asp Phe Leu
1 5 10 15
Gly Gin Glu His Leu Val Gly Lys Asp Ala Pro Leu Phe Lys Ala Leu
20 25 30
Gin Ser Lys His Phe Pro His Ala Phe Phe Tyr Gly Pro Pro Gly Val
35 40 45
Gly Lys Thr Ser Leu Ala Gin He He Ala Tyr Met Leu Glu Arg Pro
50 55 60
He Leu Leu Phe Asn Ala Thr Asp Phe Lys Leu Glu Asp Leu Arg Leu 65 70 75 80
Lys Leu Lys Asn Tyr Gin Asn Thr Leu Leu Lys Pro Val Val Phe He
85 90 95
Asp Glu Thr His Arg Leu Asn Lys Thr Gin Gin Glu Phe Leu Leu Pro
100 105 110
He Met Glu Lys Asp His Ala Leu He Leu Gly Ala Ser Thr Gin Asp
115 120 125
Pro Asn Tyr Ser Leu Ser His Ala He Arg Ser Arg Ser Phe He Phe
130 135 140
Glu Leu Thr Pro Leu Asn Lys Ser Asp Leu Asp Arg Leu Cys Ala Lys 145 150 155 160
Ala Leu Thr Leu Leu Lys Lys Gin He Glu Pro Gly Ala Lys Thr Tyr
165 170 175
Leu Leu Asn Asn Ser Ala Gly Asp Ala Arg Ala Leu Leu Asn Leu Leu
180 185 190
Asp Leu Ser Ala Lys He Glu Asp Pro He Thr Leu Lys Thr Leu Gin
195 200 205
Ser Leu Arg Pro His Ser Leu Asn Asp Gly Ser Tyr Ser Asp Asp Thr
210 215 220
His Tyr Asn Leu Thr Ser Ala Leu He Lys Ser Leu Arg Gly Ser Asp 225 230 235 240
Glu Asn Ala Ser He Tyr Tyr Leu Ala Arg Leu He Ala Gly Gly Glu
245 250 255
Asn Pro Glu Phe He Ala Arg Arg Leu Val He Phe Ala Ser Glu Asp
260 265 270
He Gly Asn Ala Asn Pro Asn Ala Leu Asn Leu Ala Ala Ser Cys Leu
275 280 285
Phe Ala Val Lys Gin He Gly Tyr Pro Glu Ala Arg He He Leu Ser
290 295 300
Gin Cys Val He Tyr Leu Ala Cys Ser Pro Lys Ser Asn Thr Ala Tyr 305 310 315 320
Arg Ala He Asn Gin Ala Leu Asp Cys Val Gin Lys Gly Ser Leu Tyr
325 330 335
Pro He Pro Lys His Leu Leu Pro Asn Ala Lys Asp Tyr Leu Tyr Pro 340 345 350 His Asp Tyr Asn Gly Tyr Val Lys Gin Asp Tyr Leu Glu Lys Pro Leu
355 360 365
Asp Leu Val Ser Ser Gin Gly He Gly Phe Glu Lys Thr Leu Leu Glu
370 375 380
Trp Leu Asp Lys He Arg Asn 385 390
(2) INFORMATION FOR SEQ ID NO: 495:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 869 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 76...759 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 495:
AAGAAACTTT TAACAAACAA TTCAAGGGAT TTGGCGATTT TGGGTCTTAA AAAATATGCT 60 ATTTTATGGT CTTTA ATG GGG TTT TAT GCA GGA TTG AAC GCG CTT GAT TAT 111 Met Gly Phe Tyr Ala Gly Leu Asn Ala Leu Asp Tyr 1 5 10
GAC ACC ATA GAC CCA AAA TAC TAC AAG TAT ATC AAG TAT TAT AAA GCC 159 Asp Thr He Asp Pro Lys Tyr Tyr Lys Tyr He Lys Tyr Tyr Lys Ala 15 20 25
TAT GAG GAT AAA GAA GTT GAA GAA TTG ATC AGA GAC TTA AAA AGG GCG 207 Tyr Glu Asp Lys Glu Val Glu Glu Leu He Arg Asp Leu Lys Arg Ala 30 35 40
AAC GCT AAA AGC GGG CTT ATT TTA GGG ATC AAT ACC GGG TTT TTT TAC 255 Asn Ala Lys Ser Gly Leu He Leu Gly He Asn Thr Gly Phe Phe Tyr 45 50 55 60
AAT CAT GAA ATC ATG GTT AGA ACT AAT AGC TCT AGC ATC ACG GGG AAT 303 Asn His Glu He Met Val Arg Thr Asn Ser Ser Ser He Thr Gly Asn 65 70 75
ATT TTA AAT TAT TTG TTC GCT TAC GGC TTG CGT TTT GGC TAT CAA ACT 351 He Leu Asn Tyr Leu Phe Ala Tyr Gly Leu Arg Phe Gly Tyr Gin Thr 80 85 90
TTC AGG CCG TCG TTT TTT GCG CGC TTG GTC AAG CCA AAT ATC ATT GGC 399 Phe Arg Pro Ser Phe Phe Ala Arg Leu Val Lys Pro Asn He He Gly 95 100 105 AGG CGC ATT TAT ATC CAA TAT TAT GGA GGA GCT CCT AAA AAA GCG GGC 447 Arg Arg He Tyr He Gin Tyr Tyr Gly Gly Ala Pro Lys Lys Ala Gly 110 115 120
TTT GGG GAT GTA GGG TTT CAA TCG GTT ATG CTG AAT GGG GAT TTT TTA 495 Phe Gly Asp Val Gly Phe Gin Ser Val Met Leu Asn Gly Asp Phe Leu 125 130 135 140
TTG GAT TTT CCT TTG CCT TTT GTG GGG AAA TAC CTT TAT ATG GGG GGT 543 Leu Asp Phe Pro Leu Pro Phe Val Gly Lys Tyr Leu Tyr Met Gly Gly 145 150 155
TAT ATG GGT TTA GGT TTG GGG GTT GTA GCG CAT GGG GTG AAT TAC ACG 591 Tyr Met Gly Leu Gly Leu Gly Val Val Ala His Gly Val Asn Tyr Thr 160 165 170
GCG GAA TGG GGG ATG TCT TTT AAC GCA GGA TTG GCT CTA ACG GTA TTA 639 Ala Glu Trp Gly Met Ser Phe Asn Ala Gly Leu Ala Leu Thr Val Leu 175 180 185
GAA AAA AAC CGC ATT GAA TTT GGA TTT AAA ATT TTG AAT AAT TTC CCT 687 Glu Lys Asn Arg He Glu Phe Gly Phe Lys He Leu Asn Asn Phe Pro 190 195 200
TTT TTG CAA TCT AAT TCT TCA AAA GAG ACT TGG TGG GGA GCT ATG GCA 735 Phe Leu Gin Ser Asn Ser Ser Lys Glu Thr Trp Trp Gly Ala Met Ala 205 210 215 220
AAC ATT GGG TAT CAA TAT GTG TTC TAAAAAAATA AGAAATCTCA TTTTATGCTT 789 Asn He Gly Tyr Gin Tyr Val Phe 225
TGGTTTTATG TTGGGCTTGC ACGCTGAAGA AAATACGACT GAAGGAAATA TGACTGAAGA 849 AAATATCTCT AAAGACGCTC 869
(2) INFORMATION FOR SEQ ID NO: 496:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 228 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496:
Met Gly Phe Tyr Ala Gly Leu Asn Ala Leu Asp Tyr Asp Thr He Asp
1 5 10 15
Pro Lys Tyr Tyr Lys Tyr He Lys Tyr Tyr Lys Ala Tyr Glu Asp Lys
20 25 30
Glu Val Glu Glu Leu He Arg Asp Leu Lys Arg Ala Asn Ala Lys Ser
35 40 45
Gly Leu He Leu Gly He Asn Thr Gly Phe Phe Tyr Asn His Glu He 50 55 60
Met Val Arg Thr Asn Ser Ser Ser He Thr Gly Asn He Leu Asn Tyr 65 70 75 80
Leu Phe Ala Tyr Gly Leu Arg Phe Gly Tyr Gin Thr Phe Arg Pro Ser
85 90 95
Phe Phe Ala Arg Leu Val Lys Pro Asn He He Gly Arg Arg He Tyr
100 105 110
He Gin Tyr Tyr Gly Gly Ala Pro Lys Lys Ala Gly Phe Gly Asp Val
115 120 125
Gly Phe Gin Ser Val Met Leu Asn Gly Asp Phe Leu Leu Asp Phe Pro
130 135 140
Leu Pro Phe Val Gly Lys Tyr Leu Tyr Met Gly Gly Tyr Met Gly Leu 145 150 155 160
Gly Leu Gly Val Val Ala His Gly Val Asn Tyr Thr Ala Glu Trp Gly
165 170 175
Met Ser Phe Asn Ala Gly Leu Ala Leu Thr Val Leu Glu Lys Asn Arg
180 185 190
He Glu Phe Gly Phe Lys He Leu Asn Asn Phe Pro Phe Leu Gin Ser
195 200 205
Asn Ser Ser Lys Glu Thr Trp Trp Gly Ala Met Ala Asn He Gly Tyr
210 215 220
Gin Tyr Val Phe 225
(2) INFORMATION FOR SEQ ID NO: 497:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1171 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 89...1096 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497:
GGAAAGAATT GATTCAAAAC GCTATCAAAC AATACGCTGA TGATGTGAAA AAGGGAAACT 60 TCCCTAACGA ATTAGAAAGT TATCATTA ATG AAA GAA CGG ATA GTC AAT TTA 112
Met Lys Glu Arg He Val Asn Leu 1 5
GAA ACT TTG GAT TTT GAA ATT TCT CAA GAA GTG AGT TTG CGC CCT AGT 160 Glu Thr Leu Asp Phe Glu He Ser Gin Glu Val Ser Leu Arg Pro Ser 10 15 20
CTT TGG GAA GAT TTT ATC GGT CAA GAA AAG ATT AAA AGC AAT TTG CAA 208 Leu Trp Glu Asp Phe He Gly Gin Glu Lys He Lys Ser Asn Leu Gin 25 30 35 40
ATT TCT ATT TGC GCG GCT AAA AAA CGC CAA GAA AGT TTG GAT CAC ATG 256 He Ser He Cys Ala Ala Lys Lys Arg Gin Glu Ser Leu Asp His Met 45 50 55
CTT TTT TTT GGC CCG CCC GGT TTG GGT AAA ACT TCA ATC AGC CAT ATC 304 Leu Phe Phe Gly Pro Pro Gly Leu Gly Lys Thr Ser He Ser His He 60 65 70
ATC GCT AAA GAA ATG GAA ACC AAT ATC AAG ATC ACC GCC GCT CCC ATG 352 He Ala Lys Glu Met Glu Thr Asn He Lys He Thr Ala Ala Pro Met 75 80 85
ATA GAA AAA AGC GGT GAT TTA GCC GCC ATT TTG ACC AAT TTG CAA GCT 400 He Glu Lys Ser Gly Asp Leu Ala Ala He Leu Thr Asn Leu Gin Ala 90 95 100
AAA GAC ATT CTT TTT ATT GAT GAA ATC CAC CGG CTC AGC CCA GCG ATT 448 Lys Asp He Leu Phe He Asp Glu He His Arg Leu Ser Pro Ala He 105 110 115 120
GAA GAG GTT TTA TAC CCG GCG ATG GAA GAT TTT AGG TTG GAT ATT ATC 496 Glu Glu Val Leu Tyr Pro Ala Met Glu Asp Phe Arg Leu Asp He He 125 130 135
ATA GGC TCA GGC CCA GCG GCT CAA ACC ATT AAA ATT GAT TTA CCC CCT 544 He Gly Ser Gly Pro Ala Ala Gin Thr He Lys He Asp Leu Pro Pro 140 145 150
TTC ACT CTC ATC GGC GCT ACC ACC AGA GCC GGA ATG CTC TCT AAC CCC 592 Phe Thr Leu He Gly Ala Thr Thr Arg Ala Gly Met Leu Ser Asn Pro 155 160 165
TTA AGA GAC AGA TTT GGC ATG AGT TTT AGA ATG CAA TTT TAT AAC CCT 640 Leu Arg Asp Arg Phe Gly Met Ser Phe Arg Met Gin Phe Tyr Asn Pro 170 175 180
AGC GAA CTG GCC CTC ATC ATT AAA AAA GCT GCC GTT AAA CTC AAC CAA 688 Ser Glu Leu Ala Leu He He Lys Lys Ala Ala Val Lys Leu Asn Gin 185 190 195 200
GAC ATC AAA CAA GAA AGT GCT GAT GAA ATC GCT AAA AGG AGT AGA GGC 736 Asp He Lys Gin Glu Ser Ala Asp Glu He Ala Lys Arg Ser Arg Gly 205 210 215
ACG CCA AGG ATC GCT TTA AGG CTT TTA AAA AGG GTG CGC GAT TTT GCG 784 Thr Pro Arg He Ala Leu Arg Leu Leu Lys Arg Val Arg Asp Phe Ala 220 225 230
CTA GTC AAA AAT TCA AGC TTG ATG GAT TTA AAC ATC ACT TTG CAT GCT 832 Leu Val Lys Asn Ser Ser Leu Met Asp Leu Asn He Thr Leu His Ala 235 240 245
TTG AAT GAA TTA GGC GTG AAT GAA TTA GGC TTT GAT GAA GCG GAT TTG 880 Leu Asn Glu Leu Gly Val Asn Glu Leu Gly Phe Asp Glu Ala Asp Leu 250 255 260
GCG TAT TTA TCT TTG TTG GCT AAC GCT CAA GGA AAG CCG GTG GGT TTG 928 Ala Tyr Leu Ser Leu Leu Ala Asn Ala Gin Gly Lys Pro Val Gly Leu 265 270 275 280
AAC ACG ATT GCA GCA TCT ATG AGA GAA GAT GAA GGC ACG ATT GAA GAC 976 Asn Thr He Ala Ala Ser Met Arg Glu Asp Glu Gly Thr He Glu Asp 285 290 295
GTG ATT GAG CCT TTT TTA CTC GCT AAT GGT TAT TTA GAG CGC ACC GCT 1024 Val He Glu Pro Phe Leu Leu Ala Asn Gly Tyr Leu Glu Arg Thr Ala 300 305 310
AAA GGC AGA ATC GCC ACG CCT AAA ACC CAT GAG CTC TTA AAA ATC CCC 1072 Lys Gly Arg He Ala Thr Pro Lys Thr His Glu Leu Leu Lys He Pro 315 320 325
ACT TTA AAC CCC CAA ACT TTA TTT TAATCTTGTT TAGAAAGAAA ATTACACTAC 1126 Thr Leu Asn Pro Gin Thr Leu Phe 330 335
AATAACGATA AAATTTTAAA GGGTGTAAAA GTAGATTGTT ATGTT 1171
(2) INFORMATION FOR SEQ ID NO:498:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 336 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498:
Met Lys Glu Arg He Val Asn Leu Glu Thr Leu Asp Phe Glu He Ser
1 5 10 15
Gin Glu Val Ser Leu Arg Pro Ser Leu Trp Glu Asp Phe He Gly Gin
20 25 30
Glu Lys He Lys Ser Asn Leu Gin He Ser He Cys Ala Ala Lys Lys
35 40 45
Arg Gin Glu Ser Leu Asp His Met Leu Phe Phe Gly Pro Pro Gly Leu
50 55 60
Gly Lys Thr Ser He Ser His He He Ala Lys Glu Met Glu Thr Asn 65 70 75 80
He Lys He Thr Ala Ala Pro Met He Glu Lys Ser Gly Asp Leu Ala
85 90 95
Ala He Leu Thr Asn Leu Gin Ala Lys Asp He Leu Phe He Asp Glu
100 105 110
He His Arg Leu Ser Pro Ala He Glu Glu Val Leu Tyr Pro Ala Met
115 120 125
Glu Asp Phe Arg Leu Asp He He He Gly Ser Gly Pro Ala Ala Gin 130 135 140
Thr He Lys He Asp Leu Pro Pro Phe Thr Leu He Gly Ala Thr Thr 145 150 155 160
Arg Ala Gly Met Leu Ser Asn Pro Leu Arg Asp Arg Phe Gly Met Ser
165 170 175
Phe Arg Met Gin Phe Tyr Asn Pro Ser Glu Leu Ala Leu He He Lys
180 185 190
Lys Ala Ala Val Lys Leu Asn Gin Asp He Lys Gin Glu Ser Ala Asp
195 200 205
Glu He Ala Lys Arg Ser Arg Gly Thr Pro Arg He Ala Leu Arg Leu
210 215 220
Leu Lys Arg Val Arg Asp Phe Ala Leu Val Lys Asn Ser Ser Leu Met 225 230 235 240
Asp Leu Asn He Thr Leu His Ala Leu Asn Glu Leu Gly Val Asn Glu
245 250 255
Leu Gly Phe Asp Glu Ala Asp Leu Ala Tyr Leu Ser Leu Leu Ala Asn
260 265 270
Ala Gin Gly Lys Pro Val Gly Leu Asn Thr He Ala Ala Ser Met Arg
275 280 285
Glu Asp Glu Gly Thr He Glu Asp Val He Glu Pro Phe Leu Leu Ala
290 295 300
Asn Gly Tyr Leu Glu Arg Thr Ala Lys Gly Arg He Ala Thr Pro Lys 305 310 315 320
Thr His Glu Leu Leu Lys He Pro Thr Leu Asn Pro Gin Thr Leu Phe 325 330 335
(2) INFORMATION FOR SEQ ID NO: 499:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 989 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 111...869 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499:
AGTTTCCAAT GAAGAAGCCT TAAACAAAGA AGTTTCAAGC GATGAATCCC CTAAAGAAGT 60 CCAATTAGCA ACCGATAACA ACACCAAAGA ACACGACAAA GAAAAAGAGA ATG TTT 116
Met Phe
1
GAA GAT TTA AAA CCG CAT TTA CAG GAA TTA AGA AAG CGT TTG ATG GTT 164 Glu Asp Leu Lys Pro His Leu Gin Glu Leu Arg Lys Arg Leu Met Val 5 10 15 TCT GTA GGA ACG ATT CTA GTG GCG TTT TTG GGG TGC TTT CAT TTT TGG 212 Ser Val Gly Thr He Leu Val Ala Phe Leu Gly Cys Phe His Phe Trp 20 25 30
AAA AGT ATT TTT GAA TTT GTT AAA AAT TCC TAT AAA GGC ACG CTC ATT 260 Lys Ser He Phe Glu Phe Val Lys Asn Ser Tyr Lys Gly Thr Leu He 35 40 45 50
CAG CTC TCC CCT ATT GAA GGG GTC ATG GTA GCG GTT AAA ATC AGT TTT 308 Gin Leu Ser Pro He Glu Gly Val Met Val Ala Val Lys He Ser Phe 55 60 65
TCA GCC GCT ATC GTC ATT TCC ATG CCC ATT ATT TTT TGG CAA TTA TGG 356 Ser Ala Ala He Val He Ser Met Pro He He Phe Trp Gin Leu Trp 70 75 80
CTC TTT ATC GCT CCA GGG CTT TAC AAG AAT GAA AAA AAA GTG ATT TTG 404 Leu Phe He Ala Pro Gly Leu Tyr Lys Asn Glu Lys Lys Val He Leu 85 90 95
CCT TTT GTG TTT TTT GGG AGT GGG ATG TTT TTG ATT GGG GCG GCG TTT 452 Pro Phe Val Phe Phe Gly Ser Gly Met Phe Leu He Gly Ala Ala Phe 100 105 110
TCT TAT TAT GTG GTG TTC CCT TTC ATT ATT GAA TAC TTA GCC ACT TTT 500 Ser Tyr Tyr Val Val Phe Pro Phe He He Glu Tyr Leu Ala Thr Phe 115 120 125 130
GGG AGC GAT GTG TTT GCG GCT AAT ATT TCT GCG TCC AGT TAC GTG AGC 548 Gly Ser Asp Val Phe Ala Ala Asn He Ser Ala Ser Ser Tyr Val Ser 135 140 145
TTT TTC ACG CGC TTG ATT TTA GGC TTT GGC GTG GCG TTT GAA TTG CCT 596 Phe Phe Thr Arg Leu He Leu Gly Phe Gly Val Ala Phe Glu Leu Pro 150 155 160
GTT TTG GCG TAT TTT TTG GCT AAA GTG GGC TTG ATT ACT GAT GCG AGC 644 Val Leu Ala Tyr Phe Leu Ala Lys Val Gly Leu He Thr Asp Ala Ser 165 170 175
TTG AAA GCG TAT TTT AAA TAC GCT ATT GTA GTG ATT TTT ATT GTA GCA 692 Leu Lys Ala Tyr Phe Lys Tyr Ala He Val Val He Phe He Val Ala 180 185 190
GCC ATT ATC ACT CCC CCT GAT GTG GTG AGT CAA ATC TTT ATG GCG TTG 740 Ala He He Thr Pro Pro Asp Val Val Ser Gin He Phe Met Ala Leu 195 200 205 210
CCC TTA GTG GGG CTT TAT GGG CTT TCT ATT TTA ATC GCC AAA ATG GTC 788 Pro Leu Val Gly Leu Tyr Gly Leu Ser He Leu He Ala Lys Met Val 215 220 225
AAT CCG GCT CCC AAA GAT AAC GAA AAT AAC AAC GAA AAT AAT AAC GAA 836 Asn Pro Ala Pro Lys Asp Asn Glu Asn Asn Asn Glu Asn Asn Asn Glu 230 235 240 AAT AAC ACC AAA GAG AAT ACA AAG AGC GAG TCG TAGTTGAAAG AATTTGATTT 889 Asn Asn Thr Lys Glu Asn Thr Lys Ser Glu Ser 245 250
AGAAAGCTAT GATTATTATT TGCCTAAGGA ATTGATCGCA AGCTACCCCG TTTTGCCCAA 949 AGAAAAGGCT AAATTACTCG TCTATGAAAG GCGTTCGCAA 989
(2) INFORMATION FOR SEQ ID NO: 500:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 253 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500:
Met Phe Glu Asp Leu Lys Pro His Leu Gin Glu Leu Arg Lys Arg Leu
1 5 10 15
Met Val Ser Val Gly Thr He Leu Val Ala Phe Leu Gly Cys Phe His
20 25 30
Phe Trp Lys Ser He Phe Glu Phe Val Lys Asn Ser Tyr Lys Gly Thr
35 40 45
Leu He Gin Leu Ser Pro He Glu Gly Val Met Val Ala Val Lys He
50 55 60
Ser Phe Ser Ala Ala He Val He Ser Met Pro He He Phe Trp Gin 65 70 75 80
Leu Trp Leu Phe He Ala Pro Gly Leu Tyr Lys Asn Glu Lys Lys Val
85 90 95
He Leu Pro Phe Val Phe Phe Gly Ser Gly Met Phe Leu He Gly Ala
100 105 110
Ala Phe Ser Tyr Tyr Val Val Phe Pro Phe He He Glu Tyr Leu Ala
115 120 125
Thr Phe Gly Ser Asp Val Phe Ala Ala Asn He Ser Ala Ser Ser Tyr
130 135 140
Val Ser Phe Phe Thr Arg Leu He Leu Gly Phe Gly Val Ala Phe Glu 145 150 155 160
Leu Pro Val Leu Ala Tyr Phe Leu Ala Lys Val Gly Leu He Thr Asp
165 170 175
Ala Ser Leu Lys Ala Tyr Phe Lys Tyr Ala He Val Val He Phe He
180 185 190
Val Ala Ala He He Thr Pro Pro Asp Val Val Ser Gin He Phe Met
195 200 205
Ala Leu Pro Leu Val Gly Leu Tyr Gly Leu Ser He Leu He Ala Lys
210 215 220
Met Val Asn Pro Ala Pro Lys Asp Asn Glu Asn Asn Asn Glu Asn Asn 225 230 235 240
Asn Glu Asn Asn Thr Lys Glu Asn Thr Lys Ser Glu Ser 245 250
(2) INFORMATION FOR SEQ ID NO: 501: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 655 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...600 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501:
GTGCATTATT TAAGAATTTT AATACTGAGT ATG AGT TTT TTA AAT ATT TTA AAT 54
Met Ser Phe Leu Asn He Leu Asn 1 5
GCT GAA AAT TTG AGT TAT ATG TCT TCT TCT TAT CAA ATA GGC ACG GTG 102 Ala Glu Asn Leu Ser Tyr Met Ser Ser Ser Tyr Gin He Gly Thr Val 10 15 20
TTT ATG CGC CCT TTA AAC ACC AAC AAG CTT TTA CAA GGG GCT TCA ATC 150 Phe Met Arg Pro Leu Asn Thr Asn Lys Leu Leu Gin Gly Ala Ser He 25 30 35 40
CTT CAA GGC TAT GAA GTG AAT CCT AAA AAC GAT TGG GCT TAT TCT AGG 198 Leu Gin Gly Tyr Glu Val Asn Pro Lys Asn Asp Trp Ala Tyr Ser Arg 45 50 55
TAT TAT TTC TTT ATA GAT TAT GGC AAT GTG CTT TTT AAT AAT GAC TCT 246 Tyr Tyr Phe Phe He Asp Tyr Gly Asn Val Leu Phe Asn Asn Asp Ser 60 65 70
ACT TTA CAA GCG AAC ATG TTC ACT TAT GGG GTG GGA GGG GAT TTT ATG 294 Thr Leu Gin Ala Asn Met Phe Thr Tyr Gly Val Gly Gly Asp Phe Met 75 80 85
GTC GCC TAC GCT AAA AAC CCT ATC AAC CGC TGG GCT TTT TTC TTT GGC 342 Val Ala Tyr Ala Lys Asn Pro He Asn Arg Trp Ala Phe Phe Phe Gly 90 95 100
TTG CAA CTG GCC GCT AAC ACA TGG ATA CTC AAC AAT AAA GTC AAA GAT 390 Leu Gin Leu Ala Ala Asn Thr Trp He Leu Asn Asn Lys Val Lys Asp 105 110 115 120
TTG GTG GTG AAT ACT TGG GAT TCA TTA AAA GAT TTC AAT TTT CAC AAC 438 Leu Val Val Asn Thr Trp Asp Ser Leu Lys Asp Phe Asn Phe His Asn 125 130 135
ACT TAT TTC AGG GCT ATT GGG AAG TTT GGG GTG CAG TTT CGC ACG ATC 486 Thr Tyr Phe Arg Ala He Gly Lys Phe Gly Val Gin Phe Arg Thr He 140 145 150
GTT TTG TAT CAT AAG GTG GAT GTA GAA ATT GGC ATG AAA ATC TTT CTA 534 Val Leu Tyr His Lys Val Asp Val Glu He Gly Met Lys He Phe Leu 155 160 165
ACT CCT GAA AGG CGC AGT TTG TTT GAA AGG AGC TTT TTG TTT TTT GTT 582 Thr Pro Glu Arg Arg Ser Leu Phe Glu Arg Ser Phe Leu Phe Phe Val 170 175 180
TCG CAT TCG TGG CAT TTT TAAATGGCGG AGAGAGAGGG ATTCGAACCC TCGAAGGC 638 Ser His Ser Trp His Phe 185 190
TTGCACCTTA CACGCGT 655
(2) INFORMATION FOR SEQ ID NO: 502:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 190 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:502:
Met Ser Phe Leu Asn He Leu Asn Ala Glu Asn Leu Ser Tyr Met Ser
1 5 10 15
Ser Ser Tyr Gin He Gly Thr Val Phe Met Arg Pro Leu Asn Thr Asn
20 25 30
Lys Leu Leu Gin Gly Ala Ser He Leu Gin Gly Tyr Glu Val Asn Pro
35 40 45
Lys Asn Asp Trp Ala Tyr Ser Arg Tyr Tyr Phe Phe He Asp Tyr Gly
50 55 60
Asn Val Leu Phe Asn Asn Asp Ser Thr Leu Gin Ala Asn Met Phe Thr 65 70 75 80
Tyr Gly Val Gly Gly Asp Phe Met Val Ala Tyr Ala Lys Asn Pro He
85 90 95
Asn Arg Trp Ala Phe Phe Phe Gly Leu Gin Leu Ala Ala Asn Thr Trp
100 105 110
He Leu Asn Asn Lys Val Lys Asp Leu Val Val Asn Thr Trp Asp Ser
115 120 125
Leu Lys Asp Phe Asn Phe His Asn Thr Tyr Phe Arg Ala He Gly Lys
130 135 140
Phe Gly Val Gin Phe Arg Thr He Val Leu Tyr His Lys Val Asp Val 145 150 155 160
Glu He Gly Met Lys He Phe Leu Thr Pro Glu Arg Arg Ser Leu Phe
165 170 175
Glu Arg Ser Phe Leu Phe Phe Val Ser His Ser Trp His Phe 180 185 190 (2) INFORMATION FOR SEQ ID NO: 503:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 830 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...714 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503:
AAAGCACTAT ATTAAGATTA GGATCATTTA ATG GCA GAT AAA GAA ATA CTG ATT 54
Met Ala Asp Lys Glu He Leu He
1 5
TTT GTA GAA GGT CCA AGC GAT AAG GTG TTT TTA GAA GTT TAT CTG TAT 102 Phe Val Glu Gly Pro Ser Asp Lys Val Phe Leu Glu Val Tyr Leu Tyr 10 15 20
TTT CTA GAA AGA TTT CCA ATC AAA AAC TTT AAA GTG CAA AAT GTA GAT 150 Phe Leu Glu Arg Phe Pro He Lys Asn Phe Lys Val Gin Asn Val Asp 25 30 35 40
GGA AAA GAT AAC CTG TCT AAA CGA TTG CTT GAA ATT GAA AAA TAC GAT 198 Gly Lys Asp Asn Leu Ser Lys Arg Leu Leu Glu He Glu Lys Tyr Asp 45 50 55
AAA ACA CTT ATC ATT TTT GAT GCG GAT AAA GAC TAT GAG AGT AAT AAA 246 Lys Thr Leu He He Phe Asp Ala Asp Lys Asp Tyr Glu Ser Asn Lys 60 65 70
AAA GAG ATT TTA AAA ATT GTA TCA GAA TCG AAA CAA ACT ATT TCA GAA 294 Lys Glu He Leu Lys He Val Ser Glu Ser Lys Gin Thr He Ser Glu 75 80 85
GAA CAA ATT TTT TTA TTT CCT AAT AAT CAA GAT GAT GGC GAT TTA GAA 342 Glu Gin He Phe Leu Phe Pro Asn Asn Gin Asp Asp Gly Asp Leu Glu 90 95 100
ACC CTA TTA TTA AAG ATT GCT AAC CAC AAA GAG TTC ATA AAT TGT TTT 390 Thr Leu Leu Leu Lys He Ala Asn His Lys Glu Phe He Asn Cys Phe 105 110 115 120
GAA AGC TAT TTG GAT TGT ATT AAA AAG AAA GAA CAT TAC AAA CCG ATT 438 Glu Ser Tyr Leu Asp Cys He Lys Lys Lys Glu His Tyr Lys Pro He 125 130 135 AAA AAC ATA AGA AAA AGT AAG TGG TAT GCC TAT TTA GAA GCG CTT GGA 486 Lys Asn He Arg Lys Ser Lys Trp Tyr Ala Tyr Leu Glu Ala Leu Gly 140 145 150
TTA GAA AAA TTT TTC CAA TAC ACA TGG GAC ACA AAG AAA AAG AAT AAT 534 Leu Glu Lys Phe Phe Gin Tyr Thr Trp Asp Thr Lys Lys Lys Asn Asn 155 160 165
AAA AAA AAG CTT ATC ATT GAC GAT AAA GAT GGA GAT GAG ATT GAG ATA 582 Lys Lys Lys Leu He He Asp Asp Lys Asp Gly Asp Glu He Glu He 170 175 180
AAA GAT CAA TAT AAA GGA GAT TAT GAA GAA CTA AAA AAA GTT CTT GAT 630 Lys Asp Gin Tyr Lys Gly Asp Tyr Glu Glu Leu Lys Lys Val Leu Asp 185 190 195 200
CTT AAC TCA AAA TCT CTT ATT CCC CTT AAA AAT TTT TTA GGG CAA TTT 678 Leu Asn Ser Lys Ser Leu He Pro Leu Lys Asn Phe Leu Gly Gin Phe 205 210 215
GCA GAA AAT AAT CAA AAA ACA AAT CCT AAA ATT TTC TAATTTAACA AAATAT 730 Ala Glu Asn Asn Gin Lys Thr Asn Pro Lys He Phe 220 225
ATTACAATTA CCAAAAAAGT ATTATTTTTC TTAAAAGGTG CGCTGTGAAA TTGTGGTTTC 790 CTTATTTTTT AGCGATTGTG TTCTTGCATG CATTGGGTTT 830
(2) INFORMATION FOR SEQ ID NO: 504:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 228 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:504:
Met Ala Asp Lys Glu He Leu He Phe Val Glu Gly Pro Ser Asp Lys
1 5 10 15
Val Phe Leu Glu Val Tyr Leu Tyr Phe Leu Glu Arg Phe Pro He Lys
20 25 30
Asn Phe Lys Val Gin Asn Val Asp Gly Lys Asp Asn Leu Ser Lys Arg
35 40 45
Leu Leu Glu He Glu Lys Tyr Asp Lys Thr Leu He He Phe Asp Ala
50 55 60
Asp Lys Asp Tyr Glu Ser Asn Lys Lys Glu He Leu Lys He Val Ser 65 70 75 80
Glu Ser Lys Gin Thr He Ser Glu Glu Gin He Phe Leu Phe Pro Asn
85 90 95
Asn Gin Asp Asp Gly Asp Leu Glu Thr Leu Leu Leu Lys He Ala Asn
100 105 110
His Lys Glu Phe He Asn Cys Phe Glu Ser Tyr Leu Asp Cys He Lys 115 120 125
Lys Lys Glu His Tyr Lys Pro He Lys Asn He Arg Lys Ser Lys Trp
130 135 140
Tyr Ala Tyr Leu Glu Ala Leu Gly Leu Glu Lys Phe Phe Gin Tyr Thr 145 150 155 160
Trp Asp Thr Lys Lys Lys Asn Asn Lys Lys Lys Leu He He Asp Asp
165 170 175
Lys Asp Gly Asp Glu He Glu He Lys Asp Gin Tyr Lys Gly Asp Tyr
180 185 190
Glu Glu Leu Lys Lys Val Leu Asp Leu Asn Ser Lys Ser Leu He Pro
195 200 205
Leu Lys Asn Phe Leu Gly Gin Phe Ala Glu Asn Asn Gin Lys Thr Asn
210 215 220
Pro Lys He Phe 225
(2) INFORMATION FOR SEQ ID NO:505:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1349 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 78...1298 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:505:
AAGAACAAAT TGAGCCTGCT ACCGCTCTCA CTCAATGGCC GCAAAAACAA GAAAAGAAAA 60 AATCATAAGG AAAAAAT ATG GCA AAA AGT ATT GAA TTG CAA GAG ATA GAA 110
Met Ala Lys Ser He Glu Leu Gin Glu He Glu 1 5 10
GTG TGG GAT GGC AAT ACC GCT AGT TCT AAC GCT TTA AGA CAG GCT CAA 158 Val Trp Asp Gly Asn Thr Ala Ser Ser Asn Ala Leu Arg Gin Ala Gin 15 20 25
ATT GAT GTC ATC GCA GCC TAT CCT ATC ACC CCA TCA ACG CCC ATT GTG 206 He Asp Val He Ala Ala Tyr Pro He Thr Pro Ser Thr Pro He Val 30 35 40
CAA AAT TAT GGC TCG TTT AAG GAT AAT GGC TAT GTT GAT GGC GAA TTC 254 Gin Asn Tyr Gly Ser Phe Lys Asp Asn Gly Tyr Val Asp Gly Glu Phe 45 50 55
GTT TTA GTG GAA TCT GAG CAT GCC GCC ATG AGC GCA TGC GTG GGA GCT 302 Val Leu Val Glu Ser Glu His Ala Ala Met Ser Ala Cys Val Gly Ala 60 65 70 75
GCC GCA GCT GGA GGG AGA GTC AGC ACT GCG ACT AGC TCT CAA GGT TTG 350 Ala Ala Ala Gly Gly Arg Val Ser Thr Ala Thr Ser Ser Gin Gly Leu 80 85 90
GCG TTA ATG GTA GAG GTT TTA TAC CAG GCT TCT GGA ATG CGT TTG CCT 398 Ala Leu Met Val Glu Val Leu Tyr Gin Ala Ser Gly Met Arg Leu Pro 95 100 105
ATC GTT TTG AAT TTA GTC AAT CGT GCT TTA GCA GCC CCT TTG AAT ATC 446 He Val Leu Asn Leu Val Asn Arg Ala Leu Ala Ala Pro Leu Asn He 110 115 120
CAT GGC GAT CAT TCT GAT ATG TAT TTA AGC AGG GAT TCT GGT TGG ATA 494 His Gly Asp His Ser Asp Met Tyr Leu Ser Arg Asp Ser Gly Trp He 125 130 135
AGT TTA TGC ACA TGC AAC CCC CAA GAA GCT TAT GAT TTC ACT TTA ATG 542 Ser Leu Cys Thr Cys Asn Pro Gin Glu Ala Tyr Asp Phe Thr Leu Met 140 145 150 155
GCG TTT AGA ATC GCA GAG CAT CAA AAG GTG CGC GTG CCT ACT ATT GTC 590 Ala Phe Arg He Ala Glu His Gin Lys Val Arg Val Pro Thr He Val 160 165 170
AAT CAA GAC GGG TTT TTA TGC TCG CAC ACC GTG CAA AAT GTC CGC CCT 638 Asn Gin Asp Gly Phe Leu Cys Ser His Thr Val Gin Asn Val Arg Pro 175 180 185
TTG AGC GAT GCA GTG GCT TAC CAA TTC GTG GGC GAA TAC CAA ACC AAG 686 Leu Ser Asp Ala Val Ala Tyr Gin Phe Val Gly Glu Tyr Gin Thr Lys 190 195 200
CAT TCC CTT TTG GAT TTT GAT AAA CCG GTA AGC TAT GGC GCG CAA GCT 734 His Ser Leu Leu Asp Phe Asp Lys Pro Val Ser Tyr Gly Ala Gin Ala 205 210 215
GAA GAA GAA TGG CAT TAT GAG CAT AAA GCC CAA CTC CAC CAT GCC ATC 782 Glu Glu Glu Trp His Tyr Glu His Lys Ala Gin Leu His His Ala He 220 225 230 235
ATG AGC GCG TCT TCT GTG ATT GAA GAA GTG TTC AAT GAT TTC GCT AAA 830 Met Ser Ala Ser Ser Val He Glu Glu Val Phe Asn Asp Phe Ala Lys 240 245 250
CTC ACA GGC AGG CAA TAC CAT TTA ACC AAA ACT TTC CAG CTA GAA GAC 878 Leu Thr Gly Arg Gin Tyr His Leu Thr Lys Thr Phe Gin Leu Glu Asp 255 260 265
GCT GAA ATC GCT ATC TTT GCG TTA GGC ACT ACT TAT GAA TCA GCG ATC 926 Ala Glu He Ala He Phe Ala Leu Gly Thr Thr Tyr Glu Ser Ala He 270 275 280
GTA GCG GCT AAA GAA ATG CGT AAA AAA GGC ATT AAG GCC GGC GTG GCT 974 Val Ala Ala Lys Glu Met Arg Lys Lys Gly He Lys Ala Gly Val Ala 285 290 295
ACC ATC CAT TCC TTG CGC CCC TTC CCT TAT GAA AGA TTA GGG CAG GAT 1022 Thr He His Ser Leu Arg Pro Phe Pro Tyr Glu Arg Leu Gly Gin Asp 300 305 310 315
TTG AAA AAT CTT AAA GCT TTA GCG ATT TTA GAC AAG AGC TCT CCA GCG 1070 Leu Lys Asn Leu Lys Ala Leu Ala He Leu Asp Lys Ser Ser Pro Ala 320 325 330
GGC ACT ATG GGG GCG ATG TTT AAT GAA GTA ACG AGC GCG GTG TAT CAA 1118 Gly Thr Met Gly Ala Met Phe Asn Glu Val Thr Ser Ala Val Tyr Gin 335 340 345
ACG CAA GGG ACT AAA CAC CCC GTG GTG TCT AAC TAC ATT TAT GGT TTA 1166 Thr Gin Gly Thr Lys His Pro Val Val Ser Asn Tyr He Tyr Gly Leu 350 355 360
GGC GAA AGG GAT ATG ACG ATC GCG CAT TTA TGC GAA ATT TTT GAA GAA 1214 Gly Glu Arg Asp Met Thr He Ala His Leu Cys Glu He Phe Glu Glu 365 370 375
ATC AAT GAA GAC GCT CTT AAA GGC ACG CTC ACG CAC CCT ACC CAA CAA 1262 He Asn Glu Asp Ala Leu Lys Gly Thr Leu Thr His Pro Thr Gin Gin 380 385 390 395
TTC GTA GGC TTG CAC GGC CCT AAA ATG AGC TTT TTT TAAAAAGGAA ATATCA 1314 Phe Val Gly Leu His Gly Pro Lys Met Ser Phe Phe 400 405
TGGTAAAAGA AGTCAAAACA CTCAAAGGTT TTAGC 1349
(2) INFORMATION FOR SEQ ID NO: 506:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 407 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506:
Met Ala Lys Ser He Glu Leu Gin Glu He Glu Val Trp Asp Gly Asn
1 5 10 15
Thr Ala Ser Ser Asn Ala Leu Arg Gin Ala Gin He Asp Val He Ala
20 25 30
Ala Tyr Pro He Thr Pro Ser Thr Pro He Val Gin Asn Tyr Gly Ser
35 40 45
Phe Lys Asp Asn Gly Tyr Val Asp Gly Glu Phe Val Leu Val Glu Ser
50 55 60
Glu His Ala Ala Met Ser Ala Cys Val Gly Ala Ala Ala Ala Gly Gly 65 70 75 80
Arg Val Ser Thr Ala Thr Ser Ser Gin Gly Leu Ala Leu Met Val Glu
85 90 95
Val Leu Tyr Gin Ala Ser Gly Met Arg Leu Pro He Val Leu Asn Leu
100 105 110
Val Asn Arg Ala Leu Ala Ala Pro Leu Asn He His Gly Asp His Ser
115 120 125
Asp Met Tyr Leu Ser Arg Asp Ser Gly Trp He Ser Leu Cys Thr Cys
130 135 140
Asn Pro Gin Glu Ala Tyr Asp Phe Thr Leu Met Ala Phe Arg He Ala 145 150 155 160
Glu His Gin Lys Val Arg Val Pro Thr He Val Asn Gin Asp Gly Phe
165 170 175
Leu Cys Ser His Thr Val Gin Asn Val Arg Pro Leu Ser Asp Ala Val
180 185 190
Ala Tyr Gin Phe Val Gly Glu Tyr Gin Thr Lys His Ser Leu Leu Asp
195 200 205
Phe Asp Lys Pro Val Ser Tyr Gly Ala Gin Ala Glu Glu Glu Trp His
210 215 220
Tyr Glu His Lys Ala Gin Leu His His Ala He Met Ser Ala Ser Ser 225 230 235 240
Val He Glu Glu Val Phe Asn Asp Phe Ala Lys Leu Thr Gly Arg Gin
245 250 255
Tyr His Leu Thr Lys Thr Phe Gin Leu Glu Asp Ala Glu He Ala He
260 265 270
Phe Ala Leu Gly Thr Thr Tyr Glu Ser Ala He Val Ala Ala Lys Glu
275 280 285
Met Arg Lys Lys Gly He Lys Ala Gly Val Ala Thr He His Ser Leu
290 295 300
Arg Pro Phe Pro Tyr Glu Arg Leu Gly Gin Asp Leu Lys Asn Leu Lys 305 310 315 320
Ala Leu Ala He Leu Asp Lys Ser Ser Pro Ala Gly Thr Met Gly Ala
325 330 335
Met Phe Asn Glu Val Thr Ser Ala Val Tyr Gin Thr Gin Gly Thr Lys
340 345 350
His Pro Val Val Ser Asn Tyr He Tyr Gly Leu Gly Glu Arg Asp Met
355 360 365
Thr He Ala His Leu Cys Glu He Phe Glu Glu He Asn Glu Asp Ala
370 375 380
Leu Lys Gly Thr Leu Thr His Pro Thr Gin Gin Phe Val Gly Leu His 385 390 395 400
Gly Pro Lys Met Ser Phe Phe 405
(2) INFORMATION FOR SEQ ID NO: 507:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 948 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 1...855 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:507:
AAA ATT GAA GTT TTA AGG GGT TTT TTG AAA CGA GCG TTA TAC TTA ATT 48 Lys He Glu Val Leu Arg Gly Phe Leu Lys Arg Ala Leu Tyr Leu He 1 5 10 15
TTA GGG CTT TTT TAC ACG CTT AAT GCA GAG AGC TTT AAA GAT GTT TTG 96 Leu Gly Leu Phe Tyr Thr Leu Asn Ala Glu Ser Phe Lys Asp Val Leu 20 25 30
ACT AAA GTG GAT TAC ACT TTT TTT AAT AAA AAG GTG GTT TCG CCC ATC 144 Thr Lys Val Asp Tyr Thr Phe Phe Asn Lys Lys Val Val Ser Pro He 35 40 45
AAA CGC TAT GCG GAT AGA TCG GCG TTT TAT CTG GGG CTT GGG TAT CAA 192 Lys Arg Tyr Ala Asp Arg Ser Ala Phe Tyr Leu Gly Leu Gly Tyr Gin 50 55 60
TTA GGG AGC ATT CAG CAC AAC TCT AGC AAC TTG AAT TTA TCC CAG CAA 240 Leu Gly Ser He Gin His Asn Ser Ser Asn Leu Asn Leu Ser Gin Gin 65 70 75 80
TTC AAT AAG AGT CAG ATT ATT TTC AGC GAT AGT CTA AGC CCT GTT TTT 288 Phe Asn Lys Ser Gin He He Phe Ser Asp Ser Leu Ser Pro Val Phe 85 90 95
AAA AAT TCG TAT GTG TCT AAT GGC CTT GGC GTG CAA GTG GGC TAT AAG 336 Lys Asn Ser Tyr Val Ser Asn Gly Leu Gly Val Gin Val Gly Tyr Lys 100 105 110
TGG GTG GGT AAG CAT GAA GAG ACG AAA TGG TTT GGC TTC AGG TGG GGG 384 Trp Val Gly Lys His Glu Glu Thr Lys Trp Phe Gly Phe Arg Trp Gly 115 120 125
CTG TTT TAT GAT TTG AGC GCC TCT CTT TAT GGC CAA AAA GAA TCA CAG 432 Leu Phe Tyr Asp Leu Ser Ala Ser Leu Tyr Gly Gin Lys Glu Ser Gin 130 135 140
TCT GTC ATC ATT TCC ACT TAC GGC ACT TAT ATG GAT TTA TTA TTG AAC 480 Ser Val He He Ser Thr Tyr Gly Thr Tyr Met Asp Leu Leu Leu Asn 145 150 155 160
GCT TAT AAT GGG GAT AAG TTT TTT GCT GGG TTC AAT CTG GGG ATT GCT 528 Ala Tyr Asn Gly Asp Lys Phe Phe Ala Gly Phe Asn Leu Gly He Ala 165 170 175
TTT GCT GGA GTG TAT GAC AAA GTG AGC GAT GCG TTA TTG TAT CAA GCC 576 Phe Ala Gly Val Tyr Asp Lys Val Ser Asp Ala Leu Leu Tyr Gin Ala 180 185 190
CTT CTT TTA GAC ACT TTT GGC GGG AAA GTG GAT CCA AAT GGC TTC CAG 624 Leu Leu Leu Asp Thr Phe Gly Gly Lys Val Asp Pro Asn Gly Phe Gin 195 200 205
TTT TTG GTA AAT TTA GGG GTT CGT TTA GGG AAT AAG CAC AAC CAA TTT 672 Phe Leu Val Asn Leu Gly Val Arg Leu Gly Asn Lys His Asn Gin Phe 210 215 220
GGC TTT GGG ATT AAA ATC CCT ACT TAT TAT TTT AAC CAT TAT TAT TCC 720 Gly Phe Gly He Lys He Pro Thr Tyr Tyr Phe Asn His Tyr Tyr Ser 225 230 235 240
ATG AAT AAC ATT AGC AAT AAT AGT GAA GAT GTC CTC AAA GTT TTA CGA 768 Met Asn Asn He Ser Asn Asn Ser Glu Asp Val Leu Lys Val Leu Arg 245 250 255
TTT TTA GAA TAC GGG ATC AAC AGC TTG TTA TAC CAA GTT GAT TTC AGG 816 Phe Leu Glu Tyr Gly He Asn Ser Leu Leu Tyr Gin Val Asp Phe Arg 260 265 270
CGC AAT TAC TCG GTT TAT TTC AAC TAC ACT TAT ATT TTT TAAGCGATAG CG 867 Arg Asn Tyr Ser Val Tyr Phe Asn Tyr Thr Tyr He Phe 275 280 285
TTTAAAGCGT TCTTAATTGA GCGATTTCGT CTCTCAAACG CATCGCTTCT TCAAAATCCA 927 AATTCTTCGT GCATTCTCGC A 948
(2) INFORMATION FOR SEQ ID NO: 508:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 285 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508:
Lys He Glu Val Leu Arg Gly Phe Leu Lys Arg Ala Leu Tyr Leu He
1 5 10 15
Leu Gly Leu Phe Tyr Thr Leu Asn Ala Glu Ser Phe Lys Asp Val Leu
20 25 30
Thr Lys Val Asp Tyr Thr Phe Phe Asn Lys Lys Val Val Ser Pro He
35 40 45
Lys Arg Tyr Ala Asp Arg Ser Ala Phe Tyr Leu Gly Leu Gly Tyr Gin
50 55 60
Leu Gly Ser He Gin His Asn Ser Ser Asn Leu Asn Leu Ser Gin Gin 65 70 75 80
Phe Asn Lys Ser Gin He He Phe Ser Asp Ser Leu Ser Pro Val Phe
85 90 95
Lys Asn Ser Tyr Val Ser Asn Gly Leu Gly Val Gin Val Gly Tyr Lys 100 105 110
Trp Val Gly Lys His Glu Glu Thr Lys Trp Phe Gly Phe Arg Trp Gly
115 120 125
Leu Phe Tyr Asp Leu Ser Ala Ser Leu Tyr Gly Gin Lys Glu Ser Gin
130 135 140
Ser Val He He Ser Thr Tyr Gly Thr Tyr Met Asp Leu Leu Leu Asn 145 150 155 160
Ala Tyr Asn Gly Asp Lys Phe Phe Ala Gly Phe Asn Leu Gly He Ala
165 170 175
Phe Ala Gly Val Tyr Asp Lys Val Ser Asp Ala Leu Leu Tyr Gin Ala
180 185 190
Leu Leu Leu Asp Thr Phe Gly Gly Lys Val Asp Pro Asn Gly Phe Gin
195 200 205
Phe Leu Val Asn Leu Gly Val Arg Leu Gly Asn Lys His Asn Gin Phe
210 215 220
Gly Phe Gly He Lys He Pro Thr Tyr Tyr Phe Asn His Tyr Tyr Ser 225 230 235 240
Met Asn Asn He Ser Asn Asn Ser Glu Asp Val Leu Lys Val Leu Arg
245 250 255
Phe Leu Glu Tyr Gly He Asn Ser Leu Leu Tyr Gin Val Asp Phe Arg
260 265 270
Arg Asn Tyr Ser Val Tyr Phe Asn Tyr Thr Tyr He Phe 275 280 285
(2) INFORMATION FOR SEQ ID NO: 509:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1840 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 35...1735 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509:
AAATTTTAAG CGTAACTGAT TGAAAGGAAA ACAG ATG AGA CGG AGT TTT TTG AAA 55
Met Arg Arg Ser Phe Leu Lys 1 5
ACG ATT GGC TTG GGT GTG ATA GCA CTC TTT TTG GGT TTG TTA AAC CCT 103 Thr He Gly Leu Gly Val He Ala Leu Phe Leu Gly Leu Leu Asn Pro 10 15 20
TTG AGT GCG GCG AGT TAC CCC CCC ATT AAA AAC ACT AAA GTA GGC TTA 151 Leu Ser Ala Ala Ser Tyr Pro Pro He Lys Asn Thr Lys Val Gly Leu 25 30 35 GCC CTT TCT AGC CAC CCG CTA GCT AGT GAG ATC GGG CAA AAG GTT TTA 199 Ala Leu Ser Ser His Pro Leu Ala Ser Glu He Gly Gin Lys Val Leu 40 45 50 55
GAA GAG GGA GGT AAT GCG ATT GAT GCG GCT GTA GCG ATA GGT TTT GCT 247 Glu Glu Gly Gly Asn Ala He Asp Ala Ala Val Ala He Gly Phe Ala 60 65 70
CTT GCG GTT GTC CAT CCG GCA GCA GGC AAT ATT GGT GGA GGC GGT TTT 295 Leu Ala Val Val His Pro Ala Ala Gly Asn He Gly Gly Gly Gly Phe 75 80 85
GCG GTT ATC CAT TTG GCT AAT GGT GAA AAT GTT GCG TTA GAT TTT AGA 343 Ala Val He His Leu Ala Asn Gly Glu Asn Val Ala Leu Asp Phe Arg 90 95 100
GAA AAA GCC CCC TTA AAA GCC ACT AAA AAC ATG TTT TTA GAC AAG CAA 391 Glu Lys Ala Pro Leu Lys Ala Thr Lys Asn Met Phe Leu Asp Lys Gin 105 110 115
GGC AAT GTA GTC CCT AAA CTC AGC GAA GAT GGC TAT TTG GCG GCC GGG 439 Gly Asn Val Val Pro Lys Leu Ser Glu Asp Gly Tyr Leu Ala Ala Gly 120 125 130 135
GTT CCT GGA ACG GTG GCA GGC ATG GAA GCG ATG CTG AAA AAA TAC GGC 487 Val Pro Gly Thr Val Ala Gly Met Glu Ala Met Leu Lys Lys Tyr Gly 140 145 150
ACT AAA AAA CTA TCG CAA CTC ATT GAT CCT GCC ATT AAA TTG GCT GAA 535 Thr Lys Lys Leu Ser Gin Leu He Asp Pro Ala He Lys Leu Ala Glu 155 160 165
AAT GGT TAT GCG ATT TCA CAA AGA CAA GCA GAA ACC CTA AAG GAA GCA 583 Asn Gly Tyr Ala He Ser Gin Arg Gin Ala Glu Thr Leu Lys Glu Ala 170 175 180
AGG GAG CGG TTT TTA AAA TAC AGT TCT AGC AAA AAG TAT TTT TTT AAA 631 Arg Glu Arg Phe Leu Lys Tyr Ser Ser Ser Lys Lys Tyr Phe Phe Lys 185 190 195
AAA GGC CAT CTT GAT TAT CAA GAA GGG GAT TTG TTT GTC CAA AAA GAT 679 Lys Gly His Leu Asp Tyr Gin Glu Gly Asp Leu Phe Val Gin Lys Asp 200 205 210 215
TTA GCC AAG ACT TTG AAT CAA ATC AAA ACG CTA GGC GCT AAA GGC TTT 727 Leu Ala Lys Thr Leu Asn Gin He Lys Thr Leu Gly Ala Lys Gly Phe 220 225 230
TAT CAA GGG CAA GTC GCT GAG CTT ATT GAG AAA GAC ATG AAA AAA AAT 775 Tyr Gin Gly Gin Val Ala Glu Leu He Glu Lys Asp Met Lys Lys Asn 235 240 245
GGA GGG ATT ATC ACT AAA GAA GAT TTA GCC AGT TAC AAT GTG AAA TGG 823 Gly Gly He He Thr Lys Glu Asp Leu Ala Ser Tyr Asn Val Lys Trp 250 255 260 CGC AAA CCC GTG GTA GGG AGT TAT CGT GGG TAT AAG ATC ATT TCT ATG 871 Arg Lys Pro Val Val Gly Ser Tyr Arg Gly Tyr Lys He He Ser Met 265 270 275
TCG CCG CCA AGT TCG GGA GGC ACG CAT TTG ATC CAG ATT TTA AAT GTC 919 Ser Pro Pro Ser Ser Gly Gly Thr His Leu He Gin He Leu Asn Val 280 285 290 295
ATG GAA AAT GCG GAT TTA AGC GCC CTT GGG TAT GGG GCT TCT AAG AAT 967 Met Glu Asn Ala Asp Leu Ser Ala Leu Gly Tyr Gly Ala Ser Lys Asn 300 305 310
ATC CAT ATC GCT GCC GAA GCG ATG CGT CAG GCT TAT GCG GAT AGA TCG 1015 He His He Ala Ala Glu Ala Met Arg Gin Ala Tyr Ala Asp Arg Ser 315 320 325
GTT TAT ATG GGA GAC GCT GAT TTT GTT TCG GTG CCG GTG GAT AAA TTG 1063 Val Tyr Met Gly Asp Ala Asp Phe Val Ser Val Pro Val Asp Lys Leu 330 335 340
ATT AAT AAA GCG TAT GCC AAA AAG ATT TTT GAC ACT ATC CAG CCA GAT 1111 He Asn Lys Ala Tyr Ala Lys Lys He Phe Asp Thr He Gin Pro Asp 345 350 355
ACG GTT ACG CCA AGC TCT CAA ATC AAA CCA GGA ATG GGG CAG TTG CAT 1159 Thr Val Thr Pro Ser Ser Gin He Lys Pro Gly Met Gly Gin Leu His 360 365 370 375
GAG GGG AGC AAT ACC ACG CAT TAT TCT GTA GCG GAC AGG TGG GGG AAT 1207 Glu Gly Ser Asn Thr Thr His Tyr Ser Val Ala Asp Arg Trp Gly Asn 380 385 390
GCA GTC AGC GTT ACT TAC ACC ATT AAC GCT TCT TAT GGA AGC GCT GCC 1255 Ala Val Ser Val Thr Tyr Thr He Asn Ala Ser Tyr Gly Ser Ala Ala 395 400 405
AGT ATT GAT GGG GCA GGA TTT TTA TTG AAC AAT GAA ATG GAT GAT TTT 1303 Ser He Asp Gly Ala Gly Phe Leu Leu Asn Asn Glu Met Asp Asp Phe 410 415 420
TCC ATC AAG CCA GGG AAT CCC AAT CTC TAT GGT TTA GTA GGG GGC GAT 1351 Ser He Lys Pro Gly Asn Pro Asn Leu Tyr Gly Leu Val Gly Gly Asp 425 430 435
GCG AAT GCG ATT GAA GCC AAT AAG CGC CCT TTA AGC TCC ATG TCG CCT 1399 Ala Asn Ala He Glu Ala Asn Lys Arg Pro Leu Ser Ser Met Ser Pro 440 445 450 455
ACG ATT GTG TTG AAA AAC AAT AAG GTT TTT TTG GTG GTG GGA AGC CCT 1447 Thr He Val Leu Lys Asn Asn Lys Val Phe Leu Val Val Gly Ser Pro 460 465 470
GGA GGG TCT AGG ATT ATC ACT ACG GTG CTG CAA GTG ATT TCT AAT GTC 1495 Gly Gly Ser Arg He He Thr Thr Val Leu Gin Val He Ser Asn Val 475 480 485 ATT GAT TAT AAT ATG AAT ATT TCT GAA GCG GTT TCA GCC CCA AGA TTT 1543 He Asp Tyr Asn Met Asn He Ser Glu Ala Val Ser Ala Pro Arg Phe 490 495 500
CAC ATG CAA TGG CTC CCT GAT GAA TTA AGG ATT GAA AAG TTT GGC ATG 1591 His Met Gin Trp Leu Pro Asp Glu Leu Arg He Glu Lys Phe Gly Met 505 510 515
CCC GCT GAT GTG AAA GAC AAC CTC ACT AAA ATG GGC TAT CAA ATC GTT 1639 Pro Ala Asp Val Lys Asp Asn Leu Thr Lys Met Gly Tyr Gin He Val 520 525 530 535
ACT AAG CCG GTC ATG GGT GAT GTG AAT GCG ATC CAA GTT TTG CCT AAA 1687 Thr Lys Pro Val Met Gly Asp Val Asn Ala He Gin Val Leu Pro Lys 540 545 550
ACT AAA GGG AGC GTT TTC TAT GGT TCA ACG GAT CCA AGG AAA GAA TTT T 1736 Thr Lys Gly Ser Val Phe Tyr Gly Ser Thr Asp Pro Arg Lys Glu Phe 555 560 565
AATTCTTTGT CATATACAGG TTTTTAATCC TATTTAGCCT TATTTTTTGG GATGGAGGGG 1796 GGCTTTTTAG CGAGAAAATC TTAAATTTAG TTTTAAAATT CATA 1840
(2) INFORMATION FOR SEQ ID NO: 510:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 567 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:510:
Met Arg Arg Ser Phe Leu Lys Thr He Gly Leu Gly Val He Ala Leu
1 5 10 15
Phe Leu Gly Leu Leu Asn Pro Leu Ser Ala Ala Ser Tyr Pro Pro He
20 25 30
Lys Asn Thr Lys Val Gly Leu Ala Leu Ser Ser His Pro Leu Ala Ser
35 40 45
Glu He Gly Gin Lys Val Leu Glu Glu Gly Gly Asn Ala He Asp Ala
50 55 60
Ala Val Ala He Gly Phe Ala Leu Ala Val Val His Pro Ala Ala Gly 65 70 75 80
Asn He Gly Gly Gly Gly Phe Ala Val He His Leu Ala Asn Gly Glu
85 90 95
Asn Val Ala Leu Asp Phe Arg Glu Lys Ala Pro Leu Lys Ala Thr Lys
100 105 110
Asn Met Phe Leu Asp Lys Gin Gly Asn Val Val Pro Lys Leu Ser Glu
115 120 125
Asp Gly Tyr Leu Ala Ala Gly Val Pro Gly Thr Val Ala Gly Met Glu
130 135 140
Ala Met Leu Lys Lys Tyr Gly Thr Lys Lys Leu Ser Gin Leu He Asp 145 150 155 160
Pro Ala He Lys Leu Ala Glu Asn Gly Tyr Ala He Ser Gin Arg Gin
165 170 175
Ala Glu Thr Leu Lys Glu Ala Arg Glu Arg Phe Leu Lys Tyr Ser Ser
180 185 190
Ser Lys Lys Tyr Phe Phe Lys Lys Gly His Leu Asp Tyr Gin Glu Gly
195 200 205
Asp Leu Phe Val Gin Lys Asp Leu Ala Lys Thr Leu Asn Gin He Lys
210 215 220
Thr Leu Gly Ala Lys Gly Phe Tyr Gin Gly Gin Val Ala Glu Leu He 225 230 235 240
Glu Lys Asp Met Lys Lys Asn Gly Gly He He Thr Lys Glu Asp Leu
245 250 255
Ala Ser Tyr Asn Val Lys Trp Arg Lys Pro Val Val Gly Ser Tyr Arg
260 265 270
Gly Tyr Lys He He Ser Met Ser Pro Pro Ser Ser Gly Gly Thr His
275 280 285
Leu He Gin He Leu Asn Val Met Glu Asn Ala Asp Leu Ser Ala Leu
290 295 300
Gly Tyr Gly Ala Ser Lys Asn He His He Ala Ala Glu Ala Met Arg 305 310 315 320
Gin Ala Tyr Ala Asp Arg Ser Val Tyr Met Gly Asp Ala Asp Phe Val
325 330 335
Ser Val Pro Val Asp Lys Leu He Asn Lys Ala Tyr Ala Lys Lys He
340 345 350
Phe Asp Thr He Gin Pro Asp Thr Val Thr Pro Ser Ser Gin He Lys
355 360 365
Pro Gly Met Gly Gin Leu His Glu Gly Ser Asn Thr Thr His Tyr Ser
370 375 380
Val Ala Asp Arg Trp Gly Asn Ala Val Ser Val Thr Tyr Thr He Asn 385 390 395 400
Ala Ser Tyr Gly Ser Ala Ala Ser He Asp Gly Ala Gly Phe Leu Leu
405 410 415
Asn Asn Glu Met Asp Asp Phe Ser He Lys Pro Gly Asn Pro Asn Leu
420 425 430
Tyr Gly Leu Val Gly Gly Asp Ala Asn Ala He Glu Ala Asn Lys Arg
435 440 445
Pro Leu Ser Ser Met Ser Pro Thr He Val Leu Lys Asn Asn Lys Val
450 455 460
Phe Leu Val Val Gly Ser Pro Gly Gly Ser Arg He He Thr Thr Val 465 470 475 480
Leu Gin Val He Ser Asn Val He Asp Tyr Asn Met Asn He Ser Glu
485 490 495
Ala Val Ser Ala Pro Arg Phe His Met Gin Trp Leu Pro Asp Glu Leu
500 505 510
Arg He Glu Lys Phe Gly Met Pro Ala Asp Val Lys Asp Asn Leu Thr
515 520 525
Lys Met Gly Tyr Gin He Val Thr Lys Pro Val Met Gly Asp Val Asn
530 535 540
Ala He Gin Val Leu Pro Lys Thr Lys Gly Ser Val Phe Tyr Gly Ser 545 550 555 560
Thr Asp Pro Arg Lys Glu Phe 565
(2) INFORMATION FOR SEQ ID NO: 511: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 719 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 76...630 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511:
TGCAACACTT GTATCCTTCA AGCGAACAAG CTAAAATGGC GCAAAAAATC TTAGAAAACA 60 AGGAGAAACA CCACC ATG CAA AAC CAT GAT TTA GAG TCA ATC AAA CAA GCC 111 Met Gin Asn His Asp Leu Glu Ser He Lys Gin Ala 1 5 10
GCT TTG ATT GAA TAT GAA GTG AGA GAA CAA GGC TCT AGT ATT GTG CTA 159 Ala Leu He Glu Tyr Glu Val Arg Glu Gin Gly Ser Ser He Val Leu 15 20 25
GAC AGC AAT ATT TCC AAA GAG CCT TTA GAG TTT ATT ATA GGC ACT AAT 207 Asp Ser Asn He Ser Lys Glu Pro Leu Glu Phe He He Gly Thr Asn 30 35 40
CAA ATC ATA GCA GGG TTA GAA AAG GCG GTA TTA AAG GCT CAA ATT GGC 255 Gin He He Ala Gly Leu Glu Lys Ala Val Leu Lys Ala Gin He Gly 45 50 55 60
GAG TGG GAA GAG GTT GTC ATC GCC CCA GAG GAA GCT TAT GGG GTT TAT 303 Glu Trp Glu Glu Val Val He Ala Pro Glu Glu Ala Tyr Gly Val Tyr 65 70 75
GAA AGC AGC TAT TTG CAA GAA GTC CCT AGA GAT CAA TTT GAA GGC ATT 351 Glu Ser Ser Tyr Leu Gin Glu Val Pro Arg Asp Gin Phe Glu Gly He 80 85 90
GAA TTA GAA AAA GGC ATG AGC GTT TTT GGG CAA ACT GAA GAC AAT CAA 399 Glu Leu Glu Lys Gly Met Ser Val Phe Gly Gin Thr Glu Asp Asn Gin 95 100 105
ACC ATT CAA GCC ATT ATC AAA GAC TTT AGC GCT ACG CAT GTG ATG GTG 447 Thr He Gin Ala He He Lys Asp Phe Ser Ala Thr His Val Met Val 110 115 120
GAT TAT AAC CAC CCG TTA GCC GGG AAA ACT TTA GCG TTT CGT TTC AAG 495 Asp Tyr Asn His Pro Leu Ala Gly Lys Thr Leu Ala Phe Arg Phe Lys 125 130 135 140 GTT TTA GGT TTT AGG GAA GTG AGC GAA GAA GAA ATT TTA GCT TCA CAC 543 Val Leu Gly Phe Arg Glu Val Ser Glu Glu Glu He Leu Ala Ser His 145 150 155
CAT GGC GGT GGG ACA GGT TGC TGT GGC GGT CAT GGG GGT CAT GGC GGA 591 His Gly Gly Gly Thr Gly Cys Cys Gly Gly His Gly Gly His Gly Gly 160 165 170
AAG AAA GGT GGG GGT TGT GGT TGC TCA TGT TCG CAT GGG TAGTAAGGTA TA 642 Lys Lys Gly Gly Gly Cys Gly Cys Ser Cys Ser His Gly 175 180 185
GGAGTATTTA AAAGGCAAGG TCATGAATAG TTCTAATCTC AAAAATTGGC TATTCCCTAC 702 CATTTGCTTT TTTTTAT 719
(2) INFORMATION FOR SEQ ID NO:512:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 185 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:512:
Met Gin Asn His Asp Leu Glu Ser He Lys Gin Ala Ala Leu He Glu
1 5 10 15
Tyr Glu Val Arg Glu Gin Gly Ser Ser He Val Leu Asp Ser Asn He
20 25 30
Ser Lys Glu Pro Leu Glu Phe He He Gly Thr Asn Gin He He Ala
35 40 45
Gly Leu Glu Lys Ala Val Leu Lys Ala Gin He Gly Glu Trp Glu Glu
50 55 60
Val Val He Ala Pro Glu Glu Ala Tyr Gly Val Tyr Glu Ser Ser Tyr 65 70 75 80
Leu Gin Glu Val Pro Arg Asp Gin Phe Glu Gly He Glu Leu Glu Lys
85 90 95
Gly Met Ser Val Phe Gly Gin Thr Glu Asp Asn Gin Thr He Gin Ala
100 105 110
He He Lys Asp Phe Ser Ala Thr His Val Met Val Asp Tyr Asn His
115 120 125
Pro Leu Ala Gly Lys Thr Leu Ala Phe Arg Phe Lys Val Leu Gly Phe
130 135 140
Arg Glu Val Ser Glu Glu Glu He Leu Ala Ser His His Gly Gly Gly 145 150 155 160
Thr Gly Cys Cys Gly Gly His Gly Gly His Gly Gly Lys Lys Gly Gly
165 170 175
Gly Cys Gly Cys Ser Cys Ser His Gly 180 185
(2) INFORMATION FOR SEQ ID NO:513: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 339 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 1...336 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:513:
ATC TCT TCT AAG CGT TTG GAG ATT TCA AGA CTT TTG GAG CGT AAA AAT 48 He Ser Ser Lys Arg Leu Glu He Ser Arg Leu Leu Glu Arg Lys Asn 1 5 10 15
GAA CGC AGT TTT TTA GCC GAA AAA TAC CAC AAA ATC CCC ACA AAT AAA 96 Glu Arg Ser Phe Leu Ala Glu Lys Tyr His Lys He Pro Thr Asn Lys 20 25 30
AGG AAA TTT AAA GAA CGC TCT ATA ATA TCT GTT TGT GAA ATA TCC AAT 144 Arg Lys Phe Lys Glu Arg Ser He He Ser Val Cys Glu He Ser Asn 35 40 45
CCA GTA GCG CAC AAA GGG CTT AAA AGG ATC AAA AAC CCT AAC ACC ATT 192 Pro Val Ala His Lys Gly Leu Lys Arg He Lys Asn Pro Asn Thr He 50 55 60
TTA ACT ACA AAC ATT CAT CAA CTC CCT AAA CCC ATA GCC ACA CGC TTG 240 Leu Thr Thr Asn He His Gin Leu Pro Lys Pro He Ala Thr Arg Leu 65 70 75 80
TTT AAC TCG TCT TCA AAT ACC GGC ATT TGC GCT TGC AAC TGC TCT TTT 288 Phe Asn Ser Ser Ser Asn Thr Gly He Cys Ala Cys Asn Cys Ser Phe 85 90 95
AGC GCT TGC TTT TCA TTT TGT AAT TGC TTC GCA AAC GCT TCA AAC TCT T 337 Ser Ala Cys Phe Ser Phe Cys Asn Cys Phe Ala Asn Ala Ser Asn Ser 100 105 110
GA 339
(2) INFORMATION FOR SEQ ID NO: 514:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 112 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:514:
He Ser Ser Lys Arg Leu Glu He Ser Arg Leu Leu Glu Arg Lys Asn
1 5 10 15
Glu Arg Ser Phe Leu Ala Glu Lys Tyr His Lys He Pro Thr Asn Lys
20 25 30
Arg Lys Phe Lys Glu Arg Ser He He Ser Val Cys Glu He Ser Asn
35 40 45
Pro Val Ala His Lys Gly Leu Lys Arg He Lys Asn Pro Asn Thr He
50 55 60
Leu Thr Thr Asn He His Gin Leu Pro Lys Pro He Ala Thr Arg Leu 65 70 75 80
Phe Asn Ser Ser Ser Asn Thr Gly He Cys Ala Cys Asn Cys Ser Phe
85 90 95
Ser Ala Cys Phe Ser Phe Cys Asn Cys Phe Ala Asn Ala Ser Asn Ser 100 105 110
(2) INFORMATION FOR SEQ ID NO: 515:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1440 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 34...1338 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:515:
TAATCTATAC CAAATAGTAA GGAGTTATCT AAC ATG GGG TTT TTC AAG CTT AAA 54
Met Gly Phe Phe Lys Leu Lys 1 5
GAA CAC AAC ACT AAC ATT GCC ACC GAG TTT AGA GCG GGT TTA ACG ACC 102 Glu His Asn Thr Asn He Ala Thr Glu Phe Arg Ala Gly Leu Thr Thr 10 15 20
TTT ATC ACC ATG ATT TAC ATC GTG CCC TTA AAC GCT CTT ATC CTT TCT 150 Phe He Thr Met He Tyr He Val Pro Leu Asn Ala Leu He Leu Ser 25 30 35
CAA GCC AAC ATG CCT TAT GAA GCC CTT TTA AGT GCA ACG GCC ATT ATC 198 Gin Ala Asn Met Pro Tyr Glu Ala Leu Leu Ser Ala Thr Ala He He 40 45 50 55 ACT ATC TTA TCG AGC GTG TTT AAC GGA TTG TGG GCA AAC ACC CCT ATC 246 Thr He Leu Ser Ser Val Phe Asn Gly Leu Trp Ala Asn Thr Pro He 60 65 70
GCT ATG AGC GTG GGC TTA GGG CTG TCA GCT TAT TTT AGC TTC GGG TTG 294 Ala Met Ser Val Gly Leu Gly Leu Ser Ala Tyr Phe Ser Phe Gly Leu 75 80 85
GTT CAA GGG TTA AAA CTC CCT TGG CAG AGC GCT TTA GGC ATC GTA GCG 342 Val Gin Gly Leu Lys Leu Pro Trp Gin Ser Ala Leu Gly He Val Ala 90 95 100
CTC TCG GGA GCG ATT TTT GTG ATT TTG TCT TTC ACT AAA TTT AGA AGT 390 Leu Ser Gly Ala He Phe Val He Leu Ser Phe Thr Lys Phe Arg Ser 105 110 115
TGG GTC ATG CGA AGC ATT CCT AGC GAT TTA AGG CGT GCG GTG AGT GCG 438 Trp Val Met Arg Ser He Pro Ser Asp Leu Arg Arg Ala Val Ser Ala 120 125 130 135
GGG ATA GGG GCT TTT ATC GCG TTT ATT GGC CTT AAA GAA ATG CAT ATC 486 Gly He Gly Ala Phe He Ala Phe He Gly Leu Lys Glu Met His He 140 145 150
GTC GTT ACC CAT AAR GCT ACG CTT GTA ACC TTA GGC GAT TTT GGC GAT 534 Val Val Thr His Xaa Ala Thr Leu Val Thr Leu Gly Asp Phe Gly Asp 155 160 165
CCG CAT GTG TTA TTG GGG GTT GTG GGG ATC ATT CTA ACT TTC GCG CTC 582 Pro His Val Leu Leu Gly Val Val Gly He He Leu Thr Phe Ala Leu 170 175 180
TAC ACG CTC AAA ATC AGG GGT TCT TTC ATT ATA GCG GTC TTA ATC ACT 630 Tyr Thr Leu Lys He Arg Gly Ser Phe He He Ala Val Leu He Thr 185 190 195
TCC ATT CTC GCA TGG GTT TTA AAG CTA GCC CCT TAC CCT AGC GAG TTT 678 Ser He Leu Ala Trp Val Leu Lys Leu Ala Pro Tyr Pro Ser Glu Phe 200 205 210 215
TTT TCC ATG CCC GCT AGC ATT GGC CCT ATC GCC TTT CAA TTA GAC TTT 726 Phe Ser Met Pro Ala Ser He Gly Pro He Ala Phe Gin Leu Asp Phe 220 225 230
AAG GGC ATT TTT TTT GAT GCG AGT GGG GCT TTC ACT TTA GCG TTA GTG 774 Lys Gly He Phe Phe Asp Ala Ser Gly Ala Phe Thr Leu Ala Leu Val 235 240 245
CCA GTT ATT ATC ACT TTT TTT GTA ACC GAT TTG TTT GAT TCT TTA GGC 822 Pro Val He He Thr Phe Phe Val Thr Asp Leu Phe Asp Ser Leu Gly 250 255 260
ACG CTT GCA GGG ATT GGC CAC AAG ACT GAT TTT TTC AAT GAT GAA GAA 870 Thr Leu Ala Gly He Gly His Lys Thr Asp Phe Phe Asn Asp Glu Glu 265 270 275 AAA AAC AAG GAA TTG GAA AAG ACT TTG GAA GCG GAT GCG GTG GCT TCT 918 Lys Asn Lys Glu Leu Glu Lys Thr Leu Glu Ala Asp Ala Val Ala Ser 280 285 290 295
TTA GGG AGC GCG GTG GTG GGC GTT TCT ACT ACG ACC GCT TTT ATA GAG 966 Leu Gly Ser Ala Val Val Gly Val Ser Thr Thr Thr Ala Phe He Glu 300 305 310
AGC GCG AGT GGG GTT GAA GAG GGG GGC CGC ACA GGG CTT ACA GCG GTT 1014 Ser Ala Ser Gly Val Glu Glu Gly Gly Arg Thr Gly Leu Thr Ala Val 315 320 325
TTT ACC GGA TTA TTT TTT GTT TTA ACG CTC TTT TGC TTG CCT CTT TTA 1062 Phe Thr Gly Leu Phe Phe Val Leu Thr Leu Phe Cys Leu Pro Leu Leu 330 335 340
AAA GCT ATT CCT AGC AAT GCG ATT TAT CCG GTG CTG GTG GTA GTA GGG 1110 Lys Ala He Pro Ser Asn Ala He Tyr Pro Val Leu Val Val Val Gly 345 350 355
GTT TTG ATG TTT AGC GTG TTA GAG GGG GTG AAT TTT AAA GAC ATG GCC 1158 Val Leu Met Phe Ser Val Leu Glu Gly Val Asn Phe Lys Asp Met Ala 360 365 370 375
ATT AGC GTT TCC ACT TTT TTA ACC GTG GTG ATG ATG CCC TTA ACC TTC 1206 He Ser Val Ser Thr Phe Leu Thr Val Val Met Met Pro Leu Thr Phe 380 385 390
TCC ATT GCC GAT GGC TTA GCC TTT GGC TTT TTG TCT TAT AGT ATC ATC 1254 Ser He Ala Asp Gly Leu Ala Phe Gly Phe Leu Ser Tyr Ser He He 395 400 405
AAA TTG GTT CAA AAA GAC TTC AAA GCA CTC AAT TCA GGC ATT ATC ATT 1302 Lys Leu Val Gin Lys Asp Phe Lys Ala Leu Asn Ser Gly He He He 410 415 420
CTC TGC ATC ATT TCT GTT TCT GTA TTT ATC TTT CGT TAAGCTCTTT TTAAGG 1354 Leu Cys He He Ser Val Ser Val Phe He Phe Arg 425 430 435
GGCTTTGCAT TTTTTACTCA TTTCATGCCT CTTTTTCTTT ATTTAGACAG ATTATTATCT 1414 TAAAATAATT GTAATATCAT TATTAT 1440
(2) INFORMATION FOR SEQ ID NO: 516:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 435 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516: Met Gly Phe Phe Lys Leu Lys Glu His Asn Thr Asn He Ala Thr Glu
1 5 10 15
Phe Arg Ala Gly Leu Thr Thr Phe He Thr Met He Tyr He Val Pro
20 25 30
Leu Asn Ala Leu He Leu Ser Gin Ala Asn Met Pro Tyr Glu Ala Leu
35 40 45
Leu Ser Ala Thr Ala He He Thr He Leu Ser Ser Val Phe Asn Gly
50 55 60
Leu Trp Ala Asn Thr Pro He Ala Met Ser Val Gly Leu Gly Leu Ser 65 70 75 80
Ala Tyr Phe Ser Phe Gly Leu Val Gin Gly Leu Lys Leu Pro Trp Gin
85 90 95
Ser Ala Leu Gly He Val Ala Leu Ser Gly Ala He Phe Val He Leu
100 105 110
Ser Phe Thr Lys Phe Arg Ser Trp Val Met Arg Ser He Pro Ser Asp
115 120 125
Leu Arg Arg Ala Val Ser Ala Gly He Gly Ala Phe He Ala Phe He
130 135 140
Gly Leu Lys Glu Met His He Val Val Thr His Xaa Ala Thr Leu Val 145 150 155 160
Thr Leu Gly Asp Phe Gly Asp Pro His Val Leu Leu Gly Val Val Gly
165 170 175
He He Leu Thr Phe Ala Leu Tyr Thr Leu Lys He Arg Gly Ser Phe
180 185 190
He He Ala Val Leu He Thr Ser He Leu Ala Trp Val Leu Lys Leu
195 200 205
Ala Pro Tyr Pro Ser Glu Phe Phe Ser Met Pro Ala Ser He Gly Pro
210 215 220
He Ala Phe Gin Leu Asp Phe Lys Gly He Phe Phe Asp Ala Ser Gly 225 230 235 240
Ala Phe Thr Leu Ala Leu Val Pro Val He He Thr Phe Phe Val Thr
245 250 255
Asp Leu Phe Asp Ser Leu Gly Thr Leu Ala Gly He Gly His Lys Thr
260 265 270
Asp Phe Phe Asn Asp Glu Glu Lys Asn Lys Glu Leu Glu Lys Thr Leu
275 280 285
Glu Ala Asp Ala Val Ala Ser Leu Gly Ser Ala Val Val Gly Val Ser
290 295 300
Thr Thr Thr Ala Phe He Glu Ser Ala Ser Gly Val Glu Glu Gly Gly 305 310 315 320
Arg Thr Gly Leu Thr Ala Val Phe Thr Gly Leu Phe Phe Val Leu Thr
325 330 335
Leu Phe Cys Leu Pro Leu Leu Lys Ala He Pro Ser Asn Ala He Tyr
340 345 350
Pro Val Leu Val Val Val Gly Val Leu Met Phe Ser Val Leu Glu Gly
355 360 365
Val Asn Phe Lys Asp Met Ala He Ser Val Ser Thr Phe Leu Thr Val
370 375 380
Val Met Met Pro Leu Thr Phe Ser He Ala Asp Gly Leu Ala Phe Gly 385 390 395 400
Phe Leu Ser Tyr Ser He He Lys Leu Val Gin Lys Asp Phe Lys Ala
405 410 415
Leu Asn Ser Gly He He He Leu Cys He He Ser Val Ser Val Phe
420 425 430
He Phe Arg 435 (2) INFORMATION FOR SEQ ID NO: 517:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 843 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 66...764 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517
GCTCACTTTT TGGGATTAAG CCCCCTAGAT TATGGCAAAA ACTTATTAAA CTTTAAAGGA 60 CAACC ATG ACC CCT CAC ATC AAC GCC AAA ATC GGC GAT TTT TAT CCT CAA 110 Met Thr Pro His He Asn Ala Lys He Gly Asp Phe Tyr Pro Gin 1 5 10 15
TGC CTT TTA TGC GGC GAT CCC TTA AGG GTG AGC TAC ATT GCA AAA AAA 158 Cys Leu Leu Cys Gly Asp Pro Leu Arg Val Ser Tyr He Ala Lys Lys 20 25 30
TTC TTA CAA GAC GCC AAA GAG ATC ACG AAT GTG CGT AAC ATG CTA GGC 206 Phe Leu Gin Asp Ala Lys Glu He Thr Asn Val Arg Asn Met Leu Gly 35 40 45
TTT AGC GGG AAG TAT AAG GGT AGG GGG ATT TCT TTA ATG GGG CAT GGC 254 Phe Ser Gly Lys Tyr Lys Gly Arg Gly He Ser Leu Met Gly His Gly 50 55 60
ATG GGC ATT GCG TCA TGC ACG ATT TAT GTA ACC GAA CTC ATT AAA ACC 302 Met Gly He Ala Ser Cys Thr He Tyr Val Thr Glu Leu He Lys Thr 65 70 75
TAT CAG GTT AAA GAG CTT TTA AGG ATT GGC ACT TGC GGG GCG ATT AGC 350 Tyr Gin Val Lys Glu Leu Leu Arg He Gly Thr Cys Gly Ala He Ser 80 85 90 95
CCA AAA GTT GGC CTG AAA GAC ATT ATC ATG GCG ACG GGG GCT TCA ACG 398 Pro Lys Val Gly Leu Lys Asp He He Met Ala Thr Gly Ala Ser Thr 100 105 110
GAT TCT AAA ACC AAT CGG GTG CGT TTT TTA AAC CAC GAT TTG AGC GCA 446 Asp Ser Lys Thr Asn Arg Val Arg Phe Leu Asn His Asp Leu Ser Ala 115 120 125 ACG CCT GAT TTT GAA TTG AGT TTA AGA GCG TAT CAA ACA GCA AAG CGT 494 Thr Pro Asp Phe Glu Leu Ser Leu Arg Ala Tyr Gin Thr Ala Lys Arg 130 135 140
TTG GGT ATT GAT TTG AAA GTG GGC AAT GTT TTT TCA AGC GAT TTT TTC 542 Leu Gly He Asp Leu Lys Val Gly Asn Val Phe Ser Ser Asp Phe Phe 145 150 155
TAT TCT TTT GAA ACG CAT GCC TTT GAT TTA ATG GCT AAA TAC AAC CAC 590 Tyr Ser Phe Glu Thr His Ala Phe Asp Leu Met Ala Lys Tyr Asn His 160 165 170 175
TTG GCT ATT GAA ATG GAA GCG GCG GGG TTA TAC GCC ACG GCG ATG GAA 638 Leu Ala He Glu Met Glu Ala Ala Gly Leu Tyr Ala Thr Ala Met Glu 180 185 190
CTA AAC GCT AAG GCT TTA TGC TTA TGC TCA GTC TCA GAT CAC TTA ATC 686 Leu Asn Ala Lys Ala Leu Cys Leu Cys Ser Val Ser Asp His Leu He 195 200 205
ACT AAA GAA GCC TTA AGC CCT AAA GAA AGG GTA GAA AGC TTT GAT AAC 734 Thr Lys Glu Ala Leu Ser Pro Lys Glu Arg Val Glu Ser Phe Asp Asn 210 215 220
ATG ATA ATT TTG GCT TTG GAG ATG ATG AGT TAGCCTTTTT TGCCCCCATA AGT 787 Met He He Leu Ala Leu Glu Met Met Ser 225 230
TAAGGATAAA ATTTAAAGGA AAACCCTTAA AGCTAAAAGC CTTAAGGGAA CTTTGG 843
(2) INFORMATION FOR SEQ ID NO:518:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 233 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:518:
Met Thr Pro His He Asn Ala Lys He Gly Asp Phe Tyr Pro Gin Cys
1 5 10 15
Leu Leu Cys Gly Asp Pro Leu Arg Val Ser Tyr He Ala Lys Lys Phe
20 25 30
Leu Gin Asp Ala Lys Glu He Thr Asn Val Arg Asn Met Leu Gly Phe
35 40 45
Ser Gly Lys Tyr Lys Gly Arg Gly He Ser Leu Met Gly His Gly Met
50 55 60
Gly He Ala Ser Cys Thr He Tyr Val Thr Glu Leu He Lys Thr Tyr
65 70 75 80
Gin Val Lys Glu Leu Leu Arg He Gly Thr Cys Gly Ala He Ser Pro 85 90 95 Lys Val Gly Leu Lys Asp He He Met Ala Thr Gly Ala Ser Thr Asp
100 105 110
Ser Lys Thr Asn Arg Val Arg Phe Leu Asn His Asp Leu Ser Ala Thr
115 120 125
Pro Asp Phe Glu Leu Ser Leu Arg Ala Tyr Gin Thr Ala Lys Arg Leu
130 135 140
Gly He Asp Leu Lys Val Gly Asn Val Phe Ser Ser Asp Phe Phe Tyr 145 150 155 160
Ser Phe Glu Thr His Ala Phe Asp Leu Met Ala Lys Tyr Asn His Leu
165 170 175
Ala He Glu Met Glu Ala Ala Gly Leu Tyr Ala Thr Ala Met Glu Leu
180 185 190
Asn Ala Lys Ala Leu Cys Leu Cys Ser Val Ser Asp His Leu He Thr
195 200 205
Lys Glu Ala Leu Ser Pro Lys Glu Arg Val Glu Ser Phe Asp Asn Met
210 215 220
He He Leu Ala Leu Glu Met Met Ser 225 230
(2) INFORMATION FOR SEQ ID NO: 519:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1440 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 79...1407 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519:
TACATTAAAC TTTTATTATC AATCCTACTT CCTCAAATTG ATGACAAAAG TTGGGCATTT 60 TCTTGTAAAA TAACCCGC ATG TTT AAG AAA ATT TTT CCA TTA GCG TTA GTG 111
Met Phe Lys Lys He Phe Pro Leu Ala Leu Val 1 5 10
TCA TCG TTG CGG TTT TTG GGG CTT TTT ATT GTT TTG CCG GTC ATT AGT 159 Ser Ser Leu Arg Phe Leu Gly Leu Phe He Val Leu Pro Val He Ser 15 20 25
TTG TAT GCG GAT AGT TTC CAT TCA AGC AGT CCC TTA CTC GTG GGG TTG 207 Leu Tyr Ala Asp Ser Phe His Ser Ser Ser Pro Leu Leu Val Gly Leu 30 35 40
GCT GTG GGC GGA GCG TAT CTT ACG CAA ATT GTT TTT CAA ACC CCC ATG 255 Ala Val Gly Gly Ala Tyr Leu Thr Gin He Val Phe Gin Thr Pro Met 45 50 55 GGC ATT CTT AGC GAT AAG ATA GGC CGT AAA GTG GTG GTT ATG GTG TGC 303 Gly He Leu Ser Asp Lys He Gly Arg Lys Val Val Val Met Val Cys 60 65 70 75
TTG CTG TTG TTT TTA GCC GGC TCG TTA GTG TGC TTT ATA GCG AAT GAT 351 Leu Leu Leu Phe Leu Ala Gly Ser Leu Val Cys Phe He Ala Asn Asp 80 85 90
ATT GTT TGG CTC GTT ATA GGG CGC TTC ATT CAA GGC ATG GGG GCT TTA 399 He Val Trp Leu Val He Gly Arg Phe He Gin Gly Met Gly Ala Leu 95 100 105
GGG GGG GTT ATT AGT GCG ATG GTG GCG GAT GAA GTG AAA GAA GAA GAG 447 Gly Gly Val He Ser Ala Met Val Ala Asp Glu Val Lys Glu Glu Glu 110 115 120
CGC ACC AAA GCC ATG GCC ATC ATG GGA GCG TTT ATT TTC ATT AGC TTC 495 Arg Thr Lys Ala Met Ala He Met Gly Ala Phe He Phe He Ser Phe 125 130 135
ACT ATA AGC ATG GCG ATT GGC CCT GGG GTT GTA GCG TTT TTG GGG GGG 543 Thr He Ser Met Ala He Gly Pro Gly Val Val Ala Phe Leu Gly Gly 140 145 150 155
GCA AAA TGG CTC TTT TTA CTC ACG GCG ATC TTA ACT TTA TTG AGT TTA 591 Ala Lys Trp Leu Phe Leu Leu Thr Ala He Leu Thr Leu Leu Ser Leu 160 165 170
TTG ATG CTT TTA AAA GTC AAA GAC GCC CCT AAA ATT TCT TAC CAG ATC 639 Leu Met Leu Leu Lys Val Lys Asp Ala Pro Lys He Ser Tyr Gin He 175 180 185
AAA AAC ATA AAA GCT TAC CAA CCC AAC TCT AAA GCC TTG TAT CTT TTG 687 Lys Asn He Lys Ala Tyr Gin Pro Asn Ser Lys Ala Leu Tyr Leu Leu 190 195 200
TAT CTA AGC TCT TTT TTT GAA AAA GCG TTC ATG ACG CTT ATT TTT GTG 735 Tyr Leu Ser Ser Phe Phe Glu Lys Ala Phe Met Thr Leu He Phe Val 205 210 215
CTG ATC CCT TTA GCC TTA GTG AAT GAA TTT CAT AAA GAT GAA AGC TTT 783 Leu He Pro Leu Ala Leu Val Asn Glu Phe His Lys Asp Glu Ser Phe 220 225 230 235
TTA ATC TTG GTG TAT GTG CCT GGA GCC TTA TTA GGG GTC TTA AGC ATG 831 Leu He Leu Val Tyr Val Pro Gly Ala Leu Leu Gly Val Leu Ser Met 240 245 250
GGA ATA GCG AGC GTT ATG GCT GAA AAA TAC AAC AAG CCT AAA GGA GTG 879 Gly He Ala Ser Val Met Ala Glu Lys Tyr Asn Lys Pro Lys Gly Val 255 260 265
ATG CTT TCT GGC GTA TTA TTG TTT ATT GTG AGT TAT TTG TGC TTG TTT 927 Met Leu Ser Gly Val Leu Leu Phe He Val Ser Tyr Leu Cys Leu Phe 270 275 280 TTA GCC GAC TCT AGC TTT TTA GGG AAA TAT TTA TGG CTT TTT ATT GTT 975 Leu Ala Asp Ser Ser Phe Leu Gly Lys Tyr Leu Trp Leu Phe He Val 285 290 295
GGG GTG GCG TTT TTC TTT ATT GGT TTT GCC ACC TTA GAG CCT ATC ATG 1023 Gly Val Ala Phe Phe Phe He Gly Phe Ala Thr Leu Glu Pro He Met 300 305 310 315
CAA TCT TTA GCG TCT AAA TTC GCC AAA GTG CAT GAA AAA GGC AAG GTT 1071 Gin Ser Leu Ala Ser Lys Phe Ala Lys Val His Glu Lys Gly Lys Val 320 325 330
TTA GGG CAA TTC ACT ACT TTT GGC TAT TTA GGG AGC TTT GTT GGG GGC 1119 Leu Gly Gin Phe Thr Thr Phe Gly Tyr Leu Gly Ser Phe Val Gly Gly 335 340 345
GTG AGC GGG GGG TTG AGC TAC CAT CAT TTA GGC GTT TCT AAC ACA AGC 1167 Val Ser Gly Gly Leu Ser Tyr His His Leu Gly Val Ser Asn Thr Ser 350 355 360
TTG ATC GTT GTA GCT TTA GGG CTT ATT TGG GGG CTA TCG CTC TTT TTA 1215 Leu He Val Val Ala Leu Gly Leu He Trp Gly Leu Ser Leu Phe Leu 365 370 375
CTC AAC AAC CCT TCC AAG CAA AAA AAT GTC TAT TTC CCC TTA GAC GCT 1263 Leu Asn Asn Pro Ser Lys Gin Lys Asn Val Tyr Phe Pro Leu Asp Ala 380 385 390 395
TAC AAT GAG GAA CAA TTT GAA ACT TTA GAG GAT AAA ATC ATT GAA TGG 1311 Tyr Asn Glu Glu Gin Phe Glu Thr Leu Glu Asp Lys He He Glu Trp 400 405 410
TAT GTT AAT ATT AGC GAA GAA ATC ATT ATT GTG AAA TAT AAT TCC GAT 1359 Tyr Val Asn He Ser Glu Glu He He He Val Lys Tyr Asn Ser Asp 415 420 425
CAC ATT AGC GAA GAA GAA ATC ATT CAC TTA GCG CAA AAC TTT AGA AAA T 1408 His He Ser Glu Glu Glu He He His Leu Ala Gin Asn Phe Arg Lys 430 435 440
AAAACAATTA AGGATCAAAA ATGGCCTATG AA 1440
(2) INFORMATION FOR SEQ ID NO:520:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 443 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520: Met Phe Lys Lys He Phe Pro Leu Ala Leu Val Ser Ser Leu Arg Phe
1 5 10 15
Leu Gly Leu Phe He Val Leu Pro Val He Ser Leu Tyr Ala Asp Ser
20 25 30
Phe His Ser Ser Ser Pro Leu Leu Val Gly Leu Ala Val Gly Gly Ala
35 40 45
Tyr Leu Thr Gin He Val Phe Gin Thr Pro Met Gly He Leu Ser Asp
50 55 60
Lys He Gly Arg Lys Val Val Val Met Val Cys Leu Leu Leu Phe Leu 65 70 75 80
Ala Gly Ser Leu Val Cys Phe He Ala Asn Asp He Val Trp Leu Val
85 90 95
He Gly Arg Phe He Gin Gly Met Gly Ala Leu Gly Gly Val He Ser
100 105 110
Ala Met Val Ala Asp Glu Val Lys Glu Glu Glu Arg Thr Lys Ala Met
115 120 125
Ala He Met Gly Ala Phe He Phe He Ser Phe Thr He Ser Met Ala
130 135 140
He Gly Pro Gly Val Val Ala Phe Leu Gly Gly Ala Lys Trp Leu Phe 145 150 155 160
Leu Leu Thr Ala He Leu Thr Leu Leu Ser Leu Leu Met Leu Leu Lys
165 170 175
Val Lys Asp Ala Pro Lys He Ser Tyr Gin He Lys Asn He Lys Ala
180 185 190
Tyr Gin Pro Asn Ser Lys Ala Leu Tyr Leu Leu Tyr Leu Ser Ser Phe
195 200 205
Phe Glu Lys Ala Phe Met Thr Leu He Phe Val Leu He Pro Leu Ala
210 215 220
Leu Val Asn Glu Phe His Lys Asp Glu Ser Phe Leu He Leu Val Tyr 225 230 235 240
Val Pro Gly Ala Leu Leu Gly Val Leu Ser Met Gly He Ala Ser Val
245 250 255
Met Ala Glu Lys Tyr Asn Lys Pro Lys Gly Val Met Leu Ser Gly Val
260 265 270
Leu Leu Phe He Val Ser Tyr Leu Cys Leu Phe Leu Ala Asp Ser Ser
275 280 285
Phe Leu Gly Lys Tyr Leu Trp Leu Phe He Val Gly Val Ala Phe Phe
290 295 300
Phe He Gly Phe Ala Thr Leu Glu Pro He Met Gin Ser Leu Ala Ser 305 310 315 320
Lys Phe Ala Lys Val His Glu Lys Gly Lys Val Leu Gly Gin Phe Thr
325 330 335
Thr Phe Gly Tyr Leu Gly Ser Phe Val Gly Gly Val Ser Gly Gly Leu
340 345 350
Ser Tyr His His Leu Gly Val Ser Asn Thr Ser Leu He Val Val Ala
355 360 365
Leu Gly Leu He Trp Gly Leu Ser Leu Phe Leu Leu Asn Asn Pro Ser
370 375 380
Lys Gin Lys Asn Val Tyr Phe Pro Leu Asp Ala Tyr Asn Glu Glu Gin 385 390 395 400
Phe Glu Thr Leu Glu Asp Lys He He Glu Trp Tyr Val Asn He Ser
405 410 415
Glu Glu He He He Val Lys Tyr Asn Ser Asp His He Ser Glu Glu
420 425 430
Glu He He His Leu Ala Gin Asn Phe Arg Lys 435 440
(2) INFORMATION FOR SEQ ID NO: 521:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1360 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 89...1237 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521:
GAAATCAAAA AAGGGGTTTT TACCACTTAT AAAGAGAAAA ACCCTATCGC TTTGAAAACT 60 TAACCAATAA ATAAGGTAAA ATTAAGAC ATG CAT GCA GAA TTT TTC ACT TTC 112
Met His Ala Glu Phe Phe Thr Phe 1 5
GCG CTC ATC ATG CTT TTA ATT GTG ATG GCC CCT TAT ATG TCT AGA ATC 160 Ala Leu He Met Leu Leu He Val Met Ala Pro Tyr Met Ser Arg He 10 15 20
TCT CGT TTG CCT ATC ACG GTT GTG GAG ATT TTA TTT GGA TCT GTT GGG 208 Ser Arg Leu Pro He Thr Val Val Glu He Leu Phe Gly Ser Val Gly 25 30 35 40
GCG TAT GTG GGT TTT ATT GAG CCT ACT AAG GGC TTT GAA ATC ATG TCC 256 Ala Tyr Val Gly Phe He Glu Pro Thr Lys Gly Phe Glu He Met Ser 45 50 55
GAA ATT GGC TTT TTG TTT TTA ATG TTT TTA TGC GGT TTG GAA GTG GAG 304 Glu He Gly Phe Leu Phe Leu Met Phe Leu Cys Gly Leu Glu Val Glu 60 65 70
ATT TAT CTG TTC AAA AAA TTA GGG GTT TCT CTT TTA AAA CGC ATT TTT 352 He Tyr Leu Phe Lys Lys Leu Gly Val Ser Leu Leu Lys Arg He Phe 75 80 85
GCT TAT CTG TTG ATT TTA TAC ACG CTT TCG TTT ATT CTT ACT TTT AGC 400 Ala Tyr Leu Leu He Leu Tyr Thr Leu Ser Phe He Leu Thr Phe Ser 90 95 100
CTT AAT TTA GAG CCT ATT TTT ATG GTG ATT TTC CCT ATT ATT AGT TTG 448 Leu Asn Leu Glu Pro He Phe Met Val He Phe Pro He He Ser Leu 105 110 115 120 GGC ATG ATC ATG ACT TTA GTC AAA GAT TAT CGT AAA GAG ATT TTG TGG 496 Gly Met He Met Thr Leu Val Lys Asp Tyr Arg Lys Glu He Leu Trp 125 130 135
CTT GAT TTG GTT TTG AAA GTG GGC GTT ATT GGG GAA TTG TTA AGC ATT 544 Leu Asp Leu Val Leu Lys Val Gly Val He Gly Glu Leu Leu Ser He 140 145 150
TTT GGT TTG GTG GTC GTG GAT GGG GTG TAT TCG CAT GGT TTG GGC ATG 592 Phe Gly Leu Val Val Val Asp Gly Val Tyr Ser His Gly Leu Gly Met 155 160 165
GAT TTG ATT AAA GAT TTA GGC ATT CTC ATT GTT TTT TTA ATC TTA ATT 640 Asp Leu He Lys Asp Leu Gly He Leu He Val Phe Leu He Leu He 170 175 180
ATC GTG GCG TTT CAA ATC TTT AAG ACT TTG TTT TGG TGG TTC CCG CAT 688 He Val Ala Phe Gin He Phe Lys Thr Leu Phe Trp Trp Phe Pro His 185 190 195 200
TTA AAG CTT TTT GTG ATG CCT AAA AGC AGT CAG TTT AAC CAA GAT GTG 736 Leu Lys Leu Phe Val Met Pro Lys Ser Ser Gin Phe Asn Gin Asp Val 205 210 215
CGT TTT TCG CTC ATG CTC TTT TTT TCC TTA GTT GCG ATC GTG GTG TGG 784 Arg Phe Ser Leu Met Leu Phe Phe Ser Leu Val Ala He Val Val Trp 220 225 230
CTC AAA ATA GAA ATG GTT TTA GGG GCG TTT CTA GCA GGG TTA GTC GTT 832 Leu Lys He Glu Met Val Leu Gly Ala Phe Leu Ala Gly Leu Val Val 235 240 245
TCT ACT TTT TTC CCT CAT AAA TCA GAA TTG ATC CAC AAG CTC AAT GAT 880 Ser Thr Phe Phe Pro His Lys Ser Glu Leu He His Lys Leu Asn Asp 250 255 260
GTG GGT TTT GGG TTT TTT GTG CCT TTG TTT TTC ATC CAT GTA GGC TCT 928 Val Gly Phe Gly Phe Phe Val Pro Leu Phe Phe He His Val Gly Ser 265 270 275 280
ACT TTA GAC TTA AAA TTA GTG TTT TTA AAC CCG CAT TTG ATT CTC CAA 976 Thr Leu Asp Leu Lys Leu Val Phe Leu Asn Pro His Leu He Leu Gin 285 290 295
GGG ATA TTG ATT GTC ATA GCG ATG TTG AGT TTG CAC TTG ATC ACT TCA 1024 Gly He Leu He Val He Ala Met Leu Ser Leu His Leu He Thr Ser 300 305 310
ACC TTA TTG TGG CGC AAA TAC TTT AAA GAG GCT AAG CAT TTA TTT TCA 1072 Thr Leu Leu Trp Arg Lys Tyr Phe Lys Glu Ala Lys His Leu Phe Ser 315 320 325
TTC GCT TTA GGG GCT TCT ATG CCT TTA ACT TTT TTA GTA ACC ACC GCA 1120 Phe Ala Leu Gly Ala Ser Met Pro Leu Thr Phe Leu Val Thr Thr Ala 330 335 340 GCA GTA GGC TTA AAA GCG CAA GCG ATC TCA CAA AAC ACC TAC TAC GCA 1168 Ala Val Gly Leu Lys Ala Gin Ala He Ser Gin Asn Thr Tyr Tyr Ala 345 350 355 360
TTG CTC ATG GCG GCT ATT TTT GAA GGG GTA TTA TTC ACG ATT GCG ATC 1216 Leu Leu Met Ala Ala He Phe Glu Gly Val Leu Phe Thr He Ala He 365 370 375
AAA ATA CTC AAC AAA AAA GCT TGAATGAAAG CTTAAGCGTC TAAATATTTA GCGT 1271 Lys He Leu Asn Lys Lys Ala 380
CGCTAAAGCT GTTCGCTTGA ACATTATTGA ACGCATTCTC TAAGCTATCA AAGAAACGAG 1331 GGTGCAAGTT TTGCATTTCT TTTAAGAAA 1360
(2) INFORMATION FOR SEQ ID NO:522:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 383 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522:
Met His Ala Glu Phe Phe Thr Phe Ala Leu He Met Leu Leu He Val
1 5 10 15
Met Ala Pro Tyr Met Ser Arg He Ser Arg Leu Pro He Thr Val Val
20 25 30
Glu He Leu Phe Gly Ser Val Gly Ala Tyr Val Gly Phe He Glu Pro
35 40 45
Thr Lys Gly Phe Glu He Met Ser Glu He Gly Phe Leu Phe Leu Met
50 55 60
Phe Leu Cys Gly Leu Glu Val Glu He Tyr Leu Phe Lys Lys Leu Gly 65 70 75 80
Val Ser Leu Leu Lys Arg He Phe Ala Tyr Leu Leu He Leu Tyr Thr
85 90 95
Leu Ser Phe He Leu Thr Phe Ser Leu Asn Leu Glu Pro He Phe Met
100 105 110
Val He Phe Pro He He Ser Leu Gly Met He Met Thr Leu Val Lys
115 120 125
Asp Tyr Arg Lys Glu He Leu Trp Leu Asp Leu Val Leu Lys Val Gly
130 135 140
Val He Gly Glu Leu Leu Ser He Phe Gly Leu Val Val Val Asp Gly 145 150 155 160
Val Tyr Ser His Gly Leu Gly Met Asp Leu He Lys Asp Leu Gly He
165 170 175
Leu He Val Phe Leu He Leu He He Val Ala Phe Gin He Phe Lys
180 185 190
Thr Leu Phe Trp Trp Phe Pro His Leu Lys Leu Phe Val Met Pro Lys
195 200 205
Ser Ser Gin Phe Asn Gin Asp Val Arg Phe Ser Leu Met Leu Phe Phe 210 215 220
Ser Leu Val Ala He Val Val Trp Leu Lys He Glu Met Val Leu Gly 225 230 235 240
Ala Phe Leu Ala Gly Leu Val Val Ser Thr Phe Phe Pro His Lys Ser
245 250 255
Glu Leu He His Lys Leu Asn Asp Val Gly Phe Gly Phe Phe Val Pro
260 265 270
Leu Phe Phe He His Val Gly Ser Thr Leu Asp Leu Lys Leu Val Phe
275 280 285
Leu Asn Pro His Leu He Leu Gin Gly He Leu He Val He Ala Met
290 295 300
Leu Ser Leu His Leu He Thr Ser Thr Leu Leu Trp Arg Lys Tyr Phe 305 310 315 320
Lys Glu Ala Lys His Leu Phe Ser Phe Ala Leu Gly Ala Ser Met Pro
325 330 335
Leu Thr Phe Leu Val Thr Thr Ala Ala Val Gly Leu Lys Ala Gin Ala
340 345 350
He Ser Gin Asn Thr Tyr Tyr Ala Leu Leu Met Ala Ala He Phe Glu
355 360 365
Gly Val Leu Phe Thr He Ala He Lys He Leu Asn Lys Lys Ala 370 375 380
(2) INFORMATION FOR SEQ ID NO:523:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1024 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 115...921 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:523:
AGTTGGCAAA AACGCAGAGA CAGTAACGCA AAGGCAAATA AAGAGACTCA TTTTAAACAA 60 GCGAATGCCA TTACAAATAT AATCAGATCA GTTGGTGGGT TTTTTACAAA GATT ATG 117
Met
1
AAG AGA GTT AGA GAA CTT GTA AAA AAA CAT CCC GAG AAA AGC AGT GTG 165 Lys Arg Val Arg Glu Leu Val Lys Lys His Pro Glu Lys Ser Ser Val 5 10 15
GCA TTA GTA GTA TTA ACC CAT GCT GCA TGC AAG AAA GCG AAA GAA TTG 213 Ala Leu Val Val Leu Thr His Ala Ala Cys Lys Lys Ala Lys Glu Leu 20 25 30 GAC GAT AAA GTC CAG GAT AAA TCC AAA CAA GCT GAA AAA GAA AAT CAA 261 Asp Asp Lys Val Gin Asp Lys Ser Lys Gin Ala Glu Lys Glu Asn Gin 35 40 45
ATC AAT TGG TGG AAA TAT TCA GGA TTA ACA ATA GCG ACA AGT TTA TTA 309 He Asn Trp Trp Lys Tyr Ser Gly Leu Thr He Ala Thr Ser Leu Leu 50 55 60 65
TTA GCC GCT TGT AGT GTT GGT GAT ATT GAT AAA CAG ATA GAG TTA GAA 357 Leu Ala Ala Cys Ser Val Gly Asp He Asp Lys Gin He Glu Leu Glu 70 75 80
CAA GAA AAA AAG GAA GCT GAA AAC GCT AGG GAT AGA GCG AAC AAG AGT 405 Gin Glu Lys Lys Glu Ala Glu Asn Ala Arg Asp Arg Ala Asn Lys Ser 85 90 95
GGG ATA GAA CTG GAA CAG GAA AAA CAA AAG ACC ATT AAA GAA CAA AAA 453 Gly He Glu Leu Glu Gin Glu Lys Gin Lys Thr He Lys Glu Gin Lys 100 105 110
GAT TTA GTT AAA AAA GCA GAA CAA AAT TGC CAA GAA AAT CAT GGC CAA 501 Asp Leu Val Lys Lys Ala Glu Gin Asn Cys Gin Glu Asn His Gly Gin 115 120 125
TTC TTT ATG AAA AAA TTA GGA ATT AAG GGT GGC ATT GCT ATA GAA GTA 549 Phe Phe Met Lys Lys Leu Gly He Lys Gly Gly He Ala He Glu Val 130 135 140 145
GAA GCT GAA TGC AAA ACC CCT AAA CCT GCA AAA ACC AAT CAA ACC CCT 597 Glu Ala Glu Cys Lys Thr Pro Lys Pro Ala Lys Thr Asn Gin Thr Pro 150 155 160
ATC CAG CCA AAA CAC CTC CCC AAC TCT AAA CAA CCC CAC TCT CAA AGA 645 He Gin Pro Lys His Leu Pro Asn Ser Lys Gin Pro His Ser Gin Arg 165 170 175
GGA TCA AAA GCG CAA GAG CTT ATC GCT TAT TTG CAA AAA GAG TTA GAA 693 Gly Ser Lys Ala Gin Glu Leu He Ala Tyr Leu Gin Lys Glu Leu Glu 180 185 190
TCT CTG CCC TAT TCA CAA AAA GCT ATC GCT AAA CAA GTG AAT TTT TAC 741 Ser Leu Pro Tyr Ser Gin Lys Ala He Ala Lys Gin Val Asn Phe Tyr 195 200 205
AGG CCA AGT TCT GTC GCT TAT TTA GAA CTA GAC CCT AGA GAT TTT AAG 789 Arg Pro Ser Ser Val Ala Tyr Leu Glu Leu Asp Pro Arg Asp Phe Lys 210 215 220 225
GTT ACA GAA GAA TGG CAA AAA GAA AAT CTA AAA ATA CGC TCT AAA GCT 837 Val Thr Glu Glu Trp Gin Lys Glu Asn Leu Lys He Arg Ser Lys Ala 230 235 240
CAA GCT AAA ATG CTT GGA AAT GAG AAA CCC ACA AGC CCA CCT TTC AAC 885 Gin Ala Lys Met Leu Gly Asn Glu Lys Pro Thr Ser Pro Pro Phe Asn 245 250 255 CTC TCA AAG CCT TTT GTT CGT TCA AAA AAT ATT TGC TGATGTTAAT AAAGAA 937 Leu Ser Lys Pro Phe Val Arg Ser Lys Asn He Cys 260 265
ATAGAAGCAG TTGCTAATAC TGAAAAGAAA GCAGAAAAAG MGGGTTATGG TTATAGTAAA 997 AGGATGTAGG CATAAGAAAA TAAGAAC 1024
(2) INFORMATION FOR SEQ ID NO:524:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 269 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524:
Met Lys Arg Val Arg Glu Leu Val Lys Lys His Pro Glu Lys Ser Ser
1 5 10 15
Val Ala Leu Val Val Leu Thr His Ala Ala Cys Lys Lys Ala Lys Glu
20 25 30
Leu Asp Asp Lys Val Gin Asp Lys Ser Lys Gin Ala Glu Lys Glu Asn
35 40 45
Gin He Asn Trp Trp Lys Tyr Ser Gly Leu Thr He Ala Thr Ser Leu
50 55 60
Leu Leu Ala Ala Cys Ser Val Gly Asp He Asp Lys Gin He Glu Leu 65 70 75 80
Glu Gin Glu Lys Lys Glu Ala Glu Asn Ala Arg Asp Arg Ala Asn Lys
85 90 95
Ser Gly He Glu Leu Glu Gin Glu Lys Gin Lys Thr He Lys Glu Gin
100 105 110
Lys Asp Leu Val Lys Lys Ala Glu Gin Asn Cys Gin Glu Asn His Gly
115 120 125
Gin Phe Phe Met Lys Lys Leu Gly He Lys Gly Gly He Ala He Glu
130 135 140
Val Glu Ala Glu Cys Lys Thr Pro Lys Pro Ala Lys Thr Asn Gin Thr 145 150 155 160
Pro He Gin Pro Lys His Leu Pro Asn Ser Lys Gin Pro His Ser Gin
165 170 175
Arg Gly Ser Lys Ala Gin Glu Leu He Ala Tyr Leu Gin Lys Glu Leu
180 185 190
Glu Ser Leu Pro Tyr Ser Gin Lys Ala He Ala Lys Gin Val Asn Phe
195 200 205
Tyr Arg Pro Ser Ser Val Ala Tyr Leu Glu Leu Asp Pro Arg Asp Phe
210 215 220
Lys Val Thr Glu Glu Trp Gin Lys Glu Asn Leu Lys He Arg Ser Lys 225 230 235 240
Ala Gin Ala Lys Met Leu Gly Asn Glu Lys Pro Thr Ser Pro Pro Phe
245 250 255
Asn Leu Ser Lys Pro Phe Val Arg Ser Lys Asn He Cys 260 265 (2) INFORMATION FOR SEQ ID NO: 525:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 535 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...482 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525:
GTGATCCCTA AAGAATATAT CCTGCGGTGG ATAAGGGTAT CCAAGAAGCG ATG CAA 56
Met Gin 1
AAT GGC GTT TTG GCA GGC TAT CCG GTG GTG GAT TTT AAA GTT ACC CTT 104 Asn Gly Val Leu Ala Gly Tyr Pro Val Val Asp Phe Lys Val Thr Leu 5 10 15
TAT GAT GGG AGC TAC CAT GAT GTG GAT TCT TCA GAA ATG GCG TTT AAA 152 Tyr Asp Gly Ser Tyr His Asp Val Asp Ser Ser Glu Met Ala Phe Lys 20 25 30
ATC GCT GGC TCT ATG GCG TTT AAA GAA GCG AGT CGC GCG GCT AAC CCG 200 He Ala Gly Ser Met Ala Phe Lys Glu Ala Ser Arg Ala Ala Asn Pro 35 40 45 50
GTT TTA CTA GAG CCT ATG ATG AAA GTG GAA GTG GAA GTC CCT GAA GAA 248 Val Leu Leu Glu Pro Met Met Lys Val Glu Val Glu Val Pro Glu Glu 55 60 65
TAC ATG GGC GAT GTG ATT GGC GAT TTG AAT AGA AGA AGA GGG CAA ATC 296 Tyr Met Gly Asp Val He Gly Asp Leu Asn Arg Arg Arg Gly Gin He 70 75 80
AAT TCT ATG GAC GAT AGA TTA GGC TTG AAA ATC GTG AAC GCT TTT GTG 344 Asn Ser Met Asp Asp Arg Leu Gly Leu Lys He Val Asn Ala Phe Val 85 90 95
CCG TTG GTG GAA ATG TTT GGC TAT TCT ACG GAT TTA CGA TCA GCC ACC 392 Pro Leu Val Glu Met Phe Gly Tyr Ser Thr Asp Leu Arg Ser Ala Thr 100 105 110
CAA GGG CGT GGG ACT TAC TCT ATG GAG TTT GAT CAT TAT GGC GAA GTG 440 Gin Gly Arg Gly Thr Tyr Ser Met Glu Phe Asp His Tyr Gly Glu Val 115 120 125 130 CCT AGC AAT ATC GCT AAG GAA ATT GTA GAA AAG CGC AAA GGC TGATTTAAT 491 Pro Ser Asn He Ala Lys Glu He Val Glu Lys Arg Lys Gly 135 140
TATAACGCTC TCTTATTTTT AGGGGGTGTT ATAGGTGCTG TTTA 535
(2) INFORMATION FOR SEQ ID NO: 526:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 144 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526:
Met Gin Asn Gly Val Leu Ala Gly Tyr Pro Val Val Asp Phe Lys Val
1 5 10 15
Thr Leu Tyr Asp Gly Ser Tyr His Asp Val Asp Ser Ser Glu Met Ala
20 25 30
Phe Lys He Ala Gly Ser Met Ala Phe Lys Glu Ala Ser Arg Ala Ala
35 40 45
Asn Pro Val Leu Leu Glu Pro Met Met Lys Val Glu Val Glu Val Pro
50 55 60
Glu Glu Tyr Met Gly Asp Val He Gly Asp Leu Asn Arg Arg Arg Gly 65 70 75 80
Gin He Asn Ser Met Asp Asp Arg Leu Gly Leu Lys He Val Asn Ala
85 90 95
Phe Val Pro Leu Val Glu Met Phe Gly Tyr Ser Thr Asp Leu Arg Ser
100 105 110
Ala Thr Gin Gly Arg Gly Thr Tyr Ser Met Glu Phe Asp His Tyr Gly
115 120 125
Glu Val Pro Ser Asn He Ala Lys Glu He Val Glu Lys Arg Lys Gly 130 135 140
(2) INFORMATION FOR SEQ ID NO: 527:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 740 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...671 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:527:
CTCTCTCTTC AAGCTTGGAT AACAAATGCG GTTTGAAGAG TTCTAACGCC ATG TTT 56
Met Phe
1
TTA GGC AAC CCG CAT TCA TCC ATT TTG AGA TTA GGC CCA ACC ACA ATC 104 Leu Gly Asn Pro His Ser Ser He Leu Arg Leu Gly Pro Thr Thr He 5 10 15
ACG CTT CTG CCT GAA AAA TCC ACG CGC TTA CCT AAA AGG TTT TGC CTG 152 Thr Leu Leu Pro Glu Lys Ser Thr Arg Leu Pro Lys Arg Phe Cys Leu 20 25 30
AAA CGC CCC TGC TTG CCT TTA ATG ATT TCA CTG AGC GAT TTT AAA GGG 200 Lys Arg Pro Cys Leu Pro Leu Met He Ser Leu Ser Asp Phe Lys Gly 35 40 45 50
CGT TTG TTA GCC CCT TTA ACC GCA TTA GTG CTG CGG CCG TTA TCA AAA 248 Arg Leu Leu Ala Pro Leu Thr Ala Leu Val Leu Arg Pro Leu Ser Lys 55 60 65
AGC ACA TCC ACG GCT TCT TGC AAC ATC CTT TTT TCA TTG CGC ACA ATG 296 Ser Thr Ser Thr Ala Ser Cys Asn He Leu Phe Ser Leu Arg Thr Met 70 75 80
ATT TCT GGC GCT CCA AGC TCC ATT AAG CGT TTC AAG CGT TGG TTA CGA 344 He Ser Gly Ala Pro Ser Ser He Lys Arg Phe Lys Arg Trp Leu Arg 85 90 95
TTG ATG ACA CGA CGA TAC AAT TCA TTC ACA TCG CTG ACT GCA AAC TTC 392 Leu Met Thr Arg Arg Tyr Asn Ser Phe Thr Ser Leu Thr Ala Asn Phe 100 105 110
CCG CCA TCT AGC GCG ACT AAA GGC CTT AAA TCC GGT GGC AAT ACC GGT 440 Pro Pro Ser Ser Ala Thr Lys Gly Leu Lys Ser Gly Gly Asn Thr Gly 115 120 125 130
AAA ACC GTG AGC ATC ATC CAT TCA GGC CTA TTA CCA GAA TTT AAA AAG 488 Lys Thr Val Ser He He His Ser Gly Leu Leu Pro Glu Phe Lys Lys 135 140 145
CTT TCT ACC ACT TTC AAA CGC TTA ATG AGT TTT TTC TTT TTC GCA TCA 536 Leu Ser Thr Thr Phe Lys Arg Leu Met Ser Phe Phe Phe Phe Ala Ser 150 155 160
GAA TTG GTG TCT TTC ACT TCT TCT TTC AAA CTC TGC AAT AAG GTG ATC 584 Glu Leu Val Ser Phe Thr Ser Ser Phe Lys Leu Cys Asn Lys Val He 165 170 175
AAA TCA ATT TCT TCT AAC AAA TCC TTG ATC GCT TCA CCG CCC ATT TGC 632 Lys Ser He Ser Ser Asn Lys Ser Leu He Ala Ser Pro Pro He Cys 180 185 190
GCT ACA AAG CCC CTG TCT TCG TAT CTT CGT GAG ATA TTT TGATACTGCT CT 683 Ala Thr Lys Pro Leu Ser Ser Tyr Leu Arg Glu He Phe 195 200 205
TCATTCAAAA TATCGTATTT CATCACAAGC TTAGTGCCTT CATTGTCATA AGCGGCT 740
(2) INFORMATION FOR SEQ ID NO:528:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 207 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 528:
Met Phe Leu Gly Asn Pro His Ser Ser He Leu Arg Leu Gly Pro Thr
1 5 10 15
Thr He Thr Leu Leu Pro Glu Lys Ser Thr Arg Leu Pro Lys Arg Phe
20 25 30
Cys Leu Lys Arg Pro Cys Leu Pro Leu Met He Ser Leu Ser Asp Phe
35 40 45
Lys Gly Arg Leu Leu Ala Pro Leu Thr Ala Leu Val Leu Arg Pro Leu
50 55 60
Ser Lys Ser Thr Ser Thr Ala Ser Cys Asn He Leu Phe Ser Leu Arg 65 70 75 80
Thr Met He Ser Gly Ala Pro Ser Ser He Lys Arg Phe Lys Arg Trp
85 90 95
Leu Arg Leu Met Thr Arg Arg Tyr Asn Ser Phe Thr Ser Leu Thr Ala
100 105 110
Asn Phe Pro Pro Ser Ser Ala Thr Lys Gly Leu Lys Ser Gly Gly Asn
115 120 125
Thr Gly Lys Thr Val Ser He He His Ser Gly Leu Leu Pro Glu Phe
130 135 140
Lys Lys Leu Ser Thr Thr Phe Lys Arg Leu Met Ser Phe Phe Phe Phe 145 150 155 160
Ala Ser Glu Leu Val Ser Phe Thr Ser Ser Phe Lys Leu Cys Asn Lys
165 170 175
Val He Lys Ser He Ser Ser Asn Lys Ser Leu He Ala Ser Pro Pro
180 185 190
He Cys Ala Thr Lys Pro Leu Ser Ser Tyr Leu Arg Glu He Phe 195 200 205
(2) INFORMATION FOR SEQ ID NO: 529:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 505 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 1...411 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529:
CAA TGT GGT GAG AGA GAT TTT AGA GGG CTT TTC TTT CAA GCA ATA CGA 48 Gin Cys Gly Glu Arg Asp Phe Arg Gly Leu Phe Phe Gin Ala He Arg 1 5 10 15
GTG TCT AAT TAC GCT AGA AAT TAT CAA GTC AAA CAC AAC TTG GCT TAC 96 Val Ser Asn Tyr Ala Arg Asn Tyr Gin Val Lys His Asn Leu Ala Tyr 20 25 30
TGG GGG GCT AAA GAT TAT TTA GGG TGC GGG GCT GGG GCT GTG GGC TGC 144 Trp Gly Ala Lys Asp Tyr Leu Gly Cys Gly Ala Gly Ala Val Gly Cys 35 40 45
GTG GCG AAT GAG CGC TTT TTT GCA AAA AAA CTC ATA GAA AAC TAC ATC 192 Val Ala Asn Glu Arg Phe Phe Ala Lys Lys Leu He Glu Asn Tyr He 50 55 60
AAA GAC CCC CTA CAA CGC CAA GTT GAG ACG CTT AAT AAA CAA GAC AAA 240 Lys Asp Pro Leu Gin Arg Gin Val Glu Thr Leu Asn Lys Gin Asp Lys 65 70 75 80
CGC TTA GAA AAG CTG TTT TTA GGC TTG AGG TGC GTG CTT GGG GTT GAG 288 Arg Leu Glu Lys Leu Phe Leu Gly Leu Arg Cys Val Leu Gly Val Glu 85 90 95
CTT AGT TTC TTA GAT GAA AAT AAA GTA AAG TTT TTG ATT GAA GAG AAC 336 Leu Ser Phe Leu Asp Glu Asn Lys Val Lys Phe Leu He Glu Glu Asn 100 105 110
AAG GCT TTC ATT AAA AAT AAC CGC TTG ATA GCG AGC GAT TTT TTC ATG 384 Lys Ala Phe He Lys Asn Asn Arg Leu He Ala Ser Asp Phe Phe Met 115 120 125
GCC GAT GAA ATG GCT TTG TGG CTG TTA TGATTGTAGG CTTTGCTTCA ATCAAGC 438 Ala Asp Glu Met Ala Leu Trp Leu Leu 130 135
GTTAATAAAA CGCTAGAAAG CGTTTTTTAA TGAATGCCTA CAAAATTTTT AGCCAAAAAC 498 CAACCAA 505
(2) INFORMATION FOR SEQ ID NO: 530:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 137 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530:
Gin Cys Gly Glu Arg Asp Phe Arg Gly Leu Phe Phe Gin Ala He Arg
1 5 10 15
Val Ser Asn Tyr Ala Arg Asn Tyr Gin Val Lys His Asn Leu Ala Tyr
20 25 30
Trp Gly Ala Lys Asp Tyr Leu Gly Cys Gly Ala Gly Ala Val Gly Cys
35 40 45
Val Ala Asn Glu Arg Phe Phe Ala Lys Lys Leu He Glu Asn Tyr He
50 55 60
Lys Asp Pro Leu Gin Arg Gin Val Glu Thr Leu Asn Lys Gin Asp Lys 65 70 75 80
Arg Leu Glu Lys Leu Phe Leu Gly Leu Arg Cys Val Leu Gly Val Glu
85 90 95
Leu Ser Phe Leu Asp Glu Asn Lys Val Lys Phe Leu He Glu Glu Asn
100 105 110
Lys Ala Phe He Lys Asn Asn Arg Leu He Ala Ser Asp Phe Phe Met
115 120 125
Ala Asp Glu Met Ala Leu Trp Leu Leu 130 135
(2) INFORMATION FOR SEQ ID NO: 531:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1260 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 55...1179 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531:
GTTGGTGTTT GAAAGGCTCT ATGAAGAAGA ATTAAGGCGC AAGGGTTTTT TATA ATG 57
Met
1
GTC TCT CTC TAT TTA GAA AAC GGG CTT TTT TTG CAA GCG CAA AGT TTT 105 Val Ser Leu Tyr Leu Glu Asn Gly Leu Phe Leu Gin Ala Gin Ser Phe 5 10 15
GGG GCT AGC GGC ACG CAA GCG GGC GAG CTT GTT TTT AAC ACT TCT ATG 153 Gly Ala Ser Gly Thr Gin Ala Gly Glu Leu Val Phe Asn Thr Ser Met 20 25 30 AGC GGT TAT CAA GAA GTC ATT AGC GAC CCT AGC TAT AAG GGG CAA TTT 201 Ser Gly Tyr Gin Glu Val He Ser Asp Pro Ser Tyr Lys Gly Gin Phe 35 40 45
GTG GTT TTT AGC ATG CCT GAG ATT GGG GTT GTG GGT GCT AAT TCT AAA 249 Val Val Phe Ser Met Pro Glu He Gly Val Val Gly Ala Asn Ser Lys 50 55 60 65
GAT GAT GAA TCC TTT TTT TCA TGC GCA GGG GTT TTA GCG CGC CAT TAC 297 Asp Asp Glu Ser Phe Phe Ser Cys Ala Gly Val Leu Ala Arg His Tyr 70 75 80
AAC GAA TTT TTT TCT AAC TCA AGG GCG GAT TTT AGC TTG AGC GCT TAT 345 Asn Glu Phe Phe Ser Asn Ser Arg Ala Asp Phe Ser Leu Ser Ala Tyr 85 90 95
TTG AAA GAG CGT GGC GTT TTA GGG GTT TGT GGC GTT GAT ACT AGG AGT 393 Leu Lys Glu Arg Gly Val Leu Gly Val Cys Gly Val Asp Thr Arg Ser 100 105 110
TTG ATT AAA ACC TTA CGC CAT CAT GGG TGC TTA ATG ATG GTC GCT TCC 441 Leu He Lys Thr Leu Arg His His Gly Cys Leu Met Met Val Ala Ser 115 120 125
ACG ATA GAG CAT GAC AAA AAC AAG CTT GAA GAA ATT TTA AAA AAC GCT 489 Thr He Glu His Asp Lys Asn Lys Leu Glu Glu He Leu Lys Asn Ala 130 135 140 145
CCT AAA ATT TCT CAC TCC CCC CTA GTG TCT AGC GTT TCT ACG CCA AAA 537 Pro Lys He Ser His Ser Pro Leu Val Ser Ser Val Ser Thr Pro Lys 150 155 160
ATA ACC ACG CAC CAG CGT GCG ACT TTT GAT TTC AAA ACC CTA GAT TAC 585 He Thr Thr His Gin Arg Ala Thr Phe Asp Phe Lys Thr Leu Asp Tyr 165 170 175
AAG CCT TTT GAT GAA AAA ACC TCT CAT AAA ATT ATC GCG GTG TTA GAC 633 Lys Pro Phe Asp Glu Lys Thr Ser His Lys He He Ala Val Leu Asp 180 185 190
TTT GGG GCT AAG GGC AAT ATT TTA AAC GAG CTT CAA AAT GTG GGG TTA 681 Phe Gly Ala Lys Gly Asn He Leu Asn Glu Leu Gin Asn Val Gly Leu 195 200 205
AAA GCC CTT ATT TAC CCG CAC CAC ACT AAA GCT AGC GAG CTG ATT AAA 729 Lys Ala Leu He Tyr Pro His His Thr Lys Ala Ser Glu Leu He Lys 210 215 220 225
GCC TAT GAA AAA AAA GAA ATT AGC GGG ATT TTC CTC TCT AAC GGG CCG 777 Ala Tyr Glu Lys Lys Glu He Ser Gly He Phe Leu Ser Asn Gly Pro 230 235 240
GGC GAT CCT TTA AGC TTG CAG CAA GAA ATT GGC GAA ATC AAA CAA CTC 825 Gly Asp Pro Leu Ser Leu Gin Gin Glu He Gly Glu He Lys Gin Leu 245 250 255 ATT AAC GCT AAA ATC CCC ATG CTT GGC ATT TGC TTA GGG CAT CAA TTG 873 He Asn Ala Lys He Pro Met Leu Gly He Cys Leu Gly His Gin Leu 260 265 270
CTC TCT ATC GCT CAA GGC TAC CCT ACT TAC AAG CTC AAA TTT GGT CAT 921 Leu Ser He Ala Gin Gly Tyr Pro Thr Tyr Lys Leu Lys Phe Gly His 275 280 285
CAT GGG AGC AAC CAC CCC GTT AAA AAC CTA AAA ACA AAC GCC GTA GAA 969 His Gly Ser Asn His Pro Val Lys Asn Leu Lys Thr Asn Ala Val Glu 290 295 300 305
ATC ACC GCG CAA AAC CAC AAC TAT TGC GTC CCT GAA GAC ATT GAA GAA 1017 He Thr Ala Gin Asn His Asn Tyr Cys Val Pro Glu Asp He Glu Glu 310 315 320
ATC GCC ATT ATC ACG CAC CGC AAT CTT TTT GAC AAC ACC ATT GAG GGC 1065 He Ala He He Thr His Arg Asn Leu Phe Asp Asn Thr He Glu Gly 325 330 335
GTG CGT TAT AAA AAC GCT CCC ATT ATC TCT GTC CAG CAC CAC CCA GAA 1113 Val Arg Tyr Lys Asn Ala Pro He He Ser Val Gin His His Pro Glu 340 345 350
AGT AGC CCA GGT CCT AAA GAG AGC CAC TAT ATT TTT AAA GAA TTT GTG 1161 Ser Ser Pro Gly Pro Lys Glu Ser His Tyr He Phe Lys Glu Phe Val 355 360 365
GAA TTG TTA AAG GAT TTT TAGGGGTTTT TAAAACAGCG CTTATAGAGA CTGAAAAG 1217 Glu Leu Leu Lys Asp Phe 370 375
CGCTTTAAAA ATAGATTTAA ATCTTTTTAT CAAAAAATCT CGC 1260
(2) INFORMATION FOR SEQ ID NO: 532:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 375 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:532:
Met Val Ser Leu Tyr Leu Glu Asn Gly Leu Phe Leu Gin Ala Gin Ser
1 5 10 15
Phe Gly Ala Ser Gly Thr Gin Ala Gly Glu Leu Val Phe Asn Thr Ser
20 25 30
Met Ser Gly Tyr Gin Glu Val He Ser Asp Pro Ser Tyr Lys Gly Gin
35 40 45
Phe Val Val Phe Ser Met Pro Glu He Gly Val Val Gly Ala Asn Ser 50 55 60 Lys Asp Asp Glu Ser Phe Phe Ser Cys Ala Gly Val Leu Ala Arg His 65 70 75 80
Tyr Asn Glu Phe Phe Ser Asn Ser Arg Ala Asp Phe Ser Leu Ser Ala
85 90 95
Tyr Leu Lys Glu Arg Gly Val Leu Gly Val Cys Gly Val Asp Thr Arg
100 105 110
Ser Leu He Lys Thr Leu Arg His His Gly Cys Leu Met Met Val Ala
115 120 125
Ser Thr He Glu His Asp Lys Asn Lys Leu Glu Glu He Leu Lys Asn
130 135 140
Ala Pro Lys He Ser His Ser Pro Leu Val Ser Ser Val Ser Thr Pro 145 150 155 160
Lys He Thr Thr His Gin Arg Ala Thr Phe Asp Phe Lys Thr Leu Asp
165 170 175
Tyr Lys Pro Phe Asp Glu Lys Thr Ser His Lys He He Ala Val Leu
180 185 190
Asp Phe Gly Ala Lys Gly Asn He Leu Asn Glu Leu Gin Asn Val Gly
195 200 205
Leu Lys Ala Leu He Tyr Pro His His Thr Lys Ala Ser Glu Leu He
210 215 220
Lys Ala Tyr Glu Lys Lys Glu He Ser Gly He Phe Leu Ser Asn Gly 225 230 235 240
Pro Gly Asp Pro Leu Ser Leu Gin Gin Glu He Gly Glu He Lys Gin
245 250 255
Leu He Asn Ala Lys He Pro Met Leu Gly He Cys Leu Gly His Gin
260 265 270
Leu Leu Ser He Ala Gin Gly Tyr Pro Thr Tyr Lys Leu Lys Phe Gly
275 280 285
His His Gly Ser Asn His Pro Val Lys Asn Leu Lys Thr Asn Ala Val
290 295 300
Glu He Thr Ala Gin Asn His Asn Tyr Cys Val Pro Glu Asp He Glu 305 310 315 320
Glu He Ala He He Thr His Arg Asn Leu Phe Asp Asn Thr He Glu
325 330 335
Gly Val Arg Tyr Lys Asn Ala Pro He He Ser Val Gin His His Pro
340 345 350
Glu Ser Ser Pro Gly Pro Lys Glu Ser His Tyr He Phe Lys Glu Phe
355 360 365
Val Glu Leu Leu Lys Asp Phe 370 375
(2) INFORMATION FOR SEQ ID NO: 533:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2790 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 116...2656 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:533:
TTCTCCTTTA ATTTTGGATG TTTAAAGTAT AATAAACTAT CTTTTTAAAA AAATAACTTA 60 AAAGAGCTAT AAAATAGCCT TAAAATACGC GATAAAACAA CAAAAAGGAA TACCC ATG 118
Met 1
GAT ATT CGC AAC GAA TTT TTA CAA TTT TTT CAA AAT AAA GGG CAT GCC 166 Asp He Arg Asn Glu Phe Leu Gin Phe Phe Gin Asn Lys Gly His Ala 5 10 15
GTT TAT CCT AGC ATG CCT TTA GTG CCT AAT GAC GCT ACC TTG CTT TTT 214 Val Tyr Pro Ser Met Pro Leu Val Pro Asn Asp Ala Thr Leu Leu Phe 20 25 30
ACC AAT GCC GGC ATG GTG CAA TTT AAA GAT ATT TTT ACC GGG ATT GTG 262 Thr Asn Ala Gly Met Val Gin Phe Lys Asp He Phe Thr Gly He Val 35 40 45
CCA CGC CCT AGC ATT CCT AGA GCG GCA AGC TCG CAA TTG TGC ATG CGC 310 Pro Arg Pro Ser He Pro Arg Ala Ala Ser Ser Gin Leu Cys Met Arg 50 55 60 65
GCA GGC GGC AAG CAT AAC GAT TTG GAA AAT GTC GGT TAT ACC GCA AGG 358 Ala Gly Gly Lys His Asn Asp Leu Glu Asn Val Gly Tyr Thr Ala Arg 70 75 80
CAC CAC ACG CTT TTT GAA ATG CTA GGG AAT TTC TCT TTT GGG GAT TAT 406 His His Thr Leu Phe Glu Met Leu Gly Asn Phe Ser Phe Gly Asp Tyr 85 90 95
TTC AAA GAA GAA GCG ATC TTG TTT GCG TGG GAA TTT GTA ACC AAG AAT 454 Phe Lys Glu Glu Ala He Leu Phe Ala Trp Glu Phe Val Thr Lys Asn 100 105 110
TTA GGG TTT AAG CCT AAA GAT TTA TAC ATC AGC GTG CAT GAA AAG GAC 502 Leu Gly Phe Lys Pro Lys Asp Leu Tyr He Ser Val His Glu Lys Asp 115 120 125
GAT GAA GCC GTT AAA TTA TGG GAA AAG TTT GTG CCT GTT GAT AGG ATT 550 Asp Glu Ala Val Lys Leu Trp Glu Lys Phe Val Pro Val Asp Arg He 130 135 140 145
AAA AAA ATG GGC GAT AAA GAT AAT TTT TGG CAA ATG GGC GAT AGC GGG 598 Lys Lys Met Gly Asp Lys Asp Asn Phe Trp Gin Met Gly Asp Ser Gly 150 155 160
CCT TGC GGG CCT TGC AGT GAA ATT TAC ATT GAT CAG GGC GAA AAA CAC 646 Pro Cys Gly Pro Cys Ser Glu He Tyr He Asp Gin Gly Glu Lys His 165 170 175 TTT AAG GGG AGC GAG GAT TAT TTT GGG GGC GAG GGC GAT AGG TTT TTA 694 Phe Lys Gly Ser Glu Asp Tyr Phe Gly Gly Glu Gly Asp Arg Phe Leu 180 185 190
GAA ATT TGG AAT CTG GTG TTC ATG CAA TAC GAA CGC TCT AAT GAT GGC 742 Glu He Trp Asn Leu Val Phe Met Gin Tyr Glu Arg Ser Asn Asp Gly 195 200 205
GTT TTA TCC CCC TTG CCA AAG CCT AGC ATT GAT ACA GGC ATG GGA TTA 790 Val Leu Ser Pro Leu Pro Lys Pro Ser He Asp Thr Gly Met Gly Leu 210 215 220 225
GAA AGG GTG CAA GCG CTA TTA GAA CAT AAG CTC AAT AAT TTT GAT TCT 838 Glu Arg Val Gin Ala Leu Leu Glu His Lys Leu Asn Asn Phe Asp Ser 230 235 240
TCA TTA TTT GCG CCC CTA ATG GAA GAA ATC AGC GAG CTT ACA AGC CTA 886 Ser Leu Phe Ala Pro Leu Met Glu Glu He Ser Glu Leu Thr Ser Leu 245 250 255
GAT TAT GCG AGC GAG TTC CAG CCA AGC TTT AGG GTA GTG GCC GAT CAC 934 Asp Tyr Ala Ser Glu Phe Gin Pro Ser Phe Arg Val Val Ala Asp His 260 265 270
GCA AGA GCG GTA GCA TTC TTG CTC GCT CAA GGG GTG CAT TTC AAT AAG 982 Ala Arg Ala Val Ala Phe Leu Leu Ala Gin Gly Val His Phe Asn Lys 275 280 285
GAA GGC CGT GGC TAT GTT TTA AGG CGC ATT TTA AGG CGA GCC TTA AGG 1030 Glu Gly Arg Gly Tyr Val Leu Arg Arg He Leu Arg Arg Ala Leu Arg 290 295 300 305
CAT GGG TAT TTA ATG GGC TTG AAA GAA GCG TTT TTA TAC AAA GTC GTG 1078 His Gly Tyr Leu Met Gly Leu Lys Glu Ala Phe Leu Tyr Lys Val Val 310 315 320
GGC GTG GTG TGC GAG CAA TTT GCT AAC ACG CAT GCG TAT TTG AAA GAG 1126 Gly Val Val Cys Glu Gin Phe Ala Asn Thr His Ala Tyr Leu Lys Glu 325 330 335
TCT AAA GAA ATG GTG GTA AAA GAA TGT TTT GAA GAA GAA GAG CAC TTT 1174 Ser Lys Glu Met Val Val Lys Glu Cys Phe Glu Glu Glu Glu His Phe 340 345 350
TTA GAG ACT TTG GAA TCG GGC ATG GAA TTG TTT AAC TTG TCT TTA AAG 1222 Leu Glu Thr Leu Glu Ser Gly Met Glu Leu Phe Asn Leu Ser Leu Lys 355 360 365
CAT TTG AAT GAA AAT AAA ATC TTT GAT GGC AAG ATC GCT TTC AAG CTT 1270 His Leu Asn Glu Asn Lys He Phe Asp Gly Lys He Ala Phe Lys Leu 370 375 380 385
TAT GAC ACT TTT GGT TTC CCT TTG GAT TTA ACA AAC GAC ATG TTA AGA 1318 Tyr Asp Thr Phe Gly Phe Pro Leu Asp Leu Thr Asn Asp Met Leu Arg 390 395 400 AGT CAT GGG GCG TGT GCG GAT ATG CAA GGC TTT GAA TTG TGC ATG CAA 1366 Ser His Gly Ala Cys Ala Asp Met Gin Gly Phe Glu Leu Cys Met Gin 405 410 415
GAG CAA GTG AAA CGC TCT AAA GCT TCA TGG AAA GGC AAA CAA AAC AAC 1414 Glu Gin Val Lys Arg Ser Lys Ala Ser Trp Lys Gly Lys Gin Asn Asn 420 425 430
GCC GAT TTT AGC GCT ATT TTA AAC GCT TAT GCA CCT AAT GTT TTT GTG 1462 Ala Asp Phe Ser Ala He Leu Asn Ala Tyr Ala Pro Asn Val Phe Val 435 440 445
GGG TAT GAA ACG ACA GAA TGT TCT GCT AAA GTT TTA GGG TTT TTT GAT 1510 Gly Tyr Glu Thr Thr Glu Cys Ser Ala Lys Val Leu Gly Phe Phe Asp 450 455 460 465
AGC GAT TTT AAA GAA ATA ACC GAT GCA AAT CCT AAC CAA GAA GTC TGG 1558 Ser Asp Phe Lys Glu He Thr Asp Ala Asn Pro Asn Gin Glu Val Trp 470 475 480
GTG TTG TTA GAA AAA ACC CCT TTT TAT GCA GAA GGT GGA GGG GCT ATA 1606 Val Leu Leu Glu Lys Thr Pro Phe Tyr Ala Glu Gly Gly Gly Ala He 485 490 495
GGC GAT AGG GGC GCG CTT TTT AAA GAC AAT GGA GAA GTG GCT ATC GTG 1654 Gly Asp Arg Gly Ala Leu Phe Lys Asp Asn Gly Glu Val Ala He Val 500 505 510
TTA GAT ACA AAA AAC TTT TTT GGG CTT AAT TTT TCA CTC CTT GAA ATC 1702 Leu Asp Thr Lys Asn Phe Phe Gly Leu Asn Phe Ser Leu Leu Glu He 515 520 525
AAA AAA GCG CTA AAA AAA GGC GAT CAA GTG ATC GCG CAA GTG AGC GAT 1750 Lys Lys Ala Leu Lys Lys Gly Asp Gin Val He Ala Gin Val Ser Asp 530 535 540 545
GAG CGC TTT GAA ATC GCC AAA CAC CAT AGT GCG ACT CAT TTA TTG CAG 1798 Glu Arg Phe Glu He Ala Lys His His Ser Ala Thr His Leu Leu Gin 550 555 560
AGC GCT TTA AGA GAA GTT TTA GGC TCG CAT GTG AGT CAA GCG GGG AGT 1846 Ser Ala Leu Arg Glu Val Leu Gly Ser His Val Ser Gin Ala Gly Ser 565 570 575
TTA GTG GAA TCC AAG CGA TTG CGC TTT GAT TTC TCG CAT GCT AAA GCG 1894 Leu Val Glu Ser Lys Arg Leu Arg Phe Asp Phe Ser His Ala Lys Ala 580 585 590
CTC AAT GAT GAA GAG CTA GAA AAA GTA GAA GAT TTA GTC AAC GCT CAA 1942 Leu Asn Asp Glu Glu Leu Glu Lys Val Glu Asp Leu Val Asn Ala Gin 595 600 605
ATT TTC AAG CAC CTA AAT AGC CAG GTG GAG CAT ATG CCT TTA AAC CAA 1990 He Phe Lys His Leu Asn Ser Gin Val Glu His Met Pro Leu Asn Gin 610 615 620 625 GCC AAA GAT AAG GGA GCG TTA GCG TTA TTC AGT GAA AAA TAC GCT GAA 2038 Ala Lys Asp Lys Gly Ala Leu Ala Leu Phe Ser Glu Lys Tyr Ala Glu 630 635 640
AAT GTG CGG GTG GTG AGC TTT AAA GAA GCG TCC ATT GAA TTG TGT GGG 2086 Asn Val Arg Val Val Ser Phe Lys Glu Ala Ser He Glu Leu Cys Gly 645 650 655
GGC ATT CAT GTG GAA AAT ACT GGG CTT ATT GGG GGG TTT AGG ATT GTA 2134 Gly He His Val Glu Asn Thr Gly Leu He Gly Gly Phe Arg He Val 660 665 670
AAA GAA AGC GGG GTG AGT AGT GGG GTC AGA CGC ATT GAA GCG GTG TGC 2182 Lys Glu Ser Gly Val Ser Ser Gly Val Arg Arg He Glu Ala Val Cys 675 680 685
GGG AAA GCC TTT TAC CAA CTG GCT AAA GAA GAA AAT AAA GAG CTT AAA 2230 Gly Lys Ala Phe Tyr Gin Leu Ala Lys Glu Glu Asn Lys Glu Leu Lys 690 695 700 705
AAC GCT AAG ACT TTA TTG AAA AAT AAC GAT GTG ATC GCC GGT ATC AAT 2278 Asn Ala Lys Thr Leu Leu Lys Asn Asn Asp Val He Ala Gly He Asn 710 715 720
AAG CTT AAA GAG AGC GTG AAA AAC AGC CAA AAA GCC CCC GTT TCT ATG 2326 Lys Leu Lys Glu Ser Val Lys Asn Ser Gin Lys Ala Pro Val Ser Met 725 730 735
GAT TTA CCG GTT GAA AAA ATC CAT GGC GTG AAT TTG GTG GTG GGC GTA 2374 Asp Leu Pro Val Glu Lys He His Gly Val Asn Leu Val Val Gly Val 740 745 750
GTG GAA CAA GGC GAC ATT AAA GAA ATG ATT GAC CGA TTG AAA AGT AAG 2422 Val Glu Gin Gly Asp He Lys Glu Met He Asp Arg Leu Lys Ser Lys 755 760 765
CAT GAA AGA TTG CTC GCT ATG GTG TTT AAA AAA GAA AAT GAG CGA ATC 2470 His Glu Arg Leu Leu Ala Met Val Phe Lys Lys Glu Asn Glu Arg He 770 775 780 785
ACT CTC GCA TGC GGG GTG AAA AAC GCG CCC ATA AAA GCG AAT GTG TGG 2518 Thr Leu Ala Cys Gly Val Lys Asn Ala Pro He Lys Ala Asn Val Trp 790 795 800
GCT AAT GAA GTG GCG CAA ATT TTA GGG GGC AAA GGG GGC GGG AGA GGT 2566 Ala Asn Glu Val Ala Gin He Leu Gly Gly Lys Gly Gly Gly Arg Gly 805 810 815
GAT TTT GCG AGC GCT GGA GGC AAG GAT ATT GAA AAT TTG CAA GCC GCA 2614 Asp Phe Ala Ser Ala Gly Gly Lys Asp He Glu Asn Leu Gin Ala Ala 820 825 830
CTC AAT TTA GCG AAA AAT ACC GCT CTT AAA GCT TTA GAG GGA TAGCATGGA 2665 Leu Asn Leu Ala Lys Asn Thr Ala Leu Lys Ala Leu Glu Gly 835 840 845 GCTTATTTTA GGCTCTCAAT CCAGCACTAG GGCGAATCTC TTAAAAGAGC ATGGGATTAA 2725 GTTTGAACAA AAAGCGCTCT ATTTTGATGA AGAAAGCCTA AAAACCACAG ACCCTAGGGA 2785 GTTTG 2790
(2) INFORMATION FOR SEQ ID NO:534:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 847 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534:
Met Asp He Arg Asn Glu Phe Leu Gin Phe Phe Gin Asn Lys Gly His
1 5 10 15
Ala Val Tyr Pro Ser Met Pro Leu Val Pro Asn Asp Ala Thr Leu Leu
20 25 30
Phe Thr Asn Ala Gly Met Val Gin Phe Lys Asp He Phe Thr Gly He
35 40 45
Val Pro Arg Pro Ser He Pro Arg Ala Ala Ser Ser Gin Leu Cys Met
50 55 60
Arg Ala Gly Gly Lys His Asn Asp Leu Glu Asn Val Gly Tyr Thr Ala 65 70 75 80
Arg His His Thr Leu Phe Glu Met Leu Gly Asn Phe Ser Phe Gly Asp
85 90 95
Tyr Phe Lys Glu Glu Ala He Leu Phe Ala Trp Glu Phe Val Thr Lys
100 105 110
Asn Leu Gly Phe Lys Pro Lys Asp Leu Tyr He Ser Val His Glu Lys
115 120 125
Asp Asp Glu Ala Val Lys Leu Trp Glu Lys Phe Val Pro Val Asp Arg
130 135 140
He Lys Lys Met Gly Asp Lys Asp Asn Phe Trp Gin Met Gly Asp Ser 145 150 155 160
Gly Pro Cys Gly Pro Cys Ser Glu He Tyr He Asp Gin Gly Glu Lys
165 170 175
His Phe Lys Gly Ser Glu Asp Tyr Phe Gly Gly Glu Gly Asp Arg Phe
180 185 190
Leu Glu He Trp Asn Leu Val Phe Met Gin Tyr Glu Arg Ser Asn Asp
195 200 205
Gly Val Leu Ser Pro Leu Pro Lys Pro Ser He Asp Thr Gly Met Gly
210 215 220
Leu Glu Arg Val Gin Ala Leu Leu Glu His Lys Leu Asn Asn Phe Asp 225 230 235 240
Ser Ser Leu Phe Ala Pro Leu Met Glu Glu He Ser Glu Leu Thr Ser
245 250 255
Leu Asp Tyr Ala Ser Glu Phe Gin Pro Ser Phe Arg Val Val Ala Asp
260 265 270
His Ala Arg Ala Val Ala Phe Leu Leu Ala Gin Gly Val His Phe Asn
275 280 285
Lys Glu Gly Arg Gly Tyr Val Leu Arg Arg He Leu Arg Arg Ala Leu 290 295 300 Arg His Gly Tyr Leu Met Gly Leu Lys Glu Ala Phe Leu Tyr Lys Val 305 310 315 320
Val Gly Val Val Cys Glu Gin Phe Ala Asn Thr His Ala Tyr Leu Lys
325 330 335
Glu Ser Lys Glu Met Val Val Lys Glu Cys Phe Glu Glu Glu Glu His
340 345 350
Phe Leu Glu Thr Leu Glu Ser Gly Met Glu Leu Phe Asn Leu Ser Leu
355 360 365
Lys His Leu Asn Glu Asn Lys He Phe Asp Gly Lys He Ala Phe Lys
370 375 380
Leu Tyr Asp Thr Phe Gly Phe Pro Leu Asp Leu Thr Asn Asp Met Leu 385 390 395 400
Arg Ser His Gly Ala Cys Ala Asp Met Gin Gly Phe Glu Leu Cys Met
405 410 415
Gin Glu Gin Val Lys Arg Ser Lys Ala Ser Trp Lys Gly Lys Gin Asn
420 425 430
Asn Ala Asp Phe Ser Ala He Leu Asn Ala Tyr Ala Pro Asn Val Phe
435 440 445
Val Gly Tyr Glu Thr Thr Glu Cys Ser Ala Lys Val Leu Gly Phe Phe
450 455 460
Asp Ser Asp Phe Lys Glu He Thr Asp Ala Asn Pro Asn Gin Glu Val 465 470 475 480
Trp Val Leu Leu Glu Lys Thr Pro Phe Tyr Ala Glu Gly Gly Gly Ala
485 490 495
He Gly Asp Arg Gly Ala Leu Phe Lys Asp Asn Gly Glu Val Ala He
500 505 510
Val Leu Asp Thr Lys Asn Phe Phe Gly Leu Asn Phe Ser Leu Leu Glu
515 520 525
He Lys Lys Ala Leu Lys Lys Gly Asp Gin Val He Ala Gin Val Ser
530 535 540
Asp Glu Arg Phe Glu He Ala Lys His His Ser Ala Thr His Leu Leu 545 550 555 560
Gin Ser Ala Leu Arg Glu Val Leu Gly Ser His Val Ser Gin Ala Gly
565 570 575
Ser Leu Val Glu Ser Lys Arg Leu Arg Phe Asp Phe Ser His Ala Lys
580 585 590
Ala Leu Asn Asp Glu Glu Leu Glu Lys Val Glu Asp Leu Val Asn Ala
595 600 605
Gin He Phe Lys His Leu Asn Ser Gin Val Glu His Met Pro Leu Asn
610 615 620
Gin Ala Lys Asp Lys Gly Ala Leu Ala Leu Phe Ser Glu Lys Tyr Ala 625 630 635 640
Glu Asn Val Arg Val Val Ser Phe Lys Glu Ala Ser He Glu Leu Cys
645 650 655
Gly Gly He His Val Glu Asn Thr Gly Leu He Gly Gly Phe Arg He
660 665 670
Val Lys Glu Ser Gly Val Ser Ser Gly Val Arg Arg He Glu Ala Val
675 680 685
Cys Gly Lys Ala Phe Tyr Gin Leu Ala Lys Glu Glu Asn Lys Glu Leu
690 695 700
Lys Asn Ala Lys Thr Leu Leu Lys Asn Asn Asp Val He Ala Gly He 705 710 715 720
Asn Lys Leu Lys Glu Ser Val Lys Asn Ser Gin Lys Ala Pro Val Ser
725 730 735
Met Asp Leu Pro Val Glu Lys He His Gly Val Asn Leu Val Val Gly 740 745 750
Val Val Glu Gin Gly Asp He Lys Glu Met He Asp Arg Leu Lys Ser
755 760 765
Lys His Glu Arg Leu Leu Ala Met Val Phe Lys Lys Glu Asn Glu Arg
770 775 780
He Thr Leu Ala Cys Gly Val Lys Asn Ala Pro He Lys Ala Asn Val 785 790 795 800
Trp Ala Asn Glu Val Ala Gin He Leu Gly Gly Lys Gly Gly Gly Arg
805 810 815
Gly Asp Phe Ala Ser Ala Gly Gly Lys Asp He Glu Asn Leu Gin Ala
820 825 830
Ala Leu Asn Leu Ala Lys Asn Thr Ala Leu Lys Ala Leu Glu Gly 835 840 845
(2) INFORMATION FOR SEQ ID NO: 535:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 720 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 100...636 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535:
TCGCACGCCA AAGAAAAACA CGAAAAAACC GAACACACGC ATTCTCACCA CACAGAGGAA 60 GCAGAAAGCG TAGGATCTCA TAGCGAATAA GGGCTTATC ATG TTT AAT AAA GTG 114
Met Phe Asn Lys Val
1 5
ATT ATG GTA GGG CGT TTG ACC AGG AAT GTG GAG TTG AAA TAT TTG CCT 162 He Met Val Gly Arg Leu Thr Arg Asn Val Glu Leu Lys Tyr Leu Pro 10 15 20
AGC GGT TCG GCT GCG GCT ACA ATA GGT TTA GCC ACA AGC AGG CGT TTT 210 Ser Gly Ser Ala Ala Ala Thr He Gly Leu Ala Thr Ser Arg Arg Phe 25 30 35
AAA AAA CAA GAC GGC ACG CTA GGC GAA GAG GTG TGC TTT ATA GAT GCG 258 Lys Lys Gin Asp Gly Thr Leu Gly Glu Glu Val Cys Phe He Asp Ala 40 45 50
CGT TTG TTT GGG CGA ACG GCT GAA ATC GCT AAC CAG TAT TTG AGC AAG 306 Arg Leu Phe Gly Arg Thr Ala Glu He Ala Asn Gin Tyr Leu Ser Lys 55 60 65 GGT TCA AGC GTT TTG ATA GAA GGG CGT TTG ACT TAT GAG AGT TGG ATG 354 Gly Ser Ser Val Leu He Glu Gly Arg Leu Thr Tyr Glu Ser Trp Met 70 75 80 85
GAT CAA ACG GGC AAA AAA AAT TCC CGC CAC ACT ATC ACA GCG GAC TCG 402 Asp Gin Thr Gly Lys Lys Asn Ser Arg His Thr He Thr Ala Asp Ser 90 95 100
TTG CAA TTT ATG GAT AAA AAG TCA GAC AAT CCC CAA GCA AAC GCT ATG 450 Leu Gin Phe Met Asp Lys Lys Ser Asp Asn Pro Gin Ala Asn Ala Met 105 110 115
CAA GAT AGT ATA ATG CAT GAG AAT TCC AAC AAC GCT TAT CCC GCT AAT 498 Gin Asp Ser He Met His Glu Asn Ser Asn Asn Ala Tyr Pro Ala Asn 120 125 130
CAT AAC GCT CCC AGC CAA GAT CCT TTT AAC CAA GCT TAT GCG CAA AAC 546 His Asn Ala Pro Ser Gin Asp Pro Phe Asn Gin Ala Tyr Ala Gin Asn 135 140 145
GCT TAC GCT AAA GAG AAT TTA CAA GCA CAG CCG TCC AAG TAT CAA AAC 594 Ala Tyr Ala Lys Glu Asn Leu Gin Ala Gin Pro Ser Lys Tyr Gin Asn 150 155 160 165
AGC GTG CCT GAA ATC AAT ATT GAT GAA GAA GAA ATC CCC TTT TAAGGGTTA 645 Ser Val Pro Glu He Asn He Asp Glu Glu Glu He Pro Phe 170 175
AAATTAAGGA GACATTATGG AAAGAAAACG CTATTCAAAA CGCTATTGCA AATACACTGA 705 AGCTAAAATC AGCTT 720
(2) INFORMATION FOR SEQ ID NO: 536:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 179 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536:
Met Phe Asn Lys Val He Met Val Gly Arg Leu Thr Arg Asn Val Glu
1 5 10 15
Leu Lys Tyr Leu Pro Ser Gly Ser Ala Ala Ala Thr He Gly Leu Ala
20 25 30
Thr Ser Arg Arg Phe Lys Lys Gin Asp Gly Thr Leu Gly Glu Glu Val
35 40 45
Cys Phe He Asp Ala Arg Leu Phe Gly Arg Thr Ala Glu He Ala Asn
50 55 60
Gin Tyr Leu Ser Lys Gly Ser Ser Val Leu He Glu Gly Arg Leu Thr 65 70 75 80
Tyr Glu Ser Trp Met Asp Gin Thr Gly Lys Lys Asn Ser Arg His Thr 85 90 95
He Thr Ala Asp Ser Leu Gin Phe Met Asp Lys Lys Ser Asp Asn Pro
100 105 110
Gin Ala Asn Ala Met Gin Asp Ser He Met His Glu Asn Ser Asn Asn
115 120 125
Ala Tyr Pro Ala Asn His Asn Ala Pro Ser Gin Asp Pro Phe Asn Gin
130 135 140
Ala Tyr Ala Gin Asn Ala Tyr Ala Lys Glu Asn Leu Gin Ala Gin Pro 145 150 155 160
Ser Lys Tyr Gin Asn Ser Val Pro Glu He Asn He Asp Glu Glu Glu
165 170 175
He Pro Phe
(2) INFORMATION FOR SEQ ID NO: 537:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 990 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 91...879 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537:
TCCAAAACGA TTGGGCTGAA ATTGAATTTT CTCACGAAAC AAAGGGCTAT GTGTTTTTAA 60 AACTTTTAAA AAAGGCTGAA TGAAAGAATA ATG AAA TTA AAA TCT TTT GGG GTT 114
Met Lys Leu Lys Ser Phe Gly Val 1 5
TTT GGA AAT CCC ATT AAG CAT TCC AAA TCG CCC TTA ATC CAT AAC GCT 162 Phe Gly Asn Pro He Lys His Ser Lys Ser Pro Leu He His Asn Ala 10 15 20
TGT TTT TTA ACT TTT CAA AAA GAA TTA AGG TTT TTG GGG CAT TAC CAC 210 Cys Phe Leu Thr Phe Gin Lys Glu Leu Arg Phe Leu Gly His Tyr His 25 30 35 40
CCC ATA TTA CTC CCT TTA GAA AGC CAC ATC AAA AGC GAG TTT TTG CAT 258 Pro He Leu Leu Pro Leu Glu Ser His He Lys Ser Glu Phe Leu His 45 50 55
TTG GGA TTG AGT GGG GCT AAT GTA ACC TTA CCC TTT AAA GAA AGG GCG 306 Leu Gly Leu Ser Gly Ala Asn Val Thr Leu Pro Phe Lys Glu Arg Ala 60 65 70 TTT CAA GTT TGC GAT AAA ATC AAA GGT ATC GCG CTT GAA TGC GGA GCG 354 Phe Gin Val Cys Asp Lys He Lys Gly He Ala Leu Glu Cys Gly Ala 75 80 85
GTC AAT ACG CTT GTT TTA GAA AAT GAT GAG CTT GTG GGT TAC AAT ACC 402 Val Asn Thr Leu Val Leu Glu Asn Asp Glu Leu Val Gly Tyr Asn Thr 90 95 100
GAC GCT TTA GGG TTT TAT CTT TCT TTA AAG CAA AAA AAC TAT CAA AAC 450 Asp Ala Leu Gly Phe Tyr Leu Ser Leu Lys Gin Lys Asn Tyr Gin Asn 105 110 115 120
GCT TTG ATT TTA GGA GCT GGG GGG AGC GCT AAA GCC CTA GCG TGT GAA 498 Ala Leu He Leu Gly Ala Gly Gly Ser Ala Lys Ala Leu Ala Cys Glu 125 130 135
TTG AAA AAA CAA GGC TTA CAA GTG AGC GTG TTG AAC CGC TCT TCT AGG 546 Leu Lys Lys Gin Gly Leu Gin Val Ser Val Leu Asn Arg Ser Ser Arg 140 145 150
GGA TTG GAT TTT TTC CAA CGC CTG GGC TGT GAT TGT TTT ATG GAG CCT 594 Gly Leu Asp Phe Phe Gin Arg Leu Gly Cys Asp Cys Phe Met Glu Pro 155 160 165
CCT AAA AGC GCT TTT GAT TTG ATT ATT AAC GCC ACT TCA GCG AGT TTG 642 Pro Lys Ser Ala Phe Asp Leu He He Asn Ala Thr Ser Ala Ser Leu 170 175 180
CAT AAC GAA TTG CCT TTG AAT AAA GAG GTT TTG AAA GGG TAT TTT AAA 690 His Asn Glu Leu Pro Leu Asn Lys Glu Val Leu Lys Gly Tyr Phe Lys 185 190 195 200
GAG GGC AAG CTC GCT TAT GAT TTG GCG TAT GGG TTT TTA ACG CCC TTT 738 Glu Gly Lys Leu Ala Tyr Asp Leu Ala Tyr Gly Phe Leu Thr Pro Phe 205 210 215
TTG TCT TTA GCC AAA GAG TTA AAA ACC CCT TTT CAA GAC GGA AAA GAC 786 Leu Ser Leu Ala Lys Glu Leu Lys Thr Pro Phe Gin Asp Gly Lys Asp 220 225 230
ATG CTC ATC TAT CAA GCT GCT TTA AGT TTT GAA AAA TTC AGC GCT TCT 834 Met Leu He Tyr Gin Ala Ala Leu Ser Phe Glu Lys Phe Ser Ala Ser 235 240 245
CAA ATC CCT TAT TCA AAA GCG TTT GAA GTC ATG CGA AGT GTT TTT TGATG 884 Gin He Pro Tyr Ser Lys Ala Phe Glu Val Met Arg Ser Val Phe 250 255 260
CAAGGGTTTT TAAGAAGCCT GTTTTTTGGG GTTAAAAAGA TCCCTAAACC ATTCGCTCCT 944 CTAGTAGAAA AGGGCGTTTT AAAAGAAGCG CTTGAATTGA AAAAGG 990
(2) INFORMATION FOR SEQ ID NO: 538:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 263 amino acids (B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538:
Met Lys Leu Lys Ser Phe Gly Val Phe Gly Asn Pro He Lys His Ser
1 5 10 15
Lys Ser Pro Leu He His Asn Ala Cys Phe Leu Thr Phe Gin Lys Glu
20 25 30
Leu Arg Phe Leu Gly His Tyr His Pro He Leu Leu Pro Leu Glu Ser
35 40 45
His He Lys Ser Glu Phe Leu His Leu Gly Leu Ser Gly Ala Asn Val
50 55 60
Thr Leu Pro Phe Lys Glu Arg Ala Phe Gin Val Cys Asp Lys He Lys 65 70 75 80
Gly He Ala Leu Glu Cys Gly Ala Val Asn Thr Leu Val Leu Glu Asn
85 90 95
Asp Glu Leu Val Gly Tyr Asn Thr Asp Ala Leu Gly Phe Tyr Leu Ser
100 105 110
Leu Lys Gin Lys Asn Tyr Gin Asn Ala Leu He Leu Gly Ala Gly Gly
115 120 125
Ser Ala Lys Ala Leu Ala Cys Glu Leu Lys Lys Gin Gly Leu Gin Val
130 135 140
Ser Val Leu Asn Arg Ser Ser Arg Gly Leu Asp Phe Phe Gin Arg Leu 145 150 155 160
Gly Cys Asp Cys Phe Met Glu Pro Pro Lys Ser Ala Phe Asp Leu He
165 170 175
He Asn Ala Thr Ser Ala Ser Leu His Asn Glu Leu Pro Leu Asn Lys
180 185 190
Glu Val Leu Lys Gly Tyr Phe Lys Glu Gly Lys Leu Ala Tyr Asp Leu
195 200 205
Ala Tyr Gly Phe Leu Thr Pro Phe Leu Ser Leu Ala Lys Glu Leu Lys
210 215 220
Thr Pro Phe Gin Asp Gly Lys Asp Met Leu He Tyr Gin Ala Ala Leu 225 230 235 240
Ser Phe Glu Lys Phe Ser Ala Ser Gin He Pro Tyr Ser Lys Ala Phe
245 250 255
Glu Val Met Arg Ser Val Phe 260
(2) INFORMATION FOR SEQ ID NO: 539:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1080 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 47...1033 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539:
TTTTTAGAGA GGGCGTGTTT GATAGCGTGG ATTTAAAGGA GCAAGC ATG AGC GCT 55
Met Ser Ala
1
TAT ATC ATT GAA ACC CTG ATT AAA ATT TTG ATT TTA GTC GCT GTT TTT 103 Tyr He He Glu Thr Leu He Lys He Leu He Leu Val Ala Val Phe 5 10 15
TCG GCT TTA GGA GGC TTT GCC ACT TAT ATT GAA AGG AAA GTG TTA GCC 151 Ser Ala Leu Gly Gly Phe Ala Thr Tyr He Glu Arg Lys Val Leu Ala 20 25 30 35
TAT TTC CAA CGC CGT TTA GGG CCT TGT TAT GTG GGG CCT TTT GGG CTT 199 Tyr Phe Gin Arg Arg Leu Gly Pro Cys Tyr Val Gly Pro Phe Gly Leu 40 45 50
TTG CAA GTC GCA GCA GAC GGC ATT AAG CTT TTC ACT AAA GAA GAC ATT 247 Leu Gin Val Ala Ala Asp Gly He Lys Leu Phe Thr Lys Glu Asp He 55 60 65
ATC CCT CAA GGC GCG AAC AAA TTC ATT TTC ACG CTA GCG CCC ATT ATT 295 He Pro Gin Gly Ala Asn Lys Phe He Phe Thr Leu Ala Pro He He 70 75 80
GCG ATG GTG AGT GCG TTT GTG TCC ATG GCG CCT ATC CCC TTT TTC CCT 343 Ala Met Val Ser Ala Phe Val Ser Met Ala Pro He Pro Phe Phe Pro 85 90 95
AAT TTC ACT CTG TTT GGC TAT GAG ATC AAG CCC CTT ATT TCT GAC ATC 391 Asn Phe Thr Leu Phe Gly Tyr Glu He Lys Pro Leu He Ser Asp He 100 105 110 115
AAC ATT GGC TTT TTG TTT TTC TTA GCC GTG GGT TCG GCA GGG ATT TAT 439 Asn He Gly Phe Leu Phe Phe Leu Ala Val Gly Ser Ala Gly He Tyr 120 125 130
GCG CCT ATT TTA GCC GGG CTT GCC TCT AAT AAC AAA TAC TCT TTA ATT 487 Ala Pro He Leu Ala Gly Leu Ala Ser Asn Asn Lys Tyr Ser Leu He 135 140 145
GGC TCC GCA AGA GCG ACG ATC CAA CTG CTC AGC TTT GAA GTG GTC AGC 535 Gly Ser Ala Arg Ala Thr He Gin Leu Leu Ser Phe Glu Val Val Ser 150 155 160
ACT TTA ACC ATT CTA GCC CCC TTA ATG GTG GTA GGA TCG CTC TCT TTA 583 Thr Leu Thr He Leu Ala Pro Leu Met Val Val Gly Ser Leu Ser Leu 165 170 175
GTG GAA ATC AAT CAT TAC CAA AGC GGT GGG TTT TTA GAC TGG CTT GTG 631 Val Glu He Asn His Tyr Gin Ser Gly Gly Phe Leu Asp Trp Leu Val 180 185 190 195
TTT AAG CAG CCT CTA GCG TTT GTT TTG TTT TTG ATC GCA AGT TAT GCC 679 Phe Lys Gin Pro Leu Ala Phe Val Leu Phe Leu He Ala Ser Tyr Ala 200 205 210
GAA TTG AAT CGA ACC CCC TTT GAC TTG CTA GAG CAT GAA GCC GAG ATC 727 Glu Leu Asn Arg Thr Pro Phe Asp Leu Leu Glu His Glu Ala Glu He 215 220 225
GTG GCG GGG TAT TGC ACC GAA TAC AGC GGC TTG AAA TGG GGC ATG TTC 775 Val Ala Gly Tyr Cys Thr Glu Tyr Ser Gly Leu Lys Trp Gly Met Phe 230 235 240
TTT TTA GCG GAA TAC GCG CAT TTA TTC GCT TTT TCT TTT GTG ATT TCT 823 Phe Leu Ala Glu Tyr Ala His Leu Phe Ala Phe Ser Phe Val He Ser 245 250 255
ATT GTG TTT TTT GGC GGG TTT AAC GCA TGG GGC TTT ATC CCT GGA GGC 871 He Val Phe Phe Gly Gly Phe Asn Ala Trp Gly Phe He Pro Gly Gly 260 265 270 275
ATA GCG ATT TTG ATT AAA GCG GGC TTT TTT GTC TTT TTA TCC ATG TGG 919 He Ala He Leu He Lys Ala Gly Phe Phe Val Phe Leu Ser Met Trp 280 285 290
GTT AGA GCG ACT TAT CCG CAT GTG CGC CCA GAC CAA CTG ATG GAT ATG 967 Val Arg Ala Thr Tyr Pro His Val Arg Pro Asp Gin Leu Met Asp Met 295 300 305
TGC TGG AAA ATC ATG CTG CCT TTA GCG TTA TTG AAC ATT GTG CTA ACG 1015 Cys Trp Lys He Met Leu Pro Leu Ala Leu Leu Asn He Val Leu Thr 310 315 320
GGC ATT ATC ATT TTA ATT TAAAGGAGGT TTTATGGCCA AACAAGAATA CAAGCAAC 1071 Gly He He He Leu He 325
TTCCTAAAC 1080
(2) INFORMATION FOR SEQ ID NO: 540:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 329 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540:
Met Ser Ala Tyr He He Glu Thr Leu He Lys He Leu He Leu Val
1 5 10 15
Ala Val Phe Ser Ala Leu Gly Gly Phe Ala Thr Tyr He Glu Arg Lys
20 25 30
Val Leu Ala Tyr Phe Gin Arg Arg Leu Gly Pro Cys Tyr Val Gly Pro
35 40 45
Phe Gly Leu Leu Gin Val Ala Ala Asp Gly He Lys Leu Phe Thr Lys
50 55 60
Glu Asp He He Pro Gin Gly Ala Asn Lys Phe He Phe Thr Leu Ala 65 70 75 80
Pro He He Ala Met Val Ser Ala Phe Val Ser Met Ala Pro He Pro
85 90 95
Phe Phe Pro Asn Phe Thr Leu Phe Gly Tyr Glu He Lys Pro Leu He
100 105 110
Ser Asp He Asn He Gly Phe Leu Phe Phe Leu Ala Val Gly Ser Ala
115 120 125
Gly He Tyr Ala Pro He Leu Ala Gly Leu Ala Ser Asn Asn Lys Tyr
130 135 140
Ser Leu He Gly Ser Ala Arg Ala Thr He Gin Leu Leu Ser Phe Glu 145 150 155 160
Val Val Ser Thr Leu Thr He Leu Ala Pro Leu Met Val Val Gly Ser
165 170 175
Leu Ser Leu Val Glu He Asn His Tyr Gin Ser Gly Gly Phe Leu Asp
180 185 190
Trp Leu Val Phe Lys Gin Pro Leu Ala Phe Val Leu Phe Leu He Ala
195 200 205
Ser Tyr Ala Glu Leu Asn Arg Thr Pro Phe Asp Leu Leu Glu His Glu
210 215 220
Ala Glu He Val Ala Gly Tyr Cys Thr Glu Tyr Ser Gly Leu Lys Trp 225 230 235 240
Gly Met Phe Phe Leu Ala Glu Tyr Ala His Leu Phe Ala Phe Ser Phe
245 250 255
Val He Ser He Val Phe Phe Gly Gly Phe Asn Ala Trp Gly Phe He
260 265 270
Pro Gly Gly He Ala He Leu He Lys Ala Gly Phe Phe Val Phe Leu
275 280 285
Ser Met Trp Val Arg Ala Thr Tyr Pro His Val Arg Pro Asp Gin Leu
290 295 300
Met Asp Met Cys Trp Lys He Met Leu Pro Leu Ala Leu Leu Asn He 305 310 315 320
Val Leu Thr Gly He He He Leu He 325
(2) INFORMATION FOR SEQ ID NO: 541:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1280 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 48...1226 (D) OTHER INFORMATION:
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 541:
TAAGGATAAA ATCAAGCGAT TAGCCCGAAT TTTAAGAGAG TATTAAG ATG AAT AAA 56
Met Asn Lys 1
AAA GCG TAT TTT GGG GAG TTT GGA GGG AGT TTT GTT TCG GAG TTG TTA 104 Lys Ala Tyr Phe Gly Glu Phe Gly Gly Ser Phe Val Ser Glu Leu Leu 5 10 15
GTG CCT GCA TTA AGA GAA TTA GAA CAG GCG TTT GAT GCG TGT TTG AAA 152 Val Pro Ala Leu Arg Glu Leu Glu Gin Ala Phe Asp Ala Cys Leu Lys 20 25 30 35
GAT GAA AAA TTC CAA AAA GAA TAT TTT CGT CTT TTA AAG GAT TTT GTG 200 Asp Glu Lys Phe Gin Lys Glu Tyr Phe Arg Leu Leu Lys Asp Phe Val 40 45 50
GGC CGT CCT AGC CCT TTA ACC TTG TGT CAA AAT ATC GTT TCT AAC CCT 248 Gly Arg Pro Ser Pro Leu Thr Leu Cys Gin Asn He Val Ser Asn Pro 55 60 65
AAA GTC AAG CTT TAT TTA AAA CGA GAG GAT TTA ATC CAT GGC GGG GCG 296 Lys Val Lys Leu Tyr Leu Lys Arg Glu Asp Leu He His Gly Gly Ala 70 75 80
CAT AAG ACT AAT CAA GCC TTA GGG CAA GCC CTT TTA GCG AAA AAA ATG 344 His Lys Thr Asn Gin Ala Leu Gly Gin Ala Leu Leu Ala Lys Lys Met 85 90 95
GGT AAA ACA AGG ATC ATC GCT GAA ACA GGC GCC GGT CAG CAT GGC GTG 392 Gly Lys Thr Arg He He Ala Glu Thr Gly Ala Gly Gin His Gly Val 100 105 110 115
GCG ACG GCT ATC GCT TGC GCA TTA TTG AAC TTA AAA TGC GTG GTT TTT 440 Ala Thr Ala He Ala Cys Ala Leu Leu Asn Leu Lys Cys Val Val Phe 120 125 130
ATG GGA TCT AAA GAC ATC AAG CGC CAG GAA ATG AAT GTT TTT AGA ATG 488 Met Gly Ser Lys Asp He Lys Arg Gin Glu Met Asn Val Phe Arg Met 135 140 145
CAC TTA TTA GGC GCT GAA GTG AGA GAG GTT AAT TCA GGG AGC GCG ACG 536 His Leu Leu Gly Ala Glu Val Arg Glu Val Asn Ser Gly Ser Ala Thr 150 155 160
CTT AAA GAC GCT GTG AAT GAA GCC TTA AGA GAT TGG GCG AGC AGT TAC 584 Leu Lys Asp Ala Val Asn Glu Ala Leu Arg Asp Trp Ala Ser Ser Tyr 165 170 175
AAG GAC ACG CAT TAT TTG CTA GGC ACA GCC GCC GGG CCA CAC CCT TAC 632 Lys Asp Thr His Tyr Leu Leu Gly Thr Ala Ala Gly Pro His Pro Tyr 180 185 190 195
CCC ACA ATG GTT AAA ACC TTT CAA AAA ATG ATA GGC GAT GAG GTT AAA 680 Pro Thr Met Val Lys Thr Phe Gin Lys Met He Gly Asp Glu Val Lys 200 205 210
AGC CAG ATT TTA GAA AAA GAA AAC CGC TTG CCT GAT TAT GTG ATC GCA 728 Ser Gin He Leu Glu Lys Glu Asn Arg Leu Pro Asp Tyr Val He Ala 215 220 225
TGC GTT GGA GGG GGG TCT AAC GCT ATA GGG ATA TTC AGC GCA TTT TTA 776 Cys Val Gly Gly Gly Ser Asn Ala He Gly He Phe Ser Ala Phe Leu 230 235 240
AAC GAC AAA GAA GTT AAA CTC ATA GGC GTA GAG CCG GCG GGT TTA GGG 824 Asn Asp Lys Glu Val Lys Leu He Gly Val Glu Pro Ala Gly Leu Gly 245 250 255
CTA GAA ACC AAT AAG CAT GGG GCG ACT TTG AAT AAG GGG CGT GTG GGG 872 Leu Glu Thr Asn Lys His Gly Ala Thr Leu Asn Lys Gly Arg Val Gly 260 265 270 275
ATT TTG CAT GGG AAT AAA ACC TAT CTT TTA CAA GAT GAT GAA GGC CAG 920 He Leu His Gly Asn Lys Thr Tyr Leu Leu Gin Asp Asp Glu Gly Gin 280 285 290
ATT GCA GAA AGC CAT AGC ATT AGC GCC GGG CTT GAT TAT CCA GGA GTG 968 He Ala Glu Ser His Ser He Ser Ala Gly Leu Asp Tyr Pro Gly Val 295 300 305
GGG CCA GAA CAC AGC TAT TTA AAA GAA AGT GGG CGT GCG GTT TAT GAA 1016 Gly Pro Glu His Ser Tyr Leu Lys Glu Ser Gly Arg Ala Val Tyr Glu 310 315 320
AGC GCA AGC GAT GCT GAA GCG CTA GAA GCC TTC AAG TTG TTG TGC CAA 1064 Ser Ala Ser Asp Ala Glu Ala Leu Glu Ala Phe Lys Leu Leu Cys Gin 325 330 335
AAA GAA GGC ATT ATC CCA GCG CTA GAA AGC TCA CAC GCC TTA GCG TAT 1112 Lys Glu Gly He He Pro Ala Leu Glu Ser Ser His Ala Leu Ala Tyr 340 345 350 355
GCC TTA AAG CTC GCT CAA AAA TGC GAA GAA GAA AGC ATC ATC GTA GTG 1160 Ala Leu Lys Leu Ala Gin Lys Cys Glu Glu Glu Ser He He Val Val 360 365 370
AAT TTA AGC GGC AGA GGG GAT AAG GAT TTA AGC ACC GTT TAT AAC GCT 1208 Asn Leu Ser Gly Arg Gly Asp Lys Asp Leu Ser Thr Val Tyr Asn Ala 375 380 385
TTA AAA GGA GGT TTA AAA TGAGGTATCA AAACATGTTT GAAACCTTAA AAAAACAC 1264 Leu Lys Gly Gly Leu Lys 390
GAAAAAATGG CGTTTA 1280
(2) INFORMATION FOR SEQ ID NO:542:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:542:
Met Asn Lys Lys Ala Tyr Phe Gly Glu Phe Gly Gly Ser Phe Val Ser
1 5 10 15
Glu Leu Leu Val Pro Ala Leu Arg Glu Leu Glu Gin Ala Phe Asp Ala
20 25 30
Cys Leu Lys Asp Glu Lys Phe Gin Lys Glu Tyr Phe Arg Leu Leu Lys
35 40 45
Asp Phe Val Gly Arg Pro Ser Pro Leu Thr Leu Cys Gin Asn He Val
50 55 60
Ser Asn Pro Lys Val Lys Leu Tyr Leu Lys Arg Glu Asp Leu He His 65 70 75 80
Gly Gly Ala His Lys Thr Asn Gin Ala Leu Gly Gin Ala Leu Leu Ala
85 90 95
Lys Lys Met Gly Lys Thr Arg He He Ala Glu Thr Gly Ala Gly Gin
100 105 110
His Gly Val Ala Thr Ala He Ala Cys Ala Leu Leu Asn Leu Lys Cys
115 120 125
Val Val Phe Met Gly Ser Lys Asp He Lys Arg Gin Glu Met Asn Val
130 135 140
Phe Arg Met His Leu Leu Gly Ala Glu Val Arg Glu Val Asn Ser Gly 145 150 155 160
Ser Ala Thr Leu Lys Asp Ala Val Asn Glu Ala Leu Arg Asp Trp Ala
165 170 175
Ser Ser Tyr Lys Asp Thr His Tyr Leu Leu Gly Thr Ala Ala Gly Pro
180 185 190
His Pro Tyr Pro Thr Met Val Lys Thr Phe Gin Lys Met He Gly Asp
195 200 205
Glu Val Lys Ser Gin He Leu Glu Lys Glu Asn Arg Leu Pro Asp Tyr
210 215 220
Val He Ala Cys Val Gly Gly Gly Ser Asn Ala He Gly He Phe Ser 225 230 235 240
Ala Phe Leu Asn Asp Lys Glu Val Lys Leu He Gly Val Glu Pro Ala
245 250 255
Gly Leu Gly Leu Glu Thr Asn Lys His Gly Ala Thr Leu Asn Lys Gly
260 265 270
Arg Val Gly He Leu His Gly Asn Lys Thr Tyr Leu Leu Gin Asp Asp
275 280 285
Glu Gly Gin He Ala Glu Ser His Ser He Ser Ala Gly Leu Asp Tyr 290 295 300
Pro Gly Val Gly Pro Glu His Ser Tyr Leu Lys Glu Ser Gly Arg Ala 305 310 315 320
Val Tyr Glu Ser Ala Ser Asp Ala Glu Ala Leu Glu Ala Phe Lys Leu
325 330 335
Leu Cys Gin Lys Glu Gly He He Pro Ala Leu Glu Ser Ser His Ala
340 345 350
Leu Ala Tyr Ala Leu Lys Leu Ala Gin Lys Cys Glu Glu Glu Ser He
355 360 365
He Val Val Asn Leu Ser Gly Arg Gly Asp Lys Asp Leu Ser Thr Val
370 375 380
Tyr Asn Ala Leu Lys Gly Gly Leu Lys 385 390
(2) INFORMATION FOR SEQ ID NO:543:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 559 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...513 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:543:
CAGGTAGCTT TGGGCGAGAA AGGAGAGAGC ATG AAT GTC AAA AAT CGT TTG AGC 54
Met Asn Val Lys Asn Arg Leu Ser 1 5
GAT TGG GAA TAT CAA TGG GCA GTG GCT CTA GTC TAT ACG ATA TGT ATC 102 Asp Trp Glu Tyr Gin Trp Ala Val Ala Leu Val Tyr Thr He Cys He 10 15 20
TCC ATA AAC GCT AGG ATT TTT TAT GAC ATA GAT GGT TCA GCT AGC GAT 150 Ser He Asn Ala Arg He Phe Tyr Asp He Asp Gly Ser Ala Ser Asp 25 30 35 40
TCG ATT TTT GAC CCT AAA AAT AGC TAT TAT ATG TGG CTA GTG GGT CTA 198 Ser He Phe Asp Pro Lys Asn Ser Tyr Tyr Met Trp Leu Val Gly Leu 45 50 55
ATA GCG GCT TTG TTG TCT AAC CTT TTA TTT GAC CCA CGA GGT AGG GAT 246 He Ala Ala Leu Leu Ser Asn Leu Leu Phe Asp Pro Arg Gly Arg Asp 60 65 70
TGT TAT AAA TCT TTC CAA GTA AGA TAC CCT AGG TTT CTC AAA GCC ATT 294 Cys Tyr Lys Ser Phe Gin Val Arg Tyr Pro Arg Phe Leu Lys Ala He 75 80 85
TTT AAG GCT AGG TTT TTT GGC GCG TTT TAT AAC GCT GTG TTA GGA TCA 342 Phe Lys Ala Arg Phe Phe Gly Ala Phe Tyr Asn Ala Val Leu Gly Ser 90 95 100
AGG CTA AGG GAT TTT TAT GTG ATG CTT TTA ACG ATA CCC TTT ATT GCC 390 Arg Leu Arg Asp Phe Tyr Val Met Leu Leu Thr He Pro Phe He Ala 105 110 115 120
GCT ATC CAT GAG GTT TCG GCG TAT TAC GGG CAT CCT AGC AAC TTC CTT 438 Ala He His Glu Val Ser Ala Tyr Tyr Gly His Pro Ser Asn Phe Leu 125 130 135
ATA GAG GGT TTG GTC ATT CTT GGC CTT GTG TGT GTT TTT GGG ATT TGT 486 He Glu Gly Leu Val He Leu Gly Leu Val Cys Val Phe Gly He Cys 140 145 150
TCT AGG CTT TGC GCT AAA TTA GGG TGG TGATTTAACT CAAATAGCAT TAAATGG 540 Ser Arg Leu Cys Ala Lys Leu Gly Trp 155 160
AGGGGGGAGT AAAAAATTA 559
(2) INFORMATION FOR SEQ ID NO:544:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 161 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:544:
Met Asn Val Lys Asn Arg Leu Ser Asp Trp Glu Tyr Gin Trp Ala Val
1 5 10 15
Ala Leu Val Tyr Thr He Cys He Ser He Asn Ala Arg He Phe Tyr
20 25 30
Asp He Asp Gly Ser Ala Ser Asp Ser He Phe Asp Pro Lys Asn Ser
35 40 45
Tyr Tyr Met Trp Leu Val Gly Leu He Ala Ala Leu Leu Ser Asn Leu
50 55 60
Leu Phe Asp Pro Arg Gly Arg Asp Cys Tyr Lys Ser Phe Gin Val Arg 65 70 75 80
Tyr Pro Arg Phe Leu Lys Ala He Phe Lys Ala Arg Phe Phe Gly Ala
85 90 95
Phe Tyr Asn Ala Val Leu Gly Ser Arg Leu Arg Asp Phe Tyr Val Met
100 105 110
Leu Leu Thr He Pro Phe He Ala Ala He His Glu Val Ser Ala Tyr
115 120 125
Tyr Gly His Pro Ser Asn Phe Leu He Glu Gly Leu Val He Leu Gly 130 135 140
Leu Val Cys Val Phe Gly He Cys Ser Arg Leu Cys Ala Lys Leu Gly 145 150 155 160
Trp
(2) INFORMATION FOR SEQ ID NO: 545:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 810 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 53...712 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:545:
TGAGATCAAA CCCGTAGAAC TTGTCAAGGT AATTCTTGCG TAAGGAAATA GC ATG TTA 58
Met Leu 1
ATA ACC ACC CAA CTA TCC AAA CGA TTT TAC GCC ACA CTC GCT CTT TCT 106 He Thr Thr Gin Leu Ser Lys Arg Phe Tyr Ala Thr Leu Ala Leu Ser 5 10 15
TGC GTG TTT TTA ACC ATC ACT AAC ATT CTT GTC AAA GGC TCG TTT ATC 154 Cys Val Phe Leu Thr He Thr Asn He Leu Val Lys Gly Ser Phe He 20 25 30
AAT CTT TTA GCA GGG CTT AGT GGG GTT TTG TAT GCG TTT TTT GCC GGA 202 Asn Leu Leu Ala Gly Leu Ser Gly Val Leu Tyr Ala Phe Phe Ala Gly 35 40 45 50
GAA AGG CAA ACG ATT TGC TTT GTG TTT GGT CTT GTT TAT AAT TTG AGT 250 Glu Arg Gin Thr He Cys Phe Val Phe Gly Leu Val Tyr Asn Leu Ser 55 60 65
TAC GCT TAT GTC GCT TAT CAG TGG AAA TTA AAC GCT GAT GTG ATT TTA 298 Tyr Ala Tyr Val Ala Tyr Gin Trp Lys Leu Asn Ala Asp Val He Leu 70 75 80
TGC CTT TTT TTG TAT ATG CCA GTA ACG ATT TAT GGG CTG TTC GCA TGG 346 Cys Leu Phe Leu Tyr Met Pro Val Thr He Tyr Gly Leu Phe Ala Trp 85 90 95
AAA AAG ACA GAG CAG CAT GAA GGC GTT ATC AAG GCT CAA AAA CTT TCC 394 Lys Lys Thr Glu Gin His Glu Gly Val He Lys Ala Gin Lys Leu Ser 100 105 110
AAA AAT TGG CGT TTT ATA CTC ATT TTA GGC GTA GGG GTT TTA ACT TGT 442 Lys Asn Trp Arg Phe He Leu He Leu Gly Val Gly Val Leu Thr Cys 115 120 125 130
GTG AGC GCT TTG TTT TTT AAA GAG ATT AAA ACG AAT TTT TTA TGG GCA 490 Val Ser Ala Leu Phe Phe Lys Glu He Lys Thr Asn Phe Leu Trp Ala 135 140 145
GAG AGT TTT AAT TTC GTC ATC TTT ATT ATT GCT TTT ATT TTA CAG GTT 538 Glu Ser Phe Asn Phe Val He Phe He He Ala Phe He Leu Gin Val 150 155 160
TTG CGC TAT ATA GAA AAT TAT GCG CTA GTA ACT TTG GGG AAT ATC GTA 586 Leu Arg Tyr He Glu Asn Tyr Ala Leu Val Thr Leu Gly Asn He Val 165 170 175
TCC ATT ATC GTG TGG TTT TGT ATT TTT CAA ATT TCT ACA GAG AGC TTG 634 Ser He He Val Trp Phe Cys He Phe Gin He Ser Thr Glu Ser Leu 180 185 190
GTG CAA CTC TTC ACA ACG ATC CTA TAC CTT TTT ATT GGC TTG TAT TAT 682 Val Gin Leu Phe Thr Thr He Leu Tyr Leu Phe He Gly Leu Tyr Tyr 195 200 205 210
TTT AAC CGG TGG AAT AAG TCA TGC AAG CAG TGATTTTAGC GAATGGGGAG TTT 735 Phe Asn Arg Trp Asn Lys Ser Cys Lys Gin 215 220
CCTAAATCTC AAAAATGCTT AGACCTTTTA AAAAACGCTC CCTTTTTAAT CGCATGCGAT 795 GGGGCTGTTA CCTCA 810
(2) INFORMATION FOR SEQ ID NO: 546:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 220 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546:
Met Leu He Thr Thr Gin Leu Ser Lys Arg Phe Tyr Ala Thr Leu Ala
1 5 10 15
Leu Ser Cys Val Phe Leu Thr He Thr Asn He Leu Val Lys Gly Ser
20 25 30
Phe He Asn Leu Leu Ala Gly Leu Ser Gly Val Leu Tyr Ala Phe Phe
35 40 45
Ala Gly Glu Arg Gin Thr He Cys Phe Val Phe Gly Leu Val Tyr Asn 50 55 60 Leu Ser Tyr Ala Tyr Val Ala Tyr Gin Trp Lys Leu Asn Ala Asp Val 65 70 75 80
He Leu Cys Leu Phe Leu Tyr Met Pro Val Thr He Tyr Gly Leu Phe
85 90 95
Ala Trp Lys Lys Thr Glu Gin His Glu Gly Val He Lys Ala Gin Lys
100 105 110
Leu Ser Lys Asn Trp Arg Phe He Leu He Leu Gly Val Gly Val Leu
115 120 125
Thr Cys Val Ser Ala Leu Phe Phe Lys Glu He Lys Thr Asn Phe Leu
130 135 140
Trp Ala Glu Ser Phe Asn Phe Val He Phe He He Ala Phe He Leu 145 150 155 160
Gin Val Leu Arg Tyr He Glu Asn Tyr Ala Leu Val Thr Leu Gly Asn
165 170 175
He Val Ser He He Val Trp Phe Cys He Phe Gin He Ser Thr Glu
180 185 190
Ser Leu Val Gin Leu Phe Thr Thr He Leu Tyr Leu Phe He Gly Leu
195 200 205
Tyr Tyr Phe Asn Arg Trp Asn Lys Ser Cys Lys Gin 210 215 220
(2) INFORMATION FOR SEQ ID NO: 547:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 451 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...398 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547:
AGTTTAAAGA AAAGATTAGA AAAATTAGAA GATAAAGGAG GTAACGACTG ATG AGA 56
Met Arg 1
CAC AAA CAC GGA TAC CGC AAG CTT GGG AGA ACC AGC TCG CAC AGA AAG 104 His Lys His Gly Tyr Arg Lys Leu Gly Arg Thr Ser Ser His Arg Lys 5 10 15
GCG TTA TTA AAG AAT TTA GCG ATC GCT TTG ATT GAG CAT AAC AAA ATT 152 Ala Leu Leu Lys Asn Leu Ala He Ala Leu He Glu His Asn Lys He 20 25 30
GAA ACA GGG ATT TAT AAG GCT AAG GAA TTG CGC AGT TAC ATT GAG AAA 200 Glu Thr Gly He Tyr Lys Ala Lys Glu Leu Arg Ser Tyr He Glu Lys 35 40 45 50
TTG ACG ACA GCG GCT CGT GTG GGC GAT TTT AAT GCG CAC CGC CAT GTT 248 Leu Thr Thr Ala Ala Arg Val Gly Asp Phe Asn Ala His Arg His Val 55 60 65
TTT GCA TAT TTG CAA AAC AAA GAA GCC ACC CAC AAG CTT GTA ACT GAA 296 Phe Ala Tyr Leu Gin Asn Lys Glu Ala Thr His Lys Leu Val Thr Glu 70 75 80
ATC GCG CCC AAA TAC GCG CAA AGG AAT GGC GGA TAC ACC AGG ATC CAA 344 He Ala Pro Lys Tyr Ala Gin Arg Asn Gly Gly Tyr Thr Arg He Gin 85 90 95
CGC ACC ACT TTT AGA AGA GGG GAC GCT TCC ACT CTA GCC ACC ATT GAA 392 Arg Thr Thr Phe Arg Arg Gly Asp Ala Ser Thr Leu Ala Thr He Glu 100 105 110
TTT GTA TGAAATTTGA TGAACTGCTA GCCAAGATTT AGTCTTGGTT GGTGGTTATC GC 450
Phe Val
115
451
(2) INFORMATION FOR SEQ ID NO: 548:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 116 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:548:
Met Arg His Lys His Gly Tyr Arg Lys Leu Gly Arg Thr Ser Ser His
1 5 10 15
Arg Lys Ala Leu Leu Lys Asn Leu Ala He Ala Leu He Glu His Asn
20 25 30
Lys He Glu Thr Gly He Tyr Lys Ala Lys Glu Leu Arg Ser Tyr He
35 40 45
Glu Lys Leu Thr Thr Ala Ala Arg Val Gly Asp Phe Asn Ala His Arg
50 55 60
His Val Phe Ala Tyr Leu Gin Asn Lys Glu Ala Thr His Lys Leu Val 65 70 75 80
Thr Glu He Ala Pro Lys Tyr Ala Gin Arg Asn Gly Gly Tyr Thr Arg
85 90 95
He Gin Arg Thr Thr Phe Arg Arg Gly Asp Ala Ser Thr Leu Ala Thr
100 105 110
He Glu Phe Val 115
(2) INFORMATION FOR SEQ ID NO: 549: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1204 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 36...1142 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549:
AAAATAAGCG TTTTGATGCC ATTTTTGGAG CGATC GTG GAA TTG AGT TAT TAT 53
Val Glu Leu Ser Tyr Tyr 1 5
GAA ATT TTA GAA GTG GAA AAA CAC AGC AAC CAA GAG ACC ATT AAA AAG 101 Glu He Leu Glu Val Glu Lys His Ser Asn Gin Glu Thr He Lys Lys 10 15 20
TCT TAC AGA AAG CTG GCT TTA AAA TAC CAC CCA GAC AGA AAC GCC GGC 149 Ser Tyr Arg Lys Leu Ala Leu Lys Tyr His Pro Asp Arg Asn Ala Gly 25 30 35
GAT AAA GAA GCC GAA GAA AAA TTC AAG CTC ATC AAT GAA GCC TAT GGG 197 Asp Lys Glu Ala Glu Glu Lys Phe Lys Leu He Asn Glu Ala Tyr Gly 40 45 50
GTG TTA AGC GAT GAA AAG AAG CGG GCC TTA TAC GAC AGG TAT GGT AAA 245 Val Leu Ser Asp Glu Lys Lys Arg Ala Leu Tyr Asp Arg Tyr Gly Lys 55 60 65 70
AAA GGC TTA AAC CAA GCC GGC GCA AGC CAA GGC GAT TTT TCT GAT TTT 293 Lys Gly Leu Asn Gin Ala Gly Ala Ser Gin Gly Asp Phe Ser Asp Phe 75 80 85
TTT GAA GAT TTA GGC TCG TTT TTT GAA GAC GCT TTT GGG TTT GGC GCT 341 Phe Glu Asp Leu Gly Ser Phe Phe Glu Asp Ala Phe Gly Phe Gly Ala 90 95 100
AGG GGG AGT AAA AGG CAA AAA AGC TCT ATC GCA CCG GAT TAT TTG CAA 389 Arg Gly Ser Lys Arg Gin Lys Ser Ser He Ala Pro Asp Tyr Leu Gin 105 110 115
ACC CTT GAA TTG AGT TTC AAA GAA GCG GTT TTT GGC TGT AAA AAA ACC 437 Thr Leu Glu Leu Ser Phe Lys Glu Ala Val Phe Gly Cys Lys Lys Thr 120 125 130
ATT AAA GTC CAA TAC CAG AGC GTT TGT GAA AGT TGC GAT GGC ACG GGC 485 He Lys Val Gin Tyr Gin Ser Val Cys Glu Ser Cys Asp Gly Thr Gly 135 140 145 150
GCT AAA GAC AAA GCC CTA GAG ACT TGC AAG CAA TGC AAT GGG CAG GGG 533 Ala Lys Asp Lys Ala Leu Glu Thr Cys Lys Gin Cys Asn Gly Gin Gly 155 160 165
CAG GTG TTT ATG CGT CAA GGT TTT ATG AGT TTT GCG CAA ACT TGT GGG 581 Gin Val Phe Met Arg Gin Gly Phe Met Ser Phe Ala Gin Thr Cys Gly 170 175 180
GCG TGT CAA GGC AAG GGC AAG ATC GTT AAA ACC CCA TGC CAA GCG TGC 629 Ala Cys Gin Gly Lys Gly Lys He Val Lys Thr Pro Cys Gin Ala Cys 185 190 195
AAG GGT AAA ACC TAT ATC CTT AAA GAT GAA GAA ATT GAT GCG ATA ATC 677 Lys Gly Lys Thr Tyr He Leu Lys Asp Glu Glu He Asp Ala He He 200 205 210
CCT GAG GGC ATT GAT GAT CAA AAC CGC ATG GTG CTT AAA AAT AAA GGC 725 Pro Glu Gly He Asp Asp Gin Asn Arg Met Val Leu Lys Asn Lys Gly 215 220 225 230
AAT GAA TAC GAG AAG GGA AAA AGA GGG GAT TTG TAT TTA GAA GCG CAA 773 Asn Glu Tyr Glu Lys Gly Lys Arg Gly Asp Leu Tyr Leu Glu Ala Gin 235 240 245
GTC AAA GAA GAT GAG CAT TTC AAG CGC GAA GGC TGC GAT TTA TTC ATT 821 Val Lys Glu Asp Glu His Phe Lys Arg Glu Gly Cys Asp Leu Phe He 250 255 260
AAA GCG CCG GTG TTT TTC ACC ACT ATC GCT TTA GGG CAT ACG ATT AAA 869 Lys Ala Pro Val Phe Phe Thr Thr He Ala Leu Gly His Thr He Lys 265 270 275
GTG CCG TCT TTA AAA GGG GAC GAA CTG GAA TTA AAA ATC CCT AGA AAC 917 Val Pro Ser Leu Lys Gly Asp Glu Leu Glu Leu Lys He Pro Arg Asn 280 285 290
GCC AGA GAC AAG CAG ACT TTT GCG TTT AGA AAC GAG GGC GTG AAA CAC 965 Ala Arg Asp Lys Gin Thr Phe Ala Phe Arg Asn Glu Gly Val Lys His 295 300 305 310
CCT GAA AGC TCT TAT AGA GGG AGT TTG ATC GTG GAA TTG CAA GTG ATT 1013 Pro Glu Ser Ser Tyr Arg Gly Ser Leu He Val Glu Leu Gin Val He 315 320 325
TAC CCT AAA AGT TTG AAT AAA GAG CAG CAA GAA TTG TTG GAA AAA TTG 1061 Tyr Pro Lys Ser Leu Asn Lys Glu Gin Gin Glu Leu Leu Glu Lys Leu 330 335 340
CAT GCG AGT TTT GGC TAT GAG GGC GAG CCG CAT AAA AGC GTT TTA GAA 1109 His Ala Ser Phe Gly Tyr Glu Gly Glu Pro His Lys Ser Val Leu Glu 345 350 355 ACC TGT ATT TCT AAA ATT AAA GAC TGG TTC AAA TAAAAGGTTG TTGATGCATG 1162 Thr Cys He Ser Lys He Lys Asp Trp Phe Lys 360 365
AGTTTCTAAA AGCTTTTAAA GACGCTTTCC CTCATACCAT TT 1204
(2) INFORMATION FOR SEQ ID NO: 550:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 369 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550:
Val Glu Leu Ser Tyr Tyr Glu He Leu Glu Val Glu Lys His Ser Asn
1 5 10 15
Gin Glu Thr He Lys Lys Ser Tyr Arg Lys Leu Ala Leu Lys Tyr His
20 25 30
Pro Asp Arg Asn Ala Gly Asp Lys Glu Ala Glu Glu Lys Phe Lys Leu
35 40 45
He Asn Glu Ala Tyr Gly Val Leu Ser Asp Glu Lys Lys Arg Ala Leu
50 55 60
Tyr Asp Arg Tyr Gly Lys Lys Gly Leu Asn Gin Ala Gly Ala Ser Gin 65 70 75 80
Gly Asp Phe Ser Asp Phe Phe Glu Asp Leu Gly Ser Phe Phe Glu Asp
85 90 95
Ala Phe Gly Phe Gly Ala Arg Gly Ser Lys Arg Gin Lys Ser Ser He
100 105 110
Ala Pro Asp Tyr Leu Gin Thr Leu Glu Leu Ser Phe Lys Glu Ala Val
115 120 125
Phe Gly Cys Lys Lys Thr He Lys Val Gin Tyr Gin Ser Val Cys Glu
130 135 140
Ser Cys Asp Gly Thr Gly Ala Lys Asp Lys Ala Leu Glu Thr Cys Lys 145 150 155 160
Gin Cys Asn Gly Gin Gly Gin Val Phe Met Arg Gin Gly Phe Met Ser
165 170 175
Phe Ala Gin Thr Cys Gly Ala Cys Gin Gly Lys Gly Lys He Val Lys
180 185 190
Thr Pro Cys Gin Ala Cys Lys Gly Lys Thr Tyr He Leu Lys Asp Glu
195 200 205
Glu He Asp Ala He He Pro Glu Gly He Asp Asp Gin Asn Arg Met
210 215 220
Val Leu Lys Asn Lys Gly Asn Glu Tyr Glu Lys Gly Lys Arg Gly Asp 225 230 235 240
Leu Tyr Leu Glu Ala Gin Val Lys Glu Asp Glu His Phe Lys Arg Glu
245 250 255
Gly Cys Asp Leu Phe He Lys Ala Pro Val Phe Phe Thr Thr He Ala
260 265 270
Leu Gly His Thr He Lys Val Pro Ser Leu Lys Gly Asp Glu Leu Glu 275 280 285 Leu Lys He Pro Arg Asn Ala Arg Asp Lys Gin Thr Phe Ala Phe Arg
290 295 300
Asn Glu Gly Val Lys His Pro Glu Ser Ser Tyr Arg Gly Ser Leu He 305 310 315 320
Val Glu Leu Gin Val He Tyr Pro Lys Ser Leu Asn Lys Glu Gin Gin
325 330 335
Glu Leu Leu Glu Lys Leu His Ala Ser Phe Gly Tyr Glu Gly Glu Pro
340 345 350
His Lys Ser Val Leu Glu Thr Cys He Ser Lys He Lys Asp Trp Phe
355 360 365
Lys
(2) INFORMATION FOR SEQ ID NO: 551:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 810 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 12...779 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551:
CGCATGGAGA A ATG AGG GTA ATG GCC AAA ATT GAA TTG TTA GCC AAA TTC 50 Met Arg Val Met Ala Lys He Glu Leu Leu Ala Lys Phe 1 5 10
ACG CAA ATC GCG CTC CCT AAC AGC CAC CCT TTA TTG AAA AAA GTT TTA 98 Thr Gin He Ala Leu Pro Asn Ser His Pro Leu Leu Lys Lys Val Leu 15 20 25
AAC TAC GCC AAA AAG CAT TTC AGC CAG TGC CAC ATG CTC TCT TCA TCG 146 Asn Tyr Ala Lys Lys His Phe Ser Gin Cys His Met Leu Ser Ser Ser 30 35 40 45
TTA CTC ATC TTA AAC GAC ACG GAA TGC TTT AAA AAA AAC TAC TTG CTT 194 Leu Leu He Leu Asn Asp Thr Glu Cys Phe Lys Lys Asn Tyr Leu Leu 50 55 60
AAT TGG GTC TAT CAT GCC CTT GAA TGC GTG CAT GAA AAA GAT ATT AGC 242 Asn Trp Val Tyr His Ala Leu Glu Cys Val His Glu Lys Asp He Ser 65 70 75
GCG CAT TCT TTA GAA GAG GTT TTA CAA AAA AGC CAC CTG CCC ATA CGC 290 Ala His Ser Leu Glu Glu Val Leu Gin Lys Ser His Leu Pro He Arg ATC AAA ATC ATG GCT CAA AAC ACG CTT TTA GAA AAG ATA GAA GTG AAA 338 He Lys He Met Ala Gin Asn Thr Leu Leu Glu Lys He Glu Val Lys 95 100 105
GTT TTA ACC TTT GGG GCG GAA TAT GCG CTT TTT ATC ACC AAA CAC CCT 386 Val Leu Thr Phe Gly Ala Glu Tyr Ala Leu Phe He Thr Lys His Pro 110 115 120 125
ATC GCC AAG CGG TTT TTA CGC CAA AAA TTT AGC GGC TGT GTG TTT TTA 434 He Ala Lys Arg Phe Leu Arg Gin Lys Phe Ser Gly Cys Val Phe Leu 130 135 140
GAA ACC CAA GAT GAA TTG CAT ATA AGA GGC GAT TCA GAG CGT TTT TGG 482 Glu Thr Gin Asp Glu Leu His He Arg Gly Asp Ser Glu Arg Phe Trp 145 150 155
GAA CTC ATT GTA ACG CTC AAT GAA AAT AGA ATC GTC CAT AAC GCA TGC 530 Glu Leu He Val Thr Leu Asn Glu Asn Arg He Val His Asn Ala Cys 160 165 170
TTA GAT TTC ATC TAC CCT AAT GGC TTT GGC AAG GAC AGC TAC ACC ACT 578 Leu Asp Phe He Tyr Pro Asn Gly Phe Gly Lys Asp Ser Tyr Thr Thr 175 180 185
ATG GCT GAA CGC AAA TTA AAA GAA TGC TAT AAA ACG CTA GGG TTT ATC 626 Met Ala Glu Arg Lys Leu Lys Glu Cys Tyr Lys Thr Leu Gly Phe He 190 195 200 205
AAG CAT GAA GAT TTC AGC GAA GTC AAA AAG CGC TAT TTA GAA TTG GCT 674 Lys His Glu Asp Phe Ser Glu Val Lys Lys Arg Tyr Leu Glu Leu Ala 210 215 220
AAA ACC TAC CAC CCT GAT TTA TGC GAT CTC AAA GAA AAA AAG GCT CTT 722 Lys Thr Tyr His Pro Asp Leu Cys Asp Leu Lys Glu Lys Lys Ala Leu 225 230 235
TAT GCC AAA CGC TTC GCT ATC ATT CAA GAG GCG TAT CGC CAC ATT AAA 770 Tyr Ala Lys Arg Phe Ala He He Gin Glu Ala Tyr Arg His He Lys 240 245 250
AAA CAC GCC TAAACCCCTA AACTAGCCCT AATCGCGCTA G 810
Lys His Ala 255
(2) INFORMATION FOR SEQ ID NO:552:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 256 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:552:
Met Arg Val Met Ala Lys He Glu Leu Leu Ala Lys Phe Thr Gin He
1 5 10 15
Ala Leu Pro Asn Ser His Pro Leu Leu Lys Lys Val Leu Asn Tyr Ala
20 25 30
Lys Lys His Phe Ser Gin Cys His Met Leu Ser Ser Ser Leu Leu He
35 40 45
Leu Asn Asp Thr Glu Cys Phe Lys Lys Asn Tyr Leu Leu Asn Trp Val
50 55 60
Tyr His Ala Leu Glu Cys Val His Glu Lys Asp He Ser Ala His Ser 65 70 75 80
Leu Glu Glu Val Leu Gin Lys Ser His Leu Pro He Arg He Lys He
85 90 95
Met Ala Gin Asn Thr Leu Leu Glu Lys He Glu Val Lys Val Leu Thr
100 105 110
Phe Gly Ala Glu Tyr Ala Leu Phe He Thr Lys His Pro He Ala Lys
115 120 125
Arg Phe Leu Arg Gin Lys Phe Ser Gly Cys Val Phe Leu Glu Thr Gin
130 135 140
Asp Glu Leu His He Arg Gly Asp Ser Glu Arg Phe Trp Glu Leu He 145 150 155 160
Val Thr Leu Asn Glu Asn Arg He Val His Asn Ala Cys Leu Asp Phe
165 170 175
He Tyr Pro Asn Gly Phe Gly Lys Asp Ser Tyr Thr Thr Met Ala Glu
180 185 190
Arg Lys Leu Lys Glu Cys Tyr Lys Thr Leu Gly Phe He Lys His Glu
195 200 205
Asp Phe Ser Glu Val Lys Lys Arg Tyr Leu Glu Leu Ala Lys Thr Tyr
210 215 220
His Pro Asp Leu Cys Asp Leu Lys Glu Lys Lys Ala Leu Tyr Ala Lys 225 230 235 240
Arg Phe Ala He He Gin Glu Ala Tyr Arg His He Lys Lys His Ala 245 250 255
(2) INFORMATION FOR SEQ ID NO:553:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 900 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 59...778 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:553: CATGATGATT ATTATAAAAT CCTAACGCCG CATGAACAAA TAGGATGGGT CAAAAAAG 58
ATG AAG TCA AAT AAA AAG TCC AAT CGT TTA AGA GCG ATT TAT AGA GCT 106 Met Lys Ser Asn Lys Lys Ser Asn Arg Leu Arg Ala He Tyr Arg Ala 1 5 10 15
TTA GTG ATC GCT ATA GGA CTA GCT GTT ATC ATC GTT TTC AAT TAC TTT 154 Leu Val He Ala He Gly Leu Ala Val He He Val Phe Asn Tyr Phe 20 25 30
AAC CGC AAA AAC AAT AAC GCC CGC TCC AGC CGT AGG GCT TGT TCG TGC 202 Asn Arg Lys Asn Asn Asn Ala Arg Ser Ser Arg Arg Ala Cys Ser Cys 35 40 45
TTT TTT TCC CTT ACC GGG GTT AAT TTA GAA AAA ATA GGC ACT TTT GAT 250 Phe Phe Ser Leu Thr Gly Val Asn Leu Glu Lys He Gly Thr Phe Asp 50 55 60
ACG GAC GCT AAA CTC ATT GTC TTA AAC CAC CAA AGC TTA CTA GAC ATC 298 Thr Asp Ala Lys Leu He Val Leu Asn His Gin Ser Leu Leu Asp He 65 70 75 80
ATT TAT TTA GAA GCC TAC CAC CCT AGA AAT ATT TGC TGG ATC GCT AAA 346 He Tyr Leu Glu Ala Tyr His Pro Arg Asn He Cys Trp He Ala Lys 85 90 95
AAA GAG CTG GGC GAA ATC CCT TTT TAT GGG CAT GCC TTA ACG GAT ACC 394 Lys Glu Leu Gly Glu He Pro Phe Tyr Gly His Ala Leu Thr Asp Thr 100 105 110
GGA ATG ATT TTA ATT GAC AGA GAG GAT AAA AAG GGG ATT GTG AGC CTT 442 Gly Met He Leu He Asp Arg Glu Asp Lys Lys Gly He Val Ser Leu 115 120 125
TTG AAA GCG TGT AAA GAA AAA TTA GAC CAA AAC CGC CCT TTA GTG ATT 490 Leu Lys Ala Cys Lys Glu Lys Leu Asp Gin Asn Arg Pro Leu Val He 130 135 140
TTC CCT GAA GGC ACT AGA GGC AAA GGA GGA GAA AAA TTC CTC CCT TTC 538 Phe Pro Glu Gly Thr Arg Gly Lys Gly Gly Glu Lys Phe Leu Pro Phe 145 150 155 160
AAG CAA GGG GCT AAA ATC ATC GCC GAA AAA TTC CAG CTC AAA ATC CAA 586 Lys Gin Gly Ala Lys He He Ala Glu Lys Phe Gin Leu Lys He Gin 165 170 175
CCC ATG GTG TTA ATC AAT TCC ATT AAA ATC TTT AAT TCC AAG CCT CTA 634 Pro Met Val Leu He Asn Ser He Lys He Phe Asn Ser Lys Pro Leu 180 185 190
GAA GCC TAT AAA GCG CGC ACC CGT TTA GTC ATG CTA GAA AGC TAT ACG 682 Glu Ala Tyr Lys Ala Arg Thr Arg Leu Val Met Leu Glu Ser Tyr Thr 195 200 205
CCT GAT TTT AAC TCG CCC ACC TGG TAT GAA GAA TTA CAA GAA CGC ATG 730 Pro Asp Phe Asn Ser Pro Thr Trp Tyr Glu Glu Leu Gin Glu Arg Met 210 215 220
CAA AAA GAG TAT TTA AAA CAC TAT CAT GAA TTA AAC CCT AGC GAA CAA TGA 781 Gin Lys Glu Tyr Leu Lys His Tyr His Glu Leu Asn Pro Ser Glu Gin 225 230 235 240
AGCTTTTTGA CTACGCTCCT TTGAGTTTGG CTTGGCGGGA GTTTTTGCAA AGCGAATTTA 841 AAAAGCCTTA TTTTTTAGAA ATAGAAAAAC GCTACCTAGA AGCCCTAAAA ATCCCTAAA 900
(2) INFORMATION FOR SEQ ID NO: 554:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 240 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:554:
Met Lys Ser Asn Lys Lys Ser Asn Arg Leu Arg Ala He Tyr Arg Ala
1 5 10 15
Leu Val He Ala He Gly Leu Ala Val He He Val Phe Asn Tyr Phe
20 25 30
Asn Arg Lys Asn Asn Asn Ala Arg Ser Ser Arg Arg Ala Cys Ser Cys
35 40 45
Phe Phe Ser Leu Thr Gly Val Asn Leu Glu Lys He Gly Thr Phe Asp
50 55 60
Thr Asp Ala Lys Leu He Val Leu Asn His Gin Ser Leu Leu Asp He 65 70 75 80
He Tyr Leu Glu Ala Tyr His Pro Arg Asn He Cys Trp He Ala Lys
85 90 95
Lys Glu Leu Gly Glu He Pro Phe Tyr Gly His Ala Leu Thr Asp Thr
100 105 110
Gly Met He Leu He Asp Arg Glu Asp Lys Lys Gly He Val Ser Leu
115 120 125
Leu Lys Ala Cys Lys Glu Lys Leu Asp Gin Asn Arg Pro Leu Val He
130 135 140
Phe Pro Glu Gly Thr Arg Gly Lys Gly Gly Glu Lys Phe Leu Pro Phe 145 150 155 160
Lys Gin Gly Ala Lys He He Ala Glu Lys Phe Gin Leu Lys He Gin
165 170 175
Pro Met Val Leu He Asn Ser He Lys He Phe Asn Ser Lys Pro Leu
180 185 190
Glu Ala Tyr Lys Ala Arg Thr Arg Leu Val Met Leu Glu Ser Tyr Thr
195 200 205
Pro Asp Phe Asn Ser Pro Thr Trp Tyr Glu Glu Leu Gin Glu Arg Met 210 215 220 Gin Lys Glu Tyr Leu Lys His Tyr His Glu Leu Asn Pro Ser Glu Gin 225 230 235 240
(2) INFORMATION FOR SEQ ID NO: 555:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 990 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 40...858 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555:
GGCCAAACTC GCTTTAACTA AAATGATGGA GTTATCTTA ATG GAG ATT AGA ACC 54
Met Glu He Arg Thr 1 5
TTT TTA GAA CGC GCT TTA AAA GAA GAT TTA GGG CAT GGG GAT TTG TTT 102 Phe Leu Glu Arg Ala Leu Lys Glu Asp Leu Gly His Gly Asp Leu Phe 10 15 20
GAA AGG GTG TTA GAA AAA GAT TTT AAG GCC ACA GCG TTT GTT AGG GCT 150 Glu Arg Val Leu Glu Lys Asp Phe Lys Ala Thr Ala Phe Val Arg Ala 25 30 35
AAA CAA GAG GGC GTG TTT TCA GGC GAA AAA TAC GCT TTA GAG TTG TTG 198 Lys Gin Glu Gly Val Phe Ser Gly Glu Lys Tyr Ala Leu Glu Leu Leu 40 45 50
GAA ATG ACC GGC ATT GAA TGC GTT CAA ACG ATT AAG GAT AAA GAA CGC 246 Glu Met Thr Gly He Glu Cys Val Gin Thr He Lys Asp Lys Glu Arg 55 60 65
TTC AAG CCT AAA GAC GCT TTA ATG GAA ATT AGG GGG GAT TTT AGC ATG 294 Phe Lys Pro Lys Asp Ala Leu Met Glu He Arg Gly Asp Phe Ser Met 70 75 80 85
CTT TTA AAG GTT GAG CGC ACC CTT TTA AAC CTT TTG CAA CAC AGC AGC 342 Leu Leu Lys Val Glu Arg Thr Leu Leu Asn Leu Leu Gin His Ser Ser 90 95 100
GGG ATT GCT ACT TTA ACG AGC CGT TTT GTA GAG GCT TTA AAT TCT CAT 390 Gly He Ala Thr Leu Thr Ser Arg Phe Val Glu Ala Leu Asn Ser His 105 110 115 AAA GTG CGT TTG TTG GAC ACG AGA AAA ACG AGA CCC CTT TTA AGG ATC 438 Lys Val Arg Leu Leu Asp Thr Arg Lys Thr Arg Pro Leu Leu Arg He 120 125 130
TTT GAA AAA TAT TCC GTG CTT AAT GGG GGA GCG AGC AAC CAC CGC TTA 486 Phe Glu Lys Tyr Ser Val Leu Asn Gly Gly Ala Ser Asn His Arg Leu 135 140 145
GGG CTA GAT GAC GCT TTA ATG CTT AAA GAC ACG CAT TTA AGG CAT GTG 534 Gly Leu Asp Asp Ala Leu Met Leu Lys Asp Thr His Leu Arg His Val 150 155 160 165
AAA GAT CTC AAA AGC TTT TTA ACG CAT GCC AGA AAA AAC TTG CCT TTC 582 Lys Asp Leu Lys Ser Phe Leu Thr His Ala Arg Lys Asn Leu Pro Phe 170 175 180
ACG GCT AAA ATT GAA ATT GAA TGC GAA AGC TTT GAA GAG GCC AAA AAC 630 Thr Ala Lys He Glu He Glu Cys Glu Ser Phe Glu Glu Ala Lys Asn 185 190 195
GCC ATG AAT GCG GGA GCG GAT ATT GTG ATG TGC GAT AAT TTG AGC GTT 678 Ala Met Asn Ala Gly Ala Asp He Val Met Cys Asp Asn Leu Ser Val 200 205 210
TTA GAG ACT AAA GAA ATT GCC GCT TAT AGA GAT GCG CAT TAT CCC TTT 726 Leu Glu Thr Lys Glu He Ala Ala Tyr Arg Asp Ala His Tyr Pro Phe 215 220 225
GTT TTA CTG GAA GCG AGC GGG AAC ATT TCA CTA GAG AGC ATT AAC GCT 774 Val Leu Leu Glu Ala Ser Gly Asn He Ser Leu Glu Ser He Asn Ala 230 235 240 245
TAC GCT AAA AGC GGC GTG GAT GCC ATT AGC GTA GGG GCT TTA ATC CAT 822 Tyr Ala Lys Ser Gly Val Asp Ala He Ser Val Gly Ala Leu He His 250 255 260
CAA GCC ACT TTT ATT GAC ATG CAC ATG AAA ATG GCT TAAAGACTTT AAAAAG 874 Gin Ala Thr Phe He Asp Met His Met Lys Met Ala 265 270
GGGTTATTAA CATGCTAAAA GAATATTTAG AAAGCATTAA AGATCTTACG CCTGAAAAGA 934 ATGAACTCAC GCACCGCCCT TCTTTATACA ACTTGCTTAA TCAGTTAAAA AACCAT 990
(2) INFORMATION FOR SEQ ID NO: 556:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 273 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 556: Met Glu He Arg Thr Phe Leu Glu Arg Ala Leu Lys Glu Asp Leu Gly
1 5 10 15
His Gly Asp Leu Phe Glu Arg Val Leu Glu Lys Asp Phe Lys Ala Thr
20 25 30
Ala Phe Val Arg Ala Lys Gin Glu Gly Val Phe Ser Gly Glu Lys Tyr
35 40 45
Ala Leu Glu Leu Leu Glu Met Thr Gly He Glu Cys Val Gin Thr He
50 55 60
Lys Asp Lys Glu Arg Phe Lys Pro Lys Asp Ala Leu Met Glu He Arg 65 70 75 80
Gly Asp Phe Ser Met Leu Leu Lys Val Glu Arg Thr Leu Leu Asn Leu
85 90 95
Leu Gin His Ser Ser Gly He Ala Thr Leu Thr Ser Arg Phe Val Glu
100 105 110
Ala Leu Asn Ser His Lys Val Arg Leu Leu Asp Thr Arg Lys Thr Arg
115 120 125
Pro Leu Leu Arg He Phe Glu Lys Tyr Ser Val Leu Asn Gly Gly Ala
130 135 140
Ser Asn His Arg Leu Gly Leu Asp Asp Ala Leu Met Leu Lys Asp Thr 145 150 155 160
His Leu Arg His Val Lys Asp Leu Lys Ser Phe Leu Thr His Ala Arg
165 170 175
Lys Asn Leu Pro Phe Thr Ala Lys He Glu He Glu Cys Glu Ser Phe
180 185 190
Glu Glu Ala Lys Asn Ala Met Asn Ala Gly Ala Asp He Val Met Cys
195 200 205
Asp Asn Leu Ser Val Leu Glu Thr Lys Glu He Ala Ala Tyr Arg Asp
210 215 220
Ala His Tyr Pro Phe Val Leu Leu Glu Ala Ser Gly Asn He Ser Leu 225 230 235 240
Glu Ser He Asn Ala Tyr Ala Lys Ser Gly Val Asp Ala He Ser Val
245 250 255
Gly Ala Leu He His Gin Ala Thr Phe He Asp Met His Met Lys Met
260 265 270
Ala
(2) INFORMATION FOR SEQ ID NO: 557:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1153 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1100 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:557:
GATCAAACGC TACAACCCTA GCTGTCTTTT AGAAGTGGAT GGGGGCGTGA ATG ATA 56
Met He 1
AAA ATA TCT TTG AAC TCC AAC AAG CGG GCG TGG ATG TGG TGG TTT CAG 104 Lys He Ser Leu Asn Ser Asn Lys Arg Ala Trp Met Trp Trp Phe Gin 5 10 15
GGA GTT ATA TTT TTA AAT CCA AAG ATC GTA AGC TGG CTA TTG AAG GCT 152 Gly Val He Phe Leu Asn Pro Lys He Val Ser Trp Leu Leu Lys Ala 20 25 30
TAC AGA ATG TCA GAC AAT CTC TTG CAT AAA GAC ATC CAA GCC CTA ATC 200 Tyr Arg Met Ser Asp Asn Leu Leu His Lys Asp He Gin Ala Leu He 35 40 45 50
GCT CGC TTA AAG CGC CAG GAC TTA AGC TTG GGC ATG CTA GAA AAA TCG 248 Ala Arg Leu Lys Arg Gin Asp Leu Ser Leu Gly Met Leu Glu Lys Ser 55 60 65
CTC TCT CGC CTT ATT CAT GAT GAA ATC AAT TTG GAG TAT TTG AAG GCG 296 Leu Ser Arg Leu He His Asp Glu He Asn Leu Glu Tyr Leu Lys Ala 70 75 80
TGC GGG CTC AAT TTC ATA GAA ACG AGC GAA AAT TTA ATC ACG CTC AAA 344 Cys Gly Leu Asn Phe He Glu Thr Ser Glu Asn Leu He Thr Leu Lys 85 90 95
AAC CTT AAA ACC CCC CTT AAA GAT GAG GTT TTT TCC TTT ATT GAT TTA 392 Asn Leu Lys Thr Pro Leu Lys Asp Glu Val Phe Ser Phe He Asp Leu 100 105 110
GAA ACC ACC GGA TCT TGC CCC ATA AAG CAT GAG ATT TTA GAA ATT GGG 440 Glu Thr Thr Gly Ser Cys Pro He Lys His Glu He Leu Glu He Gly 115 120 125 130
GCC GTG CAA GTG AAA GGG GGG GAA ATT ATC AAT CGT TTT GAA ACC CTT 488 Ala Val Gin Val Lys Gly Gly Glu He He Asn Arg Phe Glu Thr Leu 135 140 145
GTG AAA GTC AAA AGC GTG CCT GAT TAT ATC GCT GAG CTT ACA GGC ATC 536 Val Lys Val Lys Ser Val Pro Asp Tyr He Ala Glu Leu Thr Gly He 150 155 160
ACT TAT GAA GAC ACC CTA AAC GCC CCA AGC GCG CAT GAA GCT TTG CAA 584 Thr Tyr Glu Asp Thr Leu Asn Ala Pro Ser Ala His Glu Ala Leu Gin 165 170 175
GAA TTG CGG CTT TTT TTA GGC AAT AGC GTG TTT GTG GCC CAC AAC GCT 632 Glu Leu Arg Leu Phe Leu Gly Asn Ser Val Phe Val Ala His Asn Ala 180 185 190
AAT TTT GAT TAC AAC TTT TTG GGG CGT TAT TTT GTA GAA AAA TTG CAT 680 Asn Phe Asp Tyr Asn Phe Leu Gly Arg Tyr Phe Val Glu Lys Leu His 195 200 205 210
TGC CCT TTA TTG AAT TTA AAG CTT TGC ACT CTG GAT TTA TCC AAA CGA 728 Cys Pro Leu Leu Asn Leu Lys Leu Cys Thr Leu Asp Leu Ser Lys Arg 215 220 225
GCC ATT TTG TCC ATG CGT TAT TCT TTG AGC TTT TTA AAA GAG CTT TTA 776 Ala He Leu Ser Met Arg Tyr Ser Leu Ser Phe Leu Lys Glu Leu Leu 230 235 240
GGG TTT GGT ATA GAA GTC AGC CAT AGA GCC TAT GCG GAC GCT TTA GCG 824 Gly Phe Gly He Glu Val Ser His Arg Ala Tyr Ala Asp Ala Leu Ala 245 250 255
AGC TAT AAA CTC TTT GAA ATA TGC CTG TTA AAC TTG CCC AGC TAC ATC 872 Ser Tyr Lys Leu Phe Glu He Cys Leu Leu Asn Leu Pro Ser Tyr He 260 265 270
AAA ACG ACA ATG GAT TTG ATT GAT TTT TCT AAA TGT GCT AAC ACG CTA 920 Lys Thr Thr Met Asp Leu He Asp Phe Ser Lys Cys Ala Asn Thr Leu 275 280 285 290
ATC AAA AGA CCC CCA AAA GCC AGA TAC CAA GAG ATT CCA TCG CCA TTT 968 He Lys Arg Pro Pro Lys Ala Arg Tyr Gin Glu He Pro Ser Pro Phe 295 300 305
TCT CTT TTT GAA AAG ACA AAG GGC TTG TTC AAT CAT AAA AGC AAC CAG 1016 Ser Leu Phe Glu Lys Thr Lys Gly Leu Phe Asn His Lys Ser Asn Gin 310 315 320
TTA AAC GAA AGC TGT TTA ATG GGG TTT ATG GGG ACT GAA ATT TTA GCA 1064 Leu Asn Glu Ser Cys Leu Met Gly Phe Met Gly Thr Glu He Leu Ala 325 330 335
TCT CTA TTT GAT ACT TTT GAA TGT TGC CTA GTA TTT TGATTTTATC GGTTAC 1116 Ser Leu Phe Asp Thr Phe Glu Cys Cys Leu Val Phe 340 345 350
TTCGCACTCA TCGTATATCT TTTTGTATTC TTGTATG 1153
(2) INFORMATION FOR SEQ ID NO: 558:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 350 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558:
Met He Lys He Ser Leu Asn Ser Asn Lys Arg Ala Trp Met Trp Trp 1 5 10 15
Phe Gin Gly Val He Phe Leu Asn Pro Lys He Val Ser Trp Leu Leu
20 25 30
Lys Ala Tyr Arg Met Ser Asp Asn Leu Leu His Lys Asp He Gin Ala
35 40 45
Leu He Ala Arg Leu Lys Arg Gin Asp Leu Ser Leu Gly Met Leu Glu
50 55 60
Lys Ser Leu Ser Arg Leu He His Asp Glu He Asn Leu Glu Tyr Leu 65 70 75 80
Lys Ala Cys Gly Leu Asn Phe He Glu Thr Ser Glu Asn Leu He Thr
85 90 95
Leu Lys Asn Leu Lys Thr Pro Leu Lys Asp Glu Val Phe Ser Phe He
100 105 110
Asp Leu Glu Thr Thr Gly Ser Cys Pro He Lys His Glu He Leu Glu
115 120 125
He Gly Ala Val Gin Val Lys Gly Gly Glu He He Asn Arg Phe Glu
130 135 140
Thr Leu Val Lys Val Lys Ser Val Pro Asp Tyr He Ala Glu Leu Thr 145 150 155 160
Gly He Thr Tyr Glu Asp Thr Leu Asn Ala Pro Ser Ala His Glu Ala
165 170 175
Leu Gin Glu Leu Arg Leu Phe Leu Gly Asn Ser Val Phe Val Ala His
180 185 190
Asn Ala Asn Phe Asp Tyr Asn Phe Leu Gly Arg Tyr Phe Val Glu Lys
195 200 205
Leu His Cys Pro Leu Leu Asn Leu Lys Leu Cys Thr Leu Asp Leu Ser
210 215 220
Lys Arg Ala He Leu Ser Met Arg Tyr Ser Leu Ser Phe Leu Lys Glu 225 230 235 240
Leu Leu Gly Phe Gly He Glu Val Ser His Arg Ala Tyr Ala Asp Ala
245 250 255
Leu Ala Ser Tyr Lys Leu Phe Glu He Cys Leu Leu Asn Leu Pro Ser
260 265 270
Tyr He Lys Thr Thr Met Asp Leu He Asp Phe Ser Lys Cys Ala Asn
275 280 285
Thr Leu He Lys Arg Pro Pro Lys Ala Arg Tyr Gin Glu He Pro Ser
290 295 300
Pro Phe Ser Leu Phe Glu Lys Thr Lys Gly Leu Phe Asn His Lys Ser 305 310 315 320
Asn Gin Leu Asn Glu Ser Cys Leu Met Gly Phe Met Gly Thr Glu He
325 330 335
Leu Ala Ser Leu Phe Asp Thr Phe Glu Cys Cys Leu Val Phe 340 345 350
(2) INFORMATION FOR SEQ ID NO: 559:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 990 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...864 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559:
AAAAACACTT TTTAATGTTA TAATCTATCC TAAACAATAT AAGGGGTTTT T ATG GCA 57
Met Ala
1
AAA ATT GAA AGC AAT GAT TCC CAC CTA AGA GGT ATT TTA AAA GAC GAA 105 Lys He Glu Ser Asn Asp Ser His Leu Arg Gly He Leu Lys Asp Glu 5 10 15
CTC TAC TAT CAA ATC CCC ATC TAC CAA CGC CCT TAT CAA TGG ACA GAA 153 Leu Tyr Tyr Gin He Pro He Tyr Gin Arg Pro Tyr Gin Trp Thr Glu 20 25 30
GAA AAC TGC GAA AAA CTT TTA GAC GAT TTG TTT TTT AAT TAT GAA GAT 201 Glu Asn Cys Glu Lys Leu Leu Asp Asp Leu Phe Phe Asn Tyr Glu Asp 35 40 45 50
GAC AGA GAA GGC GAT TAT TTT TGC GGC TCA TTA GTC TTA ATT GCA ATC 249 Asp Arg Glu Gly Asp Tyr Phe Cys Gly Ser Leu Val Leu He Ala He 55 60 65
AGC AAA GAT TCT AAA GCC ACA ACC TAT GAT GTT GTA GAT GGC CAG CAA 297 Ser Lys Asp Ser Lys Ala Thr Thr Tyr Asp Val Val Asp Gly Gin Gin 70 75 80
CGC TTA AGC ACT TTC ATT CTG CTT GCA AAA GTT TTA GCC GAT CTT TAT 345 Arg Leu Ser Thr Phe He Leu Leu Ala Lys Val Leu Ala Asp Leu Tyr 85 90 95
AAT GAT TGT TTA GAC CCT AAG AAT TTA GAA CAT TTA CAA GAG GGT TGG 393 Asn Asp Cys Leu Asp Pro Lys Asn Leu Glu His Leu Gin Glu Gly Trp 100 105 110
AAA GAT AGG CAT ACA GAA AGA AAA CGA CTG AGT TTT AAC ACT ATA GGG 441 Lys Asp Arg His Thr Glu Arg Lys Arg Leu Ser Phe Asn Thr He Gly 115 120 125 130
TCT AAC GCT GAA TAT GAT TTT CAA GAT GCA TTA GAA CAT TTC AAC GAC 489 Ser Asn Ala Glu Tyr Asp Phe Gin Asp Ala Leu Glu His Phe Asn Asp 135 140 145
TCT CAA GCA AGC AAG AAT AAA AAT AAT AAG AAC AAT TAC CTA AAA AAT 537 Ser Gin Ala Ser Lys Asn Lys Asn Asn Lys Asn Asn Tyr Leu Lys Asn 150 155 160
GCG ATC TGT TTA AAA GAC TAT CTC ATG AAA AAA GAG ATT AAA AAC ATT 585 Ala He Cys Leu Lys Asp Tyr Leu Met Lys Lys Glu He Lys Asn He 165 170 175
AAC GAT TTC ATT GAG TGG CTG TAT TCT AAT GTT AAA TTT ATC ACC ATC 633 Asn Asp Phe He Glu Trp Leu Tyr Ser Asn Val Lys Phe He Thr He 180 185 190
ATT TGC CCA AAC ATA GAC AAG GCA TTA AGG ATT TTT AAT GTT TTA AAC 681 He Cys Pro Asn He Asp Lys Ala Leu Arg He Phe Asn Val Leu Asn 195 200 205 210
GCT AGG GGT TTG CCT TTG AAT GCG ACA GAT ATT TTT AAG GGG GAA TTA 729 Ala Arg Gly Leu Pro Leu Asn Ala Thr Asp He Phe Lys Gly Glu Leu 215 220 225
TTA AAA CAC GCT AAA GAG CAT GAG CGA GAA GAA TTT GTG TCT CGT TGG 777 Leu Lys His Ala Lys Glu His Glu Arg Glu Glu Phe Val Ser Arg Trp 230 235 240
AAC GCC TTA AGC CAA AAA TGC TCG GAC AAT GAT TTA ACA ATG GAG ACA 825 Asn Ala Leu Ser Gin Lys Cys Ser Asp Asn Asp Leu Thr Met Glu Thr 245 250 255
TTA TTC AGT TGG TAT AAC CTA TCT CAA TCC GGT AAC TTC TAGAGACAAA AT 876 Leu Phe Ser Trp Tyr Asn Leu Ser Gin Ser Gly Asn Phe 260 265 270
GGAAAAAGAG CTCGTTACTT GGTTCAACAT ACTTAACAAA CCCCCCCTAG AATACCTTAA 936 GGGCGTAGAG GATTTTTACA ACGCTTATGG TGAGGTGTTA GAAATGCAAG ATCG 990
(2) INFORMATION FOR SEQ ID NO: 560:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 271 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560:
Met Ala Lys He Glu Ser Asn Asp Ser His Leu Arg Gly He Leu Lys
1 5 10 15
Asp Glu Leu Tyr Tyr Gin He Pro He Tyr Gin Arg Pro Tyr Gin Trp
20 25 30
Thr Glu Glu Asn Cys Glu Lys Leu Leu Asp Asp Leu Phe Phe Asn Tyr
35 40 45
Glu Asp Asp Arg Glu Gly Asp Tyr Phe Cys Gly Ser Leu Val Leu He
50 55 60
Ala He Ser Lys Asp Ser Lys Ala Thr Thr Tyr Asp Val Val Asp Gly
65 70 75 80
Gin Gin Arg Leu Ser Thr Phe He Leu Leu Ala Lys Val Leu Ala Asp
85 90 95
Leu Tyr Asn Asp Cys Leu Asp Pro Lys Asn Leu Glu His Leu Gin Glu 100 105 110
Gly Trp Lys Asp Arg His Thr Glu Arg Lys Arg Leu Ser Phe Asn Thr
115 120 125
He Gly Ser Asn Ala Glu Tyr Asp Phe Gin Asp Ala Leu Glu His Phe
130 135 140
Asn Asp Ser Gin Ala Ser Lys Asn Lys Asn Asn Lys Asn Asn Tyr Leu 145 150 155 160
Lys Asn Ala He Cys Leu Lys Asp Tyr Leu Met Lys Lys Glu He Lys
165 170 175
Asn He Asn Asp Phe He Glu Trp Leu Tyr Ser Asn Val Lys Phe He
180 185 190
Thr He He Cys Pro Asn He Asp Lys Ala Leu Arg He Phe Asn Val
195 200 205
Leu Asn Ala Arg Gly Leu Pro Leu Asn Ala Thr Asp He Phe Lys Gly
210 215 220
Glu Leu Leu Lys His Ala Lys Glu His Glu Arg Glu Glu Phe Val Ser 225 230 235 240
Arg Trp Asn Ala Leu Ser Gin Lys Cys Ser Asp Asn Asp Leu Thr Met
245 250 255
Glu Thr Leu Phe Ser Trp Tyr Asn Leu Ser Gin Ser Gly Asn Phe 260 265 270
(2) INFORMATION FOR SEQ ID NO: 561:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 283 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...230 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561:
CCCTAGAATG CGTGCTTTTG GCTAAGGAAT TTTTGCCTAA CGCTAGGCTT ATG GTG 56
Met Val
1
GCT GGG GGG CGT GAA GTG GTG TTT AAA GAT AAT GAC AAA AAG GAA GCC 104 Ala Gly Gly Arg Glu Val Val Phe Lys Asp Asn Asp Lys Lys Glu Ala 5 10 15
AAG CTT TTT GAA TAC GGC ATC AAT GCG GTG GTG CTA GGG GAC TAT TTG 152 Lys Leu Phe Glu Tyr Gly He Asn Ala Val Val Leu Gly Asp Tyr Leu 20 25 30
ACC ACC AAA GGC AAA GCC CCT AAA AAA GAT ATA GAA AAA CTG CTC TCT 200 Thr Thr Lys Gly Lys Ala Pro Lys Lys Asp He Glu Lys Leu Leu Ser 35 40 45 50
TAT GGC TTG ACA ATG GCG ACA AGC TGT CAT TAATGAGAGA ACTTTTTAAA AGC 253 Tyr Gly Leu Thr Met Ala Thr Ser Cys His 55 60
GTTAGAGGGT TTTTTCGCCT TCTTAGAATG 283
(2) INFORMATION FOR SEQ ID NO: 562:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 60 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:562:
Met Val Ala Gly Gly Arg Glu Val Val Phe Lys Asp Asn Asp Lys Lys
1 5 10 15
Glu Ala Lys Leu Phe Glu Tyr Gly He Asn Ala Val Val Leu Gly Asp
20 25 30
Tyr Leu Thr Thr Lys Gly Lys Ala Pro Lys Lys Asp He Glu Lys Leu
35 40 45
Leu Ser Tyr Gly Leu Thr Met Ala Thr Ser Cys His 50 55 60
(2) INFORMATION FOR SEQ ID NO: 563:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 478 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...425 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:563:
CACTTGTTTG GAGTGCCTAT AGGCATAATA AGTCCTGTTT CTATTTTTAA TGG TGG 56
Trp Trp 1 TAT GAT AAC AAT GTC AAC TTA CAG CTT TTT TAT GGA TTT TTA CAC AAT 104 Tyr Asp Asn Asn Val Asn Leu Gin Leu Phe Tyr Gly Phe Leu His Asn 5 10 15
GTG TAT GAA AAT GAG AAG TTT TTC ATC GGT TAT TTT ATA GGG GCT GGG 152 Val Tyr Glu Asn Glu Lys Phe Phe He Gly Tyr Phe He Gly Ala Gly 20 25 30
CTA GGG GGT GAG AGC GTA ACA CCC AAT GTT CTT AAA GAT TTT GGT AAT 200 Leu Gly Gly Glu Ser Val Thr Pro Asn Val Leu Lys Asp Phe Gly Asn 35 40 45 50
ATG TTA GCG CAA TTA GTG CAA TTT CAG GGC TAT GGC TCA CTA GGG CTA 248 Met Leu Ala Gin Leu Val Gin Phe Gin Gly Tyr Gly Ser Leu Gly Leu 55 60 65
AGG ATG GGC GAT AAA CAC CAC ACG CTA GAA TTG AGC ACG AGC GTT CAT 296 Arg Met Gly Asp Lys His His Thr Leu Glu Leu Ser Thr Ser Val His 70 75 80
GGC GAC GCT CCT AGT TGT TCT TTA AAA AAG CTA AAG AGT TGC GAA AGT 344 Gly Asp Ala Pro Ser Cys Ser Leu Lys Lys Leu Lys Ser Cys Glu Ser 85 90 95
GCG AGG GTT TTA CAA GCA AAA ATC CCT AGG GGC ATT TTT GAA AGC TAT 392 Ala Arg Val Leu Gin Ala Lys He Pro Arg Gly He Phe Glu Ser Tyr 100 105 110
GTT ACT TGG AGC GCG GAT TAT GTT TAT CGT TTT TAAAAGTTTT TAAAAATTTA 445 Val Thr Trp Ser Ala Asp Tyr Val Tyr Arg Phe 115 120 125
ATGGCTTTGT TCCAATTGAA TAGGGGTAAT AAA 478
(2) INFORMATION FOR SEQ ID NO:564:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 125 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:564:
Trp Trp Tyr Asp Asn Asn Val Asn Leu Gin Leu Phe Tyr Gly Phe Leu
1 5 10 15
His Asn Val Tyr Glu Asn Glu Lys Phe Phe He Gly Tyr Phe He Gly
20 25 30
Ala Gly Leu Gly Gly Glu Ser Val Thr Pro Asn Val Leu Lys Asp Phe
35 40 45
Gly Asn Met Leu Ala Gin Leu Val Gin Phe Gin Gly Tyr Gly Ser Leu 50 55 60 Gly Leu Arg Met Gly Asp Lys His His Thr Leu Glu Leu Ser Thr Ser 65 70 75 80
Val His Gly Asp Ala Pro Ser Cys Ser Leu Lys Lys Leu Lys Ser Cys
85 90 95
Glu Ser Ala Arg Val Leu Gin Ala Lys He Pro Arg Gly He Phe Glu
100 105 110
Ser Tyr Val Thr Trp Ser Ala Asp Tyr Val Tyr Arg Phe 115 120 125
(2) INFORMATION FOR SEQ ID NO:565:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2169 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 53...2119 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:565:
CTTTTAAAAG GCTAATGCCT TTTTAAAAAA TTGAAATAAA GGAATAAAAG TT ATG ACG 58
Met Thr 1
GAT AAC AAC CAA AAC AAT GAA AAC CAT GAA AAC AGC AGT GAA AAT TCA 106 Asp Asn Asn Gin Asn Asn Glu Asn His Glu Asn Ser Ser Glu Asn Ser 5 10 15
AAA GCT GAT GAG ATG CGA GCC GGA GCG TTT GAG CGC TTC ACC AAC CGC 154 Lys Ala Asp Glu Met Arg Ala Gly Ala Phe Glu Arg Phe Thr Asn Arg 20 25 30
AAA AAG CGT TTC AGA GAA AAC GCG CAA AAA AAC GCA GAG TAT TCA AAC 202 Lys Lys Arg Phe Arg Glu Asn Ala Gin Lys Asn Ala Glu Tyr Ser Asn 35 40 45 50
CAT GAA GCG TCT TCG CAC CAT AAA AAA GAG CAT CGC CCT AAC AAA AAA 250 His Glu Ala Ser Ser His His Lys Lys Glu His Arg Pro Asn Lys Lys 55 60 65
CCA AAC AAC CAC CAC AAA CAA AAA CAT GCC AAA ACA CGA AAT TAC GCC 298 Pro Asn Asn His His Lys Gin Lys His Ala Lys Thr Arg Asn Tyr Ala 70 75 80
CAA GAA GAA TTG GAT AGC AAC AAA GTA GAG GGC GTT ACG GAA ATT TTG 346 Gin Glu Glu Leu Asp Ser Asn Lys Val Glu Gly Val Thr Glu He Leu 85 90 95
CAT GTG AAT GAG AGA GGG ACT TTA GGC TTT CAT AAG GAG TTA AAA AAG 394 His Val Asn Glu Arg Gly Thr Leu Gly Phe His Lys Glu Leu Lys Lys 100 105 110
GGC GTT GAA GCG AAT AAC AAG ATC CAA GTG GAG CAT TTA AAC CCG CAT 442 Gly Val Glu Ala Asn Asn Lys He Gin Val Glu His Leu Asn Pro His 115 120 125 130
TAT AAG ATG AAC TTA AAC TCT AAA GCG AGC GTT AAA ATC ACG CCT TTA 490 Tyr Lys Met Asn Leu Asn Ser Lys Ala Ser Val Lys He Thr Pro Leu 135 140 145
GGG GGC TTG GGT GAG ATT GGG GGG AAC ATG ATG GTC ATT GAA ACC CCA 538 Gly Gly Leu Gly Glu He Gly Gly Asn Met Met Val He Glu Thr Pro 150 155 160
AAA AGC GCG ATC GTG ATT GAT GCG GGC ATG AGC TTC CCT AAA GAG GGG 586 Lys Ser Ala He Val He Asp Ala Gly Met Ser Phe Pro Lys Glu Gly 165 170 175
CTC TTT GGC GTG GAT ATT TTA ATC CCG GAT TTT TCC TAC TTG CAC CAA 634 Leu Phe Gly Val Asp He Leu He Pro Asp Phe Ser Tyr Leu His Gin 180 185 190
ATC AAG GAC AAA ATC GCT GGC ATT ATC ATC ACC CAT GCC CAT GAA GAT 682 He Lys Asp Lys He Ala Gly He He He Thr His Ala His Glu Asp 195 200 205 210
CAC ATA GGG GCC ACG CCT TAT TTG TTT AAA GAG CTG CAA TTC CCC CTT 730 His He Gly Ala Thr Pro Tyr Leu Phe Lys Glu Leu Gin Phe Pro Leu 215 220 225
TAT GGC ACG CCC TTG AGT TTG GGG CTG ATT GGG AGC AAG TTT GAT GAA 778 Tyr Gly Thr Pro Leu Ser Leu Gly Leu He Gly Ser Lys Phe Asp Glu 230 235 240
CAT GGT TTG AAA AAA TAC CGC TCG TAT TTT AAA ATC GTA GAA AAG CGC 826 His Gly Leu Lys Lys Tyr Arg Ser Tyr Phe Lys He Val Glu Lys Arg 245 250 255
TGT CCC ATT AGC GTG GGC GAA TTT ATC ATT GAA TGG ATC CAC ATC ACG 874 Cys Pro He Ser Val Gly Glu Phe He He Glu Trp He His He Thr 260 265 270
CAT TCT ATC ATT GAC AGC AGC GCT TTA GCG ATC CAA ACT AAA GCC GGA 922 His Ser He He Asp Ser Ser Ala Leu Ala He Gin Thr Lys Ala Gly 275 280 285 290
ACG ATC ATC CAC ACC GGC GAT TTT AAA ATC GAT CAC ACC CCG GTG GAT 970 Thr He He His Thr Gly Asp Phe Lys He Asp His Thr Pro Val Asp 295 300 305
AAT TTG CCC ACG GAT TTG TAT CGT TTA GCG CAC TAT GGC GAA AAG GGG 1018 Asn Leu Pro Thr Asp Leu Tyr Arg Leu Ala His Tyr Gly Glu Lys Gly 310 315 320
GTG ATG CTT CTT TTA AGC GAT TCC ACC AAC TCC CAT AAA TCC GGG ACT 1066 Val Met Leu Leu Leu Ser Asp Ser Thr Asn Ser His Lys Ser Gly Thr 325 330 335
ACG CCG AGT GAA AGC ACC ATA GCG CCG GCT TTT GAT ACC CTT TTT AAA 1114 Thr Pro Ser Glu Ser Thr He Ala Pro Ala Phe Asp Thr Leu Phe Lys 340 345 350
GAA GCG CAA GGG AGG GTG ATT ATG AGC ACC TTC TCT AGC AAT ATC CAC 1162 Glu Ala Gin Gly Arg Val He Met Ser Thr Phe Ser Ser Asn He His 355 360 365 370
CGG GTC TAT CAA GCC ATA CAA TAC GGC ATT AAA TAC AAC CGC AAG ATC 1210 Arg Val Tyr Gin Ala He Gin Tyr Gly He Lys Tyr Asn Arg Lys He 375 380 385
GCT GTG ATC GGG CGC TCT ATG GAA AAA AAC CTA GAC ATC GCT AGA GAA 1258 Ala Val He Gly Arg Ser Met Glu Lys Asn Leu Asp He Ala Arg Glu 390 395 400
TTG GGC TAT ATC CAT TTG CCT TAT CAA TCT TTT ATT GAA GCC AAT GAA 1306 Leu Gly Tyr He His Leu Pro Tyr Gin Ser Phe He Glu Ala Asn Glu 405 410 415
GTC GCC AAA TAC CCG GAC AAT GAA ATC TTA ATC GTA ACG ACC GGC TCA 1354 Val Ala Lys Tyr Pro Asp Asn Glu He Leu He Val Thr Thr Gly Ser 420 425 430
CAA GGC GAA ACC ATG AGC GCG CTT TAT CGC ATG GCG ACT GAT GAA CAC 1402 Gin Gly Glu Thr Met Ser Ala Leu Tyr Arg Met Ala Thr Asp Glu His 435 440 445 450
CGT CAT ATT TCT ATC AAA CCC AAC GAT TTA GTC ATC ATT TCC GCT AAA 1450 Arg His He Ser He Lys Pro Asn Asp Leu Val He He Ser Ala Lys 455 460 465
GCC ATT CCT GGC AAT GAA GCG AGC GTT TCA GCG GTG TTG AAT TTC TTG 1498 Ala He Pro Gly Asn Glu Ala Ser Val Ser Ala Val Leu Asn Phe Leu 470 475 480
ATC AAA AAA GAA GCT AAA GTG GCT TAT CAA GAA TTT GAC AAT ATC CAT 1546 He Lys Lys Glu Ala Lys Val Ala Tyr Gin Glu Phe Asp Asn He His 485 490 495
GTG AGC GGG CAT GCC GCC CAA GAA GAG CAA AAG CTC ATG TTA AGA CTC 1594 Val Ser Gly His Ala Ala Gin Glu Glu Gin Lys Leu Met Leu Arg Leu 500 505 510
ATT AAG CCT AAG TTT TTC TTA CCC GTG CAT GGG GAA TAT AAC CAT GTC 1642 He Lys Pro Lys Phe Phe Leu Pro Val His Gly Glu Tyr Asn His Val 515 520 525 530 GCG CGC CAC AAA CAA ACC GCT ATT TCT TGC GGA GTG CCT GAA AAA AAT 1690 Ala Arg His Lys Gin Thr Ala He Ser Cys Gly Val Pro Glu Lys Asn 535 540 545
ATC TAT TTA ATG GAG GAT GGC GAT CAG GTG GAG GTT GGC CCT GCG TTC 1738 He Tyr Leu Met Glu Asp Gly Asp Gin Val Glu Val Gly Pro Ala Phe 550 555 560
ATC AAA AAA GTC GGC ACG ATT AAA AGC GGG AAA AGC TAT GTG GAT AAC 1786 He Lys Lys Val Gly Thr He Lys Ser Gly Lys Ser Tyr Val Asp Asn 565 570 575
CAA AGC AAT TTG AGT ATT GAT ACA AGC ATC GTG CAA CAA AGA GAA GAA 1834 Gin Ser Asn Leu Ser He Asp Thr Ser He Val Gin Gin Arg Glu Glu 580 585 590
GTC GCT AGC GCC GGG GTG TTT GTG GCT ACG ATT TTT GTG AAT AAA AAC 1882 Val Ala Ser Ala Gly Val Phe Val Ala Thr He Phe Val Asn Lys Asn 595 600 605 610
AAG CAA GCG CTT TTA GAA AGC TCT CAA TTT TCC AGT TTA GGG CTT GTG 1930 Lys Gin Ala Leu Leu Glu Ser Ser Gin Phe Ser Ser Leu Gly Leu Val 615 620 625
GGT TTC AAA GAT GAA AAG CCT TTG ATC AAA GAA ATT CAA GGG GGC TTA 1978 Gly Phe Lys Asp Glu Lys Pro Leu He Lys Glu He Gin Gly Gly Leu 630 635 640
GAG GTG TTA TTG AAA TCC AGC AAC GCC GAA ATT TTG AAT AAC CCT AAA 2026 Glu Val Leu Leu Lys Ser Ser Asn Ala Glu He Leu Asn Asn Pro Lys 645 650 655
AAA TTA GAA GAT CAC ACT CGT AAT TTC ATC AGA AAA GCG CTC TTT AAA 2074 Lys Leu Glu Asp His Thr Arg Asn Phe He Arg Lys Ala Leu Phe Lys 660 665 670
AAG TTT AGA AAA TAC CCG GCT ATC ATT TGT CAT GCC CAT TCT TTT TGATT 2124 Lys Phe Arg Lys Tyr Pro Ala He He Cys His Ala His Ser Phe 675 680 685
GTAACGCTAT TGCTTCACAA GTTTTAAAAG ATGAAGCGAG CGCGC 2169
(2) INFORMATION FOR SEQ ID NO: 566:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 689 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:566: Met Thr Asp Asn Asn Gin Asn Asn Glu Asn His Glu Asn Ser Ser Glu
1 5 10 15
Asn Ser Lys Ala Asp Glu Met Arg Ala Gly Ala Phe Glu Arg Phe Thr
20 25 30
Asn Arg Lys Lys Arg Phe Arg Glu Asn Ala Gin Lys Asn Ala Glu Tyr
35 40 45
Ser Asn His Glu Ala Ser Ser His His Lys Lys Glu His Arg Pro Asn
50 55 60
Lys Lys Pro Asn Asn His His Lys Gin Lys His Ala Lys Thr Arg Asn 65 70 75 80
Tyr Ala Gin Glu Glu Leu Asp Ser Asn Lys Val Glu Gly Val Thr Glu
85 90 95
He Leu His Val Asn Glu Arg Gly Thr Leu Gly Phe His Lys Glu Leu
100 105 110
Lys Lys Gly Val Glu Ala Asn Asn Lys He Gin Val Glu His Leu Asn
115 120 125
Pro His Tyr Lys Met Asn Leu Asn Ser Lys Ala Ser Val Lys He Thr
130 135 140
Pro Leu Gly Gly Leu Gly Glu He Gly Gly Asn Met Met Val He Glu 145 150 155 160
Thr Pro Lys Ser Ala He Val He Asp Ala Gly Met Ser Phe Pro Lys
165 170 175
Glu Gly Leu Phe Gly Val Asp He Leu He Pro Asp Phe Ser Tyr Leu
180 185 190
His Gin He Lys Asp Lys He Ala Gly He He He Thr His Ala His
195 200 205
Glu Asp His He Gly Ala Thr Pro Tyr Leu Phe Lys Glu Leu Gin Phe
210 215 220
Pro Leu Tyr Gly Thr Pro Leu Ser Leu Gly Leu He Gly Ser Lys Phe 225 230 235 240
Asp Glu His Gly Leu Lys Lys Tyr Arg Ser Tyr Phe Lys He Val Glu
245 250 255
Lys Arg Cys Pro He Ser Val Gly Glu Phe He He Glu Trp He His
260 265 270
He Thr His Ser He He Asp Ser Ser Ala Leu Ala He Gin Thr Lys
275 280 285
Ala Gly Thr He He His Thr Gly Asp Phe Lys He Asp His Thr Pro
290 295 300
Val Asp Asn Leu Pro Thr Asp Leu Tyr Arg Leu Ala His Tyr Gly Glu 305 310 315 320
Lys Gly Val Met Leu Leu Leu Ser Asp Ser Thr Asn Ser His Lys Ser
325 330 335
Gly Thr Thr Pro Ser Glu Ser Thr He Ala Pro Ala Phe Asp Thr Leu
340 345 350
Phe Lys Glu Ala Gin Gly Arg Val He Met Ser Thr Phe Ser Ser Asn
355 360 365
He His Arg Val Tyr Gin Ala He Gin Tyr Gly He Lys Tyr Asn Arg
370 375 380
Lys He Ala Val He Gly Arg Ser Met Glu Lys Asn Leu Asp He Ala 385 390 395 400
Arg Glu Leu Gly Tyr He His Leu Pro Tyr Gin Ser Phe He Glu Ala
405 410 415
Asn Glu Val Ala Lys Tyr Pro Asp Asn Glu He Leu He Val Thr Thr
420 425 430
Gly Ser Gin Gly Glu Thr Met Ser Ala Leu Tyr Arg Met Ala Thr Asp 435 440 445
Glu His Arg His He Ser He Lys Pro Asn Asp Leu Val He He Ser
450 455 460
Ala Lys Ala He Pro Gly Asn Glu Ala Ser Val Ser Ala Val Leu Asn 465 470 475 480
Phe Leu He Lys Lys Glu Ala Lys Val Ala Tyr Gin Glu Phe Asp Asn
485 490 495
He His Val Ser Gly His Ala Ala Gin Glu Glu Gin Lys Leu Met Leu
500 505 510
Arg Leu He Lys Pro Lys Phe Phe Leu Pro Val His Gly Glu Tyr Asn
515 520 525
His Val Ala Arg His Lys Gin Thr Ala He Ser Cys Gly Val Pro Glu
530 535 540
Lys Asn He Tyr Leu Met Glu Asp Gly Asp Gin Val Glu Val Gly Pro 545 550 555 560
Ala Phe He Lys Lys Val Gly Thr He Lys Ser Gly Lys Ser Tyr Val
565 570 575
Asp Asn Gin Ser Asn Leu Ser He Asp Thr Ser He Val Gin Gin Arg
580 585 590
Glu Glu Val Ala Ser Ala Gly Val Phe Val Ala Thr He Phe Val Asn
595 600 605
Lys Asn Lys Gin Ala Leu Leu Glu Ser Ser Gin Phe Ser Ser Leu Gly
610 615 620
Leu Val Gly Phe Lys Asp Glu Lys Pro Leu He Lys Glu He Gin Gly 625 630 635 640
Gly Leu Glu Val Leu Leu Lys Ser Ser Asn Ala Glu He Leu Asn Asn
645 650 655
Pro Lys Lys Leu Glu Asp His Thr Arg Asn Phe He Arg Lys Ala Leu
660 665 670
Phe Lys Lys Phe Arg Lys Tyr Pro Ala He He Cys His Ala His Ser
675 680 685
Phe
(2) INFORMATION FOR SEQ ID NO: 567:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2770 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 46...2721 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567: GAAGACAGAG TCTTTGTACA TGAAAATAAA ACGGTGGTGT TTTGA ATG CTT TTA GAT 57 Met Leu Leu Asp 1
TTC AGC AAC CTC AAT GAA GAA CCC TTA AAA AAC CAA ATC AAA GCC GAG 105 Phe Ser Asn Leu Asn Glu Glu Pro Leu Lys Asn Gin He Lys Ala Glu 5 10 15 20
TTT TTT AAG GAT AAG AAA TTC CTT TAT AGC GGG GAT AAA ATA GAT TTC 153 Phe Phe Lys Asp Lys Lys Phe Leu Tyr Ser Gly Asp Lys He Asp Phe 25 30 35
ATG CTA AGC TAT AAG CAT TCT AAC GCC ACC TTA CCC ATT TTA TGG GGC 201 Met Leu Ser Tyr Lys His Ser Asn Ala Thr Leu Pro He Leu Trp Gly 40 45 50
GAA GCT AAA AGG GGC GAT TTT GAT GAT TTG GAC AAA GCT TTC ACG CAA 249 Glu Ala Lys Arg Gly Asp Phe Asp Asp Leu Asp Lys Ala Phe Thr Gin 55 60 65
CTT CTT TTA ACC ATA GGC AAG CAC AGG CTT TAT ACC CAC CAC ACA CCA 297 Leu Leu Leu Thr He Gly Lys His Arg Leu Tyr Thr His His Thr Pro 70 75 80
CCT TAT TTG TGC GCT TTT AAC GCT TTT AAA ATG GAA TTT ATC GCC TTT 345 Pro Tyr Leu Cys Ala Phe Asn Ala Phe Lys Met Glu Phe He Ala Phe 85 90 95 100
GAT GAC ACG ATC ACA AGC TTT TTT TAT AAA AGC GAT ATA GAT TTT TCT 393 Asp Asp Thr He Thr Ser Phe Phe Tyr Lys Ser Asp He Asp Phe Ser 105 110 115
ATC ACC CCA AGC AAC CAC AAC ACA GAA GGT TTT AAA CAT GCT TTA GAC 441 He Thr Pro Ser Asn His Asn Thr Glu Gly Phe Lys His Ala Leu Asp 120 125 130
GCG TTT AAA GCC ATG AGC AAA TCC CAT AAA TTC GTT TTT GAC TTT AAA 489 Ala Phe Lys Ala Met Ser Lys Ser His Lys Phe Val Phe Asp Phe Lys 135 140 145
ACC CAA AGC CAA GAA TGC AAA GAA TTT ATC AAA AAC CGT TTA AAT TCT 537 Thr Gin Ser Gin Glu Cys Lys Glu Phe He Lys Asn Arg Leu Asn Ser 150 155 160
AGC CAT TTA CTC AGC AAA ATC CAA ATT GAC AAA AAC AAT TTC TTT ACG 585 Ser His Leu Leu Ser Lys He Gin He Asp Lys Asn Asn Phe Phe Thr 165 170 175 180
ATC TAT CAA AAG TGG CTT GAA ATT GTC AAA CCC ACC ATT GAC ATA AAT 633 He Tyr Gin Lys Trp Leu Glu He Val Lys Pro Thr He Asp He Asn 185 190 195
TGG GAG GTG GCT AAA ACT AAA GAC ATT TTA GAC GCA GAC TAT TAT TTA 681 Trp Glu Val Ala Lys Thr Lys Asp He Leu Asp Ala Asp Tyr Tyr Leu 200 205 210 GCG GAT TTG CTT AGC GAT GGC GAT AAA ACC ATT ATT GAG AAA TTG CAC 729 Ala Asp Leu Leu Ser Asp Gly Asp Lys Thr He He Glu Lys Leu His 215 220 225
ACG ATT TTA AGA TCG AGC CAT TAT AAA TTG AAT AGG GGT GTG AAT GAA 777 Thr He Leu Arg Ser Ser His Tyr Lys Leu Asn Arg Gly Val Asn Glu 230 235 240
TTA GGC AAA ATG GAT TTT ATG GAA GTT GGT TTC ACA GAC AGC CAA CAA 825 Leu Gly Lys Met Asp Phe Met Glu Val Gly Phe Thr Asp Ser Gin Gin 245 250 255 260
GCC CAT CAA GAA TTT TGG AGC GTT TAT GAA CGA CCG CCT AAA AGA GAA 873 Ala His Gin Glu Phe Trp Ser Val Tyr Glu Arg Pro Pro Lys Arg Glu 265 270 275
TTT CAA GCC TCT ATT TTA GAG CGG CGC GAC TTG TTA GTA CCA AGC GAT 921 Phe Gin Ala Ser He Leu Glu Arg Arg Asp Leu Leu Val Pro Ser Asp 280 285 290
GTG AGA GAA AGG AAA GGG GCG TTT TTC ACC CCT AAA ATC TGG GTA GAA 969 Val Arg Glu Arg Lys Gly Ala Phe Phe Thr Pro Lys He Trp Val Glu 295 300 305
AAG AGT CAA GAA TAT TTA GCT AAA GCT TTG GGG CAA GAT TAT CAA GAG 1017 Lys Ser Gin Glu Tyr Leu Ala Lys Ala Leu Gly Gin Asp Tyr Gin Glu 310 315 320
GAT TGT ATC ATT TGG GAT TGC GCT GGG GGG ACT GGG AAT TTG CTT CGA 1065 Asp Cys He He Trp Asp Cys Ala Gly Gly Thr Gly Asn Leu Leu Arg 325 330 335 340
GGT TTA TTG AAT AAG GCT AAT TTG TAT CTA TCC ACT TTA GAT CAT AAC 1113 Gly Leu Leu Asn Lys Ala Asn Leu Tyr Leu Ser Thr Leu Asp His Asn 345 350 355
GAT GTG GCA ATC GTT AAA GAT CTG GCT GCA AAA AAC CAC TTA AAA TTA 1161 Asp Val Ala He Val Lys Asp Leu Ala Ala Lys Asn His Leu Lys Leu 360 365 370
CTG GAA AAT CAT GTT TTC CAA TTT GAC TTT TTA AAC GAT GAT TTT TTC 1209 Leu Glu Asn His Val Phe Gin Phe Asp Phe Leu Asn Asp Asp Phe Phe 375 380 385
AGC GAT AAA ACG CCA AAA AGC TTG CAA GAA ATC TTA AAA GAC AAA GAG 1257 Ser Asp Lys Thr Pro Lys Ser Leu Gin Glu He Leu Lys Asp Lys Glu 390 395 400
AAA CGA AAA AAG CTC ATC ATT TAC ATC AAC CCG CCC TAT GCA GAA GCA 1305 Lys Arg Lys Lys Leu He He Tyr He Asn Pro Pro Tyr Ala Glu Ala 405 410 415 420
GGT AAT AAA TCT AAG ATG AGT GGC ACA GGC GAA CAT AAA GCC AAA GTG 1353 Gly Asn Lys Ser Lys Met Ser Gly Thr Gly Glu His Lys Ala Lys Val 425 430 435 GCA CGA GAC AAT CTC ATC TGT GAA AAA TAC AAA AAT GAA TTA GGC AAG 1401 Ala Arg Asp Asn Leu He Cys Glu Lys Tyr Lys Asn Glu Leu Gly Lys 440 445 450
GCT AAT AAT GAA GTT TTT GCA CAA TTT TTC ATG CGT ATT TAC AAA GAA 1449 Ala Asn Asn Glu Val Phe Ala Gin Phe Phe Met Arg He Tyr Lys Glu 455 460 465
TTA AAC GGT GTT ATC CTT GCA AGT TTT TCA ACT TTG AAA AAC TTG CAA 1497 Leu Asn Gly Val He Leu Ala Ser Phe Ser Thr Leu Lys Asn Leu Gin 470 475 480
GGA TCT AAT TTT AAA AAA TTC AGA GAA ATC TTT AAA GCT AAA TTT TTA 1545 Gly Ser Asn Phe Lys Lys Phe Arg Glu He Phe Lys Ala Lys Phe Leu 485 490 495 500
GAG GGG TTT ATG GTG CCA GCA GAC ACT TTT GAT AAT GTT AGG GGG CAA 1593 Glu Gly Phe Met Val Pro Ala Asp Thr Phe Asp Asn Val Arg Gly Gin 505 510 515
TTT CCT ATC GGC TTT TTA GTG TGG GAT ACA AGC TCT ATT CTT CCT AAA 1641 Phe Pro He Gly Phe Leu Val Trp Asp Thr Ser Ser He Leu Pro Lys 520 525 530
GAA AAC CCC CTA AAT TTA GGG GGC AAC TCT AAA GAA GAG AAA CAA AAC 1689 Glu Asn Pro Leu Asn Leu Gly Gly Asn Ser Lys Glu Glu Lys Gin Asn 535 540 545
TCC AAT TTA ATC TTA GAC CAA GAC AAT TTG AAA GAT AAT CCC TTG AAA 1737 Ser Asn Leu He Leu Asp Gin Asp Asn Leu Lys Asp Asn Pro Leu Lys 550 555 560
GAG CGT TTT TGC CTT TTA GAC ATA AAC GCT CCT AAT AGG AAG ATG TGT 1785 Glu Arg Phe Cys Leu Leu Asp He Asn Ala Pro Asn Arg Lys Met Cys 565 570 575 580
TCC CAA AGC AGA ACA AGA ACT AAG GGG ACA CAA AAG CAT TCT ACA GCA 1833 Ser Gin Ser Arg Thr Arg Thr Lys Gly Thr Gin Lys His Ser Thr Ala 585 590 595
GCC CCC TTT GAA ACC CCT TTA CAC ACT GTT AGT TTA GAA ATA TTT GAT 1881 Ala Pro Phe Glu Thr Pro Leu His Thr Val Ser Leu Glu He Phe Asp 600 605 610
AGT TTC GGC GGA TTT TTA GGC AGT AAA AAA ATA TAC ACT CAC ACA ATA 1929 Ser Phe Gly Gly Phe Leu Gly Ser Lys Lys He Tyr Thr His Thr He 615 620 625
GAC AAA ATG CTT ACT TTA GCG GAT TAT TTA CAA AAG TTT CAG CCA ACA 1977 Asp Lys Met Leu Thr Leu Ala Asp Tyr Leu Gin Lys Phe Gin Pro Thr 630 635 640
AAA AGA GAC ACT ATT TTT GGC TAT TTA GAT CCT GGT CGC AAT AGT TTT 2025 Lys Arg Asp Thr He Phe Gly Tyr Leu Asp Pro Gly Arg Asn Ser Phe 645 650 655 660 CAA CAT CAA AAT CTA ATT CAT ATT AGC ATT ATT GAC AAA TCA AAA CAA 2073 Gin His Gin Asn Leu He His He Ser He He Asp Lys Ser Lys Gin 665 670 675
TCG CAT GTA AAA TAT TTT CCA ATC ATT GCA ACT ACA ATT TTG TTG GTA 2121 Ser His Val Lys Tyr Phe Pro He He Ala Thr Thr He Leu Leu Val 680 685 690
TCT GTA TTT TTC TCC ATC CGC CAT TGC ATC AAA GCC ACA TGG CAA AAC 2169 Ser Val Phe Phe Ser He Arg His Cys He Lys Ala Thr Trp Gin Asn 695 700 705
GAT AGG GAT CAA TTT TAC GCC CCC TAT GAC GAT GCG TTC CAA GAC GAC 2217 Asp Arg Asp Gin Phe Tyr Ala Pro Tyr Asp Asp Ala Phe Gin Asp Asp 710 715 720
AGC GAG TTT AAA AAC AAT TGT TTG ATT TTC ATG CTT TTT CAC ACC CAG 2265 Ser Glu Phe Lys Asn Asn Cys Leu He Phe Met Leu Phe His Thr Gin 725 730 735 740
AAC CGC ATC ACT ACC GCT CAA GGG ACT AAC CAT TTT ATC CCC TTT AGC 2313 Asn Arg He Thr Thr Ala Gin Gly Thr Asn His Phe He Pro Phe Ser 745 750 755
GAA ACT GAA GTC AAT GCC AAA GAA AGA TAT TCT AGC CAC GCT CTA TTA 2361 Glu Thr Glu Val Asn Ala Lys Glu Arg Tyr Ser Ser His Ala Leu Leu 760 765 770
GAG TTT TTA AAA GGC GAA ATC AAA GAA CTT AAA GAG AAC GAT AGC CTC 2409 Glu Phe Leu Lys Gly Glu He Lys Glu Leu Lys Glu Asn Asp Ser Leu 775 780 785
TTT TTA AGT GCC AAA AAA GAA AAC AAG CCC CTG AAA TTC AGC CCG AGC 2457 Phe Leu Ser Ala Lys Lys Glu Asn Lys Pro Leu Lys Phe Ser Pro Ser 790 795 800
GCT TCA AAG GTG TTT GAC GCT AGC AGA GAG GTT TAT CGC TAT TAC CAC 2505 Ala Ser Lys Val Phe Asp Ala Ser Arg Glu Val Tyr Arg Tyr Tyr His 805 810 815 820
ACA CAA GAT TTC ACA AAC CGC CCC TAT AAC GCT AAC GCA AGC CTT TAT 2553 Thr Gin Asp Phe Thr Asn Arg Pro Tyr Asn Ala Asn Ala Ser Leu Tyr 825 830 835
GAC ATC AAA GAA TTT TTT CAA GGC CGT AAC AAG CAA GGC AAA TTA AAT 2601 Asp He Lys Glu Phe Phe Gin Gly Arg Asn Lys Gin Gly Lys Leu Asn 840 845 850
TTA CCC GCT AAA GCT AAA GAT GAA TAT TAC AAA CAG CTT TAC GCT AAT 2649 Leu Pro Ala Lys Ala Lys Asp Glu Tyr Tyr Lys Gin Leu Tyr Ala Asn 855 860 865
TTG CAA GAC GCC CTA AAA GAT CTC GCC AAA GAA ATA CAG CCT AAA GTC 2697 Leu Gin Asp Ala Leu Lys Asp Leu Ala Lys Glu He Gin Pro Lys Val 870 875 880 TAT GAA TAC GGG TTT TTA AGG GAG TGATTTTTAA GACAAATAAT CAAAAAGCTA 2751 Tyr Glu Tyr Gly Phe Leu Arg Glu 885 890
GAGCAAGCGG TATTTTTTA 2770
(2) INFORMATION FOR SEQ ID NO: 568:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 892 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:568:
Met Leu Leu Asp Phe Ser Asn Leu Asn Glu Glu Pro Leu Lys Asn Gin
1 5 10 15
He Lys Ala Glu Phe Phe Lys Asp Lys Lys Phe Leu Tyr Ser Gly Asp
20 25 30
Lys He Asp Phe Met Leu Ser Tyr Lys His Ser Asn Ala Thr Leu Pro
35 40 45
He Leu Trp Gly Glu Ala Lys Arg Gly Asp Phe Asp Asp Leu Asp Lys
50 55 60
Ala Phe Thr Gin Leu Leu Leu Thr He Gly Lys His Arg Leu Tyr Thr 65 70 75 80
His His Thr Pro Pro Tyr Leu Cys Ala Phe Asn Ala Phe Lys Met Glu
85 90 95
Phe He Ala Phe Asp Asp Thr He Thr Ser Phe Phe Tyr Lys Ser Asp
100 105 110
He Asp Phe Ser He Thr Pro Ser Asn His Asn Thr Glu Gly Phe Lys
115 120 125
His Ala Leu Asp Ala Phe Lys Ala Met Ser Lys Ser His Lys Phe Val
130 135 140
Phe Asp Phe Lys Thr Gin Ser Gin Glu Cys Lys Glu Phe He Lys Asn 145 150 155 160
Arg Leu Asn Ser Ser His Leu Leu Ser Lys He Gin He Asp Lys Asn
165 170 175
Asn Phe Phe Thr He Tyr Gin Lys Trp Leu Glu He Val Lys Pro Thr
180 185 190
He Asp He Asn Trp Glu Val Ala Lys Thr Lys Asp He Leu Asp Ala
195 200 205
Asp Tyr Tyr Leu Ala Asp Leu Leu Ser Asp Gly Asp Lys Thr He He
210 215 220
Glu Lys Leu His Thr He Leu Arg Ser Ser His Tyr Lys Leu Asn Arg 225 230 235 240
Gly Val Asn Glu Leu Gly Lys Met Asp Phe Met Glu Val Gly Phe Thr
245 250 255
Asp Ser Gin Gin Ala His Gin Glu Phe Trp Ser Val Tyr Glu Arg Pro
260 265 270
Pro Lys Arg Glu Phe Gin Ala Ser He Leu Glu Arg Arg Asp Leu Leu 275 280 285 Val Pro Ser Asp Val Arg Glu Arg Lys Gly Ala Phe Phe Thr Pro Lys
290 295 300
He Trp Val Glu Lys Ser Gin Glu Tyr Leu Ala Lys Ala Leu Gly Gin 305 310 315 320
Asp Tyr Gin Glu Asp Cys He He Trp Asp Cys Ala Gly Gly Thr Gly
325 330 335
Asn Leu Leu Arg Gly Leu Leu Asn Lys Ala Asn Leu Tyr Leu Ser Thr
340 345 350
Leu Asp His Asn Asp Val Ala He Val Lys Asp Leu Ala Ala Lys Asn
355 360 365
His Leu Lys Leu Leu Glu Asn His Val Phe Gin Phe Asp Phe Leu Asn
370 375 380
Asp Asp Phe Phe Ser Asp Lys Thr Pro Lys Ser Leu Gin Glu He Leu 385 390 395 400
Lys Asp Lys Glu Lys Arg Lys Lys Leu He He Tyr He Asn Pro Pro
405 410 415
Tyr Ala Glu Ala Gly Asn Lys Ser Lys Met Ser Gly Thr Gly Glu His
420 425 430
Lys Ala Lys Val Ala Arg Asp Asn Leu He Cys Glu Lys Tyr Lys Asn
435 440 445
Glu Leu Gly Lys Ala Asn Asn Glu Val Phe Ala Gin Phe Phe Met Arg
450 455 460
He Tyr Lys Glu Leu Asn Gly Val He Leu Ala Ser Phe Ser Thr Leu 465 470 475 480
Lys Asn Leu Gin Gly Ser Asn Phe Lys Lys Phe Arg Glu He Phe Lys
485 490 495
Ala Lys Phe Leu Glu Gly Phe Met Val Pro Ala Asp Thr Phe Asp Asn
500 505 510
Val Arg Gly Gin Phe Pro He Gly Phe Leu Val Trp Asp Thr Ser Ser
515 520 525
He Leu Pro Lys Glu Asn Pro Leu Asn Leu Gly Gly Asn Ser Lys Glu
530 535 540
Glu Lys Gin Asn Ser Asn Leu He Leu Asp Gin Asp Asn Leu Lys Asp 545 550 555 560
Asn Pro Leu Lys Glu Arg Phe Cys Leu Leu Asp He Asn Ala Pro Asn
565 570 575
Arg Lys Met Cys Ser Gin Ser Arg Thr Arg Thr Lys Gly Thr Gin Lys
580 585 590
His Ser Thr Ala Ala Pro Phe Glu Thr Pro Leu His Thr Val Ser Leu
595 600 605
Glu He Phe Asp Ser Phe Gly Gly Phe Leu Gly Ser Lys Lys He Tyr
610 615 620
Thr His Thr He Asp Lys Met Leu Thr Leu Ala Asp Tyr Leu Gin Lys 625 630 635 640
Phe Gin Pro Thr Lys Arg Asp Thr He Phe Gly Tyr Leu Asp Pro Gly
645 650 655
Arg Asn Ser Phe Gin His Gin Asn Leu He His He Ser He He Asp
660 665 670
Lys Ser Lys Gin Ser His Val Lys Tyr Phe Pro He He Ala Thr Thr
675 680 685
He Leu Leu Val Ser Val Phe Phe Ser He Arg His Cys He Lys Ala
690 695 700
Thr Trp Gin Asn Asp Arg Asp Gin Phe Tyr Ala Pro Tyr Asp Asp Ala 705 710 715 720
Phe Gin Asp Asp Ser Glu Phe Lys Asn Asn Cys Leu He Phe Met Leu 725 730 735
Phe His Thr Gin Asn Arg He Thr Thr Ala Gin Gly Thr Asn His Phe
740 745 750
He Pro Phe Ser Glu Thr Glu Val Asn Ala Lys Glu Arg Tyr Ser Ser
755 760 765
His Ala Leu Leu Glu Phe Leu Lys Gly Glu He Lys Glu Leu Lys Glu
770 775 780
Asn Asp Ser Leu Phe Leu Ser Ala Lys Lys Glu Asn Lys Pro Leu Lys 785 790 795 800
Phe Ser Pro Ser Ala Ser Lys Val Phe Asp Ala Ser Arg Glu Val Tyr
805 810 815
Arg Tyr Tyr His Thr Gin Asp Phe Thr Asn Arg Pro Tyr Asn Ala Asn
820 825 830
Ala Ser Leu Tyr Asp He Lys Glu Phe Phe Gin Gly Arg Asn Lys Gin
835 840 845
Gly Lys Leu Asn Leu Pro Ala Lys Ala Lys Asp Glu Tyr Tyr Lys Gin
850 855 860
Leu Tyr Ala Asn Leu Gin Asp Ala Leu Lys Asp Leu Ala Lys Glu He 865 870 875 880
Gin Pro Lys Val Tyr Glu Tyr Gly Phe Leu Arg Glu 885 890
(2) INFORMATION FOR SEQ ID NO: 569:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 996 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 70...948 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569:
AAAAGCCACG CAATCAGCCA CAAACATCAT CACACAAGCC TTTGGCTATC AATTATTAAT 60 GAGATAAAG ATG TTA GAA TTT ATT TTA AAA ATT CAA GCT AGA GAC TCT AAA 111 Met Leu Glu Phe He Leu Lys He Gin Ala Arg Asp Ser Lys 1 5 10
GGC TTG GTG AGC ACG ATT AGC ACC ACT ATC GCT AAC AAG GGC TAT AAC 159 Gly Leu Val Ser Thr He Ser Thr Thr He Ala Asn Lys Gly Tyr Asn 15 20 25 30
ATC GTC AAA AAC GAT GAA TTT GTT GAT CCC TTA AAA CAG CGT TTT TTC 207 He Val Lys Asn Asp Glu Phe Val Asp Pro Leu Lys Gin Arg Phe Phe 35 40 45 ATG CGG TTA AAA ATC CAA AAA GAA ATC AAG CCC TTG AAT ACT GAA ATT 255 Met Arg Leu Lys He Gin Lys Glu He Lys Pro Leu Asn Thr Glu He 50 55 60
AAA GAG CAA GAA GAG CAA TCC TTA AAG ACC GCT CTT TTT AAA GCC CTA 303 Lys Glu Gin Glu Glu Gin Ser Leu Lys Thr Ala Leu Phe Lys Ala Leu 65 70 75
GAA AAC TTT AAC GAG TTA TTG ATT GAA GTC ATT TTA ACG CAT AAA AAA 351 Glu Asn Phe Asn Glu Leu Leu He Glu Val He Leu Thr His Lys Lys 80 85 90
AAC ATC ATT CTG CTC GCT ACT AAA GAG AGC CAT TGC TTA GGG GAT TTG 399 Asn He He Leu Leu Ala Thr Lys Glu Ser His Cys Leu Gly Asp Leu 95 100 105 110
CTT TTA AGG GTG TAT GGG GGG GAA TTG AAC GCT CAA ATT TTA GGC GTT 447 Leu Leu Arg Val Tyr Gly Gly Glu Leu Asn Ala Gin He Leu Gly Val 115 120 125
ATT TCC AAC CAC GAG ATT TTA CGC CCT TTA GTG GAA AAA TTT GAC ATC 495 He Ser Asn His Glu He Leu Arg Pro Leu Val Glu Lys Phe Asp He 130 135 140
CCT TAT TTT TAT GCG CCT TGC GAC AAT CAA GTT TTG CAT GAA AAA GAA 543 Pro Tyr Phe Tyr Ala Pro Cys Asp Asn Gin Val Leu His Glu Lys Glu 145 150 155
GTT TTA GAA ATC ATT AAA AAC CTG GAA TTA AAG CAC AAA GTG AGT GCA 591 Val Leu Glu He He Lys Asn Leu Glu Leu Lys His Lys Val Ser Ala 160 165 170
GAC TTG CTC GTT TTA GCC AAA TAC ATG CGC ATT TTA AGC CAT GAT TTT 639 Asp Leu Leu Val Leu Ala Lys Tyr Met Arg He Leu Ser His Asp Phe 175 180 185 190
ACG AAG CGC TAT GAA AAC CAG ATC TTA AAT ATC CAT CAT AGT TTC TTG 687 Thr Lys Arg Tyr Glu Asn Gin He Leu Asn He His His Ser Phe Leu 195 200 205
CCC GCA TTC ATT GGG GCT AAT CCT TAC CAG CAA GCG TTT GAA AGG GGC 735 Pro Ala Phe He Gly Ala Asn Pro Tyr Gin Gin Ala Phe Glu Arg Gly 210 215 220
GTG AAA GTC ATC GGG GCC ACG GCG CAT TTT GTG AAT GAA AGC CTT GAT 783 Val Lys Val He Gly Ala Thr Ala His Phe Val Asn Glu Ser Leu Asp 225 230 235
GCT GGG CCG ATT ATC ATA CAA GAC ACT CTG CCC ATT AAC CAC AAT TAC 831 Ala Gly Pro He He He Gin Asp Thr Leu Pro He Asn His Asn Tyr 240 245 250
AGC GTG GAA AAA ATG CGC CTA GCG GGT AAG GAT ATA GAA AAA CTG GTT 879 Ser Val Glu Lys Met Arg Leu Ala Gly Lys Asp He Glu Lys Leu Val 255 260 265 270 TTA GCT AGG GCT TTA AAA CTC GTT TTA GAA GAC AGA GTC TTT GTA CAT 927 Leu Ala Arg Ala Leu Lys Leu Val Leu Glu Asp Arg Val Phe Val His 275 280 285
GAA AAT AAA ACG GTG GTG TTT TGAATGCTTT TAGATTTCAG CAACCTCAAT GAAG 982 Glu Asn Lys Thr Val Val Phe 290
AACCCTTAAA AAAC 996
(2) INFORMATION FOR SEQ ID NO: 570:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 293 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570:
Met Leu Glu Phe He Leu Lys He Gin Ala Arg Asp Ser Lys Gly Leu
1 5 10 15
Val Ser Thr He Ser Thr Thr He Ala Asn Lys Gly Tyr Asn He Val
20 25 30
Lys Asn Asp Glu Phe Val Asp Pro Leu Lys Gin Arg Phe Phe Met Arg
35 40 45
Leu Lys He Gin Lys Glu He Lys Pro Leu Asn Thr Glu He Lys Glu
50 55 60
Gin Glu Glu Gin Ser Leu Lys Thr Ala Leu Phe Lys Ala Leu Glu Asn 65 70 75 80
Phe Asn Glu Leu Leu He Glu Val He Leu Thr His Lys Lys Asn He
85 90 95
He Leu Leu Ala Thr Lys Glu Ser His Cys Leu Gly Asp Leu Leu Leu
100 105 110
Arg Val Tyr Gly Gly Glu Leu Asn Ala Gin He Leu Gly Val He Ser
115 120 125
Asn His Glu He Leu Arg Pro Leu Val Glu Lys Phe Asp He Pro Tyr
130 135 140
Phe Tyr Ala Pro Cys Asp Asn Gin Val Leu His Glu Lys Glu Val Leu 145 150 155 160
Glu He He Lys Asn Leu Glu Leu Lys His Lys Val Ser Ala Asp Leu
165 170 175
Leu Val Leu Ala Lys Tyr Met Arg He Leu Ser His Asp Phe Thr Lys
180 185 190
Arg Tyr Glu Asn Gin He Leu Asn He His His Ser Phe Leu Pro Ala
195 200 205
Phe He Gly Ala Asn Pro Tyr Gin Gin Ala Phe Glu Arg Gly Val Lys
210 215 220
Val He Gly Ala Thr Ala His Phe Val Asn Glu Ser Leu Asp Ala Gly 225 230 235 240
Pro He He He Gin Asp Thr Leu Pro He Asn His Asn Tyr Ser Val 245 250 255 Glu Lys Met Arg Leu Ala Gly Lys Asp He Glu Lys Leu Val Leu Ala
260 265 270
Arg Ala Leu Lys Leu Val Leu Glu Asp Arg Val Phe Val His Glu Asn
275 280 285
Lys Thr Val Val Phe 290
(2) INFORMATION FOR SEQ ID NO: 571:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 882 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 48...824 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571:
CGATAGTGTT GTAGGATACT TTTGAAATTT AAGCGGTAAG TTGGATA ATG GCG TTT 56
Met Ala Phe 1
TGG CAT AAA AGA TTA GCG GTT GGT TGT TGT ATC GTT TTA TTT TCA TGC 104 Trp His Lys Arg Leu Ala Val Gly Cys Cys He Val Leu Phe Ser Cys 5 10 15
ATG ATG AAC GCT AAT AGC ATT CAA ATC GTT AGA GAC GAT CCG CCC CTT 152 Met Met Asn Ala Asn Ser He Gin He Val Arg Asp Asp Pro Pro Leu 20 25 30 35
GAT CCA ACG CTC CCT GCA TGG GTT TAT TCT GTT GCG TTA TTA AAA GTG 200 Asp Pro Thr Leu Pro Ala Trp Val Tyr Ser Val Ala Leu Leu Lys Val 40 45 50
TAT TTT AGC GAT GGG ACT TAT AAA GAA GGC TAT GCG ACT TTG CTC AAA 248 Tyr Phe Ser Asp Gly Thr Tyr Lys Glu Gly Tyr Ala Thr Leu Leu Lys 55 60 65
AAC GGG CGT TAT ATC GCT TCT TCT GAA ACG CTT TAT TCT AAC GGC TTA 296 Asn Gly Arg Tyr He Ala Ser Ser Glu Thr Leu Tyr Ser Asn Gly Leu 70 75 80
TAC CCT AAA ACG ATT TTA GCC AAA ATG CAA GAC AGC AGC GCT AAA GAG 344 Tyr Pro Lys Thr He Leu Ala Lys Met Gin Asp Ser Ser Ala Lys Glu 85 90 95 CTG ATT TGT ATA GCT AGC CTA CGC CTT GAA GCG ATG GAT AGG AAT CAA 392 Leu He Cys He Ala Ser Leu Arg Leu Glu Ala Met Asp Arg Asn Gin 100 105 110 115
GGG CTT TCG CTT TTA AAA ACC GCC GAT TTT AGA GAC GAT TAT TGC CAT 440 Gly Leu Ser Leu Leu Lys Thr Ala Asp Phe Arg Asp Asp Tyr Cys His 120 125 130
AAA AGA GAA GAG AGC TAT TAT CAT GCA AGG ATT TAC ACA AAA TAC GCT 488 Lys Arg Glu Glu Ser Tyr Tyr His Ala Arg He Tyr Thr Lys Tyr Ala 135 140 145
CAA ACT TTT CAT TCA AAT CCC TAT ACC AAT CAA AAA ACA CCC AAT TCT 536 Gin Thr Phe His Ser Asn Pro Tyr Thr Asn Gin Lys Thr Pro Asn Ser 150 155 160
GAT CTC TAC TAC CCA GCG TTA AAT GAG GGG AAT TCT TTT TCT ATA CAG 584 Asp Leu Tyr Tyr Pro Ala Leu Asn Glu Gly Asn Ser Phe Ser He Gin 165 170 175
ATA ATG GGC ATT TCT GTG GCT GAA CTT TTG AAA TCT AAA AAA TTC CTT 632 He Met Gly He Ser Val Ala Glu Leu Leu Lys Ser Lys Lys Phe Leu 180 185 190 195
TCG CTT GAT GTT TCT TTT AAA AAG GGG AGC GTG TTG TGG GGA GGG AGG 680 Ser Leu Asp Val Ser Phe Lys Lys Gly Ser Val Leu Trp Gly Gly Arg 200 205 210
CCT TAT TTT AGC GAA GTG GGG GAG TTT ATG GGG ATG GCT AGC AGC ACT 728 Pro Tyr Phe Ser Glu Val Gly Glu Phe Met Gly Met Ala Ser Ser Thr 215 220 225
TTA GAA AAC CAA GAA AGT CTG GTG ATT ATC CCT AAA GAA AAG ATC GTG 776 Leu Glu Asn Gin Glu Ser Leu Val He He Pro Lys Glu Lys He Val 230 235 240
CAA TTT TTA AAC GCT CTA AAA AAT CAA AAT ATT TTC CCA AAC ATT CCC T 825 Gin Phe Leu Asn Ala Leu Lys Asn Gin Asn He Phe Pro Asn He Pro 245 250 255
AAGTTTAAGC AAACCTTAAG CTTTCTATGA CTATACTTTC ATTTCCTTGT TTCAGCA 882
(2) INFORMATION FOR SEQ ID NO: 572:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 259 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:572: Met Ala Phe Trp His Lys Arg Leu Ala Val Gly Cys Cys He Val Leu
1 5 10 15
Phe Ser Cys Met Met Asn Ala Asn Ser He Gin He Val Arg Asp Asp
20 25 30
Pro Pro Leu Asp Pro Thr Leu Pro Ala Trp Val Tyr Ser Val Ala Leu
35 40 45
Leu Lys Val Tyr Phe Ser Asp Gly Thr Tyr Lys Glu Gly Tyr Ala Thr
50 55 60
Leu Leu Lys Asn Gly Arg Tyr He Ala Ser Ser Glu Thr Leu Tyr Ser 65 70 75 80
Asn Gly Leu Tyr Pro Lys Thr He Leu Ala Lys Met Gin Asp Ser Ser
85 90 95
Ala Lys Glu Leu He Cys He Ala Ser Leu Arg Leu Glu Ala Met Asp
100 105 110
Arg Asn Gin Gly Leu Ser Leu Leu Lys Thr Ala Asp Phe Arg Asp Asp
115 120 125
Tyr Cys His Lys Arg Glu Glu Ser Tyr Tyr His Ala Arg He Tyr Thr
130 135 140
Lys Tyr Ala Gin Thr Phe His Ser Asn Pro Tyr Thr Asn Gin Lys Thr 145 150 155 160
Pro Asn Ser Asp Leu Tyr Tyr Pro Ala Leu Asn Glu Gly Asn Ser Phe
165 170 175
Ser He Gin He Met Gly He Ser Val Ala Glu Leu Leu Lys Ser Lys
180 185 190
Lys Phe Leu Ser Leu Asp Val Ser Phe Lys Lys Gly Ser Val Leu Trp
195 200 205
Gly Gly Arg Pro Tyr Phe Ser Glu Val Gly Glu Phe Met Gly Met Ala
210 215 220
Ser Ser Thr Leu Glu Asn Gin Glu Ser Leu Val He He Pro Lys Glu 225 230 235 240
Lys He Val Gin Phe Leu Asn Ala Leu Lys Asn Gin Asn He Phe Pro
245 250 255
Asn He Pro
(2) INFORMATION FOR SEQ ID NO:573:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 669 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 88...603 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:573: AAAATAAGGA GGAATTGTTT GATTTTACGA TTGGCTGGAG CAAGCGTTTT AACGGCTTGT 60 GTCTTTTCGG GGTGTTTTTT TTTAAAA ATG TTT GAT AAA AAA CTT TCT AGT AAC 114
Met Phe Asp Lys Lys Leu Ser Ser Asn 1 5
GAT TGG CAT ATC CAA AAA GTG GAA ATG AAC CAT CAA GTC TAT GAC ATT 162 Asp Trp His He Gin Lys Val Glu Met Asn His Gin Val Tyr Asp He 10 15 20 25
GAA ACC ATG CTC GCT GAT AGC GCT TTT AGA GAG CAT GAA GAA GAG CAA 210 Glu Thr Met Leu Ala Asp Ser Ala Phe Arg Glu His Glu Glu Glu Gin 30 35 40
GAT TCC TCT CTA AAT ACC GCT TTG CCT GAA GAT AAA ACA GCG ATT GAA 258 Asp Ser Ser Leu Asn Thr Ala Leu Pro Glu Asp Lys Thr Ala He Glu 45 50 55
GCC AAA GAG CAA GAG CAA AAA GAA AAA AGA AAA CGC TGG TAT GAG CTT 306 Ala Lys Glu Gin Glu Gin Lys Glu Lys Arg Lys Arg Trp Tyr Glu Leu 60 65 70
TTT AAA AAG AAA CCA AAG CCC AAA AGC TCT ATG GGA GAG TTT GTG TTT 354 Phe Lys Lys Lys Pro Lys Pro Lys Ser Ser Met Gly Glu Phe Val Phe 75 80 85
GAT CAA AAA GAA AAT CGT ATT TAT GGC AAA GGC TAT TGC AAC CGG TAT 402 Asp Gin Lys Glu Asn Arg He Tyr Gly Lys Gly Tyr Cys Asn Arg Tyr 90 95 100 105
TTT GCC AGC TAT GTA TGG CAG GGC GAT AGG CAC ATT GGG ATT GAA GAT 450 Phe Ala Ser Tyr Val Trp Gin Gly Asp Arg His He Gly He Glu Asp 110 115 120
AGC GGG ATT TCA AGA AAA GTG TGT AAA GAT GAG CAT TTA ATG GCG TTT 498 Ser Gly He Ser Arg Lys Val Cys Lys Asp Glu His Leu Met Ala Phe 125 130 135
GAA TTG GAA TTT ATG GAG AAT TTT AAG GGT AAT TTT ACG GTA ACT AAG 546 Glu Leu Glu Phe Met Glu Asn Phe Lys Gly Asn Phe Thr Val Thr Lys 140 145 150
GGC AAG GAC ACG CTC ATT TTA GAC AAC CAA AAA ATG AAA ATT TAT TTG 594 Gly Lys Asp Thr Leu He Leu Asp Asn Gin Lys Met Lys He Tyr Leu 155 160 165
AAA ACG CCT TGAGTGGGTT TTTGATTTCA AAACAATCTA AGATCACTAA ATTAGGGAT 652
Lys Thr Pro
170
TAAAAAGAAA TTTTTAA 669
(2) INFORMATION FOR SEQ ID NO:574:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 172 amino acids (B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574:
Met Phe Asp Lys Lys Leu Ser Ser Asn Asp Trp His He Gin Lys Val
1 5 10 15
Glu Met Asn His Gin Val Tyr Asp He Glu Thr Met Leu Ala Asp Ser
20 25 30
Ala Phe Arg Glu His Glu Glu Glu Gin Asp Ser Ser Leu Asn Thr Ala
35 40 45
Leu Pro Glu Asp Lys Thr Ala He Glu Ala Lys Glu Gin Glu Gin Lys
50 55 60
Glu Lys Arg Lys Arg Trp Tyr Glu Leu Phe Lys Lys Lys Pro Lys Pro 65 70 75 80
Lys Ser Ser Met Gly Glu Phe Val Phe Asp Gin Lys Glu Asn Arg He
85 90 95
Tyr Gly Lys Gly Tyr Cys Asn Arg Tyr Phe Ala Ser Tyr Val Trp Gin
100 105 110
Gly Asp Arg His He Gly He Glu Asp Ser Gly He Ser Arg Lys Val
115 120 125
Cys Lys Asp Glu His Leu Met Ala Phe Glu Leu Glu Phe Met Glu Asn
130 135 140
Phe Lys Gly Asn Phe Thr Val Thr Lys Gly Lys Asp Thr Leu He Leu 145 150 155 160
Asp Asn Gin Lys Met Lys He Tyr Leu Lys Thr Pro 165 170
(2) INFORMATION FOR SEQ ID NO: 575:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 290 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 53...235 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575:
TTGAGCTTAA GAAACATTGA TAATTTTGTG GAAAAAGGCT CTGCTTTGAT AG ATA AAT 58
He Asn 1 TTG ACG CTA ACC CCT ATA AAA CGA TTT TTG GAG AAA GGA AAT AAT CAT 106 Leu Thr Leu Thr Pro He Lys Arg Phe Leu Glu Lys Gly Asn Asn His 5 10 15
GAG AGC TAC GGC GAT AAA AAT CTT TTC ACT CTC ATC AGC ATT AGC CCT 154 Glu Ser Tyr Gly Asp Lys Asn Leu Phe Thr Leu He Ser He Ser Pro 20 25 30
ATT GCT TCA TGG TTG CTT GAG CAT CAA TTT AAA ACA AAT GCT ACC AGA 202 He Ala Ser Trp Leu Leu Glu His Gin Phe Lys Thr Asn Ala Thr Arg 35 40 45 50
GAT CAG AAC TTA CGA TTT GAA TGC GAG TTC TTT TGAAATCACG CAATGCGCTA 255 Asp Gin Asn Leu Arg Phe Glu Cys Glu Phe Phe 55 60
AACCTTTGAC TGAAGTGAGG CTCATTAGTA TTTTG 290
(2) INFORMATION FOR SEQ ID NO: 576:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 61 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576:
He Asn Leu Thr Leu Thr Pro He Lys Arg Phe Leu Glu Lys Gly Asn
1 5 10 15
Asn His Glu Ser Tyr Gly Asp Lys Asn Leu Phe Thr Leu He Ser He
20 25 30
Ser Pro He Ala Ser Trp Leu Leu Glu His Gin Phe Lys Thr Asn Ala
35 40 45
Thr Arg Asp Gin Asn Leu Arg Phe Glu Cys Glu Phe Phe 50 55 60
(2) INFORMATION FOR SEQ ID NO: 577:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 810 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 28...771 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577:
TATCAAAAAA TAAAGGGAAA AGACTGA ATG TTG AAA AGA ATG ATA TTA TTA GGG 54
Met Leu Lys Arg Met He Leu Leu Gly 1 5
GCT TTG GGT GTT TTA GCG AGC GCT GAA GAG AGT GCG GCT TTT GTG GGA 102 Ala Leu Gly Val Leu Ala Ser Ala Glu Glu Ser Ala Ala Phe Val Gly 10 15 20 25
GTC AAT TAC CAG GTG AGC ATG ATA CAA AAT CAG ACT AAA ATG GTG AAT 150 Val Asn Tyr Gin Val Ser Met He Gin Asn Gin Thr Lys Met Val Asn 30 35 40
GAC AAC GGC TTG CAA AAG CCT TTG ATA AAG TTT CCG CCT TAC GCA GGA 198 Asp Asn Gly Leu Gin Lys Pro Leu He Lys Phe Pro Pro Tyr Ala Gly 45 50 55
GCG GGT TTT GAA GTG GGC TAT AAG CAA TTT TTT GGT AAG AAA AAA TGG 246 Ala Gly Phe Glu Val Gly Tyr Lys Gin Phe Phe Gly Lys Lys Lys Trp 60 65 70
TTT GGC ATG CGT TAT TAT GGG TTT TTT GAC TAC GCG CAC AAC CGC TTT 294 Phe Gly Met Arg Tyr Tyr Gly Phe Phe Asp Tyr Ala His Asn Arg Phe 75 80 85
GGC GTG ATG AAA AAG GGC ATT CCG GTG GGC GAT AGT GGG TTT ATT TAC 342 Gly Val Met Lys Lys Gly He Pro Val Gly Asp Ser Gly Phe He Tyr 90 95 100 105
AAT AGT TTT AGT TTT GGA GGG AAC ACT TTA ACG GAA AGG GAT TCC TAT 390 Asn Ser Phe Ser Phe Gly Gly Asn Thr Leu Thr Glu Arg Asp Ser Tyr 110 115 120
CAG GGG CAA TAC TAT GTC AAT TTA TTC ACT TAT GGC GTG GGG TTA GAT 438 Gin Gly Gin Tyr Tyr Val Asn Leu Phe Thr Tyr Gly Val Gly Leu Asp 125 130 135
ACG CTG TGG AAT TTT GTG AAT AAA GAA AAC ATG GTT TTT GGT TTT GTG 486 Thr Leu Trp Asn Phe Val Asn Lys Glu Asn Met Val Phe Gly Phe Val 140 145 150
GTG GGG ATC CAA TTA GCG GGG GAT AGT TGG GCA ACG AGC ATC AGT AAA 534 Val Gly He Gin Leu Ala Gly Asp Ser Trp Ala Thr Ser He Ser Lys 155 160 165
GAA ATC GCT CAT TAT GCA AAA CAC CAC AGC AAT TCC AGT TAT AGC CCG 582 Glu He Ala His Tyr Ala Lys His His Ser Asn Ser Ser Tyr Ser Pro 170 175 180 185
GCC AAT TTC CAG TTT TTA TGG AAG TTT GGG GTC CGC ACC CAT ATC GCT 630 Ala Asn Phe Gin Phe Leu Trp Lys Phe Gly Val Arg Thr His He Ala 190 195 200 AAA CAC AAT AGC CTA GAA TTA GGG ATT AAA GTG CCT ACG ATC ACA CAC 678 Lys His Asn Ser Leu Glu Leu Gly He Lys Val Pro Thr He Thr His 205 210 215
CAG CTT TTC TCT CTT ACC AAC GAA AAG GGA TAC ACC TTA CAG GCT GAT 726 Gin Leu Phe Ser Leu Thr Asn Glu Lys Gly Tyr Thr Leu Gin Ala Asp 220 225 230
GTG CGT AGA GTT TAT GCG TTT CAA ATC AGT TAC TTG AGG GAT TTT TAACC 776 Val Arg Arg Val Tyr Ala Phe Gin He Ser Tyr Leu Arg Asp Phe 235 240 245
CCTTTTTAGA TACAATCACG CCTGAAACTA TCCA 810
(2) INFORMATION FOR SEQ ID NO: 578:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 248 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578:
Met Leu Lys Arg Met He Leu Leu Gly Ala Leu Gly Val Leu Ala Ser
1 5 10 15
Ala Glu Glu Ser Ala Ala Phe Val Gly Val Asn Tyr Gin Val Ser Met
20 25 30
He Gin Asn Gin Thr Lys Met Val Asn Asp Asn Gly Leu Gin Lys Pro
35 40 45
Leu He Lys Phe Pro Pro Tyr Ala Gly Ala Gly Phe Glu Val Gly Tyr
50 55 60
Lys Gin Phe Phe Gly Lys Lys Lys Trp Phe Gly Met Arg Tyr Tyr Gly 65 70 75 80
Phe Phe Asp Tyr Ala His Asn Arg Phe Gly Val Met Lys Lys Gly He
85 90 95
Pro Val Gly Asp Ser Gly Phe He Tyr Asn Ser Phe Ser Phe Gly Gly
100 105 110
Asn Thr Leu Thr Glu Arg Asp Ser Tyr Gin Gly Gin Tyr Tyr Val Asn
115 120 125
Leu Phe Thr Tyr Gly Val Gly Leu Asp Thr Leu Trp Asn Phe Val Asn
130 135 140
Lys Glu Asn Met Val Phe Gly Phe Val Val Gly He Gin Leu Ala Gly 145 150 155 160
Asp Ser Trp Ala Thr Ser He Ser Lys Glu He Ala His Tyr Ala Lys
165 170 175
His His Ser Asn Ser Ser Tyr Ser Pro Ala Asn Phe Gin Phe Leu Trp
180 185 190
Lys Phe Gly Val Arg Thr His He Ala Lys His Asn Ser Leu Glu Leu
195 200 205
Gly He Lys Val Pro Thr He Thr His Gin Leu Phe Ser Leu Thr Asn 210 215 220 Glu Lys Gly Tyr Thr Leu Gin Ala Asp Val Arg Arg Val Tyr Ala Phe 225 230 235 240
Gin He Ser Tyr Leu Arg Asp Phe 245
(2) INFORMATION FOR SEQ ID NO: 579:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1354 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 190...1299 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579:
AGTAAGCTTG ATTTTAAAAA ACCAAATGCC CCCAAAGCGA TAGGCCCCCT TAATAGCCAA 60
CTCAACGCTA TTAAGTGGGG CGAGTTCAGA TTGGGGGATT TGTTTGAAGT GTTGTCAAGT 120
AAGAAAATTT ATCATGCCAA CACGATAAAA ATCCATGACA CGCAAATAGA AAACAGCTAC 180
CCTTATGTC GTG CGC GCT GCA ACC AAT AAT GGT ATA AAA GGC TTT ATT ATA 231 Val Arg Ala Ala Thr Asn Asn Gly He Lys Gly Phe He He 1 5 10
GAT GAC CCT ACA TTT GCT AAT AAA AAA AAT ACC CTT TCG TTC GCG CAA 279 Asp Asp Pro Thr Phe Ala Asn Lys Lys Asn Thr Leu Ser Phe Ala Gin 15 20 25 30
GAC ACT TTC ACT GTG TTT TAT CAA AAA CAA CCT TAT TTT ACA GGC AAT 327 Asp Thr Phe Thr Val Phe Tyr Gin Lys Gin Pro Tyr Phe Thr Gly Asn 35 40 45
AAG GTT AAA ATT TTA AAA CCA AAA TTT GCT TTC AAA AGC CCT AAA ATT 375 Lys Val Lys He Leu Lys Pro Lys Phe Ala Phe Lys Ser Pro Lys He 50 55 60
TTA CAT TCT ATA AGC GCG ATT TTA CAA TTT ATT TTA AAA CCC TTA ACT 423 Leu His Ser He Ser Ala He Leu Gin Phe He Leu Lys Pro Leu Thr 65 70 75
TGG GGG CTA GGC TCT ACA ACA GAA AGC ATT GCG GAG TTT AAA TTT TCT 471 Trp Gly Leu Gly Ser Thr Thr Glu Ser He Ala Glu Phe Lys Phe Ser 80 85 90
CTA CCC CTA AAA CCC ACC GCT AAC GCT CAA ACC CTT GAG GAT ATT GAT 519 Leu Pro Leu Lys Pro Thr Ala Asn Ala Gin Thr Leu Glu Asp He Asp 95 100 105 110 TTT GAT TTC ATG GAA AAA TTC ATA GCC GAA CTT GAG CAG TGT CGG CTC 567 Phe Asp Phe Met Glu Lys Phe He Ala Glu Leu Glu Gin Cys Arg Leu 115 120 125
GCC GAA CTT GAG CAG TGT CGG CTC GCC GAA CTT CAG GCT TAT TTA AAA 615 Ala Glu Leu Glu Gin Cys Arg Leu Ala Glu Leu Gin Ala Tyr Leu Lys 130 135 140
GCT ACA GGG CTA GAA AAC ACC ACC CTT TCT AAC GAT GAA GAA AAC GCC 663 Ala Thr Gly Leu Glu Asn Thr Thr Leu Ser Asn Asp Glu Glu Asn Ala 145 150 155
CTT AAT GTT TTC AAT AAT TCT GGG GGG GGG GGG GGT AAT ACC CCA TGC 711 Leu Asn Val Phe Asn Asn Ser Gly Gly Gly Gly Gly Asn Thr Pro Cys 160 165 170
GGC TTA ACA TGG CAA AGC TTT AGA TTA GGG GAT TTG TTT GAA ATT GAA 759 Gly Leu Thr Trp Gin Ser Phe Arg Leu Gly Asp Leu Phe Glu He Glu 175 180 185 190
AAA ACC TTA AGC TTT AAT AAA GAC GCT TTA ACG CAA GGA GAA GAT TAT 807 Lys Thr Leu Ser Phe Asn Lys Asp Ala Leu Thr Gin Gly Glu Asp Tyr 195 200 205
GAT TAT ATT ACA AGG ACT TCG CAA AAT CAA GGC GTT TTG CAA ACT ACA 855 Asp Tyr He Thr Arg Thr Ser Gin Asn Gin Gly Val Leu Gin Thr Thr 210 215 220
GGA TTT GTC AAT GCA GAA AAT TTA AAC CCA CCA TTT ACT TGG AGT TTA 903 Gly Phe Val Asn Ala Glu Asn Leu Asn Pro Pro Phe Thr Trp Ser Leu 225 230 235
GGG CTT TTG CAA ATG GAT TTT TTC TAT CGT AAA AAG TCA TGG TAT GCG 951 Gly Leu Leu Gin Met Asp Phe Phe Tyr Arg Lys Lys Ser Trp Tyr Ala 240 245 250
GGA CAA TTC ATG CGA AAA ATC ACA CCA AAA ACT GAA ATT GAA AAT AAA 999 Gly Gin Phe Met Arg Lys He Thr Pro Lys Thr Glu He Glu Asn Lys 255 260 265 270
ATT GAT TTA CGC ATA GCC AAC TAC TTC ACA ACG CTT TTA AAC GCC TTA 1047 He Asp Leu Arg He Ala Asn Tyr Phe Thr Thr Leu Leu Asn Ala Leu 275 280 285
AAA CGC CCT TTA TTA AGC GTA TTG GTT AGA GAT ATT GAT AAA ACT TTT 1095 Lys Arg Pro Leu Leu Ser Val Leu Val Arg Asp He Asp Lys Thr Phe 290 295 300
AGG GAG CAA AAA ATC CAA CTA CCC CTA AAA CCC ACC GCT AAA ACT CAA 1143 Arg Glu Gin Lys He Gin Leu Pro Leu Lys Pro Thr Ala Lys Thr Gin 305 310 315
ACC CTT GAT GGT ATT GAT TTT GAT TTC ATG CAC ACC CTT ATC AAC GCC 1191 Thr Leu Asp Gly He Asp Phe Asp Phe Met His Thr Leu He Asn Ala 320 325 330 CTA ATG AAG CAA ACC ATT CAA GGC GTG GCT CAA TAC TGC GAC GCT AAA 1239 Leu Met Lys Gin Thr He Gin Gly Val Ala Gin Tyr Cys Asp Ala Lys 335 340 345 350
ATA CAA GCT ACA AAA GAG GTT ATC AGC CAA GAA GCG CCC GTT CAA AAA 1287 He Gin Ala Thr Lys Glu Val He Ser Gin Glu Ala Pro Val Gin Lys 355 360 365
GAC TCG TTA TTT TAAAAGGGGT TTTAAGCGCG CTAGCTTGTG TTACAATAAA CTTAA 1344 Asp Ser Leu Phe 370
AATTCGCTTG 1354
(2) INFORMATION FOR SEQ ID NO: 580:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 370 ammo acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580:
Val Arg Ala Ala Thr Asn Asn Gly He Lys Gly Phe He He Asp Asp
1 5 10 15
Pro Thr Phe Ala Asn Lys Lys Asn Thr Leu Ser Phe Ala Gin Asp Thr
20 25 30
Phe Thr Val Phe Tyr Gin Lys Gin Pro Tyr Phe Thr Gly Asn Lys Val
35 40 45
Lys He Leu Lys Pro Lys Phe Ala Phe Lys Ser Pro Lys He Leu His
50 55 60
Ser He Ser Ala He Leu Gin Phe He Leu Lys Pro Leu Thr Trp Gly 65 70 75 80
Leu Gly Ser Thr Thr Glu Ser He Ala Glu Phe Lys Phe Ser Leu Pro
85 90 95
Leu Lys Pro Thr Ala Asn Ala Gin Thr Leu Glu Asp He Asp Phe Asp
100 105 110
Phe Met Glu Lys Phe He Ala Glu Leu Glu Gin Cys Arg Leu Ala Glu
115 120 125
Leu Glu Gin Cys Arg Leu Ala Glu Leu Gin Ala Tyr Leu Lys Ala Thr
130 135 140
Gly Leu Glu Asn Thr Thr Leu Ser Asn Asp Glu Glu Asn Ala Leu Asn 145 150 155 160
Val Phe Asn Asn Ser Gly Gly Gly Gly Gly Asn Thr Pro Cys Gly Leu
165 170 175
Thr Trp Gin Ser Phe Arg Leu Gly Asp Leu Phe Glu He Glu Lys Thr
180 185 190
Leu Ser Phe Asn Lys Asp Ala Leu Thr Gin Gly Glu Asp Tyr Asp Tyr
195 200 205
He Thr Arg Thr Ser Gin Asn Gin Gly Val Leu Gin Thr Thr Gly Phe 210 215 220 Val Asn Ala Glu Asn Leu Asn Pro Pro Phe Thr Trp Ser Leu Gly Leu 225 230 235 240
Leu Gin Met Asp Phe Phe Tyr Arg Lys Lys Ser Trp Tyr Ala Gly Gin
245 250 255
Phe Met Arg Lys He Thr Pro Lys Thr Glu He Glu Asn Lys He Asp
260 265 270
Leu Arg He Ala Asn Tyr Phe Thr Thr Leu Leu Asn Ala Leu Lys Arg
275 280 285
Pro Leu Leu Ser Val Leu Val Arg Asp He Asp Lys Thr Phe Arg Glu
290 295 300
Gin Lys He Gin Leu Pro Leu Lys Pro Thr Ala Lys Thr Gin Thr Leu 305 310 315 320
Asp Gly He Asp Phe Asp Phe Met His Thr Leu He Asn Ala Leu Met
325 330 335
Lys Gin Thr He Gin Gly Val Ala Gin Tyr Cys Asp Ala Lys He Gin
340 345 350
Ala Thr Lys Glu Val He Ser Gin Glu Ala Pro Val Gin Lys Asp Ser
355 360 365
Leu Phe 370
(2) INFORMATION FOR SEQ ID NO: 581:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 450 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 164...367 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:581:
AATCCAGCAT GCCCTCTAAT TCAAACACGC CCTCTTCAAG CTTGTTTATG CCCTCTTGTT 60
TTAAGTCGTA TTCGTCGCTA ATCTCGCCCA TGATCTCTTC AATGATGTCT TCCATAGTGA 120
GCAACCCGGC TGTGCCGCCG TATTCATCAA TCACCAAAGC GGT ATG GAT TTG CTC 175
Met Asp Leu Leu 1
TTT TTT CAT TTT AAT AAG GAT TTG AGA AAT GGA AGC GCT TTC GGG GAC 223 Phe Phe His Phe Asn Lys Asp Leu Arg Asn Gly Ser Ala Phe Gly Asp 5 10 15 20
GAT GAT CAT TTT CCT AAC GAT TTG ATT GAA ATC ATG CAT TTT GGG GGT 271 Asp Asp His Phe Pro Asn Asp Leu He Glu He Met His Phe Gly Gly 25 30 35 AAA AAT AGA GCG CGA AAG CAA ATC CCT AAT ATG CAC CAT GCC GAT AAT 319 Lys Asn Arg Ala Arg Lys Gin He Pro Asn Met His His Ala Asp Asn 40 45 50
GTT ATC CTT AGA ACC CTT GCA ATA AGG GTA GCG CGT GAA ATG GCC TTT T 368 Val He Leu Arg Thr Leu Ala He Arg Val Ala Arg Glu Met Ala Phe 55 60 65
AAAACAATGT CTATATTTTC TTCATAGCTG TTTTCTTCAT CCAAACACAC CATGTCTTTT 428 CGTGGGGTCA TGATTTCTTT AG 450
(2) INFORMATION FOR SEQ ID NO: 582:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 68 ammo acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582:
Met Asp Leu Leu Phe Phe His Phe Asn Lys Asp Leu Arg Asn Gly Ser
1 5 10 15
Ala Phe Gly Asp Asp Asp His Phe Pro Asn Asp Leu He Glu He Met
20 25 30
His Phe Gly Gly Lys Asn Arg Ala Arg Lys Gin He Pro Asn Met His
35 40 45
His Ala Asp Asn Val He Leu Arg Thr Leu Ala He Arg Val Ala Arg
50 55 60
Glu Met Ala Phe 65
(2) INFORMATION FOR SEQ ID NO: 583:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1051 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51... 98 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:583 ACCAAGAGGT CGTTAAAAGC TATTACCAAC ATTTAAAACA AGGATAAAAC ATG CAA 56
Met Gin
1
GAA TTC AGT TTG TGG TGC GAT TTT ATA GAA AGG GAT TTT TTA GAA AAC 104 Glu Phe Ser Leu Trp Cys Asp Phe He Glu Arg Asp Phe Leu Glu Asn 5 10 15
GAC TTT TTA AAG CTC ATT AAT AAG GGG GCT ATT TGC GGG GCA ACG AGT 152 Asp Phe Leu Lys Leu He Asn Lys Gly Ala He Cys Gly Ala Thr Ser 20 25 30
AAC CCT AGT TTG TTT TGC GAA GCG ATC ACA AAA AGC GCG TTT TAT AAA 200 Asn Pro Ser Leu Phe Cys Glu Ala He Thr Lys Ser Ala Phe Tyr Lys 35 40 45 50
GAT GAA ATC GCT AAA CTC AAA GGC AAA AAA GCT AAA GAA ATT TAT GAA 248 Asp Glu He Ala Lys Leu Lys Gly Lys Lys Ala Lys Glu He Tyr Glu 55 60 65
ACT CTG GCG TTA AAG GAT ATT TTA CAA GCT TCT AGC GCG TTG ATG CCT 296 Thr Leu Ala Leu Lys Asp He Leu Gin Ala Ser Ser Ala Leu Met Pro 70 75 80
TTA TAT GAA AAA GAC CCT AAC AAT GGC TAC ATT AGC CTA GAA ATT GAC 344 Leu Tyr Glu Lys Asp Pro Asn Asn Gly Tyr He Ser Leu Glu He Asp 85 90 95
CCT TTT TTA GAA GAT GAT GCC GCT AAA AGC ATT GAT GAA GCC AAG CGG 392 Pro Phe Leu Glu Asp Asp Ala Ala Lys Ser He Asp Glu Ala Lys Arg 100 105 110
TTG TTC AAA ACA TTA AAC CGC CCT AAT GTG ATG ATT AAA GTC CCA GCG 440 Leu Phe Lys Thr Leu Asn Arg Pro Asn Val Met He Lys Val Pro Ala 115 120 125 130
AGT GAA AGC GGG ATT GAA GTG GTT AGC GCT TTA ACT CAA GCC TCT ATT 488 Ser Glu Ser Gly He Glu Val Val Ser Ala Leu Thr Gin Ala Ser He 135 140 145
CCT GTT AAT GTA ACT TTA GTC TTT TCG CCT AAA ATT GCC GGT GAA ATC 536 Pro Val Asn Val Thr Leu Val Phe Ser Pro Lys He Ala Gly Glu He 150 155 160
GCT CAA ATC TTA GCC AAA GAA GCG CAA AAA AGA GCG GTC ATT AGC GTG 584 Ala Gin He Leu Ala Lys Glu Ala Gin Lys Arg Ala Val He Ser Val 165 170 175
TTT GTC TCA CGA TTT GAC AAA GAA ATA GAC CCT TTA GTG CCA AAA AAT 632 Phe Val Ser Arg Phe Asp Lys Glu He Asp Pro Leu Val Pro Lys Asn 180 185 190
TTG CAA GCT CAA AGC GGG ATT ATC AAC GCT ACC GAG TGC TAT TAT CAA 680 Leu Gin Ala Gin Ser Gly He He Asn Ala Thr Glu Cys Tyr Tyr Gin 195 200 205 210 ATT AAT CAG CAT GCC AAT AAG CTA ACA AGC ACC CTT TTT GCA TCC ACA 728 He Asn Gin His Ala Asn Lys Leu Thr Ser Thr Leu Phe Ala Ser Thr 215 220 225
GGC GTT AAA TCC AAT TCT TTA GCT AAA GAT TAC TAC ATT AAA GCG CTG 776 Gly Val Lys Ser Asn Ser Leu Ala Lys Asp Tyr Tyr He Lys Ala Leu 230 235 240
TGT TTT AAA AAC TCT ATC AAT ACA GCC CCT CTA GAG GCT TTA AAC GCT 824 Cys Phe Lys Asn Ser He Asn Thr Ala Pro Leu Glu Ala Leu Asn Ala 245 250 255
TAT TTG CTT GAC CCA AAC ACC GAG TGT CAA ACC CCT TTA AAG ACT ACA 872 Tyr Leu Leu Asp Pro Asn Thr Glu Cys Gin Thr Pro Leu Lys Thr Thr 260 265 270
GAA ATT GAA GCG TTT AAA AAA GAA TTA AAA GTG CAC AAC ATT GAT TTA 920 Glu He Glu Ala Phe Lys Lys Glu Leu Lys Val His Asn He Asp Leu 275 280 285 290
GAA AAC ACC GCT CAA AAA CTC CTT AAA GAA GGC TTG ATA GCG TTC AAA 968 Glu Asn Thr Ala Gin Lys Leu Leu Lys Glu Gly Leu He Ala Phe Lys 295 300 305
CAA TCC TTT GAA AAG CTT TTA AGC AGT TTT TGATTTTTAA GGGTTTTTTG GAT 1021 Gin Ser Phe Glu Lys Leu Leu Ser Ser Phe 310 315
AGAATAAGCC CTTATTTTAT TTTAAAGGAT 1051
(2) INFORMATION FOR SEQ ID NO: 584:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 316 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584:
Met Gin Glu Phe Ser Leu Trp Cys Asp Phe He Glu Arg Asp Phe Leu
1 5 10 15
Glu Asn Asp Phe Leu Lys Leu He Asn Lys Gly Ala He Cys Gly Ala
20 25 30
Thr Ser Asn Pro Ser Leu Phe Cys Glu Ala He Thr Lys Ser Ala Phe
35 40 45
Tyr Lys Asp Glu He Ala Lys Leu Lys Gly Lys Lys Ala Lys Glu He
50 55 60
Tyr Glu Thr Leu Ala Leu Lys Asp He Leu Gin Ala Ser Ser Ala Leu 65 70 75 80
Met Pro Leu Tyr Glu Lys Asp Pro Asn Asn Gly Tyr He Ser Leu Glu
85 90 95 He Asp Pro Phe Leu Glu Asp Asp Ala Ala Lys Ser He Asp Glu Ala
100 105 110
Lys Arg Leu Phe Lys Thr Leu Asn Arg Pro Asn Val Met He Lys Val
115 120 125
Pro Ala Ser Glu Ser Gly He Glu Val Val Ser Ala Leu Thr Gin Ala
130 135 140
Ser He Pro Val Asn Val Thr Leu Val Phe Ser Pro Lys He Ala Gly 145 150 155 160
Glu He Ala Gin He Leu Ala Lys Glu Ala Gin Lys Arg Ala Val He
165 170 175
Ser Val Phe Val Ser Arg Phe Asp Lys Glu He Asp Pro Leu Val Pro
180 185 190
Lys Asn Leu Gin Ala Gin Ser Gly He He Asn Ala Thr Glu Cys Tyr
195 200 205
Tyr Gin He Asn Gin His Ala Asn Lys Leu Thr Ser Thr Leu Phe Ala
210 215 220
Ser Thr Gly Val Lys Ser Asn Ser Leu Ala Lys Asp Tyr Tyr He Lys 225 230 235 240
Ala Leu Cys Phe Lys Asn Ser He Asn Thr Ala Pro Leu Glu Ala Leu
245 250 255
Asn Ala Tyr Leu Leu Asp Pro Asn Thr Glu Cys Gin Thr Pro Leu Lys
260 265 270
Thr Thr Glu He Glu Ala Phe Lys Lys Glu Leu Lys Val His Asn He
275 280 285
Asp Leu Glu Asn Thr Ala Gin Lys Leu Leu Lys Glu Gly Leu He Ala
290 295 300
Phe Lys Gin Ser Phe Glu Lys Leu Leu Ser Ser Phe 305 310 315
(2) INFORMATION FOR SEQ ID NO: 585.
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1254 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 55...1215 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:585:
AGCTATAATA AAATAATTAA AAAAGTAACA CTTAAGCGGA GACCCTAGAG AGTG ATG 57
Met
1
CTC AAT TTT ATG ACA AAG AAG AAA AAT AGA ATG CAA GAT TGC AAA ATG 105 Leu Asn Phe Met Thr Lys Lys Lys Asn Arg Met Gin Asp Cys Lys Met 5 10 15
GTT TGT AAA AAT TTT AAT CGT AAG GAA TCT GTT TTG ATA GCT CAA TCT 153 Val Cys Lys Asn Phe Asn Arg Lys Glu Ser Val Leu He Ala Gin Ser 20 25 30
TTA GAT ATT TCT AAA AAA GGT TCG GTA ATT TTA GGC GCT CTT TTG AGT 201 Leu Asp He Ser Lys Lys Gly Ser Val He Leu Gly Ala Leu Leu Ser 35 40 45
TCG TTA TGG CTG ACA AAC CCC TTA AAT GCC CAT GAA AAG AAT GGC GCG 249 Ser Leu Trp Leu Thr Asn Pro Leu Asn Ala His Glu Lys Asn Gly Ala 50 55 60 65
TTT GTG GGG ATT AGC TTG GAA GTG GGT AGG GCC GAT CAA AAG ACA AAC 297 Phe Val Gly He Ser Leu Glu Val Gly Arg Ala Asp Gin Lys Thr Asn 70 75 80
GCT TAT AAA AAC GGC GAG TTG TTT CAA GTG CCT TTT GGC GAT GTT TCG 345 Ala Tyr Lys Asn Gly Glu Leu Phe Gin Val Pro Phe Gly Asp Val Ser 85 90 95
GCT AAT GAT GAT GGC AAA GTT CCT GAC GGG CAG ACC GGT GGC TGT CAG 393 Ala Asn Asp Asp Gly Lys Val Pro Asp Gly Gin Thr Gly Gly Cys Gin 100 105 110
CCA GCT TCA GGG ACG CCA GGA ACG CCA GGC TAC ACT AAA GCT AAC TGC 441 Pro Ala Ser Gly Thr Pro Gly Thr Pro Gly Tyr Thr Lys Ala Asn Cys 115 120 125
GTG GTC AAT TGG ACT TCG CGC ACC ATG CTT AGC ACC AAT AAA AAC ATT 489 Val Val Asn Trp Thr Ser Arg Thr Met Leu Ser Thr Asn Lys Asn He 130 135 140 145
CCT GGC CGT AAC CAG CCG ATG TAT GGG CTA GGC GTG ATG ACA GGC TAT 537 Pro Gly Arg Asn Gin Pro Met Tyr Gly Leu Gly Val Met Thr Gly Tyr 150 155 160
AAG CAT TTT ATC GGT AAA AAA AGA TGG TTT GGG TTG CGC TAT TAC GGC 585 Lys His Phe He Gly Lys Lys Arg Trp Phe Gly Leu Arg Tyr Tyr Gly 165 170 175
TTT TTT GAT TAT GGG CAT ACC AAT TTC TCT AAC TCC AGA GCC GCT AAC 633 Phe Phe Asp Tyr Gly His Thr Asn Phe Ser Asn Ser Arg Ala Ala Asn 180 185 190
GCT ATA TCG CCT TTT TAT TTG AGC GAT CAA AAA GCC GAC ATG TAT ACT 681 Ala He Ser Pro Phe Tyr Leu Ser Asp Gin Lys Ala Asp Met Tyr Thr 195 200 205
TAT GGT TTT GGC ACA GAC ATG CTT TTT AAC ATT ATA GAT AAG CCT AAA 729 Tyr Gly Phe Gly Thr Asp Met Leu Phe Asn He He Asp Lys Pro Lys 210 215 220 225
GCC ACG GCC GGG TTT TTT TTA GGC GTG AAT TTT GCG GGT AAC ACT TGG 777 Ala Thr Ala Gly Phe Phe Leu Gly Val Asn Phe Ala Gly Asn Thr Trp 230 235 240
ACT AAT AAT CGT GTG GGG TAT TTT AAG GAC GGG TAT GTT TAT GGC GTC 825 Thr Asn Asn Arg Val Gly Tyr Phe Lys Asp Gly Tyr Val Tyr Gly Val 245 250 255
AAT ACG GAC GCT GAC GCT TAC ATG ACT AAC GCT GAT GGC ACA ATC ACT 873 Asn Thr Asp Ala Asp Ala Tyr Met Thr Asn Ala Asp Gly Thr He Thr 260 265 270
TGC GGG GAC ACG ACG CCG GCG AGT TGC AAT GTG GGG ATT AAC CCT AAT 921 Cys Gly Asp Thr Thr Pro Ala Ser Cys Asn Val Gly He Asn Pro Asn 275 280 285
AGC GTC TAT ACC ACA GGA AAA TTG AAC GCT AAG GTG AAT CAC ACG ATT 969 Ser Val Tyr Thr Thr Gly Lys Leu Asn Ala Lys Val Asn His Thr He 290 295 300 305
TTC CAA TTT TTA GTG AAT GTG GGC ATT AGA ACT AAT ATT TTT GAA CAC 1017 Phe Gin Phe Leu Val Asn Val Gly He Arg Thr Asn He Phe Glu His 310 315 320
CAT GGC ATT GAG TTT GGC ATC AAA ATC CCC ACG CTC CCT AAC TAC TTT 1065 His Gly He Glu Phe Gly He Lys He Pro Thr Leu Pro Asn Tyr Phe 325 330 335
TTC AAA GGT TCT ACT ACC ATA AGA GCG AAA AAA CAA GGC CCG CTA GAG 1113 Phe Lys Gly Ser. Thr Thr He Arg Ala Lys Lys Gin Gly Pro Leu Glu 340 345 350
AAT GGC CAA CCA ACC ACT ATC ACC GGA GCA GAA ACC AAT TTC AGC TTA 1161 Asn Gly Gin Pro Thr Thr He Thr Gly Ala Glu Thr Asn Phe Ser Leu 355 360 365
ACC CAA ACC TTA CGC CGT CAG TAT TCT ATG TAT TTG CGC TAT GTT TAT 1209 Thr Gin Thr Leu Arg Arg Gin Tyr Ser Met Tyr Leu Arg Tyr Val Tyr 370 375 380 385
ACT TTT TAAGTTTGGT AGGGTTTTTA GGCAAGGCTT AGAGATGAA 1254
Thr Phe
(2) INFORMATION FOR SEQ ID NO: 586
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 387 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586:
Met Leu Asn Phe Met Thr Lys Lys Lys Asn Arg Met Gin Asp Cys Lys
1 5 10 15
Met Val Cys Lys Asn Phe Asn Arg Lys Glu Ser Val Leu He Ala Gin
20 25 30
Ser Leu Asp He Ser Lys Lys Gly Ser Val He Leu Gly Ala Leu Leu
35 40 45
Ser Ser Leu Trp Leu Thr Asn Pro Leu Asn Ala His Glu Lys Asn Gly
50 55 60
Ala Phe Val Gly He Ser Leu Glu Val Gly Arg Ala Asp Gin Lys Thr 65 70 75 80
Asn Ala Tyr Lys Asn Gly Glu Leu Phe Gin Val Pro Phe Gly Asp Val
85 90 95
Ser Ala Asn Asp Asp Gly Lys Val Pro Asp Gly Gin Thr Gly Gly Cys
100 105 110
Gin Pro Ala Ser Gly Thr Pro Gly Thr Pro Gly Tyr Thr Lys Ala Asn
115 120 125
Cys Val Val Asn Trp Thr Ser Arg Thr Met Leu Ser Thr Asn Lys Asn
130 135 140
He Pro Gly Arg Asn Gin Pro Met Tyr Gly Leu Gly Val Met Thr Gly 145 150 155 160
Tyr Lys His Phe He Gly Lys Lys Arg Trp Phe Gly Leu Arg Tyr Tyr
165 170 175
Gly Phe Phe Asp Tyr Gly His Thr Asn Phe Ser Asn Ser Arg Ala Ala
180 185 190
Asn Ala He Ser Pro Phe Tyr Leu Ser Asp Gin Lys Ala Asp Met Tyr
195 200 205
Thr Tyr Gly Phe Gly Thr Asp Met Leu Phe Asn He He Asp Lys Pro
210 215 220
Lys Ala Thr Ala Gly Phe Phe Leu Gly Val Asn Phe Ala Gly Asn Thr 225 230 235 240
Trp Thr Asn Asn Arg Val Gly Tyr Phe Lys Asp Gly Tyr Val Tyr Gly
245 250 255
Val Asn Thr Asp Ala Asp Ala Tyr Met Thr Asn Ala Asp Gly Thr He
260 265 270
Thr Cys Gly Asp Thr Thr Pro Ala Ser Cys Asn Val Gly He Asn Pro
275 280 285
Asn Ser Val Tyr Thr Thr Gly Lys Leu Asn Ala Lys Val Asn His Thr
290 295 300
He Phe Gin Phe Leu Val Asn Val Gly He Arg Thr Asn He Phe Glu 305 310 315 320
His His Gly He Glu Phe Gly He Lys He Pro Thr Leu Pro Asn Tyr
325 330 335
Phe Phe Lys Gly Ser Thr Thr He Arg Ala Lys Lys Gin Gly Pro Leu
340 345 350
Glu Asn Gly Gin Pro Thr Thr He Thr Gly Ala Glu Thr Asn Phe Ser
355 360 365
Leu Thr Gin Thr Leu Arg Arg Gin Tyr Ser Met Tyr Leu Arg Tyr Val
370 375 380
Tyr Thr Phe 385
(2) INFORMATION FOR SEQ ID NO: 587: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 534 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 47...481 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:587:
AGTAGATTAA AAACTCTTTA ATTTTCCCAT GAATAAAGGA TTTTAA ATG GAT GCG 55
Met Asp Ala
1
ATT TAT CCT TAT GTG TTG GTT GTT CAT TTA TTG TGC GCC ATT ATT TTT 103 He Tyr Pro Tyr Val Leu Val Val His Leu Leu Cys Ala He He Phe 5 10 15
ATT GGC TAC TTG TTT TTT GAT GGG GTA ATT TTC CCT AAT GTG AAG AAA 151 He Gly Tyr Leu Phe Phe Asp Gly Val He Phe Pro Asn Val Lys Lys 20 25 30 35
ATG TTT GGC GAA GAG TTT GCC AAT AAA GCG AAT ACA GGA ATC ACT CAA 199 Met Phe Gly Glu Glu Phe Ala Asn Lys Ala Asn Thr Gly He Thr Gin 40 45 50
AGA GCG ATC AAA ATC ATG CCC TTA TGC GTT TTA GGG CTT GTT TTA ACA 247 Arg Ala He Lys He Met Pro Leu Cys Val Leu Gly Leu Val Leu Thr 55 60 65
GGG GGC ATG ATG CTT AGC CAA TAC ATG GGG GGC GAT AAA GGC TGG TGT 295 Gly Gly Met Met Leu Ser Gin Tyr Met Gly Gly Asp Lys Gly Trp Cys 70 75 80
GAA ACC CCT TTT CAA AAG ATA CTC ATG CTT AAA GTG ATC TTA GCG TTA 343 Glu Thr Pro Phe Gin Lys He Leu Met Leu Lys Val He Leu Ala Leu 85 90 95
AGC ATT TTT CTT TTG GTG CTT TTT TCT TTA TCG TGT AAG TTT TTG GGC 391 Ser He Phe Leu Leu Val Leu Phe Ser Leu Ser Cys Lys Phe Leu Gly 100 105 110 115
AAG AAA AAC CCT ATT GGT AAA TAT ATC CAC CCT ATC GCT CTA ACT TTT 439 Lys Lys Asn Pro He Gly Lys Tyr He His Pro He Ala Leu Thr Phe 120 125 130
GGC TTT TTA ATC GCC ATT TTA GCC AAA ACG ATG TGG TTT GTT TAAGAGCGT 490 Gly Phe Leu He Ala He Leu Ala Lys Thr Met Trp Phe Val 135 140 145
TTCAACCTCA AAGAATTTAA GACCACTAAG AGTGAGCTAG CGCT 534
(2) INFORMATION FOR SEQ ID NO: 588:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 145 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588:
Met Asp Ala He Tyr Pro Tyr Val Leu Val Val His Leu Leu Cys Ala
1 5 10 15
He He Phe He Gly Tyr Leu Phe Phe Asp Gly Val He Phe Pro Asn
20 25 30
Val Lys Lys Met Phe Gly Glu Glu Phe Ala Asn Lys Ala Asn Thr Gly
35 40 45
He Thr Gin Arg Ala He Lys He Met Pro Leu Cys Val Leu Gly Leu
50 55 60
Val Leu Thr Gly Gly Met Met Leu Ser Gin Tyr Met Gly Gly Asp Lys 65 70 75 80
Gly Trp Cys Glu Thr Pro Phe Gin Lys He Leu Met Leu Lys Val He
85 90 95
Leu Ala Leu Ser He Phe Leu Leu Val Leu Phe Ser Leu Ser Cys Lys
100 105 110
Phe Leu Gly Lys Lys Asn Pro He Gly Lys Tyr He His Pro He Ala
115 120 125
Leu Thr Phe Gly Phe Leu He Ala He Leu Ala Lys Thr Met Trp Phe
130 135 140
Val 145
(2) INFORMATION FOR SEQ ID NO: 589:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 635 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 48...584 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:589:
ACCACCTCAC CGATGAAGAA ATTAAGATCA TTGAAGAGGG GCAGTGA ATG GAA AAG 56
Met Glu Lys
1
TTA TTT GAA AAG ATA TTG CAT GAA ATG AGA TCA AGA ACT TCT TTT CTG 104 Leu Phe Glu Lys He Leu His Glu Met Arg Ser Arg Thr Ser Phe Leu 5 10 15
CTT GCT TTT GTC GTT TCG CTT ATT GTT TTT ATT TTT AAT CTA AAA GGG 152 Leu Ala Phe Val Val Ser Leu He Val Phe He Phe Asn Leu Lys Gly 20 25 30 35
GTT TTT CAA TTG ATT TTT GAG TCT ATT TTC CAA TAC ACC CAA AAC AAA 200 Val Phe Gin Leu He Phe Glu Ser He Phe Gin Tyr Thr Gin Asn Lys 40 45 50
ATC CTT TCT TTT TCT CTT TCT TTT TTC TTT ATT TTC TTT TTC TTT TAC 248 He Leu Ser Phe Ser Leu Ser Phe Phe Phe He Phe Phe Phe Phe Tyr 55 60 65
GCT ATT TTT CTT ATT TTT TAT CAA ATA TTT CTT TGG TAT GGG GCT AAA 296 Ala He Phe Leu He Phe Tyr Gin He Phe Leu Trp Tyr Gly Ala Lys 70 75 80
AAA TAT AAA CAA AAT CAA AGA GAT AGT GAA ATT GTC TAT AAC ATT CAA 344 Lys Tyr Lys Gin Asn Gin Arg Asp Ser Glu He Val Tyr Asn He Gin 85 90 95
AAA TTC CCA AAT GAG ATA AAA GAA GAA CTT TAT CGT TGC TAT TCT AAA 392 Lys Phe Pro Asn Glu He Lys Glu Glu Leu Tyr Arg Cys Tyr Ser Lys 100 105 110 115
AAA CAA AAT AAA ATT CTT AGA ACG AAA AAA CTT GAT GAT TTG ATA GAT 440 Lys Gin Asn Lys He Leu Arg Thr Lys Lys Leu Asp Asp Leu He Asp 120 125 130
TAT CTT GAT TTA ATA GGT TTT TTT AAG TCG AGA GAT TTT TTT GAA CCG 488 Tyr Leu Asp Leu He Gly Phe Phe Lys Ser Arg Asp Phe Phe Glu Pro 135 140 145
ACA AAA GAC GAT TAT ATT GTC AAA CCG GAT GTC TTA AGG GCT ATA AAA 536 Thr Lys Asp Asp Tyr He Val Lys Pro Asp Val Leu Arg Ala He Lys 150 155 160
AAA TAC CAT AAA ATT GCT TTT AAA TCT GTA TAT TGG CAA CAG AAC AAA T 585 Lys Tyr His Lys He Ala Phe Lys Ser Val Tyr Trp Gin Gin Asn Lys 165 170 175
AAAAAGCGCT TGTTCTTTGT CTAAGAAACA CCCCCCTTTA AAAAAGGGGG 635
(2) INFORMATION FOR SEQ ID NO: 590: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 179 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590:
Met Glu Lys Leu Phe Glu Lys He Leu His Glu Met Arg Ser Arg Thr
1 5 10 15
Ser Phe Leu Leu Ala Phe Val Val Ser Leu He Val Phe He Phe Asn
20 25 30
Leu Lys Gly Val Phe Gin Leu He Phe Glu Ser He Phe Gin Tyr Thr
35 40 45
Gin Asn Lys He Leu Ser Phe Ser Leu Ser Phe Phe Phe He Phe Phe
50 55 60
Phe Phe Tyr Ala He Phe Leu He Phe Tyr Gin He Phe Leu Trp Tyr 65 70 75 80
Gly Ala Lys Lys Tyr Lys Gin Asn Gin Arg Asp Ser Glu He Val Tyr
85 90 95
Asn He Gin Lys Phe Pro Asn Glu He Lys Glu Glu Leu Tyr Arg Cys
100 105 110
Tyr Ser Lys Lys Gin Asn Lys He Leu Arg Thr Lys Lys Leu Asp Asp
115 120 125
Leu He Asp Tyr Leu Asp Leu He Gly Phe Phe Lys Ser Arg Asp Phe
130 135 140
Phe Glu Pro Thr Lys Asp Asp Tyr He Val Lys Pro Asp Val Leu Arg 145 150 155 160
Ala He Lys Lys Tyr His Lys He Ala Phe Lys Ser Val Tyr Trp Gin
165 170 175
Gin Asn Lys
(2) INFORMATION FOR SEQ ID NO: 591:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 292 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...239 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591: TTGCCCTTTT CAAAAATATA ATAGCTGTAT TTTAAAAGCT CTATCCATAG CCT GAG 56
Pro Glu
1
CTT GGT GAT TTC GCA AGA ATT GGG GTT AAT ATC CAC GCC AAA AAG GCA 104 Leu Gly Asp Phe Ala Arg He Gly Val Asn He His Ala Lys Lys Ala 5 10 15
GTT TTC AAT AAT GGA TTT TTT AAG ATT AAA AAG TTC TTT TTG GAT GTG 152 Val Phe Asn Asn Gly Phe Phe Lys He Lys Lys Phe Phe Leu Asp Val 20 25 30
GTG GTG GGG GTC GTT TTC GCT ATC TGG TTT TAT GTA GTT AAA GAT TTC 200 Val Val Gly Val Val Phe Ala He Trp Phe Tyr Val Val Lys Asp Phe 35 40 45 50
ACC CGT TGG CGT GTG GTG AAT GAT GAT TTC ATC GTT TTC TAATTTAAGA TC 251 Thr Arg Trp Arg Val Val Asn Asp Asp Phe He Val Phe 55 60
GTAGCGATAC AAGGAAGCAA TAAGTCCTAG CTCATAAGCA A 292
(2) INFORMATION FOR SEQ ID NO: 592:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 63 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592:
Pro Glu Leu Gly Asp Phe Ala Arg He Gly Val Asn He His Ala Lys
1 5 10 15
Lys Ala Val Phe Asn Asn Gly Phe Phe Lys He Lys Lys Phe Phe Leu
20 25 30
Asp Val Val Val Gly Val Val Phe Ala He Trp Phe Tyr Val Val Lys
35 40 45
Asp Phe Thr Arg Trp Arg Val Val Asn Asp Asp Phe He Val Phe 50 55 60
(2) INFORMATION FOR SEQ ID NO: 593:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...287 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593:
TAATTTAATA GTTTAGCTAT CATGGAGCAT TCTAAATTAA AGGCGATCAC ATG TTT 56
Met Phe 1
GAA AAA ATA CGC AAG ATT TTA GCG GAT ATT GAA GAT TCG CAA AAT GAA 104 Glu Lys He Arg Lys He Leu Ala Asp He Glu Asp Ser Gin Asn Glu 5 10 15
ATT GAA ATG CTT TTA AAA TTA GCG AAT TTG AGT TTG GGG GAT TTT ATT 152 He Glu Met Leu Leu Lys Leu Ala Asn Leu Ser Leu Gly Asp Phe He 20 25 30
GAG ATT AAA AGA GGG AGC ATG GAC ATG CCA AAG GGC GTG AAT GAA GCG 200 Glu He Lys Arg Gly Ser Met Asp Met Pro Lys Gly Val Asn Glu Ala 35 40 45 50
TTT TTT ACG CAA TTA AGC GAA GAA GTG GAG CGA TTG AAG GAG CTT ATT 248 Phe Phe Thr Gin Leu Ser Glu Glu Val Glu Arg Leu Lys Glu Leu He 55 60 65
AAC GCT TTG AAT AAA ATC AAA AAA GGG TTA TTG GTG TTT TAAATGTGTG GG 299 Asn Ala Leu Asn Lys He Lys Lys Gly Leu Leu Val Phe 70 75
ATTGTAGGTT ATATAGGGGA TAGCGAGAAA AAATCCGTTC T 340
(2) INFORMATION FOR SEQ ID NO: 594:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 79 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594:
Met Phe Glu Lys He Arg Lys He Leu Ala Asp He Glu Asp Ser Gin
1 5 10 15
Asn Glu He Glu Met Leu Leu Lys Leu Ala Asn Leu Ser Leu Gly Asp
20 25 30
Phe He Glu He Lys Arg Gly Ser Met Asp Met Pro Lys Gly Val Asn
35 40 45
Glu Ala Phe Phe Thr Gin Leu Ser Glu Glu Val Glu Arg Leu Lys Glu 50 55 60
Leu He Asn Ala Leu Asn Lys He Lys Lys Gly Leu Leu Val Phe 65 70 75
(2) INFORMATION FOR SEQ ID NO: 595:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3101 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 54...3050 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 595:
GGGGGGCGTT GTTAATCAAT GAGCAAGAAA AGAAAATTGA AAATAAATAG GGA ATG 56
Met
1
ATC CAA TCC AGC CTT TAT AGA GCC TTA AAC AAA GGC TTT GAT TAC CAA 104 He Gin Ser Ser Leu Tyr Arg Ala Leu Asn Lys Gly Phe Asp Tyr Gin 5 10 15
ATA CTC GCT TGT AAG GAT TTT AAA GAA TCC GAG CTC GCT AAA GAA GTC 152 He Leu Ala Cys Lys Asp Phe Lys Glu Ser Glu Leu Ala Lys Glu Val 20 25 30
ATA AGC TAT TTT AAG CCA AAT ACC AAA GCC ATT CTT TTC CCG GAG TTT 200 He Ser Tyr Phe Lys Pro Asn Thr Lys Ala He Leu Phe Pro Glu Phe 35 40 45
AGG GCT AAA AAA AAC GAC GAT TTG CGT TCG TTT TTT GAA GAA TTT TTA 248 Arg Ala Lys Lys Asn Asp Asp Leu Arg Ser Phe Phe Glu Glu Phe Leu 50 55 60 65
CAG CTT TTA GGG GGT TTA AGG GAG TTT TAT CAA GCC TTA GAA AAC AAG 296 Gin Leu Leu Gly Gly Leu Arg Glu Phe Tyr Gin Ala Leu Glu Asn Lys 70 75 80
CAA GAA ACT ATC ATC ATT GCC CCG ATT AGC GCG TTA TTG CAC CCT TTA 344 Gin Glu Thr He He He Ala Pro He Ser Ala Leu Leu His Pro Leu 85 90 95
CCT AAA AAA GAA CTT TTA GAA AGC TTT AAA ATC ACT CTT TTA GAA AAA 392 Pro Lys Lys Glu Leu Leu Glu Ser Phe Lys He Thr Leu Leu Glu Lys 100 105 110 TAT AAC CTT AAG GAT TTG AAA GAC AAG CTC TTT TAT TAT GGC TAT GAA 440 Tyr Asn Leu Lys Asp Leu Lys Asp Lys Leu Phe Tyr Tyr Gly Tyr Glu 115 120 125
ATT TTA GAC TTA GTG GAA GTG GAA GGC GAA GCG AGC TTT AGG GGG GAT 488 He Leu Asp Leu Val Glu Val Glu Gly Glu Ala Ser Phe Arg Gly Asp 130 135 140 145
ATT GTG GAT ATT TAT GCG CCA AAT TCT AAA GCG TAT CGC TTG AGT TTT 536 He Val Asp He Tyr Ala Pro Asn Ser Lys Ala Tyr Arg Leu Ser Phe 150 155 160
TTT GAC ACC GAG TGT GAG AGC ATT AAG GAA TTT GAT CCC ATT ACT CAA 584 Phe Asp Thr Glu Cys Glu Ser He Lys Glu Phe Asp Pro He Thr Gin 165 170 175
ATG AGC CTT AAA GAA GAT TTG TTA GAA ATT GAA ATC CCC CCC ACG CTT 632 Met Ser Leu Lys Glu Asp Leu Leu Glu He Glu He Pro Pro Thr Leu 180 185 190
TTT AGT TTG GAC GAA TCA TCT TAT AAG GAT CTA AAA ACA AAA GTG GAA 680 Phe Ser Leu Asp Glu Ser Ser Tyr Lys Asp Leu Lys Thr Lys Val Glu 195 200 205
CAA AGC CCC TTA AAT AGC TTT TCT AAA GAT TTA ACC AGT TTT GGT TTG 728 Gin Ser Pro Leu Asn Ser Phe Ser Lys Asp Leu Thr Ser Phe Gly Leu 210 215 220 225
TGG TTT TTA GGA GAA AAA GCA CAA GAC TTA CTA ATC GTT TAT AAA AGC 776 Trp Phe Leu Gly Glu Lys Ala Gin Asp Leu Leu He Val Tyr Lys Ser 230 235 240
ATT ATA AGT CCT AGA GCT TTA GAA GAA ATT CAA GAA TTA GCG AGC TTA 824 He He Ser Pro Arg Ala Leu Glu Glu He Gin Glu Leu Ala Ser Leu 245 250 255
AAC GAA TTG GAT TGT GAG CGT TTC AAA TTT TTA AAG GTT TTA GAA AAC 872 Asn Glu Leu Asp Cys Glu Arg Phe Lys Phe Leu Lys Val Leu Glu Asn 260 265 270
GCG CAA GGC TAT GAA GAT TTA GAA ATC CAT GCG CAT GCC CTA GAA GGC 920 Ala Gin Gly Tyr Glu Asp Leu Glu He His Ala His Ala Leu Glu Gly 275 280 285
TTT ATC GCT TTG CAT TCA AAT CAT AAA ATC ACG CTC CTA GCC CCC AAT 968 Phe He Ala Leu His Ser Asn His Lys He Thr Leu Leu Ala Pro Asn 290 295 300 305
AAA ACG ATT TTA GAC AAC GCG ATA AGC GCG CTT GAT GCA GGC AAC ATG 1016 Lys Thr He Leu Asp Asn Ala He Ser Ala Leu Asp Ala Gly Asn Met 310 315 320
GAA TGC GTC ATC GCC CCC TTT GTG TTA AAC TTT AAA ACC CCT GAT GGG 1064 Glu Cys Val He Ala Pro Phe Val Leu Asn Phe Lys Thr Pro Asp Gly 325 330 335 ATT TTT ATT TCG CTC AAT TCT TTT GAA AGG AAG AAA AAA CGC CAA AAA 1112 He Phe He Ser Leu Asn Ser Phe Glu Arg Lys Lys Lys Arg Gin Lys 340 345 350
TCC AAG CTC GCT TTG AAT GAG TTG AAT CCG GGC GAA TGG GTG GTG CAT 1160 Ser Lys Leu Ala Leu Asn Glu Leu Asn Pro Gly Glu Trp Val Val His 355 360 365
GAT GAT TAT GGG GTG GGC GTG TTT TCT CAA TTA GTC CAG CAC AGC GTT 1208 Asp Asp Tyr Gly Val Gly Val Phe Ser Gin Leu Val Gin His Ser Val 370 375 380 385
TTA GGG AGC AAG AGG GAT TTT TTA GAA ATC GCT TAT TTG GGC GAA GAC 1256 Leu Gly Ser Lys Arg Asp Phe Leu Glu He Ala Tyr Leu Gly Glu Asp 390 395 400
AAA CTG CTG TTA CCG GTA GAA AAC TTG CAT CTC ATC GCT CGC TAT GTG 1304 Lys Leu Leu Leu Pro Val Glu Asn Leu His Leu He Ala Arg Tyr Val 405 410 415
GCG CAA AGC GAT AGC GTG CCA GCT AAA GAC CGG CTA GGG AAA GGG AGC 1352 Ala Gin Ser Asp Ser Val Pro Ala Lys Asp Arg Leu Gly Lys Gly Ser 420 425 430
TTT CTT AAA TTA AAA GCT AAA GTC AGG ACT AAG CTT TTA GAG ATT GCT 1400 Phe Leu Lys Leu Lys Ala Lys Val Arg Thr Lys Leu Leu Glu He Ala 435 440 445
AGC AAG ATC ATT GAA TTA GCG GCT GAA CGC AAT TTG ATC TTG GGT AAA 1448 Ser Lys He He Glu Leu Ala Ala Glu Arg Asn Leu He Leu Gly Lys 450 455 460 465
AAG ATG GAT GTG CAT TTA GCG GAG TTG GAA GTC TTT AAA TCG CAT GCG 1496 Lys Met Asp Val His Leu Ala Glu Leu Glu Val Phe Lys Ser His Ala 470 475 480
GGG TTT GAA TAC ACC AGC GAT CAA GAA AAG GCT ATC GCT GAA ATT TCA 1544 Gly Phe Glu Tyr Thr Ser Asp Gin Glu Lys Ala He Ala Glu He Ser 485 490 495
AAG GAT TTA AGC TCT CAC AGG GTG ATG GAT AGA TTA TTG AGT GGG GAT 1592 Lys Asp Leu Ser Ser His Arg Val Met Asp Arg Leu Leu Ser Gly Asp 500 505 510
GTG GGT TTT GGG AAA ACA GAA GTG GCG ATG CAT GCG ATT TTT TGC GCG 1640 Val Gly Phe Gly Lys Thr Glu Val Ala Met His Ala He Phe Cys Ala 515 520 525
TTT TTG AAC GGC TTT CAA AGC GCT TTA GTT GTG CCT ACC ACT TTA TTA 1688 Phe Leu Asn Gly Phe Gin Ser Ala Leu Val Val Pro Thr Thr Leu Leu 530 535 540 545
GCG CAC CAG CAT TTT GAG ACT TTA AGG GCG CGT TTT GAA AAT TTT GGC 1736 Ala His Gin His Phe Glu Thr Leu Arg Ala Arg Phe Glu Asn Phe Gly 550 555 560 GTT AAA GTG GCT CGT TTG GAC AGG TAT GCG AGC GAA AAA AAC AAG CTT 1784 Val Lys Val Ala Arg Leu Asp Arg Tyr Ala Ser Glu Lys Asn Lys Leu 565 570 575
TTA AAG GCG GTG GAA TTA GGG CAA GTT GAT GCG CTA ATA GGC ACG CAT 1832 Leu Lys Ala Val Glu Leu Gly Gin Val Asp Ala Leu He Gly Thr His 580 585 590
GCG ATT TTA GGC GCG AAA TTC AAA AAC CTG GGC TTG GTG GTG GTG GAT 1880 Ala He Leu Gly Ala Lys Phe Lys Asn Leu Gly Leu Val Val Val Asp 595 600 605
GAA GAG CAT AAA TTT GGC GTG AAA CAA AAA GAA GCT TTA AAA GAA TTG 1928 Glu Glu His Lys Phe Gly Val Lys Gin Lys Glu Ala Leu Lys Glu Leu 610 615 620 625
AGT AAG AGC GTG CAT TTT TTA AGC ATG TCC GCT ACG CCT ATC CCG CGC 1976 Ser Lys Ser Val His Phe Leu Ser Met Ser Ala Thr Pro He Pro Arg 630 635 640
ACT CTA AAC ATG GCG CTC TCT CAA ATT AAG GGC ATT AGT TCT TTA AAA 2024 Thr Leu Asn Met Ala Leu Ser Gin He Lys Gly He Ser Ser Leu Lys 645 650 655
ACC CCG CCC ACA GAC AGA AAG CCC AGC CGC ACT TTT TTG AAA GAA AAG 2072 Thr Pro Pro Thr Asp Arg Lys Pro Ser Arg Thr Phe Leu Lys Glu Lys 660 665 670
AAT GAC GAA CTC TTA AAA GAG ATT ATT TAC AGA GAA TTA CGC CGT AAC 2120 Asn Asp Glu Leu Leu Lys Glu He He Tyr Arg Glu Leu Arg Arg Asn 675 680 685
GGG CAA ATT TTT TAC ATC CAT AAC CAC ATC GCT AGC ATT TTA AAA GTC 2168 Gly Gin He Phe Tyr He His Asn His He Ala Ser He Leu Lys Val 690 695 700 705
AAA ACC AAG CTA GAA GAT TTA ATC CCT AAA CTC AAA ATC GCT ATT TTG 2216 Lys Thr Lys Leu Glu Asp Leu He Pro Lys Leu Lys He Ala He Leu 710 715 720
CAT TCC CAG ATT AAC GCT AAT GAG AGC GAA GAA ATC ATG CTA GAG TTT 2264 His Ser Gin He Asn Ala Asn Glu Ser Glu Glu He Met Leu Glu Phe 725 730 735
GCC AAG GGA AAT TAT CAG GTT TTA TTA TGC ACT TCT ATT GTG GAA TCA 2312 Ala Lys Gly Asn Tyr Gin Val Leu Leu Cys Thr Ser He Val Glu Ser 740 745 750
GGG ATT CAT TTG CCT AAC GCT AAC ACG ATC ATT ATA GAT AAT GCG CAA 2360 Gly He His Leu Pro Asn Ala Asn Thr He He He Asp Asn Ala Gin 755 760 765
AAT TTC GGG CTG GCT GAT TTG CAC CAA TTG AGA GGG CGT GTG GGG AGA 2408 Asn Phe Gly Leu Ala Asp Leu His Gin Leu Arg Gly Arg Val Gly Arg 770 775 780 785 GGT AAA AAA GAA GGC TTT TGT TAT TTC CTC ATA GAA GAT CAA AAA AGT 2456 Gly Lys Lys Glu Gly Phe Cys Tyr Phe Leu He Glu Asp Gin Lys Ser 790 795 800
TTG AAT GAA CAG GCT TTA AAA CGC TTG CTC GCT TTG GAA AAA AAT TCA 2504 Leu Asn Glu Gin Ala Leu Lys Arg Leu Leu Ala Leu Glu Lys Asn Ser 805 810 815
TAT TTA GGC AGC GGG GAG AGT GTC GCT TAT CAT GAT TTA GAA ATC AGG 2552 Tyr Leu Gly Ser Gly Glu Ser Val Ala Tyr His Asp Leu Glu He Arg 820 825 830
GGG GGC GGG AAT TTG CTC GGG CAA GAT CAG AGC GGG CAT ATT AAA AAC 2600 Gly Gly Gly Asn Leu Leu Gly Gin Asp Gin Ser Gly His He Lys Asn 835 840 845
ATT GGT TAT GCA CTC TAT ACG CGC ATG CTT GAA GAC GCG ATT TAT GAA 2648 He Gly Tyr Ala Leu Tyr Thr Arg Met Leu Glu Asp Ala He Tyr Glu 850 855 860 865
TTG AGT GGG GGG AAG AAA AGG CTT GAA AAG AGC GTA GAA ATC CAA CTT 2696 Leu Ser Gly Gly Lys Lys Arg Leu Glu Lys Ser Val Glu He Gin Leu 870 875 880
GGC GTG AGC GCT TTT TTA AAC CCT GAA CTC ATT GCA AGC GAT AGT TTG 2744 Gly Val Ser Ala Phe Leu Asn Pro Glu Leu He Ala Ser Asp Ser Leu 885 890 895
AGA TTG GAT TTA TAC CGC CGT TTG AGT TTG TGT GAA AAT ACA GAT GAG 2792 Arg Leu Asp Leu Tyr Arg Arg Leu Ser Leu Cys Glu Asn Thr Asp Glu 900 905 910
GTG GGG CAA ATC CAT GAA GAA ATA GAA GAC AGG TTT GGC AAA ATA GAC 2840 Val Gly Gin He His Glu Glu He Glu Asp Arg Phe Gly Lys He Asp 915 920 925
GAT TTG AGC GCT CAA TTT TTG CAA ATC ATT ACG CTT AAA ATT CTA GCC 2888 Asp Leu Ser Ala Gin Phe Leu Gin He He Thr Leu Lys He Leu Ala 930 935 940 945
AAC CAG CTT GGC ATC ATC AAA CTT TCT AAT TTC AAT CAA AAC ATC ACC 2936 Asn Gin Leu Gly He He Lys Leu Ser Asn Phe Asn Gin Asn He Thr 950 955 960
ATC ACT TAT AGC GAT GAA AAG AAA GAA AGC CTG AAA GCC CCA AGC AAA 2984 He Thr Tyr Ser Asp Glu Lys Lys Glu Ser Leu Lys Ala Pro Ser Lys 965 970 975
GAC GAT AAC GAT ATT TTA GAA ACC CTT TTG AAA CAT TTG CGC GCT CAA 3032 Asp Asp Asn Asp He Leu Glu Thr Leu Leu Lys His Leu Arg Ala Gin 980 985 990
ATT TCT TTA AAG CGG CGT TAAAAGCGTT TGATTTTAGC GTTAATTTTG TTATTTTA 3088 He Ser Leu Lys Arg Arg 995 1 AAAAGATTAT TAA 3101
(2) INFORMATION FOR SEQ ID NO: 596:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 999 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596:
Met He Gin Ser Ser Leu Tyr Arg Ala Leu Asn Lys Gly Phe Asp Tyr
1 5 10 15
Gin He Leu Ala Cys Lys Asp Phe Lys Glu Ser Glu Leu Ala Lys Glu
20 25 30
Val He Ser Tyr Phe Lys Pro Asn Thr Lys Ala He Leu Phe Pro Glu
35 40 45
Phe Arg Ala Lys Lys Asn Asp Asp Leu Arg Ser Phe Phe Glu Glu Phe
50 55 60
Leu Gin Leu Leu Gly Gly Leu Arg Glu Phe Tyr Gin Ala Leu Glu Asn 65 70 75 80
Lys Gin Glu Thr He He He Ala Pro He Ser Ala Leu Leu His Pro
85 90 95
Leu Pro Lys Lys Glu Leu Leu Glu Ser Phe Lys He Thr Leu Leu Glu
100 105 110
Lys Tyr Asn Leu Lys Asp Leu Lys Asp Lys Leu Phe Tyr Tyr Gly Tyr
115 120 125
Glu He Leu Asp Leu Val Glu Val Glu Gly Glu Ala Ser Phe Arg Gly
130 135 140
Asp He Val Asp He Tyr Ala Pro Asn Ser Lys Ala Tyr Arg Leu Ser 145 150 155 160
Phe Phe Asp Thr Glu Cys Glu Ser He Lys Glu Phe Asp Pro He Thr
165 170 175
Gin Met Ser Leu Lys Glu Asp Leu Leu Glu He Glu He Pro Pro Thr
180 185 190
Leu Phe Ser Leu Asp Glu Ser Ser Tyr Lys Asp Leu Lys Thr Lys Val
195 200 205
Glu Gin Ser Pro Leu Asn Ser Phe Ser Lys Asp Leu Thr Ser Phe Gly
210 215 220
Leu Trp Phe Leu Gly Glu Lys Ala Gin Asp Leu Leu He Val Tyr Lys 225 230 235 240
Ser He He Ser Pro Arg Ala Leu Glu Glu He Gin Glu Leu Ala Ser
245 250 255
Leu Asn Glu Leu Asp Cys Glu Arg Phe Lys Phe Leu Lys Val Leu Glu
260 265 270
Asn Ala Gin Gly Tyr Glu Asp Leu Glu He His Ala His Ala Leu Glu
275 280 285
Gly Phe He Ala Leu His Ser Asn His Lys He Thr Leu Leu Ala Pro
290 295 300
Asn Lys Thr He Leu Asp Asn Ala He Ser Ala Leu Asp Ala Gly Asn 305 310 315 320 Met Glu Cys Val He Ala Pro Phe Val Leu Asn Phe Lys Thr Pro Asp
325 330 335
Gly He Phe He Ser Leu Asn Ser Phe Glu Arg Lys Lys Lys Arg Gin
340 345 350
Lys Ser Lys Leu Ala Leu Asn Glu Leu Asn Pro Gly Glu Trp Val Val
355 360 365
His Asp Asp Tyr Gly Val Gly Val Phe Ser Gin Leu Val Gin His Ser
370 375 380
Val Leu Gly Ser Lys Arg Asp Phe Leu Glu He Ala Tyr Leu Gly Glu 385 390 395 400
Asp Lys Leu Leu Leu Pro Val Glu Asn Leu His Leu He Ala Arg Tyr
405 410 415
Val Ala Gin Ser Asp Ser Val Pro Ala Lys Asp Arg Leu Gly Lys Gly
420 425 430
Ser Phe Leu Lys Leu Lys Ala Lys Val Arg Thr Lys Leu Leu Glu He
435 440 445
Ala Ser Lys He He Glu Leu Ala Ala Glu Arg Asn Leu He Leu Gly
450 455 460
Lys Lys Met Asp Val His Leu Ala Glu Leu Glu Val Phe Lys Ser His 465 470 475 480
Ala Gly Phe Glu Tyr Thr Ser Asp Gin Glu Lys Ala He Ala Glu He
485 490 495
Ser Lys Asp Leu Ser Ser His Arg Val Met Asp Arg Leu Leu Ser Gly
500 505 510
Asp Val Gly Phe Gly Lys Thr Glu Val Ala Met His Ala He Phe Cys
515 520 525
Ala Phe Leu Asn Gly Phe Gin Ser Ala Leu Val Val Pro Thr Thr Leu
530 535 540
Leu Ala His Gin His Phe Glu Thr Leu Arg Ala Arg Phe Glu Asn Phe 545 550 555 560
Gly Val Lys Val Ala Arg Leu Asp Arg Tyr Ala Ser Glu Lys Asn Lys
565 570 575
Leu Leu Lys Ala Val Glu Leu Gly Gin Val Asp Ala Leu He Gly Thr
580 585 590
His Ala He Leu Gly Ala Lys Phe Lys Asn Leu Gly Leu Val Val Val
595 600 605
Asp Glu Glu His Lys Phe Gly Val Lys Gin Lys Glu Ala Leu Lys Glu
610 615 620
Leu Ser Lys Ser Val His Phe Leu Ser Met Ser Ala Thr Pro He Pro 625 630 635 640
Arg Thr Leu Asn Met Ala Leu Ser Gin He Lys Gly He Ser Ser Leu
645 650 655
Lys Thr Pro Pro Thr Asp Arg Lys Pro Ser Arg Thr Phe Leu Lys Glu
660 665 670
Lys Asn Asp Glu Leu Leu Lys Glu He He Tyr Arg Glu Leu Arg Arg
675 680 685
Asn Gly Gin He Phe Tyr He His Asn His He Ala Ser He Leu Lys
690 695 700
Val Lys Thr Lys Leu Glu Asp Leu He Pro Lys Leu Lys He Ala He 705 710 715 720
Leu His Ser Gin He Asn Ala Asn Glu Ser Glu Glu He Met Leu Glu
725 730 735
Phe Ala Lys Gly Asn Tyr Gin Val Leu Leu Cys Thr Ser He Val Glu
740 745 750
Ser Gly He His Leu Pro Asn Ala Asn Thr He He He Asp Asn Ala 755 760 765
Gin Asn Phe Gly Leu Ala Asp Leu His Gin Leu Arg Gly Arg Val Gly
770 775 780
Arg Gly Lys Lys Glu Gly Phe Cys Tyr Phe Leu He Glu Asp Gin Lys 785 790 795 800
Ser Leu Asn Glu Gin Ala Leu Lys Arg Leu Leu Ala Leu Glu Lys Asn
805 810 815
Ser Tyr Leu Gly Ser Gly Glu Ser Val Ala Tyr His Asp Leu Glu He
820 825 830
Arg Gly Gly Gly Asn Leu Leu Gly Gin Asp Gin Ser Gly His He Lys
835 840 845
Asn He Gly Tyr Ala Leu Tyr Thr Arg Met Leu Glu Asp Ala He Tyr
850 855 860
Glu Leu Ser Gly Gly Lys Lys Arg Leu Glu Lys Ser Val Glu He Gin 865 870 875 880
Leu Gly Val Ser Ala Phe Leu Asn Pro Glu Leu He Ala Ser Asp Ser
885 890 895
Leu Arg Leu Asp Leu Tyr Arg Arg Leu Ser Leu Cys Glu Asn Thr Asp
900 905 910
Glu Val Gly Gin He His Glu Glu He Glu Asp Arg Phe Gly Lys He
915 920 925
Asp Asp Leu Ser Ala Gin Phe Leu Gin He He Thr Leu Lys He Leu
930 935 940
Ala Asn Gin Leu Gly He He Lys Leu Ser Asn Phe Asn Gin Asn He 945 950 955 960
Thr He Thr Tyr Ser Asp Glu Lys Lys Glu Ser Leu Lys Ala Pro Ser
965 970 975
Lys Asp Asp Asn Asp He Leu Glu Thr Leu Leu Lys His Leu Arg Ala
980 985 990
Gin He Ser Leu Lys Arg Arg 995 1
(2) INFORMATION FOR SEQ ID NO: 597:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1217 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 46...1161 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 597:
AGGCTATGAT ATAGTTTATT TTCACAAAAT CTCCAAGGAA AATTG ATG CAA CAT GAA 57
Met Gin His Glu 1 ATC CCT ATT GCC TTT GCC TTT GAT AAA AAC TAC CTA AAA ACA GGG GCT 105 He Pro He Ala Phe Ala Phe Asp Lys Asn Tyr Leu Lys Thr Gly Ala 5 10 15 20
GTG GCT CTC TAC TCT TTA TTG CAT GCC CAT CGT GCA GTT GAA GGG GTA 153 Val Ala Leu Tyr Ser Leu Leu His Ala His Arg Ala Val Glu Gly Val 25 30 35
TTT TTC AGT ATC TAT ATA TTC TAT AGC GGT TTG AAT GAA GAT GAT TTA 201 Phe Phe Ser He Tyr He Phe Tyr Ser Gly Leu Asn Glu Asp Asp Leu 40 45 50
AAC AGG CTC CAA GAA ACT ATC AAA CCT TTC AAA CAT TTT GCC GCT TTA 249 Asn Arg Leu Gin Glu Thr He Lys Pro Phe Lys His Phe Ala Ala Leu 55 60 65
AAA TGC CAA GAT ATT AGC GCC ACT CTT GAT TCT TTG CCC ACC ATC ACG 297 Lys Cys Gin Asp He Ser Ala Thr Leu Asp Ser Leu Pro Thr He Thr 70 75 80
GAT AGT GCA TGG GTT AAT CGC TAT TCT AGA ATG ATT TTG GTC AAA TAC 345 Asp Ser Ala Trp Val Asn Arg Tyr Ser Arg Met He Leu Val Lys Tyr 85 90 95 100
CTT CTC CCT AGT TTA TTC CCC CAA TAC AGC AAA ATG ATT TGG TCT GAT 393 Leu Leu Pro Ser Leu Phe Pro Gin Tyr Ser Lys Met He Trp Ser Asp 105 110 115
GTG GAT GTG GTC TTT TGC AGA GCT TTC GCT GAT GAT TTT ATC GCT TTA 441 Val Asp Val Val Phe Cys Arg Ala Phe Ala Asp Asp Phe He Ala Leu 120 125 130
GAC ACA AGC GAA TCT TTT CAT TTG AGT GGT GTG ATA AGT TTA GTA TCA 489 Asp Thr Ser Glu Ser Phe His Leu Ser Gly Val He Ser Leu Val Ser 135 140 145
CAA TCA GTT ACA GAG GGG TTT TGG TTT TGC AAT TTG GAT TAC ATG CGA 537 Gin Ser Val Thr Glu Gly Phe Trp Phe Cys Asn Leu Asp Tyr Met Arg 150 155 160
AAG CAC TCT TTC ACC CAA CAG GTC TTA GAA AAA TTT AAA ATT CAA GTA 585 Lys His Ser Phe Thr Gin Gin Val Leu Glu Lys Phe Lys He Gin Val 165 170 175 180
ATG CGT CCA TAT TTT AAA GAA CCT ACA TTA ATA CAC CAT TTG CAT GCT 633 Met Arg Pro Tyr Phe Lys Glu Pro Thr Leu He His His Leu His Ala 185 190 195
TAT ATT AAA GAA CTT CCC TTA CAC TAT TGC GTT CTG CCT TAT TAT TAT 681 Tyr He Lys Glu Leu Pro Leu His Tyr Cys Val Leu Pro Tyr Tyr Tyr 200 205 210
CAA GAA GAA CTT GAT GAT TTG AGA CAT AAA GCT TCC TTA CCC ATT CGG 729 Gin Glu Glu Leu Asp Asp Leu Arg His Lys Ala Ser Leu Pro He Arg 215 220 225 TTT GAA ATC ATC CAC CAA GAC AAA CCC AAT GAA TTT ATC CAT CGC CAG 777 Phe Glu He He His Gin Asp Lys Pro Asn Glu Phe He His Arg Gin 230 235 240
CAA ATC CCC TAT GAG ATC TCT CAA ATT CAA AAC ATT CTT TCA AAC CCT 825 Gin He Pro Tyr Glu He Ser Gin He Gin Asn He Leu Ser Asn Pro 245 250 255 260
ATT ATC ATG CAC TAT GAA TCT GAT AAA GAT GCT CTT GGA ATC TAC AAT 873 He He Met His Tyr Glu Ser Asp Lys Asp Ala Leu Gly He Tyr Asn 265 270 275
GGC AAA CCT TGG GAG TTC CCT TTG GGG AAT CAA TAC CAC CTG TGG TTA 921 Gly Lys Pro Trp Glu Phe Pro Leu Gly Asn Gin Tyr His Leu Trp Leu 280 285 290
GAG ATG CTT GCA CAC ACT CCA TTT TGG AAA GAC TTC ACT CTG GAA ATG 969 Glu Met Leu Ala His Thr Pro Phe Trp Lys Asp Phe Thr Leu Glu Met 295 300 305
CAA AAA AAA CGC ATA GAA TAC CGA GAT ATT GCT CAA AAA ATC CAT TAT 1017 Gin Lys Lys Arg He Glu Tyr Arg Asp He Ala Gin Lys He His Tyr 310 315 320
TTT TCT CAA GAT AAG CGT CTT TAT GAA GTG AGC ATA CGC TCC ATT AAG 1065 Phe Ser Gin Asp Lys Arg Leu Tyr Glu Val Ser He Arg Ser He Lys 325 330 335 340
GTT TTT GCA TCT CAT TAC TAT AAT TTA GTG GTT AAA GAA CGA TGG TCT 1113 Val Phe Ala Ser His Tyr Tyr Asn Leu Val Val Lys Glu Arg Trp Ser 345 350 355
AAA CCA ATA AAA ACT TTC TTT CAA AAA AAT TTT TTT CAA AAA AAG TTC T 1162 Lys Pro He Lys Thr Phe Phe Gin Lys Asn Phe Phe Gin Lys Lys Phe 360 365 370
AATTTGTTGG AATCACAAGG CGAAACCAAA TCCCTAATAC TTATGCTTTC TCAGG 1217
(2) INFORMATION FOR SEQ ID NO: 598:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 372 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 598:
Met Gin His Glu He Pro He Ala Phe Ala Phe Asp Lys Asn Tyr Leu
1 5 10 15
Lys Thr Gly Ala Val Ala Leu Tyr Ser Leu Leu His Ala His Arg Ala 20 25 30 Val Glu Gly Val Phe Phe Ser He Tyr He Phe Tyr Ser Gly Leu Asn
35 40 45
Glu Asp Asp Leu Asn Arg Leu Gin Glu Thr He Lys Pro Phe Lys His
50 55 60
Phe Ala Ala Leu Lys Cys Gin Asp He Ser Ala Thr Leu Asp Ser Leu 65 70 75 80
Pro Thr He Thr Asp Ser Ala Trp Val Asn Arg Tyr Ser Arg Met He
85 90 95
Leu Val Lys Tyr Leu Leu Pro Ser Leu Phe Pro Gin Tyr Ser Lys Met
100 105 110
He Trp Ser Asp Val Asp Val Val Phe Cys Arg Ala Phe Ala Asp Asp
115 120 125
Phe He Ala Leu Asp Thr Ser Glu Ser Phe His Leu Ser Gly Val He
130 135 140
Ser Leu Val Ser Gin Ser Val Thr Glu Gly Phe Trp Phe Cys Asn Leu 145 150 155 160
Asp Tyr Met Arg Lys His Ser Phe Thr Gin Gin Val Leu Glu Lys Phe
165 170 175
Lys He Gin Val Met Arg Pro Tyr Phe Lys Glu Pro Thr Leu He His
180 185 190
His Leu His Ala Tyr He Lys Glu Leu Pro Leu His Tyr Cys Val Leu
195 200 205
Pro Tyr Tyr Tyr Gin Glu Glu Leu Asp Asp Leu Arg H s Lys Ala Ser
210 215 220
Leu Pro He Arg Phe Glu He He His Gin Asp Lys Pro Asn Glu Phe 225 230 235 240
He His Arg Gin Gin He Pro Tyr Glu He Ser Gin He Gin Asn He
245 250 255
Leu Ser Asn Pro He He Met His Tyr Glu Ser Asp Lys Asp Ala Leu
260 265 270
Gly He Tyr Asn Gly Lys Pro Trp Glu Phe Pro Leu Gly Asn Gin Tyr
275 280 285
His Leu Trp Leu Glu Met Leu Ala His Thr Pro Phe Trp Lys Asp Phe
290 295 300
Thr Leu Glu Met Gin Lys Lys Arg He Glu Tyr Arg Asp He Ala Gin 305 310 315 320
Lys He His Tyr Phe Ser Gin Asp Lys Arg Leu Tyr Glu Val Ser He
325 330 335
Arg Ser He Lys Val Phe Ala Ser His Tyr Tyr Asn Leu Val Val Lys
340 345 350
Glu Arg Trp Ser Lys Pro He Lys Thr Phe Phe Gin Lys Asn Phe Phe
355 360 365
Gin Lys Lys Phe 370
(2) INFORMATION FOR SEQ ID NO -599:
(l) SEQUENCE CHARACTERISTICS
(A) LENGTH: 780 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 250...729 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:599:
TTTAGCTATG ATAAGCGTTT TAAAAACAAA CGAATTTTAA TCAAAATGAG ATTTAAGGGT 60
TAAAGAGTGA AAGCGTTTTT AGGAGCGTTA GAGTTTCAAG AGAATGAATA TGAAGAGCTT 120
AAAGAGCTTT ATGAGAGCTT AAAAACCAAG CAAAAGCCCC ACACTTTGTT CATTTCTTGT 180
GTGGATTCAC GAGTCGTGCC TAATTTAATC ACTGGCACCA AACCGGGCGA ATTGTATGTG 240
ATTTGCAAC ATG GGC AAT GTG AAC CCC CCT AAA ACA AGC TAT AAA GAG TCC 291
Met Gly Asn Val Asn Pro Pro Lys Thr Ser Tyr Lys Glu Ser 1 5 10
CTT TCT ACC ATT GCG AGC ATT GAA TAC GCT ATC GCG CAT GTG GGC GTT 339 Leu Ser Thr He Ala Ser He Glu Tyr Ala He Ala His Val Gly Val 15 20 25 30
CAA AAC TTA ATC ATT TGC GGG CAT AGC GAT TGT GGG GCT TGC GGG AGC 387 Gin Asn Leu He He Cys Gly His Ser Asp Cys Gly Ala Cys Gly Ser 35 40 45
GTT CAT TTA ATC CAT GAT GAA ACC ACC AAA GCT AAA ACC CCT TAC ATT 435 Val His Leu He His Asp Glu Thr Thr Lys Ala Lys Thr Pro Tyr He 50 55 60
GCA AAC TGG ATA CAA TTT TTA GAG CCT GTT AAA GAA GAG TTA AAA AAC 483 Ala Asn Trp He Gin Phe Leu Glu Pro Val Lys Glu Glu Leu Lys Asn 65 70 75
CAC CCG CAA TTC AGC AAC CAT TTC GCC AAG CGT TCA TGG CTT ACA GAG 531 His Pro Gin Phe Ser Asn His Phe Ala Lys Arg Ser Trp Leu Thr Glu 80 85 90
CGT TTG AAT GCG CGC TTG CAA CTC AAC AAC CTC TTA AGC TAT GAT TTC 579 Arg Leu Asn Ala Arg Leu Gin Leu Asn Asn Leu Leu Ser Tyr Asp Phe 95 100 105 110
ATT CAA GAG AAA GCG AGC AAG AAT GAA TTA AAA ATT TTT GGT TGG CAC 627 He Gin Glu Lys Ala Ser Lys Asn Glu Leu Lys He Phe Gly Trp His 115 120 125
TAC ATC ATA GAA ACA GGC AGG ATT TAT AAT TAT AAT TTT GAA AGC CAT 675 Tyr He He Glu Thr Gly Arg He Tyr Asn Tyr Asn Phe Glu Ser His 130 135 140
TTT TTT GAG CCG ATT GGA GAA ACC ATT AAA CAA AGG AAA AGT CAT GAA 723 Phe Phe Glu Pro He Gly Glu Thr He Lys Gin Arg Lys Ser His Glu 145 150 155
AAC TTC TAAAACAAAA ACCCCTAAAT CCGTTTTAAT CGCTGGGCCA TGCGTCATTG A 780 Asn Phe 160
(2) INFORMATION FOR SEQ ID NO: 600.
(l) SEQUENCE CHARACTERISTICS-
(A) LENGTH: 160 ammo acids
Figure imgf000926_0001
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600:
Met Gly Asn Val Asn Pro Pro Lys Thr Ser Tyr Lys Glu Ser Leu Ser
1 5 10 15
Thr He Ala Ser He Glu Tyr Ala He Ala His Val Gly Val Gin Asn
20 25 30
Leu He He Cys Gly His Ser Asp Cys Gly Ala Cys Gly Ser Val His
35 40 45
Leu He His Asp Glu Thr Thr Lys Ala Lys Thr Pro Tyr He Ala Asn
50 55 60
Trp He Gin Phe Leu Glu Pro Val Lys Glu Glu Leu Lys Asn His Pro 65 70 75 80
Gin Phe Ser Asn His Phe Ala Lys Arg Ser Trp Leu Thr Glu Arg Leu
85 90 95
Asn Ala Arg Leu Gin Leu Asn Asn Leu Leu Ser Tyr Asp Phe He Gin
100 105 110
Glu Lys Ala Ser Lys Asn Glu Leu Lys He Phe Gly Trp His Tyr He
115 120 125
He Glu Thr Gly Arg He Tyr Asn Tyr Asn Phe Glu Ser His Phe Phe
130 135 140
Glu Pro He Gly Glu Thr He Lys Gin Arg Lys Ser His Glu Asn Phe 145 150 155 160
(2) INFORMATION FOR SEQ ID NO: 601.
(l) SEQUENCE CHARACTERISTICS.
(A) LENGTH: 450 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(n) MOLECULE TYPE- Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 107...385 (D) OTHER INFORMATION. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601:
TAAAGATAAA AACCACCCCT TTTGACCCCC TTTTAGAGGA TTTTTATGGT TTTTTTGAGA 60 TAATAAAGAC TTATAAAGTT AATTAAAATT AAACAGAAGG GTTTTG ATG TCC GCT 115
Met Ser Ala
1
CAT TTT TTA AAA ATC GTT TTT TTA GTA GGC ATG TGC GTT TCA AGT TTG 163 His Phe Leu Lys He Val Phe Leu Val Gly Met Cys Val Ser Ser Leu 5 10 15
TTC GCT GAA GGT TTA GAG GGG TTT TTT AAC GCC CTA GAA GCC CAG CTC 211 Phe Ala Glu Gly Leu Glu Gly Phe Phe Asn Ala Leu Glu Ala Gin Leu 20 25 30 35
AAA AGC CCC ATC GCT AAG GGG ATT TTA ATG GTG ATT TTC ATA GGG ATC 259 Lys Ser Pro He Ala Lys Gly He Leu Met Val He Phe He Gly He 40 45 50
GCT ATT TAT GTG TGG AGG AAT TTG GAC CGG TGG AAA GAG ATC TTA TTC 307 Ala He Tyr Val Trp Arg Asn Leu Asp Arg Trp Lys Glu He Leu Phe 55 60 65
ACG ATC CTT GGC GTG GTG TTT GGG ATT TTT TTA TTC TTT AAA GCT CCG 355 Thr He Leu Gly Val Val Phe Gly He Phe Leu Phe Phe Lys Ala Pro 70 75 80
AGT TTA GCG AAT TGG TTT ATG GGA ATT TTT TAATGATTAT CCTGTCAGCG AGC 408 Ser Leu Ala Asn Trp Phe Met Gly He Phe 85 90
GTGAAGAATT TGCGTGAAAT TTCGGTTAAA GAAAAATTTT TA 450
(2) INFORMATION FOR SEQ ID NO: 602:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 93 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 602:
Met Ser Ala His Phe Leu Lys He Val Phe Leu Val Gly Met Cys Val
1 5 10 15
Ser Ser Leu Phe Ala Glu Gly Leu Glu Gly Phe Phe Asn Ala Leu Glu
20 25 30
Ala Gin Leu Lys Ser Pro He Ala Lys Gly He Leu Met Val He Phe
35 40 45
He Gly He Ala He Tyr Val Trp Arg Asn Leu Asp Arg Trp Lys Glu
50 55 60
He Leu Phe Thr He Leu Gly Val Val Phe Gly He Phe Leu Phe Phe 65 70 75 80
Lys Ala Pro Ser Leu Ala Asn Trp Phe Met Gly He Phe 85 90
(2) INFORMATION FOR SEQ ID NO: 603:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1350 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 77...1291 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603:
TGAAGGCCGA TGATTTCATT TCCAAGTCTA ACCCCAAAGA CATCCAGCGA GTGGTTAAGC 60 AATTTTTGGA ATTAGC ATG AAA AAA TAC AGC ACT ATC CCC ACC CCT TGC TAC 112
Met Lys Lys Tyr Ser Thr He Pro Thr Pro Cys Tyr 1 5 10
GTG TTA GAG AGC GAA CGC TTA GAA AAA AAC GCC AAG ATT TTA GAA ATC 160 Val Leu Glu Ser Glu Arg Leu Glu Lys Asn Ala Lys He Leu Glu He 15 20 25
GTG CGC CAA CAA AGT GGG GCA AAG GTC TTG CTT GCT TTA AAG GGG TAT 208 Val Arg Gin Gin Ser Gly Ala Lys Val Leu Leu Ala Leu Lys Gly Tyr 30 35 40
GCG TTT TGG CGT GAG TTT GGG ATT TTG AGG CAA AAA TTG AAC GGG TGT 256 Ala Phe Trp Arg Glu Phe Gly He Leu Arg Gin Lys Leu Asn Gly Cys 45 50 55 60
TGC GCG AGC GGT CTT TAT GAG GCT AAG CTC GCT TTT GAA GAA TTT GGG 304 Cys Ala Ser Gly Leu Tyr Glu Ala Lys Leu Ala Phe Glu Glu Phe Gly 65 70 75
GGG CGA GAG AGC CAC AAA GAA ATT TGC GTT TAT AGC CCG GCT TTC AAA 352 Gly Arg Glu Ser His Lys Glu He Cys Val Tyr Ser Pro Ala Phe Lys 80 85 90
GAG GCT GAA ATG AGC GCG ATT TTA CCC CTA GCG ACA AGC ATT ATT TTT 400 Glu Ala Glu Met Ser Ala He Leu Pro Leu Ala Thr Ser He He Phe 95 100 105
AAC TCT TTT TAC CAA TAC GCT ACC TAT AAA GAC AGG ATT TTA GAT AAA 448 Asn Ser Phe Tyr Gin Tyr Ala Thr Tyr Lys Asp Arg He Leu Asp Lys 110 115 120 AAC AAG CAA TTA GAA AAC TTG GGC TTA AGC CCC ATT AAA ATG GGT TTG 496 Asn Lys Gin Leu Glu Asn Leu Gly Leu Ser Pro He Lys Met Gly Leu 125 130 135 140
AGG ATA AAC CCT CTC TAT AGC GAA GTA ACC CCA GCG ATC TAT AAC CCA 544 Arg He Asn Pro Leu Tyr Ser Glu Val Thr Pro Ala He Tyr Asn Pro 145 150 155
TGC TCT AAA GTG AGC CGG TTA GGG ATT ACG CCT AGC GGA TTT GAA AAG 592 Cys Ser Lys Val Ser Arg Leu Gly He Thr Pro Ser Gly Phe Glu Lys 160 165 170
GGG GTG AAA GAG CAT GGC TTA GAG GGG GTG AGC GGG TTG CAT TTC CAT 640 Gly Val Lys Glu His Gly Leu Glu Gly Val Ser Gly Leu His Phe His 175 180 185
ACG CAT TGC GAG CAA AAC GCT GAC GCT TTG TGC CGG ACT TTA GAG CAT 688 Thr His Cys Glu Gin Asn Ala Asp Ala Leu Cys Arg Thr Leu Glu His 190 195 200
GTA GAA AAG CAT TTC AGG CCC TAT TTA GAA AAC ATG GCG TGG GTG AAT 736 Val Glu Lys His Phe Arg Pro Tyr Leu Glu Asn Met Ala Trp Val Asn 205 210 215 220
TTT GGT GGG GGG CAT CAT ATC ACT AAG AGC GAT TAT GAT GTG AAT TTG 784 Phe Gly Gly Gly His His He Thr Lys Ser Asp Tyr Asp Val Asn Leu 225 230 235
CTC ATC CAA ACG ATT AAG GAT TTC AAA GAA CGC TAT CAT AAT ATA GAA 832 Leu He Gin Thr He Lys Asp Phe Lys Glu Arg Tyr His Asn He Glu 240 245 250
GTG ATT TTA GAG CCT GGG GAA GCC ATA GGG TGG CAA TGC GGG TTT TTA 880 Val He Leu Glu Pro Gly Glu Ala He Gly Trp Gin Cys Gly Phe Leu 255 260 265
ATC GCA AGC GTG ATA GAC ATC GTT CAA AAC GAT CAA GAA ATT GCG ATT 928 He Ala Ser Val He Asp He Val Gin Asn Asp Gin Glu He Ala He 270 275 280
CTA GAC GCT TCT TTT AGC GCT CAC ATG CCC GAT TGC TTA GAA ATG CCT 976 Leu Asp Ala Ser Phe Ser Ala His Met Pro Asp Cys Leu Glu Met Pro 285 290 295 300
TAT CGC CCT AGC ATT TTT AAA GTC TCC GTA GAA AAT GAT GAA GAG CTT 1024 Tyr Arg Pro Ser He Phe Lys Val Ser Val Glu Asn Asp Glu Glu Leu 305 310 315
ATT GAA GTT GAA AAG GGC GAA AAT CAA GGG GCG TTT TCT TAT TTT TTA 1072 He Glu Val Glu Lys Gly Glu Asn Gin Gly Ala Phe Ser Tyr Phe Leu 320 325 330
GGC GGC CCT ACT TGT TTA GCG GGG GAT TTT ATG GGG AGT TTT AGC TTT 1120 Gly Gly Pro Thr Cys Leu Ala Gly Asp Phe Met Gly Ser Phe Ser Phe 335 340 345 GAA ACG CCT TTA AAA AGG GGC GAT AAA ATC GTG TTT CAA GAC ATG CTC 1168 Glu Thr Pro Leu Lys Arg Gly Asp Lys He Val Phe Gin Asp Met Leu 350 355 360
CAT TAT ACG ATT GTC AAA AAC AAC TCG TTT AAT GGC GTG CCG CTC CCA 1216 His Tyr Thr He Val Lys Asn Asn Ser Phe Asn Gly Val Pro Leu Pro 365 370 375 380
AGC CTG GCT AGA TTG GAT CAA CAA GGG TTT AAA ATC CTT AAA AAC TTT 1264 Ser Leu Ala Arg Leu Asp Gin Gin Gly Phe Lys He Leu Lys Asn Phe 385 390 395
TCT TAT GAA GAC TAT AAA AAC AGA AAC TAAAGCTTTT GATTAAGGCT TTTTGGG 1318 Ser Tyr Glu Asp Tyr Lys Asn Arg Asn 400 405
GCTTGTAAAA AGNTTACGCA CAACATTCCA AC 1350
(2) INFORMATION FOR SEQ ID NO: 604:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 405 ammo acids
(B) TYPE: ammo acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(n) MOLECULE TYPE- protein (v) FRAGMENT TYPE- internal
(xi) SEQUENCE DESCRIPTION. SEQ ID NO: 604
Met Lys Lys Tyr Ser Thr He Pro Thr Pro Cys Tyr Val Leu Glu Ser
1 5 10 15
Glu Arg Leu Glu Lys Asn Ala Lys He Leu Glu He Val Arg Gin Gin
20 25 30
Ser Gly Ala Lys Val Leu Leu Ala Leu Lys Gly Tyr Ala Phe Trp Arg
35 40 45
Glu Phe Gly He Leu Arg Gin Lys Leu Asn Gly Cys Cys Ala Ser Gly
50 55 60
Leu Tyr Glu Ala Lys Leu Ala Phe Glu Glu Phe Gly Gly Arg Glu Ser 65 70 75 80
His Lys Glu He Cys Val Tyr Ser Pro Ala Phe Lys Glu Ala Glu Met
85 90 95
Ser Ala He Leu Pro Leu Ala Thr Ser He He Phe Asn Ser Phe Tyr
100 105 110
Gin Tyr Ala Thr Tyr Lys Asp Arg He Leu Asp Lys Asn Lys Gin Leu
115 120 125
Glu Asn Leu Gly Leu Ser Pro He Lys Met Gly Leu Arg He Asn Pro
130 135 140
Leu Tyr Ser Glu Val Thr Pro Ala He Tyr Asn Pro Cys Ser Lys Val 145 150 155 160
Ser Arg Leu Gly He Thr Pro Ser Gly Phe Glu Lys Gly Val Lys Glu
165 170 175
His Gly Leu Glu Gly Val Ser Gly Leu His Phe His Thr His Cys Glu 180 185 190 Gin Asn Ala Asp Ala Leu Cys Arg Thr Leu Glu His Val Glu Lys His
195 200 205
Phe Arg Pro Tyr Leu Glu Asn Met Ala Trp Val Asn Phe Gly Gly Gly
210 215 220
His His He Thr Lys Ser Asp Tyr Asp Val Asn Leu Leu He Gin Thr 225 230 235 240
He Lys Asp Phe Lys Glu Arg Tyr His Asn He Glu Val He Leu Glu
245 250 255
Pro Gly Glu Ala He Gly Trp Gin Cys Gly Phe Leu He Ala Ser Val
260 265 270
He Asp He Val Gin Asn Asp Gin Glu He Ala He Leu Asp Ala Ser
275 280 285
Phe Ser Ala His Met Pro Asp Cys Leu Glu Met Pro Tyr Arg Pro Ser
290 295 300
He Phe Lys Val Ser Val Glu Asn Asp Glu Glu Leu He Glu Val Glu 305 310 315 320
Lys Gly Glu Asn Gin Gly Ala Phe Ser Tyr Phe Leu Gly Gly Pro Thr
325 330 335
Cys Leu Ala Gly Asp Phe Met Gly Ser Phe Ser Phe Glu Thr Pro Leu
340 345 350
Lys Arg Gly Asp Lys He Val Phe Gin Asp Met Leu His Tyr Thr He
355 360 365
Val Lys Asn Asn Ser Phe Asn Gly Val Pro Leu Pro Ser Leu Ala Arg
370 375 380
Leu Asp Gin Gin Gly Phe Lys He Leu Lys Asn Phe Ser Tyr Glu Asp 385 390 395 400
Tyr Lys Asn Arg Asn 405
(2) INFORMATION FOR SEQ ID NO: 605:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 609 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 246...548 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605:
GAATCTTTAC TTTATAATTT GCCTGACCTT TTAAAAGAAC ACTCTAATGA AAATGTCTTG 60
CTCATCTTAC ACTTGCAGGC TCGCATGGCC CAAACTACGA CAACAAAGTG CCTTTAAATT 120
TTAGGGTGTT TAAGCCTTAT TGCTCAAGCG CTGATCTGTC TTCTTGCTCC AAAGAAAGCC 180
TGATTAACGC CTATGACAAC ACCATTTTTT ACAACGACTA TCTGCTAGAT CGAAAGATCA 240
TTAGC ATG CTT GAA AAC GCC AAG CAG CCC GCC TTA ATG ATC TAT TTA AGC 290 Met Leu Glu Asn Ala Lys Gin Pro Ala Leu Met He Tyr Leu Ser 1 5 10 15
GAT CAT GGC GAA AGT TTG GGC GAA GAA GCG TTC TAT TTG CAT GGC ATT 338 Asp His Gly Glu Ser Leu Gly Glu Glu Ala Phe Tyr Leu His Gly He 20 25 30
CCT AAA AGC ATC GCC CCC AAA GAA CAA TAC GAG ATC CCC TTT ATC GTT 386 Pro Lys Ser He Ala Pro Lys Glu Gin Tyr Glu He Pro Phe He Val 35 40 45
TAT GCT AAT GAG CCT TTC AAA GAA AAG CAT TCC ATC ATT CAA ACC CAA 434 Tyr Ala Asn Glu Pro Phe Lys Glu Lys His Ser He He Gin Thr Gin 50 55 60
ACC CCC ATT AAT CAA AAT GTG ATT TTC CAT AGC GTT TTA GGG GTG TTT 482 Thr Pro He Asn Gin Asn Val He Phe His Ser Val Leu Gly Val Phe 65 70 75
TTG GAT TTT AAA AAC CCA AGC GTT GTT TAT CGC CCT TCT TTA GAT CTG 530 Leu Asp Phe Lys Asn Pro Ser Val Val Tyr Arg Pro Ser Leu Asp Leu 80 85 90 95
CTT AAA CAC AAA AAA GAG TAAAATAACA CGCATGAAAA AATTCTTATT TAAACAAA 586 Leu Lys His Lys Lys Glu 100
AATTTTGTGA AAGCCTGCCC AAA 609
(2) INFORMATION FOR SEQ ID NO: 606:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 101 ammo acids
Figure imgf000932_0001
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
( i) SEQUENCE DESCRIPTION. SEQ ID NO: 606.
Met Leu Glu Asn Ala Lys Gin Pro Ala Leu Met He Tyr Leu Ser Asp
1 5 10 15
His Gly Glu Ser Leu Gly Glu Glu Ala Phe Tyr Leu His Gly He Pro
20 25 30
Lys Ser He Ala Pro Lys Glu Gin Tyr Glu He Pro Phe He Val Tyr
35 40 45
Ala Asn Glu Pro Phe Lys Glu Lys His Ser He He Gin Thr Gin Thr
50 55 60
Pro He Asn Gin Asn Val He Phe His Ser Val Leu Gly Val Phe Leu 65 70 75 80
Asp Phe Lys Asn Pro Ser Val Val Tyr Arg Pro Ser Leu Asp Leu Leu
85 90 95
Lys His Lys Lys Glu 100 (2) INFORMATION FOR SEQ ID NO 607
(l) SEQUENCE CHARACTERISTICS
(A) LENGTH 872 base pairs
(B) TYPE nucleic acid
(C) STRANDEDNESS single
(D) TOPOLOGY linear
(n) MOLECULE TYPE Genomic DNA (ix) FEATURE
(A) NAME/KEY Coding Sequence
(B) LOCATION 123 818 (D) OTHER INFORMATION
(xi) SEQUENCE DESCRIPTION SEQ ID NO 607
AGTTTTAGCG ATAATTATAT CTTTTATGGC AATAAAACGA GCGTTTGTAA GCAAATCGGT 60 AACGCTGTGC CTCCTCTTCT AGCCCTAGCC TTAGGCAAAG CGATCTTAAA AAGCTTAAGA 120 AA ATG ATA CAA ATT TAT CAC GCT GAC GCT TTT GAA ATC ATC AAA GAC 167 Met He Gin He Tyr His Ala Asp Ala Phe Glu He He Lys Asp 1 5 10 15
TTT TAC CAG CAA AAT TTA AAA GTG GAT GCG ATC ATC ACG GAC CCT CCT 215 Phe Tyr Gin Gin Asn Leu Lys Val Asp Ala He He Thr Asp Pro Pro 20 25 30
TAT AAC ATT TCG GTT AAA AAC AAT TTT CCC ACC CTA AAG AGC GCT AAA 263 Tyr Asn He Ser Val Lys Asn Asn Phe Pro Thr Leu Lys Ser Ala Lys 35 40 45
AGG CAA GGC ATA GAT TTT GGG GAA TGG GAT AAA AAT TTC AAG CTT TTA 311 Arg Gin Gly He Asp Phe Gly Glu Trp Asp Lys Asn Phe Lys Leu Leu 50 55 60
GAA TGG ATC GCA CGC TAC GCC CCC TTA GTC AAT CCA AAC GGC TGC ATG 359 Glu Trp He Ala Arg Tyr Ala Pro Leu Val Asn Pro Asn Gly Cys Met 65 70 75
GTT ATT TTT TGC TCT TAC AGG TTT ATA AGC TAT ATC GCT GAT TTT TTA 407 Val He Phe Cys Ser Tyr Arg Phe He Ser Tyr He Ala Asp Phe Leu 80 85 90 95
GAA GAA AAC GGC TTT GTG GTC AAA GAC TTT ATC CAA TGG GTT AAA AAT 455 Glu Glu Asn Gly Phe Val Val Lys Asp Phe He Gin Trp Val Lys Asn 100 105 110
AAT CCC ATG CCA AGA AAC ATT CAC CGG CGT TAT GTC CAA GAC ACG GAA 503 Asn Pro Met Pro Arg Asn He His Arg Arg Tyr Val Gin Asp Thr Glu 115 120 125
TTT GCT CTG TGG GCG GTT AAA AAG AAA GCC AAG TGG GTG TTT AAC AAA 551 Phe Ala Leu Trp Ala Val Lys Lys Lys Ala Lys Trp Val Phe Asn Lys 130 135 140
CCC AAA AAT GAA AAA TAT TTA CGG CCT TTG ATT TTA AAA AGC CCT GTG 599 Pro Lys Asn Glu Lys Tyr Leu Arg Pro Leu He Leu Lys Ser Pro Val 145 150 155
GTA AGC GGG CTT GAA AAA ACC AAA CAC CCC ACG CAA AAA AGC CTG GCC 647 Val Ser Gly Leu Glu Lys Thr Lys His Pro Thr Gin Lys Ser Leu Ala 160 165 170 175
TTA ATG GAA AAA ATC ATT TCC ATC CAC ACA AAC CCT AAT GAC ATC GTG 695 Leu Met Glu Lys He He Ser He His Thr Asn Pro Asn Asp He Val 180 185 190
CTA GAT CCT TTC ATG GGG AGC GGC ACC ACC GGC TTA GCG TGC AAA AAT 743 Leu Asp Pro Phe Met Gly Ser Gly Thr Thr Gly Leu Ala Cys Lys Asn 195 200 205
TTA GAA CGG AAT TTT ATC GGC ATA GAA TCA GAA AAA GAA TAT TTT CAA 791 Leu Glu Arg Asn Phe He Gly He Glu Ser Glu Lys Glu Tyr Phe Gin 210 215 220
ACC GCT AAA AAG CGT TTG AAT CTG TTT TAAAAACGCT ATTTGAATGA GATTGTG 845 Thr Ala Lys Lys Arg Leu Asn Leu Phe 225 230
TTATAGTTAT TTAAAAGGAT ATTTTGA 872
(2) INFORMATION FOR SEQ ID NO: 608:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 232 ammo acids
(B) TYPE: am o acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:608:
Met He Gin He Tyr His Ala Asp Ala Phe Glu He He Lys Asp Phe
1 5 10 15
Tyr Gin Gin Asn Leu Lys Val Asp Ala He He Thr Asp Pro Pro Tyr
20 25 30
Asn He Ser Val Lys Asn Asn Phe Pro Thr Leu Lys Ser Ala Lys Arg
35 40 45
Gin Gly He Asp Phe Gly Glu Trp Asp Lys Asn Phe Lys Leu Leu Glu
50 55 60
Trp He Ala Arg Tyr Ala Pro Leu Val Asn Pro Asn Gly Cys Met Val 65 70 75 80
He Phe Cys Ser Tyr Arg Phe He Ser Tyr He Ala Asp Phe Leu Glu
85 90 95
Glu Asn Gly Phe Val Val Lys Asp Phe He Gin Trp Val Lys Asn Asn 100 105 110
Pro Met Pro Arg Asn He His Arg Arg Tyr Val Gin Asp Thr Glu Phe
115 120 125
Ala Leu Trp Ala Val Lys Lys Lys Ala Lys Trp Val Phe Asn Lys Pro
130 135 140
Lys Asn Glu Lys Tyr Leu Arg Pro Leu He Leu Lys Ser Pro Val Val 145 150 155 160
Ser Gly Leu Glu Lys Thr Lys His Pro Thr Gin Lys Ser Leu Ala Leu
165 170 175
Met Glu Lys He He Ser He His Thr Asn Pro Asn Asp He Val Leu
180 185 190
Asp Pro Phe Met Gly Ser Gly Thr Thr Gly Leu Ala Cys Lys Asn Leu
195 200 205
Glu Arg Asn Phe He Gly He Glu Ser Glu Lys Glu Tyr Phe Gin Thr
210 215 220
Ala Lys Lys Arg Leu Asn Leu Phe 225 230
(2) INFORMATION FOR SEQ ID NO: 609:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1181 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 60...1124 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 609: TTGCGTTTTG AAACCGATGA TTTTTCAACG CTTATTGATC GTATTTGTGA AAGCTTGAA 59
ATG AAT TAT AAA ATT TTA GAT TTA TTT TGT GGG GCT GGG GGT TTT AGC 107 Met Asn Tyr Lys He Leu Asp Leu Phe Cys Gly Ala Gly Gly Phe Ser 1 5 10 15
GCT GGG TTA GAG TGT TTA GAA GAG TTT GAC GCT TTA ATA GGG CTA GAT 155 Ala Gly Leu Glu Cys Leu Glu Glu Phe Asp Ala Leu He Gly Leu Asp 20 25 30
TGC GAT AAA CAA GCC CTA ATC ACT TTT GAA AAC AAC CAT AAA AAC GCC 203 Cys Asp Lys Gin Ala Leu He Thr Phe Glu Asn Asn His Lys Asn Ala 35 40 45
ATA GGC GTT TGT GGG GAC ATC ACT CAA ACC GAA ATT AAA GAA AAA GTC 251 He Gly Val Cys Gly Asp He Thr Gin Thr Glu He Lys Glu Lys Val 50 55 60
ATC AAA CTA GCT AAA AAA TTA GAA ATC AAC ATG ATC ATT GGC GGG CCT 299 He Lys Leu Ala Lys Lys Leu Glu He Asn Met He He Gly Gly Pro 65 70 75 80
CCA TGT CAA GGC TTT TCT AAT AAA GGG AAA AAT TTA GGG CTA AAA GAC 347 Pro Cys Gin Gly Phe Ser Asn Lys Gly Lys Asn Leu Gly Leu Lys Asp 85 90 95
CCT AGG AAT TTT TTA TTC TTA GAA TAT ATA GAA ATA GTC AAA GCC ATA 395 Pro Arg Asn Phe Leu Phe Leu Glu Tyr He Glu He Val Lys Ala He 100 105 110
AAG CCA GAA ATT TTT ATC ATT GAA AAC GTG AAA AAC CTC ATC TCT TGC 443 Lys Pro Glu He Phe He He Glu Asn Val Lys Asn Leu He Ser Cys 115 120 125
GCT AAA GGC TAT TTT TTA GAA GAA ATT AAA GAA AGG TTG AAC GCT TTA 491 Ala Lys Gly Tyr Phe Leu Glu Glu He Lys Glu Arg Leu Asn Ala Leu 130 135 140
GGG TAT CAA TTG AGC TAT CAA ATC CTA AAC GCT AAA GAT TAT GGC GTG 539 Gly Tyr Gin Leu Ser Tyr Gin He Leu Asn Ala Lys Asp Tyr Gly Val 145 150 155 160
CCT CAA AAC AGA GAG AGA GCC TTT ATT GTA GGG GCT AGT CGT TTC AGT 587 Pro Gin Asn Arg Glu Arg Ala Phe He Val Gly Ala Ser Arg Phe Ser 165 170 175
TTT GAT TTC AAT CTT TTA GAG CCT TCT CAA AGC GTG AAT GTT CAA GAT 635 Phe Asp Phe Asn Leu Leu Glu Pro Ser Gin Ser Val Asn Val Gin Asp 180 185 190
GCC ATA AGC GAT TTA GCC TAT CTT TGT TCT AAT GAG GGG GCG TTT GAG 683 Ala He Ser Asp Leu Ala Tyr Leu Cys Ser Asn Glu Gly Ala Phe Glu 195 200 205
AGC GAT TAT TTA AAC CCT ATC CAA TCA AGC TAT CAA GCT TTA ATG CGA 731 Ser Asp Tyr Leu Asn Pro He Gin Ser Ser Tyr Gin Ala Leu Met Arg 210 215 220
AAA GAT AGC CCT AAA TTA TAC AAC CAT CAA GCC ACC AAC CAC TCG CAA 779 Lys Asp Ser Pro Lys Leu Tyr Asn His Gin Ala Thr Asn His Ser Gin 225 230 235 240
GCC GCT TTA GAG AAA TTA AAA CTC ATT AAC AAA GAA CAA GGC AAA GAA 827 Ala Ala Leu Glu Lys Leu Lys Leu He Asn Lys Glu Gin Gly Lys Glu 245 250 255
TGC TTG CCT AAA AAC TTG CAT GGC AAA CAG CAA TTC AAA AGC ACA TGG 875 Cys Leu Pro Lys Asn Leu His Gly Lys Gin Gin Phe Lys Ser Thr Trp 260 265 270
GGG CGC CTG AAT TGG AAT AAA ATC AGC CCC ACC ATA GAC ACA CGA TTT 923 Gly Arg Leu Asn Trp Asn Lys He Ser Pro Thr He Asp Thr Arg Phe 275 280 285
GAC ACT CCC AGC AAT GGC ACC AAC TCC CAC CCC GAA TTG CAC CGC TCT 971 Asp Thr Pro Ser Asn Gly Thr Asn Ser His Pro Glu Leu His Arg Ser 290 295 300
ATC ACG CCT AGA GAA GCC GCT AGG ATA CAA AGT TTT AGC GAT AAT TAT 1019 He Thr Pro Arg Glu Ala Ala Arg He Gin Ser Phe Ser Asp Asn Tyr 305 310 315 320
ATC TTT TAT GGC AAT AAA ACG AGC GTT TGT AAG CAA ATC GGT AAC GCT 1067 He Phe Tyr Gly Asn Lys Thr Ser Val Cys Lys Gin He Gly Asn Ala 325 330 335
GTG CCT CCT CTT CTA GCC CTA GCC TTA GGC AAA GCG ATC TTA AAA AGC 1115 Val Pro Pro Leu Leu Ala Leu Ala Leu Gly Lys Ala He Leu Lys Ser 340 345 350
TTA AGA AAA TGATACAAAT TTATCACGCT GACGCTTTTG AAATCATCAA AGACTTTTAC 1174 Leu Arg Lys 355
CAGCAAA 1181
(2) INFORMATION FOR SEQ ID NO: 610:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 355 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610:
Met Asn Tyr Lys He Leu Asp Leu Phe Cys Gly Ala Gly Gly Phe Ser
1 5 10 15
Ala Gly Leu Glu Cys Leu Glu Glu Phe Asp Ala Leu He Gly Leu Asp
20 25 30
Cys Asp Lys Gin Ala Leu He Thr Phe Glu Asn Asn His Lys Asn Ala
35 40 45
He Gly Val Cys Gly Asp He Thr Gin Thr Glu He Lys Glu Lys Val
50 55 60
He Lys Leu Ala Lys Lys Leu Glu He Asn Met He He Gly Gly Pro 65 70 75 80
Pro Cys Gin Gly Phe Ser Asn Lys Gly Lys Asn Leu Gly Leu Lys Asp
85 90 95
Pro Arg Asn Phe Leu Phe Leu Glu Tyr He Glu He Val Lys Ala He
100 105 110
Lys Pro Glu He Phe He He Glu Asn Val Lys Asn Leu He Ser Cys 115 120 125 Ala Lys Gly Tyr Phe Leu Glu Glu He Lys Glu Arg Leu Asn Ala Leu
130 135 140
Gly Tyr Gin Leu Ser Tyr Gin He Leu Asn Ala Lys Asp Tyr Gly Val 145 150 155 160
Pro Gin Asn Arg Glu Arg Ala Phe He Val Gly Ala Ser Arg Phe Ser
165 170 175
Phe Asp Phe Asn Leu Leu Glu Pro Ser Gin Ser Val Asn Val Gin Asp
180 185 190
Ala He Ser Asp Leu Ala Tyr Leu Cys Ser Asn Glu Gly Ala Phe Glu
195 200 205
Ser Asp Tyr Leu Asn Pro He Gin Ser Ser Tyr Gin Ala Leu Met Arg
210 215 220
Lys Asp Ser Pro Lys Leu Tyr Asn His Gin Ala Thr Asn His Ser Gin 225 230 235 240
Ala Ala Leu Glu Lys Leu Lys Leu He Asn Lys Glu Gin Gly Lys Glu
245 250 255
Cys Leu Pro Lys Asn Leu His Gly Lys Gin Gin Phe Lys Ser Thr Trp
260 265 270
Gly Arg Leu Asn Trp Asn Lys He Ser Pro Thr He Asp Thr Arg Phe
275 280 285
Asp Thr Pro Ser Asn Gly Thr Asn Ser His Pro Glu Leu His Arg Ser
290 295 300
He Thr Pro Arg Glu Ala Ala Arg He Gin Ser Phe Ser Asp Asn Tyr 305 310 315 320
He Phe Tyr Gly Asn Lys Thr Ser Val Cys Lys Gin He Gly Asn Ala
325 330 335
Val Pro Pro Leu Leu Ala Leu Ala Leu Gly Lys Ala He Leu Lys Ser
340 345 350
Leu Arg Lys 355
(2) INFORMATION FOR SEQ ID NO: 611:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1361 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...1305 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611:
TATTGATAGC ATGAGTTGTT TTTGGTTTGG AATTTTAAGG AGTAGCTT ATG AAA GAG 57
Met Lys Glu 1 CAA TCA ATG ATT GAT TTT TTA AAA CTT AGA GAT TAT GAC ATT AGA AAA 105 Gin Ser Met He Asp Phe Leu Lys Leu Arg Asp Tyr Asp He Arg Lys 5 10 15
ACA CAA AAT GCG CGA TGG ATA GAT CAA AAA TGC ACC CCT GAT GTG TTG 153 Thr Gin Asn Ala Arg Trp He Asp Gin Lys Cys Thr Pro Asp Val Leu 20 25 30 35
TCT CTT GTT GCT GAT TGT ATT TTA GAG TTT ACG CAA TGT AAT ATT GGA 201 Ser Leu Val Ala Asp Cys He Leu Glu Phe Thr Gin Cys Asn He Gly 40 45 50
AAA TCA TTT TCT ATT AGG GAT ATT TGG GAT AGC CCT TAC ACC AAT GAA 249 Lys Ser Phe Ser He Arg Asp He Trp Asp Ser Pro Tyr Thr Asn Glu 55 60 65
AAT GTT AAA ATG ATT TTT TCT AAA CCT GAT TTA AAT TCT GAC TTT TCC 297 Asn Val Lys Met He Phe Ser Lys Pro Asp Leu Asn Ser Asp Phe Ser 70 75 80
ATG CAT GAA TAC GAT AAG TTT TTT TCT CAG CCT ATT AAA TTA TTA GCC 345 Met His Glu Tyr Asp Lys Phe Phe Ser Gin Pro He Lys Leu Leu Ala 85 90 95
TAT AGC GGT ATT TTA TTT GAA ACA AAA ACT GGC AAT AGA AAT ATT TAT 393 Tyr Ser Gly He Leu Phe Glu Thr Lys Thr Gly Asn Arg Asn He Tyr 100 105 110 115
ACC ATA CAA AAC ATA GAG CTA TTA GAA TAT CTC ATG CAA AGA GAA ACA 441 Thr He Gin Asn He Glu Leu Leu Glu Tyr Leu Met Gin Arg Glu Thr 120 125 130
AAC GCT TTG AAA TTC CTT ATT TTA TAT ATT CAA AAG GTA TTA ATG GAT 489 Asn Ala Leu Lys Phe Leu He Leu Tyr He Gin Lys Val Leu Met Asp 135 140 145
AGT GGG ATT TAT CCT TTA TTT GAC AAC TTT TTA CAA AAA CAA GAC ACA 537 Ser Gly He Tyr Pro Leu Phe Asp Asn Phe Leu Gin Lys Gin Asp Thr 150 155 160
GAA AGT TTT AAG CAA CTA AAA GAT GGT TTC ACT CAT TTT ACT ATC AAT 585 Glu Ser Phe Lys Gin Leu Lys Asp Gly Phe Thr His Phe Thr He Asn 165 170 175
AAC ACA GCA ATC AAT AAC GCT ACG GAA TGT TTT AGG ATT TTT ACT AAA 633 Asn Thr Ala He Asn Asn Ala Thr Glu Cys Phe Arg He Phe Thr Lys 180 185 190 195
ATT ATC AAT CCT TTA GCT TTT TAT TAT GGT AAA AAA GGC ACA AGA AAA 681 He He Asn Pro Leu Ala Phe Tyr Tyr Gly Lys Lys Gly Thr Arg Lys 200 205 210
GGG TAT TTG TCC AAC ACT ATA ATT ACA AAA GAT GAG CTT AAT TAT AAT 729 Gly Tyr Leu Ser Asn Thr He He Thr Lys Asp Glu Leu Asn Tyr Asn 215 220 225 CGT ATC AAT TGG CGA GAT ATA GGA AAA GAT AAA AAT ACC ACC AGA CAA 777 Arg He Asn Trp Arg Asp He Gly Lys Asp Lys Asn Thr Thr Arg Gin 230 235 240
GAA TAC GAT CTT ATA AAC TCT AAA AGG ATT GCT AAT TCT AAC TAT CTT 825 Glu Tyr Asp Leu He Asn Ser Lys Arg He Ala Asn Ser Asn Tyr Leu 245 250 255
ATT TCA AAA GCT AAG AAA GTG GTG AAA CGA TAT AAT GAT AGA TTT AAT 873 He Ser Lys Ala Lys Lys Val Val Lys Arg Tyr Asn Asp Arg Phe Asn 260 265 270 275
AAT TCT CTC TCT GAA GTA AAA CAA GAA AAA GAA GAG TCG CAA GCC ACA 921 Asn Ser Leu Ser Glu Val Lys Gin Glu Lys Glu Glu Ser Gin Ala Thr 280 285 290
CAA ATA CAC CAT ATT TTT CCC ATC CAA GAC TTT CCC ATT ATT GCT AAC 969 Gin He His His He Phe Pro He Gin Asp Phe Pro He He Ala Asn 295 300 305
TAT ATA GAG AAT CTT ATC GCA CTC ACT CCT AAT CAA CAT TTT ATT TAC 1017 Tyr He Glu Asn Leu He Ala Leu Thr Pro Asn Gin His Phe He Tyr 310 315 320
GCC CAC CCT AAT AAT CAA ACC CGC TTG ATT GAT AAA GAT TTT CAA TAT 1065 Ala His Pro Asn Asn Gin Thr Arg Leu He Asp Lys Asp Phe Gin Tyr 325 330 335
ATC TGC TTA TTA GCT AAA ACG ACC ACA ATT CTT AAT GAC ACT CAA GGC 1113 He Cys Leu Leu Ala Lys Thr Thr Thr He Leu Asn Asp Thr Gin Gly 340 345 350 355
GTA TAT GAT TGG AAT GAT TAT ATT GTT GTG TTG AAT ATG GGC CTC AAA 1161 Val Tyr Asp Trp Asn Asp Tyr He Val Val Leu Asn Met Gly Leu Lys 360 365 370
ACA ACT ATC TTT TCT CAA GTC AAG AAC GAA TGG GAA TTA TTA AAA GTA 1209 Thr Thr He Phe Ser Gin Val Lys Asn Glu Trp Glu Leu Leu Lys Val 375 380 385
ATA GAT GCT TTT TAT TTT GAT TTT AAC AAG AGC AAA GAT CCA AGT TGG 1257 He Asp Ala Phe Tyr Phe Asp Phe Asn Lys Ser Lys Asp Pro Ser Trp 390 395 400
TCA TAC TTG CTA GAT AAA AAC GAT TTA AGA GCT TTC AAG CTA AAA TTT T 1306 Ser Tyr Leu Leu Asp Lys Asn Asp Leu Arg Ala Phe Lys Leu Lys Phe 405 410 415
AATAAGTTTT ATTGAAACTG GCTATAAAAA CCCGCTTGAC TTATCTTATC CTTTT 1361
(2) INFORMATION FOR SEQ ID NO: 612:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 419 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612:
Met Lys Glu Gin Ser Met He Asp Phe Leu Lys Leu Arg Asp Tyr Asp
1 5 10 15
He Arg Lys Thr Gin Asn Ala Arg Trp He Asp Gin Lys Cys Thr Pro
20 25 30
Asp Val Leu Ser Leu Val Ala Asp Cys He Leu Glu Phe Thr Gin Cys
35 40 45
Asn He Gly Lys Ser Phe Ser He Arg Asp He Trp Asp Ser Pro Tyr
50 55 60
Thr Asn Glu Asn Val Lys Met He Phe Ser Lys Pro Asp Leu Asn Ser 65 70 75 80
Asp Phe Ser Met His Glu Tyr Asp Lys Phe Phe Ser Gin Pro He Lys
85 90 95
Leu Leu Ala Tyr Ser Gly He Leu Phe Glu Thr Lys Thr Gly Asn Arg
100 105 110
Asn He Tyr Thr He Gin Asn He Glu Leu Leu Glu Tyr Leu Met Gin
115 120 125
Arg Glu Thr Asn Ala Leu Lys Phe Leu He Leu Tyr He Gin Lys Val
130 135 140
Leu Met Asp Ser Gly He Tyr Pro Leu Phe Asp Asn Phe Leu Gin Lys 145 150 155 160
Gin Asp Thr Glu Ser Phe Lys Gin Leu Lys Asp Gly Phe Thr His Phe
165 170 175
Thr He Asn Asn Thr Ala He Asn Asn Ala Thr Glu Cys Phe Arg He
180 185 190
Phe Thr Lys He He Asn Pro Leu Ala Phe Tyr Tyr Gly Lys Lys Gly
195 200 205
Thr Arg Lys Gly Tyr Leu Ser Asn Thr He He Thr Lys Asp Glu Leu
210 215 220
Asn Tyr Asn Arg He Asn Trp Arg Asp He Gly Lys Asp Lys Asn Thr 225 230 235 240
Thr Arg Gin Glu Tyr Asp Leu He Asn Ser Lys Arg He Ala Asn Ser
245 250 255
Asn Tyr Leu He Ser Lys Ala Lys Lys Val Val Lys Arg Tyr Asn Asp
260 265 270
Arg Phe Asn Asn Ser Leu Ser Glu Val Lys Gin Glu Lys Glu Glu Ser
275 280 285
Gin Ala Thr Gin He His His He Phe Pro He Gin Asp Phe Pro He
290 295 300
He Ala Asn Tyr He Glu Asn Leu He Ala Leu Thr Pro Asn Gin His 305 310 315 320
Phe He Tyr Ala His Pro Asn Asn Gin Thr Arg Leu He Asp Lys Asp
325 330 335
Phe Gin Tyr He Cys Leu Leu Ala Lys Thr Thr Thr He Leu Asn Asp
340 345 350
Thr Gin Gly Val Tyr Asp Trp Asn Asp Tyr He Val Val Leu Asn Met
355 360 365
Gly Leu Lys Thr Thr He Phe Ser Gin Val Lys Asn Glu Trp Glu Leu 370 375 380
Leu Lys Val He Asp Ala Phe Tyr Phe Asp Phe Asn Lys Ser Lys Asp 385 390 395 400
Pro Ser Trp Ser Tyr Leu Leu Asp Lys Asn Asp Leu Arg Ala Phe Lys
405 410 415
Leu Lys Phe
(2) INFORMATION FOR SEQ ID NO: 613
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2610 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 90...2558 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613:
TAAAGAGGCT TTTGAAACCA TGCTTAAAGA AATTGAGAGC TTGAAACATT AATGCTTCAA 60 ATGAATTTGG TTTAATCCTT AATCCCCTT ATG CTC TTT GAT CAA ACC TTA ACC 113
Met Leu Phe Asp Gin Thr Leu Thr 1 5
TAT ATT TCT TTA TTT TCT GGG GCA GGA GTG GGG TGC TAT GGG CTT TTA 161 Tyr He Ser Leu Phe Ser Gly Ala Gly Val Gly Cys Tyr Gly Leu Leu 10 15 20
GAA GAG GGG TTT GAA TGC GTT GCT ACC AAT GAA ATT TTA GAA AAA CGC 209 Glu Glu Gly Phe Glu Cys Val Ala Thr Asn Glu He Leu Glu Lys Arg 25 30 35 40
TTG AAT ATC CAA AGG ATT AAT CGC AAA TGC AAA TTA GAT GAA AGC TAC 257 Leu Asn He Gin Arg He Asn Arg Lys Cys Lys Leu Asp Glu Ser Tyr 45 50 55
ATT AGT GGG GAC ATT AAA AAG CCA GAA ACA AAA GAA AAA ATT TTA AAG 305 He Ser Gly Asp He Lys Lys Pro Glu Thr Lys Glu Lys He Leu Lys 60 65 70
CAA ATT GAA TTT TAT TCT AAA AAA TTT GGT AAT GAT AGG GTT GAT TTA 353 Gin He Glu Phe Tyr Ser Lys Lys Phe Gly Asn Asp Arg Val Asp Leu 75 80 85
GTG GTA GCA ACC CCA CCT TGT CAA GGC ATG AGC GTA GCC AAT CAT AAG 401 Val Val Ala Thr Pro Pro Cys Gin Gly Met Ser Val Ala Asn His Lys 90 95 100
AAG AAA AAC GAT GAG ATC AAA CGG AAT TCT TTG GTG GTT GAA AGC ATT 449 Lys Lys Asn Asp Glu He Lys Arg Asn Ser Leu Val Val Glu Ser He 105 110 115 120
GAT TTG ATC AAA CAA ATC AAA CCC AGA TTT TTT ATT TTA GAA AAT GTC 497 Asp Leu He Lys Gin He Lys Pro Arg Phe Phe He Leu Glu Asn Val 125 130 135
CCT AGT TTT TAT AAA ACA GGT TGT ATA GAC AAA AAT GAT AAT TTG CTA 545 Pro Ser Phe Tyr Lys Thr Gly Cys He Asp Lys Asn Asp Asn Leu Leu 140 145 150
GAA ATA GGA TCT ATG ATA GAG CAA AAT TTG AGT GGC GAT TAT ATG CTC 593 Glu He Gly Ser Met He Glu Gin Asn Leu Ser Gly Asp Tyr Met Leu 155 160 165
TAT GAT GAG GTA ATC AAT TTT AAA AAT TTT GGA GCT AAT TCA AGC CGA 641 Tyr Asp Glu Val He Asn Phe Lys Asn Phe Gly Ala Asn Ser Ser Arg 170 175 180
ACA AGA ACT TTA GTG ATA GGG GTT TGT AAA GAG TTT AAA GAT TTT ATA 689 Thr Arg Thr Leu Val He Gly Val Cys Lys Glu Phe Lys Asp Phe He 185 190 195 200
AGC GCG TTA GAA TTT TTT CCT GAT TTC AAA CAA GAA AAA ACC TTA AAA 737 Ser Ala Leu Glu Phe Phe Pro Asp Phe Lys Gin Glu Lys Thr Leu Lys 205 210 215
GAA GTG ATA GGA TCG TTA AAA CCA CTT GCT TGG GGC GAG TAT GAC AAC 785 Glu Val He Gly Ser Leu Lys Pro Leu Ala Trp Gly Glu Tyr Asp Asn 220 225 230
ACG GAT TTT TAT CAT AGT TTT AGA ACT TAT CCA AAG CAT ATG CAA GAA 833 Thr Asp Phe Tyr His Ser Phe Arg Thr Tyr Pro Lys His Met Gin Glu 235 240 245
TGG ATT AAG GAT TTA AAA GAA GGA CAA AGC GCG TTT GAG AAT ACA GAA 881 Trp He Lys Asp Leu Lys Glu Gly Gin Ser Ala Phe Glu Asn Thr Glu 250 255 260
TTA AAC AAA AAA CCT CAT AGA ATT GTT GGC AGT AAG ATT GTC TTA AAT 929 Leu Asn Lys Lys Pro His Arg He Val Gly Ser Lys He Val Leu Asn 265 270 275 280
GTT TCT AAA AAT GGC GAT AAA TAT AAA AGA CAA AAA TAT CAT AGC GTT 977 Val Ser Lys Asn Gly Asp Lys Tyr Lys Arg Gin Lys Tyr His Ser Val 285 290 295
GCC CCT TGC ATT CAT ACA AGA AAC GAC CAA ATG GCT AGC CAA AAC ACG 1025 Ala Pro Cys He His Thr Arg Asn Asp Gin Met Ala Ser Gin Asn Thr 300 305 310
ATC CAC CCC AAA GAT GAT AGA GTG TTT TCC ATT AGA GAG CTG ATG CTT 1073 He His Pro Lys Asp Asp Arg Val Phe Ser He Arg Glu Leu Met Leu 315 320 325
TTA ATG AAT ATC CCT AGC CGT TTT AAG TGG TTA GAT TTA GAA TTA CAA 1121 Leu Met Asn He Pro Ser Arg Phe Lys Trp Leu Asp Leu Glu Leu Gin 330 335 340
GAA TTA AAC GCC CTT AAC CAA CAA GAA AAA GAA AAA ATC TCC AAA CAA 1169 Glu Leu Asn Ala Leu Asn Gin Gin Glu Lys Glu Lys He Ser Lys Gin 345 350 355 360
AAC GAA ATG AAT ATA AGA CAA AGC ATC GGT GAA GCT GTT CCA ACG ATT 1217 Asn Glu Met Asn He Arg Gin Ser He Gly Glu Ala Val Pro Thr He 365 370 375
ATT TTT AAG CAA ATT GCC ATA AAG ATA AAA AAT TTC ATG TCT CAA ACC 1265 He Phe Lys Gin He Ala He Lys He Lys Asn Phe Met Ser Gin Thr 380 385 390
CAC TTA GAG CCT AAA GAA ATC ATT AGG CTT ATT GAT GTG CAC CAT TTA 1313 His Leu Glu Pro Lys Glu He He Arg Leu He Asp Val His His Leu 395 400 405
TTA GAG CCA CAA AAT TTG AAG CGA TTT ATT TTA GAA AAT CAA AAC AAG 1361 Leu Glu Pro Gin Asn Leu Lys Arg Phe He Leu Glu Asn Gin Asn Lys 410 415 420
ATT GCA AGA GCG AGT TTA GTG AGT TTG GCA GAA ATG TCT AAT TCT AAA 1409 He Ala Arg Ala Ser Leu Val Ser Leu Ala Glu Met Ser Asn Ser Lys 425 430 435 440
CGC ATA GAA AAA AGC GCG TAT TTT ACA AAC CCT TTT ATT ATT AAT GAA 1457 Arg He Glu Lys Ser Ala Tyr Phe Thr Asn Pro Phe He He Asn Glu 445 450 455
ATA GCG AAG TTA TTG CCA AGC TTT AAA CAA GAG AGT GTT ACT ATT ATA 1505 He Ala Lys Leu Leu Pro Ser Phe Lys Gin Glu Ser Val Thr He He 460 465 470
GAG CCA AGT GCA GGG TGT GGG AAT TTC TTA AGT GCT CTT TTT AAA AAA 1553 Glu Pro Ser Ala Gly Cys Gly Asn Phe Leu Ser Ala Leu Phe Lys Lys 475 480 485
TAC ACT TCT GTT AAA AAA GTT TAT TTA AAG TGT ATA GAT ATT GAT AAA 1601 Tyr Thr Ser Val Lys Lys Val Tyr Leu Lys Cys He Asp He Asp Lys 490 495 500
AAT AGT TTA GAA ATT TTA GAG ATT TTA TAT AAA GAT TGC ATT CCT AAC 1649 Asn Ser Leu Glu He Leu Glu He Leu Tyr Lys Asp Cys He Pro Asn 505 510 515 520
AAT TTT GAG ATG GAA TTG ATT TGC AAA GAT TTT CTA GCC TAT GAA TGC 1697 Asn Phe Glu Met Glu Leu He Cys Lys Asp Phe Leu Ala Tyr Glu Cys 525 530 535 GGC AAA GTG GAT TTA ATT GTG GGC AAT CCG CCT TTT GGC AAA ACG CAT 1745 Gly Lys Val Asp Leu He Val Gly Asn Pro Pro Phe Gly Lys Thr His 540 545 550
GAA AGA TTC AAA GAT TAT AGT TTA AGA CTC ACT CAT TTA GCA GGG ATT 1793 Glu Arg Phe Lys Asp Tyr Ser Leu Arg Leu Thr His Leu Ala Gly He 555 560 565
TTT TTA GAA AAG TCT TTA AAA CTA GCC AAC TTT ACA GCG ATG GTT ATG 1841 Phe Leu Glu Lys Ser Leu Lys Leu Ala Asn Phe Thr Ala Met Val Met 570 575 580
CCT AAA AAC CTT TTA AAC ACT AAA GAG TAT GCA GAA ACT AGA ACT AAG 1889 Pro Lys Asn Leu Leu Asn Thr Lys Glu Tyr Ala Glu Thr Arg Thr Lys 585 590 595 600
CTT GAA AAA AAG GGA GTA GGA GCG ATT TTA GAC TTT GGC GAG CTT GGT 1937 Leu Glu Lys Lys Gly Val Gly Ala He Leu Asp Phe Gly Glu Leu Gly 605 610 615
TTT AAG GGT GTT TTG GTA GAA ACA ATT GCT ATT GTT ACA CAA AAA TCA 1985 Phe Lys Gly Val Leu Val Glu Thr He Ala He Val Thr Gin Lys Ser 620 625 630
AAA GAA GTT TTA GCG CGT TCG TTA CCC CTA AAT CTA AGC ATC AAG CAA 2033 Lys Glu Val Leu Ala Arg Ser Leu Pro Leu Asn Leu Ser He Lys Gin 635 640 645
AAG CCA AGC TAT ATT TTT GAC AAA CAA TTG CCC TAT TGG GTT ATC TAT 2081 Lys Pro Ser Tyr He Phe Asp Lys Gin Leu Pro Tyr Trp Val He Tyr 650 655 660
CGC AAC GCT TTT TTT GAT AAG GTG TTT CAT TCC ATG CAG TTT GGT CTT 2129 Arg Asn Ala Phe Phe Asp Lys Val Phe His Ser Met Gin Phe Gly Leu 665 670 675 680
TTT GAA GTG TTT AGA GAC AGA CAA ATC ACT AAT TCT GTG TTG GTT AAA 2177 Phe Glu Val Phe Arg Asp Arg Gin He Thr Asn Ser Val Leu Val Lys 685 690 695
AAT GGT ATT CGT GTG ATT AAA TCT CGC AAT ATT GAT GAA AAC GGA AAG 2225 Asn Gly He Arg Val He Lys Ser Arg Asn He Asp Glu Asn Gly Lys 700 705 710
ATT ATT AGC ATT GAA AAT TAC GAT AGC TAC ATT CAA AAA GAG GTT TTA 2273 He He Ser He Glu Asn Tyr Asp Ser Tyr He Gin Lys Glu Val Leu 715 720 725
AGT CCG TTT AAG ATA GCT TCA TTT TTA GAC AGA GAT GAT GTC TAT TTA 2321 Ser Pro Phe Lys He Ala Ser Phe Leu Asp Arg Asp Asp Val Tyr Leu 730 735 740
ACC CCC AAC ATG ACC TAT AAG CCA AGG ATT TTA AAA AAA GAA AAA GGC 2369 Thr Pro Asn Met Thr Tyr Lys Pro Arg He Leu Lys Lys Glu Lys Gly 745 750 755 760 TAT GTG GTT AAT GGC TCT GTG GCT ATT TTA ATC CCT AAA AAC CCC ATA 2417 Tyr Val Val Asn Gly Ser Val Ala He Leu He Pro Lys Asn Pro He 765 770 775
TCT TTA AGC AAG AAA CAA TGC GAT TAT ATC TCT AGC GTT GAA TTT AGA 2465 Ser Leu Ser Lys Lys Gin Cys Asp Tyr He Ser Ser Val Glu Phe Arg 780 785 790
GAT TTT TAT AAA ATC GCT AGG AAT TAT CAA ACG CGC ACC TTA AAT ATT 2513 Asp Phe Tyr Lys He Ala Arg Asn Tyr Gin Thr Arg Thr Leu Asn He 795 800 805
GAT AGC ATG AGT TGT TTT TGG TTT GGA ATT TTA AGG AGT AGC TTA TGAAA 2563 Asp Ser Met Ser Cys Phe Trp Phe Gly He Leu Arg Ser Ser Leu 810 815 820
GAGCAATCAA TGATTGATTT TTTAAAACTT AGAGATTATG ACATTAG 2610
(2) INFORMATION FOR SEQ ID NO: 614:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 823 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614:
Met Leu Phe Asp Gin Thr Leu Thr Tyr He Ser Leu Phe Ser Gly Ala
1 5 10 15
Gly Val Gly Cys Tyr Gly Leu Leu Glu Glu Gly Phe Glu Cys Val Ala
20 25 30
Thr Asn Glu He Leu Glu Lys Arg Leu Asn He Gin Arg He Asn Arg
35 40 45
Lys Cys Lys Leu Asp Glu Ser Tyr He Ser Gly Asp He Lys Lys Pro
50 55 60
Glu Thr Lys Glu Lys He Leu Lys Gin He Glu Phe Tyr Ser Lys Lys 65 70 75 80
Phe Gly Asn Asp Arg Val Asp Leu Val Val Ala Thr Pro Pro Cys Gin
85 90 95
Gly Met Ser Val Ala Asn His Lys Lys Lys Asn Asp Glu He Lys Arg
100 105 110
Asn Ser Leu Val Val Glu Ser He Asp Leu He Lys Gin He Lys Pro
115 120 125
Arg Phe Phe He Leu Glu Asn Val Pro Ser Phe Tyr Lys Thr Gly Cys
130 135 140
He Asp Lys Asn Asp Asn Leu Leu Glu He Gly Ser Met He Glu Gin 145 150 155 160
Asn Leu Ser Gly Asp Tyr Met Leu Tyr Asp Glu Val He Asn Phe Lys
165 170 175
Asn Phe Gly Ala Asn Ser Ser Arg Thr Arg Thr Leu Val He Gly Val 180 185 190 Cys Lys Glu Phe Lys Asp Phe He Ser Ala Leu Glu Phe Phe Pro Asp
195 200 205
Phe Lys Gin Glu Lys Thr Leu Lys Glu Val He Gly Ser Leu Lys Pro
210 215 220
Leu Ala Trp Gly Glu Tyr Asp Asn Thr Asp Phe Tyr His Ser Phe Arg 225 230 235 240
Thr Tyr Pro Lys His Met Gin Glu Trp He Lys Asp Leu Lys Glu Gly
245 250 255
Gin Ser Ala Phe Glu Asn Thr Glu Leu Asn Lys Lys Pro His Arg He
260 265 270
Val Gly Ser Lys He Val Leu Asn Val Ser Lys Asn Gly Asp Lys Tyr
275 280 285
Lys Arg Gin Lys Tyr His Ser Val Ala Pro Cys He His Thr Arg Asn
290 295 300
Asp Gin Met Ala Ser Gin Asn Thr He His Pro Lys Asp Asp Arg Val 305 310 315 320
Phe Ser He Arg Glu Leu Met Leu Leu Met Asn He Pro Ser Arg Phe
325 330 335
Lys Trp Leu Asp Leu Glu Leu Gin Glu Leu Asn Ala Leu Asn Gin Gin
340 345 350
Glu Lys Glu Lys He Ser Lys Gin Asn Glu Met Asn He Arg Gin Ser
355 360 365
He Gly Glu Ala Val Pro Thr He He Phe Lys Gin He Ala He Lys
370 375 380
He Lys Asn Phe Met Ser Gin Thr His Leu Glu Pro Lys Glu He He 385 390 395 400
Arg Leu He Asp Val His His Leu Leu Glu Pro Gin Asn Leu Lys Arg
405 410 415
Phe He Leu Glu Asn Gin Asn Lys He Ala Arg Ala Ser Leu Val Ser
420 425 430
Leu Ala Glu Met Ser Asn Ser Lys Arg He Glu Lys Ser Ala Tyr Phe
435 440 445
Thr Asn Pro Phe He He Asn Glu He Ala Lys Leu Leu Pro Ser Phe
450 455 460
Lys Gin Glu Ser Val Thr He He Glu Pro Ser Ala Gly Cys Gly Asn 465 470 475 480
Phe Leu Ser Ala Leu Phe Lys Lys Tyr Thr Ser Val Lys Lys Val Tyr
485 490 495
Leu Lys Cys He Asp He Asp Lys Asn Ser Leu Glu He Leu Glu He
500 505 510
Leu Tyr Lys Asp Cys He Pro Asn Asn Phe Glu Met Glu Leu He Cys
515 520 525
Lys Asp Phe Leu Ala Tyr Glu Cys Gly Lys Val Asp Leu He Val Gly
530 535 540
Asn Pro Pro Phe Gly Lys Thr His Glu Arg Phe Lys Asp Tyr Ser Leu 545 550 555 560
Arg Leu Thr His Leu Ala Gly He Phe Leu Glu Lys Ser Leu Lys Leu
565 570 575
Ala Asn Phe Thr Ala Met Val Met Pro Lys Asn Leu Leu Asn Thr Lys
580 585 590
Glu Tyr Ala Glu Thr Arg Thr Lys Leu Glu Lys Lys Gly Val Gly Ala
595 600 605
He Leu Asp Phe Gly Glu Leu Gly Phe Lys Gly Val Leu Val Glu Thr
610 615 620
He Ala He Val Thr Gin Lys Ser Lys Glu Val Leu Ala Arg Ser Leu 625 630 635 640
Pro Leu Asn Leu Ser He Lys Gin Lys Pro Ser Tyr He Phe Asp Lys
645 650 655
Gin Leu Pro Tyr Trp Val He Tyr Arg Asn Ala Phe Phe Asp Lys Val
660 665 670
Phe His Ser Met Gin Phe Gly Leu Phe Glu Val Phe Arg Asp Arg Gin
675 680 685
He Thr Asn Ser Val Leu Val Lys Asn Gly He Arg Val He Lys Ser
690 695 700
Arg Asn He Asp Glu Asn Gly Lys He He Ser He Glu Asn Tyr Asp 705 710 715 720
Ser Tyr He Gin Lys Glu Val Leu Ser Pro Phe Lys He Ala Ser Phe
725 730 735
Leu Asp Arg Asp Asp Val Tyr Leu Thr Pro Asn Met Thr Tyr Lys Pro
740 745 750
Arg He Leu Lys Lys Glu Lys Gly Tyr Val Val Asn Gly Ser Val Ala
755 760 765
He Leu He Pro Lys Asn Pro He Ser Leu Ser Lys Lys Gin Cys Asp
770 775 780
Tyr He Ser Ser Val Glu Phe Arg Asp Phe Tyr Lys He Ala Arg Asn 785 790 795 800
Tyr Gin Thr Arg Thr Leu Asn He Asp Ser Met Ser Cys Phe Trp Phe
805 810 815
Gly He Leu Arg Ser Ser Leu 820
(2) INFORMATION FOR SEQ ID NO: 615:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3666 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 54...3608 (D) OTHER INFORMATION:
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 615:
CCCGCTTAAA ACATGCTACA ATCAAGTCAA ATTCTTAAAT AAAAGGTAAG CTC ATG 56
Met 1
CAA AAA ATC ATT GAC GAT TCG CTA GAA TTA GCT AAA AAA CTG CAA GAT 104 Gin Lys He He Asp Asp Ser Leu Glu Leu Ala Lys Lys Leu Gin Asp 5 10 15
AGT ATC AGT AAC CAT TTG AGC GAT CAG GAA AAA GCG TTC CAC TCT AAA 152 Ser He Ser Asn His Leu Ser Asp Gin Glu Lys Ala Phe His Ser Lys 20 25 30
ATG CAA AAG CTT TTA AAC AAC CCT GAA AAC AAA GTC ATG CTC ATA GAG 200 Met Gin Lys Leu Leu Asn Asn Pro Glu Asn Lys Val Met Leu He Glu 35 40 45
CTT ATG GAT CGG AGT TTC AGG TGC TTG GAC AAT AAA GCC CGC TTT GAA 248 Leu Met Asp Arg Ser Phe Arg Cys Leu Asp Asn Lys Ala Arg Phe Glu 50 55 60 65
ATG ATT GAG CAT GTT TTA GAC AAA TAC AAA AGC CGT GAG ATT TTT TCT 296 Met He Glu His Val Leu Asp Lys Tyr Lys Ser Arg Glu He Phe Ser 70 75 80
CCG TTT GAA AAA GTG CTT TTA ATG GGG TTT TTA AGC TTT GGG AAA ATG 344 Pro Phe Glu Lys Val Leu Leu Met Gly Phe Leu Ser Phe Gly Lys Met 85 90 95
CTC CCT GAT ATG AGC GTG CCT TTC TTT GTC AAT AAA ATC AGA AGC GAC 392 Leu Pro Asp Met Ser Val Pro Phe Phe Val Asn Lys He Arg Ser Asp 100 105 110
ACG AAA GCG ATG GTC TTG GAT CAA GAA GAG AGC CAG TTA AAA GAG CGG 440 Thr Lys Ala Met Val Leu Asp Gin Glu Glu Ser Gin Leu Lys Glu Arg 115 120 125
ATT TTA AAA AGA AAA AAT GAA AAA ATC ATT TTG AAT GTG AAT TTT ATT 488 He Leu Lys Arg Lys Asn Glu Lys He He Leu Asn Val Asn Phe He 130 135 140 145
GGC GAA GAG GTT TTA GGC GAA GAA GAA GCT AAT GCG CGT TTT GAA AAA 536 Gly Glu Glu Val Leu Gly Glu Glu Glu Ala Asn Ala Arg Phe Glu Lys 150 155 160
TAC TCT CAA GCC CTA AAA TCC AAC TAC ATC CAA TAC ATT TCC ATT AAA 584 Tyr Ser Gin Ala Leu Lys Ser Asn Tyr He Gin Tyr He Ser He Lys 165 170 175
ATC ACG ACG ATT TTT TCT CAA ATC AAT ATC CTT GAT TTT GAA TAC TCT 632 He Thr Thr He Phe Ser Gin He Asn He Leu Asp Phe Glu Tyr Ser 180 185 190
AAA AAA GAG ATT GTC AAA CGC CTA GAC GCT CTT TAC GCC CTG GCT TTA 680 Lys Lys Glu He Val Lys Arg Leu Asp Ala Leu Tyr Ala Leu Ala Leu 195 200 205
GAA GAA GAA AAA AAA CAA GGC ATG CCT AAA TTC ATC AAT TTG GAT ATG 728 Glu Glu Glu Lys Lys Gin Gly Met Pro Lys Phe He Asn Leu Asp Met 210 215 220 225
GAA GAA TTT AGG GAT TTA GAG CTA ACA GTG GAG TCG TTT ATG GAA TCC 776 Glu Glu Phe Arg Asp Leu Glu Leu Thr Val Glu Ser Phe Met Glu Ser 230 235 240 ATC GCT AAA TTT GAT TTG AAC GCT GGT ATT GTG CTG CAA GCC TAT ATT 824 He Ala Lys Phe Asp Leu Asn Ala Gly He Val Leu Gin Ala Tyr He 245 250 255
CCG GAT TCT TAT GAA TAT TTG AAA AAA CTG CAC GCT TTT TCT AAA GAA 872 Pro Asp Ser Tyr Glu Tyr Leu Lys Lys Leu His Ala Phe Ser Lys Glu 260 265 270
AGG GTT TTA AAA GGG TTG AAG CCC ATT AAA ATC CGC TTT GTT AAG GGA 920 Arg Val Leu Lys Gly Leu Lys Pro He Lys He Arg Phe Val Lys Gly 275 280 285
GCG AAC ATG GAG AGC GAA GAA ACT ATC GCT TCC GTG AAA GAT TGG GCG 968 Ala Asn Met Glu Ser Glu Glu Thr He Ala Ser Val Lys Asp Trp Ala 290 295 300 305
TTA CCC ACA TTT TCC AAT AAG CAA GAC ACC GAT TCT AAT TAC AAT AAA 1016 Leu Pro Thr Phe Ser Asn Lys Gin Asp Thr Asp Ser Asn Tyr Asn Lys 310 315 320
ATG TTG GAT TTT GTT TTA GAG GGC GAT AAT TAT AAA TAC ATT CAT ATT 1064 Met Leu Asp Phe Val Leu Glu Gly Asp Asn Tyr Lys Tyr He His He 325 330 335
GGC GCA GCG AGT CAT AAT ATT TTT GAA ATC GCT TAT GTC TAT ACG CGT 1112 Gly Ala Ala Ser His Asn He Phe Glu He Ala Tyr Val Tyr Thr Arg 340 345 350
ATC CAT GCC ATT AAT GAT CCT GTT GTG TTA GAG CAT TTC AGC TTT GAA 1160 He His Ala He Asn Asp Pro Val Val Leu Glu His Phe Ser Phe Glu 355 360 365
ATG CTA GAG GGC ATG AGT TTG CAA GCG AGC CAG GAA CTA AAA GAG ATG 1208 Met Leu Glu Gly Met Ser Leu Gin Ala Ser Gin Glu Leu Lys Glu Met 370 375 380 385
CAC AAG CTC ATT CTT TAT GCG CCG GTG TGC GAT GAA GCG CAT TTT AAC 1256 His Lys Leu He Leu Tyr Ala Pro Val Cys Asp Glu Ala His Phe Asn 390 395 400
AAT GCG ATC GCT TAC TTG GTG AGG AGG TTA GAC GAA AAC ACC TCA AGC 1304 Asn Ala He Ala Tyr Leu Val Arg Arg Leu Asp Glu Asn Thr Ser Ser 405 410 415
GAT AAT TTC ATG AAG GCT TTC TTT AAC CTC AAA GTA GGC ACG AGC GAA 1352 Asp Asn Phe Met Lys Ala Phe Phe Asn Leu Lys Val Gly Thr Ser Glu 420 425 430
TGG AAA GAT CAA GAA CAA CGC TTT TTA AAC AGC CTT AAA GGA ATT GCC 1400 Trp Lys Asp Gin Glu Gin Arg Phe Leu Asn Ser Leu Lys Gly He Ala 435 440 445
ACT TTA GAC AAT GCC ACC CAT AGG ACT CAA GAT AGG AAC GCC AAA CAA 1448 Thr Leu Asp Asn Ala Thr His Arg Thr Gin Asp Arg Asn Ala Lys Gin 450 455 460 465 AGC GGG CAT ACC ACT TAC CCA AAC CAC TCC TTT AAA AAC GAA AGC GAT 1496 Ser Gly His Thr Thr Tyr Pro Asn His Ser Phe Lys Asn Glu Ser Asp 470 475 480
ACC GAT TTT ATT TTA AAA GCC AAC CGA GAA TGG GCT AAA AAA GTG CGC 1544 Thr Asp Phe He Leu Lys Ala Asn Arg Glu Trp Ala Lys Lys Val Arg 485 490 495
GAG AAA ATG CGT AAC GCT CCT ATT TTA GAG CTT TAC CCA GAG ATG GAT 1592 Glu Lys Met Arg Asn Ala Pro He Leu Glu Leu Tyr Pro Glu Met Asp 500 505 510
GGG AGG TTT GAA GAT CCT AAT CTA ACC CCT TTA GAA GTC TTT GAT AGA 1640 Gly Arg Phe Glu Asp Pro Asn Leu Thr Pro Leu Glu Val Phe Asp Arg 515 520 525
ATC CAT CAT AAA AAA ATC GCT AGC GTG CAT TTA GCG GAT AAG GAA GCG 1688 He His His Lys Lys He Ala Ser Val His Leu Ala Asp Lys Glu Ala 530 535 540 545
ATT TTA AAA GCC CTA GAA GTG GCT AAA AGC GAT AAG AGC CGT TTC AGT 1736 He Leu Lys Ala Leu Glu Val Ala Lys Ser Asp Lys Ser Arg Phe Ser 550 555 560
CAA AAA AGC TTT ACA GAA ATC CAT GCC TTA ATG AGT CAA ACC GCC CAG 1784 Gin Lys Ser Phe Thr Glu He His Ala Leu Met Ser Gin Thr Ala Gin 565 570 575
CTT TTT AGA GAA AGA AGA GGC GAT TTG ATA GGG ATT TCC GCT TTA GAA 1832 Leu Phe Arg Glu Arg Arg Gly Asp Leu He Gly He Ser Ala Leu Glu 580 585 590
GTG GGT AAG ACT TTC GCT GAA ACG GAC GCT GAA GTG AGC GAA GCC ATT 1880 Val Gly Lys Thr Phe Ala Glu Thr Asp Ala Glu Val Ser Glu Ala He 595 600 605
GAC TTT TTA GAG TTT TAC CCT TAC AGC TTA AGG GTG TTG CAA GAG CAA 1928 Asp Phe Leu Glu Phe Tyr Pro Tyr Ser Leu Arg Val Leu Gin Glu Gin 610 615 620 625
AAC ACA AAA ACG CAA TTC ACC CCT AAA GGC GTG GGC GTG GTC ATT GCC 1976 Asn Thr Lys Thr Gin Phe Thr Pro Lys Gly Val Gly Val Val He Ala 630 635 640
CCA TGG AAT TTC CCT GTG GGC ATT TCT GTA GGC ACT ATC GCT GCC CCC 2024 Pro Trp Asn Phe Pro Val Gly He Ser Val Gly Thr He Ala Ala Pro 645 650 655
CTA GCT ACG GGC AAT CGG GTG ATT TAC AAG CCC TCA AGT TTG TCT AGC 2072 Leu Ala Thr Gly Asn Arg Val He Tyr Lys Pro Ser Ser Leu Ser Ser 660 665 670
GTA ACG GGC TAT AAG CTT TGT GAG TGC TTT TGG GAT GCG GGC GTG CCT 2120 Val Thr Gly Tyr Lys Leu Cys Glu Cys Phe Trp Asp Ala Gly Val Pro 675 680 685 AGA GAT GCG CTC ATT TAC TTG CCC TCT AAA GGG AGC GAT ATT AGC GAA 2168 Arg Asp Ala Leu He Tyr Leu Pro Ser Lys Gly Ser Asp He Ser Glu 690 695 700 705
CAT CTT TTA AGA GAT GAA AGC ATC CAG TTT GCC ATT TTA ACC GGG GGC 2216 His Leu Leu Arg Asp Glu Ser He Gin Phe Ala He Leu Thr Gly Gly 710 715 720
GAA GAC ACC GCT TAT AAA ATG TTA AAA GCT AAC CCC ACT TTA GCC TTG 2264 Glu Asp Thr Ala Tyr Lys Met Leu Lys Ala Asn Pro Thr Leu Ala Leu 725 730 735
AGC GCT GAA ACA GGC GGT AAA AAC GCC ACC ATT GTG AGC AAA ATG GCA 2312 Ser Ala Glu Thr Gly Gly Lys Asn Ala Thr He Val Ser Lys Met Ala 740 745 750
GAC AGA GAC CAG GCG ATT AAG AAT GTT ATC CAT TCA GCT TTT AGC AAT 2360 Asp Arg Asp Gin Ala He Lys Asn Val He His Ser Ala Phe Ser Asn 755 760 765
TCG GGG CAA AAA TGC TCC GCC ACT TCG CTT TTA GTA TTA GAA AAA GAA 2408 Ser Gly Gin Lys Cys Ser Ala Thr Ser Leu Leu Val Leu Glu Lys Glu 770 775 780 785
GTC TAT GAA GAT GAG AAC TTT AAA AAG ACT CTA ATA GAT GCG ACT CTA 2456 Val Tyr Glu Asp Glu Asn Phe Lys Lys Thr Leu He Asp Ala Thr Leu 790 795 800
AGC CTT AGC GTG GGC GAT CCT TTT GAT TTC AAA AAC AAA ATC GGC GCT 2504 Ser Leu Ser Val Gly Asp Pro Phe Asp Phe Lys Asn Lys He Gly Ala 805 810 815
CTA GCG GAC AAG CCT AAT GAA AAG GTC ATC AAA GCC ATA GAT GAA TTA 2552 Leu Ala Asp Lys Pro Asn Glu Lys Val He Lys Ala He Asp Glu Leu 820 825 830
AAA AGC TAT GAA AAT TAC GAA ATC CCG GTA AGC TTT GTC AAT GAT AAC 2600 Lys Ser Tyr Glu Asn Tyr Glu He Pro Val Ser Phe Val Asn Asp Asn 835 840 845
CCC TAT TTG ATG AAG CCA AGC ATC AAA TAC GGC ACT AAA AAA GGC GAT 2648 Pro Tyr Leu Met Lys Pro Ser He Lys Tyr Gly Thr Lys Lys Gly Asp 850 855 860 865
TTC ACG CAC CAA ACT GAG CTT TTT ACG CCC ATT TTA TCC GTG ATG GAA 2696 Phe Thr His Gin Thr Glu Leu Phe Thr Pro He Leu Ser Val Met Glu 870 875 880
GCA AAA GAT TTA GAC GAA GCG ATA GAA ATA GCC AAT TCT ACC GGT TAC 2744 Ala Lys Asp Leu Asp Glu Ala He Glu He Ala Asn Ser Thr Gly Tyr 885 890 895
GGG CTG ACT AGC GCG TTA GAG TCG TTG GAC GAA AGG GAG TGG GAA TAT 2792 Gly Leu Thr Ser Ala Leu Glu Ser Leu Asp Glu Arg Glu Trp Glu Tyr 900 905 910 TAT TTA GAA CGC ATT GAA GCC GGT AAT ATC TAT ATC AAC AAG CCC ACC 2840 Tyr Leu Glu Arg He Glu Ala Gly Asn He Tyr He Asn Lys Pro Thr 915 920 925
ACA GGA GCG ATT GTC TTG CGC CAG CCT TTT GGG GGG GTT AAA AAA TCC 2888 Thr Gly Ala He Val Leu Arg Gin Pro Phe Gly Gly Val Lys Lys Ser 930 935 940 945
GCT GTG GGG TTT GGG AGG AAA GTA GGC ATT TTC AAC TAT ATC ACG CAA 2936 Ala Val Gly Phe Gly Arg Lys Val Gly He Phe Asn Tyr He Thr Gin 950 955 960
TTT GTG AAT ATC TGC CAA GAA GAA GAA GAC GAA AAC GCC TTA AAA AAC 2984 Phe Val Asn He Cys Gin Glu Glu Glu Asp Glu Asn Ala Leu Lys Asn 965 970 975
CCC TTA AGC GAA GCC TTA GAA AAC TTA ACT CAA AAA GGC TAT GAT GAG 3032 Pro Leu Ser Glu Ala Leu Glu Asn Leu Thr Gin Lys Gly Tyr Asp Glu 980 985 990
CAT ACG CAT GAG TTG AAG CGC GCG ATT TTT ATG GCA AAA AGC TAC GCT 3080 His Thr His Glu Leu Lys Arg Ala He Phe Met Ala Lys Ser Tyr Ala 995 1000 1005
TAT CAT TAC AAA CAT GAA TTC AGC CAA ACT AAA GAC TAT GTC AAA ATC 3128 Tyr His Tyr Lys His Glu Phe Ser Gin Thr Lys Asp Tyr Val Lys He 1010 1015 1020 1025
AGA GGC GAA GAC AAC CTT TTT TCC TAC ACT AAA GTT AAA AGC GTG GGC 3176 Arg Gly Glu Asp Asn Leu Phe Ser Tyr Thr Lys Val Lys Ser Val Gly 1030 1035 1040
TAT CGC ATC ACC GAA AAG GAC ACT TTA AGC GAC ATG TTA GGC GTT GCT 3224 Tyr Arg He Thr Glu Lys Asp Thr Leu Ser Asp Met Leu Gly Val Ala 1045 1050 1055
TTA GCA TGT TTA ATT TCT CAA ATC CCT TTA ACG CTC AGC ATA GAA AAC 3272 Leu Ala Cys Leu He Ser Gin He Pro Leu Thr Leu Ser He Glu Asn 1060 1065 1070
GAA CGA ACG AAC AAA GAT TTA ACC TTT TTC TTA GAA TGC TTA AAA GCG 3320 Glu Arg Thr Asn Lys Asp Leu Thr Phe Phe Leu Glu Cys Leu Lys Ala 1075 1080 1085
CTC CAA GCA AGC GCC CCT ATT GTT TAT GAA AGC TTG CAA AAA TTT AGC 3368 Leu Gin Ala Ser Ala Pro He Val Tyr Glu Ser Leu Gin Lys Phe Ser 1090 1095 1100 1105
GAG AAA TTG AAT ACT TTC AAT CGT GTC CGT TAT CTC AAA AGC GAT TTG 3416 Glu Lys Leu Asn Thr Phe Asn Arg Val Arg Tyr Leu Lys Ser Asp Leu 1110 1115 1120
GAT TTA TTG CAC GAA CAA GCG AGC GCT TTA GGG ATG GTT TTA GCC ACG 3464 Asp Leu Leu His Glu Gin Ala Ser Ala Leu Gly Met Val Leu Ala Thr 1125 1130 1135 GCT AAA CCA TGC CTA AAT GGG CGT TTT GAA TTG CTG TAT TAC CAC TTA 3512 Ala Lys Pro Cys Leu Asn Gly Arg Phe Glu Leu Leu Tyr Tyr His Leu 1140 1145 1150
GAG CGA TCG GTT AGC ATC TCT TAT CAC CGT TAT GGG AAT TTA GGC TCA 3560 Glu Arg Ser Val Ser He Ser Tyr His Arg Tyr Gly Asn Leu Gly Ser 1155 1160 1165
AGG GTT TTA AGG CAA CCC ACT TGC CAC AAA TCA TGC TGT GCT GAA AAA T 3609 Arg Val Leu Arg Gin Pro Thr Cys His Lys Ser Cys Cys Ala Glu Lys 1170 1175 1180 1185
AAATATTGTA TTAAATAAGG AGATCAAAAT GGGACATGTT GTTTTAAGTA CCCCTAT 3666
(2) INFORMATION FOR SEQ ID NO: 616:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1185 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616:
Met Gin Lys He He Asp Asp Ser Leu Glu Leu Ala Lys Lys Leu Gin
1 5 10 15
Asp Ser He Ser Asn His Leu Ser Asp Gin Glu Lys Ala Phe His Ser
20 25 30
Lys Met Gin Lys Leu Leu Asn Asn Pro Glu Asn Lys Val Met Leu He
35 40 45
Glu Leu Met Asp Arg Ser Phe Arg Cys Leu Asp Asn Lys Ala Arg Phe
50 55 60
Glu Met He Glu His Val Leu Asp Lys Tyr Lys Ser Arg Glu He Phe 65 70 75 80
Ser Pro Phe Glu Lys Val Leu Leu Met Gly Phe Leu Ser Phe Gly Lys
85 90 95
Met Leu Pro Asp Met Ser Val Pro Phe Phe Val Asn Lys He Arg Ser
100 105 110
Asp Thr Lys Ala Met Val Leu Asp Gin Glu Glu Ser Gin Leu Lys Glu
115 120 125
Arg He Leu Lys Arg Lys Asn Glu Lys He He Leu Asn Val Asn Phe
130 135 140
He Gly Glu Glu Val Leu Gly Glu Glu Glu Ala Asn Ala Arg Phe Glu 145 150 155 160
Lys Tyr Ser Gin Ala Leu Lys Ser Asn Tyr He Gin Tyr He Ser He
165 170 175
Lys He Thr Thr He Phe Ser Gin He Asn He Leu Asp Phe Glu Tyr
180 185 190
Ser Lys Lys Glu He Val Lys Arg Leu Asp Ala Leu Tyr Ala Leu Ala
195 200 205
Leu Glu Glu Glu Lys Lys Gin Gly Met Pro Lys Phe He Asn Leu Asp 210 215 220 Met Glu Glu Phe Arg Asp Leu Glu Leu Thr Val Glu Ser Phe Met Glu 225 230 235 240
Ser He Ala Lys Phe Asp Leu Asn Ala Gly He Val Leu Gin Ala Tyr
245 250 255
He Pro Asp Ser Tyr Glu Tyr Leu Lys Lys Leu His Ala Phe Ser Lys
260 265 270
Glu Arg Val Leu Lys Gly Leu Lys Pro He Lys He Arg Phe Val Lys
275 280 285
Gly Ala Asn Met Glu Ser Glu Glu Thr He Ala Ser Val Lys Asp Trp
290 295 300
Ala Leu Pro Thr Phe Ser Asn Lys Gin Asp Thr Asp Ser Asn Tyr Asn 305 310 315 320
Lys Met Leu Asp Phe Val Leu Glu Gly Asp Asn Tyr Lys Tyr He His
325 330 335
He Gly Ala Ala Ser His Asn He Phe Glu He Ala Tyr Val Tyr Thr
340 345 350
Arg He His Ala He Asn Asp Pro Val Val Leu Glu His Phe Ser Phe
355 360 365
Glu Met Leu Glu Gly Met Ser Leu Gin Ala Ser Gin Glu Leu Lys Glu
370 375 380
Met His Lys Leu He Leu Tyr Ala Pro Val Cys Asp Glu Ala His Phe 385 390 395 400
Asn Asn Ala He Ala Tyr Leu Val Arg Arg Leu Asp Glu Asn Thr Ser
405 410 415
Ser Asp Asn Phe Met Lys Ala Phe Phe Asn Leu Lys Val Gly Thr Ser
420 425 430
Glu Trp Lys Asp Gin Glu Gin Arg Phe Leu Asn Ser Leu Lys Gly He
435 440 445
Ala Thr Leu Asp Asn Ala Thr His Arg Thr Gin Asp Arg Asn Ala Lys
450 455 460
Gin Ser Gly His Thr Thr Tyr Pro Asn His Ser Phe Lys Asn Glu Ser 465 470 475 480
Asp Thr Asp Phe He Leu Lys Ala Asn Arg Glu Trp Ala Lys Lys Val
485 490 495
Arg Glu Lys Met Arg Asn Ala Pro He Leu Glu Leu Tyr Pro Glu Met
500 505 510
Asp Gly Arg Phe Glu Asp Pro Asn Leu Thr Pro Leu Glu Val Phe Asp
515 520 525
Arg He His His Lys Lys He Ala Ser Val His Leu Ala Asp Lys Glu
530 535 540
Ala He Leu Lys Ala Leu Glu Val Ala Lys Ser Asp Lys Ser Arg Phe 545 550 555 560
Ser Gin Lys Ser Phe Thr Glu He His Ala Leu Met Ser Gin Thr Ala
565 570 575
Gin Leu Phe Arg Glu Arg Arg Gly Asp Leu He Gly He Ser Ala Leu
580 585 590
Glu Val Gly Lys Thr Phe Ala Glu Thr Asp Ala Glu Val Ser Glu Ala
595 600 605
He Asp Phe Leu Glu Phe Tyr Pro Tyr Ser Leu Arg Val Leu Gin Glu
610 615 620
Gin Asn Thr Lys Thr Gin Phe Thr Pro Lys Gly Val Gly Val Val He 625 630 635 640
Ala Pro Trp Asn Phe Pro Val Gly He Ser Val Gly Thr He Ala Ala
645 650 655
Pro Leu Ala Thr Gly Asn Arg Val He Tyr Lys Pro Ser Ser Leu Ser 660 665 670
Ser Val Thr Gly Tyr Lys Leu Cys Glu Cys Phe Trp Asp Ala Gly Val
675 680 685
Pro Arg Asp Ala Leu He Tyr Leu Pro Ser Lys Gly Ser Asp He Ser
690 695 700
Glu His Leu Leu Arg Asp Glu Ser He Gin Phe Ala He Leu Thr Gly 705 710 715 720
Gly Glu Asp Thr Ala Tyr Lys Met Leu Lys Ala Asn Pro Thr Leu Ala
725 730 735
Leu Ser Ala Glu Thr Gly Gly Lys Asn Ala Thr He Val Ser Lys Met
740 745 750
Ala Asp Arg Asp Gin Ala He Lys Asn Val He His Ser Ala Phe Ser
755 760 765
Asn Ser Gly Gin Lys Cys Ser Ala Thr Ser Leu Leu Val Leu Glu Lys
770 775 780
Glu Val Tyr Glu Asp Glu Asn Phe Lys Lys Thr Leu He Asp Ala Thr 785 790 795 800
Leu Ser Leu Ser Val Gly Asp Pro Phe Asp Phe Lys Asn Lys He Gly
805 810 815
Ala Leu Ala Asp Lys Pro Asn Glu Lys Val He Lys Ala He Asp Glu
820 825 830
Leu Lys Ser Tyr Glu Asn Tyr Glu He Pro Val Ser Phe Val Asn Asp
835 840 845
Asn Pro Tyr Leu Met Lys Pro Ser He Lys Tyr Gly Thr Lys Lys Gly
850 855 860
Asp Phe Thr His Gin Thr Glu Leu Phe Thr Pro He Leu Ser Val Met 865 870 875 880
Glu Ala Lys Asp Leu Asp Glu Ala He Glu He Ala Asn Ser Thr Gly
885 890 895
Tyr Gly Leu Thr Ser Ala Leu Glu Ser Leu Asp Glu Arg Glu Trp Glu
900 905 910
Tyr Tyr Leu Glu Arg He Glu Ala Gly Asn He Tyr He Asn Lys Pro
915 920 925
Thr Thr Gly Ala He Val Leu Arg Gin Pro Phe Gly Gly Val Lys Lys
930 935 940
Ser Ala Val Gly Phe Gly Arg Lys Val Gly He Phe Asn Tyr He Thr 945 950 955 960
Gin Phe Val Asn He Cys Gin Glu Glu Glu Asp Glu Asn Ala Leu Lys
965 970 975
Asn Pro Leu Ser Glu Ala Leu Glu Asn Leu Thr Gin Lys Gly Tyr Asp
980 985 990
Glu His Thr His Glu Leu Lys Arg Ala He Phe Met Ala Lys Ser Tyr
995 1000 1005
Ala Tyr His Tyr Lys His Glu Phe Ser Gin Thr Lys Asp Tyr Val Lys
1010 1015 1020
He Arg Gly Glu Asp Asn Leu Phe Ser Tyr Thr Lys Val Lys Ser Val 025 1030 1035 1040
Gly Tyr Arg He Thr Glu Lys Asp Thr Leu Ser Asp Met Leu Gly Val
1045 1050 1055
Ala Leu Ala Cys Leu He Ser Gin He Pro Leu Thr Leu Ser He Glu
1060 1065 1070
Asn Glu Arg Thr Asn Lys Asp Leu Thr Phe Phe Leu Glu Cys Leu Lys
1075 1080 1085
Ala Leu Gin Ala Ser Ala Pro He Val Tyr Glu Ser Leu Gin Lys Phe 1090 1095 1100 Ser Glu Lys Leu Asn Thr Phe Asn Arg Val Arg Tyr Leu Lys Ser Asp 105 1110 1115 1120
Leu Asp Leu Leu His Glu Gin Ala Ser Ala Leu Gly Met Val Leu Ala
1125 1130 1135
Thr Ala Lys Pro Cys Leu Asn Gly Arg Phe Glu Leu Leu Tyr Tyr His
1140 1145 1150
Leu Glu Arg Ser Val Ser He Ser Tyr His Arg Tyr Gly Asn Leu Gly
1155 1160 1165
Ser Arg Val Leu Arg Gin Pro Thr Cys His Lys Ser Cys Cys Ala Glu
1170 1175 1180
Lys 185
(2) INFORMATION FOR SEQ ID NO: 617:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 810 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 107...673 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617:
AGCATGAAGA AGTGGATGTG AAGGTGTGCA GTATAGATTC ACAAAGCATT AAAGTGGGGC 60 TGTTTAAAGA TAACCAATTA ATCTATGAAA GCGAGGCAGA AAAATT ATG ATG ACT 115
Met Met Thr 1
AAG AAC GCG TAT GCG TTT GTT GTG ATT GAA GAA AGC GTT ATG GTG TTT 163 Lys Asn Ala Tyr Ala Phe Val Val He Glu Glu Ser Val Met Val Phe 5 10 15
AAA CGC ACC AAA GAT GAG GGG TTA ATG CCT ATC TTT GAA GGC TTT GTG 211 Lys Arg Thr Lys Asp Glu Gly Leu Met Pro He Phe Glu Gly Phe Val 20 25 30 35
CCT TTA AAA GAG GGC TTT TTG AAA AGT TTT AAA GAG CGT TGC AAT TTG 259 Pro Leu Lys Glu Gly Phe Leu Lys Ser Phe Lys Glu Arg Cys Asn Leu 40 45 50
GAA TTT TTA GAA AAT TTA GAC CTT TTG TTT TTG TAT GAC AAA CCA TCC 307 Glu Phe Leu Glu Asn Leu Asp Leu Leu Phe Leu Tyr Asp Lys Pro Ser 55 60 65
GCA CAC GAG ATC TTT TCC TTG TGC AAG GAG CTG AAA AAT TCC ATC TGG 355 Ala His Glu He Phe Ser Leu Cys Lys Glu Leu Lys Asn Ser He Trp 70 75 80
GAC AGG AAG CTT GTG GTA GCG CTA GTG GAG GCT TTA GAG GGG TTT AAG 403 Asp Arg Lys Leu Val Val Ala Leu Val Glu Ala Leu Glu Gly Phe Lys 85 90 95
GAT TGG AAT TTG TCG CTT AAA ATA GAA GAC AAG CGT TCT AAC AGC TTG 451 Asp Trp Asn Leu Ser Leu Lys He Glu Asp Lys Arg Ser Asn Ser Leu 100 105 110 115
GGT AAT GGC ACC AAA AAA TTG CTC ACC AAC GCT GAT TTA GGG AGC GAC 499 Gly Asn Gly Thr Lys Lys Leu Leu Thr Asn Ala Asp Leu Gly Ser Asp 120 125 130
TAT AAA ACA ATC GTG ATA GAC AGC ATG AAA ACA TAC CAC CAA AGC CAG 547 Tyr Lys Thr He Val He Asp Ser Met Lys Thr Tyr His Gin Ser Gin 135 140 145
CAA GAA AAA TAT AAA AGA GAA AGA GGC GAA ACG CTA GAG GTT CGC CCC 595 Gin Glu Lys Tyr Lys Arg Glu Arg Gly Glu Thr Leu Glu Val Arg Pro 150 155 160
ACA ACA CCC CCT AGC TAT GGG GGT GGA AGC ATT AGA ATC AGC GGC GAT 643 Thr Thr Pro Pro Ser Tyr Gly Gly Gly Ser He Arg He Ser Gly Asp 165 170 175
AAA AAG CCT GAT TTT GAT GAA GAA AAT TTT TAAAAGAAAG GACAACCGAT GAG 696 Lys Lys Pro Asp Phe Asp Glu Glu Asn Phe 180 185
CAGAGTGCAA ATGGATACCG AAGAGGTCAG GGAATTTGTA GGGCATTTAG AACGCTTTAA 756 AGAGTTACTA AGAGAGGAAG TGAACAGCTT GAGTAATCAT TTCCATAATT TAGA 810
(2) INFORMATION FOR SEQ ID NO: 618:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 189 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618:
Met Met Thr Lys Asn Ala Tyr Ala Phe Val Val He Glu Glu Ser Val
1 5 10 15
Met Val Phe Lys Arg Thr Lys Asp Glu Gly Leu Met Pro He Phe Glu
20 25 30
Gly Phe Val Pro Leu Lys Glu Gly Phe Leu Lys Ser Phe Lys Glu Arg
35 40 45
Cys Asn Leu Glu Phe Leu Glu Asn Leu Asp Leu Leu Phe Leu Tyr Asp 50 55 60 Lys Pro Ser Ala His Glu He Phe Ser Leu Cys Lys Glu Leu Lys Asn 65 70 75 80
Ser He Trp Asp Arg Lys Leu Val Val Ala Leu Val Glu Ala Leu Glu
85 90 95
Gly Phe Lys Asp Trp Asn Leu Ser Leu Lys He Glu Asp Lys Arg Ser
100 105 110
Asn Ser Leu Gly Asn Gly Thr Lys Lys Leu Leu Thr Asn Ala Asp Leu
115 120 125
Gly Ser Asp Tyr Lys Thr He Val He Asp Ser Met Lys Thr Tyr His
130 135 140
Gin Ser Gin Gin Glu Lys Tyr Lys Arg Glu Arg Gly Glu Thr Leu Glu 145 150 155 160
Val Arg Pro Thr Thr Pro Pro Ser Tyr Gly Gly Gly Ser He Arg He
165 170 175
Ser Gly Asp Lys Lys Pro Asp Phe Asp Glu Glu Asn Phe 180 185
(2) INFORMATION FOR SEQ ID NO: 619:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 940 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 58...852 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:619:
AGAAGGCTTA GACGATGTGA TCGCTTGGAT CAAGCGCAAC GCTTTATTGG AAGATTG ATG 60
Met
1
AAC ACT TAC GCT CAA GAA TCC AAG CTC AGG TTA AAA ACC AAA ATA GGG 108 Asn Thr Tyr Ala Gin Glu Ser Lys Leu Arg Leu Lys Thr Lys He Gly 5 10 15
GCT GAT GGG CGG TGC GTG ATT GAA GAC AAT TTT TTC ACG CCC CCC TTT 156 Ala Asp Gly Arg Cys Val He Glu Asp Asn Phe Phe Thr Pro Pro Phe 20 25 30
AAG CTC ATG GCG CCC TTT TAC CCT AAA GAC GAT TTA GCG GAA ATC ATG 204 Lys Leu Met Ala Pro Phe Tyr Pro Lys Asp Asp Leu Ala Glu He Met 35 40 45
CTT TTA GCG GTA AGC CCT GGC ATG ATG AGG GGC GAT GCG CAA GAT GTG 252 Leu Leu Ala Val Ser Pro Gly Met Met Arg Gly Asp Ala Gin Asp Val 50 55 60 65
CAA TTA AAC ATC GGT CCA AAT TGC AAG TTA AGG ATC ACT TCG CAA TCC 300 Gin Leu Asn He Gly Pro Asn Cys Lys Leu Arg He Thr Ser Gin Ser 70 75 80
TTT GAA AAA ATC CAT AAC ACT GAA GAT GGG TTT GCC AGC AGA GAC ATG 348 Phe Glu Lys He His Asn Thr Glu Asp Gly Phe Ala Ser Arg Asp Met 85 90 95
CAT ATT GTT GTG GGG GAA AAC GCT TTT TTA GAT TTT GCG CCT TTC CCG 396 His He Val Val Gly Glu Asn Ala Phe Leu Asp Phe Ala Pro Phe Pro 100 105 110
TTA ATC CCC TTT GAA AAC GCG CAT TTT AAG GGC AAC ACC ACG ATT TCT 444 Leu He Pro Phe Glu Asn Ala His Phe Lys Gly Asn Thr Thr He Ser 115 120 125
TTG CGC TCT AGC TCT CAA TTG CTC TAT AGT GAA ATC ATT GTC GCA GGG 492 Leu Arg Ser Ser Ser Gin Leu Leu Tyr Ser Glu He He Val Ala Gly 130 135 140 145
CGA GTG GCG CGC AAT GAG TTG TTT AAA TTC AAC CGC TTG CAC ACC AAA 540 Arg Val Ala Arg Asn Glu Leu Phe Lys Phe Asn Arg Leu His Thr Lys 150 155 160
ATC TCT ATT TTA CAA GAT GAG AAA CCC ATC TAT TAT GAC AAC ACG ATT 588 He Ser He Leu Gin Asp Glu Lys Pro He Tyr Tyr Asp Asn Thr He 165 170 175
TTA GAT CCC AAA ACC ACC GAC TTA AAT AAC ATG TGC ATG TTT GAT GGC 636 Leu Asp Pro Lys Thr Thr Asp Leu Asn Asn Met Cys Met Phe Asp Gly 180 185 190
TAT ACG CAT TAT TTG AAT TTG GTG CTT GTC AAT TGC CCC ATA GAG CTC 684 Tyr Thr His Tyr Leu Asn Leu Val Leu Val Asn Cys Pro He Glu Leu 195 200 205
TCT GGT GTG CGA GAA TGC ATT GAA GAA AGC GAA GGG GTG GAT GGG GCA 732 Ser Gly Val Arg Glu Cys He Glu Glu Ser Glu Gly Val Asp Gly Ala 210 215 220 225
GTG AGT GAA ACC GCT AGT TCT CAT TTA TGC GTG AAA GCT TTA GCG AAA 780 Val Ser Glu Thr Ala Ser Ser His Leu Cys Val Lys Ala Leu Ala Lys 230 235 240
GGC TCA GAA CCC TTA TTG CAT TTA AGA GAA AAA ATC GCT CGC TTG GTT 828 Gly Ser Glu Pro Leu Leu His Leu Arg Glu Lys He Ala Arg Leu Val 245 250 255
ACG CAA ACC ACC ACG CAA AAG GTT TGAAAGCACT TCAAAAAGAT TAAAGTCCTT 882 Thr Gin Thr Thr Thr Gin Lys Val 260 265
TAGTCTTTTT ACTCCCCCCT TTTTTTGACC CTATAAGCTG AAAGGCCTGA ATTCAGTA 940 (2) INFORMATION FOR SEQ ID NO: 620:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 265 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620:
Met Asn Thr Tyr Ala Gin Glu Ser Lys Leu Arg Leu Lys Thr Lys He
1 5 10 15
Gly Ala Asp Gly Arg Cys Val He Glu Asp Asn Phe Phe Thr Pro Pro
20 25 30
Phe Lys Leu Met Ala Pro Phe Tyr Pro Lys Asp Asp Leu Ala Glu He
35 40 45
Met Leu Leu Ala Val Ser Pro Gly Met Met Arg Gly Asp Ala Gin Asp
50 55 60
Val Gin Leu Asn He Gly Pro Asn Cys Lys Leu Arg He Thr Ser Gin 65 70 75 80
Ser Phe Glu Lys He His Asn Thr Glu Asp Gly Phe Ala Ser Arg Asp
85 90 95
Met His He Val Val Gly Glu Asn Ala Phe Leu Asp Phe Ala Pro Phe
100 105 110
Pro Leu He Pro Phe Glu Asn Ala His Phe Lys Gly Asn Thr Thr He
115 120 125
Ser Leu Arg Ser Ser Ser Gin Leu Leu Tyr Ser Glu He He Val Ala
130 135 140
Gly Arg Val Ala Arg Asn Glu Leu Phe Lys Phe Asn Arg Leu His Thr 145 150 155 160
Lys He Ser He Leu Gin Asp Glu Lys Pro He Tyr Tyr Asp Asn Thr
165 170 175
He Leu Asp Pro Lys Thr Thr Asp Leu Asn Asn Met Cys Met Phe Asp
180 185 190
Gly Tyr Thr His Tyr Leu Asn Leu Val Leu Val Asn Cys Pro He Glu
195 200 205
Leu Ser Gly Val Arg Glu Cys He Glu Glu Ser Glu Gly Val Asp Gly
210 215 220
Ala Val Ser Glu Thr Ala Ser Ser His Leu Cys Val Lys Ala Leu Ala 225 230 235 240
Lys Gly Ser Glu Pro Leu Leu His Leu Arg Glu Lys He Ala Arg Leu
245 250 255
Val Thr Gin Thr Thr Thr Gin Lys Val 260 265
(2) INFORMATION FOR SEQ ID NO: 621:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1815 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1757 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 621:
ATGGCGCTAA AAGCGATGAC AACTATGTAA AAACAATTAA GGAGTAAGAA ATG AAA 56
Met Lys
1
AAG ATT AGC AGA AAA GAA TAT GTT TCT ATG TAT GGC CCT ACT ACA GGC 104 Lys He Ser Arg Lys Glu Tyr Val Ser Met Tyr Gly Pro Thr Thr Gly 5 10 15
GAT AAA GTG AGA TTG GGC GAT ACA GAC TTG ATC GCT GAA GTA GAA CAT 152 Asp Lys Val Arg Leu Gly Asp Thr Asp Leu He Ala Glu Val Glu His 20 25 30
GAC TAC ACC ATT TAT GGC GAA GAG CTT AAA TTC GGT GGC GGT AAA ACC 200 Asp Tyr Thr He Tyr Gly Glu Glu Leu Lys Phe Gly Gly Gly Lys Thr 35 40 45 50
CTG AGA GAA GGC ATG AGC CAA TCC AAC AAC CCT AGC AAA GAA GAA TTG 248 Leu Arg Glu Gly Met Ser Gin Ser Asn Asn Pro Ser Lys Glu Glu Leu 55 60 65
GAT CTA ATC ATC ACT AAC GCT TTA ATC GTG GAT TAC ACC GGT ATT TAT 296 Asp Leu He He Thr Asn Ala Leu He Val Asp Tyr Thr Gly He Tyr 70 75 80
AAA GCG GAT ATT GGT ATT AAA GAT GGC AAA ATC GCT GGC ATT GGT AAA 344 Lys Ala Asp He Gly He Lys Asp Gly Lys He Ala Gly He Gly Lys 85 90 95
GGC GGT AAC AAA GAC ATG CAA GAT GGC GTT AAA AAC AAT CTT AGC GTA 392 Gly Gly Asn Lys Asp Met Gin Asp Gly Val Lys Asn Asn Leu Ser Val 100 105 110
GGT CCT GCT ACT GAA GCC TTA GCC GGT GAA GGT TTG ATC GTA ACT GCT 440 Gly Pro Ala Thr Glu Ala Leu Ala Gly Glu Gly Leu He Val Thr Ala 115 120 125 130
GGT GGT ATT GAC ACA CAC ATC CAC TTC ATT TCA CCC CAA CAA ATC CCT 488 Gly Gly He Asp Thr His He His Phe He Ser Pro Gin Gin He Pro 135 140 145
ACA GCT TTT GCA AGC GGT GTA ACA ACC ATG ATT GGT GGC GGA ACT GGT 536 Thr Ala Phe Ala Ser Gly Val Thr Thr Met He Gly Gly Gly Thr Gly 150 155 160 CCT GCT GAT GGC ACT AAT GCG ACT ACT ATC ACT CCA GGC AGA AGA AAT 584 Pro Ala Asp Gly Thr Asn Ala Thr Thr He Thr Pro Gly Arg Arg Asn 165 170 175
TTA AAA TGG ATG CTC AGA GCG GCT GAA GAA TAT TCT ATG AAC TTA GGT 632 Leu Lys Trp Met Leu Arg Ala Ala Glu Glu Tyr Ser Met Asn Leu Gly 180 185 190
TTC TTG GCT AAA GGT AAC GCT TCT AAC GAC GCG AGC TTA GCC GAT CAA 680 Phe Leu Ala Lys Gly Asn Ala Ser Asn Asp Ala Ser Leu Ala Asp Gin 195 200 205 210
ATT GAA GCT GGT GCG ATT GGC TTT AAA ATC CAC GAA GAC TGG GGC ACC 728 He Glu Ala Gly Ala He Gly Phe Lys He His Glu Asp Trp Gly Thr 215 220 225
ACT CCT TCT GCA ATC AAT CAT GCG TTA GAT GTT GCA GAC AAA TAC GAT 776 Thr Pro Ser Ala He Asn His Ala Leu Asp Val Ala Asp Lys Tyr Asp 230 235 240
GTG CAA GTC GCT ATC CAC ACA GAC ACT TTG AAT GAA GCC GGT TGC GTG 824 Val Gin Val Ala He His Thr Asp Thr Leu Asn Glu Ala Gly Cys Val 245 250 255
GAA GAC ACT ATG GCA GCT ATT GCC GGA CGC ACT ATG CAC ACT TTC CAC 872 Glu Asp Thr Met Ala Ala He Ala Gly Arg Thr Met His Thr Phe His 260 265 270
ACT GAA GGT GCT GGC GGC GGA CAC GCT CCT GAT ATT ATT AAA GTA GCT 920 Thr Glu Gly Ala Gly Gly Gly His Ala Pro Asp He He Lys Val Ala 275 280 285 290
GGT GAA CAC AAC ATT CTT CCC GCT TCC ACT AAC CCC ACT ATC CCT TTC 968 Gly Glu His Asn He Leu Pro Ala Ser Thr Asn Pro Thr He Pro Phe 295 300 305
ACT GTG AAT ACA GAA GCA GAA CAC ATG GAC ATG CTT ATG GTG TGC CAC 1016 Thr Val Asn Thr Glu Ala Glu His Met Asp Met Leu Met Val Cys His 310 315 320
CAC TTG GAT AAA AGC ATT AAA GAA GAT GTT CAG TTC GCT GAT TCA AGG 1064 His Leu Asp Lys Ser He Lys Glu Asp Val Gin Phe Ala Asp Ser Arg 325 330 335
ATC CGC CCT CAA ACC ATT GCG GCT GAA GAC ACT TTG CAT GAC ATG GGG 1112 He Arg Pro Gin Thr He Ala Ala Glu Asp Thr Leu His Asp Met Gly 340 345 350
ATT TTC TCA ATC ACC AGC TCT GAC TCT CAA GCT ATG GGT CGT GTG GGT 1160 He Phe Ser He Thr Ser Ser Asp Ser Gin Ala Met Gly Arg Val Gly 355 360 365 370
GAA GTT ATC ACT AGA ACT TGG CAA ACA GCT GAC AAA AAC AAA AAA GAA 1208 Glu Val He Thr Arg Thr Trp Gin Thr Ala Asp Lys Asn Lys Lys Glu 375 380 385 TTT GGC CGC TTG AAA GAA GAA AAA GGC GAT AAC GAC AAC TTC AGG ATC 1256 Phe Gly Arg Leu Lys Glu Glu Lys Gly Asp Asn Asp Asn Phe Arg He 390 395 400
AAA CGC TAC TTG TCT AAA TAC ACC ATT AAC CCA GCG ATC GCT CAT GGG 1304 Lys Arg Tyr Leu Ser Lys Tyr Thr He Asn Pro Ala He Ala His Gly 405 410 415
ATT AGC GAG TAT GTA GGT TCT GTA GAA GTG GGC AAA GTG GCT GAC TTG 1352 He Ser Glu Tyr Val Gly Ser Val Glu Val Gly Lys Val Ala Asp Leu 420 425 430
GTA TTG TGG AGT CCC GCA TTC TTT GGC GTA AAA CCC AAC ATG ATC ATC 1400 Val Leu Trp Ser Pro Ala Phe Phe Gly Val Lys Pro Asn Met He He 435 440 445 450
AAA GGC GGG TTC ATT GCG TTG AGT CAA ATG GGT GAC GCG AAC GCT TCT 1448 Lys Gly Gly Phe He Ala Leu Ser Gin Met Gly Asp Ala Asn Ala Ser 455 460 465
ATC CCT ACC CCA CAA CCA GTT TAT TAC AGA GAA ATG TTC GCT CAT CAT 1496 He Pro Thr Pro Gin Pro Val Tyr Tyr Arg Glu Met Phe Ala His His 470 475 480
GGT AAA GCC AAA TAC GAT GCA AAC ATC ACT TTT GTG TCT CAA GCG GCT 1544 Gly Lys Ala Lys Tyr Asp Ala Asn He Thr Phe Val Ser Gin Ala Ala 485 490 495
TAT GAC AAA GGC ATT AAA GAA GAA TTA GGG CTT GAA AGA CAA GTG TTG 1592 Tyr Asp Lys Gly He Lys Glu Glu Leu Gly Leu Glu Arg Gin Val Leu 500 505 510
CCG GTA AAA AAT TGC AGA AAC ATC ACT AAA AAA GAC ATG CAA TTC AAC 1640 Pro Val Lys Asn Cys Arg Asn He Thr Lys Lys Asp Met Gin Phe Asn 515 520 525 530
GAC ACT ACC GCT CAC ATT GAA GTC AAT CCT GAA ACT TAC CAT GTG TTC 1688 Asp Thr Thr Ala His He Glu Val Asn Pro Glu Thr Tyr His Val Phe 535 540 545
GTG GAT GGC AAA GAA GTA ACT TCT AAA CCA GCC AAT AAA GTG AGC TTG 1736 Val Asp Gly Lys Glu Val Thr Ser Lys Pro Ala Asn Lys Val Ser Leu 550 555 560
GCG CAA CTC TTT AGC ATT TTC TAGGATTTTT TAGGAGCAAC GCTCCTTAAA TCCT 1791 Ala Gin Leu Phe Ser He Phe 565
TAGTTTTTAG CTCTCTGATT TTTT 1815
(2) INFORMATION FOR SEQ ID NO: 622:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 569 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622:
Met Lys Lys He Ser Arg Lys Glu Tyr Val Ser Met Tyr Gly Pro Thr
1 5 10 15
Thr Gly Asp Lys Val Arg Leu Gly Asp Thr Asp Leu He Ala Glu Val
20 25 30
Glu His Asp Tyr Thr He Tyr Gly Glu Glu Leu Lys Phe Gly Gly Gly
35 40 45
Lys Thr Leu Arg Glu Gly Met Ser Gin Ser Asn Asn Pro Ser Lys Glu
50 55 60
Glu Leu Asp Leu He He Thr Asn Ala Leu He Val Asp Tyr Thr Gly 65 70 75 80
He Tyr Lys Ala Asp He Gly He Lys Asp Gly Lys He Ala Gly He
85 90 95
Gly Lys Gly Gly Asn Lys Asp Met Gin Asp Gly Val Lys Asn Asn Leu
100 105 110
Ser Val Gly Pro Ala Thr Glu Ala Leu Ala Gly Glu Gly Leu He Val
115 120 125
Thr Ala Gly Gly He Asp Thr His He His Phe He Ser Pro Gin Gin
130 135 140
He Pro Thr Ala Phe Ala Ser Gly Val Thr Thr Met He Gly Gly Gly 145 150 155 160
Thr Gly Pro Ala Asp Gly Thr Asn Ala Thr Thr He Thr Pro Gly Arg
165 170 175
Arg Asn Leu Lys Trp Met Leu Arg Ala Ala Glu Glu Tyr Ser Met Asn
180 185 190
Leu Gly Phe Leu Ala Lys Gly Asn Ala Ser Asn Asp Ala Ser Leu Ala
195 200 205
Asp Gin He Glu Ala Gly Ala He Gly Phe Lys He His Glu Asp Trp
210 215 220
Gly Thr Thr Pro Ser Ala He Asn His Ala Leu Asp Val Ala Asp Lys 225 230 235 240
Tyr Asp Val Gin Val Ala He His Thr Asp Thr Leu Asn Glu Ala Gly
245 250 255
Cys Val Glu Asp Thr Met Ala Ala He Ala Gly Arg Thr Met His Thr
260 265 270
Phe His Thr Glu Gly Ala Gly Gly Gly His Ala Pro Asp He He Lys
275 280 285
Val Ala Gly Glu His Asn He Leu Pro Ala Ser Thr Asn Pro Thr He
290 295 300
Pro Phe Thr Val Asn Thr Glu Ala Glu His Met Asp Met Leu Met Val 305 310 315 320
Cys His His Leu Asp Lys Ser He Lys Glu Asp Val Gin Phe Ala Asp
325 330 335
Ser Arg He Arg Pro Gin Thr He Ala Ala Glu Asp Thr Leu His Asp
340 345 350
Met Gly He Phe Ser He Thr Ser Ser Asp Ser Gin Ala Met Gly Arg
355 360 365
Val Gly Glu Val He Thr Arg Thr Trp Gin Thr Ala Asp Lys Asn Lys 370 375 380
Lys Glu Phe Gly Arg Leu Lys Glu Glu Lys Gly Asp Asn Asp Asn Phe 385 390 395 400
Arg He Lys Arg Tyr Leu Ser Lys Tyr Thr He Asn Pro Ala He Ala
405 410 415
His Gly He Ser Glu Tyr Val Gly Ser Val Glu Val Gly Lys Val Ala
420 425 430
Asp Leu Val Leu Trp Ser Pro Ala Phe Phe Gly Val Lys Pro Asn Met
435 440 445
He He Lys Gly Gly Phe He Ala Leu Ser Gin Met Gly Asp Ala Asn
450 455 460
Ala Ser He Pro Thr Pro Gin Pro Val Tyr Tyr Arg Glu Met Phe Ala 465 470 475 480
His His Gly Lys Ala Lys Tyr Asp Ala Asn He Thr Phe Val Ser Gin
485 490 495
Ala Ala Tyr Asp Lys Gly He Lys Glu Glu Leu Gly Leu Glu Arg Gin
500 505 510
Val Leu Pro Val Lys Asn Cys Arg Asn He Thr Lys Lys Asp Met Gin
515 520 525
Phe Asn Asp Thr Thr Ala His He Glu Val Asn Pro Glu Thr Tyr His
530 535 540
Val Phe Val Asp Gly Lys Glu Val Thr Ser Lys Pro Ala Asn Lys Val 545 550 555 560
Ser Leu Ala Gin Leu Phe Ser He Phe 565
(2) INFORMATION FOR SEQ ID NO: 623:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 934 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...881 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623:
GAGATCTACA ATTTGAGCAA TGTAAATGAT TTTATATCAA AGGCACAAAA ATG GTA 56
Met Val 1
ATC GCG CAT TCT AAT GAA ATC GCA CGC CCC ATT TTT AAA AGC CAA GAC 104 He Ala His Ser Asn Glu He Ala Arg Pro He Phe Lys Ser Gin Asp 5 10 15
CAG CTT TTC ACT CTT TAT CAA GGG GAT TGT AAT GAG GTT TTG CCC CAA 152 Gin Leu Phe Thr Leu Tyr Gin Gly Asp Cys Asn Glu Val Leu Pro Gin 20 25 30
TTT GAA AAC CAG TTT GAT TTG ATT TTT GCT GAT CCG CCT TAT TTC CTC 200 Phe Glu Asn Gin Phe Asp Leu He Phe Ala Asp Pro Pro Tyr Phe Leu 35 40 45 50
TCT AAT GAC GGC TTA AGC ATA CAG AGC GGT AAA ATC GTG AGC GTC AAT 248 Ser Asn Asp Gly Leu Ser He Gin Ser Gly Lys He Val Ser Val Asn 55 60 65
AAA GGC GAT TGG GAT AAA GAA GAT GGG ATT AAT GGT ATT GAT GAG TTT 296 Lys Gly Asp Trp Asp Lys Glu Asp Gly He Asn Gly He Asp Glu Phe 70 75 80
AAT TAC CAG TGG ATA AAC AAC GCT AAA AAG GCT TTA AAA GAC ACA GGA 344 Asn Tyr Gin Trp He Asn Asn Ala Lys Lys Ala Leu Lys Asp Thr Gly 85 90 95
AGC CTT TTA ATC AGC GGG ACT TAC CAC AAC ATC TTT TCT TTG GGG TGT 392 Ser Leu Leu He Ser Gly Thr Tyr His Asn He Phe Ser Leu Gly Cys 100 105 110
GTT TTA CAA AAA TTG GAT TTT AAG ATT TTA AAC CTC ATC ACC TGG CAA 440 Val Leu Gin Lys Leu Asp Phe Lys He Leu Asn Leu He Thr Trp Gin 115 120 125 130
AAA ACC AAC CCT CCT CCC AAT TTC AGC TGC CGT TAT TTG ACG CAT TCA 488 Lys Thr Asn Pro Pro Pro Asn Phe Ser Cys Arg Tyr Leu Thr His Ser 135 140 145
GCT GAG CAA ATC ATT TGG GCG AGA AAA AGC CGC AAA CAC AAG CAT GTT 536 Ala Glu Gin He He Trp Ala Arg Lys Ser Arg Lys His Lys His Val 150 155 160
TTT AAC TAT GAG GTT TTA AAA AAG ATC AAT AAC GAC AAG CAA ATG CGC 584 Phe Asn Tyr Glu Val Leu Lys Lys He Asn Asn Asp Lys Gin Met Arg 165 170 175
GAT GTG TGG AGC TTC CCA GCG ATC GCT CCT TGG GAA AAA GTT AAT GGC 632 Asp Val Trp Ser Phe Pro Ala He Ala Pro Trp Glu Lys Val Asn Gly 180 185 190
AAG CAC CCC ACT CAA AAA CCC CTC GCT TTA TTA GTG CGC TTG CTT TTA 680 Lys His Pro Thr Gin Lys Pro Leu Ala Leu Leu Val Arg Leu Leu Leu 195 200 205 210
ATG GCG AGC GAT GAA AAT TCT CTC ATT GGC GAT CCT TTT AGC GGG AGC 728 Met Ala Ser Asp Glu Asn Ser Leu He Gly Asp Pro Phe Ser Gly Ser 215 220 225
TCT ACC ACA GGC ATT GCG GCT AAT CTT TTG AAG AGG GAA TTT ATT GGC 776 Ser Thr Thr Gly He Ala Ala Asn Leu Leu Lys Arg Glu Phe He Gly 230 235 240 ATA GAA AAA GAA AGC GAG TTT ATC AAA ATA TCC ATG GAT AGA AAA ATA 824 He Glu Lys Glu Ser Glu Phe He Lys He Ser Met Asp Arg Lys He 245 250 255
GAA TTA GAC GCT CGC TAT AAA GAA ATC CGA TCT AAA ATC AAA GAT TTA 872 Glu Leu Asp Ala Arg Tyr Lys Glu He Arg Ser Lys He Lys Asp Leu 260 265 270
AAC CAC CAA TAAAGCCTTT TTTTAAGCCA CTTTAAGCGT TATACTTTTG GGATTTTAC 930
Asn His Gin
275
CTCA 934
(2) INFORMATION FOR SEQ ID NO: 624:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 277 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624:
Met Val He Ala His Ser Asn Glu He Ala Arg Pro He Phe Lys Ser
1 5 10 15
Gin Asp Gin Leu Phe Thr Leu Tyr Gin Gly Asp Cys Asn Glu Val Leu
20 25 30
Pro Gin Phe Glu Asn Gin Phe Asp Leu He Phe Ala Asp Pro Pro Tyr
35 40 45
Phe Leu Ser Asn Asp Gly Leu Ser He Gin Ser Gly Lys He Val Ser
50 55 60
Val Asn Lys Gly Asp Trp Asp Lys Glu Asp Gly He Asn Gly He Asp 65 70 75 80
Glu Phe Asn Tyr Gin Trp He Asn Asn Ala Lys Lys Ala Leu Lys Asp
85 90 95
Thr Gly Ser Leu Leu He Ser Gly Thr Tyr His Asn He Phe Ser Leu
100 105 110
Gly Cys Val Leu Gin Lys Leu Asp Phe Lys He Leu Asn Leu He Thr
115 120 125
Trp Gin Lys Thr Asn Pro Pro Pro Asn Phe Ser Cys Arg Tyr Leu Thr
130 135 140
His Ser Ala Glu Gin He He Trp Ala Arg Lys Ser Arg Lys His Lys 145 150 155 160
His Val Phe Asn Tyr Glu Val Leu Lys Lys He Asn Asn Asp Lys Gin
165 170 175
Met Arg Asp Val Trp Ser Phe Pro Ala He Ala Pro Trp Glu Lys Val
180 185 190
Asn Gly Lys His Pro Thr Gin Lys Pro Leu Ala Leu Leu Val Arg Leu
195 200 205
Leu Leu Met Ala Ser Asp Glu Asn Ser Leu He Gly Asp Pro Phe Ser 210 215 220 Gly Ser Ser Thr Thr Gly He Ala Ala Asn Leu Leu Lys Arg Glu Phe 225 230 235 240
He Gly He Glu Lys Glu Ser Glu Phe He Lys He Ser Met Asp Arg
245 250 255
Lys He Glu Leu Asp Ala Arg Tyr Lys Glu He Arg Ser Lys He Lys
260 265 270
Asp Leu Asn His Gin 275
(2) INFORMATION FOR SEQ ID NO: 625:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 646 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 259...588 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625:
CTTGACTTAT TTTTTTGGCT ATTTCCAAGA TCCACGATAC TTTGATGCTA TATCCCCTTT 60
AATCAAGCAA ACCTTCACTC TACCCCCCCC CCCCCCGAAA ATAATAAGAA TAATAATAAA 120
AAAGAGGAAG AATATCAGTG CAAGCTTTCT TTGATTTTAG CCGCTAAAAA CAGCGTGTTT 180
GTGCATATAA GAAGAGGGGA TTATGTGGGG ATTGGCTGTC AGCTTGGTAT TGACTATCAA 240
AAAAAGGCGC TTGAGTAT ATG GCA AAG CGC GTG CCA AAC ATG GAG CTT TTT 291
Met Ala Lys Arg Val Pro Asn Met Glu Leu Phe 1 5 10
GTG TTT TGC GAA GAC TTA GAA TTC ACG CAA AAT CTT GAT CTT GGC TAC 339 Val Phe Cys Glu Asp Leu Glu Phe Thr Gin Asn Leu Asp Leu Gly Tyr 15 20 25
CCT TTT ATG GAC ATG ACC ACT AGG GAT AAA GAA GAA GAG GCG TAT TGG 387 Pro Phe Met Asp Met Thr Thr Arg Asp Lys Glu Glu Glu Ala Tyr Trp 30 35 40
GAC ATG CTG CTC ATG CAA TCT TGT CAG CAT GGC ATT ATC GCT AAT AGC 435 Asp Met Leu Leu Met Gin Ser Cys Gin His Gly He He Ala Asn Ser 45 50 55
ACT TAT AGC TGG TGG GCG GCC TAT TTG ATA GAA AAT CCA GAA AAA ATC 483 Thr Tyr Ser Trp Trp Ala Ala Tyr Leu He Glu Asn Pro Glu Lys He 60 65 70 75
ATT ATT GGC CCC AAA CAC TGG CTT TTT GGG CAT GAG AAT ATC CTT TGT 531 He He Gly Pro Lys His Trp Leu Phe Gly His Glu Asn He Leu Cys 80 85 90
AAG GAG TGG GTG AAA ATA GAA TCC CAT TTT GAG GTA AAA TCC CAA AAG 579 Lys Glu Trp Val Lys He Glu Ser His Phe Glu Val Lys Ser Gin Lys 95 100 105
TAT AAC GCT TAAAGTGGCT TAAAAAAAGG CTTTATTGGT GGTTTAAATC TTTGATTTT 637 Tyr Asn Ala 110
AGATCGGAT 646
(2) INFORMATION FOR SEQ ID NO: 626:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 110 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626:
Met Ala Lys Arg Val Pro Asn Met Glu Leu Phe Val Phe Cys Glu Asp
1 5 10 15
Leu Glu Phe Thr Gin Asn Leu Asp Leu Gly Tyr Pro Phe Met Asp Met
20 25 30
Thr Thr Arg Asp Lys Glu Glu Glu Ala Tyr Trp Asp Met Leu Leu Met
35 40 45
Gin Ser Cys Gin His Gly He He Ala Asn Ser Thr Tyr Ser Trp Trp
50 55 60
Ala Ala Tyr Leu He Glu Asn Pro Glu Lys He He He Gly Pro Lys 65 70 75 80
His Trp Leu Phe Gly His Glu Asn He Leu Cys Lys Glu Trp Val Lys
85 90 95
He Glu Ser His Phe Glu Val Lys Ser Gin Lys Tyr Asn Ala 100 105 110
(2) INFORMATION FOR SEQ ID NO: 627:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1027 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...974 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627:
GGCGCATGAA TGCATTAAAT GTAAAGAACG CGTGTTTTTA GAGGAAGATA ATG GCT 56
Met Ala 1
AAA GAA AAT CCG CCT ATC GTT TTT GGG CCT GTT TTA TCC AGG CGT TTT 104 Lys Glu Asn Pro Pro He Val Phe Gly Pro Val Leu Ser Arg Arg Phe 5 10 15
GGG AAG TCT TTG GGC GTG GAT CTA TCG CCC TCT AAA AAA CAA TGC AAT 152 Gly Lys Ser Leu Gly Val Asp Leu Ser Pro Ser Lys Lys Gin Cys Asn 20 25 30
TAC AAT TGC ATT TAT TGC GAG TTG GGT AAA GCC AAG CCC ATT GAA CGC 200 Tyr Asn Cys He Tyr Cys Glu Leu Gly Lys Ala Lys Pro He Glu Arg 35 40 45 50
ATG GAA GAA GTG ATC AAA GTG GAA ACC TTG ATT AAC GCC ATT CAA AAC 248 Met Glu Glu Val He Lys Val Glu Thr Leu He Asn Ala He Gin Asn 55 60 65
GCC CTA AAC AAC CTC ACC ACC CCC ATT GAT GTT TTA ACC ATT ACC GCT 296 Ala Leu Asn Asn Leu Thr Thr Pro He Asp Val Leu Thr He Thr Ala 70 75 80
AAT GGC GAA CCC ACG CTA TAC CCT CAT TTA TTA GAG CTT ATC CAA AGC 344 Asn Gly Glu Pro Thr Leu Tyr Pro His Leu Leu Glu Leu He Gin Ser 85 90 95
ATC AAG CCT TTT TTA AAG GGC GTT AAA ACT TTG ATT TTA AGC AAT GGC 392 He Lys Pro Phe Leu Lys Gly Val Lys Thr Leu He Leu Ser Asn Gly 100 105 110
TCG CTC TTT TAT GAG CCA AAA GTC CAG CAA GCC TTA AAG GAA TTT GAC 440 Ser Leu Phe Tyr Glu Pro Lys Val Gin Gin Ala Leu Lys Glu Phe Asp 115 120 125 130
ATC GTT AAA TTT TCT TTA GAC GCT ATT GAT TTG AAA GCC TTT GAA AGA 488 He Val Lys Phe Ser Leu Asp Ala He Asp Leu Lys Ala Phe Glu Arg 135 140 145
GTG GAT AAA CCC TAT TCT AAA GAC ATT AAT AAG ATT TTA GAG GGG ATT 536 Val Asp Lys Pro Tyr Ser Lys Asp He Asn Lys He Leu Glu Gly He 150 155 160
TTG CGC TTT TCT CAA ATT TAT CAA GGG CAA TTG GTG GCT GAA GTG TTG 584 Leu Arg Phe Ser Gin He Tyr Gin Gly Gin Leu Val Ala Glu Val Leu 165 170 175
TTA ATT AAG GGC GTG AAT GAT AGC GCG AAC AAC TTA AAA CTC ATC GCT 632 Leu He Lys Gly Val Asn Asp Ser Ala Asn Asn Leu Lys Leu He Ala 180 185 190 GCC TTT TTA AAA CAA ATC AAT ATA GCC AGA GTG GAT TTA AGC ACC ATA 680 Ala Phe Leu Lys Gin He Asn He Ala Arg Val Asp Leu Ser Thr He 195 200 205 210
GAC AGA CCC TCA AGC TTT AAA GCC CCT AAA TTA AGC GAA GAT GAA TTG 728 Asp Arg Pro Ser Ser Phe Lys Ala Pro Lys Leu Ser Glu Asp Glu Leu 215 220 225
TTA AAA TGC TCT TTA TTT TTT GAA GGG CTT TGC GTG AGT TTG CCT AAA 776 Leu Lys Cys Ser Leu Phe Phe Glu Gly Leu Cys Val Ser Leu Pro Lys 230 235 240
CGA TCC ATT ACT CAA GCT AAA AAA TTG ATT TCT TGC GGT ATA GAC GAA 824 Arg Ser He Thr Gin Ala Lys Lys Leu He Ser Cys Gly He Asp Glu 245 250 255
TTG CTC GCT TTA ATT TCC AGG CGC CCT TTA AGC GCA GAA GAA GCC CCC 872 Leu Leu Ala Leu He Ser Arg Arg Pro Leu Ser Ala Glu Glu Ala Pro 260 265 270
CTA ATT CTA GAT TCT AAC GCT TTT AAG CAT TTA GAA ACT TTG TTA AAC 920 Leu He Leu Asp Ser Asn Ala Phe Lys His Leu Glu Thr Leu Leu Asn 275 280 285 290
CAT AAG CAA ATT ACG ATT AAA AAA GTC GGC TCT TTG GAG TTT TAT TGC 968 His Lys Gin He Thr He Lys Lys Val Gly Ser Leu Glu Phe Tyr Cys 295 300 305
GCG TTT TAACCTCCAT TTGTAAGTTT TACCTTACTT TAGGGATAGC TTAAGCTTTT AA 1026 Ala Phe
A 1027
(2) INFORMATION FOR SEQ ID NO: 628:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 308 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628:
Met Ala Lys Glu Asn Pro Pro He Val Phe Gly Pro Val Leu Ser Arg
1 5 10 15
Arg Phe Gly Lys Ser Leu Gly Val Asp Leu Ser Pro Ser Lys Lys Gin
20 25 30
Cys Asn Tyr Asn Cys He Tyr Cys Glu Leu Gly Lys Ala Lys Pro He
35 40 45
Glu Arg Met Glu Glu Val He Lys Val Glu Thr Leu He Asn Ala He 50 55 60 Gin Asn Ala Leu Asn Asn Leu Thr Thr Pro He Asp Val Leu Thr He 65 70 75 80
Thr Ala Asn Gly Glu Pro Thr Leu Tyr Pro His Leu Leu Glu Leu He
85 90 95
Gin Ser He Lys Pro Phe Leu Lys Gly Val Lys Thr Leu He Leu Ser
100 105 110
Asn Gly Ser Leu Phe Tyr Glu Pro Lys Val Gin Gin Ala Leu Lys Glu
115 120 125
Phe Asp He Val Lys Phe Ser Leu Asp Ala He Asp Leu Lys Ala Phe
130 135 140
Glu Arg Val Asp Lys Pro Tyr Ser Lys Asp He Asn Lys He Leu Glu 145 150 155 160
Gly He Leu Arg Phe Ser Gin He Tyr Gin Gly Gin Leu Val Ala Glu
165 170 175
Val Leu Leu He Lys Gly Val Asn Asp Ser Ala Asn Asn Leu Lys Leu
180 185 190
He Ala Ala Phe Leu Lys Gin He Asn He Ala Arg Val Asp Leu Ser
195 200 205
Thr He Asp Arg Pro Ser Ser Phe Lys Ala Pro Lys Leu Ser Glu Asp
210 215 220
Glu Leu Leu Lys Cys Ser Leu Phe Phe Glu Gly Leu Cys Val Ser Leu 225 230 235 240
Pro Lys Arg Ser He Thr Gin Ala Lys Lys Leu He Ser Cys Gly He
245 250 255
Asp Glu Leu Leu Ala Leu He Ser Arg Arg Pro Leu Ser Ala Glu Glu
260 265 270
Ala Pro Leu He Leu Asp Ser Asn Ala Phe Lys His Leu Glu Thr Leu
275 280 285
Leu Asn His Lys Gin He Thr He Lys Lys Val Gly Ser Leu Glu Phe
290 295 300
Tyr Cys Ala Phe 305
(2) INFORMATION FOR SEQ ID NO: 629:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1350 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 87...1280 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 629:
ATCAATCTAA CTTGAGTGGA TTTTTCGTAT TAGTTTCCAT GATATAATTT TGAAAAGTAA 60 GATTGTTTTT TAAAAAAAGG TTGGTA ATG GAA TCA GTA AAA ACA GGA AAA ACA 113 Met Glu Ser Val Lys Thr Gly Lys Thr 1 5
AAT AAG GTT GGC AAG AAT ACA GAG ATG GCT AAT ACA AAG GCA AAT AAA 161 Asn Lys Val Gly Lys Asn Thr Glu Met Ala Asn Thr Lys Ala Asn Lys 10 15 20 25
GAG ACT CAT TTT AAA CAA GTG AGC GCC ATT ACA AAT ATA ATC AGA TCA 209 Glu Thr His Phe Lys Gin Val Ser Ala He Thr Asn He He Arg Ser 30 35 40
GTT GGT GGG TTT TTT ACA AAA ATT GCA AAG AGA GTT AGA GGA CTT GTA 257 Val Gly Gly Phe Phe Thr Lys He Ala Lys Arg Val Arg Gly Leu Val 45 50 55
AAA AAA CAC CCC AAG AAA AGC AGT GCG GCA TTA GTA GTA TTG ACC CAT 305 Lys Lys His Pro Lys Lys Ser Ser Ala Ala Leu Val Val Leu Thr His 60 65 70
ATT GCG TGC AAG AAA GCG AAA GAA TTA GAC GAT AAA GTC CAA GAT AAA 353 He Ala Cys Lys Lys Ala Lys Glu Leu Asp Asp Lys Val Gin Asp Lys 75 80 85
TCC AAA CAA GCT GAA AAA GAA AAT CAA ATC AAT TGG TGG AAA TAT TCA 401 Ser Lys Gin Ala Glu Lys Glu Asn Gin He Asn Trp Trp Lys Tyr Ser 90 95 100 105
GGA TTA ACA ATA GCG GCA AGT TTA TTA TTA GCC GCT TGT AGC GCT GGT 449 Gly Leu Thr He Ala Ala Ser Leu Leu Leu Ala Ala Cys Ser Ala Gly 110 115 120
GAT ACT GAT AAA CAG ATA GAA CTA GAA CAA GAA AAA AAG GAA GCT GAA 497 Asp Thr Asp Lys Gin He Glu Leu Glu Gin Glu Lys Lys Glu Ala Glu 125 130 135
AAC GCT AGG GAT AGA GCG AAC AAG AGT GGG ATA GAA CTA GAA CAA GAA 545 Asn Ala Arg Asp Arg Ala Asn Lys Ser Gly He Glu Leu Glu Gin Glu 140 145 150
AGA CAG AAA ACA AAC AAG AGT GGG ATA GAA CTC GCT AAT AGT CAA ATA 593 Arg Gin Lys Thr Asn Lys Ser Gly He Glu Leu Ala Asn Ser Gin He 155 160 165
AAA GCA GAA CAA GAA AGA CAA AAG ACA GAA CAA GAA AAA CAA AAA GCA 641 Lys Ala Glu Gin Glu Arg Gin Lys Thr Glu Gin Glu Lys Gin Lys Ala 170 175 180 185
AAT AAG AGT GCG ATA GAG TTA GAA CAG CAA AAA CAA AAG ACC ATT AAT 689 Asn Lys Ser Ala He Glu Leu Glu Gin Gin Lys Gin Lys Thr He Asn 190 195 200
ACA CAA AGA GAT TTG ATT AAA GAA CAG AAA GAT TTC ATT AAA GAA ACA 737 Thr Gin Arg Asp Leu He Lys Glu Gin Lys Asp Phe He Lys Glu Thr 205 210 215 GAA CAA AAT TGC CAA GAA AAT CAT AAT CAA TTC TTT ATT AAA AAA TTA 785 Glu Gin Asn Cys Gin Glu Asn His Asn Gin Phe Phe He Lys Lys Leu 220 225 230
GGA ATT AAG GGT GGC ATT GCT ATA GAA GTA GAA GCT GAA TGC AAA ACC 833 Gly He Lys Gly Gly He Ala He Glu Val Glu Ala Glu Cys Lys Thr 235 240 245
CCT AAA CCT GCA AAA ACC AAT CAA ACC CCT ATC CAG CCA AAA CAC CTC 881 Pro Lys Pro Ala Lys Thr Asn Gin Thr Pro He Gin Pro Lys His Leu 250 255 260 265
CCA AAC TCT AAA CAA CCT CAT TCT CAA AGA GGA TCA AAA GCG CAA GAG 929 Pro Asn Ser Lys Gin Pro His Ser Gin Arg Gly Ser Lys Ala Gin Glu 270 275 280
TTT ATC GCT TAT TTG CAA AAA GAG CTA GAA TTT CTG CCC TAT TCG CAA 977 Phe He Ala Tyr Leu Gin Lys Glu Leu Glu Phe Leu Pro Tyr Ser Gin 285 290 295
AAA GCT ATC GCT AAA CAA GTG AAT TTC TAT AAA CCA AGT TCT ATC GCT 1025 Lys Ala He Ala Lys Gin Val Asn Phe Tyr Lys Pro Ser Ser He Ala 300 305 310
TAT TTA GAA CTA GAT CCT AGA GAT TTT AAG GTT ACA GAA GAA TGG CAA 1073 Tyr Leu Glu Leu Asp Pro Arg Asp Phe Lys Val Thr Glu Glu Trp Gin 315 320 325
AAA GAA AAT CTA AAA ATA CGC TCT AAA GCT CAA GCT AAA ATG CTT GAA 1121 Lys Glu Asn Leu Lys He Arg Ser Lys Ala Gin Ala Lys Met Leu Glu 330 335 340 345
ATG AGG GAT TTA AAA CCA GAC CCA CAA GCC CAC CTT CCA ACC TCT CAA 1169 Met Arg Asp Leu Lys Pro Asp Pro Gin Ala His Leu Pro Thr Ser Gin 350 355 360
AGC CTT TTG TTC GTT CAA AAA ATA TTT GCT GAT GTT AAT AAA GAA ATA 1217 Ser Leu Leu Phe Val Gin Lys He Phe Ala Asp Val Asn Lys Glu He 365 370 375
GAA GCA GTT GCT AAT ACT GAA AAG AAA GCA GAA AAA GCG GGT TAT GGT 1265 Glu Ala Val Ala Asn Thr Glu Lys Lys Ala Glu Lys Ala Gly Tyr Gly 380 385 390
TAT AGT AAA AGG ATG TAGGCATAAG AAAATAAGAA CACCATAAAA TCGTTTTTAG C 1321 Tyr Ser Lys Arg Met 395
TTCTAGGAGA CATCAGTCAG TTTCTTGCC 1350
(2) INFORMATION FOR SEQ ID NO: 630:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 398 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630:
Met Glu Ser Val Lys Thr Gly Lys Thr Asn Lys Val Gly Lys Asn Thr
1 5 10 15
Glu Met Ala Asn Thr Lys Ala Asn Lys Glu Thr His Phe Lys Gin Val
20 25 30
Ser Ala He Thr Asn He He Arg Ser Val Gly Gly Phe Phe Thr Lys
35 40 45
He Ala Lys Arg Val Arg Gly Leu Val Lys Lys His Pro Lys Lys Ser
50 55 60
Ser Ala Ala Leu Val Val Leu Thr His He Ala Cys Lys Lys Ala Lys 65 70 75 80
Glu Leu Asp Asp Lys Val Gin Asp Lys Ser Lys Gin Ala Glu Lys Glu
85 90 95
Asn Gin He Asn Trp Trp Lys Tyr Ser Gly Leu Thr He Ala Ala Ser
100 105 110
Leu Leu Leu Ala Ala Cys Ser Ala Gly Asp Thr Asp Lys Gin He Glu
115 120 125
Leu Glu Gin Glu Lys Lys Glu Ala Glu Asn Ala Arg Asp Arg Ala Asn
130 135 140
Lys Ser Gly He Glu Leu Glu Gin Glu Arg Gin Lys Thr Asn Lys Ser 145 150 155 160
Gly He Glu Leu Ala Asn Ser Gin He Lys Ala Glu Gin Glu Arg Gin
165 170 175
Lys Thr Glu Gin Glu Lys Gin Lys Ala Asn Lys Ser Ala He Glu Leu
180 185 190
Glu Gin Gin Lys Gin Lys Thr He Asn Thr Gin Arg Asp Leu He Lys
195 200 205
Glu Gin Lys Asp Phe He Lys Glu Thr Glu Gin Asn Cys Gin Glu Asn
210 215 220
His Asn Gin Phe Phe He Lys Lys Leu Gly He Lys Gly Gly He Ala 225 230 235 240
He Glu Val Glu Ala Glu Cys Lys Thr Pro Lys Pro Ala Lys Thr Asn
245 250 255
Gin Thr Pro He Gin Pro Lys His Leu Pro Asn Ser Lys Gin Pro His
260 265 270
Ser Gin Arg Gly Ser Lys Ala Gin Glu Phe He Ala Tyr Leu Gin Lys
275 280 285
Glu Leu Glu Phe Leu Pro Tyr Ser Gin Lys Ala He Ala Lys Gin Val
290 295 300
Asn Phe Tyr Lys Pro Ser Ser He Ala Tyr Leu Glu Leu Asp Pro Arg 305 310 315 320
Asp Phe Lys Val Thr Glu Glu Trp Gin Lys Glu Asn Leu Lys He Arg
325 330 335
Ser Lys Ala Gin Ala Lys Met Leu Glu Met Arg Asp Leu Lys Pro Asp
340 345 350
Pro Gin Ala His Leu Pro Thr Ser Gin Ser Leu Leu Phe Val Gin Lys
355 360 365
He Phe Ala Asp Val Asn Lys Glu He Glu Ala Val Ala Asn Thr Glu 370 375 380
Lys Lys Ala Glu Lys Ala Gly Tyr Gly Tyr Ser Lys Arg Met 385 390 395
(2) INFORMATION FOR SEQ ID NO: 631:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1939 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic RNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1886 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631:
AAGATTGCTA AATTTTAAGG TTTGAAGAAA GGAAACATAA GTTTTAAAGA ATG AGT 56
Met Ser 1
GCG GAA CTG ATT GCT GTT TAT AAA GAC GAG CAA ATA ATA GAT TTA GAG 104 Ala Glu Leu He Ala Val Tyr Lys Asp Glu Gin He He Asp Leu Glu 5 10 15
AGC GCG AAA GTC TTA GGG CTG AGC GAT GGG ATT AAA GCG TTA AAC GGG 152 Ser Ala Lys Val Leu Gly Leu Ser Asp Gly He Lys Ala Leu Asn Gly 20 25 30
ACA GAG CCG ATA TAT TTT GAT GAT TCG CCT TTG GCT TTA GAG GTG ATT 200 Thr Glu Pro He Tyr Phe Asp Asp Ser Pro Leu Ala Leu Glu Val He 35 40 45 50
AGG CAT TCA TGC GCG CAT TTG CTT GCG CAA AGC TTG AAA GCC CTT TAT 248 Arg His Ser Cys Ala His Leu Leu Ala Gin Ser Leu Lys Ala Leu Tyr 55 60 65
CCG GAC GCG AAA TTT TTT GTA GGC CCT GTG GTA GAA GAG GGG TTT TAT 296 Pro Asp Ala Lys Phe Phe Val Gly Pro Val Val Glu Glu Gly Phe Tyr 70 75 80
TAC GAT TTC AAG ACT TCT TCA AAA ATC AGC GAA GAG GAT TTG CCT AAA 344 Tyr Asp Phe Lys Thr Ser Ser Lys He Ser Glu Glu Asp Leu Pro Lys 85 90 95
ATT GAA GCG AAA ATG AAA GAG TTT GCG AAG TTG AAA CTC GCT ATC ACT 392 He Glu Ala Lys Met Lys Glu Phe Ala Lys Leu Lys Leu Ala He Thr 100 105 110 AAA GAG ACT TTA ACC AGA GAG CAA GCT TTG GAG CGT TTT AAG GGC GAT 440 Lys Glu Thr Leu Thr Arg Glu Gin Ala Leu Glu Arg Phe Lys Gly Asp 115 120 125 130
GAA TTA AAG CAT GCG GTG ATG AGT AAA ATC GGT GGC GAT GCC TTT GGC 488 Glu Leu Lys His Ala Val Met Ser Lys He Gly Gly Asp Ala Phe Gly 135 140 145
GTG TAT CAA CAA GGC GAG TTT GAA GAT TTG TGT AAG GGG CCG CAT CTC 536 Val Tyr Gin Gin Gly Glu Phe Glu Asp Leu Cys Lys Gly Pro His Leu 150 155 160
CCA AAC ACC CGT TTT TTA AAC CAT TTT AAG CTC ACT AAA CTG GCT GGG 584 Pro Asn Thr Arg Phe Leu Asn His Phe Lys Leu Thr Lys Leu Ala Gly 165 170 175
GCT TAT TTG GGC GGC GAT GAA AAC AAT GAA ATG CTC ATT AGA ATC TAT 632 Ala Tyr Leu Gly Gly Asp Glu Asn Asn Glu Met Leu He Arg He Tyr 180 185 190
GGA ATC GCT TTT GCC ACC AAA GAG GGT TTA AAA GAC TAT CTT TTC CAA 680 Gly He Ala Phe Ala Thr Lys Glu Gly Leu Lys Asp Tyr Leu Phe Gin 195 200 205 210
ATA GAA GAA GCG AAA AAA CGA GAT CAC AGA AAG CTA GGC GTG GAG CTA 728 He Glu Glu Ala Lys Lys Arg Asp His Arg Lys Leu Gly Val Glu Leu 215 220 225
GGG CTT TTT AGC TTT GAT GAT GAG ATA GGG GCG GGC TTA CCT TTA TGG 776 Gly Leu Phe Ser Phe Asp Asp Glu He Gly Ala Gly Leu Pro Leu Trp 230 235 240
CTG CCT AAA GGG GCA AGG CTT AGG AAG CGC ATT GAA GAT TTA TTG AGT 824 Leu Pro Lys Gly Ala Arg Leu Arg Lys Arg He Glu Asp Leu Leu Ser 245 250 255
CAA GCG TTA CTT TTA AGA GGC TAT GAG CCG GTT AAA GGT CCT GAG ATT 872 Gin Ala Leu Leu Leu Arg Gly Tyr Glu Pro Val Lys Gly Pro Glu He 260 265 270
TTA AAG AGC GAT GTG TGG AAA ATC AGC GGG CAT TAT GAC AAC TAT AAA 920 Leu Lys Ser Asp Val Trp Lys He Ser Gly His Tyr Asp Asn Tyr Lys 275 280 285 290
GAA AAC ATG TAT TTC ACC ACG ATT GAT GAG CAA GAA TAT GGC ATA AAG 968 Glu Asn Met Tyr Phe Thr Thr He Asp Glu Gin Glu Tyr Gly He Lys 295 300 305
CCT ATG AAC TGC GTG GGG CAT ATT AAA GTC TAT CAA AGC GCT TTG CAC 1016 Pro Met Asn Cys Val Gly His He Lys Val Tyr Gin Ser Ala Leu His 310 315 320
AGC TAC AGA GAT TTG CCC TTA AGG TTT TAT GAA TAC GGC GTG GTG CAT 1064 Ser Tyr Arg Asp Leu Pro Leu Arg Phe Tyr Glu Tyr Gly Val Val His 325 330 335 CGG CAT GAA AAA AGC GGC GTG TTG CAT GGG CTT TTA AGG GTT AGG GAA 1112 Arg His Glu Lys Ser Gly Val Leu His Gly Leu Leu Arg Val Arg Glu 340 345 350
TTT ACC CAA GAT GAT GCA CAT ATT TTT TGC TCT TTT GAA CAG ATC CAA 1160 Phe Thr Gin Asp Asp Ala His He Phe Cys Ser Phe Glu Gin He Gin 355 360 365 370
AGC GAA GTG AGC GCG ATT TTA GAT TTT ACG CAC AAA ATC ATG CAA GCG 1208 Ser Glu Val Ser Ala He Leu Asp Phe Thr His Lys He Met Gin Ala 375 380 385
TTT GAT TTT AGC TAT GAA ATG GAA TTA TCC ACA AGG CCG GCT AAA TCC 1256 Phe Asp Phe Ser Tyr Glu Met Glu Leu Ser Thr Arg Pro Ala Lys Ser 390 395 400
ATA GGC GAT GAT AAA GTT TGG GAA AAG GCC ACT AAC GCT TTA AAA GAA 1304 He Gly Asp Asp Lys Val Trp Glu Lys Ala Thr Asn Ala Leu Lys Glu 405 410 415
GCC TTA AAA GAA CAC CGC ATT GAT TAC AAG ATT GAT GAA GGG GGA GGG 1352 Ala Leu Lys Glu His Arg He Asp Tyr Lys He Asp Glu Gly Gly Gly 420 425 430
GCT TTC TAT GGG CCT AAG ATT GAC ATT AAA ATC ACT GAC GCT TTA AAG 1400 Ala Phe Tyr Gly Pro Lys He Asp He Lys He Thr Asp Ala Leu Lys 435 440 445 450
CGT AAA TGG CAG TGT GGC ACG ATT CAA GTG GAT ATG AAT TTG CCT GAA 1448 Arg Lys Trp Gin Cys Gly Thr He Gin Val Asp Met Asn Leu Pro Glu 455 460 465
CGC TTC AAG CTC GCT TTC ACT AAT GAG TAT AAT CAC GCT GAG CAG CCG 1496 Arg Phe Lys Leu Ala Phe Thr Asn Glu Tyr Asn His Ala Glu Gin Pro 470 475 480
GTG ATG ATC CAC AGA GCG ATT TTA GGC TCG TTT GAA AGG TTT ATT GCG 1544 Val Met He His Arg Ala He Leu Gly Ser Phe Glu Arg Phe He Ala 485 490 495
ATT TTG AGC GAA CAT TTT GGG GGG AAT TTC CCT TTC TTT GTC GCG CCC 1592 He Leu Ser Glu His Phe Gly Gly Asn Phe Pro Phe Phe Val Ala Pro 500 505 510
ACT CAA ATC GCT CTC ATC CCT ATT AAT GAA GAG CAT CAT GTT TTT GCT 1640 Thr Gin He Ala Leu He Pro He Asn Glu Glu His His Val Phe Ala 515 520 525 530
TTG AAA TTA AAA GAG GCG CTA AAA AAG CGC GAT ATT TTT GTA GAA GTG 1688 Leu Lys Leu Lys Glu Ala Leu Lys Lys Arg Asp He Phe Val Glu Val 535 540 545
TTA GAT AAA AAC GAC AGC TTG AAT AAA AAG GTG CGA TTA GCC GAA AAG 1736 Leu Asp Lys Asn Asp Ser Leu Asn Lys Lys Val Arg Leu Ala Glu Lys 550 555 560 CAA AAA ATC CCT ATG ATT TTA GTG TTA GGG AAT GAA GAA GTG GAG ACC 1784 Gin Lys He Pro Met He Leu Val Leu Gly Asn Glu Glu Val Glu Thr 565 570 575
GAA ATT TTA TCC ATT AGA GAC AGA GAA AAA CAA GAT CAA TAT AAA ATG 1832 Glu He Leu Ser He Arg Asp Arg Glu Lys Gin Asp Gin Tyr Lys Met 580 585 590
CCC TTA AAG GAG TTT TTA AAC ATG GTT GAA TCT AAG ATG CAA GAG GTT 1880 Pro Leu Lys Glu Phe Leu Asn Met Val Glu Ser Lys Met Gin Glu Val 595 600 605 610
AGT TTT TGAGTAGAAA CGAAGTGTTG TTAAACGGAG ACATTAATTT TAAAGAAGTG CG 1938 Ser Phe
T 1939
(2) INFORMATION FOR SEQ ID NO: 632:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 612 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632:
Met Ser Ala Glu Leu He Ala Val Tyr Lys Asp Glu Gin He He Asp
1 5 10 15
Leu Glu Ser Ala Lys Val Leu Gly Leu Ser Asp Gly He Lys Ala Leu
20 25 30
Asn Gly Thr Glu Pro He Tyr Phe Asp Asp Ser Pro Leu Ala Leu Glu
35 40 45
Val He Arg His Ser Cys Ala His Leu Leu Ala Gin Ser Leu Lys Ala
50 55 60
Leu Tyr Pro Asp Ala Lys Phe Phe Val Gly Pro Val Val Glu Glu Gly 65 70 75 80
Phe Tyr Tyr Asp Phe Lys Thr Ser Ser Lys He Ser Glu Glu Asp Leu
85 90 95
Pro Lys He Glu Ala Lys Met Lys Glu Phe Ala Lys Leu Lys Leu Ala
100 105 110
He Thr Lys Glu Thr Leu Thr Arg Glu Gin Ala Leu Glu Arg Phe Lys
115 120 125
Gly Asp Glu Leu Lys His Ala Val Met Ser Lys He Gly Gly Asp Ala
130 135 140
Phe Gly Val Tyr Gin Gin Gly Glu Phe Glu Asp Leu Cys Lys Gly Pro 145 150 155 160
His Leu Pro Asn Thr Arg Phe Leu Asn His Phe Lys Leu Thr Lys Leu
165 170 175
Ala Gly Ala Tyr Leu Gly Gly Asp Glu Asn Asn Glu Met Leu He Arg 180 185 190 He Tyr Gly He Ala Phe Ala Thr Lys Glu Gly Leu Lys Asp Tyr Leu
195 200 205
Phe Gin He Glu Glu Ala Lys Lys Arg Asp His Arg Lys Leu Gly Val
210 215 220
Glu Leu Gly Leu Phe Ser Phe Asp Asp Glu He Gly Ala Gly Leu Pro 225 230 235 240
Leu Trp Leu Pro Lys Gly Ala Arg Leu Arg Lys Arg He Glu Asp Leu
245 250 255
Leu Ser Gin Ala Leu Leu Leu Arg Gly Tyr Glu Pro Val Lys Gly Pro
260 265 270
Glu He Leu Lys Ser Asp Val Trp Lys He Ser Gly His Tyr Asp Asn
275 280 285
Tyr Lys Glu Asn Met Tyr Phe Thr Thr He Asp Glu Gin Glu Tyr Gly
290 295 300
He Lys Pro Met Asn Cys Val Gly His He Lys Val Tyr Gin Ser Ala 305 310 315 320
Leu His Ser Tyr Arg Asp Leu Pro Leu Arg Phe Tyr Glu Tyr Gly Val
325 330 335
Val His Arg His Glu Lys Ser Gly Val Leu His Gly Leu Leu Arg Val
340 345 350
Arg Glu Phe Thr Gin Asp Asp Ala His He Phe Cys Ser Phe Glu Gin
355 360 365
He Gin Ser Glu Val Ser Ala He Leu Asp Phe Thr His Lys He Met
370 375 380
Gin Ala Phe Asp Phe Ser Tyr Glu Met Glu Leu Ser Thr Arg Pro Ala 385 390 395 400
Lys Ser He Gly Asp Asp Lys Val Trp Glu Lys Ala Thr Asn Ala Leu
405 410 415
Lys Glu Ala Leu Lys Glu His Arg He Asp Tyr Lys He Asp Glu Gly
420 425 430
Gly Gly Ala Phe Tyr Gly Pro Lys He Asp He Lys He Thr Asp Ala
435 440 445
Leu Lys Arg Lys Trp Gin Cys Gly Thr He Gin Val Asp Met Asn Leu
450 455 460
Pro Glu Arg Phe Lys Leu Ala Phe Thr Asn Glu Tyr Asn His Ala Glu 465 470 475 480
Gin Pro Val Met He His Arg Ala He Leu Gly Ser Phe Glu Arg Phe
485 490 495
He Ala He Leu Ser Glu His Phe Gly Gly Asn Phe Pro Phe Phe Val
500 505 510
Ala Pro Thr Gin He Ala Leu He Pro He Asn Glu Glu His His Val
515 520 525
Phe Ala Leu Lys Leu Lys Glu Ala Leu Lys Lys Arg Asp He Phe Val
530 535 540
Glu Val Leu Asp Lys Asn Asp Ser Leu Asn Lys Lys Val Arg Leu Ala 545 550 555 560
Glu Lys Gin Lys He Pro Met He Leu Val Leu Gly Asn Glu Glu Val
565 570 575
Glu Thr Glu He Leu Ser He Arg Asp Arg Glu Lys Gin Asp Gin Tyr
580 585 590
Lys Met Pro Leu Lys Glu Phe Leu Asn Met Val Glu Ser Lys Met Gin
595 600 605
Glu Val Ser Phe 610 (2) INFORMATION FOR SEQ ID NO: 633:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1198 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1145 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633:
GCGATTGTGC GGAGTCGTTT TCTCAATTTA GCGCCAATCG GATTAGAGAC ATG TTT 56
Met Phe
1
AAA GTA ATG ATG CAA ATG GCG ATC GTT CTC ACT TTT GCT GGC TCT ATA 104 Lys Val Met Met Gin Met Ala He Val Leu Thr Phe Ala Gly Ser He 5 10 15
CCG ATC GTG AAA GTG GGG CGC ATT GCC GGG CAA TTT GCC AAG CCT CGC 152 Pro He Val Lys Val Gly Arg He Ala Gly Gin Phe Ala Lys Pro Arg 20 25 30
TCC AAT GCG ACT GAA ATG CTG GAT AAT GAA GAA GTG TTG AGT TAC AGA 200 Ser Asn Ala Thr Glu Met Leu Asp Asn Glu Glu Val Leu Ser Tyr Arg 35 40 45 50
GGG GAT ATT ATC AAT GGG ATT TCC AAA AAA GAA AGA GAG CCA AAT CCT 248 Gly Asp He He Asn Gly He Ser Lys Lys Glu Arg Glu Pro Asn Pro 55 60 65
GAA AGA ATG CTT AAG GCC TAC CAT CAA AGC GTA GCG ACT TTA AAC CTT 296 Glu Arg Met Leu Lys Ala Tyr His Gin Ser Val Ala Thr Leu Asn Leu 70 75 80
ATC AGA GCC TTT GCT CAA GGC GGG TTA GCG GAT TTG GAG CAA GTG CAT 344 He Arg Ala Phe Ala Gin Gly Gly Leu Ala Asp Leu Glu Gin Val His 85 90 95
CGT TTC AAT TTG GAT TTT GTC AAA AAC AAC GAC TTT GGG CAA AAA TAC 392 Arg Phe Asn Leu Asp Phe Val Lys Asn Asn Asp Phe Gly Gin Lys Tyr 100 105 110
CAG CAA ATC GCT GAC CGG ATC ACG CAA GCT TTA GGG TTT ATG CGA GCA 440 Gin Gin He Ala Asp Arg He Thr Gin Ala Leu Gly Phe Met Arg Ala 115 120 125 130 TGC GGG GTG GAG ATA GAG CGA ACG CCT ATT CTT AGG GAA GTG GAA TTT 488 Cys Gly Val Glu He Glu Arg Thr Pro He Leu Arg Glu Val Glu Phe 135 140 145
TAC ACC AGC CAC GAA GCG TTA CTG CTC CAT TAT GAA GAG CCG TTG GTG 536 Tyr Thr Ser His Glu Ala Leu Leu Leu His Tyr Glu Glu Pro Leu Val 150 155 160
CGT AAG GAT AGT CTG ACT AAC CAG TTT TAT GAT TGC TCC GCG CAC ATG 584 Arg Lys Asp Ser Leu Thr Asn Gin Phe Tyr Asp Cys Ser Ala His Met 165 170 175
CTA TGG ATT GGC GAA AGG ACA AGA GAC CCT AAG GGT GCG CAT GTG GAG 632 Leu Trp He Gly Glu Arg Thr Arg Asp Pro Lys Gly Ala His Val Glu 180 185 190
TTT TTA AGG GGG GTT TGT AAC CCT ATT GGC GTG AAA ATC GGG CCT AAT 680 Phe Leu Arg Gly Val Cys Asn Pro He Gly Val Lys He Gly Pro Asn 195 200 205 210
GCG AGC GTG AGC GAA GTG TTA GAA TTG TGC GAT GTT TTA AAC CCG CGC 728 Ala Ser Val Ser Glu Val Leu Glu Leu Cys Asp Val Leu Asn Pro Arg 215 220 225
AAC ATT AAG GGG CGT TTG AAT TTG ATC GTG CGC ATG GGT TCT AAG ATG 776 Asn He Lys Gly Arg Leu Asn Leu He Val Arg Met Gly Ser Lys Met 230 235 240
ATT AAA GAG CGT TTG CCT AAA CTT TTA CAA GGG GTG TTG GAA GAA AAA 824 He Lys Glu Arg Leu Pro Lys Leu Leu Gin Gly Val Leu Glu Glu Lys 245 250 255
CGC CAT ATT TTA TGG AGC ATT GAT CCC ATG CAT GGC AAC ACG GTT AAA 872 Arg His He Leu Trp Ser He Asp Pro Met His Gly Asn Thr Val Lys 260 265 270
ACC AGC TTG GGG GTT AAA ACA AGG GCT TTT GAT AGC GTG TTA GAT GAA 920 Thr Ser Leu Gly Val Lys Thr Arg Ala Phe Asp Ser Val Leu Asp Glu 275 280 285 290
GTG AAA AGC TTT TTT GAA ATC CAT AGG GCT GAA GGG AGT TTG GCT TCA 968 Val Lys Ser Phe Phe Glu He His Arg Ala Glu Gly Ser Leu Ala Ser 295 300 305
GGG GTT CAT TTG GAA ATG ACA GGT GAG AAT GTT ACA GAA TGT ATC GGT 1016 Gly Val His Leu Glu Met Thr Gly Glu Asn Val Thr Glu Cys He Gly 310 315 320
GGC TCG CAA GCG ATC ACC GAA GAG GGT TTG AGC TGC CAT TAC TAC ACG 1064 Gly Ser Gin Ala He Thr Glu Glu Gly Leu Ser Cys His Tyr Tyr Thr 325 330 335
CAA TGC GAT CCA AGA TTA AAC GCC ACC CAA GCC CTA GAA CTC GCC TTC 1112 Gin Cys Asp Pro Arg Leu Asn Ala Thr Gin Ala Leu Glu Leu Ala Phe 340 345 350 TTA ATC GCT GAC ATG CTC AAA AAA CAG CAC GCT TAGTTAAAAA GAGATTAATC 1165 Leu He Ala Asp Met Leu Lys Lys Gin His Ala 355 360 365
TTTTTTTAAC TCTTTTACTT TATAATTATC GTT 1198
(2) INFORMATION FOR SEQ ID NO : 634 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 365 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634:
Met Phe Lys Val Met Met Gin Met Ala He Val Leu Thr Phe Ala Gly
1 5 10 15
Ser He Pro He Val Lys Val Gly Arg He Ala Gly Gin Phe Ala Lys
20 25 30
Pro Arg Ser Asn Ala Thr Glu Met Leu Asp Asn Glu Glu Val Leu Ser
35 40 45
Tyr Arg Gly Asp He He Asn Gly He Ser Lys Lys Glu Arg Glu Pro
50 55 60
Asn Pro Glu Arg Met Leu Lys Ala Tyr His Gin Ser Val Ala Thr Leu 65 70 75 80
Asn Leu He Arg Ala Phe Ala Gin Gly Gly Leu Ala Asp Leu Glu Gin
85 90 95
Val His Arg Phe Asn Leu Asp Phe Val Lys Asn Asn Asp Phe Gly Gin
100 105 110
Lys Tyr Gin Gin He Ala Asp Arg He Thr Gin Ala Leu Gly Phe Met
115 120 125
Arg Ala Cys Gly Val Glu He Glu Arg Thr Pro He Leu Arg Glu Val
130 135 140
Glu Phe Tyr Thr Ser His Glu Ala Leu Leu Leu His Tyr Glu Glu Pro 145 150 155 160
Leu Val Arg Lys Asp Ser Leu Thr Asn Gin Phe Tyr Asp Cys Ser Ala
165 170 175
His Met Leu Trp He Gly Glu Arg Thr Arg Asp Pro Lys Gly Ala His
180 185 190
Val Glu Phe Leu Arg Gly Val Cys Asn Pro He Gly Val Lys He Gly
195 200 205
Pro Asn Ala Ser Val Ser Glu Val Leu Glu Leu Cys Asp Val Leu Asn
210 215 220
Pro Arg Asn He Lys Gly Arg Leu Asn Leu He Val Arg Met Gly Ser 225 230 235 240
Lys Met He Lys Glu Arg Leu Pro Lys Leu Leu Gin Gly Val Leu Glu
245 250 255
Glu Lys Arg His He Leu Trp Ser He Asp Pro Met His Gly Asn Thr
260 265 270
Val Lys Thr Ser Leu Gly Val Lys Thr Arg Ala Phe Asp Ser Val Leu 275 280 285 Asp Glu Val Lys Ser Phe Phe Glu He His Arg Ala Glu Gly Ser Leu
290 295 300
Ala Ser Gly Val His Leu Glu Met Thr Gly Glu Asn Val Thr Glu Cys 305 310 315 320
He Gly Gly Ser Gin Ala He Thr Glu Glu Gly Leu Ser Cys His Tyr
325 330 335
Tyr Thr Gin Cys Asp Pro Arg Leu Asn Ala Thr Gin Ala Leu Glu Leu
340 345 350
Ala Phe Leu He Ala Asp Met Leu Lys Lys Gin His Ala 355 360 365
(2) INFORMATION FOR SEQ ID NO: 635:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 388 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...335 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635:
TGATAGCGAC TTTTTGAGGC CCATGCACCC CAAAAACGGT TTTTAATTCA ATG TCA 56
Met Ser
1
GCT GTC CGG CTA GGC CCG CCA ATA AGG AGC ATG TTT GTG GGT AAT GCA 104 Ala Val Arg Leu Gly Pro Pro He Arg Ser Met Phe Val Gly Asn Ala 5 10 15
CCG TTT TGG CTT TGG TTT TTT AAA GCT TGC ATG CCT TCA CTC AAA TTG 152 Pro Phe Trp Leu Trp Phe Phe Lys Ala Cys Met Pro Ser Leu Lys Leu 20 25 30
CGC ACA ATG GAT TCT TTT TTC AAT AAG ATG ATG CAA TTA AGG GTG ATG 200 Arg Thr Met Asp Ser Phe Phe Asn Lys Met Met Gin Leu Arg Val Met 35 40 45 50
AGC GAA AGC AAT CGC GGG CTT GCA TGC GAA GAG ACC GCC CCA ATC ATG 248 Ser Glu Ser Asn Arg Gly Leu Ala Cys Glu Glu Thr Ala Pro He Met 55 60 65
CCC AAG CTT GAA ATC CCA CAA ACC CCA TGC AAT AAA GCC GTA TCA ATC 296 Pro Lys Leu Glu He Pro Gin Thr Pro Cys Asn Lys Ala Val Ser He 70 75 80 TCA AAC AAC TCT TCA CGC ATC GCT TCA ATT TCT TTA TCA TAAGGCTGTA AA 347 Ser Asn Asn Ser Ser Arg He Ala Ser He Ser Leu Ser 85 90 95
GTAAAATCCT TAAACGCTTC AAAATTCAAA TTCAAATCTG T 38£
(2) INFORMATION FOR SEQ ID NO: 636:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 95 ammo acids
(B) TYPE: ammo acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 636:
Met Ser Ala Val Arg Leu Gly Pro Pro He Arg Ser Met Phe Val Gly
1 5 10 15
Asn Ala Pro Phe Trp Leu Trp Phe Phe Lys Ala Cys Met Pro Ser Leu
20 25 30
Lys Leu Arg Thr Met Asp Ser Phe Phe Asn Lys Met Met Gin Leu Arg
35 40 45
Val Met Ser Glu Ser Asn Arg Gly Leu Ala Cys Glu Glu Thr Ala Pro
50 55 60
He Met Pro Lys Leu Glu He Pro Gin Thr Pro Cys Asn Lys Ala Val 65 70 75 80
Ser He Ser Asn Asn Ser Ser Arg He Ala Ser He Ser Leu Ser 85 90 95
(2) INFORMATION FOR SEQ ID NO: 637:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1756 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1703 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 637:
TTTTAAAGAG AACTAGCACT AAGAGAATAT TTTTAAAAAG GGATTTTTTA GTG CTA 56
Val Leu 1 GAA TTT CAT CAA ATT TAT GAT CCT TTG GGT AAT ATT TGG CTG AGC GCT 104 Glu Phe His Gin He Tyr Asp Pro Leu Gly Asn He Trp Leu Ser Ala 5 10 15
CTT GTG GCC TTA TTG CCG ATT TTA TTG TTT TTC TTA TCT TTA ATG GTT 152 Leu Val Ala Leu Leu Pro He Leu Leu Phe Phe Leu Ser Leu Met Val 20 25 30
TTT AAA CTC AAA GGT TAT ACA GCG GCC TTT TTG AGC GTG GCC TTA TCA 200 Phe Lys Leu Lys Gly Tyr Thr Ala Ala Phe Leu Ser Val Ala Leu Ser 35 40 45 50
GCC GTT ATT GCG GTT TTA GTG TAT AAA ATG CCT GTT AGC ATG GTG GGT 248 Ala Val He Ala Val Leu Val Tyr Lys Met Pro Val Ser Met Val Gly 55 60 65
TCA AGC TTC CTT TAC GGC TTT CTT TAT GGC TTA TGG CCG ATC GCT TGG 296 Ser Ser Phe Leu Tyr Gly Phe Leu Tyr Gly Leu Trp Pro He Ala Trp 70 75 80
ATC ATT ATT GCG GCG ATT TTT TTA TAC AAA CTC AGC GTT AAA TCC GGC 344 He He He Ala Ala He Phe Leu Tyr Lys Leu Ser Val Lys Ser Gly 85 90 95
TAT TTT GAA ATT TTA AAA GAA AGC GTC CAG TCC ATC ACT TTA GAT CAC 392 Tyr Phe Glu He Leu Lys Glu Ser Val Gin Ser He Thr Leu Asp His 100 105 110
CGC ATT TTA GTG ATT TTG ATT GGC TTT TGT TTT GGC TCG TTT TTA GAA 440 Arg He Leu Val He Leu He Gly Phe Cys Phe Gly Ser Phe Leu Glu 115 120 125 130
GGG GCG ATC GGC TTT GGA GGG CCT ATT GCC ATT ACC GCA GCG ATT TTA 488 Gly Ala He Gly Phe Gly Gly Pro He Ala He Thr Ala Ala He Leu 135 140 145
GTG GGC TTA GGG TTA AGC CCT TTG TAT TCT GCC GGG TTA TGT TTG ATC 536 Val Gly Leu Gly Leu Ser Pro Leu Tyr Ser Ala Gly Leu Cys Leu He 150 155 160
GCT AAT ACC GCT CCT GTA GCT TTT GGC GCG GTG GGT ATC CCT ATA AGT 584 Ala Asn Thr Ala Pro Val Ala Phe Gly Ala Val Gly He Pro He Ser 165 170 175
GCT ATG GCG AGC GCG GTA GGG GTG CCA GCG ATT TTA ATT TCA GCC ATG 632 Ala Met Ala Ser Ala Val Gly Val Pro Ala He Leu He Ser Ala Met 180 185 190
ACG GGT AAA ATC CTC TTT TTT GTG AGC TTG TTA GTG CCG TTT TTC ATT 680 Thr Gly Lys He Leu Phe Phe Val Ser Leu Leu Val Pro Phe Phe He 195 200 205 210
GTG TTT TTA ATG GAT GGC TTT AAA GGG ATT AAA GAA ACT TTT CCG GCC 728 Val Phe Leu Met Asp Gly Phe Lys Gly He Lys Glu Thr Phe Pro Ala 215 220 225 GTT TTT ATC GCG GCT TTT TCT TTC GCT GGT GCG CAA TTT TTA AGC TCT 776 Val Phe He Ala Ala Phe Ser Phe Ala Gly Ala Gin Phe Leu Ser Ser 230 235 240
AAT TAT TTA GGG CCA GAA TTG CCT GGT ATT ATT TCA GCC CTT GTT TCA 824 Asn Tyr Leu Gly Pro Glu Leu Pro Gly He He Ser Ala Leu Val Ser 245 250 255
CTC GTT GCA ACA GCG CTC TTT TTG AAA TTT TGG CAG CCT AAA GCG ATT 872 Leu Val Ala Thr Ala Leu Phe Leu Lys Phe Trp Gin Pro Lys Ala He 260 265 270
TTT AGA AGC GAC GGC AAA GCG GCT TCG TTC ACT AAG AGT AAC CAT CAT 920 Phe Arg Ser Asp Gly Lys Ala Ala Ser Phe Thr Lys Ser Asn His His 275 280 285 290
ATT TGT AAG ATC TAT GTC GCT TGG TCT CCT TTT GTG ATT TTA GTT TTA 968 He Cys Lys He Tyr Val Ala Trp Ser Pro Phe Val He Leu Val Leu 295 300 305
GTG ATT GTG CTA TGG ATA CAG CCT TTT TTT AAA GCC TTG TTT GAA AAA 1016 Val He Val Leu Trp He Gin Pro Phe Phe Lys Ala Leu Phe Glu Lys 310 315 320
GAC GGC TTG TTA GCT TTT TCT AAT TTT TAT TTT GAA TTC AAT AAC ATC 1064 Asp Gly Leu Leu Ala Phe Ser Asn Phe Tyr Phe Glu Phe Asn Asn He 325 330 335
AGT AAC CAC ATC TTT AAA AGC CCG CCT TTT GTA GAA GCC AAT CAA AGC 1112 Ser Asn His He Phe Lys Ser Pro Pro Phe Val Glu Ala Asn Gin Ser 340 345 350
GTG AGT TTT CCG GTG GTG TTT AAA TTT CTC TTA ATC AAC ACG GTT GGC 1160 Val Ser Phe Pro Val Val Phe Lys Phe Leu Leu He Asn Thr Val Gly 355 360 365 370
ACT TCC ATT TTT TTA GCC GCT CTT GTT AGC ATG CTC GTT TTA AGG GTG 1208 Thr Ser He Phe Leu Ala Ala Leu Val Ser Met Leu Val Leu Arg Val 375 380 385
CGA GTG AGC GAT GCG CTG AGC GTC TTT GGC GAG ACT TTA AAA GAA ATG 1256 Arg Val Ser Asp Ala Leu Ser Val Phe Gly Glu Thr Leu Lys Glu Met 390 395 400
CGT TAC CCC ATT CTC ACC ATT GGT TTA GTC TTA AGC TTT GCC TAT GTG 1304 Arg Tyr Pro He Leu Thr He Gly Leu Val Leu Ser Phe Ala Tyr Val 405 410 415
TCT AAT TAC AGC GGG ATT TCT TCC ACT CTA GCC TTA GCG CTC ACG CAT 1352 Ser Asn Tyr Ser Gly He Ser Ser Thr Leu Ala Leu Ala Leu Thr His 420 425 430
ACG GGT TTG GCT TTC ACC TTT TTC TCG CCC TTG ATC GGG TGG GTA GGC 1400 Thr Gly Leu Ala Phe Thr Phe Phe Ser Pro Leu He Gly Trp Val Gly 435 440 445 450 GTG TTT TTA ACC GGG AGC GAT ACG AGT TCC AAT CTT TTG TTT GGC TCT 1448 Val Phe Leu Thr Gly Ser Asp Thr Ser Ser Asn Leu Leu Phe Gly Ser 455 460 465
TTA CAG CAA CTC ACC GCC CAA CGA TTG CAC CTC CCT GAG GTT TTA ACC 1496 Leu Gin Gin Leu Thr Ala Gin Arg Leu His Leu Pro Glu Val Leu Thr 470 475 480
CTA ACG GCT AAT ACC GTG GGT GGC ACT TTA GGC AAG ATG ATA AGC CCT 1544 Leu Thr Ala Asn Thr Val Gly Gly Thr Leu Gly Lys Met He Ser Pro 485 490 495
CAA AGC ATC GCT ATC GCT TGC GCG GCG GTG GGG TTA GCC GGG AAA GAG 1592 Gin Ser He Ala He Ala Cys Ala Ala Val Gly Leu Ala Gly Lys Glu 500 505 510
AGC GAT TTG TTC AAA TTC ACG GTT AAA TAC TCC CTT ATT TTT GTA GCG 1640 Ser Asp Leu Phe Lys Phe Thr Val Lys Tyr Ser Leu He Phe Val Ala 515 520 525 530
ATC ATG GGA GTT GTG ATC AGC GCG ATT GCG TAT TTG ATC CCT GAA GTG 1688 He Met Gly Val Val He Ser Ala He Ala Tyr Leu He Pro Glu Val 535 540 545
GTG CCT GCG ATA AAG TAGGGCCATT TTAGATTTAG CAGGGTTTAA CCCCCAAATA A 1744 Val Pro Ala He Lys 550
ATTTTTTTGT TT 1756
(2) INFORMATION FOR SEQ ID NO: 638:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 551 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 638:
Val Leu Glu Phe His Gin He Tyr Asp Pro Leu Gly Asn He Trp Leu
1 5 10 15
Ser Ala Leu Val Ala Leu Leu Pro He Leu Leu Phe Phe Leu Ser Leu
20 25 30
Met Val Phe Lys Leu Lys Gly Tyr Thr Ala Ala Phe Leu Ser Val Ala
35 40 45
Leu Ser Ala Val He Ala Val Leu Val Tyr Lys Met Pro Val Ser Met
50 55 60
Val Gly Ser Ser Phe Leu Tyr Gly Phe Leu Tyr Gly Leu Trp Pro He 65 70 75 80
Ala Trp He He He Ala Ala He Phe Leu Tyr Lys Leu Ser Val Lys 85 90 95 Ser Gly Tyr Phe Glu He Leu Lys Glu Ser Val Gin Ser He Thr Leu
100 105 110
Asp His Arg He Leu Val He Leu He Gly Phe Cys Phe Gly Ser Phe
115 120 125
Leu Glu Gly Ala He Gly Phe Gly Gly Pro He Ala He Thr Ala Ala
130 135 140
He Leu Val Gly Leu Gly Leu Ser Pro Leu Tyr Ser Ala Gly Leu Cys 145 150 155 160
Leu He Ala Asn Thr Ala Pro Val Ala Phe Gly Ala Val Gly He Pro
165 170 175
He Ser Ala Met Ala Ser Ala Val Gly Val Pro Ala He Leu He Ser
180 185 190
Ala Met Thr Gly Lys He Leu Phe Phe Val Ser Leu Leu Val Pro Phe
195 200 205
Phe He Val Phe Leu Met Asp Gly Phe Lys Gly He Lys Glu Thr Phe
210 215 220
Pro Ala Val Phe He Ala Ala Phe Ser Phe Ala Gly Ala Gin Phe Leu 225 230 235 240
Ser Ser Asn Tyr Leu Gly Pro Glu Leu Pro Gly He He Ser Ala Leu
245 250 255
Val Ser Leu Val Ala Thr Ala Leu Phe Leu Lys Phe Trp Gin Pro Lys
260 265 270
Ala He Phe Arg Ser Asp Gly Lys Ala Ala Ser Phe Thr Lys Ser Asn
275 280 285
His His He Cys Lys He Tyr Val Ala Trp Ser Pro Phe Val He Leu
290 295 300
Val Leu Val He Val Leu Trp He Gin Pro Phe Phe Lys Ala Leu Phe 305 310 315 320
Glu Lys Asp Gly Leu Leu Ala Phe Ser Asn Phe Tyr Phe Glu Phe Asn
325 330 335
Asn He Ser Asn His He Phe Lys Ser Pro Pro Phe Val Glu Ala Asn
340 345 350
Gin Ser Val Ser Phe Pro Val Val Phe Lys Phe Leu Leu He Asn Thr
355 360 365
Val Gly Thr Ser He Phe Leu Ala Ala Leu Val Ser Met Leu Val Leu
370 375 380
Arg Val Arg Val Ser Asp Ala Leu Ser Val Phe Gly Glu Thr Leu Lys 385 390 395 400
Glu Met Arg Tyr Pro He Leu Thr He Gly Leu Val Leu Ser Phe Ala
405 410 415
Tyr Val Ser Asn Tyr Ser Gly He Ser Ser Thr Leu Ala Leu Ala Leu
420 425 430
Thr His Thr Gly Leu Ala Phe Thr Phe Phe Ser Pro Leu He Gly Trp
435 440 445
Val Gly Val Phe Leu Thr Gly Ser Asp Thr Ser Ser Asn Leu Leu Phe
450 455 460
Gly Ser Leu Gin Gin Leu Thr Ala Gin Arg Leu His Leu Pro Glu Val 465 470 475 480
Leu Thr Leu Thr Ala Asn Thr Val Gly Gly Thr Leu Gly Lys Met He
485 490 495
Ser Pro Gin Ser He Ala He Ala Cys Ala Ala Val Gly Leu Ala Gly
500 505 510
Lys Glu Ser Asp Leu Phe Lys Phe Thr Val Lys Tyr Ser Leu He Phe
515 520 525
Val Ala He Met Gly Val Val He Ser Ala He Ala Tyr Leu He Pro 530 535 540
Glu Val Val Pro Ala He Lys 545 550
(2) INFORMATION FOR SEQ ID NO: 639:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 961 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...908 (D) OTHER INFORMATION: .
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 639:
TGAACCACGC CATAAAAAAG TTCATGATAA TGGCATAAAG GAAAGTTGAA ATG GAT 56
Met Asp 1
TTT TTA AAC GAC CAT ATA AAT GTT TTT GGC TTG ATT GCA GCG CTT GTG 104 Phe Leu Asn Asp His He Asn Val Phe Gly Leu He Ala Ala Leu Val 5 10 15
ATT TTA GTT TTA ACC ATC TAT GAA TCC AGT TCG CTC ATT AAA GAA ATG 152 He Leu Val Leu Thr He Tyr Glu Ser Ser Ser Leu He Lys Glu Met 20 25 30
CGC GAC AGC AAA TCT CAA GGT GAG CTT GTA GAA AAT GGG CAT TTG ATT 200 Arg Asp Ser Lys Ser Gin Gly Glu Leu Val Glu Asn Gly His Leu He 35 40 45 50
GAT GGG ATA GGG GAG TTT GCC AAT AAT GTG CCA GTA GGC TGG ATC GCA 248 Asp Gly He Gly Glu Phe Ala Asn Asn Val Pro Val Gly Trp He Ala 55 60 65
AGC TTT ATG TGC ACG ATT GTG TGG GCT TTT TGG TAT TTC TTC TTT GGG 296 Ser Phe Met Cys Thr He Val Trp Ala Phe Trp Tyr Phe Phe Phe Gly 70 75 80
TAT CCG CTG AAT AGC TTT TCT CAA ATC GGG CAA TAC AAT GAA GAG GTT 344 Tyr Pro Leu Asn Ser Phe Ser Gin He Gly Gin Tyr Asn Glu Glu Val 85 90 95
AAA GCG CAC AAC CAA AAA TTT GAG GCC AAG TGG AAG CAT TTG GGT CAA 392 Lys Ala His Asn Gin Lys Phe Glu Ala Lys Trp Lys His Leu Gly Gin 100 105 110 AAG GAA CTG GTG GAT ATG GGT CAA GGC ATC TTT TTA GTC CAT TGT TCG 440 Lys Glu Leu Val Asp Met Gly Gin Gly He Phe Leu Val His Cys Ser 115 120 125 130
CAA TGC CAT GGC ATC ACC GCT GAG GGC TTG CAT GGG AGC GCT CAA AAT 488 Gin Cys His Gly He Thr Ala Glu Gly Leu His Gly Ser Ala Gin Asn 135 140 145
CTG GTG CGC TGG GGT AAA GAA GAG GGT ATT ATG GAT ACC ATT AAG CAT 536 Leu Val Arg Trp Gly Lys Glu Glu Gly He Met Asp Thr He Lys His 150 155 160
GGC TCT AAG GGC ATG GAT TAT CTC GCT GGG GAA ATG CCC GCT ATG GAA 584 Gly Ser Lys Gly Met Asp Tyr Leu Ala Gly Glu Met Pro Ala Met Glu 165 170 175
TTG GAC GAA AAA GAC GCT AAA GCG ATC GCA AGC TAT GTG ATG GCA GAA 632 Leu Asp Glu Lys Asp Ala Lys Ala He Ala Ser Tyr Val Met Ala Glu 180 185 190
CTT TCT AGC GTT AAA AAA ACC AAA AAC CCT CAA CTC ATT GAT AAA GGC 680 Leu Ser Ser Val Lys Lys Thr Lys Asn Pro Gin Leu He Asp Lys Gly 195 200 205 210
AAG GAA TTG TTT GAA AGC ATG GGT TGC ACA GGC TGT CAT GGC AAT GAT 728 Lys Glu Leu Phe Glu Ser Met Gly Cys Thr Gly Cys His Gly Asn Asp 215 220 225
GGT AAG GGC TTG CAA GAA AAT CAA GTG TTT GCA GCC GAT TTG ACC GCT 776 Gly Lys Gly Leu Gin Glu Asn Gin Val Phe Ala Ala Asp Leu Thr Ala 230 235 240
TAC GGC ACA GAG AAT TTT TTG AGA AAT ATC TTA ACG CAT GGC AAA AAG 824 Tyr Gly Thr Glu Asn Phe Leu Arg Asn He Leu Thr His Gly Lys Lys 245 250 255
GGC AAT ATA GGG CAT ATG CCA TCA TTC AAG TAT AAA AAC TTT AGC GAT 872 Gly Asn He Gly His Met Pro Ser Phe Lys Tyr Lys Asn Phe Ser Asp 260 265 270
TTG CAA GTT AAA GCG TTA CTG AAT TTA TCC AAT CGC TAAAACCCTT AGAAGA 924 Leu Gin Val Lys Ala Leu Leu Asn Leu Ser Asn Arg 275 280 285
TTAAAGGAAA AGAGATGAAA TTTTTAAACG GATTAGC 961
(2) INFORMATION FOR SEQ ID NO: 640:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 286 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 640:
Met Asp Phe Leu Asn Asp His He Asn Val Phe Gly Leu He Ala Ala
1 5 10 15
Leu Val He Leu Val Leu Thr He Tyr Glu Ser Ser Ser Leu He Lys
20 25 30
Glu Met Arg Asp Ser Lys Ser Gin Gly Glu Leu Val Glu Asn Gly His
35 40 45
Leu He Asp Gly He Gly Glu Phe Ala Asn Asn Val Pro Val Gly Trp
50 55 60
He Ala Ser Phe Met Cys Thr He Val Trp Ala Phe Trp Tyr Phe Phe 65 70 75 80
Phe Gly Tyr Pro Leu Asn Ser Phe Ser Gin He Gly Gin Tyr Asn Glu
85 90 95
Glu Val Lys Ala His Asn Gin Lys Phe Glu Ala Lys Trp Lys His Leu
100 105 110
Gly Gin Lys Glu Leu Val Asp Met Gly Gin Gly He Phe Leu Val His
115 120 125
Cys Ser Gin Cys His Gly He Thr Ala Glu Gly Leu His Gly Ser Ala
130 135 140
Gin Asn Leu Val Arg Trp Gly Lys Glu Glu Gly He Met Asp Thr He 145 150 155 160
Lys His Gly Ser Lys Gly Met Asp Tyr Leu Ala Gly Glu Met Pro Ala
165 170 175
Met Glu Leu Asp Glu Lys Asp Ala Lys Ala He Ala Ser Tyr Val Met
180 185 190
Ala Glu Leu Ser Ser Val Lys Lys Thr Lys Asn Pro Gin Leu He Asp
195 200 205
Lys Gly Lys Glu Leu Phe Glu Ser Met Gly Cys Thr Gly Cys His Gly
210 215 220
Asn Asp Gly Lys Gly Leu Gin Glu Asn Gin Val Phe Ala Ala Asp Leu 225 230 235 240
Thr Ala Tyr Gly Thr Glu Asn Phe Leu Arg Asn He Leu Thr His Gly
245 250 255
Lys Lys Gly Asn He Gly His Met Pro Ser Phe Lys Tyr Lys Asn Phe
260 265 270
Ser Asp Leu Gin Val Lys Ala Leu Leu Asn Leu Ser Asn Arg 275 280 285
(2) INFORMATION FOR SEQ ID NO: 641:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 307 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...254 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641:
TACTGAATTT ATCCAATCGC TAAAACCCTT AGAAGATTAA AGGAAAAGAG ATG AAA 56
Met Lys 1
TTT TTA AAC GGA TTA GCA GGG AAT TTA CTG ATT GTG GTT ATT TTA TTG 104 Phe Leu Asn Gly Leu Ala Gly Asn Leu Leu He Val Val He Leu Leu 5 10 15
TGT GTG GCC GTT TTT TTT ACG CTC AAA GCG ATC CAT ATC CAA AAA GAG 152 Cys Val Ala Val Phe Phe Thr Leu Lys Ala He His He Gin Lys Glu 20 25 30
CAA GCC ACC AAT TAT TAC CGC TAT AAG GAT ATT AAC GCT TTA GAG ACA 200 Gin Ala Thr Asn Tyr Tyr Arg Tyr Lys Asp He Asn Ala Leu Glu Thr 35 40 45 50
AAA AAC ACC CAA AAC CGG GCT AAC TAT GAA TTA GTC AAT CAA GGG AGT 248 Lys Asn Thr Gin Asn Arg Ala Asn Tyr Glu Leu Val Asn Gin Gly Ser 55 60 65
AAA AAA TGAAATTCAC GACTTTAGAA AAAATTTTAG CTTTGATGGT AGTAGCGACC AT 306 Lys Lys
T 307
(2) INFORMATION FOR SEQ ID NO: 642:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 68 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642:
Met Lys Phe Leu Asn Gly Leu Ala Gly Asn Leu Leu He Val Val He
1 5 10 15
Leu Leu Cys Val Ala Val Phe Phe Thr Leu Lys Ala He His He Gin
20 25 30
Lys Glu Gin Ala Thr Asn Tyr Tyr Arg Tyr Lys Asp He Asn Ala Leu
35 40 45
Glu Thr Lys Asn Thr Gin Asn Arg Ala Asn Tyr Glu Leu Val Asn Gin
50 55 60
Gly Ser Lys Lys 65 (2) INFORMATION FOR SEQ ID NO: 643:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 843 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...770 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643:
TTATTAGCTA TCATGCTAAA GAAGTGGCTA ACTTATTACA AAGGAATTGA ATG GGA 56
Met Gly 1
CGA GCG TTT GAA TAC AGA AGA GCG GCT AAA GAA AAA CGA TGG GAT AAG 104 Arg Ala Phe Glu Tyr Arg Arg Ala Ala Lys Glu Lys Arg Trp Asp Lys 5 10 15
ATG AGT AAG GTT TTC CCA AAG CTC GCT AAA GCG ATC ACT CTA GCG GCA 152 Met Ser Lys Val Phe Pro Lys Leu Ala Lys Ala He Thr Leu Ala Ala 20 25 30
AAA GAT GGC GGG AGC GAA CCG GAC ACG AAC GCC AAA CTA CGA ACA GCG 200 Lys Asp Gly Gly Ser Glu Pro Asp Thr Asn Ala Lys Leu Arg Thr Ala 35 40 45 50
ATT TTA AAC GCT AAA GCG CAA AAC ATG CCT AAA GAC AAT ATT GAC GCA 248 He Leu Asn Ala Lys Ala Gin Asn Met Pro Lys Asp Asn He Asp Ala 55 60 65
GCG ATT AAA AGA GCG AGC AGT AAA GAA GGG AAT TTG AGT GAA ATC ACT 296 Ala He Lys Arg Ala Ser Ser Lys Glu Gly Asn Leu Ser Glu He Thr 70 75 80
TAT GAA GGT AAG GCG AAT TTT GGC GTG CTA ATC ATC ATG GAA TGC ATG 344 Tyr Glu Gly Lys Ala Asn Phe Gly Val Leu He lie Met Glu Cys Met 85 90 95
ACT GAT AAC CCC ACC AGA ACC ATT GCC AAC CTT AAA AGC TAT TTC AAT 392 Thr Asp Asn Pro Thr Arg Thr He Ala Asn Leu Lys Ser Tyr Phe Asn 100 105 110
AAA ACG CAA GGG GCA AGC ATC GTG CCT AAT GGC TCT TTA GAG TTT ATG 440 Lys Thr Gin Gly Ala Ser He Val Pro Asn Gly Ser Leu Glu Phe Met 115 120 125 130 TTT AAC CGA AAA AGC GTG TTT GAA TGC TTG AAA AAT GAA GTG GAA AAT 488 Phe Asn Arg Lys Ser Val Phe Glu Cys Leu Lys Asn Glu Val Glu Asn 135 140 145
TTA AAA CTC AGT CTA GAA GAT TTA GAA TTC GCT CTC ATT GAT TAT GGT 536 Leu Lys Leu Ser Leu Glu Asp Leu Glu Phe Ala Leu He Asp Tyr Gly 150 155 160
TTG GAA GAA TTA GAA GAA GTG GAA GAC AAG ATC ATT ATT AGG GGG GAT 584 Leu Glu Glu Leu Glu Glu Val Glu Asp Lys He He He Arg Gly Asp 165 170 175
TAT AAC AGC TTC AAG CTT TTA AAT GAG GGG TTT GAA AGC TTG AAA TTA 632 Tyr Asn Ser Phe Lys Leu Leu Asn Glu Gly Phe Glu Ser Leu Lys Leu 180 185 190
CCC ATT TTA AAA GCG AGT TTG CAA CGC ATC GCC ACA ACG CCC ATT GAA 680 Pro He Leu Lys Ala Ser Leu Gin Arg He Ala Thr Thr Pro He Glu 195 200 205 210
TTG AAT GAC GAA CAA ATG GAG CTT ACC GAA AAA TTA CTG GAC AGG ATT 728 Leu Asn Asp Glu Gin Met Glu Leu Thr Glu Lys Leu Leu Asp Arg He 215 220 225
GAA GAC GAT GAT GAT GTG GTC GCG CTT TAT ACC AAT ATT GAG TGAAATGCA 779 Glu Asp Asp Asp Asp Val Val Ala Leu Tyr Thr Asn He Glu 230 235 240
AAAAGACTCA AAGTATTTTT TTAAACCACC CAAGCATTCA TCCCAATAGG GAATGCGTTG 839 GAGA 843
(2) INFORMATION FOR SEQ ID NO: 644:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 240 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644:
Met Gly Arg Ala Phe Glu Tyr Arg Arg Ala Ala Lys Glu Lys Arg Trp
1 5 10 15
Asp Lys Met Ser Lys Val Phe Pro Lys Leu Ala Lys Ala He Thr Leu
20 25 30
Ala Ala Lys Asp Gly Gly Ser Glu Pro Asp Thr Asn Ala Lys Leu Arg
35 40 45
Thr Ala He Leu Asn Ala Lys Ala Gin Asn Met Pro Lys Asp Asn He
50 55 60
Asp Ala Ala He Lys Arg Ala Ser Ser Lys Glu Gly Asn Leu Ser Glu 65 70 75 80
He Thr Tyr Glu Gly Lys Ala Asn Phe Gly Val Leu He He Met Glu 85 90 95
Cys Met Thr Asp Asn Pro Thr Arg Thr He Ala Asn Leu Lys Ser Tyr
100 105 110
Phe Asn Lys Thr Gin Gly Ala Ser He Val Pro Asn Gly Ser Leu Glu
115 120 125
Phe Met Phe Asn Arg Lys Ser Val Phe Glu Cys Leu Lys Asn Glu Val
130 135 140
Glu Asn Leu Lys Leu Ser Leu Glu Asp Leu Glu Phe Ala Leu He Asp 145 150 155 160
Tyr Gly Leu Glu Glu Leu Glu Glu Val Glu Asp Lys He He He Arg
165 170 175
Gly Asp Tyr Asn Ser Phe Lys Leu Leu Asn Glu Gly Phe Glu Ser Leu
180 185 190
Lys Leu Pro He Leu Lys Ala Ser Leu Gin Arg He Ala Thr Thr Pro
195 200 205
He Glu Leu Asn Asp Glu Gin Met Glu Leu Thr Glu Lys Leu Leu Asp
210 215 220
Arg He Glu Asp Asp Asp Asp Val Val Ala Leu Tyr Thr Asn He Glu 225 230 235 240
(2) INFORMATION FOR SEQ ID NO: 645:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 451 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 108...392 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645:
ATCAGAGGGA TTACGCCTGG AAGGATAAGA ATTTTAGGGA TCTGTAAGAA GATCTAGAAG 60 AAGCGATAGA ATACAACAAA AAGGGTCAGG GGCATTTTAT AGGCACC ATG GTT GTC 116
Met Val Val
1
GCT AAG AAT GAA GAC AAC AAA AAA TTG TAT GAC ATC ATT GAC GGC CAG 164 Ala Lys Asn Glu Asp Asn Lys Lys Leu Tyr Asp He He Asp Gly Gin 5 10 15
CAA CGA ACG ACT ACC ATC TTC ATG CTC TTG CAT GTC TTG GCG AAC AAA 212 Gin Arg Thr Thr Thr He Phe Met Leu Leu His Val Leu Ala Asn Lys 20 25 30 35
CAA AAC GAG AAA GAC AAG CAA GAA ACA AGA AAA TAT CTA TAC CAA AAG 260 Gin Asn Glu Lys Asp Lys Gin Glu Thr Arg Lys Tyr Leu Tyr Gin Lys 40 45 50
GGG GAA TTA AAA TTA GAA GTC GCC CCC AAA AAC CAA AGC TTC TTC AAA 308 Gly Glu Leu Lys Leu Glu Val Ala Pro Lys Asn Gin Ser Phe Phe Lys 55 60 65
ACG CTC TTG GAA GCG GCA GAA AAG GAG AAT ATC AGC CAG AAA AAG ATG 356 Thr Leu Leu Glu Ala Ala Glu Lys Glu Asn He Ser Gin Lys Lys Met 70 75 80
CAG ACA CCG AGG GCA AGC AAA ATC TTT TTG AAG TTT TGAAGGCTAT CTTGGA 408 Gin Thr Pro Arg Ala Ser Lys He Phe Leu Lys Phe 85 90 95
TAAGGTCAGC AAATTGAGTG AAGAAGAAGT GAATGAGCGT TTG 451
(2) INFORMATION FOR SEQ ID NO: 646:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 95 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 646:
Met Val Val Ala Lys Asn Glu Asp Asn Lys Lys Leu Tyr Asp He He
1 5 10 15
Asp Gly Gin Gin Arg Thr Thr Thr He Phe Met Leu Leu His Val Leu
20 25 30
Ala Asn Lys Gin Asn Glu Lys Asp Lys Gin Glu Thr Arg Lys Tyr Leu
35 • 40 45
Tyr Gin Lys Gly Glu Leu Lys Leu Glu Val Ala Pro Lys Asn Gin Ser
50 55 60
Phe Phe Lys Thr Leu Leu Glu Ala Ala Glu Lys Glu Asn He Ser Gin
65 70 75 80
Lys Lys Met Gin Thr Pro Arg Ala Ser Lys He Phe Leu Lys Phe 85 90 95
(2) INFORMATION FOR SEQ ID NO: 647:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 840 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 54...718 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647:
CAATGTCCAT GAGTTATTGG ACCAACACAA CGCTAACCTA AAAGGAGAAC ACC ATG 56
Met 1
AGT GAT AAT GAA CGA ACG ATT GTA GTT AGA GTG CTA AAA TTT GAC CCT 104 Ser Asp Asn Glu Arg Thr He Val Val Arg Val Leu Lys Phe Asp Pro 5 10 15
CAA AGC GCG GTG AGT AAG CCG CAT TTT AAA GAG TAT CAG TTG AAA GAA 152 Gin Ser Ala Val Ser Lys Pro His Phe Lys Glu Tyr Gin Leu Lys Glu 20 25 30
ACG CCA TCC ATG ACG CTT TTT ATC GCT TTG AAC CTC ATT AGA GAG CAT 200 Thr Pro Ser Met Thr Leu Phe He Ala Leu Asn Leu He Arg Glu His 35 40 45
CAA GAT CCG GAT TTG AGT TTT GAT TTT GTG TGC CGC GCT GGG ATT TGC 248 Gin Asp Pro Asp Leu Ser Phe Asp Phe Val Cys Arg Ala Gly He Cys 50 55 60 65
GGC TCT TGC GCG ATG ATG GTT AAT GGG AGA CCG AGG CTA GCT TGT AAA 296 Gly Ser Cys Ala Met Met Val Asn Gly Arg Pro Arg Leu Ala Cys Lys 70 75 80
ACC CTA ACT TCT AGC TTT GAA AGC GGG GTG ATC ACG CTC ATG CCC ATG 344 Thr Leu Thr Ser Ser Phe Glu Ser Gly Val He Thr Leu Met Pro Met 85 90 95
CCC AGT TTT ACG CTC ATT AAA GAT TTG AGC GTG AAT ACG GGC GAT TGG 392 Pro Ser Phe Thr Leu He Lys Asp Leu Ser Val Asn Thr Gly Asp Trp 100 105 110
TTT TTG GAT ATG ACT AAA AGG GTG GAG AGT TGG GCG CAT TCT AAA GAA 440 Phe Leu Asp Met Thr Lys Arg Val Glu Ser Trp Ala His Ser Lys Glu 115 120 125
GAA GTG GAT ATT ACT AGA CCG GAA AAA AGG GTT GAG CCT GAC GAA GCC 488 Glu Val Asp He Thr Arg Pro Glu Lys Arg Val Glu Pro Asp Glu Ala 130 135 140 145
CAA GAA GTC TTT GAA CTA GAC AGG TGT ATT GAA TGC GGG TGT TGT ATC 536 Gin Glu Val Phe Glu Leu Asp Arg Cys He Glu Cys Gly Cys Cys He 150 155 160
GCT TCT TGC GGG ACT AAA CTC ATG CGC CCT AAT TTC ATT GGA GCT GCT 584 Ala Ser Cys Gly Thr Lys Leu Met Arg Pro Asn Phe He Gly Ala Ala 165 170 175 GGC ATG AAC AGA GCC ATG CGT TTT ATG ATT GAC AGC CAC GAT GAA AGA 632 Gly Met Asn Arg Ala Met Arg Phe Met He Asp Ser His Asp Glu Arg 180 185 190
AAC GAT GAT GAT TTT TAT GAG TTA GTC GGC GAT GAT GAT GGT GTT TTT 680 Asn Asp Asp Asp Phe Tyr Glu Leu Val Gly Asp Asp Asp Gly Val Phe 195 200 205
GGG TGC ATG AGC CTT ATC GCT TGC CAT GAC ACT TGC CC TAAAGAATTA CCC 731 Gly Cys Met Ser Leu He Ala Cys His Asp Thr Cys Pro 210 215 220
TTGCAAAGCA GTATCGCCAC TTTGCGTAAC AGGATGTTGA AAGTGGGTAA AAGCCGCTAA 791 TTTCTTTTTA GTGGGTCGTT TTTGAAAATC TTTTTAGTCT TTTTAAGCG 840
(2) INFORMATION FOR SEQ ID NO: 648:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 222 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 648:
Met Ser Asp Asn Glu Arg Thr He Val Val Arg Val Leu Lys Phe Asp
1 5 10 15
Pro Gin Ser Ala Val Ser Lys Pro His Phe Lys Glu Tyr Gin Leu Lys
20 25 30
Glu Thr Pro Ser Met Thr Leu Phe He Ala Leu Asn Leu He Arg Glu
35 40 45
His Gin Asp Pro Asp Leu Ser Phe Asp Phe Val Cys Arg Ala Gly He
50 55 60
Cys Gly Ser Cys Ala Met Met Val Asn Gly Arg Pro Arg Leu Ala Cys 65 70 75 80
Lys Thr Leu Thr Ser Ser Phe Glu Ser Gly Val He Thr Leu Met Pro
85 90 95
Met Pro Ser Phe Thr Leu He Lys Asp Leu Ser Val Asn Thr Gly Asp
100 105 110
Trp Phe Leu Asp Met Thr Lys Arg Val Glu Ser Trp Ala His Ser Lys
115 120 125
Glu Glu Val Asp He Thr Arg Pro Glu Lys Arg Val Glu Pro Asp Glu
130 135 140
Ala Gin Glu Val Phe Glu Leu Asp Arg Cys He Glu Cys Gly Cys Cys 145 150 155 160
He Ala Ser Cys Gly Thr Lys Leu Met Arg Pro Asn Phe He Gly Ala
165 170 175
Ala Gly Met Asn Arg Ala Met Arg Phe Met He Asp Ser His Asp Glu
180 185 190
Arg Asn Asp Asp Asp Phe Tyr Glu Leu Val Gly Asp Asp Asp Gly Val
195 200 205
Phe Gly Cys Met Ser Leu He Ala Cys His Asp Thr Cys Pro 210 215 220
(2) INFORMATION FOR SEQ ID NO: 649:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 351 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 53...262 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 649:
GAAAGCCAAA GTAGCCCCTT GATTGAAACA AGATTGAGCG ATCCCATAAG CG ATG GAT 58
Met Asp 1
TTA TTG TTC GCC ACC CCT ACA ATA AGC CCT TTT TTA CCT TTT AAA AAT 106 Leu Leu Phe Ala Thr Pro Thr He Ser Pro Phe Leu Pro Phe Lys Asn 5 10 15
CCC ATG ATT TTC CTT TAT AAA AAT GAA ATG ATT GTT TTA AAA TTT TCT 154 Pro Met He Phe Leu Tyr Lys Asn Glu Met He Val Leu Lys Phe Ser 20 25 30
AAT TCC CAA GAC GCG CTC CCG ATC AAC AAG CCA TCC ACG CTA TCA ATC 202 Asn Ser Gin Asp Ala Leu Pro He Asn Lys Pro Ser Thr Leu Ser He 35 40 45 50
CCT AAA ATT TCT TTA GCG TTT TGT GTG TTC ACG CTC CCC CCA TAC AAC 250 Pro Lys He Ser Leu Ala Phe Cys Val Phe Thr Leu Pro Pro Tyr Asn 55 60 65
AAG GGC GTT TTT TGATTTAAGA TTTGCTTTAA AAAACCATGC GTGAGATAAA TATCT 307 Lys Gly Val Phe 70
TCTAAAGAAG CGCTTTTTTT GGTGCCAATC GCCCAAATAG GCTC 351
(2) INFORMATION FOR SEQ ID NO: 650:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 70 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650:
Met Asp Leu Leu Phe Ala Thr Pro Thr He Ser Pro Phe Leu Pro Phe
1 5 10 15
Lys Asn Pro Met He Phe Leu Tyr Lys Asn Glu Met He Val Leu Lys
20 25 30
Phe Ser Asn Ser Gin Asp Ala Leu Pro He Asn Lys Pro Ser Thr Leu
35 40 45
Ser He Pro Lys He Ser Leu Ala Phe Cys Val Phe Thr Leu Pro Pro
50 55 60
Tyr Asn Lys Gly Val Phe 65 70
(2) INFORMATION FOR SEQ ID NO: 651:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1271 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 368...1210 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651:
TTTAAGGACT ATGCATGCAA GAGATTATCC CTATTGTCGT GGCTTTTGAT AACAACTATT 60
GTATCCCTGC TGGCGTGAGC TTATTTTCCA TGCTAGCAAA CGCCAAACGA GAGAGAGAGA 120
GAGAGAGAGA GTAAAACTCT TTTATCAAAT CCATTGTTTG GTGGAGAGTT TAACCCCAGA 180
GAATATAGCC AAATTAGAAG AAACGATCGC TCCTTTTAGA GCTTTTTCTA GCATAGAGTT 240
TTTGGATATT ACCGATAAAG AATTAGAACC ACGCCACAAT TATAATAAGC TTGATCCTTT 300
AATAGCGAGT GAAATTAAAA AATTGTATTT AAAACTCAAT GCTTTTTCGC AAAAACGCTT 360
TTCTAAA ATG ATC ATG TGC CGT TTC TTT TTT GCC TCC CTT TTC CCC CAA 409 Met He Met Cys Arg Phe Phe Phe Ala Ser Leu Phe Pro Gin 1 5 10
TAC GAT AAG ATG ATC ATG TTT GAT GTG GAC ACT TTG TTT GTG AAT GAT 457 Tyr Asp Lys Met He Met Phe Asp Val Asp Thr Leu Phe Val Asn Asp 15 20 25 30
ATT AGC GAG AGC TTT TTT ATC CCC CTT GAA ACG CAT TAT TTT GGG GCT 505 He Ser Glu Ser Phe Phe He Pro Leu Glu Thr His Tyr Phe Gly Ala 35 40 45
GTG AGG GAA AAA GAT TTG ATC GCT ATA AAT AGG AAT TCG GCT AAG GAT 553 Val Arg Glu Lys Asp Leu He Ala He Asn Arg Asn Ser Ala Lys Asp 50 55 60
TTA TAC GAA TTG CGC CAA ATG CAT GCA AAA TCT ATC GGC ATC GCC AAC 601 Leu Tyr Glu Leu Arg Gin Met His Ala Lys Ser He Gly He Ala Asn 65 70 75
GCT TTC CCT AAT TTA GAA GAA GCT CAA ATC CTT TTT GAC AAC TAC TTT 649 Ala Phe Pro Asn Leu Glu Glu Ala Gin He Leu Phe Asp Asn Tyr Phe 80 85 90
AAC GCC GGG TTT TTA GCC TTA AAT TTA AAA TCA TGG CGT AAA GAA AAT 697 Asn Ala Gly Phe Leu Ala Leu Asn Leu Lys Ser Trp Arg Lys Glu Asn 95 100 105 110
CTT GAA AAC CAA TTG ATT ACC TTT TTC ATT TTG AAA AAT GAA AAA CTT 745 Leu Glu Asn Gin Leu He Thr Phe Phe He Leu Lys Asn Glu Lys Leu 115 120 125
TTA TTT AAC GAT CAA GAT GCT TTG TGT TTT GTG TGC CGT GGG AGG ATT 793 Leu Phe Asn Asp Gin Asp Ala Leu Cys Phe Val Cys Arg Gly Arg He 130 135 140
TTA GAA TTG CCT TAT CCA TAC AAT GCC CAC CCT AGT TTC CTT GAT ACG 841 Leu Glu Leu Pro Tyr Pro Tyr Asn Ala His Pro Ser Phe Leu Asp Thr 145 150 155
CTC TCA TTC CCT AGC ATC AAA GAA GCG CGC ATG CTG CAT TTT TGG GGC 889 Leu Ser Phe Pro Ser He Lys Glu Ala Arg Met Leu His Phe Trp Gly 160 165 170
GAT AAA CCC TGG AAA CTC TTA AGC GTC ATT GGC GCG AAA AAA TGG CAT 937 Asp Lys Pro Trp Lys Leu Leu Ser Val He Gly Ala Lys Lys Trp His 175 180 185 190
GAA GCG TTG ATC CAA ACG CCT TTT AAA GAC GCC TAT TTC AAC GCT TCT 985 Glu Ala Leu He Gin Thr Pro Phe Lys Asp Ala Tyr Phe Asn Ala Ser 195 200 205
TTT TTA GAT CAC CTC TTT GAA TCC CTT CAA AAC AAG GAT AAT GAG ATC 1033 Phe Leu Asp His Leu Phe Glu Ser Leu Gin Asn Lys Asp Asn Glu He 210 215 220
AAA AGA AGA GAT GAA AGG ATC ATT GAA GCA CTT CAA GCA AGG GAT AAA 1081 Lys Arg Arg Asp Glu Arg He He Glu Ala Leu Gin Ala Arg Asp Lys 225 230 235
ATC CTG TCT TTT TCA GAC AAG CGA CAT TCT TTT GAA TCT CTT CTG CCC 1129 He Leu Ser Phe Ser Asp Lys Arg His Ser Phe Glu Ser Leu Leu Pro 240 245 250
AAG CTT TCT TCT AAA CTC CTT ATA GAA TTT TTG CTT TTT AAA GCC AAA 1177 Lys Leu Ser Ser Lys Leu Leu He Glu Phe Leu Leu Phe Lys Ala Lys 255 260 265 270 CAA AAA GTG AAG CGA CTG ATT AAA AGG GTT TTT TAAAACCCTT TTTAAACTAA 1230 Gin Lys Val Lys Arg Leu He Lys Arg Val Phe 275 280
TGCGAGCAAG CATGGGTTTG TGTGGGCTGG ATGTCCTTAT T 1271
(2) INFORMATION FOR SEQ ID NO: 652:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 281 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 652:
Met He Met Cys Arg Phe Phe Phe Ala Ser Leu Phe Pro Gin Tyr Asp
1 5 10 15
Lys Met He Met Phe Asp Val Asp Thr Leu Phe Val Asn Asp He Ser
20 25 30
Glu Ser Phe Phe He Pro Leu Glu Thr His Tyr Phe Gly Ala Val Arg
35 40 45
Glu Lys Asp Leu He Ala He Asn Arg Asn Ser Ala Lys Asp Leu Tyr
50 55 60
Glu Leu Arg Gin Met His Ala Lys Ser He Gly He Ala Asn Ala Phe 65 70 75 80
Pro Asn Leu Glu Glu Ala Gin He Leu Phe Asp Asn Tyr Phe Asn Ala
85 90 95
Gly Phe Leu Ala Leu Asn Leu Lys Ser Trp Arg Lys Glu Asn Leu Glu
100 105 110
Asn Gin Leu He Thr Phe Phe He Leu Lys Asn Glu Lys Leu Leu Phe
115 120 125
Asn Asp Gin Asp Ala Leu Cys Phe Val Cys Arg Gly Arg He Leu Glu
130 135 140
Leu Pro Tyr Pro Tyr Asn Ala His Pro Ser Phe Leu Asp Thr Leu Ser 145 150 155 160
Phe Pro Ser He Lys Glu Ala Arg Met Leu His Phe Trp Gly Asp Lys
165 170 175
Pro Trp Lys Leu Leu Ser Val He Gly Ala Lys Lys Trp His Glu Ala
180 185 190
Leu He Gin Thr Pro Phe Lys Asp Ala Tyr Phe Asn Ala Ser Phe Leu
195 200 205
Asp His Leu Phe Glu Ser Leu Gin Asn Lys Asp Asn Glu He Lys Arg
210 215 220
Arg Asp Glu Arg He He Glu Ala Leu Gin Ala Arg Asp Lys He Leu 225 230 235 240
Ser Phe Ser Asp Lys Arg His Ser Phe Glu Ser Leu Leu Pro Lys Leu
245 250 255
Ser Ser Lys Leu Leu He Glu Phe Leu Leu Phe Lys Ala Lys Gin Lys
260 265 270
Val Lys Arg Leu He Lys Arg Val Phe 275 280 (2) INFORMATION FOR SEQ ID NO 653
(l) SEQUENCE CHARACTERISTICS
(A) LENGTH 1198 base pairs
(B) TYPE nucleic acid
(C) STRANDEDNESS single
(D) TOPOLOGY linear
(ii) MOLECULE TYPE Genomic DNA (ix) FEATURE
(A) NAME/KEY Coding Sequence
(B) LOCATION 51 1145 (D) OTHER INFORMATION
(xi) SEQUENCE DESCRIPTION SEQ ID NO 653
ATTTAGACAA TAACGCTACA ACTAGGATTG ACCCTAAAGT CAAAGAGATC ATG GAT 56
Met Asp
1
CCT TTT TTA AGG GAT CAT TAT GGG AAT CCT AGC TCG TTG CAC CAG TTT 104 Pro Phe Leu Arg Asp His Tyr Gly Asn Pro Ser Ser Leu His Gin Phe 5 10 15
GGC ACA GAA ACC CAC CCA GCC ATT GCA GAA GCG CTA GAT AAG CTT TAT 152 Gly Thr Glu Thr His Pro Ala He Ala Glu Ala Leu Asp Lys Leu Tyr 20 25 30
AAG GGC ATT AAC GCT AGG GAT ATA GAT GAT GTG ATC ATC ACT TCT TGT 200 Lys Gly He Asn Ala Arg Asp He Asp Asp Val He He Thr Ser Cys 35 40 45 50
GCG ACA GAA AGC AAT AAT TGG GTT TTA AAG GGC GTG TAT TTT GAT GAA 248 Ala Thr Glu Ser Asn Asn Trp Val Leu Lys Gly Val Tyr Phe Asp Glu 55 60 65
TGC TTG AAA AAA GGC AAA AAC CAT ATT GTA ACC ACG GTT GCA GAG CAT 296 Cys Leu Lys Lys Gly Lys Asn His He Val Thr Thr Val Ala Glu His 70 75 80
CCG GCG GTG CGA TCC ACT TGC AAT TTT TTA GAA AGC TTG GGG GTG GAG 344 Pro Ala Val Arg Ser Thr Cys Asn Phe Leu Glu Ser Leu Gly Val Glu 85 90 95
GTT ACT TAC TTG CCC ATT AAT GAG CAT GGG AGC ATC ACC GCA GAG CAA 392 Val Thr Tyr Leu Pro He Asn Glu His Gly Ser He Thr Ala Glu Gin 100 105 110
GTC AAA GAA GCG ATC ACA GAA AAA ACC GCT CTA GTG AGC GTG ATG TGG 440 Val Lys Glu Ala He Thr Glu Lys Thr Ala Leu Val Ser Val Met Trp 115 120 125 130 GCG AAT AAT GAA ACC GGT CTC ATT TTC CCT ATT GAA GAA ATT GGG GCT 488 Ala Asn Asn Glu Thr Gly Leu He Phe Pro He Glu Glu He Gly Ala 135 140 145
ATT TGT AAA GAA AAG GGC GTG TTG TTC CAT ACC GAT GCC GTG CAA GCG 536 He Cys Lys Glu Lys Gly Val Leu Phe His Thr Asp Ala Val Gin Ala 150 155 160
ATT GGT AAA ATC CCT GTA GAT GTG TTA AAA GCG AAT GCA GAT TTC CTT 584 He Gly Lys He Pro Val Asp Val Leu Lys Ala Asn Ala Asp Phe Leu 165 170 175
TCT TTT AGC GCG CAC AAG TTT CAT GGG CCT AAA GGC ATT GGG GGG TTG 632 Ser Phe Ser Ala His Lys Phe His Gly Pro Lys Gly He Gly Gly Leu 180 185 190
TAT ATT AGA AGT GGG GTG GGA TTG ACC CCT CTT TTT CAT GGC GGG GAG 680 Tyr He Arg Ser Gly Val Gly Leu Thr Pro Leu Phe His Gly Gly Glu 195 200 205 210
CAT ATG AAT GGC AGG CGC AGC GGG ACT TTG AAT GTG CCT TAT ATT GTG 728 His Met Asn Gly Arg Arg Ser Gly Thr Leu Asn Val Pro Tyr He Val 215 220 225
GGA ATG GGC GAA GCG ATG AAA TTA GCC GTA GAG CAT TTA GAC TAT GAA 776 Gly Met Gly Glu Ala Met Lys Leu Ala Val Glu His Leu Asp Tyr Glu 230 235 240
AAA GAA GTG GTG GGG AAA TTG CGC GAC AAA TTA GAA GAA GCG CTT TTG 824 Lys Glu Val Val Gly Lys Leu Arg Asp Lys Leu Glu Glu Ala Leu Leu 245 250 255
AAA ATC CCT GAT GTG ATG GTG GTG GGC GAT AGA ATC CAT CGT GTG CCT 872 Lys He Pro Asp Val Met Val Val Gly Asp Arg He His Arg Val Pro 260 265 270
AAC ACG ACT TTA GTC AGC GTG AGA GGG ATT GAA GGA GAG GCC ATG CTG 920 Asn Thr Thr Leu Val Ser Val Arg Gly He Glu Gly Glu Ala Met Leu 275 280 285 290
TGG GAT TTA AAC CGC TCT AAT ATC GCC GCT TCC ACA GGG AGC GCG TGC 968 Trp Asp Leu Asn Arg Ser Asn He Ala Ala Ser Thr Gly Ser Ala Cys 295 300 305
GCG AGT GAG GAT TTA GAG GCT AAT CCG GTG ATG GTA GCG ATT GGA GCG 1016 Ala Ser Glu Asp Leu Glu Ala Asn Pro Val Met Val Ala He Gly Ala 310 315 320
AGT AAG GAA TTG GCT CAT ACC GCT ATC AGG CTT TCA TTG AGC CGT TTT 1064 Ser Lys Glu Leu Ala His Thr Ala He Arg Leu Ser Leu Ser Arg Phe 325 330 335
AAC ACG GAA GCT GAA ATT GAC AAA ACG ATT GAA GTT TTC TCT CAA GCG 1112 Asn Thr Glu Ala Glu He Asp Lys Thr He Glu Val Phe Ser Gin Ala 340 345 350 GCT GTA AGG TTG AGA AAT ATT TCA AGC TCT TAT TAAAAAGAAT ATAAAGGAAT 1165 Ala Val Arg Leu Arg Asn He Ser Ser Ser Tyr 355 360 365
CAAAATGGCA AAACATGATT TAGTGGGTTC GGT 1198
(2) INFORMATION FOR SEQ ID NO: 654:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 365 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654:
Met Asp Pro Phe Leu Arg Asp His Tyr Gly Asn Pro Ser Ser Leu His
1 5 10 15
Gin Phe Gly Thr Glu Thr His Pro Ala He Ala Glu Ala Leu Asp Lys
20 25 30
Leu Tyr Lys Gly He Asn Ala Arg Asp He Asp Asp Val He He Thr
35 40 45
Ser Cys Ala Thr Glu Ser Asn Asn Trp Val Leu Lys Gly Val Tyr Phe
50 55 60
Asp Glu Cys Leu Lys Lys Gly Lys Asn His He Val Thr Thr Val Ala 65 70 75 80
Glu His Pro Ala Val Arg Ser Thr Cys Asn Phe Leu Glu Ser Leu Gly
85 90 95
Val Glu Val Thr Tyr Leu Pro He Asn Glu His Gly Ser He Thr Ala
100 105 110
Glu Gin Val Lys Glu Ala He Thr Glu Lys Thr Ala Leu Val Ser Val
115 120 125
Met Trp Ala Asn Asn Glu Thr Gly Leu He Phe Pro He Glu Glu He
130 135 140
Gly Ala He Cys Lys Glu Lys Gly Val Leu Phe His Thr Asp Ala Val 145 150 155 160
Gin Ala He Gly Lys He Pro Val Asp Val Leu Lys Ala Asn Ala Asp
165 170 175
Phe Leu Ser Phe Ser Ala His Lys Phe His Gly Pro Lys Gly He Gly
180 185 190
Gly Leu Tyr He Arg Ser Gly Val Gly Leu Thr Pro Leu Phe His Gly
195 200 205
Gly Glu His Met Asn Gly Arg Arg Ser Gly Thr Leu Asn Val Pro Tyr
210 215 220
He Val Gly Met Gly Glu Ala Met Lys Leu Ala Val Glu His Leu Asp 225 230 235 240
Tyr Glu Lys Glu Val Val Gly Lys Leu Arg Asp Lys Leu Glu Glu Ala
245 250 255
Leu Leu Lys He Pro Asp Val Met Val Val Gly Asp Arg He His Arg
260 265 270
Val Pro Asn Thr Thr Leu Val Ser Val Arg Gly He Glu Gly Glu Ala 275 280 285 Met Leu Trp Asp Leu Asn Arg Ser Asn He Ala Ala Ser Thr Gly Ser
290 295 300
Ala Cys Ala Ser Glu Asp Leu Glu Ala Asn Pro Val Met Val Ala He 305 310 315 320
Gly Ala Ser Lys Glu Leu Ala His Thr Ala He Arg Leu Ser Leu Ser
325 330 335
Arg Phe Asn Thr Glu Ala Glu He Asp Lys Thr He Glu Val Phe Ser
340 345 350
Gin Ala Ala Val Arg Leu Arg Asn He Ser Ser Ser Tyr 355 360 365
(2) INFORMATION FOR SEQ ID NO: 655:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1273 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1220 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655:
TCCGGTTTGA AGTTGGTTTT TAAATCTTTG GCTATAATCA AGCCATTCTA ATG AGA 56
Met Arg 1
AAG AAA GGC ATG TTT GAA AAG ATA CAA AAA GAA TGG CTG AGC AAC ATT 104 Lys Lys Gly Met Phe Glu Lys He Gin Lys Glu Trp Leu Ser Asn He 5 10 15
CAA AAG GAT TTG TTG TCT GGT TTT GTG GTG GGG CTT TCT GTG ATC CCA 152 Gin Lys Asp Leu Leu Ser Gly Phe Val Val Gly Leu Ser Val He Pro 20 25 30
GAG ACG GCC GGC TTT GCG ATC ATG GTG GGT TTA GAT GTG GGC GTG GCG 200 Glu Thr Ala Gly Phe Ala He Met Val Gly Leu Asp Val Gly Val Ala 35 40 45 50
TTT TAT ACG ACC TTT TAC ATG GCT TTT GTT TTG TCT CTT TTT GGG GCT 248 Phe Tyr Thr Thr Phe Tyr Met Ala Phe Val Leu Ser Leu Phe Gly Ala 55 60 65
AGA AAG GCG ATG ATT AGC GCA GCG GCC GGC TCA GTG GCG CTC ATT TTA 296 Arg Lys Ala Met He Ser Ala Ala Ala Gly Ser Val Ala Leu He Leu 70 75 80 GTG GGC GTG GTT AAA AAC TAT GGG CTT GAA TAC GCG GGC GTG GCG ACT 344 Val Gly Val Val Lys Asn Tyr Gly Leu Glu Tyr Ala Gly Val Ala Thr 85 90 95
CTT ATG GCA GGG GTG TTG CAA ATT CTT TTA GGC TAT TTG AAA ATA GGG 392 Leu Met Ala Gly Val Leu Gin He Leu Leu Gly Tyr Leu Lys He Gly 100 105 110
AAT CTT TTG AGG TTT ATC CCC CAA TCA GTG ATG TAT GGC TTT GTG AAC 440 Asn Leu Leu Arg Phe He Pro Gin Ser Val Met Tyr Gly Phe Val Asn 115 120 125 130
GCG CTA GGC ATT TTG CTT TTA ATG GAG CAA TTC AAA TTC CTT CAA AAC 488 Ala Leu Gly He Leu Leu Leu Met Glu Gin Phe Lys Phe Leu Gin Asn 135 140 145
CAA AAT TTG GGG GTG TTT GTC TTG CTC GCT ATT GGG ATA CTC ATC ATT 536 Gin Asn Leu Gly Val Phe Val Leu Leu Ala He Gly He Leu He He 150 155 160
TAT CTT TTT CCT CTA ATC ACT AAA AAA ATC CCC TCT AAT CTG ATT TGT 584 Tyr Leu Phe Pro Leu He Thr Lys Lys He Pro Ser Asn Leu He Cys 165 170 175
ATC CTT ATA GTG AGC GCG ATC GCT TTA ATT TTT GAT ATG CAT GCG CCG 632 He Leu He Val Ser Ala He Ala Leu He Phe Asp Met His Ala Pro 180 185 190
AAT TTG GGG AGC ATT GAG CAA GGG GTT TCA GGC TTT CAT TTC ATC ATT 680 Asn Leu Gly Ser He Glu Gin Gly Val Ser Gly Phe His Phe He He 195 200 205 210
ATC CCC AAA AAT TTG GAT TTT AAA ATA ATG ATA GAG TTG TTG CCT TAC 728 He Pro Lys Asn Leu Asp Phe Lys He Met He Glu Leu Leu Pro Tyr 215 220 225
GCT CTT TCT TTA GCA CTA GTG GGA ACG ATA GAA AGC TTA TTG ACG GCT 776 Ala Leu Ser Leu Ala Leu Val Gly Thr He Glu Ser Leu Leu Thr Ala 230 235 240
AAA ACT TTA GAT GTG ATT TTA AAA GAC GGC GTG AGC GAT AAA AAT AAA 824 Lys Thr Leu Asp Val He Leu Lys Asp Gly Val Ser Asp Lys Asn Lys 245 250 255
GAA ACT AAA GCG CAA GGC TTG GGG AAT ATC ATC TCA GGG CTT TTG GGG 872 Glu Thr Lys Ala Gin Gly Leu Gly Asn He He Ser Gly Leu Leu Gly 260 265 270
GGA ATG ACA GGG TGC GCT TTA GTG GGG CAG TCT ATC ATT AAC GCA AAA 920 Gly Met Thr Gly Cys Ala Leu Val Gly Gin Ser He He Asn Ala Lys 275 280 285 290
TCC GGG GCT AAA ACA AGG CTT TCT ACT TTT TTT GCC GGC TTT TCT TTA 968 Ser Gly Ala Lys Thr Arg Leu Ser Thr Phe Phe Ala Gly Phe Ser Leu 295 300 305 ATG GTG CTC ATA TTA GTG TTT AAT GAA TAT GTG GTT AAG ATC CCC ATT 1016 Met Val Leu He Leu Val Phe Asn Glu Tyr Val Val Lys He Pro He 310 315 320
GTG GCG GTT GTG GCG GTA ATG GTG ATG ATT TCT TTC ACC ACT TTT AAT 1064 Val Ala Val Val Ala Val Met Val Met He Ser Phe Thr Thr Phe Asn 325 330 335
TTC CAA TCC ATT ATT AAC ATT AAA AAA ATC AAG CTC TAT GAC ACG CTC 1112 Phe Gin Ser He He Asn He Lys Lys He Lys Leu Tyr Asp Thr Leu 340 345 350
AAC ATG CTC TTA GTC GTG GCG GTG GTT TTA TAC ACG CAT AAT TTA GCG 1160 Asn Met Leu Leu Val Val Ala Val Val Leu Tyr Thr His Asn Leu Ala 355 360 365 370
ATA GGG GTT GTG GTG GGG GTT TTA GTC AAT GCG TTA TGG ATC AAA TCT 1208 He Gly Val Val Val Gly Val Leu Val Asn Ala Leu Trp He Lys Ser 375 380 385
AAA GGG ATT GCA TGAAATTTTA TTTTAAAAAG TTGGGTAGCT AGAGATATGG CTCCA 1265 Lys Gly He Ala 390
GATGTAGG 1273
(2) INFORMATION FOR SEQ ID NO: 656:
(l) SEQUENCE CHARACTERISTICS.
(A) LENGTH: 390 ammo acids
Figure imgf001010_0001
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION. SEQ ID NO: 656:
Met Arg Lys Lys Gly Met Phe Glu Lys He Gin Lys Glu Trp Leu Ser
1 5 10 15
Asn He Gin Lys Asp Leu Leu Ser Gly Phe Val Val Gly Leu Ser Val
20 25 30
He Pro Glu Thr Ala Gly Phe Ala He Met Val Gly Leu Asp Val Gly
35 40 45
Val Ala Phe Tyr Thr Thr Phe Tyr Met Ala Phe Val Leu Ser Leu Phe
50 55 60
Gly Ala Arg Lys Ala Met He Ser Ala Ala Ala Gly Ser Val Ala Leu 65 70 75 80
He Leu Val Gly Val Val Lys Asn Tyr Gly Leu Glu Tyr Ala Gly Val
85 90 95
Ala Thr Leu Met Ala Gly Val Leu Gin He Leu Leu Gly Tyr Leu Lys
100 105 110
He Gly Asn Leu Leu Arg Phe He Pro Gin Ser Val Met Tyr Gly Phe 115 120 125 Val Asn Ala Leu Gly He Leu Leu Leu Met Glu Gin Phe Lys Phe Leu
130 135 140
Gin Asn Gin Asn Leu Gly Val Phe Val Leu Leu Ala He Gly He Leu 145 150 155 160
He He Tyr Leu Phe Pro Leu He Thr Lys Lys He Pro Ser Asn Leu
165 170 175
He Cys He Leu He Val Ser Ala He Ala Leu He Phe Asp Met His
180 185 190
Ala Pro Asn Leu Gly Ser He Glu Gin Gly Val Ser Gly Phe His Phe
195 200 205
He He He Pro Lys Asn Leu Asp Phe Lys He Met He Glu Leu Leu
210 215 220
Pro Tyr Ala Leu Ser Leu Ala Leu Val Gly Thr He Glu Ser Leu Leu 225 230 235 240
Thr Ala Lys Thr Leu Asp Val He Leu Lys Asp Gly Val Ser Asp Lys
245 250 255
Asn Lys Glu Thr Lys Ala Gin Gly Leu Gly Asn He He Ser Gly Leu
260 265 270
Leu Gly Gly Met Thr Gly Cys Ala Leu Val Gly Gin Ser He He Asn
275 280 285
Ala Lys Ser Gly Ala Lys Thr Arg Leu Ser Thr Phe Phe Ala Gly Phe
290 295 300
Ser Leu Met Val Leu He Leu Val Phe Asn Glu Tyr Val Val Lys He 305 310 315 320
Pro He Val Ala Val Val Ala Val Met Val Met He Ser Phe Thr Thr
325 330 335
Phe Asn Phe Gin Ser He He Asn He Lys Lys He Lys Leu Tyr Asp
340 345 350
Thr Leu Asn Met Leu Leu Val Val Ala Val Val Leu Tyr Thr His Asn
355 360 365
Leu Ala He Gly Val Val Val Gly Val Leu Val Asn Ala Leu Trp He
370 375 380
Lys Ser Lys Gly He Ala 385 390
(2) INFORMATION FOR SEQ ID NO 657
(l) SEQUENCE CHARACTERISTICS
(A) LENGTH 1020 base pairs
(B) TYPE nucleic acid
(C) STRANDEDNESS single
(D) TOPOLOGY linear
(n) MOLECULE TYPE Genomic DNA ( x) FEATURE
(A) NAME/KEY Coding Sequence
(B) LOCATION 386 964 (D) OTHER INFORMATION
(xi) SEQUENCE DESCRIPTION SEQ ID NO 657 GAAGAAAAAA TTTTAGAAAT GTTAGAAAGC GAATAAGGGG GAGTGAGTGG GAAATTTAGT 60
GATTGGCTCT AGGGGGAGCG AATTAGCCTT ATGGCAAGCG AATCACATTA AAGAACGCCT 120
GAAAAAAGAA TGCTTGATAG AAAGCGAAAT TCAAATCGTT AAGACTAAGG GCGATAAAAT 180
CTTAGACACC CCTTTAAATA AGATCGGCGG TAAGGGGCTA TTCACTAAGG AATTAGAAGA 240
ATTGCTTTTA AAGGGCGCAA TTGATTTGGC GGTGCATTCT TTAAAAGACG TGCCGGTCGT 300
GTTTGAAAAG GGGTTAGACT TGGCATGCAT CACCAAAAGG GCTGATGTGA GGGACACTTT 360
TTTAAGCGTG AAATTCCCTG ATTTG ATG AGT TTG CCT AAA GGG GCA AAG GTT 412
Met Ser Leu Pro Lys Gly Ala Lys Val 1 5
GGC ACG ACT TCT TTA AGG CGC TCT ATG CAG ATC AAA TTA AAG CGC CAG 460 Gly Thr Thr Ser Leu Arg Arg Ser Met Gin He Lys Leu Lys Arg Gin 10 15 20 25
GAT TTG GAC ACA GAA AGC TTG AGA GGG AAT GTC CAA ACC CGT TTG AAA 508 Asp Leu Asp Thr Glu Ser Leu Arg Gly Asn Val Gin Thr Arg Leu Lys 30 35 40
AAG CTT GAA TGC GGA GAA TTT GAC GCT ATC ATT TTA GCT GAA GCC GGG 556 Lys Leu Glu Cys Gly Glu Phe Asp Ala He He Leu Ala Glu Ala Gly 45 50 55
TTG TGC CGC CTA GAA ATT CAA GGA GCG AAA TAC CGC AAG GCT TTT AGC 604 Leu Cys Arg Leu Glu He Gin Gly Ala Lys Tyr Arg Lys Ala Phe Ser 60 65 70
GTA GAA GAA ATG ATT CCT AGC ATG GGT CAG GGG GCT TTA GGG GTA GAA 652 Val Glu Glu Met He Pro Ser Met Gly Gin Gly Ala Leu Gly Val Glu 75 80 85
ATG CTC AAA AAC CAC AAG CAT TTT GCC ACG CTT CAA AAA CTC AAC GAC 700 Met Leu Lys Asn His Lys His Phe Ala Thr Leu Gin Lys Leu Asn Asp 90 95 100 105
GAG AAA AGC GCG TTT TGC TGC CGT TTA GAA AGG GAG TTT ATC AAG GGG 748 Glu Lys Ser Ala Phe Cys Cys Arg Leu Glu Arg Glu Phe He Lys Gly 110 115 120
CTT AAT GGA GGG TGT CAG ATC CCT ATA GGC GTG CAT GCG AGT TTA ATG 796 Leu Asn Gly Gly Cys Gin He Pro He Gly Val His Ala Ser Leu Met 125 130 135
GGC GAT AGG GTT AAA ATC CAG GCG GTT TTA GGC TTG CCT AAC GGG AAA 844 Gly Asp Arg Val Lys He Gin Ala Val Leu Gly Leu Pro Asn Gly Lys 140 145 150
GAA GTC ATT ACT AAA GAA AAA CAA GGG GAT AAA ACT AAA GCG TTT GAT 892 Glu Val He Thr Lys Glu Lys Gin Gly Asp Lys Thr Lys Ala Phe Asp 155 160 165
TTA GTT CAA GAG CTT TTA GAA GAA TTT TTG CAA AGC GGG GCT AAA GAG 940 Leu Val Gin Glu Leu Leu Glu Glu Phe Leu Gin Ser Gly Ala Lys Glu 170 175 180 185
ATT TTA GAA AAG GCG CAG TTG TTT TAATGCGTTT GTTTATCGCG CTAGTTTTGT 994 He Leu Glu Lys Ala Gin Leu Phe 190
TTTGGTGGTG GTTAAGCTTG AACGCT 1020
(2) INFORMATION FOR SEQ ID NO -658
(l) SEQUENCE CHARACTERISTICS-
(A) LENGTH: 193 ammo acids
Figure imgf001013_0001
(C) STRANDEDNESS. single
(D) TOPOLOGY, linear
(ii) MOLECULE TYPE protein (v) FRAGMENT TYPE internal
(xi) SEQUENCE DESCRIPTION SEQ ID NO 658
Met Ser Leu Pro Lys Gly Ala Lys Val Gly Thr Thr Ser Leu Arg Arg
1 5 10 15
Ser Met Gin He Lys Leu Lys Arg Gin Asp Leu Asp Thr Glu Ser Leu
20 25 30
Arg Gly Asn Val Gin Thr Arg Leu Lys Lys Leu Glu Cys Gly Glu Phe
35 40 45
Asp Ala He He Leu Ala Glu Ala Gly Leu Cys Arg Leu Glu He Gin
50 55 60
Gly Ala Lys Tyr Arg Lys Ala Phe Ser Val Glu Glu Met He Pro Ser 65 70 75 80
Met Gly Gin Gly Ala Leu Gly Val Glu Met Leu Lys Asn His Lys His
85 90 95
Phe Ala Thr Leu Gin Lys Leu Asn Asp Glu Lys Ser Ala Phe Cys Cys
100 105 110
Arg Leu Glu Arg Glu Phe He Lys Gly Leu Asn Gly Gly Cys Gin He
115 120 125
Pro He Gly Val His Ala Ser Leu Met Gly Asp Arg Val Lys He Gin
130 135 140
Ala Val Leu Gly Leu Pro Asn Gly Lys Glu Val He Thr Lys Glu Lys 145 150 155 160
Gin Gly Asp Lys Thr Lys Ala Phe Asp Leu Val Gin Glu Leu Leu Glu
165 170 175
Glu Phe Leu Gin Ser Gly Ala Lys Glu He Leu Glu Lys Ala Gin Leu
180 185 190
Phe
(2) INFORMATION FOR SEQ ID NO: 659.
(l) SEQUENCE CHARACTERISTICS
(A) LENGTH: 265 base pairs
(B) TYPE- nucleic acid
(C) STRANDEDNESS- single
(D) TOPOLOGY, linear
(n) MOLECULE TYPE- Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...212 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 659:
TAAGCGCTTT GCGTGTTTAA AAATTCCTTA TCCAAGCTCT CAGCGTCTTC ATG AAA 56
Met Lys
1
GCT GTA ACC ATC TTT CAT GAT AAA TTC CCT CGC TCT CAC CAG CCC AAA 104 Ala Val Thr He Phe His Asp Lys Phe Pro Arg Ser His Gin Pro Lys 5 10 15
TCT TGG GCG GAT TTC ATC ACG GAA TTT CGT GTG GAT TTG ATA GAG ATG 152 Ser Trp Ala Asp Phe He Thr Glu Phe Arg Val Asp Leu He Glu Met 20 25 30
GAC GGG CAA TTG CTT GTA ACT TTT AAT GAA ATT AGC GGC AAT TTC GGT 200 Asp Gly Gin Leu Leu Val Thr Phe Asn Glu He Ser Gly Asn Phe Gly 35 40 45 50
GAT ATT TTC TTC TAAAGTGGGG CTTAAAACAA AATCATTGTC TTTTCGGTCT TTAAA 257 Asp He Phe Phe
AACCAATA 265
(2) INFORMATION FOR SEQ ID NO: 660:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 54 amino acids
(B) TYPE: ammo acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 660:
Met Lys Ala Val Thr He Phe His Asp Lys Phe Pro Arg Ser His Gin
1 5 10 15
Pro Lys Ser Trp Ala Asp Phe He Thr Glu Phe Arg Val Asp Leu He
20 25 30
Glu Met Asp Gly Gin Leu Leu Val Thr Phe Asn Glu He Ser Gly Asn
35 40 45
Phe Gly Asp He Phe Phe 50
(2) INFORMATION FOR SEQ ID NO: 661: (I) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1350 base pairs
(B) TYPE- nucleic acid
(C) STRANDEDNESS single
(D) TOPOLOGY: linear
(II) MOLECULE TYPE- Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION- 73...1305 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION. SEQ ID NO : 661-
TTTTACTTTT TTTTTGGTAT TCTAACAAGC TTTTAAATAA TCCAATCTAC TTTGTTTTAA 60 GGATAATATT TT ATG GCA GAT GTC GTT GTG GGG ATC CAG TGG GGA GAT GAG 111 Met Ala Asp Val Val Val Gly He Gin Trp Gly Asp Glu 1 5 10
GGG AAG GGA AAA ATT GTT GAT AGG ATC GCT AAA GAT TAT GAC TTT GTG 159 Gly Lys Gly Lys He Val Asp Arg He Ala Lys Asp Tyr Asp Phe Val 15 20 25
GTG CGC TAT CAG GGC GGG CAT AAT GCT GGG CAT ACC ATT GTG CAT AAG 207 Val Arg Tyr Gin Gly Gly His Asn Ala Gly His Thr He Val His Lys 30 35 40 45
GGG GTT AAG CAT TCT TTG CAT TTA ATG CCT TCA GGG GTT TTA TAC CCC 255 Gly Val Lys His Ser Leu His Leu Met Pro Ser Gly Val Leu Tyr Pro 50 55 60
AAA TGC AAG AAC ATC ATT TCT AGC GCG GTG GTC GTG AGC GTT AAG GAT 303 Lys Cys Lys Asn He He Ser Ser Ala Val Val Val Ser Val Lys Asp 65 70 75
TTG TGC GAA GAA ATC AGC GCG TTT GAG GAT TTA GAA AAT CGT TTG TTT 351 Leu Cys Glu Glu He Ser Ala Phe Glu Asp Leu Glu Asn Arg Leu Phe 80 85 90
GTC AGC GAC AGA GCC CAT GTG ATC TTG CCC TAT CAT GCC AAA AAA GAC 399 Val Ser Asp Arg Ala His Val He Leu Pro Tyr His Ala Lys Lys Asp 95 100 105
GCT TTT AAA GAA AAA TCT CAA AAC ATC GGC ACG ACT AAA AAA GGC ATA 447 Ala Phe Lys Glu Lys Ser Gin Asn He Gly Thr Thr Lys Lys Gly He 110 115 120 125
GGC CCT TGC TAT GAG GAT AAA ATG GCC AGG AGC GGG ATA AGA ATG GGG 495 Gly Pro Cys Tyr Glu Asp Lys Met Ala Arg Ser Gly He Arg Met Gly 130 135 140 GAT TTA TTA GAC GAT AAA ATC TTA GAA GAA AAG CTA AAC GCT CAT TTC 543 Asp Leu Leu Asp Asp Lys He Leu Glu Glu Lys Leu Asn Ala His Phe 145 150 155
AAA GCC ATT GAG CCT TTT AAA AAA GCG TAT GAT TTG GGC GAG AAT TAC 591 Lys Ala He Glu Pro Phe Lys Lys Ala Tyr Asp Leu Gly Glu Asn Tyr 160 165 170
GAA AAA GAT TTG ATG GGG TAT TTT AAA ACT TAC GCT CCA AAA ATT TGC 639 Glu Lys Asp Leu Met Gly Tyr Phe Lys Thr Tyr Ala Pro Lys He Cys 175 180 185
CCC TTT ATC AAA GAC ACG ACA AGC ATG CTG ATA GAA GCG AAT CAA AAG 687 Pro Phe He Lys Asp Thr Thr Ser Met Leu He Glu Ala Asn Gin Lys 190 195 200 205
GGT GAA AAA ATC CTA TTA GAA GGG GCA CAA GGC ACG CTT TTA GAC ATT 735 Gly Glu Lys He Leu Leu Glu Gly Ala Gin Gly Thr Leu Leu Asp He 210 215 220
GAT TTA GGG ACT TAC CCT TTT GTA ACA AGC TCT AAC ACC ACG AGC GCT 783 Asp Leu Gly Thr Tyr Pro Phe Val Thr Ser Ser Asn Thr Thr Ser Ala 225 230 235
AGC GCA TGC GTG AGC ACC GGC TTA AAC CCT AAA GCG ATC AAT GAA GTC 831 Ser Ala Cys Val Ser Thr Gly Leu Asn Pro Lys Ala He Asn Glu Val 240 245 250
ATA GGT ATC ACA AAA GCC TAC TCC ACT CGT GTG GGT AAT GGG CCT TTC 879 He Gly He Thr Lys Ala Tyr Ser Thr Arg Val Gly Asn Gly Pro Phe 255 260 265
CCT AGC GAA GAC ACT ACA CCC ATG GGC GAT CAT TTA AGG ACT AAG GGT 927 Pro Ser Glu Asp Thr Thr Pro Met Gly Asp His Leu Arg Thr Lys Gly 270 275 280 285
GCG GAG TTT GGC ACG ACA ACC AAG CGC CCA AGG CGT TGC GGG TGG CTG 975 Ala Glu Phe Gly Thr Thr Thr Lys Arg Pro Arg Arg Cys Gly Trp Leu 290 295 300
GAT TTG GTG GCT TTG AAA TAC GCT TGC GCT TTG AAT GGT TGC ACG CAA 1023 Asp Leu Val Ala Leu Lys Tyr Ala Cys Ala Leu Asn Gly Cys Thr Gin 305 310 315
TTA GCC TTA ATG AAA TTA GAC GTT TTA GAC GGG ATT GAT GCG ATT AAG 1071 Leu Ala Leu Met Lys Leu Asp Val Leu Asp Gly He Asp Ala He Lys 320 325 330
GTG TGC GTG GCT TAT GAA AGA AAG GGC GAA AGA TTG GAG ATT TTC CCT 1119 Val Cys Val Ala Tyr Glu Arg Lys Gly Glu Arg Leu Glu He Phe Pro 335 340 345
AGC GAT TTG AAA GAT TGC GTG CCG ATC TAT CAA ACT TTT AAA GGT TGG 1167 Ser Asp Leu Lys Asp Cys Val Pro He Tyr Gin Thr Phe Lys Gly Trp 350 355 360 365 GAA AAA AGC GTG GGC GTG AGA AAA TTA GAC GAT TTA GAG CCA AAC GTT 1215 Glu Lys Ser Val Gly Val Arg Lys Leu Asp Asp Leu Glu Pro Asn Val 370 375 380
AGA GAG TAT ATC CGT TTT ATT GAA AAA GAA GTG GGG GTA AAA ATC CGC 1263 Arg Glu Tyr He Arg Phe He Glu Lys Glu Val Gly Val Lys He Arg 385 390 395
CTT ATT TCT ACA AGC CCT GAA AGA GAA GAC ACG ATT TTT CTA TGAAAAAAT 1314 Leu He Ser Thr Ser Pro Glu Arg Glu Asp Thr He Phe Leu 400 405 410
TCGCTTCTGT ATTGGTGCAA TTAAAAACCC TTGCGT 1350
(2) INFORMATION FOR SEQ ID NO: 662-
( ) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 411 ammo acids
Figure imgf001017_0001
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(n) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(XI ) SEQUENCE DESCRIPTION: SEQ ID NO: 662:
Met Ala Asp Val Val Val Gly He Gin Trp Gly Asp Glu Gly Lys Gly
1 5 10 15
Lys He Val Asp Arg He Ala Lys Asp Tyr Asp Phe Val Val Arg Tyr
20 25 30
Gin Gly Gly His Asn Ala Gly His Thr He Val His Lys Gly Val Lys
35 40 45
His Ser Leu His Leu Met Pro Ser Gly Val Leu Tyr Pro Lys Cys Lys
50 55 60
Asn He He Ser Ser Ala Val Val Val Ser Val Lys Asp Leu Cys Glu 65 70 75 80
Glu He Ser Ala Phe Glu Asp Leu Glu Asn Arg Leu Phe Val Ser Asp
85 90 95
Arg Ala H s Val He Leu Pro Tyr His Ala Lys Lys Asp Ala Phe Lys
100 105 110
Glu Lys Ser Gin Asn He Gly Thr Thr Lys Lys Gly He Gly Pro Cys
115 120 125
Tyr Glu Asp Lys Met Ala Arg Ser Gly He Arg Met Gly Asp Leu Leu
130 135 140
Asp Asp Lys He Leu Glu Glu Lys Leu Asn Ala His Phe Lys Ala He 145 150 155 160
Glu Pro Phe Lys Lys Ala Tyr Asp Leu Gly Glu Asn Tyr Glu Lys Asp
165 170 175
Leu Met Gly Tyr Phe Lys Thr Tyr Ala Pro Lys He Cys Pro Phe He
180 185 190
Lys Asp Thr Thr Ser Met Leu He Glu Ala Asn Gin Lys Gly Glu Lys
195 200 205
He Leu Leu Glu Gly Ala Gin Gly Thr Leu Leu Asp He Asp Leu Gly 210 215 220 Thr Tyr Pro Phe Val Thr Ser Ser Asn Thr Thr Ser Ala Ser Ala Cys 225 230 235 240
Val Ser Thr Gly Leu Asn Pro Lys Ala He Asn Glu Val He Gly He
245 250 255
Thr Lys Ala Tyr Ser Thr Arg Val Gly Asn Gly Pro Phe Pro Ser Glu
260 265 270
Asp Thr Thr Pro Met Gly Asp His Leu Arg Thr Lys Gly Ala Glu Phe
275 280 285
Gly Thr Thr Thr Lys Arg Pro Arg Arg Cys Gly Trp Leu Asp Leu Val
290 295 300
Ala Leu Lys Tyr Ala Cys Ala Leu Asn Gly Cys Thr Gin Leu Ala Leu 305 310 315 320
Met Lys Leu Asp Val Leu Asp Gly He Asp Ala He Lys Val Cys Val
325 330 335
Ala Tyr Glu Arg Lys Gly Glu Arg Leu Glu He Phe Pro Ser Asp Leu
340 345 350
Lys Asp Cys Val Pro He Tyr Gin Thr Phe Lys Gly Trp Glu Lys Ser
355 360 365
Val Gly Val Arg Lys Leu Asp Asp Leu Glu Pro Asn Val Arg Glu Tyr
370 375 380
He Arg Phe He Glu Lys Glu Val Gly Val Lys He Arg Leu He Ser 385 390 395 400
Thr Ser Pro Glu Arg Glu Asp Thr He Phe Leu 405 410
(2) INFORMATION FOR SEQ ID NO: 663:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1363 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1310 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663:
TTTTAGGGCT TTTCAATGAT CTCACTCGTT TGCTATAAAA GGGGGAGCTG GTG GAT 56
Val Asp 1
GTA TTG AGC GTG AGC GAA ATC AAT GCG CAA ATC AAA GCC CTT TTA GAA 104 Val Leu Ser Val Ser Glu He Asn Ala Gin He Lys Ala Leu Leu Glu 5 10 15
GCG ACT TTT TTG CAA GTT AGG GTT CAA GGG GAA GTG AGT AAT TTG ACT 152 Ala Thr Phe Leu Gin Val Arg Val Gin Gly Glu Val Ser Asn Leu Thr 20 25 30
ATC CAT AAG GTG AGC GGC CAT GCG TAT TTT TCG CTC AAA GAC AGC CAG 200 He His Lys Val Ser Gly His Ala Tyr Phe Ser Leu Lys Asp Ser Gin 35 40 45 50
TCG GTT ATT AAA TGC GTG CTG TTT AAA GGG AAC GCT AAC AGG CTC AAA 248 Ser Val He Lys Cys Val Leu Phe Lys Gly Asn Ala Asn Arg Leu Lys 55 60 65
TTC GCT TTA AAA GAA GGG CAG GAA GTG GTT GTT TTT GGG GGT ATT AGC 296 Phe Ala Leu Lys Glu Gly Gin Glu Val Val Val Phe Gly Gly He Ser 70 75 80
GTG TAT GTC CCA AGG GGG GAT TAT CAA ATC AAT TGC TTT GAA ATA GAG 344 Val Tyr Val Pro Arg Gly Asp Tyr Gin He Asn Cys Phe Glu He Glu 85 90 95
CCT AAG GAT ATA GGT TCA TTA ACT TTA GCT TTA GAG CAA TTG AAA GAA 392 Pro Lys Asp He Gly Ser Leu Thr Leu Ala Leu Glu Gin Leu Lys Glu 100 105 110
AAA TTA CGC CTT AAA GGC TAT TTT GAT GAA GAA AAT AAA TTA CCC AAA 440 Lys Leu Arg Leu Lys Gly Tyr Phe Asp Glu Glu Asn Lys Leu Pro Lys 115 120 125 130
CCG CAT TTT CCT AAA CGA GTG GCA GTC ATC ACT TCT CAA AAT TCA GCC 488 Pro His Phe Pro Lys Arg Val Ala Val He Thr Ser Gin Asn Ser Ala 135 140 145
GCT TGG GCG GAC ATG AAA AAG ATC GCT TCC AAA CGA TGG CCG ATG TGT 536 Ala Trp Ala Asp Met Lys Lys He Ala Ser Lys Arg Trp Pro Met Cys 150 155 160
GAA TTA GTT TGT ATC AAC ACC TTA ATG CAA GGG GAG GGC TGC GTT CAA 584 Glu Leu Val Cys He Asn Thr Leu Met Gin Gly Glu Gly Cys Val Gin 165 170 175
AGC GTG GTG GAA AGC ATC GTT TAT GCG GAT AGT TTT CAT GAC ACA AAA 632 Ser Val Val Glu Ser He Val Tyr Ala Asp Ser Phe His Asp Thr Lys 180 185 190
AAC GCT TTT GAT GCG ATT GTA GTG GCT AGG GGT GGG GGG AGC ATG GAG 680 Asn Ala Phe Asp Ala He Val Val Ala Arg Gly Gly Gly Ser Met Glu 195 200 205 210
GAT TTG TAT TCT TTC AAT GAT GAA AAA ATC GCT GAT GCT CTG TAT TTG 728 Asp Leu Tyr Ser Phe Asn Asp Glu Lys He Ala Asp Ala Leu Tyr Leu 215 220 225
GCC AAA ACC TTC AGC ATG TCA GCT ATT GGG CAT GAG AGC GAT TTT TTA 776 Ala Lys Thr Phe Ser Met Ser Ala He Gly His Glu Ser Asp Phe Leu 230 235 240
TTG AGC GAT TTA GTG GCG GAT TTA AGG GCT TCT ACG CCT TCA AAC GCG 824 Leu Ser Asp Leu Val Ala Asp Leu Arg Ala Ser Thr Pro Ser Asn Ala 245 250 255
ATG GAA ATT TTA CTC CCC AGC AGC GAT GAA TGG TTG CAA AGA CTT GAT 872 Met Glu He Leu Leu Pro Ser Ser Asp Glu Trp Leu Gin Arg Leu Asp 260 265 270
GGG TTT AAT GTG AAA TTG CAC CGC TCG TTT AAA ACT TTG CTC CAC CAA 920 Gly Phe Asn Val Lys Leu His Arg Ser Phe Lys Thr Leu Leu His Gin 275 280 285 290
AAA AAG GCG CAT TTA GAG CAT TTA GTG GCT TCT TTA AAA CGA TTG AGT 968 Lys Lys Ala His Leu Glu His Leu Val Ala Ser Leu Lys Arg Leu Ser 295 300 305
TTT GAA AAC AAG CAC CAT TTA AAC GCT TTA AAA CTA GAA AAA TTA AAA 1016 Phe Glu Asn Lys His His Leu Asn Ala Leu Lys Leu Glu Lys Leu Lys 310 315 320
ATC GCC CTA GAA AAT AAA ACT CTA GAA TTT TTA CGC TTT AAA AAA ACG 1064 He Ala Leu Glu Asn Lys Thr Leu Glu Phe Leu Arg Phe Lys Lys Thr 325 330 335
CTT TTA GAA AAA ATC TCT ACT CAA ACA TTA ACA AGC CCT TTT TTA CAA 1112 Leu Leu Glu Lys He Ser Thr Gin Thr Leu Thr Ser Pro Phe Leu Gin 340 345 350
ACT AAA ACA GAG CGA TTG AAC AGG CTA GAA AAC GCC CTT AAA CTC GCT 1160 Thr Lys Thr Glu Arg Leu Asn Arg Leu Glu Asn Ala Leu Lys Leu Ala 355 360 365 370
CAT GCT AAT TTG AAA TTA CCC CAA TTC GGG GCG TTG GTG AGC AAA AAT 1208 His Ala Asn Leu Lys Leu Pro Gin Phe Gly Ala Leu Val Ser Lys Asn 375 380 385
AAT CAA GCG ATA GAA TTA GAG GCA TTA AAA AGG GGC GAT AAA ATT GAA 1256 Asn Gin Ala He Glu Leu Glu Ala Leu Lys Arg Gly Asp Lys He Glu 390 395 400
TTA AGT AAT GAA AAA ACC AGA GCG AGC GCT GAA ATT TTG AGC GTG GAT 1304 Leu Ser Asn Glu Lys Thr Arg Ala Ser Ala Glu He Leu Ser Val Asp 405 410 415
AGG GTG TAGGGGTTTG AAAAATAATA TTTAAAACGC TATTTGTTTT ATCTTAAAAT TT 1362 Arg Val 420
C 1363
(2) INFORMATION FOR SEQ ID NO:664:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 420 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(11) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 664:
Val Asp Val Leu Ser Val Ser Glu He Asn Ala Gin He Lys Ala Leu
1 5 10 15
Leu Glu Ala Thr Phe Leu Gin Val Arg Val Gin Gly Glu Val Ser Asn
20 25 30
Leu Thr He His Lys Val Ser Gly His Ala Tyr Phe Ser Leu Lys Asp
35 40 45
Ser Gin Ser Val He Lys Cys Val Leu Phe Lys Gly Asn Ala Asn Arg
50 55 60
Leu Lys Phe Ala Leu Lys Glu Gly Gin Glu Val Val Val Phe Gly Gly 65 70 75 80
He Ser Val Tyr Val Pro Arg Gly Asp Tyr Gin He Asn Cys Phe Glu
85 90 95
He Glu Pro Lys Asp He Gly Ser Leu Thr Leu Ala Leu Glu Gin Leu
100 105 110
Lys Glu Lys Leu Arg Leu Lys Gly Tyr Phe Asp Glu Glu Asn Lys Leu
115 120 125
Pro Lys Pro His Phe Pro Lys Arg Val Ala Val He Thr Ser Gin Asn
130 135 140
Ser Ala Ala Trp Ala Asp Met Lys Lys He Ala Ser Lys Arg Trp Pro 145 150 155 160
Met Cys Glu Leu Val Cys He Asn Thr Leu Met Gin Gly Glu Gly Cys
165 170 175
Val Gin Ser Val Val Glu Ser He Val Tyr Ala Asp Ser Phe His Asp
180 185 190
Thr Lys Asn Ala Phe Asp Ala He Val Val Ala Arg Gly Gly Gly Ser
195 200 205
Met Glu Asp Leu Tyr Ser Phe Asn Asp Glu Lys He Ala Asp Ala Leu
210 215 220
Tyr Leu Ala Lys Thr Phe Ser Met Ser Ala He Gly His Glu Ser Asp 225 230 235 240
Phe Leu Leu Ser Asp Leu Val Ala Asp Leu Arg Ala Ser Thr Pro Ser
245 250 255
Asn Ala Met Glu He Leu Leu Pro Ser Ser Asp Glu Trp Leu Gin Arg
260 265 270
Leu Asp Gly Phe Asn Val Lys Leu His Arg Ser Phe Lys Thr Leu Leu
275 280 285
His Gin Lys Lys Ala His Leu Glu His Leu Val Ala Ser Leu Lys Arg
290 295 300
Leu Ser Phe Glu Asn Lys His His Leu Asn Ala Leu Lys Leu Glu Lys 305 310 315 320
Leu Lys He Ala Leu Glu Asn Lys Thr Leu Glu Phe Leu Arg Phe Lys
325 330 335
Lys Thr Leu Leu Glu Lys He Ser Thr Gin Thr Leu Thr Ser Pro Phe
340 345 350
Leu Gin Thr Lys Thr Glu Arg Leu Asn Arg Leu Glu Asn Ala Leu Lys
355 360 365
Leu Ala His Ala Asn Leu Lys Leu Pro Gin Phe Gly Ala Leu Val Ser 370 375 380 Lys Asn Asn Gin Ala He Glu Leu Glu Ala Leu Lys Arg Gly Asp Lys 385 390 395 400
He Glu Leu Ser Asn Glu Lys Thr Arg Ala Ser Ala Glu He Leu Ser
405 410 415
Val Asp Arg Val 420
(2) INFORMATION FOR SEQ ID NO: 665:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1083 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1031 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 665:
AAAGACGCGC ATGGATTCTG TTGGGTTTGA AGAACCTCAA AGGGGTTTGG GTG CTT 56
Val Leu
1
AAG GGG TTA AAA AAA GCG TTT AAG GAG AGG TTT TGC TCT CAA GTG TAT 104 Lys Gly Leu Lys Lys Ala Phe Lys Glu Arg Phe Cys Ser Gin Val Tyr 5 10 15
ATC TCT TTT AAT GTG GAT CAC AAT CTT TTA TCC ACT CAA GTC ATA AGG 152 He Ser Phe Asn Val Asp His Asn Leu Leu Ser Thr Gin Val He Arg 20 25 30
ATC AAA AAC GAT CGC ATT AAA GAG AAA TTT TTT AAA ACT TTT GAG ACT 200 He Lys Asn Asp Arg He Lys Glu Lys Phe Phe Lys Thr Phe Glu Thr 35 40 45 50
AAA GTG GAG ACT AAA AAT GGT GAA GTC CCT ATT CAA GCC TTA AAA ATC 248 Lys Val Glu Thr Lys Asn Gly Glu Val Pro He Gin Ala Leu Lys He 55 60 65
GCC AGA ACT TAT AGC CAA AAA TAC CCC TAC ACT TAT TTT AGC GCG ATG 296 Ala Arg Thr Tyr Ser Gin Lys Tyr Pro Tyr Thr Tyr Phe Ser Ala Met 70 75 80
AGT AAA GCT AAA GAG GTT TTA TGC GAA AAG CAG GCG TTT GAA CAA ATC 344 Ser Lys Ala Lys Glu Val Leu Cys Glu Lys Gin Ala Phe Glu Gin He 85 90 95 AAA CAA GAA AAT CAA GAT TAT CAT GCT TGT GAA GTC AAT CAA AAG TAT 392 Lys Gin Glu Asn Gin Asp Tyr His Ala Cys Glu Val Asn Gin Lys Tyr 100 105 110
TGC GTT TAT GTG GAA TCT AAG GAT TTT TTA AAG GAT TTT AAG CGT TTT 440 Cys Val Tyr Val Glu Ser Lys Asp Phe Leu Lys Asp Phe Lys Arg Phe 115 120 125 130
AAA ATC CAG GAT GTG GAT TTT TTG TTT TCG CCT TTT AGC CTT ATT TAT 488 Lys He Gin Asp Val Asp Phe Leu Phe Ser Pro Phe Ser Leu He Tyr 135 140 145
GAT TTT GTG CGC GAT AAT TTA GAA AAT AAG CCG TTG TTG TAT TTG CTT 536 Asp Phe Val Arg Asp Asn Leu Glu Asn Lys Pro Leu Leu Tyr Leu Leu 150 155 160
TTG GAG CGT TCA AGA TTT TAT TTT TTG ATT GCG GAT AAA AAA GAG ATT 584 Leu Glu Arg Ser Arg Phe Tyr Phe Leu He Ala Asp Lys Lys Glu He 165 170 175
TTT TTA GCC AAA TCC GTG TTT TTA GAA GAA CAA CCT GAA GAG TTT ATA 632 Phe Leu Ala Lys Ser Val Phe Leu Glu Glu Gin Pro Glu Glu Phe He 180 185 190
GAG AGC AAA GAA GAA GAT TTT ATG GGA ATG GAT AAT GAG GCT GTG GAT 680 Glu Ser Lys Glu Glu Asp Phe Met Gly Met Asp Asn Glu Ala Val Asp 195 200 205 210
TTG TTT TTG AGT GAA ATC CAA GAA GAT ATT GAC AGC CTT GAA GAA GCG 728 Leu Phe Leu Ser Glu He Gin Glu Asp He Asp Ser Leu Glu Glu Ala 215 220 225
ATA GGC CTA GAC AGC AGT AAG GAT AAT AGC GAA AAA ATA ACA GAG GAC 776 He Gly Leu Asp Ser Ser Lys Asp Asn Ser Glu Lys He Thr Glu Asp 230 235 240
GCT TAT AGT TTG ATT GAA GGC ATG ACG AAT ATC CCC TTA ATT GCA GAT 824 Ala Tyr Ser Leu He Glu Gly Met Thr Asn He Pro Leu He Ala Asp 245 250 255
GTT TTG CAA GAG GGA TTG CGT GGC GTC TAT CAT TCT AGA GAG ATA GAC 872 Val Leu Gin Glu Gly Leu Arg Gly Val Tyr His Ser Arg Glu He Asp 260 265 270
TTT GTA GAA AAA GTG GTT GTT TTA GAC AGC TGT CAA ATC CAC CAA AAA 920 Phe Val Glu Lys Val Val Val Leu Asp Ser Cys Gin He His Gin Lys 275 280 285 290
GCG TTA ATG CAT TTG CAA GAA ACT TTG ATG ATA GAA GTG GAT AGG CTT 968 Ala Leu Met His Leu Gin Glu Thr Leu Met He Glu Val Asp Arg Leu 295 300 305
GAT TTT TCT TTA GTG GAG CGC TTG AAC ATT TTA GCG CGC ATG GAG AAT 1016 Asp Phe Ser Leu Val Glu Arg Leu Asn He Leu Ala Arg Met Glu Asn 310 315 320 GAA AAG CAT GCG TTT TAGTTACATT GAGCCAAGAG CGAAATACCT TATCAGCAAG C 1072 Glu Lys His Ala Phe 325
TTTCTAAAAT T 1083
(2) INFORMATION FOR SEQ ID NO: 666:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 327 ammo acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 666:
Val Leu Lys Gly Leu Lys Lys Ala Phe Lys Glu Arg Phe Cys Ser Gin
1 5 10 15
Val Tyr He Ser Phe Asn Val Asp His Asn Leu Leu Ser Thr Gin Val
20 25 30
He Arg He Lys Asn Asp Arg He Lys Glu Lys Phe Phe Lys Thr Phe
35 40 45
Glu Thr Lys Val Glu Thr Lys Asn Gly Glu Val Pro He Gin Ala Leu
50 55 60
Lys He Ala Arg Thr Tyr Ser Gin Lys Tyr Pro Tyr Thr Tyr Phe Ser 65 70 75 80
Ala Met Ser Lys Ala Lys Glu Val Leu Cys Glu Lys Gin Ala Phe Glu
85 90 95
Gin He Lys Gin Glu Asn Gin Asp Tyr His Ala Cys Glu Val Asn Gin
100 105 110
Lys Tyr Cys Val Tyr Val Glu Ser Lys Asp Phe Leu Lys Asp Phe Lys
115 120 125
Arg Phe Lys He Gin Asp Val Asp Phe Leu Phe Ser Pro Phe Ser Leu
130 135 140
He Tyr Asp Phe Val Arg Asp Asn Leu Glu Asn Lys Pro Leu Leu Tyr 145 150 155 160
Leu Leu Leu Glu Arg Ser Arg Phe Tyr Phe Leu He Ala Asp Lys Lys
165 170 175
Glu He Phe Leu Ala Lys Ser Val Phe Leu Glu Glu Gin Pro Glu Glu
180 185 190
Phe He Glu Ser Lys Glu Glu Asp Phe Met Gly Met Asp Asn Glu Ala
195 200 205
Val Asp Leu Phe Leu Ser Glu He Gin Glu Asp He Asp Ser Leu Glu
210 215 220
Glu Ala He Gly Leu Asp Ser Ser Lys Asp Asn Ser Glu Lys He Thr 225 230 235 240
Glu Asp Ala Tyr Ser Leu He Glu Gly Met Thr Asn He Pro Leu He
245 250 255
Ala Asp Val Leu Gin Glu Gly Leu Arg Gly Val Tyr His Ser Arg Glu
260 265 270
He Asp Phe Val Glu Lys Val Val Val Leu Asp Ser Cys Gin He His 275 280 285 Gin Lys Ala Leu Met His Leu Gin Glu Thr Leu Met He Glu Val Asp
290 295 300
Arg Leu Asp Phe Ser Leu Val Glu Arg Leu Asn He Leu Ala Arg Met
305 310 315 320
Glu Asn Glu Lys His Ala Phe 325
(2) INFORMATION FOR SEQ ID NO: 667:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1233 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 58...1170 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667:
ATTCCAAACT TCCTAAAAAA TGATTACGCC ATTATATCCA AGATTAAGGC TTAAACG ATG 60
Met 1
GAT TTT CAA CTC CAA GCG ACT GAC AAT AAC GCG CGA GCT GGT CTT TTA 108 Asp Phe Gin Leu Gin Ala Thr Asp Asn Asn Ala Arg Ala Gly Leu Leu 5 10 15
AAT TTA GCC CAT TCT CAA GTG GCA ACG CCT GTT TTT ATG CCC GTA GGC 156 Asn Leu Ala His Ser Gin Val Ala Thr Pro Val Phe Met Pro Val Gly 20 25 30
ACG CAA GGC TGC ATC AAA TCT TTA GAC GCT ACA GAT GCG CAA GAA ATT 204 Thr Gin Gly Cys He Lys Ser Leu Asp Ala Thr Asp Ala Gin Glu He 35 40 45
TTA GGC GCT AAA CTC ATT TTA GCC AAC ACC TAT CAC ATG TAT TTA AGG 252 Leu Gly Ala Lys Leu He Leu Ala Asn Thr Tyr His Met Tyr Leu Arg 50 55 60 65
CCG GGT GAA AAG GTC GTT GAG GAG TTA GGG GGC TTG CAT CGT TTC GCT 300 Pro Gly Glu Lys Val Val Glu Glu Leu Gly Gly Leu His Arg Phe Ala 70 75 80
CAA TTT TAT GGG AGT TTT TTA ACC GAT AGT GGA GGG TTT CAA GCC TTT 348 Gin Phe Tyr Gly Ser Phe Leu Thr Asp Ser Gly Gly Phe Gin Ala Phe 85 90 95 AGT TTG AGC GAT AAT GTC AAA TTG CAA GAA GAT GGG ATC GTT TTT AAA 396 Ser Leu Ser Asp Asn Val Lys Leu Gin Glu Asp Gly He Val Phe Lys 100 105 110
TCC CAT ATT GAT GGG AGC AAG CAT CTA TTC ACG CCC GCT AAA GTT TTG 444 Ser His He Asp Gly Ser Lys His Leu Phe Thr Pro Ala Lys Val Leu 115 120 125
GAC ATT CAA TAT TCT TTG AAT AGC GAT ATT ATG ATG GTT TTA GAC GAT 492 Asp He Gin Tyr Ser Leu Asn Ser Asp He Met Met Val Leu Asp Asp 130 135 140 145
TTA GTG GGC TTG CCC GCT CCC TTA AAA CGC CTT GAA GAA TCC ATT AAA 540 Leu Val Gly Leu Pro Ala Pro Leu Lys Arg Leu Glu Glu Ser He Lys 150 155 160
AGA AGT GCT AAA TGG GCG AAT ATG AGC CTA GAA TAC CAC AAA GAA AAA 588 Arg Ser Ala Lys Trp Ala Asn Met Ser Leu Glu Tyr His Lys Glu Lys 165 170 175
AAC CGC CCG AGC AAC AAC CTT TTT GCC ATT ATC CAG GGC GGG ACG CAT 636 Asn Arg Pro Ser Asn Asn Leu Phe Ala He He Gin Gly Gly Thr His 180 185 190
TTG AAA ATG CGC AGC CTT AGC GTG GGA TTA ACG CAT GAG GGT TTT GAT 684 Leu Lys Met Arg Ser Leu Ser Val Gly Leu Thr His Glu Gly Phe Asp 195 200 205
GGC TAC GCT ATA GGC GGT TTA GCG GTG GGG GAA AGC GCT GAT GAA ATG 732 Gly Tyr Ala He Gly Gly Leu Ala Val Gly Glu Ser Ala Asp Glu Met 210 215 220 225
CTA GAA ACC ATC GCG CAC ACC GCC CCC TTG CTC CCC AAA GAC AAG CCT 780 Leu Glu Thr He Ala His Thr Ala Pro Leu Leu Pro Lys Asp Lys Pro 230 235 240
CGC TAC TTA ATG GGC GTA GGC ACG CCT GAA AAT ATC CTA GAC GCT ATC 828 Arg Tyr Leu Met Gly Val Gly Thr Pro Glu Asn He Leu Asp Ala He 245 250 255
AGT TTG GGG GTG GAT ATG TTT GAT TGC GTG ATG CCC ACC AGA AAC GCC 876 Ser Leu Gly Val Asp Met Phe Asp Cys Val Met Pro Thr Arg Asn Ala 260 265 270
AGA AAC GCC ACC CTT TTC ACG CAT TCT GGC AAA ATT TCT ATC AAA AAC 924 Arg Asn Ala Thr Leu Phe Thr His Ser Gly Lys He Ser He Lys Asn 275 280 285
GCG CCC TAT AAA TTG GAT AAT ACC CCT ATT GAA GAA AAT TGC GCA TGT 972 Ala Pro Tyr Lys Leu Asp Asn Thr Pro He Glu Glu Asn Cys Ala Cys 290 295 300 305
TAT GCT TGC AAA CGC TAT TCT AAA GCC TAT TTG CAC CAT TTA TTT AGG 1020 Tyr Ala Cys Lys Arg Tyr Ser Lys Ala Tyr Leu His His Leu Phe Arg 310 315 320 GCT AAA GAA CTC ACT TAC GCT CGT TTG GCC AGC TTG CAC AAT TTG CAT 1068 Ala Lys Glu Leu Thr Tyr Ala Arg Leu Ala Ser Leu His Asn Leu His 325 330 335
TTT TAT TTA GAG CTG GTG AAG AAC GCC AGA AAC GCC ATT TTA GAA AAG 1116 Phe Tyr Leu Glu Leu Val Lys Asn Ala Arg Asn Ala He Leu Glu Lys 340 345 350
CGG TTT TTG AGT TTT AAA AAA GAA TTT TTG GAG AAA TAC AAC TCC CGC 1164 Arg Phe Leu Ser Phe Lys Lys Glu Phe Leu Glu Lys Tyr Asn Ser Arg 355 360 365
TCT CAT TGAATGATGG AATGCAAAAA TACTAAAAAG CGTTTTTTAC CATCAATAAA AG 1222
Ser His
370
TTTTCTTAAA A 1233
(2) INFORMATION FOR SEQ ID NO: 668:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 371 ammo acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 668:
Met Asp Phe Gin Leu Gin Ala Thr Asp Asn Asn Ala Arg Ala Gly Leu
1 5 10 15
Leu Asn Leu Ala His Ser Gin Val Ala Thr Pro Val Phe Met Pro Val
20 25 30
Gly Thr Gin Gly Cys He Lys Ser Leu Asp Ala Thr Asp Ala Gin Glu
35 40 45
He Leu Gly Ala Lys Leu He Leu Ala Asn Thr Tyr His Met Tyr Leu
50 55 60
Arg Pro Gly Glu Lys Val Val Glu Glu Leu Gly Gly Leu His Arg Phe 65 70 75 80
Ala Gin Phe Tyr Gly Ser Phe Leu Thr Asp Ser Gly Gly Phe Gin Ala
85 90 95
Phe Ser Leu Ser Asp Asn Val Lys Leu Gin Glu Asp Gly He Val Phe
100 105 110
Lys Ser His He Asp Gly Ser Lys His Leu Phe Thr Pro Ala Lys Val
115 120 125
Leu Asp He Gin Tyr Ser Leu Asn Ser Asp He Met Met Val Leu Asp
130 135 140
Asp Leu Val Gly Leu Pro Ala Pro Leu Lys Arg Leu Glu Glu Ser He 145 150 155 160
Lys Arg Ser Ala Lys Trp Ala Asn Met Ser Leu Glu Tyr His Lys Glu
165 170 175
Lys Asn Arg Pro Ser Asn Asn Leu Phe Ala He He Gin Gly Gly Thr 180 185 190 His Leu Lys Met Arg Ser Leu Ser Val Gly Leu Thr His Glu Gly Phe
195 200 205
Asp Gly Tyr Ala He Gly Gly Leu Ala Val Gly Glu Ser Ala Asp Glu
210 215 220
Met Leu Glu Thr He Ala His Thr Ala Pro Leu Leu Pro Lys Asp Lys 225 230 235 240
Pro Arg Tyr Leu Met Gly Val Gly Thr Pro Glu Asn He Leu Asp Ala
245 250 255
He Ser Leu Gly Val Asp Met Phe Asp Cys Val Met Pro Thr Arg Asn
260 265 270
Ala Arg Asn Ala Thr Leu Phe Thr His Ser Gly Lys He Ser He Lys
275 280 285
Asn Ala Pro Tyr Lys Leu Asp Asn Thr Pro He Glu Glu Asn Cys Ala
290 295 300
Cys Tyr Ala Cys Lys Arg Tyr Ser Lys Ala Tyr Leu His His Leu Phe 305 310 315 320
Arg Ala Lys Glu Leu Thr Tyr Ala Arg Leu Ala Ser Leu His Asn Leu
325 330 335
His Phe Tyr Leu Glu Leu Val Lys Asn Ala Arg Asn Ala He Leu Glu
340 345 350
Lys Arg Phe Leu Ser Phe Lys Lys Glu Phe Leu Glu Lys Tyr Asn Ser
355 360 365
Arg Ser His 370
(2) INFORMATION FOR SEQ ID NO: 669:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1357 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1304 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 669:
GTGGATGCTG ATTTTTTAGG CGATGGTTTT GGGAATAAAA GGGAACAAAA ATG AAA 56
Met Lys
1
AAA GTC TAT TTC AAA ACT TTT GGG TGC AGG ACG AAT CTT TTT GAC ACG 104 Lys Val Tyr Phe Lys Thr Phe Gly Cys Arg Thr Asn Leu Phe Asp Thr 5 10 15
CAA GTG ATG AGC GAG AAT TTG AAG GAC TTT AGC ACG ACC TTA GAA GAA 152 Gin Val Met Ser Glu Asn Leu Lys Asp Phe Ser Thr Thr Leu Glu Glu 20 25 30
CAA GAA GCC GAT ATT ATT ATC ATC AAT TCT TGC ACC GTG ACC AAT GGG 200 Gin Glu Ala Asp He He He He Asn Ser Cys Thr Val Thr Asn Gly 35 40 45 50
GCC GAT AGC GCG GTA AGG AGT TAC GCT AAA AAA ATG GCA CGG TTG GAT 248 Ala Asp Ser Ala Val Arg Ser Tyr Ala Lys Lys Met Ala Arg Leu Asp 55 60 65
AAG GAA GTG CTA TTT ACT GGT TGC GGG GTG AAA ACC CAA GGC AAA GAG 296 Lys Glu Val Leu Phe Thr Gly Cys Gly Val Lys Thr Gin Gly Lys Glu 70 75 80
CTT TTT GAA AAA GGG TTT TTA AAG GGC GTT TTT GGG CAT GAC AAT AAA 344 Leu Phe Glu Lys Gly Phe Leu Lys Gly Val Phe Gly His Asp Asn Lys 85 90 95
GAA AAG ATT AAC GCG CTT TTA CAA GAA AAA AAG CGT TTT TTT ATA GAT 392 Glu Lys He Asn Ala Leu Leu Gin Glu Lys Lys Arg Phe Phe He Asp 100 105 110
GAC AAT TTA GAA AAC AAG CAC TTA GAC ACC ACG ATG GTG AGC GAG TTT 440 Asp Asn Leu Glu Asn Lys His Leu Asp Thr Thr Met Val Ser Glu Phe 115 120 125 130
GTG GGA AAA ACT AGG GCG TTT ATT AAG ATC CAA GAA GGC TGT GAT TTT 488 Val Gly Lys Thr Arg Ala Phe He Lys He Gin Glu Gly Cys Asp Phe 135 140 145
GAT TGC AAT TAT TGC ATT ATC CCA AGC GTG AGA GGG AGG GCT AGG AGT 536 Asp Cys Asn Tyr Cys He He Pro Ser Val Arg Gly Arg Ala Arg Ser 150 155 160
TTT GAA GAG AGA AAA ATT TTA GAG CAA GTG GGC CTT TTA TGC TCT AAA 584 Phe Glu Glu Arg Lys He Leu Glu Gin Val Gly Leu Leu Cys Ser Lys 165 170 175
GGG GTT CAA GAA GTG GTT TTA ACC GGC ACC AAT GTG GGG AGC TAT GGG 632 Gly Val Gin Glu Val Val Leu Thr Gly Thr Asn Val Gly Ser Tyr Gly 180 185 190
AAA GAT AGA GGA AGC AAT ATC GCG CGA TTG ATT AAA AAA TTA AGC CAG 680 Lys Asp Arg Gly Ser Asn He Ala Arg Leu He Lys Lys Leu Ser Gin 195 200 205 210
ATC GCT GGA TTA AAA CGC ATA AGG ATT GGG AGC TTA GAA CCT AAT CAA 728 He Ala Gly Leu Lys Arg He Arg He Gly Ser Leu Glu Pro Asn Gin 215 220 225
ATT AAC GAT GAA TTT TTA GAG CTT TTA GAA GAG GAT TTT TTA GAA AAA 776 He Asn Asp Glu Phe Leu Glu Leu Leu Glu Glu Asp Phe Leu Glu Lys 230 235 240
CAT TTG CAT ATC GCT TTA CAG CAC AGC CAT GAT CTC ATG CTA GAG AGG 824 His Leu His He Ala Leu Gin His Ser His Asp Leu Met Leu Glu Arg 245 250 255
ATG AAT CGA AGA AAC CGC ACT AAA AGC GAT AGG GAA TTA TTA GAA ACA 872 Met Asn Arg Arg Asn Arg Thr Lys Ser Asp Arg Glu Leu Leu Glu Thr 260 265 270
ATC GCT TCT AAG AAT TTT GCT ATT GGC ACG GAT TTT ATT GTG GGG CAT 920 He Ala Ser Lys Asn Phe Ala He Gly Thr Asp Phe He Val Gly His 275 280 285 290
CCG GGC GAG AGC GGA AGC GTT TTT GAA AAA GCG TTT AAA AAT TTA GAA 968 Pro Gly Glu Ser Gly Ser Val Phe Glu Lys Ala Phe Lys Asn Leu Glu 295 300 305
AGC TTG CCT TTA ACG CAC ATC CAC CCT TTT ATT TAC AGC AAA CGA AAA 1016 Ser Leu Pro Leu Thr His He His Pro Phe He Tyr Ser Lys Arg Lys 310 315 320
GAC ACC CCC TCT AGC TTG ATG ACT GAT AGC GTG AGT TTG GAA GAT TCT 1064 Asp Thr Pro Ser Ser Leu Met Thr Asp Ser Val Ser Leu Glu Asp Ser 325 330 335
AAA AAG CGT TTG AAT GCG ATT AAA GAT TTG ATT TTT CAT AAA AAT AAG 1112 Lys Lys Arg Leu Asn Ala He Lys Asp Leu He Phe His Lys Asn Lys 340 345 350
GCG TTC AGG CAA TTG CAG CTC AAG CTC AAT ACG CCT CTA AAA GCC TTA 1160 Ala Phe Arg Gin Leu Gin Leu Lys Leu Asn Thr Pro Leu Lys Ala Leu 355 360 365 370
GTG GAA GTG CAA AAA GAC GGC GAA TTT AAA GCC TTA GAT CAA TTT TTC 1208 Val Glu Val Gin Lys Asp Gly Glu Phe Lys Ala Leu Asp Gin Phe Phe 375 380 385
AAC CCC ATT AAA ATC AAA AGC GAT AAG CCT CTA AGG GCT AGT TTT TTA 1256 Asn Pro He Lys He Lys Ser Asp Lys Pro Leu Arg Ala Ser Phe Leu 390 395 400
GAA ATC AAA GAG TAT GAA ATT AAG GAG AGG GAA AAT CAT GCC GTT TTC T 1305 Glu He Lys Glu Tyr Glu He Lys Glu Arg Glu Asn His Ala Val Phe 405 410 415
AAAAATTTAG AAAATCTTAC CGCTCCCTTC AAACGCATTA AAAACCGCTC GC 1357
(2) INFORMATION FOR SEQ ID NO : 670-
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 418 ammo acids
Figure imgf001030_0001
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 670:
Met Lys Lys Val Tyr Phe Lys Thr Phe Gly Cys Arg Thr Asn Leu Phe
1 5 10 15
Asp Thr Gin Val Met Ser Glu Asn Leu Lys Asp Phe Ser Thr Thr Leu
20 25 30
Glu Glu Gin Glu Ala Asp He He He He Asn Ser Cys Thr Val Thr
35 40 45
Asn Gly Ala Asp Ser Ala Val Arg Ser Tyr Ala Lys Lys Met Ala Arg
50 55 60
Leu Asp Lys Glu Val Leu Phe Thr Gly Cys Gly Val Lys Thr Gin Gly 65 70 75 80
Lys Glu Leu Phe Glu Lys Gly Phe Leu Lys Gly Val Phe Gly His Asp
85 90 95
Asn Lys Glu Lys He Asn Ala Leu Leu Gin Glu Lys Lys Arg Phe Phe
100 105 110
He Asp Asp Asn Leu Glu Asn Lys His Leu Asp Thr Thr Met Val Ser
115 120 125
Glu Phe Val Gly Lys Thr Arg Ala Phe He Lys He Gin Glu Gly Cys
130 135 140
Asp Phe Asp Cys Asn Tyr Cys He He Pro Ser Val Arg Gly Arg Ala 145 150 155 160
Arg Ser Phe Glu Glu Arg Lys He Leu Glu Gin Val Gly Leu Leu Cys
165 170 175
Ser Lys* Gly Val Gin Glu Val Val Leu Thr Gly Thr Asn Val Gly Ser
180 185 190
Tyr Gly Lys Asp Arg Gly Ser Asn He Ala Arg Leu He Lys Lys Leu
195 200 205
Ser Gin He Ala Gly Leu Lys Arg He Arg He Gly Ser Leu Glu Pro
210 215 220
Asn Gin He Asn Asp Glu Phe Leu Glu Leu Leu Glu Glu Asp Phe Leu 225 230 235 240
Glu Lys His Leu His He Ala Leu Gin His Ser His Asp Leu Met Leu
245 250 255
Glu Arg Met Asn Arg Arg Asn Arg Thr Lys Ser Asp Arg Glu Leu Leu
260 265 270
Glu Thr He Ala Ser Lys Asn Phe Ala He Gly Thr Asp Phe He Val
275 280 285
Gly His Pro Gly Glu Ser Gly Ser Val Phe Glu Lys Ala Phe Lys Asn
290 295 300
Leu Glu Ser Leu Pro Leu Thr His He His Pro Phe He Tyr Ser Lys 305 310 315 320
Arg Lys Asp Thr Pro Ser Ser Leu Met Thr Asp Ser Val Ser Leu Glu
325 330 335
Asp Ser Lys Lys Arg Leu Asn Ala He Lys Asp Leu He Phe His Lys
340 345 350
Asn Lys Ala Phe Arg Gin Leu Gin Leu Lys Leu Asn Thr Pro Leu Lys
355 360 365
Ala Leu Val Glu Val Gin Lys Asp Gly Glu Phe Lys Ala Leu Asp Gin
370 375 380
Phe Phe Asn Pro He Lys He Lys Ser Asp Lys Pro Leu Arg Ala Ser 385 390 395 400
Phe Leu Glu He Lys Glu Tyr Glu He Lys Glu Arg Glu Asn His Ala
405 410 415
Val Phe (2) INFORMATION FOR SEQ ID NO : 671-
(i) SEQUENCE CHARACTERISTICS-
(A) LENGTH: 574 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE. Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...521 (D) OTHER INFORMATION.
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 671:
AGTTTAAGGA TTGCAACCAT TTGTTGCAAA AATCTTAAAA AAGGTGGAAG ATG AAA 56
Met Lys
1
AAA TTT GGT TTG GGG GTG TAT TTG CTT CTT TTA GGT ATT TTG GGC GGC 104 Lys Phe Gly Leu Gly Val Tyr Leu Leu Leu Leu Gly He Leu Gly Gly 5 10 15
TCT TTG ATC ATT CTA GGA GCG ATA GTC GCG CCC ATT GTT TTC AAA GCT 152 Ser Leu He He Leu Gly Ala He Val Ala Pro He Val Phe Lys Ala 20 25 30
TCA AGC GTT TTA CCT GAA TTG CAT CTG ACT CCC TTT GAG AGC GGG AAA 200 Ser Ser Val Leu Pro Glu Leu His Leu Thr Pro Phe Glu Ser Gly Lys 35 40 45 50
CTC ATG GCG CAA ATC TTT GTG CGT TTC AAT TAT GTT TTA GGC GCG ATC 248 Leu Met Ala Gin He Phe Val Arg Phe Asn Tyr Val Leu Gly Ala He 55 60 65
GGT TTT GTA GTG TTA CTT TAT GAA ATC ATT TCG TTT ATT TAT TAC AAA 296 Gly Phe Val Val Leu Leu Tyr Glu He He Ser Phe He Tyr Tyr Lys 70 75 80
AGA TCG TTA GTG TAT TTG ATC CTT GGC GTG GCG ATA GGG GCG TTG TGT 344 Arg Ser Leu Val Tyr Leu He Leu Gly Val Ala He Gly Ala Leu Cys 85 90 95
TTG CTC TTT GTT TTT TAT TAC ACG CCT TAT ATT TTA AAC GCT CAA AAA 392 Leu Leu Phe Val Phe Tyr Tyr Thr Pro Tyr He Leu Asn Ala Gin Lys 100 105 110
GCG GGC GAA GCC GCG CTT CAA AGT GCT GAA TTT GCC CGC TCG CAC GCT 440 Ala Gly Glu Ala Ala Leu Gin Ser Ala Glu Phe Ala Arg Ser His Ala 115 120 125 130
CAA AGC GAA TGG TTG TTT AAG GAA TTG TTT GTG CTG GTG TGC GCT TTG 488 Gin Ser Glu Trp Leu Phe Lys Glu Leu Phe Val Leu Val Cys Ala Leu 135 140 145
TTT TTT TGG CGT TTG CTT GGA AAA AAT GTG CTT TAGTCCCTTT GATTTAATCA 541 Phe Phe Trp Arg Leu Leu Gly Lys Asn Val Leu 150 155
AATGAGAGAG TTTTTGGCTA CTATCTAGGA AAT 574
(2) INFORMATION FOR SEQ ID NO: 672.
(l) SEQUENCE CHARACTERISTICS.
(A) LENGTH: 157 ammo acids
Figure imgf001033_0001
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE, protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 672
Met Lys Lys Phe Gly Leu Gly Val Tyr Leu Leu Leu Leu Gly He Leu
1 5 10 15
Gly Gly Ser Leu He He Leu Gly Ala He Val Ala Pro He Val Phe
20 25 30
Lys Ala Ser Ser Val Leu Pro Glu Leu His Leu Thr Pro Phe Glu Ser
35 40 45
Gly Lys Leu Met Ala Gin He Phe Val Arg Phe Asn Tyr Val Leu Gly
50 55 60
Ala He Gly Phe Val Val Leu Leu Tyr Glu He He Ser Phe He Tyr 65 70 75 80
Tyr Lys Arg Ser Leu Val Tyr Leu He Leu Gly Val Ala He Gly Ala
85 90 95
Leu Cys Leu Leu Phe Val Phe Tyr Tyr Thr Pro Tyr He Leu Asn Ala
100 105 110
Gin Lys Ala Gly Glu Ala Ala Leu Gin Ser Ala Glu Phe Ala Arg Ser
115 120 125
His Ala Gin Ser Glu Trp Leu Phe Lys Glu Leu Phe Val Leu Val Cys
130 135 140
Ala Leu Phe Phe Trp Arg Leu Leu Gly Lys Asn Val Leu 145 150 155
(2) INFORMATION FOR SEQ ID NO:673.
(l) SEQUENCE CHARACTERISTICS.
(A) LENGTH: 1780 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1727 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 673:
ATAACCCGGT GTTTTTGATA GACAAATACG AAGCCGTTGC TAAAAATTAA ATG ATT 56
Met He
1
TTT GGG GAT TTT AAA TAT CAA AAA AGC GTT AAA AAA CTC ACA GCC ACC 104 Phe Gly Asp Phe Lys Tyr Gin Lys Ser Val Lys Lys Leu Thr Ala Thr 5 10 15
AAT CTT AAT GAG CTT AAA AAC GCC CTG GAT TTC ATC TCT CAA AAT AGG 152 Asn Leu Asn Glu Leu Lys Asn Ala Leu Asp Phe He Ser Gin Asn Arg 20 25 30
GGG AAT GGG TAT TTT GTG GGG TAT CTT TTA TAT GAA GCG CGC TTA GCG 200 Gly Asn Gly Tyr Phe Val Gly Tyr Leu Leu Tyr Glu Ala Arg Leu Ala 35 40 45 50
TTT TTA GAT GAA AAT TTT CAA AGC CAA ACC CCT TTT TTG TAT TTT GAA 248 Phe Leu Asp Glu Asn Phe Gin Ser Gin Thr Pro Phe Leu Tyr Phe Glu 55 60 65
CAA TTT TTA GAA AGA AAA AAA TAT TCT TTA GAG CCT TTA AAA GAG CAT 296 Gin Phe Leu Glu Arg Lys Lys Tyr Ser Leu Glu Pro Leu Lys Glu His 70 75 80
GCG TTT TAC CCT AAA ATC CAT AGT TCT TTA GAT CAA AAA ACT TAT TTC 344 Ala Phe Tyr Pro Lys He His Ser Ser Leu Asp Gin Lys Thr Tyr Phe 85 90 95
AAG CAG TTT AAA GCC GTT AAA GAG CGT CTC AAA AAC GGC GAC ACC TAT 392 Lys Gin Phe Lys Ala Val Lys Glu Arg Leu Lys Asn Gly Asp Thr Tyr 100 105 110
CAA GTG AAT CTC ACA ATG GAT TTA TTT TTA GAC ACT AAA GCC AAA CCA 440 Gin Val Asn Leu Thr Met Asp Leu Phe Leu Asp Thr Lys Ala Lys Pro 115 120 125 130
AAG CGC GTT TTT AAG GAG GTG GTA CAC AAC CAA AAC ACG CCT TTT AAG 488 Lys Arg Val Phe Lys Glu Val Val His Asn Gin Asn Thr Pro Phe Lys 135 140 145
GCT TTT ATA GAA AAT GAG TTT GGG AGC GTT TTA AGC TTT TCG CCG GAA 536 Ala Phe He Glu Asn Glu Phe Gly Ser Val Leu Ser Phe Ser Pro Glu 150 155 160 TTG TTT TTT GAA TTA GAG TTT TTA GAT ACA GCG ATT AAG ATT ATT ACA 584 Leu Phe Phe Glu Leu Glu Phe Leu Asp Thr Ala He Lys He He Thr 165 170 175
AAA CCC ATG AAA GGC ACG ATC GCT CGC TCA AAA AAC CCC TTA ATA GAT 632 Lys Pro Met Lys Gly Thr He Ala Arg Ser Lys Asn Pro Leu He Asp 180 185 190
GAA AAA AAC CGA TTG TTT TTG CAA AAT GAT GAC AAA AAC AGA AGC GAA 680 Glu Lys Asn Arg Leu Phe Leu Gin Asn Asp Asp Lys Asn Arg Ser Glu 195 200 205 210
AAT GTG ATG ATT GTG GAT TTA TTG CGT AAC GAT TTG AGC CGC TTG GCC 728 Asn Val Met He Val Asp Leu Leu Arg Asn Asp Leu Ser Arg Leu Ala 215 220 225
TTA AAA AAT AGC GTG AAA GTC AAT CAA TTG TTT GAA ATC ATC AGC TTG 776 Leu Lys Asn Ser Val Lys Val Asn Gin Leu Phe Glu He He Ser Leu 230 235 240
CCT AGC GTG TAT CAA ATG ATA AGC GAG ATT GAA GCG AAA TTG CCC CTA 824 Pro Ser Val Tyr Gin Met He Ser Glu He Glu Ala Lys Leu Pro Leu 245 250 255
AAA ACC AGC TTG TTT GAG ATT TTT AAG GCG TTG TTC CCT TGC GGC TCT 872 Lys Thr Ser Leu Phe Glu He Phe Lys Ala Leu Phe Pro Cys Gly Ser 260 265 270
GTG ACC GGA TGC CCT AAA ATC AAA ACC ATG CAA ATC ATT GAA AGT TTA 920 Val Thr Gly Cys Pro Lys He Lys Thr Met Gin He He Glu Ser Leu 275 280 285 290
GAA AAA CGC CCT AGG GGG GTG TAT TGC GGG GCG ATA GGC ATG GTT GAA 968 Glu Lys Arg Pro Arg Gly Val Tyr Cys Gly Ala He Gly Met Val Glu 295 300 305
GAA AAA AAA GCC CTT TTT AGC GTG CCT ATC CGC ACT TTA GAA AAA AGA 1016 Glu Lys Lys Ala Leu Phe Ser Val Pro He Arg Thr Leu Glu Lys Arg 310 315 320
GTG CAC GAA AAT TTT TTG CAT TTA GGG GTA GGG AGT GGG GTA ACT TAT 1064 Val His Glu Asn Phe Leu His Leu Gly Val Gly Ser Gly Val Thr Tyr 325 330 335
AAA AGT AAA GCG CCA AAA GAA TAT GAA GAA AGC TTT TTG AAA TCC TTT 1112 Lys Ser Lys Ala Pro Lys Glu Tyr Glu Glu Ser Phe Leu Lys Ser Phe 340 345 350
TTT GTG ATG CCC AAA ATA GAA TTT GAG ATT GTA GAA ACG ATG AAA ATT 1160 Phe Val Met Pro Lys He Glu Phe Glu He Val Glu Thr Met Lys He 355 360 365 370
ATC AAA AAG GAT CAA AAA TTA GAG ATT AAT AAT AAA AAC GCC CAT AAA 1208 He Lys Lys Asp Gin Lys Leu Glu He Asn Asn Lys Asn Ala His Lys 375 380 385 GAA CGC TTA ATG AAT AGC ACT CGA TAT TTT AAC TTT AAA TAC GAT GAA 1256 Glu Arg Leu Met Asn Ser Thr Arg Tyr Phe Asn Phe Lys Tyr Asp Glu 390 395 400
AAT CTT TTA GAT TTT GAA TTA GAA AAA GAA GGG GTT TTA AGG GTT TTA 1304 Asn Leu Leu Asp Phe Glu Leu Glu Lys Glu Gly Val Leu Arg Val Leu 405 410 415
CTC AAT AAA AAG GGC AAG CTC ATT AAA GAA TAC AAA ACC TTA GAG CCT 1352 Leu Asn Lys Lys Gly Lys Leu He Lys Glu Tyr Lys Thr Leu Glu Pro 420 425 430
TTA AAA AGC CTA GAA ATC CGT TTG AGT GAA GCC CCC ATT GAT AAA CGC 1400 Leu Lys Ser Leu Glu He Arg Leu Ser Glu Ala Pro He Asp Lys Arg 435 440 445 450
AAT GAT TTT TTA TAC CAT AAG ACC ACT TAT GCC CCT TTT TAT CAA AAG 1448 Asn Asp Phe Leu Tyr His Lys Thr Thr Tyr Ala Pro Phe Tyr Gin Lys 455 460 465
GCT CGA GCG CTC ATT AAA AAG GGC GTT ATG TTT GAT GAA ATC TTT TAT 1496 Ala Arg Ala Leu He Lys Lys Gly Val Met Phe Asp Glu He Phe Tyr 470 475 480
AAC CAG GAT TTG GAA CTC ACT GAG GGC GCT AGG AGC AAT CTT GTT TTA 1544 Asn Gin Asp Leu Glu Leu Thr Glu Gly Ala Arg Ser Asn Leu Val Leu 485 490 495
GAA ATC CAT AAC AGG CTT TTA ACC CCT TAT TTT AGC GCG GGC GCG TTA 1592 Glu He His Asn Arg Leu Leu Thr Pro Tyr Phe Ser Ala Gly Ala Leu 500 505 510
AAC GGG ACG GGT GTT GTG GGG TTG TTA AAA AAG GGT CTT GTT GGG CAT 1640 Asn Gly Thr Gly Val Val Gly Leu Leu Lys Lys Gly Leu Val Gly His 515 520 525 530
GCA CCT TTG AAA TTG CAA GAT TTG CAA AAA GCG TCT AAA ATC TAT TGT 1688 Ala Pro Leu Lys Leu Gin Asp Leu Gin Lys Ala Ser Lys He Tyr Cys 535 540 545
ATT AAC GCG CTA TAT GGC TTA GTG GAA GTG AAA ATA AAA TAACTATAAA AA 1739 He Asn Ala Leu Tyr Gly Leu Val Glu Val Lys He Lys 550 555
CAGAGCGGCT AAAACCTCAT TTTTAGAAAT AGGTTACCCA A 1780
(2) INFORMATION FOR SEQ ID NO: 674:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 559 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 674:
Met He Phe Gly Asp Phe Lys Tyr Gin Lys Ser Val Lys Lys Leu Thr
1 5 10 15
Ala Thr Asn Leu Asn Glu Leu Lys Asn Ala Leu Asp Phe He Ser Gin
20 25 30
Asn Arg Gly Asn Gly Tyr Phe Val Gly Tyr Leu Leu Tyr Glu Ala Arg
35 40 45
Leu Ala Phe Leu Asp Glu Asn Phe Gin Ser Gin Thr Pro Phe Leu Tyr
50 55 60
Phe Glu Gin Phe Leu Glu Arg Lys Lys Tyr Ser Leu Glu Pro Leu Lys 65 70 75 80
Glu His Ala Phe Tyr Pro Lys He His Ser Ser Leu Asp Gin Lys Thr
85 90 95
Tyr Phe Lys Gin Phe Lys Ala Val Lys Glu Arg Leu Lys Asn Gly Asp
100 105 110
Thr Tyr Gin Val Asn Leu Thr Met Asp Leu Phe Leu Asp Thr Lys Ala
115 120 125
Lys Pro Lys Arg Val Phe Lys Glu Val Val His Asn Gin Asn Thr Pro
130 135 140
Phe Lys Ala Phe He Glu Asn Glu Phe Gly Ser Val Leu Ser Phe Ser 145 150 155 160
Pro Glu Leu Phe Phe Glu Leu Glu Phe Leu Asp Thr Ala He Lys He
165 170 175
He Thr Lys Pro Met Lys Gly Thr He Ala Arg Ser Lys Asn Pro Leu
180 185 190
He Asp Glu Lys Asn Arg Leu Phe Leu Gin Asn Asp Asp Lys Asn Arg
195 200 205
Ser Glu Asn Val Met He Val Asp Leu Leu Arg Asn Asp Leu Ser Arg
210 215 220
Leu Ala Leu Lys Asn Ser Val Lys Val Asn Gin Leu Phe Glu He He 225 230 235 240
Ser Leu Pro Ser Val Tyr Gin Met He Ser Glu He Glu Ala Lys Leu
245 250 255
Pro Leu Lys Thr Ser Leu Phe Glu He Phe Lys Ala Leu Phe Pro Cys
260 265 270
Gly Ser Val Thr Gly Cys Pro Lys He Lys Thr Met Gin He He Glu
275 280 285
Ser Leu Glu Lys Arg Pro Arg Gly Val Tyr Cys Gly Ala He Gly Met
290 295 300
Val Glu Glu Lys Lys Ala Leu Phe Ser Val Pro He Arg Thr Leu Glu 305 310 315 320
Lys Arg Val His Glu Asn Phe Leu His Leu Gly Val Gly Ser Gly Val
325 330 335
Thr Tyr Lys Ser Lys Ala Pro Lys Glu Tyr Glu Glu Ser Phe Leu Lys
340 345 350
Ser Phe Phe Val Met Pro Lys He Glu Phe Glu He Val Glu Thr Met
355 360 365
Lys He He Lys Lys Asp Gin Lys Leu Glu He Asn Asn Lys Asn Ala
370 375 380
His Lys Glu Arg Leu Met Asn Ser Thr Arg Tyr Phe Asn Phe Lys Tyr 385 390 395 400
Asp Glu Asn Leu Leu Asp Phe Glu Leu Glu Lys Glu Gly Val Leu Arg 405 410 415
Val Leu Leu Asn Lys Lys Gly Lys Leu He Lys Glu Tyr Lys Thr Leu
420 425 430
Glu Pro Leu Lys Ser Leu Glu He Arg Leu Ser Glu Ala Pro He Asp
435 440 445
Lys Arg Asn Asp Phe Leu Tyr His Lys Thr Thr Tyr Ala Pro Phe Tyr
450 455 460
Gin Lys Ala Arg Ala Leu He Lys Lys Gly Val Met Phe Asp Glu He 465 470 475 480
Phe Tyr Asn Gin Asp Leu Glu Leu Thr Glu Gly Ala Arg Ser Asn Leu
485 490 495
Val Leu Glu He His Asn Arg Leu Leu Thr Pro Tyr Phe Ser Ala Gly
500 505 510
Ala Leu Asn Gly Thr Gly Val Val Gly Leu Leu Lys Lys Gly Leu Val
515 520 525
Gly His Ala Pro Leu Lys Leu Gin Asp Leu Gin Lys Ala Ser Lys He
530 535 540
Tyr Cys He Asn Ala Leu Tyr Gly Leu Val Glu Val Lys He Lys 545 550 555
(2) INFORMATION FOR SEQ ID NO: 675:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 958 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...905 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675:
TTAGTGGATA TTTTATACGC TTTTATTGAT CCTAGAATAA GGTTGTCATA ATG GAG 56
Met Glu 1
TCT TTT AGA GAG TTT ATC CAA CAA TTC AAA AAA AAT AAG GCA GCG GTC 104 Ser Phe Arg Glu Phe He Gin Gin Phe Lys Lys Asn Lys Ala Ala Val 5 10 15
GTT GGG GCT TGG ATT GTG CTT TTA TTG GTA ATT TGC GCG ATT TTT GCG 152 Val Gly Ala Trp He Val Leu Leu Leu Val He Cys Ala He Phe Ala 20 25 30
CCC CTT TTA GCC CCG CAT GAT CCT TAT GTC CAA AAC GCG CAA GAT CGC 200 Pro Leu Leu Ala Pro His Asp Pro Tyr Val Gin Asn Ala Gin Asp Arg 35 40 45 50 CTT TTG AAG CCT ATA TGG GAG CAT GGA GGG AAT GCT AAA TAC CTT TTA 248 Leu Leu Lys Pro He Trp Glu His Gly Gly Asn Ala Lys Tyr Leu Leu 55 60 65
GGC ACC GAT GAT TTG GGG CGC GAT ATT TTG AGC CGC TTG ATC TAT GGG 296 Gly Thr Asp Asp Leu Gly Arg Asp He Leu Ser Arg Leu He Tyr Gly 70 75 80
GCC AGG ATT TCT TTA ACC ATA GGG ATT GTT TCT ATG GGG ATT GCG GTG 344 Ala Arg He Ser Leu Thr He Gly He Val Ser Met Gly He Ala Val 85 90 95
TTT TTT GGC ACG ATA CTA GGG CTA ATA GCG GGG TAT TTT GGG GGG AAA 392 Phe Phe Gly Thr He Leu Gly Leu He Ala Gly Tyr Phe Gly Gly Lys 100 105 110
ACA GAT GCA ATT ATC ATG CGT ATC ATG GAC ATC ATG TTC GCT TTG CCC 440 Thr Asp Ala He He Met Arg He Met Asp He Met Phe Ala Leu Pro 115 120 125 130
TCT ATT TTA TTG ATC GTG ATT GTG GTC GCG GTG TTA GGG CCT TCA CTC 488 Ser He Leu Leu He Val He Val Val Ala Val Leu Gly Pro Ser Leu 135 140 145
ACT AAC GCC ATG CTC GCT ATT GGG TTT GTG GGG ATT CCT GGG TTT GCA 536 Thr Asn Ala Met Leu Ala He Gly Phe Val Gly He Pro Gly Phe Ala 150 155 160
AGA TTG GTG CGC AGT TCC GTG CTA GGT GAA AAA GAA AAA GAA TAC GTG 584 Arg Leu Val Arg Ser Ser Val Leu Gly Glu Lys Glu Lys Glu Tyr Val 165 170 175
ATC GCT TCT AAA ATC AAT GGC TCT TCG CAT CTT CGT TTG ATG TGT AAG 632 He Ala Ser Lys He Asn Gly Ser Ser His Leu Arg Leu Met Cys Lys 180 185 190
GTG ATT TTC CCT AAT TGC ATT ATC CCT TTA ATC GTT CAA ACG ACA ATG 680 Val He Phe Pro Asn Cys He He Pro Leu He Val Gin Thr Thr Met 195 200 205 210
GGT TTT GCT TCC ACG GTT TTA GAA GCG GCT GCA CTG AGC TTC TTA GGT 728 Gly Phe Ala Ser Thr Val Leu Glu Ala Ala Ala Leu Ser Phe Leu Gly 215 220 225
CTT GGG GCC CAA CCT CCC AAA CCC GAA TGG GGA GCG ATG TTG ATG AAT 776 Leu Gly Ala Gin Pro Pro Lys Pro Glu Trp Gly Ala Met Leu Met Asn 230 235 240
TCC ATG CAA TAC ATC GCT ACC GCT CCT TGG ATG CTT GTT TTC CCT GGG 824 Ser Met Gin Tyr He Ala Thr Ala Pro Trp Met Leu Val Phe Pro Gly 245 250 255
GTG ATG ATT TTT TTA ACG GTC ATG AGT TTT AAT CTG GTA GGC GAT GGC 872 Val Met He Phe Leu Thr Val Met Ser Phe Asn Leu Val Gly Asp Gly 260 265 270 ATC ATG GAC GCT TTA GAT CCT AAA CGC ACC TCT TAAAAGGAGC TTGCATGATT 925 He Met Asp Ala Leu Asp Pro Lys Arg Thr Ser 275 280 285
TTAGAAGTTA AAGATTTAAA AACTTATTTT TTC 958
(2) INFORMATION FOR SEQ ID NO: 676:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 285 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 676:
Met Glu Ser Phe Arg Glu Phe He Gin Gin Phe Lys Lys Asn Lys Ala
1 5 10 15
Ala Val Val Gly Ala Trp He Val Leu Leu Leu Val He Cys Ala He
20 25 30
Phe Ala Pro Leu Leu Ala Pro His Asp Pro Tyr Val Gin Asn Ala Gin
35 40 45
Asp Arg Leu Leu Lys Pro He Trp Glu His Gly Gly Asn Ala Lys Tyr
50 55 60
Leu Leu Gly Thr Asp Asp Leu Gly Arg Asp He Leu Ser Arg Leu He 65 70 75 80
Tyr Gly Ala Arg He Ser Leu Thr He Gly He Val Ser Met Gly He
85 90 95
Ala Val Phe Phe Gly Thr He Leu Gly Leu He Ala Gly Tyr Phe Gly
100 105 110
Gly Lys Thr Asp Ala He He Met Arg He Met Asp He Met Phe Ala
115 120 125
Leu Pro Ser He Leu Leu He Val He Val Val Ala Val Leu Gly Pro
130 135 140
Ser Leu Thr Asn Ala Met Leu Ala He Gly Phe Val Gly He Pro Gly 145 150 155 160
Phe Ala Arg Leu Val Arg Ser Ser Val Leu Gly Glu Lys Glu Lys Glu
165 170 175
Tyr Val He Ala Ser Lys He Asn Gly Ser Ser His Leu Arg Leu Met
180 185 190
Cys Lys Val He Phe Pro Asn Cys He He Pro Leu He Val Gin Thr
195 200 205
Thr Met Gly Phe Ala Ser Thr Val Leu Glu Ala Ala Ala Leu Ser Phe
210 215 220
Leu Gly Leu Gly Ala Gin Pro Pro Lys Pro Glu Trp Gly Ala Met Leu 225 230 235 240
Met Asn Ser Met Gin Tyr He Ala Thr Ala Pro Trp Met Leu Val Phe
245 250 255
Pro Gly Val Met He Phe Leu Thr Val Met Ser Phe Asn Leu Val Gly
260 265 270
Asp Gly He Met Asp Ala Leu Asp Pro Lys Arg Thr Ser 275 280 285 (2) INFORMATION FOR SEQ ID NO: 677:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 791 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 95...727 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 677:
CCTTGTTAAA AATGTTAGTT GGTGCAAGCT TGCTGACACA CGCCTTAATA GCTAAAGAAG 60 AAAGCGCAGC NCTTCTTGGA CAAAAAATTT GTAT ATG GGA GTC AAT TAC CAA ACA 115
Met Gly Val Asn Tyr Gin Thr 1 5
GGT TCT ATC AAT TTA ATG ACT AAT ATC CAT GAA GTT AGA GAA GTT ACT 163 Gly Ser He Asn Leu Met Thr Asn He His Glu Val Arg Glu Val Thr 10 15 20
AAC TAT CAA ACC GGT TAC ACC AAT ATT ATA ACT AGC GTT AAT AGC GTT 211 Asn Tyr Gin Thr Gly Tyr Thr Asn He He Thr Ser Val Asn Ser Val 25 30 35
AAA AAG CTC ACC AAC ATG GGA TCT AAT GGG ATT GGA TTA GTC ATG GGT 259 Lys Lys Leu Thr Asn Met Gly Ser Asn Gly He Gly Leu Val Met Gly 40 45 50 55
TAT AAC CAC TTT TTC CAT CCG GAT AAA ATC TTG GGC TTG CGC TAT TTC 307 Tyr Asn His Phe Phe His Pro Asp Lys He Leu Gly Leu Arg Tyr Phe 60 65 70
GCT TTT TTA GAT TGG CAA GGC TAT GGC ATG AGA TAC CCT AAA GGC TAT 355 Ala Phe Leu Asp Trp Gin Gly Tyr Gly Met Arg Tyr Pro Lys Gly Tyr 75 80 85
TAT GGC GGC AAT AAC ATG ATC ACT TAT GGC GTG GGC GTG GAT GCA GTG 403 Tyr Gly Gly Asn Asn Met He Thr Tyr Gly Val Gly Val Asp Ala Val 90 95 100
TGG AAT TTC TTT CAA GGG AGT TTC TAT CAA GAT GAC ATT AGC GTG GAT 451 Trp Asn Phe Phe Gin Gly Ser Phe Tyr Gin Asp Asp He Ser Val Asp 105 110 115
ATT GGC GTT TTT GGG GGG ATT GCG ATT GCG GGG AAT AGC TGG TAT ATT 499 He Gly Val Phe Gly Gly He Ala He Ala Gly Asn Ser Trp Tyr He 120 125 130 135
GGC AGT AAA GGG CAG GAA TTG TTA GGT ATC ACT AAC AGC AGC GCG GTT 547 Gly Ser Lys Gly Gin Glu Leu Leu Gly He Thr Asn Ser Ser Ala Val 140 145 150
GAT AAC ACC TCT TTT CAA TTC CTC TTT AAC TTT GGC CTC AAG GCT TTA 595 Asp Asn Thr Ser Phe Gin Phe Leu Phe Asn Phe Gly Leu Lys Ala Leu 155 160 165
TTT GTA GAT GAG CAT GAA TTT GAA ATC GGT TTT AAA TTC CCC ACC ATT 643 Phe Val Asp Glu His Glu Phe Glu He Gly Phe Lys Phe Pro Thr He 170 175 180
AAT AAC AAA TAC TAC ACC ACT GAC GCG CTC AAG GTT CAA ATG CGT AGG 691 Asn Asn Lys Tyr Tyr Thr Thr Asp Ala Leu Lys Val Gin Met Arg Arg 185 190 195
GTC TTT GCC TTT TAT GTG GGG TAT AAT TAC CAC TTC TAAAGGGCTT TTAAAA 743 Val Phe Ala Phe Tyr Val Gly Tyr Asn Tyr His Phe 200 205 210
CCCAACGCAA CTCCCTAACA TCTTTTGGTA ATAGCTCTTG GCTTTGAG 791
(2) INFORMATION FOR SEQ ID NO: 678:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 211 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 678:
Met Gly Val Asn Tyr Gin Thr Gly Ser He Asn Leu Met Thr Asn He
1 5 10 15
His Glu Val Arg Glu Val Thr Asn Tyr Gin Thr Gly Tyr Thr Asn He
20 25 30
He Thr Ser Val Asn Ser Val Lys Lys Leu Thr Asn Met Gly Ser Asn
35 40 45
Gly He Gly Leu Val Met Gly Tyr Asn His Phe Phe His Pro Asp Lys
50 55 60
He Leu Gly Leu Arg Tyr Phe Ala Phe Leu Asp Trp Gin Gly Tyr Gly 65 70 75 80
Met Arg Tyr Pro Lys Gly Tyr Tyr Gly Gly Asn Asn Met He Thr Tyr
85 90 95
Gly Val Gly Val Asp Ala Val Trp Asn Phe Phe Gin Gly Ser Phe Tyr
100 105 110
Gin Asp Asp He Ser Val Asp He Gly Val Phe Gly Gly He Ala He
115 120 125
Ala Gly Asn Ser Trp Tyr He Gly Ser Lys Gly Gin Glu Leu Leu Gly 130 135 140 He Thr Asn Ser Ser Ala Val Asp Asn Thr Ser Phe Gin Phe Leu Phe 145 150 155 160
Asn Phe Gly Leu Lys Ala Leu Phe Val Asp Glu His Glu Phe Glu He
165 170 175
Gly Phe Lys Phe Pro Thr He Asn Asn Lys Tyr Tyr Thr Thr Asp Ala
180 185 190
Leu Lys Val Gin Met Arg Arg Val Phe Ala Phe Tyr Val Gly Tyr Asn
195 200 205
Tyr His Phe 210
(2) INFORMATION FOR SEQ ID NO: 679:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 517 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...464 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:679:
AGATTTCATT CGAGGTAGAA AATACATTGA AAAAGCGTGT GAATTAAACG ATG GTA 56
Met Val 1
GGG GGT GGA ACG GTA AAA AAA GAC TTG AAG AAA GCC ATT CAA TAC TAT 104 Gly Gly Gly Thr Val Lys Lys Asp Leu Lys Lys Ala He Gin Tyr Tyr 5 10 15
GTT AAA GCG TGT GAA TTG AAT GAA ATG TTT GGG TGT CTG TCA TTA GTT 152 Val Lys Ala Cys Glu Leu Asn Glu Met Phe Gly Cys Leu Ser Leu Val 20 25 30
TCG AAC TCT CAA ATA AAC AAA CAA AAA CTC TTT CAA TAT CTC TCT AAA 200 Ser Asn Ser Gin He Asn Lys Gin Lys Leu Phe Gin Tyr Leu Ser Lys 35 40 45 50
GCT TGT GAA TTA AAT AGT GGT AAT GGA TGT AGG TTT TTA GGG GAT TTT 248 Ala Cys Glu Leu Asn Ser Gly Asn Gly Cys Arg Phe Leu Gly Asp Phe 55 60 65
TAT GAG AAT GGA AAA TAT GTA AAA AAG GAT TTA AGA AAA GCT GCT CAA 296 Tyr Glu Asn Gly Lys Tyr Val Lys Lys Asp Leu Arg Lys Ala Ala Gin 70 75 80 TAC TAC TCT AAA GCT TGT GGA TTA AAT GAT CAA GAT GGG TGT TTA ATA 344 Tyr Tyr Ser Lys Ala Cys Gly Leu Asn Asp Gin Asp Gly Cys Leu He 85 90 95
CTA GGA TAT AAG CAA TAT GCT GGC AAG GGC GTA GTC AAA AAT GAA AAA 392 Leu Gly Tyr Lys Gin Tyr Ala Gly Lys Gly Val Val Lys Asn Glu Lys 100 105 110
CAA GCG GTG AAA ACC TTT GAA AAG GCT TGT AGG TTA GGA TCT GAA GAC 440 Gin Ala Val Lys Thr Phe Glu Lys Ala Cys Arg Leu Gly Ser Glu Asp 115 120 125 130
GCA TGT GGT ATT TTA AAC AAC TAC TAGATTTGAA ATAAATGCTG TTTTTTAGCT 494 Ala Cys Gly He Leu Asn Asn Tyr 135
GGCTTTCATG TTTTTGTAAC CCC 517
(2) INFORMATION FOR SEQ ID NO: 680:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 138 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 680:
Met Val Gly Gly Gly Thr Val Lys Lys Asp Leu Lys Lys Ala He Gin
1 5 10 15
Tyr Tyr Val Lys Ala Cys Glu Leu Asn Glu Met Phe Gly Cys Leu Ser
20 25 30
Leu Val Ser Asn Ser Gin He Asn Lys Gin Lys Leu Phe Gin Tyr Leu
35 40 45
Ser Lys Ala Cys Glu Leu Asn Ser Gly Asn Gly Cys Arg Phe Leu Gly
50 55 60
Asp Phe Tyr Glu Asn Gly Lys Tyr Val Lys Lys Asp Leu Arg Lys Ala 65 70 75 80
Ala Gin Tyr Tyr Ser Lys Ala Cys Gly Leu Asn Asp Gin Asp Gly Cys
85 90 95
Leu He Leu Gly Tyr Lys Gin Tyr Ala Gly Lys Gly Val Val Lys Asn
100 105 110
Glu Lys Gin Ala Val Lys Thr Phe Glu Lys Ala Cys Arg Leu Gly Ser
115 120 125
Glu Asp Ala Cys Gly He Leu Asn Asn Tyr 130 135
(2) INFORMATION FOR SEQ ID NO: 681:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 451 base pairs
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...398 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 681:
GTGGAACGCT CTGTCTTAGC AAATTGATCT TAGCGGCGTC GTTTTTGATA GTG GAT 56
Val Asp 1
TCA GAG GGG TTT TCG CCT TCT ATT TAT ACC GAC AAG ACA GGG CAT CCC 104 Ser Glu Gly Phe Ser Pro Ser He Tyr Thr Asp Lys Thr Gly His Pro 5 10 15
ACG ATT GGC TAT GGC TAT AAT TTG AGC GTT TAT TCT TAT GAG GGT AAG 152 Thr He Gly Tyr Gly Tyr Asn Leu Ser Val Tyr Ser Tyr Glu Gly Lys 20 25 30
CGT ATC ACC AAA ACA TAT GGG CTT TTA ACT GAC ATA CTC TCT TAT GGG 200 Arg He Thr Lys Thr Tyr Gly Leu Leu Thr Asp He Leu Ser Tyr Gly 35 40 45 50
TGG TAT AAA AAT TTG GAC GCA ATG AGG AGA ATG GTC ATC TTG GAT TTG 248 Trp Tyr Lys Asn Leu Asp Ala Met Arg Arg Met Val He Leu Asp Leu 55 60 65
AGC TAC AAT TTA GGC TTG AAC GGA CTG CTC AAA TTC AAG CAA TTC ATC 296 Ser Tyr Asn Leu Gly Leu Asn Gly Leu Leu Lys Phe Lys Gin Phe He 70 75 80
AAG GCC ATA GAG GAT AAA AAT TAT GCT TTG GCT GTG GAG AGA CTG CAA 344 Lys Ala He Glu Asp Lys Asn Tyr Ala Leu Ala Val Glu Arg Leu Gin 85 90 95
AAA AGC CCG TAT TTC AAT CAA GTG AAA AAA GAG CGT CAA GGA ATA TGG 392 Lys Ser Pro Tyr Phe Asn Gin Val Lys Lys Glu Arg Gin Gly He Trp 100 105 110
AAA TTT TGAAATTGGA GGGTTGCGAA AAACATTGTA AGAAAAAATA CGCAATAGAA AA 450
Lys Phe
115
G 451
(2) INFORMATION FOR SEQ ID NO:682: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 116 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 682:
Val Asp Ser Glu Gly Phe Ser Pro Ser He Tyr Thr Asp Lys Thr Gly
1 5 10 15
His Pro Thr He Gly Tyr Gly Tyr Asn Leu Ser Val Tyr Ser Tyr Glu
20 25 30
Gly Lys Arg He Thr Lys Thr Tyr Gly Leu Leu Thr Asp He Leu Ser
35 40 45
Tyr Gly Trp Tyr Lys Asn Leu Asp Ala Met Arg Arg Met Val He Leu
50 55 60
Asp Leu Ser Tyr Asn Leu Gly Leu Asn Gly Leu Leu Lys Phe Lys Gin 65 70 75 80
Phe He Lys Ala He Glu Asp Lys Asn Tyr Ala Leu Ala Val Glu Arg
85 90 95
Leu Gin Lys Ser Pro Tyr Phe Asn Gin Val Lys Lys Glu Arg Gin Gly
100 105 110
He Trp Lys Phe 115
(2) INFORMATION FOR SEQ ID NO: 683:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 399 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 50...346 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 683:
ACACAACCAT AGCGACCAAA AACCCGACAG ACCCGCCAAG GAGCGATAA AAC CCA CAA 58
Asn Pro Gin
1
AGC TTT TTC AGG AAT ACG CCA CAT TTT TTG ATC AGC CGA ATT TTT ATC 106 Ser Phe Phe Arg Asn Thr Pro His Phe Leu He Ser Arg He Phe He 5 10 15 CAC CAG CAT GAG AAT AAA AAT CAG AAT ATT GAC TAT GAG CAA CCA AAC 154 His Gin His Glu Asn Lys Asn Gin Asn He Asp Tyr Glu Gin Pro Asn 20 25 30 35
GAT AGA AGC AAA TTC CAC GCT CAC CCT TTC AAG AGC GTT TTA ACA ACC 202 Asp Arg Ser Lys Phe His Ala His Pro Phe Lys Ser Val Leu Thr Thr 40 45 50
CAA ACG CTA CCA CTT GGT TTT TTA GAG AGA GAA AGA GAG AGA AAG CAA 250 Gin Thr Leu Pro Leu Gly Phe Leu Glu Arg Glu Arg Glu Arg Lys Gin 55 60 65
AAT TTT AAG ATT GAT TCT CAA ATC TAT TCC TTT GCA AAA GTT AAG ATT 298 Asn Phe Lys He Asp Ser Gin He Tyr Ser Phe Ala Lys Val Lys He 70 75 80
GGG TGT TTT AAC ATG ATT TTT GGC CTG CTC GCA TCA AGC CCT TAT TTT T 347 Gly Cys Phe Asn Met He Phe Gly Leu Leu Ala Ser Ser Pro Tyr Phe 85 90 95
AACATTTCCG CTCCCTTGCT TTTTTAAAGC CTCCCTAAAT TACTACACCA CT 399
(2) INFORMATION FOR SEQ ID NO: 684:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 99 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684:
Asn Pro Gin Ser Phe Phe Arg Asn Thr Pro His Phe Leu He Ser Arg
1 5 10 15
He Phe He His Gin His Glu Asn Lys Asn Gin Asn He Asp Tyr Glu
20 25 30
Gin Pro Asn Asp Arg Ser Lys Phe His Ala His Pro Phe Lys Ser Val
35 40 45
Leu Thr Thr Gin Thr Leu Pro Leu Gly Phe Leu Glu Arg Glu Arg Glu
50 55 60
Arg Lys Gin Asn Phe Lys He Asp Ser Gin He Tyr Ser Phe Ala Lys 65 70 75 80
Val Lys He Gly Cys Phe Asn Met He Phe Gly Leu Leu Ala Ser Ser
85 90 95
Pro Tyr Phe
(2) INFORMATION FOR SEQ ID NO: 685:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 522 base pairs
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...470 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 685:
AAAGACAACC CTAACAAACG CTTCAAAAAC AACAAGAGGG ATAAAAAATA ATG TCT 56
Met Ser 1
TAT TTT TTT AAA ATC ATT CTG GGC ACA AGC GTG ATC GTG GGG GTG TTG 104 Tyr Phe Phe Lys He He Leu Gly Thr Ser Val He Val Gly Val Leu 5 10 15
TTG GGC TTG TGG CGT TTG ACT TAC GAT AAG TTC TAT TTC TCG TTG GTC 152 Leu Gly Leu Trp Arg Leu Thr Tyr Asp Lys Phe Tyr Phe Ser Leu Val 20 25 30
TTT GTG TTG CTG ATA CTA GGG ATT GTC GCT TGT AGC TAT ATT TCT TTA 200 Phe Val Leu Leu He Leu Gly He Val Ala Cys Ser Tyr He Ser Leu 35 40 45 50
AAA ATG CAT CAG AGG AAA TGC TTC GCC AAG TGT TTC GTG AAT AGT GAA 248 Lys Met His Gin Arg Lys Cys Phe Ala Lys Cys Phe Val Asn Ser Glu 55 60 65
TCT TTT TTA TCC AAG ATG TTA CAC TCC CCA ATA ATG GTA ATT TGC TTT 296 Ser Phe Leu Ser Lys Met Leu His Ser Pro He Met Val He Cys Phe 70 75 80
TAT TTT ATT TTT TCA ATT TTC ACA TCC ATA TCC ATC GTC TAT AGC GTG 344 Tyr Phe He Phe Ser He Phe Thr Ser He Ser He Val Tyr Ser Val 85 90 95
CTG GAC TAT GAT CAG ATG ATG TGG GGG TTT GTT TTT TGC ACT ATC GTT 392 Leu Asp Tyr Asp Gin Met Met Trp Gly Phe Val Phe Cys Thr He Val 100 105 110
GTT TGC GCT GTG GTG TTT GGC ACG CTT GAA AAA AAT GCT CAA GAG TAC 440 Val Cys Ala Val Val Phe Gly Thr Leu Glu Lys Asn Ala Gin Glu Tyr 115 120 125 130
CAT CAA AGA AGA TTA TTT GAT GCT GAT GTC TAGAGAAGTG AGTGCTTTTG TGG 493 His Gin Arg Arg Leu Phe Asp Ala Asp Val 135 140 GGACTCTTTT CTTCATTGGC TTGAGTTGC 522
(2) TNFORMATION FOR SEQ ID NO: 686:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 140 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 686:
Met Ser Tyr Phe Phe Lys He He Leu Gly Thr Ser Val He Val Gly
1 5 10 15
Val Leu Leu Gly Leu Trp Arg Leu Thr Tyr Asp Lys Phe Tyr Phe Ser
20 25 30
Leu Val Phe Val Leu Leu He Leu Gly He Val Ala Cys Ser Tyr He
35 40 45
Ser Leu Lys Met His Gin Arg Lys Cys Phe Ala Lys Cys Phe Val Asn
50 55 60
Ser Glu Ser Phe Leu Ser Lys Met Leu His Ser Pro He Met Val He 65 70 75 80
Cys Phe Tyr Phe He Phe Ser He Phe Thr Ser He Ser He Val Tyr
85 90 95
Ser Val Leu Asp Tyr Asp Gin Met Met Trp Gly Phe Val Phe Cys Thr
100 105 110
He Val Val Cys Ala Val Val Phe Gly Thr Leu Glu Lys Asn Ala Gin
115 120 125
Glu Tyr His Gin Arg Arg Leu Phe Asp Ala Asp Val 130 135 140
(2) INFORMATION FOR SEQ ID NO: 687:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 976 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...923 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 687: CCCACTGAAG CGTATTCTAA CACCCTTTTA GAATTAGCTA AAAAAGATGA AAA AAT 56 Lys Asn
1
CGT AGG CGT AAC CGC GGC GAT GCT AGC GGC ACA GGA TTA GAC AAA CTC 104 Arg Arg Arg Asn Arg Gly Asp Ala Ser Gly Thr Gly Leu Asp Lys Leu 5 10 15
ATT GAC GCT TAN CCT TTG CGC TTT TTT GAT GTC GCT ATC GCT GAG CAA 152 He Asp Ala Xaa Pro Leu Arg Phe Phe Asp Val Ala He Ala Glu Gin 20 25 30
CAC GCT TTA ACT TCT AGC AGC GCT ATG GCT AAA GAG GGG TTT AAA CCT 200 His Ala Leu Thr Ser Ser Ser Ala Met Ala Lys Glu Gly Phe Lys Pro 35 40 45 50
TTT GTG AGC ATC TAT TCT ACT TTT TTG CAG AGG GCT TAT GAT TCT ATT 248 Phe Val Ser He Tyr Ser Thr Phe Leu Gin Arg Ala Tyr Asp Ser He 55 60 65
GTG CAT GAC GCT TGT ATT TCT AGC TTG CCG ATT AAA TTA GCC ATT GAC 296 Val His Asp Ala Cys He Ser Ser Leu Pro He Lys Leu Ala He Asp 70 75 80
AGG GCT GGG ATT GTG GGC GAA GAT GGC GAG ACG CAC CAA GGG CTT TTA 344 Arg Ala Gly He Val Gly Glu Asp Gly Glu Thr His Gin Gly Leu Leu 85 90 95
GAC GTG TCG TAT TTG CGC TCT ATC CCT AAC ATG GTC ATT TTT GCC CCA 392 Asp Val Ser Tyr Leu Arg Ser He Pro Asn Met Val He Phe Ala Pro 100 105 110
CGA GAC AAT GAG ACT TTA AAA AAC GCC GTG CGT TTT GCC AAT GAA CAC 440 Arg Asp Asn Glu Thr Leu Lys Asn Ala Val Arg Phe Ala Asn Glu His 115 120 125 130
GAT TCA AGC CCT TGC GCG TTC CGA TAC CCT AGG GGG TCG TTT GCG TTA 488 Asp Ser Ser Pro Cys Ala Phe Arg Tyr Pro Arg Gly Ser Phe Ala Leu 135 140 145
AAA GAG GGG GTT TTT GAG CCT AGC GGT TTT GTT TTA GGC CAA AGC GAA 536 Lys Glu Gly Val Phe Glu Pro Ser Gly Phe Val Leu Gly Gin Ser Glu 150 155 160
TTG TTG AAA AAA GAG GGC GAA ATT TTA CTC ATA GGC TAT GGT AAT GGC 584 Leu Leu Lys Lys Glu Gly Glu He Leu Leu He Gly Tyr Gly Asn Gly 165 170 175
GTG GGG CGG GCG CAT TTA GTC CAA CTG GCT TTA AAA GAA AAA AAC ATA 632 Val Gly Arg Ala His Leu Val Gin Leu Ala Leu Lys Glu Lys Asn He 180 185 190
GAA TGC GCT CTC TTG GAT CTC AGG TTT TTA AAG CCT TTA GAT CCA AAT 680 Glu Cys Ala Leu Leu Asp Leu Arg Phe Leu Lys Pro Leu Asp Pro Asn 195 200 205 210 TTA AGC GCG ATC GTT GCC CCT TAT CAA AAG CTC TAT GTT TTT AGC GAT 728 Leu Ser Ala He Val Ala Pro Tyr Gin Lys Leu Tyr Val Phe Ser Asp 215 220 225
AAT TAC AAG CTT GGA GGG GTG GCT AGC GCG ATT TTA GAG TTT TTG AGC 776 Asn Tyr Lys Leu Gly Gly Val Ala Ser Ala He Leu Glu Phe Leu Ser 230 235 240
GAA CAA AAT ATT TTA AAG CCT GTT AAA AGC TTT GAA ATC ATT GAT GAA 824 Glu Gin Asn He Leu Lys Pro Val Lys Ser Phe Glu He He Asp Glu 245 250 255
TTT ATC ATG CAT GGG AAC ACC GCT TTA GTG GAA AAA TCC TTA GGA TTA 872 Phe He Met His Gly Asn Thr Ala Leu Val Glu Lys Ser Leu Gly Leu 260 265 270
GAC ACA GAG AGT TTG ACT GAC GCT ATT TTA AAA GAT TTA GGA CAA GAG 920 Asp Thr Glu Ser Leu Thr Asp Ala He Leu Lys Asp Leu Gly Gin Glu 275 280 285 290
AGA TGAAAACAAA AGCGCCAATG AAAAATATCC GCAATTTTTC CATTATCGCT CAC 976 Arg
(2) INFORMATION FOR SEQ ID NO: 688:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 291 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 688:
Lys Asn Arg Arg Arg Asn Arg Gly Asp Ala Ser Gly Thr Gly Leu Asp
1 5 10 15
Lys Leu He Asp Ala Xaa Pro Leu Arg Phe Phe Asp Val Ala He Ala
20 25 30
Glu Gin His Ala Leu Thr Ser Ser Ser Ala Met Ala Lys Glu Gly Phe
35 40 45
Lys Pro Phe Val Ser He Tyr Ser Thr Phe Leu Gin Arg Ala Tyr Asp
50 55 60
Ser He Val His Asp Ala Cys He Ser Ser Leu Pro He Lys Leu Ala 65 70 75 80
He Asp Arg Ala Gly He Val Gly Glu Asp Gly Glu Thr His Gin Gly
85 90 95
Leu Leu Asp Val Ser Tyr Leu Arg Ser He Pro Asn Met Val He Phe
100 105 110
Ala Pro Arg Asp Asn Glu Thr Leu Lys Asn Ala Val Arg Phe Ala Asn
115 120 125
Glu His Asp Ser Ser Pro Cys Ala Phe Arg Tyr Pro Arg Gly Ser Phe 130 135 140
Ala Leu Lys Glu Gly Val Phe Glu Pro Ser Gly Phe Val Leu Gly Gin 145 150 155 160
Ser Glu Leu Leu Lys Lys Glu Gly Glu He Leu Leu He Gly Tyr Gly
165 170 175
Asn Gly Val Gly Arg Ala His Leu Val Gin Leu Ala Leu Lys Glu Lys
180 185 190
Asn He Glu Cys Ala Leu Leu Asp Leu Arg Phe Leu Lys Pro Leu Asp
195 200 205
Pro Asn Leu Ser Ala He Val Ala Pro Tyr Gin Lys Leu Tyr Val Phe
210 215 220
Ser Asp Asn Tyr Lys Leu Gly Gly Val Ala Ser Ala He Leu Glu Phe 225 230 235 240
Leu Ser Glu Gin Asn He Leu Lys Pro Val Lys Ser Phe Glu He He
245 250 255
Asp Glu Phe He Met His Gly Asn Thr Ala Leu Val Glu Lys Ser Leu
260 265 270
Gly Leu Asp Thr Glu Ser Leu Thr Asp Ala He Leu Lys Asp Leu Gly
275 280 285
Gin Glu Arg 290
(2) INFORMATION FOR SEQ ID NO: 689:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1135 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1082 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 689:
GGATTAAGAT GTTATAATAG TTGTTATTTT TTCATTTTAA AAGGGGTTTT ATG GCA 56
Met Ala 1
TTA TTA TTC ACA GGA GCG TGC GGG TAT ATA GGC TCG CAT ACC GCA AGG 104 Leu Leu Phe Thr Gly Ala Cys Gly Tyr He Gly Ser His Thr Ala Arg 5 10 15
GCG TTT TTA GAA AAA ACC AAA GAA AAT ATC ATT ATT GTA GAT GAC TTA 152 Ala Phe Leu Glu Lys Thr Lys Glu Asn He He He Val Asp Asp Leu 20 25 30
AGC ACC GGT TTT TTA GAG CAC CTC AAA GCG TTA GAG CAT TAT TAC CCT 200 Ser Thr Gly Phe Leu Glu His Leu Lys Ala Leu Glu His Tyr Tyr Pro 35 40 45 50
AAT AGG GTT GTG TTT ATT CAA GCG AAT TTG AAT GAA ACG CAC AAA TTA 248 Asn Arg Val Val Phe He Gin Ala Asn Leu Asn Glu Thr His Lys Leu 55 60 65
GAC GCC TTT TTG AAT AAG CAG CAG CTA AAA GAT CCC ATT GAA GCC ATC 296 Asp Ala Phe Leu Asn Lys Gin Gin Leu Lys Asp Pro He Glu Ala He 70 75 80
TTG CAC TTT GGG GCT AAA ATC TCA GTA GAA GAA TCC ACG CAC TTG CCT 344 Leu His Phe Gly Ala Lys He Ser Val Glu Glu Ser Thr His Leu Pro 85 90 95
TTA GAA TAC TAC ACC AAC AAC ACG CTC AAC ACT TTA GAG CTT GTC AAA 392 Leu Glu Tyr Tyr Thr Asn Asn Thr Leu Asn Thr Leu Glu Leu Val Lys 100 105 110
CTT TGC TTA AAA CAT GCA ATC AAG CGT TTT ATT TTT TCT TCT ACG GCC 440 Leu Cys Leu Lys His Ala He Lys Arg Phe He Phe Ser Ser Thr Ala 115 120 125 130
GTG GTT TAT GGC GAA TCT AGT TCA AGT TTG AAT GAA GAA AGC CCC TTA 488 Val Val Tyr Gly Glu Ser Ser Ser Ser Leu Asn Glu Glu Ser Pro Leu 135 140 145
AAC CCC ATT AAT CCT TAT GGA GCG TCT AAA ATG ATG AGC GAA AGA ATC 536 Asn Pro He Asn Pro Tyr Gly Ala Ser Lys Met Met Ser Glu Arg He 150 155 160
TTG TTA GAC ACT TCT AAA ATA GCG GAT TTT AAA TGC GTT ATT TTG CGC 584 Leu Leu Asp Thr Ser Lys He Ala Asp Phe Lys Cys Val He Leu Arg 165 170 175
TAT TTC AAT GTG GCT GGG GCA TGC ATG CAC AAT GAT TAT ACC ACC CCT 632 Tyr Phe Asn Val Ala Gly Ala Cys Met His Asn Asp Tyr Thr Thr Pro 180 185 190
TAC ACG CTA GGG CAA CGC ACG CTC AAC GCC ACG CAT TTG ATC AAA ATC 680 Tyr Thr Leu Gly Gin Arg Thr Leu Asn Ala Thr His Leu He Lys He 195 200 205 210
GCA TGC GAA TGC GCG GTG GGG AAA AGG AAA AAA ATG GGG ATT TTT GGC 728 Ala Cys Glu Cys Ala Val Gly Lys Arg Lys Lys Met Gly He Phe Gly 215 220 225
ACT AAC TAC CCC ACA AGA GAT GGC ACT TGC ATT AGG GAT TAT ATC CAT 776 Thr Asn Tyr Pro Thr Arg Asp Gly Thr Cys He Arg Asp Tyr He His 230 235 240
GTA GAT GAT TTG GCT AAC GCA CAT TTA GCG AGC TAT CAA ACC CTT TTA 824 Val Asp Asp Leu Ala Asn Ala His Leu Ala Ser Tyr Gin Thr Leu Leu 245 250 255 GAA AAA AAT AAG AGC GAG ATC TAT AAT GTC GGC TAC AAT CAA GGC CAT 872 Glu Lys Asn Lys Ser Glu He Tyr Asn Val Gly Tyr Asn Gin Gly His 260 265 270
AGC GTG AAA GAA GTG ATA GAA AAG GTC AAA GAA ATC TCA AAC AAC GAT 920 Ser Val Lys Glu Val He Glu Lys Val Lys Glu He Ser Asn Asn Asp 275 280 285 290
TTT TTA GTG GAA ATT TTA GAC AAA CGA CAG GGC GAT CCA GCA AGC CTT 968 Phe Leu Val Glu He Leu Asp Lys Arg Gin Gly Asp Pro Ala Ser Leu
295 300 305
)
ATT GCC AAT AAC GCT AAA ATC TTA CAA AAC ACC TCT TTC AAA CCC CTT 1016 He Ala Asn Asn Ala Lys He Leu Gin Asn Thr Ser Phe Lys Pro Leu 310 315 320
TAT AAC AAC CTA GAC ACC ATT ATC AAA AGC GCT CTA GAT TGG GAA GAA 1064 Tyr Asn Asn Leu Asp Thr He He Lys Ser Ala Leu Asp Trp Glu Glu 325 330 335
CAC CTT TTG AGG TTT CAA TAATACACCC TGTGCAAATA CAAGCCATTA GCCATTAT 1120 His Leu Leu Arg Phe Gin 340
GGGCGTTCTT ATAGT 1135
(2) INFORMATION FOR SEQ ID NO: 690:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 344 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 690:
Met Ala Leu Leu Phe Thr Gly Ala Cys Gly Tyr He Gly Ser His Thr
1 5 10 15
Ala Arg Ala Phe Leu Glu Lys Thr Lys Glu Asn He He He Val Asp
20 25 30
Asp Leu Ser Thr Gly Phe Leu Glu His Leu Lys Ala Leu Glu His Tyr
35 40 45
Tyr Pro Asn Arg Val Val Phe He Gin Ala Asn Leu Asn Glu Thr His
50 55 60
Lys Leu Asp Ala Phe Leu Asn Lys Gin Gin Leu Lys Asp Pro He Glu 65 70 75 80
Ala He Leu His Phe Gly Ala Lys He Ser Val Glu Glu Ser Thr His
85 90 95
Leu Pro Leu Glu Tyr Tyr Thr Asn Asn Thr Leu Asn Thr Leu Glu Leu
100 105 110
Val Lys Leu Cys Leu Lys His Ala He Lys Arg Phe He Phe Ser Ser 115 120 125 Thr Ala Val Val Tyr Gly Glu Ser Ser Ser Ser Leu Asn Glu Glu Ser
130 135 140
Pro Leu Asn Pro He Asn Pro Tyr Gly Ala Ser Lys Met Met Ser Glu 145 150 155 160
Arg He Leu Leu Asp Thr Ser Lys He Ala Asp Phe Lys Cys Val He
165 170 175
Leu Arg Tyr Phe Asn Val Ala Gly Ala Cys Met His Asn Asp Tyr Thr
180 185 190
Thr Pro Tyr Thr Leu Gly Gin Arg Thr Leu Asn Ala Thr His Leu He
195 200 205
Lys He Ala Cys Glu Cys Ala Val Gly Lys Arg Lys Lys Met Gly He
210 215 220
Phe Gly Thr Asn Tyr Pro Thr Arg Asp Gly Thr Cys He Arg Asp Tyr 225 230 235 240
He His Val Asp Asp Leu Ala Asn Ala His Leu Ala Ser Tyr Gin Thr
245 250 255
Leu Leu Glu Lys Asn Lys Ser Glu He Tyr Asn Val Gly Tyr Asn Gin
260 265 270
Gly His Ser Val Lys Glu Val He Glu Lys Val Lys Glu He Ser Asn
275 280 285
Asn Asp Phe Leu Val Glu He Leu Asp Lys Arg Gin Gly Asp Pro Ala
290 295 300
Ser Leu He Ala Asn Asn Ala Lys He Leu Gin Asn Thr Ser Phe Lys 305 310 315 320
Pro Leu Tyr Asn Asn Leu Asp Thr He He Lys Ser Ala Leu Asp Trp
325 330 335
Glu Glu His Leu Leu Arg Phe Gin 340
(2) INFORMATION FOR SEQ ID NO: 691:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1170 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 97...1119 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 691:
GGAGTTTTTT CTGGCTTTTA GGGGGTTTTA AATTCTTTTA AGGTATTCTA ACAAGACTAT 60 ATCATTGAGA TAGTTTTAAG GAAATTAAGG AACAAA ATG GAA GTT TCA CGC AAG 114
Met Glu Val Ser Arg Lys 1 5
AAA ATT TAC AAC CCC AAT TCT ACA GAA AGT GTG AAT GAA AGA AAG ATT 162 Lys He Tyr Asn Pro Asn Ser Thr Glu Ser Val Asn Glu Arg Lys He 10 15 20
TTT GGG GGC AAT CCT ACA AGC ATG TTT GAT TTG AAT AAG ATC AAG TAT 210 Phe Gly Gly Asn Pro Thr Ser Met Phe Asp Leu Asn Lys He Lys Tyr 25 30 35
CAA TGG GCG GAT CAT TTG TGG AAA ACG ATG CTC GCT AAC ACC TGG TTT 258 Gin Trp Ala Asp His Leu Trp Lys Thr Met Leu Ala Asn Thr Trp Phe 40 45 50
GCT GAA GAA GTG AGC ATG AAT GAT GAC AAA AGG GAT TAT TTG AAA TTA 306 Ala Glu Glu Val Ser Met Asn Asp Asp Lys Arg Asp Tyr Leu Lys Leu 55 60 65 70
AGC GCA GAG GAA AAG ATC GGT TAT GAC AGA GCT TTA GCG CAA CTC ATT 354 Ser Ala Glu Glu Lys He Gly Tyr Asp Arg Ala Leu Ala Gin Leu He 75 80 85
TTT ATG GAC AGC TTG CAA GCG AAT AAT TTA ATT GAC AAT ATC AAT CCC 402 Phe Met Asp Ser Leu Gin Ala Asn Asn Leu He Asp Asn He Asn Pro 90 95 100
TTC ATC ACC AGC CCC GAA ATC AAT TTG TGT TTG GTG CGT CAA GCT TAT 450 Phe He Thr Ser Pro Glu He Asn Leu Cys Leu Val Arg Gin Ala Tyr 105 110 115
GAA GAA GCC CTA CAC AGC CAT GCG TAT GCG GTG ATG GTA GAA AGC ATA 498 Glu Glu Ala Leu His Ser His Ala Tyr Ala Val Met Val Glu Ser He 120 125 130
AGT GCG AAT ACT GAA GAG ATT TAT GAC ATG TGG CGT AAC GAT ATG CAA 546 Ser Ala Asn Thr Glu Glu He Tyr Asp Met Trp Arg Asn Asp Met Gin 135 140 145 150
TTA AAA AGC AAG AAC GAC TAT ATC GCG CAA GTG TAT ATG GAA TTA GCC 594 Leu Lys Ser Lys Asn Asp Tyr He Ala Gin Val Tyr Met Glu Leu Ala 155 160 165
AAA AAC CCC ACA GAA GAA AAC ATT CTC AAA GCG CTT TTT GCT AAC CAG 642 Lys Asn Pro Thr Glu Glu Asn He Leu Lys Ala Leu Phe Ala Asn Gin 170 175 180
ATT TTA GAG GGG ATT TAT TTT TAT AGC GGG TTT AGC TAT TTT TAC ACT 690 He Leu Glu Gly He Tyr Phe Tyr Ser Gly Phe Ser Tyr Phe Tyr Thr 185 190 195
TTG GCT AGG AGC GGT AAA ATG CTA GGA TCG GCA CAA ATG ATT CGT TTT 738 Leu Ala Arg Ser Gly Lys Met Leu Gly Ser Ala Gin Met He Arg Phe 200 205 210
ATC CAA AGA GAT GAG GTA ACG CAT TTG ATT TTG TTC CAA AAC ATG ATC 786 He Gin Arg Asp Glu Val Thr His Leu He Leu Phe Gin Asn Met He 215 220 225 230 AAC GCT TTA AGG AAT GAA AGA GCG GAT CTC TTC ACG CCG CAA TTG ATT 834 Asn Ala Leu Arg Asn Glu Arg Ala Asp Leu Phe Thr Pro Gin Leu He 235 240 245
AAT GAA GTC ATA GGA ATG TTT AAA AAA GCG GTA GAA ATT GAA GCT TTG 882 Asn Glu Val He Gly Met Phe Lys Lys Ala Val Glu He Glu Ala Leu 250 255 260
TGG GGG GAT TAT ATC ACG CAA GGC AAG ATT TTA GGG CTC ACT TCA AGC 930 Trp Gly Asp Tyr He Thr Gin Gly Lys He Leu Gly Leu Thr Ser Ser 265 270 275
TTG ATT GAG CAA TAC ATC CAG TTT TTA GCG GAT AGC CGT TTG AGT AAG 978 Leu He Glu Gin Tyr He Gin Phe Leu Ala Asp Ser Arg Leu Ser Lys 280 285 290
GTG GGC ATC GCT AAA GTT TAT GGC GTC CAA CAC CCC ATT AAA TGG GTA 1026 Val Gly He Ala Lys Val Tyr Gly Val Gin His Pro He Lys Trp Val 295 300 305 310
GAG AGC TTT TCA AGT TTC AAT GAG CAG CGC TCT AAT TTC TTT GAG GCT 1074 Glu Ser Phe Ser Ser Phe Asn Glu Gin Arg Ser Asn Phe Phe Glu Ala 315 320 325
AGG GTG AGC AAT TAC GCT AAA GGG AGC GTG AGT TTT GAT GAT TTT TAAGG 1124 Arg Val Ser Asn Tyr Ala Lys Gly Ser Val Ser Phe Asp Asp Phe 330 335 340
GGCTTGTTTG AATAGTATTA AAAACCATTT AATGTGTGAA GAAATC 1170
(2) INFORMATION FOR SEQ ID NO: 692:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 341 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 692:
Met Glu Val Ser Arg Lys Lys He Tyr Asn Pro Asn Ser Thr Glu Ser
1 5 10 15
Val Asn Glu Arg Lys He Phe Gly Gly Asn Pro Thr Ser Met Phe Asp
20 25 30
Leu Asn Lys He Lys Tyr Gin Trp Ala Asp His Leu Trp Lys Thr Met
35 40 45
Leu Ala Asn Thr Trp Phe Ala Glu Glu Val Ser Met Asn Asp Asp Lys
50 55 60
Arg Asp Tyr Leu Lys Leu Ser Ala Glu Glu Lys He Gly Tyr Asp Arg 65 70 75 80
Ala Leu Ala Gin Leu He Phe Met Asp Ser Leu Gin Ala Asn Asn Leu 85 90 95 He Asp Asn He Asn Pro Phe He Thr Ser Pro Glu He Asn Leu Cys
100 105 110
Leu Val Arg Gin Ala Tyr Glu Glu Ala Leu His Ser His Ala Tyr Ala
115 120 125
Val Met Val Glu Ser He Ser Ala Asn Thr Glu Glu He Tyr Asp Met
130 135 140
Trp Arg Asn Asp Met Gin Leu Lys Ser Lys Asn Asp Tyr He Ala Gin 145 150 155 160
Val Tyr Met Glu Leu Ala Lys Asn Pro Thr Glu Glu Asn He Leu Lys
165 170 175
Ala Leu Phe Ala Asn Gin He Leu Glu Gly He Tyr Phe Tyr Ser Gly
180 185 190
Phe Ser Tyr Phe Tyr Thr Leu Ala Arg Ser Gly Lys Met Leu Gly Ser
195 200 205
Ala Gin Met He Arg Phe He Gin Arg Asp Glu Val Thr His Leu He
210 215 220
Leu Phe Gin Asn Met He Asn Ala Leu Arg Asn Glu Arg Ala Asp Leu 225 230 235 240
Phe Thr Pro Gin Leu He Asn Glu Val He Gly Met Phe Lys Lys Ala
245 250 255
Val Glu He Glu Ala Leu Trp Gly Asp Tyr He Thr Gin Gly Lys He
260 265 270
Leu Gly Leu Thr Ser Ser Leu He Glu Gin Tyr He Gin Phe Leu Ala
275 280 285
Asp Ser Arg Leu Ser Lys Val Gly He Ala Lys Val Tyr Gly Val Gin
290 295 300
His Pro He Lys Trp Val Glu Ser Phe Ser Ser Phe Asn Glu Gin Arg 305 310 315 320
Ser Asn Phe Phe Glu Ala Arg Val Ser Asn Tyr Ala Lys Gly Ser Val
325 330 335
Ser Phe Asp Asp Phe 340
(2) INFORMATION FOR SEQ ID NO:693:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 689 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 139...627 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693:
CAAAAGCTCA TTGAAGAAAC CCCGGCAGTG GTTTTAGAAG AGGGCGTGCG TGACGTTTGC 60 TAGAAACAGC GATCAAGGCC GCTAAATATA TCGGCTRRTG TGGGGGCGGG GACTTTTGAA 120 TTTTTGTTGG ATTCTAAC ATG AAA GAT TTT TAT TTC ATG GAG ATG AAC ACT 171
Met Lys Asp Phe Tyr Phe Met Glu Met Asn Thr 1 5 10
CGT TTG CAA GTG GAA CAC ACC ATT AGC GAA ATG GTG AGC GGG TTA AAC 219 Arg Leu Gin Val Glu His Thr He Ser Glu Met Val Ser Gly Leu Asn 15 20 25
CTC ATT GAG TGG ATG ATT AAA ATC GCT CAA GGC GAA AAA TTG CCC AAG 267 Leu He Glu Trp Met He Lys He Ala Gin Gly Glu Lys Leu Pro Lys 30 35 40
CAA GAA AGC TTT TCT CTC AAA GGG CAT GCG ATA GAA TGC CGA ATC ACG 315 Gin Glu Ser Phe Ser Leu Lys Gly His Ala He Glu Cys Arg He Thr 45 50 55
GCA GAA GAT CCT AAA AAA TTC TAC CCA AGC CCG GGC AAA ATT ACC GAA 363 Ala Glu Asp Pro Lys Lys Phe Tyr Pro Ser Pro Gly Lys He Thr Glu 60 65 70 75
TGG ATC GCT CCT GGT GGG GTG AAT GTG CGC CTT GAT TCG CAC GCG CAT 411 Trp He Ala Pro Gly Gly Val Asn Val Arg Leu Asp Ser His Ala His 80 85 90
GCC AAT TAT GTC GTG CCT ACG CAC TAT GAT TCG ATG ATT GGC AAG CTC 459 Ala Asn Tyr Val Val Pro Thr His Tyr Asp Ser Met He Gly Lys Leu 95 100 105
ATT GTG TGG GGT GAA AAC AGA GAA AGA GCG ATC GCT AAG ATG AAA AGG 507 He Val Trp Gly Glu Asn Arg Glu Arg Ala He Ala Lys Met Lys Arg 110 115 120
GCT TTA AAG GAA TTT AAA GTA GAA GGC ATT AAA ACG ACC ATT CCT TTC 555 Ala Leu Lys Glu Phe Lys Val Glu Gly He Lys Thr Thr He Pro Phe 125 130 135
CAC CTT GAA ATG CTT GAA AAT GCG GAT TTC AGG CAA GCA AAA ATC CAC 603 His Leu Glu Met Leu Glu Asn Ala Asp Phe Arg Gin Ala Lys He His 140 145 150 155
ACG AAG TAT TTA GAA GAA AAT TTT TAAGTTTTAA GGATTCTTTT AAGCATAGTT 657 Thr Lys Tyr Leu Glu Glu Asn Phe 160
TAAGGGTTTT AAGCGATCAG AAAAAGTCAG CA 689
(2) INFORMATION FOR SEQ ID NO: 694:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 163 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 694:
Met Lys Asp Phe Tyr Phe Met Glu Met Asn Thr Arg Leu Gin Val Glu
1 5 10 15
His Thr He Ser Glu Met Val Ser Gly Leu Asn Leu He Glu Trp Met
20 25 30
He Lys He Ala Gin Gly Glu Lys Leu Pro Lys Gin Glu Ser Phe Ser
35 40 45
Leu Lys Gly His Ala He Glu Cys Arg He Thr Ala Glu Asp Pro Lys
50 55 60
Lys Phe Tyr Pro Ser Pro Gly Lys He Thr Glu Trp He Ala Pro Gly 65 70 75 80
Gly Val Asn Val Arg Leu Asp Ser His Ala His Ala Asn Tyr Val Val
85 90 95
Pro Thr His Tyr Asp Ser Met He Gly Lys Leu He Val Trp Gly Glu
100 105 110
Asn Arg Glu Arg Ala He Ala Lys Met Lys Arg Ala Leu Lys Glu Phe
115 120 125
Lys Val Glu Gly He Lys Thr Thr He Pro Phe His Leu Glu Met Leu
130 135 140
Glu Asn Ala Asp Phe Arg Gin Ala Lys He His Thr Lys Tyr Leu Glu 145 150 155 160
Glu Asn Phe
(2) INFORMATION FOR SEQ ID NO: 695:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1960 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1907 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 695:
AACGAGTATT TGCACAATGA ACTCCAAAAG CTTTTAGAAA AAATCTCATC ATG TTC 56
Met Phe
1
TAT CAC TTA ATC GCT CCT TTA AAA AAT AAA ACC CCC CCT TTA ACC TAT 104 Tyr His Leu He Ala Pro Leu Lys Asn Lys Thr Pro Pro Leu Thr Tyr 5 10 15 TTT TCT AAA GAG CAA CAC CAA AAA GGA GCG TTA GTC AAT ATC CCT TTA 152 Phe Ser Lys Glu Gin His Gin Lys Gly Ala Leu Val Asn He Pro Leu 20 25 30
AGG AAT AAA ACG CTT TTA GGC GTC GTC CTT GAA GAA GTT TCA AAA CCC 200 Arg Asn Lys Thr Leu Leu Gly Val Val Leu Glu Glu Val Ser Lys Pro 35 40 45 50
TCT TTT GAA TGC CTA GAG CTA GAA AAA ACC CCT TAT TTT TTA CTC CCC 248 Ser Phe Glu Cys Leu Glu Leu Glu Lys Thr Pro Tyr Phe Leu Leu Pro 55 60 65
TTT CAA ATG GAG CTC GCT ATT TTT ATC GCT CAA TAT TAC TCA GCT AAT 296 Phe Gin Met Glu Leu Ala He Phe He Ala Gin Tyr Tyr Ser Ala Asn 70 75 80
CTT TCT TCA GTT TTA AGC CTT TTT GCC CCT TTT AAA GAA TGC GAT TTA 344 Leu Ser Ser Val Leu Ser Leu Phe Ala Pro Phe Lys Glu Cys Asp Leu 85 90 95
GTG GGG TTA GAA AAA ATT GAG CCT ATT CTT AAT ATA TTA AGC CAA ACG 392 Val Gly Leu Glu Lys He Glu Pro He Leu Asn He Leu Ser Gin Thr 100 105 110
CAA ACA AAC GCT TTA AAA GAA TTG CAA AAA CAT TCA GCA AGC TTG CTC 440 Gin Thr Asn Ala Leu Lys Glu Leu Gin Lys His Ser Ala Ser Leu Leu 115 120 125 130
TTT GGC GAT ACG GGT AGC GGG AAA ACC GAG ATT TAT ATG CAT GCA ATC 488 Phe Gly Asp Thr Gly Ser Gly Lys Thr Glu He Tyr Met His Ala He 135 140 145
GCC CAA ACT TTA GAG CAA AAA AAA AGC GCT TTA TTG TTG GTG CCA GAA 536 Ala Gin Thr Leu Glu Gin Lys Lys Ser Ala Leu Leu Leu Val Pro Glu 150 155 160
ATC GCT CTC ACC CCT CAA ATG CAA CAA CGC CTT AAA AGG GTT TTT AAA 584 He Ala Leu Thr Pro Gin Met Gin Gin Arg Leu Lys Arg Val Phe Lys 165 170 175
GAA AAT TTA GGC TTG TGG CAT AGC AAA CTC TCT CAA AAT CAA AAA AAA 632 Glu Asn Leu Gly Leu Trp His Ser Lys Leu Ser Gin Asn Gin Lys Lys 180 185 190
CAA TTT TTA GAA AAG CTT TAT TCG CAA GAA ATC AAA TTA GTG GTA GGC 680 Gin Phe Leu Glu Lys Leu Tyr Ser Gin Glu He Lys Leu Val Val Gly 195 200 205 210
ACA CGA AGC GCG TTG TTT TTA CCC CTT AAA GAG CTG GGT TTA ATC ATT 728 Thr Arg Ser Ala Leu Phe Leu Pro Leu Lys Glu Leu Gly Leu He He 215 220 225
GTA GAT GAA GAG CAT GAC TTT TCT TAT AAA TCC CAT CAA AGC CCT ATG 776 Val Asp Glu Glu His Asp Phe Ser Tyr Lys Ser His Gin Ser Pro Met 230 235 240 TAT AAC GCT AGG GAT TTA TGC TTG TAT TTA TCT CAT AAA TTC CCT ATT 824 Tyr Asn Ala Arg Asp Leu Cys Leu Tyr Leu Ser His Lys Phe Pro He 245 250 255
CAA GTG ATC TTA GGC TCT GCT ACG CCA AGT TTG AAT AGT TAT AAA CGC 872 Gin Val He Leu Gly Ser Ala Thr Pro Ser Leu Asn Ser Tyr Lys Arg 260 265 270
TTT AAA GAT AAG GCT TTA GTG CGC TTA AAG GGG CGC TAC ACC CCC ACG 920 Phe Lys Asp Lys Ala Leu Val Arg Leu Lys Gly Arg Tyr Thr Pro Thr 275 280 285 290
CAA AAA AAC ATT ATT TTT GAA AAA ACC GAG CGT TTT ATC ACG CCC AAA 968 Gin Lys Asn He He Phe Glu Lys Thr Glu Arg Phe He Thr Pro Lys 295 300 305
CTC CTA GAA GCG CTA CAA CAA GTC CTA GAC AAA AAC GAG CAA GCC ATT 1016 Leu Leu Glu Ala Leu Gin Gin Val Leu Asp Lys Asn Glu Gin Ala He 310 315 320
ATT TTT GTG CCT ACA AGG GCT AAT TTC AAA ACC TTG CTG TGC CAA AGT 1064 He Phe Val Pro Thr Arg Ala Asn Phe Lys Thr Leu Leu Cys Gin Ser 325 330 335
TGT TAC AAA AGC GTT CAA TGC CCC TTT TGC AGC GTG AAT ATG AGC TTG 1112 Cys Tyr Lys Ser Val Gin Cys Pro Phe Cys Ser Val Asn Met Ser Leu 340 345 350
CAT TTA AAG ACC AAC AAA CTC ATG TGC CAT TAT TGC CAT TTT TCA AGC 1160 His Leu Lys Thr Asn Lys Leu Met Cys His Tyr Cys His Phe Ser Ser 355 360 365 370
CCT ATC CCT AAA ATT TGC AGC GCG TGT CAA AGC GAA GTC TTA GTG GGT 1208 Pro He Pro Lys He Cys Ser Ala Cys Gin Ser Glu Val Leu Val Gly 375 380 385
AAA AGG ATA GGC ACT ATG CAA GTG CTA AAG GAA TTA GAG AGC CTT TTA 1256 Lys Arg He Gly Thr Met Gin Val Leu Lys Glu Leu Glu Ser Leu Leu 390 395 400
GAG GGG GCT AAA ATA GCG ATT TTA GAT AAA GAT CAC ACT AGC ACG CAA 1304 Glu Gly Ala Lys He Ala He Leu Asp Lys Asp His Thr Ser Thr Gin 405 410 415
AAA AAA CTC CAC AAT ATT TTA AAC GAT TTC AAC GCT CAA AAA ACG AAT 1352 Lys Lys Leu His Asn He Leu Asn Asp Phe Asn Ala Gin Lys Thr Asn 420 425 430
ATC TTA ATC GGC ACT CAA ATG ATA AGC AAA GGG CAT GAT TAC GCT AAA 1400 He Leu He Gly Thr Gin Met He Ser Lys Gly His Asp Tyr Ala Lys 435 440 445 450
GTG AGT TTG GCG GTT GTT TTA GGC ATA GAC AAT ATC ATC AAA TCT AAT 1448 Val Ser Leu Ala Val Val Leu Gly He Asp Asn He He Lys Ser Asn 455 460 465 AGT TAT AGG GCT TTA GAA GAA GGC GTG TCG TTA CTT TAT CAA ATC GCT 1496 Ser Tyr Arg Ala Leu Glu Glu Gly Val Ser Leu Leu Tyr Gin He Ala 470 475 480
GGG AGG AGC GCT AGG CAA ATT TCT GGC CAA GTG TTC ATT CAA AGC ACC 1544 Gly Arg Ser Ala Arg Gin He Ser Gly Gin Val Phe He Gin Ser Thr 485 490 495
GAA ACC GAT CTG TTA GAA AAT TTC TTA GAA GAT TAT GAA GAT TTT TTA 1592 Glu Thr Asp Leu Leu Glu Asn Phe Leu Glu Asp Tyr Glu Asp Phe Leu 500 505 510
CAA TAC GAA TTG CAA GAA AGG TGC GAA CTC TAC CCG CCT TTT TCT AGG 1640 Gin Tyr Glu Leu Gin Glu Arg Cys Glu Leu Tyr Pro Pro Phe Ser Arg 515 520 525 530
CTG TGT TTG TTG GAG TTT AAG CAT AAA AAC GAA GAA AAA GCC CAA CAA 1688 Leu Cys Leu Leu Glu Phe Lys His Lys Asn Glu Glu Lys Ala Gin Gin 535 540 545
TTG AGC CTA AAA GCC TCT CAA ACC CTT TCT TCG TGT TTA GAA AAG GGC 1736 Leu Ser Leu Lys Ala Ser Gin Thr Leu Ser Ser Cys Leu Glu Lys Gly 550 555 560
GTA ACG CTC TCT AAT TTC AAA GCC CCC ATT GAA AAA ATC GCT TCT TCT 1784 Val Thr Leu Ser Asn Phe Lys Ala Pro He Glu Lys He Ala Ser Ser 565 570 575
TAT CGC TAC CTT ATT TTA TTG CGT TCC AAA AAC CCT TTA AGC CTA ATC 1832 Tyr Arg Tyr Leu He Leu Leu Arg Ser Lys Asn Pro Leu Ser Leu He 580 585 590
AAA AGC GTG CAT GCG TTT TTA AAA TCC GCC CCT AGT ATC CCT TGC AGC 1880 Lys Ser Val His Ala Phe Leu Lys Ser Ala Pro Ser He Pro Cys Ser 595 600 605 610
GTG AAC ATG GAT CCT GTG GAT ATT TTT TAAAAAACTC ATGTTTTATA TATTATT 1934 Val Asn Met Asp Pro Val Asp He Phe 615
TCAAAAAACT TAAGTTTTTC TGGCGA 1960
(2) INFORMATION FOR SEQ ID NO: 696:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 619 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696: Met Phe Tyr His Leu He Ala Pro Leu Lys Asn Lys Thr Pro Pro Leu
1 5 10 15
Thr Tyr Phe Ser Lys Glu Gin His Gin Lys Gly Ala Leu Val Asn He
20 25 30
Pro Leu Arg Asn Lys Thr Leu Leu Gly Val Val Leu Glu Glu Val Ser
35 40 45
Lys Pro Ser Phe Glu Cys Leu Glu Leu Glu Lys Thr Pro Tyr Phe Leu
50 55 60
Leu Pro Phe Gin Met Glu Leu Ala He Phe He Ala Gin Tyr Tyr Ser 65 70 75 80
Ala Asn Leu Ser Ser Val Leu Ser Leu Phe Ala Pro Phe Lys Glu Cys
85 90 95
Asp Leu Val Gly Leu Glu Lys He Glu Pro He Leu Asn He Leu Ser
100 105 110
Gin Thr Gin Thr Asn Ala Leu Lys Glu Leu Gin Lys His Ser Ala Ser
115 120 125
Leu Leu Phe Gly Asp Thr Gly Ser Gly Lys Thr Glu He Tyr Met His
130 135 140
Ala He Ala Gin Thr Leu Glu Gin Lys Lys Ser Ala Leu Leu Leu Val 145 150 155 160
Pro Glu He Ala Leu Thr Pro Gin Met Gin Gin Arg Leu Lys Arg Val
165 170 175
Phe Lys Glu Asn Leu Gly Leu Trp His Ser Lys Leu Ser Gin Asn Gin
180 185 190
Lys Lys Gin Phe Leu Glu Lys Leu Tyr Ser Gin Glu He Lys Leu Val
195 200 205
Val Gly Thr Arg Ser Ala Leu Phe Leu Pro Leu Lys Glu Leu Gly Leu
210 215 220
He He Val Asp Glu Glu His Asp Phe Ser Tyr Lys Ser His Gin Ser 225 230 235 240
Pro Met Tyr Asn Ala Arg Asp Leu Cys Leu Tyr Leu Ser His Lys Phe
245 250 255
Pro He Gin Val He Leu Gly Ser Ala Thr Pro Ser Leu Asn Ser Tyr
260 265 270
Lys Arg Phe Lys Asp Lys Ala Leu Val Arg Leu Lys Gly Arg Tyr Thr
275 280 285
Pro Thr Gin Lys Asn He He Phe Glu Lys Thr Glu Arg Phe He Thr
290 295 300
Pro Lys Leu Leu Glu Ala Leu Gin Gin Val Leu Asp Lys Asn Glu Gin 305 310 315 320
Ala He He Phe Val Pro Thr Arg Ala Asn Phe Lys Thr Leu Leu Cys
325 330 335
Gin Ser Cys Tyr Lys Ser Val Gin Cys Pro Phe Cys Ser Val Asn Met
340 345 350
Ser Leu His Leu Lys Thr Asn Lys Leu Met Cys His Tyr Cys His Phe
355 360 365
Ser Ser Pro He Pro Lys He Cys Ser Ala Cys Gin Ser Glu Val Leu
370 375 380
Val Gly Lys Arg He Gly Thr Met Gin Val Leu Lys Glu Leu Glu Ser 385 390 395 400
Leu Leu Glu Gly Ala Lys He Ala He Leu Asp Lys Asp His Thr Ser
405 410 415
Thr Gin Lys Lys Leu His Asn He Leu Asn Asp Phe Asn Ala Gin Lys
420 425 430
Thr Asn He Leu He Gly Thr Gin Met He Ser Lys Gly His Asp Tyr 435 440 445
Ala Lys Val Ser Leu Ala Val Val Leu Gly He Asp Asn He He Lys
450 455 460
Ser Asn Ser Tyr Arg Ala Leu Glu Glu Gly Val Ser Leu Leu Tyr Gin 465 470 475 480
He Ala Gly Arg Ser Ala Arg Gin He Ser Gly Gin Val Phe He Gin
485 490 495
Ser Thr Glu Thr Asp Leu Leu Glu Asn Phe Leu Glu Asp Tyr Glu Asp
500 505 510
Phe Leu Gin Tyr Glu Leu Gin Glu Arg Cys Glu Leu Tyr Pro Pro Phe
515 520 525
Ser Arg Leu Cys Leu Leu Glu Phe Lys His Lys Asn Glu Glu Lys Ala
530 535 540
Gin Gin Leu Ser Leu Lys Ala Ser Gin Thr Leu Ser Ser Cys Leu Glu 545 550 555 560
Lys Gly Val Thr Leu Ser Asn Phe Lys Ala Pro He Glu Lys He Ala
565 570 575
Ser Ser Tyr Arg Tyr Leu He Leu Leu Arg Ser Lys Asn Pro Leu Ser
580 585 590
Leu He Lys Ser Val His Ala Phe Leu Lys Ser Ala Pro Ser He Pro
595 600 605
Cys Ser Val Asn Met Asp Pro Val Asp He Phe 610 615
(2) INFORMATION FOR SEQ ID NO: 697:
(i) SEQUENCE CHARACTERISTICS: - (A) LENGTH: 2438 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 82...2373 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697:
AAGATTAGCC ATGCTGACTT GCCAGATCAA TGATTTGCGC AGTTTCTTTG AAACTGATTT 60 GAGAGTGTTG GAGAGCTTTT A ATG AAA CTG AGC ATT AAT GAT TTG AAT GTT 111
Met Lys Leu Ser He Asn Asp Leu Asn Val 1 5 10
TTT GTC AAT ACG CCT AAA GAT ATA GCC AAA CTC TGT GAG GAT TTG AGT 159 Phe Val Asn Thr Pro Lys Asp He Ala Lys Leu Cys Glu Asp Leu Ser 15 20 25
CGC TTA GGT TTA GAA GTG GAA AGC TGT ATC CCT TGT ATC GCT CCT AAA 207 Arg Leu Gly Leu Glu Val Glu Ser Cys He Pro Cys He Ala Pro Lys 30 35 40
AAT GTG GTT GTG GGT AAA ATT TTA GAA AAA GCC CCC CAT AAA AAC GCT 255 Asn Val Val Val Gly Lys He Leu Glu Lys Ala Pro His Lys Asn Ala 45 50 55
GAA AAA CTC AGC GTG TGT CAA GTG GAT GTG GGT AAA GAA GTG TTG CAA 303 Glu Lys Leu Ser Val Cys Gin Val Asp Val Gly Lys Glu Val Leu Gin 60 65 70
ATC GTG TGT GGG GCT AAA AAT GTC GCG CCA AAC CAA TTC GTG CCA GTC 351 He Val Cys Gly Ala Lys Asn Val Ala Pro Asn Gin Phe Val Pro Val 75 80 85 90
GCT TTA AAC GGG GCG CTA ATC GGC TCA ACC ACC ATC GCT AAA ACG GAG 399 Ala Leu Asn Gly Ala Leu He Gly Ser Thr Thr He Ala Lys Thr Glu 95 100 105
CTT AGG GGG GTT GAA AGC CAT GGC ATG ATT TGC TCT AGC ATT GAA TTA 447 Leu Arg Gly Val Glu Ser His Gly Met He Cys Ser Ser He Glu Leu 110 115 120
GGC TTC CCT AAA ATC AAT GAT GGC ATC TTG GAA TTA GAT GAG AGC GTT 495 Gly Phe Pro Lys He Asn Asp Gly He Leu Glu Leu Asp Glu Ser Val 125 130 135
GGG GAG TTG GTT TTA GGG AAA GAA TTA AAC GAA TAC GCC CCT TTC AAC 543 Gly Glu Leu Val Leu Gly Lys Glu Leu Asn Glu Tyr Ala Pro Phe Asn 140 145 150
ACG CAT GTT TTA GAA ATT TCA TTG ACT CCC AAT CGT GGG GAT TGC TTG 591 Thr His Val Leu Glu He Ser Leu Thr Pro Asn Arg Gly Asp Cys Leu 155 160 165 170
AGC GTT TTA GGT ATT GCC AGA GAA ATT AGC GCC TTT TAT CAC ACG CCC 639 Ser Val Leu Gly He Ala Arg Glu He Ser Ala Phe Tyr His Thr Pro 175 180 185
CTA AAG CCT ATT AAG GCT TTA AAT TTT ACG CCC AAA AGC GGT TTG ATC 687 Leu Lys Pro He Lys Ala Leu Asn Phe Thr Pro Lys Ser Gly Leu He 190 195 200
ACG CTT AGT GCG GGT GAA AAT ATT GAA TCG CAT CTG GCT TAT TAT TTG 735 Thr Leu Ser Ala Gly Glu Asn He Glu Ser His Leu Ala Tyr Tyr Leu 205 210 215
ATT TGC AAC CAT TCA TTA AAA ACC CCT TTA AAT ATC AAA CTT TCG CTC 783 He Cys Asn His Ser Leu Lys Thr Pro Leu Asn He Lys Leu Ser Leu 220 225 230
GCT CAT AAT AAT GCC TTG AGT GAG AAC GAT CTG AAC AAT TTC ATA GAA 831 Ala His Asn Asn Ala Leu Ser Glu Asn Asp Leu Asn Asn Phe He Glu 235 240 245 250
TTT AGC ACG CAT TTT AGT GGG GTA ATA ATG AAC GCT TAT AGC CTA AAT 879 Phe Ser Thr His Phe Ser Gly Val He Met Asn Ala Tyr Ser Leu Asn 255 260 265
ACA ACC CCT ATG GAT TTG AGC GTG AAA AAC GAT GAA AAC AAC CTT GAA 927 Thr Thr Pro Met Asp Leu Ser Val Lys Asn Asp Glu Asn Asn Leu Glu 270 275 280
AGC GTT TAT ATC AAC CAT CAA AAA CGC TCC ACG ATC GCT ATC AAG CAT 975 Ser Val Tyr He Asn His Gin Lys Arg Ser Thr He Ala He Lys His 285 290 295
CAA GTT CAA AAA GAT TTG AGC GAG TGT TTG CTT TTA GAG GCA AGT TAC 1023 Gin Val Gin Lys Asp Leu Ser Glu Cys Leu Leu Leu Glu Ala Ser Tyr 300 305 310
ACC GAT CCG ATA AGC CTG TCT TTA AAA TTA CAC GCC CTA AAA GAT AAA 1071 Thr Asp Pro He Ser Leu Ser Leu Lys Leu His Ala Leu Lys Asp Lys 315 320 325 330
ACG CTT CAA AAA GAC AAC GCC CTT ATT TAT AGA AGC GCT AGG GGG AGT 1119 Thr Leu Gin Lys Asp Asn Ala Leu He Tyr Arg Ser Ala Arg Gly Ser 335 340 345
AAC CCT AAT TTA TCA GAC GGC TTG AAT TTT TTA AGC GCT CAT TTG AAA 1167 Asn Pro Asn Leu Ser Asp Gly Leu Asn Phe Leu Ser Ala His Leu Lys 350 355 360
GCC ACG ATT TTA GAA AGC AAA CAA ACT GAG CAT TCT TTA AAA GAT CGC 1215 Ala Thr He Leu Glu Ser Lys Gin Thr Glu His Ser Leu Lys Asp Arg 365 370 375
ACC CTT ACA TTC CAG CTT GAA GAC ATT ACT GAA ATT TTG GGG CTT GCT 1263 Thr Leu Thr Phe Gin Leu Glu Asp He Thr Glu He Leu Gly Leu Ala 380 385 390
GTA GAG AAA GAA AAA ATT CAA GGC ATT TTA AAA AAT TTA GGC TTT AAA 1311 Val Glu Lys Glu Lys He Gin Gly He Leu Lys Asn Leu Gly Phe Lys 395 400 405 410
GTC AGC GTA AAA GAG CCA AAC TCA AAA CCC CAA ATT TTA GAG GTT ATT 1359 Val Ser Val Lys Glu Pro Asn Ser Lys Pro Gin He Leu Glu Val He 415 420 425
GCG CCA AAT TTC AGG CAT GAC ATT AAA ACG ATC CAA GAT ATT GCT GAA 1407 Ala Pro Asn Phe Arg His Asp He Lys Thr He Gin Asp He Ala Glu 430 435 440
GAA ATT TTG CGC TTT GTA GGG ATT GAT AAT CTA GTC TCA AAG CCC CTT 1455 Glu He Leu Arg Phe Val Gly He Asp Asn Leu Val Ser Lys Pro Leu 445 450 455
CAT TGT GTC AGT AGC AAA AAT TCA AAC CCC AAT TAC GAC ACG CAC CGC 1503 His Cys Val Ser Ser Lys Asn Ser Asn Pro Asn Tyr Asp Thr His Arg 460 465 470 TTT TTT GAA AAC CTT AAA CAC AAG GCT CTC GCT TGC GGT TTT AAA GAA 1551 Phe Phe Glu Asn Leu Lys His Lys Ala Leu Ala Cys Gly Phe Lys Glu 475 480 485 490
GTC ATT CAT TAC GTG TTT TAC TCT AAA GAA AAA CAG CAA AAA TTA GGC 1599 Val He His Tyr Val Phe Tyr Ser Lys Glu Lys Gin Gin Lys Leu Gly 495 500 505
TTT GAA GTT TTA GAA GAT CCC CTA GAA TTG CAA AAC CCT ATC ACA ACG 1647 Phe Glu Val Leu Glu Asp Pro Leu Glu Leu Gin Asn Pro He Thr Thr 510 515 520
GAG TTA AAC ACC CTA AGG ACG AGT CTT GTT TGC GGG CTT TTA GAC GCC 1695 Glu Leu Asn Thr Leu Arg Thr Ser Leu Val Cys Gly Leu Leu Asp Ala 525 530 535
AGT TTA AGG AAT AAA AAT TTA GGG TTT AAA AGC ATA GCC CTT TAT GAA 1743 Ser Leu Arg Asn Lys Asn Leu Gly Phe Lys Ser He Ala Leu Tyr Glu 540 545 550
AAG GGG AGC GTG TAT AAC TCT AAA AGA GAA GAA ATC CAA AAA CTA GGC 1791 Lys Gly Ser Val Tyr Asn Ser Lys Arg Glu Glu He Gin Lys Leu Gly 555 560 565 570
TTT TTA ATA AGC GGC TTG CAA AAA AAA GAA AGC TAC CCT GAT ACT AAG 1839 Phe Leu He Ser Gly Leu Gin Lys Lys Glu Ser Tyr Pro Asp Thr Lys 575 580 585
GGC AAG GCT TGG GAT TTT TAC TCT TTT GCC GAA TGC GTT TCA AAA GTT 1887 Gly Lys Ala Trp Asp Phe Tyr Ser Phe Ala Glu Cys Val Ser Lys Val 590 595 600
ATA GGG GAT TTC AGC TTG GAA AAA CTA ACC ACT CAA ACC CCC ATT AAC 1935 He Gly Asp Phe Ser Leu Glu Lys Leu Thr Thr Gin Thr Pro He Asn 605 610 615
CAC CCC TAC CAG AGC GCT AAA ATC ATT CAA AAT CAT GAA ATC ATA GGC 1983 His Pro Tyr Gin Ser Ala Lys He He Gin Asn His Glu He He Gly 620 625 630
GTG ATC GCT AAA ATC CAC CCT AAA GTG ATC CAG GAA TTG GAT TTG TTT 2031 Val He Ala Lys He His Pro Lys Val He Gin Glu Leu Asp Leu Phe 635 640 645 650
GAA AGC TAT TAC GCT GAG ATA GAC GCT TTT AAA CTC AAA CGC CCT GCT 2079 Glu Ser Tyr Tyr Ala Glu He Asp Ala Phe Lys Leu Lys Arg Pro Ala 655 660 665
ATG CTA TTA AAA CCC TTT AGC ATT TAT CCT AGC AGT GTT AGG GAT TTG 2127 Met Leu Leu Lys Pro Phe Ser He Tyr Pro Ser Ser Val Arg Asp Leu 670 675 680
ACT CTC ATC ATT GAT GAG AAT ACC GCT TTT AGT GGG ATT AAA AAA GCC 2175 Thr Leu He He Asp Glu Asn Thr Ala Phe Ser Gly He Lys Lys Ala 685 690 695 CTA AAG GAC GCT CAA ATC CCT AAT TTA AGC GAG ATT CTA CCC CTT GAT 2223 Leu Lys Asp Ala Gin He Pro Asn Leu Ser Glu He Leu Pro Leu Asp 700 705 710
ATT TTT AAA GAA AGT AAT AAT TCC ATA GCC TTA AGC GTG CGT TGC GTG 2271 He Phe Lys Glu Ser Asn Asn Ser He Ala Leu Ser Val Arg Cys Val 715 720 725 730
ATC CAT TCT TTA GAA AAA ACC CTG AAT GAT GAA GAG GTC AAT TCA GCC 2319 He His Ser Leu Glu Lys Thr Leu Asn Asp Glu Glu Val Asn Ser Ala 735 740 745
GTG CAA AAA GCA CTT GAA ATT TTA GAA AAA GAA TTT AAC GCC CGC CTT 2367 Val Gin Lys Ala Leu Glu He Leu Glu Lys Glu Phe Asn Ala Arg Leu 750 755 760
AAA GGA TAATATAAAG GATAATATGT GATAGAGCTT GACATTAACG CTAGCGATAA AT 2425 Lys Gly
CGCTCTCACA CAG 243!
(2) INFORMATION FOR SEQ ID NO: 698:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 764 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 698:
Met Lys Leu Ser He Asn Asp Leu Asn Val Phe Val Asn Thr Pro Lys
1 5 10 15
Asp He Ala Lys Leu Cys Glu Asp Leu Ser Arg Leu Gly Leu Glu Val
20 25 30
Glu Ser Cys He Pro Cys He Ala Pro Lys Asn Val Val Val Gly Lys
35 40 45
He Leu Glu Lys Ala Pro His Lys Asn Ala Glu Lys Leu Ser Val Cys
50 55 60
Gin Val Asp Val Gly Lys Glu Val Leu Gin He Val Cys Gly Ala Lys 65 70 75 80
Asn Val Ala Pro Asn Gin Phe Val Pro Val Ala Leu Asn Gly Ala Leu
85 90 95
He Gly Ser Thr Thr He Ala Lys Thr Glu Leu Arg Gly Val Glu Ser
100 105 110
His Gly Met He Cys Ser Ser He Glu Leu Gly Phe Pro Lys He Asn
115 120 125
Asp Gly He Leu Glu Leu Asp Glu Ser Val Gly Glu Leu Val Leu Gly
130 135 140
Lys Glu Leu Asn Glu Tyr Ala Pro Phe Asn Thr His Val Leu Glu He 145 150 155 160 Ser Leu Thr Pro Asn Arg Gly Asp Cys Leu Ser Val Leu Gly He Ala
165 170 175
Arg Glu He Ser Ala Phe Tyr His Thr Pro Leu Lys Pro He Lys Ala
180 185 190
Leu Asn Phe Thr Pro Lys Ser Gly Leu He Thr Leu Ser Ala Gly Glu
195 200 205
Asn He Glu Ser His Leu Ala Tyr Tyr Leu He Cys Asn His Ser Leu
210 215 220
Lys Thr Pro Leu Asn He Lys Leu Ser Leu Ala His Asn Asn Ala Leu 225 230 235 240
Ser Glu Asn Asp Leu Asn Asn Phe He Glu Phe Ser Thr His Phe Ser
245 250 255
Gly Val He Met Asn Ala Tyr Ser Leu Asn Thr Thr Pro Met Asp Leu
260 265 270
Ser Val Lys Asn Asp Glu Asn Asn Leu Glu Ser Val Tyr He Asn His
275 280 285
Gin Lys Arg Ser Thr He Ala He Lys His Gin Val Gin Lys Asp Leu
290 295 300
Ser Glu Cys Leu Leu Leu Glu Ala Ser Tyr Thr Asp Pro He Ser Leu 305 310 315 320
Ser Leu Lys Leu His Ala Leu Lys Asp Lys Thr Leu Gin Lys Asp Asn
325 330 335
Ala Leu He Tyr Arg Ser Ala Arg Gly Ser Asn Pro Asn Leu Ser Asp
340 345 350
Gly Leu Asn Phe Leu Ser Ala His Leu Lys Ala Thr He Leu Glu Ser
355 360 365
Lys Gin Thr Glu His Ser Leu Lys Asp Arg Thr Leu Thr Phe Gin Leu
370 375 380
Glu Asp He Thr Glu He Leu Gly Leu Ala Val Glu Lys Glu Lys He 385 390 395 400
Gin Gly He Leu Lys Asn Leu Gly Phe Lys Val Ser Val Lys Glu Pro
405 410 415
Asn Ser Lys Pro Gin He Leu Glu Val He Ala Pro Asn Phe Arg His
420 425 430
Asp He Lys Thr He Gin Asp He Ala Glu Glu He Leu Arg Phe Val
435 440 445
Gly He Asp Asn Leu Val Ser Lys Pro Leu His Cys Val Ser Ser Lys
450 455 460
Asn Ser Asn Pro Asn Tyr Asp Thr His Arg Phe Phe Glu Asn Leu Lys 465 470 475 480
His Lys Ala Leu Ala Cys Gly Phe Lys Glu Val He His Tyr Val Phe
485 490 495
Tyr Ser Lys Glu Lys Gin Gin Lys Leu Gly Phe Glu Val Leu Glu Asp
500 505 510
Pro Leu Glu Leu Gin Asn Pro He Thr Thr Glu Leu Asn Thr Leu Arg
515 520 525
Thr Ser Leu Val Cys Gly Leu Leu Asp Ala Ser Leu Arg Asn Lys Asn
530 535 540
Leu Gly Phe Lys Ser He Ala Leu Tyr Glu Lys Gly Ser Val Tyr Asn 545 550 555 560
Ser Lys Arg Glu Glu He Gin Lys Leu Gly Phe Leu He Ser Gly Leu
565 570 575
Gin Lys Lys Glu Ser Tyr Pro Asp Thr Lys Gly Lys Ala Trp Asp Phe
580 585 590
Tyr Ser Phe Ala Glu Cys Val Ser Lys Val He Gly Asp Phe Ser Leu 595 600 605
Glu Lys Leu Thr Thr Gin Thr Pro He Asn His Pro Tyr Gin Ser Ala
610 615 620
Lys He He Gin Asn His Glu He He Gly Val He Ala Lys He His 625 630 635 640
Pro Lys Val He Gin Glu Leu Asp Leu Phe Glu Ser Tyr Tyr Ala Glu
645 650 655
He Asp Ala Phe Lys Leu Lys Arg Pro Ala Met Leu Leu Lys Pro Phe
660 665 670
Ser He Tyr Pro Ser Ser Val Arg Asp Leu Thr Leu He He Asp Glu
675 680 685
Asn Thr Ala Phe Ser Gly He Lys Lys Ala Leu Lys Asp Ala Gin He
690 695 700
Pro Asn Leu Ser Glu He Leu Pro Leu Asp He Phe Lys Glu Ser Asn 705 710 715 720
Asn Ser He Ala Leu Ser Val Arg Cys Val He His Ser Leu Glu Lys
725 730 735
Thr Leu Asn Asp Glu Glu Val Asn Ser Ala Val Gin Lys Ala Leu Glu
740 745 750
He Leu Glu Lys Glu Phe Asn Ala Arg Leu Lys Gly 755 760
(2) INFORMATION FOR SEQ ID NO: 699:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1097 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 492...1040 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 699:
TTTCTTTTAT TAAAGTTGTA TTATAGCTTT TTAAAAATAA AATATAAGGA TCGTTATTGC 60
ACACCTTAAT AGAGCGATTA GAAAAGGTTA CTAATAGCAA AGAGTTAGAA GAAGCGCGCT 120
TGAATGCTTT GGGTAAAAAA GGGGTTTTTG CGGATAAATT CAACCAGCTC AAACATCTGA 180
ACGGCGAAGA AAAAAACGCC TTTGCTAAAG AAATCCACCA TTATAAACAA GCGTTTGAAA 240
AAGCCTTTGA ATGGAAAAAA AAGGCTATTA TAGAGCTTGA ATTAGAAGAA CGCTTGAAAA 300
AAGAAAAAAT TGATGTGAGC TTGTTTAACG CTATCAAAAC AAGCTCTTCT CACCCTTTAA 360
ACTACACTAA AAATAAAATC ATTGAATTTT TCACCCCATT AGGATACAAG CTTGAAATCG 420
GCTCTTTAGT GGAAGATGAT TTCCATAATT TCAGCGCTTT AAACTTGCCC CCTTACCATC 480
CTGCAAGAGA C ATG CAA GAC ACT TTT TAT TTT AAA GAT CAC AAG CTT TTA 530 Met Gin Asp Thr Phe Tyr Phe Lys Asp His Lys Leu Leu 1 5 10
AGG ACC CAC ACT TCG CCC GTG CAA ATC CAC ACC ATG CAA GAA CAA ACC 578 Arg Thr His Thr Ser Pro Val Gin He His Thr Met Gin Glu Gin Thr 15 20 25
CCA CCC ATT AAG ATG ATT TGT TTA GGC GAA ACC TTT AGG CGC GAT TAT 626 Pro Pro He Lys Met He Cys Leu Gly Glu Thr Phe Arg Arg Asp Tyr 30 35 40 45
GAT TTG ACC CAC ACG CCC ATG TTC CAC CAA ATT GAA GGG CTT GTC GTG 674 Asp Leu Thr His Thr Pro Met Phe His Gin He Glu Gly Leu Val Val 50 55 60
GAT CAA AAA GGG AAT ATC CGT TTC ACA CAT TTA AAA GGT GTG ATC GAA 722 Asp Gin Lys Gly Asn He Arg Phe Thr His Leu Lys Gly Val He Glu 65 70 75
GAC TTT TTG CAT TAT TTC TTT GGG GGC GTG AAG TTA AGG TGG CGC TCT 770 Asp Phe Leu His Tyr Phe Phe Gly Gly Val Lys Leu Arg Trp Arg Ser 80 85 90
AGC TTT TTC CCT TTC ACA GAG CCA AGC GCT GAA GTG GAT ATT AGT TGC 818 Ser Phe Phe Pro Phe Thr Glu Pro Ser Ala Glu Val Asp He Ser Cys 95 100 105
GTG TTT TGC AAG CAA GAA GGC TGT AGG GTT TGC TCG CAC ACA GGC TGG 866 Val Phe Cys Lys Gin Glu Gly Cys Arg Val Cys Ser His Thr Gly Trp 110 115 120 125
TTA GAA GTG TTG GGC TGT GGC ATG GTC AAT AAT GCG GTG TTT GAA GCC 914 Leu Glu Val Leu Gly Cys Gly Met Val Asn Asn Ala Val Phe Glu Ala 130 135 140
ATA GGG TAT GAG AAT GTG AGC GGG TTT GCT TTT GGC ATG GGG ATT GAA 962 He Gly Tyr Glu Asn Val Ser Gly Phe Ala Phe Gly Met Gly He Glu 145 150 155
AGA TTA GCC ATG CTG ACT TGC CAG ATC AAT GAT TTG CGC AGT TTC TTT 1010 Arg Leu Ala Met Leu Thr Cys Gin He Asn Asp Leu Arg Ser Phe Phe 160 165 170
GAA ACT GAT TTG AGA GTG TTG GAG AGC TTT TAATGAAACT GAGCATTAAT GAT 1063 Glu Thr Asp Leu Arg Val Leu Glu Ser Phe 175 180
TTGAATGTTT TTGTCAATAC GCCTAAAGAT ATAG 1097
(2) INFORMATION FOR SEQ ID NO: 700:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 183 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:700:
Met Gin Asp Thr Phe Tyr Phe Lys Asp His Lys Leu Leu Arg Thr His
1 5 10 15
Thr Ser Pro Val Gin He His Thr Met Gin Glu Gin Thr Pro Pro He
20 25 30
Lys Met He Cys Leu Gly Glu Thr Phe Arg Arg Asp Tyr Asp Leu Thr
35 40 45
His Thr Pro Met Phe His Gin He Glu Gly Leu Val Val Asp Gin Lys
50 55 60
Gly Asn He Arg Phe Thr His Leu Lys Gly Val He Glu Asp Phe Leu 65 70 75 80
His Tyr Phe Phe Gly Gly Val Lys Leu Arg Trp Arg Ser Ser Phe Phe
85 90 95
Pro Phe Thr Glu Pro Ser Ala Glu Val Asp He Ser Cys Val Phe Cys
100 105 110
Lys Gin Glu Gly Cys Arg Val Cys Ser His Thr Gly Trp Leu Glu Val
115 120 125
Leu Gly Cys Gly Met Val Asn Asn Ala Val Phe Glu Ala He Gly Tyr
130 135 140
Glu Asn Val Ser Gly Phe Ala Phe Gly Met Gly He Glu Arg Leu Ala 145 150 155 160
Met Leu Thr Cys Gin He Asn Asp Leu Arg Ser Phe Phe Glu Thr Asp
165 170 175
Leu Arg Val Leu Glu Ser Phe 180
(2) INFORMATION FOR SEQ ID NO: 701:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 517 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(n) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...464 (D) OTHER INFORMATION.
(xi) SEQUENCE DESCRIPTION. SEQ ID NO: 701:
TATTGACTTT CATAGAAAGT ATTTTAACCT CTTTTTGTTA AAATAGGTCT ATG AAA 56
Met Lys
1
AAA ATT GAT GAT ATG AGA CAC GGA AGA CAT TGT GTT TTT TTA ATG CAT 104 Lys He Asp Asp Met Arg His Gly Arg His Cys Val Phe Leu Met His 5 10 15 GTG CAT TTT GTA TTT GTT ACT AAA TAC AGG CGT TCA GCA TTC AAT AAG 152 Val His Phe Val Phe Val Thr Lys Tyr Arg Arg Ser Ala Phe Asn Lys 20 25 30
GAA GTG ATA GAT TTT TTA GGA TCG GTG TTT GCC AAA GTG TGT AAG GAC 200 Glu Val He Asp Phe Leu Gly Ser Val Phe Ala Lys Val Cys Lys Asp 35 40 45 50
TTT GAG AGC GAA TTG GTA GAA TTT GAT GGG GAG AGC GAT CAT GTG CAT 248 Phe Glu Ser Glu Leu Val Glu Phe Asp Gly Glu Ser Asp His Val His 55 60 65
TTG CTT ATC AAC TAC CCT CCA AAA GTG AGC GTG AGT AAG TTA GTT AAT 296 Leu Leu He Asn Tyr Pro Pro Lys Val Ser Val Ser Lys Leu Val Asn 70 75 80
TCT TTA AAA GGC GTT AGC AGT CGT TTG ACT AGA CAA CAC CAT TTC AAA 344 Ser Leu Lys Gly Val Ser Ser Arg Leu Thr Arg Gin His His Phe Lys 85 90 95
AGC GTT GAA GCT AGT TTG TGG GGG AAG CAT TTA TGG TCG CCT AGT TAT 392 Ser Val Glu Ala Ser Leu Trp Gly Lys His Leu Trp Ser Pro Ser Tyr 100 105 110
TTC GCT GGG AGT TGT GGG GAC GCG CCT TTA GAG ATG ATT AAG CAA TAC 440 Phe Ala Gly Ser Cys Gly Asp Ala Pro Leu Glu Met He Lys Gin Tyr 115 120 125 130
ATA CAA GAT CAA GAA ACA CCG CAT TAAATTAGCT AACTTTGATT TTTAAGTAGA 494 He Gin Asp Gin Glu Thr Pro His 135
ACGCGCTAAA AAGCGAATGG ATC 517
(2) INFORMATION FOR SEQ ID NO: 702:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 138 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 702:
Met Lys Lys He Asp Asp Met Arg His Gly Arg His Cys Val Phe Leu
1 5 10 15
Met His Val His Phe Val Phe Val Thr Lys Tyr Arg Arg Ser Ala Phe
20 25 30
Asn Lys Glu Val He Asp Phe Leu Gly Ser Val Phe Ala Lys Val Cys
35 40 45
Lys Asp Phe Glu Ser Glu Leu Val Glu Phe Asp Gly Glu Ser Asp His 50 55 60 Val His Leu Leu He Asn Tyr Pro Pro Lys Val Ser Val Ser Lys Leu 65 70 75 80
Val Asn Ser Leu Lys Gly Val Ser Ser Arg Leu Thr Arg Gin His His
85 90 95
Phe Lys Ser Val Glu Ala Ser Leu Trp Gly Lys His Leu Trp Ser Pro
100 105 110
Ser Tyr Phe Ala Gly Ser Cys Gly Asp Ala Pro Leu Glu Met He Lys
115 120 125
Gin Tyr He Gin Asp Gin Glu Thr Pro His 130 135
(2) INFORMATION FOR SEQ ID NO: 703:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1786 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1733 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 703:
AAAAATAACC CCCATCTCTT TAAAACCTTA TCATAATGAA AGGATAAAAA ATG CAA 56
Met Gin 1
GAA GTC CAT GAT TAT GGG ATT AAA TTT TGG AGC AAT AAC GAA TTT AAG 104 Glu Val His Asp Tyr Gly He Lys Phe Trp Ser Asn Asn Glu Phe Lys 5 10 15
ATA GAA AAA GGC TTG GTT AAA GTC TGT CAT GGT AAA AAC CCC TCG CTT 152 He Glu Lys Gly Leu Val Lys Val Cys His Gly Lys Asn Pro Ser Leu 20 25 30
TTA GAA ATC GTT CAA AGC GTG CGC GAT AAG GGC TAT AGA GGA CCT TTG 200 Leu Glu He Val Gin Ser Val Arg Asp Lys Gly Tyr Arg Gly Pro Leu 35 40 45 50
TTG GTG CGA TTC CCC CAT TTG GTG CAA AAA CAA ATC AAA AGC CTG TTT 248 Leu Val Arg Phe Pro His Leu Val Gin Lys Gin He Lys Ser Leu Phe 55 60 65
GAT GCG TTT TCT TCA GCG ATT AAA GAG TAT CAA TAC AGC GGG GCT TTT 296 Asp Ala Phe Ser Ser Ala He Lys Glu Tyr Gin Tyr Ser Gly Ala Phe 70 75 80 AAG GCG GTT TTC CCT TTA AAA GTC AAT CAA ATG CCC TCG TTT GTT TTC 344 Lys Ala Val Phe Pro Leu Lys Val Asn Gin Met Pro Ser Phe Val Phe 85 90 95
CCT TTA GTG CAG GGG GCT AAG GGT TTG AAT TAC GGA TTA GAG GCT GGG 392 Pro Leu Val Gin Gly Ala Lys Gly Leu Asn Tyr Gly Leu Glu Ala Gly 100 105 110
AGC AAG TCT GAA CTC ATC ATC GCA ATG AGT TAC ACT AAC CCT AAA GCC 440 Ser Lys Ser Glu Leu He He Ala Met Ser Tyr Thr Asn Pro Lys Ala 115 120 125 130
CCT ATC ACC GTG AAT GGC TTT AAA GAC AAA GAA ATG ATT GAG CTT GGC 488 Pro He Thr Val Asn Gly Phe Lys Asp Lys Glu Met He Glu Leu Gly 135 140 145
TTT ATC GCT AAA AGC ATG CAG CAT GAG ATC ACT TTA ACG ATT GAG GGT 536 Phe He Ala Lys Ser Met Gin His Glu He Thr Leu Thr He Glu Gly 150 155 160
TTG AAT GAA TTG AAA ACC ATT ATC GCC GTG GCT AAA CAA AAC GAG TTT 584 Leu Asn Glu Leu Lys Thr He He Ala Val Ala Lys Gin Asn Glu Phe 165 170 175
TTA GCC TGC CCT AAA ATT GGC ATC CGC ATC CGT TTG CAC AGC ACT GGC 632 Leu Ala Cys Pro Lys He Gly He Arg He Arg Leu His Ser Thr Gly 180 185 190
ACT GGC GTT TGG GCA AAG AGT GGG GGG ATC AAT TCT AAA TTT GGT CTT 680 Thr Gly Val Trp Ala Lys Ser Gly Gly He Asn Ser Lys Phe Gly Leu 195 200 205 210
AGC AGC ACT GAA GTT TTA GAG GCG ATG CGC CTT TTA GAA GAA AAC GAC 728 Ser Ser Thr Glu Val Leu Glu Ala Met Arg Leu Leu Glu Glu Asn Asp 215 220 225
TTG TTA GAG CAT TTC CAC ATG ATA CAT TTC CAT ATA GGC TCT CAA ATC 776 Leu Leu Glu His Phe His Met He His Phe His He Gly Ser Gin He 230 235 240
AGC GAT ATT TCG CCC TTA AAA AAG GCT TTA AGA GAA GCG GGT AAC TTG 824 Ser Asp He Ser Pro Leu Lys Lys Ala Leu Arg Glu Ala Gly Asn Leu 245 250 255
TAT GCA GAA TTG CGT AAA ATG GGC GCT AAA AAT CTT AAT AGC GTG AAT 872 Tyr Ala Glu Leu Arg Lys Met Gly Ala Lys Asn Leu Asn Ser Val Asn 260 265 270
ATT GGA GGG GGG TTA GCC GTA GAA TAC ACC CAA CAC AAG CAC CAC CAA 920 He Gly Gly Gly Leu Ala Val Glu Tyr Thr Gin His Lys His His Gin 275 280 285 290
GAC AAA AAC TAC ACT TTA GAG GAA TTC AGC GCT GAT GTG GTG TTT TTA 968 Asp Lys Asn Tyr Thr Leu Glu Glu Phe Ser Ala Asp Val Val Phe Leu 295 300 305 TTG AGA GAA ATT GTG AAA AAT AAG CAG GAA ATC GAG CCG GAC ATT TTC 1016 Leu Arg Glu He Val Lys Asn Lys Gin Glu He Glu Pro Asp He Phe 310 315 320
ATT GAA TCA GGC CGT TAT ATT TCC GCT AAC CAT GCC GTT TTA GTG GCC 1064 He Glu Ser Gly Arg Tyr He Ser Ala Asn His Ala Val Leu Val Ala 325 330 335
CCG GTG TTA GAA TTG TTT TCG CAT GAA TAC AAT GAA AAA TCC CTA AAA 1112 Pro Val Leu Glu Leu Phe Ser His Glu Tyr Asn Glu Lys Ser Leu Lys 340 345 350
ATC AAA GAA AAT AAT AAC CCC CCT TTG ATT GAT GAA ATG CTA GAC TTG 1160 He Lys Glu Asn Asn Asn Pro Pro Leu He Asp Glu Met Leu Asp Leu 355 360 365 370
CTC GCT AAT ATC AAT GAA AAA AAC GCC ATT GAA TAC TTG CAT GAT AGT 1208 Leu Ala Asn He Asn Glu Lys Asn Ala He Glu Tyr Leu His Asp Ser 375 380 385
TTT GAT CAC ACC GAG TCG CTA TTC ACG CTT TTT GAT CTG GGC TAT ATT 1256 Phe Asp His Thr Glu Ser Leu Phe Thr Leu Phe Asp Leu Gly Tyr He 390 395 400
GAT TTG ATT GAC AGG AGC AAC ACT GAA GTT TTA GCC CAT TTG ATC GTC 1304 Asp Leu He Asp Arg Ser Asn Thr Glu Val Leu Ala His Leu He Val 405 410 415
AAA AAA GCG GTG CAA TTG CTT TAT GTT AAG GAT CAT AAC GAT ATT TTA 1352 Lys Lys Ala Val Gin Leu Leu Tyr Val Lys Asp His Asn Asp He Leu 420 425 430
CGC ATT CAA GAG CAG GTC CAA GAG CGC TAT TTA TTG AAT TGC TCG TTT 1400 Arg He Gin Glu Gin Val Gin Glu Arg Tyr Leu Leu Asn Cys Ser Phe 435 440 445 450
TTC CAA AGC TTG CCG GAT TAT TGG GGC TTG AGA CAG AAT TTC CCG GTC 1448 Phe Gin Ser Leu Pro Asp Tyr Trp Gly Leu Arg Gin Asn Phe Pro Val 455 460 465
ATG CCC TTG AAT AAA TTA GAT GAA AAG CCC ACC AGG AGT GCG AGC TTG 1496 Met Pro Leu Asn Lys Leu Asp Glu Lys Pro Thr Arg Ser Ala Ser Leu 470 475 480
TGG GAT ATT ACT TGC GAT AGC GAT GGG GAA ATC GCT TTT GAT TCC ACG 1544 Trp Asp He Thr Cys Asp Ser Asp Gly Glu He Ala Phe Asp Ser Thr 485 490 495
AAG CCC TTG TTT TTG CAC GAT ATA GAT ATA GAT GAA GAA GAA TAC TTT 1592 Lys Pro Leu Phe Leu His Asp He Asp He Asp Glu Glu Glu Tyr Phe 500 505 510
TTA GCG TTC TTT TTA GTG GGA GCG TAT CAA GAA GTT TTA GGC ATG AAA 1640 Leu Ala Phe Phe Leu Val Gly Ala Tyr Gin Glu Val Leu Gly Met Lys 515 520 525 530 CAC AAT TTA TTC ACG CAC CTA CGG AAT TTA GCG TGG TTT TTG ATG AAA 1688 His Asn Leu Phe Thr His Leu Arg Asn Leu Ala Trp Phe Leu Met Lys 535 540 545
AAG GCG ATT ATG AAG TGG AAG ATA TTT GTG AAG CCC AAA CGA TTT TAGAT 1738 Lys Ala He Met Lys Trp Lys He Phe Val Lys Pro Lys Arg Phe 550 555 560
GTGCTAGACG ATTTAGACTA TGACACTAAA GAGATCGAGC GCCTTTTA 1786
(2) INFORMATION FOR SEQ ID NO: 704:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 561 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 704:
Met Gin Glu Val His Asp Tyr Gly He Lys Phe Trp Ser Asn Asn Glu
1 5 10 15
Phe Lys He Glu Lys Gly Leu Val Lys Val Cys His Gly Lys Asn Pro
20 25 30
Ser Leu Leu Glu He Val Gin Ser Val Arg Asp Lys Gly Tyr Arg Gly
35 40 45
Pro Leu Leu Val Arg Phe Pro His Leu Val Gin Lys Gin He Lys Ser
50 55 60
Leu Phe Asp Ala Phe Ser Ser Ala He Lys Glu Tyr Gin Tyr Ser Gly 65 70 75 80
Ala Phe Lys Ala Val Phe Pro Leu Lys Val Asn Gin Met Pro Ser Phe
85 90 95
Val Phe Pro Leu Val Gin Gly Ala Lys Gly Leu Asn Tyr Gly Leu Glu
100 105 110
Ala Gly Ser Lys Ser Glu Leu He He Ala Met Ser Tyr Thr Asn Pro
115 120 125
Lys Ala Pro He Thr Val Asn Gly Phe Lys Asp Lys Glu Met He Glu
130 135 140
Leu Gly Phe He Ala Lys Ser Met Gin His Glu He Thr Leu Thr He 145 150 155 160
Glu Gly Leu Asn Glu Leu Lys Thr He He Ala Val Ala Lys Gin Asn
165 170 175
Glu Phe Leu Ala Cys Pro Lys He Gly He Arg He Arg Leu His Ser
180 185 190
Thr Gly Thr Gly Val Trp Ala Lys Ser Gly Gly He Asn Ser Lys Phe
195 200 205
Gly Leu Ser Ser Thr Glu Val Leu Glu Ala Met Arg Leu Leu Glu Glu
210 215 220
Asn Asp Leu Leu Glu His Phe His Met He His Phe His He Gly Ser 225 230 235 240
Gin He Ser Asp He Ser Pro Leu Lys Lys Ala Leu Arg Glu Ala Gly 245 250 255 Asn Leu Tyr Ala Glu Leu Arg Lys Met Gly Ala Lys Asn Leu Asn Ser
260 265 270
Val Asn He Gly Gly Gly Leu Ala Val Glu Tyr Thr Gin His Lys His
275 280 285
His Gin Asp Lys Asn Tyr Thr Leu Glu Glu Phe Ser Ala Asp Val Val
290 295 300
Phe Leu Leu Arg Glu He Val Lys Asn Lys Gin Glu He Glu Pro Asp 305 310 315 320
He Phe He Glu Ser Gly Arg Tyr He Ser Ala Asn His Ala Val Leu
325 330 335
Val Ala Pro Val Leu Glu Leu Phe Ser His Glu Tyr Asn Glu Lys Ser
340 345 350
Leu Lys He Lys Glu Asn Asn Asn Pro Pro Leu He Asp Glu Met Leu
355 360 365
Asp Leu Leu Ala Asn He Asn Glu Lys Asn Ala He Glu Tyr Leu His
370 375 380
Asp Ser Phe Asp His Thr Glu Ser Leu Phe Thr Leu Phe Asp Leu Gly 385 390 395 400
Tyr He Asp Leu He Asp Arg Ser Asn Thr Glu Val Leu Ala His Leu
405 410 415
He Val Lys Lys Ala Val Gin Leu Leu Tyr Val Lys Asp His Asn Asp
420 425 430
He Leu Arg He Gin Glu Gin Val Gin Glu Arg Tyr Leu Leu Asn Cys
435 440 445
Ser Phe Phe Gin Ser Leu Pro Asp Tyr Trp Gly Leu Arg Gin Asn Phe
450 455 460
Pro Val Met Pro Leu Asn Lys Leu Asp Glu Lys Pro Thr Arg Ser Ala 465 470 475 480
Ser Leu Trp Asp He Thr Cys Asp Ser Asp Gly Glu He Ala Phe Asp
485 490 495
Ser Thr Lys Pro Leu Phe Leu His Asp He Asp He Asp Glu Glu Glu
500 505 510
Tyr Phe Leu Ala Phe Phe Leu Val Gly Ala Tyr Gin Glu Val Leu Gly
515 520 525
Met Lys His Asn Leu Phe Thr His Leu Arg Asn Leu Ala Trp Phe Leu
530 535 540
Met Lys Lys Ala He Met Lys Trp Lys He Phe Val Lys Pro Lys Arg 545 550 555 560
Phe
(2) INFORMATION FOR SEQ ID NO: 705:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 676 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...623 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 705:
TTAAAGAGGA TAAAGAGAAT AGTGATAGCG ATAATGATAC TGCAGACACG ATG GAT 56
Met Asp 1
GAA GTC TTA AAA GAG ATT TTA TCA AGT TAT CAA AAA AGA GCT TTA AAA 104 Glu Val Leu Lys Glu He Leu Ser Ser Tyr Gin Lys Arg Ala Leu Lys 5 10 15
TTA ACC AAA AGA GTT AGA AAG AAG ATT TTT AAG AAT GAT CCC ACA GAA 152 Leu Thr Lys Arg Val Arg Lys Lys He Phe Lys Asn Asp Pro Thr Glu 20 25 30
AAT CAA AAA AAA GCC ATA AAG ATC GCT CTA AAT ACC CCT GAT ATT GCT 200 Asn Gin Lys Lys Ala He Lys He Ala Leu Asn Thr Pro Asp He Ala 35 40 45 50
ATT ATC CAA GGG CCT CCT GGA ACG GGC AAA ACC ACT GTG ATC AAT GCC 248 He He Gin Gly Pro Pro Gly Thr Gly Lys Thr Thr Val He Asn Ala 55 60 65
ATT TGT GAG AGA TTG TTT GAA GAA TAC CCT AAG GAT AAA AAT ATC AAG 296 He Cys Glu Arg Leu Phe Glu Glu Tyr Pro Lys Asp Lys Asn He Lys 70 75 80
GGG CAA ATT TTA CTG TGC GCT CAA GGG CAT GAT GCG ACT AAC AAT GCG 344 Gly Gin He Leu Leu Cys Ala Gin Gly His Asp Ala Thr Asn Asn Ala 85 90 95
CGT GAG CGC ATC AAA GTA GGG GGA TTG CCC ACT TTT AAA TTT GGT GCT 392 Arg Glu Arg He Lys Val Gly Gly Leu Pro Thr Phe Lys Phe Gly Ala 100 105 110
AAA AAA AAT GCT AAA GAA GAA CAA TAC AAG CAA GAT GAA AGA TTG AAT 440 Lys Lys Asn Ala Lys Glu Glu Gin Tyr Lys Gin Asp Glu Arg Leu Asn 115 120 125 130
GAG CGA TTG AGA GAG TTT GCT GAA ACG CTC ATA GAA AGC GTG AGA AAA 488 Glu Arg Leu Arg Glu Phe Ala Glu Thr Leu He Glu Ser Val Arg Lys 135 140 145
AAA CTG CAA AAA TTA GGG GAT TAT GAA AAT ATA GAA AAA ATT TTG GAT 536 Lys Leu Gin Lys Leu Gly Asp Tyr Glu Asn He Glu Lys He Leu Asp 150 155 160
TTA GAA GAA GCC CTT AGA CGC TAC TAT AGT TCG CCT ATC AGT GAA TTG 584 Leu Glu Glu Ala Leu Arg Arg Tyr Tyr Ser Ser Pro He Ser Glu Leu 165 170 175
GAA TTT TTA AAA GAA ATA GAA AAA AAT GAG AGC TTT TTT TAATTCTTCT AT 635 Glu Phe Leu Lys Glu He Glu Lys Asn Glu Ser Phe Phe 180 185 190
GCGTGAAAAG CTAAGCCAAT TAAAAGCAAG GCAGCAAAAA C 676
(2) INFORMATION FOR SEQ ID NO: 706:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 191 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 706:
Met Asp Glu Val Leu Lys Glu He Leu Ser Ser Tyr Gin Lys Arg Ala
1 5 10 15
Leu Lys Leu Thr Lys Arg Val Arg Lys Lys He Phe Lys Asn Asp Pro
20 25 30
Thr Glu Asn Gin Lys Lys Ala He Lys He Ala Leu Asn Thr Pro Asp
35 40 45
He Ala He He Gin Gly Pro Pro Gly Thr Gly Lys Thr Thr Val He
50 55 60
Asn Ala He Cys Glu Arg Leu Phe Glu Glu Tyr Pro Lys Asp Lys Asn 65 70 75 80
He Lys Gly Gin He Leu Leu Cys Ala Gin Gly His Asp Ala Thr Asn
85 90 95
Asn Ala Arg Glu Arg He Lys Val Gly Gly Leu Pro Thr Phe Lys Phe
100 105 110
Gly Ala Lys Lys Asn Ala Lys Glu Glu Gin Tyr Lys Gin Asp Glu Arg
115 120 125
Leu Asn Glu Arg Leu Arg Glu Phe Ala Glu Thr Leu He Glu Ser Val
130 135 140
Arg Lys Lys Leu Gin Lys Leu Gly Asp Tyr Glu Asn He Glu Lys He 145 150 155 160
Leu Asp Leu Glu Glu Ala Leu Arg Arg Tyr Tyr Ser Ser Pro He Ser
165 170 175
Glu Leu Glu Phe Leu Lys Glu He Glu Lys Asn Glu Ser Phe Phe 180 185 190
(2) INFORMATION FOR SEQ ID NO: 707:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 913 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence (B) LOCATION: 86...862 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 707:
ATTTCATATT TTCTTTTAGG GGCATTCTTT AGAGAATGCA ATTATTTTCT AATCTCTAAG 60 AAATCAATAT CTAGGAATTA GGACC ATG CAA GAA AGA GTT TTT AAA AGA AAA 112
Met Gin Glu Arg Val Phe Lys Arg Lys 1 5
GTT TTA GAT GCG AAT ATC TTA AAA GAA ATG CAT GCG AAC AAT GTC TGT 160 Val Leu Asp Ala Asn He Leu Lys Glu Met His Ala Asn Asn Val Cys 10 15 20 25
TAT TCC AAG CAT TCA AAA GAT AGG TTT ATT CCT TTC AAA TTT GAT AAA 208 Tyr Ser Lys His Ser Lys Asp Arg Phe He Pro Phe Lys Phe Asp Lys 30 35 40
TTT GGT TAT GTT GGA TGT AAA CTT TTT AAA AAG ATA TTA AAC TTT CCT 256 Phe Gly Tyr Val Gly Cys Lys Leu Phe Lys Lys He Leu Asn Phe Pro 45 50 55
AGC AAT ACA ACT TTC TTT GGT GGC ACA GGT TGT AAG AAA CTC ATG GAA 304 Ser Asn Thr Thr Phe Phe Gly Gly Thr Gly Cys Lys Lys Leu Met Glu 60 65 70
CTT TTA AGT GAA ATC GTT ATA GAT TCT AGA AGT TCT AAA ATT GCG TTA 352 Leu Leu Ser Glu He Val He Asp Ser Arg Ser Ser Lys He Ala Leu 75 80 85
AAC CGC CAT TAT GCC TTA ACT CGC TTG CAA TGG TGC GAT AGA ACC TTA 400 Asn Arg His Tyr Ala Leu Thr Arg Leu Gin Trp Cys Asp Arg Thr Leu 90 95 100 105
AGA CAT AAT CTC CAA ATT TTA GAG AGA ATA GGA TTT CTA ACT GCT TTT 448 Arg His Asn Leu Gin He Leu Glu Arg He Gly Phe Leu Thr Ala Phe 110 115 120
AAG AAC AAA AAA GGT TAT ATT TTT TTG TCT ATG CAT GAC TTC ACT AAA 496 Lys Asn Lys Lys Gly Tyr He Phe Leu Ser Met His Asp Phe Thr Lys 125 130 135
ATA GAA AAC TAC GAA CAT TCA GGT TTG AAT GGG GAG AGC AAT TTA CCT 544 He Glu Asn Tyr Glu His Ser Gly Leu Asn Gly Glu Ser Asn Leu Pro 140 145 150
AAT AGC TTC TTT TTA GGA ATT TGT GGG TAT TTG AAA AAA CTC TTC AAG 592 Asn Ser Phe Phe Leu Gly He Cys Gly Tyr Leu Lys Lys Leu Phe Lys 155 160 165
AAA TTA AAA GAT AGA GCA TTC AGG CTC GCA AAC AAG CAC GGT GTA TTC 640 Lys Leu Lys Asp Arg Ala Phe Arg Leu Ala Asn Lys His Gly Val Phe 170 175 180 185
TTT TTG AAA ATT CCT AAG CAT TTT CAA ATG CAA AAC TTT AAC AAT ATT 688 Phe Leu Lys He Pro Lys His Phe Gin Met Gin Asn Phe Asn Asn He 190 195 200
TTT TTG GAG TTT GTG TCG GTT AAT AAT CCT TGT TTT TCT TAT AGA TTG 736 Phe Leu Glu Phe Val Ser Val Asn Asn Pro Cys Phe Ser Tyr Arg Leu 205 210 215
ACT TAT GAT CAA CTT GTT GGT AAA AAA ATT CCA AAT ATC AAG TGC TCT 784 Thr Tyr Asp Gin Leu Val Gly Lys Lys He Pro Asn He Lys Cys Ser 220 225 230
TAC CAA CAA GCA ATT GTA AAA AAG AAT ATC CAT AGA GCA TTA GAT GAA 832 Tyr Gin Gin Ala He Val Lys Lys Asn He His Arg Ala Leu Asp Glu 235 240 245
CTA TCT ATA GAT AAG GAA ATT TTA GCA TCA TAAAGAAGAC AAAGGATAAA AAT 885 Leu Ser He Asp Lys Glu He Leu Ala Ser 250 255
GCAATTCCCA CTCAAAAAAG ATTTAAGA 913
(2) INFORMATION FOR SEQ ID NO: 08:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 259 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 708:
Met Gin Glu Arg Val Phe Lys Arg Lys Val Leu Asp Ala Asn He Leu
1 5 10 15
Lys Glu Met His Ala Asn Asn Val Cys Tyr Ser Lys His Ser Lys Asp
20 25 30
Arg Phe He Pro Phe Lys Phe Asp Lys Phe Gly Tyr Val Gly Cys Lys
35 40 45
Leu Phe Lys Lys He Leu Asn Phe Pro Ser Asn Thr Thr Phe Phe Gly
50 55 60
Gly Thr Gly Cys Lys Lys Leu Met Glu Leu Leu Ser Glu He Val He 65 70 75 80
Asp Ser Arg Ser Ser Lys He Ala Leu Asn Arg His Tyr Ala Leu Thr
85 90 95
Arg Leu Gin Trp Cys Asp Arg Thr Leu Arg His Asn Leu Gin He Leu
100 105 110
Glu Arg He Gly Phe Leu Thr Ala Phe Lys Asn Lys Lys Gly Tyr He
115 120 125
Phe Leu Ser Met His Asp Phe Thr Lys He Glu Asn Tyr Glu His Ser 130 135 140 Gly Leu Asn Gly Glu Ser Asn Leu Pro Asn Ser Phe Phe Leu Gly He 145 150 155 160
Cys Gly Tyr Leu Lys Lys Leu Phe Lys Lys Leu Lys Asp Arg Ala Phe
165 170 175
Arg Leu Ala Asn Lys His Gly Val Phe Phe Leu Lys He Pro Lys His
180 185 190
Phe Gin Met Gin Asn Phe Asn Asn He Phe Leu Glu Phe Val Ser Val
195 200 205
Asn Asn Pro Cys Phe Ser Tyr Arg Leu Thr Tyr Asp Gin Leu Val Gly
210 215 220
Lys Lys He Pro Asn He Lys Cys Ser Tyr Gin Gin Ala He Val Lys 225 230 235 240
Lys Asn He His Arg Ala Leu Asp Glu Leu Ser He Asp Lys Glu He
245 250 255
Leu Ala Ser
(2) INFORMATION FOR SEQ ID NO: 709:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3166 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...3113 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:709:
ACTATTTGAA TAATGTCGAA TATGAAAAAC TTCTTAAAAA GTAGCATGCA ATG GAT 56
Met Asp 1
CTA GAA GAA CTC TAT GCG CCT AAT CAC ATA GAG CGT TTG AAA GCG CGG 104 Leu Glu Glu Leu Tyr Ala Pro Asn His He Glu Arg Leu Lys Ala Arg 5 10 15
AGT TTT TTA AGA TCG ATT GCT TTT TTT GAT GAT TTT AGC GCT TCT TTT 152 Ser Phe Leu Arg Ser He Ala Phe Phe Asp Asp Phe Ser Ala Ser Phe 20 25 30
GAA TAC AGA GAT CTA TTT AGC GTT TTG GAA AAT ATC GTG CAA TTT GAT 200 Glu Tyr Arg Asp Leu Phe Ser Val Leu Glu Asn He Val Gin Phe Asp 35 40 45 50
TAT GAA AAA AAG CCG TAT AAA GAT GAT TTG TAT TTT TTG TGC AAA TTT 248 Tyr Glu Lys Lys Pro Tyr Lys Asp Asp Leu Tyr Phe Leu Cys Lys Phe 55 60 65
GTG GAG CCA GCC CTA AAG GCT ATC TTT AGC AAT CTA AAT ACC AAT ATC 296 Val Glu Pro Ala Leu Lys Ala He Phe Ser Asn Leu Asn Thr Asn He 70 75 80
TAC CGA AAA CAT TTA AAA ATG CCT TTA GAA AAG GCT AGG GAA TTT GAC 344 Tyr Arg Lys His Leu Lys Met Pro Leu Glu Lys Ala Arg Glu Phe Asp 85 90 95
GCT AAA TGC GCG TTG GAT TTA GCC AAG CGA CCA GGT CGT AGT TTG AAA 392 Ala Lys Cys Ala Leu Asp Leu Ala Lys Arg Pro Gly Arg Ser Leu Lys 100 105 110
GAA AAG TTG TGC GAC AAT AAA GTA TTG AGC GTC AAG CGT TAT GTG AAT 440 Glu Lys Leu Cys Asp Asn Lys Val Leu Ser Val Lys Arg Tyr Val Asn 115 120 125 130
GCC AAT ACG CAT GAA AAC AGG TTT CTC AAG CGT TTC ATT AAA GAA CTT 488 Ala Asn Thr His Glu Asn Arg Phe Leu Lys Arg Phe He Lys Glu Leu 135 140 145
TTA AGA ATA ATT CAT TGG CGC GAG ATA GAA TTC CAA CAG GTT TTT GAA 536 Leu Arg He He His Trp Arg Glu He Glu Phe Gin Gin Val Phe Glu 150 155 160
GAG TTA ATT TTC AGC ATA ACA AGT TTT TTA AAG AAT GGA GTA GCC CAA 584 Glu Leu He Phe Ser He Thr Ser Phe Leu Lys Asn Gly Val Ala Gin 165 170 175
CAA ATT GAT GAA AAA CAA GCC ATC ATT CCT AAT AAC TTG TTG CAT TTT 632 Gin He Asp Glu Lys Gin Ala He He Pro Asn Asn Leu Leu His Phe 180 185 190
GAT AAG CAC TAC AAA CGC ATT TTT AAA GCC CAT GAT TGG CTT TAT GAT 680 Asp Lys His Tyr Lys Arg He Phe Lys Ala His Asp Trp Leu Tyr Asp 195 200 205 210
GGT GTG GGG TCA TTG ATG AAT TTG GAT CAA ATT TTC TAT TTG GAG TGT 728 Gly Val Gly Ser Leu Met Asn Leu Asp Gin He Phe Tyr Leu Glu Cys 215 220 225
TTA TAC CAA GCC CAA TTT TAT ACT TCT AAA AAC ATT GAA CCC ACG CTA 776 Leu Tyr Gin Ala Gin Phe Tyr Thr Ser Lys Asn He Glu Pro Thr Leu 230 235 240
ATT AGA AAT GAA CAA GAT TTA TAC GCG CTA ATT AAA AAT AGT TTT CCA 824 He Arg Asn Glu Gin Asp Leu Tyr Ala Leu He Lys Asn Ser Phe Pro 245 250 255
ATA AAA GAT TTA TCG TTT GAA AAG ATG CGT TTA AAA GCG AAA GAG TTT 872 He Lys Asp Leu Ser Phe Glu Lys Met Arg Leu Lys Ala Lys Glu Phe 260 265 270
TTT GAA AAT GAA TTA AGA CAG CCT ATA AAT TTA GAT CAA GAA ATT CCG 920 Phe Glu Asn Glu Leu Arg Gin Pro He Asn Leu Asp Gin Glu He Pro 275 280 285 290
CAA TTG GAA TTG TGT AAG GGA GTT TAT AAA GAA ATG TAT ATT GAT ATG 968 Gin Leu Glu Leu Cys Lys Gly Val Tyr Lys Glu Met Tyr He Asp Met 295 300 305
TTT AGC CCT GAA CCT TTC GCT TTG TTA GTG GGT AAT GGC AAT GAA GAA 1016 Phe Ser Pro Glu Pro Phe Ala Leu Leu Val Gly Asn Gly Asn Glu Glu 310 315 320
AAG ATT TTA AAG CTC CCC CTT TTA GTC AAA AAG CAG GAG AAT AAT ACT 1064 Lys He Leu Lys Leu Pro Leu Leu Val Lys Lys Gin Glu Asn Asn Thr 325 330 335
TAT ATC AAC GCT AAT GGC GCT AAG GGT AAG ATA GAT GAA AAA GGT TAT 1112 Tyr He Asn Ala Asn Gly Ala Lys Gly Lys He Asp Glu Lys Gly Tyr 340 345 350
TTG GCC AAC GCT CTC AAA AAC TAT GAT GAG ACT CTT GTG GAA GCT TTT 1160 Leu Ala Asn Ala Leu Lys Asn Tyr Asp Glu Thr Leu Val Glu Ala Phe 355 360 365 370
ATG AGA GAT TTC AAG GAA CGC TAT AAG ATA GAA AAA CTA TAT TAT TTA 1208 Met Arg Asp Phe Lys Glu Arg Tyr Lys He Glu Lys Leu Tyr Tyr Leu 375 380 385
TTA GAT GAT AAT ATT AAA AAT TTT GAA TTT GCT AAG ATC AAG CAT AAA 1256 Leu Asp Asp Asn He Lys Asn Phe Glu Phe Ala Lys He Lys His Lys 390 395 400
ATA AGC TTG TAT TTT AAA GAC GCA AAA TTC TAT CCT AAA AGC GTT GCT 1304 He Ser Leu Tyr Phe Lys Asp Ala Lys Phe Tyr Pro Lys Ser Val Ala 405 410 415
TTA GGA TTT AGT TCT TTG TTT GAA AAT AAA TTA AAG AAA AAT GAG CGT 1352 Leu Gly Phe Ser Ser Leu Phe Glu Asn Lys Leu Lys Lys Asn Glu Arg 420 425 430
TTG CGT TAT AAC AGC GTG GAT TTG GTC GTT AAA GAA AAC CAT AAA AGT 1400 Leu Arg Tyr Asn Ser Val Asp Leu Val Val Lys Glu Asn His Lys Ser 435 440 445 450
AAG ACC TTT AAT GAT TGT GGC TTG GTT TTG GAG AGG CAA AAA AGC GAT 1448 Lys Thr Phe Asn Asp Cys Gly Leu Val Leu Glu Arg Gin Lys Ser Asp 455 460 465
GAT TCA AAA GAG TTC CTT ATT CTA CAA GAT TCT TTT ATC AAA AAA GCT 1496 Asp Ser Lys Glu Phe Leu He Leu Gin Asp Ser Phe He Lys Lys Ala 470 475 480
TTA AAA AAT TTT AAA AGA GCC TTA GGA TTA GAA AAA GAA GGC TTT ATT 1544 Leu Lys Asn Phe Lys Arg Ala Leu Gly Leu Glu Lys Glu Gly Phe He 485 490 495 CTG TAT AAA GAA TGC TTG CCT AAG CTC TCT ATG GAA GTG GTT AAA GAC 1592 Leu Tyr Lys Glu Cys Leu Pro Lys Leu Ser Met Glu Val Val Lys Asp 500 505 510
GGG CGG TTT AAA AAT TTT GAG ATC ATT AAA GAT AAA ACC ATT TTA GGA 1640 Gly Arg Phe Lys Asn Phe Glu He He Lys Asp Lys Thr He Leu Gly 515 520 525 530
GAT AAA GAA ACC CTA GAG ATT GAA ACG CCT TTT ATT ATC CCT AAA GGG 1688 Asp Lys Glu Thr Leu Glu He Glu Thr Pro Phe He He Pro Lys Gly 535 540 545
CGA GAA AGT TTT GCT TTG CCC TTG ATC CTA AAT GAA GAA AAA ATC GCC 1736 Arg Glu Ser Phe Ala Leu Pro Leu He Leu Asn Glu Glu Lys He Ala 550 555 560
TAT CAA GGT AAA ATC ACC TCT AAA GAT TTT CCC CTA GAA AAT GAC GAA 1784 Tyr Gin Gly Lys He Thr Ser Lys Asp Phe Pro Leu Glu Asn Asp Glu 565 570 575
GAA TAC AAA CTC ACG CTC ACT TAT GAC ATT GGC ACC GAG TTT AAC TAT 1832 Glu Tyr Lys Leu Thr Leu Thr Tyr Asp He Gly Thr Glu Phe Asn Tyr 580 585 590
GTG TTA GAG TTT AAA CCT GTC AAT AAT GAT TTA AAG CCC ATT GTC ATG 1880 Val Leu Glu Phe Lys Pro Val Asn Asn Asp Leu Lys Pro He Val Met 595 600 605 610
GAA TGG CAG CGT ATT GAT AGG GTT GAA CTC CCT ACG CCC GAT TCC ATC 1928 Glu Trp Gin Arg He Asp Arg Val Glu Leu Pro Thr Pro Asp Ser He 615 620 625
AAA AAA CCA AGT ATT GAT GAA CTA AAA AAT GAC TTT AAT CCT AAA AGG 1976 Lys Lys Pro Ser He Asp Glu Leu Lys Asn Asp Phe Asn Pro Lys Arg 630 635 640
GGC AAA AGT TCT GAT TTG TTT GAG TGG GCG CTA GAG CAA TTA GAG ACA 2024 Gly Lys Ser Ser Asp Leu Phe Glu Trp Ala Leu Glu Gin Leu Glu Thr 645 650 655
TTG AAA GAT TTA AAT AGT CCA CCC AGA TTT GTT TTA GAG AAA AAA CTA 2072 Leu Lys Asp Leu Asn Ser Pro Pro Arg Phe Val Leu Glu Lys Lys Leu 660 665 670
GAA TGC GGT GGA ATC TCA ATA ATA GGG GAA GAT AGA AAC AAT GAA CTT 2120 Glu Cys Gly Gly He Ser He He Gly Glu Asp Arg Asn Asn Glu Leu 675 680 685 690
TTT TAC ATA ATG GAA ACA AAT GGT AAA AAA GTT TTT TGT CAT AGC CGT 2168 Phe Tyr He Met Glu Thr Asn Gly Lys Lys Val Phe Cys His Ser Arg 695 700 705
CAA TGC AAA GGG AGC GTG AAC AAA GAT GAG CTT TCA TTA GGC GCG CGA 2216 Gin Cys Lys Gly Ser Val Asn Lys Asp Glu Leu Ser Leu Gly Ala Arg 710 715 720 GTG TGT TTG GAA GTG GGG CCA GAT AAG AAC GAC CAT GGT AAA TAT CGA 2264 Val Cys Leu Glu Val Gly Pro Asp Lys Asn Asp His Gly Lys Tyr Arg 725 730 735
GGT AAA ATT TAT GGT TTG GAA AAA AAT AGA GAA ATT GTT TTA TTA AAT 2312 Gly Lys He Tyr Gly Leu Glu Lys Asn Arg Glu He Val Leu Leu Asn 740 745 750
ACA GCT AAA AAT TCT TAT CAA AGA AAA CCT CTA GAT GAG AAA ATT AAA 2360 Thr Ala Lys Asn Ser Tyr Gin Arg Lys Pro Leu Asp Glu Lys He Lys 755 760 765 770
CAC AGA ATA GAA GCG CTC AAA AGA ATC AAG TAT CCT TGT TTA AAA ATT 2408 His Arg He Glu Ala Leu Lys Arg He Lys Tyr Pro Cys Leu Lys He 775 780 785
TTT TCA CAT TAC ATG CTT GAA GAG TTA GAA ACC TTA AAT CCT GAA TTT 2456 Phe Ser His Tyr Met Leu Glu Glu Leu Glu Thr Leu Asn Pro Glu Phe 790 795 800
GCT ACT CCC TTT AAA GAA TAT TTG AAG CGG TTA GAA GAA TAT TAT TTT 2504 Ala Thr Pro Phe Lys Glu Tyr Leu Lys Arg Leu Glu Glu Tyr Tyr Phe 805 810 815
GAC CCA CAA ACA GAC AGA GAT TTT AAA AAA GGA CTC TTG GAT TTC TTT 2552 Asp Pro Gin Thr Asp Arg Asp Phe Lys Lys Gly Leu Leu Asp Phe Phe 820 825 830
AGC CGC TTG AAT GAT AGT ATT CCC GCA AAA TTA CAA CAA GAA TTT ATT 2600 Ser Arg Leu Asn Asp Ser He Pro Ala Lys Leu Gin Gin Glu Phe He 835 840 845 850
AAT TTA CCT TCT ACG GAT TTT TTA AGC AGA TGT TTA GGC TCT CTT GAA 2648 Asn Leu Pro Ser Thr Asp Phe Leu Ser Arg Cys Leu Gly Ser Leu Glu 855 860 865
AAA GAC TTT CAA AAA ACG ATT TTT AAG AAG CTT AAA GTT ACT AAC CTA 2696 Lys Asp Phe Gin Lys Thr He Phe Lys Lys Leu Lys Val Thr Asn Leu 870 875 880
AAG ACT TTA AGT ATT GTG GCT AGG GCT AGT TGG AAT AAT GAG AAA TTT 2744 Lys Thr Leu Ser He Val Ala Arg Ala Ser Trp Asn Asn Glu Lys Phe 885 890 895
TTA GAG AAC TTG ATG GCT CAA ACC AGC TTG GAG CAG CAA AAA GAC TTT 2792 Leu Glu Asn Leu Met Ala Gin Thr Ser Leu Glu Gin Gin Lys Asp Phe 900 905 910
TTG AAG CGT ATA GAA GAG TGT TTG AAA AAT CCT GAG TCA TTT TAT TTC 2840 Leu Lys Arg He Glu Glu Cys Leu Lys Asn Pro Glu Ser Phe Tyr Phe 915 920 925 930
AGT AGC GCA TGC GAA TTG CTG TTA GCG TTT TTG TCT TAT CGC AAC GCT 2888 Ser Ser Ala Cys Glu Leu Leu Leu Ala Phe Leu Ser Tyr Arg Asn Ala 935 940 945 AAA AGA GAG TTG GAA TTG ATC CCT GAA AGC GAA AAA ACC ATG CGT TTA 2936 Lys Arg Glu Leu Glu Leu He Pro Glu Ser Glu Lys Thr Met Arg Leu 950 955 960
TTG GAC AGC ATA GAT AAA GCG ATA GAA AAA GAG ACT GAA ATT AAA AGC 2984 Leu Asp Ser He Asp Lys Ala He Glu Lys Glu Thr Glu He Lys Ser 965 970 975
TTT GTA AAA TTA GAG CTA AAA AAT CAA AGC TTC AAC AAT ATC CCA CCT 3032 Phe Val Lys Leu Glu Leu Lys Asn Gin Ser Phe Asn Asn He Pro Pro 980 985 990
TTG TTG TCG GCG TTA CGC TTG TAT TTA AGG GGG GAT TTG GAA GGT GTT 3080 Leu Leu Ser Ala Leu Arg Leu Tyr Leu Arg Gly Asp Leu Glu Gly Val 995 1000 1005 1010
GGA ATT GAA ATT AAT GGG ACA GAA GAG GAT GAA TAAATCAAAC AAATTAGTCA 3133 Gly He Glu He Asn Gly Thr Glu Glu Asp Glu 1015 1020
TTATCAATCG CGCCATTCCA GGTGGGGGCA AGA 3166
(2) INFORMATION FOR SEQ ID NO: 710:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1021 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 710:
Met Asp Leu Glu Glu Leu Tyr Ala Pro Asn His He Glu Arg Leu Lys
1 5 10 15
Ala Arg Ser Phe Leu Arg Ser He Ala Phe Phe Asp Asp Phe Ser Ala
20 25 30
Ser Phe Glu Tyr Arg Asp Leu Phe Ser Val Leu Glu Asn He Val Gin
35 40 45
Phe Asp Tyr Glu Lys Lys Pro Tyr Lys Asp Asp Leu Tyr Phe Leu Cys
50 55 60
Lys Phe Val Glu Pro Ala Leu Lys Ala He Phe Ser Asn Leu Asn Thr 65 70 75 80
Asn He Tyr Arg Lys His Leu Lys Met Pro Leu Glu Lys Ala Arg Glu
85 90 95
Phe Asp Ala Lys Cys Ala Leu Asp Leu Ala Lys Arg Pro Gly Arg Ser
100 105 110
Leu Lys Glu Lys Leu Cys Asp Asn Lys Val Leu Ser Val Lys Arg Tyr
115 120 125
Val Asn Ala Asn Thr His Glu Asn Arg Phe Leu Lys Arg Phe He Lys
130 135 140
Glu Leu Leu Arg He He His Trp Arg Glu He Glu Phe Gin Gin Val 145 150 155 160 Phe Glu Glu Leu He Phe Ser He Thr Ser Phe Leu Lys Asn Gly Val
165 170 175
Ala Gin Gin He Asp Glu Lys Gin Ala He He Pro Asn Asn Leu Leu
180 185 190
His Phe Asp Lys His Tyr Lys Arg He Phe Lys Ala His Asp Trp Leu
195 200 205
Tyr Asp Gly Val Gly Ser Leu Met Asn Leu Asp Gin He Phe Tyr Leu
210 215 220
Glu Cys Leu Tyr Gin Ala Gin Phe Tyr Thr Ser Lys Asn He Glu Pro 225 230 235 240
Thr Leu He Arg Asn Glu Gin Asp Leu Tyr Ala Leu He Lys Asn Ser
245 250 255
Phe Pro He Lys Asp Leu Ser Phe Glu Lys Met Arg Leu Lys Ala Lys
260 265 270
Glu Phe Phe Glu Asn Glu Leu Arg Gin Pro He Asn Leu Asp Gin Glu
275 280 285
He Pro Gin Leu Glu Leu Cys Lys Gly Val Tyr Lys Glu Met Tyr He
290 295 300
Asp Met Phe Ser Pro Glu Pro Phe Ala Leu Leu Val Gly Asn Gly Asn 305 310 315 320
Glu Glu Lys He Leu Lys Leu Pro Leu Leu Val Lys Lys Gin Glu Asn
325 330 335
Asn Thr Tyr He Asn Ala Asn Gly Ala Lys Gly Lys He Asp Glu Lys
340 345 350
Gly Tyr Leu Ala Asn Ala Leu Lys Asn Tyr Asp Glu Thr Leu Val Glu
355 360 365
Ala Phe Met Arg Asp Phe Lys Glu Arg Tyr Lys He Glu Lys Leu Tyr
370 375 380
Tyr Leu Leu Asp Asp Asn He Lys Asn Phe Glu Phe Ala Lys He Lys 385 390 395 400
His Lys He Ser Leu Tyr Phe Lys Asp Ala Lys Phe Tyr Pro Lys Ser
405 410 415
Val Ala Leu Gly Phe Ser Ser Leu Phe Glu Asn Lys Leu Lys Lys Asn
420 425 430
Glu Arg Leu Arg Tyr Asn Ser Val Asp Leu Val Val Lys Glu Asn His
435 440 445
Lys Ser Lys Thr Phe Asn Asp Cys Gly Leu Val Leu Glu Arg Gin Lys
450 455 460
Ser Asp Asp Ser Lys Glu Phe Leu He Leu Gin Asp Ser Phe He Lys 465 470 475 480
Lys Ala Leu Lys Asn Phe Lys Arg Ala Leu Gly Leu Glu Lys Glu Gly
485 490 495
Phe He Leu Tyr Lys Glu Cys Leu Pro Lys Leu Ser Met Glu Val Val
500 505 510
Lys Asp Gly Arg Phe Lys Asn Phe Glu He He Lys Asp Lys Thr He
515 520 525
Leu Gly Asp Lys Glu Thr Leu Glu He Glu Thr Pro Phe He He Pro
530 535 540
Lys Gly Arg Glu Ser Phe Ala Leu Pro Leu He Leu Asn Glu Glu Lys 545 550 555 560
He Ala Tyr Gin Gly Lys He Thr Ser Lys Asp Phe Pro Leu Glu Asn
565 570 575
Asp Glu Glu Tyr Lys Leu Thr Leu Thr Tyr Asp He Gly Thr Glu Phe
580 585 590
Asn Tyr Val Leu Glu Phe Lys Pro Val Asn Asn Asp Leu Lys Pro He 595 600 605
Val Met Glu Trp Gin Arg He Asp Arg Val Glu Leu Pro Thr Pro Asp
610 615 620
Ser He Lys Lys Pro Ser He Asp Glu Leu Lys Asn Asp Phe Asn Pro 625 630 635 640
Lys Arg Gly Lys Ser Ser Asp Leu Phe Glu Trp Ala Leu Glu Gin Leu
645 650 655
Glu Thr Leu Lys Asp Leu Asn Ser Pro Pro Arg Phe Val Leu Glu Lys
660 665 670
Lys Leu Glu Cys Gly Gly He Ser He He Gly Glu Asp Arg Asn Asn
675 680 685
Glu Leu Phe Tyr He Met Glu Thr Asn Gly Lys Lys Val Phe Cys His
690 695 700
Ser Arg Gin Cys Lys Gly Ser Val Asn Lys Asp Glu Leu Ser Leu Gly 705 710 715 720
Ala Arg Val Cys Leu Glu Val Gly Pro Asp Lys Asn Asp His Gly Lys
725 730 735
Tyr Arg Gly Lys He Tyr Gly Leu Glu Lys Asn Arg Glu He Val Leu
740 745 750
Leu Asn Thr Ala Lys Asn Ser Tyr Gin Arg Lys Pro Leu Asp Glu Lys
755 760 765
He Lys His Arg lie Glu Ala Leu Lys Arg He Lys Tyr Pro Cys Leu
770 775 780
Lys He Phe Ser His Tyr Met Leu Glu Glu Leu Glu Thr Leu Asn Pro 785 790 795 800
Glu Phe Ala Thr Pro Phe Lys Glu Tyr Leu Lys Arg Leu Glu Glu Tyr
805 810 815
Tyr Phe Asp Pro Gin Thr Asp Arg Asp Phe Lys Lys Gly Leu Leu Asp
820 825 830
Phe Phe Ser Arg Leu Asn Asp Ser He Pro Ala Lys Leu Gin Gin Glu
835 840 845
Phe He Asn Leu Pro Ser Thr Asp Phe Leu Ser Arg Cys Leu Gly Ser
850 855 860
Leu Glu Lys Asp Phe Gin Lys Thr He Phe Lys Lys Leu Lys Val Thr 865 870 875 880
Asn Leu Lys Thr Leu Ser He Val Ala Arg Ala Ser Trp Asn Asn Glu
885 890 895
Lys Phe Leu Glu Asn Leu Met Ala Gin Thr Ser Leu Glu Gin Gin Lys
900 905 910
Asp Phe Leu Lys Arg He Glu Glu Cys Leu Lys Asn Pro Glu Ser Phe
915 920 925
Tyr Phe Ser Ser Ala Cys Glu Leu Leu Leu Ala Phe Leu Ser Tyr Arg
930 935 940
Asn Ala Lys Arg Glu Leu Glu Leu He Pro Glu Ser Glu Lys Thr Met 945 950 955 960
Arg Leu Leu Asp Ser He Asp Lys Ala He Glu Lys Glu Thr Glu He
965 970 975
Lys Ser Phe Val Lys Leu Glu Leu Lys Asn Gin Ser Phe Asn Asn He
980 985 990
Pro Pro Leu Leu Ser Ala Leu Arg Leu Tyr Leu Arg Gly Asp Leu Glu
995 1000 1005
Gly Val Gly He Glu He Asn Gly Thr Glu Glu Asp Glu 1010 1015 1020
(2) INFORMATION FOR SEQ ID NO: 711: (I) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 281 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(II) MOLECULE TYPE: Genomic DNA ( x) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 1...180 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION. SEQ ID NO: 711:
AAC AAA ACT TTT TTA AGG GGT AAA AAC CCT TTA GTT ACA AGA AAG AGA 48 Asn Lys Thr Phe Leu Arg Gly Lys Asn Pro Leu Val Thr Arg Lys Arg 1 5 10 15
AGT TTT GTC ATC AAA ATG AAG TTT TTT AAA GAA AAA GAA AAA GAA GTT 96 Ser Phe Val He Lys Met Lys Phe Phe Lys Glu Lys Glu Lys Glu Val 20 25 30
TCA AAA ATT AAA AGT TTG AGA AAG TTT GAG TCA AAT CCG CTA GTA AGA 144 Ser Lys He Lys Ser Leu Arg Lys Phe Glu Ser Asn Pro Leu Val Arg 35 40 45
TTT GAC CCT AGC GCT CTT GCG CTA GAG CCA AAA TTT TAGTATAATG GCGTTG 196 Phe Asp Pro Ser Ala Leu Ala Leu Glu Pro Lys Phe 50 55 60
CGGATACCAT TAAAACTATG ATTACTAATC TCATGTTAGG TTATCTCCTT TTCTGAGGTA 256 ACCACCCACT ATTACCCCCA ACCCT 281
(2) INFORMATION FOR SEQ ID NO: 712:
(l) SEQUENCE CHARACTERISTICS.
(A) LENGTH: 60 ammo acids
Figure imgf001092_0001
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 712:
Asn Lys Thr Phe Leu Arg Gly Lys Asn Pro Leu Val Thr Arg Lys Arg
1 5 10 15
Ser Phe Val He Lys Met Lys Phe Phe Lys Glu Lys Glu Lys Glu Val
20 25 30
Ser Lys He Lys Ser Leu Arg Lys Phe Glu Ser Asn Pro Leu Val Arg 35 40 45
Phe Asp Pro Ser Ala Leu Ala Leu Glu Pro Lys Phe 50 55 60
(2) INFORMATION FOR SEQ ID NO: 713:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 745 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...692 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 713:
GCGCGAATTC TTCTTGGACA TTGATTTTAT AGCGCGATTT AGAAATGGCC ATG CCC 56
Met Pro 1
ACA ATG AAC GCT CCT AAA GAC ATA GAA AAC CCA AAA AAA TGG CTC AAC 104 Thr Met Asn Ala Pro Lys Asp He Glu Asn Pro Lys Lys Trp Leu Asn 5 10 15
CCT GCT GCG CTG CAA ACA ATC ACT AAA ATC GTG CCT ATA AAA ATT TCA 152 Pro Ala Ala Leu Gin Thr He Thr Lys He Val Pro He Lys He Ser 20 25 30
GGC AGG CGC GTG TCT TTT GCT TGC TCT AAG ATG AGA TTA GCC CCT TTT 200 Gly Arg Arg Val Ser Phe Ala Cys Ser Lys Met Arg Leu Ala Pro Phe 35 40 45 50
TTT CCA GGC AAT AAT AAA AGA ACT AAA ATA ATC CCT GCT GAA ATA AAG 248 Phe Pro Gly Asn Asn Lys Arg Thr Lys He He Pro Ala Glu He Lys 55 60 65
GTT TTA AGA ATG AGT AAA TTA ACA TTA GAA TCT TTA CTA CCT AGA ATA 296 Val Leu Arg Met Ser Lys Leu Thr Leu Glu Ser Leu Leu Pro Arg He 70 75 80
GTG AGG ATT AAA AGC ATG GGA ATG GCT GCA ATA TCT TGG AAA ATC AAA 344 Val Arg He Lys Ser Met Gly Met Ala Ala He Ser Trp Lys He Lys 85 90 95
ATC CCC ACC GCG CTC TTT CCC ATG GGC GTG CTA AGC TGT TTG GAA TCT 392 He Pro Thr Ala Leu Phe Pro Met Gly Val Leu Ser Cys Leu Glu Ser 100 105 110 TCA AAG AAT TTC AGC ACA ATA GCG GTT GAA GAG AGC GAA AGC CCC ATG 440
Ser Lys Asn Phe Ser Thr He Ala Val Glu Glu Ser Glu Ser Pro Met
115 120 125 130
CCT AAA ACA AGG GAA AAA ATG GGT GAA AGA CCC AAA ACA AAA TAC CCC 488
Pro Lys Thr Arg Glu Lys Met Gly Glu Arg Pro Lys Thr Lys Tyr Pro 135 140 145
AAT AAA AAA GCG ATT AAA GCG CAT AAA ACC ACT TGT AAA AGC CCA AAA 536
Asn Lys Lys Ala He Lys Ala His Lys Thr Thr Cys Lys Ser Pro Lys 150 155 160
ACC AGC ACT TCT TGT TTG ATG GAT TTG AGC TTG TCA AAA TTA AAC TCA 584
Thr Ser Thr Ser Cys Leu Met Asp Leu Ser Leu Ser Lys Leu Asn Ser 165 170 175
ATG CCT ATC ATA AAC ATT AAA AAG ACG ATA CCA AAT TCG CCA ATA TCA 632
Met Pro He He Asn He Lys Lys Thr He Pro Asn Ser Pro He Ser 180 185 190
GAC AAC AAA TCA AAA TCA TTA ATT TTA AAA AAA GCC GCT AAG ACC GTT 680
Asp Asn Lys Ser Lys Ser Leu He Leu Lys Lys Ala Ala Lys Thr Val
195 200 205 210
CCT GTG CAA ATG TAACCAATGA TAACAGGCAT GTCTAATTTC TTTAAAAAGA TTCCA 737 Pro Val Gin Met
AAGCCCAC 745
(2) INFORMATION FOR SEQ ID NO: 714:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 214 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 714:
Met Pro Thr Met Asn Ala Pro Lys Asp He Glu Asn Pro Lys Lys Trp
1 5 10 15
Leu Asn Pro Ala Ala Leu Gin Thr He Thr Lys He Val Pro He Lys
20 25 30
He Ser Gly Arg Arg Val Ser Phe Ala Cys Ser Lys Met Arg Leu Ala
35 40 45
Pro Phe Phe Pro Gly Asn Asn Lys Arg Thr Lys He He Pro Ala Glu
50 55 60
He Lys Val Leu Arg Met Ser Lys Leu Thr Leu Glu Ser Leu Leu Pro 65 70 75 80
Arg He Val Arg He Lys Ser Met Gly Met Ala Ala He Ser Trp Lys 85 90 95 He Lys He Pro Thr Ala Leu Phe Pro Met Gly Val Leu Ser Cys Leu
100 105 110
Glu Ser Ser Lys Asn Phe Ser Thr He Ala Val Glu Glu Ser Glu Ser
115 120 125
Pro Met Pro Lys Thr Arg Glu Lys Met Gly Glu Arg Pro Lys Thr Lys
130 135 140
Tyr Pro Asn Lys Lys Ala He Lys Ala His Lys Thr Thr Cys Lys Ser 145 150 155 160
Pro Lys Thr Ser Thr Ser Cys Leu Met Asp Leu Ser Leu Ser Lys Leu
165 170 175
Asn Ser Met Pro He He Asn He Lys Lys Thr He Pro Asn Ser Pro
180 185 190
He Ser Asp Asn Lys Ser Lys Ser Leu He Leu Lys Lys Ala Ala Lys
195 200 205
Thr Val Pro Val Gin Met 210
(2) INFORMATION FOR SEQ ID NO: 715:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 601 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...491 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 715:
GCCTTTGCTC ACCCTAAAGA TAAAAATAAA ATTCACGAAG GTTACGCTTA ATG CGT 56
Met Arg
1
TTA GAA AAT CTA AGC CAG CAA AAA ATT CTT CAA CTC TCT GGC GGG CAA 104 Leu Glu Asn Leu Ser Gin Gin Lys He Leu Gin Leu Ser Gly Gly Gin 5 10 15
GCC CAA CGA GTC GCT TTA GCA AGA GCT TTA ATC GCA GCC AAG AAT CTA 152 Ala Gin Arg Val Ala Leu Ala Arg Ala Leu He Ala Ala Lys Asn Leu 20 25 30
TTG CTT TTA GAT GAG CCT TTA AAC GCC TTA GAT AAC GCC TTA AAA AAC 200 Leu Leu Leu Asp Glu Pro Leu Asn Ala Leu Asp Asn Ala Leu Lys Asn 35 40 45 50
GAA GTG CAA CAA GGT TTG CT 'GAT TTT ATC AAG CGT GAA AAT TTA AGC 248 Glu Val Gin Gin Gly Leu Leu Asp Phe He Lys Arg Glu Asn Leu Ser 55 60 65
GTG TTA TTG GTA AGC CAT AAC CCC AAT GAA ATC ACC AAG CTC GCG CAA 296 Val Leu Leu Val Ser His Asn Pro Asn Glu He Thr Lys Leu Ala Gin 70 75 80
ACT TTC CTC TTT TTA AAC AAT GGC GTT ATT GAT CCT AAT CAA GAA AAT 344 Thr Phe Leu Phe Leu Asn Asn Gly Val He Asp Pro Asn Gin Glu Asn 85 90 95
CGG CTT TTT TCA AAC CGC TTA TTA ATA AAA CCT CTC TTT GAA GAT GAA 392 Arg Leu Phe Ser Asn Arg Leu Leu He Lys Pro Leu Phe Glu Asp Glu 100 105 110
AAT TAT TGC CAT TAT GAG GTC ATT TCT CAA ACG ATT AGT TTG CCC AAA 440 Asn Tyr Cys His Tyr Glu Val He Ser Gin Thr He Ser Leu Pro Lys 115 120 125 130
GAT TGT CTG AAC CCA ACT TTT AAG CTT GAT TTC AAT CAA AAC AAA AAA 488 Asp Cys Leu Asn Pro Thr Phe Lys Leu Asp Phe Asn Gin Asn Lys Lys 135 140 145
TTT TAGAAATATT TTTTCATTTT CCTCTTAAAA CCCTCTTATT TTTCAAAAGG AGTTGC 547 Phe
TTAACAACCG CTAAAATCAA ACTCTTTTAT TTTAATACCC AATGAAAACA GAGC 601
(2) INFORMATION FOR SEQ ID NO: 716:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 147 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 716:
Met Arg Leu Glu Asn Leu Ser Gin Gin Lys He Leu Gin Leu Ser Gly
1 5 10 15
Gly Gin Ala Gin Arg Val Ala Leu Ala Arg Ala Leu He Ala Ala Lys
20 25 30
Asn Leu Leu Leu Leu Asp Glu Pro Leu Asn Ala Leu Asp Asn Ala Leu
35 40 45
Lys Asn Glu Val Gin Gin Gly Leu Leu Asp Phe He Lys Arg Glu Asn
50 55 60
Leu Ser Val Leu Leu Val Ser His Asn Pro Asn Glu He Thr Lys Leu 65 70 75 80
Ala Gin Thr Phe Leu Phe Leu Asn Asn Gly Val He Asp Pro Asn Gin
85 90 95
Glu Asn Arg Leu Phe Ser Asn Arg Leu Leu He Lys Pro Leu Phe Glu 100 105 110 Asp Glu Asn Tyr Cys His Tyr Glu Val He Ser Gin Thr He Ser Leu
115 120 125
Pro Lys Asp Cys Leu Asn Pro Thr Phe Lys Leu Asp Phe Asn Gin Asn
130 135 140
Lys Lys Phe 145
(2) INFORMATION FOR SEQ ID NO: 717:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1530 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...1440 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 717:
TTAAGCCATT TTTGAATACA ATAAGAGTAT TTTATTTTTA AGGTTAGAAC A ATG AGT 57
Met Ser 1
TTG ATC GTT ACG CGC TTC GCT CCA TCG CCC ACT GGC TAC CTC CAC ATA 105 Leu He Val Thr Arg Phe Ala Pro Ser Pro Thr Gly Tyr Leu His He 5 10 15
GGA GGC TTA AGA ACA GCC ATT TTC AAT TAT CTT TTT GCA CGA GCC AAT 153 Gly Gly Leu Arg Thr Ala He Phe Asn Tyr Leu Phe Ala Arg Ala Asn 20 25 30
CAA GGA AAA TTT TTT TTA CGC ATT GAA GAC ACG GAT TTG AGC CGT AAC 201 Gin Gly Lys Phe Phe Leu Arg He Glu Asp Thr Asp Leu Ser Arg Asn 35 40 45 50
TCT ATA GAA GCG GCT AAC GCC ATT ATA GAA GCT TTC AAA TGG GTA GGG 249 Ser He Glu Ala Ala Asn Ala He He Glu Ala Phe Lys Trp Val Gly 55 60 65
CTA GAA TAC GAT GGC GAA ATC CTC TAC CAA TCC AAA CGC TTT GAG ATT 297 Leu Glu Tyr Asp Gly Glu He Leu Tyr Gin Ser Lys Arg Phe Glu He 70 75 80
TAT AAA GAA TAC ATT CAA AAA CTC TTA GAT GAA GAC AAA GCC TAT TAT 345 Tyr Lys Glu Tyr He Gin Lys Leu Leu Asp Glu Asp Lys Ala Tyr Tyr 85 90 95 TGT TAC ATG AGC AAA GAA GAG TTG GAC GCT TTG AGA GAA GAG CAA AAA 393 Cys Tyr Met Ser Lys Glu Glu Leu Asp Ala Leu Arg Glu Glu Gin Lys 100 105 110
GCC AGG AAA GAA ACC CCA CGC TAT GAC AAT CGC TAT CGT GAT TTT AAA 441 Ala Arg Lys Glu Thr Pro Arg Tyr Asp Asn Arg Tyr Arg Asp Phe Lys 115 120 125 130
GGC ACG CCT CCT AAA GGC ATA GAG CCT GTG GTA AGG ATT AAA GTC CCC 489 Gly Thr Pro Pro Lys Gly He Glu Pro Val Val Arg He Lys Val Pro 135 140 145
CAA AAT GAG GTG ATT GGT TTT AAT GAC GGG GTT AAA GGC GAA GTG AAA 537 Gin Asn Glu Val He Gly Phe Asn Asp Gly Val Lys Gly Glu Val Lys 150 155 160
GTG AAT ACT AAC GAA TTA GAC GAT TTT ATT ATC GCC AGG AGC GAT GGG 585 Val Asn Thr Asn Glu Leu Asp Asp Phe He He Ala Arg Ser Asp Gly 165 170 175
ACA CCC ACT TAT AAC TTT GTG GTT ACT ATT GAT GAC GCT TTA ATG GGG 633 Thr Pro Thr Tyr Asn Phe Val Val Thr He Asp Asp Ala Leu Met Gly 180 185 190
ATT ACT GAT GTG ATT AGA GGC GAT GAT CAC CTT TCT AAC ACC CCT AAA 681 He Thr Asp Val He Arg Gly Asp Asp His Leu Ser Asn Thr Pro Lys 195 200 205 210
CAA ATC GTT CTT TAT AAG GCT TTG AAT TTT AAA ATC CCT AAT TTT TTC 729 Gin He Val Leu Tyr Lys Ala Leu Asn Phe Lys He Pro Asn Phe Phe 215 220 225
CAT GTG CCG ATG ATT TTG AAT GAA GAA GGG CAA AAA TTA AGC AAA CGC 777 His Val Pro Met He Leu Asn Glu Glu Gly Gin Lys Leu Ser Lys Arg 230 235 240
CAT GGG GCC ACT AAT GTG ATG GAC TAT CAA GAA ATG GGC TAT CTT AAG 825 His Gly Ala Thr Asn Val Met Asp Tyr Gin Glu Met Gly Tyr Leu Lys 245 250 255
GAA GCT TTA GTG AAT TTT TTA GCG CGT TTG GGG TGG AGC TAT CAG GAT 873 Glu Ala Leu Val Asn Phe Leu Ala Arg Leu Gly Trp Ser Tyr Gin Asp 260 265 270
AAA GAG GTT TTT AGC ATG CAA GAA TTG CTA GAA TTA TTT GAT CCT AAA 921 Lys Glu Val Phe Ser Met Gin Glu Leu Leu Glu Leu Phe Asp Pro Lys 275 280 285 290
GAT TTG AAT TCT TCG CCC AGT TGC TTC AGC TGG CAC AAG CTT AAT TGG 969 Asp Leu Asn Ser Ser Pro Ser Cys Phe Ser Trp His Lys Leu Asn Trp 295 300 305
CTC AAC GCT CAT TAT TTA AAA AAC CAA AGT GTG CAA GAA TTG TTA AAA 1017 Leu Asn Ala His Tyr Leu Lys Asn Gin Ser Val Gin Glu Leu Leu Lys 310 315 320 CTT TTA AAG CCT TTT AGT TTT AGC GAT CTC TCG CAT TTA AAC CCC ACT 1065 Leu Leu Lys Pro Phe Ser Phe Ser Asp Leu Ser His Leu Asn Pro Thr 325 330 335
CAA TTG GAT CGC TTG TTA GAC GCT CTC AAA GAA AGA TCT CAA ACA CTA 1113 Gin Leu Asp Arg Leu Leu Asp Ala Leu Lys Glu Arg Ser Gin Thr Leu 340 345 350
AAA GAA TTA GCC CTT AAA ATA GAT GAG GTT TTA ATC GCC CCT GTG GAG 1161 Lys Glu Leu Ala Leu Lys He Asp Glu Val Leu He Ala Pro Val Glu 355 360 365 370
TAT GAA GAA AAG GTT TTT AAA AAA CTC AAT CAA GCG CTC GTT ATG CCC 1209 Tyr Glu Glu Lys Val Phe Lys Lys Leu Asn Gin Ala Leu Val Met Pro 375 380 385
TTG TTA GAA AAG TTT AAG CTA GAA TTA AAC AAA GCC AAT TTC AAC GAT 1257 Leu Leu Glu Lys Phe Lys Leu Glu Leu Asn Lys Ala Asn Phe Asn Asp 390 395 400
GAA AGC GCG CTA GAA AAC GCC ATG CGC CAA ATC ATT GAA GAA GAA AAG 1305 Glu Ser Ala Leu Glu Asn Ala Met Arg Gin He He Glu Glu Glu Lys 405 410 415
ATT AAA GCG GGT AGT TTT ATG CAG CCT TTA AGA TTG GCC CTT TTG GGT 1353 He Lys Ala Gly Ser Phe Met Gin Pro Leu Arg Leu Ala Leu Leu Gly 420 425 430
AAG GGA GGC GGG ATA GGC CTT AAA GAA GCG CTT TTT ATT TTA GGC AAA 1401 Lys Gly Gly Gly He Gly Leu Lys Glu Ala Leu Phe He Leu Gly Lys 435 440 445 450
ACA GAG AGC GTC AAA AGA ATA GAG GAT TTT TTG AAA AAC TAAAAAATTG GC 1452 Thr Glu Ser Val Lys Arg He Glu Asp Phe Leu Lys Asn 455 460
TCTGTTTTCA TTGGGTATTA AAATAAAAGA GTTTGATTTT AGCGGTTGTT AAGCAACTCC 1512 TTTTGAAAAA TAAGAGGG 1530
(2) INFORMATION FOR SEQ ID NO: 718:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 463 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 718:
Met Ser Leu He Val Thr Arg Phe Ala Pro Ser Pro Thr Gly Tyr Leu
1 5 10 15
His He Gly Gly Leu Arg Thr Ala He Phe Asn Tyr Leu Phe Ala Arg 20 25 30
Ala Asn Gin Gly Lys Phe Phe Leu Arg He Glu Asp Thr Asp Leu Ser
35 40 45
Arg Asn Ser He Glu Ala Ala Asn Ala He He Glu Ala Phe Lys Trp
50 55 60
Val Gly Leu Glu Tyr Asp Gly Glu He Leu Tyr Gin Ser Lys Arg Phe 65 70 75 80
Glu He Tyr Lys Glu Tyr He Gin Lys Leu Leu Asp Glu Asp Lys Ala
85 90 95
Tyr Tyr Cys Tyr Met Ser Lys Glu Glu Leu Asp Ala Leu Arg Glu Glu
100 105 110
Gin Lys Ala Arg Lys Glu Thr Pro Arg Tyr Asp Asn Arg Tyr Arg Asp
115 120 125
Phe Lys Gly Thr Pro Pro Lys Gly He Glu Pro Val Val Arg He Lys
130 135 140
Val Pro Gin Asn Glu Val He Gly Phe Asn Asp Gly Val Lys Gly Glu 145 150 155 160
Val Lys Val Asn Thr Asn Glu Leu Asp Asp Phe He He Ala Arg Ser
165 170 175
Asp Gly Thr Pro Thr Tyr Asn Phe Val Val Thr He Asp Asp Ala Leu
180 185 190
Met Gly He Thr Asp Val He Arg Gly Asp Asp His Leu Ser Asn Thr
195 200 205
Pro Lys Gin He Val Leu Tyr Lys Ala Leu Asn Phe Lys He Pro Asn
210 215 220
Phe Phe His Val Pro Met He Leu Asn Glu Glu Gly Gin Lys Leu Ser 225 230 235 240
Lys Arg His Gly Ala Thr Asn Val Met Asp Tyr Gin Glu Met Gly Tyr
245 250 255
Leu Lys Glu Ala Leu Val Asn Phe Leu Ala Arg Leu Gly Trp Ser Tyr
260 265 270
Gin Asp Lys Glu Val Phe Ser Met Gin Glu Leu Leu Glu Leu Phe Asp
275 280 285
Pro Lys Asp Leu Asn Ser Ser Pro Ser Cys Phe Ser Trp His Lys Leu
290 295 300
Asn Trp Leu Asn Ala His Tyr Leu Lys Asn Gin Ser Val Gin Glu Leu 305 310 315 320
Leu Lys Leu Leu Lys Pro Phe Ser Phe Ser Asp Leu Ser His Leu Asn
325 330 335
Pro Thr Gin Leu Asp Arg Leu Leu Asp Ala Leu Lys Glu Arg Ser Gin
340 345 350
Thr Leu Lys Glu Leu Ala Leu Lys He Asp Glu Val Leu He Ala Pro
355 360 365
Val Glu Tyr Glu Glu Lys Val Phe Lys Lys Leu Asn Gin Ala Leu Val
370 375 380
Met Pro Leu Leu Glu Lys Phe Lys Leu Glu Leu Asn Lys Ala Asn Phe 385 390 395 400
Asn Asp Glu Ser Ala Leu Glu Asn Ala Met Arg Gin He He Glu Glu
405 410 415
Glu Lys He Lys Ala Gly Ser Phe Met Gin Pro Leu Arg Leu Ala Leu
420 425 430
Leu Gly Lys Gly Gly Gly He Gly Leu Lys Glu Ala Leu Phe He Leu
435 440 445
Gly Lys Thr Glu Ser Val Lys Arg He Glu Asp Phe Leu Lys Asn 450 455 460 (2) INFORMATION FOR SEQ ID NO: 719:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 382 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...329 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:719:
CCTAATGATT TTTTCTATCT TGCCCCCCTT ATGTGTTAGA AAATTCTAAA ATG TTT 56
Met Phe 1
AAG GGG ATT TAT CCT ATG CGT AAT TTT CCT ATC CAC CAT AAT GGT TTT 104 Lys Gly He Tyr Pro Met Arg Asn Phe Pro He His His Asn Gly Phe 5 10 15
AAA CAT GAA GTG TTA GCT CAC ATG CTA AAA AGG CAT AAA GAG CCA TTT 152 Lys His Glu Val Leu Ala His Met Leu Lys Arg His Lys Glu Pro Phe 20 25 30
ATT TTA AGC TAT AAT GAC TGC GAA TTT GTA AGG AAT GCT TAT AAA GAT 200 He Leu Ser Tyr Asn Asp Cys Glu Phe Val Arg Asn Ala Tyr Lys Asp 35 40 45 50
TTT AAA ATT TTA GAA CCA TCT TGG CAA TAC ACT ATG GGA CAA GGC GAG 248 Phe Lys He Leu Glu Pro Ser Trp Gin Tyr Thr Met Gly Gin Gly Glu 55 60 65
ATC AGA ATG GGT AAA AAT CGC TTA GAA AGA GGC GAT AAT AAC CAT GTC 296 He Arg Met Gly Lys Asn Arg Leu Glu Arg Gly Asp Asn Asn His Val 70 75 80
AAA CAA TCT CAT GAG TTA TTG ATT ATC AAG GAG TAAAAATGCA TATTAGCGAA 349 Lys Gin Ser His Glu Leu Leu He He Lys Glu 85 90
GTCAAAACTG CCTTTAAAAT CGCTGATGTA GAA 382
(2) INFORMATION FOR SEQ ID NO: 720:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 93 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS single
(D) TOPOLOGY linear
(11) MOLECULE TYPE protein (v) FRAGMENT TYPE internal
(xi) SEQUENCE DESCRIPTION SEQ ID NO 720
Met Phe Lys Gly He Tyr Pro Met Arg Asn Phe Pro He His His Asn
1 5 10 15
Gly Phe Lys His Glu Val Leu Ala His Met Leu Lys Arg His Lys Glu
20 25 30
Pro Phe He Leu Ser Tyr Asn Asp Cys Glu Phe Val Arg Asn Ala Tyr
35 40 45
Lys Asp Phe Lys He Leu Glu Pro Ser Trp Gin Tyr Thr Met Gly Gin
50 55 60
Gly Glu He Arg Met Gly Lys Asn Arg Leu Glu Arg Gly Asp Asn Asn 65 70 75 80
His Val Lys Gin Ser His Glu Leu Leu He He Lys Glu 85 90
(2) INFORMATION FOR SEQ ID NO 721
(l) SEQUENCE CHARACTERISTICS
(A) LENGTH 376 base pairs
(B) TYPE nucleic acid
(C) STRANDEDNESS single
(D) TOPOLOGY linear
(n) MOLECULE TYPE Genomic DNA (ix) FEATURE
(A) NAME/KEY Coding Sequence
(B) LOCATION 183 323 (D) OTHER INFORMATION
(xi) SEQUENCE DESCRIPTION SEQ ID NO 721
ATCCACAAAT TCCACTTCAT TTTCTTTGCA AAATTCAAAA AATTCTTTGA TCTTGCTTTC 60
ACTATTTTGA GTTCTTACTA TCATTAACCA CCTTTAATTG TTATGAATTG AGTTTGATTG 120
ATAGGGGTGA TTATAGCATT TATGGGGCAA AAAAAGTAGA ATCTGTATCA AGTTTTATTA 180
AG ATG CAT GCG GTA AAA TCC GCT AAA TCA AGG AGT GTT ATT ATG GAA 227 Met His Ala Val Lys Ser Ala Lys Ser Arg Ser Val He Met Glu 1 5 10 15
GCA GAC GCA ACC ACA CTA TTA GGA TTT TTT GAA GAA AAT CAA AAC AAT 275 Ala Asp Ala Thr Thr Leu Leu Gly Phe Phe Glu Glu Asn Gin Asn Asn 20 25 30
CAA TTT GTC ATT CCT ATC TAT CAG AGG TTG TAT AGT TGG AAA AAG GAA T 324 Gin Phe Val He Pro He Tyr Gin Arg Leu Tyr Ser Trp Lys Lys Glu 35 40 45 AATGCGAACA ATTATGGGAT GATATTATAA AAATTGGTGG GAATGATAAG AT 376
(2) INFORMATION FOR SEQ ID NO: 722:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 47 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 722:
Met His Ala Val Lys Ser Ala Lys Ser Arg Ser Val He Met Glu Ala
1 5 10 15
Asp Ala Thr Thr Leu Leu Gly Phe Phe Glu Glu Asn Gin Asn Asn Gin
20 25 30
Phe Val He Pro He Tyr Gin Arg Leu Tyr Ser Trp Lys Lys Glu 35 40 45
(2) INFORMATION FOR SEQ ID NO:723:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1021 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...968 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:723:
TATTAATGGG GTTTGTGGGA TTGAATGCTA GTGATCGTTT GTTAGAAATC ATG CGC 56
Met Arg
1
CTT TAT CAA AAA CAA GGC TTG GAA ATG GTG GGT CAA AAG TTG GAT TCT 104 Leu Tyr Gin Lys Gin Gly Leu Glu Met Val Gly Gin Lys Leu Asp Ser 5 10 15
TAT TTA GCG GAT AAA TCT TTT TGG GCA GAA GAA CTT CAA AAC AAG GAC 152 Tyr Leu Ala Asp Lys Ser Phe Trp Ala Glu Glu Leu Gin Asn Lys Asp 20 25 30
ACG GAT TTT GGC TAT TAT CAA AAC AAG CAG TTT TTA TTT GTG GCT AAT 200 Thr Asp Phe Gly Tyr Tyr Gin Asn Lys Gin Phe Leu Phe Val Ala Asn 35 40 45 50
AAA TCC AAG CCC AGT TTG GAG TTT TAT GAG ATA GAA AAT AAC ATG CTT 248 Lys Ser Lys Pro Ser Leu Glu Phe Tyr Glu He Glu Asn Asn Met Leu 55 60 65
AAA AAA ATC AAC AGC TCT AAA GCT CTT GTA GGC TCT AAA AAG GGC GAT 296 Lys Lys He Asn Ser Ser Lys Ala Leu Val Gly Ser Lys Lys Gly Asp 70 75 80
AAG ACT TTA GAG GGC GAT TTG GCC ACG CCT ATT GGA GTG TAT CGT ATC 344 Lys Thr Leu Glu Gly Asp Leu Ala Thr Pro He Gly Val Tyr Arg He 85 90 95
ACG CAG AAA TTA GAG CGC TTG GAT CAA TAT TAT GGC GTT TTG GCT TTT 392 Thr Gin Lys Leu Glu Arg Leu Asp Gin Tyr Tyr Gly Val Leu Ala Phe 100 105 110
GTA ACG AAT TAC CCT AAT TTG TAT GAT ACC TTG AAA AAA CGC ACC GGG 440 Val Thr Asn Tyr Pro Asn Leu Tyr Asp Thr Leu Lys Lys Arg Thr Gly 115 120 125 130
CAT GGC ATT TGG GTG CAT GGA ATG CCT TTA AAT GGC GAT CGG AAT GAA 488 His Gly He Trp Val His Gly Met Pro Leu Asn Gly Asp Arg Asn Glu 135 140 145
TTG AAC ACC AAG GGC TGT ATT GCG ATT GAA AAC CCG CTT TTA AGC TCT 536 Leu Asn Thr Lys Gly Cys He Ala He Glu Asn Pro Leu Leu Ser Ser 150 155 160
TAT GAC AAA GTG TTA AAA GGC GAA AAA GCG TTC CTC ATC ACC TAT GAA 584 Tyr Asp Lys Val Leu Lys Gly Glu Lys Ala Phe Leu He Thr Tyr Glu 165 170 175
GAC AAG TTT TTC CCA AGC ACC AAA GAA GAA TTG AGC ATG ATT TTA AGC 632 Asp Lys Phe Phe Pro Ser Thr Lys Glu Glu Leu Ser Met He Leu Ser 180 185 190
TCC CTT TTT CAA TGG AAA GAA GCC TGG GCT AGG GGC GAT TTT GAA CGC 680 Ser Leu Phe Gin Trp Lys Glu Ala Trp Ala Arg Gly Asp Phe Glu Arg 195 200 205 210
TAC ATG CGT TTT TAT AAC CCC AAT TTC ACT CGC TAT GAC GGC ATG AAA 728 Tyr Met Arg Phe Tyr Asn Pro Asn Phe Thr Arg Tyr Asp Gly Met Lys 215 220 225
TTT AAC GCT TTT AAA GAG TAT AAA AAA AGG GTG TTT GCA AAA AAC GAA 776 Phe Asn Ala Phe Lys Glu Tyr Lys Lys Arg Val Phe Ala Lys Asn Glu 230 235 240
AAA AAG AAT ATC GCT TTT TCC TCT ATC AAT GTG ATC CCT TAC CCC AAC 824 Lys Lys Asn He Ala Phe Ser Ser He Asn Val He Pro Tyr Pro Asn 245 250 255 TCT CAG AAC AAA CGC TTG TTT TAT GTG GTG TTT GAC CAA GAT TAT AAA 872 Ser Gin Asn Lys Arg Leu Phe Tyr Val Val Phe Asp Gin Asp Tyr Lys 260 265 270
GCC TAC CAG CAT AAC AAG CTC TCT TAT AGC TCC AAT TCT CAA AAA GAA 920 Ala Tyr Gin His Asn Lys Leu Ser Tyr Ser Ser Asn Ser Gin Lys Glu 275 280 285 290
CTC TAT ATA GAG ATT GAA AAC AAT CAA GTG TCT ATT ATA ATG GAA AAA T 969 Leu Tyr He Glu He Glu Asn Asn Gin Val Ser He He Met Glu Lys 295 300 305
AAGAAAAATA GGGCTTTGTT TTAATTAGGA TAATCTAAGC GGATTTTTCT AA 1021
(2) INFORMATION FOR SEQ ID NO: 724:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 306 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 724:
Met Arg Leu Tyr Gin Lys Gin Gly Leu Glu Met Val Gly Gin Lys Leu
1 5 10 15
Asp Ser Tyr Leu Ala Asp Lys Ser Phe Trp Ala Glu Glu Leu Gin Asn
20 25 30
Lys Asp Thr Asp Phe Gly Tyr Tyr Gin Asn Lys Gin Phe Leu Phe Val
35 40 45
Ala Asn Lys Ser Lys Pro Ser Leu Glu Phe Tyr Glu He Glu Asn Asn
50 55 60
Met Leu Lys Lys He Asn Ser Ser Lys Ala Leu Val Gly Ser Lys Lys 65 70 75 80
Gly Asp Lys Thr Leu Glu Gly Asp Leu Ala Thr Pro He Gly Val Tyr
85 90 95
Arg He Thr Gin Lys Leu Glu Arg Leu Asp Gin Tyr Tyr Gly Val Leu
100 105 110
Ala Phe Val Thr Asn Tyr Pro Asn Leu Tyr Asp Thr Leu Lys Lys Arg
115 120 125
Thr Gly His Gly He Trp Val His Gly Met Pro Leu Asn Gly Asp Arg
130 135 140
Asn Glu Leu Asn Thr Lys Gly Cys He Ala He Glu Asn Pro Leu Leu 145 150 155 160
Ser Ser Tyr Asp Lys Val Leu Lys Gly Glu Lys Ala Phe Leu He Thr
165 170 175
Tyr Glu Asp Lys Phe Phe Pro Ser Thr Lys Glu Glu Leu Ser Met He
180 185 190
Leu Ser Ser Leu Phe Gin Trp Lys Glu Ala Trp Ala Arg Gly Asp Phe
195 200 205
Glu Arg Tyr Met Arg Phe Tyr Asn Pro Asn Phe Thr Arg Tyr Asp Gly 210 215 220 Met Lys Phe Asn Ala Phe Lys Glu Tyr Lys Lys Arg Val Phe Ala Lys 225 230 235 240
Asn Glu Lys Lys Asn He Ala Phe Ser Ser He Asn Val He Pro Tyr
245 250 255
Pro Asn Ser Gin Asn Lys Arg Leu Phe Tyr Val Val Phe Asp Gin Asp
260 265 270
Tyr Lys Ala Tyr Gin His Asn Lys Leu Ser Tyr Ser Ser Asn Ser Gin
275 280 285
Lys Glu Leu Tyr He Glu He Glu Asn Asn Gin Val Ser He He Met
290 295 300
Glu Lys 305
(2) INFORMATION FOR SEQ ID NO: 725:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 990 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 43...870 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 725:
AAAACAAGCT AAAGAATGGC TCAAATTGTA AAAGGATCTG AC ATG TTT AAA GAT 54
Met Phe Lys Asp 1
TTT TAT CGC ACC ACC CTC TCT TTT TTA AAG CCT TTA TTG CTT TTA CTA 102 Phe Tyr Arg Thr Thr Leu Ser Phe Leu Lys Pro Leu Leu Leu Leu Leu 5 10 15 20
GTT TTA TTA TTG CCG TTT TCA CTT TGT ATA GCT GAT GAA TAT ATT AGC 150 Val Leu Leu Leu Pro Phe Ser Leu Cys He Ala Asp Glu Tyr He Ser 25 30 35
ATA AGT GAT GAT TGG GAT GAA ATT GTG CGA AAT CAT AAG ACA TAT TAT 198 He Ser Asp Asp Trp Asp Glu He Val Arg Asn His Lys Thr Tyr Tyr 40 45 50
TTT GAA AAT GGT TTA GAC CAT TTT AAT CAA GGC CAA TAC CAG CAA GCC 246 Phe Glu Asn Gly Leu Asp His Phe Asn Gin Gly Gin Tyr Gin Gin Ala 55 60 65
TTT AAA GAT TTT AGA TTG GCG CAA GAA TAC AGC ATC GGG CTT GGC AGT 294 Phe Lys Asp Phe Arg Leu Ala Gin Glu Tyr Ser He Gly Leu Gly Ser 70 75 80
GTT TAT TTA GCC AAA ATG TAT TTG GAG GGA AAG GGC GTG AAA GTG GAT 342 Val Tyr Leu Ala Lys Met Tyr Leu Glu Gly Lys Gly Val Lys Val Asp 85 90 95 100
TAC AAA AAA GCA CAA TTT TAT GCA GAA AAC GCT ATC AAA GGG TAT GGG 390 Tyr Lys Lys Ala Gin Phe Tyr Ala Glu Asn Ala He Lys Gly Tyr Gly 105 110 115
AGC GGA TTG TTA GGG GGT GCT CTT ATT TTA GGA CGC ATG CAA GCA GAA 438 Ser Gly Leu Leu Gly Gly Ala Leu He Leu Gly Arg Met Gin Ala Glu 120 125 130
GGC TTA GGG ATG AAA AAG GAT TTG AAA CAA GCG CTC AAG ACT TAT AGG 486 Gly Leu Gly Met Lys Lys Asp Leu Lys Gin Ala Leu Lys Thr Tyr Arg 135 140 145
CAT GTG GTT CGC ATG TTT TCT AAT AAA AGC ACA AAT TTT GCT AAC AAT 534 His Val Val Arg Met Phe Ser Asn Lys Ser Thr Asn Phe Ala Asn Asn 150 155 160
TTT AGA TTA CCA AAC CTT GCG GAA TTT ACT AGT ATG CTT ATT GGA TCG 582 Phe Arg Leu Pro Asn Leu Ala Glu Phe Thr Ser Met Leu He Gly Ser 165 170 175 180
CGA TTC ATT GAT CTT TCA GGT TTG AGC GCG AAT CCT ATA AAA TTT GGA 630 Arg Phe He Asp Leu Ser Gly Leu Ser Ala Asn Pro He Lys Phe Gly 185 190 195
AAG AAA TTT GGA ATA CTT GTT AAG AAA TCC ACT CAA ATC AAA GAT AAG 678 Lys Lys Phe Gly He Leu Val Lys Lys Ser Thr Gin He Lys Asp Lys 200 205 210
ACA CTT CTT TGG GAA GAT ATT GCT GAA ATT TCA AGC AAT ATT ACT TTA 726 Thr Leu Leu Trp Glu Asp He Ala Glu He Ser Ser Asn He Thr Leu 215 220 225
CTC AAA CAA CAA ATG GGG GAG ATC CTT TAT AGG ATT GGG ATC GCT TAT 774 Leu Lys Gin Gin Met Gly Glu He Leu Tyr Arg He Gly He Ala Tyr 230 235 240
AAA GAA GGG CTT GGC ACT AGA AAG AAA AAG GAC AGG GCT AAA AAA TTC 822 Lys Glu Gly Leu Gly Thr Arg Lys Lys Lys Asp Arg Ala Lys Lys Phe 245 250 255 260
CTG CAA AAA TCC GCA GAA TTT GGC TAT GAA AAA GCC ATG GAA GCT CTG T 871 Leu Gin Lys Ser Ala Glu Phe Gly Tyr Glu Lys Ala Met Glu Ala Leu 265 270 275
AGTTTTTTAA TCAAACTTGT ATCAAGCTTG ACTGAATGGG TTAGAAAAAT CCGCTTAGAT 931 TATCCTAATT AAAACAAAGC CCTATTTTTC TTATTTTTCC ATTATAATAG ACACTTGAT 990
(2) INFORMATION FOR SEQ ID NO: 726: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 276 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 726:
Met Phe Lys Asp Phe Tyr Arg Thr Thr Leu Ser Phe Leu Lys Pro Leu
1 5 10 15
Leu Leu Leu Leu Val Leu Leu Leu Pro Phe Ser Leu Cys He Ala Asp
20 25 30
Glu Tyr He Ser He Ser Asp Asp Trp Asp Glu He Val Arg Asn His
35 40 45
Lys Thr Tyr Tyr Phe Glu Asn Gly Leu Asp His Phe Asn Gin Gly Gin
50 55 60
Tyr Gin Gin Ala Phe Lys Asp Phe Arg Leu Ala Gin Glu Tyr Ser He 65 70 75 80
Gly Leu Gly Ser Val Tyr Leu Ala Lys Met Tyr Leu Glu Gly Lys Gly
85 90 95
Val Lys Val Asp Tyr Lys Lys Ala Gin Phe Tyr Ala Glu Asn Ala He
100 105 110
Lys Gly Tyr Gly Ser Gly Leu Leu Gly Gly Ala Leu He Leu Gly Arg
115 120 125
Met Gin Ala Glu Gly Leu Gly Met Lys Lys Asp Leu Lys Gin Ala Leu
130 135 140
Lys Thr Tyr Arg His Val Val Arg Met Phe Ser Asn Lys Ser Thr Asn 145 150 155 160
Phe Ala Asn Asn Phe Arg Leu Pro Asn Leu Ala Glu Phe Thr Ser Met
165 170 175
Leu He Gly Ser Arg Phe He Asp Leu Ser Gly Leu Ser Ala Asn Pro
180 185 190
He Lys Phe Gly Lys Lys Phe Gly He Leu Val Lys Lys Ser Thr Gin
195 200 205
He Lys Asp Lys Thr Leu Leu Trp Glu Asp He Ala Glu He Ser Ser
210 215 220
Asn He Thr Leu Leu Lys Gin Gin Met Gly Glu He Leu Tyr Arg He 225 230 235 240
Gly He Ala Tyr Lys Glu Gly Leu Gly Thr Arg Lys Lys Lys Asp Arg
245 250 255
Ala Lys Lys Phe Leu Gin Lys Ser Ala Glu Phe Gly Tyr Glu Lys Ala
260 265 270
Met Glu Ala Leu 275
(2) INFORMATION FOR SEQ ID NO: 727:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2685 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 1...2601 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 727:
TTG GAT TGC GTA TCT CAA GCC AAA ACT GAA GCT GAG AAA AAA GAA TGC 48 Leu Asp Cys Val Ser Gin Ala Lys Thr Glu Ala Glu Lys Lys Glu Cys 1 5 10 15
GAG AAA TTA CTC ACC CCT GAA GCG AGA AAA CTC TTA GAA GAA GCT AAA 96 Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Glu Ala Lys 20 25 30
GAG AGC GTT AAA GCT TAT AAA GAC TGC GTA TCA AAA GCT AGG AAT GAA 144 Glu Ser Val Lys Ala Tyr Lys Asp Cys Val Ser Lys Ala Arg Asn Glu 35 40 45
AAA GAG AAA AAA GAA TGC GAG AAA TTA CTC ACG CCT GAA GCG AAA AAA 192 Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Lys Lys 50 55 60
CTT TTA GAG CAA CAA GTG CTA GAT TGT TTG AAA AAC GCT AAA ACC GAA 240 Leu Leu Glu Gin Gin Val Leu Asp Cys Leu Lys Asn Ala Lys Thr Glu 65 70 75 80
GCT GAT AAA AAA AGG TGT GTC AAA GAT CTC CCT AAA GAC TTG CAG AAA 288 Ala Asp Lys Lys Arg Cys Val Lys Asp Leu Pro Lys Asp Leu Gin Lys 85 90 9£
AAG GTT TTA GCT AAA GAG AGC GTT AAG GCT TAT TTG GAC TGC GTA TCA 336 Lys Val Leu Ala Lys Glu Ser Val Lys Ala Tyr Leu Asp Cys Val Ser 100 105 110
AGA GCT AGG AAT GAA AAA GAG AAA AAA GAA TGC GAG AAA TTG CTC ACC 384 Arg Ala Arg Asn Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr 115 120 125
CCT GAA GCG AAA AAA CTT TTA GAA GAA GCC AAA GAG AGT CTT AAA GCT 432 Pro Glu Ala Lys Lys Leu Leu Glu Glu Ala Lys Glu Ser Leu Lys Ala 130 135 140
TAT AAA GAC TGC CTC TCT CAA GCT AGA AAT GAA GAA GAA AGG AGA GCT 480 Tyr Lys Asp Cys Leu Ser Gin Ala Arg Asn Glu Glu Glu Arg Arg Ala 145 150 155 160
TGC GAG AAA CTA CTC ACG CCT GAA GCG AGA AAA CTC TTA GAG CAA GAA 528 Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Gin Glu 165 170 175 GTT AAG AAA AGC ATT AAG GCT TAT TTG GAC TGC GTA TCA AGA GCT AGG 576 Val Lys Lys Ser He Lys Ala Tyr Leu Asp Cys Val Ser Arg Ala Arg 180 185 190
AAT GAA AAA GAG AAA AAA GAA TGC GAG AAA TTA CTC ACG CCT GAA GCG 624 Asn Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala 195 200 205
AGA AAA TTT TTA GCG AAG CAA GTG CTA AAT TGT TTG GAA AAA GCT GGA 672 Arg Lys Phe Leu Ala Lys Gin Val Leu Asn Cys Leu Glu Lys Ala Gly 210 215 220
AAT GAA GAA GAA AGA AAA GCA TGT CTT AAA AAT CTC CCT AAA GAC TTA 720 Asn Glu Glu Glu Arg Lys Ala Cys Leu Lys Asn Leu Pro Lys Asp Leu 225 230 235 240
CAG GAA AAT ATT TTA GCT AAA GAG AGT CTT AAA GCT TAT AAA GAC TGC 768 Gin Glu Asn He Leu Ala Lys Glu Ser Leu Lys Ala Tyr Lys Asp Cys 245 250 255
CTC TCT CAA GCT AGA AAT GAA GAA GAA AGG AGA GCT TGC GAG AAA CTA 816 Leu Ser Gin Ala Arg Asn Glu Glu Glu Arg Arg Ala Cys Glu Lys Leu 260 265 270
CTC ACG CCT GAA GCG AGA AAA CTC TTA GAG CAA GAA GTT AAG AAA AGC 864 Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Gin Glu Val Lys Lys Ser 275 280 285
GTT AAG GCT TAT TTG GAC TGC GTA TCA AGA GCT AGG AAT GAA AAA GAG 912 Val Lys Ala Tyr Leu Asp Cys Val Ser Arg Ala Arg Asn Glu Lys Glu 290 295 300
AAA AAA GAA TGC GAG AAA TTA CTC ACG CCT GAA GCG AGA AAA TTT TTA 960 Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Phe Leu 305 310 315 320
GCG AAA GAA CTC CAA CAA AAA GAT AAA GCG ATC AAA GAT TGC TTG AAA 1008 Ala Lys Glu Leu Gin Gin Lys Asp Lys Ala He Lys Asp Cys Leu Lys 325 330 335
AAC GCC GAT CCT AAC GAC AGA GCG GCT ATC ATG AAG TGT TTG GAT GGT 1056 Asn Ala Asp Pro Asn Asp Arg Ala Ala He Met Lys Cys Leu Asp Gly 340 345 350
TTG AGC GAT GAA GAG AAG CTC AAA TAC CTG CAA GAA GCT AGA GAA AAG 1104 Leu Ser Asp Glu Glu Lys Leu Lys Tyr Leu Gin Glu Ala Arg Glu Lys 355 360 365
GCT GTT GCG GAT TGT TTG GCT ATG GCT AAA ACC GAT GAA GAA AAA AGG 1152 Ala Val Ala Asp Cys Leu Ala Met Ala Lys Thr Asp Glu Glu Lys Arg 370 375 380
AAA TGC CAA AAC CTT TAT AGC GAT TTG ATC CAA GAA ATC CAA AAT AAA 1200 Lys Cys Gin Asn Leu Tyr Ser Asp Leu He Gin Glu He Gin Asn Lys 385 390 395 400 AGG ACA CAA AAC AAA CAA AAT CAA TTG AGT AAA ACA GAA AGG TTG CAT 1248 Arg Thr Gin Asn Lys Gin Asn Gin Leu Ser Lys Thr Glu Arg Leu His 405 410 415
CAA GCA AGC GAG TGC TTG GAT AAC TTA GAT GAC CCT ACT GAT CAA GAG 1296 Gin Ala Ser Glu Cys Leu Asp Asn Leu Asp Asp Pro Thr Asp Gin Glu 420 425 430
GCC ATA GAG CAA TGT TTA GAG GGC TTG AGC GAT AGT GAA AGG GCG CTA 1344 Ala He Glu Gin Cys Leu Glu Gly Leu Ser Asp Ser Glu Arg Ala Leu 435 440 445
ATT CTA GGA ATT AAA CGA CAA GCT GAT GAA GTG GAT CTG ATT TAT AGC 1392 He Leu Gly He Lys Arg Gin Ala Asp Glu Val Asp Leu He Tyr Ser 450 455 460
GAT CTA AGA AAC CGT AAA ACC TTT GAT AAC ATG GCG GCT AAA GGT TAT 1440 Asp Leu Arg Asn Arg Lys Thr Phe Asp Asn Met Ala Ala Lys Gly Tyr 465 470 475 480
CCA TTG TTA CCA ATG GAT TTC AAA AAT GGC GGC GAT ATT GCC ACT ATT 1488 Pro Leu Leu Pro Met Asp Phe Lys Asn Gly Gly Asp He Ala Thr He 485 490 495
AAC GCC ACT AAT GTT GAT GCG GAC AAA ATA GCT AGC GAT AAT CCT ATT 1536 Asn Ala Thr Asn Val Asp Ala Asp Lys He Ala Ser Asp Asn Pro He 500 505 510
TAT GCT TCC ATA GAG CCT GAT ATT GCC AAG CAA TAC GAA ACA GAA AAA 1584 Tyr Ala Ser He Glu Pro Asp He Ala Lys Gin Tyr Glu Thr Glu Lys 515 520 525
ACC ATT AAG GAT AAG AAT TTA GAA GCT AAA TTA GCT AAG GCT TTA GGT 1632 Thr He Lys Asp Lys Asn Leu Glu Ala Lys Leu Ala Lys Ala Leu Gly 530 535 540
GGC AAT AAA AAA GAT GAC GAT AAA GAA AAA AGT AAA AAA TCC ACA GCA 1680 Gly Asn Lys Lys Asp Asp Asp Lys Glu Lys Ser Lys Lys Ser Thr Ala 545 550 555 560
GAA GCT AAA GCA GAA AAC AAT AAG ATA GAC AAA GAT GTC GCA GAA ACT 1728 Glu Ala Lys Ala Glu Asn Asn Lys He Asp Lys Asp Val Ala Glu Thr 565 570 575
GCC AAG AAT ATC AGT GAA ATC GCT CTT AAG AAC AAA AAA GAA AAG AGT 1776 Ala Lys Asn He Ser Glu He Ala Leu Lys Asn Lys Lys Glu Lys Ser 580 585 590
GGG GAA TTT GTA GAT GAA AAT GGT AAT CCC ATT GAT GAC AAA AAG AAA 1824 Gly Glu Phe Val Asp Glu Asn Gly Asn Pro He Asp Asp Lys Lys Lys 595 600 605
GCA GAA AAA CAA GAT GAA ACA AGC CCT GTC AAA CAG GCC TTT ATA GGC 1872 Ala Glu Lys Gin Asp Glu Thr Ser Pro Val Lys Gin Ala Phe He Gly 610 615 620 AAG AGT GAT CCC ACA TTT GTT TTA GCG CAA TAC ACC CCC ATT GAA ATC 1920 Lys Ser Asp Pro Thr Phe Val Leu Ala Gin Tyr Thr Pro He Glu He 625 630 635 640
ACT CTG ACT TCT AAA GTA GAT GCC ACT CTC ACA GGT ATA GTG AGT GGG 1968 Thr Leu Thr Ser Lys Val Asp Ala Thr Leu Thr Gly He Val Ser Gly 645 650 655
GTT GTA GCC AAA GAT GTA TGG AAC ATG AAC GGC ACT ATG ATC TTA TTA 2016 Val Val Ala Lys Asp Val Trp Asn Met Asn Gly Thr Met He Leu Leu 660 665 670
GAC AAA GGC ACT AAG GTG TAT GGG AAT TAT CAA AGC GTG AAA GGT GGC 2064 Asp Lys Gly Thr Lys Val Tyr Gly Asn Tyr Gin Ser Val Lys Gly Gly 675 680 685
ACA CCC ATT ATG ACA CGC TTA ATG ATA GTC TTT ACT AAA GCC ATT ACG 2112 Thr Pro He Met Thr Arg Leu Met He Val Phe Thr Lys Ala He Thr 690 695 700
CCT GAT GGT GTG ATA ATA CCT CTA GCA AAC GCT CAA GCA GCA GGC ATG 2160 Pro Asp Gly Val He He Pro Leu Ala Asn Ala Gin Ala Ala Gly Met 705 710 715 720
TTG GGT GAA GCA GGG GTA GAT GGC TAT GTG AAT AAT CAC TTT ATG AAG 2208 Leu Gly Glu Ala Gly Val Asp Gly Tyr Val Asn Asn His Phe Met Lys 725 730 735
CGC ATA GGC TTT GCT GTG ATA GCA AGC GTG GTT AAT AGC TTC TTG CAA 2256 Arg He Gly Phe Ala Val He Ala Ser Val Val Asn Ser Phe Leu Gin 740 745 750
ACT GCG CCT ATC ATA GCT CTA GAT AAA CTC ATA GGC CTT GGC AAA GGT 2304 Thr Ala Pro He He Ala Leu Asp Lys Leu He Gly Leu Gly Lys Gly 755 760 765
AGA AGT GAA AGG ACA CCT GAA TTT AAT TAC GCT TTG GGT CAA GCT ATC 2352 Arg Ser Glu Arg Thr Pro Glu Phe Asn Tyr Ala Leu Gly Gin Ala He 770 775 780
AAT GGT AGC ATG CAA AGT TCA GCT CAG ATG TCT AAT CAA ATT CTA GGG 2400 Asn Gly Ser Met Gin Ser Ser Ala Gin Met Ser Asn Gin He Leu Gly 785 790 795 800
CAA CTG ATG AAT ATC CCC CCA AGT TTT TAC AAA AAC GAG GGC GAT AGT 2448 Gin Leu Met Asn He Pro Pro Ser Phe Tyr Lys Asn Glu Gly Asp Ser 805 810 815
ATT AAG ATT CTC ACA ATG GAC GAT ATT GAT TTT AGC GGT GTG TAT GAT 2496 He Lys He Leu Thr Met Asp Asp He Asp Phe Ser Gly Val Tyr Asp 820 825 830
GTT AAA ATT ACT AAC AAA TCT GTG GTA GAT GAA ATT ATC AAA CAA AGC 2544 Val Lys He Thr Asn Lys Ser Val Val Asp Glu He He Lys Gin Ser 835 840 845 ACC AAA ACT TTG TCT AGA GAA CAT GAA GAA ATC ACC ACA AGC CCC AAA 2592 Thr Lys Thr Leu Ser Arg Glu His Glu Glu He Thr Thr Ser Pro Lys 850 855 860
GGT GGC AAT TAATTCAAGA GAAAGGATAA AATATATTCA TGTTACTAAA CTCGGTTCT 2650
Gly Gly Asn
865
TTACAAAATA AAAGACAAAA CCAACAACAG GCTCT 2685
(2) INFORMATION FOR SEQ ID NO: 728:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 867 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 728:
Leu Asp Cys Val Ser Gin Ala Lys Thr Glu Ala Glu Lys Lys Glu Cys
1 5 10 15
Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Glu Ala Lys
20 25 30
Glu Ser Val Lys Ala Tyr Lys Asp Cys Val Ser Lys Ala Arg Asn Glu
35 40 45
Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Lys Lys
50 55 60
Leu Leu Glu Gin Gin Val Leu Asp Cys Leu Lys Asn Ala Lys Thr Glu 65 70 75 80
Ala Asp Lys Lys Arg Cys Val Lys Asp Leu Pro Lys Asp Leu Gin Lys
85 90 95
Lys Val Leu Ala Lys Glu Ser Val Lys Ala Tyr Leu Asp Cys Val Ser
100 105 110
Arg Ala Arg Asn Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr
115 120 125
Pro Glu Ala Lys Lys Leu Leu Glu Glu Ala Lys Glu Ser Leu Lys Ala
130 135 140
Tyr Lys Asp Cys Leu Ser Gin Ala Arg Asn Glu Glu Glu Arg Arg Ala 145 150 155 160
Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Gin Glu
165 170 175
Val Lys Lys Ser He Lys Ala Tyr Leu Asp Cys Val Ser Arg Ala Arg
180 185 190
Asn Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala
195 200 205
Arg Lys Phe Leu Ala Lys Gin Val Leu Asn Cys Leu Glu Lys Ala Gly
210 215 220
Asn Glu Glu Glu Arg Lys Ala Cys Leu Lys Asn Leu Pro Lys Asp Leu 225 230 235 240
Gin Glu Asn He Leu Ala Lys Glu Ser Leu Lys Ala Tyr Lys Asp Cys 245 250 255 Leu Ser Gin Ala Arg Asn Glu Glu Glu Arg Arg Ala Cys Glu Lys Leu
260 265 270
Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Gin Glu Val Lys Lys Ser
275 280 285
Val Lys Ala Tyr Leu Asp Cys Val Ser Arg Ala Arg Asn Glu Lys Glu
290 295 300
Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Phe Leu 305 310 315 320
Ala Lys Glu Leu Gin Gin Lys Asp Lys Ala He Lys Asp Cys Leu Lys
325 330 335
Asn Ala Asp Pro Asn Asp Arg Ala Ala He Met Lys Cys Leu Asp Gly
340 345 350
Leu Ser Asp Glu Glu Lys Leu Lys Tyr Leu Gin Glu Ala Arg Glu Lys
355 360 365
Ala Val Ala Asp Cys Leu Ala Met Ala Lys Thr Asp Glu Glu Lys Arg
370 375 380
Lys Cys Gin Asn Leu Tyr Ser Asp Leu He Gin Glu He Gin Asn Lys 385 390 395 400
Arg Thr Gin Asn Lys Gin Asn Gin Leu Ser Lys Thr Glu Arg Leu His
405 410 415
Gin Ala Ser Glu Cys Leu Asp Asn Leu Asp Asp Pro Thr Asp Gin Glu
420 425 430
Ala He Glu Gin Cys Leu Glu Gly Leu Ser Asp Ser Glu Arg Ala Leu
435 440 445
He Leu Gly He Lys Arg Gin Ala Asp Glu Val Asp Leu He Tyr Ser
450 455 460
Asp Leu Arg Asn Arg Lys Thr Phe Asp Asn Met Ala Ala Lys Gly Tyr 465 470 475 480
Pro Leu Leu Pro Met Asp Phe Lys Asn Gly Gly Asp He Ala Thr He
485 490 495
Asn Ala Thr Asn Val Asp Ala Asp Lys He Ala Ser Asp Asn Pro He
500 505 510
Tyr Ala Ser He Glu Pro Asp He Ala Lys Gin Tyr Glu Thr Glu Lys
515 520 525
Thr He Lys Asp Lys Asn Leu Glu Ala Lys Leu Ala Lys Ala Leu Gly
530 535 540
Gly Asn Lys Lys Asp Asp Asp Lys Glu Lys Ser Lys Lys Ser Thr Ala 545 550 555 560
Glu Ala Lys Ala Glu Asn Asn Lys He Asp Lys Asp Val Ala Glu Thr
565 570 575
Ala Lys Asn He Ser Glu He Ala Leu Lys Asn Lys Lys Glu Lys Ser
580 585 590
Gly Glu Phe Val Asp Glu Asn Gly Asn Pro He Asp Asp Lys Lys Lys
595 600 605
Ala Glu Lys Gin Asp Glu Thr Ser Pro Val Lys Gin Ala Phe He Gly
610 615 620
Lys Ser Asp Pro Thr Phe Val Leu Ala Gin Tyr Thr Pro He Glu He 625 630 635 640
Thr Leu Thr Ser Lys Val Asp Ala Thr Leu Thr Gly He Val Ser Gly
645 650 655
Val Val Ala Lys Asp Val Trp Asn Met Asn Gly Thr Met He Leu Leu
660 665 670
Asp Lys Gly Thr Lys Val Tyr Gly Asn Tyr Gin Ser Val Lys Gly Gly
675 680 685
Thr Pro He Met Thr Arg Leu Met He Val Phe Thr Lys Ala He Thr 690 695 700
Pro Asp Gly Val He He Pro Leu Ala Asn Ala Gin Ala Ala Gly Met 705 710 715 720
Leu Gly Glu Ala Gly Val Asp Gly Tyr Val Asn Asn His Phe Met Lys
725 730 735
Arg He Gly Phe Ala Val He Ala Ser Val Val Asn Ser Phe Leu Gin
740 745 750
Thr Ala Pro He He Ala Leu Asp Lys Leu He Gly Leu Gly Lys Gly
755 760 765
Arg Ser Glu Arg Thr Pro Glu Phe Asn Tyr Ala Leu Gly Gin Ala He
770 775 780
Asn Gly Ser Met Gin Ser Ser Ala Gin Met Ser Asn Gin He Leu Gly 785 790 795 800
Gin Leu Met Asn He Pro Pro Ser Phe Tyr Lys Asn Glu Gly Asp Ser
805 810 815
He Lys He Leu Thr Met Asp Asp He Asp Phe Ser Gly Val Tyr Asp
820 825 830
Val Lys He Thr Asn Lys Ser Val Val Asp Glu He He Lys Gin Ser
835 840 845
Thr Lys Thr Leu Ser Arg Glu His Glu Glu He Thr Thr Ser Pro Lys
850 855 860
Gly Gly Asn 865
(2) INFORMATION FOR SEQ ID NO: 729:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 877 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 65...688 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 729:
TTTGGTGGCA CTAATGGTGT TGTGATTTTC AAAAAAGCCT AGTTTTACAA AGTTAGGATT 60 TTGA ATG GCC GTT TAT TTA GAT TTT GAA AAT CAT ATT AAA GAG ATT CAA 109 Met Ala Val Tyr Leu Asp Phe Glu Asn His He Lys Glu He Gin 1 5 10 15
AAT GAA ATT GAA TTA GCC CTT ATT AGA GGC GAT GAG GAC GCT AAA GAA 157 Asn Glu He Glu Leu Ala Leu He Arg Gly Asp Glu Asp Ala Lys Glu 20 25 30
ATC TTA GAA AAA AGA TTG GAT AAG GAG GTT AAA AGC ATT TAT TCC AAT 205 He Leu Glu Lys Arg Leu Asp Lys Glu Val Lys Ser He Tyr Ser Asn 35 40 45
CTC ACT GAT TTT CAA AAA CTC CAA TTA GCA AGA CAC CCT GAC AGA CCC 253 Leu Thr Asp Phe Gin Lys Leu Gin Leu Ala Arg His Pro Asp Arg Pro 50 55 60
TAC GCT ATG GAT TAC ATT GAT CTC ATC TTA AAA GAT AAA TAT GAA GTC 301 Tyr Ala Met Asp Tyr He Asp Leu He Leu Lys Asp Lys Tyr Glu Val 65 70 75
TTT GGG GAT AGG CAT TAT AAC GAT GAT AAA GCG ATC GTG TGC TTT GTA 349 Phe Gly Asp Arg His Tyr Asn Asp Asp Lys Ala He Val Cys Phe Val 80 85 90 95
GGG AAA ATT GAT AAT GTC CCA GTT GTG GTG ATC GGA GAA GAA AAG GGC 397 Gly Lys He Asp Asn Val Pro Val Val Val He Gly Glu Glu Lys Gly 100 105 110
AGA GGG ACT AAA AAC AAA CTC TTA AGA AAT TTT GGC ATG CCT AAC CCT 445 Arg Gly Thr Lys Asn Lys Leu Leu Arg Asn Phe Gly Met Pro Asn Pro 115 120 125
TGT GGC TAT CGT AAG GCT TTG AAA ATG GCA AAG TTT GCT GAA AAG TTT 493 Cys Gly Tyr Arg Lys Ala Leu Lys Met Ala Lys Phe Ala Glu Lys Phe 130 135 140
AAT TTG CCT ATT TTA ATG CTT GTG GAT ACA GCC GGG GCG TAT CCG GGG 541 Asn Leu Pro He Leu Met Leu Val Asp Thr Ala Gly Ala Tyr Pro Gly 145 150 155
ATT GGT GCA GAA GAA AGG GGG CAA AGT GAA GCG ATC GCT AAA AAT CTC 589 He Gly Ala Glu Glu Arg Gly Gin Ser Glu Ala He Ala Lys Asn Leu 160 165 170 175
CAA GAG TTC GCC TCT TTA AAA GTC CCT ACT ATT TCT GTA ATT ATC GGT 637 Gin Glu Phe Ala Ser Leu Lys Val Pro Thr He Ser Val He He Gly 180 185 190
GAG GGG GGC AGT GGT GGT GCG CTA CGA TTG CAG TGG CTG ACA AAT TGG 685 Glu Gly Gly Ser Gly Gly Ala Leu Arg Leu Gin Trp Leu Thr Asn Trp 195 200 205
CTA TGATGGAATA TTCCATTTTT AGCGTTATAT CCCCAGAAGG TTGTGCGGCG ATTCTT 744 Leu
TGGGATGACC CTAGCAAGAC TGAAGTGGCT ATTAAAGCGA TGAAAATCAC GCCTAGAGAC 804 TTAAAGGAGG CGGGGCTTAT TGATGATATT ATCTTAGAGC CTAGCAAAGG GGCTCATAGA 864 GACAAATTTT CAG 877
(2) INFORMATION FOR SEQ ID NO: 730:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 208 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 730:
Met Ala Val Tyr Leu Asp Phe Glu Asn His He Lys Glu He Gin Asn
1 5 10 15
Glu He Glu Leu Ala Leu He Arg Gly Asp Glu Asp Ala Lys Glu He
20 25 30
Leu Glu Lys Arg Leu Asp Lys Glu Val Lys Ser He Tyr Ser Asn Leu
35 40 45
Thr Asp Phe Gin Lys Leu Gin Leu Ala Arg His Pro Asp Arg Pro Tyr
50 55 60
Ala Met Asp Tyr He Asp Leu He Leu Lys Asp Lys Tyr Glu Val Phe 65 70 75 80
Gly Asp Arg His Tyr Asn Asp Asp Lys Ala He Val Cys Phe Val Gly
85 90 95
Lys He Asp Asn Val Pro Val Val Val He Gly Glu Glu Lys Gly Arg
100 105 110
Gly Thr Lys Asn Lys Leu Leu Arg Asn Phe Gly Met Pro Asn Pro Cys
115 120 125
Gly Tyr Arg Lys Ala Leu Lys Met Ala Lys Phe Ala Glu Lys Phe Asn
130 135 140
Leu Pro He Leu Met Leu Val Asp Thr Ala Gly Ala Tyr Pro Gly He 145 150 155 160
Gly Ala Glu Glu Arg Gly Gin Ser Glu Ala He Ala Lys Asn Leu Gin
165 170 175
Glu Phe Ala Ser Leu Lys Val Pro Thr He Ser Val He He Gly Glu
180 185 190
Gly Gly Ser Gly Gly Ala Leu Arg Leu Gin Trp Leu Thr Asn Trp Leu 195 200 205
(2) INFORMATION FOR SEQ ID NO: 731:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 804 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 67...744 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 731: AAGGCGTTAT GAGTCAAGAC TATAATAGAC TTTAAGAAAA ATTTAAAAAT TAAGGATTAT 60
TGAATA ATG CAA TTC ACA GGG AAA AAT GTT CTC ATT ACT GGG GCT TCT 108 Met Gin Phe Thr Gly Lys Asn Val Leu He Thr Gly Ala Ser 1 5 10
AAA GGC ATT GGG GCT GAA ATC GCC AAA ACT CTC GCT TCT ATG GGG CTG 156 Lys Gly He Gly Ala Glu He Ala Lys Thr Leu Ala Ser Met Gly Leu 15 20 25 30
AAA GTT TGG ATC AAT TAC CGC AGT AAT GCT GAA GTG GCT GAC GCT TTG 204 Lys Val Trp He Asn Tyr Arg Ser Asn Ala Glu Val Ala Asp Ala Leu 35 40 45
AAA AAT GAG CTT GAA GAA AAA GGC TAT AAG GCA GCT GTC ATT AAA TTT 252 Lys Asn Glu Leu Glu Glu Lys Gly Tyr Lys Ala Ala Val He Lys Phe 50 55 60
GAT GCG GCT TCT GAA AGC GAT TTT ATT GAA GCG ATA CAA ACC ATC GTC 300 Asp Ala Ala Ser Glu Ser Asp Phe He Glu Ala He Gin Thr He Val 65 70 75
CAA AGC GAT GGG GGG TTG TCT TAC TTG GTG AAT AAC GCC GGT GTG GTG 348 Gin Ser Asp Gly Gly Leu Ser Tyr Leu Val Asn Asn Ala Gly Val Val 80 85 90
CGC GAT AAA TTA GCG ATC AAA ATG AAA ACA GAA GAC TTT CAC CAT GTC 396 Arg Asp Lys Leu Ala He Lys Met Lys Thr Glu Asp Phe His His Val 95 100 105 110
ATA GAC AAT AAC CTC ACT TCA GCC TTT ATA GGT TGC CGA GAG GCT TTA 444 He Asp Asn Asn Leu Thr Ser Ala Phe He Gly Cys Arg Glu Ala Leu 115 120 125
AAG GTG ATG AGC AAG AGT CGT TTT GGG AGC GTG GTC AAT GTC GCT TCT 492 Lys Val Met Ser Lys Ser Arg Phe Gly Ser Val Val Asn Val Ala Ser 130 135 140
ATC ATT GGT GAA AGA GGC AAT ATG GGG CAG ACA AAC TAC TCA GCG AGT 540 He He Gly Glu Arg Gly Asn Met Gly Gin Thr Asn Tyr Ser Ala Ser 145 150 155
AAG GGG GGA ATG ATT GCA ATG AGC AAG TCC TTT GCT TAT GAG GGA GCT 588 Lys Gly Gly Met He Ala Met Ser Lys Ser Phe Ala Tyr Glu Gly Ala 160 165 170
TTA AGG AAT ATT CGT TTC AAC TCT GTA ACG CCC GGT TTT ATA GAA ACC 636 Leu Arg Asn He Arg Phe Asn Ser Val Thr Pro Gly Phe He Glu Thr 175 180 185 190
GAC ATG AAC GCC AAT TTG AAA GAC GAA CTC AAA GCG GAT TAT GTT AAA 684 Asp Met Asn Ala Asn Leu Lys Asp Glu Leu Lys Ala Asp Tyr Val Lys 195 200 205
AAC ATT CCT TTA AAC AGG CTA GGG TCT GCT AAG GAA GTG GCA GAA GCG 732 Asn He Pro Leu Asn Arg Leu Gly Ser Ala Lys Glu Val Ala Glu Ala 210 215 220
GTA GGN TTC TTT TGAGTGATCA CTCTAGTTAC ATCACTGGAG AGACTCTCAA AGTCA 789 Val Xaa Phe Phe 225
ATGGCGGGCT TTATA 804
(2) INFORMATION FOR SEQ ID NO: 732:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 226 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 732:
Met Gin Phe Thr Gly Lys Asn Val Leu He Thr Gly Ala Ser Lys Gly
1 5 10 15
He Gly Ala Glu He Ala Lys Thr Leu Ala Ser Met Gly Leu Lys Val
20 25 30
Trp He Asn Tyr Arg Ser Asn Ala Glu Val Ala Asp Ala Leu Lys Asn
35 40 45
Glu Leu Glu Glu Lys Gly Tyr Lys Ala Ala Val He Lys Phe Asp Ala
50 55 60
Ala Ser Glu Ser Asp Phe He Glu Ala He Gin Thr_ He Val Gin Ser 65 70 75 80
Asp Gly Gly Leu Ser Tyr Leu Val Asn Asn Ala Gly Val Val Arg Asp
85 90 95
Lys Leu Ala He Lys Met Lys Thr Glu Asp Phe His His Val He Asp
100 105 110
Asn Asn Leu Thr Ser Ala Phe He Gly Cys Arg Glu Ala Leu Lys Val
115 120 125
Met Ser Lys Ser Arg Phe Gly Ser Val Val Asn Val Ala Ser He He
130 135 140
Gly Glu Arg Gly Asn Met Gly Gin Thr Asn Tyr Ser Ala Ser Lys Gly 145 150 155 160
Gly Met He Ala Met Ser Lys Ser Phe Ala Tyr Glu Gly Ala Leu Arg
165 170 175
Asn He Arg Phe Asn Ser Val Thr Pro Gly Phe He Glu Thr Asp Met
180 185 190
Asn Ala Asn Leu Lys Asp Glu Leu Lys Ala Asp Tyr Val Lys Asn He
195 200 205
Pro Leu Asn Arg Leu Gly Ser Ala Lys Glu Val Ala Glu Ala Val Xaa
210 215 220
Phe Phe 225
(2) INFORMATION FOR SEQ ID NO: 733:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 373 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 103...312 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:733:
GAAAATTCTT GACACCCAGT TAATTCGTCT TATATAATAC TATTTCATTT GTAGCCATTG 60 TTAAGCAATG ATTTTAAGCT ATGGAAAAGA GGTGATAGCA GT ATG CCA GGG ATT 114
Met Pro Gly He
1
AAG GTT AGA GAA GGC GAT GCG TTT GAT GAA GCT TAT AGG AGA TTC AAA 162 Lys Val Arg Glu Gly Asp Ala Phe Asp Glu Ala Tyr Arg Arg Phe Lys 5 10 15 20
AAG CAA ACC GAT CGC AAT TTA GTG GTA ACA GAA TGC CGT GCT AGA AGG 210 Lys Gin Thr Asp Arg Asn Leu Val Val Thr Glu Cys Arg Ala Arg Arg 25 30 35
TTC TTT GAG TCT AAG ACT GAA AAA CGC AAA AAA CAA AAA ATC AGC GCT 258 Phe Phe Glu Ser Lys Thr Glu Lys Arg Lys Lys Gin Lys He Ser Ala 40 45 50
AAA AAG AAG GTT TTG AAG CGT CTT TAT ATG TTA AGG CGT TAT GAG TCA 306 Lys Lys Lys Val Leu Lys Arg Leu Tyr Met Leu Arg Arg Tyr Glu Ser 55 60 65
AGA CTA TAATAGACTT TAAGAAAAAT TTAAAAATTA AGGATTATTG AATAATGCAA TT 364 Arg Leu 70
CACAGGGAA 373
(2) INFORMATION FOR SEQ ID NO: 734:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 70 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 734:
Met Pro Gly He Lys Val Arg Glu Gly Asp Ala Phe Asp Glu Ala Tyr
1 5 10 15
Arg Arg Phe Lys Lys Gin Thr Asp Arg Asn Leu Val Val Thr Glu Cys
20 25 30
Arg Ala Arg Arg Phe Phe Glu Ser Lys Thr Glu Lys Arg Lys Lys Gin
35 40 45
Lys He Ser Ala Lys Lys Lys Val Leu Lys Arg Leu Tyr Met Leu Arg
50 55 60
Arg Tyr Glu Ser Arg Leu 65 70
(2) INFORMATION FOR SEQ ID NO: 735
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1613 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 64...1551 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 735:
TAAGAAAAAC CGCTAGAGTG CAATACAATT CTTGAAAGAT ATGAAATTAA AAAAGGAGAC 60
TTT ATG TTA AAA ATC AAA TTA GAA AAA ACC ACT TTT GAA AAC GCA AAA 108 Met Leu Lys He Lys Leu Glu Lys Thr Thr Phe Glu Asn Ala Lys 1 5 10 15
GCT GAA TGC AGT TTA GTT TTT ATT ATC AAT AAG GAT TTT AGC CAC GCT 156 Ala Glu Cys Ser Leu Val Phe He He Asn Lys Asp Phe Ser His Ala 20 25 30
TGG GTC AAA AAT AAA GAG TTG CTA GAA ACC TTT AAA TAC GAA GGC GAA 204 Trp Val Lys Asn Lys Glu Leu Leu Glu Thr Phe Lys Tyr Glu Gly Glu 35 40 45
GGC GTA TTT TTA GAC CAA GAA AAT AAA ATC CTG TAT GCG GGC GTT AAA 252 Gly Val Phe Leu Asp Gin Glu Asn Lys He Leu Tyr Ala Gly Val Lys 50 55 60
GAA GAT GAT GTG CAT TTA TTG AGA GAG AGC GCG TGT TTA GCC GTT CGC 300 Glu Asp Asp Val His Leu Leu Arg Glu Ser Ala Cys Leu Ala Val Arg 65 70 75
ACC CTT AAA AAA CTC GCT TTT AAA AGC GTT AAA GTG GGC GTT TAT ACT 348 Thr Leu Lys Lys Leu Ala Phe Lys Ser Val Lys Val Gly Val Tyr Thr 80 85 90 95
TGT GGT GCA CAT TCT AAA GAT AAC GCG CTT TTA GAA AAC TTG AAA GCG 396 Cys Gly Ala His Ser Lys Asp Asn Ala Leu Leu Glu Asn Leu Lys Ala 100 105 110
CTG TTT TTG GGC TTG AAA TTA GGT TTG TAT GAA TAC GAC ACT TTT AAA 444 Leu Phe Leu Gly Leu Lys Leu Gly Leu Tyr Glu Tyr Asp Thr Phe Lys 115 120 125
TCC AAC AAA AAA GAA AGC GTT TTA AAA GAA GCC ATT GTC GCT TTA GAA 492 Ser Asn Lys Lys Glu Ser Val Leu Lys Glu Ala He Val Ala Leu Glu 130 135 140
TTG CAC AAA CCT TGC GAA AAA ACT TGC GCA AAT TCT TTA GAA AAG AGT 540 Leu His Lys Pro Cys Glu Lys Thr Cys Ala Asn Ser Leu Glu Lys Ser 145 150 155
GCT AAA GAA GCG TTA AAA TAC GCT GAA ATC ATG ACA GAA AGC TTG AAT 588 Ala Lys Glu Ala Leu Lys Tyr Ala Glu He Met Thr Glu Ser Leu Asn 160 165 170 175
ATC GTT AAA GAT CTA GTC AAT ACC CCC CCT ATG ATT GGC ACT CCG GTT 636 He Val Lys Asp Leu Val Asn Thr Pro Pro Met He Gly Thr Pro Val 180 185 190
TAT ATG GCT GAA GTG GCG CAA AAA GTG GCT AAA GAA AAC CAT TTA GAA 684 Tyr Met Ala Glu Val Ala Gin Lys Val Ala Lys Glu Asn His Leu Glu 195 200 205
ATC CAT GTT CAT GAT GAA AAA TTT TTA GAA GAA AAG AAA ATG AAC GCC 732 He His Val His Asp Glu Lys Phe Leu Glu Glu Lys Lys Met Asn Ala 210 215 220
TTT TTA GCG GTC AAT AAA GCC TCT CTT AGC GTC AAT CCT CCT CGC TTG 780 Phe Leu Ala Val Asn Lys Ala Ser Leu Ser Val Asn Pro Pro Arg Leu 225 230 235
ATC CAT TTA GTC TAT AAG CCT AAA AAA GCG AAG AAA AAA ATC GCT TTA 828 He His Leu Val Tyr Lys Pro Lys Lys Ala Lys Lys Lys He Ala Leu 240 245 250 255
GTG GGT AAG GGC TTG ACT TAT GAT TGT GGG GGT TTG AGC TTG AAA CCG 876 Val Gly Lys Gly Leu Thr Tyr Asp Cys Gly Gly Leu Ser Leu Lys Pro 260 265 270
GCC GAT TAC ATG GTT ACT ATG AAA GCG GAT AAA GGC GGT GGC TCT GCG 924 Ala Asp Tyr Met Val Thr Met Lys Ala Asp Lys Gly Gly Gly Ser Ala 275 280 285
GTG ATT GGG CTT TTA AAC GCA TTA GCC AAA CTA GGC GTG GAG GCT GAA 972 Val He Gly Leu Leu Asn Ala Leu Ala Lys Leu Gly Val Glu Ala Glu 290 295 300 GTG CAT GGC ATT ATT GGG GCT ACA GAA AAC ATG ATA GGC CCA GCC GCT 1020 Val His Gly He He Gly Ala Thr Glu Asn Met He Gly Pro Ala Ala 305 310 315
TAT AAA CCA GAT GAT ATT TTG ATC TCC AAA GAA GGC AAG AGC ATA GAG 1068 Tyr Lys Pro Asp Asp He Leu He Ser Lys Glu Gly Lys Ser He Glu 320 325 330 335
GTC CGT AAT ACC GAC GCT GAG GGG CGT TTG GTT TTA GCG GAT TGT TTG 1116 Val Arg Asn Thr Asp Ala Glu Gly Arg Leu Val Leu Ala Asp Cys Leu 340 345 350
AGC TAC GCT CAA GAT TTA AAC CCT GAT GTG ATC GTG GAT TTT GCG ACC 1164 Ser Tyr Ala Gin Asp Leu Asn Pro Asp Val He Val Asp Phe Ala Thr 355 360 365
CTT ACT GGG GCA TGC GTT GTA GGC TTA GGC GAA TTC ACT TCA GCG ATC 1212 Leu Thr Gly Ala Cys Val Val Gly Leu Gly Glu Phe Thr Ser Ala He 370 375 380
ATG GGG CAT AAT GAA GAG TTA AAA AAC CTC TTT GAA ACT TCA GGG TTA 1260 Met Gly His Asn Glu Glu Leu Lys Asn Leu Phe Glu Thr Ser Gly Leu 385 390 395
GAA TCC GGC GAA TTA TTA GCC AAA CTC CCC TTT AAC CGC CAT TTA AAG 1308 Glu Ser Gly Glu Leu Leu Ala Lys Leu Pro Phe Asn Arg His Leu Lys 400 405 410 415
AAA TTG ATT GAA TCT AAA ATC GCT GAT GTG TGC AAT ATT TCT TCT TCA 1356 Lys Leu He Glu Ser Lys He Ala Asp Val Cys Asn He Ser Ser Ser 420 425 430
CGC TAT GGC GGT GCG ATC ACA GCG GGC TTG TTT TTA AAT GAA TTT ATT 1404 Arg Tyr Gly Gly Ala He Thr Ala Gly Leu Phe Leu Asn Glu Phe He 435 440 445
AGA GAT GAG TTT AAG GAT AAG TGG CTA CAC ATT GAC ATT GCA GGC CCT 1452 Arg Asp Glu Phe Lys Asp Lys Trp Leu His He Asp He Ala Gly Pro 450 455 460
GCT TAT GTG GAA AAA GAA TGG GAT GTG AAT AGC TTT GGA GCG AGT GGG 1500 Ala Tyr Val Glu Lys Glu Trp Asp Val Asn Ser Phe Gly Ala Ser Gly 465 470 475
GCT GGC GTG AGA GCT TGC ACA GCT TTT GTG GAA GAG CTT TTG AAA AAG 1548 Ala Gly Val Arg Ala Cys Thr Ala Phe Val Glu Glu Leu Leu Lys Lys 480 485 490 495
GCT TGAAATGGGC TTGTCTGTAG GCATTGTGGG TTTGCCTAAT GTGGGCAAAT CCAGCA 1607 Ala
CCTTTA 1613
(2) INFORMATION FOR SEQ ID NO: 736: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 496 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 736:
Met Leu Lys He Lys Leu Glu Lys Thr Thr Phe Glu Asn Ala Lys Ala
1 5 10 15
Glu Cys Ser Leu Val Phe He He Asn Lys Asp Phe Ser His Ala Trp
20 25 30
Val Lys Asn Lys Glu Leu Leu Glu Thr Phe Lys Tyr Glu Gly Glu Gly
35 40 45
Val Phe Leu Asp Gin Glu Asn Lys He Leu Tyr Ala Gly Val Lys Glu
50 55 60
Asp Asp Val His Leu Leu Arg Glu Ser Ala Cys Leu Ala Val Arg Thr 65 70 75 80
Leu Lys Lys Leu Ala Phe Lys Ser Val Lys Val Gly Val Tyr Thr Cys
85 90 95
Gly Ala His Ser Lys Asp Asn Ala Leu Leu Glu Asn Leu Lys Ala Leu
100 105 110
Phe Leu Gly Leu Lys Leu Gly Leu Tyr Glu Tyr Asp Thr Phe Lys Ser
115 120 125
Asn Lys Lys Glu Ser Val Leu Lys Glu Ala He Val Ala Leu Glu Leu
130 135 140
His Lys Pro Cys Glu Lys Thr Cys Ala Asn Ser Leu Glu Lys Ser Ala 145 150 155 160
Lys Glu Ala Leu Lys Tyr Ala Glu He Met Thr Glu Ser Leu Asn He
165 170 175
Val Lys Asp Leu Val Asn Thr Pro Pro Met He Gly Thr Pro Val Tyr
180 185 190
Met Ala Glu Val Ala Gin Lys Val Ala Lys Glu Asn His Leu Glu He
195 200 205
His Val His Asp Glu Lys Phe Leu Glu Glu Lys Lys Met Asn Ala Phe
210 215 220
Leu Ala Val Asn Lys Ala Ser Leu Ser Val Asn Pro Pro Arg Leu He 225 230 235 240
His Leu Val Tyr Lys Pro Lys Lys Ala Lys Lys Lys He Ala Leu Val
245 250 255
Gly Lys Gly Leu Thr Tyr Asp Cys Gly Gly Leu Ser Leu Lys Pro Ala
260 265 270
Asp Tyr Met Val Thr Met Lys Ala Asp Lys Gly Gly Gly Ser Ala Val
275 280 285
He Gly Leu Leu Asn Ala Leu Ala Lys Leu Gly Val Glu Ala Glu Val
290 295 300
His Gly He He Gly Ala Thr Glu Asn Met He Gly Pro Ala Ala Tyr 305 310 315 320
Lys Pro Asp Asp He Leu He Ser Lys Glu Gly Lys Ser He Glu Val
325 330 335
Arg Asn Thr Asp Ala Glu Gly Arg Leu Val Leu Ala Asp Cys Leu Ser
340 345 350
Tyr Ala Gin Asp Leu Asn Pro Asp Val He Val Asp Phe Ala Thr Leu 355 360 365
Thr Gly Ala Cys Val Val Gly Leu Gly Glu Phe Thr Ser Ala He Met
370 375 380
Gly His Asn Glu Glu Leu Lys Asn Leu Phe Glu Thr Ser Gly Leu Glu 385 390 395 400
Ser Gly Glu Leu Leu Ala Lys Leu Pro Phe Asn Arg His Leu Lys Lys
405 410 415
Leu He Glu Ser Lys He Ala Asp Val Cys Asn He Ser Ser Ser Arg
420 425 430
Tyr Gly Gly Ala He Thr Ala Gly Leu Phe Leu Asn Glu Phe He Arg
435 440 445
Asp Glu Phe Lys Asp Lys Trp Leu His He Asp He Ala Gly Pro Ala
450 455 460
Tyr Val Glu Lys Glu Trp Asp Val Asn Ser Phe Gly Ala Ser Gly Ala 465 470 475 480
Gly Val Arg Ala Cys Thr Ala Phe Val Glu Glu Leu Leu Lys Lys Ala 485 490 495
(2) INFORMATION FOR SEQ ID NO: 737:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 560 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...492 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 737:
GGCGAAATCG GGTTAATTTT AGCAGGGATT GCCAGCTATA CCGGTCAT ATG CAT TTA 57
Met His Leu 1
GGG TTA GCC ATT TTA GTC GCA GGG ATT GGG GGC TTT GTG GGG GAT CAG 105 Gly Leu Ala He Leu Val Ala Gly He Gly Gly Phe Val Gly Asp Gin 5 10 15
ATC TAT TTT TAC ATC GGC CGC ACC AAT AAA GCT TAC ATC CAA AAA AAG 153 He Tyr Phe Tyr He Gly Arg Thr Asn Lys Ala Tyr He Gin Lys Lys 20 25 30 35
CTA GAA AAA CAA CGC CGA AAA CTA GCC CTA GCC CAT TTA TTG TTG CAA 201 Leu Glu Lys Gin Arg Arg Lys Leu Ala Leu Ala His Leu Leu Leu Gin 40 45 50
AAA CAC GGC TGG TTT ATC ATT TTT ATC CAA CGC TAT ATG TAT GGC ATG 249 Lys His Gly Trp Phe He He Phe He Gin Arg Tyr Met Tyr Gly Met 55 60 65
CGC ACC ATC ATT CCC ATT AGC ATA GGT CTC ACG CGT TAT AGC GCT TTA 297
Arg Thr He He Pro He Ser He Gly Leu Thr Arg Tyr Ser Ala Leu 70 75 80
AAA TTC GCT ATC ATC AAT CTC ATT AGC GCG ATG GTG TGG GCG AGC ATT 345
Lys Phe Ala He He Asn Leu He Ser Ala Met Val Trp Ala Ser He
85 90 95
ACC ATT ATT CTA GCG TGG TAT TTA GGA GAA GAG TTA TTG CAT GCG TTA 393
Thr He He Leu Ala Trp Tyr Leu Gly Glu Glu Leu Leu His Ala Leu 100 105 110 115
GGG TGG CTT AAA AAA CAC CCT TAT GCG CTA ATA TTA CTA TTA GTA TCT 441
Gly Trp Leu Lys Lys His Pro Tyr Ala Leu He Leu Leu Leu Val Ser 120 125 130
TTC TTG GCG TTA GTG CTG TGG TAT TTC CAA TAC TAT AGT AAG AAA AAC 489
Phe Leu Ala Leu Val Leu Trp Tyr Phe Gin Tyr Tyr Ser Lys Lys Asn 135 140 145
CGC TAGAGTGCAA TACAATTCTT GAAAGATATG AAATTAAAAA AGGAGACTTT ATGTTA 548 Arg
AAAATCAAAT TA 560
(2) INFORMATION FOR SEQ ID NO: 738:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 148 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 738:
Met His Leu Gly Leu Ala He Leu Val Ala Gly He Gly Gly Phe Val
1 5 10 15
Gly Asp Gin He Tyr Phe Tyr He Gly Arg Thr Asn Lys Ala Tyr He
20 25 30
Gin Lys Lys Leu Glu Lys Gin Arg Arg Lys Leu Ala Leu Ala His Leu
35 40 45
Leu Leu Gin Lys His Gly Trp Phe He He Phe He Gin Arg Tyr Met
50 55 60
Tyr Gly Met Arg Thr He He Pro He Ser He Gly Leu Thr Arg Tyr
65 70 75 80
Ser Ala Leu Lys Phe Ala He He Asn Leu He Ser Ala Met Val Trp
85 90 95
Ala Ser He Thr He He Leu Ala Trp Tyr Leu Gly Glu Glu Leu Leu 100 105 110 His Ala Leu Gly Trp Leu Lys Lys His Pro Tyr Ala Leu He Leu Leu
115 120 125
Leu Val Ser Phe Leu Ala Leu Val Leu Trp Tyr Phe Gin Tyr Tyr Ser
130 135 140
Lys Lys Asn Arg 145
(2) INFORMATION FOR SEQ ID NO: 739:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 609 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 61...600 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 739:
TAAAAACGCT ATAATAAATC AAAATTCTAC AACCAATCCG TTATATTAAA GGAAATCAAA 60
ATG AAT GAA ACG CTC AAA GAA GAA CTT TTA CAA AGC ATC AGA GAA GTG 108 Met Asn Glu Thr Leu Lys Glu Glu Leu Leu Gin Ser He Arg Glu Val 1 5 10 15
AAA GAC TAC CCT AAA AAA GGG ATT TTA TTC AAA GAC ATT ACC ACG CTA 156 Lys Asp Tyr Pro Lys Lys Gly He Leu Phe Lys Asp He Thr Thr Leu 20 25 30
CTC AAC TAC CCT AAA CTC TTT AAC AAA CTC ATT GAC ACG CTC AAA AAA 204 Leu Asn Tyr Pro Lys Leu Phe Asn Lys Leu He Asp Thr Leu Lys Lys 35 40 45
CGC TAT CTC GCT CTC AAT ATA GAC TTT ATC GTG GGC ATT GAA GCG AGA 252 Arg Tyr Leu Ala Leu Asn He Asp Phe He Val Gly He Glu Ala Arg 50 55 60
GGG TTT ATT TTA GGC TCT GCT CTC GCT TAT GCG CTT GGG GTG GGT TTT 300 Gly Phe He Leu Gly Ser Ala Leu Ala Tyr Ala Leu Gly Val Gly Phe 65 70 75 80
GTG CCT GTG AGG AAA AAG GGC AAA CTC CCC GCA CAC ACC CTA TCT CAA 348 Val Pro Val Arg Lys Lys Gly Lys Leu Pro Ala His Thr Leu Ser Gin 85 90 95
AGC TAC AGC CTA GAA TAC GGG AGC GAC AGC ATA GAA ATC CAC TCC GAC 396 Ser Tyr Ser Leu Glu Tyr Gly Ser Asp Ser He Glu He His Ser Asp 100 105 110 GCT TTT AGG GGA ATT AAG GGG GTA AGG GTG GTG TTG ATT GAT GAT TTA 444 Ala Phe Arg Gly He Lys Gly Val Arg Val Val Leu He Asp Asp Leu 115 120 125
TTA GCC ACT GGA GGC ACA GCT TTA GCG AGC CTT GAG CTT ATC AAA GCC 492 Leu Ala Thr Gly Gly Thr Ala Leu Ala Ser Leu Glu Leu He Lys Ala 130 135 140
CTA CAA GCC GAA TGC ATA GAA GCA TGC TTT TTG ATA GGG TTA AAA GAA 540 Leu Gin Ala Glu Cys He Glu Ala Cys Phe Leu He Gly Leu Lys Glu 145 150 155 160
TTA CCG GGT ATC CAA CTT TTA GAA GAA CGC GTG AAA ACC TTT TGT TTG 588 Leu Pro Gly He Gin Leu Leu Glu Glu Arg Val Lys Thr Phe Cys Leu 165 170 175
TTA GAG TTA GAA TAAGGGTGA 609
Leu Glu Leu Glu 180
(2) INFORMATION FOR SEQ ID NO: 740
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 180 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 740:
Met Asn Glu Thr Leu Lys Glu Glu Leu Leu Gin Ser He Arg Glu Val
1 5 10 15
Lys Asp Tyr Pro Lys Lys Gly He Leu Phe Lys Asp He Thr Thr Leu
20 25 30
Leu Asn Tyr Pro Lys Leu Phe Asn Lys Leu He Asp Thr Leu Lys Lys
35 40 45
Arg Tyr Leu Ala Leu Asn He Asp Phe He Val Gly He Glu Ala Arg
50 55 60
Gly Phe He Leu Gly Ser Ala Leu Ala Tyr Ala Leu Gly Val Gly Phe 65 70 75 80
Val Pro Val Arg Lys Lys Gly Lys Leu Pro Ala His Thr Leu Ser Gin
85 90 95
Ser Tyr Ser Leu Glu Tyr Gly Ser Asp Ser He Glu He His Ser Asp
100 105 110
Ala Phe Arg Gly He Lys Gly Val Arg Val Val Leu He Asp Asp Leu
115 120 125
Leu Ala Thr Gly Gly Thr Ala Leu Ala Ser Leu Glu Leu He Lys Ala
130 135 140
Leu Gin Ala Glu Cys He Glu Ala Cys Phe Leu He Gly Leu Lys Glu 145 150 155 160
Leu Pro Gly He Gin Leu Leu Glu Glu Arg Val Lys Thr Phe Cys Leu 165 170 175
Leu Glu Leu Glu 180
(2) INFORMATION FOR SEQ ID NO: 741:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 374 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 28...357 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 741:
TTTCGCTAAA AAGGATATTT TAACAGA ATG TTT ACC CAA TGG TTT ATT CTC ACT 54
Met Phe Thr Gin Trp Phe He Leu Thr 1 5
ATC GCT ATT GTT TTT ATC CTT TAT ATG GGT GTG CGC ACT TTC TTT TTT 102 He Ala He Val Phe He Leu Tyr Met Gly Val Arg Thr Phe Phe Phe 10 15 20 25
AAA ACC GTG GCT AAA CGG CAA GAA CGC ACC AAC GCA TCC ATG AAG CTC 150 Lys Thr Val Ala Lys Arg Gin Glu Arg Thr Asn Ala Ser Met Lys Leu 30 35 40
ACC TTA CAA GAA GCT GAA ATT TTG ATC CAA AAA CAC CAG TTG CAA CTC 198 Thr Leu Gin Glu Ala Glu He Leu He Gin Lys His Gin Leu Gin Leu 45 50 55
CAA AGG GCT TTG GGC AAT ATT GAT ATT CTC ACC CAA GAA ATG AGC TCG 246 Gin Arg Ala Leu Gly Asn He Asp He Leu Thr Gin Glu Met Ser Ser 60 65 70
TTA AAA ACA GAA CTA AAA GCC CTT AAA CAG CGC AAC TCT GAA TAC AAA 294 Leu Lys Thr Glu Leu Lys Ala Leu Lys Gin Arg Asn Ser Glu Tyr Lys 75 80 85
GGC GAA TCG GAT AAA TAT AAA AAT CGT ATT AAA GAA TTG GAG CAA AAA 342 Gly Glu Ser Asp Lys Tyr Lys Asn Arg He Lys Glu Leu Glu Gin Lys 90 95 100 105
ATA GAA GCT CTC CTT TAAAAACGCT ATAATAA 374
He Glu Ala Leu Leu 110 (2) INFORMATION FOR SEQ ID NO : 742 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 110 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 742:
Met Phe Thr Gin Trp Phe He Leu Thr He Ala He Val Phe He Leu
1 5 10 15
Tyr Met Gly Val Arg Thr Phe Phe Phe Lys Thr Val Ala Lys Arg Gin
20 25 30
Glu Arg Thr Asn Ala Ser Met Lys Leu Thr Leu Gin Glu Ala Glu He
35 40 45
Leu He Gin Lys His Gin Leu Gin Leu Gin Arg Ala Leu Gly Asn He
50 55 60
Asp He Leu Thr Gin Glu Met Ser Ser Leu Lys Thr Glu Leu Lys Ala 65 70 75 80
Leu Lys Gin Arg Asn Ser Glu Tyr Lys Gly Glu Ser Asp Lys Tyr Lys
85 90 95
Asn Arg He Lys Glu Leu Glu Gin Lys He Glu Ala Leu Leu 100 105 110
(2) INFORMATION FOR SEQ ID NO: 743:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 778 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 78...728 (D) OTHER INFORMATTON:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 743:
AAAAAAGAAA ACGCAACGCA TTAAGGTTTT TTGTGCAATT TTTTGATTTC TCTTTAGAAA 60 GTTTTATTAC CACCTTA ATG AAA ATC CTA GCC CTT TTA ATC GCT ATC ATA 110
Met Lys He Leu Ala Leu Leu He Ala He He 1 5 10
GGG CAT GAG ATC ATG CAT GGC TTG AGC GCG TTT TTA TTT GGG GAT AGG 158 Gly His Glu He Met His Gly Leu Ser Ala Phe Leu Phe Gly Asp Arg 15 20 25 AGC ACT AAA GAC GCT AGG CGT TTG AGT TTA AAC CCT ATC AGG CAT TTA 206 Ser Thr Lys Asp Ala Arg Arg Leu Ser Leu Asn Pro He Arg His Leu 30 35 40
GAC ATG ATG GGT TCG GTG CTT TTA CCG GCT TTA TTA CTC ATT TTT CAA 254 Asp Met Met Gly Ser Val Leu Leu Pro Ala Leu Leu Leu He Phe Gin 45 50 55
GCC CCT TTT TTG TTT GGG TGG GCC AAA CCC GTG CCT GTT GAT ATG CGC 302 Ala Pro Phe Leu Phe Gly Trp Ala Lys Pro Val Pro Val Asp Met Arg 60 65 70 75
TAC ATT GTC TCT CAA AAA GGC TCT CTA GCA TGC GTA GTG GTG AGT TTA 350 Tyr He Val Ser Gin Lys Gly Ser Leu Ala Cys Val Val Val Ser Leu 80 85 90
GCC GGG GTG GCT TAT AAT TTC ACT CTG GCC GTT CTG CTC GCT TTC ATC 398 Ala Gly Val Ala Tyr Asn Phe Thr Leu Ala Val Leu Leu Ala Phe He 95 100 105
ACG CAT TGG AGC TTC CAA CAA CTA GGG ATC AAC GCT TTA AGC ATT GAT 446 Thr His Trp Ser Phe Gin Gin Leu Gly He Asn Ala Leu Ser He Asp 110 115 120
GAA TTG AAT CTT TAT CAG CTC GCT TTA GTA ACC TTT CTC ATT CAA GGC 494 Glu Leu Asn Leu Tyr Gin Leu Ala Leu Val Thr Phe Leu He Gin Gly 125 130 135
ATT CTT TAT AAT CTT GTC TTA GGC GTT TTC AAT AGC CTC CCT ATC CCG 542 He Leu Tyr Asn Leu Val Leu Gly Val Phe Asn Ser Leu Pro He Pro 140 145 150 155
CCC TTA GAC GGC TCC AAA GCG TTA GGC TTT TTA GCG TTG CAT TTT AAA 590 Pro Leu Asp Gly Ser Lys Ala Leu Gly Phe Leu Ala Leu His Phe Lys 160 165 170
AGT GCG TTT TTA TTG GAA TGG TTT TCT AAA ATG GAA CGC TAC GGC TTG 638 Ser Ala Phe Leu Leu Glu Trp Phe Ser Lys Met Glu Arg Tyr Gly Leu 175 180 185
TTG GTA GTG TTT ATT TTT TTG TTT ATC CCC CCT TTA TCG GAG TTT TTT 686 Leu Val Val Phe He Phe Leu Phe He Pro Pro Leu Ser Glu Phe Phe 190 195 200
ATC CAT GCG CCC ACA AGA TTT TTA TTT TCT TTA CTC CTC TCT TAATCTTTT 737 He His Ala Pro Thr Arg Phe Leu Phe Ser Leu Leu Leu Ser 205 210 215
ATCAAGGAGA GTTTATGAAT AAGCTCTTAA AGTTTTCTCA A 77 £
(2) INFORMATION FOR SEQ ID NO: 744:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 217 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 744:
Met Lys He Leu Ala Leu Leu He Ala He He Gly His Glu He Met
1 5 10 15
His Gly Leu Ser Ala Phe Leu Phe Gly Asp Arg Ser Thr Lys Asp Ala
20 25 30
Arg Arg Leu Ser Leu Asn Pro He Arg His Leu Asp Met Met Gly Ser
35 40 45
Val Leu Leu Pro Ala Leu Leu Leu He Phe Gin Ala Pro Phe Leu Phe
50 55 60
Gly Trp Ala Lys Pro Val Pro Val Asp Met Arg Tyr He Val Ser Gin 65 70 75 80
Lys Gly Ser Leu Ala Cys Val Val Val Ser Leu Ala Gly Val Ala Tyr
85 90 95
Asn Phe Thr Leu Ala Val Leu Leu Ala Phe He Thr His Trp Ser Phe
100 105 110
Gin Gin Leu Gly He Asn Ala Leu Ser He Asp Glu Leu Asn Leu Tyr
115 120 125
Gin Leu Ala Leu Val Thr Phe Leu He Gin Gly He Leu Tyr Asn Leu
130 135 140
Val Leu Gly Val Phe Asn Ser Leu Pro He Pro Pro Leu Asp Gly Ser 145 150 155 160
Lys Ala Leu Gly Phe Leu Ala Leu His Phe Lys Ser Ala Phe Leu Leu
165 170 175
Glu Trp Phe Ser Lys Met Glu Arg Tyr Gly Leu Leu Val Val Phe He
180 185 190
Phe Leu Phe He Pro Pro Leu Ser Glu Phe Phe He His Ala Pro Thr
195 200 205
Arg Phe Leu Phe Ser Leu Leu Leu Ser 210 215
(2) INFORMATION FOR SEQ ID NO: 745:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 373 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 70...336 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 745 CATCAGATAT TATCCAAGCG CCTTTTAAAA TCTTGCGCCG TATTTTCACA CCTATTGACA 60 TCATCGTGG ATG AAG TCA AAA AAA ACA TTG ATT CAA AAA GGA AGT AAA ATG 111 Met Lys Ser Lys Lys Thr Leu He Gin Lys Gly Ser Lys Met 1 5 10
ACG CTC AAT GAA GCC ATT AAA GAC AAA GTT TAT GAA ATC GTA GAA ATC 159 Thr Leu Asn Glu Ala He Lys Asp Lys Val Tyr Glu He Val Glu He 15 20 25 30
GCT AAC TGC GAT GAA GCC CTT AAA AAA CGC TTT CTC TCT TTT GGT ATC 207 Ala Asn Cys Asp Glu Ala Leu Lys Lys Arg Phe Leu Ser Phe Gly He 35 40 45
CAT GAA GGG GTT CAA TGC ATT CTT TTG CAT TAT TCC ATG AAA AAA GCC 255 His Glu Gly Val Gin Cys He Leu Leu His Tyr Ser Met Lys Lys Ala 50 55 60
ACG CTT TCG GTT AAA ATC AAC CGC ATT CAA GTG GCT TTA AGA TCC CAT 303 Thr Leu Ser Val Lys He Asn Arg He Gin Val Ala Leu Arg Ser His 65 70 75
GAA GCA CAA TAC CTT GTC ATC AAA GAA AGC GTG TGAAAATGGG TTTAAAACGC 356 Glu Ala Gin Tyr Leu Val He Lys Glu Ser Val 80 85
GCTAAACGCT ATAATAA 373
(2) INFORMATION FOR SEQ ID NO: 746:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 89 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 746:
Met Lys Ser Lys Lys Thr Leu He Gin Lys Gly Ser Lys Met Thr Leu
1 5 10 15
Asn Glu Ala He Lys Asp Lys Val Tyr Glu He Val Glu He Ala Asn
20 25 30
Cys Asp Glu Ala Leu Lys Lys Arg Phe Leu Ser Phe Gly He His Glu
35 40 45
Gly Val Gin Cys He Leu Leu His Tyr Ser Met Lys Lys Ala Thr Leu
50 55 60
Ser Val Lys He Asn Arg He Gin Val Ala Leu Arg Ser His Glu Ala 65 70 75 80
Gin Tyr Leu Val He Lys Glu Ser Val 85
(2) INFORMATION FOR SEQ ID NO: 747:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 450 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 46...375 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 747:
GAGCCAGTTA ATTCCATGGT TTCATAAGTG ATTTTTTGGG GCTGT ATG AGG AGC TGT 57
Met Arg Ser Cys
1
TTG TTT TTG AAA ACT AAT TCG GTT TTA TCC ATT TTA ATG GGC GAT AAG 105 Leu Phe Leu Lys Thr Asn Ser Val Leu Ser He Leu Met Gly Asp Lys 5 10 15 20
CCA TCA TTA AAA ACG ACT GAA GGC TTC ATC AAA GTG GCT TTA ATT ACA 153 Pro Ser Leu Lys Thr Thr Glu Gly Phe He Lys Val Ala Leu He Thr 25 30 35
GAA TTT TTT AAA AGC GAT GGG ACA AAC TCG CTA GGA GTG AAA TTG GCT 201 Glu Phe Phe Lys Ser Asp Gly Thr Asn Ser Leu Gly Val Lys Leu Ala 40 45 50
TTG ATT GAA GCG TTA TCA ATC TTA AAG CTA GCG AAT TGG ATC TTA TCA 249 Leu He Glu Ala Leu Ser He Leu Lys Leu Ala Asn Trp He Leu Ser 55 60 65
AAA ATC CAT GTT TTT AAA TTT TTT TGC GAT TGG CGT TGG AAA AGA GGC 297 Lys He His Val Phe Lys Phe Phe Cys Asp Trp Arg Trp Lys Arg Gly 70 75 80
TTT AAA AAC GCC AGG CTT TTC ATT ACA GAA GTG TTA ATT TTT AAT TCT 345 Phe Lys Asn Ala Arg Leu Phe He Thr Glu Val Leu He Phe Asn Ser 85 90 95 100
ATG GTT TTT AAA TCG GTT AGC CCT TGC AAA TAAATTGCAG CGCTGGGTTC GAT 398 Met Val Phe Lys Ser Val Ser Pro Cys Lys 105 110
TAAGGGCTTG ACAATCAAAT TAAACGCCAT TTTCCTAGCT TTGGGTGAAT AG 450
(2) INFORMATION FOR SEQ ID NO: 748:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 110 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 748:
Met Arg Ser Cys Leu Phe Leu Lys Thr Asn Ser Val Leu Ser He Leu
1 5 10 15
Met Gly Asp Lys Pro Ser Leu Lys Thr Thr Glu Gly Phe He Lys Val
20 25 30
Ala Leu He Thr Glu Phe Phe Lys Ser Asp Gly Thr Asn Ser Leu Gly
35 40 45
Val Lys Leu Ala Leu He Glu Ala Leu Ser He Leu Lys Leu Ala Asn
50 55 60
Trp He Leu Ser Lys He His Val Phe Lys Phe Phe Cys Asp Trp Arg 65 70 75 80
Trp Lys Arg Gly Phe Lys Asn Ala Arg Leu Phe He Thr Glu Val Leu
85 90 95
He Phe Asn Ser Met Val Phe Lys Ser Val Ser Pro Cys Lys 100 105 110
(2) INFORMATION FOR SEQ ID NO: 749:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 450 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 56...394 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 749:
GGCAAATTTT GATTGCTAGG GCTTTAAATG CAGCTTTTAG CACAAAGGAG AATGA ATG 58
Met
1
GCT AAA ATG AGC GCT CCA GAT GGG GTT GCC GTT TGG GTG AAT GAA GAC 106 Ala Lys Met Ser Ala Pro Asp Gly Val Ala Val Trp Val Asn Glu Asp 5 10 15
AGG TGT AAG GGT TGT GAT ATT TGC GTA TCG GTA TGC CCT GCT GGG GTT 154 Arg Cys Lys Gly Cys Asp He Cys Val Ser Val Cys Pro Ala Gly Val 20 25 30
CTT GGC ATG GGG ATT GAA AAA GAA AGG GTG CTT GGA AAA GTG GCC AAA 202 Leu Gly Met Gly He Glu Lys Glu Arg Val Leu Gly Lys Val Ala Lys 35 40 45
GTA GCC TAC CCA GAG AGC TGT ATC GGT TGC GTG CAA TGC GAG TTG CAC 250 Val Ala Tyr Pro Glu Ser Cys He Gly Cys Val Gin Cys Glu Leu His 50 55 60 65
TGC CCG GAT TTT GCG ATT TAT GTG GCT GAC AGG AAG GAT TTC AAA TTC 298 Cys Pro Asp Phe Ala He Tyr Val Ala Asp Arg Lys Asp Phe Lys Phe 70 75 80
GCT AAA GTT TCT AAA GAA GCC CAA GAA AGA AGC GAA AAG GTT AAG GCC 346 Ala Lys Val Ser Lys Glu Ala Gin Glu Arg Ser Glu Lys Val Lys Ala 85 90 95
AAT AAA TAC ATG CTC TTA GAA GAG ACT ATT TTA GAA GGG AGA GAC AAA T 395 Asn Lys Tyr Met Leu Leu Glu Glu Thr He Leu Glu Gly Arg Asp Lys 100 105 110
AATGCGTGAG ATTATTTCTG ATGGGAATGA ATTAGTCGCT AAAGCGGCGA TTGAA 450
(2) INFORMATION FOR SEQ ID NO: 750:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 113 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:750:
Met Ala Lys Met Ser Ala Pro Asp Gly Val Ala Val Trp Val Asn Glu
1 5 10 15
Asp Arg Cys Lys Gly Cys Asp He Cys Val Ser Val Cys Pro Ala Gly
20 25 30
Val Leu Gly Met Gly He Glu Lys Glu Arg Val Leu Gly Lys Val Ala
35 40 45
Lys Val Ala Tyr Pro Glu Ser Cys He Gly Cys Val Gin Cys Glu Leu
50 55 60
His Cys Pro Asp Phe Ala He Tyr Val Ala Asp Arg Lys Asp Phe Lys 65 70 75 80
Phe Ala Lys Val Ser Lys Glu Ala Gin Glu Arg Ser Glu Lys Val Lys
85 90 95
Ala Asn Lys Tyr Met Leu Leu Glu Glu Thr He Leu Glu Gly Arg Asp
100 105 110
Lys
(2) INFORMATION FOR SEQ ID NO: 751:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1350 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 127...1251 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 751:
TGTGGCTGAC AGGAAGGATT TCAAATTCGC TAAAGTTTCT AAAGAAGCCC AAGAAAGAAG 60
CGAAAAGGTT AAGGCCAATA AATACATGCT CTTAGAAGAG ACTATTTTAG AAGGGAGAGA 120
CAAATA ATG CGT GAG ATT ATT TCT GAT GGG AAT GAA TTA GTC GCT AAA 168 Met Arg Glu He He Ser Asp Gly Asn Glu Leu Val Ala Lys 1 5 10
GCG GCG ATT GAA GTG GGG TGT CGG TTT TTT GGG GGC TAT CCT ATC ACG 216 Ala Ala He Glu Val Gly Cys Arg Phe Phe Gly Gly Tyr Pro He Thr 15 20 25 30
CCA AGT TCG GAT ATT ATG CAT GCG ATG AGC GTG GCT TTA CCC AAA TGC 264 Pro Ser Ser Asp He Met His Ala Met Ser Val Ala Leu Pro Lys Cys 35 40 45
GGC GGT CAT TTT ATC CAA ATG GAA GAT GAA ATC AGC GGG ATT AGC GTG 312 Gly Gly His Phe He Gin Met Glu Asp Glu He Ser Gly He Ser Val 50 55 60
TCT TTA GGA GCG AGC ATG AGC GGG ACG AAG TCT ATG ACA GCA AGC TCT 360 Ser Leu Gly Ala Ser Met Ser Gly Thr Lys Ser Met Thr Ala Ser Ser 65 70 75
GGG CCT GGT ATT TCA TTG AAA GTG GAG CAA ATC GGT TAT TCT TTC ATG 408 Gly Pro Gly He Ser Leu Lys Val Glu Gin He Gly Tyr Ser Phe Met 80 85 90
GCG GAA ATC CCT TTA GTG ATC GCT GAT GTG ATG CGT TCA GGC CCA TCA 456 Ala Glu He Pro Leu Val He Ala Asp Val Met Arg Ser Gly Pro Ser 95 100 105 110
ACC GGA ATG CCC ACT CGT GTG GCT CAA GGC GAT GTG AAT TTC TTA AGA 504 Thr Gly Met Pro Thr Arg Val Ala Gin Gly Asp Val Asn Phe Leu Arg 115 120 125
CAC CCC ATA CAT GGG GAT TTT AAA GCC GTC GCG CTC GCT CCT GCG AAT 552 His Pro He His Gly Asp Phe Lys Ala Val Ala Leu Ala Pro Ala Asn 130 135 140
TTA GAA GAA GCT TAC ACC GAA ACC GTT CGC GCG TTC AAT TTG GCT GAA 600 Leu Glu Glu Ala Tyr Thr Glu Thr Val Arg Ala Phe Asn Leu Ala Glu 145 150 155 ATG CTC ATG ACT CCT GTA TTC TTG CTC ATG GAT GAA ACC GTG GGG CAT 648 Met Leu Met Thr Pro Val Phe Leu Leu Met Asp Glu Thr Val Gly His 160 165 170
ATG TAT GGC AAG GTG CAA ATC CCA GAT TTA GAA GAA GTG CAA AAG ATG 696 Met Tyr Gly Lys Val Gin He Pro Asp Leu Glu Glu Val Gin Lys Met 175 180 185 190
ACT ATT AAT CGT AAG GAA TTT CTG GGC GAT AAA AAA GAC TAC AAG CCT 744 Thr He Asn Arg Lys Glu Phe Leu Gly Asp Lys Lys Asp Tyr Lys Pro 195 200 205
TAT GGG GTC GCA CAA GAC GAG CCG GCT GTT TTG AAC CCT TTC TTT AAA 792 Tyr Gly Val Ala Gin Asp Glu Pro Ala Val Leu Asn Pro Phe Phe Lys 210 215 220
GGT TAT CGC TAC CAT GTT TCA GGC TTG CAC CAT GGG CCT ATT GGC TTT 840 Gly Tyr Arg Tyr His Val Ser Gly Leu His His Gly Pro He Gly Phe 225 230 235
CCT ACT GAA GAC GCT AAA ATT GGT GGG GAT TTG ATT GAC AGA TTA TTT 888 Pro Thr Glu Asp Ala Lys He Gly Gly Asp Leu He Asp Arg Leu Phe 240 245 250
AAT AAG ATT GAA TCC AAG CAA GAC ATT ATC AAC GAA AAT GAG GAA ATG 936 Asn Lys He Glu Ser Lys Gin Asp He He Asn Glu Asn Glu Glu Met 255 260 265 270
GAT TTA GAG GGT GCT GAA ATC GTT GTT ATC GCT TAC GGT TCG GTT TCT 984 Asp Leu Glu Gly Ala Glu He Val Val He Ala Tyr Gly Ser Val Ser 275 280 285
TTG GCG GTT AAA GAG GCC TTG AAA GAT TAC CAT AAA GAA AGC AAG CAA 1032 Leu Ala Val Lys Glu Ala Leu Lys Asp Tyr His Lys Glu Ser Lys Gin 290 295 300
AAA GTC GGC TTT TTC AGG CCT AAA ACC TTA TGG CCA AGC CCG GCT AAA 1080 Lys Val Gly Phe Phe Arg Pro Lys Thr Leu Trp Pro Ser Pro Ala Lys 305 310 315
CGC TTG AAA GAA ATA GGG GAT AAA TAC GAA AAA ATC CTT GTG ATT GAA 1128 Arg Leu Lys Glu He Gly Asp Lys Tyr Glu Lys He Leu Val He Glu 320 325 330
TTG AAT AAA GGG CAG TAT TTA GAA GAA ATT GAA AGG GCT ATG CAA AGA 1176 Leu Asn Lys Gly Gin Tyr Leu Glu Glu He Glu Arg Ala Met Gin Arg 335 340 345 350
AAG GTG CAT TTC TTG GGG CAA GCC AAT GGG CGC ACG ATT TCG CCT AAA 1224 Lys Val His Phe Leu Gly Gin Ala Asn Gly Arg Thr He Ser Pro Lys 355 360 365
CAA ATC ATC GCA AAA TTG AAG GAG CTT TAAAATGGCG TTTAATTATG ATGAATA 1278 Gin He He Ala Lys Leu Lys Glu Leu 370 375 TTTGCGTGTG GATAAAATAC CCACTTTGTG GTGTTGGGGC TGTGGCGATG GCGTGATTTT 1338 GAAATCCATT AT 1350
(2) INFORMATION FOR SEQ ID NO: 752:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 375 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 752:
Met Arg Glu He He Ser Asp Gly Asn Glu Leu Val Ala Lys Ala Ala
1 5 10 15
He Glu Val Gly Cys Arg Phe Phe Gly Gly Tyr Pro He Thr Pro Ser
20 25 30
Ser Asp He Met His Ala Met Ser Val Ala Leu Pro Lys Cys Gly Gly
35 40 45
His Phe He Gin Met Glu Asp Glu He Ser Gly He Ser Val Ser Leu
50 55 60
Gly Ala Ser Met Ser Gly Thr Lys Ser Met Thr Ala Ser Ser Gly Pro 65 70 75 80
Gly He Ser Leu Lys Val Glu Gin He Gly Tyr Ser Phe Met Ala Glu
85 90 95
He Pro Leu Val He Ala Asp Val Met Arg Ser Gly Pro Ser Thr Gly
100 105 110
Met Pro Thr Arg Val Ala Gin Gly Asp Val Asn Phe Leu Arg His Pro
115 120 125
He His Gly Asp Phe Lys Ala Val Ala Leu Ala Pro Ala Asn Leu Glu
130 135 140
Glu Ala Tyr Thr Glu Thr Val Arg Ala Phe Asn Leu Ala Glu Met Leu 145 150 155 160
Met Thr Pro Val Phe Leu Leu Met Asp Glu Thr Val Gly His Met Tyr
165 170 175
Gly Lys Val Gin He Pro Asp Leu Glu Glu Val Gin Lys Met Thr He
180 185 190
Asn Arg Lys Glu Phe Leu Gly Asp Lys Lys Asp Tyr Lys Pro Tyr Gly
195 200 205
Val Ala Gin Asp Glu Pro Ala Val Leu Asn Pro Phe Phe Lys Gly Tyr
210 215 220
Arg Tyr His Val Ser Gly Leu His His Gly Pro He Gly Phe Pro Thr 225 230 235 240
Glu Asp Ala Lys He Gly Gly Asp Leu He Asp Arg Leu Phe Asn Lys
245 250 255
He Glu Ser Lys Gin Asp He He Asn Glu Asn Glu Glu Met Asp Leu
260 265 270
Glu Gly Ala Glu He Val Val He Ala Tyr Gly Ser Val Ser Leu Ala
275 280 285
Val Lys Glu Ala Leu Lys Asp Tyr His Lys Glu Ser Lys Gin Lys Val
290 295 300
Gly Phe Phe Arg Pro Lys Thr Leu Trp Pro Ser Pro Ala Lys Arg Leu 305 310 315 320 Lys Glu He Gly Asp Lys Tyr Glu Lys He Leu Val He Glu Leu Asn
325 330 335
Lys Gly Gin Tyr Leu Glu Glu He Glu Arg Ala Met Gin Arg Lys Val
340 345 350
His Phe Leu Gly Gin Ala Asn Gly Arg Thr He Ser Pro Lys Gin He
355 360 365
He Ala Lys Leu Lys Glu Leu 370 375
(2) INFORMATION FOR SEQ ID NO: 753:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 192 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 54...164 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 753:
TATGATAAAA GCTCTCATAT AACCCGCTAC TAGCCATAAG CAACAGCAAG GCA ATG 56
Met 1
TAT TTA GGC TTA AAC CCA AAA CGC ACC ACC AAA AGC GCC ACA GCC CCT 104 Tyr Leu Gly Leu Asn Pro Lys Arg Thr Thr Lys Ser Ala Thr Ala Pro 5 10 15
ATT AAA ATC ATG TTG ATG CGT TGC GCC CAG CAA AAA ATA CAA GGC GAA 152 He Lys He Met Leu Met Arg Cys Ala Gin Gin Lys He Gin Gly Glu 20 25 30
TCT TTC AAA ACA TAGCCAAAAT AACCTTAAAA AACGCTTT 192
Ser Phe Lys Thr 35
(2) INFORMATION FOR SEQ ID NO: 754:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 37 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 754:
Met Tyr Leu Gly Leu Asn Pro Lys Arg Thr Thr Lys Ser Ala Thr Ala
1 5 10 15
Pro He Lys He Met Leu Met Arg Cys Ala Gin Gin Lys He Gin Gly
20 25 30
Glu Ser Phe Lys Thr 35
(2) INFORMATION FOR SEQ ID NO: 755:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1080 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 30...1049 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 755:
ACCATAATTA GACAAACCTT TAAGGATTT ATG ATG ATT TTC ATT GAT GCA TGT 53
Met Met He Phe He Asp Ala Cys 1 5
TTT AGA AAG GAA ACG CCT TAC ACG CCC ATT TGG ATG ATG AGG CAA GCG 101 Phe Arg Lys Glu Thr Pro Tyr Thr Pro He Trp Met Met Arg Gin Ala 10 15 20
GGG CGT TAC CTT AGC GAA TAC CAA GAG AGC CGT AAA AAA GCG GGG AGT 149 Gly Arg Tyr Leu Ser Glu Tyr Gin Glu Ser Arg Lys Lys Ala Gly Ser 25 30 35 40
TTC TTG GAA TTG TGT AAA AAT AGC GAT CTA GCC ACA GAA GTT ACC TTA 197 Phe Leu Glu Leu Cys Lys Asn Ser Asp Leu Ala Thr Glu Val Thr Leu 45 50 55
CAG CCG GTA GAG ATT TTA GGC GTG GAT GCG GCT ATT TTG TTT AGC GAT 245 Gin Pro Val Glu He Leu Gly Val Asp Ala Ala He Leu Phe Ser Asp 60 65 70
ATT TTA GTA GTG CCT TTG GAA ATG GGC TTG AAT TTG GAG TTT ATC CCC 293 He Leu Val Val Pro Leu Glu Met Gly Leu Asn Leu Glu Phe He Pro 75 80 85
AAA AAG GGG CCG CAT TTT TTA GAG ACG ATT ACG GAT TTA AAA AGC GTG 341 Lys Lys Gly Pro His Phe Leu Glu Thr He Thr Asp Leu Lys Ser Val 90 95 100 GAA AGC CTA AAA GTA GGG GCT TAT AAA CAA CTA AAC TAT GTC TAT GAT 389 Glu Ser Leu Lys Val Gly Ala Tyr Lys Gin Leu Asn Tyr Val Tyr Asp 105 110 115 120
ACG ATT TCT CAA ACG CGC CAA AAG CTT TCT AGA GAG AAA GCG TTA ATC 437 Thr He Ser Gin Thr Arg Gin Lys Leu Ser Arg Glu Lys Ala Leu He 125 130 135
GGT TTT TGC GGA TCG CCT TGG ACT TTA GCG ACT TAC ATG ATA GAA GGC 485 Gly Phe Cys Gly Ser Pro Trp Thr Leu Ala Thr Tyr Met He Glu Gly 140 145 150
GAG GGG AGC AAA TCG TAT GCC AAA AGC AAG AAA ATG CTT TAT AGC GAG 533 Glu Gly Ser Lys Ser Tyr Ala Lys Ser Lys Lys Met Leu Tyr Ser Glu 155 160 165
CCT GAA GTT TTA AAA GCG CTT TTA GAA AAA TTA AGC CTT GAA TTG ATA 581 Pro Glu Val Leu Lys Ala Leu Leu Glu Lys Leu Ser Leu Glu Leu He 170 175 180
GAG TAT TTG AGC CTT CAA ATC CAA GCA GGG GTC AAT GCA GTG ATG ATC 629 Glu Tyr Leu Ser Leu Gin He Gin Ala Gly Val Asn Ala Val Met He 185 190 195 200
TTT GAC TCA TGG GCT AGC GCT TTA GAA AAA GAA GCG TAT TTG AAA TTC 677 Phe Asp Ser Trp Ala Ser Ala Leu Glu Lys Glu Ala Tyr Leu Lys Phe 205 210 215
AGT TGG GAT TAT TTG AAA AAA ATC TCT AAA GAG CTT AAA AAA CGC TAT 725 Ser Trp Asp Tyr Leu Lys Lys He Ser Lys Glu Leu Lys Lys Arg Tyr 220 225 230
GCG CAT ATC CCA GTT ATC CTT TTC CCT AAA GGG ATT GGC GCT TAT TTG 773 Ala His He Pro Val He Leu Phe Pro Lys Gly He Gly Ala Tyr Leu 235 240 245
GAT AGC ATA GAT GGG GAA TTT GAT GTG TTT GGC GTG GAT TGG GGC ACG 821 Asp Ser He Asp Gly Glu Phe Asp Val Phe Gly Val Asp Trp Gly Thr 250 255 260
CCT TTA ACT GCG GCA AAA AAG ATT TTA GGC GGT AAG TAT GTT TTG CAA 869 Pro Leu Thr Ala Ala Lys Lys He Leu Gly Gly Lys Tyr Val Leu Gin 265 270 275 280
GGG AAT TTA GAA CCC ACC CGC CTT TAT GAT AAA AAC GCT TTA GAA GAA 917 Gly Asn Leu Glu Pro Thr Arg Leu Tyr Asp Lys Asn Ala Leu Glu Glu 285 290 295
GGG GTT GAA ACG ATT CTA AAA GTC ATG GGC AAT CAA GGG CAT ATT TTT 965 Gly Val Glu Thr He Leu Lys Val Met Gly Asn Gin Gly His He Phe 300 305 310
AAT TTA GGG CAT GGG ATG TTG CCG GAT TTA CCC AGA GAA AAC GCC AAA 1013 Asn Leu Gly His Gly Met Leu Pro Asp Leu Pro Arg Glu Asn Ala Lys 315 320 325 TAT TTA GTG CAA TTA GTG CAT GCT AAA ACC AGA CGA TAGGGGGATT GATGAA 1065 Tyr Leu Val Gin Leu Val His Ala Lys Thr Arg Arg 330 335 340
TACTATCATA AGATA 1080
(2) INFORMATION FOR SEQ ID NO: 756:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 756:
Met Met He Phe He Asp Ala Cys Phe Arg Lys Glu Thr Pro Tyr Thr
1 5 10 15
Pro He Trp Met Met Arg Gin Ala Gly Arg Tyr Leu Ser Glu Tyr Gin
20 25 30
Glu Ser Arg Lys Lys Ala Gly Ser Phe Leu Glu Leu Cys Lys Asn Ser
35 40 45
Asp Leu Ala Thr Glu Val Thr Leu Gin Pro Val Glu He Leu Gly Val
50 55 60
Asp Ala Ala He Leu Phe Ser Asp He Leu Val Val Pro Leu Glu Met 65 70 75 80
Gly Leu Asn Leu Glu Phe He Pro Lys Lys Gly Pro His Phe Leu Glu
85 90 95
Thr He Thr Asp Leu Lys Ser Val Glu Ser Leu Lys Val Gly Ala Tyr
100 105 110
Lys Gin Leu Asn Tyr Val Tyr Asp Thr He Ser Gin Thr Arg Gin Lys
115 120 125
Leu Ser Arg Glu Lys Ala Leu He Gly Phe Cys Gly Ser Pro Trp Thr
130 135 140
Leu Ala Thr Tyr Met He Glu Gly Glu Gly Ser Lys Ser Tyr Ala Lys 145 150 155 160
Ser Lys Lys Met Leu Tyr Ser Glu Pro Glu Val Leu Lys Ala Leu Leu
165 170 175
Glu Lys Leu Ser Leu Glu Leu He Glu Tyr Leu Ser Leu Gin He Gin
180 185 190
Ala Gly Val Asn Ala Val Met He Phe Asp Ser Trp Ala Ser Ala Leu
195 200 205
Glu Lys Glu Ala Tyr Leu Lys Phe Ser Trp Asp Tyr Leu Lys Lys He
210 215 220
Ser Lys Glu Leu Lys Lys Arg Tyr Ala His He Pro Val He Leu Phe 225 230 235 240
Pro Lys Gly He Gly Ala Tyr Leu Asp Ser He Asp Gly Glu Phe Asp
245 250 255
Val Phe Gly Val Asp Trp Gly Thr Pro Leu Thr Ala Ala Lys Lys He
260 265 270
Leu Gly Gly Lys Tyr Val Leu Gin Gly Asn Leu Glu Pro Thr Arg Leu
275 280 285
Tyr Asp Lys Asn Ala Leu Glu Glu Gly Val Glu Thr He Leu Lys Val 290 295 300
Met Gly Asn Gin Gly His He Phe Asn Leu Gly His Gly Met Leu Pro 305 310 315 320
Asp Leu Pro Arg Glu Asn Ala Lys Tyr Leu Val Gin Leu Val His Ala
325 330 335
Lys Thr Arg Arg 340
(2) INFORMATION FOR SEQ ID NO: 757:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 766 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...732 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 757:
TAGACGACTA TGTGCATTAA GGGAATGAAA ATG ATA CGA AAA ATT TTA ATA GGA 54
Met He Arg Lys He Leu He Gly 1 5
CTT TTT TTG AGT TTT TTG AGC ATG GAA GCT GGC GAA AAA GTG TAT GCG 102 Leu Phe Leu Ser Phe Leu Ser Met Glu Ala Gly Glu Lys Val Tyr Ala 10 15 20
ATT TTC AAT GTG AAA GCG ACA CAA GAT TCC AAA CTC ACC TTA GAC AGC 150 He Phe Asn Val Lys Ala Thr Gin Asp Ser Lys Leu Thr Leu Asp Ser 25 30 35 40
ACA GGA ATT GTG GAT AGC ATT AAG GTT ACT GAG GGG AGC GTG GTC AAA 198 Thr Gly He Val Asp Ser He Lys Val Thr Glu Gly Ser Val Val Lys 45 50 55
AAG GGC GAT GTT TTG TTG CTT TTA TAT AAT CAA GAC AAA CAG GCT CAA 246 Lys Gly Asp Val Leu Leu Leu Leu Tyr Asn Gin Asp Lys Gin Ala Gin 60 65 70
AGC GAT TCC ACC GAA CAA CAA CTC ATT TTC GCT AAA AAG CAA TAC CAA 294 Ser Asp Ser Thr Glu Gin Gin Leu He Phe Ala Lys Lys Gin Tyr Gin 75 80 85
CGA TAC AGC AAA ATT GGG GGC GCT GTG GAT AAA AAC ACT CTA GAG GGT 342 Arg Tyr Ser Lys He Gly Gly Ala Val Asp Lys Asn Thr Leu Glu Gly 90 95 100 TAT GAG TTC ACT TAC AGG CGC TTG GAG TCT GAT TAC GCT TAT TCT ATT 390 Tyr Glu Phe Thr Tyr Arg Arg Leu Glu Ser Asp Tyr Ala Tyr Ser He 105 110 115 120
GCG GTA TTG AAT AAA ACC ATT TTA AGA GCC CCT TTT GAT GGC GTG ATA 438 Ala Val Leu Asn Lys Thr He Leu Arg Ala Pro Phe Asp Gly Val He 125 130 135
GCG AGT AAA AAC ATT CAA GTG GGC GAA GGG GTG AGC GCG AAT AAC ACG 486 Ala Ser Lys Asn He Gin Val Gly Glu Gly Val Ser Ala Asn Asn Thr 140 145 150
GTG TTA TTG AGA TTA GTC AGC CAT GCT AGG AAA TTA GTT ATT GAA TTT 534 Val Leu Leu Arg Leu Val Ser His Ala Arg Lys Leu Val He Glu Phe 155 160 165
GAT TCT AAA TAT ATT AAT GCG GTC AAA GTA GGG GAC ACT TAC ACC TAT 582 Asp Ser Lys Tyr He Asn Ala Val Lys Val Gly Asp Thr Tyr Thr Tyr 170 175 180
TCT ATA GAC GGG GAT TCT AAT CAG CAT GAA GCT AAA ATC ACT AAG ATT 630 Ser He Asp Gly Asp Ser Asn Gin His Glu Ala Lys He Thr Lys He 185 190 195 200
TAC CCC ACG GTT GAT GAA AAC ACC AGG AAA GTG AGC GCT GAA GCC CTT 678 Tyr Pro Thr Val Asp Glu Asn Thr Arg Lys Val Ser Ala Glu Ala Leu 205 210 215
TTA TCT AAG CCT ATG GCA GTG GGG CTT TTT GGC GAT GGG TTT ATC CAA 726 Leu Ser Lys Pro Met Ala Val Gly Leu Phe Gly Asp Gly Phe He Gin 220 225 230
ACG AAA TAATAGGATA TTTTGATGTA TAAAACAGCG ATTA 766
Thr Lys
(2) INFORMATION FOR SEQ ID NO: 758:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 234 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:758:
Met He Arg Lys He Leu He Gly Leu Phe Leu Ser Phe Leu Ser Met
1 5 10 15
Glu Ala Gly Glu Lys Val Tyr Ala He Phe Asn Val Lys Ala Thr Gin
20 25 30
Asp Ser Lys Leu Thr Leu Asp Ser Thr Gly He Val Asp Ser He Lys 35 40 45 Val Thr Glu Gly Ser Val Val Lys Lys Gly Asp Val Leu Leu Leu Leu
50 55 60
Tyr Asn Gin Asp Lys Gin Ala Gin Ser Asp Ser Thr Glu Gin Gin Leu 65 70 75 80
He Phe Ala Lys Lys Gin Tyr Gin Arg Tyr Ser Lys He Gly Gly Ala
85 90 95
Val Asp Lys Asn Thr Leu Glu Gly Tyr Glu Phe Thr Tyr Arg Arg Leu
100 105 110
Glu Ser Asp Tyr Ala Tyr Ser He Ala Val Leu Asn Lys Thr He Leu
115 120 125
Arg Ala Pro Phe Asp Gly Val He Ala Ser Lys Asn He Gin Val Gly
130 135 140
Glu Gly Val Ser Ala Asn Asn Thr Val Leu Leu Arg Leu Val Ser His 145 150 155 160
Ala Arg Lys Leu Val He Glu Phe Asp Ser Lys Tyr He Asn Ala Val
165 170 175
Lys Val Gly Asp Thr Tyr Thr Tyr Ser He Asp Gly Asp Ser Asn Gin
180 185 190
His Glu Ala Lys He Thr Lys He Tyr Pro Thr Val Asp Glu Asn Thr
195 200 205
Arg Lys Val Ser Ala Glu Ala Leu Leu Ser Lys Pro Met Ala Val Gly
210 215 220
Leu Phe Gly Asp Gly Phe He Gin Thr Lys 225 230
(2) INFORMATION FOR SEQ ID NO: 759:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 630 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 62...544 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:759:
ACGCCAAAGA GAGCAACGGG GAGTTTTTAG TCGCTTTAGC GNAGCGNTTG TGCTGATTTA 60
T ATG ATT TTA GCG GCG TTG TAT GAG TCC ATT TTA GAG CCT TTT ATC ATC 109
Met He Leu Ala Ala Leu Tyr Glu Ser He Leu Glu Pro Phe He He 1 5 10 15
ATG GTT ACC ATG CCT TTA AGT TTT TCA GGG GCG TTT TTT GCT CTA GGT 157 Met Val Thr Met Pro Leu Ser Phe Ser Gly Ala Phe Phe Ala Leu Gly 20 25 30
TTA GTC CAT CAG CCT TTG AGC ATG TTC TCT ATG ATA GGC TTG ATT TTG 205 Leu Val His Gin Pro Leu Ser Met Phe Ser Met He Gly Leu He Leu 35 40 45
CTC ATT GGT ATG GTG GGT AAA AAC GCC ACG CTT TTA ATT GAT GTG GCG 253 Leu He Gly Met Val Gly Lys Asn Ala Thr Leu Leu He Asp Val Ala 50 55 60
AAT GAA GAG CGT AAA AAA GGT TTG AAT ATC CAA GAG GCC ATT TTA TTT 301 Asn Glu Glu Arg Lys Lys Gly Leu Asn He Gin Glu Ala He Leu Phe 65 70 75 80
GCC GGC AAA ACC CGT CTA AGA CCG ATT TTA ATG ACG ACC ATT GCG ATG 349 Ala Gly Lys Thr Arg Leu Arg Pro He Leu Met Thr Thr He Ala Met 85 90 95
GTT TGC GGG ATG CTG CCT TTA GCG TTG GCG AGT GGG GAT GGA GCG GCG 397 Val Cys Gly Met Leu Pro Leu Ala Leu Ala Ser Gly Asp Gly Ala Ala 100 105 110
ATG AAA TCC CCT ATA GGG ATT GCG ATG AGT GGG GGC TTG ATG ATT TCT 445 Met Lys Ser Pro He Gly He Ala Met Ser Gly Gly Leu Met He Ser 115 120 125
ATG GTG TTA AGC TTA CTC ATT GTG CCG GTG TTT TAT CGT TTG CTC GCT 493 Met Val Leu Ser Leu Leu He Val Pro Val Phe Tyr Arg Leu Leu Ala 130 135 140
CCC ATA GAC GAC AAA ATC AAG CGG TTT TAT CAA AAC CAA AAA ACT TTA 541 Pro He Asp Asp Lys He Lys Arg Phe Tyr Gin Asn Gin Lys Thr Leu 145 150 155 160
GAA TGAAAAAAAT TGCTTTCATT TTGGCTTTAT GGGTGGGCTT GTTAGGGGCG TTTGAG 600 Glu
CCTAAAAAAA GTCATATTTA TTTTGGGGCT 630
(2) INFORMATION FOR SEQ ID NO: 760:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 161 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 760:
Met He Leu Ala Ala Leu Tyr Glu Ser He Leu Glu Pro Phe He He
1 5 10 15
Met Val Thr Met Pro Leu Ser Phe Ser Gly Ala Phe Phe Ala Leu Gly
20 25 30
Leu Val His Gin Pro Leu Ser Met Phe Ser Met He Gly Leu He Leu
35 40 45
Leu He Gly Met Val Gly Lys Asn Ala Thr Leu Leu He Asp Val Ala 50 55 60
Asn Glu Glu Arg Lys Lys Gly Leu Asn He Gin Glu Ala He Leu Phe 65 70 75 80
Ala Gly Lys Thr Arg Leu Arg Pro He Leu Met Thr Thr He Ala Met
85 90 95
Val Cys Gly Met Leu Pro Leu Ala Leu Ala Ser Gly Asp Gly Ala Ala
100 105 110
Met Lys Ser Pro He Gly He Ala Met Ser Gly Gly Leu Met He Ser
115 120 125
Met Val Leu Ser Leu Leu He Val Pro Val Phe Tyr Arg Leu Leu Ala
130 135 140
Pro He Asp Asp Lys He Lys Arg Phe Tyr Gin Asn Gin Lys Thr Leu 145 150 155 160
Glu
(2) INFORMATION FOR SEQ ID NO: 761:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1007 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...945 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 761:
TAAAAGGTTT TTACAAAC ATG ATA AAA AGC CAA AAA GAA TAT TTA GAA AGA 51
Met He Lys Ser Gin Lys Glu Tyr Leu Glu Arg 1 5 10
ATT GCA TAT TTA AAC ACC CTA TCG CAC CAT TAT TAC AAC CTT GAT GAA 99 He Ala Tyr Leu Asn Thr Leu Ser His His Tyr Tyr Asn Leu Asp Glu 15 20 25
CCC ATC GTA AGC GAT GCG ATC TAT GAT GAA CTT TAC CAA GAA TTG AAA 147 Pro He Val Ser Asp Ala He Tyr Asp Glu Leu Tyr Gin Glu Leu Lys 30 35 40
GCT TAT GAA GAA AAA AAC CCT AAT GGC ATT CAA GCT AAT TCC CCT ACC 195 Ala Tyr Glu Glu Lys Asn Pro Asn Gly He Gin Ala Asn Ser Pro Thr 45 50 55
CAA AAA GTG GGG GCT ACT ACC ACC AAT TCG TTC AAT AAA AAC CCC CAT 243 Gin Lys Val Gly Ala Thr Thr Thr Asn Ser Phe Asn Lys Asn Pro His 60 65 70 75 TTA ATG CGG ATG TGG AGC TTA GAT GAT GTG TTC AAT CAA AGC GAA TTG 291 Leu Met Arg Met Trp Ser Leu Asp Asp Val Phe Asn Gin Ser Glu Leu 80 85 90
CAA GCG TGG TTG CAA CGC ATT TTA AAA GCC TAT CCT AGT GCT TCG TTC 339 Gin Ala Trp Leu Gin Arg He Leu Lys Ala Tyr Pro Ser Ala Ser Phe 95 100 105
GTG TGT TCG CCC AAA CTT GAT GGG GTT TCG CTC AAT CTT TTG TAT CAA 387 Val Cys Ser Pro Lys Leu Asp Gly Val Ser Leu Asn Leu Leu Tyr Gin 110 115 120
CAT GGC AAG CTA GTG AAG GCG ACC ACT AGG GGC AAC GGC TTA GAA GGA 435 His Gly Lys Leu Val Lys Ala Thr Thr Arg Gly Asn Gly Leu Glu Gly 125 130 135
GAA TTA GTT AGC GCA AAC GCT AAA CAC ATC GCT AAT ATC CCC CAC GCT 483 Glu Leu Val Ser Ala Asn Ala Lys His He Ala Asn He Pro His Ala 140 145 150 155
ATC GCT TAT AAT GGA GAA ATA GAA ATC AGG GGC GAA GTG ATC ATT TCT 531 He Ala Tyr Asn Gly Glu He Glu He Arg Gly Glu Val He He Ser 160 165 170
AAA AAG GAT TTT GAC GCT TTG AAT CAA GAG CGC TTA AAC GCT AAT GAA 579 Lys Lys Asp Phe Asp Ala Leu Asn Gin Glu Arg Leu Asn Ala Asn Glu 175 180 185
CCC CTA TTC GCT AAC CCC AGA AAC GCC GCA TCA GGG AGT TTG AGG CAA 627 Pro Leu Phe Ala Asn Pro Arg Asn Ala Ala Ser Gly Ser Leu Arg Gin 190 195 200
CTT GAT AGC GAA ATC ACT AAA AAG CGT AAA TTG CAA TTC ATT CCT TGG 675 Leu Asp Ser Glu He Thr Lys Lys Arg Lys Leu Gin Phe He Pro Trp 205 210 215
GGC GTG GGC AAG CAT TCT TTA AAT TTT TTA AGC TTT AAG GAG TGT TTG 723 Gly Val Gly Lys His Ser Leu Asn Phe Leu Ser Phe Lys Glu Cys Leu 220 225 230 235
GAT TTT ATC GTC TCG TTA GGT TTT AGC GCC ATT CAA TAC TTA AGC CTA 771 Asp Phe He Val Ser Leu Gly Phe Ser Ala He Gin Tyr Leu Ser Leu 240 245 250
AAC AAA AAC CAC CAA GAA ATA GAA GAC AAT TAC CAC ACC CTA ATT AGA 819 Asn Lys Asn His Gin Glu He Glu Asp Asn Tyr His Thr Leu He Arg 255 260 265
GAA AGG GAG GGC TTT TTT GCC CTT TTA GAC GGC ATG GTG ATC GTT GTG 867 Glu Arg Glu Gly Phe Phe Ala Leu Leu Asp Gly Met Val He Val Val 270 275 280
AAT GAA TTA AAT ATT CAA AAG GAG CTA GGC TAC ACG CAA AAA TCC CCT 915 Asn Glu Leu Asn He Gin Lys Glu Leu Gly Tyr Thr Gin Lys Ser Pro 285 290 295 AAA TNG CTT GCG CTT ATA AAT TCC CGG CTT TAGAAAAACA CACCAAAATT GTA 968 Lys Xaa Leu Ala Leu He Asn Ser Arg Leu 300 305
GGAGTCATTA ACCAAGTGGG GCGCASSGGG CGATCACAC 1007
(2) INFORMATION FOR SEQ ID NO:762:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 309 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 762:
Met He Lys Ser Gin Lys Glu Tyr Leu Glu Arg He Ala Tyr Leu Asn
1 5 10 15
Thr Leu Ser His His Tyr Tyr Asn Leu Asp Glu Pro He Val Ser Asp
20 25 30
Ala He Tyr Asp Glu Leu Tyr Gin Glu Leu Lys Ala Tyr Glu Glu Lys
35 40 45
Asn Pro Asn Gly He Gin Ala Asn Ser Pro Thr Gin Lys Val Gly Ala
50 55 60
Thr Thr Thr Asn Ser Phe Asn Lys Asn Pro His Leu Met Arg Met Trp 65 70 75 80
Ser Leu Asp Asp Val Phe Asn Gin Ser Glu Leu Gin Ala Trp Leu Gin
85 90 95
Arg He Leu Lys Ala Tyr Pro Ser Ala Ser Phe Val Cys Ser Pro Lys
100 105 110
Leu Asp Gly Val Ser Leu Asn Leu Leu Tyr Gin His Gly Lys Leu Val
115 120 125
Lys Ala Thr Thr Arg Gly Asn Gly Leu Glu Gly Glu Leu Val Ser Ala
130 135 140
Asn Ala Lys His He Ala Asn He Pro His Ala He Ala Tyr Asn Gly 145 150 155 160
Glu He Glu He Arg Gly Glu Val He He Ser Lys Lys Asp Phe Asp
165 170 175
Ala Leu Asn Gin Glu Arg Leu Asn Ala Asn Glu Pro Leu Phe Ala Asn
180 185 190
Pro Arg Asn Ala Ala Ser Gly Ser Leu Arg Gin Leu Asp Ser Glu He
195 200 205
Thr Lys Lys Arg Lys Leu Gin Phe He Pro Trp Gly Val Gly Lys His
210 215 220
Ser Leu Asn Phe Leu Ser Phe Lys Glu Cys Leu Asp Phe He Val Ser 225 230 235 240
Leu Gly Phe Ser Ala He Gin Tyr Leu Ser Leu Asn Lys Asn His Gin
245 250 255
Glu He Glu Asp Asn Tyr His Thr Leu He Arg Glu Arg Glu Gly Phe
260 265 270
Phe Ala Leu Leu Asp Gly Met Val He Val Val Asn Glu Leu Asn He
275 280 285
Gin Lys Glu Leu Gly Tyr Thr Gin Lys Ser Pro Lys Xaa Leu Ala Leu 290 295 300
He Asn Ser Arg Leu 305
(2) INFORMATION FOR SEQ ID NO: 763:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 937 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 44...880 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 763:
GTGCTTTAGA TTAGATGCAG AAAAAGACGC CCAACTTTAT GGC ATG AAT ATT TTT 55
Met Asn He Phe 1
AAG ATC CGA GAA ATT ATC CAT TAT GAC GGG GAG GTT ACA GAG ATT CTT 103 Lys He Arg Glu He He His Tyr Asp Gly Glu Val Thr Glu He Leu 5 10 15 20
GGG GGG AGC GAT GGC GTG ATG CTC GGG TTT CTT AGC GTT AGG GGC GAG 151 Gly Gly Ser Asp Gly Val Met Leu Gly Phe Leu Ser Val Arg Gly Glu 25 30 35
TCT ATC CCT TTA GTG GAT GTG AAA AGG TGG TTG CAT TAT AAC GCT AAT 199 Ser He Pro Leu Val Asp Val Lys Arg Trp Leu His Tyr Asn Ala Asn 40 45 50
GAT CCG AGC CGT GAT CTA AAA GAA TGC AGC GTT AAA GAT GAC CAT AAT 247 Asp Pro Ser Arg Asp Leu Lys Glu Cys Ser Val Lys Asp Asp His Asn 55 60 65
TTG GTG ATT GTG TGC CAT TTT TCT AAC CAT TCC ATC GCT CTA AAG GTT 295 Leu Val He Val Cys His Phe Ser Asn His Ser He Ala Leu Lys Val 70 75 80
TTA AAA ATT GAA AGG ATC ATC CAT AAA AAT TGG ACT GAG ATT AGC GCT 343 Leu Lys He Glu Arg He He His Lys Asn Trp Thr Glu He Ser Ala 85 90 95 100
GGG GAC AAA CAA GGC ATT AAT GAA GAG GGT AAG CTT AGC GCT ATC ACT 391 Gly Asp Lys Gin Gly He Asn Glu Glu Gly Lys Leu Ser Ala He Thr 105 110 115 CGT TTT GAT GAA GAA CGA GTG GTG CAG ATC TTA GAT GTG GAA AAA ATG 439 Arg Phe Asp Glu Glu Arg Val Val Gin He Leu Asp Val Glu Lys Met 120 125 130
ATT AGC GAT GTT TTC CCT AGC TTG AAA GAT TTA GAC GAT TTG ACT TTG 487 He Ser Asp Val Phe Pro Ser Leu Lys Asp Leu Asp Asp Leu Thr Leu 135 140 145
CGT TGC ATA GAA GCC ATT CAA AGC CAA AAA CTC ATT TTA ATC GCT GAA 535 Arg Cys He Glu Ala He Gin Ser Gin Lys Leu He Leu He Ala Glu 150 155 160
GAC TCC CTA AGC GCT CTT AAA ACC TTA GAA AAG ATC GTT CAA ACT TTA 583 Asp Ser Leu Ser Ala Leu Lys Thr Leu Glu Lys He Val Gin Thr Leu 165 170 175 180
GAA TTG CGT TAT TTA GCT TTT CCA AAC GGG AGG GAA TTG TTG GAT TAT 631 Glu Leu Arg Tyr Leu Ala Phe Pro Asn Gly Arg Glu Leu Leu Asp Tyr 185 190 195
TTG TAT GAA AAA GAA CAT TAC CAA CAA GTT GGC GTG GTC ATT ACG GAT 679 Leu Tyr Glu Lys Glu His Tyr Gin Gin Val Gly Val Val He Thr Asp 200 205 210
TTA GAA ATG CCT AAC ATT TCA GGG TTT GAA GTG TTA AAA ACC ATT AAA 727 Leu Glu Met Pro Asn He Ser Gly Phe Glu Val Leu Lys Thr He Lys 215 220 225
GCT GAT CAT AGA ACT GAG CAT CTT CCT GTG ATT ATC AAT TCG TCC ATG 775 Ala Asp His Arg Thr Glu His Leu Pro Val He He Asn Ser Ser Met 230 235 240
AGC AGC GAT TCT AAC CGC CAG TTA GCC CAA TCT TTA GAA GCG GAT GGT 823 Ser Ser Asp Ser Asn Arg Gin Leu Ala Gin Ser Leu Glu Ala Asp Gly 245 250 255 260
TTT GTG GTA AAA TCT AAC ATT CTT GAA ATC CAT GAA ATG CTT AAA AAA 871 Phe Val Val Lys Ser Asn He Leu Glu He His Glu Met Leu Lys Lys 265 270 275
ACG CTT TCA TAAATTTAAT TTTTGTTTTA ATTTAAAGGG ATAAAACATG CGAAGTCAT 929 Thr Leu Ser
TTTTGCAC 937
(2) INFORMATION FOR SEQ ID NO: 764:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 279 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION. SEQ ID NO: 764.
Met Asn He Phe Lys He Arg Glu He He His Tyr Asp Gly Glu Val
1 5 10 15
Thr Glu He Leu Gly Gly Ser Asp Gly Val Met Leu Gly Phe Leu Ser
20 25 30
Val Arg Gly Glu Ser He Pro Leu Val Asp Val Lys Arg Trp Leu His
35 40 45
Tyr Asn Ala Asn Asp Pro Ser Arg Asp Leu Lys Glu Cys Ser Val Lys
50 55 60
Asp Asp His Asn Leu Val He Val Cys His Phe Ser Asn His Ser He 65 70 75 80
Ala Leu Lys Val Leu Lys He Glu Arg He He His Lys Asn Trp Thr
85 90 95
Glu He Ser Ala Gly Asp Lys Gin Gly He Asn Glu Glu Gly Lys Leu
100 105 110
Ser Ala He Thr Arg Phe Asp Glu Glu Arg Val Val Gin He Leu Asp
115 120 125
Val Glu Lys Met He Ser Asp Val Phe Pro Ser Leu Lys Asp Leu Asp
130 135 140
Asp Leu Thr Leu Arg Cys He Glu Ala He Gin Ser Gin Lys Leu He 145 150 155 160
Leu He Ala Glu Asp Ser Leu Ser Ala Leu Lys Thr Leu Glu Lys He
165 170 175
Val Gin Thr Leu Glu Leu Arg Tyr Leu Ala Phe Pro Asn Gly Arg Glu
180 185 190
Leu Leu Asp Tyr Leu Tyr Glu Lys Glu His Tyr Gin Gin Val Gly Val
195 200 205
Val He Thr Asp Leu Glu Met Pro Asn He Ser Gly Phe Glu Val Leu
210 215 220
Lys Thr He Lys Ala Asp His Arg Thr Glu His Leu Pro Val He He 225 230 235 240
Asn Ser Ser Met Ser Ser Asp Ser Asn Arg Gin Leu Ala Gin Ser Leu
245 250 255
Glu Ala Asp Gly Phe Val Val Lys Ser Asn He Leu Glu He His Glu
260 265 270
Met Leu Lys Lys Thr Leu Ser 275
(2) INFORMATION FOR SEQ ID NO: 765:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 630 base pairs
(B) TYPE: nucleic acid
(C) S RANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 21...593 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 765:
ATAATTTAAA AGGATACGAT ATG AAA CAA CTA TTT TTG ATC ATT GGA GCC 50
Met Lys Gin Leu Phe Leu He He Gly Ala 1 5 10
CCA GGG AGT GGT AAA ACC ACT GAT GCA GAG CTT ATC GCT AAA AAT AAC 98 Pro Gly Ser Gly Lys Thr Thr Asp Ala Glu Leu He Ala Lys Asn Asn 15 20 25
AGC GAA ACA ATC GCT CAT TTT TCT ACC GGG GAT TTA CTC AGG GCT GAG 146 Ser Glu Thr He Ala His Phe Ser Thr Gly Asp Leu Leu Arg Ala Glu 30 35 40
AGC GCT AAA AAG ACC GAG CGA GGC TTA TTG ATT GAA AAA TTC ACT TCT 194 Ser Ala Lys Lys Thr Glu Arg Gly Leu Leu He Glu Lys Phe Thr Ser 45 50 55
CAA GGC GAA TTA GTG CCT TTA GAA ATT GTG GTA GAA ACG ATC CTT TCA 242 Gin Gly Glu Leu Val Pro Leu Glu He Val Val Glu Thr He Leu Ser 60 65 70
GCG ATT AAA AGC TCT GGT AAA GGG ATC ATT TTA ATT GAT GGT TAT CCT 290 Ala He Lys Ser Ser Gly Lys Gly He He Leu He Asp Gly Tyr Pro 75 80 85 90
AGG AGC GTG GAA CAA ATG CAG GCT TTG GAT AAG GAA TTG AAC GCT CAA 338 Arg Ser Val Glu Gin Met Gin Ala Leu Asp Lys Glu Leu Asn Ala Gin 95 100 105
AAC GAA GTG ATC TTA AAA AGC GTG ATT GAA GTA GAA GTG AGT GAA AAC 386 Asn Glu Val He Leu Lys Ser Val He Glu Val Glu Val Ser Glu Asn 110 115 120
ACT GCT AAA GAA AGG GTT TTA GGG CGC TCT AGG GGG GCT GAT GAT AAT 434 Thr Ala Lys Glu Arg Val Leu Gly Arg Ser Arg Gly Ala Asp Asp Asn 125 130 135
GAA AAG GTG TTT CAT AAC CGC ATG CGG GTG TTT TTG GAT CCG TTG GGC 482 Glu Lys Val Phe His Asn Arg Met Arg Val Phe Leu Asp Pro Leu Gly 140 145 150
GAG ATC CAA AAT TTT TAC AAG AAT AAG AAG GTG TAT AAA GCG ATC GAT 530 Glu He Gin Asn Phe Tyr Lys Asn Lys Lys Val Tyr Lys Ala He Asp 155 160 165 170
GGG GAG AGG AGC ATT GAA GAG ATT GTG GGC GAA ATG CAA GAG TAT ATC 578 Gly Glu Arg Ser He Glu Glu He Val Gly Glu Met Gin Glu Tyr He 175 180 185
TTG TCT TTC GGT AAT TAAAATGCAC TCTCAAGGAG AATAGCTGTG ATTTCTG 630
Leu Ser Phe Gly Asn 190 (2) INFORMATION FOR SEQ ID NO: 766:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 191 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:766:
Met Lys Gin Leu Phe Leu He He Gly Ala Pro Gly Ser Gly Lys Thr
1 5 10 15
Thr Asp Ala Glu Leu He Ala Lys Asn Asn Ser Glu Thr He Ala His
20 25 30
Phe Ser Thr Gly Asp Leu Leu Arg Ala Glu Ser Ala Lys Lys Thr Glu
35 40 45
Arg Gly Leu Leu He Glu Lys Phe Thr Ser Gin Gly Glu Leu Val Pro
50 55 60
Leu Glu He Val Val Glu Thr He Leu Ser Ala He Lys Ser Ser Gly 65 70 75 80
Lys Gly He He Leu He Asp Gly Tyr Pro Arg Ser Val Glu Gin Met
85 90 95
Gin Ala Leu Asp Lys Glu Leu Asn Ala Gin Asn Glu Val He Leu Lys
100 105 110
Ser Val He Glu Val Glu Val Ser Glu Asn Thr Ala Lys Glu Arg Val
115 120 125
Leu Gly Arg Ser Arg Gly Ala Asp Asp Asn Glu Lys Val Phe His Asn
130 135 140
Arg Met Arg Val Phe Leu Asp Pro Leu Gly Glu He Gin Asn Phe Tyr 145 150 155 160
Lys Asn Lys Lys Val Tyr Lys Ala He Asp Gly Glu Arg Ser He Glu
165 170 175
Glu He Val Gly Glu Met Gin Glu Tyr He Leu Ser Phe Gly Asn 180 185 190
(2) INFORMATION FOR SEQ ID NO: 767:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 777 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 1...717 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 767:
AGT TAC CCC CCC CCC CCC AAT CCC ACA CAA GAA ACG CAA CAA GAT TTT 48 Ser Tyr Pro Pro Pro Pro Asn Pro Thr Gin Glu Thr Gin Gin Asp Phe 1 5 10 15
ATT ATT GAA GCA CAA CAA GAT TTG ATT ATT GAA ACG CAA CAA GAC CCC 96 He He Glu Ala Gin Gin Asp Leu He He Glu Thr Gin Gin Asp Pro 20 25 30
AAA GAA CTA CCT GAG TCT TGC AAA ATA ACG CCC CAA AAA ATC TCT TTT 144 Lys Glu Leu Pro Glu Ser Cys Lys He Thr Pro Gin Lys He Ser Phe 35 40 45
AAC CAA GTG GTT TTT AAA AAA ATT AAA AGA AAA CTC AAC CGC TTC ATT 192 Asn Gin Val Val Phe Lys Lys He Lys Arg Lys Leu Asn Arg Phe He 50 55 60
GGA AGC ATT TTA GCT CGG ACA GAA GTG TAT AAG AAT CTC GTG GCA AAA 240 Gly Ser He Leu Ala Arg Thr Glu Val Tyr Lys Asn Leu Val Ala Lys 65 70 75 80
TAC GAT GAA CTC ACA GGA AAA TAC GAA TCA TTA TTG GCA AAA GAG GCA 288 Tyr Asp Glu Leu Thr Gly Lys Tyr Glu Ser Leu Leu Ala Lys Glu Ala 85 90 95
AAC ATC AAA GAG ACC TTT TGG GAA AGG CGT GCT GAT AGC GAA AAA GAA 336 Asn He Lys Glu Thr Phe Trp Glu Arg Arg Ala Asp Ser Glu Lys Glu 100 105 110
GCC TTT TTT TTA GAG CAT TTT TAC CTC ACT AGC GTG TAT GTG GCT TCT 384 Ala Phe Phe Leu Glu His Phe Tyr Leu Thr Ser Val Tyr Val Ala Ser 115 120 125
ACA GCA GGA TAC TAT ATC ACG CCT AAG GGC GCT AAA ACC TTT ATA GAA 432 Thr Ala Gly Tyr Tyr He Thr Pro Lys Gly Ala Lys Thr Phe He Glu 130 135 140
GCC ACG GAG CGT TTT AAA ATC ATA GAG CCG GTG GAT ATG TTC ATA AAC 480 Ala Thr Glu Arg Phe Lys He He Glu Pro Val Asp Met Phe He Asn 145 150 155 160
AAC CCC ACT TAC CAT GAT GTG GCT AAT TTT ACC TAT TTG CCT TGC CCT 528 Asn Pro Thr Tyr His Asp Val Ala Asn Phe Thr Tyr Leu Pro Cys Pro 165 170 175
GTT TCT TTA AAC AAG CAT GCT TTC AAT AGC ACC ATT CAA AAT GCA AAA 576 Val Ser Leu Asn Lys His Ala Phe Asn Ser Thr He Gin Asn Ala Lys 180 185 190
AAG CCT GAC ATT TCA TTA AAA CCC CCT AGA AAA TCC TAT TTT GAT AAT 624 Lys Pro Asp He Ser Leu Lys Pro Pro Arg Lys Ser Tyr Phe Asp Asn 195 200 205
CTT TTT TAT GAT CAA TTA AAC ACT AGA AAG TGC TTA AAA GCC TTT CAC 672 Leu Phe Tyr Asp Gin Leu Asn Thr Arg Lys Cys Leu Lys Ala Phe His 210 215 220
AAA TAC AGC AGA CGA TAC GCT CCT TTA AAA ACC CCT AAA GAG GTT TAAAA 722 Lys Tyr Ser Arg Arg Tyr Ala Pro Leu Lys Thr Pro Lys Glu Val 225 230 235
AGAGCGGGCT TTATGTTAGA ATAAGTCTTT TTATTCAAAG GAGATTGCAA TGAAT 777
(2) INFORMATION FOR SEQ ID NO: 768:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 239 ammo acids
(B) TYPE: ammo acid
(C) STRANDEDNESS- single
(D) TOPOLOGY: linear
(n) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 768:
Ser Tyr Pro Pro Pro Pro Asn Pro Thr Gin Glu Thr Gin Gin Asp Phe
1 5 10 15
He He Glu Ala Gin Gin Asp Leu He He Glu Thr Gin Gin Asp Pro
20 25 30
Lys Glu Leu Pro Glu Ser Cys Lys He Thr Pro Gin Lys He Ser Phe
35 40 45
Asn Gin Val Val Phe Lys Lys He Lys Arg Lys Leu Asn Arg Phe He
50 55 60
Gly Ser He Leu Ala Arg Thr Glu Val Tyr Lys Asn Leu Val Ala Lys 65 70 75 80
Tyr Asp Glu Leu Thr Gly Lys Tyr Glu Ser Leu Leu Ala Lys Glu Ala
85 90 95
Asn He Lys Glu Thr Phe Trp Glu Arg Arg Ala Asp Ser Glu Lys Glu
100 105 110
Ala Phe Phe Leu Glu His Phe Tyr Leu Thr Ser Val Tyr Val Ala Ser
115 120 125
Thr Ala Gly Tyr Tyr He Thr Pro Lys Gly Ala Lys Thr Phe He Glu
130 135 140
Ala Thr Glu Arg Phe Lys He He Glu Pro Val Asp Met Phe He Asn 145 150 155 160
Asn Pro Thr Tyr His Asp Val Ala Asn Phe Thr Tyr Leu Pro Cys Pro
165 170 175
Val Ser Leu Asn Lys His Ala Phe Asn Ser Thr He Gin Asn Ala Lys
180 185 190
Lys Pro Asp He Ser Leu Lys Pro Pro Arg Lys Ser Tyr Phe Asp Asn
195 200 205
Leu Phe Tyr Asp Gin Leu Asn Thr Arg Lys Cys Leu Lys Ala Phe His
210 215 220
Lys Tyr Ser Arg Arg Tyr Ala Pro Leu Lys Thr Pro Lys Glu Val 225 230 235
(2) INFORMATION FOR SEQ ID NO: 769:
(l) SEQUENCE CHARACTERISTICS. (A) LENGTH: 1350 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(11) MOLECULE TYPE. Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 50...1252 (D) OTHER INFORMATION:
(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 769:
AAGTGGAAAA TTTAGCTAAA GAAAGAGAAA AAAGTTTAAA GGATTAGGC ATG ATC AAT 58
Met He Asn
1
AAG TTT AAA AAT TTT GTG AGC AAC TAC CAG CAA TCT AAC CAC TAT AAA 106 Lys Phe Lys Asn Phe Val Ser Asn Tyr Gin Gin Ser Asn His Tyr Lys 5 10 15
GAG CCT TTA GGT TTT GGC ATT GCC AGA GTG GAT ATT GCC CCT ATT TCC 154 Glu Pro Leu Gly Phe Gly He Ala Arg Val Asp He Ala Pro He Ser 20 25 30 35
AAA AAG ATT TTA TGC GCC ACT TAC CCT GTT TTG AAT TGG AAA GAT GAA 202 Lys Lys He Leu Cys Ala Thr Tyr Pro Val Leu Asn Trp Lys Asp Glu 40 45 50
AAT TTA GGC TCT TAT GCG GTG TTT TGC AAC TCG CTT TCA AAA GAA AAA 250 Asn Leu Gly Ser Tyr Ala Val Phe Cys Asn Ser Leu Ser Lys Glu Lys 55 60 65
ATC CTA AAA GAG AGC GCG AGC GAG CGC GTT ATT GAG ATT GAT GAA AGT 298 He Leu Lys Glu Ser Ala Ser Glu Arg Val He Glu He Asp Glu Ser 70 75 80
TTT GTG TTA AAA GCG TTG GAT TTT TAT ACG CCC TTT TTG AAT GAA GCC 346 Phe Val Leu Lys Ala Leu Asp Phe Tyr Thr Pro Phe Leu Asn Glu Ala 85 90 95
TAT TCT AAT AAA ATG GCT CAT AAA AAC ATC CAA GTG GTT TTA GAG CTT 394 Tyr Ser Asn Lys Met Ala His Lys Asn He Gin Val Val Leu Glu Leu 100 105 110 115
TTA AAG GCT TTA GAA GAA AAT CGT TTG AAA AAT AGC GAT GGG GAG TCT 442 Leu Lys Ala Leu Glu Glu Asn Arg Leu Lys Asn Ser Asp Gly Glu Ser 120 125 130
CTT TAT CGC TTG GTG ATC TTG TAT GAA GAT AAG CCT TGC GAG AGC GTG 490 Leu Tyr Arg Leu Val He Leu Tyr Glu Asp Lys Pro Cys Glu Ser Val 135 140 145 GAG AGC GCG TAT ATG AAA CTT TTA GCG CTC TCT TTA GGT AAA GCC CCT 538 Glu Ser Ala Tyr Met Lys Leu Leu Ala Leu Ser Leu Gly Lys Ala Pro 150 155 160
TTG AGG AGT TTG AAT TTA GAG GGT ATT TTT AAC CAG CTT TCT AAT GCG 586 Leu Arg Ser Leu Asn Leu Glu Gly He Phe Asn Gin Leu Ser Asn Ala 165 170 175
GCC TGG AGC GGT AAC AAG CCC TAT GAA TTA GAA TGG CTT AGA ATG AAC 634 Ala Trp Ser Gly Asn Lys Pro Tyr Glu Leu Glu Trp Leu Arg Met Asn 180 185 190 195
GAA GTG GCT TTA AAA ATG CGA GAC CAT TTC CCT AGC ATT GAT TTC ATA 682 Glu Val Ala Leu Lys Met Arg Asp His Phe Pro Ser He Asp Phe He 200 205 210
GAT AAA TTC CCA CGC TAT TTG ATG CAA TTA ATC CCT GAG TTT GAT AAT 730 Asp Lys Phe Pro Arg Tyr Leu Met Gin Leu He Pro Glu Phe Asp Asn 215 220 225
ATC CGC TTA TTG GAT AGC TCA AAA ACG CGC TTT GGG GCG TAT TTA GGG 778 He Arg Leu Leu Asp Ser Ser Lys Thr Arg Phe Gly Ala Tyr Leu Gly 230 235 240
ACT GGA GGT TAT ACC CAA ATG CCT GGG GCT AGT TAT GTG AAT TTT AAC 826 Thr Gly Gly Tyr Thr Gin Met Pro Gly Ala Ser Tyr Val Asn Phe Asn 245 250 255
GCA GGG GCT ATG GGA GTG TGC ATG AAT GAG GGG CGT ATT TCT TCA TCG 874 Ala Gly Ala Met Gly Val Cys Met Asn Glu Gly Arg He Ser Ser Ser 260 265 270 275
GTG GTG GTT GGA GCA GGC ACT GAT ATT GGT GGG GGA GCG AGC GTG TTA 922 Val Val Val Gly Ala Gly Thr Asp He Gly Gly Gly Ala Ser Val Leu 280 285 290
GGC GTT TTA AGT GGA GGG AAT AAC AAC CCC ATT AGC ATC GGG AAA AAT 970 Gly Val Leu Ser Gly Gly Asn Asn Asn Pro He Ser He Gly Lys Asn 295 300 305
TGT TTG CTA GGG GCT AAT AGC GTT ACT GGA ATT AGT CTA GGC GAT GGC 1018 Cys Leu Leu Gly Ala Asn Ser Val Thr Gly He Ser Leu Gly Asp Gly 310 315 320
TGT ATC GTG GAT GCA GGC GTT GCG ATA CTA GCC GGG AGC GTG ATA GAA 1066 Cys He Val Asp Ala Gly Val Ala He Leu Ala Gly Ser Val He Glu 325 330 335
ATT GAA GAA AAT GAG TTT AAA AAG CTT TTA GAA GTG AAT AGC GCT TTA 1114 He Glu Glu Asn Glu Phe Lys Lys Leu Leu Glu Val Asn Ser Ala Leu 340 345 350 355
GAA AAA CAT GCC AAC AAC CTT TAC AAA GGC AAA GAA CTT TCC GGA AAA 1162 Glu Lys His Ala Asn Asn Leu Tyr Lys Gly Lys Glu Leu Ser Gly Lys 360 365 370 AAT GGC GTG CAT TTT CGT TCC AAT AGT CAG AAT GGC AAG CTG ATT GCT 1210 Asn Gly Val His Phe Arg Ser Asn Ser Gin Asn Gly Lys Leu He Ala 375 380 385
TTT AGG AGC GTG AAA AAA ATT GAG TTG AAT CAA AAC CTG CAT TAAGGATTA 1261 Phe Arg Ser Val Lys Lys He Glu Leu Asn Gin Asn Leu His 390 395 400
AAAGAATGCT CAAAAAAAGT TTGTTATTGC TTGTTTTTTT AGTCTTACAG CTTAGCGGCG 1321 CTGAAGAAAA CAATCAAGCC CCAAAAAAC 1350
(2) INFORMATION FOR SEQ ID NO: 770:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 401 ammo acids
Figure imgf001160_0001
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE, protein
(xi) SEQUENCE DESCRIPTION. SEQ ID NO: 770:
Met He Asn Lys Phe Lys Asn Phe Val Ser Asn Tyr Gin Gin Ser Asn
1 5 10 15
His Tyr Lys Glu Pro Leu Gly Phe Gly He Ala Arg Val Asp He Ala
20 25 30
Pro He Ser Lys Lys He Leu Cys Ala Thr Tyr Pro Val Leu Asn Trp
35 40 45
Lys Asp Glu Asn Leu Gly Ser Tyr Ala Val Phe Cys Asn Ser Leu Ser
50 55 60
Lys Glu Lys He Leu Lys Glu Ser Ala Ser Glu Arg Val He Glu He 65 70 75 80
Asp Glu Ser Phe Val Leu Lys Ala Leu Asp Phe Tyr Thr Pro Phe Leu
85 90 95
Asn Glu Ala Tyr Ser Asn Lys Met Ala His Lys Asn He Gin Val Val
100 105 110
Leu Glu Leu Leu Lys Ala Leu Glu Glu Asn Arg Leu Lys Asn Ser Asp
115 120 125
Gly Glu Ser Leu Tyr Arg Leu Val He Leu Tyr Glu Asp Lys Pro Cys
130 135 140
Glu Ser Val Glu Ser Ala Tyr Met Lys Leu Leu Ala Leu Ser Leu Gly 145 150 155 160
Lys Ala Pro Leu Arg Ser Leu Asn Leu Glu Gly He Phe Asn Gin Leu
165 170 175
Ser Asn Ala Ala Trp Ser Gly Asn Lys Pro Tyr Glu Leu Glu Trp Leu
180 185 190
Arg Met Asn Glu Val Ala Leu Lys Met Arg Asp His Phe Pro Ser He
195 200 205
Asp Phe He Asp Lys Phe Pro Arg Tyr Leu Met Gin Leu He Pro Glu
210 215 220
Phe Asp Asn He Arg Leu Leu Asp Ser Ser Lys Thr Arg Phe Gly Ala 225 230 235 240
Tyr Leu Gly Thr Gly Gly Tyr Thr Gin Met Pro Gly Ala Ser Tyr Val 245 250 255 Asn Phe Asn Ala Gly Ala Met Gly Val Cys Met Asn Glu Gly Arg He
260 265 270
Ser Ser Ser Val Val Val Gly Ala Gly Thr Asp He Gly Gly Gly Ala
275 280 285
Ser Val Leu Gly Val Leu Ser Gly Gly Asn Asn Asn Pro He Ser He
290 295 300
Gly Lys Asn Cys Leu Leu Gly Ala Asn Ser Val Thr Gly He Ser Leu 305 310 315 320
Gly Asp Gly Cys He Val Asp Ala Gly Val Ala He Leu Ala Gly Ser
325 330 335
Val He Glu He Glu Glu Asn Glu Phe Lys Lys Leu Leu Glu Val Asn
340 345 350
Ser Ala Leu Glu Lys His Ala Asn Asn Leu Tyr Lys Gly Lys Glu Leu
355 360 365
Ser Gly Lys Asn Gly Val His Phe Arg Ser Asn Ser Gin Asn Gly Lys
370 375 380
Leu He Ala Phe Arg Ser Val Lys Lys He Glu Leu Asn Gin Asn Leu 385 390 395 400
His
(2) INFORMATION FOR SEQ ID NO: 771:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1304 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 50...1201 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 771:
TTATTTTTAT TATGTTAAGA TAATGAAAAT TTCTAATTAA GGAGTGGTC ATG TTC TAC 58
Met Phe Tyr 1
GAT GAA AAA AAG ACC TAT CAA AAG ATT GAA GAA CGC CTT GAT ATA GTC 106 Asp Glu Lys Lys Thr Tyr Gin Lys He Glu Glu Arg Leu Asp He Val 5 10 15
CGT TCG TTT AAC GCT CAC AAC GAG CAT AAA AAC TTG CAA GAC GAG TTT 154 Arg Ser Phe Asn Ala His Asn Glu His Lys Asn Leu Gin Asp Glu Phe 20 25 30 35
AAA GGG GCG GGC ATT TCT AGG CGC GAT TTA TTG AAG TGG GCG GGC ATG 202 Lys Gly Ala Gly He Ser Arg Arg Asp Leu Leu Lys Trp Ala Gly Met 40 45 50 ATG AGC ACA GCG TTA GCC TTG CCG GCT AGT TTT GCT CCC TTG ACT TTG 250 Met Ser Thr Ala Leu Ala Leu Pro Ala Ser Phe Ala Pro Leu Thr Leu 55 60 65
AAG GCG GTG GAA GTG GCT AAC AGA TTG CCC GTG ATT TGG TTG CAC ATG 298 Lys Ala Val Glu Val Ala Asn Arg Leu Pro Val He Trp Leu His Met 70 75 80
GCA GAA TGC ACC GGT TGT AGC GAA AGT TTG TTA AGG AGC GCA GAC CCC 346 Ala Glu Cys Thr Gly Cys Ser Glu Ser Leu Leu Arg Ser Ala Asp Pro 85 90 95
ACC ATT GAT AGC ATT ATC TTT GAT TAC ATC AAC CTA GAA TAC CAT GAG 394 Thr He Asp Ser He He Phe Asp Tyr He Asn Leu Glu Tyr His Glu 100 105 110 115
ACC ATC ATG GTA GCG AGC GGT TTT CAA GCT GAA AAA AGC TTG CAT GAC 442 Thr He Met Val Ala Ser Gly Phe Gin Ala Glu Lys Ser Leu His Asp 120 125 130
GCC ATA GAA AAG CAT AAA AAC AAT TAC ATT TTA ATG GTA GAA GGG GGT 490 Ala He Glu Lys His Lys Asn Asn Tyr He Leu Met Val Glu Gly Gly 135 140 145
ATC CCC CAA GGC ACG GAA TAC TTC CTC ACT CAA GGC CCA AAC GCT GAA 538 He Pro Gin Gly Thr Glu Tyr Phe Leu Thr Gin Gly Pro Asn Ala Glu 150 155 160
ACG GGA GCT GAA GAG TGT AGG AAA GCC GCT CAA TAC GCA GCC GCT ATT 586 Thr Gly Ala Glu Glu Cys Arg Lys Ala Ala Gin Tyr Ala Ala Ala He 165 170 175
TTT GCC ATA GGC ACA TGC TCA AGT TTT GGG GGC GTT CAA GCG GCT TAC 634 Phe Ala He Gly Thr Cys Ser Ser Phe Gly Gly Val Gin Ala Ala Tyr 180 185 190 195
CCT AAC CCC TCT AAC GCG CAA CCC TTG CAT AAA ATC ATT GAT AAA CCC 682 Pro Asn Pro Ser Asn Ala Gin Pro Leu His Lys He He Asp Lys Pro 200 205 210
GTG ATC AAT GTT CCT GGT TGC CCG CCT AGT GAA AAA AAT ATC GTG GGT 730 Val He Asn Val Pro Gly Cys Pro Pro Ser Glu Lys Asn He Val Gly 215 220 225
AAT GTG CTT TAT TAC TTG ATG TTT GGG GCT CTC CCT AAA TTG GAT GCG 778 Asn Val Leu Tyr Tyr Leu Met Phe Gly Ala Leu Pro Lys Leu Asp Ala 230 235 240
TAT AAC CGC CCC TCT TGG GCT TAT GGG AAC AGG ATC CAT GAT TTG TGC 826 Tyr Asn Arg Pro Ser Trp Ala Tyr Gly Asn Arg He His Asp Leu Cys 245 250 255
GAA AGG AGA GGG CAT TTT GAT GCG GGC GAA TTT GTG GAG CAT TTT GGC 874 Glu Arg Arg Gly His Phe Asp Ala Gly Glu Phe Val Glu His Phe Gly 260 265 270 275 GAT GAA AAC GCT AAA AGG GGC TTT TGT TTG TAT AAA ATG GGC TGT AAA 922 Asp Glu Asn Ala Lys Arg Gly Phe Cys Leu Tyr Lys Met Gly Cys Lys 280 285 290
GGG CCT TAC ACT TTC AAC AAT TGC TCC AAA CTC CGC TTC AAT TCA CAC 970 Gly Pro Tyr Thr Phe Asn Asn Cys Ser Lys Leu Arg Phe Asn Ser His 295 300 305
ACC TCT TGG CCC ATA GGT GCA GGG CAT GGG TGT ATA GGG TGT TCT GAG 1018 Thr Ser Trp Pro He Gly Ala Gly His Gly Cys He Gly Cys Ser Glu 310 315 320
CCT AAT TTT TGG GAT ACG ATG AGT CCT TTT GAA GAG CCT TTA GCG AAT 1066 Pro Asn Phe Trp Asp Thr Met Ser Pro Phe Glu Glu Pro Leu Ala Asn 325 330 335
CGT TCC ATT AAA ACC GCT TTT GAC GGA TTA GGG GCT GAT AAA GTA GCG 1114 Arg Ser He Lys Thr Ala Phe Asp Gly Leu Gly Ala Asp Lys Val Ala 340 345 350 355
GAT AAA GTA GGC ACG ACT TTG CTG AGC GCA ACC GCT ATT GGC ATT GTT 1162 Asp Lys Val Gly Thr Thr Leu Leu Ser Ala Thr Ala He Gly He Val 360 365 370
GCG CAT GCG CTC CTT TCT AAA GCG ATC AAA AAC AAA GAG TAAGGGATTA AC 1213 Ala His Ala Leu Leu Ser Lys Ala He Lys Asn Lys Glu 375 380
ATGTCAAAAA AAATCGTAGT CGATCCTATC ACTAGGATTG AGGGGCATTT AAGGATTGAA 1273 GTGATCGTAG ATGATGATAA CGTGATCACT G 1304
(2) INFORMATION FOR SEQ ID NO:772:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 384 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 772:
Met Phe Tyr Asp Glu Lys Lys Thr Tyr Gin Lys He Glu Glu Arg Leu
1 5 10 15
Asp He Val Arg Ser Phe Asn Ala His Asn Glu His Lys Asn Leu Gin
20 25 30
Asp Glu Phe Lys Gly Ala Gly He Ser Arg Arg Asp Leu Leu Lys Trp
35 40 45
Ala Gly Met Met Ser Thr Ala Leu Ala Leu Pro Ala Ser Phe Ala Pro
50 55 60
Leu Thr Leu Lys Ala Val Glu Val Ala Asn Arg Leu Pro Val He Trp 65 70 75 80
Leu His Met Ala Glu Cys Thr Gly Cys Ser Glu Ser Leu Leu Arg Ser 85 90 95 Ala Asp Pro Thr He Asp Ser He He Phe Asp Tyr He Asn Leu Glu
100 105 110
Tyr His Glu Thr He Met Val Ala Ser Gly Phe Gin Ala Glu Lys Ser
115 120 125
Leu His Asp Ala He Glu Lys His Lys Asn Asn Tyr He Leu Met Val
130 135 140
Glu Gly Gly He Pro Gin Gly Thr Glu Tyr Phe Leu Thr Gin Gly Pro 145 150 155 160
Asn Ala Glu Thr Gly Ala Glu Glu Cys Arg Lys Ala Ala Gin Tyr Ala
165 170 175
Ala Ala He Phe Ala He Gly Thr Cys Ser Ser Phe Gly Gly Val Gin
180 185 190
Ala Ala Tyr Pro Asn Pro Ser Asn Ala Gin Pro Leu His Lys He He
195 200 205
Asp Lys Pro Val He Asn Val Pro Gly Cys Pro Pro Ser Glu Lys Asn
210 215 220
He Val Gly Asn Val Leu Tyr Tyr Leu Met Phe Gly Ala Leu Pro Lys 225 230 235 240
Leu Asp Ala Tyr Asn Arg Pro Ser Trp Ala Tyr Gly Asn Arg He His
245 250 255
Asp Leu Cys Glu Arg Arg Gly His Phe Asp Ala Gly Glu Phe Val Glu
260 265 270
His Phe Gly Asp Glu Asn Ala Lys Arg Gly Phe Cys Leu Tyr Lys Met
275 280 285
Gly Cys Lys Gly Pro Tyr Thr Phe Asn Asn Cys Ser Lys Leu Arg Phe
290 295 300
Asn Ser His Thr Ser Trp Pro He Gly Ala Gly His Gly Cys He Gly 305 310 315 320
Cys Ser Glu Pro Asn Phe Trp Asp Thr Met Ser Pro Phe Glu Glu Pro
325 330 335
Leu Ala Asn Arg Ser He Lys Thr Ala Phe Asp Gly Leu Gly Ala Asp
340 345 350
Lys Val Ala Asp Lys Val Gly Thr Thr Leu Leu Ser Ala Thr Ala He
355 360 365
Gly He Val Ala His Ala Leu Leu Ser Lys Ala He Lys Asn Lys Glu 370 375 380
(2) INFORMATION FOR SEQ ID NO: 773:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 810 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 39...710 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 773 TAGAGCCTAA TTTCGCTAAA TTCTAAAAAG GGTTACGC ATG GAT AAA ATG AAT AAG 56
Met Asp Lys Met Asn Lys 1 5
GTC GTT TTA CAC AAA GAA TAT TCC GGT TTT GTG CGC TTT TTC CAT TGG 104 Val Val Leu His Lys Glu Tyr Ser Gly Phe Val Arg Phe Phe His Trp 10 15 20
GTT AGG GCT TTG AGT ATT TTC GCT TTA ATC GCT ACA GGG TTT TAC ATC 152 Val Arg Ala Leu Ser He Phe Ala Leu He Ala Thr Gly Phe Tyr He 25 30 35
GCT TAC CCT TTT TTG CAG CCT AAT TCC AGC TTT TAT AAA GGG GTG TAT 200 Ala Tyr Pro Phe Leu Gin Pro Asn Ser Ser Phe Tyr Lys Gly Val Tyr 40 45 50
CTT TTA CAA GCT TAT GTG CGT TCT TTT CAT GTC ATG TTT GGG TTT TTG 248 Leu Leu Gin Ala Tyr Val Arg Ser Phe His Val Met Phe Gly Phe Leu 55 60 65 70
CTC ATT AGC GCA TTA ATC TTT AGA ACC TAT CTT TTT TTC ACT AAA GAA 296 Leu He Ser Ala Leu He Phe Arg Thr Tyr Leu Phe Phe Thr Lys Glu 75 80 85
AGC TTG ATG GAA CGC AAG AGT TTT AGC CAA CTT TTA AGC CCA AAA GCC 344 Ser Leu Met Glu Arg Lys Ser Phe Ser Gin Leu Leu Ser Pro Lys Ala 90 95 100
TGG ATT GAT CAG ATG AAA GCG TAT TTT CTT ATC AGC GGC AAA CCC CAC 392 Trp He Asp Gin Met Lys Ala Tyr Phe Leu He Ser Gly Lys Pro His 105 110 115
ACT AAA GGA GCG TAT AAC CCT ATC CAA CTC GTG GCT TAT TCC ACT TTG 440 Thr Lys Gly Ala Tyr Asn Pro He Gin Leu Val Ala Tyr Ser Thr Leu 120 125 130
ATT GTT TTG ATC GTG TTG ATG AGT TTG AGC GGG ATG GTG CTG TAT TAT 488 He Val Leu He Val Leu Met Ser Leu Ser Gly Met Val Leu Tyr Tyr 135 140 145 150
AAT GTC TAT CAT GCG GGG CTT GGA GCG TTT TTA GGA AGC GCT TTT AAG 536 Asn Val Tyr His Ala Gly Leu Gly Ala Phe Leu Gly Ser Ala Phe Lys 155 160 165
TGG TTT GAA ACG CTT TGT GGA GGG TTA GCG AAT GTT CGT TTC ATC CAC 584 Trp Phe Glu Thr Leu Cys Gly Gly Leu Ala Asn Val Arg Phe He His 170 175 180
CAC TTA GCG ACT TGG GGG TTT ATT TTG TTT GTC CCT GTG CAT GTT TAT 632 His Leu Ala Thr Trp Gly Phe He Leu Phe Val Pro Val His Val Tyr 185 190 195
ATG GTG TTT TTC CAT TCT ATC AGG TAT GAA AGT TCG GGG GCG GAT TCT 680 Met Val Phe Phe His Ser He Arg Tyr Glu Ser Ser Gly Ala Asp Ser 200 205 210 ATG ATT AAT GGC TAT GGT TAT ACC AAA GAA TGAGTCAAAA AATCCTAATT CTA 733 Met He Asn Gly Tyr Gly Tyr Thr Lys Glu 215 220
GGTATTGGCA ATATCCTTTT TGGCGATGAA GGGATTGGGG TGCATTTAGC CCACTACCTC 793 AAAAAAAATT TTTCTTT 810
(2) INFORMATION FOR SEQ ID NO: 774:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 224 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 774:
Met Asp Lys Met Asn Lys Val Val Leu His Lys Glu Tyr Ser Gly Phe
1 5 10 15
Val Arg Phe Phe His Trp Val Arg Ala Leu Ser He Phe Ala Leu He
20 25 30
Ala Thr Gly Phe Tyr He Ala Tyr Pro Phe Leu Gin Pro Asn Ser Ser
35 40 45
Phe Tyr Lys Gly Val Tyr Leu Leu Gin Ala Tyr Val Arg Ser Phe His
50 55 60
Val Met Phe Gly Phe Leu Leu He Ser Ala Leu He Phe Arg Thr Tyr 65 70 75 80
Leu Phe Phe Thr Lys Glu Ser Leu Met Glu Arg Lys Ser Phe Ser Gin
85 90 95
Leu Leu Ser Pro Lys Ala Trp He Asp Gin Met Lys Ala Tyr Phe Leu
100 105 110
He Ser Gly Lys Pro His Thr Lys Gly Ala Tyr Asn Pro He Gin Leu
115 120 125
Val Ala Tyr Ser Thr Leu He Val Leu He Val Leu Met Ser Leu Ser
130 135 140
Gly Met Val Leu Tyr Tyr Asn Val Tyr His Ala Gly Leu Gly Ala Phe 145 150 155 160
Leu Gly Ser Ala Phe Lys Trp Phe Glu Thr Leu Cys Gly Gly Leu Ala
165 170 175
Asn Val Arg Phe He His His Leu Ala Thr Trp Gly Phe He Leu Phe
180 185 190
Val Pro Val His Val Tyr Met Val Phe Phe His Ser He Arg Tyr Glu
195 200 205
Ser Ser Gly Ala Asp Ser Met He Asn Gly Tyr Gly Tyr Thr Lys Glu 210 215 220
(2) INFORMATION FOR SEQ ID NO: 775:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1543 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 32...1495 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 775:
AAGCCTGAAT TTTACGCCCC TTTTAGAGCG A ATG GCA TGC AAT TTG CAA GCG 52
Met Ala Cys Asn Leu Gin Ala 1 5
CGT TTT TAT AGC GTT TAT AAG GAT AAT ACC ACT TCT TTC TAC CTC CAA 100 Arg Phe Tyr Ser Val Tyr Lys Asp Asn Thr Thr Ser Phe Tyr Leu Gin 10 15 20
GCG AGC GCT GAA ACC ACT TTA GAG TTC GCG CAA AAA CTC AGC GAA ATT 148 Ala Ser Ala Glu Thr Thr Leu Glu Phe Ala Gin Lys Leu Ser Glu He 25 30 35
CTG CCC TTT TCT TTA GAT TTT AGC TTT TTG TCT TTA AAG GAA ATC ACA 196 Leu Pro Phe Ser Leu Asp Phe Ser Phe Leu Ser Leu Lys Glu He Thr 40 45 50 55
GAG CCT TTA GAT GAA AAT CTT TTC CAA ACA GCA AGC CTT TCA AAG CCC 244 Glu Pro Leu Asp Glu Asn Leu Phe Gin Thr Ala Ser Leu Ser Lys Pro 60 65 70
CTT TTT ATG AAC GCT AAA GAG CAT CAA GAT TTT TTA GAC AAA AAT TCA 292 Leu Phe Met Asn Ala Lys Glu His Gin Asp Phe Leu Asp Lys Asn Ser 75 80 85
TCT TTG TAT GCC GAT ACT CTG GGC TTG ATT AAA AAC ACC GCT TTT AAG 340 Ser Leu Tyr Ala Asp Thr Leu Gly Leu He Lys Asn Thr Ala Phe Lys 90 95 100
GGG GAT ATA ATC CAT AGC CCT AAA GAG CTT ATA GAT TGC TTA ACC CAA 388 Gly Asp He He His Ser Pro Lys Glu Leu He Asp Cys Leu Thr Gin 105 110 115
TTA AAA GGC ATG CTC AAA ACG CAA GAT TTT ATC CCT ATT TTC ACT TCT 436 Leu Lys Gly Met Leu Lys Thr Gin Asp Phe He Pro He Phe Thr Ser 120 125 130 135
AGA GAG GCG TTA TCC CTT TCT TTA AAA AAT CCC TCT CCA AGC GTT ATT 484 Arg Glu Ala Leu Ser Leu Ser Leu Lys Asn Pro Ser Pro Ser Val He 140 145 150
TTT AGC GAT CTT TCT AGC GTT TTG AGC TGC ACT AAA TTG CCT TTA GAG 532 Phe Ser Asp Leu Ser Ser Val Leu Ser Cys Thr Lys Leu Pro Leu Glu 155 160 165 GAC GCT AAA TAT TTG GCC AGT TTG GAA AAA CCC TCC ATC AAA GCC CCA 580 Asp Ala Lys Tyr Leu Ala Ser Leu Glu Lys Pro Ser He Lys Ala Pro 170 175 180
TTA AAA AGC GTG TTT AAA GAC ACT TTC AAA AAC GAT GAA ATC ATC GCC 628 Leu Lys Ser Val Phe Lys Asp Thr Phe Lys Asn Asp Glu He He Ala 185 190 195
CAG CTA CCC TAT GAC CCC ATA TTG AAT TTA TTG TGC CAT ATT TTA CAA 676 Gin Leu Pro Tyr Asp Pro He Leu Asn Leu Leu Cys His He Leu Gin 200 205 210 215
GAT GAG GGG ATA GAA TTT GTT TTT ATG CAT GAA AGC CGT TCT TGT GAA 724 Asp Glu Gly He Glu Phe Val Phe Met His Glu Ser Arg Ser Cys Glu 220 225 230
GCG CTT TTG TAT TAT GAA GCG CTT TTT AAA ACC CCT AAA CGC TTG ATC 772 Ala Leu Leu Tyr Tyr Glu Ala Leu Phe Lys Thr Pro Lys Arg Leu He 235 240 245
ACA CCC ACT AAA AAA TTC GTG CTA GAA AAT AAT TTT TCT ACC TTT CCC 820 Thr Pro Thr Lys Lys Phe Val Leu Glu Asn Asn Phe Ser Thr Phe Pro 250 255 260
TTT AAA GAT GAA TTA GAG TTT TTA AGC GCA ACC CCC AAT TCT ATC GTT 868 Phe Lys Asp Glu Leu Glu Phe Leu Ser Ala Thr Pro Asn Ser He Val 265 270 ' 275
TTG TAT CTC AGT TTC AAG CGC CCT ACA AGG TTG TTA TTG CAT GCT AAT 916 Leu Tyr Leu Ser Phe Lys Arg Pro Thr Arg Leu Leu Leu His Ala Asn 280 285 290 295
GGT TCT TTA AAA ACG CTT TTA AGC GTC AGT TTT GAT TTT AAC AAA ATG 964 Gly Ser Leu Lys Thr Leu Leu Ser Val Ser Phe Asp Phe Asn Lys Met 300 305 310
TTT AAC GCG CTC AAA CAA GAT GAA AAA GCC TCC AGA ATG CTA CAA AAC 1012 Phe Asn Ala Leu Lys Gin Asp Glu Lys Ala Ser Arg Met Leu Gin Asn 315 320 325
TAC GCC ACT AAA TTC CCT GAT TTT TAC GCG CGC ATT GTA GAG CTT TCT 1060 Tyr Ala Thr Lys Phe Pro Asp Phe Tyr Ala Arg He Val Glu Leu Ser 330 335 340
AAA TAC GAT CTA GGG GGC GCG AAT TTA TTG GAT TTT TTT TGC ATT TTA 1108 Lys Tyr Asp Leu Gly Gly Ala Asn Leu Leu Asp Phe Phe Cys He Leu 345 350 355
GGG TTT GTT TTG GGC TAT AGC GAG GAT TTT TGC ACA CAG AGC GTT ATT 1156 Gly Phe Val Leu Gly Tyr Ser Glu Asp Phe Cys Thr Gin Ser Val He 360 365 370 375
CCT TTG GCT AAA GAA TGC TTA CGC CCT AAA GGC CCT AGG ATT GAT TAT 1204 Pro Leu Ala Lys Glu Cys Leu Arg Pro Lys Gly Pro Arg He Asp Tyr 380 385 390 AAA ATC CTT AAA GAC AAT TCT TTG AAA ATG GCT TTA AAC TTT TCA AAG 1252 Lys He Leu Lys Asp Asn Ser Leu Lys Met Ala Leu Asn Phe Ser Lys 395 400 405
ATC ATG CAC AGT GCG ATG AGT TTC AGG CTC GCA GGC GTG GAA AAT GAA 1300 He Met His Ser Ala Met Ser Phe Arg Leu Ala Gly Val Glu Asn Glu 410 415 420
ATT TTG AGT TTG GGG ATT TTG GAT TCT TTA GCG GAG TTT TTA GGG AAT 1348 He Leu Ser Leu Gly He Leu Asp Ser Leu Ala Glu Phe Leu Gly Asn 425 430 435
TTC ATT TGG GAT AAC GCG CAA AAT TTT AGC GTT CAA GAA GTA ACG ATC 1396 Phe He Trp Asp Asn Ala Gin Asn Phe Ser Val Gin Glu Val Thr He 440 445 450 455
GCT GGG GAT TTC TTT GGC GAA AAA GTG TTT TTG GAT TTG TTT GTG CGG 1444 Ala Gly Asp Phe Phe Gly Glu Lys Val Phe Leu Asp Leu Phe Val Arg 460 465 470
TAT TTC CCT AAA ACC CTA GCC CTT AAA ACG CAT GCA TTT TTG GAT TAT 1492 Tyr Phe Pro Lys Thr Leu Ala Leu Lys Thr His Ala Phe Leu Asp Tyr 475 480 485
GAA TAAGGGCTTA AAAGCGGATG TGCATCATCA GCCCGCCGTC CATGTATT 1543
Glu
(2) INFORMATION FOR SEQ ID NO: 776:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 488 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 776:
Met Ala Cys Asn Leu Gin Ala Arg Phe Tyr Ser Val Tyr Lys Asp Asn
1 5 10 15
Thr Thr Ser Phe Tyr Leu Gin Ala Ser Ala Glu Thr Thr Leu Glu Phe
20 25 30
Ala Gin Lys Leu Ser Glu He Leu Pro Phe Ser Leu Asp Phe Ser Phe
35 40 45
Leu Ser Leu Lys Glu He Thr Glu Pro Leu Asp Glu Asn Leu Phe Gin
50 55 60
Thr Ala Ser Leu Ser Lys Pro Leu Phe Met Asn Ala Lys Glu His Gin 65 70 75 80
Asp Phe Leu Asp Lys Asn Ser Ser Leu Tyr Ala Asp Thr Leu Gly Leu
85 90 95
He Lys Asn Thr Ala Phe Lys Gly Asp He He His Ser Pro Lys Glu 100 105 110 Leu He Asp Cys Leu Thr Gin Leu Lys Gly Met Leu Lys Thr Gin Asp
115 120 125
Phe He Pro He Phe Thr Ser Arg Glu Ala Leu Ser Leu Ser Leu Lys
130 135 140
Asn Pro Ser Pro Ser Val He Phe Ser Asp Leu Ser Ser Val Leu Ser 145 150 155 160
Cys Thr Lys Leu Pro Leu Glu Asp Ala Lys Tyr Leu Ala Ser Leu Glu
165 170 175
Lys Pro Ser He Lys Ala Pro Leu Lys Ser Val Phe Lys Asp Thr Phe
180 185 190
Lys Asn Asp Glu He He Ala Gin Leu Pro Tyr Asp Pro He Leu Asn
195 200 205
Leu Leu Cys His He Leu Gin Asp Glu Gly He Glu Phe Val Phe Met
210 215 220
His Glu Ser Arg Ser Cys Glu Ala Leu Leu Tyr Tyr Glu Ala Leu Phe 225 230 235 240
Lys Thr Pro Lys Arg Leu He Thr Pro Thr Lys Lys Phe Val Leu Glu
245 250 255
Asn Asn Phe Ser Thr Phe Pro Phe Lys Asp Glu Leu Glu Phe Leu Ser
260 265 270
Ala Thr Pro Asn Ser He Val Leu Tyr Leu Ser Phe Lys Arg Pro Thr
275 280 285
Arg Leu Leu Leu His Ala Asn Gly Ser Leu Lys Thr Leu Leu Ser Val
290 295 300
Ser Phe Asp Phe Asn Lys Met Phe Asn Ala Leu Lys Gin Asp Glu Lys 305 310 315 320
Ala Ser Arg Met Leu Gin Asn Tyr Ala Thr Lys Phe Pro Asp Phe Tyr
325 330 335
Ala Arg He Val Glu Leu Ser Lys Tyr Asp Leu Gly Gly Ala Asn Leu
340 345 350
Leu Asp Phe Phe Cys He Leu Gly Phe Val Leu Gly Tyr Ser Glu Asp
355 360 365
Phe Cys Thr Gin Ser Val He Pro Leu Ala Lys Glu Cys Leu Arg Pro
370 375 380
Lys Gly Pro Arg He Asp Tyr Lys He Leu Lys Asp Asn Ser Leu Lys 385 390 395 400
Met Ala Leu Asn Phe Ser Lys He Met His Ser Ala Met Ser Phe Arg
405 410 415
Leu Ala Gly Val Glu Asn Glu He Leu Ser Leu Gly He Leu Asp Ser
420 425 430
Leu Ala Glu Phe Leu Gly Asn Phe He Trp Asp Asn Ala Gin Asn Phe
435 440 445
Ser Val Gin Glu Val Thr He Ala Gly Asp Phe Phe Gly Glu Lys Val
450 455 460
Phe Leu Asp Leu Phe Val Arg Tyr Phe Pro Lys Thr Leu Ala Leu Lys 465 470 475 480
Thr His Ala Phe Leu Asp Tyr Glu 485
(2) INFORMATION FOR SEQ ID NO: 777:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 715 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 17...694 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 777:
TTTTTAAGGA ATTTTG ATG GAA CAA AAA ATT TGC GTG ATC GGT TTT AGC GGC 52
Met Glu Gin Lys He Cys Val He Gly Phe Ser Gly 1 5 10
GGG CAA GAC AGC ACC ACT TTA GCC GTG TGG GCG AAA AAG CGT TTT AAA 100 Gly Gin Asp Ser Thr Thr Leu Ala Val Trp Ala Lys Lys Arg Phe Lys 15 20 25
AAA GTC TGT TTA GTG GGG TTT GAT TAT GCG CAA AAA CAC TCT GTG GAA 148 Lys Val Cys Leu Val Gly Phe Asp Tyr Ala Gin Lys His Ser Val Glu 30 35 40
TTA GAA TGC GCT CAA AAA ATC GCT TCT CTT TTA CAA CTC CCT TAT GAA 196 Leu Glu Cys Ala Gin Lys He Ala Ser Leu Leu Gin Leu Pro Tyr Glu 45 50 55 60
ATC ATC CCA TTA GAT TTT TTA GAA AAT ATC ACC CGC TCT GCG CTT TTT 244 He He Pro Leu Asp Phe Leu Glu Asn He Thr Arg Ser Ala Leu Phe 65 70 75
AAA AAC TCT AAC GAT TTA ATA GGG CAT TCG CAT GCG CAA AAT AAA GAT 292 Lys Asn Ser Asn Asp Leu He Gly His Ser His Ala Gin Asn Lys Asp 80 85 90
TTA CCC AAT TCT TTT GTG CCT AAT CGT AAC GCT ATT TTT ATC ACC CTT 340 Leu Pro Asn Ser Phe Val Pro Asn Arg Asn Ala He Phe He Thr Leu 95 100 105
TTG CAT TCT TAC GCG CAA AAA CTA GGG GCT AGC AAT ATC GCT TTA GGA 388 Leu His Ser Tyr Ala Gin Lys Leu Gly Ala Ser Asn He Ala Leu Gly 110 115 120
GTT TCG CAA GCG GAT TTT AGC GGC TAT CCG GAT TGT AAA GAA GAT TTT 436 Val Ser Gin Ala Asp Phe Ser Gly Tyr Pro Asp Cys Lys Glu Asp Phe 125 130 135 140
ATT AAA AGC ATC GAG CAT GCC TTA AAT TTA GGA TCA AAC ACG GCG ATT 484 He Lys Ser He Glu His Ala Leu Asn Leu Gly Ser Asn Thr Ala He 145 150 155
AAA ATC CTA ACG CCT TTA ATG TTT TTG AAT AAA GCG CAA GAA TTT CAA 532 Lys He Leu Thr Pro Leu Met Phe Leu Asn Lys Ala Gin Glu Phe Gin 160 165 170
ATG GCT AAA GAT TTG GGC GTC TTG GAT TTA GTC ATC AAA GAA ACG CAC 580 Met Ala Lys Asp Leu Gly Val Leu Asp Leu Val He Lys Glu Thr His 175 180 185
ACC TGC TAT CAA GGA GAG CGA AAG ATT TTG CAT GCT TAT GGT TAT GGT 628 Thr Cys Tyr Gin Gly Glu Arg Lys He Leu His Ala Tyr Gly Tyr Gly 190 195 200
TGC GAT AAA TGC CCG GCA TGC CAA TTG AGA AAA AAA GGC TTT GAA GAG 676 Cys Asp Lys Cys Pro Ala Cys Gin Leu Arg Lys Lys Gly Phe Glu Glu 205 210 215 220
TTT CAA GCT AAT AAA AAA TAAGGTTTTT TAAAAAACCA A 715
Phe Gin Ala Asn Lys Lys 225
(2) INFORMATION FOR SEQ ID NO: 778:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 226 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 778:
Met Glu Gin Lys He Cys Val He Gly Phe Ser Gly Gly Gin Asp Ser
1 5 10 15
Thr Thr Leu Ala Val Trp Ala Lys Lys Arg Phe Lys Lys Val Cys Leu
20 25 30
Val Gly Phe Asp Tyr Ala Gin Lys His Ser Val Glu Leu Glu Cys Ala
35 40 45
Gin Lys He Ala Ser Leu Leu Gin Leu Pro Tyr Glu He He Pro Leu
50 55 60
Asp Phe Leu Glu Asn He Thr Arg Ser Ala Leu Phe Lys Asn Ser Asn 65 70 75 80
Asp Leu He Gly His Ser His Ala Gin Asn Lys Asp Leu Pro Asn Ser
85 90 95
Phe Val Pro Asn Arg Asn Ala He Phe He Thr Leu Leu His Ser Tyr
100 105 110
Ala Gin Lys Leu Gly Ala Ser Asn He Ala Leu Gly Val Ser Gin Ala
115 120 125
Asp Phe Ser Gly Tyr Pro Asp Cys Lys Glu Asp Phe He Lys Ser He
130 135 140
Glu His Ala Leu Asn Leu Gly Ser Asn Thr Ala He Lys He Leu Thr 145 150 155 160
Pro Leu Met Phe Leu Asn Lys Ala Gin Glu Phe Gin Met Ala Lys Asp
165 170 175
Leu Gly Val Leu Asp Leu Val He Lys Glu Thr His Thr Cys Tyr Gin 180 185 190 Gly Glu Arg Lys He Leu H s Ala Tyr Gly Tyr Gly Cys Asp Lys Cys
195 200 205
Pro Ala Cys Gin Leu Arg Lys Lys Gly Phe Glu Glu Phe Gin Ala Asn
210 215 220
Lys Lys 225
(2) INFORMATION FOR SEQ ID NO: 779:
(l) SEQUENCE CHARACTERISTICS.
(A) LENGTH: 1201 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...1155 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 779:
TCCTCTCATG AGCTTTACTT GGTAGGGGGG TGCGTGCGCG ATTATTTA ATG GGC ATT 57
Met Gly He 1
ACC CCA AAA GAT TAC GAT TTA ACC TCA AAC GCT TTA GTC AAT GAA AGC 105 Thr Pro Lys Asp Tyr Asp Leu Thr Ser Asn Ala Leu Val Asn Glu Ser 5 10 15
AAA GAG CTT CTT TTA AAG CGC CAT TTT AGG GTG CTA GAA ACC GGT ATC 153 Lys Glu Leu Leu Leu Lys Arg His Phe Arg Val Leu Glu Thr Gly He 20 25 30 35
AAA CAT GGT ACG ATC ACG GCT CTT AAA AAC CAT CAA AGC TAT GAA ATC 201 Lys His Gly Thr He Thr Ala Leu Lys Asn His Gin Ser Tyr Glu He 40 45 50
ACA ACT TTT AGA ATT GAA AAG GGG CAT ATC AAA CAC CGA AAG CCT AAA 249 Thr Thr Phe Arg He Glu Lys Gly His He Lys His Arg Lys Pro Lys 55 60 65
GAA TTG GTT TTT AGC GTT CAT TTA ACA GAC GAT TTA AAG CGG CGC GAT 297 Glu Leu Val Phe Ser Val His Leu Thr Asp Asp Leu Lys Arg Arg Asp 70 75 80
TTT AGC ATG AAT GCG ATC GCT TAT AGC CCT ACA AAA GGG CTG ATT GAT 345 Phe Ser Met Asn Ala He Ala Tyr Ser Pro Thr Lys Gly Leu He Asp 85 90 95
CCT TTT AAA GGG CAG AAT GCG ATT GAA AAT CAA ATG ATT GAA TGC GTG 393 Pro Phe Lys Gly Gin Asn Ala He Glu Asn Gin Met He Glu Cys Val 100 105 110 115
GGG GAA GCG CGA TTA AGG TTT TTT GAA GAC GCT TTA AGG ATT TTA AGA 441 Gly Glu Ala Arg Leu Arg Phe Phe Glu Asp Ala Leu Arg He Leu Arg 120 125 130
TCG CTG CGA TTC AGT GCA ACT TTA GGC TTT AAG ATA GCG CCA AAC ACC 489 Ser Leu Arg Phe Ser Ala Thr Leu Gly Phe Lys He Ala Pro Asn Thr 135 140 145
AAA GAA GCG GTT TTT GCG TGT AAG GAT TTG TTA AAA CAC CTT TCT AAA 537 Lys Glu Ala Val Phe Ala Cys Lys Asp Leu Leu Lys His Leu Ser Lys 150 155 160
GAA CGC TTA CAA AGT GAA TTG AAT AAG CTT CTT ATG GGG AAA AAC GCC 585 Glu Arg Leu Gin Ser Glu Leu Asn Lys Leu Leu Met Gly Lys Asn Ala 165 170 175
TAT GAA GTG GCT AAA GAA TAT CAA GAA ATT TTA GAG TTG GTT ATT CAA 633 Tyr Glu Val Ala Lys Glu Tyr Gin Glu He Leu Glu Leu Val He Gin 180 185 190 195
GAA AAA ATA GAA AAT TTA GGG TTT TTA AAA AAC GCG CCT TTC AAT CTG 681 Glu Lys He Glu Asn Leu Gly Phe Leu Lys Asn Ala Pro Phe Asn Leu 200 205 210
GAA TTA AGA TTG TTA GGG TTT TTT AAG CAT CAA AAA AGT TTA GAA AGT 729 Glu Leu Arg Leu Leu Gly Phe Phe Lys His Gin Lys Ser Leu Glu Ser 215 220 225
TTA CGC TAC CCT AAA AAA ACG ATC GTT TTA TTT TCC AAA GCT AAA GAA 777 Leu Arg Tyr Pro Lys Lys Thr He Val Leu Phe Ser Lys Ala Lys Glu 230 235 240
TGC CAT AAA TCT TTT TTA AAT ATT CAT AAC AAA ACA GAG TTA AAA TTT 825 Cys His Lys Ser Phe Leu Asn He His Asn Lys Thr Glu Leu Lys Phe 245 250 255
TTA TTG AAA AAC TAC GAT TTA GAG CCT TTT AAT TTG GCT TTA GAT TTT 873 Leu Leu Lys Asn Tyr Asp Leu Glu Pro Phe Asn Leu Ala Leu Asp Phe 260 265 270 275
TAT GCG CTC AAA AAC CCC AAA CAT GCT TTA AAA ATT AAA GGC TTG TTA 921 Tyr Ala Leu Lys Asn Pro Lys His Ala Leu Lys He Lys Gly Leu Leu 280 285 290
AAA GAA ATC TTT GAT TCT AAC GAG CCT TTT AAA AAA GAA CAC TTG GCC 969 Lys Glu He Phe Asp Ser Asn Glu Pro Phe Lys Lys Glu His Leu Ala 295 300 305
CTT AAG GGC GGT GCG CTT CAA AGC TTG GGT TAC CAG CAC CAA AAA ATC 1017 Leu Lys Gly Gly Ala Leu Gin Ser Leu Gly Tyr Gin His Gin Lys He 310 315 320 GGC GAA ATT TTA AAC GCA TGC TTA GAT TTA GTC ATC GCT AAC CCT AAA 1065 Gly Glu He Leu Asn Ala Cys Leu Asp Leu Val He Ala Asn Pro Lys 325 330 335
AAT AAC GCT TTA GAA TGG CTG ATT GAA TGG GTT AAG GGT CAT TAT TTA 1113 Asn Asn Ala Leu Glu Trp Leu He Glu Trp Val Lys Gly His Tyr Leu 340 345 350 355
CCT AAT GAT ACT ATA AAT CTT TCG CCA ATA GGC AGA AGA AAT TAAAAACAG 1164 Pro Asn Asp Thr He Asn Leu Ser Pro He Gly Arg Arg Asn 360 365
AGAAAACATG ATAACGATGA ATGCGATTCA ATGGCCT 1201
(2) INFORMATION FOR SEQ ID NO: 780:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 369 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 780:
Met Gly He Thr Pro Lys Asp Tyr Asp Leu Thr Ser Asn Ala Leu Val
1 5 10 15
Asn Glu Ser Lys Glu Leu Leu Leu Lys Arg His Phe Arg Val Leu Glu
20 25 30
Thr Gly He Lys His Gly Thr He Thr Ala Leu Lys Asn His Gin Ser
35 40 45
Tyr Glu He Thr Thr Phe Arg He Glu Lys Gly His He Lys His Arg
50 55 60
Lys Pro Lys Glu Leu Val Phe Ser Val His Leu Thr Asp Asp Leu Lys 65 70 75 80
Arg Arg Asp Phe Ser Met Asn Ala He Ala Tyr Ser Pro Thr Lys Gly
85 90 95
Leu He Asp Pro Phe Lys Gly Gin Asn Ala He Glu Asn Gin Met He
100 105 110
Glu Cys Val Gly Glu Ala Arg Leu Arg Phe Phe Glu Asp Ala Leu Arg
115 120 125
He Leu Arg Ser Leu Arg Phe Ser Ala Thr Leu Gly Phe Lys He Ala
130 135 140
Pro Asn Thr Lys Glu Ala Val Phe Ala Cys Lys Asp Leu Leu Lys His 145 150 155 160
Leu Ser Lys Glu Arg Leu Gin Ser Glu Leu Asn Lys Leu Leu Met Gly
165 170 175
Lys Asn Ala Tyr Glu Val Ala Lys Glu Tyr Gin Glu He Leu Glu Leu
180 185 190
Val He Gin Glu Lys He Glu Asn Leu Gly Phe Leu Lys Asn Ala Pro
195 200 205
Phe Asn Leu Glu Leu Arg Leu Leu Gly Phe Phe Lys His Gin Lys Ser
210 215 220
Leu Glu Ser Leu Arg Tyr Pro Lys Lys Thr He Val Leu Phe Ser Lys 225 230 235 240
Ala Lys Glu Cys His Lys Ser Phe Leu Asn He His Asn Lys Thr Glu
245 250 255
Leu Lys Phe Leu Leu Lys Asn Tyr Asp Leu Glu Pro Phe Asn Leu Ala
260 265 270
Leu Asp Phe Tyr Ala Leu Lys Asn Pro Lys His Ala Leu Lys He Lys
275 280 285
Gly Leu Leu Lys Glu He Phe Asp Ser Asn Glu Pro Phe Lys Lys Glu
290 295 300
His Leu Ala Leu Lys Gly Gly Ala Leu Gin Ser Leu Gly Tyr Gin His 305 310 315 320
Gin Lys He Gly Glu He Leu Asn Ala Cys Leu Asp Leu Val He Ala
325 330 335
Asn Pro Lys Asn Asn Ala Leu Glu Trp Leu He Glu Trp Val Lys Gly
340 345 350
His Tyr Leu Pro Asn Asp Thr He Asn Leu Ser Pro He Gly Arg Arg
355 360 365
Asn
(2) INFORMATION FOR SEQ ID NO: 781:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 360 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 50...340 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:781:
TTTTCCCCTA TATCCAAAGC CATCATCAAG AAGTTTTAAG GCTCAAAGC ATG ATT TTT 58
Met He Phe
1
TCC ACT CTT ATT AAT GCG ATA GCG GTG ATT TTA AGC TCG CTC ATT ACG 106 Ser Thr Leu He Asn Ala He Ala Val He Leu Ser Ser Leu He Thr 5 10 15
ATT TAT ATG TGG ATA GTA ATC ATT TAT TCG CTT ATC AGT TTC GTG CAG 154 He Tyr Met Trp He Val He He Tyr Ser Leu He Ser Phe Val Gin 20 25 30 35
CCT AAC CCC AAT AAC CCC ATC ATG CAA ATT CTC GCT CGC TTG TGT GAG 202 Pro Asn Pro Asn Asn Pro He Met Gin He Leu Ala Arg Leu Cys Glu 40 45 50 CCG GTG TTT TAT TTT TTA CGC TCT AGA TTC AAG CTG GTG TTT AAC GGG 250 Pro Val Phe Tyr Phe Leu Arg Ser Arg Phe Lys Leu Val Phe Asn Gly 55 60 65
TTG GAT TTC TCT CCT TTA GTG GTG GTC ATT GTT TTG AAA TTC TTG GAT 298 Leu Asp Phe Ser Pro Leu Val Val Val He Val Leu Lys Phe Leu Asp 70 75 80
CTC ACG CTC ATT CAG TGG CTT TTC ATG CTC GCT AAA AAC CTT TAAAGAAAA 349 Leu Thr Leu He Gin Trp Leu Phe Met Leu Ala Lys Asn Leu 85 90 95
TCATGCGTTT T 360
(2) INFORMATION FOR SEQ ID NO: 782:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 97 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 782:
Met He Phe Ser Thr Leu He Asn Ala He Ala Val He Leu Ser Ser
1 5 10 15
Leu He Thr He Tyr Met Trp He Val He He Tyr Ser Leu He Ser
20 25 30
Phe Val Gin Pro Asn Pro Asn Asn Pro He Met Gin He Leu Ala Arg
35 40 45
Leu Cys Glu Pro Val Phe Tyr Phe Leu Arg Ser Arg Phe Lys Leu Val
50 55 60
Phe Asn Gly Leu Asp Phe Ser Pro Leu Val Val Val He Val Leu Lys 65 70 75 80
Phe Leu Asp Leu Thr Leu He Gin Trp Leu Phe Met Leu Ala Lys Asn
85 90 95
Leu
(2) INFORMATION FOR SEQ ID NO: 783:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1740 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 22...1701 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:783:
TAAAAACCTT TAAAGAAAAT C ATG CGT TTT TTT ACC TTG TTT TTT ATC GGT 51
Met Arg Phe Phe Thr Leu Phe Phe He Gly 1 5 10
ATG CTT GGC GTT GGT TTT TCT CAA ACC GAG TTA AAT TTA AAA GAT TTA 99 Met Leu Gly Val Gly Phe Ser Gin Thr Glu Leu Asn Leu Lys Asp Leu 15 20 25
GAA AAA AAG CCC GCC GGG ATC GTT AGG GAT TAT TAT TTG TGG CGT TAT 147 Glu Lys Lys Pro Ala Gly He Val Arg Asp Tyr Tyr Leu Trp Arg Tyr 30 35 40
ATT AGC GAT AAA AAA ACC AGT TTA GAA AAC GCT AAA AAA GCC TAT GAA 195 He Ser Asp Lys Lys Thr Ser Leu Glu Asn Ala Lys Lys Ala Tyr Glu 45 50 55
TTG ACT CAA AAT AAA AAC AAC GCC CTA CAA AAG GCC ATG CAA GAA AAA 243 Leu Thr Gin Asn Lys Asn Asn Ala Leu Gin Lys Ala Met Gin Glu Lys 60 65 70
GGC TCA GAC AAT GCA GAA AAA AAC CCT GAT GTT AAA TTG CCT GAA GAT 291 Gly Ser Asp Asn Ala Glu Lys Asn Pro Asp Val Lys Leu Pro Glu Asp 75 80 85 90
ATT TAT TGC AAG CAA ACG GCT TTA GAA AGC ATG CTA GAA ACA ACA GAC 339 He Tyr Cys Lys Gin Thr Ala Leu Glu Ser Met Leu Glu Thr Thr Asp 95 100 105
ACT TTC CAA GCA AGC TGC ATC GCT ATC GCT TTA AAA TCA AAG ATC AGA 387 Thr Phe Gin Ala Ser Cys He Ala He Ala Leu Lys Ser Lys He Arg 110 115 120
GAT TTT GAT AAA ATC CCT ATT GAA ACC CTT AAG CCC TTA CAA ATT AAA 435 Asp Phe Asp Lys He Pro He Glu Thr Leu Lys Pro Leu Gin He Lys 125 130 135
ATC AAA GAG GCT TAC CCC GTT CTT TAT GAA GAA TTA GAA ATT TTG CAA 483 He Lys Glu Ala Tyr Pro Val Leu Tyr Glu Glu Leu Glu He Leu Gin 140 145 150
AGT AAG CAT GTG AGC GCT TCT TTG TTT AAG GCT AAC GCG CAA GTG TTT 531 Ser Lys His Val Ser Ala Ser Leu Phe Lys Ala Asn Ala Gin Val Phe 155 160 165 170
AGC GCG CTT TTC AAT CAT TTG AGT TAT GAA AAA AAG CTC CAA ATT TTT 579 Ser Ala Leu Phe Asn His Leu Ser Tyr Glu Lys Lys Leu Gin He Phe 175 180 185
GAA AAG CAT ATC CCC ATT AAA GAG TTA AAC CGT CTT TTA GAC GAA AAT 627 Glu Lys His He Pro He Lys Glu Leu Asn Arg Leu Leu Asp Glu Asn 190 195 200
TAT CCG GCG TTT AAC CGC TTG ATC TAT CAG GTT ATT TTA GAT CCT AAA 675 Tyr Pro Ala Phe Asn Arg Leu He Tyr Gin Val He Leu Asp Pro Lys 205 210 215
TTG GAT CAT TTT AAA GAC GCT CTC ACT AAA AGT AAC GCT ACC CAC AGC 723 Leu Asp His Phe Lys Asp Ala Leu Thr Lys Ser Asn Ala Thr His Ser 220 225 230
AAC GCG CAA ACC TTT TTT ATT CTA GGG ATT AAT GAA ATC TTG CGC AAA 771 Asn Ala Gin Thr Phe Phe He Leu Gly He Asn Glu He Leu Arg Lys 235 240 245 250
AAA CCC TCT AAA GCG CTC AAG TAT TTT GAA CGA TCA GAA GCG GTT GTC 819 Lys Pro Ser Lys Ala Leu Lys Tyr Phe Glu Arg Ser Glu Ala Val Val 255 260 265
AAA GAC GAT GAT TTT TCA AAA GAC AGA GCG ATT TTT TGG CAG TAT TTA 867 Lys Asp Asp Asp Phe Ser Lys Asp Arg Ala He Phe Trp Gin Tyr Leu 270 275 280
GTT TCT AAA AAG AAA AAA ACT TTA GAA CGC CTT TCA CAA AGC CCA GCT 915 Val Ser Lys Lys Lys Lys Thr Leu Glu Arg Leu Ser Gin Ser Pro Ala 285 290 295
TTA AAT CTC TAT AGT CTT TAT GCG AGC CGC AAA CTC AAA ACC ACG CCC 963 Leu Asn Leu Tyr Ser Leu Tyr Ala Ser Arg Lys Leu Lys Thr Thr Pro 300 305 310
AGT TAC CGC ATC ATT TCA CGC ATC CAG AAT TTA AGC CAA GAA GAT CCT 1011 Ser Tyr Arg He He Ser Arg He Gin Asn Leu Ser Gin Glu Asp Pro 315 320 325 330
CCT TTT AAC ACT TAT GAC CCT TTT TCG TGG CAA ATT TTT AAG GAA AAA 1059 Pro Phe Asn Thr Tyr Asp Pro Phe Ser Trp Gin He Phe Lys Glu Lys 335 340 345
ACC TTG AGT TTG AAA GAT GAG GGC GCG TTT AAT GCG ATG CTA AAA AGC 1107 Thr Leu Ser Leu Lys Asp Glu Gly Ala Phe Asn Ala Met Leu Lys Ser 350 355 360
CTG TAT TAT GAA AAA AGC GCT CCT GAA TTG ACC TAT CTT TTA AGC CAA 1155 Leu Tyr Tyr Glu Lys Ser Ala Pro Glu Leu Thr Tyr Leu Leu Ser Gin 365 370 375
CGC AAT AAA GAC AAG ATT TAT TAT TAT TTA TCC CCT TAT GAG GGC ATT 1203 Arg Asn Lys Asp Lys He Tyr Tyr Tyr Leu Ser Pro Tyr Glu Gly He 380 385 390
ATT GAA TGG CAA AAT ACT GAT GAA AAG GCT ATG GCG TAT GCG ATC GCT 1251 He Glu Trp Gin Asn Thr Asp Glu Lys Ala Met Ala Tyr Ala He Ala 395 400 405 410
AGG CAA GAA AGC TTT TTG CTC CCG GCA GTC ATT TCG CGC TCG TTC GCT 1299 Arg Gin Glu Ser Phe Leu Leu Pro Ala Val He Ser Arg Ser Phe Ala 415 420 425
CTG GGG CTT ATG CAA ATC ATG CCC TTT AAT GTA GGG CCT TTC GCT AAA 1347 Leu Gly Leu Met Gin He Met Pro Phe Asn Val Gly Pro Phe Ala Lys 430 435 440
AGC CTT GGC ATG GAT AAC ATT GAT CTA AAC GAC ATG TTT AAC CCC AAC 1395 Ser Leu Gly Met Asp Asn He Asp Leu Asn Asp Met Phe Asn Pro Asn 445 450 455
ATC GCT CTC AAA TTT GGC AAT TAT TAC TTG AAC CAT TTG AAA AAA GAA 1443 He Ala Leu Lys Phe Gly Asn Tyr Tyr Leu Asn His Leu Lys Lys Glu 460 465 470
TTC AAC CAC CCC CTT TTT GTC GCC TAC GCT TAT AAC GCT GGG CCT GGG 1491 Phe Asn His Pro Leu Phe Val Ala Tyr Ala Tyr Asn Ala Gly Pro Gly 475 480 485 490
TTT TTA AGG AGG TGG TTA GAA AGT TCC AAA CGA TTT AAA GAA AAA AAT 1539 Phe Leu Arg Arg Trp Leu Glu Ser Ser Lys Arg Phe Lys Glu Lys Asn 495 500 505
CAT TTT GAG CCA TGG CTT AGC ATG GAG CTT ATG CCT TAT AGC GAG ACT 1587 His Phe Glu Pro Trp Leu Ser Met Glu Leu Met Pro Tyr Ser Glu Thr 510 515 520
CGC ATG TAT GGC TTT AGG GTC ATG CTC AAT TAC TTG ATT TAT CAA GAA 1635 Arg Met Tyr Gly Phe Arg Val Met Leu Asn Tyr Leu He Tyr Gin Glu 525 530 535
ATT TTT GGG AAT TTC ATC CCT ATT GAT GGA TTT TTA GAA CAA ACT CTT 1683 He Phe Gly Asn Phe He Pro He Asp Gly Phe Leu Glu Gin Thr Leu 540 545 550
AAC TCA AAG GAC AAA CCA TGATTAAAAA ATGCCTTTTT CCTGCTGCGG GCTATGGC 1739 Asn Ser Lys Asp Lys Pro 555 560
A 1740
(2) INFORMATION FOR SEQ ID NO: 784:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 560 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 784:
Met Arg Phe Phe Thr Leu Phe Phe He Gly Met Leu Gly Val Gly Phe
1 5 10 15 Ser Gin Thr Glu Leu Asn Leu Lys Asp Leu Glu Lys Lys Pro Ala Gly
20 25 30
He Val Arg Asp Tyr Tyr Leu Trp Arg Tyr He Ser Asp Lys Lys Thr
35 40 45
Ser Leu Glu Asn Ala Lys Lys Ala Tyr Glu Leu Thr Gin Asn Lys Asn
50 55 60
Asn Ala Leu Gin Lys Ala Met Gin Glu Lys Gly Ser Asp Asn Ala Glu 65 70 75 80
Lys Asn Pro Asp Val Lys Leu Pro Glu Asp He Tyr Cys Lys Gin Thr
85 90 95
Ala Leu Glu Ser Met Leu Glu Thr Thr Asp Thr Phe Gin Ala Ser Cys
100 105 110
He Ala He Ala Leu Lys Ser Lys He Arg Asp Phe Asp Lys He Pro
115 120 125
He Glu Thr Leu Lys Pro Leu Gin He Lys He Lys Glu Ala Tyr Pro
130 135 140
Val Leu Tyr Glu Glu Leu Glu He Leu Gin Ser Lys His Val Ser Ala 145 150 155 160
Ser Leu Phe Lys Ala Asn Ala Gin Val Phe Ser Ala Leu Phe Asn His
165 170 175
Leu Ser Tyr Glu Lys Lys Leu Gin He Phe Glu Lys His He Pro He
180 185 190
Lys Glu Leu Asn Arg Leu Leu Asp Glu Asn Tyr Pro Ala Phe Asn Arg
195 200 205
Leu He Tyr Gin Val He Leu Asp Pro Lys Leu Asp His Phe Lys Asp
210 215 220
Ala Leu Thr Lys Ser Asn Ala Thr His Ser Asn Ala Gin Thr Phe Phe 225 230 235 240
He Leu Gly He Asn Glu He Leu Arg Lys Lys Pro Ser Lys Ala Leu
245 250 255
Lys Tyr Phe Glu Arg Ser Glu Ala Val Val Lys Asp Asp Asp Phe Ser
260 265 270
Lys Asp Arg Ala He Phe Trp Gin Tyr Leu Val Ser Lys Lys Lys Lys
275 280 285
Thr Leu Glu Arg Leu Ser Gin Ser Pro Ala Leu Asn Leu Tyr Ser Leu
290 295 300
Tyr Ala Ser Arg Lys Leu Lys Thr Thr Pro Ser Tyr Arg He He Ser 305 310 315 320
Arg He Gin Asn Leu Ser Gin Glu Asp Pro Pro Phe Asn Thr Tyr Asp
325 330 335
Pro Phe Ser Trp Gin He Phe Lys Glu Lys Thr Leu Ser Leu Lys Asp
340 345 350
Glu Gly Ala Phe Asn Ala Met Leu Lys Ser Leu Tyr Tyr Glu Lys Ser
355 360 365
Ala Pro Glu Leu Thr Tyr Leu Leu Ser Gin Arg Asn Lys Asp Lys He
370 375 380
Tyr Tyr Tyr Leu Ser Pro Tyr Glu Gly He He Glu Trp Gin Asn Thr 385 390 395 400
Asp Glu Lys Ala Met Ala Tyr Ala He Ala Arg Gin Glu Ser Phe Leu
405 410 415
Leu Pro Ala Val He Ser Arg Ser Phe Ala Leu Gly Leu Met Gin He
420 425 430
Met Pro Phe Asn Val Gly Pro Phe Ala Lys Ser Leu Gly Met Asp Asn
435 440 445
He Asp Leu Asn Asp Met Phe Asn Pro Asn He Ala Leu Lys Phe Gly 450 455 460
Asn Tyr Tyr Leu Asn His Leu Lys Lys Glu Phe Asn His Pro Leu Phe 465 470 475 480
Val Ala Tyr Ala Tyr Asn Ala Gly Pro Gly Phe Leu Arg Arg Trp Leu
485 490 495
Glu Ser Ser Lys Arg Phe Lys Glu Lys Asn His Phe Glu Pro Trp Leu
500 505 510
Ser Met Glu Leu Met Pro Tyr Ser Glu Thr Arg Met Tyr Gly Phe Arg
515 520 525
Val Met Leu Asn Tyr Leu He Tyr Gin Glu He Phe Gly Asn Phe He
530 535 540
Pro He Asp Gly Phe Leu Glu Gin Thr Leu Asn Ser Lys Asp Lys Pro 545 550 555 560
(2) INFORMATION FOR SEQ ID NO: 785:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 770 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...738 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 785:
TAAAAAGAAG GACAA ATG ATG CCA TTT GAA GCT GTA ATC GGG CTA GAA GTC 51 Met Met Pro Phe Glu Ala Val He Gly Leu Glu Val 1 5 10
CAT GTC CAA CTC AAC ACC AAA ACC AAA ATC TTT TGC TCT TGC TCT ACA 99 His Val Gin Leu Asn Thr Lys Thr Lys He Phe Cys Ser Cys Ser Thr 15 20 25
AGC TTT GGA GAA TCC CCT AAT TCT AAC ACC TGC CCT GTG TGT TTG GGT 147 Ser Phe Gly Glu Ser Pro Asn Ser Asn Thr Cys Pro Val Cys Leu Gly 30 35 40
TTA CCG GGA GCT TTG CCG GTA TTG AAT AAA GAA GTG GTT AAA AAA GCC 195 Leu Pro Gly Ala Leu Pro Val Leu Asn Lys Glu Val Val Lys Lys Ala 45 50 55 60
ATC CAA TTA GGC ACA GCC ATT GAA GCC AAT ATC AAC CAA TAT TCT ATT 243 He Gin Leu Gly Thr Ala He Glu Ala Asn He Asn Gin Tyr Ser He 65 70 75
TTT GCG AGG AAA AAT TAT TTT TAC CCT GAT TTG CCT AAG GCT TAT CAA 291 Phe Ala Arg Lys Asn Tyr Phe Tyr Pro Asp Leu Pro Lys Ala Tyr Gin 80 85 90
ATT TCG CAG TTT GAA GTC CCT ATT GTG AGC GAT GGG AAA TTA GAG ATT 339 He Ser Gin Phe Glu Val Pro He Val Ser Asp Gly Lys Leu Glu He 95 100 105
GAC ACT AAA GAG GGT GCA AAA ATC GTG CGT ATT GAA AGG GCC CAC ATG 387 Asp Thr Lys Glu Gly Ala Lys He Val Arg He Glu Arg Ala His Met 110 115 120
GAA GAA GAC GCC GGT AAA AAT ATC CAT GAG GGC AGT TAT TCT TTA GTG 435 Glu Glu Asp Ala Gly Lys Asn He His Glu Gly Ser Tyr Ser Leu Val 125 130 135 140
GAT TTG AAC CGC GCT TGC ACC CCT TTA TTA GAA ATT GTC AGT AAG CCG 483 Asp Leu Asn Arg Ala Cys Thr Pro Leu Leu Glu He Val Ser Lys Pro 145 150 155
GAC ATG CGA AAT AGT GAA GAA GCT ATA GCG TAT TTG AAA AAG CTC CAT 531 Asp Met Arg Asn Ser Glu Glu Ala He Ala Tyr Leu Lys Lys Leu His 160 165 170
GCT ATC GTG CGT TTT ATA GGG ATT TCT GAT GCG AAC ATG CAA GAG GGG 579 Ala He Val Arg Phe He Gly He Ser Asp Ala Asn Met Gin Glu Gly 175 180 185
AAT TTC AGG TGC GAT GCG AAC GTG TCC ATT AGA CCC AAA GGC GAT GAA 627 Asn Phe Arg Cys Asp Ala Asn Val Ser He Arg Pro Lys Gly Asp Glu 190 195 200
AAG CTT TAT ACG AGA GTA GAG ATT AAA AAT CTA AAT AGC TTT AGA TTC 675 Lys Leu Tyr Thr Arg Val Glu He Lys Asn Leu Asn Ser Phe Arg Phe 205 210 215 220
ATT GCT AAA GCG ATT GAA TAC GAG ATA GAG CGC CAA AGC GCG GAC GTG 723 He Ala Lys Ala He Glu Tyr Glu He Glu Arg Gin Ser Ala Asp Val 225 230 235
GGA GAA CGG GCG CTA TAATGAAGAG GTGGTTCAAG AAACGCGCCT TT 770
Gly Glu Arg Ala Leu 240
(2) INFORMATION FOR SEQ ID NO: 786:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 241 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 786: Met Met Pro Phe Glu Ala Val He Gly Leu Glu Val His Val Gin Leu
1 5 10 15
Asn Thr Lys Thr Lys He Phe Cys Ser Cys Ser Thr Ser Phe Gly Glu
20 25 30
Ser Pro Asn Ser Asn Thr Cys Pro Val Cys Leu Gly Leu Pro Gly Ala
35 40 45
Leu Pro Val Leu Asn Lys Glu Val Val Lys Lys Ala He Gin Leu Gly
50 55 60
Thr Ala He Glu Ala Asn He Asn Gin Tyr Ser He Phe Ala Arg Lys 65 70 75 80
Asn Tyr Phe Tyr Pro Asp Leu Pro Lys Ala Tyr Gin He Ser Gin Phe
85 90 95
Glu Val Pro He Val Ser Asp Gly Lys Leu Glu He Asp Thr Lys Glu
100 105 110
Gly Ala Lys He Val Arg He Glu Arg Ala His Met Glu Glu Asp Ala
115 120 125
Gly Lys Asn He His Glu Gly Ser Tyr Ser Leu Val Asp Leu Asn Arg
130 135 140
Ala Cys Thr Pro Leu Leu Glu He Val Ser Lys Pro Asp Met Arg Asn 145 150 155 160
Ser Glu Glu Ala He Ala Tyr Leu Lys Lys Leu His Ala He Val Arg
165 170 175
Phe He Gly He Ser Asp Ala Asn Met Gin Glu Gly Asn Phe Arg Cys
180 185 190
Asp Ala Asn Val Ser He Arg Pro Lys Gly Asp Glu Lys Leu Tyr Thr
195 200 205
Arg Val Glu He Lys Asn Leu Asn Ser Phe Arg Phe He Ala Lys Ala
210 215 220
He Glu Tyr Glu He Glu Arg Gin Ser Ala Asp Val Gly Glu Arg Ala 225 230 235 240
Leu
(2) INFORMATION FOR SEQ ID NO: 787:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 487 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...444 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 787:
ATGGGAGTGG ATTGA ATG CAA GAA ATT GAA ATT TTT TGC GAT GGC TCT TCT 51
Met Gin Glu He Glu He Phe Cys Asp Gly Ser Ser 1 5 10 TTA GGC AAT CCC GGG CCA GGC GGT TAT GCG GCG ATT TTA CGC TAT AAA 99 Leu Gly Asn Pro Gly Pro Gly Gly Tyr Ala Ala He Leu Arg Tyr Lys 15 20 25
GAT AAA GAA AAA ACC ATC AGT GGG GGC GAA GAA TTC ACC ACG AAT AAC 147 Asp Lys Glu Lys Thr He Ser Gly Gly Glu Glu Phe Thr Thr Asn Asn 30 35 40
CGC ATG GAA TTA AGA GCG CTC AAT GAA GCG TTA AAA ATT TTG AAA CGC 195 Arg Met Glu Leu Arg Ala Leu Asn Glu Ala Leu Lys He Leu Lys Arg 45 50 55 60
CCA TGC CGT ATC ACG CTT TAT AGC GAT TCG CAA TAC GTG TGC CAA GCG 243 Pro Cys Arg He Thr Leu Tyr Ser Asp Ser Gin Tyr Val Cys Gin Ala 65 70 75
ATC AAT GTG TGG CTA GCT AAC TGG CAA AAA AAG AAT TTT TCT AAA GTT 291 He Asn Val Trp Leu Ala Asn Trp Gin Lys Lys Asn Phe Ser Lys Val 80 85 90
AAA AAT GTG GAT TTA TGG AAA GAA TTT TTA GAA GTC TCT AAA GGG CAT 339 Lys Asn Val Asp Leu Trp Lys Glu Phe Leu Glu Val Ser Lys Gly His 95 100 105
TCT ATT GTG GCT GTT TGG ATC AAG GGG CAT AAC GGG CAT GCC GAG AAT 387 Ser He Val Ala Val Trp He Lys Gly His Asn Gly His Ala Glu Asn 110 115 120
GAA CGA TGC GAT AGC CTC GCT AAA TTA GAG GCG CAA AAA CGG GTC AAA 435 Glu Arg Cys Asp Ser Leu Ala Lys Leu Glu Ala Gin Lys Arg Val Lys 125 130 135 140
ACG ACC ACT TAAAGGGAAA AATGATGAAA AACAAACGCT CTCAAAACAG CCC 487
Thr Thr Thr
(2) INFORMATION FOR SEQ ID NO: 788:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 143 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 788:
Met Gin Glu He Glu He Phe Cys Asp Gly Ser Ser Leu Gly Asn Pro
1 5 10 15
Gly Pro Gly Gly Tyr Ala Ala He Leu Arg Tyr Lys Asp Lys Glu Lys
20 25 30
Thr He Ser Gly Gly Glu Glu Phe Thr Thr Asn Asn Arg Met Glu Leu 35 40 45 Arg Ala Leu Asn Glu Ala Leu Lys He Leu Lys Arg Pro Cys Arg He
50 55 60
Thr Leu Tyr Ser Asp Ser Gin Tyr Val Cys Gin Ala He Asn Val Trp 65 70 75 80
Leu Ala Asn Trp Gin Lys Lys Asn Phe Ser Lys Val Lys Asn Val Asp
85 90 95
Leu Trp Lys Glu Phe Leu Glu Val Ser Lys Gly His Ser He Val Ala
100 105 110
Val Trp He Lys Gly His Asn Gly His Ala Glu Asn Glu Arg Cys Asp
115 120 125
Ser Leu Ala Lys Leu Glu Ala Gin Lys Arg Val Lys Thr Thr Thr 130 135 140
(2) INFORMATION FOR SEQ ID NO: 789:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1217 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 48...1181 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 789:
ATGTATGCGA CCGCTAAAGG CAAAAGCAAA AAAGAAGCCG AACAGCA ATG CGC TTA 56
Met Arg Leu 1
TCA AGC GCT TCA AAA ACT GAA GGA AGC CAA ATG AAC ACT TTG GGG CGT 104 Ser Ser Ala Ser Lys Thr Glu Gly Ser Gin Met Asn Thr Leu Gly Arg 5 10 15
TTT TTA AGG CTC ACG ACT TTT GGG GAA TCG CAT GGG GAT GTG ATA GGG 152 Phe Leu Arg Leu Thr Thr Phe Gly Glu Ser His Gly Asp Val He Gly 20 25 30 35
GGG GTA TTA GAC GGC ATG CCT AGC GGG ATT AAA ATA GAC TAT GCG CTA 200 Gly Val Leu Asp Gly Met Pro Ser Gly He Lys He Asp Tyr Ala Leu 40 45 50
TTA GAA AAT GAA ATG AAG CGC CGC CAA GGG GGG AGG AAC GTT TTC ATT 248 Leu Glu Asn Glu Met Lys Arg Arg Gin Gly Gly Arg Asn Val Phe He 55 60 65
ACG CCA CGA AAA GAA GAC GAT AAA GTG GAA ATA ACA AGC GGG GTT TTT 296 Thr Pro Arg Lys Glu Asp Asp Lys Val Glu He Thr Ser Gly Val Phe 70 75 80 GAA GAT TTT AGC ACA GGG ACT CCT ATA GGG TTT TTA ATC CAC AAC CAA 344 Glu Asp Phe Ser Thr Gly Thr Pro He Gly Phe Leu He His Asn Gin 85 90 95
AGG GCT AGG AGC AAG GAT TAC GAT AAC ATT AAA AAC CTT TTT AGG CCT 392 Arg Ala Arg Ser Lys Asp Tyr Asp Asn He Lys Asn Leu Phe Arg Pro 100 105 110 115
AGC CAT GCG GAT TTC ACT TAT TTT CAT AAA TAC GGC ATT AGG GAT TTT 440 Ser His Ala Asp Phe Thr Tyr Phe His Lys Tyr Gly He Arg Asp Phe 120 125 130
AGG GGT GGG GGG AGG AGT TCG GCC AGA GAG AGT GCT ATA AGA GTG GCT 488 Arg Gly Gly Gly Arg Ser Ser Ala Arg Glu Ser Ala He Arg Val Ala 135 140 145
GCT GGG GCG TTT GCT AAA ATG CTT TTA AGA GAA ATC GGT ATT GTT TGT 536 Ala Gly Ala Phe Ala Lys Met Leu Leu Arg Glu He Gly He Val Cys 150 155 160
GAA AGC GGG ATT ATA GAA ATT GGG GGT ATT AAA GCC AAA AAT TAT GAT 584 Glu Ser Gly He He Glu He Gly Gly He Lys Ala Lys Asn Tyr Asp 165 170 175
TTT AAT CAC GCC TTA AAA AGC GAG ATT TTT GCC CTA GAT GAA GAA CAA 632 Phe Asn His Ala Leu Lys Ser Glu He Phe Ala Leu Asp Glu Glu Gin 180 185 190 195
GAA GAA GCG CAA AAA ACA GCC ATT CAA AAC GCT ATC AAA AAC CAC GAT 680 Glu Glu Ala Gin Lys Thr Ala He Gin Asn Ala He Lys Asn His Asp 200 205 210
AGC ATA GGG GGT GTG GCT TTG ATT AGA GCG AGG AGC ATA AAA ACC AAT 728 Ser He Gly Gly Val Ala Leu He Arg Ala Arg Ser He Lys Thr Asn 215 220 225
CAA AAG CTC CCC ATT GGC TTA GGT CAA GGG CTA TAC GCT AAA TTA GAC 776 Gin Lys Leu Pro He Gly Leu Gly Gin Gly Leu Tyr Ala Lys Leu Asp 230 235 240
GCT AAA ATC GCT GAA GCG ATG ATG GGG CTT AAT GGG GTG AAA GCG GTT 824 Ala Lys He Ala Glu Ala Met Met Gly Leu Asn Gly Val Lys Ala Val 245 250 255
GAA ATA GGC AAG GGG GTA GAA AGC TCT TTA TTA AAA GGC TCA GAG TAT 872 Glu He Gly Lys Gly Val Glu Ser Ser Leu Leu Lys Gly Ser Glu Tyr 260 265 270 275
AAT GAT TTA ATG GAT CAA AAG GGG TTT TTG AGC AAT CGT AGC GGA GGG 920 Asn Asp Leu Met Asp Gin Lys Gly Phe Leu Ser Asn Arg Ser Gly Gly 280 285 290
GTT TTA GGG GGC ATG AGC AAT GGG GAA GAA ATC ATT GTT AGA GTG CAT 968 Val Leu Gly Gly Met Ser Asn Gly Glu Glu He He Val Arg Val His 295 300 305 TTC AAA CCC ACG CCA AGC ATT TTC CAA CCT CAA CGA ACC ATA GAC ATT 1016 Phe Lys Pro Thr Pro Ser He Phe Gin Pro Gin Arg Thr He Asp He 310 315 320
AAT GGC AAT GAG TGC GAA TGC TTG TTA AAG GGC AGG CAT GAT CCT TGC 1064 Asn Gly Asn Glu Cys Glu Cys Leu Leu Lys Gly Arg His Asp Pro Cys 325 330 335
ATT GCG ATT AGA GGG AGC GTG GTG TGC GAG AGT TTG TTA GCG TTG GTG 1112 He Ala He Arg Gly Ser Val Val Cys Glu Ser Leu Leu Ala Leu Val 340 345 350 355
TTG GCT GAT ATG GTA TTA CTC AAT TTG ACT TCA AAA ATA GAG TAT TTA 1160 Leu Ala Asp Met Val Leu Leu Asn Leu Thr Ser Lys He Glu Tyr Leu 360 365 370
AAA ACG ATT TAT AAT GAG AAT TAAACGAAAT TGGATACAAT CAGCTTAAAA AGGA 1215 Lys Thr He Tyr Asn Glu Asn 375
TA 1217
(2) INFORMATION FOR SEQ ID NO: 790:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 378 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 790:
Met Arg Leu Ser Ser Ala Ser Lys Thr Glu Gly Ser Gin Met Asn Thr
1 5 10 15
Leu Gly Arg Phe Leu Arg Leu Thr Thr Phe Gly Glu Ser His Gly Asp
20 25 30
Val He Gly Gly Val Leu Asp Gly Met Pro Ser Gly He Lys He Asp
35 40 45
Tyr Ala Leu Leu Glu Asn Glu Met Lys Arg Arg Gin Gly Gly Arg Asn
50 55 60
Val Phe He Thr Pro Arg Lys Glu Asp Asp Lys Val Glu He Thr Ser 65 70 75 80
Gly Val Phe Glu Asp Phe Ser Thr Gly Thr Pro He Gly Phe Leu He
85 90 95
His Asn Gin Arg Ala Arg Ser Lys Asp Tyr Asp Asn He Lys Asn Leu
100 105 110
Phe Arg Pro Ser His Ala Asp Phe Thr Tyr Phe His Lys Tyr Gly He
115 120 125
Arg Asp Phe Arg Gly Gly Gly Arg Ser Ser Ala Arg Glu Ser Ala He
130 135 140
Arg Val Ala Ala Gly Ala Phe Ala Lys Met Leu Leu Arg Glu He Gly 145 150 155 160
He Val Cys Glu Ser Gly He He Glu He Gly Gly He Lys Ala Lys 165 170 175
Asn Tyr Asp Phe Asn His Ala Leu Lys Ser Glu He Phe Ala Leu Asp
180 185 190
Glu Glu Gin Glu Glu Ala Gin Lys Thr Ala He Gin Asn Ala He Lys
195 200 205
Asn His Asp Ser He Gly Gly Val Ala Leu He Arg Ala Arg Ser He
210 215 220
Lys Thr Asn Gin Lys Leu Pro He Gly Leu Gly Gin Gly Leu Tyr Ala 225 230 235 240
Lys Leu Asp Ala Lys He Ala Glu Ala Met Met Gly Leu Asn Gly Val
245 250 255
Lys Ala Val Glu He Gly Lys Gly Val Glu Ser Ser Leu Leu Lys Gly
260 265 270
Ser Glu Tyr Asn Asp Leu Met Asp Gin Lys Gly Phe Leu Ser Asn Arg
275 280 285
Ser Gly Gly Val Leu Gly Gly Met Ser Asn Gly Glu Glu He He Val
290 295 300
Arg Val His Phe Lys Pro Thr Pro Ser He Phe Gin Pro Gin Arg Thr 305 310 315 320
He Asp He Asn Gly Asn Glu Cys Glu Cys Leu Leu Lys Gly Arg His
325 330 335
Asp Pro Cys He Ala He Arg Gly Ser Val Val Cys Glu Ser Leu Leu
340 345 350
Ala Leu Val Leu Ala Asp Met Val Leu Leu Asn Leu Thr Ser Lys He
355 360 365
Glu Tyr Leu Lys Thr He Tyr Asn Glu Asn 370 375
(2) INFORMATION FOR SEQ ID NO: 791:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 588 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 20...535 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 791:
TCATTGTGCC TGAAGCCAC ATG CGC TAC ATG CTC ATC AAC GAT TAT TAC AAG 52
Met Arg Tyr Met Leu He Asn Asp Tyr Tyr Lys 1 5 10
GTG TTT TTG GGC GAA AAA GAT AAG GAT TTG TAT GTG AAG CGC TTG GAA 100 Val Phe Leu Gly Glu Lys Asp Lys Asp Leu Tyr Val Lys Arg Leu Glu 15 20 25 AAA ATC ACG CCT AAA ATC TAT CTG GCG AGC GTG TTT TTA GAG AAA CAC 148 Lys He Thr Pro Lys He Tyr Leu Ala Ser Val Phe Leu Glu Lys His 30 35 40
ACC CCT TTA AAA AGT CTT TTA GAA AAA ATC CCT AAG GGA AAA AAA GAG 196 Thr Pro Leu Lys Ser Leu Leu Glu Lys He Pro Lys Gly Lys Lys Glu 45 50 55
ACT ATC ACC TAT CAT AAC CCT TGT CAT GCC AAA AAA ACC CTA AAC GCT 244 Thr He Thr Tyr His Asn Pro Cys His Ala Lys Lys Thr Leu Asn Ala 60 65 70 75
CAC AAA GAA GTG CGC AAC TTG CTC AAT TTG CAT TAT GAA ATT AAA GAA 292 His Lys Glu Val Arg Asn Leu Leu Asn Leu His Tyr Glu He Lys Glu 80 85 90
ATG CCG GAC AAT TGT TGC GGT TTT GGG GGG ATT ACG ATG CAA ACA CAA 340 Met Pro Asp Asn Cys Cys Gly Phe Gly Gly He Thr Met Gin Thr Gin 95 100 105
AAG GCG GGA TTT TCT TTA AAA GTG GGG CTT CTT AGG GCT AAA GAA ATC 388 Lys Ala Gly Phe Ser Leu Lys Val Gly Leu Leu Arg Ala Lys Glu He 110 115 120
ATA GAC ACC AAA GCT GCA ATT TTG AGC GCT GAA TGC GGG GCA TGC CAT 436 He Asp Thr Lys Ala Ala He Leu Ser Ala Glu Cys Gly Ala Cys His 125 130 135
ATG CAA TTA AAC AAC GCT TTA AAG TCT TTA GAC GAC CCT AAC ACT CCG 484 Met Gin Leu Asn Asn Ala Leu Lys Ser Leu Asp Asp Pro Asn Thr Pro 140 145 150 155
CCA TTT TTG CAC CCT TTA GAA CTC ATC GCT AAA GCC TTA AAA AGC GCT 532 Pro Phe Leu His Pro Leu Glu Leu He Ala Lys Ala Leu Lys Ser Ala 160 165 170
GAA TAAAAAGCCT TTTTAACCCC ATTCTCCAAC ATCTTTTTAT ATAATACAGA GCT 58Ϊ Glu
(2) INFORMATION FOR SEQ ID NO: 792:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 172 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 792:
Met Arg Tyr Met Leu He Asn Asp Tyr Tyr Lys Val Phe Leu Gly Glu 1 5 10 15 Lys Asp Lys Asp Leu Tyr Val Lys Arg Leu Glu Lys He Thr Pro Lys
20 25 30
He Tyr Leu Ala Ser Val Phe Leu Glu Lys His Thr Pro Leu Lys Ser
35 40 45
Leu Leu Glu Lys He Pro Lys Gly Lys Lys Glu Thr He Thr Tyr His
50 55 60
Asn Pro Cys His Ala Lys Lys Thr Leu Asn Ala His Lys Glu Val Arg 65 70 75 80
Asn Leu Leu Asn Leu His Tyr Glu He Lys Glu Met Pro Asp Asn Cys
85 90 95
Cys Gly Phe Gly Gly He Thr Met Gin Thr Gin Lys Ala Gly Phe Ser
100 105 110
Leu Lys Val Gly Leu Leu Arg Ala Lys Glu He He Asp Thr Lys Ala
115 120 125
Ala He Leu Ser Ala Glu Cys Gly Ala Cys His Met Gin Leu Asn Asn
130 135 140
Ala Leu Lys Ser Leu Asp Asp Pro Asn Thr Pro Pro Phe Leu His Pro 145 150 155 160
Leu Glu Leu He Ala Lys Ala Leu Lys Ser Ala Glu 165 170
(2) INFORMATION FOR SEQ ID NO:793:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 350 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 30...317 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:793:
GGCGTTAAAG CTCTGTATTA TATAAAAAG ATG TTG GAG AAT GGG GTT AAA AAG 53
Met Leu Glu Asn Gly Val Lys Lys 1 5
GCT TTT TAT TCA GCG CTT TTT AAG GCT TTA GCG ATG AGT TCT AAA GGG 101 Ala Phe Tyr Ser Ala Leu Phe Lys Ala Leu Ala Met Ser Ser Lys Gly 10 15 20
TGC AAA AAT GGC GGA GTG TTA GGG TCG TCT AAA GAC TTT AAA GCG TTG 149 Cys Lys Asn Gly Gly Val Leu Gly Ser Ser Lys Asp Phe Lys Ala Leu 25 30 35 40
TTT AAT TGC ATA TGG CAT GCC CCG CAT TCA GCG CTC AAA ATT GCA GCT 197 Phe Asn Cys He Trp His Ala Pro His Ser Ala Leu Lys He Ala Ala 45 50 55 TTG GTG TCT ATG ATT TCT TTA GCC CTA AGA AGC CCC ACT TTT AAA GAA 245 Leu Val Ser Met He Ser Leu Ala Leu Arg Ser Pro Thr Phe Lys Glu 60 65 70
AAT CCC GCC TTT TGT GTT TGC ATC GTA ATC CCC CCA AAA CCG CAA CAA 293 Asn Pro Ala Phe Cys Val Cys He Val He Pro Pro Lys Pro Gin Gin 75 80 85
TTG TCC GGC ATT TCT TTA ATT TCA TAATGCAAAT TGAGCAACCT TTTGCATTCT 347 Leu Ser Gly He Ser Leu He Ser 90 95
TAC 350
(2) INFORMATION FOR SEQ ID NO: 794:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 96 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 794:
Met Leu Glu Asn Gly Val Lys Lys Ala Phe Tyr Ser Ala Leu Phe Lys
1 5 10 15
Ala Leu Ala Met Ser Ser Lys Gly Cys Lys Asn Gly Gly Val Leu Gly
20 25 30
Ser Ser Lys Asp Phe Lys Ala Leu Phe Asn Cys He Trp His Ala Pro
35 40 45
His Ser Ala Leu Lys He Ala Ala Leu Val Ser Met He Ser Leu Ala
50 55 60
Leu Arg Ser Pro Thr Phe Lys Glu Asn Pro Ala Phe Cys Val Cys He 65 70 75 80
Val He Pro Pro Lys Pro Gin Gin Leu Ser Gly He Ser Leu He Ser 85 90 95
(2) INFORMATION FOR SEQ ID NO: 795:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1800 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 69...1718 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 795:
TCATCATCTC CACTTCTAAT TTAACCTCTA ACGCCCTTAA CGCAATTGAG CAAATCAGAA 60
GCACAGGA ATG GGG ATT GAC ATT GAT GAA ATC ACT GAA GAG GAT TTT ATC 110 Met Gly He Asp He Asp Glu He Thr Glu Glu Asp Phe He 1 5 10
TAT TCT CGC ATT GAT TGG GAA AAG TTT GAT CCC ACA AAA ACG CAA GAC 158 Tyr Ser Arg He Asp Trp Glu Lys Phe Asp Pro Thr Lys Thr Gin Asp 15 20 25 30
GAA ATC CCC TTA TGC GAT AAG AAA AAG CCG CGC TCG CAT CAA ACA GAA 206 Glu He Pro Leu Cys Asp Lys Lys Lys Pro Arg Ser His Gin Thr Glu 35 40 45
GCC ATA AAC GCC ACT AAA GAG TAT TTT TCT GAC CCT AAA AAC GCT AGA 254 Ala He Asn Ala Thr Lys Glu Tyr Phe Ser Asp Pro Lys Asn Ala Arg 50 55 60
GGC AAG CTC ATT ATG GCA TGC GGG ACA GGC AAA ACC TAC ACT TCT TTA 302 Gly Lys Leu He Met Ala Cys Gly Thr Gly Lys Thr Tyr Thr Ser Leu 65 70 75
AAA ATC ATG GAA GCT TTA GAC TCT AAG ATC ACG CTT TTT CTA GCA CCC 350 Lys He Met Glu Ala Leu Asp Ser Lys He Thr Leu Phe Leu Ala Pro 80 85 90
AGC ATC GCT TTG CTT TCT CAA ACT TTT AGA GAA TAC GCG CAA GAA AAA 398 Ser He Ala Leu Leu Ser Gin Thr Phe Arg Glu Tyr Ala Gin Glu Lys 95 100 105 110
AGT GAG CCG TTT TAC GCT TCT ATC GTG TGC AGC GAT GAT AAA GTC GGG 446 Ser Glu Pro Phe Tyr Ala Ser He Val Cys Ser Asp Asp Lys Val Gly 115 120 125
AAA AGT AAA GAC GAA GAC AAT GAT GAT ATT AAA TTT TCT GAG CTC CCT 494 Lys Ser Lys Asp Glu Asp Asn Asp Asp He Lys Phe Ser Glu Leu Pro 130 135 140
TTA AAG CCC TCC ACT CGC CTT GAA GAC ATT TTA AGC GTT CGA AAA AAA 542 Leu Lys Pro Ser Thr Arg Leu Glu Asp He Leu Ser Val Arg Lys Lys 145 150 155
GCG CAA AAA GAA AAC AAG CGC TTC ATT ATT TTT TCA ACC TAT CAA AGC 590 Ala Gin Lys Glu Asn Lys Arg Phe He He Phe Ser Thr Tyr Gin Ser 160 165 170
GCG TTG CGT ATT AAA GAA GCG CAA GAA GCG GGT TTG GGC GGA ATC GAT 638 Ala Leu Arg He Lys Glu Ala Gin Glu Ala Gly Leu Gly Gly He Asp 175 180 185 190
CTT ATT ATT TGC GAT GAA GCC CAC AGA ACG GTA GGG GCT ATG TAT TCT 686 Leu He He Cys Asp Glu Ala His Arg Thr Val Gly Ala Met Tyr Ser 195 200 205 AGT AAT GAA AGG GAC GAT AAA AAC GCT TTC ACG CTT TGC CAT AGC GAT 734 Ser Asn Glu Arg Asp Asp Lys Asn Ala Phe Thr Leu Cys His Ser Asp 210 215 220
AAA AAT ATC AAA GCG AAA AAA CGC CTG TAT ATG ACC GCC ACG CCT AAA 782 Lys Asn He Lys Ala Lys Lys Arg Leu Tyr Met Thr Ala Thr Pro Lys 225 230 235
GTT TAT AGC GAA AGC TCC AAA GCT AAA GCC AAA GAG AGC GAT AAT GTT 830 Val Tyr Ser Glu Ser Ser Lys Ala Lys Ala Lys Glu Ser Asp Asn Val 240 245 250
ATC TAT TCT ATG GAC GAT GCA GAG ATT TTT GGC GAA GAA ATC TAT ACG 878 He Tyr Ser Met Asp Asp Ala Glu He Phe Gly Glu Glu He Tyr Thr 255 260 265 270
CTC AAT TTT TCA AAA GCG ATC GCT TTG GAT CTC TTA ACC GAT TAT AAA 926 Leu Asn Phe Ser Lys Ala He Ala Leu Asp Leu Leu Thr Asp Tyr Lys 275 280 285
GTC ATC ATT TTA GCG GTG CGA AAA GAA AAT TTA AGC GGC GTT ACT AAC 974 Val He He Leu Ala Val Arg Lys Glu Asn Leu Ser Gly Val Thr Asn 290 295 300
AGC GTG AAT AAA AAG ATC AGC CAG CTC AAA GCC GAA GGC ACT AAA TTA 1022 Ser Val Asn Lys Lys He Ser Gin Leu Lys Ala Glu Gly Thr Lys Leu 305 310 315
GAT AAA AAG CTC ATC AAT AAC GAA TTT GTT TGT AAG ATC ATC GGC ACT 1070 Asp Lys Lys Leu He Asn Asn Glu Phe Val Cys Lys He He Gly Thr 320 325 330
CAT AAA GGG TTA GCC AAG CAG GAT TTA ATC GTT TTA AAC GAG AAA AAC 1118 His Lys Gly Leu Ala Lys Gin Asp Leu He Val Leu Asn Glu Lys Asn 335 340 345 350
AAA GAA GAT CAC AAC TTG CAA AAC CAA TAC GAC ACC GCT CCC TCT CAA 1166 Lys Glu Asp His Asn Leu Gin Asn Gin Tyr Asp Thr Ala Pro Ser Gin 355 360 365
AGA GCC ATA AAC TTT TGT AAA AGC ATT AAC ACG AGC AAG AAC ATT AAA 1214 Arg Ala He Asn Phe Cys Lys Ser He Asn Thr Ser Lys Asn He Lys 370 375 380
GAC TCC TTT GAA ACG ATT ATG GAA TGC TAT GAT GAA GAG TTG AAG AAA 1262 Asp Ser Phe Glu Thr He Met Glu Cys Tyr Asp Glu Glu Leu Lys Lys 385 390 395
AAG AGT TTT AAA AAC CTA AAA ATC AGC ATC GAT CAC ATT GAT GGC ACC 1310 Lys Ser Phe Lys Asn Leu Lys He Ser He Asp His He Asp Gly Thr 400 405 410
ATG AAT TGT AAG GAT AGG CTT GAA AAA TTA GAA GAG CTC AAT CAA TTT 1358 Met Asn Cys Lys Asp Arg Leu Glu Lys Leu Glu Glu Leu Asn Gin Phe 415 420 425 430 GAG CCC AAC ACT TGC AAG GTT TTA AGC AAC GCC AGG TGT TTG AGC GAA 1406 Glu Pro Asn Thr Cys Lys Val Leu Ser Asn Ala Arg Cys Leu Ser Glu 435 440 445
GGG GTG GAT GTC CCA GCG TTA GAT AGC ATC GTC TTT TTT GAT GGC AAA 1454 Gly Val Asp Val Pro Ala Leu Asp Ser He Val Phe Phe Asp Gly Lys 450 455 460
AGC GCT ATG GTG GAT ATT ATC CAA GCG GTG GGT AGG GTG ATG CGA AAA 1502 Ser Ala Met Val Asp He He Gin Ala Val Gly Arg Val Met Arg Lys 465 470 475
GCC AAA CGC AAG AAA AGA GGC TAT ATC ATT TTG CCT ATC GCT TTA GAA 1550 Ala Lys Arg Lys Lys Arg Gly Tyr He He Leu Pro He Ala Leu Glu 480 485 490
GAG AGT GAA ATC CAA AAC CTG GAT GAA GCC GTC AAT AAC ACC AAT TTC 1598 Glu Ser Glu He Gin Asn Leu Asp Glu Ala Val Asn Asn Thr Asn Phe 495 500 505 510
AAA AAC ATT TGG AAA GTG ATA AAA GCC TTA AGA AGC CAT GAC CCA AGC 1646 Lys Asn He Trp Lys Val He Lys Ala Leu Arg Ser His Asp Pro Ser 515 520 525
CTG GTT GAT GAA GCC ACT TTT AAA GAA AAA ATC AAA ATC TTT GGA AGC 1694 Leu Val Asp Glu Ala Thr Phe Lys Glu Lys He Lys He Phe Gly Ser 530 535 540
GAT GAT GGC AAC CAA TCA CAA CGA TGAAAAAACC CTTTTTGACG CTATCTTACT 1748 Asp Asp Gly Asn Gin Ser Gin Arg 545 550
GCAAGATCTA GCGGACGCTA TGTATAATGT CATGCCCACT AAATTAGGGG AC 1800
(2) INFORMATION FOR SEQ ID NO: 796:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 550 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 796:
Met Gly He Asp He Asp Glu He Thr Glu Glu Asp Phe He Tyr Ser
1 5 10 15
Arg He Asp Trp Glu Lys Phe Asp Pro Thr Lys Thr Gin Asp Glu He
20 25 30
Pro Leu Cys Asp Lys Lys Lys Pro Arg Ser His Gin Thr Glu Ala He
35 40 45
Asn Ala Thr Lys Glu Tyr Phe Ser Asp Pro Lys Asn Ala Arg Gly Lys
50 55 60
Leu He Met Ala Cys Gly Thr Gly Lys Thr Tyr Thr Ser Leu Lys He 65 70 75 80
Met Glu Ala Leu Asp Ser Lys He Thr Leu Phe Leu Ala Pro Ser He
85 90 95
Ala Leu Leu Ser Gin Thr Phe Arg Glu Tyr Ala Gin Glu Lys Ser Glu
100 105 110
Pro Phe Tyr Ala Ser He Val Cys Ser Asp Asp Lys Val Gly Lys Ser
115 120 125
Lys Asp Glu Asp Asn Asp Asp He Lys Phe Ser Glu Leu Pro Leu Lys
130 135 140
Pro Ser Thr Arg Leu Glu Asp He Leu Ser Val Arg Lys Lys Ala Gin 145 150 155 160
Lys Glu Asn Lys Arg Phe He He Phe Ser Thr Tyr Gin Ser Ala Leu
165 170 175
Arg He Lys Glu Ala Gin Glu Ala Gly Leu Gly Gly He Asp Leu He
180 185 190
He Cys Asp Glu Ala His Arg Thr Val Gly Ala Met Tyr Ser Ser Asn
195 200 205
Glu Arg Asp Asp Lys Asn Ala Phe Thr Leu Cys His Ser Asp Lys Asn
210 215 220
He Lys Ala Lys Lys Arg Leu Tyr Met Thr Ala Thr Pro Lys Val Tyr 225 230 235 240
Ser Glu Ser Ser Lys Ala Lys Ala Lys Glu Ser Asp Asn Val He Tyr
245 250 255
Ser Met Asp Asp Ala Glu He Phe Gly Glu Glu He Tyr Thr Leu Asn
260 265 270
Phe Ser Lys Ala He Ala Leu Asp Leu Leu Thr Asp Tyr Lys Val He
275 280 285
He Leu Ala Val Arg Lys Glu Asn Leu Ser Gly Val Thr Asn Ser Val
290 295 300
Asn Lys Lys He Ser Gin Leu Lys Ala Glu Gly Thr Lys Leu Asp Lys 305 310 315 320
Lys Leu He Asn Asn Glu Phe Val Cys Lys He He Gly Thr His Lys
325 330 335
Gly Leu Ala Lys Gin Asp Leu He Val Leu Asn Glu Lys Asn Lys Glu
340 345 350
Asp His Asn Leu Gin Asn Gin Tyr Asp Thr Ala Pro Ser Gin Arg Ala
355 360 365
He Asn Phe Cys Lys Ser He Asn Thr Ser Lys Asn He Lys Asp Ser
370 375 380
Phe Glu Thr He Met Glu Cys Tyr Asp Glu Glu Leu Lys Lys Lys Ser 385 390 395 400
Phe Lys Asn Leu Lys He Ser He Asp His He Asp Gly Thr Met Asn
405 410 415
Cys Lys Asp Arg Leu Glu Lys Leu Glu Glu Leu Asn Gin Phe Glu Pro
420 425 430
Asn Thr Cys Lys Val Leu Ser Asn Ala Arg Cys Leu Ser Glu Gly Val
435 440 445
Asp Val Pro Ala Leu Asp Ser He Val Phe Phe Asp Gly Lys Ser Ala
450 455 460
Met Val Asp He He Gin Ala Val Gly Arg Val Met Arg Lys Ala Lys 465 470 475 480
Arg Lys Lys Arg Gly Tyr He He Leu Pro He Ala Leu Glu Glu Ser
485 490 495
Glu He Gin Asn Leu Asp Glu Ala Val Asn Asn Thr Asn Phe Lys Asn 500 505 510 He Trp Lys Val He Lys Ala Leu Arg Ser His Asp Pro Ser Leu Val
515 520 525
Asp Glu Ala Thr Phe Lys Glu Lys' He Lys He Phe Gly Ser Asp Asp
530 535 540
Gly Asn Gin Ser Gin Arg 545 550
(2) INFORMATION FOR SEQ ID NO: 797:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2880 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...2814 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 797:
AAGCCTTAAG AAGCC ATG ACC CAA GCC TGG TTG ATG AAG CCA CTT TTA AAG 51 Met Thr Gin Ala Trp Leu Met Lys Pro Leu Leu Lys 1 5 10
AAA AAA TCA AAA TCT TTG GAA GCG ATG ATG GCA ACC AAT CAC AAC GAT 99 Lys Lys Ser Lys Ser Leu Glu Ala Met Met Ala Thr Asn His Asn Asp 15 20 25
GAA AAA ACC CTT TTT GAC GCT ATC TTA CTG CAA GAT CTA GCG GAC GCT 147 Glu Lys Thr Leu Phe Asp Ala He Leu Leu Gin Asp Leu Ala Asp Ala 30 35 40
ATG TAT AAT GTC ATG CCC ACT AAA TTA GGG GAC AGG AAT TAT TGG GAA 195 Met Tyr Asn Val Met Pro Thr Lys Leu Gly Asp Arg Asn Tyr Trp Glu 45 50 55 60
AAT TTC ACT AAA AAA ACG GGC AAC ATC GCA AGG ACC TTG AAC AAC CGC 243 Asn Phe Thr Lys Lys Thr Gly Asn He Ala Arg Thr Leu Asn Asn Arg 65 70 75
CTA AAA ATT ATT TTT GAC AAA AAC CCT GAA TTT TTC CAC GGC TTT TTG 291 Leu Lys He He Phe Asp Lys Asn Pro Glu Phe Phe His Gly Phe Leu 80 85 90
GAT TCC TTA AGG GAA AAT ATC CAT CAA AAC ATT AAA GAA GAT GAA GCC 339 Asp Ser Leu Arg Glu Asn He His Gin Asn He Lys Glu Asp Glu Ala 95 100 105
TTA GAC ATG ATC ACT TCT CAC ATC ATC ACT AAG CCC ATT TTT GAT GCA 387 Leu Asp Met He Thr Ser His He He Thr Lys Pro He Phe Asp Ala 110 115 120
CTT TTT GGG GAC AAC ATC AAA AAC CCT ATC GCT AAA GCC TTG GAT AAA 435 Leu Phe Gly Asp Asn He Lys Asn Pro He Ala Lys Ala Leu Asp Lys 125 130 135 140
ATG GTA GAA AAA CTC TCC ACT TTA GGA TTA GAA GGA GAA ACT AAA GAT 483 Met Val Glu Lys Leu Ser Thr Leu Gly Leu Glu Gly Glu Thr Lys Asp 145 150 155
CTG AAA AAC CTC TAT GAA AGC GTG AAA ACC GAA GCC TTG CAC GCC AAA 531 Leu Lys Asn Leu Tyr Glu Ser Val Lys Thr Glu Ala Leu His Ala Lys 160 165 170
AGC CAA AAA AGC CAA CAA GAA CTC ATT AAA AAC CTC TAC AAC ACT TTC 579 Ser Gin Lys Ser Gin Gin Glu Leu He Lys Asn Leu Tyr Asn Thr Phe 175 180 185
TTT AAA GAA GCC TTT AAA AAG CAA AGC GAA AAA CTA GGG ATC GTT TAT 627 Phe Lys Glu Ala Phe Lys Lys Gin Ser Glu Lys Leu Gly He Val Tyr 190 195 200
ACG CCC ATA GAG GTG GTG GAT TTC ATT TTA AGA GCC ACT AAC GGC ATT 675 Thr Pro He Glu Val Val Asp Phe He Leu Arg Ala Thr Asn Gly He 205 210 215 220
TTG AAA AAG CAT TTC AAC ACG GAT TTT AAC GAT CAA AGC ATC ACG ATT 723 Leu Lys Lys His Phe Asn Thr Asp Phe Asn Asp Gin Ser He Thr He 225 230 235
TTT GAC CCA TTC ACC GGC ACC GGG AGT TTT ATC GCT CGT TTG CTT TCT 771 Phe Asp Pro Phe Thr Gly Thr Gly Ser Phe He Ala Arg Leu Leu Ser 240 245 250
AAA GAA AAC GCG CTC ATT AGC GAT GAA GCC TTA AAA GAG AAG TTT CAA 819 Lys Glu Asn Ala Leu He Ser Asp Glu Ala Leu Lys Glu Lys Phe Gin 255 260 265
AAA AAT TTG TTC GCT TTT GAC ATC GTG CTT TTG TCT TAT TAT ATC GCT 867 Lys Asn Leu Phe Ala Phe Asp He Val Leu Leu Ser Tyr Tyr He Ala 270 275 280
TTA ATC AAT ATC ACC CAA GCC GCG CAA AAT AGG GAT GGC TCG TTA AAC 915 Leu He Asn He Thr Gin Ala Ala Gin Asn Arg Asp Gly Ser Leu Asn 285 290 295 300
AAT TTC AAA AAC ATC GCG CTC ACG GAC AGC CTG GAT TAT TTA GAA GAA 963 Asn Phe Lys Asn He Ala Leu Thr Asp Ser Leu Asp Tyr Leu Glu Glu 305 310 315
AAA ACC AAT AAA GGG GTG CTC CCT TTA TAT GAG GAT TTG AAA GAA AAC 1011 Lys Thr Asn Lys Gly Val Leu Pro Leu Tyr Glu Asp Leu Lys Glu Asn 320 325 330 AAA GGC ATC AAA GAC ACT CTA GCC AAC CAA AAT ATT AGA GTC ATC ATC 1059 Lys Gly He Lys Asp Thr Leu Ala Asn Gin Asn He Arg Val He He 335 340 345
GGC AAC CCG CCT TAT TCA GCC GGC GCA AAG AGC CAA AAC GAT AAC AAC 1107 Gly Asn Pro Pro Tyr Ser Ala Gly Ala Lys Ser Gin Asn Asp Asn Asn 350 355 360
CAA AAC CTC TCA CAC CCA AAG CTT GAA AAA TTA GTT TAT GAA AAA TAC 1155 Gin Asn Leu Ser His Pro Lys Leu Glu Lys Leu Val Tyr Glu Lys Tyr 365 370 375 380
GGA AAA AAT TCC ACA TCT AGA AGT GTG GGA AAA ACC ACA CGA GAC ACG 1203 Gly Lys Asn Ser Thr Ser Arg Ser Val Gly Lys Thr Thr Arg Asp Thr 385 390 395
CTC ATT CAA AGC ATC CGC ATG GCG AGC GAT GTT GTT AAA GAT AGG GGG 1251 Leu He Gin Ser He Arg Met Ala Ser Asp Val Val Lys Asp Arg Gly 400 405 410
GTG ATA GGC TTT GTG GTG AAC GGG GGT TTT ATT GAC TCT AAA AGC GCG 1299 Val He Gly Phe Val Val Asn Gly Gly Phe He Asp Ser Lys Ser Ala 415 420 425
GAT GGG TTC AGA AAA TGC GTG GCC AAA GAA TTT TCG CAT CTT TAT GTA 1347 Asp Gly Phe Arg Lys Cys Val Ala Lys Glu Phe Ser His Leu Tyr Val 430 435 440
TTG AAT TTG AGA GGC AAT CAG CGC ACT TCT GGG GAA GTG TCA AAA AAA 1395 Leu Asn Leu Arg Gly Asn Gin Arg Thr Ser Gly Glu Val Ser Lys Lys 445 450 455 460
GAG GGA GGG AAA ATC TTT GAT AGC GGA TCG AGG GCG ACG GTA GCG ATT 1443 Glu Gly Gly Lys He Phe Asp Ser Gly Ser Arg Ala Thr Val Ala He 465 470 475
ATC TTT TTT GTG AAA GAT AAG AGC ACT CCT GAT AAT ACG ATT TTT TAT 1491 He Phe Phe Val Lys Asp Lys Ser Thr Pro Asp Asn Thr He Phe Tyr 480 485 490
TAT GAA GTG GAA GAT TAC TTG AAA AGA GAA GCC AAA CTC AAC TGG CTC 1539 Tyr Glu Val Glu Asp Tyr Leu Lys Arg Glu Ala Lys Leu Asn Trp Leu 495 500 505
GCC AAT TTT GAA AAT TTG GAT TTT GTG CCT TTT GAG AAA ATC ACC CCG 1587 Ala Asn Phe Glu Asn Leu Asp Phe Val Pro Phe Glu Lys He Thr Pro 510 515 520
AAT GAT AAA GGC GAT TGG ATC AAC CAA AGG AAT GAC GCT TTT GAA AAA 1635 Asn Asp Lys Gly Asp Trp He Asn Gin Arg Asn Asp Ala Phe Glu Lys 525 530 535 540
CTC ATC CCT TTA AAA AGA GAC AAA ACA CTC CAA AAC GAC AGC GTT TTT 1683 Leu He Pro Leu Lys Arg Asp Lys Thr Leu Gin Asn Asp Ser Val Phe 545 550 555 GAC ATC AAT TCT CTT GGC GTG GTG AGC GGT CGT GAT CCT TGG GTG TAT 1731 Asp He Asn Ser Leu Gly Val Val Ser Gly Arg Asp Pro Trp Val Tyr 560 565 570
AAC TTT TCT CCA AAC ATT TTA ACC CAA TCG GTG CAA AAA TGC ATT GAC 1779 Asn Phe Ser Pro Asn He Leu Thr Gin Ser Val Gin Lys Cys He Asp 575 580 585
ACT TAT AAC GCT GAT TTG AAG CGC TTC AAT GCG CGT TTT AGG GAA GCT 1827 Thr Tyr Asn Ala Asp Leu Lys Arg Phe Asn Ala Arg Phe Arg Glu Ala 590 595 600
TTC AAA CAA CGC GCT CAA AGC GTC AAA GCA GGC GAT CTT TAC AAA CAA 1875 Phe Lys Gin Arg Ala Gin Ser Val Lys Ala Gly Asp Leu Tyr Lys Gin 605 610 615 620
CTT AAT GAT AAA GAA ATC ACC ACC GAT AAA ACG AAA ATC GCT TGG ACT 1923 Leu Asn Asp Lys Glu He Thr Thr Asp Lys Thr Lys He Ala Trp Thr 625 630 635
GAT GGT TTG AAA AAC AAA CTC ATT AAA AAT AAA TCT GCA AGA GAA AGC 1971 Asp Gly Leu Lys Asn Lys Leu He Lys Asn Lys Ser Ala Arg Glu Ser 640 645 650
AGT GAG GAG CGT GTA AGG TTG GCC TTG TAT CGC CCT TTT AAC AAA CAA 2019 Ser Glu Glu Arg Val Arg Leu Ala Leu Tyr Arg Pro Phe Asn Lys Gin 655 660 665
TGG CTT TAT TGG GAT AAG GAT TGG ATA AAC AGG CAA AGA GAA TTT TCA 2067 Trp Leu Tyr Trp Asp Lys Asp Trp He Asn Arg Gin Arg Glu Phe Ser 670 675 680
AAA ATT TTC CCG GAT AAA GAC GCT CAG AAT GTG GTG ATT AAT ACC GGT 2115 Lys He Phe Pro Asp Lys Asp Ala Gin Asn Val Val He Asn Thr Gly 685 690 695 700
GTG GGA AAT GGT AAA GAT TTT AGC GCT TTG GTA AGC GAT TTT ATT TCT 2163 Val Gly Asn Gly Lys Asp Phe Ser Ala Leu Val Ser Asp Phe He Ser 705 710 715
GAT TAT AGT TTG ATC TCA CCC AAT CAA GCT TAC CCC TTG TAT TAT TAC 2211 Asp Tyr Ser Leu He Ser Pro Asn Gin Ala Tyr Pro Leu Tyr Tyr Tyr 720 725 730
GAT GAT TTG GGG AAT CGC CAT TAC GCC ATC AGC GGC TAT TGC TTA AAC 2259 Asp Asp Leu Gly Asn Arg His Tyr Ala He Ser Gly Tyr Cys Leu Asn 735 740 745
CTC TTC AGG AGG CAT TAT GGG GAT AAT CTG ATC GCT GAA GAA GAG ATT 2307 Leu Phe Arg Arg His Tyr Gly Asp Asn Leu He Ala Glu Glu Glu He 750 755 760
TTT TAT TAC ATT TAT GCG ATT TTC CAC CAT AAA GGC TAT TTA GAA AAA 2355 Phe Tyr Tyr He Tyr Ala He Phe His His Lys Gly Tyr Leu Glu Lys 765 770 775 780 TAC AAA AAT TCC CTC GCC AAA GAA GCG CCG CGC ATC GCT TTG AGC GAA 2403 Tyr Lys Asn Ser Leu Ala Lys Glu Ala Pro Arg He Ala Leu Ser Glu 785 790 795
GAT TTT AAA GAA CTC TCT GTG CTT GGC AAA GAA TTG GCC GAA TTG CAC 2451 Asp Phe Lys Glu Leu Ser Val Leu Gly Lys Glu Leu Ala Glu Leu His 800 805 810
CTG AAC TAT GAG AGT GGG GAA ATG CAT GAT AAT ATT AAA TAC ACC ACA 2499 Leu Asn Tyr Glu Ser Gly Glu Met His Asp Asn He Lys Tyr Thr Thr 815 820 825
CTG ATG AAC GCC GAA ATA GAG GGT TAT TAT GAT GTG GAT AAA ATG ACC 2547 Leu Met Asn Ala Glu He Glu Gly Tyr Tyr Asp Val Asp Lys Met Thr 830 835 840
AAA AAA GGG GAT TGC ATC ATC TAT AAC CAA AAC ATC GCT ATC ACT AAG 2595 Lys Lys Gly Asp Cys He He Tyr Asn Gin Asn He Ala He Thr Lys 845 850 855 860
ATC CCT AAA AAA GCC TTT GAC TAT GTC ATT AAT GGC AAG AGC GCG ATT 2643 He Pro Lys Lys Ala Phe Asp Tyr Val He Asn Gly Lys Ser Ala He 865 870 875
GAC TGG GTG ATC GAA CGC TAT CAA AAA ACT ATG GAT AAA GAA AGC CTG 2691 Asp Trp Val He Glu Arg Tyr Gin Lys Thr Met Asp Lys Glu Ser Leu 880 885 890
ATT GAA AAC AAC CCG AAC GAT TAC GCC GGC GGA AAA TAC GTT TTT GAA 2739 He Glu Asn Asn Pro Asn Asp Tyr Ala Gly Gly Lys Tyr Val Phe Glu 895 900 905
CTC CTT TGT AGG GTC ATC ACA CTT TCG GTA AAA AGC GTG GAT TTG ATA 2787 Leu Leu Cys Arg Val He Thr Leu Ser Val Lys Ser Val Asp Leu He 910 915 920
GAA AAG ATC AGC GAA AAG AGG TTT GAG TGATTACATC GCTTGGGGGT GTGGAAT 2841 Glu Lys He Ser Glu Lys Arg Phe Glu 925 930
ATTTTGAAAG GCAATGTCTT GCTTTCTTAA AAAATCCAC 2880
(2) INFORMATION FOR SEQ ID NO: 798:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 933 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 798: Met Thr Gin Ala Trp Leu Met Lys Pro Leu Leu Lys Lys Lys Ser Lys 1 5 10 15
Ser Leu Glu Ala Met Met Ala Thr Asn His Asn Asp Glu Lys Thr Leu
20 25 30
Phe Asp Ala He Leu Leu Gin Asp Leu Ala Asp Ala Met Tyr Asn Val
35 40 45
Met Pro Thr Lys Leu Gly Asp Arg Asn Tyr Trp Glu Asn Phe Thr Lys
50 55 60
Lys Thr Gly Asn He Ala Arg Thr Leu Asn Asn Arg Leu Lys He He 65 70 75 80
Phe Asp Lys Asn Pro Glu Phe Phe His Gly Phe Leu Asp Ser Leu Arg
85 90 95
Glu Asn He His Gin Asn He Lys Glu Asp Glu Ala Leu Asp Met He
100 105 110
Thr Ser His He He Thr Lys Pro He Phe Asp Ala Leu Phe Gly Asp
115 120 125
Asn He Lys Asn Pro He Ala Lys Ala Leu Asp Lys Met Val Glu Lys
130 135 140
Leu Ser Thr Leu Gly Leu Glu Gly Glu Thr Lys Asp Leu Lys Asn Leu 145 150 155 160
Tyr Glu Ser Val Lys Thr Glu Ala Leu His Ala Lys Ser Gin Lys Ser
165 170 175
Gin Gin Glu Leu He Lys Asn Leu Tyr Asn Thr Phe Phe Lys Glu Ala
180 185 190
Phe Lys Lys Gin Ser Glu Lys Leu Gly He Val Tyr Thr Pro He Glu
195 200 205
Val Val Asp Phe He Leu Arg Ala Thr Asn Gly He Leu Lys Lys His
210 215 220
Phe Asn Thr Asp Phe Asn Asp Gin Ser He Thr He Phe Asp Pro Phe 225 230 235 240
Thr Gly Thr Gly Ser Phe He Ala Arg Leu Leu Ser Lys Glu Asn Ala
245 250 255
Leu He Ser Asp Glu Ala Leu Lys Glu Lys Phe Gin Lys Asn Leu Phe
260 265 270
Ala Phe Asp He Val Leu Leu Ser Tyr Tyr He Ala Leu He Asn He
275 280 285
Thr Gin Ala Ala Gin Asn Arg Asp Gly Ser Leu Asn Asn Phe Lys Asn
290 295 300
He Ala Leu Thr Asp Ser Leu Asp Tyr Leu Glu Glu Lys Thr Asn Lys 305 310 315 320
Gly Val Leu Pro Leu Tyr Glu Asp Leu Lys Glu Asn Lys Gly He Lys
325 330 335
Asp Thr Leu Ala Asn Gin Asn He Arg Val He He Gly Asn Pro Pro
340 345 350
Tyr Ser Ala Gly Ala Lys Ser Gin Asn Asp Asn Asn Gin Asn Leu Ser
355 360 365
His Pro Lys Leu Glu Lys Leu Val Tyr Glu Lys Tyr Gly Lys Asn Ser
370 375 380
Thr Ser Arg Ser Val Gly Lys Thr Thr Arg Asp Thr Leu He Gin Ser 385 390 395 400
He Arg Met Ala Ser Asp Val Val Lys Asp Arg Gly Val He Gly Phe
405 410 415
Val Val Asn Gly Gly Phe He Asp Ser Lys Ser Ala Asp Gly Phe Arg
420 425 430
Lys Cys Val Ala Lys Glu Phe Ser His Leu Tyr Val Leu Asn Leu Arg 435 440 445 Gly Asn Gin Arg Thr Ser Gly Glu Val Ser Lys Lys Glu Gly Gly Lys
450 455 460
He Phe Asp Ser Gly Ser Arg Ala Thr Val Ala He He Phe Phe Val 465 470 475 480
Lys Asp Lys Ser Thr Pro Asp Asn Thr He Phe Tyr Tyr Glu Val Glu
485 490 495
Asp Tyr Leu Lys Arg Glu Ala Lys Leu Asn Trp Leu Ala Asn Phe Glu
500 505 510
Asn Leu Asp Phe Val Pro Phe Glu Lys He Thr Pro Asn Asp Lys Gly
515 520 525
Asp Trp He Asn Gin Arg Asn Asp Ala Phe Glu Lys Leu He Pro Leu
530 535 540
Lys Arg Asp Lys Thr Leu Gin Asn Asp Ser Val Phe Asp He Asn Ser 545 550 555 560
Leu Gly Val Val Ser Gly Arg Asp Pro Trp Val Tyr Asn Phe Ser Pro
565 570 575
Asn He Leu Thr Gin Ser Val Gin Lys Cys He Asp Thr Tyr Asn Ala
580 585 590
Asp Leu Lys Arg Phe Asn Ala Arg Phe Arg Glu Ala Phe Lys Gin Arg
595 600 605
Ala Gin Ser Val Lys Ala Gly Asp Leu Tyr Lys Gin Leu Asn Asp Lys
610 615 620
Glu He Thr Thr Asp Lys Thr Lys He Ala Trp Thr Asp Gly Leu Lys 625 630 635 640
Asn Lys Leu He Lys Asn Lys Ser Ala Arg Glu Ser Ser Glu Glu Arg
645 650 655
Val Arg Leu Ala Leu Tyr Arg Pro Phe Asn Lys Gin Trp Leu Tyr Trp
660 665 670
Asp Lys Asp Trp He Asn Arg Gin Arg Glu Phe Ser Lys He Phe Pro
675 680 685
Asp Lys Asp Ala Gin Asn Val Val He Asn Thr Gly Val Gly Asn Gly
690 695 700
Lys Asp Phe Ser Ala Leu Val Ser Asp Phe He Ser Asp Tyr Ser Leu 705 710 715 720
He Ser Pro Asn Gin Ala Tyr Pro Leu Tyr Tyr Tyr Asp Asp Leu Gly
725 730 735
Asn Arg His Tyr Ala He Ser Gly Tyr Cys Leu Asn Leu Phe Arg Arg
740 745 750
His Tyr Gly Asp Asn Leu He Ala Glu Glu Glu He Phe Tyr Tyr He
755 760 765
Tyr Ala He Phe His His Lys Gly Tyr Leu Glu Lys Tyr Lys Asn Ser
770 775 780
Leu Ala Lys Glu Ala Pro Arg He Ala Leu Ser Glu Asp Phe Lys Glu 785 790 795 800
Leu Ser Val Leu Gly Lys Glu Leu Ala Glu Leu His Leu Asn Tyr Glu
805 810 815
Ser Gly Glu Met His Asp Asn He Lys Tyr Thr Thr Leu Met Asn Ala
820 825 830
Glu He Glu Gly Tyr Tyr Asp Val Asp Lys Met Thr Lys Lys Gly Asp
835 840 845
Cys He He Tyr Asn Gin Asn He Ala He Thr Lys He Pro Lys Lys
850 855 860
Ala Phe Asp Tyr Val He Asn Gly Lys Ser Ala He Asp Trp Val He 865 870 875 880
Glu Arg Tyr Gin Lys Thr Met Asp Lys Glu Ser Leu He Glu Asn Asn 885 890 895
Pro Asn Asp Tyr Ala Gly Gly Lys Tyr Val Phe Glu Leu Leu Cys Arg
900 905 910
Val He Thr Leu Ser Val Lys Ser Val Asp Leu He Glu Lys He Ser
915 920 925
Glu Lys Arg Phe Glu 930
(2) INFORMATION FOR SEQ ID NO: 799:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1440 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 24...1370 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 799:
AAGATCAGCG AAAAGAGGTT TGA GTG ATT ACA TCG CTT GGG GGT GTG GAA TAT 53
Val He Thr Ser Leu Gly Gly Val Glu Tyr 1 5 10
TTT GAA AGG CAA TGT CTT GCT TTC TTA AAA AAT CCA CAA ACT AAT CCA 101 Phe Glu Arg Gin Cys Leu Ala Phe Leu Lys Asn Pro Gin Thr Asn Pro 15 20 25
CAA AAT GAG CAA TAC ATT CCA GGA GTG TTT TCG TAT CAA GAA AAC AAA 149 Gin Asn Glu Gin Tyr He Pro Gly Val Phe Ser Tyr Gin Glu Asn Lys 30 35 40
ATT TCT TTT TCT TTT TTG GTT TTA GGA GAA ATT GAA GAG ATC CAC TCT 197 He Ser Phe Ser Phe Leu Val Leu Gly Glu He Glu Glu He His Ser 45 50 55
TTG CAA TAC CAA ACG CTC TAT ATT GTG GAT AAC AAA AAA AGA TAC ACT 245 Leu Gin Tyr Gin Thr Leu Tyr He Val Asp Asn Lys Lys Arg Tyr Thr 60 65 70
CTT TAC AAG CTT TAT GAT CGC ATT ATT TTG GGT CAT ACT TTA GGG TAT 293 Leu Tyr Lys Leu Tyr Asp Arg He He Leu Gly His Thr Leu Gly Tyr 75 80 85 90
TCT GCA CCA ATC ACG CTC TAT TAT GAA TGG CTG TTT GAT GAT TGG ATC 341 Ser Ala Pro He Thr Leu Tyr Tyr Glu Trp Leu Phe Asp Asp Trp He 95 100 105 GAT CCA GAA AAA ATT ATG GGC GAT CGT TTT GTT TGT AGG ACA AAT TAT 389 Asp Pro Glu Lys He Met Gly Asp Arg Phe Val Cys Arg Thr Asn Tyr 110 115 120
TTA GAA AGT TTT TTT ACG ACC AAG AAG CAT TTG CTA CCT GAT ACA TTA 437 Leu Glu Ser Phe Phe Thr Thr Lys Lys His Leu Leu Pro Asp Thr Leu 125 130 135
TTT AAA GTA GAT GAA AGT GGG TGT GAA AGT TAT CAT GAG AAT AAC GAT 485 Phe Lys Val Asp Glu Ser Gly Cys Glu Ser Tyr His Glu Asn Asn Asp 140 145 150
AAG GAC TTT ATC CTA CAA TCA TTT TAT ATT CAA AAT GAT TTT TTA TCC 533 Lys Asp Phe He Leu Gin Ser Phe Tyr He Gin Asn Asp Phe Leu Ser 155 160 165 170
CAA AGA TAT GAA AAA GAC AAG ATA AAA GCA AAA TCT AAT TTG ATT CCT 581 Gin Arg Tyr Glu Lys Asp Lys He Lys Ala Lys Ser Asn Leu He Pro 175 180 185
AAA AGA CAG AAT CGT TTA TTA ACT TAT CAA TTT GAT TTG TCT TTG GAA 629 Lys Arg Gin Asn Arg Leu Leu Thr Tyr Gin Phe Asp Leu Ser Leu Glu 190 195 200
TGC AAT ATA ATT TTT GAA ACC CTT GAA AAA TTA GCA CTT ATT GCT GGA 677 Cys Asn He He Phe Glu Thr Leu Glu Lys Leu Ala Leu He Ala Gly 205 210 215
GCG ATT AAA AAC TTT TTT ATT TTG ATT TAT GCT CAT TCT AAT TTT GAC 725 Ala He Lys Asn Phe Phe He Leu He Tyr Ala His Ser Asn Phe Asp 220 225 230
ATC CAA ATT GAC TAT ATC CAA TTC AAG CTT TCT AAT AAA GAC ATT ACA 773 He Gin He Asp Tyr He Gin Phe Lys Leu Ser Asn Lys Asp He Thr 235 240 245 250
GCA ATA AGA AAC ACT TAC AAA AAA GAT AAA AAG TCT ATG GAG ATA GAT 821 Ala He Arg Asn Thr Tyr Lys Lys Asp Lys Lys Ser Met Glu He Asp 255 260 265
CTT TAT GGG ATT GCT ATA AAT TTC CAA CGG ATA GAC AAT TTT TCT GTA 869 Leu Tyr Gly He Ala He Asn Phe Gin Arg He Asp Asn Phe Ser Val 270 275 280
ATA CTT GAA AAA TGG ATT GTT TTT TAT ATC AAA GAC AAT AGA GAT TTC 917 He Leu Glu Lys Trp He Val Phe Tyr He Lys Asp Asn Arg Asp Phe 285 290 295
CAA CTT GCA AGT ATT TTA GAC ATT ATT AAT AAA AAA GAT CCA ATT ATT 965 Gin Leu Ala Ser He Leu Asp He He Asn Lys Lys Asp Pro He He 300 305 310
CAC TTG TAT TTG GAC ATG TTT GTA TTG ATT AGC ATG ATT GAA AGT TTT 1013 His Leu Tyr Leu Asp Met Phe Val Leu He Ser Met He Glu Ser Phe 315 320 325 330 TTA AAG AAA CCA CAA CAA ACA AAA CTC CAT GAA AAA CTC TCT GAA TTT 1061 Leu Lys Lys Pro Gin Gin Thr Lys Leu His Glu Lys Leu Ser Glu Phe 335 340 345
TTT AAA ATT TCA TTA TCT AGG ACA AAA TGC GAT CAA ACG AAA AAT TAT 1109 Phe Lys He Ser Leu Ser Arg Thr Lys Cys Asp Gin Thr Lys Asn Tyr 350 355 360
TTT AAT GAT AAA TGT CAA GAA GAT CTA ATC CAA CAG ATT GTT GAC TGC 1157 Phe Asn Asp Lys Cys Gin Glu Asp Leu He Gin Gin He Val Asp Cys 365 370 375
CGT AAC TCT CTA GCG CAC GGA AGA AGT TTA AAG CTT GAT ACA AAC AAA 1205 Arg Asn Ser Leu Ala His Gly Arg Ser Leu Lys Leu Asp Thr Asn Lys 380 385 390
GCT ACA GAC ATT AGC CAT GCT TTT ATA GAT TTC AAG CAA ATT GTC ATT 1253 Ala Thr Asp He Ser His Ala Phe He Asp Phe Lys Gin He Val He 395 400 405 410
GAA TTT TTC TTT GGC GAG ATA GGA TTG AGC GAT TTT ATT ACA AAC AAT 1301 Glu Phe Phe Phe Gly Glu He Gly Leu Ser Asp Phe He Thr Asn Asn 415 420 425
TTT GGT TTT CTT AAC AAA GTT AAA TTA AGA AAC CCC CCA AAA ACA GAA 1349 Phe Gly Phe Leu Asn Lys Val Lys Leu Arg Asn Pro Pro Lys Thr Glu 430 435 440
AAA ATC ACC GAG CCA AAC CGC TAAAACCCCT TAGAAAATTT AAAATTTTAA GTTT 1404 Lys He Thr Glu Pro Asn Arg 445
TAGGGGTGTT TTTCTTAAGA ATTTAGGTTT TTTATA 1440
(2) INFORMATION FOR SEQ ID NO: 800:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 449 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 800:
Val He Thr Ser Leu Gly Gly Val Glu Tyr Phe Glu Arg Gin Cys Leu
1 5 10 15
Ala Phe Leu Lys Asn Pro Gin Thr Asn Pro Gin Asn Glu Gin Tyr He
20 25 30
Pro Gly Val Phe Ser Tyr Gin Glu Asn Lys He Ser Phe Ser Phe Leu
35 40 45
Val Leu Gly Glu He Glu Glu He His Ser Leu Gin Tyr Gin Thr Leu
50 55 60
Tyr He Val Asp Asn Lys Lys Arg Tyr Thr Leu Tyr Lys Leu Tyr Asp 65 70 75 80
Arg He He Leu Gly His Thr Leu Gly Tyr Ser Ala Pro He Thr Leu
85 90 95
Tyr Tyr Glu Trp Leu Phe Asp Asp Trp He Asp Pro Glu Lys He Met
100 105 110
Gly Asp Arg Phe Val Cys Arg Thr Asn Tyr Leu Glu Ser Phe Phe Thr
115 120 125
Thr Lys Lys His Leu Leu Pro Asp Thr Leu Phe Lys Val Asp Glu Ser
130 135 140
Gly Cys Glu Ser Tyr His Glu Asn Asn Asp Lys Asp Phe He Leu Gin 145 150 155 160
Ser Phe Tyr He Gin Asn Asp Phe Leu Ser Gin Arg Tyr Glu Lys Asp
165 170 175
Lys He Lys Ala Lys Ser Asn Leu He Pro Lys Arg Gin Asn Arg Leu
180 185 190
Leu Thr Tyr Gin Phe Asp Leu Ser Leu Glu Cys Asn He He Phe Glu
195 200 205
Thr Leu Glu Lys Leu Ala Leu He Ala Gly Ala He Lys Asn Phe Phe
210 215 220
He Leu He Tyr Ala His Ser Asn Phe Asp He Gin He Asp Tyr He 225 230 235 240
Gin Phe Lys Leu Ser Asn Lys Asp He Thr Ala He Arg Asn Thr Tyr
245 250 255
Lys Lys Asp Lys Lys Ser Met Glu He Asp Leu Tyr Gly He Ala He
260 265 270
Asn Phe Gin Arg He Asp Asn Phe Ser Val He Leu Glu Lys Trp He
275 280 285
Val Phe Tyr He Lys Asp Asn Arg Asp Phe Gin Leu Ala Ser He Leu
290 295 300
Asp He He Asn Lys Lys Asp Pro He He His Leu Tyr Leu Asp Met 305 310 315 320
Phe Val Leu He Ser Met He Glu Ser Phe Leu Lys Lys Pro Gin Gin
325 330 335
Thr Lys Leu His Glu Lys Leu Ser Glu Phe Phe Lys He Ser Leu Ser
340 345 350
Arg Thr Lys Cys Asp Gin Thr Lys Asn Tyr Phe Asn Asp Lys Cys Gin
355 360 365
Glu Asp Leu He Gin Gin He Val Asp Cys Arg Asn Ser Leu Ala His
370 375 380
Gly Arg Ser Leu Lys Leu Asp Thr Asn Lys Ala Thr Asp He Ser His 385 390 395 400
Ala Phe He Asp Phe Lys Gin He Val He Glu Phe Phe Phe Gly Glu
405 410 415
He Gly Leu Ser Asp Phe He Thr Asn Asn Phe Gly Phe Leu Asn Lys
420 425 430
Val Lys Leu Arg Asn Pro Pro Lys Thr Glu Lys He Thr Glu Pro Asn
435 440 445
Arg
(2) INFORMATION FOR SEQ ID NO: 801:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1345 base pairs
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 25...1302 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 801:
CTAAATGAAT TTAAAAGGAA AAAT ATG GCA GTA AGA TTT GGG ATT ATC TTT 51
Met Ala Val Arg Phe Gly He He Phe 1 5
ATA TCT GAC TCT ATT GAT GAT TAT AAA GCC AAA CAA TTA AGA TCA ATT 99 He Ser Asp Ser He Asp Asp Tyr Lys Ala Lys Gin Leu Arg Ser He 10 15 20 25
TTA GAA CGC AAG AAA GAG TGT AAT TTT ATA TGG TTT AAT GAA TCA AGT 147 Leu Glu Arg Lys Lys Glu Cys Asn Phe He Trp Phe Asn Glu Ser Ser 30 35 40
GCT ATA ATT CAC AAT ACT CCT AAA GTT TTT GAA GGA GAG AGT TTT TTT 195 Ala He He His Asn Thr Pro Lys Val Phe Glu Gly Glu Ser Phe Phe 45 50 55
GAT CAT CTT TTC GTT AGT GCA AAA ATT ACT GCT TTT GTG GTA TCC ACA 243 Asp His Leu Phe Val Ser Ala Lys He Thr Ala Phe Val Val Ser Thr 60 65 70
AAC GAA TCA GAT ACA ATA TTC AAT TTA AAA AAC TAC TTG CTA GTA TTA 291 Asn Glu Ser Asp Thr He Phe Asn Leu Lys Asn Tyr Leu Leu Val Leu 75 80 85
GCC AAA AAT CTC AAT AAT AGA GAT ATT TGG TAT TGT GAA AAC ACT ATT 339 Ala Lys Asn Leu Asn Asn Arg Asp He Trp Tyr Cys Glu Asn Thr He 90 95 100 105
TGC GAT AAA AAA GGC ACT TAT AAT ATA GAA ATA GAA TTA GTG AGC AAT 387 Cys Asp Lys Lys Gly Thr Tyr Asn He Glu He Glu Leu Val Ser Asn 110 115 120
GCT AAT GAT TTT AGA GGA GTG TTT GGA GAA GTG TTA GGT ATA GTC AAA 435 Ala Asn Asp Phe Arg Gly Val Phe Gly Glu Val Leu Gly He Val Lys 125 130 135
GAC ACT TTC GGT GAT TTA CTG CAA CTT CTT ACA AAT TTA AAG AAC AAG 483 Asp Thr Phe Gly Asp Leu Leu Gin Leu Leu Thr Asn Leu Lys Asn Lys 140 145 150
GAA ATT GAA TTT AAT TTT CAT AAA AAA ATT AAT TAC GGA TTG CCT TTT 531 Glu He Glu Phe Asn Phe His Lys Lys He Asn Tyr Gly Leu Pro Phe 155 160 165
GGG ATT ATC TTT ATC GCT AGC AAC TCT GAC AAC CCT ATT GAT ATT GAC 579 Gly He He Phe He Ala Ser Asn Ser Asp Asn Pro He Asp He Asp 170 175 180 185
AAT AAA ACC AAA AAG TTA AAA TCA TGC TTT CGT GAT GAT GAG AGT AAC 627 Asn Lys Thr Lys Lys Leu Lys Ser Cys Phe Arg Asp Asp Glu Ser Asn 190 195 200
TGT TTT ATT GAC TGC CCA ATT ACA ATT GAG GAT TAT TTA ATT TTA GAT 675 Cys Phe He Asp Cys Pro He Thr He Glu Asp Tyr Leu He Leu Asp 205 210 215
AAT CTA AAA AGC TGT TTT GTA ATC CAA AAT AAG CCA AAT GTA ACA TTA 723 Asn Leu Lys Ser Cys Phe Val He Gin Asn Lys Pro Asn Val Thr Leu 220 225 230
TTT GAT AAC GAC GAG AAC GAT AGA CCA TTC AAT TTA AAG CGA TAC TTG 771 Phe Asp Asn Asp Glu Asn Asp Arg Pro Phe Asn Leu Lys Arg Tyr Leu 235 240 245
TTA GGA TTG AAA GAA AAG TTA GGG TTT GAG CCA ACG GGT ATT TTC TAT 819 Leu Gly Leu Lys Glu Lys Leu Gly Phe Glu Pro Thr Gly He Phe Tyr 250 255 260 265
TGC GAA AAC GCA AAC ACA CAC AAA ATT GAA TTG ATT GGT AAT GAT TCT 867 Cys Glu Asn Ala Asn Thr His Lys He Glu Leu He Gly Asn Asp Ser 270 275 280
GAT TTC AGA GAG GTA TTA CTT GAA TTT TCA GAG AAT ATA CCA AAA GCC 915 Asp Phe Arg Glu Val Leu Leu Glu Phe Ser Glu Asn He Pro Lys Ala 285 290 295
CCT AAT GAA CTA CCA CAA TTT CTT ACA AAC TTT AAA AAT TCA AAA ATC 963 Pro Asn Glu Leu Pro Gin Phe Leu Thr Asn Phe Lys Asn Ser Lys He 300 305 310
CCC AAT GGA AAC ATT TCA TTT TCG CCA CCA AAA AAT TCT CCA TCA ATT 1011 Pro Asn Gly Asn He Ser Phe Ser Pro Pro Lys Asn Ser Pro Ser He 315 320 325
TCT TCA TAT GCT TTA TCT GAT AAG ATT AAA AGA GAA GTA AGA GAT ACC 1059 Ser Ser Tyr Ala Leu Ser Asp Lys He Lys Arg Glu Val Arg Asp Thr 330 335 340 345
TTT GAT CGC TAT TTG TGG CAT GGT TAT TCT AAA ATT CCA CAG GAG AAA 1107 Phe Asp Arg Tyr Leu Trp His Gly Tyr Ser Lys He Pro Gin Glu Lys 350 355 360
AGG ATA GCC AAA ATA AAA GAG CAA GTG AAG GAA GAA ATT AAA CTA AAT 1155 Arg He Ala Lys He Lys Glu Gin Val Lys Glu Glu He Lys Leu Asn 365 370 375 CCT TCT TTT CGT AAT TAT AGA GTA GAC TCT GAA CAA AAC CGC AAG ATC 1203 Pro Ser Phe Arg Asn Tyr Arg Val Asp Ser Glu Gin Asn Arg Lys He 380 385 390
AAT GAA ATT GCT GAG GGT TTA AAA AGT GGT AAG ATA ATT GGT AAA AAG 1251 Asn Glu He Ala Glu Gly Leu Lys Ser Gly Lys He He Gly Lys Lys 395 400 405
GTT ATT GCT AAT GCG TTC GAT CTA AAT GCT AGC TTA TTG TTT TAT TAC 1299 Val He Ala Asn Ala Phe Asp Leu Asn Ala Ser Leu Leu Phe Tyr Tyr 410 415 420 425
TCC TGATGATTTA AAGAATTTAA AGGAACGATT ATTTATAGAT ATT 1345
Ser
(2) INFORMATION FOR SEQ ID NO: 802:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 426 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 802:
Met Ala Val Arg Phe Gly He He Phe He Ser Asp Ser He Asp Asp
1 5 10 15
Tyr Lys Ala Lys Gin Leu Arg Ser He Leu Glu Arg Lys Lys Glu Cys
20 25 30
Asn Phe He Trp Phe Asn Glu Ser Ser Ala He He His Asn Thr Pro
35 40 45
Lys Val Phe Glu Gly Glu Ser Phe Phe Asp His Leu Phe Val Ser Ala
50 55 60
Lys He Thr Ala Phe Val Val Ser Thr Asn Glu Ser Asp Thr He Phe 65 70 75 80
Asn Leu Lys Asn Tyr Leu Leu Val Leu Ala Lys Asn Leu Asn Asn Arg
85 90 95
Asp He Trp Tyr Cys Glu Asn Thr He Cys Asp Lys Lys Gly Thr Tyr
100 105 110
Asn He Glu He Glu Leu Val Ser Asn Ala Asn Asp Phe Arg Gly Val
115 120 125
Phe Gly Glu Val Leu Gly He Val Lys Asp Thr Phe Gly Asp Leu Leu
130 135 140
Gin Leu Leu Thr Asn Leu Lys Asn Lys Glu He Glu Phe Asn Phe His 145 150 155 160
Lys Lys He Asn Tyr Gly Leu Pro Phe Gly He He Phe He Ala Ser
165 170 175
Asn Ser Asp Asn Pro He Asp He Asp Asn Lys Thr Lys Lys Leu Lys
180 185 190
Ser Cys Phe Arg Asp Asp Glu Ser Asn Cys Phe He Asp Cys Pro He 195 200 205 Thr He Glu Asp Tyr Leu He Leu Asp Asn Leu Lys Ser Cys Phe Val
210 215 220
He Gin Asn Lys Pro Asn Val Thr Leu Phe Asp Asn Asp Glu Asn Asp 225 230 235 240
Arg Pro Phe Asn Leu Lys Arg Tyr Leu Leu Gly Leu Lys Glu Lys Leu
245 250 255
Gly Phe Glu Pro Thr Gly He Phe Tyr Cys Glu Asn Ala Asn Thr His
260 265 270
Lys He Glu Leu He Gly Asn Asp Ser Asp Phe Arg Glu Val Leu Leu
275 280 285
Glu Phe Ser Glu Asn He Pro Lys Ala Pro Asn Glu Leu Pro Gin Phe
290 295 300
Leu Thr Asn Phe Lys Asn Ser Lys He Pro Asn Gly Asn He Ser Phe 305 310 315 320
Ser Pro Pro Lys Asn Ser Pro Ser He Ser Ser Tyr Ala Leu Ser Asp
325 330 335
Lys He Lys Arg Glu Val Arg Asp Thr Phe Asp Arg Tyr Leu Trp His
340 345 350
Gly Tyr Ser Lys He Pro Gin Glu Lys Arg He Ala Lys He Lys Glu
355 360 365
Gin Val Lys Glu Glu He Lys Leu Asn Pro Ser Phe Arg Asn Tyr Arg
370 375 380
Val Asp Ser Glu Gin Asn Arg Lys He Asn Glu He Ala Glu Gly Leu 385 390 395 400
Lys Ser Gly Lys He He Gly Lys Lys Val He Ala Asn Ala Phe Asp
405 410 415
Leu Asn Ala Ser Leu Leu Phe Tyr Tyr Ser 420 425
(2) INFORMATION FOR SEQ ID NO: 803:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 720 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 22...705 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 803:
TTGCTAATGC GTTCGATCTA A ATG CTA GCT TAT TGT TTT ATT ACT CCT GAT 51
Met Leu Ala Tyr Cys Phe He Thr Pro Asp 1 5 10
GAT TTA AAG AAT TTA AAG GAA CGA TTA TTT ATA GAT ATT ATC AAT GCT 99 Asp Leu Lys Asn Leu Lys Glu Arg Leu Phe He Asp He He Asn Ala 15 20 25 ATC AAC CAA AAA AAG AGA GTC GCG CTC GAT CAT GCT CAA ATA GAT GAC 147 He Asn Gin Lys Lys Arg Val Ala Leu Asp His Ala Gin He Asp Asp 30 35 40
ATC CAG TAT AAT GTG CTT GAT AAT GCG TTT TAT TTT ATC TTT GAT GTT 195 He Gin Tyr Asn Val Leu Asp Asn Ala Phe Tyr Phe He Phe Asp Val 45 50 55
GGT AAC CCT TCT CAA TTA GCT ATT AAA GTG CCT AGA AAA TCT TTA GAA 243 Gly Asn Pro Ser Gin Leu Ala He Lys Val Pro Arg Lys Ser Leu Glu 60 65 70
AAT GAT GAG TTG CCC AAC ACT AAA AAA AAC ATA TTC AAT GGA TTA ATA 291 Asn Asp Glu Leu Pro Asn Thr Lys Lys Asn He Phe Asn Gly Leu He 75 80 85 90
AGA ACT ATC TAT GGG TGT ATT GAT GAT GAA AAT TCA TTT TTA TTA GAA 339 Arg Thr He Tyr Gly Cys He Asp Asp Glu Asn Ser Phe Leu Leu Glu 95 100 105
AAC GAT AAA ACC ATC AAG GAT TTA AAT ATT CAG GAT TTA TTG GGG CCA 387 Asn Asp Lys Thr He Lys Asp Leu Asn He Gin Asp Leu Leu Gly Pro 110 115 120
TTA AAA ACT CAA GCA TTT CCA TTA TCA TAC ATT ATT ACT GAC GCT ATC 435 Leu Lys Thr Gin Ala Phe Pro Leu Ser Tyr He He Thr Asp Ala He 125 130 135
AAT CAA AAA GAA GGG GTG GCT CTC GAT TAC GCT CTA ATA AAC GAT ATT 483 Asn Gin Lys Glu Gly Val Ala Leu Asp Tyr Ala Leu He Asn Asp He 140 145 150
AAG TAT AAT TTG CTT GAT AAC ACA TTC CAT TTT ATC TTT GAT GTT GGT 531 Lys Tyr Asn Leu Leu Asp Asn Thr Phe His Phe He Phe Asp Val Gly 155 160 165 170
AAT CCT TTG TTG AAA GAG TCA AGT CAA TTT ATT ATT GAA GTG CCT AGA 579 Asn Pro Leu Leu Lys Glu Ser Ser Gin Phe He He Glu Val Pro Arg 175 180 185
GAG GCG TTG GAT CTA GAG AAT GTT GAT CGG CTT GTT GAA TAT ACG CTG 627 Glu Ala Leu Asp Leu Glu Asn Val Asp Arg Leu Val Glu Tyr Thr Leu 190 195 200
TCT CCT AAT AAT CAT AGT CAA AGT TCT TTA GTG TAT CAT ATT TCT GAA 675 Ser Pro Asn Asn His Ser Gin Ser Ser Leu Val Tyr His He Ser Glu 205 210 215
GGC TCT TAT ATC ATT CAC TTA ATA GAT GAC TAAACTTAAA TGAAA 720
Gly Ser Tyr He He His Leu He Asp Asp 220 225
(2) INFORMATION FOR SEQ ID NO:804 (I) SEQUENCE CHARACTERISTICS-
(A) LENGTH: 228 ammo acids
Figure imgf001213_0001
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(II) MOLECULE TYPE: protein
(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 804:
Met Leu Ala Tyr Cys Phe He Thr Pro Asp Asp Leu Lys Asn Leu Lys
1 5 10 15
Glu Arg Leu Phe He Asp He He Asn Ala He Asn Gin Lys Lys Arg
20 25 30
Val Ala Leu Asp His Ala Gin He Asp Asp He Gin Tyr Asn Val Leu
35 40 45
Asp Asn Ala Phe Tyr Phe He Phe Asp Val Gly Asn Pro Ser Gin Leu
50 55 60
Ala He Lys Val Pro Arg Lys Ser Leu Glu Asn Asp Glu Leu Pro Asn 65 70 75 80
Thr Lys Lys Asn He Phe Asn Gly Leu He Arg Thr He Tyr Gly Cys
85 90 95
He Asp Asp Glu Asn Ser Phe Leu Leu Glu Asn Asp Lys Thr He Lys
100 105 110
Asp Leu Asn He Gin Asp Leu Leu Gly Pro Leu Lys Thr Gin Ala Phe
115 120 125
Pro Leu Ser Tyr He He Thr Asp Ala He Asn Gin Lys Glu Gly Val
130 135 140
Ala Leu Asp Tyr Ala Leu He Asn Asp He Lys Tyr Asn Leu Leu Asp 145 150 155 160
Asn Thr Phe His Phe He Phe Asp Val Gly Asn Pro Leu Leu Lys Glu
165 170 175
Ser Ser Gin Phe He He Glu Val Pro Arg Glu Ala Leu Asp Leu Glu
180 185 190
Asn Val Asp Arg Leu Val Glu Tyr Thr Leu Ser Pro Asn Asn His Ser
195 200 205
Gin Ser Ser Leu Val Tyr His He Ser Glu Gly Ser Tyr He He His
210 215 220
Leu He Asp Asp 225
(2) INFORMATION FOR SEQ ID NO: 805:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 611 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 25...561 (D) OTHER INFORMATION- (xi) SEQUENCE DESCRIPTION: SEQ ID NO:805:
TTTGGGCATG CTTGGTGATC TTAA ATG AGT CAA GGT GAT GGG GTG GAA GGA 51
Met Ser Gin Gly Asp Gly Val Glu Gly 1 5
AAT AAT ATG GAT ACT ACG AAA GAG AAC TTG AAT GGC TCA AAA GAG CGT 99 Asn Asn Met Asp Thr Thr Lys Glu Asn Leu Asn Gly Ser Lys Glu Arg 10 15 20 25
TTG AGC GAT TGG GAA TAT CGA TGG GCA ATG GCT CTA GTC TAT GGA GGA 147 Leu Ser Asp Trp Glu Tyr Arg Trp Ala Met Ala Leu Val Tyr Gly Gly 30 35 40
TGT ATC TCC ATA ACC ACT AGG ATT TTT TAT GAC ATA AAT GGT TCA GCT 195 Cys He Ser He Thr Thr Arg He Phe Tyr Asp He Asn Gly Ser Ala 45 50 55
AGC GAT CCG CTT TTT GAC CCT AAA TAC AGC TAT TAT GTG TGG TTA GTG 243 Ser Asp Pro Leu Phe Asp Pro Lys Tyr Ser Tyr Tyr Val Trp Leu Val 60 65 70
GCT CTA ATA GCG GCT TTG TTG TCT AAT CTC TTG TTT AAT CCT AAA GGC 291 Ala Leu He Ala Ala Leu Leu Ser Asn Leu Leu Phe Asn Pro Lys Gly 75 80 85
AGG TCG GTA GGT TAT TTA ATG ATT GAA ACT TGG CAA GGG TTC CCC AAG 339 Arg Ser Val Gly Tyr Leu Met He Glu Thr Trp Gin Gly Phe Pro Lys 90 95 100 105 τττ τττ pjij GC ATT τττ AAG Gcτ AGG τττ τττ QGT GCG τττ TAT GAC 387
Phe Phe Lys Ala He Phe Lys Ala Arg Phe Phe Gly Ala Phe Tyr Asp 110 115 120
GCT GTG TTA GGA TCA AGG CTA AGG GAT TTT TAT GTG ATG CTT TTA ACG 435 Ala Val Leu Gly Ser Arg Leu Arg Asp Phe Tyr Val Met Leu Leu Thr 125 130 135
ATG CCC TTT ATT GCC GCT ATC CAT GAG GTT TCG GCG TAT TGT GGG CAT 483 Met Pro Phe He Ala Ala He His Glu Val Ser Ala Tyr Cys Gly His 140 145 150
CCT AGC AAT CTC CTT GTA GAG GGT TTG GTC ATT TTG GGG TTT CAA GGT 531 Pro Ser Asn Leu Leu Val Glu Gly Leu Val He Leu Gly Phe Gin Gly 155 160 165
TTT CTT AAG CTT TGC GCT AAA TGG GGG TGG TGATTTAACC CAAATGTCAT TAA 584 Phe Leu Lys Leu Cys Ala Lys Trp Gly Trp 170 175
ATGGAGGGGG TATAAAAAAA TTAAAAA 611 (2) INFORMATION FOR SEQ ID NO: 806:
(l) SEQUENCE CHARACTERISTICS
(A) LENGTH: 179 ammo acids
(B) TYPE: ammo acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION- SEQ ID NO: 806-
Met Ser Gin Gly Asp Gly Val Glu Gly Asn Asn Met Asp Thr Thr Lys
1 5 10 15
Glu Asn Leu Asn Gly Ser Lys Glu Arg Leu Ser Asp Trp Glu Tyr Arg
20 25 30
Trp Ala Met Ala Leu Val Tyr Gly Gly Cys He Ser He Thr Thr Arg
35 40 45
He Phe Tyr Asp He Asn Gly Ser Ala Ser Asp Pro Leu Phe Asp Pro
50 55 60
Lys Tyr Ser Tyr Tyr Val Trp Leu Val Ala Leu He Ala Ala Leu Leu 65 70 75 80
Ser Asn Leu Leu Phe Asn Pro Lys Gly Arg Ser Val Gly Tyr Leu Met
85 90 95
He Glu Thr Trp Gin Gly Phe Pro Lys Phe Phe Lys Ala He Phe Lys
100 105 110
Ala Arg Phe Phe Gly Ala Phe Tyr Asp Ala Val Leu Gly Ser Arg Leu
115 120 125
Arg Asp Phe Tyr Val Met Leu Leu Thr Met Pro Phe He Ala Ala He
130 135 140
His Glu Val Ser Ala Tyr Cys Gly His Pro Ser Asn Leu Leu Val Glu 145 150 155 160
Gly Leu Val He Leu Gly Phe Gin Gly Phe Leu Lys Leu Cys Ala Lys
165 170 175
Trp Gly Trp
(2) INFORMATION FOR SEQ ID NO: 807:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 424 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(n) MOLECULE TYPE. Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 72...404 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 807: GTTGTGGTAT AAATATTCTT ATCAAGGTGT GCCAAACATG CCTTGAATCT CAATTTTTGA 60
ATCTCAATTT T ATG AAA GGA TTT GTT ATG AGT GGA TTA AGA ACA TTT AGT 110
Met Lys Gly Phe Val Met Ser Gly Leu Arg Thr Phe Ser 1 5 10
TGT GTA GTG GTT TTA TGC GGT GCA ATG GCT AAT GTG GCT ATA GCT AGT 158 Cys Val Val Val Leu Cys Gly Ala Met Ala Asn Val Ala He Ala Ser 15 20 25
CCT AAA ATA GAG GCA AGG GGT GAA TTA GGC AAA TTT ATA GGG GGT GGT 206 Pro Lys He Glu Ala Arg Gly Glu Leu Gly Lys Phe He Gly Gly Gly 30 35 40 45
GTT GGG GGT TTT GTT GGT GAT AAA ATG GGC GGA TTT GTT GGT GGT GCA 254 Val Gly Gly Phe Val Gly Asp Lys Met Gly Gly Phe Val Gly Gly Ala 50 55 60
ATA GGA GGA TAT ATT GGG TCT GAA ATA GGC GAT AGG GTA GAA GAT TAT 302 He Gly Gly Tyr He Gly Ser Glu He Gly Asp Arg Val Glu Asp Tyr 65 70 75
ATC CGT GGT GTT GAT AGA GAG CCA CAA AAC AAA GAA CCA CAA GCC CCA 350 He Arg Gly Val Asp Arg Glu Pro Gin Asn Lys Glu Pro Gin Ala Pro 80 85 90
AGA GAA CCT ATC CGT GAT CTT TAT GAT TAC GGC TAT AGT TTT GGG CAT 398 Arg Glu Pro He Arg Asp Leu Tyr Asp Tyr Gly Tyr Ser Phe Gly His 95 100 105
GCT TGG TGATCTTAAA TGAGTCAAGA 424
Ala Trp
110
(2) INFORMATION FOR SEQ ID NO: 808:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 111 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 808:
Met Lys Gly Phe Val Met Ser Gly Leu Arg Thr Phe Ser Cys Val Val
1 5 10 15
Val Leu Cys Gly Ala Met Ala Asn Val Ala He Ala Ser Pro Lys He
20 25 30
Glu Ala Arg Gly Glu Leu Gly Lys Phe He Gly Gly Gly Val Gly Gly
35 40 45
Phe Val Gly Asp Lys Met Gly Gly Phe Val Gly Gly Ala He Gly Gly
50 55 60
Tyr He Gly Ser Glu He Gly Asp Arg Val Glu Asp Tyr He Arg Gly 65 70 75 80
Val Asp Arg Glu Pro Gin Asn Lys Glu Pro Gin Ala Pro Arg Glu Pro
85 90 95
He Arg Asp Leu Tyr Asp Tyr Gly Tyr Ser Phe Gly His Ala Trp 100 105 110
(2) INFORMATION FOR SEQ ID NO: 809:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 630 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 132...569 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 809:
GATTTTAGTG ATGACGAGTT TCACCCGTTT GATCGTGGTG TTTTCTTTTT TAAGGACCGC 60 TTTGGGCACG CAACAAACCC CCCCCCACTC AAATTTTAGT CTCGCTCTCT TTGATATTGA 120 CTTTTTTCAT C ATG GAA CCT AGC CTA AAA AAG GCC TAT GAT ACA GGG ATT 170 Met Glu Pro Ser Leu Lys Lys Ala Tyr Asp Thr Gly He 1 5 10
AAG CCT TAT ATG GAT AAA AAG ATT TCT TAC ACC GAA GCG TTT GAA AAA 218 Lys Pro Tyr Met Asp Lys Lys He Ser Tyr Thr Glu Ala Phe Glu Lys 15 20 25
AGC GCT CTG CCC TTC AAG GAA TTC ATG CTT AAA AAC ACA CGA GAA AAG 266 Ser Ala Leu Pro Phe Lys Glu Phe Met Leu Lys Asn Thr Arg Glu Lys 30 35 40 45
GAT CTA GCC CTT TTT TTT AGG ATT AGA AAC CTC CCT AAC CCT AAA ACC 314 Asp Leu Ala Leu Phe Phe Arg He Arg Asn Leu Pro Asn Pro Lys Thr 50 55 60
CCT GAT GAG GTG AGT TTG AGC GTT TTG ATC CCG GCA TTT ATG ATA AGC 362 Pro Asp Glu Val Ser Leu Ser Val Leu He Pro Ala Phe Met He Ser 65 70 75
GAG TTG AAA ACA GCG TTT CAA ATC GGC TTT TTA CTC TAC TTG CCT TTT 410 Glu Leu Lys Thr Ala Phe Gin He Gly Phe Leu Leu Tyr Leu Pro Phe 80 85 90
TTG GTG ATT GAT ATG GTG ATC AGC TCT ATT TTA ATG GCG ATG GGC ATG 458 Leu Val He Asp Met Val He Ser Ser He Leu Met Ala Met Gly Met 95 100 105 ATG ATG CTC CCG CCT GTA ATG ATT TCT CTG CCT TTT AAA ATT TTA GTG 506 Met Met Leu Pro Pro Val Met He Ser Leu Pro Phe Lys He Leu Val 110 115 120 125
TTT ATT CTG GTA GAT GGG TTT AAT TTA TTG ACC GAA AAT TTA GTG GCG 554 Phe He Leu Val Asp Gly Phe Asn Leu Leu Thr Glu Asn Leu Val Ala 130 135 140
AGT TTT AAA ATG GTT TGATATTAAC AAGCATTCAA GCGATAAAAG CTTGAAGCTA G 610 Ser Phe Lys Met Val 145
TTTAAAACTC ATAATTCAAA 630
(2) INFORMATION FOR SEQ ID NO: 810:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 146 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 810:
Met Glu Pro Ser Leu Lys Lys Ala Tyr Asp Thr Gly He Lys Pro Tyr
1 5 10 15
Met Asp Lys Lys He Ser Tyr Thr Glu Ala Phe Glu Lys Ser Ala Leu
20 25 30
Pro Phe Lys Glu Phe Met Leu Lys Asn Thr Arg Glu Lys Asp Leu Ala
35 40 45
Leu Phe Phe Arg He Arg Asn Leu Pro Asn Pro Lys Thr Pro Asp Glu
50 55 60
Val Ser Leu Ser Val Leu He Pro Ala Phe Met He Ser Glu Leu Lys 65 70 75 80
Thr Ala Phe Gin He Gly Phe Leu Leu Tyr Leu Pro Phe Leu Val He
85 90 95
Asp Met Val He Ser Ser He Leu Met Ala Met Gly Met Met Met Leu
100 105 110
Pro Pro Val Met He Ser Leu Pro Phe Lys He Leu Val Phe He Leu
115 120 125
Val Asp Gly Phe Asn Leu Leu Thr Glu Asn Leu Val Ala Ser Phe Lys
130 135 140
Met Val 145
(2) INFORMATION FOR SEQ ID NO: 811:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2352 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 13...2313 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 811:
AAAAGGTGGT AA ATG AAA AGA ATT TTA GTC TCT TTG GCT GTT TTG AGT CAT 51 Met Lys Arg He Leu Val Ser Leu Ala Val Leu Ser His 1 5 10
AGC GCG CAT GCT GTC AAA ACT CAT AAT TTG GAA AGG GTG GAA GCT TCA 99 Ser Ala His Ala Val Lys Thr His Asn Leu Glu Arg Val Glu Ala Ser 15 20 25
GGG GTG GCT AAC GAT AAG GAA GCG CCT TTA AGC TGG AGG AGC AAG GAA 147 Gly Val Ala Asn Asp Lys Glu Ala Pro Leu Ser Trp Arg Ser Lys Glu 30 35 40 45
GTG AGA AAC TAT ATG GGA TCT CGC ACG GTG ATT TCT AAC AAG CAA CTC 195 Val Arg Asn Tyr Met Gly Ser Arg Thr Val He Ser Asn Lys Gin Leu 50 55 60
ACT AAA AGC GCC AAT CAG AGC ATT GAA GAA GCT TTG CAA AAT GTG CCA 243 Thr Lys Ser Ala Asn Gin Ser He Glu Glu Ala Leu Gin Asn Val Pro 65 70 75
GGC GTG CAT ATT AGA AAC GCT ACG GGT ATT GGA GCT GTG CCT AGC TTT 291 Gly Val His He Arg Asn Ala Thr Gly He Gly Ala Val Pro Ser Phe 80 85 90
TCT GTT AGG GGC TTT GGT GGG GGA AGT TCA GGG CAT TCC AAT ACG GCT 339 Ser Val Arg Gly Phe Gly Gly Gly Ser Ser Gly His Ser Asn Thr Ala 95 100 105
ATG GTT TTA GTC AAT GGG ATC CCT ATT TAT GTT GCG CCC TAT GTT GAT 387 Met Val Leu Val Asn Gly He Pro He Tyr Val Ala Pro Tyr Val Asp 110 115 120 125
ATT AGC ATT CCT ATT TTC CCT GTA ACC TTT CAA TCT GTA GAT AGA ATC 435 He Ser He Pro He Phe Pro Val Thr Phe Gin Ser Val Asp Arg He 130 135 140
AGC GTA ACC AAG GGT GGG GAG AGC GTG CGT TAT GGC CCT AAT GTT TTT 483 Ser Val Thr Lys Gly Gly Glu Ser Val Arg Tyr Gly Pro Asn Val Phe 145 150 155
GGC GGT GTG ATT AAT GTG ATC ACT AAG GGC ATT CCT ACC AAG TGG GAG 531 Gly Gly Val He Asn Val He Thr Lys Gly He Pro Thr Lys Trp Glu 160 165 170 AGT CAG GTG AGC GAG AGG GCC ACT TTT TGG GGC AAA TCT GAA AAT GGG 579 Ser Gin Val Ser Glu Arg Ala Thr Phe Trp Gly Lys Ser Glu Asn Gly 175 180 185
GGC TTT TTC AAT CAA AAT TCT AAA AAC CTT GAC AAA AGC TTA GCC AAT 627 Gly Phe Phe Asn Gin Asn Ser Lys Asn Leu Asp Lys Ser Leu Ala Asn 190 195 200 205
AAC ATG CTT TTT GAC ACT TAC TTA AGA ACA GGG GGC ATG ATG AAT AAG 675 Asn Met Leu Phe Asp Thr Tyr Leu Arg Thr Gly Gly Met Met Asn Lys 210 215 220
CAT TTT GGA ATC CAA GCT CAA GCC AAC TGG CTT AAA GGG CAA GGG TTT 723 His Phe Gly He Gin Ala Gin Ala Asn Trp Leu Lys Gly Gin Gly Phe 225 230 235
AGA TAC AAC AGC CCT ACG AAC ATT CAA AAC TAC ATG CTA GAT TCC TTG 771 Arg Tyr Asn Ser Pro Thr Asn He Gin Asn Tyr Met Leu Asp Ser Leu 240 245 250
TAT CAA ATT AAT GAT AGT AAT AAG ATC ACT GCT TTT TTC CAA TAC TAT 819 Tyr Gin He Asn Asp Ser Asn Lys He Thr Ala Phe Phe Gin Tyr Tyr 255 260 265
AAT TAT TTT ATG GCA GAC CCC GGA TCT TTA GGC ATA GAA GCG TAT AAT 867 Asn Tyr Phe Met Ala Asp Pro Gly Ser Leu Gly He Glu Ala Tyr Asn 270 275 280 285
CAA AAT CGT TTT CAA AAC AAC CGC CCT AAT AAC AAT AAA AGC GGG AGA 915 Gin Asn Arg Phe Gin Asn Asn Arg Pro Asn Asn Asn Lys Ser Gly Arg 290 295 300
GCG AAG CGR TGG GGA GCT GTG TAT CAA AAC TTT TTT GGG GAT ACG GAC 963 Ala Lys Xaa Trp Gly Ala Val Tyr Gin Asn Phe Phe Gly Asp Thr Asp 305 310 315
AAA ATA GGT GGG GAT TTC ACT TTT AGT TAC TAT GGG CAT GAC ATG TCA 1011 Lys He Gly Gly Asp Phe Thr Phe Ser Tyr Tyr Gly His Asp Met Ser 320 325 330
AGG GAT TTT CAA TTT GAT TCT AAT TTT TTG AAT GTC AAT ACC AAT CCT 1059 Arg Asp Phe Gin Phe Asp Ser Asn Phe Leu Asn Val Asn Thr Asn Pro 335 340 345
AAA TTA GGC CCT GTT TAT ACC GAT CAA AAT TAT CCA GGA TTT TTT ATT 1107 Lys Leu Gly Pro Val Tyr Thr Asp Gin Asn Tyr Pro Gly Phe Phe He 350 355 360 365
TTT GAT CAT TTA AGG CGT TAC ATA ATG AAC GCT TTT GAG CCT AAT TTG 1155 Phe Asp His Leu Arg Arg Tyr He Met Asn Ala Phe Glu Pro Asn Leu 370 375 380
AAC TTA GTT GTC AAT ACC AAT AAA GTT AAG CAA ACT TTT AAT GTG GGC 1203 Asn Leu Val Val Asn Thr Asn Lys Val Lys Gin Thr Phe Asn Val Gly 385 390 395 ATG CGT TTT ATG ACA ATG GAT ATG TAT TTC AGA TTG GAT CAA AGC ACA 1251 Met Arg Phe Met Thr Met Asp Met Tyr Phe Arg Leu Asp Gin Ser Thr 400 405 410
TGC GAA AAA ACC GAT ATT TTT AAT GGG GTG TGC CGC ATG CCT CCT TTT 1299 Cys Glu Lys Thr Asp He Phe Asn Gly Val Cys Arg Met Pro Pro Phe 415 420 425
GTT CTT TCT AAA AAA CCC AGC AAC AAT CAA AAC CTG TTT AAC AAC TAT 1347 Val Leu Ser Lys Lys Pro Ser Asn Asn Gin Asn Leu Phe Asn Asn Tyr 430 435 440 445
ACA GCG GTA TGG TTG AGC GAT AAA ATA GAG CTT TTT GAT TCT AAA TTG 1395 Thr Ala Val Trp Leu Ser Asp Lys He Glu Leu Phe Asp Ser Lys Leu 450 455 460
GTG ATA ACT CCA GGG CTT AGA TAC ACT TTT TTG AAC TAT AAC AAC AAA 1443 Val He Thr Pro Gly Leu Arg Tyr Thr Phe Leu Asn Tyr Asn Asn Lys 465 470 475
GAG CCA GAA AAG CAT GAT TTT TCT GTG TGG AAT ATT ACA AAA AAG CGT 1491 Glu Pro Glu Lys His Asp Phe Ser Val Trp Asn He Thr Lys Lys Arg 480 485 490
CAA AAC GAA TGG AGT CCC GCC CTT AAC ATT GGC TAT AAA CCT ATG GAA 1539 Gin Asn Glu Trp Ser Pro Ala Leu Asn He Gly Tyr Lys Pro Met Glu 495 500 505
AAT TGG ATA TGG TAT GCG AAC TAC CGC CGC AGT TTT ATC CCC CCA CAA 1587 Asn Trp He Trp Tyr Ala Asn Tyr Arg Arg Ser Phe He Pro Pro Gin 510 515 520 525
CAT ACA ATG CTA GGC ATT ACT AGG ACT AAT TAC AAC CAA ATT TTT AAT 1635 His Thr Met Leu Gly He Thr Arg Thr Asn Tyr Asn Gin He Phe Asn 530 535 540
GAA ATT GAA GTG GGG CAA CGC TAT AGT TAT AAA AAT CTA TTG AGC TTT 1683 Glu He Glu Val Gly Gin Arg Tyr Ser Tyr Lys Asn Leu Leu Ser Phe 545 550 555
AAC ACG AAT TAT TTT GTG ATT TTT GCC AAG CGT TAC TAT GCG GGA GGC 1731 Asn Thr Asn Tyr Phe Val He Phe Ala Lys Arg Tyr Tyr Ala Gly Gly 560 565 570
TAT AGC CCA CAG CCT ATT AAC GCT AGG AGT CAA GGG GTA GAA TTG GAA 1779 Tyr Ser Pro Gin Pro He Asn Ala Arg Ser Gin Gly Val Glu Leu Glu 575 580 585
TTG TAT TAC GCG CCG ATT AGG GGT TTG CAA TTC CAT GTG GCT TAC ACC 1827 Leu Tyr Tyr Ala Pro He Arg Gly Leu Gin Phe His Val Ala Tyr Thr 590 595 600 605
TAT ATT GAT GCA CGC ATC ACT TCT AAC GCT GAT GAT ATT GCT TAT TAT 1875 Tyr He Asp Ala Arg He Thr Ser Asn Ala Asp Asp He Ala Tyr Tyr 610 615 620 TTT ACA GGC ATT GTC AAT AAA CCC TTT GAC ATT AAA GGG AAG CGT TTG 1923 Phe Thr Gly He Val Asn Lys Pro Phe Asp He Lys Gly Lys Arg Leu 625 630 635
CCT TAT GTG AGT CCT AAC CAA TTC ATA TTT GAC ATG ATG TAT ACT TAC 1971 Pro Tyr Val Ser Pro Asn Gin Phe He Phe Asp Met Met Tyr Thr Tyr 640 645 650
AAG CAC ACG ACT TTT GGT ATT AGC AGC TAT TTT TAT AGC CGT GCT TAT 2019 Lys His Thr Thr Phe Gly He Ser Ser Tyr Phe Tyr Ser Arg Ala Tyr 655 660 665
AGT TCT ATG CTC AAT CAG GCC AAA AGC CAA ACC GTG TGC CTG CCC TTA 2067 Ser Ser Met Leu Asn Gin Ala Lys Ser Gin Thr Val Cys Leu Pro Leu 670 675 680 685
AAC CCA GAA TAC ACA GGG GGG CTA GAG TAT GGT TGT AAT TCA GTA GGG 2115 Asn Pro Glu Tyr Thr Gly Gly Leu Glu Tyr Gly Cys Asn Ser Val Gly 690 695 700
TTA TTG CCC TTG TAT TTT GTG TTG AAC GTT CAA GTA AGC TCG GTT TTA 2163 Leu Leu Pro Leu Tyr Phe Val Leu Asn Val Gin Val Ser Ser Val Leu 705 710 715
TGG CAA AGC GGT AGG CAT AAA ATC ACA GGG AGT TTG CAA ATC AAT AAT 2211 Trp Gin Ser Gly Arg His Lys He Thr Gly Ser Leu Gin He Asn Asn 720 725 730
CTT TTT AAC ATG AAG TAT TAT TTT AGG GGA ATT GGC ACA AGC CCT ACA 2259 Leu Phe Asn Met Lys Tyr Tyr Phe Arg Gly He Gly Thr Ser Pro Thr 735 740 745
GGA AGA GAG CCC GCA CCA GGG CGA TCC ATT ACA GCG TAT TTG AAT TAT 2307 Gly Arg Glu Pro Ala Pro Gly Arg Ser He Thr Ala Tyr Leu Asn Tyr 750 755 760 765
GAG TTT TAAACTAGCT TCAAGCTTTT ATCGCTTGAA TGCTTGTTA 2352 Glu Phe
(2) INFORMATION FOR SEQ ID NO: 812:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 767 ammo acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 812:
Met Lys Arg He Leu Val Ser Leu Ala Val Leu Ser Hi s Ser Ala Hi s 1 5 10 15 Ala Val Lys Thr His Asn Leu Glu Arg Val Glu Ala Ser Gly Val Ala
20 25 30
Asn Asp Lys Glu Ala Pro Leu Ser Trp Arg Ser Lys Glu Val Arg Asn
35 40 45
Tyr Met Gly Ser Arg Thr Val He Ser Asn Lys Gin Leu Thr Lys Ser
50 55 60
Ala Asn Gin Ser He Glu Glu Ala Leu Gin Asn Val Pro Gly Val His 65 70 75 80
He Arg Asn Ala Thr Gly He Gly Ala Val Pro Ser Phe Ser Val Arg
85 90 95
Gly Phe Gly Gly Gly Ser Ser Gly His Ser Asn Thr Ala Met Val Leu
100 105 110
Val Asn Gly He Pro He Tyr Val Ala Pro Tyr Val Asp He Ser He
115 120 125
Pro He Phe Pro Val Thr Phe Gin Ser Val Asp Arg He Ser Val Thr
130 135 140
Lys Gly Gly Glu Ser Val Arg Tyr Gly Pro Asn Val Phe Gly Gly Val 145 150 155 160
He Asn Val He Thr Lys Gly He Pro Thr Lys Trp Glu Ser Gin Val
165 170 175
Ser Glu Arg Ala Thr Phe Trp Gly Lys Ser Glu Asn Gly Gly Phe Phe
180 185 190
Asn Gin Asn Ser Lys Asn Leu Asp Lys Ser Leu Ala Asn Asn Met Leu
195 200 205
Phe Asp Thr Tyr Leu Arg Thr Gly Gly Met Met Asn Lys His Phe Gly
210 215 220
He Gin Ala Gin Ala Asn Trp Leu Lys Gly Gin Gly Phe Arg Tyr Asn 225 230 235 240
Ser Pro Thr Asn He Gin Asn Tyr Met Leu Asp Ser Leu Tyr Gin He
245 250 255
Asn Asp Ser Asn Lys He Thr Ala Phe Phe Gin Tyr Tyr Asn Tyr Phe
260 265 270
Met Ala Asp Pro Gly Ser Leu Gly He Glu Ala Tyr Asn Gin Asn Arg
275 280 285
Phe Gin Asn Asn Arg Pro Asn Asn Asn Lys Ser Gly Arg Ala Lys Xaa
290 295 300
Trp Gly Ala Val Tyr Gin Asn Phe Phe Gly Asp Thr Asp Lys He Gly 305 310 315 320
Gly Asp Phe Thr Phe Ser Tyr Tyr Gly His Asp Met Ser Arg Asp Phe
325 330 335
Gin Phe Asp Ser Asn Phe Leu Asn Val Asn Thr Asn Pro Lys Leu Gly
340 345 350
Pro Val Tyr Thr Asp Gin Asn Tyr Pro Gly Phe Phe He Phe Asp His
355 360 365
Leu Arg Arg Tyr He Met Asn Ala Phe Glu Pro Asn Leu Asn Leu Val
370 375 380
Val Asn Thr Asn Lys Val Lys Gin Thr Phe Asn Val Gly Met Arg Phe 385 390 395 400
Met Thr Met Asp Met Tyr Phe Arg Leu Asp Gin Ser Thr Cys Glu Lys
405 410 415
Thr Asp He Phe Asn Gly Val Cys Arg Met Pro Pro Phe Val Leu Ser
420 425 430
Lys Lys Pro Ser Asn Asn Gin Asn Leu Phe Asn Asn Tyr Thr Ala Val
435 440 445
Trp Leu Ser Asp Lys He Glu Leu Phe Asp Ser Lys Leu Val He Thr 450 455 460
Pro Gly Leu Arg Tyr Thr Phe Leu Asn Tyr Asn Asn Lys Glu Pro Glu 465 470 475 480
Lys His Asp Phe Ser Val Trp Asn He Thr Lys Lys Arg Gin Asn Glu
485 490 495
Trp Ser Pro Ala Leu Asn He Gly Tyr Lys Pro Met Glu Asn Trp He
500 505 510
Trp Tyr Ala Asn Tyr Arg Arg Ser Phe He Pro Pro Gin His Thr Met
515 520 525
Leu Gly He Thr Arg Thr Asn Tyr Asn Gin He Phe Asn Glu He Glu
530 535 540
Val Gly Gin Arg Tyr Ser Tyr Lys Asn Leu Leu Ser Phe Asn Thr Asn 545 550 555 560
Tyr Phe Val He Phe Ala Lys Arg Tyr Tyr Ala Gly Gly Tyr Ser Pro
565 570 575
Gin Pro He Asn Ala Arg Ser Gin Gly Val Glu Leu Glu Leu Tyr Tyr
580 585 590
Ala Pro He Arg Gly Leu Gin Phe His Val Ala Tyr Thr Tyr He Asp
595 600 605
Ala Arg He Thr Ser Asn Ala Asp Asp He Ala Tyr Tyr Phe Thr Gly
610 615 620
He Val Asn Lys Pro Phe Asp He Lys Gly Lys Arg Leu Pro Tyr Val 625 630 635 640
Ser Pro Asn Gin Phe He Phe Asp Met Met Tyr Thr Tyr Lys His Thr
645 650 655
Thr Phe Gly He Ser Ser Tyr Phe Tyr Ser Arg Ala Tyr Ser Ser Met
660 665 670
Leu Asn Gin Ala Lys Ser Gin Thr Val Cys Leu Pro Leu Asn Pro Glu
675 680 685
Tyr Thr Gly Gly Leu Glu Tyr Gly Cys Asn Ser Val Gly Leu Leu Pro
690 695 700
Leu Tyr Phe Val Leu Asn Val Gin Val Ser Ser Val Leu Trp Gin Ser 705 710 715 720
Gly Arg His Lys He Thr Gly Ser Leu Gin He Asn Asn Leu Phe Asn
725 730 735
Met Lys Tyr Tyr Phe Arg Gly He Gly Thr Ser Pro Thr Gly Arg Glu
740 745 750
Pro Ala Pro Gly Arg Ser He Thr Ala Tyr Leu Asn Tyr Glu Phe 755 760 765
(2) INFORMATION FOR SEQ ID NO: 813:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 888 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...837 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 813:
AGATAGGAAT GTAAAGGA ATG GAA TTT ATG AAA AAG TTT GTA GCT TTA GGG 51
Met Glu Phe Met Lys Lys Phe Val Ala Leu Gly 1 5 10
CTT CTA TCC GCA GTT TTA AGC TCT TCG TTG TTA GCC GAA GGT GAT GGT 99 Leu Leu Ser Ala Val Leu Ser Ser Ser Leu Leu Ala Glu Gly Asp Gly 15 20 25
GTT TAT ATA GGG ACT AAT TAT CAG CTT GGA CAA GCC CGT TTG AAT AGT 147 Val Tyr He Gly Thr Asn Tyr Gin Leu Gly Gin Ala Arg Leu Asn Ser 30 35 40
AAT ATT TAT AAT ACA GGG GAT TGC ACA GGG AGT GTT GTA GGT TGC CCC 195 Asn He Tyr Asn Thr Gly Asp Cys Thr Gly Ser Val Val Gly Cys Pro 45 50 55
CCA GGT CTT ACC GCT AAT AAG CAT AAT CCA GGA GGC ACC AAT ATC AAT 243 Pro Gly Leu Thr Ala Asn Lys His Asn Pro Gly Gly Thr Asn He Asn 60 65 70 75
TGG CAT GCT AAA TAC GCT AAT GGG GCT TTG AAT GGT CTT GGG TTG AAT 291 Trp His Ala Lys Tyr Ala Asn Gly Ala Leu Asn Gly Leu Gly Leu Asn 80 85 90
GTG GGT TAT AAG AAG TTC TTC CAG TTC AAG TCT TTT GAT ATG ACA AGC 339 Val Gly Tyr Lys Lys Phe Phe Gin Phe Lys Ser Phe Asp Met Thr Ser 95 100 105
AAG TGG TTT GGT TTT AGA GTG TAT GGG CTT TTT GAT TAT GGG CAT GCC 387 Lys Trp Phe Gly Phe Arg Val Tyr Gly Leu Phe Asp Tyr Gly His Ala 110 115 120
ACT TTA GGC AAG CAA GTT TAT GCA CCT AAT AAA ATC CAG TTG GAT ATG 435 Thr Leu Gly Lys Gin Val Tyr Ala Pro Asn Lys He Gin Leu Asp Met 125 130 135
GTC TCT TGG GGT GTG GGG AGC GAT TTG TTA GCT GAT ATT ATT GAT AAC 483 Val Ser Trp Gly Val Gly Ser Asp Leu Leu Ala Asp He He Asp Asn 140 145 150 155
GAT AAC GCT TCT TTT GGT ATT TTT GGT GGG GTC GCT ATC GGC GGT AAC 531 Asp Asn Ala Ser Phe Gly He Phe Gly Gly Val Ala He Gly Gly Asn 160 165 170
ACT TGG AAA AGC TCA GCG GCA AAC TAT TGG AAA GAG CAA ATC ATT GAA 579 Thr Trp Lys Ser Ser Ala Ala Asn Tyr Trp Lys Glu Gin He He Glu 175 180 185
GCT AAG GGT CCT GAT GTT TGT ACC CCT ACT TAT TGT AAC CCT AAC GCT 627 Ala Lys Gly Pro Asp Val Cys Thr Pro Thr Tyr Cys Asn Pro Asn Ala 190 195 200 CCT TAT AGC ACC AAA ACT TCA ACC GTC GCT TTT CAG GTA TGG TTG AAT 675 Pro Tyr Ser Thr Lys Thr Ser Thr Val Ala Phe Gin Val Trp Leu Asn 205 210 215
TTT GGG GTG AGA GCC AAT ATT TAC AAG CAT AAT GGC GTA GAG TTT GGC 723 Phe Gly Val Arg Ala Asn He Tyr Lys His Asn Gly Val Glu Phe Gly 220 225 230 235
GTG AGA GTG CCG CTA CTC ATC AAC AAG TTT TTG AGT GCG GGT CCT AAC 771 Val Arg Val Pro Leu Leu He Asn Lys Phe Leu Ser Ala Gly Pro Asn 240 245 250
GCT ACT AAT CTT TAT TAC CAT TTG AAA CGG GAT TAT TCG CTT TAT TTA 819 Ala Thr Asn Leu Tyr Tyr His Leu Lys Arg Asp Tyr Ser Leu Tyr Leu 255 260 265
GGG TAT AAC TAC ACT TTT TAAACCCTTT AAAAGGGTGT CTTTAAGCCC TTTTTAGT 875 Gly Tyr Asn Tyr Thr Phe 270
CCTTATAAAA AGG 88S
(2) INFORMATION FOR SEQ ID NO: 814:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 273 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 814:
Met Glu Phe Met Lys Lys Phe Val Ala Leu Gly Leu Leu Ser Ala Val
1 5 10 15
Leu Ser Ser Ser Leu Leu Ala Glu Gly Asp Gly Val Tyr He Gly Thr
20 25 30
Asn Tyr Gin Leu Gly Gin Ala Arg Leu Asn Ser Asn He Tyr Asn Thr
35 40 45
Gly Asp Cys Thr Gly Ser Val Val Gly Cys Pro Pro Gly Leu Thr Ala
50 55 60
Asn Lys His Asn Pro Gly Gly Thr Asn He Asn Trp His Ala Lys Tyr 65 70 75 80
Ala Asn Gly Ala Leu Asn Gly Leu Gly Leu Asn Val Gly Tyr Lys Lys
85 90 95
Phe Phe Gin Phe Lys Ser Phe Asp Met Thr Ser Lys Trp Phe Gly Phe
100 105 110
Arg Val Tyr Gly Leu Phe Asp Tyr Gly His Ala Thr Leu Gly Lys Gin
115 120 125
Val Tyr Ala Pro Asn Lys He Gin Leu Asp Met Val Ser Trp Gly Val
130 135 140
Gly Ser Asp Leu Leu Ala Asp He He Asp Asn Asp Asn Ala Ser Phe 145 150 155 160
Gly He Phe Gly Gly Val Ala He Gly Gly Asn Thr Trp Lys Ser Ser 165 170 175
Ala Ala Asn Tyr Trp Lys Glu Gin He He Glu Ala Lys Gly Pro Asp
180 185 190
Val Cys Thr Pro Thr Tyr Cys Asn Pro Asn Ala Pro Tyr Ser Thr Lys
195 200 205
Thr Ser Thr Val Ala Phe Gin Val Trp Leu Asn Phe Gly Val Arg Ala
210 215 220
Asn He Tyr Lys His Asn Gly Val Glu Phe Gly Val Arg Val Pro Leu 225 230 235 240
Leu He Asn Lys Phe Leu Ser Ala Gly Pro Asn Ala Thr Asn Leu Tyr
245 250 255
Tyr His Leu Lys Arg Asp Tyr Ser Leu Tyr Leu Gly Tyr Asn Tyr Thr
260 265 270
Phe
(2) INFORMATION FOR SEQ ID NO: 815
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 560 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 37...522 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 815:
CACGGTCATT TTGTCTGTGT TTTTAGTCGC GTATTC ATG CAA CTG CTT ATG GAA 54
Met Gin Leu Leu Met Glu
1 5
CGC CTT TCT CTT GTC TTC AGG CAT CTT TTC CAT GCG CTT TTT CAA CTC 102 Arg Leu Ser Leu Val Phe Arg His Leu Phe His Ala Leu Phe Gin Leu 10 15 20
TTT TGT GTA ATC CAC AAT ATC CTG CGG AGK ACA ACG CCA GCC ATT TTA 150 Phe Cys Val He His Asn He Leu Arg Xaa Thr Thr Pro Ala He Leu 25 30 35
GCC AAA TCT TCA TCG CTT GTT TTG CTG AAA TCT TTG GCG TTC AAA GCC 198 Ala Lys Ser Ser Ser Leu Val Leu Leu Lys Ser Leu Ala Phe Lys Ala 40 45 50
ACA AAT AGC AAC GCG CTA ACA GAA AGT ATT TTC AAC GCT TTT TTC ATT 246 Thr Asn Ser Asn Ala Leu Thr Glu Ser He Phe Asn Ala Phe Phe He 55 60 65 70 TTT TAT CCT TTT AAA TTA AAT TTA TCT CAC TTA GGA GAG CAA TGC TCG 294 Phe Tyr Pro Phe Lys Leu Asn Leu Ser His Leu Gly Glu Gin Cys Ser 75 80 85
TCT TTT TTC TTA ACA GCC CTA CAC CAA ACT TTT CTC GTA TCG CCG CTG 342 Ser Phe Phe Leu Thr Ala Leu His Gin Thr Phe Leu Val Ser Pro Leu 90 95 100
CAA ACG CTC ACA TTA AGT CCT TTT GCC TTG ATT TCT TCA TCG CTC AAA 390 Gin Thr Leu Thr Leu Ser Pro Phe Ala Leu He Ser Ser Ser Leu Lys 105 110 115
CCT TTG GTT TTT TCT TCT AAT TCT TTA CGC ACT TCT TCA CGC ATT TTT 438 Pro Leu Val Phe Ser Ser Asn Ser Leu Arg Thr Ser Ser Arg He Phe 120 125 130
TTG AAA TCC TTC GCT CAT TTT GGA AAG ATT CTT CCT AGC GAT CCG GCT 486 Leu Lys Ser Phe Ala His Phe Gly Lys He Leu Pro Ser Asp Pro Ala 135 140 145 150
GAA ATT CGC GCG GAA TTT CTT AGC GTC CTC AGC GTT TAAAGTTTTA AGGCGT 538 Glu He Arg Ala Glu Phe Leu Ser Val Leu Ser Val 155 160
TTAGACACTT CCATGCGATA AT 560
(2) INFORMATION FOR SEQ ID NO: 816:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 162 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 816:
Met Gin Leu Leu Met Glu Arg Leu Ser Leu Val Phe Arg His Leu Phe
1 5 10 15
His Ala Leu Phe Gin Leu Phe Cys Val He His Asn He Leu Arg Xaa
20 25 30
Thr Thr Pro Ala He Leu Ala Lys Ser Ser Ser Leu Val Leu Leu Lys
35 40 45
Ser Leu Ala Phe Lys Ala Thr Asn Ser Asn Ala Leu Thr Glu Ser He
50 55 60
Phe Asn Ala Phe Phe He Phe Tyr Pro Phe Lys Leu Asn Leu Ser His 65 70 75 80
Leu Gly Glu Gin Cys Ser Ser Phe Phe Leu Thr Ala Leu His Gin Thr
85 90 95
Phe Leu Val Ser Pro Leu Gin Thr Leu Thr Leu Ser Pro Phe Ala Leu
100 105 110
He Ser Ser Ser Leu Lys Pro Leu Val Phe Ser Ser Asn Ser Leu Arg
115 120 125
Thr Ser Ser Arg He Phe Leu Lys Ser Phe Ala His Phe Gly Lys He 130 135 140
Leu Pro Ser Asp Pro Ala Glu He Arg Ala Glu Phe Leu Ser Val Leu 145 150 155 160
Ser Val
(2) INFORMATION FOR SEQ ID NO: 817:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1196 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 41...1132 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 817:
AAGGCATCCA CCATTAACCC TTTTTAAATT TATTTTAACG ATG TTT TAC TAT ACT 55
Met Phe Tyr Tyr Thr 1 5
ATA AAA TCT TTT AAT TTC AAA AGG TGG TCC ATA ATG AGA ATA TTT TTG 103 He Lys Ser Phe Asn Phe Lys Arg Trp Ser He Met Arg He Phe Leu 10 15 20
AAA TTG TTG ATT CTT TTA TTT TGT TTG AAG GGG CAG GTT ATG GCT CAA 151 Lys Leu Leu He Leu Leu Phe Cys Leu Lys Gly Gin Val Met Ala Gin 25 30 35
AAT TTA CCC ACC ATT GCT TTA CTG GCG ACA GGG GGG ACG ATT GCA GGG 199 Asn Leu Pro Thr He Ala Leu Leu Ala Thr Gly Gly Thr He Ala Gly 40 45 50
AGT GGT GCG AGC GCG AGT TTG GGT AGT TAT AAG AGT GGT GAG TTG GGC 247 Ser Gly Ala Ser Ala Ser Leu Gly Ser Tyr Lys Ser Gly Glu Leu Gly 55 60 65
ATC AAA GAG CTT TTG AAG GCT ATC CCT AGT CTT AAC AGA CTC GCT CGC 295 He Lys Glu Leu Leu Lys Ala He Pro Ser Leu Asn Arg Leu Ala Arg 70 75 80 85
ATT CAA GGG GAG CAG ATT TCT AAC ATC GGC TCA CAA GAC ATG AAT GAA 343 He Gin Gly Glu Gin He Ser Asn He Gly Ser Gin Asp Met Asn Glu 90 95 100
GAG GTA TGG TTC AAG CTC GCC AAA CGT GCC CAA GAA TTG CTA GAT GAT 391 Glu Val Trp Phe Lys Leu Ala Lys Arg Ala Gin Glu Leu Leu Asp Asp 105 110 115
AGC CGT ATT CAA GGC GTG GTC ATC ACG CAT GGC ACG GAC ACT TTA GAA 439 Ser Arg He Gin Gly Val Val He Thr His Gly Thr Asp Thr Leu Glu 120 125 130
GAG AGC GCG TAT TTT TTA AAC TTA GTT TTA CGC TCC ACA AAA CCG GTC 487 Glu Ser Ala Tyr Phe Leu Asn Leu Val Leu Arg Ser Thr Lys Pro Val 135 140 145
GTG CTG GTG GGA GCG ATG CGT AAT GCT GCT TCT TTG AGC GCG GAT GGG 535 Val Leu Val Gly Ala Met Arg Asn Ala Ala Ser Leu Ser Ala Asp Gly 150 155 160 165
GCT TTG AAT TTA TAT AAT GCT GTG AGC GTA GCG CTC AAT GAA AAA AGT 583 Ala Leu Asn Leu Tyr Asn Ala Val Ser Val Ala Leu Asn Glu Lys Ser 170 175 180
GCG AAT AAA GGC GTG TTA GTG GTG ATG GAC GAT AAT ATT TTT AGC GCT 631 Ala Asn Lys Gly Val Leu Val Val Met Asp Asp Asn He Phe Ser Ala 185 190 195
AGA GAA GTG ATT AAA ACG CAC ACC ACC CAC ACT TCC ACC TTT AAA GCC 679 Arg Glu Val He Lys Thr His Thr Thr His Thr Ser Thr Phe Lys Ala 200 205 210
TTA AAT AGC GGC GCG ATA GGG AGC GTG TAT TAT GGC AAA ACG CGC TAT 727 Leu Asn Ser Gly Ala He Gly Ser Val Tyr Tyr Gly Lys Thr Arg Tyr 215 220 225
TAC ATG CAG CCT TTG AGA AAA CAC ACC ACA GAG AGC GAA TTT TCC CTT 775 Tyr Met Gin Pro Leu Arg Lys His Thr Thr Glu Ser Glu Phe Ser Leu 230 235 240 245
TCA CAA CTC AAA ACC CCC CTG CCT AAA GTG GAT ATT ATT TAC ACG CAT 823 Ser Gin Leu Lys Thr Pro Leu Pro Lys Val Asp He He Tyr Thr His 250 255 260
GCT GGC ATG ACC CCT GAT TTA TTC CAA GCG AGC CTA AAC TCG CAT GCA 871 Ala Gly Met Thr Pro Asp Leu Phe Gin Ala Ser Leu Asn Ser His Ala 265 270 275
AAA GGC GTT GTG ATA GCC GGG GTG GGT AAT GGG AAT GTG AGC GCT GGG 919 Lys Gly Val Val He Ala Gly Val Gly Asn Gly Asn Val Ser Ala Gly 280 285 290
TTT TTA AAA GCG ATG CAA GAA GCG AGC CAA ATG GGG GTG GTT ATT GTT 967 Phe Leu Lys Ala Met Gin Glu Ala Ser Gin Met Gly Val Val He Val 295 300 305
CGT TCT AGC AGG GTA AAT AGC GGT GAG ATT ACT TCA GGC GAG ATT GAT 1015 Arg Ser Ser Arg Val Asn Ser Gly Glu He Thr Ser Gly Glu He Asp 310 315 320 325
GAC AAG GCC TTC ATC ACA AGC GAC AAT TTA AAC CCC CAA AAA GCT AGG 1063 Asp Lys Ala Phe He Thr Ser Asp Asn Leu Asn Pro Gin Lys Ala Arg 330 335 340
GTG CTT TTA CAA CTC GCT TTA ACT AAA ACA AAT AAT AAA GAA AAA ATC 1111 Val Leu Leu Gin Leu Ala Leu Thr Lys Thr Asn Asn Lys Glu Lys He 345 350 355
CAA GAA ATG TTT GAA GAG TAT TGAAAGATTC TCTTAAATCA CCCAATTATC AAAG 1166 Gin Glu Met Phe Glu Glu Tyr 360
ATAATTGGGT GATTTGGTTT ATTTTGTTTT 1196
(2) INFORMATION FOR SEQ ID NO: 818:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 364 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 818:
Met Phe Tyr Tyr Thr He Lys Ser Phe Asn Phe Lys Arg Trp Ser He
1 5 10 15
Met Arg He Phe Leu Lys Leu Leu He Leu Leu Phe Cys Leu Lys Gly
20 25 30
Gin Val Met Ala Gin Asn Leu Pro Thr He Ala Leu Leu Ala Thr Gly
35 40 45
Gly Thr He Ala Gly Ser Gly Ala Ser Ala Ser Leu Gly Ser Tyr Lys
50 55 60
Ser Gly Glu Leu Gly He Lys Glu Leu Leu Lys Ala He Pro Ser Leu 65 70 75 80
Asn Arg Leu Ala Arg He Gin Gly Glu Gin He Ser Asn He Gly Ser
85 90 95
Gin Asp Met Asn Glu Glu Val Trp Phe Lys Leu Ala Lys Arg Ala Gin
100 105 110
Glu Leu Leu Asp Asp Ser Arg He Gin Gly Val Val He Thr His Gly
115 120 125
Thr Asp Thr Leu Glu Glu Ser Ala Tyr Phe Leu Asn Leu Val Leu Arg
130 135 140
Ser Thr Lys Pro Val Val Leu Val Gly Ala Met Arg Asn Ala Ala Ser 145 150 155 160
Leu Ser Ala Asp Gly Ala Leu Asn Leu Tyr Asn Ala Val Ser Val Ala
165 170 175
Leu Asn Glu Lys Ser Ala Asn Lys Gly Val Leu Val Val Met Asp Asp
180 185 190
Asn He Phe Ser Ala Arg Glu Val He Lys Thr His Thr Thr His Thr
195 200 205
Ser Thr Phe Lys Ala Leu Asn Ser Gly Ala He Gly Ser Val Tyr Tyr
210 215 220
Gly Lys Thr Arg Tyr Tyr Met Gin Pro Leu Arg Lys His Thr Thr Glu 225 230 235 240 Ser Glu Phe Ser Leu Ser Gin Leu Lys Thr Pro Leu Pro Lys Val Asp
245 250 255
He He Tyr Thr His Ala Gly Met Thr Pro Asp Leu Phe Gin Ala Ser
260 265 270
Leu Asn Ser His Ala Lys Gly Val Val He Ala Gly Val Gly Asn Gly
275 280 285
Asn Val Ser Ala Gly Phe Leu Lys Ala Met Gin Glu Ala Ser Gin Met
290 295 300
Gly Val Val He Val Arg Ser Ser Arg Val Asn Ser Gly Glu He Thr 305 310 315 320
Ser Gly Glu He Asp Asp Lys Ala Phe He Thr Ser Asp Asn Leu Asn
325 330 335
Pro Gin Lys Ala Arg Val Leu Leu Gin Leu Ala Leu Thr Lys Thr Asn
340 345 350
Asn Lys Glu Lys He Gin Glu Met Phe Glu Glu Tyr 355 360
(2) INFORMATION FOR SEQ ID NO: 819:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 678 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 37...612 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 819:
AGTAAAACAT CGTTAAAATA AATTTAAAAA GGGTTA ATG GTG GAT GCC TTT TTC 54
Met Val Asp Ala Phe Phe 1 5
CAA ATT GCA GTG TTA CTT TTT TCG CTT TTT TTA GGG GCA AGG CTA GGG 102 Gin He Ala Val Leu Leu Phe Ser Leu Phe Leu Gly Ala Arg Leu Gly 10 15 20
GGC TTG GGA GTG GGC TAT GCG GGG GGC TTG GGC GTG CTT ATT TTA TGC 150 Gly Leu Gly Val Gly Tyr Ala Gly Gly Leu Gly Val Leu He Leu Cys 25 30 35
TTA TTT TTG GGG CTA AAT CCG GGC AAA ATC CCT TTT GAT GTG ATT TTA 198 Leu Phe Leu Gly Leu Asn Pro Gly Lys He Pro Phe Asp Val He Leu 40 45 50
ATC ATC ATG GCA GTC ATT AGC GCT ATT AGC GCG ATG CAA AAA GCG GGG 246 He He Met Ala Val He Ser Ala He Ser Ala Met Gin Lys Ala Gly 55 60 65 70 GGC TTG GAT TAC TTA GTC AAA ATC GCT GAA AAA ATT TTA AGG AAA CAC 294 Gly Leu Asp Tyr Leu Val Lys He Ala Glu Lys He Leu Arg Lys His 75 80 85
CCC AAG CAA ATC AAT TAC CTT GCG CCA AGC GTG GCG TAT TGT TTA ACG 342 Pro Lys Gin He Asn Tyr Leu Ala Pro Ser Val Ala Tyr Cys Leu Thr 90 95 100
ATA CTA GCC GGC ACC GGG CAT ACG GTT TTT TCC TTG ATC CCG GTG ATT 390 He Leu Ala Gly Thr Gly His Thr Val Phe Ser Leu He Pro Val He 105 110 115
GTG GAA GTG AGC CAG AGC CAA AAC ATC AAG CCT AAA GCG CCT TTA AGC 438 Val Glu Val Ser Gin Ser Gin Asn He Lys Pro Lys Ala Pro Leu Ser 120 125 130
TTA GCG GTA GTC TCT AGT CAA GTC GCT ATT ACT GCA AGC CCG GTG AGC 486 Leu Ala Val Val Ser Ser Gin Val Ala He Thr Ala Ser Pro Val Ser 135 140 145 150
GCA GCG GTN GGT GTT TAT GAG CGG CAT TTT AGA GCC TTT AGG AGC AAA 534 Ala Ala Xaa Gly Val Tyr Glu Arg His Phe Arg Ala Phe Arg Ser Lys 155 160 165
TTA CTT GAC CCT TTT AAT GGT TTG GAT CCC TAC GAC TTT TTT AGC ATG 582 Leu Leu Asp Pro Phe Asn Gly Leu Asp Pro Tyr Asp Phe Phe Ser Met 170 175 180
CAT GCT CAC GGC ATT TAT TAT GGG TTT TAC TGATTTGAAA TTAGACAGCG ATC 635 His Ala His Gly He Tyr Tyr Gly Phe Tyr 185 190
CGCATTATTT AGAGCGCTTG AAAGCGGGCA AAATCTCGCC CCC 67 £
(2) INFORMATION FOR SEQ ID NO: 820:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 192 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 820:
Met Val Asp Ala Phe Phe Gin He Ala Val Leu Leu Phe Ser Leu Phe
1 5 10 15
Leu Gly Ala Arg Leu Gly Gly Leu Gly Val Gly Tyr Ala Gly Gly Leu
20 25 30
Gly Val Leu He Leu Cys Leu Phe Leu Gly Leu Asn Pro Gly Lys He
35 40 45
Pro Phe Asp Val He Leu He He Met Ala Val He Ser Ala He Ser
50 55 60
Ala Met Gin Lys Ala Gly Gly Leu Asp Tyr Leu Val Lys He Ala Glu 65 70 75 80
Lys He Leu Arg Lys His Pro Lys Gin He Asn Tyr Leu Ala Pro Ser
85 90 95
Val Ala Tyr Cys Leu Thr He Leu Ala Gly Thr Gly His Thr Val Phe
100 105 110
Ser Leu He Pro Val He Val Glu Val Ser Gin Ser Gin Asn He Lys
115 120 125
Pro Lys Ala Pro Leu Ser Leu Ala Val Val Ser Ser Gin Val Ala He
130 135 140
Thr Ala Ser Pro Val Ser Ala Ala Xaa Gly Val Tyr Glu Arg His Phe 145 150 155 160
Arg Ala Phe Arg Ser Lys Leu Leu Asp Pro Phe Asn Gly Leu Asp Pro
165 170 175
Tyr Asp Phe Phe Ser Met His Ala His Gly He Tyr Tyr Gly Phe Tyr 180 185 190
(2) INFORMATION FOR SEQ ID NO: 821:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1038 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 22...1005 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 821:
AAAACTTTGA TATGAGAATA A ATG GAC TTT AAA AAT AAA AAA TGG CTT TTT 51
Met Asp Phe Lys Asn Lys Lys Trp Leu Phe 1 5 10
CTA GCC CCT TTA GCA GGC TAT ACG GAT TTG CCT TTC AGG AGC GTG GTG 99 Leu Ala Pro Leu Ala Gly Tyr Thr Asp Leu Pro Phe Arg Ser Val Val 15 20 25
AAA AAA TTT GGC GTG GAT GTT ACC ACG AGC GAA ATG GTG AGC TCG CAT 147 Lys Lys Phe Gly Val Asp Val Thr Thr Ser Glu Met Val Ser Ser His 30 35 40
TCG TTG GTG TAT GCG TTT GAT AAA ACT TCT AAA ATG TTG GAA AAA TCC 195 Ser Leu Val Tyr Ala Phe Asp Lys Thr Ser Lys Met Leu Glu Lys Ser 45 50 55
CCT TTA GAA GAT CAT TTC ATG GCG CAA ATT TCA GGC TCT AAA GAA AGC 243 Pro Leu Glu Asp His Phe Met Ala Gin He Ser Gly Ser Lys Glu Ser 60 65 70 GTA GTC AAA GAA GCG GTG GAG AAA ATC AAC GCT TTA GAG CAT GTG AAT 291 Val Val Lys Glu Ala Val Glu Lys He Asn Ala Leu Glu His Val Asn 75 80 85 90
GGG ATT GAT TTT AAT TGC GGT TGT CCC GCT CCT AAA GTG GCT AAT CAT 339 Gly He Asp Phe Asn Cys Gly Cys Pro Ala Pro Lys Val Ala Asn His 95 100 105
GGT AAT GGT AGT GGG TTA TTG AAG GAT TTA AAC CAC TTA GTG AAG CTT 387 Gly Asn Gly Ser Gly Leu Leu Lys Asp Leu Asn His Leu Val Lys Leu 110 115 120
TTA AAA ACC ATC AGA GAA AAC ACT AGT AAA AAA ATC ACA AGC GTG AAA 435 Leu Lys Thr He Arg Glu Asn Thr Ser Lys Lys He Thr Ser Val Lys 125 130 135
GTG CGT TTA GGC TTT GAA AAG AAA ATC CCT AAA GAA ATC GCT CAT GCC 483 Val Arg Leu Gly Phe Glu Lys Lys He Pro Lys Glu He Ala His Ala 140 145 150
CTA AAT GAC GCA CCG GTG GAT TAT GTG GTG GTG CAT GGG AGG ACA CGA 531 Leu Asn Asp Ala Pro Val Asp Tyr Val Val Val His Gly Arg Thr Arg 155 160 165 170
AGC GAT AAA TAC CAA AAA GAC AAA ATA GAT TAC GAA AGC ATC GCT TTA 579 Ser Asp Lys Tyr Gin Lys Asp Lys He Asp Tyr Glu Ser He Ala Leu 175 180 185
ATG AAA AAG ATT TTA AAA AAG CCG GTG ATA GCC AAT GGC GAA ATT GAC 627 Met Lys Lys He Leu Lys Lys Pro Val He Ala Asn Gly Glu He Asp 190 195 200
AGC GTG AAA AAG GCT TTT GAA GTT TTA CAA ATC ACT CAA GCG GAT GGG 675 Ser Val Lys Lys Ala Phe Glu Val Leu Gin He Thr Gin Ala Asp Gly 205 210 215
CTA ATG ATA GGG CGA GCG GCC TTA AGA GCC CCA TGG ATA TTT TGG CAA 723 Leu Met He Gly Arg Ala Ala Leu Arg Ala Pro Trp He Phe Trp Gin 220 225 230
ATC AGA AAC AAC ACC ACA AAA TTA CCC GCA GTC GTG AAA AAA GAC CTG 771 He Arg Asn Asn Thr Thr Lys Leu Pro Ala Val Val Lys Lys Asp Leu 235 240 245 250
GTT TTA GAA CAT TTT GAT AAA ATG GTG GAG TTT TAT GGG GAT ATG GGG 819 Val Leu Glu His Phe Asp Lys Met Val Glu Phe Tyr Gly Asp Met Gly 255 260 265
GTA ATC ATG TTT AGG AAA AAT TTG CAT GCT TAC GCT AAG GGC GAA ATG 867 Val He Met Phe Arg Lys Asn Leu His Ala Tyr Ala Lys Gly Glu Met 270 275 280
CAA GCG AGC GCG TTT CGT AAC TGC GTC AAT ACC CTT ACA GAA ATA AAG 915 Gin Ala Ser Ala Phe Arg Asn Cys Val Asn Thr Leu Thr Glu He Lys 285 290 295 AGC ATG CGA GAG AGC ATA GAG GAA TTT TTT AAT CAA GAA ATG TTG CAA 963 Ser Met Arg Glu Ser He Glu Glu Phe Phe Asn Gin Glu Met Leu Gin 300 305 310
AGT GAA GTG CCG TTA TGG GTA GAA TTG AAT CAA AAA AGC GTT TGAAAGCGC 1014 Ser Glu Val Pro Leu Trp Val Glu Leu Asn Gin Lys Ser Val 315 320 325
TTGTTTTTTT AGCCAGCTTG GGGG 103!
(2) INFORMATION FOR SEQ ID NO: 822:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 328 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 822:
Met Asp Phe Lys Asn Lys Lys Trp Leu Phe Leu Ala Pro Leu Ala Gly
1 5 10 15
Tyr Thr Asp Leu Pro Phe Arg Ser Val Val Lys Lys Phe Gly Val Asp
20 25 30
Val Thr Thr Ser Glu Met Val Ser Ser His Ser Leu Val Tyr Ala Phe
35 40 45
Asp Lys Thr Ser Lys Met Leu Glu Lys Ser Pro Leu Glu Asp His Phe
50 55 60
Met Ala Gin He Ser Gly Ser Lys Glu Ser Val Val Lys Glu Ala Val 65 70 75 80
Glu Lys He Asn Ala Leu Glu His Val Asn Gly He Asp Phe Asn Cys
85 90 95
Gly Cys Pro Ala Pro Lys Val Ala Asn His Gly Asn Gly Ser GI}. Leu
100 105 110
Leu Lys Asp Leu Asn His Leu Val Lys Leu Leu Lys Thr He Arg Glu
115 120 125
Asn Thr Ser Lys Lys He Thr Ser Val Lys Val Arg Leu Gly Phe Glu
130 135 140
Lys Lys He Pro Lys Glu He Ala His Ala Leu Asn Asp Ala Pro Val 145 150 155 160
Asp Tyr Val Val Val His Gly Arg Thr Arg Ser Asp Lys Tyr Gin Lys
165 170 175
Asp Lys He Asp Tyr Glu Ser He Ala Leu Met Lys Lys He Leu Lys
180 185 190
Lys Pro Val He Ala Asn Gly Glu He Asp Ser Val Lys Lys Ala Phe
195 200 205
Glu Val Leu Gin He Thr Gin Ala Asp Gly Leu Met He Gly Arg Ala
210 215 220
Ala Leu Arg Ala Pro Trp He Phe Trp Gin He Arg Asn Asn Thr Thr 225 230 235 240
Lys Leu Pro Ala Val Val Lys Lys Asp Leu Val Leu Glu His Phe Asp
245 250 255
Lys Met Val Glu Phe Tyr Gly Asp Met Gly Val He Met Phe Arg Lys 260 265 270
Asn Leu His Ala Tyr Ala Lys Gly Glu Met Gin Ala Ser Ala Phe Arg
275 280 285
Asn Cys Val Asn Thr Leu Thr Glu He Lys Ser Met Arg Glu Ser He
290 295 300
Glu Glu Phe Phe Asn Gin Glu Met Leu Gin Ser Glu Val Pro Leu Trp 305 310 315 320
Val Glu Leu Asn Gin Lys Ser Val 325
(2) INFORMATION FOR SEQ ID NO: 823:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1170 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 24...1130 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 823:
TCAAAAATTA AAGGTTAATT TTA ATG TTG CTT TTC ACT CCA GGC CCT GTA GCC 53
Met Leu Leu Phe Thr Pro Gly Pro Val Ala 1 5 10
ATT AAT GAA GAG ATG CGC ACA AGC TTT TCT CAG CCA ATG CCC CAC CAC 101 He Asn Glu Glu Met Arg Thr Ser Phe Ser Gin Pro Met Pro His His 15 20 25
CGC ACT AAA GAT TTT GAA AAG ATT TTC CAA AGC GTG CGA GAA AAT TTG 149 Arg Thr Lys Asp Phe Glu Lys He Phe Gin Ser Val Arg Glu Asn Leu 30 35 40
AAA AAA ATG ACC GGT TTA GAA GAG GTT TTG CTT CTA AGC AGC AGC GGG 197 Lys Lys Met Thr Gly Leu Glu Glu Val Leu Leu Leu Ser Ser Ser Gly 45 50 55
ACA GGG GCT ATG GAA GCG AGC GTG ATT TCC TTG TGT CAA AAA GAG TTG 245 Thr Gly Ala Met Glu Ala Ser Val He Ser Leu Cys Gin Lys Glu Leu 60 65 70
CTT TTT GTT AAT GCG GGC AAG TTT GGC GAA AGG TTT GGC AAG ATC GCT 293 Leu Phe Val Asn Ala Gly Lys Phe Gly Glu Arg Phe Gly Lys He Ala 75 80 85 90
AAA GCC CAT TCT ATC AAA GCC CAT GAA TTA GTC TAT GAA TGG GAC ACA 341 Lys Ala His Ser He Lys Ala His Glu Leu Val Tyr Glu Trp Asp Thr 95 100 105
CCA GCT CAA GTA GAT GAA ATA TTA AGC GTT CTT AAA GCC AAC CCT AAC 389 Pro Ala Gin Val Asp Glu He Leu Ser Val Leu Lys Ala Asn Pro Asn 110 115 120
ATT GAT GCG TTT TGC ATT CAA GCA TGC GAG TCT AGT GGG GGG TTA CGA 437 He Asp Ala Phe Cys He Gin Ala Cys Glu Ser Ser Gly Gly Leu Arg 125 130 135
CAC CCT GTG GAA AAA ATC GCT CAA GCG ATC AAA GAA ACT AAC CCG AAT 485 His Pro Val Glu Lys He Ala Gin Ala He Lys Glu Thr Asn Pro Asn 140 145 150
GTT TTT GTA ATT GTA GAT GCT ATC ACC GCT TTA GGG GTT GAG CCT TTA 533 Val Phe Val He Val Asp Ala He Thr Ala Leu Gly Val Glu Pro Leu 155 160 165 170
GAA ATA ACG CAT GTT GAT GCG CTC ATT GGA GGG AGT CAA AAA GCG TTC 581 Glu He Thr His Val Asp Ala Leu He Gly Gly Ser Gin Lys Ala Phe 175 180 185
ATG CTG CCT CCT GCG ATG AGC CTA GTC GCA TTG AGC CAG AAT GCA ATT 629 Met Leu Pro Pro Ala Met Ser Leu Val Ala Leu Ser Gin Asn Ala He 190 195 200
GAG CGT ATA GAA GAA CGC AAT GTG GGG TTT TAT TTC AAT TTA AAG AGC 677 Glu Arg He Glu Glu Arg Asn Val Gly Phe Tyr Phe Asn Leu Lys Ser 205 210 215
GAA TTG AAA AAC CAA AGG AAT AAC ACC ACA AGC TAC ACC GCT CCT ATT 725 Glu Leu Lys Asn Gin Arg Asn Asn Thr Thr Ser Tyr Thr Ala Pro He 220 225 230
TTA CAC ACT TTA GGG TTG CAA CGC TAT TTT GAA TTG GTG CAA AAT TTA 773 Leu His Thr Leu Gly Leu Gin Arg Tyr Phe Glu Leu Val Gin Asn Leu 235 240 245 250
GGG GGC TTT GAA GCG CTC TAT AGA GAG ACT AAA AAA GCC GCT TTG GCC 821 Gly Gly Phe Glu Ala Leu Tyr Arg Glu Thr Lys Lys Ala Ala Leu Ala 255 260 265
ACT CAA AAA GCC GTT TTA GCT TTA GGT TTA AAG ATT TTC CCT AAA AGC 869 Thr Gin Lys Ala Val Leu Ala Leu Gly Leu Lys He Phe Pro Lys Ser 270 275 280
CCA AGC TTG AGC ATG ACA ACG ATT GTT AAT GAG CAT GCC AAA GAA TTG 917 Pro Ser Leu Ser Met Thr Thr He Val Asn Glu His Ala Lys Glu Leu 285 290 295
AGA AAC CTT TTA AAA GAA AAA TAC CAG GTG CAA TTT GCG GGC GGT CAA 965 Arg Asn Leu Leu Lys Glu Lys Tyr Gin Val Gin Phe Ala Gly Gly Gin 300 305 310
GAG CCT TAT AAA GAT GCG CTC ATT CGT ATC AAC CAC ATG GGG ATC ATT 1013 Glu Pro Tyr Lys Asp Ala Leu He Arg He Asn His Met Gly He He 315 320 325 330
CCT GTT TAT AAA AGC GCT TAC GCT TTA AAC GCC CTA GAG TTA GCC CTA 1061 Pro Val Tyr Lys Ser Ala Tyr Ala Leu Asn Ala Leu Glu Leu Ala Leu 335 340 345
AAC GAC TTG GAT TTA AGG GAA TTT GAT GGC GTG GCG AAC GCA ACT TTT 1109 Asn Asp Leu Asp Leu Arg Glu Phe Asp Gly Val Ala Asn Ala Thr Phe 350 355 360
TTA AAG CAA TAT TAT GGA ATT TAAGGATCAC AATGCATTAT TCTTATGAAA CCTT 1164 Leu Lys Gin Tyr Tyr Gly He 365
TTTAAA 1170
(2) INFORMATION FOR SEQ ID NO: 824:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 369 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 824:
Met Leu Leu Phe Thr Pro Gly Pro Val Ala He Asn Glu Glu Met Arg
1 5 10 15
Thr Ser Phe Ser Gin Pro Met Pro His His Arg Thr Lys Asp Phe Glu
20 25 30
Lys He Phe Gin Ser Val Arg Glu Asn Leu Lys Lys Met Thr Gly Leu
35 40 45
Glu Glu Val Leu Leu Leu Ser Ser Ser Gly Thr Gly Ala Met Glu Ala
50 55 60
Ser Val He Ser Leu Cys Gin Lys Glu Leu Leu Phe Val Asn Ala Gly 65 70 75 80
Lys Phe Gly Glu Arg Phe Gly Lys He Ala Lys Ala His Ser He Lys
85 90 95
Ala His Glu Leu Val Tyr Glu Trp Asp Thr Pro Ala Gin Val Asp Glu
100 105 110
He Leu Ser Val Leu Lys Ala Asn Pro Asn He Asp Ala Phe Cys He
115 120 125
Gin Ala Cys Glu Ser Ser Gly Gly Leu Arg His Pro Val Glu Lys He
130 135 140
Ala Gin Ala He Lys Glu Thr Asn Pro Asn Val Phe Val He Val Asp 145 150 155 160
Ala He Thr Ala Leu Gly Val Glu Pro Leu Glu He Thr His Val Asp
165 170 175
Ala Leu He Gly Gly Ser Gin Lys Ala Phe Met Leu Pro Pro Ala Met
180 185 190
Ser Leu Val Ala Leu Ser Gin Asn Ala He Glu Arg He Glu Glu Arg 195 200 205 Asn Val Gly Phe Tyr Phe Asn Leu Lys Ser Glu Leu Lys Asn Gin Arg
210 215 220
Asn Asn Thr Thr Ser Tyr Thr Ala Pro He Leu His Thr Leu Gly Leu 225 230 235 240
Gin Arg Tyr Phe Glu Leu Val Gin Asn Leu Gly Gly Phe Glu Ala Leu
245 250 255
Tyr Arg Glu Thr Lys Lys Ala Ala Leu Ala Thr Gin Lys Ala Val Leu
260 265 270
Ala Leu Gly Leu Lys He Phe Pro Lys Ser Pro Ser Leu Ser Met Thr
275 280 285
Thr He Val Asn Glu His Ala Lys Glu Leu Arg Asn Leu Leu Lys Glu
290 295 300
Lys Tyr Gin Val Gin Phe Ala Gly Gly Gin Glu Pro Tyr Lys Asp Ala 305 310 315 320
Leu He Arg He Asn His Met Gly He He Pro Val Tyr Lys Ser Ala
325 330 335
Tyr Ala Leu Asn Ala Leu Glu Leu Ala Leu Asn Asp Leu Asp Leu Arg
340 345 350
Glu Phe Asp Gly Val Ala Asn Ala Thr Phe Leu Lys Gin Tyr Tyr Gly
355 360 365
He
(2) INFORMATION FOR SEQ ID NO: 825:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 285 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 46...270 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 825:
AAGAGGCATG ACAGCTCTTA CATTGTGATA GACGAATTAG TGGGC ATG TGG TTG GCG 57
Met Trp Leu Ala 1
ATG GCG ATT AGC GGG TTA TCG TTA GCG GGT GTG ATC TTG AGT TTT ATC 105 Met Ala He Ser Gly Leu Ser Leu Ala Gly Val He Leu Ser Phe He 5 10 15 20
TTT TTT AGG ATC TAT GAT ATT ACT AAA CCC TCA CTC ATT GGC AAG ATA 153 Phe Phe Arg He Tyr Asp He Thr Lys Pro Ser Leu He Gly Lys He 25 30 35
GAT AAA GAA GTT AAA GGG GGC TTA GGG GTT GTG GCT GAT GAC GCT TTA 201 Asp Lys Glu Val Lys Gly Gly Leu Gly Val Val Ala Asp Asp Ala Leu 40 45 50
GCG GGT GTT TTA GCC GGA TTG AGC GCG TTA TTA GTC ATC CAT ATT TTA 249
Ala Gly Val Leu Ala Gly Leu Ser Ala Leu Leu Val He His He Leu 55 60 65
GGA TTT TTT AAC ATT AAA CTT TAATTTTAAG AAAAT 285
Gly Phe Phe Asn He Lys Leu
70 75
(2) INFORMATION FOR SEQ ID NO: 826:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 75 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 826:
Met Trp Leu Ala Met Ala He Ser Gly Leu Ser Leu Ala Gly Val He
1 5 10 15
Leu Ser Phe He Phe Phe Arg He Tyr Asp He Thr Lys Pro Ser Leu
20 25 30
He Gly Lys He Asp Lys Glu Val Lys Gly Gly Leu Gly Val Val Ala
35 40 45
Asp Asp Ala Leu Ala Gly Val Leu Ala Gly Leu Ser Ala Leu Leu Val
50 55 60
He His He Leu Gly Phe Phe Asn He Lys Leu 65 70 75
(2) INFORMATION FOR SEQ ID NO: 827:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1021 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 70...957 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 827: AAAGATAGGA TTAAATATTT TATTTTTTTA GATGAAAACC ATCATTTTTA TTTGATTGAA 60 GAATCCAAC ATG CAT TCA AAA TAC TTC GCT CAA ATC AAA GAA AAA AAA TTA 111 Met His Ser Lys Tyr Phe Ala Gin He Lys Glu Lys Lys Leu 1 5 10
CCT CCC CTA ATC CTC ACA CAC AAT GGC TTG CTT AAA AAC TCA TTT TTA 159 Pro Pro Leu He Leu Thr His Asn Gly Leu Leu Lys Asn Ser Phe Leu 15 20 25 30
GGT GCT AAG ATT ATA GAA TTG CCT TTA GTG ATC AAT CTC GTG CAT GGG 207 Gly Ala Lys He He Glu Leu Pro Leu Val He Asn Leu Val His Gly 35 40 45
GGC GAT GGC GAA GAT GGG AAA TTA GCG AGC TTG TTA GAA TTT TAT CGT 255 Gly Asp Gly Glu Asp Gly Lys Leu Ala Ser Leu Leu Glu Phe Tyr Arg 50 55 60
ATC GCT TTT ATA GGC CCT AGG ATT GAA GCG AGC GTG CTG AGT TAT AAC 303 He Ala Phe He Gly Pro Arg He Glu Ala Ser Val Leu Ser Tyr Asn 65 70 75
AAA TAT TTA ACC AAG CTT TAC GCC AAA GAC TTA GGG GTA AAG ACT TTA 351 Lys Tyr Leu Thr Lys Leu Tyr Ala Lys Asp Leu Gly Val Lys Thr Leu 80 85 90
GAT CAT GTT CTT TTG AAT GAA AAA AAC CGC GCT AAC GCC TTG GAT TTG 399 Asp His Val Leu Leu Asn Glu Lys Asn Arg Ala Asn Ala Leu Asp Leu 95 100 105 110
ATG AAC TTT AAT TTC CCT TTC ATA ATC AAG CCT AAT AAC GCC GGA AGC 447 Met Asn Phe Asn Phe Pro Phe He He Lys Pro Asn Asn Ala Gly Ser 115 120 125
TCT TTA GGG GTG AAT GTT GTG AAA GAA GAA AAA GAA TTG GTT TAC GCT 495 Ser Leu Gly Val Asn Val Val Lys Glu Glu Lys Glu Leu Val Tyr Ala 130 135 140
TTA GAC GGT GCG TTT GAA TAT TCT AAA GAG GTC TTG ATA GAG CCT TTC 543 Leu Asp Gly Ala Phe Glu Tyr Ser Lys Glu Val Leu He Glu Pro Phe 145 150 155
ATT CAG GGA GTG AAA GAA TAC AAT TTG GCC GGT TGC AAG ATC AAA AAG 591 He Gin Gly Val Lys Glu Tyr Asn Leu Ala Gly Cys Lys He Lys Lys 160 165 170
GAT TTT TGT TTT TCC TAT GTG GAA GAG CCT AAC AAA CAG GAA TTT TTA 639 Asp Phe Cys Phe Ser Tyr Val Glu Glu Pro Asn Lys Gin Glu Phe Leu 175 180 185 190
GAT TTC AAA CAA AAA TAT TTG GAT TTT TCA CGC AAT AAA GCC CCT AAA 687 Asp Phe Lys Gin Lys Tyr Leu Asp Phe Ser Arg Asn Lys Ala Pro Lys 195 200 205
GCG AAT CTT TCT AAC GCC CTA GAA GAG CAA TTA AAA GAA AAT TTT AAA 735 Ala Asn Leu Ser Asn Ala Leu Glu Glu Gin Leu Lys Glu Asn Phe Lys 210 215 220 AAA CTC TAT AAC GAT TTG TTT GAT GGC GCG ATC ATT CGT TGC GAT TTT 783 Lys Leu Tyr Asn Asp Leu Phe Asp Gly Ala He He Arg Cys Asp Phe 225 230 235
TTT GTC ATA AAA AAT GAA GTG TAT CTT AAT GAG ATC AAC CCC ATT CCT 831 Phe Val He Lys Asn Glu Val Tyr Leu Asn Glu He Asn Pro He Pro 240 245 250
GGC AGT TTG GCC AAT TAT TTG TTT GAT GAT TTT AAA ACA ACG CTA GAA 879 Gly Ser Leu Ala Asn Tyr Leu Phe Asp Asp Phe Lys Thr Thr Leu Glu 255 260 265 270
AAT TTA GCG CAA TCA TTA CCC AAA ACC CCT AAG ATC CAA ATC AAA AAC 927 Asn Leu Ala Gin Ser Leu Pro Lys Thr Pro Lys He Gin He Lys Asn 275 280 285
TCT TAT TTG TTG CAA ATC CAA AAG AAT AAG TAATGGCCAA ACGCAGTATC GCT 980 Ser Tyr Leu Leu Gin He Gin Lys Asn Lys 290 295
TATTTGGATA GCGTTTTTGA CATTTCCTAC ACTTTTATAG A 1021
(2) INFORMATION FOR SEQ ID NO: 828:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 296 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 828:
Met His Ser Lys Tyr Phe Ala Gin He Lys Glu Lys Lys Leu Pro Pro
1 5 10 15
Leu He Leu Thr His Asn Gly Leu Leu Lys Asn Ser Phe Leu Gly Ala
20 25 30
Lys He He Glu Leu Pro Leu Val He Asn Leu Val His Gly Gly Asp
35 40 45
Gly Glu Asp Gly Lys Leu Ala Ser Leu Leu Glu Phe Tyr Arg He Ala
50 55 60
Phe He Gly Pro Arg He Glu Ala Ser Val Leu Ser Tyr Asn Lys Tyr 65 70 75 80
Leu Thr Lys Leu Tyr Ala Lys Asp Leu Gly Val Lys Thr Leu Asp His
85 90 95
Val Leu Leu Asn Glu Lys Asn Arg Ala Asn Ala Leu Asp Leu Met Asn
100 105 110
Phe Asn Phe Pro Phe He He Lys Pro Asn Asn Ala Gly Ser Ser Leu
115 120 125
Gly Val Asn Val Val Lys Glu Glu Lys Glu Leu Val Tyr Ala Leu Asp
130 135 140
Gly Ala Phe Glu Tyr Ser Lys Glu Val Leu He Glu Pro Phe He Gin 145 150 155 160
Gly Val Lys Glu Tyr Asn Leu Ala Gly Cys Lys He Lys Lys Asp Phe 165 170 175
Cys Phe Ser Tyr Val Glu Glu Pro Asn Lys Gin Glu Phe Leu Asp Phe
180 185 190
Lys Gin Lys Tyr Leu Asp Phe Ser Arg Asn Lys Ala Pro Lys Ala Asn
195 200 205
Leu Ser Asn Ala Leu Glu Glu Gin Leu Lys Glu Asn Phe Lys Lys Leu
210 215 220
Tyr Asn Asp Leu Phe Asp Gly Ala He He Arg Cys Asp Phe Phe Val 225 230 235 240
He Lys Asn Glu Val Tyr Leu Asn Glu He Asn Pro He Pro Gly Ser
245 250 255
Leu Ala Asn Tyr Leu Phe Asp Asp Phe Lys Thr Thr Leu Glu Asn Leu
260 265 270
Ala Gin Ser Leu Pro Lys Thr Pro Lys He Gin He Lys Asn Ser Tyr
275 280 285
Leu Leu Gin He Gin Lys Asn Lys 290 295
(2) INFORMATION FOR SEQ ID NO: 829:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 686 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 47...628 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 829:
CAAAGCCTAT GCGAGAGTCA ATTCCGTTGC ATTTTCATTT AGAGGC ATG GAA GTC 55
Met Glu Val
1
CCC ATT GAA GGT TTA GAA GAA TTG GTA GAT GAA ACG AAA AAA TGC TTG 103 Pro He Glu Gly Leu Glu Glu Leu Val Asp Glu Thr Lys Lys Cys Leu 5 10 15
ATA GAA GCT AAG AAA AAC AAA CAA AAC CAT TTC TTG CTG ATT CAA AAA 151 He Glu Ala Lys Lys Asn Lys Gin Asn His Phe Leu Leu He Gin Lys 20 25 30 35
GCT AAC ATC CAA GCA AGA AAA CAA GCC ATG ATA GAT GAA AGT AAA ACC 199 Ala Asn He Gin Ala Arg Lys Gin Ala Met He Asp Glu Ser Lys Thr 40 45 50
ATT ATC CAT GTT GCA TCA GGA GCG GCT GGA GCG GCC GGG CTT ATC CCC 247 He He His Val Ala Ser Gly Ala Ala Gly Ala Ala Gly Leu He Pro 55 60 65
ATA CCC TTT AGC GAT GCA CTC GCT ATC GCG CCC ATT CAA GCA GGA ATG 295 He Pro Phe Ser Asp Ala Leu Ala He Ala Pro He Gin Ala Gly Met 70 75 80
ATC TAC AAA ATG AAT GAC GCT TTT GGA ATG GAT TTG GAT AAA TCT GTA 343 He Tyr Lys Met Asn Asp Ala Phe Gly Met Asp Leu Asp Lys Ser Val 85 90 95
GCC GCA TCA TTA ATC ACC GGA TTG TTA GGC GTA ACC GCT GTC GCG CAA 391 Ala Ala Ser Leu He Thr Gly Leu Leu Gly Val Thr Ala Val Ala Gin 100 105 110 115
GTG GGG AGA ACG CTT GTT AAT GGT TTC CTT AAA TTC ATT CCT GTT GTG 439 Val Gly Arg Thr Leu Val Asn Gly Phe Leu Lys Phe He Pro Val Val 120 125 130
GGG AGT GTT GCA GGG GGC ACA ACC GCT GTA ATT ATC ACA GAA GGC ATT 487 Gly Ser Val Ala Gly Gly Thr Thr Ala Val He He Thr Glu Gly He 135 140 145
GGG TTT GCG TAT TTG AAA GTG CTA GAA AAG TGC TTT AAT GAT GAG ACG 535 Gly Phe Ala Tyr Leu Lys Val Leu Glu Lys Cys Phe Asn Asp Glu Thr 150 155 160
GGC GAA GTC AAT TTG CCT GAT GAA GTT GGC ATG ATA ACT TCT CTC TTT 583 Gly Glu Val Asn Leu Pro Asp Glu Val Gly Met He Thr Ser Leu Phe 165 170 175
AAG GAG AAT TAT CTC AAC TTG GAT ACA ATC AAG AAA TTA ACA CAA TAAGA 633 Lys Glu Asn Tyr Leu Asn Leu Asp Thr He Lys Lys Leu Thr Gin 180 185 190
TTAGGGGTTA TGAAAAACGC ATGGCATTAG ACAAAAGGAT TTGGATGCAT TTT 686
(2) INFORMATION FOR SEQ ID NO: 830:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 194 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 830:
Met Glu Val Pro He Glu Gly Leu Glu Glu Leu Val Asp Glu Thr Lys
1 5 10 15
Lys Cys Leu He Glu Ala Lys Lys Asn Lys Gin Asn His Phe Leu Leu
20 25 30
He Gin Lys Ala Asn He Gin Ala Arg Lys Gin Ala Met He Asp Glu
35 40 45
Ser Lys Thr He He His Val Ala Ser Gly Ala Ala Gly Ala Ala Gly 50 55 60
Leu He Pro He Pro Phe Ser Asp Ala Leu Ala He Ala Pro He Gin 65 70 75 80
Ala Gly Met He Tyr Lys Met Asn Asp Ala Phe Gly Met Asp Leu Asp
85 90 95
Lys Ser Val Ala Ala Ser Leu He Thr Gly Leu Leu Gly Val Thr Ala
100 105 110
Val Ala Gin Val Gly Arg Thr Leu Val Asn Gly Phe Leu Lys Phe He
115 120 125
Pro Val Val Gly Ser Val Ala Gly Gly Thr Thr Ala Val He He Thr
130 135 140
Glu Gly He Gly Phe Ala Tyr Leu Lys Val Leu Glu Lys Cys Phe Asn 145 150 155 160
Asp Glu Thr Gly Glu Val Asn Leu Pro Asp Glu Val Gly Met He Thr
165 170 175
Ser Leu Phe Lys Glu Asn Tyr Leu Asn Leu Asp Thr He Lys Lys Leu
180 185 190
Thr Gin
(2) INFORMATION FOR SEQ ID NO: 831:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 900 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 6...821 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 831:
AGAAG ATG GGG AAG TTT ATG AAT ACT CTT AAA AAG CAT TTA GCC TTT ATC 50
Met Gly Lys Phe Met Asn Thr Leu Lys Lys His Leu Ala Phe He
1 5 10 15
ATT CCC CTA GTA GCG TTA TTG TTT AGC TTG GAG TGC GTG TTA TTT ATC 98 He Pro Leu Val Ala Leu Leu Phe Ser Leu Glu Cys Val Leu Phe He 20 25 30
AAT CAA GCG ATC GAA CAG AAA GAA AAA AAA TTG ATT GAA GAT TAT TCG 146 Asn Gin Ala He Glu Gin Lys Glu Lys Lys Leu He Glu Asp Tyr Ser 35 40 45
GTC GTG TTG GCC AGC ACG CAA AAA TTA AAC TTG GAA TTG TTG CGT CAA 194 Val Val Leu Ala Ser Thr Gin Lys Leu Asn Leu Glu Leu Leu Arg Gin 50 55 60 AAT TTT AGC GAA ATC ATA GCG TTA AAA GAA ATT GAT CCT AAT TAT TCT 242 Asn Phe Ser Glu He He Ala Leu Lys Glu He Asp Pro Asn Tyr Ser 65 70 75
TTA GAA CCT CTT CAA AAA ACC TTA GGC ATA GAT GGG CTT AAG GAA TTA 290 Leu Glu Pro Leu Gin Lys Thr Leu Gly He Asp Gly Leu Lys Glu Leu 80 85 90 95
AGA AAA AAT TTG CCC TTT TTT TAT TCT TTA CAA CTT TCC ACA TTC CCC 338 Arg Lys Asn Leu Pro Phe Phe Tyr Ser Leu Gin Leu Ser Thr Phe Pro 100 105 110
ACT CAA GAG CGT TTA GAA AAC ATT AAA GAA AAA TTG CTC AAA ATC CCT 386 Thr Gin Glu Arg Leu Glu Asn He Lys Glu Lys Leu Leu Lys He Pro 115 120 125
GGC GTT CAA AAA GTT GAA GTC TTT GCC AAA ACT TAC ATG CAA GTG TAT 434 Gly Val Gin Lys Val Glu Val Phe Ala Lys Thr Tyr Met Gin Val Tyr 130 135 140
GAT CTC TTG AGT TTT ATT AAA ACA GCG GTC TAT ATC TTT GCG TTA GTG 482 Asp Leu Leu Ser Phe He Lys Thr Ala Val Tyr He Phe Ala Leu Val 145 150 155
GTC TTT GTT TTA TCG GTT TTA TTG ATG TTT AAA CAA GTC CGC ATC TGG 530 Val Phe Val Leu Ser Val Leu Leu Met Phe Lys Gin Val Arg He Trp 160 165 170 175
ATC TAT CAA TAC CAT GAG AGA TTA GAG ATC ATG GAT TTA TTA GGG GCT 578 He Tyr Gin Tyr His Glu Arg Leu Glu He Met Asp Leu Leu Gly Ala 180 185 190
TCG GTG TCT TTT AAA AAC GGG TTT TTG TAT AAA ATA GCT TTA ATG GAT 626 Ser Val Ser Phe Lys Asn Gly Phe Leu Tyr Lys He Ala Leu Met Asp 195 200 205
TCT GTA ATC GCT AGT TTT TTA GCC CCC ATG CTC ATG CTC TAT ACC ACT 674 Ser Val He Ala Ser Phe Leu Ala Pro Met Leu Met Leu Tyr Thr Thr 210 215 220
TCG CAA AAA GGT TTT GAA AAA ACG ATG GAT ACT TTG GGT ATT ATA GGA 722 Ser Gin Lys Gly Phe Glu Lys Thr Met Asp Thr Leu Gly He He Gly 225 230 235
GGC GCG TTT GTT TTA AAC CAT TTT TTA TGG GGA CTG CTT TTT AGC CTT 770 Gly Ala Phe Val Leu Asn His Phe Leu Trp Gly Leu Leu Phe Ser Leu 240 245 250 255
GTG GTC TCA TTT GTT TCT GTT TTA CTT GTA GCT TGG AGG ACT AGG CAT 818 Val Val Ser Phe Val Ser Val Leu Leu Val Ala Trp Arg Thr Arg His 260 265 270
GTA TAAATTAGGG GTGTTTTTGT TAGCCACCTT ACTATCAGCT AACACGCAAA AAGTGA 877 Val GCGATATTGC TAAAGATATC CAA 900
(2) INFORMATION FOR SEQ ID NO: 832:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 272 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 832:
Met Gly Lys Phe Met Asn Thr Leu Lys Lys His Leu Ala Phe He He
1 5 10 15
Pro Leu Val Ala Leu Leu Phe Ser Leu Glu Cys Val Leu Phe He Asn
20 25 30
Gin Ala He Glu Gin Lys Glu Lys Lys Leu He Glu Asp Tyr Ser Val
35 40 45
Val Leu Ala Ser Thr Gin Lys Leu Asn Leu Glu Leu Leu Arg Gin Asn
50 55 60
Phe Ser Glu He He Ala Leu Lys Glu He Asp Pro Asn Tyr Ser Leu 65 70 75 80
Glu Pro Leu Gin Lys Thr Leu Gly He Asp Gly Leu Lys Glu Leu Arg
85 90 95
Lys Asn Leu Pro Phe Phe Tyr Ser Leu Gin Leu Ser Thr Phe Pro Thr
100 105 110
Gin Glu Arg Leu Glu Asn He Lys Glu Lys Leu Leu Lys He Pro Gly
115 120 125
Val Gin Lys Val Glu Val Phe Ala Lys Thr Tyr Met Gin Val Tyr Asp
130 135 140
Leu Leu Ser Phe He Lys Thr Ala Val Tyr He Phe Ala Leu Val Val 145 150 155 160
Phe Val Leu Ser Val Leu Leu Met Phe Lys Gin Val Arg He Trp He
165 170 175
Tyr Gin Tyr His Glu Arg Leu Glu He Met Asp Leu Leu Gly Ala Ser
180 185 190
Val Ser Phe Lys Asn Gly Phe Leu Tyr Lys He Ala Leu Met Asp Ser
195 200 205
Val He Ala Ser Phe Leu Ala Pro Met Leu Met Leu Tyr Thr Thr Ser
210 215 220
Gin Lys Gly Phe Glu Lys Thr Met Asp Thr Leu Gly He He Gly Gly 225 230 235 240
Ala Phe Val Leu Asn His Phe Leu Trp Gly Leu Leu Phe Ser Leu Val
245 250 255
Val Ser Phe Val Ser Val Leu Leu Val Ala Trp Arg Thr Arg His Val 260 265 270
(2) INFORMATION FOR SEQ ID NO:833:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 701 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 1...672 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 833:
AGG TTG TCT GAA CCC ATA GAT AGA TTC ACG CGC ATA AGG TGG TTG TTT 48 Arg Leu Ser Glu Pro He Asp Arg Phe Thr Arg He Arg Trp Leu Phe 1 5 10 15
AAA AAC GAT TTT GAA AAA ATC CGC CAA CAA AGG GTT TTA ATC TGT GGC 96 Lys Asn Asp Phe Glu Lys He Arg Gin Gin Arg Val Leu He Cys Gly 20 25 30
GTG GGG GGC GTT GGG GGC TTT GCG CTA GAC GCT TTG TAT CGT GTG GGG 144 Val Gly Gly Val Gly Gly Phe Ala Leu Asp Ala Leu Tyr Arg Val Gly 35 40 45
ATA GGG CAA ATC ACT ATC ATT GAT AAA GAC GTG TTT GAT GTT ACC AAT 192 He Gly Gin He Thr He He Asp Lys Asp Val Phe Asp Val Thr Asn 50 55 60
CAA AAC CGC CAG ATT GGC TCA GAA AGG ATA GGA GAA TCT AAA GTG TTG 240 Gin Asn Arg Gin He Gly Ser Glu Arg He Gly Glu Ser Lys Val Leu 65 70 75 80
GTG TTG CAA GAT CTC TAT AAG GGC ATT CAA GCT TTG AAC TTG CAT ATA 288 Val Leu Gin Asp Leu Tyr Lys Gly He Gin Ala Leu Asn Leu His He 85 90 95
GAT GAA GCG TTT TTA AAT TCA TTT AAT TTT AGA GAT TAT GAT TAC ATT 336 Asp Glu Ala Phe Leu Asn Ser Phe Asn Phe Arg Asp Tyr Asp Tyr He 100 105 110
TTA GAT TGC ATG GAC GAT TTG CCT ATT AAA ACA AGC TTA GCG ATA AAA 384 Leu Asp Cys Met Asp Asp Leu Pro He Lys Thr Ser Leu Ala He Lys 115 120 125
TGC CAG AAT TTC GCT TAC GGA AAA TTT ATC AGC TCT ATG GGG AGT GCG 432 Cys Gin Asn Phe Ala Tyr Gly Lys Phe He Ser Ser Met Gly Ser Ala 130 135 140
AAA CGC TTG AAC CCT AAA CAC ATC CAA GTG GGG AGC GTG TGG GAA AGC 480 Lys Arg Leu Asn Pro Lys His He Gin Val Gly Ser Val Trp Glu Ser 145 150 155 160
TAT GGC GAT AAA TTC GGG CGT AAA TTT AGG GAT TTT TTA AAA AAA CGC 528 Tyr Gly Asp Lys Phe Gly Arg Lys Phe Arg Asp Phe Leu Lys Lys Arg 165 170 175
CGT TTT AAA GGG GAT TTT AAA GTG GTT TTT AGC CCT GAA ATT CCG CAT 576 Arg Phe Lys Gly Asp Phe Lys Val Val Phe Ser Pro Glu He Pro His 180 185 190
TGC ATA GAG CTT GGG AGT TTT AAT GCG GTT ACG GCG AGT TTT GGT TTG 624 Cys He Glu Leu Gly Ser Phe Asn Ala Val Thr Ala Ser Phe Gly Leu 195 200 205
CAA ATA GCG AGT GAA GTC GTG CAA GAC ATT ATC AAC GAT AAA AGG AAG T 673 Gin He Ala Ser Glu Val Val Gin Asp He He Asn Asp Lys Arg Lys 210 215 220
GAGATGAAAG ATTACGAAGA CGAATTGG 701
(2) INFORMATION FOR SEQ ID NO: 834:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 224 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 834:
Arg Leu Ser Glu Pro He Asp Arg Phe Thr Arg He Arg Trp Leu Phe
1 5 10 15
Lys Asn Asp Phe Glu Lys He Arg Gin Gin Arg Val Leu He Cys Gly
20 25 30
Val Gly Gly Val Gly Gly Phe Ala Leu Asp Ala Leu Tyr Arg Val Gly
35 40 45
He Gly Gin He Thr He He Asp Lys Asp Val Phe Asp Val Thr Asn
50 55 60
Gin Asn Arg Gin He Gly Ser Glu Arg He Gly Glu Ser Lys Val Leu 65 70 75 80
Val Leu Gin Asp Leu Tyr Lys Gly He Gin Ala Leu Asn Leu His He
85 90 95
Asp Glu Ala Phe Leu Asn Ser Phe Asn Phe Arg Asp Tyr Asp Tyr He
100 105 110
Leu Asp Cys Met Asp Asp Leu Pro He Lys Thr Ser Leu Ala He Lys
115 120 125
Cys Gin Asn Phe Ala Tyr Gly Lys Phe He Ser Ser Met Gly Ser Ala
130 135 140
Lys Arg Leu Asn Pro Lys His He Gin Val Gly Ser Val Trp Glu Ser 145 150 155 160
Tyr Gly Asp Lys Phe Gly Arg Lys Phe Arg Asp Phe Leu Lys Lys Arg
165 170 175
Arg Phe Lys Gly Asp Phe Lys Val Val Phe Ser Pro Glu He Pro His
180 185 190
Cys He Glu Leu Gly Ser Phe Asn Ala Val Thr Ala Ser Phe Gly Leu
195 200 205
Gin He Ala Ser Glu Val Val Gin Asp He He Asn Asp Lys Arg Lys 210 215 220
(2) INFORMATION FOR SEQ ID NO: 835:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1260 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 73...1236 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:835:
CTCTCGGTTT TTTTTGTGGG TAAGATTTCG CACCATCACA TTGTGGCTTT AGGGGTGGGC 60 TTGCAATTTT TG ATG CTT TTT TAT GGC ATC AAC ACG ATT TTA TAC ACC GGC 111 Met Leu Phe Tyr Gly He Asn Thr He Leu Tyr Thr Gly 1 5 10
ACT AAC GCC ATT CTT TCT AGG CTT GTG GGG GCT AGG GAT TTT ACT CAA 159 Thr Asn Ala He Leu Ser Arg Leu Val Gly Ala Arg Asp Phe Thr Gin 15 20 25
ATC AAC CAC GCT TTT TCC AGT ATT TTC ATA GGG GCT TTT ATG ATC TGT 207 He Asn His Ala Phe Ser Ser He Phe He Gly Ala Phe Met He Cys 30 35 40 45
TTG GGC GTG CTG TTT GTT TCT TAT TTT TTG ATT GAG CCT TTT TTA AAT 255 Leu Gly Val Leu Phe Val Ser Tyr Phe Leu He Glu Pro Phe Leu Asn 50 55 60
TGG ATG CAA TTA CAA GAT CCT TCG CGC CAA TTG ACG CAA GAT TAT TTA 303 Trp Met Gin Leu Gin Asp Pro Ser Arg Gin Leu Thr Gin Asp Tyr Leu 65 70 75
GAA GTC TTA GTT GTA GCG CTA CCG AGT ATT TTT TTA AAA AAT ATT TTA 351 Glu Val Leu Val Val Ala Leu Pro Ser He Phe Leu Lys Asn He Leu 80 85 90
GTT TCA GCG CTC GCT AGT TTT TCA GAC ACC CTA ACC CCC TTT ATT GTC 399 Val Ser Ala Leu Ala Ser Phe Ser Asp Thr Leu Thr Pro Phe He Val 95 100 105
AAA ATC ATC ATG GTC ATT GCA TGC ATT TTT TTG AAT CAA GCC TTG ATT 447 Lys He He Met Val He Ala Cys He Phe Leu Asn Gin Ala Leu He 110 115 120 125
TTT GGG GAT TTT GGT TTT AAA GAA ATG GGG ATT GTA GGC TCT GCT TTA 495 Phe Gly Asp Phe Gly Phe Lys Glu Met Gly He Val Gly Ser Ala Leu 130 135 140
GCG AAT GTG GTT GTC TCT TAT TTG GAA TTA CTC GCA CTT GGC GTT TGG 543 Ala Asn Val Val Val Ser Tyr Leu Glu Leu Leu Ala Leu Gly Val Trp 145 150 155
ATA CAA ATC AAA AAA ATC CCT TTA AAA TTC AAA ATA ACC TTT CAT TTT 591 He Gin He Lys Lys He Pro Leu Lys Phe Lys He Thr Phe His Phe 160 165 170
TCT TTT TTA AAA ACC ATG TTT AGA GTG GGT TGG CCA GCC GGG TTT GAG 639 Ser Phe Leu Lys Thr Met Phe Arg Val Gly Trp Pro Ala Gly Phe Glu 175 180 185
CGC TTA TTG AGT TTA TTT TCT TTA ATC CTC TTA TCC AAA TTT GTA GCG 687 Arg Leu Leu Ser Leu Phe Ser Leu He Leu Leu Ser Lys Phe Val Ala 190 195 200 205
AGC TAT GGG GAT AAA GTG TTA GCG GGC ATG CAA ATA GGC ATT AGG GTT 735 Ser Tyr Gly Asp Lys Val Leu Ala Gly Met Gin He Gly He Arg Val 210 215 220
GAA ACC TTT TCG TTC ATG CCC GGA TTT GGG TTT ATG ATC GCA GCG ATG 783 Glu Thr Phe Ser Phe Met Pro Gly Phe Gly Phe Met He Ala Ala Met 225 230 235
GTT TTA ACA GGG CAA AAT TTA GGG GCA AAC AAG CCA AAG ATC GCC ACA 831 Val Leu Thr Gly Gin Asn Leu Gly Ala Asn Lys Pro Lys He Ala Thr 240 245 250
GAA TAC GCG CAT TTG ATT TTA AAA ATC TCT ATG GGT TTA ATG GGG GTT 879 Glu Tyr Ala His Leu He Leu Lys He Ser Met Gly Leu Met Gly Val 255 260 265
TTA GGG ATT GTT TTA GTC TTA TTC GCT AAA GAA TTT GCG AGC CTT TTT 927 Leu Gly He Val Leu Val Leu Phe Ala Lys Glu Phe Ala Ser Leu Phe 270 275 280 285
TCT CAA GAT GAA GAA GTC TTG GAA GTG GCG CGA TCT TAT TTG ATC GCT 975 Ser Gin Asp Glu Glu Val Leu Glu Val Ala Arg Ser Tyr Leu He Ala 290 295 300
GTG GGC CTC TCT CAA GCC CCC TTA ATT GGG TAT TTT GTG CTA GAT GGA 1023 Val Gly Leu Ser Gin Ala Pro Leu He Gly Tyr Phe Val Leu Asp Gly 305 310 315
GTT TTT AGA GGG GCT GGC ATT TCT AAA GTC TCA CTG TAT ATT AAC ACC 1071 Val Phe Arg Gly Ala Gly He Ser Lys Val Ser Leu Tyr He Asn Thr 320 325 330
CTA AGC TTA TGG GGG TTA AGG ATC ATG CCC ATT TAC TTG CTT TTA ATT 1119 Leu Ser Leu Trp Gly Leu Arg He Met Pro He Tyr Leu Leu Leu He 335 340 345 CAT CAT TTT AAG GTG GAA TTT ATT TTT GTA GTG ATC GCA TCA GAA ACT 1167 His His Phe Lys Val Glu Phe He Phe Val Val He Ala Ser Glu Thr 350 355 360 365
TTT TTG CGC TCA TTC ATC TAT TAT AAA GTT TTT TCT AAA GGC ATT TGG 1215 Phe Leu Arg Ser Phe He Tyr Tyr Lys Val Phe Ser Lys Gly He Trp 370 375 380
AAA AGG TGC GGG AAA AAG GCT TGATTATTGC TTGAGCGTAG CGGT 1260
Lys Arg Cys Gly Lys Lys Ala 385
(2) INFORMATION FOR SEQ ID NO: 836:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 388 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 836:
Met Leu Phe Tyr Gly He Asn Thr He Leu Tyr Thr Gly Thr Asn Ala
1 5 10 15
He Leu Ser Arg Leu Val Gly Ala Arg Asp Phe Thr Gin He Asn His
20 25 30
Ala Phe Ser Ser He Phe He Gly Ala Phe Met He Cys Leu Gly Val
35 40 45
Leu Phe Val Ser Tyr Phe Leu He Glu Pro Phe Leu Asn Trp Met Gin
50 55 60
Leu Gin Asp Pro Ser Arg Gin Leu Thr Gin Asp Tyr Leu Glu Val Leu 65 70 75 80
Val Val Ala Leu Pro Ser He Phe Leu Lys Asn He Leu Val Ser Ala
85 90 95
Leu Ala Ser Phe Ser Asp Thr Leu Thr Pro Phe He Val Lys He He
100 105 110
Met Val He Ala Cys He Phe Leu Asn Gin Ala Leu He Phe Gly Asp
115 120 125
Phe Gly Phe Lys Glu Met Gly He Val Gly Ser Ala Leu Ala Asn Val
130 135 140
Val Val Ser Tyr Leu Glu Leu Leu Ala Leu Gly Val Trp He Gin He 145 150 155 160
Lys Lys He Pro Leu Lys Phe Lys He Thr Phe His Phe Ser Phe Leu
165 170 175
Lys Thr Met Phe Arg Val Gly Trp Pro Ala Gly Phe Glu Arg Leu Leu
180 185 190
Ser Leu Phe Ser Leu He Leu Leu Ser Lys Phe Val Ala Ser Tyr Gly
195 200 205
Asp Lys Val Leu Ala Gly Met Gin He Gly He Arg Val Glu Thr Phe
210 215 220
Ser Phe Met Pro Gly Phe Gly Phe Met He Ala Ala Met Val Leu Thr 225 230 235 240 Gly Gin Asn Leu Gly Ala Asn Lys Pro Lys He Ala Thr Glu Tyr Ala
245 250 255
His Leu He Leu Lys He Ser Met Gly Leu Met Gly Val Leu Gly He
260 265 270
Val Leu Val Leu Phe Ala Lys Glu Phe Ala Ser Leu Phe Ser Gin Asp
275 280 285
Glu Glu Val Leu Glu Val Ala Arg Ser Tyr Leu He Ala Val Gly Leu
290 295 300
Ser Gin Ala Pro Leu He Gly Tyr Phe Val Leu Asp Gly Val Phe Arg 305 310 315 320
Gly Ala Gly He Ser Lys Val Ser Leu Tyr He Asn Thr Leu Ser Leu
325 330 335
Trp Gly Leu Arg He Met Pro He Tyr Leu Leu Leu He His His Phe
340 345 350
Lys Val Glu Phe He Phe Val Val He Ala Ser Glu Thr Phe Leu Arg
355 360 365
Ser Phe He Tyr Tyr Lys Val Phe Ser Lys Gly He Trp Lys Arg Cys
370 375 380
Gly Lys Lys Ala 385
(2) INFORMATION FOR SEQ ID NO: 837:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1327 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 22...1305 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:837:
TAAACATCTA AAGGATAAAA C ATG CCA TTG CCA TTT ATT ATA GCA GCT GGA 51
Met Pro Leu Pro Phe He He Ala Ala Gly 1 5 10
GTG GCC TTA GTG GCC GCA GNA TAC GGA GTT AAA AAA AAA GTT GAT GCA 99 Val Ala Leu Val Ala Ala Xaa Tyr Gly Val Lys Lys Lys Val Asp Ala 15 20 25
GAC ATT CTC AGT GAA GAG ACC AAT GAA TAT ATT AAG TAT ATC AAT GAA 147 Asp He Leu Ser Glu Glu Thr Asn Glu Tyr He Lys Tyr He Asn Glu 30 35 40
GGC AAT GAC TTG CTA GAG GAA GCA GAA GAA GTT ATT AAA GCT GTG GCT 195 Gly Asn Asp Leu Leu Glu Glu Ala Glu Glu Val He Lys Ala Val Ala 45 50 55 TCT GAT TGT GAG TTT GCT CTT GCG AGA TTT GAA GAG AAA AGG TGC TAT 243 Ser Asp Cys Glu Phe Ala Leu Ala Arg Phe Glu Glu Lys Arg Cys Tyr 60 65 70
ATT AGA AAT CAT GTA ATT TCA GAA TTT TTG CAC CAT TTT AAT CAA TTA 291 He Arg Asn His Val He Ser Glu Phe Leu His His Phe Asn Gin Leu 75 80 85 90
GAA GGA TTC GAG CTT ACC AAC AAA AAA GAT AGC ATG GAA AAT ATC CAA 339 Glu Gly Phe Glu Leu Thr Asn Lys Lys Asp Ser Met Glu Asn He Gin 95 100 105
CTC GAT GTA TCA AAT ACA CTA AAA ATT ATT GAT AAA AAT CTC AAG ATG 387 Leu Asp Val Ser Asn Thr Leu Lys He He Asp Lys Asn Leu Lys Met 110 115 120
AGC TCT TTT GAC ACC CTT GGT GCC GTT GGA AAT GTT GTG GGA GGT TTT 435 Ser Ser Phe Asp Thr Leu Gly Ala Val Gly Asn Val Val Gly Gly Phe 125 130 135
TCT ATG GGA TTT GGT TTG GCT GCT GGA GGT ATA GTT GGA AGT GTA GGG 483 Ser Met Gly Phe Gly Leu Ala Ala Gly Gly He Val Gly Ser Val Gly 140 145 150
CTT TTA GCC GGA CCC ACA CTC GCT ATT TTT GGA GCT TTG AGA GCT GCT 531 Leu Leu Ala Gly Pro Thr Leu Ala He Phe Gly Ala Leu Arg Ala Ala 155 160 165 170
GAA ATG GAA AAA AAA TTA GAA GAT GCT AAG GCT TAT TGC TCT CAA GTT 579 Glu Met Glu Lys Lys Leu Glu Asp Ala Lys Ala Tyr Cys Ser Gin Val 175 180 185
GAA GCA GCC GTC AAA AAA GCC GAT GCG ATG ATT GAT AAT CTT CAA GCC 627 Glu Ala Ala Val Lys Lys Ala Asp Ala Met He Asp Asn Leu Gin Ala 190 195 200
GTT AGG AAA ATG GCA GAT CTT TTC ACT AGG CAG ATC ACA AAA TTT GAC 675 Val Arg Lys Met Ala Asp Leu Phe Thr Arg Gin He Thr Lys Phe Asp 205 210 215
GCA CTG TTT TTC TCG CTT GCT CAA GAG GCA ATC GCC ACG ATG AAA AAG 723 Ala Leu Phe Phe Ser Leu Ala Gin Glu Ala He Ala Thr Met Lys Lys 220 225 230
CAC AAC TAC GAT TTT TCG CAT TAC AAT CAA AAA GAA CAA GAT CAG CTA 771 His Asn Tyr Asp Phe Ser His Tyr Asn Gin Lys Glu Gin Asp Gin Leu 235 240 245 250
GCT ACT GCT TCT TCA ACC CTT AAA ACT TTG GGT GCT TTT TTG AAA GTG 819 Ala Thr Ala Ser Ser Thr Leu Lys Thr Leu Gly Ala Phe Leu Lys Val 255 260 265
CCT ATC ATG GAC AAA CAC CAA AAG CTC AAT GAA GCT ACA CAA AGT AAG 867 Pro He Met Asp Lys His Gin Lys Leu Asn Glu Ala Thr Gin Ser Lys 270 275 280 CTA GAG TTT ATG CAA AGG GAG ATG AGT AGC CTA GAA GCT AAG CAT TAT 915 Leu Glu Phe Met Gin Arg Glu Met Ser Ser Leu Glu Ala Lys His Tyr 285 290 295
GAT TCA GTT AAA ATC AAA TTT GGA TTG GTA CGC AGA TTA TTT GAA TTT 963 Asp Ser Val Lys He Lys Phe Gly Leu Val Arg Arg Leu Phe Glu Phe 300 305 310
TTT AGA TCG CTT TGG GGA AAA AAT GGA AGA ATC CAA AGA GCG AAA ACA 1011 Phe Arg Ser Leu Trp Gly Lys Asn Gly Arg He Gin Arg Ala Lys Thr 315 320 325 330
ACT CCT GAT CGC TTC CCT TGC ACC TCT TGC GGG CTT TGC TGC AAG AAT 1059 Thr Pro Asp Arg Phe Pro Cys Thr Ser Cys Gly Leu Cys Cys Lys Asn 335 340 345
ATC GCC GGG ATT ATT GAG CTT ATT GGG TTT GAT GCT GGC AAT GGG GTG 1107 He Ala Gly He He Glu Leu He Gly Phe Asp Ala Gly Asn Gly Val 350 355 360
TGC AAA TTT TTG GAT TTA GAA ACC AAT CTG TGC AAG ATT TAT GAA TCG 1155 Cys Lys Phe Leu Asp Leu Glu Thr Asn Leu Cys Lys He Tyr Glu Ser 365 370 375
CGC CCG TTA ATT TGC AGG ATT GAT GAA GCG CAC AAA AAG CTT TAT CCC 1203 Arg Pro Leu He Cys Arg He Asp Glu Ala His Lys Lys Leu Tyr Pro 380 385 390
CAC ATC CCG CTT AAG GAG TTT TAT GCC AAA AAC GCA GAG GTT TGT AAC 1251 His He Pro Leu Lys Glu Phe Tyr Ala Lys Asn Ala Glu Val Cys Asn 395 400 405 410
GCT TTG CAA GAA GCA AAC CAT ATG GAT AAG AGC TTT AGG GTT ATT CTT 1299 Ala Leu Gin Glu Ala Asn His Met Asp Lys Ser Phe Arg Val He Leu 415 420 425
AAG AAA TAATTTAGAA TTTATTGTCC CA 1327
Lys Lys
(2) INFORMATION FOR SEQ ID NO: 838:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 428 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 838:
Met Pro Leu Pro Phe He He Ala Ala Gly Val Ala Leu Val Ala Ala 1 5 10 15 Xaa Tyr Gly Val Lys Lys Lys Val Asp Ala Asp He Leu Ser Glu Glu
20 25 30
Thr Asn Glu Tyr He Lys Tyr He Asn Glu Gly Asn Asp Leu Leu Glu
35 40 45
Glu Ala Glu Glu Val He Lys Ala Val Ala Ser Asp Cys Glu Phe Ala
50 55 60
Leu Ala Arg Phe Glu Glu Lys Arg Cys Tyr He Arg Asn His Val He 65 70 75 80
Ser Glu Phe Leu His His Phe Asn Gin Leu Glu Gly Phe Glu Leu Thr
85 90 95
Asn Lys Lys Asp Ser Met Glu Asn He Gin Leu Asp Val Ser Asn Thr
100 105 110
Leu Lys He He Asp Lys Asn Leu Lys Met Ser Ser Phe Asp Thr Leu
115 120 125
Gly Ala Val Gly Asn Val Val Gly Gly Phe Ser Met Gly Phe Gly Leu
130 135 140
Ala Ala Gly Gly He Val Gly Ser Val Gly Leu Leu Ala Gly Pro Thr 145 150 155 160
Leu Ala He Phe Gly Ala Leu Arg Ala Ala Glu Met Glu Lys Lys Leu
165 170 175
Glu Asp Ala Lys Ala Tyr Cys Ser Gin Val Glu Ala Ala Val Lys Lys
180 185 190
Ala Asp Ala Met He Asp Asn Leu Gin Ala Val Arg Lys Met Ala Asp
195 200 205
Leu Phe Thr Arg Gin He Thr Lys Phe Asp Ala Leu Phe Phe Ser Leu
210 215 220
Ala Gin Glu Ala He Ala Thr Met Lys Lys His Asn Tyr Asp Phe Ser 225 230 235 240
His Tyr Asn Gin Lys Glu Gin Asp Gin Leu Ala Thr Ala Ser Ser Thr
245 250 255
Leu Lys Thr Leu Gly Ala Phe Leu Lys Val Pro He Met Asp Lys His
260 265 270
Gin Lys Leu Asn Glu Ala Thr Gin Ser Lys Leu Glu Phe Met Gin Arg
275 280 285
Glu Met Ser Ser Leu Glu Ala Lys His Tyr Asp Ser Val Lys He Lys
290 295 300
Phe Gly Leu Val Arg Arg Leu Phe Glu Phe Phe Arg Ser Leu Trp Gly 305 310 315 320
Lys Asn Gly Arg He Gin Arg Ala Lys Thr Thr Pro Asp Arg Phe Pro
325 330 335
Cys Thr Ser Cys Gly Leu Cys Cys Lys Asn He Ala Gly He He Glu
340 345 350
Leu He Gly Phe Asp Ala Gly Asn Gly Val Cys Lys Phe Leu Asp Leu
355 360 365
Glu Thr Asn Leu Cys Lys He Tyr Glu Ser Arg Pro Leu He Cys Arg
370 375 380
He Asp Glu Ala His Lys Lys Leu Tyr Pro His He Pro Leu Lys Glu 385 390 395 400
Phe Tyr Ala Lys Asn Ala Glu Val Cys Asn Ala Leu Gin Glu Ala Asn
405 410 415
His Met Asp Lys Ser Phe Arg Val He Leu Lys Lys 420 425
(2) INFORMATION FOR SEQ ID NO: 839: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 894 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 30...851 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 839:
TATTAACACC GTTAAAGAGG GGGTAATTC ATG TTA GAA AAC ATG CAA GAT ATT 53
Met Leu Glu Asn Met Gin Asp He 1 5
TCA TTG CAA AGC TCT CAT GAA GTA GGA GTG GAT ATT ACA GAG AGC AAA 101 Ser Leu Gin Ser Ser His Glu Val Gly Val Asp He Thr Glu Ser Lys 10 15 20
ATG CTT ACA AAA TTT GCA TCC TCG TTA TTA ATG AAT TTA TAT GAA TAT 149 Met Leu Thr Lys Phe Ala Ser Ser Leu Leu Met Asn Leu Tyr Glu Tyr 25 30 35 40
ATT GGA AAT GGC AAG GAT CCC AAA GAA GCG TCC GAT CAT GCC ATG AGG 197 He Gly Asn Gly Lys Asp Pro Lys Glu Ala Ser Asp His Ala Met Arg 45 50 55
GAT GCA AAG GAT GTG GTG CTT AGT TGT GGT AGA GTA GCC TTT CTT AAA 245 Asp Ala Lys Asp Val Val Leu Ser Cys Gly Arg Val Ala Phe Leu Lys 60 65 70
GAC ATA GTT TCA AAT AGT CCA AAC GAA ACA ATC CAA AGT TTT GAT GGA 293 Asp He Val Ser Asn Ser Pro Asn Glu Thr He Gin Ser Phe Asp Gly 75 80 85
GAC TTA GAA GTT GCG ATG CAT TTA GAA AAA ATT GGC ATA GAA TGT TAT 341 Asp Leu Glu Val Ala Met His Leu Glu Lys He Gly He Glu Cys Tyr 90 95 100
AAG ATA TTT ATT GAC TAT GGT TCT CAA AAG ATC GAT GAT AAT GAG CTT 389 Lys He Phe He Asp Tyr Gly Ser Gin Lys He Asp Asp Asn Glu Leu 105 110 115 120
TCT TGT CGT TTG TTA CAC ACT GGC ACG AAA ATT TTA GGC ACA AAA GCT 437 Ser Cys Arg Leu Leu His Thr Gly Thr Lys He Leu Gly Thr Lys Ala 125 130 135
ATG GCA GTT GTT GGT CAA ACA TTC ATC CCC ATT CCT GGA GTT GGA GCG 485 Met Ala Val Val Gly Gin Thr Phe He Pro He Pro Gly Val Gly Ala 140 145 150
ATA ATT GGA AAT TTT GTG GGT GCA TTA CTG AGC AAA ACT CTC TGT GAA 533 He He Gly Asn Phe Val Gly Ala Leu Leu Ser Lys Thr Leu Cys Glu 155 160 165
AAT TTG CGA GAT GTT TTA AAA GAG GCT AAA TTG GCG CGC CAA AGG CGT 581 Asn Leu Arg Asp Val Leu Lys Glu Ala Lys Leu Ala Arg Gin Arg Arg 170 175 180
ATA GAG ATT GAA AAA GAA TGC CGT GAA AGT ATT AGG CTG TTA GAG ATC 629 He Glu He Glu Lys Glu Cys Arg Glu Ser He Arg Leu Leu Glu He 185 190 195 200
TAT CGC AAT CAA TTT AAG GAA GTG TTT GAG CGG TAT TTT CAT GGG AAT 677 Tyr Arg Asn Gin Phe Lys Glu Val Phe Glu Arg Tyr Phe His Gly Asn 205 210 215
GTA AAA TTC TTT AAT GAG AAT TTT AAT AAT CTT GAG AGG GCG CTT TAT 725 Val Lys Phe Phe Asn Glu Asn Phe Asn Asn Leu Glu Arg Ala Leu Tyr 220 225 230
GCA GGA GAT GCG GAT TTG GCC ATA GGA GTC AAT AAT GAG ATT CAA GAA 773 Ala Gly Asp Ala Asp Leu Ala He Gly Val Asn Asn Glu He Gin Glu 235 240 245
AGA CTA GGT CAA AAA CCC TTG TTT AAT AAT ACC CAA GAA TTT TTG GAA 821 Arg Leu Gly Gin Lys Pro Leu Phe Asn Asn Thr Gin Glu Phe Leu Glu 250 255 260
CTC ATG AAT AAT GGT GGA AAA ATA GAA ATT TAAAGGAGAA ATCATGGAAG AAC 874 Leu Met Asn Asn Gly Gly Lys He Glu He 265 270
AAAAGGATAT GGGTCAAAGT 894
(2) INFORMATION FOR SEQ ID NO: 840:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 274 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 840:
Met Leu Glu Asn Met Gin Asp He Ser Leu Gin Ser Ser His Glu Val
1 5 10 15
Gly Val Asp He Thr Glu Ser Lys Met Leu Thr Lys Phe Ala Ser Ser
20 25 30
Leu Leu Met Asn Leu Tyr Glu Tyr He Gly Asn Gly Lys Asp Pro Lys
35 40 45
Glu Ala Ser Asp His Ala Met Arg Asp Ala Lys Asp Val Val Leu Ser 50 55 60
Cys Gly Arg Val Ala Phe Leu Lys Asp He Val Ser Asn Ser Pro Asn 65 70 75 80
Glu Thr He Gin Ser Phe Asp Gly Asp Leu Glu Val Ala Met His Leu
85 90 95
Glu Lys He Gly He Glu Cys Tyr Lys He Phe He Asp Tyr Gly Ser
100 105 110
Gin Lys He Asp Asp Asn Glu Leu Ser Cys Arg Leu Leu His Thr Gly
115 120 125
Thr Lys He Leu Gly Thr Lys Ala Met Ala Val Val Gly Gin Thr Phe
130 135 140
He Pro He Pro Gly Val Gly Ala He He Gly Asn Phe Val Gly Ala 145 150 155 160
Leu Leu Ser Lys Thr Leu Cys Glu Asn Leu Arg Asp Val Leu Lys Glu
165 170 175
Ala Lys Leu Ala Arg Gin Arg Arg He Glu He Glu Lys Glu Cys Arg
180 185 190
Glu Ser He Arg Leu Leu Glu He Tyr Arg Asn Gin Phe Lys Glu Val
195 200 205
Phe Glu Arg Tyr Phe His Gly Asn Val Lys Phe Phe Asn Glu Asn Phe
210 215 220
Asn Asn Leu Glu Arg Ala Leu Tyr Ala Gly Asp Ala Asp Leu Ala He 225 230 235 240
Gly Val Asn Asn Glu He Gin Glu Arg Leu Gly Gin Lys Pro Leu Phe
245 250 255
Asn Asn Thr Gin Glu Phe Leu Glu Leu Met Asn Asn Gly Gly Lys He
260 265 270
Glu He
(2) INFORMATION FOR SEQ ID NO: 841:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1338 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 70...1281 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 841:
CTTCATTACG CTTACGCTAC AACCCTTAAG ATCACCAATG TTGTGCCTTT TGGCTCTAGC 60 AGCGTTAAA ATG GTG TTC AAT CAA GAG GTT AAA AAA TTC AAA GAA GTT TCG 111 Met Val Phe Asn Gin Glu Val Lys Lys Phe Lys Glu Val Ser 1 5 10
CTC AAA AAT TTC AAG AGT TAT TTG GAA TTA GAA GCC ATT TTA ACC ATT 159 Leu Lys Asn Phe Lys Ser Tyr Leu Glu Leu Glu Ala He Leu Thr He 15 20 25 30
CCT AAA AAG CAT TAC CAA TTC TCC AAG CAA TCG TTC ATC ACG ATC GCG 207 Pro Lys Lys His Tyr Gin Phe Ser Lys Gin Ser Phe He Thr He Ala 35 40 45
CAA TTC AGC CCT AAG TTA GTG CGA GTG GTT ATC GGC TAT GCT CCT AAG 255 Gin Phe Ser Pro Lys Leu Val Arg Val Val He Gly Tyr Ala Pro Lys 50 55 60
ATG ACT TAT GAA GTT AAA ATC CTT AAA GAC AAG CTT TAT GTT TCT ATC 303 Met Thr Tyr Glu Val Lys He Leu Lys Asp Lys Leu Tyr Val Ser He 65 70 75
GTG GAG AAA AAG CCC TTA ATT AGG CAT CAA ATG GCG TTA AAA CCA CCC 351 Val Glu Lys Lys Pro Leu He Arg His Gin Met Ala Leu Lys Pro Pro 80 85 90
AAA CAC CAT GCA CTC AAA CAC ACA ACG CCA AAA CCC GCC CAT AAG CCC 399 Lys His His Ala Leu Lys His Thr Thr Pro Lys Pro Ala His Lys Pro 95 100 105 110
ATT AAA AAA GAG GCT AAA AAG GTT AAA GAA AAA ACG CCA ACT AAA CAT 447 He Lys Lys Glu Ala Lys Lys Val Lys Glu Lys Thr Pro Thr Lys His 115 120 125
GCG CAT TCA AAA CAC ACG CAT TCC CCA TTG AAC GAA AGG AGC ACT AAA 495 Ala His Ser Lys His Thr His Ser Pro Leu Asn Glu Arg Ser Thr Lys 130 135 140
AAA GAA ATT CCT AAA AAA GAA ATT CCT AAA AAA GAA GCG GAA AAT GAG 543 Lys Glu He Pro Lys Lys Glu He Pro Lys Lys Glu Ala Glu Asn Glu 145 150 155
AGC AAG AAC CAA GTC TTT ATA GCA GAA AAA AAT GAT ACT TTC ATC AAA 591 Ser Lys Asn Gin Val Phe He Ala Glu Lys Asn Asp Thr Phe He Lys 160 165 170
ACC AAA CGC AAA AAA CAC AAA AAG ATC GTT TTA GAC GCT GGG CAT GGG 639 Thr Lys Arg Lys Lys His Lys Lys He Val Leu Asp Ala Gly His Gly 175 180 185 190
GGG AAA GAT TGC GGG GCG ATG AGC GCG AAT TTG GTG TGT GAA AAA GAC 687 Gly Lys Asp Cys Gly Ala Met Ser Ala Asn Leu Val Cys Glu Lys Asp 195 200 205
ATT GTT TTA GAA GTG GTG AAG TTT TTA CAC AAA GAG CTT AAA AAA AGA 735 He Val Leu Glu Val Val Lys Phe Leu His Lys Glu Leu Lys Lys Arg 210 215 220
GAT TAT AGC GTT TTA TTG ACA AGG GAT AAG GAT ATT TAT ATT GAT TTA 783 Asp Tyr Ser Val Leu Leu Thr Arg Asp Lys Asp He Tyr He Asp Leu 225 230 235 GTG GCT CGC ACG GAA TTA GCC AAT AAA AAA AGC GCG GAT TTA TTC ATC 831 Val Ala Arg Thr Glu Leu Ala Asn Lys Lys Ser Ala Asp Leu Phe He 240 245 250
TCA GTG CAT GCC AAT TCC ATC CCC AAA CAT TCC ACT TCT AAC GCT CAT 879 Ser Val His Ala Asn Ser He Pro Lys His Ser Thr Ser Asn Ala His 255 260 265 270
GGT ATA GAG ACT TAT TTT TTA TCC ACC GCA AGG AGC GAA AGG GCT AGG 927 Gly He Glu Thr Tyr Phe Leu Ser Thr Ala Arg Ser Glu Arg Ala Arg 275 280 285
AAA GTG GCT GAG CAA GAA AAT AAA GAC GAT GTG AAT TTA ATG GAT TAT 975 Lys Val Ala Glu Gin Glu Asn Lys Asp Asp Val Asn Leu Met Asp Tyr 290 295 300
TTT TCT AAA AGT TTG TTT TTA AAT TCA TTG AAC ACG CAG CGA TTG ATC 1023 Phe Ser Lys Ser Leu Phe Leu Asn Ser Leu Asn Thr Gin Arg Leu He 305 310 315
GTC TCT AAC AAA TTA GCG ATT GAT GTG CAA TAC GGC ATG CTC CAA AGC 1071 Val Ser Asn Lys Leu Ala He Asp Val Gin Tyr Gly Met Leu Gin Ser 320 325 330
GTC CGC AAA AAT TAC CCT GAT GTG GTG GAT GGA GGC GTG AGA GAG GGG 1119 Val Arg Lys Asn Tyr Pro Asp Val Val Asp Gly Gly Val Arg Glu Gly 335 340 345 350
CCT TTT TGG GTG TTG GCC GGG GCT TTA ATG CCT TCA ATC TTA ATA GAA 1167 Pro Phe Trp Val Leu Ala Gly Ala Leu Met Pro Ser He Leu He Glu 355 360 365
ATT GGT TAT AAT TCC CAT GCG ATA GAA TCT AAA CGC ATC CAA AGC AAA 1215 He Gly Tyr Asn Ser His Ala He Glu Ser Lys Arg He Gin Ser Lys 370 375 380
CCG TAT CAA AAG ATC TTG GCT AAG GGC ATT GCT GAT GGC ATT GAT AGT 1263 Pro Tyr Gin Lys He Leu Ala Lys Gly He Ala Asp Gly He Asp Ser 385 390 395
TTC TTC AGC AAG AAT GAT TAGGCAATGA TTAGGTTGTA GATGAATTTT TATCAAAA 1319 Phe Phe Ser Lys Asn Asp 400
AATATACACT CATAAAGTC 133!
(2) INFORMATION FOR SEQ ID NO: 842:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 404 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 842:
Met Val Phe Asn Gin Glu Val Lys Lys Phe Lys Glu Val Ser Leu Lys
1 5 10 15
Asn Phe Lys Ser Tyr Leu Glu Leu Glu Ala He Leu Thr He Pro Lys
20 25 30
Lys His Tyr Gin Phe Ser Lys Gin Ser Phe He Thr He Ala Gin Phe
35 40 45
Ser Pro Lys Leu Val Arg Val Val He Gly Tyr Ala Pro Lys Met Thr
50 55 60
Tyr Glu Val Lys He Leu Lys Asp Lys Leu Tyr Val Ser He Val Glu 65 70 75 80
Lys Lys Pro Leu He Arg His Gin Met Ala Leu Lys Pro Pro Lys His
85 90 95
His Ala Leu Lys His Thr Thr Pro Lys Pro Ala His Lys Pro He Lys
100 105 110
Lys Glu Ala Lys Lys Val Lys Glu Lys Thr Pro Thr Lys His Ala His
115 120 125
Ser Lys His Thr His Ser Pro Leu Asn Glu Arg Ser Thr Lys Lys Glu
130 135 140
He Pro Lys Lys Glu He Pro Lys Lys Glu Ala Glu Asn Glu Ser Lys 145 150 155 160
Asn Gin Val Phe He Ala Glu Lys Asn Asp Thr Phe He Lys Thr Lys
165 170 175
Arg Lys Lys His Lys Lys He Val Leu Asp Ala Gly His Gly Gly Lys
180 185 190
Asp Cys Gly Ala Met Ser Ala Asn Leu Val Cys Glu Lys Asp He Val
195 200 205
Leu Glu Val Val Lys Phe Leu His Lys Glu Leu Lys Lys Arg Asp Tyr
210 215 220
Ser Val Leu Leu Thr Arg Asp Lys Asp He Tyr He Asp Leu Val Ala 225 230 235 240
Arg Thr Glu Leu Ala Asn Lys Lys Ser Ala Asp Leu Phe He Ser Val
245 250 255
His Ala Asn Ser He Pro Lys His Ser Thr Ser Asn Ala His Gly He
260 265 270
Glu Thr Tyr Phe Leu Ser Thr Ala Arg Ser Glu Arg Ala Arg Lys Val
275 280 285
Ala Glu Gin Glu Asn Lys Asp Asp Val Asn Leu Met Asp Tyr Phe Ser
290 295 300
Lys Ser Leu Phe Leu Asn Ser Leu Asn Thr Gin Arg Leu He Val Ser 305 310 315 320
Asn Lys Leu Ala He Asp Val Gin Tyr Gly Met Leu Gin Ser Val Arg
325 330 335
Lys Asn Tyr Pro Asp Val Val Asp Gly Gly Val Arg Glu Gly Pro Phe
340 345 350
Trp Val Leu Ala Gly Ala Leu Met Pro Ser He Leu He Glu He Gly
355 360 365
Tyr Asn Ser His Ala He Glu Ser Lys Arg He Gin Ser Lys Pro Tyr
370 375 380
Gin Lys He Leu Ala Lys Gly He Ala Asp Gly He Asp Ser Phe Phe 385 390 395 400
Ser Lys Asn Asp (2) INFORMATION FOR SEQ ID NO:843:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1161 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 37...1125 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 843:
TGAAATTAAA TATTAACTAA TATTAAGGAA AGAGCT ATG GTA TCA ACA CTC AAA 54
Met Val Ser Thr Leu Lys 1 5
CCG CTA AAA ATC GGT AAG CAC ACC ATA AAA TTC CCT ATC TTT CAA GGG 102 Pro Leu Lys He Gly Lys His Thr He Lys Phe Pro He Phe Gin Gly 10 15 20
GGA ATG GGT GTG GGG ATT AGC TGG GAT GAA CTA GCT GGA AAT GTT GCC 150 Gly Met Gly Val Gly He Ser Trp Asp Glu Leu Ala Gly Asn Val Ala 25 30 35
AAA GAA GGG GCT TTA GGA GTG ATT TCA GCC GTA GGG ACT GGT TAT TAT 198 Lys Glu Gly Ala Leu Gly Val He Ser Ala Val Gly Thr Gly Tyr Tyr 40 45 50
AAA AAC ATG CGT TTT GTA GAA AGG ATT GTG GCT AAA AAA CCC TTT GAA 246 Lys Asn Met Arg Phe Val Glu Arg He Val Ala Lys Lys Pro Phe Glu 55 60 65 70
GCC TTG AAT TTT TAC TCC AAA AAA GCG TTG AAT GAG ATT TTT GCA AAC 294 Ala Leu Asn Phe Tyr Ser Lys Lys Ala Leu Asn Glu He Phe Ala Asn 75 80 85
GCT AGG AAG ATT TGC GGG AAC AAC CCT TTA GGA GCG AAT ATT TTA TAC 342 Ala Arg Lys He Cys Gly Asn Asn Pro Leu Gly Ala Asn He Leu Tyr 90 95 100
GCT ATC AAT GAC TAT GGC CGT GTT TTA AGG GAC TCT TGT GAA GCG GGA 390 Ala He Asn Asp Tyr Gly Arg Val Leu Arg Asp Ser Cys Glu Ala Gly 105 110 115
GCG AAT ATC ATT ATT ACA GGG GCT GGT TTG CCC ACC AAC ATG CCT GAA 438 Ala Asn He He He Thr Gly Ala Gly Leu Pro Thr Asn Met Pro Glu 120 125 130 TTC GCT AAG GAT TTT AGC GAT GTG GCG CTC ATC CCT ATT ATT TCT TCA 486 Phe Ala Lys Asp Phe Ser Asp Val Ala Leu He Pro He He Ser Ser 135 140 145 150
GCG AAG GCT TTA AAA ATC CTT TGT AAA AGA TGG AGC GAT CGC TAT AAA 534 Ala Lys Ala Leu Lys He Leu Cys Lys Arg Trp Ser Asp Arg Tyr Lys 155 160 165
AGA ATC CCG GAC GCG TTC ATT GTG GAA GGG CCT TTG AGT GGG GGG CAT 582 Arg He Pro Asp Ala Phe He Val Glu Gly Pro Leu Ser Gly Gly His 170 175 180
CAG GGC TTT AAA TAC GAA GAT TGT TTC AAA GAA GAA TTC CGA TTA GAA 630 Gin Gly Phe Lys Tyr Glu Asp Cys Phe Lys Glu Glu Phe Arg Leu Glu 185 190 195
AAC TTA GTG CCT AAA GTC GTG GAA GCT TCT AAA GAA TGG GGG AAT ATC 678 Asn Leu Val Pro Lys Val Val Glu Ala Ser Lys Glu Trp Gly Asn He 200 205 210
CCT ATC ATC GCC GCT GGG GGG ATT TGG GAT AGG AAG GAT ATA GAC ACC 726 Pro He He Ala Ala Gly Gly He Trp Asp Arg Lys Asp He Asp Thr 215 220 225 230
ATG TTA AGT CTT GGA GCG AGT GGG GTG CAG ATG GCG ACT CGT TTT TTA 774 Met Leu Ser Leu Gly Ala Ser Gly Val Gin Met Ala Thr Arg Phe Leu 235 240 245
GGC ACG AAA GAA TGC GAC GCT AAA GTG TAT GCC GAT CTT TTG CCC ACG 822 Gly Thr Lys Glu Cys Asp Ala Lys Val Tyr Ala Asp Leu Leu Pro Thr 250 255 260
CTC AAA AAA GAA GAT ATT TTA CTC ATT AAA TCG CCT GTA GGT TAT CCG 870 Leu Lys Lys Glu Asp He Leu Leu He Lys Ser Pro Val Gly Tyr Pro 265 270 275
GCT AGG GCT ATT AAT ACG GGA GTG ATC AAG CGC ATT GAA GAG GGT AAC 918 Ala Arg Ala He Asn Thr Gly Val He Lys Arg He Glu Glu Gly Asn 280 285 290
GCG CCC AAA ATC GCA TGC GTG AGC AAT TGT GTA GCG CCT TGC AAC AGG 966 Ala Pro Lys He Ala Cys Val Ser Asn Cys Val Ala Pro Cys Asn Arg 295 300 305 310
GGT GAA GAG GCT AAA AAG GTG GGC TAT TGT ATC GCT GAT GGT TTG GGG 1014 Gly Glu Glu Ala Lys Lys Val Gly Tyr Cys He Ala Asp Gly Leu Gly 315 320 325
CGC AGT TAT TTA GGG AAC AGA GAA GAG GGG CTT TAT TTT ACC GGG GCT 1062 Arg Ser Tyr Leu Gly Asn Arg Glu Glu Gly Leu Tyr Phe Thr Gly Ala 330 335 340
AAT GGC TAT AGA GTG GAT AAG ATT ATC AGC GTG CAT GAA TTG ATT AAA 1110 Asn Gly Tyr Arg Val Asp Lys He He Ser Val His Glu Leu He Lys 345 350 355 GAG CTT ACA GAG GGT TAATTTGTAG TGCTTGTGAG GTTAGGGGTT GTTGCA 1161
Glu Leu Thr Glu Gly 360
(2) INFORMATION FOR SEQ ID NO: 844:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 363 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 844:
Met Val Ser Thr Leu Lys Pro Leu Lys He Gly Lys His Thr He Lys
1 5 10 15
Phe Pro He Phe Gin Gly Gly Met Gly Val Gly He Ser Trp Asp Glu
20 25 30
Leu Ala Gly Asn Val Ala Lys Glu Gly Ala Leu Gly Val He Ser Ala
35 40 45
Val Gly Thr Gly Tyr Tyr Lys Asn Met Arg Phe Val Glu Arg He Val
50 55 60
Ala Lys Lys Pro Phe Glu Ala Leu Asn Phe Tyr Ser Lys Lys Ala Leu 65 70 75 80
Asn Glu He Phe Ala Asn Ala Arg Lys He Cys Gly Asn Asn Pro Leu
85 90 95
Gly Ala Asn He Leu Tyr Ala He Asn Asp Tyr Gly Arg Val Leu Arg
100 105 110
Asp Ser Cys Glu Ala Gly Ala Asn He He He Thr Gly Ala Gly Leu
115 120 125
Pro Thr Asn Met Pro Glu Phe Ala Lys Asp Phe Ser Asp Val Ala Leu
130 135 140
He Pro He He Ser Ser Ala Lys Ala Leu Lys He Leu Cys Lys Arg 145 150 155 160
Trp Ser Asp Arg Tyr Lys Arg He Pro Asp Ala Phe He Val Glu Gly
165 170 175
Pro Leu Ser Gly Gly His Gin Gly Phe Lys Tyr Glu Asp Cys Phe Lys
180 185 190
Glu Glu Phe Arg Leu Glu Asn Leu Val Pro Lys Val Val Glu Ala Ser
195 200 205
Lys Glu Trp Gly Asn He Pro He He Ala Ala Gly Gly He Trp Asp
210 215 220
Arg Lys Asp He Asp Thr Met Leu Ser Leu Gly Ala Ser Gly Val Gin 225 230 235 240
Met Ala Thr Arg Phe Leu Gly Thr Lys Glu Cys Asp Ala Lys Val Tyr
245 250 255
Ala Asp Leu Leu Pro Thr Leu Lys Lys Glu Asp He Leu Leu He Lys
260 265 270
Ser Pro Val Gly Tyr Pro Ala Arg Ala He Asn Thr Gly Val He Lys
275 280 285
Arg He Glu Glu Gly Asn Ala Pro Lys He Ala Cys Val Ser Asn Cys 290 295 300 Val Ala Pro Cys Asn Arg Gly Glu Glu Ala Lys Lys Val Gly Tyr Cys 305 310 315 320
He Ala Asp Gly Leu Gly Arg Ser Tyr Leu Gly Asn Arg Glu Glu Gly
325 330 335
Leu Tyr Phe Thr Gly Ala Asn Gly Tyr Arg Val Asp Lys He He Ser
340 345 350
Val His Glu Leu He Lys Glu Leu Thr Glu Gly 355 360
(2) INFORMATION FOR SEQ ID NO: 845:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2373 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 13...2337 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 845:
TAGACAGGAT AG ATG AAC GAA ATT GAT AAA TCC GTT GAT ATC GGA TTC TTA 51 Met Asn Glu He Asp Lys Ser Val Asp He Gly Phe Leu 1 5 10
CGG ATT CTT GAT GTT ATT AAA AAA GTT ACG ACC CCA AAG GGT GGC ATT 99 Arg He Leu Asp Val He Lys Lys Val Thr Thr Pro Lys Gly Gly He 15 20 25
GAA ATC TTA AGG ACT TTA ATT GAT TTT ACG CCC AAA ATT GAA AAC GCC 147 Glu He Leu Arg Thr Leu He Asp Phe Thr Pro Lys He Glu Asn Ala 30 35 40 45
CTG AAT TTA GCG GCC AAA AGC CAT AAG GGG CAA TAC AGA AAA AGC GGC 195 Leu Asn Leu Ala Ala Lys Ser His Lys Gly Gin Tyr Arg Lys Ser Gly 50 55 60
GAG CCT TAT ATT GTC CAT CCT ATT TGC GTG GCA AGC TTG GTA GCG TTT 243 Glu Pro Tyr He Val His Pro He Cys Val Ala Ser Leu Val Ala Phe 65 70 75
TGT GGG GGC GAT GAG GCG ATG GTG TGT GCT GCG CTT TTG CAT GAT GTG 291 Cys Gly Gly Asp Glu Ala Met Val Cys Ala Ala Leu Leu His Asp Val 80 85 90
GTG GAA GAC ACG CCT TGT AAG ATT GAA ACG ATT GAG CAA GAA TTT GGG 339 Val Glu Asp Thr Pro Cys Lys He Glu Thr He Glu Gin Glu Phe Gly 95 100 105 CAA GAT GTG GCT AAT TTA GTG GAT GCG CTC ACT AAA ATC ACT GAA ATC 387 Gin Asp Val Ala Asn Leu Val Asp Ala Leu Thr Lys He Thr Glu He 110 115 120 125
AGG AAA GAA GAA TTA GGC GTG AGC TCT CAA GAT CCC AGA ATG GTG GTT 435 Arg Lys Glu Glu Leu Gly Val Ser Ser Gin Asp Pro Arg Met Val Val 130 135 140
TCA GCG CTC ACT TTC AGA AAG ATT TTA ATT AGC GCG ATA CAA GAT CCA 483 Ser Ala Leu Thr Phe Arg Lys He Leu He Ser Ala He Gin Asp Pro 145 150 155
AGA GCC TTA GTG GTA AAG ATT AGC GAC AGG TTG CAC AAC ATG CTC ACC 531 Arg Ala Leu Val Val Lys He Ser Asp Arg Leu His Asn Met Leu Thr 160 165 170
TTA GAC GCC TTG CCT CAT GAC AAG CAA GTG CGT ATT TCT AAA GAG ACT 579 Leu Asp Ala Leu Pro His Asp Lys Gin Val Arg He Ser Lys Glu Thr 175 180 185
CTA GCG GTG TAT GCC CCT ATA GCG AGC CGA TTG GGC ATG TCT TCA ATC 627 Leu Ala Val Tyr Ala Pro He Ala Ser Arg Leu Gly Met Ser Ser He 190 195 200 205
AAA AAT GAA TTA GAA GAC AAG AGC TTT TAT TAT ATT TAT CCA GAA GAG 675 Lys Asn Glu Leu Glu Asp Lys Ser Phe Tyr Tyr He Tyr Pro Glu Glu 210 215 220
TAT AAA AAT ATC AAG GAA TAT TTG CAC AAA AAC AAG CAG TCT TTA CTC 723 Tyr Lys Asn He Lys Glu Tyr Leu His Lys Asn Lys Gin Ser Leu Leu 225 230 235
TTA AAG CTC AAC GCT TTT GCG AGC AAG TTA GAA AAA AAA CTT TTT GAT 771 Leu Lys Leu Asn Ala Phe Ala Ser Lys Leu Glu Lys Lys Leu Phe Asp 240 245 250
AGT GGG TTT AGC CAT TCG GAT TTT AAA CTC GTT ACA AGG GTG AAA CGC 819 Ser Gly Phe Ser His Ser Asp Phe Lys Leu Val Thr Arg Val Lys Arg 255 260 265
CCT TAT TCT ATC TAT CTT AAG ATG CAA CGA AAG GGC GCG GTT AAT ATT 867 Pro Tyr Ser He Tyr Leu Lys Met Gin Arg Lys Gly Ala Val Asn He 270 275 280 285
GAT GAA ATT TTG GAC TTG TTA GCC ATT AGG ATT TTA TTG AAA AAC CCG 915 Asp Glu He Leu Asp Leu Leu Ala He Arg He Leu Leu Lys Asn Pro 290 295 300
ATT GAT TGC TAT AAA GTT TTA GGG ATT ATC CAT TTG AAT TTC AAA CCC 963 He Asp Cys Tyr Lys Val Leu Gly He He His Leu Asn Phe Lys Pro 305 310 315
ATT GTC TCT CGT TTT AAA GAT TAC ATC GCT TTG CCC AAA GAA AAT GGC 1011 He Val Ser Arg Phe Lys Asp Tyr He Ala Leu Pro Lys Glu Asn Gly 320 325 330 TAT AAG ACG ATA CAC ACG ACC ATT TTT GAT GAA TCT TCT GTT TAT GAA 1059 Tyr Lys Thr He His Thr Thr He Phe Asp Glu Ser Ser Val Tyr Glu 335 340 345
GTG CAG ATC CGC ACC TTT GAT ATG CAT ATG GGG GCG GAG TAT GGT AAT 1107 Val Gin He Arg Thr Phe Asp Met His Met Gly Ala Glu Tyr Gly Asn 350 355 360 365
TCA GCC CAC TGG AAG TAT AAA GCC GGG GGC GTG GAT CAT GAA GAT CAT 1155 Ser Ala His Trp Lys Tyr Lys Ala Gly Gly Val Asp His Glu Asp His 370 375 380
CAT GAG GGC ATG AGA TGG TTG CAA AAT TTT AAA TAC CAT GAC AGC GAT 1203 His Glu Gly Met Arg Trp Leu Gin Asn Phe Lys Tyr His Asp Ser Asp 385 390 395
TTG AAA AAC GAC CCT AAG GAA TTT TAC GAA CTC GCT AAG AAC GAT TTG 1251 Leu Lys Asn Asp Pro Lys Glu Phe Tyr Glu Leu Ala Lys Asn Asp Leu 400 405 410
TAT CGT GAA GAT ATT GTC GTT TTT TCG CCT CAT GGG GAC ACT TAC ACT 1299 Tyr Arg Glu Asp He Val Val Phe Ser Pro His Gly Asp Thr Tyr Thr 415 420 425
TTA CCG GTG GGT GCG ATC GCT TTA GAT TTT GCT TAC ATG GTG CAT AGC 1347 Leu Pro Val Gly Ala He Ala Leu Asp Phe Ala Tyr Met Val His Ser 430 435 440 445
GAT TTG GGC GAT AAA GCC ACG GAC GCT TAT ATC AAT AGT AAA AAA GCC 1395 Asp Leu Gly Asp Lys Ala Thr Asp Ala Tyr He Asn Ser Lys Lys Ala 450 455 460
TTA CTC AAT CAG GAA TTA AGA AGT GGG GAT GTG GTT AAA ATC ATT AAA 1443 Leu Leu Asn Gin Glu Leu Arg Ser Gly Asp Val Val Lys He He Lys 465 470 475
GGC GAT AAA ATA ATA CCT CGT TTC ATT TGG ATG GAT CAG CTT AAA ACT 1491 Gly Asp Lys He He Pro Arg Phe He Trp Met Asp Gin Leu Lys Thr 480 485 490
TCT AAG GCT AAA AAC CAT TTG CGC ATC CAA AGA AGA AAC CGC TTG AAA 1539 Ser Lys Ala Lys Asn His Leu Arg He Gin Arg Arg Asn Arg Leu Lys 495 500 505
GAG GTT GAC ACT AAG AGC ATG ATC AAT ATC TTA GCG ACT TTT TTT GGG 1587 Glu Val Asp Thr Lys Ser Met He Asn He Leu Ala Thr Phe Phe Gly 510 515 520 525
CGC TCT GTT TTT GAA GAC ATG GAT TTA AAG GAT TAT AAA AAC TTT GAA 1635 Arg Ser Val Phe Glu Asp Met Asp Leu Lys Asp Tyr Lys Asn Phe Glu 530 535 540
GAA AGA TTA ACA GAT TGC GGG GTG GAG ACC ACC TTA ACA GAA GCG ATG 1683 Glu Arg Leu Thr Asp Cys Gly Val Glu Thr Thr Leu Thr Glu Ala Met 545 550 555 AAA AGT TTT GAA AAT TTA GCC AAA CTC ACT GAA GAA ATT GAA AAT AAG 1731 Lys Ser Phe Glu Asn Leu Ala Lys Leu Thr Glu Glu He Glu Asn Lys 560 565 570
GTG TTT TCT TTA AAA GAA GAT GCG ATT TTA GAA TAC CAA GAG ATG AGT 1779 Val Phe Ser Leu Lys Glu Asp Ala He Leu Glu Tyr Gin Glu Met Ser 575 580 585
TTA TGG ACT CGA GGT TTA AGG TAT TTG GGC TTT AAA ACC AAT GTC TTG 1827 Leu Trp Thr Arg Gly Leu Arg Tyr Leu Gly Phe Lys Thr Asn Val Leu 590 595 600 605
AAT TTT TTA GCC CCC AAT CGG CAG TGG CAG TGT AAG GAA TTA GAA CAT 1875 Asn Phe Leu Ala Pro Asn Arg Gin Trp Gin Cys Lys Glu Leu Glu His 610 615 620
TTT AGC GTT TGT TCA AGC AAC GCT TTA GAA ATC AAA CAG GTG TTG TTG 1923 Phe Ser Val Cys Ser Ser Asn Ala Leu Glu He Lys Gin Val Leu Leu 625 630 635
AAT GAT TGT TGT TAC CCT AAA TAT GGC GAT GAA ATC ATT GCG ATT GTA 1971 Asn Asp Cys Cys Tyr Pro Lys Tyr Gly Asp Glu He He Ala He Val 640 645 650
ACG GAT TTA AAA GAT CCA AAA GCG ATT GCG CAC CAT AAA TTT TGC AAA 2019 Thr Asp Leu Lys Asp Pro Lys Ala He Ala His His Lys Phe Cys Lys 655 660 665
AAA GCG ATG GCG GAA GTA GAT GCT AAA GTG CCT ATG GTT TAT ATA GAA 2067 Lys Ala Met Ala Glu Val Asp Ala Lys Val Pro Met Val Tyr He Glu 670 675 680 685
TGG CAC AAG CGG GAT CGA ACG ATT TAT AAA ATG ATG TTT TAT TTG GGC 2115 Trp His Lys Arg Asp Arg Thr He Tyr Lys Met Met Phe Tyr Leu Gly 690 695 700
GAA AAA AAG TCG GTT TTA GCG GGT TTA TTA ACT TTT TTA AAC AGG AAT 2163 Glu Lys Lys Ser Val Leu Ala Gly Leu Leu Thr Phe Leu Asn Arg Asn 705 710 715
GAA TGC AAC ATT GTG GGC GTG TCT TAT TTG GGC TAT AAA GAC AAG TAT 2211 Glu Cys Asn He Val Gly Val Ser Tyr Leu Gly Tyr Lys Asp Lys Tyr 720 725 730
TCT AGC CAT TGT GAA GTG AGT TTT GAA ATA GCC ACA GAT AAG GCG GAT 2259 Ser Ser His Cys Glu Val Ser Phe Glu He Ala Thr Asp Lys Ala Asp 735 740 745
TGG ATC AGA GCC TTA ATC AAT CGC AAA TAT CAG GAT AGG ATT GTA GAA 2307 Trp He Arg Ala Leu He Asn Arg Lys Tyr Gin Asp Arg He Val Glu 750 755 760 765
TTA TCC AGT CTG GAT GAC GCT TAT GAA TCA TAATAAGCCC TAATTAAGGA ATG 2360 Leu Ser Ser Leu Asp Asp Ala Tyr Glu Ser 770 775 AACATGGAAC AAA 2373
(2) INFORMATION FOR SEQ ID NO: 846:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 775 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 846:
Met Asn Glu He Asp Lys Ser Val Asp He Gly Phe Leu Arg He Leu
1 5 10 15
Asp Val He Lys Lys Val Thr Thr Pro Lys Gly Gly He Glu He Leu
20 25 30
Arg Thr Leu He Asp Phe Thr Pro Lys He Glu Asn Ala Leu Asn Leu
35 40 45
Ala Ala Lys Ser His Lys Gly Gin Tyr Arg Lys Ser Gly Glu Pro Tyr
50 55 60
He Val His Pro He Cys Val Ala Ser Leu Val Ala Phe Cys Gly Gly 65 70 75 80
Asp Glu Ala Met Val Cys Ala Ala Leu Leu His Asp Val Val Glu Asp
85 90 95
Thr Pro Cys Lys He Glu Thr He Glu Gin Glu Phe Gly Gin Asp Val
100 105 110
Ala Asn Leu Val Asp Ala Leu Thr Lys He Thr Glu He Arg Lys Glu
115 120 125
Glu Leu Gly Val Ser Ser Gin Asp Pro Arg Met Val Val Ser Ala Leu
130 135 140
Thr Phe Arg Lys He Leu He Ser Ala He Gin Asp Pro Arg Ala Leu 145 150 155 160
Val Val Lys He Ser Asp Arg Leu His Asn Met Leu Thr Leu Asp Ala
165 170 175
Leu Pro His Asp Lys Gin Val Arg He Ser Lys Glu Thr Leu Ala Val
180 185 190
Tyr Ala Pro He Ala Ser Arg Leu Gly Met Ser Ser He Lys Asn Glu
195 200 205
Leu Glu Asp Lys Ser Phe Tyr Tyr He Tyr Pro Glu Glu Tyr Lys Asn
210 215 220
He Lys Glu Tyr Leu His Lys Asn Lys Gin Ser Leu Leu Leu Lys Leu 225 230 235 240
Asn Ala Phe Ala Ser Lys Leu Glu Lys Lys Leu Phe Asp Ser Gly Phe
245 250 255
Ser His Ser Asp Phe Lys Leu Val Thr Arg Val Lys Arg Pro Tyr Ser
260 265 270
He Tyr Leu Lys Met Gin Arg Lys Gly Ala Val Asn He Asp Glu He
275 280 285
Leu Asp Leu Leu Ala He Arg He Leu Leu Lys Asn Pro He Asp Cys
290 295 300
Tyr Lys Val Leu Gly He He His Leu Asn Phe Lys Pro He Val Ser 305 310 315 320
Arg Phe Lys Asp Tyr He Ala Leu Pro Lys Glu Asn Gly Tyr Lys Thr 325 330 335
He His Thr Thr He Phe Asp Glu Ser Ser Val Tyr Glu Val Gin He
340 345 350
Arg Thr Phe Asp Met His Met Gly Ala Glu Tyr Gly Asn Ser Ala His
355 360 365
Trp Lys Tyr Lys Ala Gly Gly Val Asp His Glu Asp His His Glu Gly
370 375 380
Met Arg Trp Leu Gin Asn Phe Lys Tyr His Asp Ser Asp Leu Lys Asn 385 390 395 400
Asp Pro Lys Glu Phe Tyr Glu Leu Ala Lys Asn Asp Leu Tyr Arg Glu
405 410 415
Asp He Val Val Phe Ser Pro His Gly Asp Thr Tyr Thr Leu Pro Val
420 425 430
Gly Ala He Ala Leu Asp Phe Ala Tyr Met Val His Ser Asp Leu Gly
435 440 445
Asp Lys Ala Thr Asp Ala Tyr He Asn Ser Lys Lys Ala Leu Leu Asn
450 455 460
Gin Glu Leu Arg Ser Gly Asp Val Val Lys He He Lys Gly Asp Lys 465 470 475 480
He He Pro Arg Phe He Trp Met Asp Gin Leu Lys Thr Ser Lys Ala
485 490 495
Lys Asn His Leu Arg He Gin Arg Arg Asn Arg Leu Lys Glu Val Asp
500 505 510
Thr Lys Ser Met He Asn He Leu Ala Thr Phe Phe Gly Arg Ser Val
515 520 525
Phe Glu Asp Met Asp Leu Lys Asp Tyr Lys Asn Phe Glu Glu Arg Leu
530 535 540
Thr Asp Cys Gly Val Glu Thr Thr Leu Thr Glu Ala Met Lys Ser Phe 545 550 555 560
Glu Asn Leu Ala Lys Leu Thr Glu Glu He Glu Asn Lys Val Phe Ser
565 570 575
Leu Lys Glu Asp Ala He Leu Glu Tyr Gin Glu Met Ser Leu Trp Thr
580 585 590
Arg Gly Leu Arg Tyr Leu Gly Phe Lys Thr Asn Val Leu Asn Phe Leu
595 600 605
Ala Pro Asn Arg Gin Trp Gin Cys Lys Glu Leu Glu His Phe Ser Val
610 615 620
Cys Ser Ser Asn Ala Leu Glu He Lys Gin Val Leu Leu Asn Asp Cys 625 630 635 640
Cys Tyr Pro Lys Tyr Gly Asp Glu He He Ala He Val Thr Asp Leu
645 650 655
Lys Asp Pro Lys Ala He Ala His His Lys Phe Cys Lys Lys Ala Met
660 665 670
Ala Glu Val Asp Ala Lys Val Pro Met Val Tyr He Glu Trp His Lys
675 680 685
Arg Asp Arg Thr He Tyr Lys Met Met Phe Tyr Leu Gly Glu Lys Lys
690 695 700
Ser Val Leu Ala Gly Leu Leu Thr Phe Leu Asn Arg Asn Glu Cys Asn 705 710 715 720
He Val Gly Val Ser Tyr Leu Gly Tyr Lys Asp Lys Tyr Ser Ser His
725 730 735
Cys Glu Val Ser Phe Glu He Ala Thr Asp Lys Ala Asp Trp He Arg
740 745 750
Ala Leu He Asn Arg Lys Tyr Gin Asp Arg He Val Glu Leu Ser Ser 755 760 765 Leu Asp Asp Ala Tyr Glu Ser 770 775
(2) INFORMATION FOR SEQ ID NO: 847:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 310 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 10...279 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 847:
AAAAGGAGA GTG GCG GTG AAA AAA ATC GTT GTG AGT TGG TGT GTG GCG TTG 51 Val Ala Val Lys Lys He Val Val Ser Trp Cys Val Ala Leu 1 5 10
GCT TTT TTA AGC GCG GAT TCA GCA CAA GCC AAT AAA GCG ATC AGT AAT 99 Ala Phe Leu Ser Ala Asp Ser Ala Gin Ala Asn Lys Ala He Ser Asn 15 20 25 30
GCG GAT TTG ATT AAA GAG ATA AGG GAT TTA AAA AAA ATC ATC AGC GCG 147 Ala Asp Leu He Lys Glu He Arg Asp Leu Lys Lys He He Ser Ala 35 40 45
CAA AAC ACT GAG ATT AAC AAC TTA AGA AAA GTG CAA GAA GTG TTG TCT 195 Gin Asn Thr Glu He Asn Asn Leu Arg Lys Val Gin Glu Val Leu Ser 50 55 60
GGG CAA TTA GGG GAC ATG CGT AAG GAT ATA TTA AGC ACT AGA GAT TAT 243 Gly Gin Leu Gly Asp Met Arg Lys Asp He Leu Ser Thr Arg Asp Tyr 65 70 75
TGC ATT AGC TTA AGG CCT TAT ATC TAT AAT TGG CGC TAGGGGATAA TCCAAA 295 Cys He Ser Leu Arg Pro Tyr He Tyr Asn Trp Arg 80 85 90
AAATGAAAGC ATGCG 310
(2) INFORMATION FOR SEQ ID NO: 848:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 90 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 848:
Val Ala Val Lys Lys He Val Val Ser Trp Cys Val Ala Leu Ala Phe
1 5 10 15
Leu Ser Ala Asp Ser Ala Gin Ala Asn Lys Ala He Ser Asn Ala Asp
20 25 30
Leu He Lys Glu He Arg Asp Leu Lys Lys He He Ser Ala Gin Asn
35 40 45
Thr Glu He Asn Asn Leu Arg Lys Val Gin Glu Val Leu Ser Gly Gin
50 55 60
Leu Gly Asp Met Arg Lys Asp He Leu Ser Thr Arg Asp Tyr Cys He 65 70 75 80
Ser Leu Arg Pro Tyr He Tyr Asn Trp Arg 85 90
(2) INFORMATION FOR SEQ ID NO: 849:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1631 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...1569 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 849:
TGAAAAAGAA CTCAAAGAGC TGCAAAAAAA ACAAAAACAC GAGTAACAAC C ATG ATT 57
Met He 1
AAC ACG ATG TTT TGC GCG ACC ATG CAA AGG GGA GTG GCG GAA ATC GTG 105 Asn Thr Met Phe Cys Ala Thr Met Gin Arg Gly Val Ala Glu He Val 5 10 15
GCT GTG GAA GCG ACT TTC ACA AGG GCT TTG CCG GCG TTT GTG ATT TCA 153 Ala Val Glu Ala Thr Phe Thr Arg Ala Leu Pro Ala Phe Val He Ser 20 25 30
GGG TTA GCT AAT AGC TCT ATC CAA GAA GCC AAA CAG CGG GTT CAA TCG 201 Gly Leu Ala Asn Ser Ser He Gin Glu Ala Lys Gin Arg Val Gin Ser 35 40 45 50
GCT TTA CAA AAT AAC GAT TTC ACT TTC CCG CCT TTA AAA ATC ACC ATC 249 Ala Leu Gin Asn Asn Asp Phe Thr Phe Pro Pro Leu Lys He Thr He 55 60 65 AAC CTT TCC CCC TCA GAT TTG CCT AAA TCC GGG AGT CAT TTT GAT TTG 297 Asn Leu Ser Pro Ser Asp Leu Pro Lys Ser Gly Ser His Phe Asp Leu 70 75 80
CCT ATC GCT CTT TTA ATC GCT TTG CAA AAA CAA GAG TTG GCT TTT AAA 345 Pro He Ala Leu Leu He Ala Leu Gin Lys Gin Glu Leu Ala Phe Lys 85 90 95
GAG TGG TTT GCT TTT GGG GAG TTA GGG CTT GAT GGC AAG ATC AAA CCC 393 Glu Trp Phe Ala Phe Gly Glu Leu Gly Leu Asp Gly Lys He Lys Pro 100 105 110
AAT CCT AAC ATT TTC CCC ATG CTT TTA GAC ATT GCC ATT AAA CAC CCC 441 Asn Pro Asn He Phe Pro Met Leu Leu Asp He Ala He Lys His Pro 115 120 125 130
CAT GCT AAG ATC ATT GCG CCT AAG GCC AAT GAA GAG CTT TTT TCG CTT 489 His Ala Lys He He Ala Pro Lys Ala Asn Glu Glu Leu Phe Ser Leu 135 140 145
ATC CCT AAT TTG CAA TGC TTT TTT GTG GGG CAT TTT AAA GAA GCG TTA 537 He Pro Asn Leu Gin Cys Phe Phe Val Gly His Phe Lys Glu Ala Leu 150 155 160
GAA ATC TTG CAA AAC CCT GAA ACC AAA GCA GAC ACC CAC ACG AAA AAA 585 Glu He Leu Gin Asn Pro Glu Thr Lys Ala Asp Thr His Thr Lys Lys 165 170 175
CTA CCC TTT AAA ACG ATA GAA TTG AAC GAT AAA GAG TAT TAT TTT TCA 633 Leu Pro Phe Lys Thr He Glu Leu Asn Asp Lys Glu Tyr Tyr Phe Ser 180 185 190
GAC GCC TAT GCC TTA GAT TTT AAA GAA GTT AAG GGG CAA GCT GTC GCT 681 Asp Ala Tyr Ala Leu Asp Phe Lys Glu Val Lys Gly Gin Ala Val Ala 195 200 205 210
AAA GAG GCC GCT TTG ATC GCT AGC GCT GGG TTT CAT AAC TTG ATT TTA 729 Lys Glu Ala Ala Leu He Ala Ser Ala Gly Phe His Asn Leu He Leu 215 220 225
GAG GGA AGT CCA GGG TGT GGG AAA AGC ATG ATC ATT AAT CGC ATG CGT 777 Glu Gly Ser Pro Gly Cys Gly Lys Ser Met He He Asn Arg Met Arg 230 235 240
TAT ATC TTG CCT CCA TTA AGC CTG AAT GAA ATC CTA GAA GCG ACA AAA 825 Tyr He Leu Pro Pro Leu Ser Leu Asn Glu He Leu Glu Ala Thr Lys 245 250 255
TTA CGC ATT TTA AGC GAG CAA GAC AGT GCC TAT TAC CCC TTA AGG AGT 873 Leu Arg He Leu Ser Glu Gin Asp Ser Ala Tyr Tyr Pro Leu Arg Ser 260 265 270
TTT AGA AAC CCT CAC CAA AGC GCT TCA AAA TCC AGT ATT TTA GGC TCA 921 Phe Arg Asn Pro His Gin Ser Ala Ser Lys Ser Ser He Leu Gly Ser 275 280 285 290 AGC TCT CTA AGA GAG CCA AAA CCT GGC GAA ATC GCG CTA GCG CAT AAC 969 Ser Ser Leu Arg Glu Pro Lys Pro Gly Glu He Ala Leu Ala His Asn 295 300 305
GGC ATG CTT TTT TTT GAT GAA TTG CCT CAT TTT AAA AAG GAT ATT TTG 1017 Gly Met Leu Phe Phe Asp Glu Leu Pro His Phe Lys Lys Asp He Leu 310 315 320
GAA GCT TTA AGA GAG CCT TTA GAA AAC AAT AAA TTG GTG GTT TCA AGA 1065 Glu Ala Leu Arg Glu Pro Leu Glu Asn Asn Lys Leu Val Val Ser Arg 325 330 335
GTG CAT AGC AAA ATT GAA TAC GAA ACC TCT TTT TTA TTT GTA GGG GCT 1113 Val His Ser Lys He Glu Tyr Glu Thr Ser Phe Leu Phe Val Gly Ala 340 345 350
CAA AAC CCT TGC TTG TGC GGG AAT TTA CTC AGC GCG ACC AAA GCA TGC 1161 Gin Asn Pro Cys Leu Cys Gly Asn Leu Leu Ser Ala Thr Lys Ala Cys 355 360 365 370
CGT TGC CAA GAC AGA GAA ATC ACG CAG TAT AAA AAC CGC TTG AGC GAG 1209 Arg Cys Gin Asp Arg Glu He Thr Gin Tyr Lys Asn Arg Leu Ser Glu 375 380 385
CCT TTT TTG GAT AGG ATT GAT TTG TTT GTG CAA ATG GAA GAG GGG AAT 1257 Pro Phe Leu Asp Arg He Asp Leu Phe Val Gin Met Glu Glu Gly Asn 390 395 400
TAT AAA GAC ACG CCG TCG CAT TCT TGG ACT TCA AAA GAG ATG CAT GAA 1305 Tyr Lys Asp Thr Pro Ser His Ser Trp Thr Ser Lys Glu Met His Glu 405 410 415
TTG GTG TTA TTA GCT TTC AAG CAG CAA AAG TTA AGG AAA CAG AGC GTT 1353 Leu Val Leu Leu Ala Phe Lys Gin Gin Lys Leu Arg Lys Gin Ser Val 420 425 430
TTT AAT GGT AAG CTT AAT GAA GAG CAG ATA GAA CGA TTT TGC CCC TTA 1401 Phe Asn Gly Lys Leu Asn Glu Glu Gin He Glu Arg Phe Cys Pro Leu 435 440 445 450
AAC GCT GAA GCA AAA AAG TTG TTG GAG CAG GCG GTT GAA AGG TTT AAT 1449 Asn Ala Glu Ala Lys Lys Leu Leu Glu Gin Ala Val Glu Arg Phe Asn 455 460 465
CTC TCC ATG CGC TCT ATT AAT AAG GTC AAA AAA GTC GCT AGG ACG ATT 1497 Leu Ser Met Arg Ser He Asn Lys Val Lys Lys Val Ala Arg Thr He 470 475 480
GCG GAT TTA AAC GCT TGC GAG GAT ATA GAA AAA TCT CAC ATG CTT AAA 1545 Ala Asp Leu Asn Ala Cys Glu Asp He Glu Lys Ser His Met Leu Lys 485 490 495
GCG CTG AGT TTT AGA AAG ATT TCT TAAAAGGATT TTTATAAGGG AGAAAAAATG 1599 Ala Leu Ser Phe Arg Lys He Ser 500 505 CAAGAATACC ACATTCATAA TTTGGATTGC CC 1631
(2) TNFORMATION FOR SEQ ID NO: 850:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 506 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 850:
Met He Asn Thr Met Phe Cys Ala Thr Met Gin Arg Gly Val Ala Glu
1 5 10 15
He Val Ala Val Glu Ala Thr Phe Thr Arg Ala Leu Pro Ala Phe Val
20 25 30
He Ser Gly Leu Ala Asn Ser Ser He Gin Glu Ala Lys Gin Arg Val
35 40 45
Gin Ser Ala Leu Gin Asn Asn Asp Phe Thr Phe Pro Pro Leu Lys He
50 55 60
Thr He Asn Leu Ser Pro Ser Asp Leu Pro Lys Ser Gly Ser His Phe 65 70 75 80
Asp Leu Pro He Ala Leu Leu He Ala Leu Gin Lys Gin Glu Leu Ala
85 90 95
Phe Lys Glu Trp Phe Ala Phe Gly Glu Leu Gly Leu Asp Gly Lys He
100 105 110
Lys Pro Asn Pro Asn He Phe Pro Met Leu Leu Asp He Ala He Lys
115 120 125
His Pro His Ala Lys He He Ala Pro Lys Ala Asn Glu Glu Leu Phe
130 135 140
Ser Leu He Pro Asn Leu Gin Cys Phe Phe Val Gly His Phe Lys Glu 145 150 155 160
Ala Leu Glu He Leu Gin Asn Pro Glu Thr Lys Ala Asp Thr His Thr
165 170 175
Lys Lys Leu Pro Phe Lys Thr He Glu Leu Asn Asp Lys Glu Tyr Tyr
180 185 190
Phe Ser Asp Ala Tyr Ala Leu Asp Phe Lys Glu Val Lys Gly Gin Ala
195 200 205
Val Ala Lys Glu Ala Ala Leu He Ala Ser Ala Gly Phe His Asn Leu
210 215 220
He Leu Glu Gly Ser Pro Gly Cys Gly Lys Ser Met He He Asn Arg 225 230 235 240
Met Arg Tyr He Leu Pro Pro Leu Ser Leu Asn Glu He Leu Glu Ala
245 250 255
Thr Lys Leu Arg He Leu Ser Glu Gin Asp Ser Ala Tyr Tyr Pro Leu
260 265 270
Arg Ser Phe Arg Asn Pro His Gin Ser Ala Ser Lys Ser Ser He Leu
275 280 285
Gly Ser Ser Ser Leu Arg Glu Pro Lys Pro Gly Glu He Ala Leu Ala
290 295 300
His Asn Gly Met Leu Phe Phe Asp Glu Leu Pro His Phe Lys Lys Asp 305 310 315 320
He Leu Glu Ala Leu Arg Glu Pro Leu Glu Asn Asn Lys Leu Val Val 325 330 335
Ser Arg Val His Ser Lys He Glu Tyr Glu Thr Ser Phe Leu Phe Val
340 345 350
Gly Ala Gin Asn Pro Cys Leu Cys Gly Asn Leu Leu Ser Ala Thr Lys
355 360 365
Ala Cys Arg Cys Gin Asp Arg Glu He Thr Gin Tyr Lys Asn Arg Leu
370 375 380
Ser Glu Pro Phe Leu Asp Arg He Asp Leu Phe Val Gin Met Glu Glu 385 390 395 400
Gly Asn Tyr Lys Asp Thr Pro Ser His Ser Trp Thr Ser Lys Glu Met
405 410 415
His Glu Leu Val Leu Leu Ala Phe Lys Gin Gin Lys Leu Arg Lys Gin
420 425 430
Ser Val Phe Asn Gly Lys Leu Asn Glu Glu Gin He Glu Arg Phe Cys
435 440 445
Pro Leu Asn Ala Glu Ala Lys Lys Leu Leu Glu Gin Ala Val Glu Arg
450 455 460
Phe Asn Leu Ser Met Arg Ser He Asn Lys Val Lys Lys Val Ala Arg 465 470 475 480
Thr He Ala Asp Leu Asn Ala Cys Glu Asp He Glu Lys Ser His Met
485 490 495
Leu Lys Ala Leu Ser Phe Arg Lys He Ser 500 505
(2) INFORMATION FOR SEQ ID NO: 851:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 605 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 26...547 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 851:
TTACAGAAAA ACGTGAAGTG ATTGC ATG GCG TTA TTA GAG ATT ATC CAT TAC 52
Met Ala Leu Leu Glu He He His Tyr
1 5
CCT TCT AAA ATC TTA AGA ACG ATT TCT AAA GAG GTC GTT TCT TTT GAT 100 Pro Ser Lys He Leu Arg Thr He Ser Lys Glu Val Val Ser Phe Asp 10 15 20 25
TCA AAA CTC CAC CAA CAG CTA GAT GAC ATG CAT GAG ACT ATG ATC GCT 148 Ser Lys Leu His Gin Gin Leu Asp Asp Met His Glu Thr Met He Ala 30 35 40 AGT GAG GGG ATA GGG TTA GCC GCT ATT CAA GTG GGT TTG CCT TTA AGA 196 Ser Glu Gly He Gly Leu Ala Ala He Gin Val Gly Leu Pro Leu Arg 45 50 55
ATG CTC ATC ATC AAC CTC CCG CAA GAA GAC GGC GTG CAA CAC AAA GAA 244 Met Leu He He Asn Leu Pro Gin Glu Asp Gly Val Gin His Lys Glu 60 65 70
GAC TGC TTG GAA ATC ATT AAC CCT AAG TTT ATA GAA ACT GGG GGA TCA 292 Asp Cys Leu Glu He He Asn Pro Lys Phe He Glu Thr Gly Gly Ser 75 80 85
ATG ATG TAT AGA GAA GGG TGC TTG TCT GTG CCG GGA TTT TAC GAA GAA 340 Met Met Tyr Arg Glu Gly Cys Leu Ser Val Pro Gly Phe Tyr Glu Glu 90 95 100 105
GTG GAG CGT TTT GAA AAG GTT AAG ATA GAG TAT CAA AAC CGC TTC GCT 388 Val Glu Arg Phe Glu Lys Val Lys He Glu Tyr Gin Asn Arg Phe Ala 110 115 120
GAA GTG AAA GTT TTA GAA GCG AGC GAG CTT TTA GCG GTA GCC ATT CAG 436 Glu Val Lys Val Leu Glu Ala Ser Glu Leu Leu Ala Val Ala He Gin 125 130 135
CAT GAG ATC GAT CAC CTC AAT GGC GTG TTA TTC GTG GAT AAA TTA TCC 484 His Glu He Asp His Leu Asn Gly Val Leu Phe Val Asp Lys Leu Ser 140 145 150
ATT TTG AAG CGT AAG AAA TTT GAA AAA GAA CTC AAA GAG CTG CAA AAA 532 He Leu Lys Arg Lys Lys Phe Glu Lys Glu Leu Lys Glu Leu Gin Lys 155 160 165
AAA CAA AAA CAC GAG TAACAACCAT GATTAACACG ATGTTTTGCG CGACCATGCA A 588
Lys Gin Lys His Glu
170
AGGGGAGTGG CGGAAAT 605
(2) INFORMATION FOR SEQ ID NO: 852:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 174 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 852:
Met Ala Leu Leu Glu He He His Tyr Pro Ser Lys He Leu Arg Thr
1 5 10 15
He Ser Lys Glu Val Val Ser Phe Asp Ser Lys Leu His Gin Gin Leu
20 25 30
Asp Asp Met His Glu Thr Met He Ala Ser Glu Gly He Gly Leu Ala 35 40 45
Ala He Gin Val Gly Leu Pro Leu Arg Met Leu He He Asn Leu Pro
50 55 60
Gin Glu Asp Gly Val Gin His Lys Glu Asp Cys Leu Glu He He Asn 65 70 75 80
Pro Lys Phe He Glu Thr Gly Gly Ser Met Met Tyr Arg Glu Gly Cys
85 90 95
Leu Ser Val Pro Gly Phe Tyr Glu Glu Val Glu Arg Phe Glu Lys Val
100 105 110
Lys He Glu Tyr Gin Asn Arg Phe Ala Glu Val Lys Val Leu Glu Ala
115 120 125
Ser Glu Leu Leu Ala Val Ala He Gin His Glu He Asp His Leu Asn
130 135 140
Gly Val Leu Phe Val Asp Lys Leu Ser He Leu Lys Arg Lys Lys Phe 145 150 155 160
Glu Lys Glu Leu Lys Glu Leu Gin Lys Lys Gin Lys His Glu 165 170
(2) INFORMATION FOR SEQ ID NO: 853:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 564 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 22...495 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 853:
CCCCAAACAA TAGGATAAAA A ATG CCG CTC ACT CAT TTG AAT GAA GAA AAT 51
Met Pro Leu Thr His Leu Asn Glu Glu Asn 1 5 10
CAG CCT AAA ATG GTG GAT ATA GGG GAT AAA GAA ACC ACT GAA AGA ATC 99 Gin Pro Lys Met Val Asp He Gly Asp Lys Glu Thr Thr Glu Arg He 15 20 25
GCT TTA GCA AGC GGT CGT ATC AGC ATG AAT AAA GAG GCT TAT GAC GCT 147 Ala Leu Ala Ser Gly Arg He Ser Met Asn Lys Glu Ala Tyr Asp Ala 30 35 40
ATT ATC AAT CAT TGC GTC AAA AAG GGT CCG GTG TTA CAG ACT GCT ATT 195 He He Asn His Cys Val Lys Lys Gly Pro Val Leu Gin Thr Ala He 45 50 55
ATT GCT GGA ATT ATG GGG GCT AAA AAG ACA AGC GAG CTC ATT CCC ATG 243 He Ala Gly He Met Gly Ala Lys Lys Thr Ser Glu Leu He Pro Met 60 65 70
TGC CAT CCA ATC ATG CTC AAT GGG GTG GAT ATT GAT ATT TTA GAA GAA 291 Cys His Pro He Met Leu Asn Gly Val Asp He Asp He Leu Glu Glu 75 80 85 90
AAA GAG ACT TGT AGT TTT AAA CTC TAT GCG AGA GTC AAA ACT CAA GCT 339 Lys Glu Thr Cys Ser Phe Lys Leu Tyr Ala Arg Val Lys Thr Gin Ala 95 100 105
AAA ACG GGC GTA GAA ATG GAA GCG CTA ATG AGT GTG AGC ATA GGG CTT 387 Lys Thr Gly Val Glu Met Glu Ala Leu Met Ser Val Ser He Gly Leu 110 115 120
TTA ACC ATT TAT GAC ATG GTG AAA GCC ATT GAC AAG AGC ATG ACA ATT 435 Leu Thr He Tyr Asp Met Val Lys Ala He Asp Lys Ser Met Thr He 125 130 135
AGC GGT GTG ATG TTG GAG CAT AAA AGT GGA GGC AAA AGT GGG GAT TAT 483 Ser Gly Val Met Leu Glu His Lys Ser Gly Gly Lys Ser Gly Asp Tyr 140 145 150
AAC GCT AAA AAA TAGAAAAAGA CCAATAATCT AAAGATGTTA GGGTAAAATA ACATT 540
Asn Ala Lys Lys
155
TTGACAACAA AAGCGTGTTG GTTG 564
(2) INFORMATION FOR SEQ ID NO: 854:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 158 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 854:
Met Pro Leu Thr His Leu Asn Glu Glu Asn Gin Pro Lys Met Val Asp
1 5 10 15
He Gly Asp Lys Glu Thr Thr Glu Arg He Ala Leu Ala Ser Gly Arg
20 25 30
He Ser Met Asn Lys Glu Ala Tyr Asp Ala He He Asn His Cys Val
35 40 45
Lys Lys Gly Pro Val Leu Gin Thr Ala He He Ala Gly He Met Gly
50 55 60
Ala Lys Lys Thr Ser Glu Leu He Pro Met Cys His Pro He Met Leu 65 70 75 80
Asn Gly Val Asp He Asp He Leu Glu Glu Lys Glu Thr Cys Ser Phe
85 90 95
Lys Leu Tyr Ala Arg Val Lys Thr Gin Ala Lys Thr Gly Val Glu Met
100 105 110
Glu Ala Leu Met Ser Val Ser He Gly Leu Leu Thr He Tyr Asp Met 115 120 125
Val Lys Ala He Asp Lys Ser Met Thr He Ser Gly Val Met Leu Glu
130 135 140
His Lys Ser Gly Gly Lys Ser Gly Asp Tyr Asn Ala Lys Lys 145 150 155
(2) INFORMATION FOR SEQ ID NO: 855:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 605 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 25...552 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 855:
CTTTTAGCTT AAAAAGGAGT TCAA ATG CAA ACG ATT CAT ATA GGC GTT TTG 51
Met Gin Thr He His He Gly Val Leu 1 5
AGC GCG AGC GAT AGA GCG TCA AAA GGG ATT TAT GAA GAT TTA AGC GGT 99 Ser Ala Ser Asp Arg Ala Ser Lys Gly He Tyr Glu Asp Leu Ser Gly 10 15 20 25
AAG GCG ATA CAA GAA GTG TTG AGC GAA TAC TTG CTC AAT CCT TTA GAA 147 Lys Ala He Gin Glu Val Leu Ser Glu Tyr Leu Leu Asn Pro Leu Glu 30 35 40
TTT TAT TAC GAA ATT GTC GCT GAT GAA AGG GAT TTA ATT GAA AAA TCA 195 Phe Tyr Tyr Glu He Val Ala Asp Glu Arg Asp Leu He Glu Lys Ser 45 50 55
CTG ATT AAA ATG TGC GAT GAA TAC CAA TGC GAT CTA GTC GTT ACT ACA 243 Leu He Lys Met Cys Asp Glu Tyr Gin Cys Asp Leu Val Val Thr Thr 60 65 70
GGA GGC ACA GGC CCT GCT TTA AGA GAT ATA ACC CCA GAA GCC ACA GAA 291 Gly Gly Thr Gly Pro Ala Leu Arg Asp He Thr Pro Glu Ala Thr Glu 75 80 85
AAA GTG TGC CAA AAA ATG CTT CCT GGT TTT GGA GAG CTT ATG CGA ATG 339 Lys Val Cys Gin Lys Met Leu Pro Gly Phe Gly Glu Leu Met Arg Met 90 95 100 105
ACT AGT TTA AAA TAT GTG CCT ACA GCG ATC CTG TCG CGC CAG AGC GCT 387 Thr Ser Leu Lys Tyr Val Pro Thr Ala He Leu Ser Arg Gin Ser Ala 110 115 120
GGT ATT AGG AAT AAG AGT TTG ATT ATT AAT CTC CCT GGT AAG CCA AAA 435 Gly He Arg Asn Lys Ser Leu He He Asn Leu Pro Gly Lys Pro Lys 125 130 135
AGT ATT AGA GAA TGC TTA GAG GCG GTT TTT CCA GCG ATT CCT TAT TGC 483 Ser He Arg Glu Cys Leu Glu Ala Val Phe Pro Ala He Pro Tyr Cys 140 145 150
GTG GAT TTG ATT TTA GGG AAT TAT ATG CAA GTG AAT GAA AAA AAC ATT 531 Val Asp Leu He Leu Gly Asn Tyr Met Gin Val Asn Glu Lys Asn He 155 160 165
CAA GCG TTT CGC CCC AAA CAA TAGGATAAAA AATGCCGCTC ACTCATTTGA ATGA 586 Gin Ala Phe Arg Pro Lys Gin 170 175
AGAAAATCAG CCTAAAATG 605
(2) INFORMATION FOR SEQ ID NO: 856:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 176 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 856:
Met Gin Thr He His He Gly Val Leu Ser Ala Ser Asp Arg Ala Ser
1 5 10 15
Lys Gly He Tyr Glu Asp Leu Ser Gly Lys Ala He Gin Glu Val Leu
20 25 30
Ser Glu Tyr Leu Leu Asn Pro Leu Glu Phe Tyr Tyr Glu He Val Ala
35 40 45
Asp Glu Arg Asp Leu He Glu Lys Ser Leu He Lys Met Cys Asp Glu
50 55 60
Tyr Gin Cys Asp Leu Val Val Thr Thr Gly Gly Thr Gly Pro Ala Leu 65 70 75 80
Arg Asp He Thr Pro Glu Ala Thr Glu Lys Val Cys Gin Lys Met Leu
85 90 95
Pro Gly Phe Gly Glu Leu Met Arg Met Thr Ser Leu Lys Tyr Val Pro
100 105 110
Thr Ala He Leu Ser Arg Gin Ser Ala Gly He Arg Asn Lys Ser Leu
115 120 125
He He Asn Leu Pro Gly Lys Pro Lys Ser He Arg Glu Cys Leu Glu
130 135 140
Ala Val Phe Pro Ala He Pro Tyr Cys Val Asp Leu He Leu Gly Asn 145 150 155 160
Tyr Met Gin Val Asn Glu Lys Asn He Gin Ala Phe Arg Pro Lys Gin 165 170 175 (2) INFORMATION FOR SEQ ID NO: 857:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 659 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...630 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 857:
GCAATTACTA GAAGAATACA CCCACAAATT ATG CCA AAT CAT CAG CCA GTA AAA 54
Met Pro Asn His Gin Pro Val Lys 1 5
AAA TTT AAG ATT ATT GGG GGG GCT TGT AAG GGA TTA GGC TTG AAT TTG 102 Lys Phe Lys He He Gly Gly Ala Cys Lys Gly Leu Gly Leu Asn Leu 10 15 20
CCT AAC ATT TCT AGC ACG CGC CCC ACC AAA GCG ATC GTA AGA GAG TCG 150 Pro Asn He Ser Ser Thr Arg Pro Thr Lys Ala He Val Arg Glu Ser 25 30 35 40
TTT TTT AAC ACC TTG CAA GCA GAA ATT AAT GGA GCG CAT TTT ATA GAA 198 Phe Phe Asn Thr Leu Gin Ala Glu He Asn Gly Ala His Phe He Glu 45 50 55
GTG TTT TCA GGC AGC GCT TCT ATG GGT TTA GAG GCT TTG AGT AGG GGG 246 Val Phe Ser Gly Ser Ala Ser Met Gly Leu Glu Ala Leu Ser Arg Gly 60 65 70
GCT AAA AGT GCG GTG TTT TTT GAA CAA AAC AAA AGC GCT TAT AAG ACG 294 Ala Lys Ser Ala Val Phe Phe Glu Gin Asn Lys Ser Ala Tyr Lys Thr 75 80 85
CTT TTA GAA AAT ATT TCC CTT TTT AAA AAC CGC TTG AAA AAA GAA ATG 342 Leu Leu Glu Asn He Ser Leu Phe Lys Asn Arg Leu Lys Lys Glu Met 90 95 100
GAA ATT CAA ACC TTT TTA GAT GAC GCT TTC AAG CTT TTG CCC ACG CTG 390 Glu He Gin Thr Phe Leu Asp Asp Ala Phe Lys Leu Leu Pro Thr Leu 105 110 115 120
TGT TTA AAA AAT GGC GTT TTG AAT ATT ATT TAT TTG GAT CCT CCT TTT 438 Cys Leu Lys Asn Gly Val Leu Asn He He Tyr Leu Asp Pro Pro Phe 125 130 135 GAA ACA AGT GGG TTT TTA GGG ATT TAT GAA AAG TGT TTT CAA GCT TTA 486 Glu Thr Ser Gly Phe Leu Gly He Tyr Glu Lys Cys Phe Gin Ala Leu 140 145 150
GAA AGG TTA TTG AAA CGC TTT AAT CCA AAA AAT CTT TTA GTG GTT TTT 534 Glu Arg Leu Leu Lys Arg Phe Asn Pro Lys Asn Leu Leu Val Val Phe 155 160 165
GAG CAT GAA AGC ATG CAT GAA ATG CCT AAA AGT CTT GTA ACT TTA GCT 582 Glu His Glu Ser Met His Glu Met Pro Lys Ser Leu Val Thr Leu Ala 170 175 180
ATA ATC AAA CAA AAA AAA TTT GGA AAA ACC ACT TTA ACT TAT TTT CAA T 631 He He Lys Gin Lys Lys Phe Gly Lys Thr Thr Leu Thr Tyr Phe Gin 185 190 195 200
AGGAATAGGC ATGGCAGAAG AACAAGAA 659
(2) INFORMATION FOR SEQ ID NO: 858:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 200 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 858:
Met Pro Asn His Gin Pro Val Lys Lys Phe Lys He He Gly Gly Ala
1 5 10 15
Cys Lys Gly Leu Gly Leu Asn Leu Pro Asn He Ser Ser Thr Arg Pro
20 25 30
Thr Lys Ala He Val Arg Glu Ser Phe Phe Asn Thr Leu Gin Ala Glu
35 40 45
He Asn Gly Ala His Phe He Glu Val Phe Ser Gly Ser Ala Ser Met
50 55 60
Gly Leu Glu Ala Leu Ser Arg Gly Ala Lys Ser Ala Val Phe Phe Glu 65 70 75 80
Gin Asn Lys Ser Ala Tyr Lys Thr Leu Leu Glu Asn He Ser Leu Phe
85 90 95
Lys Asn Arg Leu Lys Lys Glu Met Glu He Gin Thr Phe Leu Asp Asp
100 105 110
Ala Phe Lys Leu Leu Pro Thr Leu Cys Leu Lys Asn Gly Val Leu Asn
115 120 125
He He Tyr Leu Asp Pro Pro Phe Glu Thr Ser Gly Phe Leu Gly He
130 135 140
Tyr Glu Lys Cys Phe Gin Ala Leu Glu Arg Leu Leu Lys Arg Phe Asn 145 150 155 160
Pro Lys Asn Leu Leu Val Val Phe Glu His Glu Ser Met His Glu Met
165 170 175
Pro Lys Ser Leu Val Thr Leu Ala He He Lys Gin Lys Lys Phe Gly
180 185 190
Lys Thr Thr Leu Thr Tyr Phe Gin 195 200
(2) INFORMATION FOR SEQ ID NO: 859:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 695 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 26...655 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 859:
CGTGCCTACT TCGCTTTTTG CCGCC ATG ACA GGC ACA CAT GCG CGT TAC GTG 52
Met Thr Gly Thr His Ala Arg Tyr Val 1 5
AAA GCC GCT TAC AAA GAG ATA AAA ATT GTT TTT TTA AAC CCT AAA ATC 100 Lys Ala Ala Tyr Lys Glu He Lys He Val Phe Leu Asn Pro Lys He 10 15 20 25
AAT TTA AAC GAA ACC ATC AAA AAT TTA GTG GAA TTA GCC ACT CTG GCT 148 Asn Leu Asn Glu Thr He Lys Asn Leu Val Glu Leu Ala Thr Leu Ala 30 35 40
AGA AAA GAT GGG GTG TTG AGT TTA GAG GGG CGA GTG GCG CAA ATT GAA 196 Arg Lys Asp Gly Val Leu Ser Leu Glu Gly Arg Val Ala Gin He Glu 45 50 55
GAC GAT TTC ACC CGT AAT GGC TTG TCT ATG ATC ATA GAT GGC AAG GAT 244 Asp Asp Phe Thr Arg Asn Gly Leu Ser Met He He Asp Gly Lys Asp 60 65 70
TTA AAA TCC GTT AAG GAA AGC TTA GAA ATC AGC ATT GAA GAA ATG GAA 292 Leu Lys Ser Val Lys Glu Ser Leu Glu He Ser He Glu Glu Met Glu 75 80 85
GAG TAT TAC CAC GGC GCC GCT CAT TAT TGG GAG ACG GCC GGT GAG ACC 340 Glu Tyr Tyr His Gly Ala Ala His Tyr Trp Glu Thr Ala Gly Glu Thr 90 95 100 105
GCT CCT ACT ATG GGG TTA GTG GGG GCG GTT ATG GGG CTT ATG TTA GCC 388 Ala Pro Thr Met Gly Leu Val Gly Ala Val Met Gly Leu Met Leu Ala 110 115 120
TTG CAA AAA CTA GAC AAC CCG GCT GAA ATG GCA GCA GGG ATC GCT GGG 436 Leu Gin Lys Leu Asp Asn Pro Ala Glu Met Ala Ala Gly He Ala Gly 125 130 135
GCT TTT ACG GCT ACT GTT ACA GGG ATT ATG TGT TCT TAT GCG ATT TTT 484 Ala Phe Thr Ala Thr Val Thr Gly He Met Cys Ser Tyr Ala He Phe 140 145 150
GGC CCT TTT GGG CAT AAG CTC AAA GCT AAG TCT AAA GAC ATT ATC AAA 532 Gly Pro Phe Gly His Lys Leu Lys Ala Lys Ser Lys Asp He He Lys 155 160 165
GAA AAA ACC GTT CTT TTA GAG GGG ATT TTA GGC ATC GCT AAT GGG GAA 580 Glu Lys Thr Val Leu Leu Glu Gly He Leu Gly He Ala Asn Gly Glu 170 175 180 185
AAC CCA AGG GAT TTA GAA AAC AAA CTC TTA AAC TAC ATC GCT CCC GGT 628 Asn Pro Arg Asp Leu Glu Asn Lys Leu Leu Asn Tyr He Ala Pro Gly 190 195 200
GAA CCT AAA AAA TCT CAA TTT GAG GGC TAAAGATGGC TAAGAAAAAC AMACCCA 682 Glu Pro Lys Lys Ser Gin Phe Glu Gly 205 210
CCGAATGCCC CGC 695
(2) INFORMATION FOR SEQ ID NO: 860:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 210 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 860:
Met Thr Gly Thr His Ala Arg Tyr Val Lys Ala Ala Tyr Lys Glu He
1 5 10 15
Lys He Val Phe Leu Asn Pro Lys He Asn Leu Asn Glu Thr He Lys
20 25 30
Asn Leu Val Glu Leu Ala Thr Leu Ala Arg Lys Asp Gly Val Leu Ser
35 40 45
Leu Glu Gly Arg Val Ala Gin He Glu Asp Asp Phe Thr Arg Asn Gly
50 55 60
Leu Ser Met He He Asp Gly Lys Asp Leu Lys Ser Val Lys Glu Ser 65 70 75 80
Leu Glu He Ser He Glu Glu Met Glu Glu Tyr Tyr His Gly Ala Ala
85 90 95
His Tyr Trp Glu Thr Ala Gly Glu Thr Ala Pro Thr Met Gly Leu Val
100 105 110
Gly Ala Val Met Gly Leu Met Leu Ala Leu Gin Lys Leu Asp Asn Pro
115 120 125
Ala Glu Met Ala Ala Gly He Ala Gly Ala Phe Thr Ala Thr Val Thr
130 135 140
Gly He Met Cys Ser Tyr Ala He Phe Gly Pro Phe Gly His Lys Leu 145 150 155 160
Lys Ala Lys Ser Lys Asp He He Lys Glu Lys Thr Val Leu Leu Glu
165 170 175
Gly He Leu Gly He Ala Asn Gly Glu Asn Pro Arg Asp Leu Glu Asn
180 185 190
Lys Leu Leu Asn Tyr He Ala Pro Gly Glu Pro Lys Lys Ser Gin Phe
195 200 205
Glu Gly 210
(2) INFORMATION FOR SEQ ID NO: 861:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 810 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 13...783 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 861:
TGAGGGCTAA AG ATG GCT AAG AAA AAC AMA CCC ACC GAA TGC CCC GCC GGT 51 Met Ala Lys Lys Asn Xaa Pro Thr Glu Cys Pro Ala Gly 1 5 10
GAA AAA TGG GCG GTT CCT TAT GCG GAC TTT TTG TCG TTG TTG CTC GCG 99 Glu Lys Trp Ala Val Pro Tyr Ala Asp Phe Leu Ser Leu Leu Leu Ala 15 20 25
CTT TTT ATC GCT CTT TAT GCC ATT TCA GCG GTC AAC AAA TCC AAA GTG 147 Leu Phe He Ala Leu Tyr Ala He Ser Ala Val Asn Lys Ser Lys Val 30 35 40 45
GAA GCC TTA AAA ACC GAA TTT ATT AAG ATT TTT AAT TAC GCT CCC AAG 195 Glu Ala Leu Lys Thr Glu Phe He Lys He Phe Asn Tyr Ala Pro Lys 50 55 60
CCA GAG GCG ATG CAG CCG GTT GTA GTG ATC CCG CCT GAT TCA GGG AAA 243 Pro Glu Ala Met Gin Pro Val Val Val He Pro Pro Asp Ser Gly Lys 65 70 75
GAA GAA GAA CAA ATG GCG AGC GAA AGC TCC AAA CCG GCT TCG CAA AAT 291 Glu Glu Glu Gin Met Ala Ser Glu Ser Ser Lys Pro Ala Ser Gin Asn 80 85 90
ACC GAA ACA AAA GCC ACT ATC GCT CGC AAA GGC GAA GGC AGT GTT TTA 339 Thr Glu Thr Lys Ala Thr He Ala Arg Lys Gly Glu Gly Ser Val Leu 95 100 105
GAG CAA ATT GAT CAA GGC TCT ATC TTA AAG CTC CCC TCT AAT TTG CTG 387 Glu Gin He Asp Gin Gly Ser He Leu Lys Leu Pro Ser Asn Leu Leu 110 115 120 125
TTT GAA AAC GCT ACT TCA GAC GCT ATC AAT CAA GAC ATG ATG CTT TAT 435 Phe Glu Asn Ala Thr Ser Asp Ala He Asn Gin Asp Met Met Leu Tyr 130 135 140
ATT GAA CGG ATC GCT AAA ATC ATT CAA AAA CTC CCT AAA AGG GTG CAT 483 He Glu Arg He Ala Lys He He Gin Lys Leu Pro Lys Arg Val His 145 150 155
ATT AAT GTG AGA GGC TTT ACG GAT GAT ACG CCT TTA GTT AAA ACC CGT 531 He Asn Val Arg Gly Phe Thr Asp Asp Thr Pro Leu Val Lys Thr Arg 160 165 170
TTT AAA AGC CAT TAT GAA TTA GCC GCC AAT CGC GCT TAT AGG GTG ATG 579 Phe Lys Ser His Tyr Glu Leu Ala Ala Asn Arg Ala Tyr Arg Val Met 175 180 185
AAA GTC CTT ATA CAA TAC GGC GTA AAT CCT AAC CAA TTG TCT TTT TCT 627 Lys Val Leu He Gin Tyr Gly Val Asn Pro Asn Gin Leu Ser Phe Ser 190 195 200 205
TCT TAC GGC TCT ACC AAC CCT ATC GCG CCT AAC GAC TCC CTA GAG AAC 675 Ser Tyr Gly Ser Thr Asn Pro He Ala Pro Asn Asp Ser Leu Glu Asn 210 215 220
AGA ATG AAA AAC AAT CGT GTG GAA ATC TTT TTT TCA ACC GAT GCG AAC 723 Arg Met Lys Asn Asn Arg Val Glu He Phe Phe Ser Thr Asp Ala Asn 225 230 235
GAT TTG AGT AAA ATT CAT TCT ATT TTA GAT AAT GAG TTC AAT CCC CAC 771 Asp Leu Ser Lys He His Ser He Leu Asp Asn Glu Phe Asn Pro His 240 245 250
AAA CAG CAA GAA TGAATCGCAT GAATAAAAAT TATCTTT 810
Lys Gin Gin Glu 255
(2) INFORMATION FOR SEQ ID NO: 862:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 257 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 862 Met Ala Lys Lys Asn Xaa Pro Thr Glu Cys Pro Ala Gly Glu Lys Trp
1 5 10 15
Ala Val Pro Tyr Ala Asp Phe Leu Ser Leu Leu Leu Ala Leu Phe He
20 25 30
Ala Leu Tyr Ala He Ser Ala Val Asn Lys Ser Lys Val Glu Ala Leu
35 40 45
Lys Thr Glu Phe He Lys He Phe Asn Tyr Ala Pro Lys Pro Glu Ala
50 55 60
Met Gin Pro Val Val Val He Pro Pro Asp Ser Gly Lys Glu Glu Glu 65 70 75 80
Gin Met Ala Ser Glu Ser Ser Lys Pro Ala Ser Gin Asn Thr Glu Thr
85 90 95
Lys Ala Thr He Ala Arg Lys Gly Glu Gly Ser Val Leu Glu Gin He
100 105 110
Asp Gin Gly Ser He Leu Lys Leu Pro Ser Asn Leu Leu Phe Glu Asn
115 120 125
Ala Thr Ser Asp Ala He Asn Gin Asp Met Met Leu Tyr He Glu Arg
130 135 140
He Ala Lys He He Gin Lys Leu Pro Lys Arg Val His He Asn Val 145 150 155 160
Arg Gly Phe Thr Asp Asp Thr Pro Leu Val Lys Thr Arg Phe Lys Ser
165 170 175
His Tyr Glu Leu Ala Ala Asn Arg Ala Tyr Arg Val Met Lys Val Leu
180 185 190
He Gin Tyr Gly Val Asn Pro Asn Gin Leu Ser Phe Ser Ser Tyr Gly
195 200 205
Ser Thr Asn Pro He Ala Pro Asn Asp Ser Leu Glu Asn Arg Met Lys
210 215 220
Asn Asn Arg Val Glu He Phe Phe Ser Thr Asp Ala Asn Asp Leu Ser 225 230 235 240
Lys He His Ser He Leu Asp Asn Glu Phe Asn Pro His Lys Gin Gin
245 250 255
Glu
(2) INFORMATION FOR SEQ ID NO: 863
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 549 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...474 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 863: TGTTAAGATC AGTTT ATG GAA CAA AAT ATT TTC TCC TTA CTC ATT CAA AAA 51 Met Glu Gin Asn He Phe Ser Leu Leu He Gin Lys 1 5 10
AAG TCT TAT AAA AAG CTT GAA ACC CTT TTG AAA CTC AAA AAG CTT AAG 99 Lys Ser Tyr Lys Lys Leu Glu Thr Leu Leu Lys Leu Lys Lys Leu Lys 15 20 25
GTT TTT ATG CCT TTA AGT TTA CAA GAA AAT TTG CTT TTT ATC TTC ATA 147 Val Phe Met Pro Leu Ser Leu Gin Glu Asn Leu Leu Phe He Phe He 30 35 40
AAA GAC TCT AAA TTG CTT TTT GCG TTT AAA GAC ATT TGG GCT TCT AAA 195 Lys Asp Ser Lys Leu Leu Phe Ala Phe Lys Asp He Trp Ala Ser Lys 45 50 55 60
GAA TTT AAC CAA CGA TTC GCT AAA GAA ATC AGC CAT TTT TTA AAC ACG 243 Glu Phe Asn Gin Arg Phe Ala Lys Glu He Ser His Phe Leu Asn Thr 65 70 75
CAA GGG CAT GCT TAT GGG TTT GAC GGG TTG AAT GGG TTA GAA ATT TTA 291 Gin Gly His Ala Tyr Gly Phe Asp Gly Leu Asn Gly Leu Glu He Leu 80 85 90
GGT TAT GTG CCT AAA GAC GCG CTA AAA AAA TCC AAT TTT TAT GCC CCC 339 Gly Tyr Val Pro Lys Asp Ala Leu Lys Lys Ser Asn Phe Tyr Ala Pro 95 100 105
ATT AAA AAA CAA GCC CGT TTT TTT CGC CCT AGT GCT TTA GGG TTG TTC 387 He Lys Lys Gin Ala Arg Phe Phe Arg Pro Ser Ala Leu Gly Leu Phe 110 115 120
CAT AAC CCC ATT AAA GAC GCT CGT TTG CAT GAA TGT TTT GAA AAA GCG 435 His Asn Pro He Lys Asp Ala Arg Leu His Glu Cys Phe Glu Lys Ala 125 130 135 140
CGC GCT TTG ATC CAC TAC CAA CGA AGT TTT TTT GAG GAA TGAATGGCTG AT 486 Arg Ala Leu He His Tyr Gin Arg Ser Phe Phe Glu Glu 145 150
TTATTGTCCA GTTTAAAAAA CCTTCCTAAC AGCAGTGGCG TGTATCAATA TTTTGATAAA 546 AAC 549
(2) INFORMATION FOR SEQ ID NO: 864:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 153 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 864: Met Glu Gin Asn He Phe Ser Leu Leu He Gin Lys Lys Ser Tyr Lys 1 5 10 15
Lys Leu Glu Thr Leu Leu Lys Leu Lys Lys Leu Lys Val Phe Met Pro
20 25 30
Leu Ser Leu Gin Glu Asn Leu Leu Phe He Phe He Lys Asp Ser Lys
35 40 45
Leu Leu Phe Ala Phe Lys Asp He Trp Ala Ser Lys Glu Phe Asn Gin
50 55 60
Arg Phe Ala Lys Glu He Ser His Phe Leu Asn Thr Gin Gly His Ala 65 70 75 80
Tyr Gly Phe Asp Gly Leu Asn Gly Leu Glu He Leu Gly Tyr Val Pro
85 90 95
Lys Asp Ala Leu Lys Lys Ser Asn Phe Tyr Ala Pro He Lys Lys Gin
100 105 110
Ala Arg Phe Phe Arg Pro Ser Ala Leu Gly Leu Phe His Asn Pro He
115 120 125
Lys Asp Ala Arg Leu His Glu Cys Phe Glu Lys Ala Arg Ala Leu He
130 135 140
His Tyr Gin Arg Ser Phe Phe Glu Glu 145 150
(2) INFORMATION FOR SEQ ID NO: 865:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 352 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 1...318 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 865:
AAG TGT TAC TTT TTT ATA ACT TTT TCA TAT TCT TAT GGG TAT GTT GTC 48 Lys Cys Tyr Phe Phe He Thr Phe Ser Tyr Ser Tyr Gly Tyr Val Val 1 5 10 15
ATT TTT TTA CCG GAG AAT TTT ATC TTG AGA AAC ATT TAT GTA GGG AAT 96 He Phe Leu Pro Glu Asn Phe He Leu Arg Asn He Tyr Val Gly Asn 20 25 30
TTG GTT TAT AGC GCT ACC AGT GAG CAA GTC AAG GAG CTT TTC AGT CAA 144 Leu Val Tyr Ser Ala Thr Ser Glu Gin Val Lys Glu Leu Phe Ser Gin 35 40 45
TTT GGC AAA GTT TTT AAT GTC AAG CTG ATT TAT GAC AGA GAA ACG AAG 192 Phe Gly Lys Val Phe Asn Val Lys Leu He Tyr Asp Arg Glu Thr Lys 50 55 60 AAA CCT AAA GGT TTT GGC TTT GTA GAA ATG CAA GAA GAG AGC GTT AGT 240 Lys Pro Lys Gly Phe Gly Phe Val Glu Met Gin Glu Glu Ser Val Ser 65 70 75 80
GAA GCG ATC GCT AAA TTA GAC AAT ACG GAT TTT ATG GGC AGA ACG ATT 288 Glu Ala He Ala Lys Leu Asp Asn Thr Asp Phe Met Gly Arg Thr He 85 90 95
AGG GTA ACC GAA GCT AAT CCT AAA AAG TCT TAGTAACATT AGAAAATAAT TTT 341 Arg Val Thr Glu Ala Asn Pro Lys Lys Ser 100 105
CTAATGCGCT T 352
(2) INFORMATION FOR SEQ ID NO: 866:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 106 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 866:
Lys Cys Tyr Phe Phe He Thr Phe Ser Tyr Ser Tyr Gly Tyr Val Val
1 5 10 15
He Phe Leu Pro Glu Asn Phe He Leu Arg Asn He Tyr Val Gly Asn
20 25 30
Leu Val Tyr Ser Ala Thr Ser Glu Gin Val Lys Glu Leu Phe Ser Gin
35 40 45
Phe Gly Lys Val Phe Asn Val Lys Leu He Tyr Asp Arg Glu Thr Lys
50 55 60
Lys Pro Lys Gly Phe Gly Phe Val Glu Met Gin Glu Glu Ser Val Ser 65 70 75 80
Glu Ala He Ala Lys Leu Asp Asn Thr Asp Phe Met Gly Arg Thr He
85 90 95
Arg Val Thr Glu Ala Asn Pro Lys Lys Ser 100 105
(2) INFORMATION FOR SEQ ID NO: 867:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1558 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...1473 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 867:
TTAGATTTAA AATTAGATTA AGGATAGAAA ATG AGA ATT TTA CAA AGG GCT TTG 54
Met Arg He Leu Gin Arg Ala Leu 1 5
ACT TTT GAA GAT GTG TTG ATG GTG CCT AGA AAG TCT AGC GTT TTA CCT 102 Thr Phe Glu Asp Val Leu Met Val Pro Arg Lys Ser Ser Val Leu Pro 10 15 20
AAA GAT GTG AGC TTA AAG TCT CGC TTA ACT AAA AAC ATT CGT TTG AAT 150 Lys Asp Val Ser Leu Lys Ser Arg Leu Thr Lys Asn He Arg Leu Asn 25 30 35 40
ATC CCC TTT ATC AGT GCG GCT ATG GAT ACG GTT ACA GAG CAT AAA ACC 198 He Pro Phe He Ser Ala Ala Met Asp Thr Val Thr Glu His Lys Thr 45 50 55
GCT ATC GCT ATG GCG CGC CTT GGG GGT ATT GGC ATC GTG CAT AAA AAC 246 Ala He Ala Met Ala Arg Leu Gly Gly He Gly He Val His Lys Asn 60 65 70
ATG GAT ATT CAA ACG CAA GTT AAA GAA ATC ACT AAG GTT AAA AAA AGC 294 Met Asp He Gin Thr Gin Val Lys Glu He Thr Lys Val Lys Lys Ser 75 80 85
GAG AGC GGG GTG ATT AAT GAT CCT ATT TTT ATC CAT GCG CAC AGG ACG 342 Glu Ser Gly Val He Asn Asp Pro He Phe He His Ala His Arg Thr 90 95 100
CTA GCG GAC GCT AAA GTC ATA ACG GAT AAT TAC AAG ATT TCA GGC GTG 390 Leu Ala Asp Ala Lys Val He Thr Asp Asn Tyr Lys He Ser Gly Val 105 110 115 120
CCT GTG GTA GAT GAT AAG GGG TTG TTG ATT GGG ATT TTA ACC AAC AGA 438 Pro Val Val Asp Asp Lys Gly Leu Leu He Gly He Leu Thr Asn Arg 125 130 135
GAT GTG CGC TTT GAA ACC GAT TTG AGT AAA AAA GTG GGC GAT GTG ATG 486 Asp Val Arg Phe Glu Thr Asp Leu Ser Lys Lys Val Gly Asp Val Met 140 145 150
ACT AAA ATG CCT TTA GTT ACC GCT CAT GTG GGT ATC AGT TTG GAT GAA 534 Thr Lys Met Pro Leu Val Thr Ala His Val Gly He Ser Leu Asp Glu 155 160 165
GCG AGC GAT TTG ATG CAC AAG CAT AAG ATT GAA AAA TTG CCC ATT GTG 582 Ala Ser Asp Leu Met His Lys His Lys He Glu Lys Leu Pro He Val 170 175 180
GAT AAA GAT AAT GTC TTA AAA GGC TTG ATC ACG ATC AAA GAT ATT CAA 630 Asp Lys Asp Asn Val Leu Lys Gly Leu He Thr He Lys Asp He Gin 185 190 195 200
AAA CGC ATT GAA TAC CCT GAG GCC AAT AAA GAT GAT TTT GGG AGG TTG 678 Lys Arg He Glu Tyr Pro Glu Ala Asn Lys Asp Asp Phe Gly Arg Leu 205 210 215
AGA GTG GGG GCG GCT ATT GGA GTG GGG CAG TTG GAT AGG GCT GAG ATG 726 Arg Val Gly Ala Ala He Gly Val Gly Gin Leu Asp Arg Ala Glu Met 220 225 230
TTA GTT AAA GCG GGG GTG GAT GCA CTG GTG CTA GAC AGC GCA CAT GGG 774 Leu Val Lys Ala Gly Val Asp Ala Leu Val Leu Asp Ser Ala His Gly 235 240 245
CAT TCA GCC AAT ATC TTA CAC ACT TTA GAA GAG ATT AAA AAA AGC TTG 822 His Ser Ala Asn He Leu His Thr Leu Glu Glu He Lys Lys Ser Leu 250 255 260
GTA GTG GAT GTG ATT GTG GGG AAT GTG GTT ACT AAA GAA GCC ACA AGC 870 Val Val Asp Val He Val Gly Asn Val Val Thr Lys Glu Ala Thr Ser 265 270 275 280
GAT TTG ATT AGC GCG GGA GCA GAC GCT ATT AAA GTG GGT ATT GGG CCA 918 Asp Leu He Ser Ala Gly Ala Asp Ala He Lys Val Gly He Gly Pro 285 290 295
GGA AGC ATT TGC ACC ACT AGG ATT GTG GCT GGG GTG GGA ATG CCC CAA 966 Gly Ser He Cys Thr Thr Arg He Val Ala Gly Val Gly Met Pro Gin 300 305 310
GTG AGC GCG ATT GAT AAT TGC GTA GAA GTG GCG TCT AAA TTT GAT ATT 1014 Val Ser Ala He Asp Asn Cys Val Glu Val Ala Ser Lys Phe Asp He 315 320 325
CCT GTG ATT GCA GAT GGA GGG ATC CGC TAT TCA GGC GAT GTG GCT AAG 1062 Pro Val He Ala Asp Gly Gly He Arg Tyr Ser Gly Asp Val Ala Lys 330 335 340
GCT TTG GCT TTG GGG GCA TCA AGC GTG ATG ATA GGC TCT TTA CTC GCT 1110 Ala Leu Ala Leu Gly Ala Ser Ser Val Met He Gly Ser Leu Leu Ala 345 350 355 360
GGC ACA GAA GAA TCT CCT GGG GAT TTT ATG ATC TAT CAA GGG AGG CAA 1158 Gly Thr Glu Glu Ser Pro Gly Asp Phe Met He Tyr Gin Gly Arg Gin 365 370 375
TAT AAA AGC TAT AGG GGC ATG GGC AGC ATT GGG GCT ATG ACT AAA GGG 1206 Tyr Lys Ser Tyr Arg Gly Met Gly Ser He Gly Ala Met Thr Lys Gly 380 385 390
AGC TCT GAT AGG TAT TTT CAA GAG GGC GTA GCG AGT GAA AAG TTA GTC 1254 Ser Ser Asp Arg Tyr Phe Gin Glu Gly Val Ala Ser Glu Lys Leu Val 395 400 405
CCA GAA GGC ATT GAA GGG CGT GTG CCT TAT CGT GGT AAG GTT TCG GAT 1302 Pro Glu Gly He Glu Gly Arg Val Pro Tyr Arg Gly Lys Val Ser Asp 410 415 420
ATG ATT TTC CAA TTA GTA GGG GGC GTG CGC TCT TCT ATG GGG TAT CAG 1350 Met He Phe Gin Leu Val Gly Gly Val Arg Ser Ser Met Gly Tyr Gin 425 430 435 440
GGG GCG AAA AAT ATT TTG GAA TTG TAT CAA AAC GCT GAA TTT GTA GAA 1398 Gly Ala Lys Asn He Leu Glu Leu Tyr Gin Asn Ala Glu Phe Val Glu 445 450 455
ATC ACT AGC GCG GGG TTA AAA GAA AGC CAT GTG CAT GGC GTG GAT ATT 1446 He Thr Ser Ala Gly Leu Lys Glu Ser His Val His Gly Val Asp He 460 465 470
ACT AAA GAA GCC CCT AAT TAT TAT GGG TGAATTGTAA AAGAAAACAA GACAAAT 1500 Thr Lys Glu Ala Pro Asn Tyr Tyr Gly 475 480
CGTTAAAAAA CTCGTTAAAA AGCTTGGTTT AATGAGTTTT TAAAACTTAA TTGCTACA 155 £
(2) INFORMATION FOR SEQ ID NO: 868:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 481 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 868:
Met Arg He Leu Gin Arg Ala Leu Thr Phe Glu Asp Val Leu Met Val
1 5 10 15
Pro Arg Lys Ser Ser Val Leu Pro Lys Asp Val Ser Leu Lys Ser Arg
20 25 30
Leu Thr Lys Asn He Arg Leu Asn He Pro Phe He Ser Ala Ala Met
35 40 45
Asp Thr Val Thr Glu His Lys Thr Ala He Ala Met Ala Arg Leu Gly
50 55 60
Gly He Gly He Val His Lys Asn Met Asp He Gin Thr Gin Val Lys 65 70 75 80
Glu He Thr Lys Val Lys Lys Ser Glu Ser Gly Val He Asn Asp Pro
85 90 95
He Phe He His Ala His Arg Thr Leu Ala Asp Ala Lys Val He Thr
100 105 110
Asp Asn Tyr Lys He Ser Gly Val Pro Val Val Asp Asp Lys Gly Leu
115 120 125
Leu He Gly He Leu Thr Asn Arg Asp Val Arg Phe Glu Thr Asp Leu
130 135 140
Ser Lys Lys Val Gly Asp Val Met Thr Lys Met Pro Leu Val Thr Ala 145 150 155 160
His Val Gly He Ser Leu Asp Glu Ala Ser Asp Leu Met His Lys His 165 170 175 Lys He Glu Lys Leu Pro He Val Asp Lys Asp Asn Val Leu Lys Gly
180 185 190
Leu He Thr He Lys Asp He Gin Lys Arg He Glu Tyr Pro Glu Ala
195 200 205
Asn Lys Asp Asp Phe Gly Arg Leu Arg Val Gly Ala Ala He Gly Val
210 215 220
Gly Gin Leu Asp Arg Ala Glu Met Leu Val Lys Ala Gly Val Asp Ala 225 230 235 240
Leu Val Leu Asp Ser Ala His Gly His Ser Ala Asn He Leu His Thr
245 250 255
Leu Glu Glu He Lys Lys Ser Leu Val Val Asp Val He Val Gly Asn
260 265 270
Val Val Thr Lys Glu Ala Thr Ser Asp Leu He Ser Ala Gly Ala Asp
275 280 285
Ala He Lys Val Gly He Gly Pro Gly Ser He Cys Thr Thr Arg He
290 295 300
Val Ala Gly Val Gly Met Pro Gin Val Ser Ala He Asp Asn Cys Val 305 310 315 320
Glu Val Ala Ser Lys Phe Asp He Pro Val He Ala Asp Gly Gly He
325 330 335
Arg Tyr Ser Gly Asp Val Ala Lys Ala Leu Ala Leu Gly Ala Ser Ser
340 345 350
Val Met He Gly Ser Leu Leu Ala Gly Thr Glu Glu Ser Pro Gly Asp
355 360 365
Phe Met He Tyr Gin Gly Arg Gin Tyr Lys Ser Tyr Arg Gly Met Gly
370 375 380
Ser He Gly Ala Met Thr Lys Gly Ser Ser Asp Arg Tyr Phe Gin Glu 385 390 395 400
Gly Val Ala Ser Glu Lys Leu Val Pro Glu Gly He Glu Gly Arg Val
405 410 415
Pro Tyr Arg Gly Lys Val Ser Asp Met He Phe Gin Leu Val Gly Gly
420 425 430
Val Arg Ser Ser Met Gly Tyr Gin Gly Ala Lys Asn He Leu Glu Leu
435 440 445
Tyr Gin Asn Ala Glu Phe Val Glu He Thr Ser Ala Gly Leu Lys Glu
450 455 460
Ser His Val His Gly Val Asp He Thr Lys Glu Ala Pro Asn Tyr Tyr 465 470 475 480
Gly
(2) INFORMATION FOR SEQ ID NO: 869:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 919 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 58...876 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 869:
TAATGAAAAA TAGTTCATGA ACGCTTTTGC ATTAAGGCTC AAAAAAAGCG CCGTTTA ATG 60
Met
1
GAT TTT TGT AAA ATA AAA GAA ATT TTA AGG AGG CTT GTG GTG TTG AAA 108 Asp Phe Cys Lys He Lys Glu He Leu Arg Arg Leu Val Val Leu Lys 5 10 15
GAA TTA CGC CAA AAA CGC CCT TTA GTG CAT AAT ATC ACC AAT TAT GTG 156 Glu Leu Arg Gin Lys Arg Pro Leu Val His Asn He Thr Asn Tyr Val 20 25 30
GCG GCG CAA TTT GTG GCT AAT GGT TTG TTA GCT TTA GGG GCA TCG CCT 204 Ala Ala Gin Phe Val Ala Asn Gly Leu Leu Ala Leu Gly Ala Ser Pro 35 40 45
TTA ATG AGC GAT GCG ATT GAT GAA ATG CGA GAT TTA GCG AAA ATT TCT 252 Leu Met Ser Asp Ala He Asp Glu Met Arg Asp Leu Ala Lys He Ser 50 55 60 65
GAC GCG CTC GCT ATC AAT ATT GGC ACC CTT AAT GAT CGC GCT ATT TTA 300 Asp Ala Leu Ala He Asn He Gly Thr Leu Asn Asp Arg Ala He Leu 70 75 80
TGC GCT AAA GAG GCT ATC AAG CAT TAC AAG GCT TTG AAC AAA CCC ATT 348 Cys Ala Lys Glu Ala He Lys His Tyr Lys Ala Leu Asn Lys Pro He 85 90 95
GTG TTA GAT CCT GTG GGG TGT TCA GCG AGC GCT TTG CGT CAT GAC ACC 396 Val Leu Asp Pro Val Gly Cys Ser Ala Ser Ala Leu Arg His Asp Thr 100 105 110
AGT TTA GAG CTT TTG AAA AGT GGT GGG ATT AGC GCG CTT AGG GGT AAT 444 Ser Leu Glu Leu Leu Lys Ser Gly Gly He Ser Ala Leu Arg Gly Asn 115 120 125
GCT GCA GAA TTA GGC TCT TTA GTG GGG ATT TCT TGC GAA AGT AAG GGG 492 Ala Ala Glu Leu Gly Ser Leu Val Gly He Ser Cys Glu Ser Lys Gly 130 135 140 145
CTA GAC TCT AAT GAT GCC GCC ACG CCT GTA GAA ATA ATC AAA TTA GCG 540 Leu Asp Ser Asn Asp Ala Ala Thr Pro Val Glu He He Lys Leu Ala 150 155 160
GCT CAA AAA TAT TCT GTG ATA GCG GTA ATG ACG GGT AAA ACA GAT TAC 588 Ala Gin Lys Tyr Ser Val He Ala Val Met Thr Gly Lys Thr Asp Tyr 165 170 175
GTG AGC GAT GGG AAA AAG GTT TTG AGT ATT ACT GGG GGG AGC GAG TAT 636 Val Ser Asp Gly Lys Lys Val Leu Ser He Thr Gly Gly Ser Glu Tyr 180 185 190
TTA GCG CTC ATT ACT GGG GCT GGG TGT TTG CAT GCC GCA GCA TGC GCG 684 Leu Ala Leu He Thr Gly Ala Gly Cys Leu His Ala Ala Ala Cys Ala 195 200 205
AGC TTT TTA AGT TTG AAA AAA GAC CCC TTA GAT TCT ATG GCG CAA CTT 732 Ser Phe Leu Ser Leu Lys Lys Asp Pro Leu Asp Ser Met Ala Gin Leu 210 215 220 225
TGC GCG CTC TAT AAA CAA GCC GCT TTT AAC GCG CAA AAA AAG GTG TTG 780 Cys Ala Leu Tyr Lys Gin Ala Ala Phe Asn Ala Gin Lys Lys Val Leu 230 235 240
GAA AAT AAC GGC TCT AAT GGT TCG TTC TTG TTT TAT TTT TTA GAT GCT 828 Glu Asn Asn Gly Ser Asn Gly Ser Phe Leu Phe Tyr Phe Leu Asp Ala 245 250 255
CTA AGC TTG CCC ATA GAG TTA GAA AAC AGC CTT ATT AAG GAA GAG TGG T 877 Leu Ser Leu Pro He Glu Leu Glu Asn Ser Leu He Lys Glu Glu Trp 260 265 270
GAAAATTTAC CCGCAAGTTT TAAGCATTGC TGGCAGCGAT AG 919
(2) INFORMATION FOR SEQ ID NO: 870:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 273 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 870:
Met Asp Phe Cys Lys He Lys Glu He Leu Arg Arg Leu Val Val Leu
1 5 10 15
Lys Glu Leu Arg Gin Lys Arg Pro Leu Val His Asn He Thr Asn Tyr
20 25 30
Val Ala Ala Gin Phe Val Ala Asn Gly Leu Leu Ala Leu Gly Ala Ser
35 40 45
Pro Leu Met Ser Asp Ala He Asp Glu Met Arg Asp Leu Ala Lys He
50 55 60
Ser Asp Ala Leu Ala He Asn He Gly Thr Leu Asn Asp Arg Ala He 65 70 75 80
Leu Cys Ala Lys Glu Ala He Lys His Tyr Lys Ala Leu Asn Lys Pro
85 90 95
He Val Leu Asp Pro Val Gly Cys Ser Ala Ser Ala Leu Arg His Asp
100 105 110
Thr Ser Leu Glu Leu Leu Lys Ser Gly Gly He Ser Ala Leu Arg Gly
115 120 125
Asn Ala Ala Glu Leu Gly Ser Leu Val Gly He Ser Cys Glu Ser Lys
130 135 140
Gly Leu Asp Ser Asn Asp Ala Ala Thr Pro Val Glu He He Lys Leu 145 150 155 160
Ala Ala Gin Lys Tyr Ser Val He Ala Val Met Thr Gly Lys Thr Asp
165 170 175
Tyr Val Ser Asp Gly Lys Lys Val Leu Ser He Thr Gly Gly Ser Glu
180 185 190
Tyr Leu Ala Leu He Thr Gly Ala Gly Cys Leu His Ala Ala Ala Cys
195 200 205
Ala Ser Phe Leu Ser Leu Lys Lys Asp Pro Leu Asp Ser Met Ala Gin
210 215 220
Leu Cys Ala Leu Tyr Lys Gin Ala Ala Phe Asn Ala Gin Lys Lys Val 225 230 235 240
Leu Glu Asn Asn Gly Ser Asn Gly Ser Phe Leu Phe Tyr Phe Leu Asp
245 250 255
Ala Leu Ser Leu Pro He Glu Leu Glu Asn Ser Leu He Lys Glu Glu
260 265 270
Trp
(2) INFORMATION FOR SEQ ID NO: 871:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1010 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 78...971 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 871:
ATCCAAATAA TTGGGCGATT AAAGAGGGAA TTTATTCAAT CAAACCAAAT AAAAAAATAG 60
TATTTCCAAG ATTTTTA ATG TTT TGC TTT GAA AAT TTG AAT ATT CAA AAT 110
Met Phe Cys Phe Glu Asn Leu Asn He Gin Asn 1 5 10
GMT ATA AAA AGT AAA AGT TTT GGA GGA ATA GTT AAA AGT ATA TCA ATG 158 Xaa He Lys Ser Lys Ser Phe Gly Gly He Val Lys Ser He Ser Met 15 20 25
AAC GAT TTA CAA CAA ATA ACC ATC CCC ATC CCA CCC CTA GAG ATC CAA 206 Asn Asp Leu Gin Gin He Thr He Pro He Pro Pro Leu Glu He Gin 30 35 40
CAA GAG ATC GTT AAG ATT TTG GAC GCT TTC ACA GAA TTA AAC ACA GAA 254 Gin Glu He Val Lys He Leu Asp Ala Phe Thr Glu Leu Asn Thr Glu 45 50 55
TTA AAC ACA GAA TTA AAA GCG CGC AAA AAG CAA TAT GAG TAT TAC CAA 302 Leu Asn Thr Glu Leu Lys Ala Arg Lys Lys Gin Tyr Glu Tyr Tyr Gin 60 65 70 75
AAC ATG CTT TTA GAC TTT AAC GAT ATT AAT CAA AAC CAC AAA GAC GCC 350 Asn Met Leu Leu Asp Phe Asn Asp He Asn Gin Asn His Lys Asp Ala 80 85 90
AAA ATA AAA ACC TAC CCT AAA CGC TTG AAA ACC TTA CTC CAC ACT TTA 398 Lys He Lys Thr Tyr Pro Lys Arg Leu Lys Thr Leu Leu His Thr Leu 95 100 105
GCG CCT AAG GGG GTG GAG TTT AGG AAA TTG GGG GAG GTG TGT GAA AGC 446 Ala Pro Lys Gly Val Glu Phe Arg Lys Leu Gly Glu Val Cys Glu Ser 110 115 120
ACA AAT AAA AAA ACA CTC AAA ATA AGC GAA GTA AGT GAA GTA AAA AAT 494 Thr Asn Lys Lys Thr Leu Lys He Ser Glu Val Ser Glu Val Lys Asn 125 130 135
AAG GGA ATG TAT CCA GTG ATA AAT TCA GGG AGG GAT TTG TAT GGT TAT 542 Lys Gly Met Tyr Pro Val He Asn Ser Gly Arg Asp Leu Tyr Gly Tyr 140 145 150 155
TAC CAT GAT TTT AAC AAT GAT GGA GAA AAT ATA ACT ATT GCA TCT AGG 590 Tyr His Asp Phe Asn Asn Asp Gly Glu Asn He Thr He Ala Ser Arg 160 165 170
GGA GAA TAT GCA GGA TTT ATA AAC TAT TTC AAT GAA AAA TTT TTT GCA 638 Gly Glu Tyr Ala Gly Phe He Asn Tyr Phe Asn Glu Lys Phe Phe Ala 175 180 185
GGG GGT CTA TGT TAT CCC TAT AAA GTT AAA GAC ACT AAC GAG CTT TTA 686 Gly Gly Leu Cys Tyr Pro Tyr Lys Val Lys Asp Thr Asn Glu Leu Leu 190 195 200
ACA AAA TTT TTA TAC TTT TAT CTC AAA ACT AAT GAA ATC CAA ATT ATG 734 Thr Lys Phe Leu Tyr Phe Tyr Leu Lys Thr Asn Glu He Gin He Met 205 210 215
GAG AAC CTT GTT TTT CGT GGC AGT ATC CCC GCA CTC AAT AAA GCA GAT 782 Glu Asn Leu Val Phe Arg Gly Ser He Pro Ala Leu Asn Lys Ala Asp 220 225 230 235
ATT GAA ACT TTA ACA ATC CCC ATC CCA CCT CTA GAG ATC CAA CAA GAG 830 He Glu Thr Leu Thr He Pro He Pro Pro Leu Glu He Gin Gin Glu 240 245 250
ATC GTT AAG ATT TTG GAT CAA TTT TCA GCC CTA ACC ACC GAT TTA TTA 878 He Val Lys He Leu Asp Gin Phe Ser Ala Leu Thr Thr Asp Leu Leu 255 260 265
GCC GGT ATC CCC GCT GAA ATA AAA GCC CGA AAA AAG CAA TAC GAA TAT 926 Ala Gly He Pro Ala Glu He Lys Ala Arg Lys Lys Gin Tyr Glu Tyr 270 275 280 TAC CGA GAA AAA CTA CTG ACC TTC AAA CCT CTC CAA AAC AAG GAA TAACA 976 Tyr Arg Glu Lys Leu Leu Thr Phe Lys Pro Leu Gin Asn Lys Glu 285 290 295
TGAGTTACGA AACGATCGCA GAAAGCAATG AAAG 1010
(2) INFORMATION FOR SEQ ID NO:872:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 298 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 872:
Met Phe Cys Phe Glu Asn Leu Asn He Gin Asn Xaa He Lys Ser Lys
1 5 10 15
Ser Phe Gly Gly He Val Lys Ser He Ser Met Asn Asp Leu Gin Gin
20 25 30
He Thr He Pro He Pro Pro Leu Glu He Gin Gin Glu He Val Lys
35 40 45
He Leu Asp Ala Phe Thr Glu Leu Asn Thr Glu Leu Asn Thr Glu Leu
50 55 60
Lys Ala Arg Lys Lys Gin Tyr Glu Tyr Tyr Gin Asn Met Leu Leu Asp 65 70 75 80
Phe Asn Asp He Asn Gin Asn His Lys Asp Ala Lys He Lys Thr Tyr
85 90 95
Pro Lys Arg Leu Lys Thr Leu Leu His Thr Leu Ala Pro Lys Gly Val
100 105 110
Glu Phe Arg Lys Leu Gly Glu Val Cys Glu Ser Thr Asn Lys Lys Thr
115 120 125
Leu Lys He Ser Glu Val Ser Glu Val Lys Asn Lys Gly Met Tyr Pro
130 135 140
Val He Asn Ser Gly Arg Asp Leu Tyr Gly Tyr Tyr His Asp Phe Asn 145 150 155 160
Asn Asp Gly Glu Asn He Thr He Ala Ser Arg Gly Glu Tyr Ala Gly
165 170 175
Phe He Asn Tyr Phe Asn Glu Lys Phe Phe Ala Gly Gly Leu Cys Tyr
180 185 190
Pro Tyr Lys Val Lys Asp Thr Asn Glu Leu Leu Thr Lys Phe Leu Tyr
195 200 205
Phe Tyr Leu Lys Thr Asn Glu He Gin He Met Glu Asn Leu Val Phe
210 215 220
Arg Gly Ser He Pro Ala Leu Asn Lys Ala Asp He Glu Thr Leu Thr 225 230 235 240
He Pro He Pro Pro Leu Glu He Gin Gin Glu He Val Lys He Leu
245 250 255
Asp Gin Phe Ser Ala Leu Thr Thr Asp Leu Leu Ala Gly He Pro Ala
260 265 270
Glu He Lys Ala Arg Lys Lys Gin Tyr Glu Tyr Tyr Arg Glu Lys Leu
275 280 285
Leu Thr Phe Lys Pro Leu Gin Asn Lys Glu 290 295
(2) INFORMATION FOR SEQ ID NO: 873:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1305 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 42...1253 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 873:
GATTAGGGGA GTTAGAAACC ATTTGCGTGG AAGAAGATCC C ATG TAT GAA TGC GAA 56
Met Tyr Glu Cys Glu 1 5
GTG GCG ATT GAA AAA ATC CTA GAA GAT TTA GGC ATT CCT AGC TCT AAA 104 Val Ala He Glu Lys He Leu Glu Asp Leu Gly He Pro Ser Ser Lys 10 15 20
CAC AAC GAT TTG ATG AAA ACC CTG CCA AGC AGC GAT AAA TTT AAA ATC 152 His Asn Asp Leu Met Lys Thr Leu Pro Ser Ser Asp Lys Phe Lys He 25 30 35
CTT CTC GCT CAA GTC TTG TTC CCT AAA CCG GAT ATT TTG CTT TTA GAT 200 Leu Leu Ala Gin Val Leu Phe Pro Lys Pro Asp He Leu Leu Leu Asp 40 45 50
GAG CCG ACC AAC AAC CTG GAT TTA AAC GCC ATT GAA TGG CTA GAA AAC 248 Glu Pro Thr Asn Asn Leu Asp Leu Asn Ala He Glu Trp Leu Glu Asn 55 60 65
AAC CTC AAA CGC CAT GAA GGC ACG ATG GTC GTC ATT AGC CAT GAC AGG 296 Asn Leu Lys Arg His Glu Gly Thr Met Val Val He Ser His Asp Arg 70 75 80 85
CAT TTT TTA AAT GCG GTA TGC ACG CAT ATT TTG GAT TTG GAT TTC CAC 344 His Phe Leu Asn Ala Val Cys Thr His He Leu Asp Leu Asp Phe His 90 95 100
AGC GTG CGC GAA TTT AGC GGG AAT TAT GAC GAT TGG TAT ATC GCT TCC 392 Ser Val Arg Glu Phe Ser Gly Asn Tyr Asp Asp Trp Tyr He Ala Ser 105 110 115
ACT CTG ATC GCT AAA CAG CAA GAG GCC GAA CGC AAT AAA AAA CTC AAA 440 Thr Leu He Ala Lys Gin Gin Glu Ala Glu Arg Asn Lys Lys Leu Lys 120 125 130
GAA AAA GAA GAG CTA GAA AAA TTC ATC GCG CGC TTT ARN NNN NAC GCT 488 Glu Lys Glu Glu Leu Glu Lys Phe He Ala Arg Phe Xaa Xaa Xaa Ala 135 140 145
TCT AAA GCC AAG CAA GCC ACC AGC CGC CAA AAA CAA CTG GAT AAA TTA 536 Ser Lys Ala Lys Gin Ala Thr Ser Arg Gin Lys Gin Leu Asp Lys Leu 150 155 160 165
GAC ATT CAA AGT TTA GCG GTA TCT AGC AGG AGG GAT CCT AGC ATT ATT 584 Asp He Gin Ser Leu Ala Val Ser Ser Arg Arg Asp Pro Ser He He 170 175 180
TTT AAA CCC AAA CGC ACC ATT GGT AAT GAA GCC TTA GAG TGC GAA AAC 632 Phe Lys Pro Lys Arg Thr He Gly Asn Glu Ala Leu Glu Cys Glu Asn 185 190 195
ATC TCT AAA AGT TAT GAC GAC CAA ATC GTT TTA AAT CAA GTG AGC TTG 680 He Ser Lys Ser Tyr Asp Asp Gin He Val Leu Asn Gin Val Ser Leu 200 205 210
AAA GTG ATG CCT AAA GAC AAG ATC GCC CTC ATA GGG CCA AAC GGC GTG 728 Lys Val Met Pro Lys Asp Lys He Ala Leu He Gly Pro Asn Gly Val 215 220 225
GGT AAA TCC ACG CTT TGT AAA ATT CTA GTA GAA GAA TTA AAG CCG GAT 776 Gly Lys Ser Thr Leu Cys Lys He Leu Val Glu Glu Leu Lys Pro Asp 230 235 240 245
AAG GGC GTG GTG AAA TGG GGG GCG ACG GTT TCA AAA GGC TAT TTC CCT 824 Lys Gly Val Val Lys Trp Gly Ala Thr Val Ser Lys Gly Tyr Phe Pro 250 255 260
CAA AAC GTG AGC GAA GAA ATT AGC GGG GAA GAG ACC TTG TAT CAA TGG 872 Gin Asn Val Ser Glu Glu He Ser Gly Glu Glu Thr Leu Tyr Gin Trp 265 270 275
CTC TTT AAC TTC AAT AAA AAG ATT GAA AGC GCT GAG GTT AGG AAC GCT 920 Leu Phe Asn Phe Asn Lys Lys He Glu Ser Ala Glu Val Arg Asn Ala 280 285 290
TTA GGG AGG ATG CTG TTT AAT GGC GAA GAG CAA GAA AAA TGC GTG AAC 968 Leu Gly Arg Met Leu Phe Asn Gly Glu Glu Gin Glu Lys Cys Val Asn 295 300 305
GCT TTA AGT GGG GGC GAA AAA CAC CGA ATG GTT TTA TCC AAG CTC ATG 1016 Ala Leu Ser Gly Gly Glu Lys His Arg Met Val Leu Ser Lys Leu Met 310 315 320 325
CTA GAG GGG GGG AAT TTT TTA GTC TTA GAT GAG CCA ACC AAC CAT TTG 1064 Leu Glu Gly Gly Asn Phe Leu Val Leu Asp Glu Pro Thr Asn His Leu 330 335 340
GAT TTA GAA GCG ATT ATC GCT TTA GGC GAA GCG CTC TTT AAA TTT GAT 1112 Asp Leu Glu Ala He He Ala Leu Gly Glu Ala Leu Phe Lys Phe Asp 345 350 355
GGG GCG CTG ATT TGC GTA AGC CAT GAC AGA GAG CTC ATT GAT GCG TAT 1160 Gly Ala Leu He Cys Val Ser His Asp Arg Glu Leu He Asp Ala Tyr 360 365 370
GCT AAT AGG ATC ATT GAA TTA GTC CCA AGC CCT AAA GGC GCT TCA ATC 1208 Ala Asn Arg He He Glu Leu Val Pro Ser Pro Lys Gly Ala Ser He 375 380 385
ATT GAT TTT AAA GGC AGT TAT GAA GAG TAT TTG GCG AGC AAA AAA TGAAA 1258 He Asp Phe Lys Gly Ser Tyr Glu Glu Tyr Leu Ala Ser Lys Lys 390 395 400
CCGCAAGACA TTGAAATCGT TCAAAGCGTT TTAGAGATTA CAGGACC 1305
(2) INFORMATION FOR SEQ ID NO: 874:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 404 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 874:
Met Tyr Glu Cys Glu Val Ala He Glu Lys He Leu Glu Asp Leu Gly
1 5 10 15
He Pro Ser Ser Lys His Asn Asp Leu Met Lys Thr Leu Pro Ser Ser
20 25 30
Asp Lys Phe Lys He Leu Leu Ala Gin Val Leu Phe Pro Lys Pro Asp
35 40 45
He Leu Leu Leu Asp Glu Pro Thr Asn Asn Leu Asp Leu Asn Ala He
50 55 60
Glu Trp Leu Glu Asn Asn Leu Lys Arg His Glu Gly Thr Met Val Val 65 70 75 80
He Ser His Asp Arg His Phe Leu Asn Ala Val Cys Thr His He Leu
85 90 95
Asp Leu Asp Phe His Ser Val Arg Glu Phe Ser Gly Asn Tyr Asp Asp
100 105 110
Trp Tyr He Ala Ser Thr Leu He Ala Lys Gin Gin Glu Ala Glu Arg
115 120 125
Asn Lys Lys Leu Lys Glu Lys Glu Glu Leu Glu Lys Phe He Ala Arg
130 135 140
Phe Xaa Xaa Xaa Ala Ser Lys Ala Lys Gin Ala Thr Ser Arg Gin Lys 145 150 155 160
Gin Leu Asp Lys Leu Asp He Gin Ser Leu Ala Val Ser Ser Arg Arg
165 170 175
Asp Pro Ser He He Phe Lys Pro Lys Arg Thr He Gly Asn Glu Ala
180 185 190
Leu Glu Cys Glu Asn He Ser Lys Ser Tyr Asp Asp Gin He Val Leu 195 200 205 Asn Gin Val Ser Leu Lys Val Met Pro Lys Asp Lys He Ala Leu He
210 215 220
Gly Pro Asn Gly Val Gly Lys Ser Thr Leu Cys Lys He Leu Val Glu 225 230 235 240
Glu Leu Lys Pro Asp Lys Gly Val Val Lys Trp Gly Ala Thr Val Ser
245 250 255
Lys Gly Tyr Phe Pro Gin Asn Val Ser Glu Glu He Ser Gly Glu Glu
260 265 270
Thr Leu Tyr Gin Trp Leu Phe Asn Phe Asn Lys Lys He Glu Ser Ala
275 280 285
Glu Val Arg Asn Ala Leu Gly Arg Met Leu Phe Asn Gly Glu Glu Gin
290 295 300
Glu Lys Cys Val Asn Ala Leu Ser Gly Gly Glu Lys His Arg Met Val 305 310 315 320
Leu Ser Lys Leu Met Leu Glu Gly Gly Asn Phe Leu Val Leu Asp Glu
325 330 335
Pro Thr Asn His Leu Asp Leu Glu Ala He He Ala Leu Gly Glu Ala
340 345 350
Leu Phe Lys Phe Asp Gly Ala Leu He Cys Val Ser His Asp Arg Glu
355 360 365
Leu He Asp Ala Tyr Ala Asn Arg He He Glu Leu Val Pro Ser Pro
370 375 380
Lys Gly Ala Ser He He Asp Phe Lys Gly Ser Tyr Glu Glu Tyr Leu 385 390 395 400
Ala Ser Lys Lys
(2) INFORMATION FOR SEQ ID NO: 875:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 801 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...756 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 875:
AAAAGCAGGG ATACTAGA ATG CAA ATG ATG CAC AAT TTG AGT TTT TTG GGC 51
Met Gin Met Met His Asn Leu Ser Phe Leu Gly 1 5 10
ATG TTT TTA GCC GCT TTG AGC ATG TCT TTA GGG CAT TGT GTG GGC ATG 99 Met Phe Leu Ala Ala Leu Ser Met Ser Leu Gly His Cys Val Gly Met 15 20 25
TGT GGG GGG ATT GTG AGC GCG TTC AGT CAA ATA AGA TTT TCT AAA GTT 147 Cys Gly Gly He Val Ser Ala Phe Ser Gin He Arg Phe Ser Lys Val 30 35 40
ACA AGC TTT TCT TAC CAG CTC ACT TGC CAT GCC CTT TAT AAT GTA GGG 195 Thr Ser Phe Ser Tyr Gin Leu Thr Cys His Ala Leu Tyr Asn Val Gly 45 50 55
AGG ATC AGC ACT TAC ATG CTT TTA GGG GCT ATA GCG GCA AGT TTG GGG 243 Arg He Ser Thr Tyr Met Leu Leu Gly Ala He Ala Ala Ser Leu Gly 60 65 70 75
CAT AGT CTT AGC GTG AGC ATG GGT TTT AGG GGT GTT TTA TTC ATT AGC 291 His Ser Leu Ser Val Ser Met Gly Phe Arg Gly Val Leu Phe He Ser 80 85 90
ATG GGG ATT ATT TTG ATC TGT TTA GCG TTG CTA GGG GCA AGA ATG GAA 339 Met Gly He He Leu He Cys Leu Ala Leu Leu Gly Ala Arg Met Glu 95 100 105
AAA TTA AGC TTT CAA ATC CCT TTT ATT TCT TTT TTG ATG AAA AAA ACC 387 Lys Leu Ser Phe Gin He Pro Phe He Ser Phe Leu Met Lys Lys Thr 110 115 120
TTG CAA TCT CAA AAC ATT CTA GGG CTG TAT TTC TTA GGC GTG TTG AAC 435 Leu Gin Ser Gin Asn He Leu Gly Leu Tyr Phe Leu Gly Val Leu Asn 125 130 135
GGG TTT TTA CCT TGC ATG ATG GTG TAT TCG TTT TTA GCG AGC GTG ATT 483 Gly Phe Leu Pro Cys Met Met Val Tyr Ser Phe Leu Ala Ser Val He 140 145 150 155
CTC AGT CAT AGC GCG TTT ATG GGA GCG ATG CTA GGC CTT TCT TTT GGG 531 Leu Ser His Ser Ala Phe Met Gly Ala Met Leu Gly Leu Ser Phe Gly 160 165 170
CTT GGC ACC AGC ATG CCG TTG TTT TTA ATG GGG ATT TTT TTA AGC AAA 579 Leu Gly Thr Ser Met Pro Leu Phe Leu Met Gly He Phe Leu Ser Lys 175 180 185
ATT TCC GTT TCT TAC AGG AAA TTT TTC AAT CTT TTG TCT AAA ATT TTA 627 He Ser Val Ser Tyr Arg Lys Phe Phe Asn Leu Leu Ser Lys He Leu 190 195 200
ATG GGG GTT TTT GGG CTT TAT ATC CTT TAT ATG GGG ATC ATG CTC ATT 675 Met Gly Val Phe Gly Leu Tyr He Leu Tyr Met Gly He Met Leu He 205 210 215
AAC CAC AAA ATG CCT CAT GCC ATG CAT CAT CAA AAC AAC ACC ACT CAG 723 Asn His Lys Met Pro His Ala Met His His Gin Asn Asn Thr Thr Gin 220 225 230 235
CAT GAT CAT AAA GGA GTG CAT TCG CAT GAA CAC TAACAAAGCC CTTTTTTTGG 776 His Asp His Lys Gly Val His Ser His Glu His 240 245 ACAGAGACGG CATTATCAAT ATTGA 801
(2) INFORMATION FOR SEQ ID NO: 876:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 246 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 876:
Met Gin Met Met His Asn Leu Ser Phe Leu Gly Met Phe Leu Ala Ala
1 5 10 15
Leu Ser Met Ser Leu Gly His Cys Val Gly Met Cys Gly Gly He Val
20 25 30
Ser Ala Phe Ser Gin He Arg Phe Ser Lys Val Thr Ser Phe Ser Tyr
35 40 45
Gin Leu Thr Cys His Ala Leu Tyr Asn Val Gly Arg He Ser Thr Tyr
50 55 60
Met Leu Leu Gly Ala He Ala Ala Ser Leu Gly His Ser Leu Ser Val 65 70 75 80
Ser Met Gly Phe Arg Gly Val Leu Phe He Ser Met Gly He He Leu
85 90 95
He Cys Leu Ala Leu Leu Gly Ala Arg Met Glu Lys Leu Ser Phe Gin
100 105 110
He Pro Phe He Ser Phe Leu Met Lys Lys Thr Leu Gin Ser Gin Asn
115 120 125
He Leu Gly Leu Tyr Phe Leu Gly Val Leu Asn Gly Phe Leu Pro Cys
130 135 140
Met Met Val Tyr Ser Phe Leu Ala Ser Val He Leu Ser His Ser Ala 145 150 155 160
Phe Met Gly Ala Met Leu Gly Leu Ser Phe Gly Leu Gly Thr Ser Met
165 170 175
Pro Leu Phe Leu Met Gly He Phe Leu Ser Lys He Ser Val Ser Tyr
180 185 190
Arg Lys Phe Phe Asn Leu Leu Ser Lys He Leu Met Gly Val Phe Gly
195 200 205
Leu Tyr He Leu Tyr Met Gly He Met Leu He Asn His Lys Met Pro
210 215 220
His Ala Met His His Gin Asn Asn Thr Thr Gin His Asp His Lys Gly 225 230 235 240
Val His Ser His Glu His 245
(2) INFORMATION FOR SEQ ID NO: 877:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 735 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 1...693 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 877:
AAC TGC CTN TCN TCG CTC AAC ACG ATT GTA TTA AAC CAT AAT AAA TTG 48 Asn Cys Xaa Xaa Ser Leu Asn Thr He Val Leu Asn His Asn Lys Leu 1 5 10 15
TAT TCT TTA GAA AAA CGA GGG TAT GTG ATA GAG GTG GAT TTA AAT GAT 96 Tyr Ser Leu Glu Lys Arg Gly Tyr Val He Glu Val Asp Leu Asn Asp 20 25 30
TTT GAT TCG TAT AAT GTC TAT AAA ACG CCA ACT ATA GGC AGT TTT AAG 144 Phe Asp Ser Tyr Asn Val Tyr Lys Thr Pro Thr He Gly Ser Phe Lys 35 40 45
TTT TTT TCA TCT AAT CGT TTG GAT AAA GGG GTG TTT TAT GAT AAA AAT 192 Phe Phe Ser Ser Asn Arg Leu Asp Lys Gly Val Phe Tyr Asp Lys Asn 50 55 60
CGG GTG TAT TAC GAT CGC TAC TAT TTA GAT TAT AAC GAT TTT AAA CCA 240 Arg Val Tyr Tyr Asp Arg Tyr Tyr Leu Asp Tyr Asn Asp Phe Lys Pro 65 70 75 80
AAA CTT TAT CCC GTT GTG GAA AAA TCG GCA TCT AAA AAA TCT CAA AAA 288 Lys Leu Tyr Pro Val Val Glu Lys Ser Ala Ser Lys Lys Ser Gin Lys 85 90 95
GGC GAA AAA GGG AAC GCT CCT ATT TAT TTG CAA GAA AGG CAT AAA GCT 336 Gly Glu Lys Gly Asn Ala Pro He Tyr Leu Gin Glu Arg His Lys Ala 100 105 110
AAA GAA AAT AAA CAG CCT TTA GAA GAA AAC AAA GTT AAA CCA AGA AAT 384 Lys Glu Asn Lys Gin Pro Leu Glu Glu Asn Lys Val Lys Pro Arg Asn 115 120 125
AGC GGG TTT GAA GAA GAA GAG GTT AAA ACC AGA AGG CCT GAG CCT ATT 432 Ser Gly Phe Glu Glu Glu Glu Val Lys Thr Arg Arg Pro Glu Pro He 130 135 140
AGG GAT CAA AAT AAC GCC ACC CAA CAA GGC GAA ACA AAA AAC AAT GAA 480 Arg Asp Gin Asn Asn Ala Thr Gin Gin Gly Glu Thr Lys Asn Asn Glu 145 150 155 160
AGT AAA AAC GCT CCT GTC TTA AAA GAA AAC GCC GCT AAA AAA GAA GTG 528 Ser Lys Asn Ala Pro Val Leu Lys Glu Asn Ala Ala Lys Lys Glu Val 165 170 175 CCA AAA CCA AAT TCT AAA GAA GAA AAA CGC CGC TTG AAA GAA GAA AAG 576 Pro Lys Pro Asn Ser Lys Glu Glu Lys Arg Arg Leu Lys Glu Glu Lys 180 185 190
AAA AAA GCC AAA GCC GAA CAA AGA GCG AGA GAA TTT GAA CAA AGA GCG 624 Lys Lys Ala Lys Ala Glu Gin Arg Ala Arg Glu Phe Glu Gin Arg Ala 195 200 205
AGA GAG CAT CAA GAA AGA GAT GAA AAA GAG CTT GAA GAA AGA AGA AAA 672 Arg Glu His Gin Glu Arg Asp Glu Lys Glu Leu Glu Glu Arg Arg Lys 210 215 220
GCT TTA GAA ATG AAT AAG AAG TAGGCCTATG CCAGCTAGGC AATCTTTTAC AGAT 727 Ala Leu Glu Met Asn Lys Lys 225 230
TTGAAAAA 735
(2) INFORMATION FOR SEQ ID NO: 878:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 231 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 878:
Asn Cys Xaa Xaa Ser Leu Asn Thr He Val Leu Asn His Asn Lys Leu
1 5 10 15
Tyr Ser Leu Glu Lys Arg Gly Tyr Val He Glu Val Asp Leu Asn Asp
20 25 30
Phe Asp Ser Tyr Asn Val Tyr Lys Thr Pro Thr He Gly Ser Phe Lys
35 40 45
Phe Phe Ser Ser Asn Arg Leu Asp Lys Gly Val Phe Tyr Asp Lys Asn
50 55 60
Arg Val Tyr Tyr Asp Arg Tyr Tyr Leu Asp Tyr Asn Asp Phe Lys Pro 65 70 75 80
Lys Leu Tyr Pro Val Val Glu Lys Ser Ala Ser Lys Lys Ser Gin Lys
85 90 95
Gly Glu Lys Gly Asn Ala Pro He Tyr Leu Gin Glu Arg His Lys Ala
100 105 110
Lys Glu Asn Lys Gin Pro Leu Glu Glu Asn Lys Val Lys Pro Arg Asn
115 120 125
Ser Gly Phe Glu Glu Glu Glu Val Lys Thr Arg Arg Pro Glu Pro He
130 135 140
Arg Asp Gin Asn Asn Ala Thr Gin Gin Gly Glu Thr Lys Asn Asn Glu 145 150 155 160
Ser Lys Asn Ala Pro Val Leu Lys Glu Asn Ala Ala Lys Lys Glu Val
165 170 175
Pro Lys Pro Asn Ser Lys Glu Glu Lys Arg Arg Leu Lys Glu Glu Lys
180 185 190
Lys Lys Ala Lys Ala Glu Gin Arg Ala Arg Glu Phe Glu Gin Arg Ala 195 200 205
Arg Glu His Gin Glu Arg Asp Glu Lys Glu Leu Glu Glu Arg Arg Lys
210 215 220
Ala Leu Glu Met Asn Lys Lys 225 230
(2) INFORMATION FOR SEQ ID NO: 879:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1047 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 34...1005 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 879:
AGAAAGAAAC CATTCAAGGA ACGCATTGAT TTG ATG AAT AAA CCA TTT TTA ATC 54
Met Asn Lys Pro Phe Leu He 1 5
TTA CTC ATA GCC CTA ATT GTC TTT AGC GGC TGT AAC ATG AGA AAA TAT 102 Leu Leu He Ala Leu He Val Phe Ser Gly Cys Asn Met Arg Lys Tyr 10 15 20
TTC AAA CCC GCT AAA CAC CAA ATT AAA GGC GAA GCG TAT TTC CCT AAC 150 Phe Lys Pro Ala Lys His Gin He Lys Gly Glu Ala Tyr Phe Pro Asn 25 30 35
CAT TTG CAA GAA AGT ATC GTT TCG TCT AAT CGT TAT GGA GCC ATT TTG 198 His Leu Gin Glu Ser He Val Ser Ser Asn Arg Tyr Gly Ala He Leu 40 45 50 55
AAA AAT GGA GCG GTT ATA GGC GAT AAA GGT TTA ACG CAG CTA AGA ATC 246 Lys Asn Gly Ala Val He Gly Asp Lys Gly Leu Thr Gin Leu Arg He 60 65 70
GGT AAG AAC TTC AAT TAC GAA AGC AGT TTT TTA AAT GAG AGT CAA GGG 294 Gly Lys Asn Phe Asn Tyr Glu Ser Ser Phe Leu Asn Glu Ser Gin Gly 75 80 85
TTT TTT ATT CTT GCG CAA GAT TGT TTG AAC AAG ATT GAT AAA AAA ACA 342 Phe Phe He Leu Ala Gin Asp Cys Leu Asn Lys He Asp Lys Lys Thr 90 95 100
AAC AAA AGC AAG GTG GCT AAG ACT GAA GAA ACG GAA TTG AAA TTA AAG 390 Asn Lys Ser Lys Val Ala Lys Thr Glu Glu Thr Glu Leu Lys Leu Lys 105 110 115
GGC GTT GAA GCG GAA GTC CAA GAT AAA GTC TGT CAT CAA GTG GAA TTG 438 Gly Val Glu Ala Glu Val Gin Asp Lys Val Cys His Gin Val Glu Leu 120 125 130 135
ATT AGC AAT AAC CCT AAC GCC AGC CAA CAA TCT ATC GTT ATT CCT TTG 486 He Ser Asn Asn Pro Asn Ala Ser Gin Gin Ser He Val He Pro Leu 140 145 150
GAG ACT TTT GCC TTG AGC GCA AGC GTT AAA GGG AAT CTT TTA GCG GTG 534 Glu Thr Phe Ala Leu Ser Ala Ser Val Lys Gly Asn Leu Leu Ala Val 155 160 165
GTG TTA GCG GAC AAT TCA GCG AAC TTA TAC GAC ATC ACT TCT CAA AAA 582 Val Leu Ala Asp Asn Ser Ala Asn Leu Tyr Asp He Thr Ser Gin Lys 170 175 180
TTG CTT TTT AGT GAG AAA GGT TCC CCA AGC ACC ACG ATC AAT TCT TTA 630 Leu Leu Phe Ser Glu Lys Gly Ser Pro Ser Thr Thr He Asn Ser Leu 185 190 195
ATG GCG ATG CCT ATT TTT ATG GAT ACG GTC GTG GTG TTC CCC ATG CTA 678 Met Ala Met Pro He Phe Met Asp Thr Val Val Val Phe Pro Met Leu 200 205 210 215
GAT GGG CGC TTG TTG GTC GTG GAT TAT GTG CAC GGA AAC CCT ACG CCT 726 Asp Gly Arg Leu Leu Val Val Asp Tyr Val His Gly Asn Pro Thr Pro 220 225 230
ATT AGA AAC ATT GTT ATC AGC AGC GAT AAG TTT TTT AAC AAT ATC ACC 774 He Arg Asn He Val He Ser Ser Asp Lys Phe Phe Asn Asn He Thr 235 240 245
TAC CTT ATC GTA GAT GGC AAT AAC ATG ATC GCT TCT ACA GGG AAA AGG 822 Tyr Leu He Val Asp Gly Asn Asn Met He Ala Ser Thr Gly Lys Arg 250 255 260
ATA CTC TCA GTA GTG AGC GGT CAA GAG TTC AAC TAT GAT GGG GAT ATT 870 He Leu Ser Val Val Ser Gly Gin Glu Phe Asn Tyr Asp Gly Asp He 265 270 275
GTG GAT TTG CTT TAT GAT AAG GGG ACT TTA TAT GTG CTC ACG CTA GAC 918 Val Asp Leu Leu Tyr Asp Lys Gly Thr Leu Tyr Val Leu Thr Leu Asp 280 285 290 295
GGG CAG ATT TTG CAA ATG GAT AAG AGT TTG AGG GAA TTA AAC AGC GTG 966 Gly Gin He Leu Gin Met Asp Lys Ser Leu Arg Glu Leu Asn Ser Val 300 305 310
AAA CTG CCT NTC NTC GCT CAA CAC GAT TGT ATT AAA CCA TAATAAATTG TA 1017 Lys Leu Pro Xaa Xaa Ala Gin His Asp Cys He Lys Pro 315 320
TTCTTTAGAA AAACGAGGGT ATGTGATAGA 1047 (2) INFORMATION FOR SEQ ID NO: 880:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 324 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 880:
Met Asn Lys Pro Phe Leu He Leu Leu He Ala Leu He Val Phe Ser
1 5 10 15
Gly Cys Asn Met Arg Lys Tyr Phe Lys Pro Ala Lys His Gin He Lys
20 25 30
Gly Glu Ala Tyr Phe Pro Asn His Leu Gin Glu Ser He Val Ser Ser
35 40 45
Asn Arg Tyr Gly Ala He Leu Lys Asn Gly Ala Val He Gly Asp Lys
50 55 60
Gly Leu Thr Gin Leu Arg He Gly Lys Asn Phe Asn Tyr Glu Ser Ser 65 70 75 80
Phe Leu Asn Glu Ser Gin Gly Phe Phe He Leu Ala Gin Asp Cys Leu
85 90 95
Asn Lys He Asp Lys Lys Thr Asn Lys Ser Lys Val Ala Lys Thr Glu
100 105 110
Glu Thr Glu Leu Lys Leu Lys Gly Val Glu Ala Glu Val Gin Asp Lys
115 120 125
Val Cys His Gin Val Glu Leu He Ser Asn Asn Pro Asn Ala Ser Gin
130 135 140
Gin Ser He Val He Pro Leu Glu Thr Phe Ala Leu Ser Ala Ser Val 145 150 155 160
Lys Gly Asn Leu Leu Ala Val Val Leu Ala Asp Asn Ser Ala Asn Leu
165 170 175
Tyr Asp He Thr Ser Gin Lys Leu Leu Phe Ser Glu Lys Gly Ser Pro
180 185 190
Ser Thr Thr He Asn Ser Leu Met Ala Met Pro He Phe Met Asp Thr
195 200 205
Val Val Val Phe Pro Met Leu Asp Gly Arg Leu Leu Val Val Asp Tyr
210 215 220
Val His Gly Asn Pro Thr Pro He Arg Asn He Val He Ser Ser Asp 225 230 235 240
Lys Phe Phe Asn Asn He Thr Tyr Leu He Val Asp Gly Asn Asn Met
245 250 255
He Ala Ser Thr Gly Lys Arg He Leu Ser Val Val Ser Gly Gin Glu
260 265 270
Phe Asn Tyr Asp Gly Asp He Val Asp Leu Leu Tyr Asp Lys Gly Thr
275 280 285
Leu Tyr Val Leu Thr Leu Asp Gly Gin He Leu Gin Met Asp Lys Ser
290 295 300
Leu Arg Glu Leu Asn Ser Val Lys Leu Pro Xaa Xaa Ala Gin His Asp 305 310 315 320
Cys He Lys Pro (2) INFORMATION FOR SEQ ID NO: 881:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 410 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...366 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 881:
AGATTAAGGT TTAGT ATG CAT GAA TAC TCG GTC GTT TCT TCT TTA ATC GCT 51 Met His Glu Tyr Ser Val Val Ser Ser Leu He Ala 1 5 10
CTT TGC GAA GAG CAT GCG AAG AAA AAT CAA GCC CAT AAG ATT GAA AGA 99 Leu Cys Glu Glu His Ala Lys Lys Asn Gin Ala His Lys He Glu Arg 15 20 25
GTC GTG GTC GGT ATT GGT GAA AGA AGT GCT ATG GAT AAG AGC TTG TTT 147 Val Val Val Gly He Gly Glu Arg Ser Ala Met Asp Lys Ser Leu Phe 30 35 40
GTG AGT GCG TTT GAG ACT TTT AGA GAA GAA TCT TTG GTG TGT AAA GAC 195 Val Ser Ala Phe Glu Thr Phe Arg Glu Glu Ser Leu Val Cys Lys Asp 45 50 55 60
GCT ATT TTA GAC ATT GTA GAT GAA AAG GTT GAA TTA GAA TGC AAG GAT 243 Ala He Leu Asp He Val Asp Glu Lys Val Glu Leu Glu Cys Lys Asp 65 70 75
TGT TCG CAT GTT TTT AAG CCT AAC GCG CTA GAT TAT GGG GTG TGT GAG 291 Cys Ser His Val Phe Lys Pro Asn Ala Leu Asp Tyr Gly Val Cys Glu 80 85 90
AAA TGC CAC AGC AAG AAT GTT ATT ATC ACT CAA GGC AAT GAA ATG CGT 339 Lys Cys His Ser Lys Asn Val He He Thr Gin Gly Asn Glu Met Arg 95 100 105
TTG TTG TCT TTA GAA ATG TTA GCG GAA TAACCGATGC AAGAAGAATT GAACGCT 393 Leu Leu Ser Leu Glu Met Leu Ala Glu 110 115
TACCAGCAAG AAATTGA 410
(2) INFORMATION FOR SEQ ID NO: 882: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 117 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:882:
Met His Glu Tyr Ser Val Val Ser Ser Leu He Ala Leu Cys Glu Glu
1 5 10 15
His Ala Lys Lys Asn Gin Ala His Lys He Glu Arg Val Val Val Gly
20 25 30
He Gly Glu Arg Ser Ala Met Asp Lys Ser Leu Phe Val Ser Ala Phe
35 40 45
Glu Thr Phe Arg Glu Glu Ser Leu Val Cys Lys Asp Ala He Leu Asp
50 55 60
He Val Asp Glu Lys Val Glu Leu Glu Cys Lys Asp Cys Ser His Val 65 70 75 80
Phe Lys Pro Asn Ala Leu Asp Tyr Gly Val Cys Glu Lys Cys His Ser
85 90 95
Lys Asn Val He He Thr Gin Gly Asn Glu Met Arg Leu Leu Ser Leu
100 105 110
Glu Met Leu Ala Glu 115
(2) INFORMATION FOR SEQ ID NO: 883:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 840 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 38...769 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 883:
TTGCAAAAAC TCTCATTAAA AACAAGGAGC AAAAAAG ATG AAA AAG GCG GGC TTT 55
Met Lys Lys Ala Gly Phe 1 5
CTT TTT TTA GCG GTA ATG GCT ATC GTT GTT ATG AGT TTA AAC GCT AAA 103 Leu Phe Leu Ala Val Met Ala He Val Val Met Ser Leu Asn Ala Lys 10 15 20
GAT CCG AAT GTG TTG CGT AAG ATT GTT TTT GAG AAA TGT CTG CCT AAT 151 Asp Pro Asn Val Leu Arg Lys He Val Phe Glu Lys Cys Leu Pro Asn 25 30 35
TAT GAG AAA AAT CAG AAT CCT TCG CCA TGC ATA GAA GTC AAA CCC GAT 199 Tyr Glu Lys Asn Gin Asn Pro Ser Pro Cys He Glu Val Lys Pro Asp 40 45 50
GCC GGC TAT GTG GTT TTA AAA GAT ATT AAC GGC CCG TTG CAA TAT TTG 247 Ala Gly Tyr Val Val Leu Lys Asp He Asn Gly Pro Leu Gin Tyr Leu 55 60 65 70
TTG ATG CCA ACA ACT CAC ATT AGC GGT ATT GAA AGC CCT TTG TTA CTT 295 Leu Met Pro Thr Thr His He Ser Gly He Glu Ser Pro Leu Leu Leu 75 80 85
GAT CCT TCT ACG CCT AAC TTT TTT TAT TTA TCC TGG CAA GCG CGT GAT 343 Asp Pro Ser Thr Pro Asn Phe Phe Tyr Leu Ser Trp Gin Ala Arg Asp 90 95 100
TTT ATG AGT AAA AAA TAC GGC CAA CCC ATT CCT GAT TAT GCG ATT TCT 391 Phe Met Ser Lys Lys Tyr Gly Gin Pro He Pro Asp Tyr Ala He Ser 105 110 115
TTG ACG ATT AAC TCT AGC AAA GGG CGA TCG CAA AAC CAT TTT CAT ATC 439 Leu Thr He Asn Ser Ser Lys Gly Arg Ser Gin Asn His Phe His He 120 125 130
CAT ATC TCT TGC ATT AGT CTT GAA GCA CGC AAA CAG CTG GAT AAT AAC 487 His He Ser Cys He Ser Leu Glu Ala Arg Lys Gin Leu Asp Asn Asn 135 140 145 150
CTA AAA AAA ATC AAC AGC CGT TGG TCG CCA TTA CCG GGC GGT TTG AAT 535 Leu Lys Lys He Asn Ser Arg Trp Ser Pro Leu Pro Gly Gly Leu Asn 155 160 165
GGG CAT AAA TAC TTG GCG CGT CGG GTA ACA GAG AGC GAG TTA GTG CAA 583 Gly His Lys Tyr Leu Ala Arg Arg Val Thr Glu Ser Glu Leu Val Gin 170 175 180
AAA AGC CCG TTT GTC ATG CTT AAT AAA GAA GTG CCT AAT GCG TAC AAA 631 Lys Ser Pro Phe Val Met Leu Asn Lys Glu Val Pro Asn Ala Tyr Lys 185 190 195
CGC ATG GGG GAC TAT GGC TTA GCG GTG GTG CAA CAA AGC GAT AAC TCC 679 Arg Met Gly Asp Tyr Gly Leu Ala Val Val Gin Gin Ser Asp Asn Ser 200 205 210
TTT GTC TTA TTA GCG ACA CAA TTT AAC CCA TTG ACT TTA AAT CGC GCT 727 Phe Val Leu Leu Ala Thr Gin Phe Asn Pro Leu Thr Leu Asn Arg Ala 215 220 225 230
TCA GCC GAA GAG ATT CAA GAT CAT GAA TGC GCG ATT TTG CAC TAAAGCGAG 778 Ser Ala Glu Glu He Gin Asp His Glu Cys Ala He Leu His 235 240 TTAGATTCTT AAGCTTGAGC GATAACCTTT AAAAAGCGTT ATGGGGTGGT GTTGCAAAAC 838 CC 840
(2) INFORMATION FOR SEQ ID NO: 884:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 244 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 884:
Met Lys Lys Ala Gly Phe Leu Phe Leu Ala Val Met Ala He Val Val
1 5 10 15
Met Ser Leu Asn Ala Lys Asp Pro Asn Val Leu Arg Lys He Val Phe
20 25 30
Glu Lys Cys Leu Pro Asn Tyr Glu Lys Asn Gin Asn Pro Ser Pro Cys
35 40 45
He Glu Val Lys Pro Asp Ala Gly Tyr Val Val Leu Lys Asp He Asn
50 55 60
Gly Pro Leu Gin Tyr Leu Leu Met Pro Thr Thr His He Ser Gly He 65 70 75 80
Glu Ser Pro Leu Leu Leu Asp Pro Ser Thr Pro Asn Phe Phe Tyr Leu
85 90 95
Ser Trp Gin Ala Arg Asp Phe Met Ser Lys Lys Tyr Gly Gin Pro He
100 105 110
Pro Asp Tyr Ala He Ser Leu Thr He Asn Ser Ser Lys Gly Arg Ser
115 120 125
Gin Asn His Phe His He His He Ser Cys He Ser Leu Glu Ala Arg
130 135 140
Lys Gin Leu Asp Asn Asn Leu Lys Lys He Asn Ser Arg Trp Ser Pro 145 150 155 160
Leu Pro Gly Gly Leu Asn Gly His Lys Tyr Leu Ala Arg Arg Val Thr
165 170 175
Glu Ser Glu Leu Val Gin Lys Ser Pro Phe Val Met Leu Asn Lys Glu
180 185 190
Val Pro Asn Ala Tyr Lys Arg Met Gly Asp Tyr Gly Leu Ala Val Val
195 200 205
Gin Gin Ser Asp Asn Ser Phe Val Leu Leu Ala Thr Gin Phe Asn Pro
210 215 220
Leu Thr Leu Asn Arg Ala Ser Ala Glu Glu He Gin Asp His Glu Cys 225 230 235 240
Ala He Leu His
(2) INFORMATION FOR SEQ ID NO: 885
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 481 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 22...441 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 885:
ATATTGAAAG ATAATCAAAA A ATG AAG ACA AGC GCT AAA GTA TTA TTG ACT 51
Met Lys Thr Ser Ala Lys Val Leu Leu Thr 1 5 10
TTA TTG ATT GTA ATA TCA TTA GGT AAG GGA TTA AAT AGT CTC ATA TCA 99 Leu Leu He Val He Ser Leu Gly Lys Gly Leu Asn Ser Leu He Ser 15 20 25
GCT TGG CGT GGC AAA GAT GAT GCG ATC CCC ATT GAA ACA AGA CTC CAT 147 Ala Trp Arg Gly Lys Asp Asp Ala He Pro He Glu Thr Arg Leu His 30 35 40
AAA AAC AAA CTG ACA ATC ATT TCT AAA ACA GAC AGC ATA GAA ATC CAA 195 Lys Asn Lys Leu Thr He He Ser Lys Thr Asp Ser He Glu He Gin 45 50 55
GAC ATT CAG TTT AAT AGA GAG AAT TGT TCT CAC ACT TAT ACT AGT AAG 243 Asp He Gin Phe Asn Arg Glu Asn Cys Ser His Thr Tyr Thr Ser Lys 60 65 70
GAT TTG GAA AAA ATT CAA AAA GAT TTA GAA GAG CTT GAA GAA GGA GTG 291 Asp Leu Glu Lys He Gin Lys Asp Leu Glu Glu Leu Glu Glu Gly Val 75 80 85 90
CCT GAA TTG TTC GAG GAG CTT GAG CGT GAT GAA GAG TCC ATC GCT AAA 339 Pro Glu Leu Phe Glu Glu Leu Glu Arg Asp Glu Glu Ser He Ala Lys 95 100 105
AAT AAA AAA ACG ATC CAA GAG TAT CAA AAT AAA ATT GCT AAT TTT CAA 387 Asn Lys Lys Thr He Gin Glu Tyr Gin Asn Lys He Ala Asn Phe Gin 110 115 120
AAA TAC TAT AAA GAT ATA AAA GAT ATT GAC GAT TAT TCG GCG TTA ATG 435 Lys Tyr Tyr Lys Asp He Lys Asp He Asp Asp Tyr Ser Ala Leu Met 125 130 135
GCT CAA TGAACATAAA GATTCTTATA CTTGGGATAA TGATCTTGAT 481
Ala Gin 140
(2) INFORMATION FOR SEQ ID NO: 886 (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 140 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 886:
Met Lys Thr Ser Ala Lys Val Leu Leu Thr Leu Leu He Val He Ser
1 5 10 15
Leu Gly Lys Gly Leu Asn Ser Leu He Ser Ala Trp Arg Gly Lys Asp
20 25 30
Asp Ala He Pro He Glu Thr Arg Leu His Lys Asn Lys Leu Thr He
35 40 45
He Ser Lys Thr Asp Ser He Glu He Gin Asp He Gin Phe Asn Arg
50 55 60
Glu Asn Cys Ser His Thr Tyr Thr Ser Lys Asp Leu Glu Lys He Gin 65 70 75 80
Lys Asp Leu Glu Glu Leu Glu Glu Gly Val Pro Glu Leu Phe Glu Glu
85 90 95
Leu Glu Arg Asp Glu Glu Ser He Ala Lys Asn Lys Lys Thr He Gin
100 105 110
Glu Tyr Gin Asn Lys He Ala Asn Phe Gin Lys Tyr Tyr Lys Asp He
115 120 125
Lys Asp He Asp Asp Tyr Ser Ala Leu Met Ala Gin 130 135 140
(2) INFORMATION FOR SEQ ID NO: 887:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 540 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 28...486 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 887:
GAAAATTAGG TAATAAATAC AACCAGT ATG CTA AAA AAA ATA TTT TTA ACC AAC 54
Met Leu Lys Lys He Phe Leu Thr Asn 1 5
AGC TTA GGG ATT TTA TGC TCT AGG ATT TTT GGC TTT TTA CGG GAT TTG 102 Ser Leu Gly He Leu Cys Ser Arg He Phe Gly Phe Leu Arg Asp Leu 10 15 20 25 ATG ATG GCT AAT ATT CTA GGG GCT GGG GTG TAT AGC GAT ATT TTC TTT 150 Met Met Ala Asn He Leu Gly Ala Gly Val Tyr Ser Asp He Phe Phe 30 35 40
GTG GCT TTC AAA TTG CCT AAT TTA TTC AGG CGT ATT TTT GCG GAG GGC 198 Val Ala Phe Lys Leu Pro Asn Leu Phe Arg Arg He Phe Ala Glu Gly 45 50 55
TCT TTT TCA CAA AGC TTT TTA CCG AGC TTC ATA CGA AGT TCT ATT AAA 246 Ser Phe Ser Gin Ser Phe Leu Pro Ser Phe He Arg Ser Ser He Lys 60 65 70
GGG AGC TTT GCG AGT TTG GTA GGG CTT ATT TTT TGT ATC GTT TTA TTC 294 Gly Ser Phe Ala Ser Leu Val Gly Leu He Phe Cys He Val Leu Phe 75 80 85
ATG TGG TGC TTA TTG GTG GCG TTA AAT CCC TTA TGG CTA GCT AAA CTC 342 Met Trp Cys Leu Leu Val Ala Leu Asn Pro Leu Trp Leu Ala Lys Leu 90 95 100 105
CTA GCT TAC GGC TTT GAT GAA GAA ACG CTC AAA TTA TGC GCC CCT ATT 390 Leu Ala Tyr Gly Phe Asp Glu Glu Thr Leu Lys Leu Cys Ala Pro He 110 115 120
GTA GCG ATC AAT TTT TGG NAT CTT TTA TTG GTG TTT ATC ACC ACC TTT 438 Val Ala He Asn Phe Trp Xaa Leu Leu Leu Val Phe He Thr Thr Phe 125 130 135
TTA GGC GCG CTT TTA CAA NTA CAA ACA CAG CTT TTT TGC CAG CGC TTA T 487 Leu Gly Ala Leu Leu Gin Xaa Gin Thr Gin Leu Phe Cys Gin Arg Leu 140 145 150
AGCGCAAGCT TACTCAATGT ATGCATGATT TTAGCCCTTT TGATTTCTAA AGA 540
(2) INFORMATION FOR SEQ ID NO: 888:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 153 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 888:
Met Leu Lys Lys He Phe Leu Thr Asn Ser Leu Gly He Leu Cys Ser
1 5 10 15
Arg He Phe Gly Phe Leu Arg Asp Leu Met Met Ala Asn He Leu Gly
20 25 30
Ala Gly Val Tyr Ser Asp He Phe Phe Val Ala Phe Lys Leu Pro Asn
35 40 45
Leu Phe Arg Arg He Phe Ala Glu Gly Ser Phe Ser Gin Ser Phe Leu
50 55 60
Pro Ser Phe He Arg Ser Ser He Lys Gly Ser Phe Ala Ser Leu Val 65 70 75 80
Gly Leu He Phe Cys He Val Leu Phe Met Trp Cys Leu Leu Val Ala
85 90 95
Leu Asn Pro Leu Trp Leu Ala Lys Leu Leu Ala Tyr Gly Phe Asp Glu
100 105 110
Glu Thr Leu Lys Leu Cys Ala Pro He Val Ala He Asn Phe Trp Xaa
115 120 125
Leu Leu Leu Val Phe He Thr Thr Phe Leu Gly Ala Leu Leu Gin Xaa
130 135 140
Gin Thr Gin Leu Phe Cys Gin Arg Leu 145 150
(2) INFORMATION FOR SEQ ID NO: 889:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1080 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 39...1016 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:889:
TTCTTTGTTC AGATTAATCG TTCTTAAAAG GAAGCGTG ATG CTT AAA ACC TAT CAT 56
Met Leu Lys Thr Tyr His 1 5
ATC GCC TTA GCT TGC GTG ATT TTA GCG GTG GTG GTG CTG TTG TTT GGA 104 He Ala Leu Ala Cys Val He Leu Ala Val Val Val Leu Leu Phe Gly 10 15 20
GGG GAG TCC TTG AGC TTG GAA GAA TGG CAA GAA GTG TGC CTT AAT GTG 152 Gly Glu Ser Leu Ser Leu Glu Glu Trp Gin Glu Val Cys Leu Asn Val 25 30 35
AAA AAC CAC TTT TTG CAC AAT GAA GAA CTG AGC TCT TTA AGT ATT ATT 200 Lys Asn His Phe Leu His Asn Glu Glu Leu Ser Ser Leu Ser He He 40 45 50
ATT TTA GAA ATA CGA CTA CCA CGA GTG ATT TTA GCG CTC CTG GTG GGA 248 He Leu Glu He Arg Leu Pro Arg Val He Leu Ala Leu Leu Val Gly 55 60 65 70
GCG AGT TTG TCT GGG AGT GGG GTG GTG ATG CAA ACG ATT TTT AGA AAC 296 Ala Ser Leu Ser Gly Ser Gly Val Val Met Gin Thr He Phe Arg Asn 75 80 85 CCC TTA GTG GAT CCC TTT TTA CTA GGG ATT TCT AGC GGG GCG ATG CTA 344 Pro Leu Val Asp Pro Phe Leu Leu Gly He Ser Ser Gly Ala Met Leu 90 95 100
GGC GTG GCG ATG GCG ATA GCG GTA GTG GAG TCT AAC ATT GCG ATT TTG 392 Gly Val Ala Met Ala He Ala Val Val Glu Ser Asn He Ala He Leu 105 110 115
GCG TTT TTT GGG GCG ATT TTA GCT AGC CTT GCT GTT TTG GCG ATG AAT 440 Ala Phe Phe Gly Ala He Leu Ala Ser Leu Ala Val Leu Ala Met Asn 120 125 130
AGG GTT TTG GGT AAT TCC GTC CTT TCG TTG GTG CTT TCA GGG GTG GTG 488 Arg Val Leu Gly Asn Ser Val Leu Ser Leu Val Leu Ser Gly Val Val 135 140 145 150
TTG AGC GCG TTT TTA AGC GCC TTA GCC GGA GCG ATA AAA TTC TTT GTG 536 Leu Ser Ala Phe Leu Ser Ala Leu Ala Gly Ala He Lys Phe Phe Val 155 160 165
ATC CCC CAA AAA GCG CAA GCG ATT GTC GTG TGG CTT TTA GGG AGC TTG 584 He Pro Gin Lys Ala Gin Ala He Val Val Trp Leu Leu Gly Ser Leu 170 175 180
TCG TTG AGC AGT TAT AAG GAT TGC TTG ATC GCT TTC ATA GGG CTA TCT 632 Ser Leu Ser Ser Tyr Lys Asp Cys Leu He Ala Phe He Gly Leu Ser 185 190 195
TTA GGC TTT ATC CCG CTT TTT TTG TTA AGG TGG CGC ATC AAT TTA TTG 680 Leu Gly Phe He Pro Leu Phe Leu Leu Arg Trp Arg He Asn Leu Leu 200 205 210
AGC TTG AGC GAT GCG CAA AGT TTG AGC TTG GGG ATT AAC CCG GTG CTG 728 Ser Leu Ser Asp Ala Gin Ser Leu Ser Leu Gly He Asn Pro Val Leu 215 220 225 230
TTG CGA TCG CTT TGT TTG GTG TGC GTG AGC GTT GCG AGC GCT TTA GCG 776 Leu Arg Ser Leu Cys Leu Val Cys Val Ser Val Ala Ser Ala Leu Ala 235 240 245
GTG AGC GTG TCC GGC ACG ATT GGC TGG ATT GGG TTA GTC ATT CCG CAT 824 Val Ser Val Ser Gly Thr He Gly Trp He Gly Leu Val He Pro His 250 255 260
GTG GCT AGG TTG TTT TTT GGG GCG AAT TTG CAA AAA CTG CTT TTA AGT 872 Val Ala Arg Leu Phe Phe Gly Ala Asn Leu Gin Lys Leu Leu Leu Ser 265 270 275
TCT TTG TTA ATG GGA GCG TTT TTC TTG CTT CTA GCG GAT GTG GTG GCT 920 Ser Leu Leu Met Gly Ala Phe Phe Leu Leu Leu Ala Asp Val Val Ala 280 285 290
AAA ACC ATT ACC CCC TAT GAT TTA CCG GTA GGC ATT GCG ACA AGC GTT 968 Lys Thr He Thr Pro Tyr Asp Leu Pro Val Gly He Ala Thr Ser Val 295 300 305 310 TTA GGA GCG CCT TTC TTC TTG TGG CTT TTG TTT AGA ACT AGG GGG GTG T 1017 Leu Gly Ala Pro Phe Phe Leu Trp Leu Leu Phe Arg Thr Arg Gly Val 315 320 325
GATGGTTTTA GAAGTTAAAA ACCTGTCCTT TAAATATTCT CAAAAACTCA TTTTGGATAA 1077 ATT 1080
(2) INFORMATION FOR SEQ ID NO: 890:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 326 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 890:
Met Leu Lys Thr Tyr His He Ala Leu Ala Cys Val He Leu Ala Val
1 5 10 15
Val Val Leu Leu Phe Gly Gly Glu Ser Leu Ser Leu Glu Glu Trp Gin
20 25 30
Glu Val Cys Leu Asn Val Lys Asn His Phe Leu His Asn Glu Glu Leu
35 40 45
Ser Ser Leu Ser He He He Leu Glu He Arg Leu Pro Arg Val He
50 55 60
Leu Ala Leu Leu Val Gly Ala Ser Leu Ser Gly Ser Gly Val Val Met 65 70 75 80
Gin Thr He Phe Arg Asn Pro Leu Val Asp Pro Phe Leu Leu Gly He
85 90 95
Ser Ser Gly Ala Met Leu Gly Val Ala Met Ala He Ala Val Val Glu
100 105 110
Ser Asn He Ala He Leu Ala Phe Phe Gly Ala He Leu Ala Ser Leu
115 120 125
Ala Val Leu Ala Met Asn Arg Val Leu Gly Asn Ser Val Leu Ser Leu
130 135 140
Val Leu Ser Gly Val Val Leu Ser Ala Phe Leu Ser Ala Leu Ala Gly 145 150 155 160
Ala He Lys Phe Phe Val He Pro Gin Lys Ala Gin Ala He Val Val
165 170 175
Trp Leu Leu Gly Ser Leu Ser Leu Ser Ser Tyr Lys Asp Cys Leu He
180 185 190
Ala Phe He Gly Leu Ser Leu Gly Phe He Pro Leu Phe Leu Leu Arg
195 200 205
Trp Arg He Asn Leu Leu Ser Leu Ser Asp Ala Gin Ser Leu Ser Leu
210 215 220
Gly He Asn Pro Val Leu Leu Arg Ser Leu Cys Leu Val Cys Val Ser 225 230 235 240
Val Ala Ser Ala Leu Ala Val Ser Val Ser Gly Thr He Gly Trp He
245 250 255
Gly Leu Val He Pro His Val Ala Arg Leu Phe Phe Gly Ala Asn Leu
260 265 270
Gin Lys Leu Leu Leu Ser Ser Leu Leu Met Gly Ala Phe Phe Leu Leu 275 280 285 Leu Ala Asp Val Val Ala Lys Thr He Thr Pro Tyr Asp Leu Pro Val
290 295 300
Gly He Ala Thr Ser Val Leu Gly Ala Pro Phe Phe Leu Trp Leu Leu 305 310 315 320
Phe Arg Thr Arg Gly Val 325
(2) INFORMATION FOR SEQ ID NO: 891:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 410 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 37...363 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:891:
CTAGCTAAAA TCGCCCCAAA AAACGCCAAA ATCGCA ATG TTA GAC TCC ACT ACC 54
Met Leu Asp Ser Thr Thr 1 5
GCT ATC GCC ATC GCC ACG CCT AGC ATC GCC CCG CTA GAA ATC CCT AGT 102 Ala He Ala He Ala Thr Pro Ser He Ala Pro Leu Glu He Pro Ser 10 15 20
AAA AAG GGA TCC ACT AAG GGG TTT CTA AAA ATC GTT TGC ATC ACC ACC 150 Lys Lys Gly Ser Thr Lys Gly Phe Leu Lys He Val Cys He Thr Thr 25 30 35
CCA CTC CCA GAC AAA CTC GCT CCC ACC AGG AGC GCT AAA ATC ACT CGT 198 Pro Leu Pro Asp Lys Leu Ala Pro Thr Arg Ser Ala Lys He Thr Arg 40 45 50
GGT AGT CGT ATT TCT AAA ATA ATA ATA CTT AAA GAG CTC AGT TCT TCA 246 Gly Ser Arg He Ser Lys He He He Leu Lys Glu Leu Ser Ser Ser 55 60 65 70
TTG TGC AAA AAG TGG TTT TTC ACA TTA AGG CAC ACT TCT TGC CAT TCT 294 Leu Cys Lys Lys Trp Phe Phe Thr Leu Arg His Thr Ser Cys His Ser 75 80 85
TCC AAG CTC AAG GAC TCC CCT CCA AAC AAC AGC ACC ACC ACC GCT AAA 342 Ser Lys Leu Lys Asp Ser Pro Pro Asn Asn Ser Thr Thr Thr Ala Lys 90 95 100
ATC ACG CAA GCT AAG GCG ATA TGATAGGTTT TAAGCATCAC GCTTCCTTTT AAGA 397 He Thr Gin Ala Lys Ala He 105
ACGATTAATC TGA 410
(2) INFORMATION FOR SEQ ID NO: 892:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 109 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 892:
Met Leu Asp Ser Thr Thr Ala He Ala He Ala Thr Pro Ser He Ala
1 5 10 15
Pro Leu Glu He Pro Ser Lys Lys Gly Ser Thr Lys Gly Phe Leu Lys
20 25 30
He Val Cys He Thr Thr Pro Leu Pro Asp Lys Leu Ala Pro Thr Arg
35 40 45
Ser Ala Lys He Thr Arg Gly Ser Arg He Ser Lys He He He Leu
50 55 60
Lys Glu Leu Ser Ser Ser Leu Cys Lys Lys Trp Phe Phe Thr Leu Arg 65 70 75 80
His Thr Ser Cys His Ser Ser Lys Leu Lys Asp Ser Pro Pro Asn Asn
85 90 95
Ser Thr Thr Thr Ala Lys He Thr Gin Ala Lys Ala He 100 105
(2) INFORMATION FOR SEQ ID NO: 893:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 711 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 39...662 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 893:
AGGAAAAATG GCTGGGGTGC AAGGATTCGA ACCTCGGA ATG CCA GGA CCA AAA CCT 56
Met Pro Gly Pro Lys Pro 1 5 GGT GCC TTA CCG CTT GGC GAC ACC CCA AAA ACT AAA GAA AGC ATT ATA 104 Gly Ala Leu Pro Leu Gly Asp Thr Pro Lys Thr Lys Glu Ser He He 10 15 20
CAA AAG CTT TTT AAA AAA GTC AAG CTA AAA CGC TAT AAT TTT ATC ATG 152 Gin Lys Leu Phe Lys Lys Val Lys Leu Lys Arg Tyr Asn Phe He Met 25 30 35
GAA AAT GGA TTT GAC CCC ATC ATT TAT AAA CGC TAT TTG AAA AAG AAA 200 Glu Asn Gly Phe Asp Pro He He Tyr Lys Arg Tyr Leu Lys Lys Lys 40 45 50
GAA ACC TTT TTG CTG TTT AAA AAA ATC GCT CAA GCG TCT GCG TTT AAA 248 Glu Thr Phe Leu Leu Phe Lys Lys He Ala Gin Ala Ser Ala Phe Lys 55 60 65 70
AAT TTA AAA CTC CAA CTC AAA CGA AGA GAA ATA ATC AAC CGC TAT GTT 296 Asn Leu Lys Leu Gin Leu Lys Arg Arg Glu He He Asn Arg Tyr Val 75 80 85
TCT CAA GCT TTG GGG GAT TTA AAA AAA GGG TTT AGA TAC GCT AAA GTA 344 Ser Gin Ala Leu Gly Asp Leu Lys Lys Gly Phe Arg Tyr Ala Lys Val 90 95 100
GAA CAC CAA ATC CTA AAA ATC TAT TTC ACG CAC CCT AGC TAT TTG AAA 392 Glu His Gin He Leu Lys He Tyr Phe Thr His Pro Ser Tyr Leu Lys 105 110 115
GCC TTT AAA ATA GAA GAA GCC TAT TAC ACC AAC CAC CTG AAA GCC CAT 440 Ala Phe Lys He Glu Glu Ala Tyr Tyr Thr Asn His Leu Lys Ala His 120 125 130
TTA AAA GAA ACG CAA AAA ACC CTA AAA GCC CTA GAT TAC CCC TTT GAT 488 Leu Lys Glu Thr Gin Lys Thr Leu Lys Ala Leu Asp Tyr Pro Phe Asp 135 140 145 150
TTT AAG ACT ATC CAA GCG AGC GTG AAA AAA AGG GCT TAT CAA AAA CCA 536 Phe Lys Thr He Gin Ala Ser Val Lys Lys Arg Ala Tyr Gin Lys Pro 155 160 165
GTT GTT AAA AAA GAA AAA CCC CCT AAA AGC GTG AAT GTC AAT TGC GAA 584 Val Val Lys Lys Glu Lys Pro Pro Lys Ser Val Asn Val Asn Cys Glu 170 175 180
GGT TTG AGC GAT TTC ACT AAA AAG CAA TTT TTA AAG CTC AAA CGC GCT 632 Gly Leu Ser Asp Phe Thr Lys Lys Gin Phe Leu Lys Leu Lys Arg Ala 185 190 195
TGT AAC GAT AAT ACG CTG CGC ACG CCC CCT TGAGAGCTGA CCATGCAACT GCC 685 Cys Asn Asp Asn Thr Leu Arg Thr Pro Pro 200 205
GATCGGGTTT TGCGGGGTGC AAGTTT 711
(2) INFORMATION FOR SEQ ID NO: 894: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 208 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 894:
Met Pro Gly Pro Lys Pro Gly Ala Leu Pro Leu Gly Asp Thr Pro Lys
1 5 10 15
Thr Lys Glu Ser He He Gin Lys Leu Phe Lys Lys Val Lys Leu Lys
20 25 30
Arg Tyr Asn Phe He Met Glu Asn Gly Phe Asp Pro He He Tyr Lys
35 40 45
Arg Tyr Leu Lys Lys Lys Glu Thr Phe Leu Leu Phe Lys Lys He Ala
50 55 60
Gin Ala Ser Ala Phe Lys Asn Leu Lys Leu Gin Leu Lys Arg Arg Glu 65 70 75 80
He He Asn Arg Tyr Val Ser Gin Ala Leu Gly Asp Leu Lys Lys Gly
85 90 95
Phe Arg Tyr Ala Lys Val Glu His Gin He Leu Lys He Tyr Phe Thr
100 105 110
His Pro Ser Tyr Leu Lys Ala Phe Lys He Glu Glu Ala Tyr Tyr Thr
115 120 125
Asn His Leu Lys Ala His Leu Lys Glu Thr Gin Lys Thr Leu Lys Ala
130 135 140
Leu Asp Tyr Pro Phe Asp Phe Lys Thr He Gin Ala Ser Val Lys Lys 145 150 155 160
Arg Ala Tyr Gin Lys Pro Val Val Lys Lys Glu Lys Pro Pro Lys Ser
165 170 175
Val Asn Val Asn Cys Glu Gly Leu Ser Asp Phe Thr Lys Lys Gin Phe
180 185 190
Leu Lys Leu Lys Arg Ala Cys Asn Asp Asn Thr Leu Arg Thr Pro Pro 195 200 205
(2) INFORMATION FOR SEQ ID NO: 895:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 486 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 85...426 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 895: TGAATCACAG CTGAGACCAT TAGACCCGCT TTACAAATCA TCAAAACTAA ACCCGGCGTG 60 AGCCTGGTTT CAAGCGTGTT TTTA ATG TGT TTA GAC ACT CAA GTG CTA GTC 111
Met Cys Leu Asp Thr Gin Val Leu Val 1 5
TTT GGG GAT TGC GCG ATT ATC CCT AAC CCT AGC CCT AAA GAA TTA GCC 159 Phe Gly Asp Cys Ala He He Pro Asn Pro Ser Pro Lys Glu Leu Ala 10 15 20 25
GAG ATC GCT ACC ACT TCC GCA CAA ACC GCC AAG CAA TTC AAT ATT GCG 207 Glu He Ala Thr Thr Ser Ala Gin Thr Ala Lys Gin Phe Asn He Ala 30 35 40
CCT AAA GTG GCC TTG CTT TCT TAT GCG ACA GGC GAT TCC GCT CAA GGC 255 Pro Lys Val Ala Leu Leu Ser Tyr Ala Thr Gly Asp Ser Ala Gin Gly 45 50 55
GAA ATG ATA GAC AAA ATC AAC GAA GCT TTA ACA ATC GCT CAA AAG TTG 303 Glu Met He Asp Lys He Asn Glu Ala Leu Thr He Ala Gin Lys Leu 60 65 70
GAT CCC CAA TTA GAA ATT GAT GGC CCC TTA CAA TTT GAC GCT TCC ATT 351 Asp Pro Gin Leu Glu He Asp Gly Pro Leu Gin Phe Asp Ala Ser He 75 80 85
GAT AAA AGC GTA GCC AAG AAA AAA TGC CTA ACA GCC AAG TGG CTG GGC 399 Asp Lys Ser Val Ala Lys Lys Lys Cys Leu Thr Ala Lys Trp Leu Gly 90 95 100 105
AAG CTA GCG TTT TTA TTT TCC CGG ATT TAAACGCTGG GAACATCGCT TATAAAG 453 Lys Leu Ala Phe Leu Phe Ser Arg lie 110
CGGTGCAACG GAGCGCTAAA GCCGTGGCGA TAG 486
(2) INFORMATION FOR SEQ ID NO: 896:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 114 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 896:
Met Cys Leu Asp Thr Gin Val Leu Val Phe Gly Asp Cys Ala He He
1 5 10 15
Pro Asn Pro Ser Pro Lys Glu Leu Ala Glu He Ala Thr Thr Ser Ala
20 25 30
Gin Thr Ala Lys Gin Phe Asn He Ala Pro Lys Val Ala Leu Leu Ser
35 40 45
Tyr Ala Thr Gly Asp Ser Ala Gin Gly Glu Met He Asp Lys He Asn 50 55 60 Glu Ala Leu Thr He Ala Gin Lys Leu Asp Pro Gin Leu Glu He Asp 65 70 75 80
Gly Pro Leu Gin Phe Asp Ala Ser He Asp Lys Ser Val Ala Lys Lys
85 90 95
Lys Cys Leu Thr Ala Lys Trp Leu Gly Lys Leu Ala Phe Leu Phe Ser
100 105 110
Arg He
(2) INFORMATION FOR SEQ ID NO: 897:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1151 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 38...1111 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 897:
AGGGCAGGTT TTCACCCCTA AAAAGATAGT GGATTTC ATG CTC ACT CTC AAA CAC 55
Met Leu Thr Leu Lys His 1 5
AAT CAT GGG AGT GTT TTA GAA CCG AGT GCT GGC GAT GGG AGT TTT TTA 103 Asn His Gly Ser Val Leu Glu Pro Ser Ala Gly Asp Gly Ser Phe Leu 10 15 20
AAG CGC TTA AAA AAG GCC GTA AGG ATT GAA ATC GAT CCT AAA ATC TGC 151 Lys Arg Leu Lys Lys Ala Val Arg He Glu He Asp Pro Lys He Cys 25 30 35
CCT AAA AAT GCC CTT TGC ATG GAC TTT TTT GAC TAC CCT TTA GAA AAT 199 Pro Lys Asn Ala Leu Cys Met Asp Phe Phe Asp Tyr Pro Leu Glu Asn 40 45 50
CAA TTT GAC ACC ATT ATT GGT AAC CCG CCC TAT GTC AAG CAC AAG GAT 247 Gin Phe Asp Thr He He Gly Asn Pro Pro Tyr Val Lys His Lys Asp 55 60 65 70
ATT GCG CCA AGC ACC AAA GAA AAA CTC CAT TAC AGC CTT TTT GAT GAA 295 He Ala Pro Ser Thr Lys Glu Lys Leu His Tyr Ser Leu Phe Asp Glu 75 80 85
AGG AGT AAT CTC TAC TTG TTT TTC ATA GAA AAA GCG ATC AAG CAT TTA 343 Arg Ser Asn Leu Tyr Leu Phe Phe He Glu Lys Ala He Lys His Leu 90 95 100 AAA CCT AAA GGC GAA TTG ATT TTC ATC ACC CCA AGG GAT TTT TTA AAA 391 Lys Pro Lys Gly Glu Leu He Phe He Thr Pro Arg Asp Phe Leu Lys 105 110 115
TCC ACT TCT AGC GTG AAA TTA AAC GAA TGG ATT TAT AAA GAA GGC ACG 439 Ser Thr Ser Ser Val Lys Leu Asn Glu Trp He Tyr Lys Glu Gly Thr 120 125 130
ATA ACG CAT TTT TTT GAA CTG GGC GAT CAA AAG GTT TTC CCA AAC GCC 487 He Thr His Phe Phe Glu Leu Gly Asp Gin Lys Val Phe Pro Asn Ala 135 140 145 150
ATG CCT AAT TGC GTG ATT TTT CGT TTT TGT AAG GGT AAT TTC AGT AGA 535 Met Pro Asn Cys Val He Phe Arg Phe Cys Lys Gly Asn Phe Ser Arg 155 160 165
ATC ACC AAC GAT GGT TTG CAA TTT TTG TGC AAA AAA GGC ATT TTG TAT 583 He Thr Asn Asp Gly Leu Gin Phe Leu Cys Lys Lys Gly He Leu Tyr 170 175 180
TTC CTC AAC CAA TCT TAC ACG CAA AAA TTA AGC GAG GTT TTT AAG GTT 631 Phe Leu Asn Gin Ser Tyr Thr Gin Lys Leu Ser Glu Val Phe Lys Val 185 190 195
AAA GTG GGG GCA GTG AGC GGG TGC GAT AAG ATT TTT AAA AAT GAA AAA 679 Lys Val Gly Ala Val Ser Gly Cys Asp Lys He Phe Lys Asn Glu Lys 200 205 210
TAC GGG AAT TTA GAA TTT GTC ACC TCA ATC ACG AAA AGA ACC AAT GCT 727 Tyr Gly Asn Leu Glu Phe Val Thr Ser He Thr Lys Arg Thr Asn Ala 215 220 225 230
TTA GAA AAA ATG GTT TTT GTC AAT GAG CCT AAT GAT TAT TTA CTC CAG 775 Leu Glu Lys Met Val Phe Val Asn Glu Pro Asn Asp Tyr Leu Leu Gin 235 240 245
CAT AAA GAC AGC TTA ATG CAA AGA AAG ATT AAA AAA TTC AAT GAA AAT 823 His Lys Asp Ser Leu Met Gin Arg Lys He Lys Lys Phe Asn Glu Asn 250 255 260
AAC TGG TTT GAG TGG GGG AGA ATG CAT CAC ATA TCC CCT AAA AAA CGC 871 Asn Trp Phe Glu Trp Gly Arg Met His His He Ser Pro Lys Lys Arg 265 270 275
ATT TAT GTC AAC GCC AAA ACG CAC CAA AAA AAC CCC TTT TTT ATC CAC 919 He Tyr Val Asn Ala Lys Thr His Gin Lys Asn Pro Phe Phe He His 280 285 290
CAA TGC CCT AAT TAT GAC GGC TCT ATT TTA GCG CTA TTC CCT TAT AAC 967 Gin Cys Pro Asn Tyr Asp Gly Ser He Leu Ala Leu Phe Pro Tyr Asn 295 300 305 310
CAA AAC CTG GAC TTA CAA AAT CTC TGC GAC AAA CTC AAC GCT ATC AAC 1015 Gin Asn Leu Asp Leu Gin Asn Leu Cys Asp Lys Leu Asn Ala He Asn 315 320 325 TGG CAA GAA TTA GGC TTT GTG TGC GGC GGG CGT TTT TTG TTT TCG CAG 1063 Trp Gin Glu Leu Gly Phe Val Cys Gly Gly Arg Phe Leu Phe Ser Gin 330 335 340
CGC TCT TTA GAA AAC GCG CTT TTG CCT AAA GAC TTT TTA AAT CTA GGA T 1112 Arg Ser Leu Glu Asn Ala Leu Leu Pro Lys Asp Phe Leu Asn Leu Gly 345 350 355
AAAACTTGTT AGAAACTTTG CAATTAAACC CTGAGCAGC 1151
(2) INFORMATION FOR SEQ ID NO: 898:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 358 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 898:
Met Leu Thr Leu Lys His Asn His Gly Ser Val Leu Glu Pro Ser Ala
1 5 10 15
Gly Asp Gly Ser Phe Leu Lys Arg Leu Lys Lys Ala Val Arg He Glu
20 25 30
He Asp Pro Lys He Cys Pro Lys Asn Ala Leu Cys Met Asp Phe Phe
35 40 45
Asp Tyr Pro Leu Glu Asn Gin Phe Asp Thr He He Gly Asn Pro Pro
50 55 60
Tyr Val Lys His Lys Asp He Ala Pro Ser Thr Lys Glu Lys Leu His 65 70 75 80
Tyr Ser Leu Phe Asp Glu Arg Ser Asn Leu Tyr Leu Phe Phe He Glu
85 90 95
Lys Ala He Lys His Leu Lys Pro Lys Gly Glu Leu He Phe He Thr
100 105 110
Pro Arg Asp Phe Leu Lys Ser Thr Ser Ser Val Lys Leu Asn Glu Trp
115 120 125
He Tyr Lys Glu Gly Thr He Thr His Phe Phe Glu Leu Gly Asp Gin
130 135 140
Lys Val Phe Pro Asn Ala Met Pro Asn Cys Val He Phe Arg Phe Cys 145 150 155 160
Lys Gly Asn Phe Ser Arg He Thr Asn Asp Gly Leu Gin Phe Leu Cys
165 170 175
Lys Lys Gly He Leu Tyr Phe Leu Asn Gin Ser Tyr Thr Gin Lys Leu
180 185 190
Ser Glu Val Phe Lys Val Lys Val Gly Ala Val Ser Gly Cys Asp Lys
195 200 205
He Phe Lys Asn Glu Lys Tyr Gly Asn Leu Glu Phe Val Thr Ser He
210 215 220
Thr Lys Arg Thr Asn Ala Leu Glu Lys Met Val Phe Val Asn Glu Pro 225 230 235 240
Asn Asp Tyr Leu Leu Gin His Lys Asp Ser Leu Met Gin Arg Lys He
245 250 255
Lys Lys Phe Asn Glu Asn Asn Trp Phe Glu Trp Gly Arg Met His His 260 265 270
He Ser Pro Lys Lys Arg He Tyr Val Asn Ala Lys Thr His Gin Lys
275 280 285
Asn Pro Phe Phe He His Gin Cys Pro Asn Tyr Asp Gly Ser He Leu
290 295 300
Ala Leu Phe Pro Tyr Asn Gin Asn Leu Asp Leu Gin Asn Leu Cys Asp 305 310 315 320
Lys Leu Asn Ala He Asn Trp Gin Glu Leu Gly Phe Val Cys Gly Gly
325 330 335
Arg Phe Leu Phe Ser Gin Arg Ser Leu Glu Asn Ala Leu Leu Pro Lys
340 345 350
Asp Phe Leu Asn Leu Gly 355
(2) INFORMATION FOR SEQ ID NO: 899:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1183 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1130 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 899:
CCTAAAGCGG AGAATAAAGT TACAGAAGTC CTAGCGAGCA AAACAATGTG ATG GCT 56
Met Ala 1
AAG ATC AAT GGT TAT TTG AGC GAA AGG GAT ATT TTA ACG CTC AGT TAT 104 Lys He Asn Gly Tyr Leu Ser Glu Arg Asp He Leu Thr Leu Ser Tyr 5 10 15
AAC ATG ACC AGA GAC AAC GCT AAC CGC CCT TTA AGA GCG AAT TTT ACA 152 Asn Met Thr Arg Asp Asn Ala Asn Arg Pro Leu Arg Ala Asn Phe Thr 20 25 30
GGC ACT TTT TTA CCC TAT TCT TGC GGT GAT TTT AAC GCT TTC CCT AAC 200 Gly Thr Phe Leu Pro Tyr Ser Cys Gly Asp Phe Asn Ala Phe Pro Asn 35 40 45 50
GAG AAA AAC CCT AGC GAT TGT TTG TTT GAA AAC GAC GCT AGT TTG TTT 248 Glu Lys Asn Pro Ser Asp Cys Leu Phe Glu Asn Asp Ala Ser Leu Phe 55 60 65
AAA ACT TAT AGC GTC AAT TTA GTG CAT AAT GTG AGT TTG AAT TAT GAA 296 Lys Thr Tyr Ser Val Asn Leu Val His Asn Val Ser Leu Asn Tyr Glu 70 75 80
AGA GAA GGG GGG AGC CGT TTT GGT GAT CCT AAA TTA AAA ATC AAT GGC 344 Arg Glu Gly Gly Ser Arg Phe Gly Asp Pro Lys Leu Lys He Asn Gly 85 90 95
TAT ACA AGC ATT AGG AAT GTC CAA ATT GAT CCG CTT TTT AAG CCT AAC 392 Tyr Thr Ser He Arg Asn Val Gin He Asp Pro Leu Phe Lys Pro Asn 100 105 110
GAC ATA GCG GCT AGT ATT CCT TTC ACC CCA AAC CCA AAA CTT GGC GAA 440 Asp He Ala Ala Ser He Pro Phe Thr Pro Asn Pro Lys Leu Gly Glu 115 120 125 130
GAG AAT GAA TGC GTG GCG CAA GGG GGC ATT TAT GAC GCT CTT AAA CAA 488 Glu Asn Glu Cys Val Ala Gin Gly Gly He Tyr Asp Ala Leu Lys Gin 135 140 145
ACT TGC TCC ATC ACT TTT AAA AGC CTT GGA GGG GGT TCT GTG GTG GCT 536 Thr Cys Ser He Thr Phe Lys Ser Leu Gly Gly Gly Ser Val Val Ala 150 155 160
AAT AAA AAT TTA TTC ATC ATC AAT TCT GGG TTT AAT GCG AAC GTG ATC 584 Asn Lys Asn Leu Phe He He Asn Ser Gly Phe Asn Ala Asn Val He 165 170 175
CAC ACC ATA GAC CAT AAG AAT GAC AAC CTT TTG GAA TAC GGG TTG AAT 632 His Thr He Asp His Lys Asn Asp Asn Leu Leu Glu Tyr Gly Leu Asn 180 185 190
TAC CAA AAC TTA ACC ACT TTT GAT AAA GCG ATC CCT AAT AGC GAA TTA 680 Tyr Gin Asn Leu Thr Thr Phe Asp Lys Ala He Pro Asn Ser Glu Leu 195 200 205 210
GTC AAA CCC GGC GAT GCC CCT GAC GCA TGC TTA AGG GTT ACA AGC CCC 728 Val Lys Pro Gly Asp Ala Pro Asp Ala Cys Leu Arg Val Thr Ser Pro 215 220 225
AAT GAT CCC AAC ATG AAC GGG CGT TGC CAA CGA AAT GGC GCT ACG GCG 776 Asn Asp Pro Asn Met Asn Gly Arg Cys Gin Arg Asn Gly Ala Thr Ala 230 235 240
AAT GTG ATT GGG GTG TAT GCG CAA GCG AAT TAC ACC TTG CAT CCT ATG 824 Asn Val He Gly Val Tyr Ala Gin Ala Asn Tyr Thr Leu His Pro Met 245 250 255
GTA ACT TTA GGG GCA GGG ACT CGT TAT GAT GTC TAT ACT TTA GTG GAT 872 Val Thr Leu Gly Ala Gly Thr Arg Tyr Asp Val Tyr Thr Leu Val Asp 260 265 270
AAA GAC TGG CAA TTG CAC ATA ACC CAA GGG TTT AGC CCT AGC GCG GCT 920 Lys Asp Trp Gin Leu His He Thr Gin Gly Phe Ser Pro Ser Ala Ala 275 280 285 290
TTA AAT GTC TCG CCT TTA GAA AAT TTG AAT TTC AGG CTT TCT TAT GCG 968 Leu Asn Val Ser Pro Leu Glu Asn Leu Asn Phe Arg Leu Ser Tyr Ala 295 300 305
TAT GTA ACC AGA GGC CCT ATG CCT GGA GGT TTG GTG TGG ATG CGT CAA 1016 Tyr Val Thr Arg Gly Pro Met Pro Gly Gly Leu Val Trp Met Arg Gin 310 315 320
GAT AAT TTG CGN CTA CAA CCG CAA TTT AAA GCC AGA AAT TGG GCA AAA 1064 Asp Asn Leu Xaa Leu Gin Pro Gin Phe Lys Ala Arg Asn Trp Ala Lys 325 330 335
TGT GGA ATT TTA ACA CCG AAT ACA GCA GTC AGT ATT TTG ATT TTA GAG 1112 Cys Gly He Leu Thr Pro Asn Thr Ala Val Ser He Leu He Leu Glu 340 345 350
CCG CCG GTT TTG TCC AAT TGATTTCTAA TTACATCAAT CAATTTTCTT CAACGCTT 1168 Pro Pro Val Leu Ser Asn 355 360
TTTGTAACCA ACTTG 1183
(2) INFORMATION FOR SEQ ID NO: 900:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 360 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 900:
Met Ala Lys He Asn Gly Tyr Leu Ser Glu Arg Asp He Leu Thr Leu
1 5 10 15
Ser Tyr Asn Met Thr Arg Asp Asn Ala Asn Arg Pro Leu Arg Ala Asn
20 25 30
Phe Thr Gly Thr Phe Leu Pro Tyr Ser Cys Gly Asp Phe Asn Ala Phe
35 40 45
Pro Asn Glu Lys Asn Pro Ser Asp Cys Leu Phe Glu Asn Asp Ala Ser
50 55 60
Leu Phe Lys Thr Tyr Ser Val Asn Leu Val His Asn Val Ser Leu Asn 65 70 75 80
Tyr Glu Arg Glu Gly Gly Ser Arg Phe Gly Asp Pro Lys Leu Lys He
85 90 95
Asn Gly Tyr Thr Ser He Arg Asn Val Gin He Asp Pro Leu Phe Lys
100 105 110
Pro Asn Asp He Ala Ala Ser He Pro Phe Thr Pro Asn Pro Lys Leu
115 120 125
Gly Glu Glu Asn Glu Cys Val Ala Gin Gly Gly He Tyr Asp Ala Leu
130 135 140
Lys Gin Thr Cys Ser He Thr Phe Lys Ser Leu Gly Gly Gly Ser Val 145 150 155 160
Val Ala Asn Lys Asn Leu Phe He He Asn Ser Gly Phe Asn Ala Asn 165 170 175 Val He His Thr He Asp His Lys Asn Asp Asn Leu Leu Glu Tyr Gly
180 185 190
Leu Asn Tyr Gin Asn Leu Thr Thr Phe Asp Lys Ala He Pro Asn Ser
195 200 205
Glu Leu Val Lys Pro Gly Asp Ala Pro Asp Ala Cys Leu Arg Val Thr
210 215 220
Ser Pro Asn Asp Pro Asn Met Asn Gly Arg Cys Gin Arg Asn Gly Ala 225 230 235 240
Thr Ala Asn Val He Gly Val Tyr Ala Gin Ala Asn Tyr Thr Leu His
245 250 255
Pro Met Val Thr Leu Gly Ala Gly Thr Arg Tyr Asp Val Tyr Thr Leu
260 265 270
Val Asp Lys Asp Trp Gin Leu His He Thr Gin Gly Phe Ser Pro Ser
275 280 285
Ala Ala Leu Asn Val Ser Pro Leu Glu Asn Leu Asn Phe Arg Leu Ser
290 295 300
Tyr Ala Tyr Val Thr Arg Gly Pro Met Pro Gly Gly Leu Val Trp Met 305 310 315 320
Arg Gin Asp Asn Leu Xaa Leu Gin Pro Gin Phe Lys Ala Arg Asn Trp
325 330 335
Ala Lys Cys Gly He Leu Thr Pro Asn Thr Ala Val Ser He Leu He
340 345 350
Leu Glu Pro Pro Val Leu Ser Asn 355 360
(2) INFORMATION FOR SEQ ID NO: 901:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 431 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...387 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 901:
AGACTGAATA AAATCGCACT CGCTCCCGCA ATG ACA ACC TGG AAC ATG GGG CTG 54
Met Thr Thr Trp Asn Met Gly Leu 1 5
CCC AAA AAC AAG TTA ATG AGC GAA CAC ACC ACC ACC ACA ATC AAA GCG 102 Pro Lys Asn Lys Leu Met Ser Glu His Thr Thr Thr Thr He Lys Ala 10 15 20
ATG AAG AGC ATT TTA CCC ATA TTC GCT AGA TCG TTT TTA GTC TTA AGG 150 Met Lys Ser He Leu Pro He Phe Ala Arg Ser Phe Leu Val Leu Arg 25 30 35 40 GCA TAC ACG CTC ATC AAA CCA AAG ACA ATA GTT GTC ATG CCC AAA GCC 198 Ala Tyr Thr Leu He Lys Pro Lys Thr He Val Val Met Pro Lys Ala 45 50 55
TGC CAA ATC GCT CCT AAA CCA GCT TTT GCA ATC ACC ATA CCC AAC AAA 246 Cys Gin He Ala Pro Lys Pro Ala Phe Ala He Thr He Pro Asn Lys 60 65 70
GGC ACT AGC GTA ACC CCT GAT AAT GAA GTG AAA GCA AAC AGC ATG AAC 294 Gly Thr Ser Val Thr Pro Asp Asn Glu Val Lys Ala Asn Ser Met Asn 75 80 85
AGA TTC AAT CCG GGT TTA GAT TTA GAA AAC ATC AAA CCA AAA AAC GCC 342 Arg Phe Asn Pro Gly Leu Asp Leu Glu Asn He Lys Pro Lys Asn Ala 90 95 100
GCA ATT TCA GCG ATA AAA AAC ACC CAT TTA TAC TGC ACT ACG GCT TGAAA 392 Ala He Ser Ala He Lys Asn Thr His Leu Tyr Cys Thr Thr Ala 105 110 115
ATTCATTAAA CCTAGTAACG CCCCAATAGT CGCTAATAA 431
(2) INFORMATION FOR SEQ ID NO: 902:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 119 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 902:
Met Thr Thr Trp Asn Met Gly Leu Pro Lys Asn Lys Leu Met Ser Glu
1 5 10 15
His Thr Thr Thr Thr He Lys Ala Met Lys Ser He Leu Pro He Phe
20 25 30
Ala Arg Ser Phe Leu Val Leu Arg Ala Tyr Thr Leu He Lys Pro Lys
35 40 45
Thr He Val Val Met Pro Lys Ala Cys Gin He Ala Pro Lys Pro Ala
50 55 60
Phe Ala He Thr He Pro Asn Lys Gly Thr Ser Val Thr Pro Asp Asn 65 70 75 80
Glu Val Lys Ala Asn Ser Met Asn Arg Phe Asn Pro Gly Leu Asp Leu
85 90 95
Glu Asn He Lys Pro Lys Asn Ala Ala He Ser Ala He Lys Asn Thr
100 105 110
His Leu Tyr Cys Thr Thr Ala 115
(2) INFORMATION FOR SEQ ID NO:903:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 671 base pairs (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(11) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 21...599 (D) OTHER INFORMATION.
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 903:
AAGTTTGATA TACTAACAGA ATG AAT ACT TAT AAA AAC AGC TTG AAT CAC 50
Met Asn Thr Tyr Lys Asn Ser Leu Asn His 1 5 10
TTT TTA AAT TTA GTG GAT TGT TTA GAA AAA ATC CCC AAT GTG GGT AAA 98 Phe Leu Asn Leu Val Asp Cys Leu Glu Lys He Pro Asn Val Gly Lys 15 20 25
AAG TCC GCC TTT AAA ATG GCG TAT CAT TTG GGT TTA GAA AAC CCC TAT 146 Lys Ser Ala Phe Lys Met Ala Tyr His Leu Gly Leu Glu Asn Pro Tyr 30 35 40
CTG GCG CTA AAA ATC ACG CAC GCT TTA GAG AAC GCC CTA GAA AAC CTT 194 Leu Ala Leu Lys He Thr His Ala Leu Glu Asn Ala Leu Glu Asn Leu 45 50 55
AAA ACA TGT TCA TCT TGT AAC GCG CTC AGC GAG AGT GAG GTT TGT GAG 242 Lys Thr Cys Ser Ser Cys Asn Ala Leu Ser Glu Ser Glu Val Cys Glu 60 65 70
ATT TGC TCT GAT GAA AGC CGA CAA AAT TCT CAG CTT TGC ATG GTT TTA 290 He Cys Ser Asp Glu Ser Arg Gin Asn Ser Gin Leu Cys Met Val Leu 75 80 85 90
CAC CCA AGA GAT GTG TTT ATT TTA GAA GAT TTA AAG GAT TTT TTA GGG 338 His Pro Arg Asp Val Phe He Leu Glu Asp Leu Lys Asp Phe Leu Gly 95 100 105
CGC TAT TAT GTG TTA AAC TCC ATA GAA GAA GTG GAT TTT AAC GCC CTA 386 Arg Tyr Tyr Val Leu Asn Ser He Glu Glu Val Asp Phe Asn Ala Leu 110 115 120
GAA AAA CGC CTG ATT GAA GAA AAC ATT AAA GAA ATC ATT TTT GCT TTC 434 Glu Lys Arg Leu He Glu Glu Asn He Lys Glu He He Phe Ala Phe 125 130 135
CCT CCC ACT TTA GCT AAT GAT TCT CTA ATG CTT TAT ATT GAA GAC AAA 482 Pro Pro Thr Leu Ala Asn Asp Ser Leu Met Leu Tyr He Glu Asp Lys 140 145 150 TTA CAG CAT TTC CAC CTC ACT TTC ACT AAA ATC GCT CAA GGC GTG CCT 530 Leu Gin His Phe His Leu Thr Phe Thr Lys He Ala Gin Gly Val Pro 155 160 165 170
ACT GGA GTG AAT TTT GAA AAC ATT GAC TCA GTT TCG CTC TCA AGG GCG 578 Thr Gly Val Asn Phe Glu Asn He Asp Ser Val Ser Leu Ser Arg Ala 175 180 185
TTT AAT TCA AGG ATC AAA GCA TGAATTTAAA TTTTATGCCC CTATTGCATG CTTA 633 Phe Asn Ser Arg He Lys Ala 190
TAACCATGCG AGCATTGATT TTCATTTCAA TTCTAGTG 671
(2) INFORMATION FOR SEQ ID NO: 904:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 193 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 904:
Met Asn Thr Tyr Lys Asn Ser Leu Asn His Phe Leu Asn Leu Val Asp
1 5 10 15
Cys Leu Glu Lys He Pro Asn Val Gly Lys Lys Ser Ala Phe Lys Met
20 25 30
Ala Tyr His Leu Gly Leu Glu Asn Pro Tyr Leu Ala Leu Lys He Thr
35 40 45
His Ala Leu Glu Asn Ala Leu Glu Asn Leu Lys Thr Cys Ser Ser Cys
50 55 60
Asn Ala Leu Ser Glu Ser Glu Val Cys Glu He Cys Ser Asp Glu Ser 65 70 75 80
Arg Gin Asn Ser Gin Leu Cys Met Val Leu His Pro Arg Asp Val Phe
85 90 95
He Leu Glu Asp Leu Lys Asp Phe Leu Gly Arg Tyr Tyr Val Leu Asn
100 105 110
Ser He Glu Glu Val Asp Phe Asn Ala Leu Glu Lys Arg Leu He Glu
115 120 125
Glu Asn He Lys Glu He He Phe Ala Phe Pro Pro Thr Leu Ala Asn
130 135 140
Asp Ser Leu Met Leu Tyr He Glu Asp Lys Leu Gin His Phe His Leu 145 150 155 160
Thr Phe Thr Lys He Ala Gin Gly Val Pro Thr Gly Val Asn Phe Glu
165 170 175
Asn He Asp Ser Val Ser Leu Ser Arg Ala Phe Asn Ser Arg He Lys
180 185 190
Ala
(2) INFORMATION FOR SEQ ID NO: 905: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 846 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 26...793 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 905:
AAAAAATTCA GGATTAAAAT ATAAA ATG AAA AAA GTT TTA TTT TTG TNG GTA 52
Met Lys Lys Val Leu Phe Leu Xaa Val 1 5
ATA AGC TTT TTT GGG GGT TTT TTG AAC GCT TCT AGC TTG TAT GAA AAA 100 He Ser Phe Phe Gly Gly Phe Leu Asn Ala Ser Ser Leu Tyr Glu Lys 10 15 20 25
CTG ATT AAT AAA GAA ACG ATC AGC GTT GGC ACA GAA GGC ATT TAC CCC 148 Leu He Asn Lys Glu Thr He Ser Val Gly Thr Glu Gly He Tyr Pro 30 35 40
CCT TTC ACT TAC CAC AAT AAA GAA GGC AAG CTC ACC GGC TAT GAT GTG 196 Pro Phe Thr Tyr His Asn Lys Glu Gly Lys Leu Thr Gly Tyr Asp Val 45 50 55
GAA GTG GCT AGG GAG TTG GCC AAA GAG CTT GGC GTG AAG ATC AAA TTC 244 Glu Val Ala Arg Glu Leu Ala Lys Glu Leu Gly Val Lys He Lys Phe 60 65 70
CAC GAA ACT TCA TGG GAT ATC ATG CTG ACA GGT TTG AAA TCG GGG CGT 292 His Glu Thr Ser Trp Asp He Met Leu Thr Gly Leu Lys Ser Gly Arg 75 80 85
TTT GAT ATG GTC GCT AAC CAA GTG AGT TTG GCG ACT AAA AAA CGC CAA 340 Phe Asp Met Val Ala Asn Gin Val Ser Leu Ala Thr Lys Lys Arg Gin 90 95 100 105
GCG GCT TTT GAT AAA AGC TTG CCT TAT AGC TAT TCA GGC ACG ATC ATG 388 Ala Ala Phe Asp Lys Ser Leu Pro Tyr Ser Tyr Ser Gly Thr He Met 110 115 120
CTG GTC AGG AAA GAT GAA AAC CGC ATT AAA GAT ATT AAA GAC ATC AAG 436 Leu Val Arg Lys Asp Glu Asn Arg He Lys Asp He Lys Asp He Lys 125 130 135
GGT TTG AGA GCG GCT AAC ACT TTA AGC TCC ACT TAT GGG GAA ATC GCT 484 Gly Leu Arg Ala Ala Asn Thr Leu Ser Ser Thr Tyr Gly Glu He Ala 140 145 150
TTT AAA TAC GAC GCT CAA ATC GTT TCG GTG GAT TCT ATG GCG CAA GCT 532 Phe Lys Tyr Asp Ala Gin He Val Ser Val Asp Ser Met Ala Gin Ala 155 160 165
TTG TTG CTG GTG GCG CAA AAA CGA GCC GAT TTG ACC TTA AAT AGT TCT 580 Leu Leu Leu Val Ala Gin Lys Arg Ala Asp Leu Thr Leu Asn Ser Ser 170 175 180 185
TTA GCG ATC TTA AAC TAC CTT AAC ACC CAC AAA GAT AAC CCC TTT AAA 628 Leu Ala He Leu Asn Tyr Leu Asn Thr His Lys Asp Asn Pro Phe Lys 190 195 200
ATC GCA TGG GAG TCC AAA GAA AAA GAT GGG GGC GCT TCC TTT GTT ATT 676 He Ala Trp Glu Ser Lys Glu Lys Asp Gly Gly Ala Ser Phe Val He 205 210 215
AAC AAG CAC CAA GAA AAA GCC TTA GAG CTT ATC AAC CAA GCG ATG CAA 724 Asn Lys His Gin Glu Lys Ala Leu Glu Leu He Asn Gin Ala Met Gin 220 225 230
AGA TTG ATC AAC AAA GGG GTT TTA AAA CGC TTA GGC GAA CAA TTT TTT 772 Arg Leu He Asn Lys Gly Val Leu Lys Arg Leu Gly Glu Gin Phe Phe 235 240 245
GGA AAA GAT GTC AGC CAG CCC TAATCTGTCT TTGTTTTTTG AATCTTTAGA TTTG 827 Gly Lys Asp Val Ser Gin Pro 250 255
AGCAAGGAGC GTTTGGAAT 846
(2) INFORMATION FOR SEQ ID NO: 906:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 256 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 906:
Met Lys Lys Val Leu Phe Leu Xaa Val He Ser Phe Phe Gly Gly Phe
1 5 10 15
Leu Asn Ala Ser Ser Leu Tyr Glu Lys Leu He Asn Lys Glu Thr He
20 25 30
Ser Val Gly Thr Glu Gly He Tyr Pro Pro Phe Thr Tyr His Asn Lys
35 40 45
Glu Gly Lys Leu Thr Gly Tyr Asp Val Glu Val Ala Arg Glu Leu Ala
50 55 60
Lys Glu Leu Gly Val Lys He Lys Phe His Glu Thr Ser Trp Asp He 65 70 75 80
Met Leu Thr Gly Leu Lys Ser Gly Arg Phe Asp Met Val Ala Asn Gin 85 90 95
Val Ser Leu Ala Thr Lys Lys Arg Gin Ala Ala Phe Asp Lys Ser Leu
100 105 110
Pro Tyr Ser Tyr Ser Gly Thr He Met Leu Val Arg Lys Asp Glu Asn
115 120 125
Arg He Lys Asp He Lys Asp He Lys Gly Leu Arg Ala Ala Asn Thr
130 135 140
Leu Ser Ser Thr Tyr Gly Glu He Ala Phe Lys Tyr Asp Ala Gin He 145 150 155 160
Val Ser Val Asp Ser Met Ala Gin Ala Leu Leu Leu Val Ala Gin Lys
165 170 175
Arg Ala Asp Leu Thr Leu Asn Ser Ser Leu Ala He Leu Asn Tyr Leu
180 185 190
Asn Thr His Lys Asp Asn Pro Phe Lys He Ala Trp Glu Ser Lys Glu
195 200 205
Lys Asp Gly Gly Ala Ser Phe Val He Asn Lys His Gin Glu Lys Ala
210 215 220
Leu Glu Leu He Asn Gin Ala Met Gin Arg Leu He Asn Lys Gly Val 225 230 235 240
Leu Lys Arg Leu Gly Glu Gin Phe Phe Gly Lys Asp Val Ser Gin Pro 245 250 255
(2) INFORMATION FOR SEQ ID NO: 907:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1423 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 23...1372 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 907:
CCTAGTTTAT TAAGGAGTTT TT ATG GAA ACG ATT GAT TCG GTG GTG CGT TTG 52
Met Glu Thr He Asp Ser Val Val Arg Leu 1 5 10
TTA TCT AAT TTG GTG TGG GGG ATT CCC ATG CAA ATT TTA TTA GTA GGC 100 Leu Ser Asn Leu Val Trp Gly He Pro Met Gin He Leu Leu Val Gly 15 20 25
ACC GGC TTG TTT TTA ACC TTT TAT CTT AGG GGT TTG CAA TTC AGT AAG 148 Thr Gly Leu Phe Leu Thr Phe Tyr Leu Arg Gly Leu Gin Phe Ser Lys 30 35 40
ATT TTT TAT GCG ATC AAA ATC CTT TTT GAC AAA GAG TCC CAA TCT AAG 196 He Phe Tyr Ala He Lys He Leu Phe Asp Lys Glu Ser Gin Ser Lys 45 50 55
GGC GAC ATT TCA CAA TTT TCC GCT CTC ATG CTC TCT TTG GGG GCG ACT 244 Gly Asp He Ser Gin Phe Ser Ala Leu Met Leu Ser Leu Gly Ala Thr 60 65 70
GTA GGC ATT GGG AGT ATC GTA GGC GTA GCG ACC GCT ATT AGC ATC GCA 292 Val Gly He Gly Ser He Val Gly Val Ala Thr Ala He Ser He Ala 75 80 85 90
GGG CCA GGA GCG GTG TTT TGG ATG TGG GTT ACT GGG CTT GTT GGC ATG 340 Gly Pro Gly Ala Val Phe Trp Met Trp Val Thr Gly Leu Val Gly Met 95 100 105
GCG ACT AAG TAT TCT GAG GGG ATT TTA GCG GTG AAA TAC CGG GAA AAA 388 Ala Thr Lys Tyr Ser Glu Gly He Leu Ala Val Lys Tyr Arg Glu Lys 110 115 120
GGG GCG TTT GGA TAC AAC GGA GGG CCC ATG TAT TAC ATC AAA AAC GGT 436 Gly Ala Phe Gly Tyr Asn Gly Gly Pro Met Tyr Tyr He Lys Asn Gly 125 130 135
CTT AAC ATG CCC AAA CTC GCC ATG GCG TTT GCG ATT TTT ACG ATT ATT 484 Leu Asn Met Pro Lys Leu Ala Met Ala Phe Ala He Phe Thr He He 140 145 150
GCA AGC ATT GGC ACC GGT AAC ATG ACG CAA TCT AAT GCG GTT TCT TCC 532 Ala Ser He Gly Thr Gly Asn Met Thr Gin Ser Asn Ala Val Ser Ser 155 160 165 170
ATT TTA AGC GAA CAA GCG AAC CTG CCT AAT TGG GTT TCA GGT TTA TTG 580 He Leu Ser Glu Gin Ala Asn Leu Pro Asn Trp Val Ser Gly Leu Leu 175 180 185
CTC ACT CTT TTA ACC GCT TTC ATT GTC ATA GGG GGG ATC AAA TCC ATT 628 Leu Thr Leu Leu Thr Ala Phe He Val He Gly Gly He Lys Ser He 190 195 200
GGT AAA TTC ACT TCT TAC TTA GCT CCT GTT ATG GTG CTT TTA TAT TTG 676 Gly Lys Phe Thr Ser Tyr Leu Ala Pro Val Met Val Leu Leu Tyr Leu 205 210 215
ATC GCT ATT ATT TAT ATT ATT GTT AGC CAT TTT GAT TTA GCC CTT CAA 724 He Ala He He Tyr He He Val Ser His Phe Asp Leu Ala Leu Gin 220 225 230
GCG ATC AAA CTC ATT TTT GAA GAA GCC TTT AAC CCT AAA CCC GTT GTG 772 Ala He Lys Leu He Phe Glu Glu Ala Phe Asn Pro Lys Pro Val Val 235 240 245 250
GGC GGA GCG AGC GGC GCG TTG ATA GCG ACG ATG ATA AAA ACG GGC GTG 820 Gly Gly Ala Ser Gly Ala Leu He Ala Thr Met He Lys Thr Gly Val 255 260 265
GCT AGG GGG TTG TAT TCT AAT GAA GCG GGG TTA GGG AGC TCA GCC ATT 868 Ala Arg Gly Leu Tyr Ser Asn Glu Ala Gly Leu Gly Ser Ser Ala He 270 275 280
ATT GCC GCG AGC GCT CAA ACA CGC CAC CCG GTG CGC CAA GCC TTA GTG 916 He Ala Ala Ser Ala Gin Thr Arg His Pro Val Arg Gin Ala Leu Val 285 290 295
TCC ATG CTC CAA ACT TTT ATT GTA ACC TTA ATA GTG TGT TCG GCA ACA 964 Ser Met Leu Gin Thr Phe He Val Thr Leu He Val Cys Ser Ala Thr 300 305 310
GCG AGC GTG ATT TTA ATG GCT CCA GAA TAC AAC ACC TTG CTC CCT AAT 1012 Ala Ser Val He Leu Met Ala Pro Glu Tyr Asn Thr Leu Leu Pro Asn 315 320 325 330
GGG GAA AAA TTA AGC GCT AAT TTG CTC ACT CTA AAA AGC ACG GAG TAT 1060 Gly Glu Lys Leu Ser Ala Asn Leu Leu Thr Leu Lys Ser Thr Glu Tyr 335 340 345
TTT CTA GGC TCA TTA GGG ACG GTG GTG ATT TTT ACA ACC ATG ATC TTT 1108 Phe Leu Gly Ser Leu Gly Thr Val Val He Phe Thr Thr Met He Phe 350 355 360
TTT GCC TAC TCT ACG ATC ATT GGT TGG GCT TAT TAT GGG GAA AAA TGC 1156 Phe Ala Tyr Ser Thr He He Gly Trp Ala Tyr Tyr Gly Glu Lys Cys 365 370 375
ACT GAA TAC GCC TTT GGT GAA AAA AAA GTG AAA TAT TAC CGC TTG ATC 1204 Thr Glu Tyr Ala Phe Gly Glu Lys Lys Val Lys Tyr Tyr Arg Leu He 380 385 390
TTT TTA GCG AGT GTG ATG GTG GGG GCT ATG GCC AAA ATT GAT TTT GTG 1252 Phe Leu Ala Ser Val Met Val Gly Ala Met Ala Lys He Asp Phe Val 395 400 405 410
TGG AAT TTA GCG GAT CTT TCT AAC GGG CTT ATG GCT ATC CCT AAT TTA 1300 Trp Asn Leu Ala Asp Leu Ser Asn Gly Leu Met Ala He Pro Asn Leu 415 420 425
ATC GCT TTG ATT TTA TTG CAT AAA GTG GTT TAT TCT GAA ACT CGT TGG 1348 He Ala Leu He Leu Leu His Lys Val Val Tyr Ser Glu Thr Arg Trp 430 435 440
TAT TTT AGC AAG CAT TCT AAC AAG TAAAATGGCA TGTTAAAAAG GGCGAGTTTT 1402 Tyr Phe Ser Lys His Ser Asn Lys 445 450
GTAGAAGTGG ATACCGCTTC T 1423
(2) INFORMATION FOR SEQ ID NO: 908:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 450 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 908:
Met Glu Thr He Asp Ser Val Val Arg Leu Leu Ser Asn Leu Val Trp
1 5 10 15
Gly He Pro Met Gin He Leu Leu Val Gly Thr Gly Leu Phe Leu Thr
20 25 30
Phe Tyr Leu Arg Gly Leu Gin Phe Ser Lys He Phe Tyr Ala He Lys
35 40 45
He Leu Phe Asp Lys Glu Ser Gin Ser Lys Gly Asp He Ser Gin Phe
50 55 60
Ser Ala Leu Met Leu Ser Leu Gly Ala Thr Val Gly He Gly Ser He 65 70 75 80
Val Gly Val Ala Thr Ala He Ser He Ala Gly Pro Gly Ala Val Phe
85 90 95
Trp Met Trp Val Thr Gly Leu Val Gly Met Ala Thr Lys Tyr Ser Glu
100 105 110
Gly He Leu Ala Val Lys Tyr Arg Glu Lys Gly Ala Phe Gly Tyr Asn
115 120 125
Gly Gly Pro Met Tyr Tyr He Lys Asn Gly Leu Asn Met Pro Lys Leu
130 135 140
Ala Met Ala Phe Ala He Phe Thr He He Ala Ser He Gly Thr Gly 145 150 155 160
Asn Met Thr Gin Ser Asn Ala Val Ser Ser He Leu Ser Glu Gin Ala
165 170 175
Asn Leu Pro Asn Trp Val Ser Gly Leu Leu Leu Thr Leu Leu Thr Ala
180 185 190
Phe He Val He Gly Gly He Lys Ser He Gly Lys Phe Thr Ser Tyr
195 200 205
Leu Ala Pro Val Met Val Leu Leu Tyr Leu He Ala He He Tyr He
210 215 220
He Val Ser His Phe Asp Leu Ala Leu Gin Ala He Lys Leu He Phe 225 230 235 240
Glu Glu Ala Phe Asn Pro Lys Pro Val Val Gly Gly Ala Ser Gly Ala
245 250 255
Leu He Ala Thr Met He Lys Thr Gly Val Ala Arg Gly Leu Tyr Ser
260 265 270
Asn Glu Ala Gly Leu Gly Ser Ser Ala He He Ala Ala Ser Ala Gin
275 280 285
Thr Arg His Pro Val Arg Gin Ala Leu Val Ser Met Leu Gin Thr Phe
290 295 300
He Val Thr Leu He Val Cys Ser Ala Thr Ala Ser Val He Leu Met 305 310 315 320
Ala Pro Glu Tyr Asn Thr Leu Leu Pro Asn Gly Glu Lys Leu Ser Ala
325 330 335
Asn Leu Leu Thr Leu Lys Ser Thr Glu Tyr Phe Leu Gly Ser Leu Gly
340 345 350
Thr Val Val He Phe Thr Thr Met He Phe Phe Ala Tyr Ser Thr He
355 360 365
He Gly Trp Ala Tyr Tyr Gly Glu Lys Cys Thr Glu Tyr Ala Phe Gly
370 375 380
Glu Lys Lys Val Lys Tyr Tyr Arg Leu He Phe Leu Ala Ser Val Met 385 390 395 400
Val Gly Ala Met Ala Lys He Asp Phe Val Trp Asn Leu Ala Asp Leu
405 410 415
Ser Asn Gly Leu Met Ala He Pro Asn Leu He Ala Leu He Leu Leu
420 425 430
His Lys Val Val Tyr Ser Glu Thr Arg Trp Tyr Phe Ser Lys His Ser
435 440 445
Asn Lys 450
(2) INFORMATION FOR SEQ ID NO: 909:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 367 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...333 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 909:
TTTAAATTAA GCCCGAAATG GAATTTTAAA GGGGCTTGGT TTTTGAGC ATG AGC TTC 57
Met Ser Phe 1
AGG GTG TCT AAC ACC ACA CCA GGG CAT GAG AGT GGG GCT TTT TTA AAC 105 Arg Val Ser Asn Thr Thr Pro Gly His Glu Ser Gly Ala Phe Leu Asn 5 10 15
GCA GAA ATA AGC CCA GCA TTC CCA AAA GAA GTG CCG TTT GCG CCA TCG 153 Ala Glu He Ser Pro Ala Phe Pro Lys Glu Val Pro Phe Ala Pro Ser 20 25 30 35
TTT TTT TCT ATC ACG CAG ACC TTA TGC CCT AAC TTG TGC ATA GAA TAC 201 Phe Phe Ser He Thr Gin Thr Leu Cys Pro Asn Leu Cys He Glu Tyr 40 45 50
GCA CAA GAA AGC CCT ACA ATC CCA CCG CCT ATG ACC ACG ACC TCT TTT 249 Ala Gin Glu Ser Pro Thr He Pro Pro Pro Met Thr Thr Thr Ser Phe 55 60 65
TTC ATG CTG ATA GTC CCT TTA ATA AAT TAC TTA ATG GCT ATC GCT TCA 297 Phe Met Leu He Val Pro Leu He Asn Tyr Leu Met Ala He Ala Ser 70 75 80
ATT TCT ACT AAA GCG TCT TTA GGC AGT TTA GCC ACT TGAAAGGTCG CTCTGG 349 He Ser Thr Lys Ala Ser Leu Gly Ser Leu Ala Thr 85 90 95
CCGGATAAGG CTCTGTAA 367
(2) INFORMATION FOR SEQ ID NO: 910:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 95 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 910:
Met Ser Phe Arg Val Ser Asn Thr Thr Pro Gly His Glu Ser Gly Ala
1 5 10 15
Phe Leu Asn Ala Glu He Ser Pro Ala Phe Pro Lys Glu Val Pro Phe
20 25 30
Ala Pro Ser Phe Phe Ser He Thr Gin Thr Leu Cys Pro Asn Leu Cys
35 40 45
He Glu Tyr Ala Gin Glu Ser Pro Thr He Pro Pro Pro Met Thr Thr
50 55 60
Thr Ser Phe Phe Met Leu He Val Pro Leu He Asn Tyr Leu Met Ala
65 70 75 80
He Ala Ser He Ser Thr Lys Ala Ser Leu Gly Ser Leu Ala Thr 85 90 95
(2) INFORMATION FOR SEQ ID NO: 911:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 756 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 36...689 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 911:
GCGCACGCTG AATTTAGGGT TTTGAAAGGT TAAAA ATG AAA TTT AAA TTT TTG 53
Met Lys Phe Lys Phe Leu 1 5
AAT ATG GAT AAT GAA AGC GGT TTT ATT TTG ATT GAA AAA GAA TTG AAA 101 Asn Met Asp Asn Glu Ser Gly Phe He Leu He Glu Lys Glu Leu Lys 10 15 20 CGA TTA AAC ATT CTC GCT CAA GTC AAA GAA GAT TGC ATT GAA TTA AAA 149 Arg Leu Asn He Leu Ala Gin Val Lys Glu Asp Cys He Glu Leu Lys 25 30 35
GGC GAA AAC ACA GAA CAA GCG AGA ATT TAT CTT AAA ACG CTT TTT AAC 197 Gly Glu Asn Thr Glu Gin Ala Arg He Tyr Leu Lys Thr Leu Phe Asn 40 45 50
TCC AAT ATT GTA GAA TTA GAC GAT CAT CAA AAA AGT GCA AAC GCT TTA 245 Ser Asn He Val Glu Leu Asp Asp His Gin Lys Ser Ala Asn Ala Leu 55 60 65 70
ATA GAG CGC TTG AAA TCT TTA GAT TTA AAA ATT GCG GTG GCT GAA AGC 293 He Glu Arg Leu Lys Ser Leu Asp Leu Lys He Ala Val Ala Glu Ser 75 80 85
TGC TCT GGG GGG CTA TTA TCG CAT GCA TTC ACT TCC ATT AGC GGG GCT 341 Cys Ser Gly Gly Leu Leu Ser His Ala Phe Thr Ser He Ser Gly Ala 90 95 100
TCA GCG GTT TTT ATG GGG GGT ATT GTG TGC TAC AAT GAA GAG GTT AAG 389 Ser Ala Val Phe Met Gly Gly He Val Cys Tyr Asn Glu Glu Val Lys 105 110 115
CGC GAA TTA TTG AAG GTC AAT GCC ACG ACT TTA AAA GTC TTT GGG GTT 437 Arg Glu Leu Leu Lys Val Asn Ala Thr Thr Leu Lys Val Phe Gly Val 120 125 130
TAT AGC GAA GAA TGC GTG AAA GAA ATG CTA CTA GGC GTG TTT TTG AAT 485 Tyr Ser Glu Glu Cys Val Lys Glu Met Leu Leu Gly Val Phe Leu Asn 135 140 145 150
TTT AAA GTG GAT TTA GCG CTT GCG ATG AGT GGG GTG GCT GGC CCT AAT 533 Phe Lys Val Asp Leu Ala Leu Ala Met Ser Gly Val Ala Gly Pro Asn 155 160 165
GGG GGG AAC AAG GCT AAT CCT GTA GGC ACG ATT TAC ATT GGC GCG CAA 581 Gly Gly Asn Lys Ala Asn Pro Val Gly Thr He Tyr He Gly Ala Gin 170 175 180
AAG TTA GGA TCT CAA GCT TTA ATC GAT CGC TGT TTT TTT GAA GGG AAC 629 Lys Leu Gly Ser Gin Ala Leu He Asp Arg Cys Phe Phe Glu Gly Asn 185 190 195
AGA GAA AGC ATT CAA AAT AAA AGC GTA GAG CAT GCC TTA AAC ATG CTC 677 Arg Glu Ser He Gin Asn Lys Ser Val Glu His Ala Leu Asn Met Leu 200 205 210
GCT AGA ATG CTA TAAAACTACC TTAACGCACA AACGCTACCA AATTCTTTTT GAGCG 734
Ala Arg Met Leu
215
ACCTTAGCGA TGTAAGCGAT TT 756
(2) INFORMATION FOR SEQ ID NO: 912: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 218 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 912:
Met Lys Phe Lys Phe Leu Asn Met Asp Asn Glu Ser Gly Phe He Leu
1 5 10 15
He Glu Lys Glu Leu Lys Arg Leu Asn He Leu Ala Gin Val Lys Glu
20 25 30
Asp Cys He Glu Leu Lys Gly Glu Asn Thr Glu Gin Ala Arg He Tyr
35 40 45
Leu Lys Thr Leu Phe Asn Ser Asn He Val Glu Leu Asp Asp His Gin
50 55 60
Lys Ser Ala Asn Ala Leu He Glu Arg Leu Lys Ser Leu Asp Leu Lys 65 70 75 80
He Ala Val Ala Glu Ser Cys Ser Gly Gly Leu Leu Ser His Ala Phe
85 90 95
Thr Ser He Ser Gly Ala Ser Ala Val Phe Met Gly Gly He Val Cys
100 105 110
Tyr Asn Glu Glu Val Lys Arg Glu Leu Leu Lys Val Asn Ala Thr Thr
115 120 125
Leu Lys Val Phe Gly Val Tyr Ser Glu Glu Cys Val Lys Glu Met Leu
130 135 140
Leu Gly Val Phe Leu Asn Phe Lys Val Asp Leu Ala Leu Ala Met Ser 145 150 155 160
Gly Val Ala Gly Pro Asn Gly Gly Asn Lys Ala Asn Pro Val Gly Thr
165 170 175
He Tyr He Gly Ala Gin Lys Leu Gly Ser Gin Ala Leu He Asp Arg
180 185 190
Cys Phe Phe Glu Gly Asn Arg Glu Ser He Gin Asn Lys Ser Val Glu
195 200 205
His Ala Leu Asn Met Leu Ala Arg Met Leu 210 215
(2) INFORMATION FOR SEQ ID NO:913:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 681 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 28...657 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 913:
AAATTCTAAA AAAATAAAGG AAAATCA ATG AAA TTT TTG GAT CAA GAA AAA AGA 54
Met Lys Phe Leu Asp Gin Glu Lys Arg 1 5
AGA CAA TTA TTA AAC GAG CGC CAT TCT TGC AAG ATG TTT GAT AGC CAT 102 Arg Gin Leu Leu Asn Glu Arg His Ser Cys Lys Met Phe Asp Ser His 10 15 20 25
TAT GAG TTT TCT AGC ACA GAA TTA GAA GAA ATC GCT GAA ATC GCC AGG 150 Tyr Glu Phe Ser Ser Thr Glu Leu Glu Glu He Ala Glu He Ala Arg 30 35 40
CTA TCG CCA AGC TCT TAC AAC ACG CAG CCA TGG CAT TTT GTG ATG GTT 198 Leu Ser Pro Ser Ser Tyr Asn Thr Gin Pro Trp His Phe Val Met Val 45 50 55
ACT GAT AAG GAT TTA AAA AAA CAA ATT GCA GCG CAC AGC TAT TTC AAT 246 Thr Asp Lys Asp Leu Lys Lys Gin He Ala Ala His Ser Tyr Phe Asn 60 65 70
GAA GAG ATG ATT AAA AGC GCT TCA GCG TTA ATG GTG GTA TGC TCT TTA 294 Glu Glu Met He Lys Ser Ala Ser Ala Leu Met Val Val Cys Ser Leu 75 80 85
AGA CCC AGC GAG TTG TTA CCA CAC GGC CAC TAC ATG CAA AAT CTC TAT 342 Arg Pro Ser Glu Leu Leu Pro His Gly His Tyr Met Gin Asn Leu Tyr 90 95 100 105
CCG GAG TCT TAT AAA GTT AGA GTG ATC CCC TCT TTT GCT CAA ATG CTT 390 Pro Glu Ser Tyr Lys Val Arg Val He Pro Ser Phe Ala Gin Met Leu 110 115 120
GGC GTG AGA TTC AAC CAC AGC ATG CAA AGA TTA GAA AGC TAT ATT TTA 438 Gly Val Arg Phe Asn His Ser Met Gin Arg Leu Glu Ser Tyr He Leu 125 130 135
GAG CAA TGC TAT ATC GCT GTG GGG CAA ATT TGC ATG GGC GTG AGC TTA 486 Glu Gin Cys Tyr He Ala Val Gly Gin He Cys Met Gly Val Ser Leu 140 145 150
ATG GGA TTG GAT AGT TGC ATT ATT GGA GGC TTT GAT CCT TTA AAG GTG 534 Met Gly Leu Asp Ser Cys He He Gly Gly Phe Asp Pro Leu Lys Val 155 160 165
GGC GAA GTT TTA GAA GAG CGT ATC AAT AAG CCT AAA ATC GCA TGC TTG 582 Gly Glu Val Leu Glu Glu Arg He Asn Lys Pro Lys He Ala Cys Leu 170 175 180 185
ATC GCT TTG GGC AAG AGG GTG GCA GAA GCG AGT CAA AAA TCA AGA AAA 630 He Ala Leu Gly Lys Arg Val Ala Glu Ala Ser Gin Lys Ser Arg Lys 190 195 200
TCA AAA GTT GAT GCG ATT ACT TGG TTG TGATTAAACA AAATCAAAAA CTTT 681 Ser Lys Val Asp Ala He Thr Trp Leu 205 210
(2) INFORMATION FOR SEQ ID NO: 914:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 210 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 914:
Met Lys Phe Leu Asp Gin Glu Lys Arg Arg Gin Leu Leu Asn Glu Arg
1 5 10 15
His Ser Cys Lys Met Phe Asp Ser His Tyr Glu Phe Ser Ser Thr Glu
20 25 30
Leu Glu Glu He Ala Glu He Ala Arg Leu Ser Pro Ser Ser Tyr Asn
35 40 45
Thr Gin Pro Trp His Phe Val Met Val Thr Asp Lys Asp Leu Lys Lys
50 55 60
Gin He Ala Ala His Ser Tyr Phe Asn Glu Glu Met He Lys Ser Ala 65 70 75 80
Ser Ala Leu Met Val Val Cys Ser Leu Arg Pro Ser Glu Leu Leu Pro
85 90 95
His Gly His Tyr Met Gin Asn Leu Tyr Pro Glu Ser Tyr Lys Val Arg
100 105 110
Val He Pro Ser Phe Ala Gin Met Leu Gly Val Arg Phe Asn His Ser
115 120 125
Met Gin Arg Leu Glu Ser Tyr He Leu Glu Gin Cys Tyr He Ala Val
130 135 140
Gly Gin He Cys Met Gly Val Ser Leu Met Gly Leu Asp Ser Cys He 145 150 155 160
He Gly Gly Phe Asp Pro Leu Lys Val Gly Glu Val Leu Glu Glu Arg
165 170 175
He Asn Lys Pro Lys He Ala Cys Leu He Ala Leu Gly Lys Arg Val
180 185 190
Ala Glu Ala Ser Gin Lys Ser Arg Lys Ser Lys Val Asp Ala He Thr
195 200 205
Trp Leu 210
(2) INFORMATION FOR SEQ ID NO: 915:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1490 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 99...1439 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 915 :
CTTAAAGAAA ACATGCAAAA TTTGCAAAAT CAAGTTCAAA ACAAAGAGCA ATCCATCGCT 60
CAATTAGATG CACAAATCCA AGCTTTAAAG GGGATTCA ATG AGC GTT AAT TTT TTT 116
Met Ser Val Asn Phe Phe 1 5
AAG GGC ATT TTT AAT GAC AAT AGC AGG GCT GAA AAC CAC CAA GAC AAC 164 Lys Gly He Phe Asn Asp Asn Ser Arg Ala Glu Asn His Gin Asp Asn 10 15 20
CAC CAA AAC AAC CAT CAA GTG GGC TTA AAA GAG CGT TAC GAT TTG ATC 212 His Gin Asn Asn His Gin Val Gly Leu Lys Glu Arg Tyr Asp Leu He 25 30 35
GCT CGT ATT TTA AAC GCC AGA ATT GAA AAT GAA GGG CTA GAA GAA TAT 260 Ala Arg He Leu Asn Ala Arg He Glu Asn Glu Gly Leu Glu Glu Tyr 40 45 50
CAG AGC GTC TTG GAT AAC GAG TTT TTA GAG TTC GCT AGC GGC GTG GAT 308 Gin Ser Val Leu Asp Asn Glu Phe Leu Glu Phe Ala Ser Gly Val Asp 55 60 65 70
TCG CTC AAA GAA AAG GAA ATA GCG TTA CTG ACG CTC CAA GAA ATC CAA 356 Ser Leu Lys Glu Lys Glu He Ala Leu Leu Thr Leu Gin Glu He Gin 75 80 85
AAA GAA TTG CAA TTG GTA GCG AGC TAC CCT AGT TTG TTC CAA AAA ACC 404 Lys Glu Leu Gin Leu Val Ala Ser Tyr Pro Ser Leu Phe Gin Lys Thr 90 95 100
ATC GTT GCG GTG GGG GGA GGG TTT AGC GCG GGC AAA TCC ACT TTT TTA 452 He Val Ala Val Gly Gly Gly Phe Ser Ala Gly Lys Ser Thr Phe Leu 105 110 115
AAC AAC TTG TTG GGC TTG AAA TTA AAA CTC CCT GAA GAC ATG AAT CCC 500 Asn Asn Leu Leu Gly Leu Lys Leu Lys Leu Pro Glu Asp Met Asn Pro 120 125 130
ACC ACA GCT ATC CCC ACT TAT TGC TTA AAG GGT AAA AGA GAA GTT TTA 548 Thr Thr Ala He Pro Thr Tyr Cys Leu Lys Gly Lys Arg Glu Val Leu 135 140 145 150
ATG GGG TTT TCT CAA AAT GGG GGC ATG GTG GAA TTG CCA CAT CTC GCT 596 Met Gly Phe Ser Gin Asn Gly Gly Met Val Glu Leu Pro His Leu Ala 155 160 165
TTT GAC CAT CAG TTT TTA AAC TCC CTT GGC TTT AAT TTG AAA GAG ATC 644 Phe Asp His Gin Phe Leu Asn Ser Leu Gly Phe Asn Leu Lys Glu He 170 175 180
ATG CCT TTC ATG CTT TTA AGC GCT CCT AGC GTG CCT TTT GAA TTT TTA 692 Met Pro Phe Met Leu Leu Ser Ala Pro Ser Val Pro Phe Glu Phe Leu 185 190 195
TGC TTC ATA GAC ACG CCT GGT TTT AAC TCC GCC AAG CAA GGC TAT ACG 740 Cys Phe He Asp Thr Pro Gly Phe Asn Ser Ala Lys Gin Gly Tyr Thr 200 205 210
GGT GGG GAT AAA GAA GCC TCT AAA GAA TCC CTA AAA CAC GCC AAA CAC 788 Gly Gly Asp Lys Glu Ala Ser Lys Glu Ser Leu Lys His Ala Lys His 215 220 225 230
ATT CTG TGG CTC ATT AGT TGC GAG AGT GGG GAG ATT CAC GAA GAT GAT 836 He Leu Trp Leu He Ser Cys Glu Ser Gly Glu He His Glu Asp Asp 235 240 245
TTA GAA TAT TTG CAA GAA TTA TAC GAA GAA GGC AAG CAG GTT TTT ATC 884 Leu Glu Tyr Leu Gin Glu Leu Tyr Glu Glu Gly Lys Gin Val Phe He 250 255 260
GTA TTG AGT AGG GCT GAT AGG CGC ACA AAA AGG CAA TTA GAA GAA GTC 932 Val Leu Ser Arg Ala Asp Arg Arg Thr Lys Arg Gin Leu Glu Glu Val 265 270 275
GTT ATT AAA ATT AAA GAG ACT TTA AAA GAT AAT GGC ATT GAA TTT TTA 980 Val He Lys He Lys Glu Thr Leu Lys Asp Asn Gly He Glu Phe Leu 280 285 290
GGG ATT GGT GCT TAT AGT TCT ACA AGG TAT CAA GAA TAT AAA GAA TTC 1028 Gly He Gly Ala Tyr Ser Ser Thr Arg Tyr Gin Glu Tyr Lys Glu Phe 295 300 305 310
AGC GAA AAA AGC AAA GTT TTT AAC TCG CTT GAG GAA TTT CTA ATG AAG 1076 Ser Glu Lys Ser Lys Val Phe Asn Ser Leu Glu Glu Phe Leu Met Lys 315 320 325
TTA AAT CAA AGG AGC GAG AAA CAA AAC GAA ATT TTA GGA TAT TTA TAC 1124 Leu Asn Gin Arg Ser Glu Lys Gin Asn Glu He Leu Gly Tyr Leu Tyr 330 335 340
GAG GTG CAT TCC ATG TAT GAA AAG GCT ATT GAG CAA GAC GCT AAC CAA 1172 Glu Val His Ser Met Tyr Glu Lys Ala He Glu Gin Asp Ala Asn Gin 345 350 355
TTC AAA CGC TAC CAA AGC GAA TTG CAT TCT GTT AGA TTG GAT TTG ATG 1220 Phe Lys Arg Tyr Gin Ser Glu Leu His Ser Val Arg Leu Asp Leu Met 360 365 370
CAA AAA GGC TTT GAT GAT TTT AGC GAT AAA ATT TTT AGA AGA ATT GAG 1268 Gin Lys Gly Phe Asp Asp Phe Ser Asp Lys He Phe Arg Arg He Glu 375 380 385 390
AAT TTA GAA AAA GAA TTT TCC GAG CAA GAG CGA TCC AAA AGA GAG AGT 1316 Asn Leu Glu Lys Glu Phe Ser Glu Gin Glu Arg Ser Lys Arg Glu Ser 395 400 405
TTA GCG CGA TTG AAT GAA GTG ATT GAC TTG TTT AAA GAA GGT ATT GAT 1364 Leu Ala Arg Leu Asn Glu Val He Asp Leu Phe Lys Glu Gly He Asp 410 415 420
AAG GTT TTT GAT CGC GTG AGC GCT TTC ACT TGG GAA AAA TAC AAA GAA 1412 Lys Val Phe Asp Arg Val Ser Ala Phe Thr Trp Glu Lys Tyr Lys Glu 425 430 435
CAA AAT GAC GAT GAA GAG GAC GAT GAT TGAAGAAAAC TACAAAGAAG AGCGTTA 1466 Gin Asn Asp Asp Glu Glu Asp Asp Asp 440 445
CACCGAAAGG GTGAATCAAG GCGG 1490
(2) INFORMATION FOR SEQ ID NO : 916-
(l) SEQUENCE CHARACTERISTICS-
(A) LENGTH: 447 ammo acids
(B) TYPE: ammo acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(n) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION. SEQ ID NO : 916-
Met Ser Val Asn Phe Phe Lys Gly He Phe Asn Asp Asn Ser Arg Ala
1 5 10 15
Glu Asn His Gin Asp Asn His Gin Asn Asn His Gin Val Gly Leu Lys
20 25 30
Glu Arg Tyr Asp Leu He Ala Arg He Leu Asn Ala Arg He Glu Asn
35 40 45
Glu Gly Leu Glu Glu Tyr Gin Ser Val Leu Asp Asn Glu Phe Leu Glu
50 55 60
Phe Ala Ser Gly Val Asp Ser Leu Lys Glu Lys Glu He Ala Leu Leu 65 70 75 80
Thr Leu Gin Glu He Gin Lys Glu Leu Gin Leu Val Ala Ser Tyr Pro
85 90 95
Ser Leu Phe Gin Lys Thr He Val Ala Val Gly Gly Gly Phe Ser Ala
100 105 110
Gly Lys Ser Thr Phe Leu Asn Asn Leu Leu Gly Leu Lys Leu Lys Leu
115 120 125
Pro Glu Asp Met Asn Pro Thr Thr Ala He Pro Thr Tyr Cys Leu Lys
130 135 140
Gly Lys Arg Glu Val Leu Met Gly Phe Ser Gin Asn Gly Gly Met Val 145 150 155 160
Glu Leu Pro His Leu Ala Phe Asp His Gin Phe Leu Asn Ser Leu Gly
165 170 175
Phe Asn Leu Lys Glu He Met Pro Phe Met Leu Leu Ser Ala Pro Ser
180 185 190
Val Pro Phe Glu Phe Leu Cys Phe He Asp Thr Pro Gly Phe Asn Ser 195 200 205 Ala Lys Gin Gly Tyr Thr Gly Gly Asp Lys Glu Ala Ser Lys Glu Ser
210 215 220
Leu Lys His Ala Lys His He Leu Trp Leu He Ser Cys Glu Ser Gly 225 230 235 240
Glu He His Glu Asp Asp Leu Glu Tyr Leu Gin Glu Leu Tyr Glu Glu
245 250 255
Gly Lys Gin Val Phe He Val Leu Ser Arg Ala Asp Arg Arg Thr Lys
260 265 270
Arg Gin Leu Glu Glu Val Val He Lys He Lys Glu Thr Leu Lys Asp
275 280 285
Asn Gly He Glu Phe Leu Gly He Gly Ala Tyr Ser Ser Thr Arg Tyr
290 295 300
Gin Glu Tyr Lys Glu Phe Ser Glu Lys Ser Lys Val Phe Asn Ser Leu 305 310 315 320
Glu Glu Phe Leu Met Lys Leu Asn Gin Arg Ser Glu Lys Gin Asn Glu
325 330 335
He Leu Gly Tyr Leu Tyr Glu Val His Ser Met Tyr Glu Lys Ala He
340 345 350
Glu Gin Asp Ala Asn Gin Phe Lys Arg Tyr Gin Ser Glu Leu His Ser
355 360 365
Val Arg Leu Asp Leu Met Gin Lys Gly Phe Asp Asp Phe Ser Asp Lys
370 375 380
He Phe Arg Arg He Glu Asn Leu Glu Lys Glu Phe Ser Glu Gin Glu 385 390 395 400
Arg Ser Lys Arg Glu Ser Leu Ala Arg Leu Asn Glu Val He Asp Leu
405 410 415
Phe Lys Glu Gly He Asp Lys Val Phe Asp Arg Val Ser Ala Phe Thr
420 425 430
Trp Glu Lys Tyr Lys Glu Gin Asn Asp Asp Glu Glu Asp Asp Asp 435 440 445
(2) INFORMATION FOR SEQ ID NO: 917:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1718 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 28...1674 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 917:
ATTTTAGTTT TTAATTTTAA AGGATTG ATG ATG GTT TTA CGC ACA CAG ACA AAT 54
Met Met Val Leu Arg Thr Gin Thr Asn 1 5
TTT GTG GAG TTT TTA GAA CAG GTT TTA GAA GTT TTA AAA GAA GTG GAG 102 Phe Val Glu Phe Leu Glu Gin Val Leu Glu Val Leu Lys Glu Val Glu 10 15 20 25
ATC GAT AAA ACA GAA TGC TCC ACG CTT TTA GCA AGC GTT CAA AAA CAA 150 He Asp Lys Thr Glu Cys Ser Thr Leu Leu Ala Ser Val Gin Lys Gin 30 35 40
CAG CTA GTG ATA CCC GTT GTG GGG AAT TTT AGC GCA GGG AAA AGC ACG 198 Gin Leu Val He Pro Val Val Gly Asn Phe Ser Ala Gly Lys Ser Thr 45 50 55
CTA TTA AAC CGC TTT TTA GGC AGC AGC GTT TTG CCT ACC GGT ATC ACG 246 Leu Leu Asn Arg Phe Leu Gly Ser Ser Val Leu Pro Thr Gly He Thr 60 65 70
CCA GAG ACT TCT TTA GCC ACT GAG TTG CAC TAT AGC GCT AAG GAA CGC 294 Pro Glu Thr Ser Leu Ala Thr Glu Leu His Tyr Ser Ala Lys Glu Arg 75 80 85
ATA GAG GCT TTT TCA AAC AAT GAT GAA AAA ACA GAG AGT TTT GAA CTG 342 He Glu Ala Phe Ser Asn Asn Asp Glu Lys Thr Glu Ser Phe Glu Leu 90 95 100 105
AAT GAG CAA AGT TTT GAA GCG ATT AAA GAG AAT GCC ACG AAG TAT TCC 390 Asn Glu Gin Ser Phe Glu Ala He Lys Glu Asn Ala Thr Lys Tyr Ser 110 115 120
TAC CTT AAG GTT TAT TTG AAT AAT GAA GCT TTG AAA AAC AGC GCT CCT 438 Tyr Leu Lys Val Tyr Leu Asn Asn Glu Ala Leu Lys Asn Ser Ala Pro 125 130 135
TTA GTG TTT GTG GAT ATG CCA GGC TTT GAT AGC CCC ATT TCA AGC CAC 486 Leu Val Phe Val Asp Met Pro Gly Phe Asp Ser Pro He Ser Ser His 140 145 150
ACC CAT GCC ATT TTG GAA TAT TTA GAA AGG GGC GTG CAT TTT GTC ATT 534 Thr His Ala He Leu Glu Tyr Leu Glu Arg Gly Val His Phe Val He 155 160 165
CTC ACA AGC GTA GAA GAG GGC AAT CTC ACT AAA CGC ATG GTT AGG GAG 582 Leu Thr Ser Val Glu Glu Gly Asn Leu Thr Lys Arg Met Val Arg Glu 170 175 180 185
TTA AAA AAC CTT TTA GAG TTT GAC AAA GGC CTT AGC TTT ATT TTG AGT 630 Leu Lys Asn Leu Leu Glu Phe Asp Lys Gly Leu Ser Phe He Leu Ser 190 195 200
AAA ACG AAT TTA AGA ACG CCT TCG CAA GTG GGA GAA ATC TCT CAC TAC 678 Lys Thr Asn Leu Arg Thr Pro Ser Gin Val Gly Glu He Ser His Tyr 205 210 215
ATT CAA GAT CAA ATC CAG GAT CAC CTT GAT TTG ACA ACG CAC CTC ATC 726 He Gin Asp Gin He Gin Asp His Leu Asp Leu Thr Thr His Leu He 220 225 230 CAT TCC AAT AAA GAC AAT AAC GCC CTT TTA GAG GTA GCG GAT AAA ATA 774 His Ser Asn Lys Asp Asn Asn Ala Leu Leu Glu Val Ala Asp Lys He 235 240 245
GAC GCT GAA AAG CTT TTT AGC GCT TTG TAT TTG AAA CGA TTG AAG TTT 822 Asp Ala Glu Lys Leu Phe Ser Ala Leu Tyr Leu Lys Arg Leu Lys Phe 250 255 260 265
TTA AAT TCT AAG TTA CAA AAT AGC CTA AAA AGC GTG ATG GAA AGC TTT 870 Leu Asn Ser Lys Leu Gin Asn Ser Leu Lys Ser Val Met Glu Ser Phe 270 275 280
GAT TAT TCT AAA GAA AAG GCT TTA GAA GAA ATA CAA GCT TTG GAT TTG 918 Asp Tyr Ser Lys Glu Lys Ala Leu Glu Glu He Gin Ala Leu Asp Leu 285 290 295
GGC GTT AAA GAC ATT GAA AAA ACC TAT GAA AAA TTA AGG GCT AAT TTA 966 Gly Val Lys Asp He Glu Lys Thr Tyr Glu Lys Leu Arg Ala Asn Leu 300 305 310
GAA GAA GAA TAT TCT AGC GTG GCT GTG GGA TCG GTG GTT AAA AAA GTA 1014 Glu Glu Glu Tyr Ser Ser Val Ala Val Gly Ser Val Val Lys Lys Val 315 320 325
GTA GAA GAG GTT AGG GAT CAA AAA TCC TAT TTA GCC TCT TTA ATC AAC 1062 Val Glu Glu Val Arg Asp Gin Lys Ser Tyr Leu Ala Ser Leu He Asn 330 335 340 345
AAG CCT AAC GAG TTC AAT AGC GAA ATA GAA AGC ATC ATG CAA CAA AGC 1110 Lys Pro Asn Glu Phe Asn Ser Glu He Glu Ser He Met Gin Gin Ser 350 355 360
TTG ATC AAA AAC GCT AAA TTA GAG ATT GAA AAG ATC AAC CTT TCT TTT 1158 Leu He Lys Asn Ala Lys Leu Glu He Glu Lys He Asn Leu Ser Phe 365 370 375
TCA AAA GAT TTC CAT GCG GAA TTT GAA AGC CTG AAC AAG CTT TCT AGC 1206 Ser Lys Asp Phe His Ala Glu Phe Glu Ser Leu Asn Lys Leu Ser Ser 380 385 390
GAT CTG TCT GTG AAT TTA GAG CAT GGG ATT GAA TTA GGG ATC AAC GCT 1254 Asp Leu Ser Val Asn Leu Glu His Gly He Glu Leu Gly He Asn Ala 395 400 405
TTA AGC GTG ATT TTT TCC AAG AAT CCG GTT ACA AGG CCA TTC GCG CTG 1302 Leu Ser Val He Phe Ser Lys Asn Pro Val Thr Arg Pro Phe Ala Leu 410 415 420 425
ATT TTG CAA GGG TTA AAA TCT CTT TTA AAA GAT TTA CTG ACA TTG TTG 1350 He Leu Gin Gly Leu Lys Ser Leu Leu Lys Asp Leu Leu Thr Leu Leu 430 435 440
CCT AAT ATC ATC GCT TCA TTC TTT AGG AAT GAA GAA AAA GAG CGG GCG 1398 Pro Asn He He Ala Ser Phe Phe Arg Asn Glu Glu Lys Glu Arg Ala 445 450 455 AAA TTA GAA AAT CTG ATT GAA GTC AGA GTG ATT CCA GAA ATC CAA TAC 1446 Lys Leu Glu Asn Leu He Glu Val Arg Val He Pro Glu He Gin Tyr 460 465 470
AAG CTT AAA AAA GTT TTA CCG GGA TTG TTT AAT GAA GCT TTG CAA AAT 1494 Lys Leu Lys Lys Val Leu Pro Gly Leu Phe Asn Glu Ala Leu Gin Asn 475 480 485
TCC CTA AAA TCT CTA AAA GAT CGG TGC GAG CTA GAA ATC ACG CAT AAA 1542 Ser Leu Lys Ser Leu Lys Asp Arg Cys Glu Leu Glu He Thr His Lys 490 495 500 505
AAA CAA GAA ATC GCG CTC GCT CAA AAG GAA AAA GAA AAA CAC CTA AAC 1590 Lys Gin Glu He Ala Leu Ala Gin Lys Glu Lys Glu Lys His Leu Asn 510 515 520
GAT TTA GAA GAT CAA AAA CAA ATC TTA GAA AAT AAG ATC AAC GCT TTA 1638 Asp Leu Glu Asp Gin Lys Gin He Leu Glu Asn Lys He Asn Ala Leu 525 530 535
AGC GAT TTA GAA CAA CAA TAT TTA AAG GAT CAA CAA TGAACGAGCA AGAACT 1690 Ser Asp Leu Glu Gin Gin Tyr Leu Lys Asp Gin Gin 540 545
CATTCAAAAA AGCGCTTTAA TTGAAAAA 1718
(2) INFORMATION FOR SEQ ID NO: 918:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 549 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 918:
Met Met Val Leu Arg Thr Gin Thr Asn Phe Val Glu Phe Leu Glu Gin
1 5 10 15
Val Leu Glu Val Leu Lys Glu Val Glu He Asp Lys Thr Glu Cys Ser
20 25 30
Thr Leu Leu Ala Ser Val Gin Lys Gin Gin Leu Val He Pro Val Val
35 40 45
Gly Asn Phe Ser Ala Gly Lys Ser Thr Leu Leu Asn Arg Phe Leu Gly
50 55 60
Ser Ser Val Leu Pro Thr Gly He Thr Pro Glu Thr Ser Leu Ala Thr 65 70 75 80
Glu Leu His Tyr Ser Ala Lys Glu Arg He Glu Ala Phe Ser Asn Asn
85 90 95
Asp Glu Lys Thr Glu Ser Phe Glu Leu Asn Glu Gin Ser Phe Glu Ala
100 105 110
He Lys Glu Asn Ala Thr Lys Tyr Ser Tyr Leu Lys Val Tyr Leu Asn
115 120 125
Asn Glu Ala Leu Lys Asn Ser Ala Pro Leu Val Phe Val Asp Met Pro 130 135 140
Gly Phe Asp Ser Pro He Ser Ser His Thr His Ala He Leu Glu Tyr 145 150 155 160
Leu Glu Arg Gly Val His Phe Val He Leu Thr Ser Val Glu Glu Gly
165 170 175
Asn Leu Thr Lys Arg Met Val Arg Glu Leu Lys Asn Leu Leu Glu Phe
180 185 190
Asp Lys Gly Leu Ser Phe He Leu Ser Lys Thr Asn Leu Arg Thr Pro
195 200 205
Ser Gin Val Gly Glu He Ser His Tyr He Gin Asp Gin He Gin Asp
210 215 220
His Leu Asp Leu Thr Thr His Leu He His Ser Asn Lys Asp Asn Asn 225 230 235 240
Ala Leu Leu Glu Val Ala Asp Lys He Asp Ala Glu Lys Leu Phe Ser
245 250 255
Ala Leu Tyr Leu Lys Arg Leu Lys Phe Leu Asn Ser Lys Leu Gin Asn
260 265 270
Ser Leu Lys Ser Val Met Glu Ser Phe Asp Tyr Ser Lys Glu Lys Ala
275 280 285
Leu Glu Glu He Gin Ala Leu Asp Leu Gly Val Lys Asp He Glu Lys
290 295 300
Thr Tyr Glu Lys Leu Arg Ala Asn Leu Glu Glu Glu Tyr Ser Ser Val 305 310 315 320
Ala Val Gly Ser Val Val Lys Lys Val Val Glu Glu Val Arg Asp Gin
325 330 335
Lys Ser Tyr Leu Ala Ser Leu He Asn Lys Pro Asn Glu Phe Asn Ser
340 345 350
Glu He Glu Ser He Met Gin Gin Ser Leu He Lys Asn Ala Lys Leu
355 360 365
Glu He Glu Lys He Asn Leu Ser Phe Ser Lys Asp Phe His Ala Glu
370 375 380
Phe Glu Ser Leu Asn Lys Leu Ser Ser Asp Leu Ser Val Asn Leu Glu 385 . 390 395 400
His Gly He Glu Leu Gly He Asn Ala Leu Ser Val He Phe Ser Lys
405 410 415
Asn Pro Val Thr Arg Pro Phe Ala Leu He Leu Gin Gly Leu Lys Ser
420 425 430
Leu Leu Lys Asp Leu Leu Thr Leu Leu Pro Asn He He Ala Ser Phe
435 440 445
Phe Arg Asn Glu Glu Lys Glu Arg Ala Lys Leu Glu Asn Leu He Glu
450 455 460
Val Arg Val He Pro Glu He Gin Tyr Lys Leu Lys Lys Val Leu Pro 465 470 475 480
Gly Leu Phe Asn Glu Ala Leu Gin Asn Ser Leu Lys Ser Leu Lys Asp
485 490 495
Arg Cys Glu Leu Glu He Thr His Lys Lys Gin Glu He Ala Leu Ala
500 505 510
Gin Lys Glu Lys Glu Lys His Leu Asn Asp Leu Glu Asp Gin Lys Gin
515 520 525
He Leu Glu Asn Lys He Asn Ala Leu Ser Asp Leu Glu Gin Gin Tyr
530 535 540
Leu Lys Asp Gin Gin 545
(2) INFORMATION FOR SEQ ID NO: 919: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 360 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...348 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 919:
TCTTTTTCTT CATTCCTAAA GAATGAAGCG ATG ATA TTA GGC AAC AAT GTC AGT 54
Met He Leu Gly Asn Asn Val Ser 1 5
AAA TCT TTT AAA AGA GAT TTT AAC CCT TGC AAA ATC AGC GCG AAT GGC 102 Lys Ser Phe Lys Arg Asp Phe Asn Pro Cys Lys He Ser Ala Asn Gly 10 15 20
CTT GTA ACC GGA TTC TTG GAA AAA ATC ACG CTT AAA GCG TTG ATC CCT 150 Leu Val Thr Gly Phe Leu Glu Lys He Thr Leu Lys Ala Leu He Pro 25 30 35 40
AAT TCA ATC CCA TGC TCT AAA TTC ACA GAC AGA TCG CTA GAA AGC TTG 198 Asn Ser He Pro Cys Ser Lys Phe Thr Asp Arg Ser Leu Glu Ser Leu 45 50 55
TTC AGG CTT TCA AAT TCC GCA TGG AAA TCT TTT GAA AAA GAA AGG TTG 246 Phe Arg Leu Ser Asn Ser Ala Trp Lys Ser Phe Glu Lys Glu Arg Leu 60 65 70
ATC TTT TCA ATC TCT AAT TTA GCG TTT TTG ATC AAG CTT TGT TGC ATG 294 He Phe Ser He Ser Asn Leu Ala Phe Leu He Lys Leu Cys Cys Met 75 80 85
ATG CTT TCT ATT TCG CTA TTG AAC TCG TTA GGC TTG TTG ATT AAA GAG 342 Met Leu Ser He Ser Leu Leu Asn Ser Leu Gly Leu Leu He Lys Glu 90 95 100
GCT AAA TAGGATTTTT GA 360
Ala Lys
105
(2) INFORMATION FOR SEQ ID NO: 920
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 106 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(11) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION. SEQ ID NO: 920:
Met He Leu Gly Asn Asn Val Ser Lys Ser Phe Lys Arg Asp Phe Asn
1 5 10 15
Pro Cys Lys He Ser Ala Asn Gly Leu Val Thr Gly Phe Leu Glu Lys
20 25 30
He Thr Leu Lys Ala Leu He Pro Asn Ser He Pro Cys Ser Lys Phe
35 40 45
Thr Asp Arg Ser Leu Glu Ser Leu Phe Arg Leu Ser Asn Ser Ala Trp
50 55 60
Lys Ser Phe Glu Lys Glu Arg Leu He Phe Ser He Ser Asn Leu Ala 65 70 75 80
Phe Leu He Lys Leu Cys Cys Met Met Leu Ser He Ser Leu Leu Asn
85 90 95
Ser Leu Gly Leu Leu He Lys Glu Ala Lys 100 105
(2) INFORMATION FOR SEQ ID NO: 921
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3179 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 61...3120 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 921:
CAGCGTGGCA GTGGGGTCAT TGATAGGGTT AAAAGGCATG ATCAACAATT TAGGGGAGGA 60 ATG ATG CTC GCT TCC ATT ATT GAA TTT TCC TTA CGC CAA AGA GTG ATC 108 Met Met Leu Ala Ser He He Glu Phe Ser Leu Arg Gin Arg Val He 1 5 10 15
GTG ATT GTT GGT GCG ATT CTT ATT TTA TTT TTT GGG ACT TAT AGT TTT 156 Val He Val Gly Ala He Leu He Leu Phe Phe Gly Thr Tyr Ser Phe 20 25 30
ATC AAC ACT CCA GTG GAC GCT TTC CCG GAT ATT TCG CCC ACT CAA GTT 204 He Asn Thr Pro Val Asp Ala Phe Pro Asp He Ser Pro Thr Gin Val 35 40 45
AAA ATC ATT TTA AAA CTC CCC GGC TCT AGC CCT GAA GAA ATG GAA AAC 252 Lys He He Leu Lys Leu Pro Gly Ser Ser Pro Glu Glu Met Glu Asn 50 55 60
AAC ATC GTG CGC CCT TTA GAA TTG GAG CTT TTA GGC TTG AAA GGG CAA 300 Asn He Val Arg Pro Leu Glu Leu Glu Leu Leu Gly Leu Lys Gly Gin 65 70 75 80
AAA TCT TTA AGG AGT GTT TCA AAA TAT TCT ATT TCA GAT ATT ACG ATA 348 Lys Ser Leu Arg Ser Val Ser Lys Tyr Ser He Ser Asp He Thr He 85 90 95
GAT TTT GAT GAC AGC GTG GAT ATT TAT TTA GCG AGG AAT ATT GTC AAT 396 Asp Phe Asp Asp Ser Val Asp He Tyr Leu Ala Arg Asn He Val Asn 100 105 110
GAG CGC TTG AGC AGC GTG ATG AAA GAT TTA CCC GTG GGG GTT GAG GGG 444 Glu Arg Leu Ser Ser Val Met Lys Asp Leu Pro Val Gly Val Glu Gly 115 120 125
GGC ATG GCG CCC ATT GTT ACG CCG CTA TCA GAT ATC TTT ATG TTC ACT 492 Gly Met Ala Pro He Val Thr Pro Leu Ser Asp He Phe Met Phe Thr 130 135 140
ATT GAT GGC AAT ATC ACT GAG ATA GAA AAA CGA CAG CTT TTA GAT TTT 540 He Asp Gly Asn He Thr Glu He Glu Lys Arg Gin Leu Leu Asp Phe 145 150 155 160
GTG ATC CGC CCA CAA TTA AGA ATG ATT AGC GGC GTA GCA GAT GTC AAT 588 Val He Arg Pro Gin Leu Arg Met He Ser Gly Val Ala Asp Val Asn 165 170 175
TCC ATT GGA GGC TTT AGC AGA GCG TTT GTG ATC GTG CCG GAT TTT AAT 636 Ser He Gly Gly Phe Ser Arg Ala Phe Val He Val Pro Asp Phe Asn 180 185 190
GAC ATG GCA AGG CTT GGG GTG AGT ATT TCT GAT TTA GAA TCG GCT GTG 684 Asp Met Ala Arg Leu Gly Val Ser He Ser Asp Leu Glu Ser Ala Val 195 200 205
AGA GTG AAT TTA AGA AAC AGC GGA GCG GGG CGC GTG GAT AGA GAT GGC 732 Arg Val Asn Leu Arg Asn Ser Gly Ala Gly Arg Val Asp Arg Asp Gly 210 215 220
GAA ACC TTT TTA GTC AAA ATC CAA ACC GCT TCT TTG AGT TTA GAA GAC 780 Glu Thr Phe Leu Val Lys He Gin Thr Ala Ser Leu Ser Leu Glu Asp 225 230 235 240
ATT GGC AAA ATC ACC GTT TCC ACT AAT TTA GGG CAT TTG CAC ATT AAG 828 He Gly Lys He Thr Val Ser Thr Asn Leu Gly His Leu His He Lys 245 250 255
GAT TTT GCG AAA GTC ATC AGC CAG TCT CGC ACC CGT TTG GGG TTT GTT 876 Asp Phe Ala Lys Val He Ser Gin Ser Arg Thr Arg Leu Gly Phe Val 260 265 270 ACT AAA GAT GGC GTG GGC GAG ACC ACA GAA GGC TTG GTG CTT TCT TTA 924 Thr Lys Asp Gly Val Gly Glu Thr Thr Glu Gly Leu Val Leu Ser Leu 275 280 285
AAA GAC GCT AAC ACC AAA GAA ATC ATC ACT CAA GTG TAT CAA AAA CTA 972 Lys Asp Ala Asn Thr Lys Glu He He Thr Gin Val Tyr Gin Lys Leu 290 295 300
GAA GAA TTA AAA CCC TTT TTA CCG AAT GGC GTG TCC ATT AAT GTT TTT 1020 Glu Glu Leu Lys Pro Phe Leu Pro Asn Gly Val Ser He Asn Val Phe 305 310 315 320
TAT GAT CGC TCA GAA TTT ACG CAA AAA GCC ATT GCC ACC GTT TCT AAA 1068 Tyr Asp Arg Ser Glu Phe Thr Gin Lys Ala He Ala Thr Val Ser Lys 325 330 335
ACC CTC ATT GAA GCC GTT GTT TTA ATC ATC ATC ACG CTC TTT TTA TTT 1116 Thr Leu He Glu Ala Val Val Leu He He He Thr Leu Phe Leu Phe 340 345 350
TTA GGG AAT TTG AGG GCG AGC GTG GCT GTG GGG GTG ATT TTA CCT TTA 1164 Leu Gly Asn Leu Arg Ala Ser Val Ala Val Gly Val He Leu Pro Leu 355 360 365
AGC TTG TCC GTG GCG TTT ATT TTT ATC AAG TTT AGC GAT CTG ACT TTA 1212 Ser Leu Ser Val Ala Phe He Phe He Lys Phe Ser Asp Leu Thr Leu 370 375 380
AAT TTG ATG AGT TTA GGG GGA TTG GTT ATC GCT ATA GGC ATG CTC ATT 1260 Asn Leu Met Ser Leu Gly Gly Leu Val He Ala He Gly Met Leu He 385 390 395 400
GAC TCA GCC GTG GTG GTG GTG GAA AAC GCT TTT GAA AAA TTA AGC GCT 1308 Asp Ser Ala Val Val Val Val Glu Asn Ala Phe Glu Lys Leu Ser Ala 405 410 415
AAC ACT AAA ACC ACT AAA CTC CAT GCA ATC TAT CGT TCG TGT AAA GAA 1356 Asn Thr Lys Thr Thr Lys Leu His Ala He Tyr Arg Ser Cys Lys Glu 420 425 430
ATC GCT GTT TCA GTG GTG AGC GGG GTG GTG ATC ATC ATT GTG TTT TTT 1404 He Ala Val Ser Val Val Ser Gly Val Val He He He Val Phe Phe 435 440 445
GTG CCG ATT TTA ACC TTA CAG GGG TTA GAG GGT AAG ATG TTT AGG CCT 1452 Val Pro He Leu Thr Leu Gin Gly Leu Glu Gly Lys Met Phe Arg Pro 450 455 460
TTA GCG CAA AGC ATT GTG TAT GCG CTT TTA GGC ACT TTA GTT CTA TCT 1500 Leu Ala Gin Ser He Val Tyr Ala Leu Leu Gly Thr Leu Val Leu Ser 465 470 475 480
ATT ACA ATC ATT CCT GTA GTC AGC TCT CTT GTC TTA AAA GCC ACG CCC 1548 He Thr He He Pro Val Val Ser Ser Leu Val Leu Lys Ala Thr Pro 485 490 495 CAT AGC GAA ACC TTT TTA ACG AGG TTT TTA AAC AGA ATC TAC GCC CCT 1596 His Ser Glu Thr Phe Leu Thr Arg Phe Leu Asn Arg He Tyr Ala Pro 500 505 510
TTA TTG GAA TTT TTT GTG CAT AAC CCT AAA AAA GTG ATT TTA GGA GCG 1644 Leu Leu Glu Phe Phe Val His Asn Pro Lys Lys Val He Leu Gly Ala 515 520 525
TTT GTT TTT TTA ATC GCA AGC CTT TCT TTA TTC CCT TTT GTG GGG AAG 1692 Phe Val Phe Leu He Ala Ser Leu Ser Leu Phe Pro Phe Val Gly Lys 530 535 540
AAT TTC ATG CCC GTT TTA GAT GAG GGC GAT GTG GTT TTG AGC GTG GAA 1740 Asn Phe Met Pro Val Leu Asp Glu Gly Asp Val Val Leu Ser Val Glu 545 550 555 560
ACC ACC CCT TCT ATT TCT TTA GAT CAA TCT AGG GAT CTC ATG CTA AAC 1788 Thr Thr Pro Ser He Ser Leu Asp Gin Ser Arg Asp Leu Met Leu Asn 565 570 575
ATT GAG AGC GCG ATT AAA AAG CAT GTC AAG GAA GTT AAA AGC ATT GTC 1836 He Glu Ser Ala He Lys Lys His Val Lys Glu Val Lys Ser He Val 580 585 590
GCG CGC ACA GGG AGC GAT GAA TTG GGG CTG GAT TTA GGA GGT TTG AAT 1884 Ala Arg Thr Gly Ser Asp Glu Leu Gly Leu Asp Leu Gly Gly Leu Asn 595 600 605
CAA ACC GAT ACT TTT ATT TCT TTT ATT CCT AAA AAA GAA TGG AGC GTT 1932 Gin Thr Asp Thr Phe He Ser Phe He Pro Lys Lys Glu Trp Ser Val 610 615 620
AAA ACC AAA GAT GAA TTA TTA GAA AAA ATC ATG GAT TCT TTA AAA GAC 1980 Lys Thr Lys Asp Glu Leu Leu Glu Lys He Met Asp Ser Leu Lys Asp 625 630 635 640
TTT AAG GGG ATT AAC TTT TCT TTC ACC CAA CCC ATT GAA ATG AGA ATT 2028 Phe Lys Gly He Asn Phe Ser Phe Thr Gin Pro He Glu Met Arg He 645 650 655
TCT GAA ATG CTG ACA GGG GTT AGG GGG GAT TTA GCG GTT AAG ATT TTT 2076 Ser Glu Met Leu Thr Gly Val Arg Gly Asp Leu Ala Val Lys He Phe 660 665 670
GGA GAT GGT ATT AGC GAA TTG AAT GAA TTG AGT TTT CAA ATC GCG CAA 2124 Gly Asp Gly He Ser Glu Leu Asn Glu Leu Ser Phe Gin He Ala Gin 675 680 685
GCT CTA AAA GGG ATT AAA GGA TCT AGT GAA GTT TTA ACC ACG CTT AAT 2172 Ala Leu Lys Gly He Lys Gly Ser Ser Glu Val Leu Thr Thr Leu Asn 690 695 700
GAG GGC GTG AAT TAT TTG TAT GTA ACC CCT AAT AAA GAA TCG ATG GCG 2220 Glu Gly Val Asn Tyr Leu Tyr Val Thr Pro Asn Lys Glu Ser Met Ala 705 710 715 720 GAT GTG GGG ATC ACT AGC GAT GAA TTT TCC AAG TTT TTA AAA TCC GCT 2268 Asp Val Gly He Thr Ser Asp Glu Phe Ser Lys Phe Leu Lys Ser Ala 725 730 735
TTA GAG GGC TTG GTT GTA GAT GTG ATC CCT ACA GGG ATT TCA CGC ACG 2316 Leu Glu Gly Leu Val Val Asp Val He Pro Thr Gly He Ser Arg Thr 740 745 750
CCA GTG ATG ATC CGC CAA GAG AGC GAT TTT GCA AGC TCT ATC ACT AAA 2364 Pro Val Met He Arg Gin Glu Ser Asp Phe Ala Ser Ser He Thr Lys 755 760 765
ATC AAA AGT TTA GCC TTG ACT TCA AAA TAT GGC GTT TTA GTG CCT ATC 2412 He Lys Ser Leu Ala Leu Thr Ser Lys Tyr Gly Val Leu Val Pro He 770 775 780
ACT TCT ATC GCC AAA ATT GAA GAA GTG GAT GGC CCT GTT TCT GTT GTG 2460 Thr Ser He Ala Lys He Glu Glu Val Asp Gly Pro Val Ser Val Val 785 790 795 800
CGT GAA AAT TCA ATG CGC ATG AGC GTG GTT CGC AGT AAT GTG GTG GGG 2508 Arg Glu Asn Ser Met Arg Met Ser Val Val Arg Ser Asn Val Val Gly 805 810 815
CGC GAT TTG AAA TCT TTT GTA GAA GAG GCT AAA AAA GTG ATC GCT CAA 2556 Arg Asp Leu Lys Ser Phe Val Glu Glu Ala Lys Lys Val He Ala Gin 820 825 830
AAC ATC AAA CTC CCT CCC AGC TAC TAT ATC ACT TAT GGG GGG CAG TTT 2604 Asn He Lys Leu Pro Pro Ser Tyr Tyr He Thr Tyr Gly Gly Gin Phe 835 840 845
GAA AAC CAG CAA CGG GCC AAT AAA AGG CTC TCC ACC GTT ATC CCT TTA 2652 Glu Asn Gin Gin Arg Ala Asn Lys Arg Leu Ser Thr Val He Pro Leu 850 855 860
AGC ATC TTA GCG ATT TTT TTC ATT CTT TTT TTC ACT TTT AAA AGC ATT 2700 Ser He Leu Ala He Phe Phe He Leu Phe Phe Thr Phe Lys Ser He 865 870 875 880
CCT TTA GCC TTG CTC ATT CTT TTG AAT ATC CCT TTT GCG GTT ACC GGA 2748 Pro Leu Ala Leu Leu He Leu Leu Asn He Pro Phe Ala Val Thr Gly 885 890 895
GGC CTT ATT GCG TTG TTT GCG GTC GGG GAG TAT ATT TCA GTG CCA GCG 2796 Gly Leu He Ala Leu Phe Ala Val Gly Glu Tyr He Ser Val Pro Ala 900 905 910
AGC GTG GGC TTT ATC GCT CTT TTT GGG ATT GCG GTT TTA AAT GGC GTG 2844 Ser Val Gly Phe He Ala Leu Phe Gly He Ala Val Leu Asn Gly Val 915 920 925
GTG ATG ATA GGC TAT TTT AAA GAG CTT CTC TTG CAA GGG AAA AGC GTA 2892 Val Met He Gly Tyr Phe Lys Glu Leu Leu Leu Gin Gly Lys Ser Val 930 935 940 GAA GAA TGC GTT TTA TTG GGC GCT AAA AGG CGT TTG AGA CCG GTT TTA 2940 Glu Glu Cys Val Leu Leu Gly Ala Lys Arg Arg Leu Arg Pro Val Leu 945 950 955 960
ATG ACC GCT TGC ATT GCC GGT TTG GGT TTG CTC CCT TTA TTA TTT TCT 2988 Met Thr Ala Cys He Ala Gly Leu Gly Leu Leu Pro Leu Leu Phe Ser 965 970 975
CAT AGC GTG GGA TCA GAA GTC CAA AAA CCT TTA GCG ATC GTG GTG CTT 3036 His Ser Val Gly Ser Glu Val Gin Lys Pro Leu Ala He Val Val Leu 980 985 990
GGA GGC TTG GTT ACC TCA AGC GCT CTA ACC TTA CTC CTA CTG CCG CCA 3084 Gly Gly Leu Val Thr Ser Ser Ala Leu Thr Leu Leu Leu Leu Pro Pro 995 1000 1005
ATG TTT ATG CTC ATC GCT AAA AAG ATT AAA ATC GTT TGAGTTAAAG GATTTC 3136 Met Phe Met Leu He Ala Lys Lys He Lys He Val 1010 1015 1020
ACATGCTCGC TTTAGAAATT TATATTGATA TTTGTTTGAA AGA 3179
(2) INFORMATION FOR SEQ ID NO: 922:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1020 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 922:
Met Met Leu Ala Ser He He Glu Phe Ser Leu Arg Gin Arg Val He
1 5 10 15
Val He Val Gly Ala He Leu He Leu Phe Phe Gly Thr Tyr Ser Phe
20 25 30
He Asn Thr Pro Val Asp Ala Phe Pro Asp He Ser Pro Thr Gin Val
35 40 45
Lys He He Leu Lys Leu Pro Gly Ser Ser Pro Glu Glu Met Glu Asn
50 55 60
Asn He Val Arg Pro Leu Glu Leu Glu Leu Leu Gly Leu Lys Gly Gin 65 70 75 80
Lys Ser Leu Arg Ser Val Ser Lys Tyr Ser He Ser Asp He Thr He
85 90 95
Asp Phe Asp Asp Ser Val Asp He Tyr Leu Ala Arg Asn He Val Asn
100 105 110
Glu Arg Leu Ser Ser Val Met Lys Asp Leu Pro Val Gly Val Glu Gly
115 120 125
Gly Met Ala Pro He Val Thr Pro Leu Ser Asp He Phe Met Phe Thr
130 135 140
He Asp Gly Asn He Thr Glu He Glu Lys Arg Gin Leu Leu Asp Phe 145 150 155 160
Val He Arg Pro Gin Leu Arg Met He Ser Gly Val Ala Asp Val Asn 165 170 175
Ser He Gly Gly Phe Ser Arg Ala Phe Val He Val Pro Asp Phe Asn
180 185 190
Asp Met Ala Arg Leu Gly Val Ser He Ser Asp Leu Glu Ser Ala Val
195 200 205
Arg Val Asn Leu Arg Asn Ser Gly Ala Gly Arg Val Asp Arg Asp Gly
210 215 220
Glu Thr Phe Leu Val Lys He Gin Thr Ala Ser Leu Ser Leu Glu Asp 225 230 235 240
He Gly Lys He Thr Val Ser Thr Asn Leu Gly His Leu His He Lys
245 250 255
Asp Phe Ala Lys Val He Ser Gin Ser Arg Thr Arg Leu Gly Phe Val
260 265 270
Thr Lys Asp Gly Val Gly Glu Thr Thr Glu Gly Leu Val Leu Ser Leu
275 280 285
Lys Asp Ala Asn Thr Lys Glu He He Thr Gin Val Tyr Gin Lys Leu
290 295 300
Glu Glu Leu Lys Pro Phe Leu Pro Asn Gly Val Ser He Asn Val Phe 305 310 315 320
Tyr Asp Arg Ser Glu Phe Thr Gin Lys Ala He Ala Thr Val Ser Lys
325 330 335
Thr Leu He Glu Ala Val Val Leu He He He Thr Leu Phe Leu Phe
340 345 350
Leu Gly Asn Leu Arg Ala Ser Val Ala Val Gly Val He Leu Pro Leu
355 360 365
Ser Leu Ser Val Ala Phe He Phe He Lys Phe Ser Asp Leu Thr Leu
370 375 380
Asn Leu Met Ser Leu Gly Gly Leu Val He Ala He Gly Met Leu He 385 390 395 400
Asp Ser Ala Val Val Val Val Glu Asn Ala Phe Glu Lys Leu Ser Ala
405 410 415
Asn Thr Lys Thr Thr Lys Leu His Ala He Tyr Arg Ser Cys Lys Glu
420 425 430
He Ala Val Ser Val Val Ser Gly Val Val He He He Val Phe Phe
435 440 445
Val Pro He Leu Thr Leu Gin Gly Leu Glu Gly Lys Met Phe Arg Pro
450 455 460
Leu Ala Gin Ser He Val Tyr Ala Leu Leu Gly Thr Leu Val Leu Ser 465 470 475 480
He Thr He He Pro Val Val Ser Ser Leu Val Leu Lys Ala Thr Pro
485 490 495
His Ser Glu Thr Phe Leu Thr Arg Phe Leu Asn Arg He Tyr Ala Pro
500 505 510
Leu Leu Glu Phe Phe Val His Asn Pro Lys Lys Val He Leu Gly Ala
515 520 525
Phe Val Phe Leu He Ala Ser Leu Ser Leu Phe Pro Phe Val Gly Lys
530 535 540
Asn Phe Met Pro Val Leu Asp Glu Gly Asp Val Val Leu Ser Val Glu 545 550 555 560
Thr Thr Pro Ser He Ser Leu Asp Gin Ser Arg Asp Leu Met Leu Asn
565 570 575
He Glu Ser Ala He Lys Lys His Val Lys Glu Val Lys Ser He Val
580 585 590
Ala Arg Thr Gly Ser Asp Glu Leu Gly Leu Asp Leu Gly Gly Leu Asn 595 600 605 Gin Thr Asp Thr Phe He Ser Phe He Pro Lys Lys Glu Trp Ser Val
610 615 620
Lys Thr Lys Asp Glu Leu Leu Glu Lys He Met Asp Ser Leu Lys Asp 625 630 635 640
Phe Lys Gly He Asn Phe Ser Phe Thr Gin Pro He Glu Met Arg He
645 650 655
Ser Glu Met Leu Thr Gly Val Arg Gly Asp Leu Ala Val Lys He Phe
660 665 670
Gly Asp Gly He Ser Glu Leu Asn Glu Leu Ser Phe Gin He Ala Gin
675 680 685
Ala Leu Lys Gly He Lys Gly Ser Ser Glu Val Leu Thr Thr Leu Asn
690 695 700
Glu Gly Val Asn Tyr Leu Tyr Val Thr Pro Asn Lys Glu Ser Met Ala 705 710 715 720
Asp Val Gly He Thr Ser Asp Glu Phe Ser Lys Phe Leu Lys Ser Ala
725 730 735
Leu Glu Gly Leu Val Val Asp Val He Pro Thr Gly He Ser Arg Thr
740 745 750
Pro Val Met He Arg Gin Glu Ser Asp Phe Ala Ser Ser He Thr Lys
755 760 765
He Lys Ser Leu Ala Leu Thr Ser Lys Tyr Gly Val Leu Val Pro He
770 775 780
Thr Ser He Ala Lys He Glu Glu Val Asp Gly Pro Val Ser Val Val 785 790 795 800
Arg Glu Asn Ser Met Arg Met Ser Val Val Arg Ser Asn Val Val Gly
805 810 815
Arg Asp Leu Lys Ser Phe Val Glu Glu Ala Lys Lys Val He Ala Gin
820 825 830
Asn He Lys Leu Pro Pro Ser Tyr Tyr He Thr Tyr Gly Gly Gin Phe
835 840 845
Glu Asn Gin Gin Arg Ala Asn Lys Arg Leu Ser Thr Val He Pro Leu
850 855 860
Ser He Leu Ala He Phe Phe He Leu Phe Phe Thr Phe Lys Ser He 865 870 875 880
Pro Leu Ala Leu Leu He Leu Leu Asn He Pro Phe Ala Val Thr Gly
885 890 895
Gly Leu He Ala Leu Phe Ala Val Gly Glu Tyr He Ser Val Pro Ala
900 905 910
Ser Val Gly Phe He Ala Leu Phe Gly He Ala Val Leu Asn Gly Val
915 920 925
Val Met He Gly Tyr Phe Lys Glu Leu Leu Leu Gin Gly Lys Ser Val
930 935 940
Glu Glu Cys Val Leu Leu Gly Ala Lys Arg Arg Leu Arg Pro Val Leu 945 950 955 960
Met Thr Ala Cys He Ala Gly Leu Gly Leu Leu Pro Leu Leu Phe Ser
965 970 975
His Ser Val Gly Ser Glu Val Gin Lys Pro Leu Ala He Val Val Leu
980 985 990
Gly Gly Leu Val Thr Ser Ser Ala Leu Thr Leu Leu Leu Leu Pro Pro
995 1000 1005
Met Phe Met Leu He Ala Lys Lys He Lys He Val 1010 1015 1020
(2) INFORMATION FOR SEQ ID NO:923: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 720 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 33...638 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 923:
AAGCTTATAA AATCATCAAA AAGAGTGCTG AA ATG AAT GTT TTA ATC AGA TTG 53
Met Asn Val Leu He Arg Leu 1 5
TGC TTT ATT TTT TTG ATT GGG TTT TTT GGC GCG AAT AAA ACC CTA AAC 101 Cys Phe He Phe Leu He Gly Phe Phe Gly Ala Asn Lys Thr Leu Asn 10 15 20
GCA ACA GCC ATT CTT TCT CTT GAC TTT GGC TCT TTT TCC ATG CCA ATC 149 Ala Thr Ala He Leu Ser Leu Asp Phe Gly Ser Phe Ser Met Pro He 25 30 35
ACT GCC AAT TTC TCA GAT GGT GCG TTA AAT GTA TTC AAA TGG TTT GAA 197 Thr Ala Asn Phe Ser Asp Gly Ala Leu Asn Val Phe Lys Trp Phe Glu 40 45 50 55
AAA CAC CCA TCA GTG GGT GTT AAA GTT GGT CGG CTT GCA AAT CAA GAC 245 Lys His Pro Ser Val Gly Val Lys Val Gly Arg Leu Ala Asn Gin Asp 60 65 70
GAC ACT ATC TTT ACT CTA GTT TTC ATT GTG ATA GTT GTC GCA ATA ATT 293 Asp Thr He Phe Thr Leu Val Phe He Val He Val Val Ala He He 75 80 85
GCC CTT ATC GCT ATT TTT ATA AGG AGT ATA TTA CTA AAC ACA ATT TTT 341 Ala Leu He Ala He Phe He Arg Ser He Leu Leu Asn Thr He Phe 90 95 100
GTA GGA TCG CTC ATA GGA TCC TTA TGG TTG TAT ATG GTA GGG TTT TAT 389 Val Gly Ser Leu He Gly Ser Leu Trp Leu Tyr Met Val Gly Phe Tyr 105 110 115
TAT TTT TAT GGT GTT CCC TTT TTG AGT TAT TTG AGC GGT TGT TAT GAA 437 Tyr Phe Tyr Gly Val Pro Phe Leu Ser Tyr Leu Ser Gly Cys Tyr Glu 120 125 130 135
TCG TTT TCT TTC TCC GCA TGC TAT CCT CAT AGT TTG CAG CTA CTC CCC 485 Ser Phe Ser Phe Ser Ala Cys Tyr Pro His Ser Leu Gin Leu Leu Pro 140 145 150
ACC CTT ATG CAG TAT TCG CCC ATT TAC TCC ATA ATC AAA CTT CTT GCT 533 Thr Leu Met Gin Tyr Ser Pro He Tyr Ser He He Lys Leu Leu Ala 155 160 165
CAT TTT AAT ATA GAG ATC ACT TCT AAG ATT ATC ATT TCT CTT GTT TGG 581 His Phe Asn He Glu He Thr Ser Lys He He He Ser Leu Val Trp 170 175 180
GTG TGT ATA GGG CTG TAT TTT TTG TTA TTG CAA GCG TTT TTT AGT CTT 629 Val Cys He Gly Leu Tyr Phe Leu Leu Leu Gin Ala Phe Phe Ser Leu 185 190 195
ACA AAT TAT TAGTTGCAGA AAATTCAAGA AGGCAAAAAA TTATCTTTTT TCCTCGAAT 687
Thr Asn Tyr
200
CAATCATTAG GTTATTTTTT GGTTTTATGA TAG 720
(2) INFORMATION FOR SEQ ID NO: 924:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 202 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 924:
Met Asn Val Leu He Arg Leu Cys Phe He Phe Leu He Gly Phe Phe
1 5 10 15
Gly Ala Asn Lys Thr Leu Asn Ala Thr Ala He Leu Ser Leu Asp Phe
20 25 30
Gly Ser Phe Ser Met Pro He Thr Ala Asn Phe Ser Asp Gly Ala Leu
35 40 45
Asn Val Phe Lys Trp Phe Glu Lys His Pro Ser Val Gly Val Lys Val
50 55 60
Gly Arg Leu Ala Asn Gin Asp Asp Thr He Phe Thr Leu Val Phe He 65 70 75 80
Val He Val Val Ala He He Ala Leu He Ala He Phe He Arg Ser
85 90 95
He Leu Leu Asn Thr He Phe Val Gly Ser Leu He Gly Ser Leu Trp
100 105 110
Leu Tyr Met Val Gly Phe Tyr Tyr Phe Tyr Gly Val Pro Phe Leu Ser
115 120 125
Tyr Leu Ser Gly Cys Tyr Glu Ser Phe Ser Phe Ser Ala Cys Tyr Pro
130 135 140
His Ser Leu Gin Leu Leu Pro Thr Leu Met Gin Tyr Ser Pro He Tyr 145 150 155 160
Ser He He Lys Leu Leu Ala His Phe Asn He Glu He Thr Ser Lys
165 170 175
He He He Ser Leu Val Trp Val Cys He Gly Leu Tyr Phe Leu Leu 180 185 190
Leu Gin Ala Phe Phe Ser Leu Thr Asn Tyr 195 200
(2) INFORMATTON FOR SEQ ID NO: 925
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 310 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 65...280 (D) OTHER INFORMATION-
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 925:
AATTCAAGAA GGCAAAAAAT TATCTTTTTT CCTCGAATCA ATCATTAGGT TATTTTTTGG 60 TTTT ATG ATA GTT TCT TTT ATT GCC GTT CCA TGC TAC TAT GTT TTA TTG 109 Met He Val Ser Phe He Ala Val Pro Cys Tyr Tyr Val Leu Leu 1 5 10 15
GCG ATG GAA TAC CAA ATA GCC TAT GAA CAC CCA GGA GAA TTA ATA AGC 157 Ala Met Glu Tyr Gin He Ala Tyr Glu His Pro Gly Glu Leu He Ser 20 25 30
ACG ATT GGT TTT GTT GCG TTA GCA GTG CTT GTG TAT TAC TTA TGG GGT 205 Thr He Gly Phe Val Ala Leu Ala Val Leu Val Tyr Tyr Leu Trp Gly 35 40 45
AAA TGG GAG AAG TTG CTA TGG GGC GCA CCT TCC AAT CAA GAG CAA CAA 253 Lys Trp Glu Lys Leu Leu Trp Gly Ala Pro Ser Asn Gin Glu Gin Gin 50 55 60
CTC TCC AAT CAA GGC AAC CAA AAT CAA TGATTGTGAT TGATCGCTAG GTCAATC 307 Leu Ser Asn Gin Gly Asn Gin Asn Gin 65 70
TGA 310
(2) INFORMATION FOR SEQ ID NO: 926:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 72 ammo acids
Figure imgf001370_0001
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 926:
Met He Val Ser Phe He Ala Val Pro Cys Tyr Tyr Val Leu Leu Ala
1 5 10 15
Met Glu Tyr Gin He Ala Tyr Glu His Pro Gly Glu Leu He Ser Thr
20 25 30
He Gly Phe Val Ala Leu Ala Val Leu Val Tyr Tyr Leu Trp Gly Lys
35 40 45
Trp Glu Lys Leu Leu Trp Gly Ala Pro Ser Asn Gin Glu Gin Gin Leu
50 55 60
Ser Asn Gin Gly Asn Gin Asn Gin 65 70
(2) INFORMATION FOR SEQ ID NO: 927:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 311 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 60...287 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 927:
TAGGAAAAAG ACTATCATGC GAGCTATCCA AATTAGATCC GATCAAAAAC TACCCTTTA A 60
Me
TGG TTG TAT CAA TGA ACT GCA TCG GCT CTA AAT ACA AAC TCA TTG CCT 108 t Val Val Ser Met Asn Cys He Gly Ser Lys Tyr Lys Leu He Ala Ph 5 10 15
TTA TTC AAG AAA ATA TCC ATG CGG TTG TGG GGC AAC CTT TTG GGT GTG 156 e He Gin Glu Asn He His Ala Val Val Gly Gin Pro Phe Gly Cys As 20 25 30
ATT TTT TGC GAT CTG TTC GCT GGG ACG GGT ATC GTG GGG TGT GCG TAA 204 p Phe Leu Arg Ser Val Arg Trp Asp Gly Tyr Arg Gly Val Cys Val Ly 35 40 45
AGT GGT CTC TAG GTT CAA CAC TAA AAA ACA TTT TTT CAT TAG ACA GCG 252 s Trp Ser Leu Gly Ser Thr Leu Lys Asn He Phe Ser Leu Asp Ser Va 50 55 60 6
TGT TAA AAG CCA ATC AAG TTA TCC CTA AAG ATG CT TAACATGTTA AAATAAT 304 1 Leu Lys Ala Asn Gin Val He Pro Lys Asp Ala 70 75
CTCATAC 311
(2) INFORMATTON FOR SEQ ID NO: 928:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 76 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 928:
Met Val Val Ser Met Asn Cys He Gly Ser Lys Tyr Lys Leu He Ala
1 5 10 15
Phe He Gin Glu Asn He His Ala Val Val Gly Gin Pro Phe Gly Cys
20 25 30
Asp Phe Leu Arg Ser Val Arg Trp Asp Gly Tyr Arg Gly Val Cys Val
35 40 45
Lys Trp Ser Leu Gly Ser Thr Leu Lys Asn He Phe Ser Leu Asp Ser
50 55 60
Val Leu Lys Ala Asn Gin Val He Pro Lys Asp Ala 65 70 75
(2) INFORMATION FOR SEQ ID NO: 929:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 900 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...872 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 929:
AAAATTAAGT TGTTTGATCG CTTTTAAACG ATTTTTAAAA GGAAAAATTT ATG GAT 56
Met Asp
1
GAA ATT AAA ACG CTG TTA GTG GAT TTT TTT CCG CAG GCA AAG CAT TTT 104 Glu He Lys Thr Leu Leu Val Asp Phe Phe Pro Gin Ala Lys His Phe 5 10 15
GGG ATA ATC TTA ATC AAG GCT ATT GTT GTC TTT TGT ATA GGT TTT TAT 152 Gly He He Leu He Lys Ala He Val Val Phe Cys He Gly Phe Tyr 20 25 30
TTT TCA TTT TTC TTA CAA AAA AAA ACC ATG AAA TTT TTA TCC AAA AAG 200 Phe Ser Phe Phe Leu Gin Lys Lys Thr Met Lys Phe Leu Ser Lys Lys 35 40 45 50
GAT GAG ATT TTA GCG AAT TTT GTC GCA CAG GTT ACT TTT ATC TTA ATC 248 Asp Glu He Leu Ala Asn Phe Val Ala Gin Val Thr Phe He Leu He 55 60 65
CTT ATC ATC ACC ACA ATC ATT GCG CTC AGC ACG CTA GGC GTG CAA ACC 296 Leu He He Thr Thr He He Ala Leu Ser Thr Leu Gly Val Gin Thr 70 75 80
ACC TCT ATT ATC ACT GTT TTA GGA ACG GTA GGG ATT GCT GTG GCG TTG 344 Thr Ser He He Thr Val Leu Gly Thr Val Gly He Ala Val Ala Leu 85 90 95
GCT TTA AAA GAT TAT CTT TCA AGC ATT GCT GGA GGG ATA ATC CTT ATT 392 Ala Leu Lys Asp Tyr Leu Ser Ser He Ala Gly Gly He He Leu He 100 105 110
ATC TTG CAC CCT TTC AAA AAA GGA GAC ATC ATT GAA ATC TCT GGC CTA 440 He Leu His Pro Phe Lys Lys Gly Asp He He Glu He Ser Gly Leu 115 120 125 130
GAG GGC AAA GTA GAA GCG CTT AAT TTT TTT AAC ACT TCT TTA CGC TTG 488 Glu Gly Lys Val Glu Ala Leu Asn Phe Phe Asn Thr Ser Leu Arg Leu 135 140 145
CAT GAC GGA CGC TTG GCG GTT TTA CCC AAT AGA AGT GTC GCT AAT TCT 536 His Asp Gly Arg Leu Ala Val Leu Pro Asn Arg Ser Val Ala Asn Ser 150 155 160
AAT ATT ATC AAT AGC AAT AAC ACG GCG TGT CGG CGC ATT GAA TGG GTT 584 Asn He He Asn Ser Asn Asn Thr Ala Cys Arg Arg He Glu Trp Val 165 170 175
TGT GGG GTA GGG TAT GGG AGC GAT ATT GAA CTG GTG CAT AAG ACT ATA 632 Cys Gly Val Gly Tyr Gly Ser Asp He Glu Leu Val His Lys Thr He 180 185 190
AAA GAT GTT ATT GAT GCA ATG GAA AAA ATT GAT AAA AAC ATG CCC ACT 680 Lys Asp Val He Asp Ala Met Glu Lys He Asp Lys Asn Met Pro Thr 195 200 205 210
TTT ATT GGG ATC ACG GAT TTT GGA CAA AGT TCG CTG AAT TTC ACC ATT 728 Phe He Gly He Thr Asp Phe Gly Gin Ser Ser Leu Asn Phe Thr He 215 220 225
AGG GTT TGG GCA AAG ATT GAA GAC GGA ATC TTT AAT GTG CGC AGC GAA 776 Arg Val Trp Ala Lys He Glu Asp Gly He Phe Asn Val Arg Ser Glu 230 235 240 CTC ATT GAA CGC ATC AAA AAC GCC CTA GAC GCT AAC CAC ATT GAA ATC 824 Leu He Glu Arg He Lys Asn Ala Leu Asp Ala Asn His He Glu He 245 250 255
CCT TTC AAC AAG CTA GAT ATT GCT ATT AAA AAT CAA GAC TCT CCT AAA T 873 Pro Phe Asn Lys Leu Asp He Ala He Lys Asn Gin Asp Ser Pro Lys 260 265 270
GATTGGTGTG AGATGTATTG ATTGTAG 900
(2) INFORMATION FOR SEQ ID NO: 930:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 274 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 930:
Met Asp Glu He Lys Thr Leu Leu Val Asp Phe Phe Pro Gin Ala Lys
1 5 10 15
His Phe Gly He He Leu He Lys Ala He Val Val Phe Cys He Gly
20 25 30
Phe Tyr Phe Ser Phe Phe Leu Gin Lys Lys Thr Met Lys Phe Leu Ser
35 40 45
Lys Lys Asp Glu He Leu Ala Asn Phe Val Ala Gin Val Thr Phe He
50 55 60
Leu He Leu He He Thr Thr He He Ala Leu Ser Thr Leu Gly Val 65 70 75 80
Gin Thr Thr Ser He He Thr Val Leu Gly Thr Val Gly He Ala Val
85 90 95
Ala Leu Ala Leu Lys Asp Tyr Leu Ser Ser He Ala Gly Gly He He
100 105 110
Leu He He Leu His Pro Phe Lys Lys Gly Asp He He Glu He Ser
115 120 125
Gly Leu Glu Gly Lys Val Glu Ala Leu Asn Phe Phe Asn Thr Ser Leu
130 135 140
Arg Leu His Asp Gly Arg Leu Ala Val Leu Pro Asn Arg Ser Val Ala 145 150 155 160
Asn Ser Asn He He Asn Ser Asn Asn Thr Ala Cys Arg Arg He Glu
165 170 175
Trp Val Cys Gly Val Gly Tyr Gly Ser Asp He Glu Leu Val His Lys
180 185 190
Thr He Lys Asp Val He Asp Ala Met Glu Lys He Asp Lys Asn Met
195 200 205
Pro Thr Phe He Gly He Thr Asp Phe Gly Gin Ser Ser Leu Asn Phe
210 215 220
Thr He Arg Val Trp Ala Lys He Glu Asp Gly He Phe Asn Val Arg 225 230 235 240
Ser Glu Leu He Glu Arg He Lys Asn Ala Leu Asp Ala Asn His He
245 250 255
Glu He Pro Phe Asn Lys Leu Asp He Ala He Lys Asn Gin Asp Ser 260 265 270
Pro Lys
(2) INFORMATION FOR SEQ ID NO: 931:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 833 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...762 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 931:
TAGCGGTTGG GTAATTCACT CCAATCTCTA AGTGGCAGTG GCTATGAAAA C ATG CAA 57
Met Gin
1
TCA CTT GCT GGG GGA TTG AGT GGT AGA GCG TGG GGA GAA ATG TTG TGT 105 Ser Leu Ala Gly Gly Leu Ser Gly Arg Ala Trp Gly Glu Met Leu Cys 5 10 15
AAA ATG GTA AAC GAT AGT AAT TAT GAA AGC GAG CAA GCT CTT TTA GCA 153 Lys Met Val Asn Asp Ser Asn Tyr Glu Ser Glu Gin Ala Leu Leu Ala 20 25 30
ACA GGC AAT AGC TCA GAA GAG CAA AAA CGA AGA TTT TTG CTT AGA GTA 201 Thr Gly Asn Ser Ser Glu Glu Gin Lys Arg Arg Phe Leu Leu Arg Val 35 40 45 50
AAG AAA AAG GTT AAT GAT AAT AGG CAG TTA AAA AAG AAA CTT GAC CCA 249 Lys Lys Lys Val Asn Asp Asn Arg Gin Leu Lys Lys Lys Leu Asp Pro 55 60 65
TTT CTA AAA AGA CTT GAT GTC CTA CAA ACT GAG TTT GGT GTA ACT GAC 297 Phe Leu Lys Arg Leu Asp Val Leu Gin Thr Glu Phe Gly Val Thr Asp 70 75 80
CCT ACA GCT AAC CAT AAT AAG CAA GGG ATA CAT TAT TGC ACA GAA AAT 345 Pro Thr Ala Asn His Asn Lys Gin Gly He His Tyr Cys Thr Glu Asn 85 90 95
AAA AAG ACA GGT AAA TGC GAC CCT ATT GAT AAT GTA TTT AGG ACA ACT 393 Lys Lys Thr Gly Lys Cys Asp Pro He Asp Asn Val Phe Arg Thr Thr 100 105 110 CGC TTA GAT AAC GAA TTA GAA CAA GAA ATC CAA ACG CTC ACA CTT GAT 441 Arg Leu Asp Asn Glu Leu Glu Gin Glu He Gin Thr Leu Thr Leu Asp 115 120 125 130
TTA ACC AAA GCC CCC AAT AAA GAC GCT CAA AGC CAA GCC TAC GCA AAT 489 Leu Thr Lys Ala Pro Asn Lys Asp Ala Gin Ser Gin Ala Tyr Ala Asn 135 140 145
TTC AAT CAA AGG ATT AAA TTA CTT ACT CTA AAA TAT TTA AAA GAA ATT 537 Phe Asn Gin Arg He Lys Leu Leu Thr Leu Lys Tyr Leu Lys Glu He 150 155 160
ACC AAT CAA ATG CTC TTT TTA AAT CAA ACA ATG GCA ATG CAA AGC GAG 585 Thr Asn Gin Met Leu Phe Leu Asn Gin Thr Met Ala Met Gin Ser Glu 165 170 175
ATT ATG GCA GAT GAT TAT TTT AGG CAA AAT AAT GAT GGC TTT GGG AAA 633 He Met Ala Asp Asp Tyr Phe Arg Gin Asn Asn Asp Gly Phe Gly Lys 180 185 190
GAA GAA AAC CAT ATA GAC AAA CAA TTA ACG CAA AAA AGA ATA AAC GAA 681 Glu Glu Asn His He Asp Lys Gin Leu Thr Gin Lys Arg He Asn Glu 195 200 205 210
AGA GAA AGA GCC AGA ATA TAC TTT CAA AAC CCT AAT GTT AAA TTT GAC 729 Arg Glu Arg Ala Arg He Tyr Phe Gin Asn Pro Asn Val Lys Phe Asp 215 220 225
CAA TTT GGT TTT CCC ATT TTT AGT ATA TGG GAT TAAGGGTTTA GTGATGAGAG 782 Gin Phe Gly Phe Pro He Phe Ser He Trp Asp 230 235
ATAGAATAAG TATTTTTTTT CCAAACTATT CCTATTTTAG TGGTAGTGTT G 833
(2) INFORMATION FOR SEQ ID NO: 932:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 237 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 932:
Met Gin Ser Leu Ala Gly Gly Leu Ser Gly Arg Ala Trp Gly Glu Met
1 5 10 15
Leu Cys Lys Met Val Asn Asp Ser Asn Tyr Glu Ser Glu Gin Ala Leu
20 25 30
Leu Ala Thr Gly Asn Ser Ser Glu Glu Gin Lys Arg Arg Phe Leu Leu
35 40 45
Arg Val Lys Lys Lys Val Asn Asp Asn Arg Gin Leu Lys Lys Lys Leu
50 55 60
Asp Pro Phe Leu Lys Arg Leu Asp Val Leu Gin Thr Glu Phe Gly Val 65 70 75 80
Thr Asp Pro Thr Ala Asn His Asn Lys Gin Gly He His Tyr Cys Thr
85 90 95
Glu Asn Lys Lys Thr Gly Lys Cys Asp Pro He Asp Asn Val Phe Arg
100 105 110
Thr Thr Arg Leu Asp Asn Glu Leu Glu Gin Glu He Gin Thr Leu Thr
115 120 125
Leu Asp Leu Thr Lys Ala Pro Asn Lys Asp Ala Gin Ser Gin Ala Tyr
130 135 140
Ala Asn Phe Asn Gin Arg He Lys Leu Leu Thr Leu Lys Tyr Leu Lys 145 150 155 160
Glu He Thr Asn Gin Met Leu Phe Leu Asn Gin Thr Met Ala Met Gin
165 170 175
Ser Glu He Met Ala Asp Asp Tyr Phe Arg Gin Asn Asn Asp Gly Phe
180 185 190
Gly Lys Glu Glu Asn His He Asp Lys Gin Leu Thr Gin Lys Arg He
195 200 205
Asn Glu Arg Glu Arg Ala Arg He Tyr Phe Gin Asn Pro Asn Val Lys
210 215 220
Phe Asp Gin Phe Gly Phe Pro He Phe Ser He Trp Asp 225 230 235
(2) INFORMATION FOR SEQ ID NO:933:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 351 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 63...311 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:933:
TCAAACCAAA ACCAACACAA AATTTGCTAA ACTACAATCA AATCAATTTA GGGAGGATAA 60
AA ATG TCA TTT GCC CCT ATG TTA TTA GCT ACA ATC AAT AAC TCT ATT 107
Met Ser Phe Ala Pro Met Leu Leu Ala Thr He Asn Asn Ser He 1 5 10 15
GGC AAT AAA GAT AAG CAT GTG AGT TTA GAG TAT CTT ATA GGG CTT TTT 155 Gly Asn Lys Asp Lys His Val Ser Leu Glu Tyr Leu He Gly Leu Phe 20 25 30
ATG GAT AAA AAA ACA ACT AAT CTA AGC AAT ACT GAC AAG TAT ATT ATA 203 Met Asp Lys Lys Thr Thr Asn Leu Ser Asn Thr Asp Lys Tyr He He 35 40 45
GGC ACA ATT CAA ACA GAG GCA CTA GAG CAA GAA ATA GAA TGG TTT TCA 251 Gly Thr He Gin Thr Glu Ala Leu Glu Gin Glu He Glu Trp Phe Ser 50 55 60
CAA GAC TAT CAC ATT CCT ATG GAG AAT ATT TTA CAT GTC CTT TCT ATC 299 Gin Asp Tyr His He Pro Met Glu Asn He Leu His Val Leu Ser He 65 70 75
AAT CCC TAT CAA TGAAAAGAGC CTTAGTTTTA TCAAAAACAA CTTTCAAGCT 351
Asn Pro Tyr Gin
80
(2) INFORMATION FOR SEQ ID NO: 934:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 83 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ TD NO: 934:
Met Ser Phe Ala Pro Met Leu Leu Ala Thr He Asn Asn Ser He Gly
1 5 10 15
Asn Lys Asp Lys His Val Ser Leu Glu Tyr Leu He Gly Leu Phe Met
20 25 30
Asp Lys Lys Thr Thr Asn Leu Ser Asn Thr Asp Lys Tyr He He Gly
35 40 45
Thr He Gin Thr Glu Ala Leu Glu Gin Glu He Glu Trp Phe Ser Gin
50 55 60
Asp Tyr His He Pro Met Glu Asn He Leu His Val Leu Ser He Asn 65 70 75 80
Pro Tyr Gin
(2) INFORMATION FOR SEQ ID NO:935
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1934 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 67...1866 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 935: AAACATAGGG CAAATCTAGT TGGCACAAAA ACAGCTAGTC CTGTGCTTAT TAAAAACATA 60
GGGCAA ATG AAA CGC TCC CAC TTA GAA AAT GCC CTA AAT TAT GCT TTA 108 Met Lys Arg Ser His Leu Glu Asn Ala Leu Asn Tyr Ala Leu 1 5 10
GAA AAT AGC GAA ACA GCT TAC AAT GAA ATG TTT TTA GAA TGC GAT AAG 156 Glu Asn Ser Glu Thr Ala Tyr Asn Glu Met Phe Leu Glu Cys Asp Lys 15 20 25 30
CAA TTC ATC TTA GAG AGT TGG CTC AAT GAC TTT GAT TTG ACT AAA GAT 204 Gin Phe He Leu Glu Ser Trp Leu Asn Asp Phe Asp Leu Thr Lys Asp 35 40 45
TAT AAC GAG ACT ATG CAC TTA GTT TTT TCT ATC AAA GAT AAG CCA GAT 252 Tyr Asn Glu Thr Met His Leu Val Phe Ser He Lys Asp Lys Pro Asp 50 55 60
GAA GAG ACA ATG CAA GGG CTT TTA CAT TCT ACT TGG GAG AGC TTA AAA 300 Glu Glu Thr Met Gin Gly Leu Leu His Ser Thr Trp Glu Ser Leu Lys 65 70 75
ATA AGA TTG CCT GAA TAC AAG TTT GCC CTT GTG CCA CAC GCT CAT CAA 348 He Arg Leu Pro Glu Tyr Lys Phe Ala Leu Val Pro His Ala His Gin 80 85 90
GAC CAT GCC CAT ATC CAT TGT TTT ATC AAT AAG ACT AAT CAG CTC ACA 396 Asp His Ala His He His Cys Phe He Asn Lys Thr Asn Gin Leu Thr 95 100 105 110
CGA AGA AGA CTG CGT TTT AAG GGG CAT GAA GAT TGT AAA GAA TTT TTT 444 Arg Arg Arg Leu Arg Phe Lys Gly His Glu Asp Cys Lys Glu Phe Phe 115 120 125
AAT GAA TTA AGA AGT GAG TTT GCT TAT AGG TTG AAT GAC CAC TTA TTG 492 Asn Glu Leu Arg Ser Glu Phe Ala Tyr Arg Leu Asn Asp His Leu Leu 130 135 140
AGC GAA GAA TAC TTG TAT GTC AAT GAG CCA AAA CTT AAA GAG CTA GAC 540 Ser Glu Glu Tyr Leu Tyr Val Asn Glu Pro Lys Leu Lys Glu Leu Asp 145 150 155
AAT ATC AAA CAA CAA TTA CAA GAC TTG GAA AAA GAA GAA AAA GCC TTA 588 Asn He Lys Gin Gin Leu Gin Asp Leu Glu Lys Glu Glu Lys Ala Leu 160 165 170
GAA CAA ATC AAA TCC CCA CAA GAT GAG TGG GAC TTA AAC AAG GCT TTA 636 Glu Gin He Lys Ser Pro Gin Asp Glu Trp Asp Leu Asn Lys Ala Leu 175 180 185 190
CAA AGC GAG TAT TTA CAA GAA CTC AAA TAT AAA AAC AAA GCA AAA GCC 684 Gin Ser Glu Tyr Leu Gin Glu Leu Lys Tyr Lys Asn Lys Ala Lys Ala 195 200 205
CTA GAC ATT CAA AAT AAC CAC AGC ACC CCT TTA AAA CAA AAG ATT TCT 732 Leu Asp He Gin Asn Asn His Ser Thr Pro Leu Lys Gin Lys He Ser 210 215 220
GAA TTT AAA ATC GCT CTG TTT AAT CAC AAA GAC ACA AGC GAT GAT GAA 780 Glu Phe Lys He Ala Leu Phe Asn His Lys Asp Thr Ser Asp Asp Glu 225 230 235
AAA GAA CAG CTA GAT ATT GAC AGG ATA GAT AAG AGA AAA CCA GTA AGC 828 Lys Glu Gin Leu Asp He Asp Arg He Asp Lys Arg Lys Pro Val Ser 240 245 250
GAA CAC TTA AAA AAC ACT AAC AAA CAC GAG CTA TAC GAA CTC TTA GGC 876 Glu His Leu Lys Asn Thr Asn Lys His Glu Leu Tyr Glu Leu Leu Gly 255 260 265 270
TTT TAT CAA AAA GAA TTA GAT AAA AAA CAA AAC CAT TCA GCC TTT AAG 924 Phe Tyr Gin Lys Glu Leu Asp Lys Lys Gin Asn His Ser Ala Phe Lys 275 280 285
AAT TTT GCT ATT CTC AAT GGT TTA GAC AGA GAC TTT GAA AGA GAG ACT 972 Asn Phe Ala He Leu Asn Gly Leu Asp Arg Asp Phe Glu Arg Glu Thr 290 295 300
AAT GGC TAT TCT GTT TTA AAG AAA AAA GAA ATG CTT TTA AAT AAG CTT 1020 Asn Gly Tyr Ser Val Leu Lys Lys Lys Glu Met Leu Leu Asn Lys Leu 305 310 315
GAA CAC CTA GAC AAA CGC CTT TTA GAT AAA AAC TCA CAC TTA CTA TTA 1068 Glu His Leu Asp Lys Arg Leu Leu Asp Lys Asn Ser His Leu Leu Leu 320 325 330
GCC CAG CTA AGA AAT GAA GTT AAA ACC AAG CAA AAC ATC CAA TAC AAC 1116 Ala Gin Leu Arg Asn Glu Val Lys Thr Lys Gin Asn He Gin Tyr Asn 335 340 345 350
ACT CTA ACT AAT CCT ATT CTT TTA GCC AAA GCC TTA GAA CTT TCT AAA 1164 Thr Leu Thr Asn Pro He Leu Leu Ala Lys Ala Leu Glu Leu Ser Lys 355 360 365
GAT AAA CGC CCC ACT CTC AAA ACT TTT AAA AAC GCT TAT TTT AGT GCT 1212 Asp Lys Arg Pro Thr Leu Lys Thr Phe Lys Asn Ala Tyr Phe Ser Ala 370 375 380
AGA AAA TAT CAA TTC ATG CTA GAG AGC TTT AAA ACT AAG CAA AAT GAC 1260 Arg Lys Tyr Gin Phe Met Leu Glu Ser Phe Lys Thr Lys Gin Asn Asp 385 390 395
CCC ACT TAC AAG CTT AAT GAT AAC ACT TAT GAG CTA GTG AGT AAG CAA 1308 Pro Thr Tyr Lys Leu Asn Asp Asn Thr Tyr Glu Leu Val Ser Lys Gin 400 405 410
CTA CAA GAC TAT CAA AAC ACC ATG CTT TTA TTA GCC AAA GAG AGA TTA 1356 Leu Gin Asp Tyr Gin Asn Thr Met Leu Leu Leu Ala Lys Glu Arg Leu 415 420 425 430
CTT TTT TTA GAA CAA GAT TTA AAA CAA AAA GAA GAA GAG TTT GAA AGA 1404 Leu Phe Leu Glu Gin Asp Leu Lys Gin Lys Glu Glu Glu Phe Glu Arg 435 440 445
GCC AAA GAA CAT TAT GTG AAA TCT TCA AAA CAT TAT AGA GAA ACT TCA 1452 Ala Lys Glu His Tyr Val Lys Ser Ser Lys His Tyr Arg Glu Thr Ser 450 455 460
TTG TCT CCA AAA GAA AAA CAA GGC TTT CTC AAA CAA ATT AAA CAA TTT 1500 Leu Ser Pro Lys Glu Lys Gin Gly Phe Leu Lys Gin He Lys Gin Phe 465 470 475
TCT AAA ATT TCT AAG GAT ATT CTC TAT ACT TGT AAT GAG ATC ATA GGA 1548 Ser Lys He Ser Lys Asp He Leu Tyr Thr Cys Asn Glu He He Gly 480 485 490
GCT AAT AGG TTT TTA ACC CAC TAT GAC AAC CTA AAC CTT GAA AAA GTC 1596 Ala Asn Arg Phe Leu Thr His Tyr Asp Asn Leu Asn Leu Glu Lys Val 495 500 505 510
CTA GAA CAC GCT AAA GAT ACT AAG CTA GAG CAA AAA GAA ATT CAA GCT 1644 Leu Glu His Ala Lys Asp Thr Lys Leu Glu Gin Lys Glu He Gin Ala 515 520 525
ATC ACA AAA GAG CCT AAT AAC GAT GAG CCT TGG ATT GAG TTT GGT AAA 1692 He Thr Lys Glu Pro Asn Asn Asp Glu Pro Trp He Glu Phe Gly Lys 530 535 540
AAA GAA CAA GCT AGA GCT AAA GCA CAC TAT CAA GCT ATG CTA GAA AAA 1740 Lys Glu Gin Ala Arg Ala Lys Ala His Tyr Gin Ala Met Leu Glu Lys 545 550 555
GAA AAA GCT AAA GAA TTA GCT AAA CAA CAA GCT AAC ACC TTG CAC TCT 1788 Glu Lys Ala Lys Glu Leu Ala Lys Gin Gin Ala Asn Thr Leu His Ser 560 565 570
AAT GAG CTT GAT GAT GAC CCT AAA GCT CAT GCT GGA TTA AAA CAA AAT 1836 Asn Glu Leu Asp Asp Asp Pro Lys Ala His Ala Gly Leu Lys Gin Asn 575 580 585 590
GAC AAC ACA AAC TTT AAA GGG CGT AAT AGA TAATGCTCTC AAGCGATGAT TGC 1889 Asp Asn Thr Asn Phe Lys Gly Arg Asn Arg 595 600
CTTTAATGTT CTTAATAAAG AATATACCCT TTGAAAGGGG TTTAT 1934
(2) INFORMATION FOR SEQ ID NO: 936:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 600 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 936:
Met Lys Arg Ser His Leu Glu Asn Ala Leu Asn Tyr Ala Leu Glu Asn
1 5 10 15
Ser Glu Thr Ala Tyr Asn Glu Met Phe Leu Glu Cys Asp Lys Gin Phe
20 25 30
He Leu Glu Ser Trp Leu Asn Asp Phe Asp Leu Thr Lys Asp Tyr Asn
35 40 45
Glu Thr Met His Leu Val Phe Ser He Lys Asp Lys Pro Asp Glu Glu
50 55 60
Thr Met Gin Gly Leu Leu His Ser Thr Trp Glu Ser Leu Lys He Arg 65 70 75 80
Leu Pro Glu Tyr Lys Phe Ala Leu Val Pro His Ala His Gin Asp His
85 90 95
Ala His He His Cys Phe He Asn Lys Thr Asn Gin Leu Thr Arg Arg
100 105 110
Arg Leu Arg Phe Lys Gly His Glu Asp Cys Lys Glu Phe Phe Asn Glu
115 120 125
Leu Arg Ser Glu Phe Ala Tyr Arg Leu Asn Asp His Leu Leu Ser Glu
130 135 140
Glu Tyr Leu Tyr Val Asn Glu Pro Lys Leu Lys Glu Leu Asp Asn He 145 150 155 160
Lys Gin Gin Leu Gin Asp Leu Glu Lys Glu Glu Lys Ala Leu Glu Gin
165 170 175
He Lys Ser Pro Gin Asp Glu Trp Asp Leu Asn Lys Ala Leu Gin Ser
180 185 190
Glu Tyr Leu Gin Glu Leu Lys Tyr Lys Asn Lys Ala Lys Ala Leu Asp
195 200 205
He Gin Asn Asn His Ser Thr Pro Leu Lys Gin Lys He Ser Glu Phe
210 215 220
Lys He Ala Leu Phe Asn His Lys Asp Thr Ser Asp Asp Glu Lys Glu 225 230 235 240
Gin Leu Asp He Asp Arg He Asp Lys Arg Lys Pro Val Ser Glu His
245 250 255
Leu Lys Asn Thr Asn Lys His Glu Leu Tyr Glu Leu Leu Gly Phe Tyr
260 265 270
Gin Lys Glu Leu Asp Lys Lys Gin Asn His Ser Ala Phe Lys Asn Phe
275 280 285
Ala He Leu Asn Gly Leu Asp Arg Asp Phe Glu Arg Glu Thr Asn Gly
290 295 300
Tyr Ser Val Leu Lys Lys Lys Glu Met Leu Leu Asn Lys Leu Glu His 305 310 315 320
Leu Asp Lys Arg Leu Leu Asp Lys Asn Ser His Leu Leu Leu Ala Gin
325 330 335
Leu Arg Asn Glu Val Lys Thr Lys Gin Asn He Gin Tyr Asn Thr Leu
340 345 350
Thr Asn Pro He Leu Leu Ala Lys Ala Leu Glu Leu Ser Lys Asp Lys
355 360 365
Arg Pro Thr Leu Lys Thr Phe Lys Asn Ala Tyr Phe Ser Ala Arg Lys
370 375 380
Tyr Gin Phe Met Leu Glu Ser Phe Lys Thr Lys Gin Asn Asp Pro Thr 385 390 395 400
Tyr Lys Leu Asn Asp Asn Thr Tyr Glu Leu Val Ser Lys Gin Leu Gin
405 410 415
Asp Tyr Gin Asn Thr Met Leu Leu Leu Ala Lys Glu Arg Leu Leu Phe 420 425 430
Leu Glu Gin Asp Leu Lys Gin Lys Glu Glu Glu Phe Glu Arg Ala Lys
435 440 445
Glu His Tyr Val Lys Ser Ser Lys His Tyr Arg Glu Thr Ser Leu Ser
450 455 460
Pro Lys Glu Lys Gin Gly Phe Leu Lys Gin He Lys Gin Phe Ser Lys 465 470 475 480
He Ser Lys Asp He Leu Tyr Thr Cys Asn Glu He He Gly Ala Asn
485 490 495
Arg Phe Leu Thr His Tyr Asp Asn Leu Asn Leu Glu Lys Val Leu Glu
500 505 510
His Ala Lys Asp Thr Lys Leu Glu Gin Lys Glu He Gin Ala He Thr
515 520 525
Lys Glu Pro Asn Asn Asp Glu Pro Trp He Glu Phe Gly Lys Lys Glu
530 535 540
Gin Ala Arg Ala Lys Ala His Tyr Gin Ala Met Leu Glu Lys Glu Lys 545 550 555 560
Ala Lys Glu Leu Ala Lys Gin Gin Ala Asn Thr Leu His Ser Asn Glu
565 570 575
Leu Asp Asp Asp Pro Lys Ala His Ala Gly Leu Lys Gin Asn Asp Asn
580 585 590
Thr Asn Phe Lys Gly Arg Asn Arg 595 600
(2) INFORMATION FOR SEQ ID NO: 937:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 884 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 22...840 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 937:
GGAGAATAAA ATTACAATAA A ATG GCG TTA GAA AAA AGT TAT AGT AAA AAC 51
Met Ala Leu Glu Lys Ser Tyr Ser Lys Asn 1 5 10
TTT GAA AGC GAT GAG CTT TTT GAT TAT GAG ATC ATC AAG CCC AAA AAG 99 Phe Glu Ser Asp Glu Leu Phe Asp Tyr Glu He He Lys Pro Lys Lys 15 20 25
ACG CTT AAG ATA CAA TAC ACT TAT GCT AAA CGC TAC TAT AAA GAA GTA 147 Thr Leu Lys He Gin Tyr Thr Tyr Ala Lys Arg Tyr Tyr Lys Glu Val 30 35 40 GAA AAG TTT GCT AAA AAT TTA ACC CAA CTG ACA CAA GAA GAA TTC ATG 195 Glu Lys Phe Ala Lys Asn Leu Thr Gin Leu Thr Gin Glu Glu Phe Met 45 50 55
CGT TTA AGA GAG CCA CAA AAA CAA GTG GTC ATC AAA AAC ATA GGC AAT 243 Arg Leu Arg Glu Pro Gin Lys Gin Val Val He Lys Asn He Gly Asn 60 65 70
ATG ACA CGC CTG CAT TCA AAA AGA GCG ATG GAT TAT ATC GCT AAA CAT 291 Met Thr Arg Leu His Ser Lys Arg Ala Met Asp Tyr He Ala Lys His 75 80 85 90
GGT GAG CTA GTG AGA GAT GAA TTT TTT AAT GAA GTT AAT TAT AAT GAC 339 Gly Glu Leu Val Arg Asp Glu Phe Phe Asn Glu Val Asn Tyr Asn Asp 95 100 105
ATA GCA GAG CAA TGG AAT GAG CAA TTT GAA AAA TTA TTA GAA AAT AAG 387 He Ala Glu Gin Trp Asn Glu Gin Phe Glu Lys Leu Leu Glu Asn Lys 110 115 120
AGC CGT GTT AAA AAT TGC GCT TTA CAT CTA GTG TTT AGC ATT GAT GAA 435 Ser Arg Val Lys Asn Cys Ala Leu His Leu Val Phe Ser He Asp Glu 125 130 135
AAT TGT AAT GAA AAA AAT TTA AAA GCT TTG GAA TTA AGC GTG TAT CAA 483 Asn Cys Asn Glu Lys Asn Leu Lys Ala Leu Glu Leu Ser Val Tyr Gin 140 145 150
ACA CTC ACT AAC ACG CTA GGT TAT GAT TAT CCT TTT ATA ATG AAA CTC 531 Thr Leu Thr Asn Thr Leu Gly Tyr Asp Tyr Pro Phe He Met Lys Leu 155 160 165 170
CAT ACA CAC CAA AAC AAT CCG CAT GCG CAT GTG ATT ATC AAC AAA ACT 579 His Thr His Gin Asn Asn Pro His Ala His Val He He Asn Lys Thr 175 180 185
AAC AAA ATT ACC AAT AAG CAA CTA TGC TTT AAT TCT AAA GAC AGC TGT 627 Asn Lys lie Thr Asn Lys Gin Leu Cys Phe Asn Ser Lys Asp Ser Cys 190 195 200
AAA GAG TTT TAC CAC ACA CTA AGA GAA ACA TTT AAA GAT TAT TTA TTT 675 Lys Glu Phe Tyr His Thr Leu Arg Glu Thr Phe Lys Asp Tyr Leu Phe 205 210 215
GCT AAC TCA AAA GGC GAA TTG CAA TAT TCT AAC ACG CCT AAT ATT TAT 723 Ala Asn Ser Lys Gly Glu Leu Gin Tyr Ser Asn Thr Pro Asn He Tyr 220 225 230
AAG GCG ATT AAA GAC ATA GAA ACA GAG CTA GAT GCA CTA GAA AAC AGG 771 Lys Ala He Lys Asp He Glu Thr Glu Leu Asp Ala Leu Glu Asn Arg 235 240 245 250
CTA GAA ACA ATA AGA GTT TTA GGC ATG AAA ACT ATT TTT ATA AAG TTT 819 Leu Glu Thr He Arg Val Leu Gly Met Lys Thr He Phe He Lys Phe 255 260 265 TGG GTA GTG CAA CTT CTC AAA TAGAAAGTTT GAAAAAAAGA GAAAATGCCC TATT 874 Trp Val Val Gin Leu Leu Lys 270
TGATCATTTA 884
(2) INFORMATION FOR SEQ ID NO: 938:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 273 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 938:
Met Ala Leu Glu Lys Ser Tyr Ser Lys Asn Phe Glu Ser Asp Glu Leu
1 5 10 15
Phe Asp Tyr Glu He He Lys Pro Lys Lys Thr Leu Lys He Gin Tyr
20 25 30
Thr Tyr Ala Lys Arg Tyr Tyr Lys Glu Val Glu Lys Phe Ala Lys Asn
35 40 45
Leu Thr Gin Leu Thr Gin Glu Glu Phe Met Arg Leu Arg Glu Pro Gin
50 55 60
Lys Gin Val Val He Lys Asn He Gly Asn Met Thr Arg Leu His Ser 65 70 75 80
Lys Arg Ala Met Asp Tyr He Ala Lys His Gly Glu Leu Val Arg Asp
85 90 95
Glu Phe Phe Asn Glu Val Asn Tyr Asn Asp He Ala Glu Gin Trp Asn
100 105 110
Glu Gin Phe Glu Lys Leu Leu Glu Asn Lys Ser Arg Val Lys Asn Cys
115 120 125
Ala Leu His Leu Val Phe Ser He Asp Glu Asn Cys Asn Glu Lys Asn
130 135 140
Leu Lys Ala Leu Glu Leu Ser Val Tyr Gin Thr Leu Thr Asn Thr Leu 145 150 155 160
Gly Tyr Asp Tyr Pro Phe He Met Lys Leu His Thr His Gin Asn Asn
165 170 175
Pro His Ala His Val He He Asn Lys Thr Asn Lys He Thr Asn Lys
180 185 190
Gin Leu Cys Phe Asn Ser Lys Asp Ser Cys Lys Glu Phe Tyr His Thr
195 200 205
Leu Arg Glu Thr Phe Lys Asp Tyr Leu Phe Ala Asn Ser Lys Gly Glu
210 215 220
Leu Gin Tyr Ser Asn Thr Pro Asn He Tyr Lys Ala He Lys Asp He 225 230 235 240
Glu Thr Glu Leu Asp Ala Leu Glu Asn Arg Leu Glu Thr He Arg Val
245 250 255
Leu Gly Met Lys Thr He Phe He Lys Phe Trp Val Val Gin Leu Leu
260 265 270
Lys (2) INFORMATION FOR SEQ ID NO: 939:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 557 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...519 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 939:
ATAAACAACC ATGACAAACT AACGGACTTT AAGCAATACC AAACAGAC ATG AAA GAA 57
Met Lys Glu 1
TTA CTA GGG ATA GAA ATA GAT GAA GAG CTG GAT ACT AAA CGA CTT ATC 105 Leu Leu Gly He Glu He Asp Glu Glu Leu Asp Thr Lys Arg Leu He 5 10 15
CCT ACT TAT TCC AAA TTG TAT TCT TTA AAA AAA TAC TCT AAA AAA TTT 153 Pro Thr Tyr Ser Lys Leu Tyr Ser Leu Lys Lys Tyr Ser Lys Lys Phe 20 25 30 35
AAA AGA TTA CAA AGA AAA CAA AGC CGT AGG GTG TTA AAG TCT AAA CAA 201 Lys Arg Leu Gin Arg Lys Gin Ser Arg Arg Val Leu Lys Ser Lys Gin 40 45 50
AAC AAA ACC AAA TTA GGA GGT AAT TTT TAC AAA ACC CAA AAG AAA TTA 249 Asn Lys Thr Lys Leu Gly Gly Asn Phe Tyr Lys Thr Gin Lys Lys Leu 55 60 65
AAC CAA GCC TTT GAC AAG TCT AGT CAT CAA AAA ACA GAC AGA TAC CAT 297 Asn Gin Ala Phe Asp Lys Ser Ser His Gin Lys Thr Asp Arg Tyr His 70 75 80
AAA ATC ACA AGC GAA CTT TCA AAG CAA TTT GAA TTG ATA GTA GTT GAA 345 Lys He Thr Ser Glu Leu Ser Lys Gin Phe Glu Leu He Val Val Glu 85 90 95
GAT TTG CAA GTA AAA AAC ATG ACT AAA AGA GCT AAA CTC AAA AAT GTT 393 Asp Leu Gin Val Lys Asn Met Thr Lys Arg Ala Lys Leu Lys Asn Val 100 105 110 115
AAA CAA AAG AGT GGG CTT AAT CAA TCT ATT TTA AAC GCT TCA TTC TAT 441 Lys Gin Lys Ser Gly Leu Asn Gin Ser He Leu Asn Ala Ser Phe Tyr 120 125 130 CAA ATC ATC TCT TTT TTA GAC TAC AAA CAA CAG CAT AAT GGC AAA TTG 489 Gin He He Ser Phe Leu Asp Tyr Lys Gin Gin His Asn Gly Lys Leu 135 140 145
TTA GTG AAA GTT TCC CCC ACA ATA TAC GAG TAAAACTTGC CATTGTTGTG GGA 542 Leu Val Lys Val Ser Pro Thr He Tyr Glu 150 155
ATATCAACCA CAAGC 557
(2) INFORMATION FOR SEQ ID NO: 940:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 157 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 940:
Met Lys Glu Leu Leu Gly He Glu He Asp Glu Glu Leu Asp Thr Lys
1 5 10 15
Arg Leu He Pro Thr Tyr Ser Lys Leu Tyr Ser Leu Lys Lys Tyr Ser
20 25 30
Lys Lys Phe Lys Arg Leu Gin Arg Lys Gin Ser Arg Arg Val Leu Lys
35 40 45
Ser Lys Gin Asn Lys Thr Lys Leu Gly Gly Asn Phe Tyr Lys Thr Gin
50 55 60
Lys Lys Leu Asn Gin Ala Phe Asp Lys Ser Ser His Gin Lys Thr Asp 65 70 75 80
Arg Tyr His Lys He Thr Ser Glu Leu Ser Lys Gin Phe Glu Leu He
85 90 95
Val Val Glu Asp Leu Gin Val Lys Asn Met Thr Lys Arg Ala Lys Leu
100 105 110
Lys Asn Val Lys Gin Lys Ser Gly Leu Asn Gin Ser He Leu Asn Ala
115 120 125
Ser Phe Tyr Gin He He Ser Phe Leu Asp Tyr Lys Gin Gin His Asn
130 135 140
Gly Lys Leu Leu Val Lys Val Ser Pro Thr He Tyr Glu 145 150 155
(2) INFORMATION FOR SEQ ID NO: 941:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 889 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence (B) LOCATION: 52...843 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 941:
ATTACATTCT TTTTGATTTC TATTGAAAAA TTTAATATTA AGAGGACTTT T ATG AAA 57
Met Lys
1
AAA TCA AAT GAC AAT AAC GCA CTC GCC AGA AGT CAA AGG GAG TTG TTT 105 Lys Ser Asn Asp Asn Asn Ala Leu Ala Arg Ser Gin Arg Glu Leu Phe 5 10 15
GTA GGG ATT AGG GAT TTT ATT GTT TTT AAA TTT AAG CGT ATG GTT GTT 153 Val Gly He Arg Asp Phe He Val Phe Lys Phe Lys Arg Met Val Val 20 25 30
TTT AAC GGA GTA AGG GAT TTT ACA AAA ATG AGA TTT TTG TCC ATA GAA 201 Phe Asn Gly Val Arg Asp Phe Thr Lys Met Arg Phe Leu Ser He Glu 35 40 45 50
TTA GAA AAA TGC GAA AAT ATT AAA GAT TTG GAA AAA TTA TGT CAT ACA 249 Leu Glu Lys Cys Glu Asn He Lys Asp Leu Glu Lys Leu Cys His Thr 55 60 65
ATT TAT AAT CAA GGC ACA AAG CAT ATT TTG ATG ATG CGT GTA TTG TTT 297 He Tyr Asn Gin Gly Thr Lys His He Leu Met Met Arg Val Leu Phe 70 75 80
TTA TTC TTT GAC TAT TTC TGC AAG CAT TTG AAA GTT AAG CGA TTG AGA 345 Leu Phe Phe Asp Tyr Phe Cys Lys His Leu Lys Val Lys Arg Leu Arg 85 90 95
CTA CTC AAT GAA GAA ATG CTT GTG AAT TTT TTA TTT GAG TTA GCT AAA 393 Leu Leu Asn Glu Glu Met Leu Val Asn Phe Leu Phe Glu Leu Ala Lys 100 105 110
CAA AGA AAA ATT AAT TCA ATG GCA AAA TAT GTG ATG TAT ATT AGG CAA 441 Gin Arg Lys He Asn Ser Met Ala Lys Tyr Val Met Tyr He Arg Gin 115 120 125 130
TTT TTT GAT TAC TTG GAT AGG ACT AAA CAT TAT GAA TTT TAT TTT AGT 489 Phe Phe Asp Tyr Leu Asp Arg Thr Lys His Tyr Glu Phe Tyr Phe Ser 135 140 145
CTT AAA AAT ATA GCC TTT GCT AAA CAC AAG GAT AAT TTG CCT AAG CAT 537 Leu Lys Asn He Ala Phe Ala Lys His Lys Asp Asn Leu Pro Lys His 150 155 160
CTA AAT TCA AAA GAT TTA AAA TCT TTT ATA TAT ACT CTT ATA AAC TAT 585 Leu Asn Ser Lys Asp Leu Lys Ser Phe He Tyr Thr Leu He Asn Tyr 165 170 175 AGA ACT AGA AGC AGT TAT GAA AAG AGA AAT AAG TGT ATT TTG CTC TTG 633 Arg Thr Arg Ser Ser Tyr Glu Lys Arg Asn Lys Cys He Leu Leu Leu 180 185 190
ATT ATT TTG GGT GGT TTG AGA AAA TCT GAG GTT TTT AAT TTA GAA TTG 681 He He Leu Gly Gly Leu Arg Lys Ser Glu Val Phe Asn Leu Glu Leu 195 200 205 210
AGA AAT ATT GTT TTA GAG AAA GAG CAT TAT ATC TTG CTT ATA AAA GGC 729 Arg Asn He Val Leu Glu Lys Glu His Tyr He Leu Leu He Lys Gly 215 220 225
AAA AAC AAT AAA GAG CGA AAA GCG TTC ATT AAA ATC GCT CAA ACA GAT 777 Lys Asn Asn Lys Glu Arg Lys Ala Phe He Lys He Ala Gin Thr Asp 230 235 240
ATT GAC ACA CTC GCA CCG CTT ATC CGT ATC CTT TTG GAA AGT ATT GCT 825 He Asp Thr Leu Ala Pro Leu He Arg He Leu Leu Glu Ser He Ala 245 250 255
AAA AAT CTT TTA TCC CAC TAGCGCGAAA AACTCCGTCC TTTAGGGCGG AGATGTAA 881 Lys Asn Leu Leu Ser His 260
GCGTTTAG 889
(2) INFORMATION FOR SEQ ID NO:942:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 264 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 942:
Met Lys Lys Ser Asn Asp Asn Asn Ala Leu Ala Arg Ser Gin Arg Glu
1 5 10 15
Leu Phe Val Gly He Arg Asp Phe He Val Phe Lys Phe Lys Arg Met
20 25 30
Val Val Phe Asn Gly Val Arg Asp Phe Thr Lys Met Arg Phe Leu Ser
35 40 45
He Glu Leu Glu Lys Cys Glu Asn He Lys Asp Leu Glu Lys Leu Cys
50 55 60
His Thr He Tyr Asn Gin Gly Thr Lys His He Leu Met Met Arg Val 65 70 75 80
Leu Phe Leu Phe Phe Asp Tyr Phe Cys Lys His Leu Lys Val Lys Arg
85 90 95
Leu Arg Leu Leu Asn Glu Glu Met Leu Val Asn Phe Leu Phe Glu Leu
100 105 110
Ala Lys Gin Arg Lys He Asn Ser Met Ala Lys Tyr Val Met Tyr He
115 120 125
Arg Gin Phe Phe Asp Tyr Leu Asp Arg Thr Lys His Tyr Glu Phe Tyr 130 135 140
Phe Ser Leu Lys Asn He Ala Phe Ala Lys His Lys Asp Asn Leu Pro 145 150 155 160
Lys His Leu Asn Ser Lys Asp Leu Lys Ser Phe He Tyr Thr Leu He
165 170 175
Asn Tyr Arg Thr Arg Ser Ser Tyr Glu Lys Arg Asn Lys Cys He Leu
180 185 190
Leu Leu He He Leu Gly Gly Leu Arg Lys Ser Glu Val Phe Asn Leu
195 200 205
Glu Leu Arg Asn He Val Leu Glu Lys Glu His Tyr He Leu Leu He
210 215 220
Lys Gly Lys Asn Asn Lys Glu Arg Lys Ala Phe He Lys He Ala Gin 225 230 235 240
Thr Asp He Asp Thr Leu Ala Pro Leu He Arg He Leu Leu Glu Ser
245 250 255
He Ala Lys Asn Leu Leu Ser His 260
(2) INFORMATION FOR SEQ ID NO:943:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 546 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 75...530 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:943:
TTGACAGAGT GGATTTTTTT ATTTCTAACG CTATTATTTA TGGGCGTTCT GTCGTGGGGG 60
GATTTGCACC GTTT ATG CGA TTA AAA CCT AAG GGG TTA AAC AAC ATT TAC 110
Met Arg Leu Lys Pro Lys Gly Leu Asn Asn He Tyr 1 5 10
ACA GCC ACC GTG TTA GCG TTC GTC GTA GGG GCT CAA GAA GCG GCA AAA 158 Thr Ala Thr Val Leu Ala Phe Val Val Gly Ala Gin Glu Ala Ala Lys 15 20 25
CGC ATG CAA AAA ATA GGC GGT GGG GCG ATC GTG AGC TTA AGT TCT ACC 206 Arg Met Gin Lys He Gly Gly Gly Ala He Val Ser Leu Ser Ser Thr 30 35 40
GGG AAT CTA GTT TAT ATG CCT AAT TAC GCC GGG CAT GGC AAT TCC AAA 254 Gly Asn Leu Val Tyr Met Pro Asn Tyr Ala Gly His Gly Asn Ser Lys 45 50 55 60
AAC GCC GTA GAA ACC ATG GTC AAA TAC GCT GCC GTG GAT TTA GGC GAA 302 Asn Ala Val Glu Thr Met Val Lys Tyr Ala Ala Val Asp Leu Gly Glu 65 70 75
TTT AAC ATT AGA GTG AAT GCG GTT AGT GGC GGG CCT ATT GAT ACG GAC 350 Phe Asn He Arg Val Asn Ala Val Ser Gly Gly Pro He Asp Thr Asp 80 85 90
GCT TTG AAA GCC TTC CCT GAT TAT GTG GAG ATT AAA GAA AAA GTA GAA 398 Ala Leu Lys Ala Phe Pro Asp Tyr Val Glu He Lys Glu Lys Val Glu 95 100 105
GAG CAA TCG CCC CTA AAA CGC ATG GGC AAT CCT AAC GAT CTA GCC GGA 446 Glu Gin Ser Pro Leu Lys Arg Met Gly Asn Pro Asn Asp Leu Ala Gly 110 115 120
GCG GCT TAT TTT TTA TGC GAT GAG ACC CAA AGC GGT TGG CTT ACA GGG 494 Ala Ala Tyr Phe Leu Cys Asp Glu Thr Gin Ser Gly Trp Leu Thr Gly 125 130 135 140
CAA ACG ATC GTT GTA GAT GGC GGG ACT ACT TTT AAA TAAAGATATT TCTTGC 546 Gin Thr He Val Val Asp Gly Gly Thr Thr Phe Lys 145 150
546
(2) INFORMATION FOR SEQ ID NO: 944:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 152 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 944:
Met Arg Leu Lys Pro Lys Gly Leu Asn Asn He Tyr Thr Ala Thr Val
1 5 10 15
Leu Ala Phe Val Val Gly Ala Gin Glu Ala Ala Lys Arg Met Gin Lys
20 25 30
He Gly Gly Gly Ala He Val Ser Leu Ser Ser Thr Gly Asn Leu Val
35 40 45
Tyr Met Pro Asn Tyr Ala Gly His Gly Asn Ser Lys Asn Ala Val Glu
50 55 60
Thr Met Val Lys Tyr Ala Ala Val Asp Leu Gly Glu Phe Asn He Arg 65 70 75 80
Val Asn Ala Val Ser Gly Gly Pro He Asp Thr Asp Ala Leu Lys Ala
85 90 95
Phe Pro Asp Tyr Val Glu He Lys Glu Lys Val Glu Glu Gin Ser Pro
100 105 110
Leu Lys Arg Met Gly Asn Pro Asn Asp Leu Ala Gly Ala Ala Tyr Phe
115 120 125
Leu Cys Asp Glu Thr Gin Ser Gly Trp Leu Thr Gly Gin Thr He Val 130 135 140 Val Asp Gly Gly Thr Thr Phe Lys 145 150
(2) INFORMATION FOR SEQ ID NO:945:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 644 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 2...616 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 945:
A AGA TAT TTC TTG CAA AAC ATT ATC CAC ATC CAC CAA AAC AAA GAG TTG 49 Arg Tyr Phe Leu Gin Asn He He His He His Gin Asn Lys Glu Leu 1 5 10 15
CAA TTC ATT AAA AAA TGC TTG TTG GGC TAT TTT TTC GCC CCT TTG TGT 97 Gin Phe He Lys Lys Cys Leu Leu Gly Tyr Phe Phe Ala Pro Leu Cys 20 25 30
GGG GCT ATT CTG TTA GTG CTT TTT ATT GTT TCA AGC GGG GCA AAA TCG 145 Gly Ala He Leu Leu Val Leu Phe He Val Ser Ser Gly Ala Lys Ser 35 40 45
TTT CAA ATT TCT AAT CTC TTT AAC AAT CAA CTA GCC TAT ATC GTT TTG 193 Phe Gin He Ser Asn Leu Phe Asn Asn Gin Leu Ala Tyr He Val Leu 50 55 60
TTG TCT CTT TTT TTG TGC GCG CTT GGG TTT ATT GCC GGA GCG ATT GGT 241 Leu Ser Leu Phe Leu Cys Ala Leu Gly Phe He Ala Gly Ala He Gly 65 70 75 80
TTT TAT AGG CTT TCT AAA ATC ACA CGC CAT CTG AGT TTT TTT GAA AAT 289 Phe Tyr Arg Leu Ser Lys He Thr Arg His Leu Ser Phe Phe Glu Asn 85 90 95
TTC GCT TTC AGT TTT TTA GCG GTG ATT TTA TGC GCT ATT TTA AGC TAT 337 Phe Ala Phe Ser Phe Leu Ala Val He Leu Cys Ala He Leu Ser Tyr 100 105 110
CTT GTC CCT AAC GCC AGT AAC GCT CTT TCG CTA ATC GGT AAT GGC GTT 385 Leu Val Pro Asn Ala Ser Asn Ala Leu Ser Leu He Gly Asn Gly Val 115 120 125
TCT ATT TTT TAT TTG CAC AAA CTC TAT AGA GAA TTG AGC CTT TAC ACG 433 Ser He Phe Tyr Leu His Lys Leu Tyr Arg Glu Leu Ser Leu Tyr Thr 130 135 140
CAA GAA AGG TTT TTT TTA AGC GGG TTT AGG TTG TTG CTT TTT AGT TTC 481 Gin Glu Arg Phe Phe Leu Ser Gly Phe Arg Leu Leu Leu Phe Ser Phe 145 150 155 160
ATG CTG GCT CTT TTA GGG ATT TTA GTG CAA GCG TTA GTT ATC ATT TTT 529 Met Leu Ala Leu Leu Gly He Leu Val Gin Ala Leu Val He He Phe 165 170 175
TTA ACG ACC GCT GTG GTT TTA ATG TGT GTG GCG CTT GGT TTT TTG GCG 577 Leu Thr Thr Ala Val Val Leu Met Cys Val Ala Leu Gly Phe Leu Ala 180 185 190
CGC GCG TTT TTG AAT TTT TCA CAA GTC TTT TTG AAA GCA TGAAAGTTTT AA 628 Arg Ala Phe Leu Asn Phe Ser Gin Val Phe Leu Lys Ala 195 200 205
AACTCCTGCC TAATTT 644
(2) INFORMATION FOR SEQ ID NO: 946:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 205 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 946:
Arg Tyr Phe Leu Gin Asn He He His He His Gin Asn Lys Glu Leu
1 5 10 15
Gin Phe He Lys Lys Cys Leu Leu Gly Tyr Phe Phe Ala Pro Leu Cys
20 25 30
Gly Ala He Leu Leu Val Leu Phe He Val Ser Ser Gly Ala Lys Ser
35 40 45
Phe Gin He Ser Asn Leu Phe Asn Asn Gin Leu Ala Tyr He Val Leu
50 55 60
Leu Ser Leu Phe Leu Cys Ala Leu Gly Phe He Ala Gly Ala He Gly 65 70 75 80
Phe Tyr Arg Leu Ser Lys He Thr Arg His Leu Ser Phe Phe Glu Asn
85 90 95
Phe Ala Phe Ser Phe Leu Ala Val He Leu Cys Ala He Leu Ser Tyr
100 105 110
Leu Val Pro Asn Ala Ser Asn Ala Leu Ser Leu He Gly Asn Gly Val
115 120 125
Ser He Phe Tyr Leu His Lys Leu Tyr Arg Glu Leu Ser Leu Tyr Thr
130 135 140
Gin Glu Arg Phe Phe Leu Ser Gly Phe Arg Leu Leu Leu Phe Ser Phe 145 150 155 160
Met Leu Ala Leu Leu Gly He Leu Val Gin Ala Leu Val He He Phe 165 170 175 Leu Thr Thr Ala Val Val Leu Met Cys Val Ala Leu Gly Phe Leu Ala
180 185 190
Arg Ala Phe Leu Asn Phe Ser Gin Val Phe Leu Lys Ala 195 200 205
(2) INFORMATION FOR SEQ ID NO: 47:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 630 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 62...598 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 947:
GTGTGGCGCT TGGTTTTTTG GCGCGCGCGT TTTTGAATTT TTCACAAGTC TTTTTGAAAG 60
C ATG AAA GTT TTA AAA CTC CTG CCT AAT TTT TTA ACA ATT TTA CGC ATT 109 Met Lys Val Leu Lys Leu Leu Pro Asn Phe Leu Thr He Leu Arg He 1 5 10 15
GTC TTA TCC TTA TTT TTA TTA TTT TTA TTG TTA AAC ACG CGC ACT TAT 157 Val Leu Ser Leu Phe Leu Leu Phe Leu Leu Leu Asn Thr Arg Thr Tyr 20 25 30
TTT AGT TTT TTA ACC CCC TTT CAA ACC AAT ATG ATC TCT TCA TTG GTT 205 Phe Ser Phe Leu Thr Pro Phe Gin Thr Asn Met He Ser Ser Leu Val 35 40 45
TTT TTG TTT GCC GCG CTC ACG GAT TTA TTG GAC GGC TAC ATC GCT AGA 253 Phe Leu Phe Ala Ala Leu Thr Asp Leu Leu Asp Gly Tyr He Ala Arg 50 55 60
AGC TAT AAA GCC AAA TCG CGC TTT GGG GAA ATC TTT GAT CCT TTA GCG 301 Ser Tyr Lys Ala Lys Ser Arg Phe Gly Glu He Phe Asp Pro Leu Ala 65 70 75 80
GAT AAA ATC CTT ATT TTG AGC GCG TTT TTA GGG TTA GTT TAT TTG GAT 349 Asp Lys He Leu He Leu Ser Ala Phe Leu Gly Leu Val Tyr Leu Asp 85 90 95
CGT GTG AAT GCG TGG ATC CCG TTT GTG ATT TTA GGG CGT GAA TTT TTT 397 Arg Val Asn Ala Trp He Pro Phe Val He Leu Gly Arg Glu Phe Phe 100 105 110
ATT TCA GGG CTT AGA GTC TTA GCC GCT AAT GAG AAA AAG GAT ATT CCT 445 He Ser Gly Leu Arg Val Leu Ala Ala Asn Glu Lys Lys Asp He Pro 115 120 125
GTC AAT GCG TTA GGC AAG TAT AAA ACC GTT TCT CAA GTC GTG GCG ATT 493 Val Asn Ala Leu Gly Lys Tyr Lys Thr Val Ser Gin Val Val Ala He 130 135 140
GGT GCT TTA TTG GCT GAT GTA ACT TAC TCT TAT GCG CTT GTG GCT ATA 541 Gly Ala Leu Leu Ala Asp Val Thr Tyr Ser Tyr Ala Leu Val Ala He 145 150 155 160
GCG GTT TTT TTA ACC CTT TAT TCG GGG ATA GAT TAC ACC ATT AAA TAT 589 Ala Val Phe Leu Thr Leu Tyr Ser Gly He Asp Tyr Thr He Lys Tyr 165 170 175
TAT AAA TCT TAATATTTTA AAAGAAGTTT TTAGCGTTCT TT 630
Tyr Lys Ser
(2) INFORMATION FOR SEQ ID NO: 948:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 179 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 948:
Met Lys Val Leu Lys Leu Leu Pro Asn Phe Leu Thr He Leu Arg He
1 5 10 15
Val Leu Ser Leu Phe Leu Leu Phe Leu Leu Leu Asn Thr Arg Thr Tyr
20 25 30
Phe Ser Phe Leu Thr Pro Phe Gin Thr Asn Met He Ser Ser Leu Val
35 40 45
Phe Leu Phe Ala Ala Leu Thr Asp Leu Leu Asp Gly Tyr He Ala Arg
50 55 60
Ser Tyr Lys Ala Lys Ser Arg Phe Gly Glu He Phe Asp Pro Leu Ala 65 70 75 80
Asp Lys He Leu He Leu Ser Ala Phe Leu Gly Leu Val Tyr Leu Asp
85 90 95
Arg Val Asn Ala Trp He Pro Phe Val He Leu Gly Arg Glu Phe Phe
100 105 110
He Ser Gly Leu Arg Val Leu Ala Ala Asn Glu Lys Lys Asp He Pro
115 120 125
Val Asn Ala Leu Gly Lys Tyr Lys Thr Val Ser Gin Val Val Ala He
130 135 140
Gly Ala Leu Leu Ala Asp Val Thr Tyr Ser Tyr Ala Leu Val Ala He 145 150 155 160
Ala Val Phe Leu Thr Leu Tyr Ser Gly He Asp Tyr Thr He Lys Tyr
165 170 175
Tyr Lys Ser (2) INFORMATION FOR SEQ ID NO: 949:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 913 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...879 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 949:
ATAGAAGAAG AGTGAGAA ATG CAA GAT TTT ATT AAG ATT TTT ATT CAA GAG 51
Met Gin Asp Phe He Lys He Phe He Gin Glu 1 5 10
GTT GTC TCT ACT TTA GAA GGG TTA GTG GGT AAG GCT CCA AGC GTG GGA 99 Val Val Ser Thr Leu Glu Gly Leu Val Gly Lys Ala Pro Ser Val Gly 15 20 25
TTA GAA AAA GAA ATT TCT AGT AGC GAC GAA TCT TTT TTG AAA TTA ATC 147 Leu Glu Lys Glu He Ser Ser Ser Asp Glu Ser Phe Leu Lys Leu He 30 35 40
AGC ACG CCT TAT GCA AGA GTT GTG ATA AGC GCG ATT GAA AAA GAA GAG 195 Ser Thr Pro Tyr Ala Arg Val Val He Ser Ala He Glu Lys Glu Glu 45 50 55
AGC TCT ATT GAA TTA CTG GCT CCG GTA GTT TTA GTT ACC TCT TTA AGC 243 Ser Ser He Glu Leu Leu Ala Pro Val Val Leu Val Thr Ser Leu Ser 60 65 70 75
GAT TTG ATG CTA GGA GGT GAG GGA GCG AGT AAG GAA GAA ATG GAT AAT 291 Asp Leu Met Leu Gly Gly Glu Gly Ala Ser Lys Glu Glu Met Asp Asn 80 85 90
GAC GAT TTA GAC GCT TTT AAA GAA ATG GCT TCT AAT ATT TTT GGC GCG 339 Asp Asp Leu Asp Ala Phe Lys Glu Met Ala Ser Asn He Phe Gly Ala 95 100 105
ATC GCT ACA AGC TTG AAG TCT CAA GAA TTG CTC CCT AAA CTC AAT TTC 387 He Ala Thr Ser Leu Lys Ser Gin Glu Leu Leu Pro Lys Leu Asn Phe 110 115 120
ACC ACT ATA AAC GCT GAA ATC GCT AAA GAG CTT CCT AAA AAA GAA GAT 435 Thr Thr He Asn Ala Glu He Ala Lys Glu Leu Pro Lys Lys Glu Asp 125 130 135 TAC GCT AAA GCG ATG GTG TTT TCT TTT AAA ATG GAA GCC ATC AAA GAA 483 Tyr Ala Lys Ala Met Val Phe Ser Phe Lys Met Glu Ala He Lys Glu 140 145 150 155
AGC CAA ATC ATT TTA TTG ACT ACG GCG GCT TTT GAG GGC CAA TTT GAA 531 Ser Gin He He Leu Leu Thr Thr Ala Ala Phe Glu Gly Gin Phe Glu 160 165 170
AAA ACG CAT AAA GAA GAA AAA GAA GAA ACG ACA GAG GGC GTT GCT GAA 579 Lys Thr His Lys Glu Glu Lys Glu Glu Thr Thr Glu Gly Val Ala Glu 175 180 185
GAG GTT AAA ACC CAT GAT GCG TCT TTA GAA AAC ATA GAA ATC CGC AAT 627 Glu Val Lys Thr His Asp Ala Ser Leu Glu Asn He Glu He Arg Asn 190 195 200
ATC AGC ATG CTT TTA GAC GTG AAA TTG AAC GTT AAG GTG CGC ATC GGG 675 He Ser Met Leu Leu Asp Val Lys Leu Asn Val Lys Val Arg He Gly 205 210 215
CAA AAA AAA ATG ATT TTA AAA GAC GTG GTC TCT ATG GAT ATA GGG AGC 723 Gin Lys Lys Met He Leu Lys Asp Val Val Ser Met Asp He Gly Ser 220 225 230 235
GTG GTA GAG CTG GAT CAA TTG GTG AAT GAC CCT TTG GAA ATT CTT GTA 771 Val Val Glu Leu Asp Gin Leu Val Asn Asp Pro Leu Glu He Leu Val 240 245 250
GAT GAC AAG GTG ATC GCT AAG GGC GAA GTG GTG ATT GTG GAT GGG AAT 819 Asp Asp Lys Val He Ala Lys Gly Glu Val Val He Val Asp Gly Asn 255 260 265
TTT GGC ATT CAA ATC ACG GAT ATT GGC ACT AAA AAA GAA CGC TTA GAA 867 Phe Gly He Gin He Thr Asp He Gly Thr Lys Lys Glu Arg Leu Glu 270 275 280
CAA TTG AAA CAT TAAATCTTTT TATCATAAAA AGGAAAGGGA TATG 913
Gin Leu Lys His 285
(2) INFORMATION FOR SEQ ID NO: 950:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 287 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 950:
Met Gin Asp Phe He Lys He Phe He Gin Glu Val Val Ser Thr Leu 1 5 10 15 Glu Gly Leu Val Gly Lys Ala Pro Ser Val Gly Leu Glu Lys Glu He
20 25 30
Ser Ser Ser Asp Glu Ser Phe Leu Lys Leu He Ser Thr Pro Tyr Ala
35 40 45
Arg Val Val He Ser Ala He Glu Lys Glu Glu Ser Ser He Glu Leu
50 55 60
Leu Ala Pro Val Val Leu Val Thr Ser Leu Ser Asp Leu Met Leu Gly 65 70 75 80
Gly Glu Gly Ala Ser Lys Glu Glu Met Asp Asn Asp Asp Leu Asp Ala
85 90 95
Phe Lys Glu Met Ala Ser Asn He Phe Gly Ala He Ala Thr Ser Leu
100 105 110
Lys Ser Gin Glu Leu Leu Pro Lys Leu Asn Phe Thr Thr He Asn Ala
115 120 125
Glu He Ala Lys Glu Leu Pro Lys Lys Glu Asp Tyr Ala Lys Ala Met
130 135 140
Val Phe Ser Phe Lys Met Glu Ala He Lys Glu Ser Gin He He Leu 145 150 155 160
Leu Thr Thr Ala Ala Phe Glu Gly Gin Phe Glu Lys Thr His Lys Glu
165 170 175
Glu Lys Glu Glu Thr Thr Glu Gly Val Ala Glu Glu Val Lys Thr His
180 185 190
Asp Ala Ser Leu Glu Asn He Glu He Arg Asn He Ser Met Leu Leu
195 200 205
Asp Val Lys Leu Asn Val Lys Val Arg He Gly Gin Lys Lys Met He
210 215 220
Leu Lys Asp Val Val Ser Met Asp He Gly Ser Val Val Glu Leu Asp 225 230 235 240
Gin Leu Val Asn Asp Pro Leu Glu He Leu Val Asp Asp Lys Val He
245 250 255
Ala Lys Gly Glu Val Val He Val Asp Gly Asn Phe Gly He Gin He
260 265 270
Thr Asp He Gly Thr Lys Lys Glu Arg Leu Glu Gin Leu Lys His 275 280 285
(2) INFORMATION FOR SEQ ID NO: 951:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1111 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 13...1056 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 951: TAAAAAAGGA GA ATG ATG CAA GTT TAC CAC CTT TCA CAC ATT GAT TTA GAC 51 Met Met Gin Val Tyr His Leu Ser His He Asp Leu Asp
1 5 10
GGC TAT GCA TGC CAG CTT GTT TCA AAA CAA TTT TTT AAA AAT ATC CAA 99 Gly Tyr Ala Cys Gin Leu Val Ser Lys Gin Phe Phe Lys Asn He Gin 15 20 25
TGC TAT AAC GCT AAT TAC GGG CGT GAA GTC TCA GCG AGA ATT TAT GAG 147 Cys Tyr Asn Ala Asn Tyr Gly Arg Glu Val Ser Ala Arg He Tyr Glu 30 35 40 45
ATT TTA AAC GCA ATC GCT CAG TCT AAA GAG AGT GAA TTC CTT ATT TTG 195 He Leu Asn Ala He Ala Gin Ser Lys Glu Ser Glu Phe Leu He Leu 50 55 60
GTT AGC GAT TTG AAT CTG AAT TTG AAT GAA GCA GAG TAT TTG CAG GAT 243 Val Ser Asp Leu Asn Leu Asn Leu Asn Glu Ala Glu Tyr Leu Gin Asp 65 70 75
AAG ATC CAA GAA CAC CGC TTG CAA AAT AAA AAC ATT CAA ATC CAG CTT 291 Lys He Gin Glu His Arg Leu Gin Asn Lys Asn He Gin He Gin Leu 80 85 90
TTA GAT CAC CAT ATC AGC GGT AAG GAA GTG GCT GAG AGT TTC CAT TGG 339 Leu Asp His His He Ser Gly Lys Glu Val Ala Glu Ser Phe His Trp 95 100 105
TAT TTT TTA GAC ATT AAC CGT TGC GCG ACT AAA ATC GTG TAT GAA TTT 387 Tyr Phe Leu Asp He Asn Arg Cys Ala Thr Lys He Val Tyr Glu Phe 110 115 120 125
TTG AAA AAG CAT TAC GCT ATT TTA GAG CCA AAA AAC ACA ACA TGG CTA 435 Leu Lys Lys His Tyr Ala He Leu Glu Pro Lys Asn Thr Thr Trp Leu 130 135 140
GAG CCT TTA GTG GAA ATG GTC AAT TCT GTG GAT ATT TGG GAC ACG CAA 483 Glu Pro Leu Val Glu Met Val Asn Ser Val Asp He Trp Asp Thr Gin 145 150 155
GGT TAT GGC TTT GAA TTA GGC AAG GTG TGC ATG CGC ATG ATT AAC CAA 531 Gly Tyr Gly Phe Glu Leu Gly Lys Val Cys Met Arg Met He Asn Gin 160 165 170
AGC TCT GAA TTG AAT CGT TTC ATG TTT GAT GAT GAA AAC CGC AAC TAT 579 Ser Ser Glu Leu Asn Arg Phe Met Phe Asp Asp Glu Asn Arg Asn Tyr 175 180 185
AAA TTA AAG CTT TTA GAA GAA GTT AAA AAC TAT TTG TTT TTA GAA AAT 627 Lys Leu Lys Leu Leu Glu Glu Val Lys Asn Tyr Leu Phe Leu Glu Asn 190 195 200 205
GCC CCT GTA GCC TAT GAT AAC GAT TTG TTC AAA CTC AAA AAA ATC GCT 675 Ala Pro Val Ala Tyr Asp Asn Asp Leu Phe Lys Leu Lys Lys He Ala 210 215 220 TTA GGG GGC GAC CCT GAT GCA GAA ACG ATG GAC AAT ATC TCT TCA AAC 723 Leu Gly Gly Asp Pro Asp Ala Glu Thr Met Asp Asn He Ser Ser Asn 225 230 235
GCG CAA ACG CAT TTG CTC TCT TTA AAA AAG CAT GAT TGC AGC GTT TAT 771 Ala Gin Thr His Leu Leu Ser Leu Lys Lys His Asp Cys Ser Val Tyr 240 245 250
TAC CAG GAT AAA AAA GGG TTT TTA AGT TAT TCT ATG GGG GGC ATT AGC 819 Tyr Gin Asp Lys Lys Gly Phe Leu Ser Tyr Ser Met Gly Gly He Ser 255 260 265
GTG TTG GCT AAC CTT TTT TTA ACG CAA AAT CCG GAT TTT GAT TTT TAT 867 Val Leu Ala Asn Leu Phe Leu Thr Gin Asn Pro Asp Phe Asp Phe Tyr 270 275 280 285
ATG GAT GTG AAC GCT AAA GGG AAT GTG AGC TTA AGG GCG AAT GGG AAT 915 Met Asp Val Asn Ala Lys Gly Asn Val Ser Leu Arg Ala Asn Gly Asn 290 295 300
TGC GAT GTG TGC GAA CTC AGT CAA ATG TGT TTT AAT GGG GGT GGG CAT 963 Cys Asp Val Cys Glu Leu Ser Gin Met Cys Phe Asn Gly Gly Gly His 305 310 315
AGG AAT GCG AGC GGA GGC AAG ATT GAT GGT TTT AGG GAG AGT TTC AAT 1011 Arg Asn Ala Ser Gly Gly Lys He Asp Gly Phe Arg Glu Ser Phe Asn 320 325 330
TAT AGG GAT ATT AAA GAA CAA ATT GAA GAA ATC TTC AAC AAC GCT TAAAA 1061 Tyr Arg Asp He Lys Glu Gin He Glu Glu He Phe Asn Asn Ala 335 340 345
CTAAGCTGTT TAGAAAAAAC TAACAAAAAC TGAAAAGAGT TTAAAAGCTC 1111
(2) INFORMATION FOR SEQ ID NO: 952:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 348 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 952:
Met Met Gin Val Tyr His Leu Ser His He Asp Leu Asp Gly Tyr Ala
1 5 10 15
Cys Gin Leu Val Ser Lys Gin Phe Phe Lys Asn He Gin Cys Tyr Asn
20 25 30
Ala Asn Tyr Gly Arg Glu Val Ser Ala Arg He Tyr Glu He Leu Asn
35 40 45
Ala He Ala Gin Ser Lys Glu Ser Glu Phe Leu He Leu Val Ser Asp
50 55 60
Leu Asn Leu Asn Leu Asn Glu Ala Glu Tyr Leu Gin Asp Lys He Gin 65 70 75 80
Glu His Arg Leu Gin Asn Lys Asn He Gin He Gin Leu Leu Asp His
85 90 95
His He Ser Gly Lys Glu Val Ala Glu Ser Phe His Trp Tyr Phe Leu
100 105 110
Asp He Asn Arg Cys Ala Thr Lys He Val Tyr Glu Phe Leu Lys Lys
115 120 125
His Tyr Ala He Leu Glu Pro Lys Asn Thr Thr Trp Leu Glu Pro Leu
130 135 140
Val Glu Met Val Asn Ser Val Asp He Trp Asp Thr Gin Gly Tyr Gly 145 150 155 160
Phe Glu Leu Gly Lys Val Cys Met Arg Met He Asn Gin Ser Ser Glu
165 170 175
Leu Asn Arg Phe Met Phe Asp Asp Glu Asn Arg Asn Tyr Lys Leu Lys
180 185 190
Leu Leu Glu Glu Val Lys Asn Tyr Leu Phe Leu Glu Asn Ala Pro Val
195 200 205
Ala Tyr Asp Asn Asp Leu Phe Lys Leu Lys Lys He Ala Leu Gly Gly
210 215 220
Asp Pro Asp Ala Glu Thr Met Asp Asn He Ser Ser Asn Ala Gin Thr 225 230 235 240
His Leu Leu Ser Leu Lys Lys His Asp Cys Ser Val Tyr Tyr Gin Asp
245 250 255
Lys Lys Gly Phe Leu Ser Tyr Ser Met Gly Gly He Ser Val Leu Ala
260 265 270
Asn Leu Phe Leu Thr Gin Asn Pro Asp Phe Asp Phe Tyr Met Asp Val
275 280 285
Asn Ala Lys Gly Asn Val Ser Leu Arg Ala Asn Gly Asn Cys Asp Val
290 295 300
Cys Glu Leu Ser Gin Met Cys Phe Asn Gly Gly Gly His Arg Asn Ala 305 310 315 320
Ser Gly Gly Lys He Asp Gly Phe Arg Glu Ser Phe Asn Tyr Arg Asp
325 330 335
He Lys Glu Gin He Glu Glu He Phe Asn Asn Ala 340 345
(2) INFORMATION FOR SEQ ID NO:953:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2070 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 39...2024 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 953: ATAAAAAATG CGCTTAAAAC CATGAAAAAG GAGATGCG ATG CAA TTA GAC GAA GAT 56
Met Gin Leu Asp Glu Asp 1 5
TTA GAA TTC GCT AAA AAA ATC TTT AAC CCT AAC AGA GCG TTT GCC AAG 104 Leu Glu Phe Ala Lys Lys He Phe Asn Pro Asn Arg Ala Phe Ala Lys 10 15 20
CAA GCC AGG ATT AAA AAC ATG TGC GAA TAT AAA GAT TTA GTG CAT GAA 152 Gin Ala Arg He Lys Asn Met Cys Glu Tyr Lys Asp Leu Val His Glu 25 30 35
GCC AAT GAA GAT TAT GAA CAT TTT TGG GGC GAT TTA GCC AAG CAG AAA 200 Ala Asn Glu Asp Tyr Glu His Phe Trp Gly Asp Leu Ala Lys Gin Lys 40 45 50
CTC ACA TGG TTT AAA CCT TTT GAT AAG GTT TTA AAC AGC GAT AAC GCC 248 Leu Thr Trp Phe Lys Pro Phe Asp Lys Val Leu Asn Ser Asp Asn Ala 55 60 65 70
CCT TTT TTC AAA TGG TTT GAA AAC GGC AAA ATC AAT GTT TCT TAC AAT 296 Pro Phe Phe Lys Trp Phe Glu Asn Gly Lys He Asn Val Ser Tyr Asn 75 80 85
TGC ATA GAC AGG CAT TTA AAA GAC AAA AAA AAT AAA GTG GCG ATC ATT 344 Cys He Asp Arg His Leu Lys Asp Lys Lys Asn Lys Val Ala He He 90 95 100
TTT GAA GGG GAA ATG GGG GAT TAT AAT GTC ATC ACT TAC AGA AAA CTC 392 Phe Glu Gly Glu Met Gly Asp Tyr Asn Val He Thr Tyr Arg Lys Leu 105 110 115
CAC TCT GAA GTC AAT AAA ACA GCC AAC CTT TTA AAA AAC GAA TTC AAT 440 His Ser Glu Val Asn Lys Thr Ala Asn Leu Leu Lys Asn Glu Phe Asn 120 125 130
GTC AAA AAA GGC GAT AGG GTC ATT ATC TAT ATG CCC ATG ATT GTA GAA 488 Val Lys Lys Gly Asp Arg Val He He Tyr Met Pro Met He Val Glu 135 140 145 150
AGC GTT TAT ATG ATG CTC GCA TGC ACT AGG ATT GGA GCG ATC CAT AGC 536 Ser Val Tyr Met Met Leu Ala Cys Thr Arg He Gly Ala He His Ser 155 160 165
ATC GTT TTT GCT GGG TTT AGC CCT GAA GCC TTA AGG GAT AGG ATC AAC 584 He Val Phe Ala Gly Phe Ser Pro Glu Ala Leu Arg Asp Arg He Asn 170 175 180
GAC GCT CAA GCT AAA TTA GTT ATC ACA GCG GAT GGG ACT TTT AGA AAA 632 Asp Ala Gin Ala Lys Leu Val He Thr Ala Asp Gly Thr Phe Arg Lys 185 190 195
GGC AAA CCT TAC ATG CTC AAG CCA GCC CTT GAC AAG GCT CTA GAA AAT 680 Gly Lys Pro Tyr Met Leu Lys Pro Ala Leu Asp Lys Ala Leu Glu Asn 200 205 210 AAC GCC TGC CCT AGC GTG GAA AAA GCG CTC ATT GTG ATA CGA AAC GCC 728 Asn Ala Cys Pro Ser Val Glu Lys Ala Leu He Val He Arg Asn Ala 215 220 225 230
AAA GAG ATT GAC TAT GTG AGA GGG CGC GAT TTT GTC TAT AAT GAA ATG 776 Lys Glu He Asp Tyr Val Arg Gly Arg Asp Phe Val Tyr Asn Glu Met 235 240 245
GTC AAT TAC CAA TCC GAC AAA TGC GAA CCT GAA ATG ATG GAC TCT GAA 824 Val Asn Tyr Gin Ser Asp Lys Cys Glu Pro Glu Met Met Asp Ser Glu 250 255 260
GAT CCT TTA TTC TTG CTC TAT ACA AGC GGA TCA ACC GGA AAG CCT AAA 872 Asp Pro Leu Phe Leu Leu Tyr Thr Ser Gly Ser Thr Gly Lys Pro Lys 265 270 275
GGC GTT CAA CAC AGC AGT GCG GGG TAT TTG TTA TGG GCG CAA ATG ACG 920 Gly Val Gin His Ser Ser Ala Gly Tyr Leu Leu Trp Ala Gin Met Thr 280 285 290
ATG GAG TGG GTT TTT GAT ATT AGA GAT AAC GAT AAT TTT TGG TGC ACC 968 Met Glu Trp Val Phe Asp He Arg Asp Asn Asp Asn Phe Trp Cys Thr 295 300 305 310
GCC GAT ATT GGC TGG ATC ACA GGG CAC ACT TAT GTG GTT TAT GGA CCT 1016 Ala Asp He Gly Trp He Thr Gly His Thr Tyr Val Val Tyr Gly Pro 315 320 325
TTA GCT TGT GGG GCG ACG ACT TTG ATA CTA GAA GGC ACG ATG TCT TAT 1064 Leu Ala Cys Gly Ala Thr Thr Leu He Leu Glu Gly Thr Met Ser Tyr 330 335 340
CCG GAT TAT GGG AGA TGG TGG AGG ATG ATA GAA GAA TAC CGT GTG GAT 1112 Pro Asp Tyr Gly Arg Trp Trp Arg Met He Glu Glu Tyr Arg Val Asp 345 350 355
AAA TTC TAC ACT TCC CCT ACC GCT ATA AGA ATG TTG CAT GCC AAA GGT 1160 Lys Phe Tyr Thr Ser Pro Thr Ala He Arg Met Leu His Ala Lys Gly 360 365 370
GAA AAC GAA CCC TCA AAG TAT AAT TTA GAG TCG CTC AAA GTT TTA GGA 1208 Glu Asn Glu Pro Ser Lys Tyr Asn Leu Glu Ser Leu Lys Val Leu Gly 375 380 385 390
ACG GTG GGA GAG CCC ATT AAC CCT ACA GCA TGG AAA TGG TTT TAT GAA 1256 Thr Val Gly Glu Pro He Asn Pro Thr Ala Trp Lys Trp Phe Tyr Glu 395 400 405
AAA ATC GGC AAC TCA AAA TGC AGC ATC GTG GAT ACT TGG TGG CAG ACA 1304 Lys He Gly Asn Ser Lys Cys Ser He Val Asp Thr Trp Trp Gin Thr 410 415 420
GAA ACA GGC GGG CAC ATC ATC AGC CCT TTA CCG GGA GCT ACG CCT ATA 1352 Glu Thr Gly Gly His He He Ser Pro Leu Pro Gly Ala Thr Pro He 425 430 435 AGG GCC AGT TGC GCG ACT TTA CCT TTG CCT GGA ATC CAT GCG GAA GTT 1400 Arg Ala Ser Cys Ala Thr Leu Pro Leu Pro Gly He His Ala Glu Val 440 445 450
TTA AAC GAA GAC GGC ACT AAA ACA AAG CCT GGA GAG CAA GGG TTT TTA 1448 Leu Asn Glu Asp Gly Thr Lys Thr Lys Pro Gly Glu Gin Gly Phe Leu 455 460 465 470
TGC ATC ACT AAG CCA TGG CCT TCT ATG ATA AGA AAC ATT TGG GGC GAT 1496 Cys He Thr Lys Pro Trp Pro Ser Met He Arg Asn He Trp Gly Asp 475 480 485
GAA AAA CGA TAC ATT GAT AGC TAT TTT TCT CAG ATC AAG TTG AAT GGG 1544 Glu Lys Arg Tyr He Asp Ser Tyr Phe Ser Gin He Lys Leu Asn Gly 490 495 500
GAA TAT GTC TAC CTC TCT GGA GAT GGC GCT ATC GTG GAT GAA AAC GGA 1592 Glu Tyr Val Tyr Leu Ser Gly Asp Gly Ala He Val Asp Glu Asn Gly 505 510 515
TAC ATT ACT ATT ATT GGG CGC ACA GAT GAT ATT GTG AAT GTG AGT GGG 1640 Tyr He Thr He He Gly Arg Thr Asp Asp He Val Asn Val Ser Gly 520 525 530
CAT AGG ATT GGC ACG GCT GAA GTG GAG AGC GCT ATT TCC AAG CAT GAA 1688 His Arg He Gly Thr Ala Glu Val Glu Ser Ala He Ser Lys His Glu 535 540 545 550
ATG GTG GCT GAA TGC GCG GTG GTG GGT ATC CCT GAT GCG ATT AAA GGA 1736 Met Val Ala Glu Cys Ala Val Val Gly He Pro Asp Ala He Lys Gly 555 560 565
GAG GGC TTG TTT GCG TTT GTG GTG CTG TGC GAT GGG GCT AAA TGC AAT 1784 Glu Gly Leu Phe Ala Phe Val Val Leu Cys Asp Gly Ala Lys Cys Asn 570 575 580
CTT GGC GAG AGT TTA GAA TTG CTA AAA GAA ATG AAC CAT ATC TTA TCC 1832 Leu Gly Glu Ser Leu Glu Leu Leu Lys Glu Met Asn His He Leu Ser 585 590 595
ATT GAG ATT GGA AAG ATC GCG AAA TTA GAC AAT GTC ATG TAT GTG CCA 1880 He Glu He Gly Lys He Ala Lys Leu Asp Asn Val Met Tyr Val Pro 600 605 610
GGT TTG CCT AAA ACC AGG AGC GGG AAA ATC ATG AGA AGG CTT TTG AAA 1928 Gly Leu Pro Lys Thr Arg Ser Gly Lys He Met Arg Arg Leu Leu Lys 615 620 625 630
TCC ATC GCC AAA AAA GAG CCT ATC ACT CAA GAT TTA AGC ACG CTA GAA 1976 Ser He Ala Lys Lys Glu Pro He Thr Gin Asp Leu Ser Thr Leu Glu 635 640 645
GAT GTG AAT GTG GTT AAA GAA ATA ATG AGC ATC GCT CAA ATG GAG GAG T 2025 Asp Val Asn Val Val Lys Glu He Met Ser He Ala Gin Met Glu Glu 650 655 660 AAAATCTAAA AAATGCTTTT TAGCGTTTTT TAGCCAAATA ATAAG 2070
(2) INFORMATION FOR SEQ ID NO: 954:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 662 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 954:
Met Gin Leu Asp Glu Asp Leu Glu Phe Ala Lys Lys He Phe Asn Pro
1 5 10 15
Asn Arg Ala Phe Ala Lys Gin Ala Arg He Lys Asn Met Cys Glu Tyr
20 25 30
Lys Asp Leu Val His Glu Ala Asn Glu Asp Tyr Glu His Phe Trp Gly
35 40 45
Asp Leu Ala Lys Gin Lys Leu Thr Trp Phe Lys Pro Phe Asp Lys Val
50 55 60
Leu Asn Ser Asp Asn Ala Pro Phe Phe Lys Trp Phe Glu Asn Gly Lys 65 70 75 80
He Asn Val Ser Tyr Asn Cys He Asp Arg His Leu Lys Asp Lys Lys
85 90 95
Asn Lys Val Ala He He Phe Glu Gly Glu Met Gly Asp Tyr Asn Val
100 105 110
He Thr Tyr Arg Lys Leu His Ser Glu Val Asn Lys Thr Ala Asn Leu
115 120 125
Leu Lys Asn Glu Phe Asn Val Lys Lys Gly Asp Arg Val He He Tyr
130 135 140
Met Pro Met He Val Glu Ser Val Tyr Met Met Leu Ala Cys Thr Arg 145 150 155 160
He Gly Ala He His Ser He Val Phe Ala Gly Phe Ser Pro Glu Ala
165 170 175
Leu Arg Asp Arg He Asn Asp Ala Gin Ala Lys Leu Val He Thr Ala
180 185 190
Asp Gly Thr Phe Arg Lys Gly Lys Pro Tyr Met Leu Lys Pro Ala Leu
195 200 205
Asp Lys Ala Leu Glu Asn Asn Ala Cys Pro Ser Val Glu Lys Ala Leu
210 215 220
He Val He Arg Asn Ala Lys Glu He Asp Tyr Val Arg Gly Arg Asp 225 230 235 240
Phe Val Tyr Asn Glu Met Val Asn Tyr Gin Ser Asp Lys Cys Glu Pro
245 250 255
Glu Met Met Asp Ser Glu Asp Pro Leu Phe Leu Leu Tyr Thr Ser Gly
260 265 270
Ser Thr Gly Lys Pro Lys Gly Val Gin His Ser Ser Ala Gly Tyr Leu
275 280 285
Leu Trp Ala Gin Met Thr Met Glu Trp Val Phe Asp He Arg Asp Asn
290 295 300
Asp Asn Phe Trp Cys Thr Ala Asp He Gly Trp He Thr Gly His Thr 305 310 315 320
Tyr Val Val Tyr Gly Pro Leu Ala Cys Gly Ala Thr Thr Leu He Leu 325 330 335
Glu Gly Thr Met Ser Tyr Pro Asp Tyr Gly Arg Trp Trp Arg Met He
340 345 350
Glu Glu Tyr Arg Val Asp Lys Phe Tyr Thr Ser Pro Thr Ala He Arg
355 360 365
Met Leu His Ala Lys Gly Glu Asn Glu Pro Ser Lys Tyr Asn Leu Glu
370 375 380
Ser Leu Lys Val Leu Gly Thr Val Gly Glu Pro He Asn Pro Thr Ala 385 390 395 400
Trp Lys Trp Phe Tyr Glu Lys He Gly Asn Ser Lys Cys Ser He Val
405 410 415
Asp Thr Trp Trp Gin Thr Glu Thr Gly Gly His He He Ser Pro Leu
420 425 430
Pro Gly Ala Thr Pro He Arg Ala Ser Cys Ala Thr Leu Pro Leu Pro
435 440 445
Gly He His Ala Glu Val Leu Asn Glu Asp Gly Thr Lys Thr Lys Pro
450 455 460
Gly Glu Gin Gly Phe Leu Cys He Thr Lys Pro Trp Pro Ser Met He 465 470 475 480
Arg Asn He Trp Gly Asp Glu Lys Arg Tyr He Asp Ser Tyr Phe Ser
485 490 495
Gin He Lys Leu Asn Gly Glu Tyr Val Tyr Leu Ser Gly Asp Gly Ala
500 505 510
He Val Asp Glu Asn Gly Tyr He Thr He He Gly Arg Thr Asp Asp
515 520 525
He Val Asn Val Ser Gly His Arg He Gly Thr Ala Glu Val Glu Ser
530 535 540
Ala He Ser Lys His Glu Met Val Ala Glu Cys Ala Val Val Gly He 545 550 555 560
Pro Asp Ala He Lys Gly Glu Gly Leu Phe Ala Phe Val Val Leu Cys
565 570 575
Asp Gly Ala Lys Cys Asn Leu Gly Glu Ser Leu Glu Leu Leu Lys Glu
580 585 590
Met Asn His He Leu Ser He Glu He Gly Lys He Ala Lys Leu Asp
595 600 605
Asn Val Met Tyr Val Pro Gly Leu Pro Lys Thr Arg Ser Gly Lys He
610 615 620
Met Arg Arg Leu Leu Lys Ser He Ala Lys Lys Glu Pro He Thr Gin 625 630 635 640
Asp Leu Ser Thr Leu Glu Asp Val Asn Val Val Lys Glu He Met Ser
645 650 655
He Ala Gin Met Glu Glu 660
(2) INFORMATION FOR SEQ ID NO: 955:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 725 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...669 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 955:
TGAATGGCTT AATGAGCA ATG GAT AAA AAC CAA TAT CAC CGC CCC CAT AGA 51
Met Asp Lys Asn Gin Tyr His Arg Pro His Arg 1 5 10
GCA AGC CAA ACG GCT TTT AAT GAA AGG ATA GTC ATG TTA AAA ACG AAT 99 Ala Ser Gin Thr Ala Phe Asn Glu Arg He Val Met Leu Lys Thr Asn 15 20 25
CAA AAA AAT GTG CAT GCG TTT GAA ATT GAA AAG CAA GAG CCT GAA GCG 147 Gin Lys Asn Val His Ala Phe Glu He Glu Lys Gin Glu Pro Glu Ala 30 35 40
GTC ATA GGG TTT TTA GAA AAA AAC CAT GCC CTT TTG CAA TAT TTT CTT 195 Val He Gly Phe Leu Glu Lys Asn His Ala Leu Leu Gin Tyr Phe Leu 45 50 55
ATT ATT TTT AAA TAC GAT ATT GAA TCA GAA GTC AAA GCC GTT TTG CGC 243 He He Phe Lys Tyr Asp He Glu Ser Glu Val Lys Ala Val Leu Arg 60 65 70 75
AAA CAC CAG CTT TTG TTT TTA GAA ACG AAT CGC GTT TTA AAC GGA CGC 291 Lys His Gin Leu Leu Phe Leu Glu Thr Asn Arg Val Leu Asn Gly Arg 80 85 90
CAT ATC AAA ACC ATG CCT TTA AAA GAC GAA ACC GAT CAT CCA AAA CCC 339 His He Lys Thr Met Pro Leu Lys Asp Glu Thr Asp His Pro Lys Pro 95 100 105
AAT CAT TCT AAA ACA GAA CCT AAA ACA ACG ATT TAT GAG CGC CAT ATC 387 Asn His Ser Lys Thr Glu Pro Lys Thr Thr He Tyr Glu Arg His He 110 115 120
AGG AGT GGG GAA GAG ATT TAT AGC ACT AAT CAC CTT ATT TTT TTG GGT 435 Arg Ser Gly Glu Glu He Tyr Ser Thr Asn His Leu He Phe Leu Gly 125 130 135
AAT ATC CAT AAT GGA GCC AAG ATT ATT TCA GAG GGC TGT GTG TCT GTT 483 Asn He His Asn Gly Ala Lys He He Ser Glu Gly Cys Val Ser Val 140 145 150 155
TAT GGG GTT TGC GAA GGG GCG ATT GTG TGC TTT GGA GAG TGT TTG ATC 531 Tyr Gly Val Cys Glu Gly Ala He Val Cys Phe Gly Glu Cys Leu He 160 165 170
TTA AAA GAA GTC AAG AGC GCT CAA ATC GTT TTT CAA AAC AAA ATT TTG 579 Leu Lys Glu Val Lys Ser Ala Gin He Val Phe Gin Asn Lys He Leu 175 180 185 TCT CTA AAA GAG GTT GAA CCG CTT TTG GTA AAT AAA AAT ATT AAA ATA 627 Ser Leu Lys Glu Val Glu Pro Leu Leu Val Asn Lys Asn He Lys He 190 195 200
ATC ACT AAA AAT GAC GAT ATA CTA GAC ATA AAG GAA GTA TTA TGAAACAAA 678 He Thr Lys Asn Asp Asp He Leu Asp He Lys Glu Val Leu 205 210 215
CAACCATTAA CCACTCTGTG GAATTAGTAG GGATAGGCTT GCACAAG 725
(2) INFORMATION FOR SEQ ID NO: 956:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 217 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 956:
Met Asp Lys Asn Gin Tyr His Arg Pro His Arg Ala Ser Gin Thr Ala
1 5 10 15
Phe Asn Glu Arg He Val Met Leu Lys Thr Asn Gin Lys Asn Val His
20 25 30
Ala Phe Glu He Glu Lys Gin Glu Pro Glu Ala Val He Gly Phe Leu
35 40 45
Glu Lys Asn His Ala Leu Leu Gin Tyr Phe Leu He He Phe Lys Tyr
50 55 60
Asp He Glu Ser Glu Val Lys Ala Val Leu Arg Lys His Gin Leu Leu 65 70 75 80
Phe Leu Glu Thr Asn Arg Val Leu Asn Gly Arg His He Lys Thr Met
85 90 95
Pro Leu Lys Asp Glu Thr Asp His Pro Lys Pro Asn His Ser Lys Thr
100 105 110
Glu Pro Lys Thr Thr He Tyr Glu Arg His He Arg Ser Gly Glu Glu
115 120 125
He Tyr Ser Thr Asn His Leu He Phe Leu Gly Asn He His Asn Gly
130 135 140
Ala Lys He He Ser Glu Gly Cys Val Ser Val Tyr Gly Val Cys Glu 145 150 155 160
Gly Ala He Val Cys Phe Gly Glu Cys Leu He Leu Lys Glu Val Lys
165 170 175
Ser Ala Gin He Val Phe Gin Asn Lys He Leu Ser Leu Lys Glu Val
180 185 190
Glu Pro Leu Leu Val Asn Lys Asn He Lys He He Thr Lys Asn Asp
195 200 205
Asp He Leu Asp He Lys Glu Val Leu 210 215
(2) INFORMATION FOR SEQ ID NO: 957:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1121 base pairs (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 64...1068 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 957:
AAAGAAAATC TGGTATTGGA TAAACCCAAG TCTTTAGAAG TGCCTTTGAC TAGGCCCGAA 60
ATC ATG GGG CTA GAA GAC AAG TGC CTT TTA TAT GAA ATT AAA GCT AAT 108 Met Gly Leu Glu Asp Lys Cys Leu Leu Tyr Glu He Lys Ala Asn 1 5 10 15
GAT TGG AGT TAT GCT AAT TTT TTC AAT GGC AAT AAA GCG TCT TTC AAA 156 Asp Trp Ser Tyr Ala Asn Phe Phe Asn Gly Asn Lys Ala Ser Phe Lys 20 25 30
CAA GAA GTG TGT GTT GAT ACG ATA AAA CCC TCA ATC ACT ATT TTA TCT 204 Gin Glu Val Cys Val Asp Thr He Lys Pro Ser He Thr He Leu Ser 35 40 45
CGA TCC CCA AGC ATC GCT TAT GGG GGG AGC GCG ATA GTC GTT TTT GAA 252 Arg Ser Pro Ser He Ala Tyr Gly Gly Ser Ala He Val Val Phe Glu 50 55 60
GCT TTG GAT AAG AAT TTG TCT CAA GCG TTT GTG CGC GTC AAA AAA AAG 300 Ala Leu Asp Lys Asn Leu Ser Gin Ala Phe Val Arg Val Lys Lys Lys 65 70 75
GAT TTT GAA GCT TTC AGG CTT TTA GAA TTC AAA CAG CGT AAT GTT TTT 348 Asp Phe Glu Ala Phe Arg Leu Leu Glu Phe Lys Gin Arg Asn Val Phe 80 85 90 95
ATC GCT CTA GTG CCT TGG TCT TAT AAA AAT AAG GAT TTT AAG GCG TTC 396 He Ala Leu Val Pro Trp Ser Tyr Lys Asn Lys Asp Phe Lys Ala Phe 100 105 110
ATT GTC GCT AAA GAT AAA GCC TAT AAC TTT AAT ACC GCC CCT TTA TTG 444 He Val Ala Lys Asp Lys Ala Tyr Asn Phe Asn Thr Ala Pro Leu Leu 115 120 125
TTC AAG CGA AAA ATC CAT CGT TTG AGG GAA AAA GAT ATA GAC TTA AGC 492 Phe Lys Arg Lys He His Arg Leu Arg Glu Lys Asp He Asp Leu Ser 130 135 140
GCC TTA AAA GAT AAG ATT GCA AAG CAA GAA AAA TTT CAA AAC GAC ACT 540 Ala Leu Lys Asp Lys He Ala Lys Gin Glu Lys Phe Gin Asn Asp Thr 145 150 155 GAA CAA GCT TTA TTA GAA AGA TTT TCC AAT GCG CGC CCA AAA GAT TTA 588 Glu Gin Ala Leu Leu Glu Arg Phe Ser Asn Ala Arg Pro Lys Asp Leu 160 165 170 175
GAA AAA ATC CAA AAG ATC GCT TTA GAG CAA GGG GAT TTT TAT AAG GAT 636 Glu Lys He Gin Lys He Ala Leu Glu Gin Gly Asp Phe Tyr Lys Asp 180 185 190
TTT TCT CAT TTT CAA GCG CTA AAA CCC TTG AAC GGG CCT TTT AAA ATG 684 Phe Ser His Phe Gin Ala Leu Lys Pro Leu Asn Gly Pro Phe Lys Met 195 200 205
GCA AGC AAT TTT TTA GAA AAT CGG CGT ATC TTA AAG AAT AAT CAG GTG 732 Ala Ser Asn Phe Leu Glu Asn Arg Arg He Leu Lys Asn Asn Gin Val 210 215 220
TTG TTT AAA TTC TTG CAT TTA GGG GTG GAT TTG ATA CCT GGC AAG GAT 780 Leu Phe Lys Phe Leu His Leu Gly Val Asp Leu He Pro Gly Lys Asp 225 230 235
TTA TCT TTA GCG TTT GAT TTG TCT GTG AAG AGG GTT TTT AAG GGG GAG 828 Leu Ser Leu Ala Phe Asp Leu Ser Val Lys Arg Val Phe Lys Gly Glu 240 245 250 255
TTC GAT TTT TAT GGT AAT AGT TTA ATC CAT TGC TAT GGG TTA GGT TTG 876 Phe Asp Phe Tyr Gly Asn Ser Leu He His Cys Tyr Gly Leu Gly Leu 260 265 270
TGC GTT TTT TTA GCC CAT TTA AAA GAT GAT AAA AGC GTG GGG AGT AGT 924 Cys Val Phe Leu Ala His Leu Lys Asp Asp Lys Ser Val Gly Ser Ser 275 280 285
GGT TTG AAA TTA GGG AGC GGG TTG CAT TTA GGG ATG CTT TTG CAA GGG 972 Gly Leu Lys Leu Gly Ser Gly Leu His Leu Gly Met Leu Leu Gin Gly 290 295 300
GTT TTT GTC CGG CCC AAT GAA TGG CTT AAT GAG CAA TGG ATA AAA ACC 1020 Val Phe Val Arg Pro Asn Glu Trp Leu Asn Glu Gin Trp He Lys Thr 305 310 315
AAT ATC ACC GCC CCC ATA GAG CAA GCC AAA CGG CTT TTA ATG AAA GGA T 1069 Asn He Thr Ala Pro He Glu Gin Ala Lys Arg Leu Leu Met Lys Gly 320 325 330 335
AGTCATGTTA AAAACGAATC AAAAAAATGT GCATGCGTTT GAAATTGAAA AG 1121
(2) INFORMATION FOR SEQ ID NO: 958:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 335 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 958:
Met Gly Leu Glu Asp Lys Cys Leu Leu Tyr Glu He Lys Ala Asn Asp
1 5 10 15
Trp Ser Tyr Ala Asn Phe Phe Asn Gly Asn Lys Ala Ser Phe Lys Gin
20 25 30
Glu Val Cys Val Asp Thr He Lys Pro Ser He Thr He Leu Ser Arg
35 40 45
Ser Pro Ser He Ala Tyr Gly Gly Ser Ala He Val Val Phe Glu Ala
50 55 60
Leu Asp Lys Asn Leu Ser Gin Ala Phe Val Arg Val Lys Lys Lys Asp 65 70 75 80
Phe Glu Ala Phe Arg Leu Leu Glu Phe Lys Gin Arg Asn Val Phe He
85 90 95
Ala Leu Val Pro Trp Ser Tyr Lys Asn Lys Asp Phe Lys Ala Phe He
100 105 110
Val Ala Lys Asp Lys Ala Tyr Asn Phe Asn Thr Ala Pro Leu Leu Phe
115 120 125
Lys Arg Lys He His Arg Leu Arg Glu Lys Asp He Asp Leu Ser Ala
130 135 140
Leu Lys Asp Lys He Ala Lys Gin Glu Lys Phe Gin Asn Asp Thr Glu 145 150 155 160
Gin Ala Leu Leu Glu Arg Phe Ser Asn Ala Arg Pro Lys Asp Leu Glu
165 170 175
Lys He Gin Lys He Ala Leu Glu Gin Gly Asp Phe Tyr Lys Asp Phe
180 185 190
Ser His Phe Gin Ala Leu Lys Pro Leu Asn Gly Pro Phe Lys Met Ala
195 200 205
Ser Asn Phe Leu Glu Asn Arg Arg He Leu Lys Asn Asn Gin Val Leu
210 215 220
Phe Lys Phe Leu His Leu Gly Val Asp Leu He Pro Gly Lys Asp Leu 225 230 235 240
Ser Leu Ala Phe Asp Leu Ser Val Lys Arg Val Phe Lys Gly Glu Phe
245 250 255
Asp Phe Tyr Gly Asn Ser Leu He His Cys Tyr Gly Leu Gly Leu Cys
260 265 270
Val Phe Leu Ala His Leu Lys Asp Asp Lys Ser Val Gly Ser Ser Gly
275 280 285
Leu Lys Leu Gly Ser Gly Leu His Leu Gly Met Leu Leu Gin Gly Val
290 295 300
Phe Val Arg Pro Asn Glu Trp Leu Asn Glu Gin Trp He Lys Thr Asn 305 310 315 320
He Thr Ala Pro He Glu Gin Ala Lys Arg Leu Leu Met Lys Gly 325 330 335
(2) INFORMATION FOR SEQ ID NO: 959:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1004 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA ( ix) FEATURE :
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 28...969 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 959:
TTACAACTAT TTATTGTAAA GGCTAAA ATG TTG AAA TTT AAA TAT GGT TTG ATT 54
Met Leu Lys Phe Lys Tyr Gly Leu He 1 5
TAT ATC GCG CTC ATA CTA GGA CTT CAA GCG ACA GAT TAT GAC AAT TTA 102 Tyr He Ala Leu He Leu Gly Leu Gin Ala Thr Asp Tyr Asp Asn Leu 10 15 20 25
GAA GAA GAA AAC CAA CAA TTA GAT GAA AAA ATA AAC CAT TTA AAG CAA 150 Glu Glu Glu Asn Gin Gin Leu Asp Glu Lys He Asn His Leu Lys Gin 30 35 40
CAG CTC ACC GAA AAA GGG GTT TCG CCC AAA GAG ATG GAT AAG GAT AAG 198 Gin Leu Thr Glu Lys Gly Val Ser Pro Lys Glu Met Asp Lys Asp Lys 45 50 55
TTT GAA GAA GAA TAC ATC AAT CGA TCT TAT CCT AAA ATT TCT TCC AAG 246 Phe Glu Glu Glu Tyr He Asn Arg Ser Tyr Pro Lys He Ser Ser Lys 60 65 70
AAA AAA GAG AAA TTG CTC AAA TCT TTT TCC ATA GCC GAT GAT AAG AGT 294 Lys Lys Glu Lys Leu Leu Lys Ser Phe Ser He Ala Asp Asp Lys Ser 75 80 85
GGG GTT TTT TTA GGG GGT GGG TAT GCT TAT GGG GAA CTT AAC TTG TCT 342 Gly Val Phe Leu Gly Gly Gly Tyr Ala Tyr Gly Glu Leu Asn Leu Ser 90 95 100 105
TAT CAA GGG GAA ATG TTA GAC AGA TAC GGC GCG AAT GCC CCT AGC GCG 390 Tyr Gin Gly Glu Met Leu Asp Arg Tyr Gly Ala Asn Ala Pro Ser Ala 110 115 120
TTT AAA AAC AAT ATC AAT ATT AAC GCT CCT GTT TCT ATG ATT AGC GCT 438 Phe Lys Asn Asn He Asn He Asn Ala Pro Val Ser Met He Ser Ala 125 130 135
AAA TTT GGG TAT CAA AAA TAC TTT GTG TCT TAT TTT GGG ACA CGA TTT 486 Lys Phe Gly Tyr Gin Lys Tyr Phe Val Ser Tyr Phe Gly Thr Arg Phe 140 145 150
TAT GGG GAT TTA TTG CTT GGG GGT GGG GCA TTA AAA GAG GAT GCA ATC 534 Tyr Gly Asp Leu Leu Leu Gly Gly Gly Ala Leu Lys Glu Asp Ala He 155 160 165
AAG CAG CCT GTA GGC TCG TTT ATT TAT GTT TTA GGG GCT GTC AAT ACC 582 Lys Gin Pro Val Gly Ser Phe He Tyr Val Leu Gly Ala Val Asn Thr 170 175 180 185
GAT TTA TTG TTT GAT ATG CCT TTA GAT TTT AAA ACT AAA AAG CAT TTT 630 Asp Leu Leu Phe Asp Met Pro Leu Asp Phe Lys Thr Lys Lys His Phe 190 195 200
TTA GGC GTT TAT GCG GGT TTT GGG ATA GGG CTT ATG CTC TAT CAA GAC 678 Leu Gly Val Tyr Ala Gly Phe Gly He Gly Leu Met Leu Tyr Gin Asp 205 210 215
AGG CCT AAT CAA AAC GGG AGG AAT TTA GTA GTG GGG GGC TAT TCA AGC 726 Arg Pro Asn Gin Asn Gly Arg Asn Leu Val Val Gly Gly Tyr Ser Ser 220 225 230
CCT AAT TTT TTA TGG AAA TCT TTG ATT GAA GTG GAT TAC ACT TTT AAT 774 Pro Asn Phe Leu Trp Lys Ser Leu He Glu Val Asp Tyr Thr Phe Asn 235 240 245
GTG GGC GTG AGT TTA ACG CTT TAT AGG AAA CAC CGT TTA GAG ATT GGC 822 Val Gly Val Ser Leu Thr Leu Tyr Arg Lys His Arg Leu Glu He Gly 250 255 260 265
ACA AAA TTG CCG ATT AGC TAT TTG AGA ATG GGA GTG GAA GAG GGA GCG 870 Thr Lys Leu Pro He Ser Tyr Leu Arg Met Gly Val Glu Glu Gly Ala 270 275 280
ATT TAT CAA AAT AAA GAA GAT GAT GAG CGT TTG TTG GTT TCG GCT AAC 918 He Tyr Gin Asn Lys Glu Asp Asp Glu Arg Leu Leu Val Ser Ala Asn 285 290 295
AAC CAG TTC AAG CGA TCC AGT TTT TTA TTA GTG AAT TAT GCG TTT ATT 966 Asn Gin Phe Lys Arg Ser Ser Phe Leu Leu Val Asn Tyr Ala Phe He 300 305 310
TTT TAAGGCTTGA TCTTGGAGTT AAGGTTTAAA ATTTT 1004
Phe
(2) INFORMATION FOR SEQ ID NO: 960:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 314 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 960:
Met Leu Lys Phe Lys Tyr Gly Leu He Tyr He Ala Leu He Leu Gly
1 5 10 15
Leu Gin Ala Thr Asp Tyr Asp Asn Leu Glu Glu Glu Asn Gin Gin Leu 20 25 30
Asp Glu Lys He Asn His Leu Lys Gin Gin Leu Thr Glu Lys Gly Val
35 40 45
Ser Pro Lys Glu Met Asp Lys Asp Lys Phe Glu Glu Glu Tyr He Asn
50 55 60
Arg Ser Tyr Pro Lys He Ser Ser Lys Lys Lys Glu Lys Leu Leu Lys 65 70 75 80
Ser Phe Ser He Ala Asp Asp Lys Ser Gly Val Phe Leu Gly Gly Gly
85 90 95
Tyr Ala Tyr Gly Glu Leu Asn Leu Ser Tyr Gin Gly Glu Met Leu Asp
100 105 110
Arg Tyr Gly Ala Asn Ala Pro Ser Ala Phe Lys Asn Asn He Asn He
115 120 125
Asn Ala Pro Val Ser Met He Ser Ala Lys Phe Gly Tyr Gin Lys Tyr
130 135 140
Phe Val Ser Tyr Phe Gly Thr Arg Phe Tyr Gly Asp Leu Leu Leu Gly 145 150 155 160
Gly Gly Ala Leu Lys Glu Asp Ala He Lys Gin Pro Val Gly Ser Phe
165 170 175
He Tyr Val Leu Gly Ala Val Asn Thr Asp Leu Leu Phe Asp Met Pro
180 185 190
Leu Asp Phe Lys Thr Lys Lys His Phe Leu Gly Val Tyr Ala Gly Phe
195 200 205
Gly He Gly Leu Met Leu Tyr Gin Asp Arg Pro Asn Gin Asn Gly Arg
210 215 220
Asn Leu Val Val Gly Gly Tyr Ser Ser Pro Asn Phe Leu Trp Lys Ser 225 230 235 240
Leu He Glu Val Asp Tyr Thr Phe Asn Val Gly Val Ser Leu Thr Leu
245 250 255
Tyr Arg Lys His Arg Leu Glu He Gly Thr Lys Leu Pro He Ser Tyr
260 265 270
Leu Arg Met Gly Val Glu Glu Gly Ala He Tyr Gin Asn Lys Glu Asp
275 280 285
Asp Glu Arg Leu Leu Val Ser Ala Asn Asn Gin Phe Lys Arg Ser Ser
290 295 300
Phe Leu Leu Val Asn Tyr Ala Phe He Phe 305 310
(2) INFORMATION FOR SEQ ID NO: 961:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 874 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 18...827 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 961:
AACACTTAGG ATTTTTA ATG AGC ATG CAA ACC GCC CCA ATT AAA AAG ATC 50
Met Ser Met Gin Thr Ala Pro He Lys Lys He 1 5 10
ACT CTC AAC CAC CTC CAA GCT AAA AAA AAT CAA GAA AAA ATC ATC GCT 98 Thr Leu Asn His Leu Gin Ala Lys Lys Asn Gin Glu Lys He He Ala 15 20 25
ATT ACC GCT TAT GAT GCG CTG TTC GCT CAA ATA TTT GAT CCG CTA GTG 146 He Thr Ala Tyr Asp Ala Leu Phe Ala Gin He Phe Asp Pro Leu Val 30 35 40
GAT GTG ATT TTA GTG GGC GAT AGT TTG AAT ATG AGT TTT TTC AAT CAA 194 Asp Val He Leu Val Gly Asp Ser Leu Asn Met Ser Phe Phe Asn Gin 45 50 55
AAC GAC ACT TTA AGC GCG AGT GTG GAA ATG ATG CTC TAT CAC ACC AAA 242 Asn Asp Thr Leu Ser Ala Ser Val Glu Met Met Leu Tyr His Thr Lys 60 65 70 75
GCC GTG TGC GCG GGC GCT AAG ACT CCT TTT ATC ATC ACA GAC ATG CCT 290 Ala Val Cys Ala Gly Ala Lys Thr Pro Phe He He Thr Asp Met Pro 80 85 90
TTT GGA AGC TAT AAA GAT GAA AAA ACA GCC CTA AAA AAC GCC ATT AGG 338 Phe Gly Ser Tyr Lys Asp Glu Lys Thr Ala Leu Lys Asn Ala He Arg 95 100 105
GTT TAT AAA GAA ACC CAA GCG AGC GCA ATC AAA TTA GAG GGG GGG AAA 386 Val Tyr Lys Glu Thr Gin Ala Ser Ala He Lys Leu Glu Gly Gly Lys 110 115 120
GAA AAA GCG AAA CTG GTT AAA ACG CTC ACT AAT GAG GGC GTT ATT GTG 434 Glu Lys Ala Lys Leu Val Lys Thr Leu Thr Asn Glu Gly Val He Val 125 130 135
GTA GGG CAT ATT GGC TTG ATG CCC CAA TTC GTG CGC CTT GAT GGA GGT 482 Val Gly His He Gly Leu Met Pro Gin Phe Val Arg Leu Asp Gly Gly 140 145 150 155
TAT AAG ATT AAG GGC AAA AAT GAA GAA CAA CAA AAA AAG CTT TTA GAA 530 Tyr Lys He Lys Gly Lys Asn Glu Glu Gin Gin Lys Lys Leu Leu Glu 160 165 170
GAC GCC TTG AGT TTA GAA GAA GCT GGG GTG GGT TTG TTG GTT TTA GAG 578 Asp Ala Leu Ser Leu Glu Glu Ala Gly Val Gly Leu Leu Val Leu Glu 175 180 185
GGT ATA ACC ACC CCT ATC GCT CAA AAA ATC ACG CAA AAA ATC AAA ATC 626 Gly He Thr Thr Pro He Ala Gin Lys He Thr Gin Lys He Lys He 190 195 200
CCC ACG ATC GGC ATA GGG AGC GGT AAA GAT TGC GAT GGG CAG ATT TTA 674 Pro Thr He Gly He Gly Ser Gly Lys Asp Cys Asp Gly Gin He Leu 205 210 215
GTG TGG AGC GAT ATG TTA GGC TTT TTT GAT AGC TTT AAG CCT AAA TTC 722 Val Trp Ser Asp Met Leu Gly Phe Phe Asp Ser Phe Lys Pro Lys Phe 220 225 230 235
GTG CGA GAA TAC CTT AAG GGG AAA GAA TTG ATT CAA AAC GCT ATC AAA 770 Val Arg Glu Tyr Leu Lys Gly Lys Glu Leu He Gin Asn Ala He Lys 240 245 250
CAA TAC GCT GAT GAT GTG AAA AAG GGA AAC TTC CCT AAC GAA TTA GAA 818 Gin Tyr Ala Asp Asp Val Lys Lys Gly Asn Phe Pro Asn Glu Leu Glu 255 260 265
AGT TAT CAT TAATGAAAGA ACGGATAGTC AATTTAGAAA CTTTGGATTT TGAAATT 874 Ser Tyr His 270
(2) INFORMATION FOR SEQ ID NO: 962:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 270 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 962:
Met Ser Met Gin Thr Ala Pro He Lys Lys He Thr Leu Asn His Leu
1 5 10 15
Gin Ala Lys Lys Asn Gin Glu Lys He He Ala He Thr Ala Tyr Asp
20 25 30
Ala Leu Phe Ala Gin He Phe Asp Pro Leu Val Asp Val He Leu Val
35 40 45
Gly Asp Ser Leu Asn Met Ser Phe Phe Asn Gin Asn Asp Thr Leu Ser
50 55 60
Ala Ser Val Glu Met Met Leu Tyr His Thr Lys Ala Val Cys Ala Gly 65 70 75 80
Ala Lys Thr Pro Phe He He Thr Asp Met Pro Phe Gly Ser Tyr Lys
85 90 95
Asp Glu Lys Thr Ala Leu Lys Asn Ala He Arg Val Tyr Lys Glu Thr
100 105 110
Gin Ala Ser Ala He Lys Leu Glu Gly Gly Lys Glu Lys Ala Lys Leu
115 120 125
Val Lys Thr Leu Thr Asn Glu Gly Val He Val Val Gly His He Gly
130 135 140
Leu Met Pro Gin Phe Val Arg Leu Asp Gly Gly Tyr Lys He Lys Gly 145 150 155 160
Lys Asn Glu Glu Gin Gin Lys Lys Leu Leu Glu Asp Ala Leu Ser Leu
165 170 175
Glu Glu Ala Gly Val Gly Leu Leu Val Leu Glu Gly He Thr Thr Pro 180 185 190
He Ala Gin Lys He Thr Gin Lys He Lys He Pro Thr He Gly He
195 200 205
Gly Ser Gly Lys Asp Cys Asp Gly Gin He Leu Val Trp Ser Asp Met
210 215 220
Leu Gly Phe Phe Asp Ser Phe Lys Pro Lys Phe Val Arg Glu Tyr Leu 225 230 235 240
Lys Gly Lys Glu Leu He Gin Asn Ala He Lys Gin Tyr Ala Asp Asp
245 250 255
Val Lys Lys Gly Asn Phe Pro Asn Glu Leu Glu Ser Tyr His 260 265 270
(2) INFORMATION FOR SEQ ID NO:963:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 568 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 41...520 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 963:
AATAACGATA AAATTTTAAA GGGTGTAAAA GTAGATTGTT ATG TTT GGC ATG GGC 55
Met Phe Gly Met Gly 1 5
TTT TTT GAA ATC CTT GTG GTG TTG GTT GTA GCG ATT ATT TTT TTA GGG 103 Phe Phe Glu He Leu Val Val Leu Val Val Ala He He Phe Leu Gly 10 15 20
CCA GAA AAA TTC CCC CAG GCT GTC GTG GAT GTG GTG AAG TTT TTT CGC 151 Pro Glu Lys Phe Pro Gin Ala Val Val Asp Val Val Lys Phe Phe Arg 25 30 35
GCG GTT AAA AAA ACG CTC AAT GAC GCT AAG GAC ACT TTA GAT AAA GAA 199 Ala Val Lys Lys Thr Leu Asn Asp Ala Lys Asp Thr Leu Asp Lys Glu 40 45 50
ATC AAT ATT GAA GAA ATC AAA AAA GAA ACC CTA GAG TAT CAA AAG CTC 247 He Asn He Glu Glu He Lys Lys Glu Thr Leu Glu Tyr Gin Lys Leu 55 60 65
TTT GAA AAC AAA GTG GAG AGT CTT AAG GGC GTT AAG ATT GAA GAA TTA 295 Phe Glu Asn Lys Val Glu Ser Leu Lys Gly Val Lys He Glu Glu Leu 70 75 80 85 GAA GAC GCT AAA GTG ACT GCA GAA AAT GAG ATT AAA AGC ATT CAG GAT 343 Glu Asp Ala Lys Val Thr Ala Glu Asn Glu He Lys Ser He Gin Asp 90 95 100
TTG ATG CAA GAT TAC CAA AAA AGC TTA GAA ACC AAC ACA ATC CCT AAC 391 Leu Met Gin Asp Tyr Gin Lys Ser Leu Glu Thr Asn Thr He Pro Asn 105 110 115
CAT TTA AAC GAA GAA GTT TCC AAT GAA GAA GCC TTA AAC AAA GAA GTT 439 His Leu Asn Glu Glu Val Ser Asn Glu Glu Ala Leu Asn Lys Glu Val 120 125 130
TCA AGC GAT GAA TCC CCT AAA GAA GTC CAA TTA GCA ACC GAT AAC AAC 487 Ser Ser Asp Glu Ser Pro Lys Glu Val Gin Leu Ala Thr Asp Asn Asn 135 140 145
ACC AAA GAA CAC GAC AAA GAA AAA GAG AAT GTT TGAAGATTTA AAACCGCATT 540 Thr Lys Glu His Asp Lys Glu Lys Glu Asn Val 150 155 160
TACAGGAATT AAGAAAGCGT TTGATGGT 568
(2) INFORMATION FOR SEQ ID NO: 964:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 160 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 964:
Met Phe Gly Met Gly Phe Phe Glu He Leu Val Val Leu Val Val Ala
1 5 10 15
He He Phe Leu Gly Pro Glu Lys Phe Pro Gin Ala Val Val Asp Val
20 25 30
Val Lys Phe Phe Arg Ala Val Lys Lys Thr Leu Asn Asp Ala Lys Asp
35 40 45
Thr Leu Asp Lys Glu He Asn He Glu Glu He Lys Lys Glu Thr Leu
50 55 60
Glu Tyr Gin Lys Leu Phe Glu Asn Lys Val Glu Ser Leu Lys Gly Val 65 70 75 80
Lys He Glu Glu Leu Glu Asp Ala Lys Val Thr Ala Glu Asn Glu He
85 90 95
Lys Ser He Gin Asp Leu Met Gin Asp Tyr Gin Lys Ser Leu Glu Thr
100 105 110
Asn Thr He Pro Asn His Leu Asn Glu Glu Val Ser Asn Glu Glu Ala
115 120 125
Leu Asn Lys Glu Val Ser Ser Asp Glu Ser Pro Lys Glu Val Gin Leu
130 135 140
Ala Thr Asp Asn Asn Thr Lys Glu His Asp Lys Glu Lys Glu Asn Val 145 150 155 160 (2) INFORMATION FOR SEQ ID NO: 965:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 359 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 46...324 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 965:
TAAAGGCGAG CAGTTAAAAG ATGAAATCGC TTGTAAAGAC ACTGA ATG CTT TAT GCA 57
Met Leu Tyr Ala 1
TCA AAA ACG AGT TTA TTT TTA CAA ATC AAA GGA AAG TTT ATG TTA AGA 105 Ser Lys Thr Ser Leu Phe Leu Gin He Lys Gly Lys Phe Met Leu Arg 5 10 15 20
ATT TTA ATC CCC TTG CTC ATT ATT GTG TGG GTT TTA TGG CGT TTG TTT 153 He Leu He Pro Leu Leu He He Val Trp Val Leu Trp Arg Leu Phe 25 30 35
TTG AGG CAA AAA CCC CCT AAA GAC AAC CAC TCT TAC ACG CAA CAA ACC 201 Leu Arg Gin Lys Pro Pro Lys Asp Asn His Ser Tyr Thr Gin Gin Thr 40 45 50
CCT AAA GAA TTA GAA GAT CAC ATG ATT GTA TGC TCT AAA TGC CAA ACC 249 Pro Lys Glu Leu Glu Asp His Met He Val Cys Ser Lys Cys Gin Thr 55 60 65
TAT GTC TCT AGC AAA GAC GCT ATT TAT AGC GGG GCG GTG GCG TAT TGC 297 Tyr Val Ser Ser Lys Asp Ala He Tyr Ser Gly Ala Val Ala Tyr Cys 70 75 80
AGT GAA ACC TGT TTG AAG GAT AAG AGG TAAATATGCT TATTTTAGGA CACCCTT 351 Ser Glu Thr Cys Leu Lys Asp Lys Arg 85 90
TAATCCCT 359
(2) INFORMATION FOR SEQ ID NO: 966:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 93 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 966:
Met Leu Tyr Ala Ser Lys Thr Ser Leu Phe Leu Gin He Lys Gly Lys
1 5 10 15
Phe Met Leu Arg He Leu He Pro Leu Leu He He Val Trp Val Leu
20 25 30
Trp Arg Leu Phe Leu Arg Gin Lys Pro Pro Lys Asp Asn His Ser Tyr
35 40 45
Thr Gin Gin Thr Pro Lys Glu Leu Glu Asp His Met He Val Cys Ser
50 55 60
Lys Cys Gin Thr Tyr Val Ser Ser Lys Asp Ala He Tyr Ser Gly Ala 65 70 75 80
Val Ala Tyr Cys Ser Glu Thr Cys Leu Lys Asp Lys Arg 85 90
(2) INFORMATION FOR SEQ ID NO: 967:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 814 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...765 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 967:
TAAAAAACAA CCAATAAATT AAGGATTATA ATG AAA CCA ACG AAC GAA CCT AAA 54
Met Lys Pro Thr Asn Glu Pro Lys 1 5
AAA CCT TTT TTT CAA AGT CCC ATT ATC CTT GCG GTT CTT GGA GGG ATT 102 Lys Pro Phe Phe Gin Ser Pro He He Leu Ala Val Leu Gly Gly He 10 15 20
TTA CTC ATC TTT TTT CTA CGC TCT TTC AAT TCT GAT GGC AGT TTT TCG 150 Leu Leu He Phe Phe Leu Arg Ser Phe Asn Ser Asp Gly Ser Phe Ser 25 30 35 40
GAC AAT TTC TTA GCT TCT AGC ACT AAA AAT GTG AGC TAC CAT GAA ATC 198 Asp Asn Phe Leu Ala Ser Ser Thr Lys Asn Val Ser Tyr His Glu He 45 50 55
AAA CAG CTC ATC AGC AAT AAT GAA GTG GAA AAT GTG AGT ATC GGT CAA 246 Lys Gin Leu He Ser Asn Asn Glu Val Glu Asn Val Ser He Gly Gin 60 65 70
ACT TTG ATC AAA GCC AGC CAT AAA GAG GGC AAC AAT CGT GTG ATC TAT 294 Thr Leu He Lys Ala Ser His Lys Glu Gly Asn Asn Arg Val He Tyr 75 80 85
ATC GCT AAA CGG GTG CCT GAT TTG ACC TTA GTG CCT TTG TTA GAC GAG 342 He Ala Lys Arg Val Pro Asp Leu Thr Leu Val Pro Leu Leu Asp Glu 90 95 100
AAA AAA ATC AAT TAT TCT GGT TTT AGC GAG TCT AAC TTT TTT ACG GAC 390 Lys Lys He Asn Tyr Ser Gly Phe Ser Glu Ser Asn Phe Phe Thr Asp 105 110 115 120
ATG CTA GGG TGG CTC ATG CCT ATT TTA GTG ATT TTA GGG CTA TGG ATG 438 Met Leu Gly Trp Leu Met Pro He Leu Val He Leu Gly Leu Trp Met 125 130 135
TTT ATG GCG AAC CGC ATG CAA AAA AAT ATG GGT GGG GGT ATT TTT GGC 486 Phe Met Ala Asn Arg Met Gin Lys Asn Met Gly Gly Gly He Phe Gly 140 145 150
ATG GGG AGC GCG AAA AAA CTC ATT AAC GCT GAA AAA CCC AAT GTG CGT 534 Met Gly Ser Ala Lys Lys Leu He Asn Ala Glu Lys Pro Asn Val Arg 155 160 165
TTT AAT GAC ATG GCA GGC AAT GAA GAA GCC AAA GAA GAA GTG GTA GAA 582 Phe Asn Asp Met Ala Gly Asn Glu Glu Ala Lys Glu Glu Val Val Glu 170 175 180
ATC GTA GAT TTC TTA AAA TAC CCT GAA CGA TAC GCC AAT TTA GGG GCT 630 He Val Asp Phe Leu Lys Tyr Pro Glu Arg Tyr Ala Asn Leu Gly Ala 185 190 195 200
AAA ATC CCT AAA GGC GTG TTA TTA GTA GGG CCT CCA GGA ACC GGT AAA 678 Lys He Pro Lys Gly Val Leu Leu Val Gly Pro Pro Gly Thr Gly Lys 205 210 215
ACC CTT TTA GCC AAA GCG GTA GCC GGC GAA CGC ATG TGC CGT TTT TCT 726 Thr Leu Leu Ala Lys Ala Val Ala Gly Glu Arg Met Cys Arg Phe Ser 220 225 230
CTA TGG GAG GGA GCA GTT TCA TTG AAA TGT TTG TGG GCT TAGGGGCAAG CA 777 Leu Trp Glu Gly Ala Val Ser Leu Lys Cys Leu Trp Ala 235 240 245
GGGTTAGGGA TTTATTTGAA ACCGCTAAAA AACAAGC 814
(2) INFORMATION FOR SEQ ID NO: 968:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 245 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:968:
Met Lys Pro Thr Asn Glu Pro Lys Lys Pro Phe Phe Gin Ser Pro He
1 5 10 15
He Leu Ala Val Leu Gly Gly He Leu Leu He Phe Phe Leu Arg Ser
20 25 30
Phe Asn Ser Asp Gly Ser Phe Ser Asp Asn Phe Leu Ala Ser Ser Thr
35 40 45
Lys Asn Val Ser Tyr His Glu He Lys Gin Leu He Ser Asn Asn Glu
50 55 60
Val Glu Asn Val Ser He Gly Gin Thr Leu He Lys Ala Ser His Lys 65 70 75 80
Glu Gly Asn Asn Arg Val He Tyr He Ala Lys Arg Val Pro Asp Leu
85 90 95
Thr Leu Val Pro Leu Leu Asp Glu Lys Lys He Asn Tyr Ser Gly Phe
100 105 110
Ser Glu Ser Asn Phe Phe Thr Asp Met Leu Gly Trp Leu Met Pro He
115 120 125
Leu Val He Leu Gly Leu Trp Met Phe Met Ala Asn Arg Met Gin Lys
130 135 140
Asn Met Gly Gly Gly He Phe Gly Met Gly Ser Ala Lys Lys Leu He 145 150 155 160
Asn Ala Glu Lys Pro Asn Val Arg Phe Asn Asp Met Ala Gly Asn Glu
165 170 175
Glu Ala Lys Glu Glu Val Val Glu He Val Asp Phe Leu Lys Tyr Pro
180 185 190
Glu Arg Tyr Ala Asn Leu Gly Ala Lys He Pro Lys Gly Val Leu Leu
195 200 205
Val Gly Pro Pro Gly Thr Gly Lys Thr Leu Leu Ala Lys Ala Val Ala
210 215 220
Gly Glu Arg Met Cys Arg Phe Ser Leu Trp Glu Gly Ala Val Ser Leu 225 230 235 240
Lys Cys Leu Trp Ala 245
(2) INFORMATION FOR SEQ ID NO: 969:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1137 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 59...1093 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 969: AAGTGTTTGT ATCGGTTTTA GTGATTTCTT GCCCTTGCGC TTTAGGATTG CTACGCCT 58
ATG AGC ATT TTA GTA GCG AAC CAG AAA GCG AGT TCT TTA GGG TTA TTT 106 Met Ser He Leu Val Ala Asn Gin Lys Ala Ser Ser Leu Gly Leu Phe 1 5 10 15
TTT AAA GAC GCT AAA AGT TTA GAA AAA GCA AGG CTA GTC AAT ACG ATC 154 Phe Lys Asp Ala Lys Ser Leu Glu Lys Ala Arg Leu Val Asn Thr He 20 25 30
GTT TTT GAT AAA ACC GGC ACG CTC ACT AAC GGC AAG CCT GTC GTT AAA 202 Val Phe Asp Lys Thr Gly Thr Leu Thr Asn Gly Lys Pro Val Val Lys 35 40 45
AGC GTT CAT TCT AAG ATA GAA TTA TTA GAG TTA TTG AGT TTA GCG CTC 250 Ser Val His Ser Lys He Glu Leu Leu Glu Leu Leu Ser Leu Ala Leu 50 55 60
AGT ATT GAA AAG AGT AGC GAA CAT GTC ATC GCT AAA GGG ATT GTA GAA 298 Ser He Glu Lys Ser Ser Glu His Val He Ala Lys Gly He Val Glu 65 70 75 80
TAC GCA AAA GAG CAT AAC GCT CCC TTA AAA GAA ATG AGC GGG GTT AAA 346 Tyr Ala Lys Glu His Asn Ala Pro Leu Lys Glu Met Ser Gly Val Lys 85 90 95
GTG AAA ACG GGT TTT GGC ATT AGT GCT AAA ACA GAT TAT CAA GGC ACT 394 Val Lys Thr Gly Phe Gly He Ser Ala Lys Thr Asp Tyr Gin Gly Thr 100 105 110
AAA GAG ATT ATT AAA GTA GGC AAC AGC GAG TTT TTT AAC CCT ATT AAC 442 Lys Glu He He Lys Val Gly Asn Ser Glu Phe Phe Asn Pro He Asn 115 120 125
ACG CTA GAA ATT AAA GAA AAC GGG ATT TTA GTG TTT GTT GGT AGA GCG 490 Thr Leu Glu He Lys Glu Asn Gly He Leu Val Phe Val Gly Arg Ala 130 135 140
ATC AGT GAA AAA GAA GAC GAG CTT TTA GGG GCG TTT GTT TTA GAA GAT 538 He Ser Glu Lys Glu Asp Glu Leu Leu Gly Ala Phe Val Leu Glu Asp 145 150 155 160
TTG CCC AAA AAA GGC GTG AAA GAG CAT ATC GCT CAA ATC AAA AAT TTA 586 Leu Pro Lys Lys Gly Val Lys Glu His He Ala Gin He Lys Asn Leu 165 170 175
GGC ATT AAC ACC TTT CTT TTA AGC GGA GAC AAT AGG GAG AAT GTC CAA 634 Gly He Asn Thr Phe Leu Leu Ser Gly Asp Asn Arg Glu Asn Val Gin 180 185 190
AAA TGC GCG TTT GAA TTA GGG ATT GAT GGT TAT ATC AGC AAC GCT AAA 682 Lys Cys Ala Phe Glu Leu Gly He Asp Gly Tyr He Ser Asn Ala Lys 195 200 205
CCA CAA GAC AAG CTC AAT AAG ATC AAA GAG CTT AAG GAA AAA GGG CAG 730 Pro Gin Asp Lys Leu Asn Lys He Lys Glu Leu Lys Glu Lys Gly Gin 210 215 220
ATC GTT ATG ATG GTG GGC GAT GGC TTG AAT GAC GCT CCT AGT CTT GCT 778 He Val Met Met Val Gly Asp Gly Leu Asn Asp Ala Pro Ser Leu Ala 225 230 235 240
ATG AGC GAT GTG GCG GTG GTG ATG GCT AAA GGG AGC GAT GTG AGC GTG 826 Met Ser Asp Val Ala Val Val Met Ala Lys Gly Ser Asp Val Ser Val 245 250 255
CAA GCA GCG GAC ATT GTG AGT TTT AAT AAC GAT ATT AAA TCG GTT TAT 874 Gin Ala Ala Asp He Val Ser Phe Asn Asn Asp He Lys Ser Val Tyr 260 265 ' 270
AGC GCG ATT AAA TTA AGC CAG GCG ACA ATT AAA AAT ATC AAA GAA AAT 922 Ser Ala He Lys Leu Ser Gin Ala Thr He Lys Asn He Lys Glu Asn 275 280 285
TTG TTT TGG GCT TTT TGT TAT AAT AGC GTG TTC ATC CCT TTA GCT TGT 970 Leu Phe Trp Ala Phe Cys Tyr Asn Ser Val Phe He Pro Leu Ala Cys 290 295 300
GGG GTT CTT TAT AAG GCT AAT CTC ATG TTA AGC CCG GCG ATA GCG GGT 1018 Gly Val Leu Tyr Lys Ala Asn Leu Met Leu Ser Pro Ala He Ala Gly 305 310 315 320
TTA GCG ATG AGT TTA AGC TCT GTG AGT GTG GTC TTA AAC TCC CAA AGG 1066 Leu Ala Met Ser Leu Ser Ser Val Ser Val Val Leu Asn Ser Gin Arg 325 330 335
CTA AGG AAT TTT AAA ATT AAG GAT CAT TGAATGAAAG CAACTTTTCA AGTGCCAAG 1122 Leu Arg Asn Phe Lys He Lys Asp His 340 345
CATTACTTGC AACCA 1137
(2) INFORMATION FOR SEQ ID NO: 970:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 345 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 970:
Met Ser He Leu Val Ala Asn Gin Lys Ala Ser Ser Leu Gly Leu Phe 1 5 10 15 Phe Lys Asp Ala Lys Ser Leu Glu Lys Ala Arg Leu Val Asn Thr He
20 25 30
Val Phe Asp Lys Thr Gly Thr Leu Thr Asn Gly Lys Pro Val Val Lys
35 40 45
Ser Val His Ser Lys He Glu Leu Leu Glu Leu Leu Ser Leu Ala Leu
50 55 60
Ser He Glu Lys Ser Ser Glu His Val He Ala Lys Gly He Val Glu 65 70 75 80
Tyr Ala Lys Glu His Asn Ala Pro Leu Lys Glu Met Ser Gly Val Lys
85 90 95
Val Lys Thr Gly Phe Gly He Ser Ala Lys Thr Asp Tyr Gin Gly Thr
100 105 110
Lys Glu He He Lys Val Gly Asn Ser Glu Phe Phe Asn Pro He Asn
115 120 125
Thr Leu Glu He Lys Glu Asn Gly He Leu Val Phe Val Gly Arg Ala
130 135 140
He Ser Glu Lys Glu Asp Glu Leu Leu Gly Ala Phe Val Leu Glu Asp 145 150 155 160
Leu Pro Lys Lys Gly Val Lys Glu His He Ala Gin He Lys Asn Leu
165 170 175
Gly He Asn Thr Phe Leu Leu Ser Gly Asp Asn Arg Glu Asn Val Gin
180 185 190
Lys Cys Ala Phe Glu Leu Gly He Asp Gly Tyr He Ser Asn Ala Lys
195 200 205
Pro Gin Asp Lys Leu Asn Lys He Lys Glu Leu Lys Glu Lys Gly Gin
210 215 220
He Val Met Met Val Gly Asp Gly Leu Asn Asp Ala Pro Ser Leu Ala 225 230 235 240
Met Ser Asp Val Ala Val Val Met Ala Lys Gly Ser Asp Val Ser Val
245 250 255
Gin Ala Ala Asp He Val Ser Phe Asn Asn Asp He Lys Ser Val Tyr
260 265 270
Ser Ala He Lys Leu Ser Gin Ala Thr He Lys Asn He Lys Glu Asn
275 280 285
Leu Phe Trp Ala Phe Cys Tyr Asn Ser Val Phe He Pro Leu Ala Cys
290 295 300
Gly Val Leu Tyr Lys Ala Asn Leu Met Leu Ser Pro Ala He Ala Gly 305 310 315 320
Leu Ala Met Ser Leu Ser Ser Val Ser Val Val Leu Asn Ser Gin Arg
325 330 335
Leu Arg Asn Phe Lys He Lys Asp His 340 345
(2) INFORMATION FOR SEQ ID NO: 971:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 575 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence (B) LOCATION: 25...537 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 971:
TCAATTCTAT TTAAAAGGTT TTTT ATG GAT ATT TTA AAA ACT CTT CAA AAA 51
Met Asp He Leu Lys Thr Leu Gin Lys 1 5
CAT TTG GGC GAT GTT GAA ACA AGC GAT TTT ACA ACC AAT GCG ATA GAA 99 His Leu Gly Asp Val Glu Thr Ser Asp Phe Thr Thr Asn Ala He Glu 10 15 20 25
AAA TCC CAA CAA ATC GCT AAA TTC AGT AGG GAC ATG AAA AAT ATA AAC 147 Lys Ser Gin Gin He Ala Lys Phe Ser Arg Asp Met Lys Asn He Asn 30 35 40
GAG AGC GTT GGA GCG TTA CAA GTC TTG CAA ATC GCT TGC AAA AAG CTT 195 Glu Ser Val Gly Ala Leu Gin Val Leu Gin He Ala Cys Lys Lys Leu 45 50 55
TTC AAT AAG AGC ATG GGT TTA GAA GAT AAA GAC GCT TTG CAA GCT TCT 243 Phe Asn Lys Ser Met Gly Leu Glu Asp Lys Asp Ala Leu Gin Ala Ser 60 65 70
ATC ATC AAA CAG GAA TTG CGA GAA ATT GTA GAA AAT TGC CAG TTT TTA 291 He He Lys Gin Glu Leu Arg Glu He Val Glu Asn Cys Gin Phe Leu 75 80 85
GCC TCC CCT TTG TTT GAC ACT CAG CTC AAC ATT GCA ATC AAT GAT GAA 339 Ala Ser Pro Leu Phe Asp Thr Gin Leu Asn He Ala He Asn Asp Glu 90 95 100 105
ATT TTT TCC ATG ATT GTG GTT AAT CCG TTG GAT TTA TTG GAA AAT GTG 387 He Phe Ser Met He Val Val Asn Pro Leu Asp Leu Leu Glu Asn Val 110 115 120
GGC GAG TTT CAA GCT TAT TTG GAA GAA AAA TTA AAC GAA ATT AAG GAA 435 Gly Glu Phe Gin Ala Tyr Leu Glu Glu Lys Leu Asn Glu He Lys Glu 125 130 135
TTA TTA GGT TAT TTG AGT GAA AGC CTT TCA AAC CCT AAA GCC TTC ATG 483 Leu Leu Gly Tyr Leu Ser Glu Ser Leu Ser Asn Pro Lys Ala Phe Met 140 145 150
CCA AGT TTT TCA AAT CAA AGC CTT AAA GAT TTA TTA AGC GAT AAT TTG 531 Pro Ser Phe Ser Asn Gin Ser Leu Lys Asp Leu Leu Ser Asp Asn Leu 155 160 165
AGG GCT TAGAATTCAG CTCTCTAGTT TAGAAAATTT GATTTTCC 575
Arg Ala
170 (2) INFORMATION FOR SEQ ID NO: 972:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 171 ammo acids
Figure imgf001427_0001
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 972 :
Met Asp He Leu Lys Thr Leu Gin Lys His Leu Gly Asp Val Glu Thr
1 5 10 15
Ser Asp Phe Thr Thr Asn Ala He Glu Lys Ser Gin Gin He Ala Lys
20 25 30
Phe Ser Arg Asp Met Lys Asn He Asn Glu Ser Val Gly Ala Leu Gin
35 40 45
Val Leu Gin He Ala Cys Lys Lys Leu Phe Asn Lys Ser Met Gly Leu
50 55 60
Glu Asp Lys Asp Ala Leu Gin Ala Ser He He Lys Gin Glu Leu Arg 65 70 75 80
Glu He Val Glu Asn Cys Gin Phe Leu Ala Ser Pro Leu Phe Asp Thr
85 90 95
Gin Leu Asn He Ala He Asn Asp Glu He Phe Ser Met He Val Val
100 105 110
Asn Pro Leu Asp Leu Leu Glu Asn Val Gly Glu Phe Gin Ala Tyr Leu
115 120 125
Glu Glu Lys Leu Asn Glu He Lys Glu Leu Leu Gly Tyr Leu Ser Glu
130 135 140
Ser Leu Ser Asn Pro Lys Ala Phe Met Pro Ser Phe Ser Asn Gin Ser 145 150 155 160
Leu Lys Asp Leu Leu Ser Asp Asn Leu Arg Ala 165 170
(2) INFORMATION FOR SEQ ID NO: 973:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1025 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...972 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 973: TTTTTAGCGA TTGTGTTCTT GCATGCATTG GGTTTAGCGT TGCTCTTT ATG GCC AAT 57 Met Ala Asn
1
AAC GCT TCG TTT TAT GCG GCG GCG TCT ATG GCC TAC ATG CTA GGG GCA 105 Asn Ala Ser Phe Tyr Ala Ala Ala Ser Met Ala Tyr Met Leu Gly Ala 5 10 15
AAG CAT GCG TTT GAT GCG GAT CAC ATC GCT TGC ATA GAT AAC ACC ATT 153 Lys His Ala Phe Asp Ala Asp His He Ala Cys He Asp Asn Thr He 20 25 30 35
AGA AAG CTC ACC CAA CAA GGC AAA AAC GCC TAT GGT GTG GGG TTT TAC 201 Arg Lys Leu Thr Gin Gin Gly Lys Asn Ala Tyr Gly Val Gly Phe Tyr 40 45 50
TTT TCT ATG GGG CAT TCA AGC GTG GTG ATT TTA ATG ACC ATC ATC AGC 249 Phe Ser Met Gly His Ser Ser Val Val He Leu Met Thr He He Ser 55 60 65
GCG TTT GCG ATC GCT TGG GCT AAA GAA CAC ACG CCG ATG CTA GAA GAA 297 Ala Phe Ala He Ala Trp Ala Lys Glu His Thr Pro Met Leu Glu Glu 70 75 80
ATA GGG GGG GTA GTG GGG ACT TTA GTT TCT GGG CTT TTT TTG CTC ATT 345 He Gly Gly Val Val Gly Thr Leu Val Ser Gly Leu Phe Leu Leu He 85 90 95
ATA GGG CTA TTG AAT GCG ATT ATT CTC TTG GAT TTA TTA AAA ATA TTC 393 He Gly Leu Leu Asn Ala He He Leu Leu Asp Leu Leu Lys He Phe 100 105 110 115
AAA AAA TCG CAC TCT AAT GAA AGC CTA AGC CAG CAA CAA AAT GAA GAG 441 Lys Lys Ser His Ser Asn Glu Ser Leu Ser Gin Gin Gin Asn Glu Glu 120 125 130
ATC GAG CGG CTC TTA ACG AGT AGG GGC TTG CTC AAC CGC TTT TTT AAA 489 He Glu Arg Leu Leu Thr Ser Arg Gly Leu Leu Asn Arg Phe Phe Lys 135 140 145
CCC TTG TTT AAT TTT GTC TCC AAG TCG TGG CAT ATT TAT CCT ATC GGT 537 Pro Leu Phe Asn Phe Val Ser Lys Ser Trp His He Tyr Pro He Gly 150 155 160
TTT CTT TTT GGG CTG GGT TTT GAT ACC GCT AGT GAA ATC GCG CTT TTG 585 Phe Leu Phe Gly Leu Gly Phe Asp Thr Ala Ser Glu He Ala Leu Leu 165 170 175
GCC CTC TCT AGC AGC GCG ATT AAA GTG AGT ATG GTG GGC ATG CTC TCT 633 Ala Leu Ser Ser Ser Ala He Lys Val Ser Met Val Gly Met Leu Ser 180 185 190 195
TTA CCC ATT CTT TTT GCC GCT GGC ATG AGT TTG TTT GAC ACT TTA GAT 681 Leu Pro He Leu Phe Ala Ala Gly Met Ser Leu Phe Asp Thr Leu Asp 200 205 210 GGG GCG TTC ATG CTC AAG GCG TAT GAC TGG GCG TTC AAA ACC CCT TTA 729 Gly Ala Phe Met Leu Lys Ala Tyr Asp Trp Ala Phe Lys Thr Pro Leu 215 220 225
AGA AAA ATC TAT TAC AAT ATC TCT ATC ACG GCC TTA AGC GTG TTT ATC 777 Arg Lys He Tyr Tyr Asn He Ser He Thr Ala Leu Ser Val Phe He 230 235 240
GCG CTC TTT ATT GGC TTG ATT GAG CTT TTT CAA GTC GTT AGC GAG AAA 825 Ala Leu Phe He Gly Leu He Glu Leu Phe Gin Val Val Ser Glu Lys 245 250 255
CTC CAT TTA AAA TTT GAA AAC CGC CTT TTA AGA GCC TTA CAA AGC CTG 873 Leu His Leu Lys Phe Glu Asn Arg Leu Leu Arg Ala Leu Gin Ser Leu 260 265 270 275
GAA TTT ACA GAC TTG GGC TAT TAC TTG GTG GGC TTA TTT GTA ATA GCG 921 Glu Phe Thr Asp Leu Gly Tyr Tyr Leu Val Gly Leu Phe Val He Ala 280 285 290
TTT CTA GGA TCG TTC TTT TTA TGG AAA ATC AAA TTT TCT AAA CTA GAG 969 Phe Leu Gly Ser Phe Phe Leu Trp Lys He Lys Phe Ser Lys Leu Glu 295 300 305
AGC TGAATTCTAA GCCCTCAAAT TATCGCTTAA TAAATCTTTA AGGCTTTGAT TTG 1025 Ser
(2) INFORMATION FOR SEQ ID NO : 974 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 308 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 974:
Met Ala Asn Asn Ala Ser Phe Tyr Ala Ala Ala Ser Met Ala Tyr Met
1 5 10 15
Leu Gly Ala Lys His Ala Phe Asp Ala Asp His He Ala Cys He Asp
20 25 30
Asn Thr He Arg Lys Leu Thr Gin Gin Gly Lys Asn Ala Tyr Gly Val
35 40 45
Gly Phe Tyr Phe Ser Met Gly His Ser Ser Val Val He Leu Met Thr
50 55 60
He He Ser Ala Phe Ala He Ala Trp Ala Lys Glu His Thr Pro Met 65 70 75 80
Leu Glu Glu He Gly Gly Val Val Gly Thr Leu Val Ser Gly Leu Phe
85 90 95
Leu Leu He He Gly Leu Leu Asn Ala He He Leu Leu Asp Leu Leu 100 105 110 Lys He Phe Lys Lys Ser His Ser Asn Glu Ser Leu Ser Gin Gin Gin
115 120 125
Asn Glu Glu He Glu Arg Leu Leu Thr Ser Arg Gly Leu Leu Asn Arg
130 135 140
Phe Phe Lys Pro Leu Phe Asn Phe Val Ser Lys Ser Trp His He Tyr 145 150 155 160
Pro He Gly Phe Leu Phe Gly Leu Gly Phe Asp Thr Ala Ser Glu He
165 170 175
Ala Leu Leu Ala Leu Ser Ser Ser Ala He Lys Val Ser Met Val Gly
180 185 190
Met Leu Ser Leu Pro He Leu Phe Ala Ala Gly Met Ser Leu Phe Asp
195 200 205
Thr Leu Asp Gly Ala Phe Met Leu Lys Ala Tyr Asp Trp Ala Phe Lys
210 215 220
Thr Pro Leu Arg Lys He Tyr Tyr Asn He Ser He Thr Ala Leu Ser 225 230 235 240
Val Phe He Ala Leu Phe He Gly Leu He Glu Leu Phe Gin Val Val
245 250 255
Ser Glu Lys Leu His Leu Lys Phe Glu Asn Arg Leu Leu Arg Ala Leu
260 265 270
Gin Ser Leu Glu Phe Thr Asp Leu Gly Tyr Tyr Leu Val Gly Leu Phe
275 280 285
Val He Ala Phe Leu Gly Ser Phe Phe Leu Trp Lys He Lys Phe Ser
290 295 300
Lys Leu Glu Ser 305
(2) INFORMATION FOR SEQ ID NO: 975:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1034 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 75...989 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 975:
TTGTGAAAGA AAATGAAGCG TTTTTTAAAA TCGGTATCAA AAACATCGCC GTGGCTGAAA 60 TTTCTTCGCC TTTA ATG GAG TTT TTA GGT TCA ATC GCT ATA GCG CTA GTG 110 Met Glu Phe Leu Gly Ser He Ala He Ala Leu Val 1 5 10
ATT TAT TTA GGG GGG AAT GAA GTG ATT AGA GGC CAT ATT AGC GTG GGG 158 He Tyr Leu Gly Gly Asn Glu Val He Arg Gly His He Ser Val Gly 15 20 25 GCG TTT TTT TCT TTC ATT ACG GCC CTT TTT ATG CTC TAT ACG CCG ATT 206 Ala Phe Phe Ser Phe He Thr Ala Leu Phe Met Leu Tyr Thr Pro He 30 35 40
AAA CGC TTA ACT AGG ATT GTT TCT AAT TTT CAA GAA GCC TTA GTC GCT 254 Lys Arg Leu Thr Arg He Val Ser Asn Phe Gin Glu Ala Leu Val Ala 45 50 55 60
AGC GAC AGG ATC CAT GAG ATT TTA GAA AGA GAG CCG GCT ATT GTT GAT 302 Ser Asp Arg He His Glu He Leu Glu Arg Glu Pro Ala He Val Asp 65 70 75
GGG GAA TTG ACG CTA AAT AAC GCC ATA CAC ACC ATA GAA TTT AAA AAG 350 Gly Glu Leu Thr Leu Asn Asn Ala He His Thr He Glu Phe Lys Lys 80 85 90
GTA TGG CTG GCT TAT ACG CTA GAC AAT CAA GAG CGT TAT GTT TTA AAC 398 Val Trp Leu Ala Tyr Thr Leu Asp Asn Gin Glu Arg Tyr Val Leu Asn 95 100 105
GAT ATT AGT TTG AAG TTC CAA CAA AAT GAA ATC ATC GCT TTA AAG GGC 446 Asp He Ser Leu Lys Phe Gin Gin Asn Glu He He Ala Leu Lys Gly 110 115 120
GAA AGC GGG AGC GGT AAA AGC TCA TTA GTG AAT CTG ATC TTA CGC CTT 494 Glu Ser Gly Ser Gly Lys Ser Ser Leu Val Asn Leu He Leu Arg Leu 125 130 135 140
TAT GAG CCA AGC AAA GGC GAA ATT TTC ATC AAC GAT CAA AAA ATA GAG 542 Tyr Glu Pro Ser Lys Gly Glu He Phe He Asn Asp Gin Lys He Glu 145 150 155
AGC ATC ACT CAA AAA TCC TTA AGA GAA AAG ATT AGC GTT GTC ACT CAA 590 Ser He Thr Gin Lys Ser Leu Arg Glu Lys He Ser Val Val Thr Gin 160 165 170
AGG GTG TTT ATT TTT AAC GGG AGC GTG GCT GAA AAT GTG GCG TAT GGT 638 Arg Val Phe He Phe Asn Gly Ser Val Ala Glu Asn Val Ala Tyr Gly 175 180 185
TTA GAA ATT GAT GAG GTA AAA ATC AAA GAA TGC CTA AAA AAA GCT CAA 686 Leu Glu He Asp Glu Val Lys He Lys Glu Cys Leu Lys Lys Ala Gin 190 195 200
GCC TTA GAT TTT GTT GAA AAA ATG CCT CAT GGG ATA GAG AGC GTT TTA 734 Ala Leu Asp Phe Val Glu Lys Met Pro His Gly He Glu Ser Val Leu 205 210 215 220
GAT GAA TTT GGC GCT AAT CTT AGC GGC GGC CAA CGC CAA AGA ATC GCC 782 Asp Glu Phe Gly Ala Asn Leu Ser Gly Gly Gin Arg Gin Arg He Ala 225 230 235
ATT GCA AGA GCT TTG TAT AAA GAC GTT CAA GTT TTA ATC TTT GAT GAA 830 He Ala Arg Ala Leu Tyr Lys Asp Val Gin Val Leu He Phe Asp Glu 240 245 250 GCC ACT TCC GCT TTA GAC AAT AAC ACA GAA GAG AGC GTT AAA CAA AGC 878 Ala Thr Ser Ala Leu Asp Asn Asn Thr Glu Glu Ser Val Lys Gin Ser 255 260 265
ATT TTA GAA TTG AAA CAA AAC CGC TTG ATC ATT CTT ATT TCG CAC AAC 926 He Leu Glu Leu Lys Gin Asn Arg Leu He He Leu He Ser His Asn 270 275 280
CCA AGC ACG CTA AAA TTA GCC ACT AAG CAT GTG AAA TTA GAG CAT GGG 974 Pro Ser Thr Leu Lys Leu Ala Thr Lys His Val Lys Leu Glu His Gly 285 290 295 300
CGT TTG ACA GAA TGC TAAGGGTTTT AAGCGTTGGT GTTGCTTTTA TTTTACTAGG G 1030 Arg Leu Thr Glu Cys 305
TGTC 1034
(2) INFORMATION FOR SEQ ID NO 976
(l) SEQUENCE CHARACTERISTICS
(A) LENGTH 305 ammo acids
(B) TYPE ammo acid
(C) STRANDEDNESS single
(D) TOPOLOGY linear
(ii) MOLECULE TYPE protein
(xi) SEQUENCE DESCRIPTION SEQ ID NO 976
Met Glu Phe Leu Gly Ser He Ala He Ala Leu Val He Tyr Leu Gly
1 5 10 15
Gly Asn Glu Val He Arg Gly His He Ser Val Gly Ala Phe Phe Ser
20 25 30
Phe He Thr Ala Leu Phe Met Leu Tyr Thr Pro He Lys Arg Leu Thr
35 40 45
Arg He Val Ser Asn Phe Gin Glu Ala Leu Val Ala Ser Asp Arg He
50 55 60
His Glu He Leu Glu Arg Glu Pro Ala He Val Asp Gly Glu Leu Thr 65 70 75 80
Leu Asn Asn Ala He His Thr He Glu Phe Lys Lys Val Trp Leu Ala
85 90 95
Tyr Thr Leu Asp Asn Gin Glu Arg Tyr Val Leu Asn Asp He Ser Leu
100 105 110
Lys Phe Gin Gin Asn Glu He He Ala Leu Lys Gly Glu Ser Gly Ser
115 120 125
Gly Lys Ser Ser Leu Val Asn Leu He Leu Arg Leu Tyr Glu Pro Ser
130 135 140
Lys Gly Glu He Phe He Asn Asp Gin Lys He Glu Ser He Thr Gin 145 150 155 160
Lys Ser Leu Arg Glu Lys He Ser Val Val Thr Gin Arg Val Phe He
165 170 175
Phe Asn Gly Ser Val Ala Glu Asn Val Ala Tyr Gly Leu Glu He Asp
180 185 190
Glu Val Lys He Lys Glu Cys Leu Lys Lys Ala Gin Ala Leu Asp Phe 195 200 205
Val Glu Lys Met Pro His Gly He Glu Ser Val Leu Asp Glu Phe Gly
210 215 220
Ala Asn Leu Ser Gly Gly Gin Arg Gin Arg He Ala He Ala Arg Ala 225 230 235 240
Leu Tyr Lys Asp Val Gin Val Leu He Phe Asp Glu Ala Thr Ser Ala
245 250 255
Leu Asp Asn Asn Thr Glu Glu Ser Val Lys Gin Ser He Leu Glu Leu
260 265 270
Lys Gin Asn Arg Leu He He Leu He Ser His Asn Pro Ser Thr Leu
275 280 285
Lys Leu Ala Thr Lys His Val Lys Leu Glu His Gly Arg Leu Thr Glu
290 295 300
Cys 305
(2) INFORMATION FOR SEQ ID NO 977
(l) SEQUENCE CHARACTERISTICS
(A) LENGTH 604 base pairs
(B) TYPE nucleic acid
(C) STRANDEDNESS single
(D) TOPOLOGY linear
(ii) MOLECULE TYPE Genomic DNA (ix) FEATURE
(A) NAME/KEY Coding Sequence
(B) LOCATION 99. 563 (D) OTHER INFORMATION
(xi) SEQUENCE DESCRIPTION SEQ ID NO 977
TTTTATTTTC TTCTTTAGTG GTGGCTTTAA GCACGGCTTG GGGGACTTAT TTAGTCAAGC 60 CCACTTTAGA TGAAATTTTT ATCAATAAAG ACACTCAC ATG CTC AAA ATC CTG CCT 116
Met Leu Lys He Leu Pro 1 5
TTT TTA GTG ATT TTG GCG TAT TTG GGT AAG AGT GGG GGC ATG TAT TTA 164 Phe Leu Val He Leu Ala Tyr Leu Gly Lys Ser Gly Gly Met Tyr Leu 10 15 20
GGC ACT TAT TTC ACC AAC TTC ATT GGG CTT GAT ATT GTC AAA AAA ATA 212 Gly Thr Tyr Phe Thr Asn Phe He Gly Leu Asp He Val Lys Lys He 25 30 35
CGC AAC ACT ATG CTA GAA AGC CTT CTT AAA ATG GAA ATG GAT TTT TTT 260 Arg Asn Thr Met Leu Glu Ser Leu Leu Lys Met Glu Met Asp Phe Phe 40 45 50
AAC AGG ACG AAA AAG GGC GAA TTG ATC GCA AGG ATC ACC AAT GAT ATA 308 Asn Arg Thr Lys Lys Gly Glu Leu He Ala Arg He Thr Asn Asp He 55 60 65 70 GGT TTG ATT AGA GCG AGT TTG TCC AAT TAC CTT TCA GAG AGC ATA AGA 356 Gly Leu He Arg Ala Ser Leu Ser Asn Tyr Leu Ser Glu Ser He Arg 75 80 85
GAG GGG CTA ACG ATT GTT GGG TTA GTG GGG GTG GTG ATC TAT CAA AGC 404 Glu Gly Leu Thr He Val Gly Leu Val Gly Val Val He Tyr Gin Ser 90 95 100
CCT AAA TTA GCG TTA GTG GGG TTA GTC ATC ATG CCG TTA GCT GCT ATT 452 Pro Lys Leu Ala Leu Val Gly Leu Val He Met Pro Leu Ala Ala He 105 110 115
CCT ATC AGT AAA ATC ATT CGT AAG GTT AAA AAA CTC GCT AAA TCC CAT 500 Pro He Ser Lys He He Arg Lys Val Lys Lys Leu Ala Lys Ser His 120 125 130
CAA GAG AGT AAC GCC AAA ATC ACC GCT CGT TTG AGT GAA GTT TTT AAC 548 Gin Glu Ser Asn Ala Lys He Thr Ala Arg Leu Ser Glu Val Phe Asn 135 140 145 150
AAC GTG GGA AGC GAT TAAAATCTCT AATGGCGAAA AATTAGAGCA TAAGGCTTTT G 604 Asn Val Gly Ser Asp 155
604
(2) INFORMATION FOR SEQ ID NO: 978:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 155 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 978:
Met Leu Lys He Leu Pro Phe Leu Val He Leu Ala Tyr Leu Gly Lys
1 5 10 15
Ser Gly Gly Met Tyr Leu Gly Thr Tyr Phe Thr Asn Phe He Gly Leu
20 25 30
Asp He Val Lys Lys He Arg Asn Thr Met Leu Glu Ser Leu Leu Lys
35 40 45
Met Glu Met Asp Phe Phe Asn Arg Thr Lys Lys Gly Glu Leu He Ala
50 55 60
Arg He Thr Asn Asp He Gly Leu He Arg Ala Ser Leu Ser Asn Tyr 65 70 75 80
Leu Ser Glu Ser He Arg Glu Gly Leu Thr He Val Gly Leu Val Gly
85 90 95
Val Val He Tyr Gin Ser Pro Lys Leu Ala Leu Val Gly Leu Val He
100 105 110
Met Pro Leu Ala Ala He Pro He Ser Lys He He Arg Lys Val Lys
115 120 125
Lys Leu Ala Lys Ser His Gin Glu Ser Asn Ala Lys He Thr Ala Arg 130 135 140
Leu Ser Glu Val Phe Asn Asn Val Gly Ser Asp 145 150 155
(2) INFORMATION FOR SEQ ID NO: 979:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 789 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 34...738 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 979:
TTTCGCTTAT CAAGTCCCCC TACCTCAATT TTA ATG CGC TTA GAT TAC GCC TTA 54
Met Arg Leu Asp Tyr Ala Leu 1 5
TTC AGT CAG CAT TTA GTA AAT AGC AGA GAA AAA GCT AAA GCG TTG GTT 102 Phe Ser Gin His Leu Val Asn Ser Arg Glu Lys Ala Lys Ala Leu Val 10 15 20
TTA AAA AAT CAG GTT TTA GTC AAT AAA ATG GTG GTT TCC AAA CCC TCT 150 Leu Lys Asn Gin Val Leu Val Asn Lys Met Val Val Ser Lys Pro Ser 25 30 35
TTT ATA GTG AAA GAG AAC GAT AAA ATT GAA CTC ATC GCT GAA AAA CTT 198 Phe He Val Lys Glu Asn Asp Lys He Glu Leu He Ala Glu Lys Leu 40 45 50 55
TTC GTT AGC AGG GCT GGG GAA AAA TTA GGG GCT TTT TTA GAA ACC CAT 246 Phe Val Ser Arg Ala Gly Glu Lys Leu Gly Ala Phe Leu Glu Thr His 60 65 70
TTC GTG GAT TTT AAG GGA AAG GTG GTT TTA GAT GTG GGA GCG AGC AAA 294 Phe Val Asp Phe Lys Gly Lys Val Val Leu Asp Val Gly Ala Ser Lys 75 80 85
GGG GGC TTT AGT CAA GTG GCT CTT TTA AAA GGG GCT AAA AGA GTG CTT 342 Gly Gly Phe Ser Gin Val Ala Leu Leu Lys Gly Ala Lys Arg Val Leu 90 95 100
TGC GTG GAT GTG GGG AAA ATG CAA TTA GAT GAA AGT TTG AAA CAA GAC 390 Cys Val Asp Val Gly Lys Met Gin Leu Asp Glu Ser Leu Lys Gin Asp 105 110 115 AAG CGC ATA GAA TGT TAC GAA GAA TGC GAT ATT AGA GGG TTT AAA ACG 438 Lys Arg He Glu Cys Tyr Glu Glu Cys Asp He Arg Gly Phe Lys Thr 120 125 130 135
CCA GAA ACA ATT GAT TTA GCG CTT TGC GAT GTG AGC TTT ATT TCT TTA 486 Pro Glu Thr He Asp Leu Ala Leu Cys Asp Val Ser Phe He Ser Leu 140 145 150
TAT TAT ATT TTA GAA GCG ATT TTG CCT TTA AGC GAT GAA TTT TTA ACA 534 Tyr Tyr He Leu Glu Ala He Leu Pro Leu Ser Asp Glu Phe Leu Thr 155 160 165
CTT TTC AAA CCG CAA TTT GAA GTG GGC AGA GGA ATA AAA CGC AAT AAA 582 Leu Phe Lys Pro Gin Phe Glu Val Gly Arg Gly He Lys Arg Asn Lys 170 175 180
AAA GGG GTG GTG GTG GAT AAA GAA GCC ATT TTG AAC GCT TTA GAA AAC 630 Lys Gly Val Val Val Asp Lys Glu Ala He Leu Asn Ala Leu Glu Asn 185 190 195
TTT AAA AAC CAT TTA AAA ACA AAG GAT TTT CAA ATC TTA AAG ATC CAA 678 Phe Lys Asn His Leu Lys Thr Lys Asp Phe Gin He Leu Lys He Gin 200 205 210 215
GAA AGC TTA GTG AAA GGG AAA AAC GGG AAT GTT GAA TTT TTT ATC CAT 726 Glu Ser Leu Val Lys Gly Lys Asn Gly Asn Val Glu Phe Phe He His 220 225 230
TTC AAG CGA GCC TAAAATTAAA AGCCTAGCTA TCGGTAAATT TGACGGCTTG CATTT 783 Phe Lys Arg Ala 235
AGGGCA 789
(2) INFORMATION FOR SEQ ID NO: 980:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 235 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 980:
Met Arg Leu Asp Tyr Ala Leu Phe Ser Gin His Leu Val Asn Ser Arg
1 5 10 15
Glu Lys Ala Lys Ala Leu Val Leu Lys Asn Gin Val Leu Val Asn Lys
20 25 30
Met Val Val Ser Lys Pro Ser Phe He Val Lys Glu Asn Asp Lys He
35 40 45
Glu Leu He Ala Glu Lys Leu Phe Val Ser Arg Ala Gly Glu Lys Leu
50 55 60
Gly Ala Phe Leu Glu Thr His Phe Val Asp Phe Lys Gly Lys Val Val 65 70 75 80
Leu Asp Val Gly Ala Ser Lys Gly Gly Phe Ser Gin Val Ala Leu Leu
85 90 95
Lys Gly Ala Lys Arg Val Leu Cys Val Asp Val Gly Lys Met Gin Leu
100 105 110
Asp Glu Ser Leu Lys Gin Asp Lys Arg He Glu Cys Tyr Glu Glu Cys
115 120 125
Asp He Arg Gly Phe Lys Thr Pro Glu Thr He Asp Leu Ala Leu Cys
130 135 140
Asp Val Ser Phe He Ser Leu Tyr Tyr He Leu Glu Ala He Leu Pro 145 150 155 160
Leu Ser Asp Glu Phe Leu Thr Leu Phe Lys Pro Gin Phe Glu Val Gly
165 170 175
Arg Gly He Lys Arg Asn Lys Lys Gly Val Val Val Asp Lys Glu Ala
180 185 190
He Leu Asn Ala Leu Glu Asn Phe Lys Asn His Leu Lys Thr Lys Asp
195 200 205
Phe Gin He Leu Lys He Gin Glu Ser Leu Val Lys Gly Lys Asn Gly
210 215 220
Asn Val Glu Phe Phe He His Phe Lys Arg Ala 225 230 235
(2) INFORMATION FOR SEQ ID NO: 981:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 906 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...858 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 981:
TGAAAGGGAA AAACGGGA ATG TTG AAT TTT TTA TCC ATT TCA AGC GAG CCT 51
Met Leu Asn Phe Leu Ser He Ser Ser Glu Pro 1 5 10
AAA ATT AAA AGC CTA GCT ATC GGT AAA TTT GAC GGC TTG CAT TTA GGG 99
Lys He Lys Ser Leu Ala He Gly Lys Phe Asp Gly Leu His Leu Gly
15 20 25
CAT CAA GCC CTT TTT AAA GAG TTA AAA GAT CCC AAA GCC CTT TTA ATC 147
His Gin Ala Leu Phe Lys Glu Leu Lys Asp Pro Lys Ala Leu Leu He 30 35 40
ATA GAA AAA AAA CAT TAC ACT AAA GGC TAT TTA ACC CCC CTA AAA TAC 195
He Glu Lys Lys His Tyr Thr Lys Gly Tyr Leu Thr Pro Leu Lys Tyr 45 « „ 50 55
CGC GCT AAA CTC GTG GGC ATG CCT TTA TTT TTT GTG TAT TTA GAA GAG 243 Arg Ala Lys Leu Val Gly Met Pro Leu Phe Phe Val Tyr Leu Glu Glu 60 65 70 75
ATT TCA CAA TTA AAC GCC CTA GAT TTT TTA GAT CTT TTA AAA AAG AAA 291 He Ser Gin Leu Asn Ala Leu Asp Phe Leu Asp Leu Leu Lys Lys Lys 80 85 90
TTT CCC CAT TTA GAA CGC CTG GTC GTG GGC TAT GAT TTC AGG TTT GGG 339 Phe Pro His Leu Glu Arg Leu Val Val Gly Tyr Asp Phe Arg Phe Gly 95 100 105
CAT GAG AGG CAA AAT GAC GCT TTA TTT TTA AAA GAG CGT TTT GAA AAA 387 His Glu Arg Gin Asn Asp Ala Leu Phe Leu Lys Glu Arg Phe Glu Lys 110 115 120
ACC ATT ATT GTG CCT GAA GTG AAA GTC CAA GAG ATT AGC GTG CAT TCT 435 Thr He He Val Pro Glu Val Lys Val Gin Glu He Ser Val His Ser 125 130 135
AAG ATG ATC AAA CTA GCC CTA AGT CAT GGC GAC TTA TCT TTA GCT AAC 483 Lys Met He Lys Leu Ala Leu Ser His Gly Asp Leu Ser Leu Ala Asn 140 145 150 155
AAG CTC TTA GGC AGA CCT TAT GAA GTG TGT GGG GAA GTC ATT AGT GAT 531 Lys Leu Leu Gly Arg Pro Tyr Glu Val Cys Gly Glu Val He Ser Asp 160 165 170
CAA GGT TTG GGG CAT AAA GAA TTA GCA CCC ACT TTA AAT ATA AAA ACT 579 Gin Gly Leu Gly His Lys Glu Leu Ala Pro Thr Leu Asn He Lys Thr 175 180 185
AAA GAT TTT ATC CTC CCT AGT TTT GGG GTG TAT GCG AGT TTA GTG AAA 627 Lys Asp Phe He Leu Pro Ser Phe Gly Val Tyr Ala Ser Leu Val Lys 190 195 200
ATA AAA GAT CCA ATT TAT CAA AAA AGC GTG AGT TTT ATA GGC AAT CGC 675 He Lys Asp Pro He Tyr Gin Lys Ser Val Ser Phe He Gly Asn Arg 205 210 215
TTA AGC ACG GAT CAA AAT TTC GCC ATA GAA TGC CAT GTC CTT GAT ACC 723 Leu Ser Thr Asp Gin Asn Phe Ala He Glu Cys His Val Leu Asp Thr 220 225 230 235
ATC ATA GAA AAC CCG CCC CAA GAA ATC GCT TTG CGT TGG GTT CAA AAA 771 He He Glu Asn Pro Pro Gin Glu He Ala Leu Arg Trp Val Gin Lys 240 245 250
ATA CGA GAC AAC ATG CGT TTT TCT TCT TTA AAA GAG CTT AAA AAT CAG 819 He Arg Asp Asn Met Arg Phe Ser Ser Leu Lys Glu Leu Lys Asn Gin 255 260 265
ATC CAA CAA GAC ATC TTA AGA GCC AAA GAG ATT TTG AGA TAATTTGTGT TA 870 He Gin Gin Asp He Leu Arg Ala Lys Glu He Leu Arg 270 275 280
AAATGACTCT CAAAAACCTT AAAAATGGAA AAATTT 906
(2) INFORMATION FOR SEQ ID NO: 982:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 280 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 982:
Met Leu Asn Phe Leu Ser He Ser Ser Glu Pro Lys He Lys Ser Leu
1 5 10 15
Ala He Gly Lys Phe Asp Gly Leu His Leu Gly His Gin Ala Leu Phe
20 25 30
Lys Glu Leu Lys Asp Pro Lys Ala Leu Leu He He Glu Lys Lys His
35 40 45
Tyr Thr Lys Gly Tyr Leu Thr Pro Leu Lys Tyr Arg Ala Lys Leu Val
50 55 60
Gly Met Pro Leu Phe Phe Val Tyr Leu Glu Glu He Ser Gin Leu Asn 65 70 75 80
Ala Leu Asp Phe Leu Asp Leu Leu Lys Lys Lys Phe Pro His Leu Glu
85 90 95
Arg Leu Val Val Gly Tyr Asp Phe Arg Phe Gly His Glu Arg Gin Asn
100 105 110
Asp Ala Leu Phe Leu Lys Glu Arg Phe Glu Lys Thr He He Val Pro
115 120 125
Glu Val Lys Val Gin Glu He Ser Val His Ser Lys Met He Lys Leu
130 135 140
Ala Leu Ser His Gly Asp Leu Ser Leu Ala Asn Lys Leu Leu Gly Arg 145 150 155 160
Pro Tyr Glu Val Cys Gly Glu Val He Ser Asp Gin Gly Leu Gly His
165 170 175
Lys Glu Leu Ala Pro Thr Leu Asn He Lys Thr Lys Asp Phe He Leu
180 185 190
Pro Ser Phe Gly Val Tyr Ala Ser Leu Val Lys He Lys Asp Pro He
195 200 205
Tyr Gin Lys Ser Val Ser Phe He Gly Asn Arg Leu Ser Thr Asp Gin
210 215 220
Asn Phe Ala He Glu Cys His Val Leu Asp Thr He He Glu Asn Pro 225 230 235 240
Pro Gin Glu He Ala Leu Arg Trp Val Gin Lys He Arg Asp Asn Met
245 250 255
Arg Phe Ser Ser Leu Lys Glu Leu Lys Asn Gin He Gin Gin Asp He
260 265 270
Leu Arg Ala Lys Glu He Leu Arg 275 280
(2) INFORMATION FOR SEQ ID NO:983: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2627 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 18...2582 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 983:
AAAGACATGT GCAACCG ATG AAA TCT AAA AAA CTT TAT TTG GCT TTA ATC 50
Met Lys Ser Lys Lys Leu Tyr Leu Ala Leu He 1 5 10
ATA GGG GTT TTA TTA GCG TTT TTA ACC CTA TCT TCA TGG CTG GGT AAT 98 He Gly Val Leu Leu Ala Phe Leu Thr Leu Ser Ser Trp Leu Gly Asn 15 20 25
AGC GGT TTA GTG GGG CGT TTT GGG GTG TGG TTT GCC GCA CTC AAT AAA 146 Ser Gly Leu Val Gly Arg Phe Gly Val Trp Phe Ala Ala Leu Asn Lys 30 35 40
AAA TAT TTT GGG CAT CTT TCA TTC ATT AAT TTA CCC TAT TTA GCA TGG 194 Lys Tyr Phe Gly His Leu Ser Phe He Asn Leu Pro Tyr Leu Ala Trp 45 50 55
GTT TTA TTC CTT TTA TAC AAG ACT AAA AAC CCT TTT ACA GAA ATC GTT 242 Val Leu Phe Leu Leu Tyr Lys Thr Lys Asn Pro Phe Thr Glu He Val 60 65 70 75
TTA GAA AAA ACT TTA GGG CAT CTA TTA GGC ATT TTA TCT TTG CTC TTT 290 Leu Glu Lys Thr Leu Gly His Leu Leu Gly He Leu Ser Leu Leu Phe 80 85 90
TTA CAA TCT AGC CTA TTA AAT CAA GGG GAA ATC GGC AAC AGC GCG CGT 338 Leu Gin Ser Ser Leu Leu Asn Gin Gly Glu He Gly Asn Ser Ala Arg 95 100 105
TTG TTT TTA CGC CCT TTT ATA GGG GAT TTT GGG CTT TAT GCG CTG ATA 386 Leu Phe Leu Arg Pro Phe He Gly Asp Phe Gly Leu Tyr Ala Leu He 110 115 120
ACG CTT ATG GTA GTT ATT TCT TAT TTG ATT CTA TTC AAA CTA CCC CCT 434 Thr Leu Met Val Val He Ser Tyr Leu He Leu Phe Lys Leu Pro Pro 125 130 135
AAA AGC GTT TTT TAT CCT TAT ATG AAC AAA ACA CAA AAC CTT TTA AAA 482 Lys Ser Val Phe Tyr Pro Tyr Met Asn Lys Thr Gin Asn Leu Leu Lys 140 145 150 155
GAG ATT TAC AAA CAA TGC TTA CAA GCC TTT AGC CCT AAT TTT AGC CCA 530 Glu He Tyr Lys Gin Cys Leu Gin Ala Phe Ser Pro Asn Phe Ser Pro 160 165 170
AAA AAA GAG GGT TTT GAA AAC ACC CCA TCA GAT ATT CAA AAA AAA GAA 578 Lys Lys Glu Gly Phe Glu Asn Thr Pro Ser Asp He Gin Lys Lys Glu 175 180 185
ACC AAA AAC GAC AAA GAA AAA GAA AAC CGC AAA GAA AAC CCT ATT AAT 626 Thr Lys Asn Asp Lys Glu Lys Glu Asn Arg Lys Glu Asn Pro He Asn 190 195 200
GAA AAC CAC AAA ACC CCT AAC GAA GAA CCG TTT TTA GCG ATC CCT ACC 674 Glu Asn His Lys Thr Pro Asn Glu Glu Pro Phe Leu Ala He Pro Thr 205 210 215
CCC TAT AAC ACG ACT TTA AAT GAT TCA GAG CCG CAA GAA GGC TTA GTC 722 Pro Tyr Asn Thr Thr Leu Asn Asp Ser Glu Pro Gin Glu Gly Leu Val 220 225 230 235
CAA ATT TCC TCC CAC CCC CCT ACC CAT TAC ACC ATT TAC CCT AAA AGA 770 Gin He Ser Ser His Pro Pro Thr His Tyr Thr He Tyr Pro Lys Arg 240 245 250
AAC CGA TTT GAT GAT TTG ACT AAC CCC ACT AAC CCC CCT TTA AAA GAA 818 Asn Arg Phe Asp Asp Leu Thr Asn Pro Thr Asn Pro Pro Leu Lys Glu 255 260 265
ATT AAA CAA GAA ACT AAA GAA AGA GAA CCC ACG CCT ACA AAA GAA ACT 866 He Lys Gin Glu Thr Lys Glu Arg Glu Pro Thr Pro Thr Lys Glu Thr 270 275 280
CTT ACG CCC ACC ACG CCC AAA CCT ATC ATG CCC ACA CTT GCA CCC ATA 914 Leu Thr Pro Thr Thr Pro Lys Pro He Met Pro Thr Leu Ala Pro He 285 290 295
ATA GAA AAT GAC AAC AAA ACA GAA AAC CAA AAA ACC CCC AAC CAC CCT 962 He Glu Asn Asp Asn Lys Thr Glu Asn Gin Lys Thr Pro Asn His Pro 300 305 310 315
AAA AAA GAA GAA AAC CCA CAA GAA AAC ACG CAA GAA GAA ATG ATA GAA 1010 Lys Lys Glu Glu Asn Pro Gin Glu Asn Thr Gin Glu Glu Met He Glu 320 325 330
GGA AGG ATA GAA GAA ATG ATA AAG GAA AAT CTA AAA AAA GAA GAA AAA 1058 Gly Arg He Glu Glu Met He Lys Glu Asn Leu Lys Lys Glu Glu Lys 335 340 345
GAA GTG CAA AAC GCT CCA AAC TTT AGC CCA GTA ACC CCC ACA AGC GCT 1106 Glu Val Gin Asn Ala Pro Asn Phe Ser Pro Val Thr Pro Thr Ser Ala 350 355 360
AAA AAA CCC GTT ATG GTT AAA GAA TTG AGC GAA AAT AAA GAG ATA TTA 1154 Lys Lys Pro Val Met Val Lys Glu Leu Ser Glu Asn Lys Glu He Leu 365 370 375
GAC GGA TTG GAT TAT GGC GAA GTG CAA AAA CCC AAA GAT TAT GAG CTT 1202 Asp Gly Leu Asp Tyr Gly Glu Val Gin Lys Pro Lys Asp Tyr Glu Leu 380 385 390 395
CCC ACC ACG CAA TTA TTG AAT GCG GTT TGT TTG AAA GAC ACT TCT TTA 1250 Pro Thr Thr Gin Leu Leu Asn Ala Val Cys Leu Lys Asp Thr Ser Leu 400 405 410
GAC GAA AAC GAG ATT GAC CAA AAA ATC CAG GAT CTA TTG AGC AAA CTG 1298 Asp Glu Asn Glu He Asp Gin Lys He Gin Asp Leu Leu Ser Lys Leu 415 420 425
CGC ACC TTT AAA ATT GAT GGC GAT ATT ATC CGC ACT TAT TCA GGC CCT 1346 Arg Thr Phe Lys He Asp Gly Asp He He Arg Thr Tyr Ser Gly Pro 430 435 440
ATT GTA ACC ACT TTT GAA TTC CGC CCA GCC CCT AAC GTT AAG GTG AGT 1394 He Val Thr Thr Phe Glu Phe Arg Pro Ala Pro Asn Val Lys Val Ser 445 450 455
CGT ATT TTA GGC TTG AGC GAT GAT TTA GCG ATG ACT TTA TGC GCT GAA 1442 Arg He Leu Gly Leu Ser Asp Asp Leu Ala Met Thr Leu Cys Ala Glu 460 465 470 475
TCC ATC CGC ATT CAA GCC CCT ATT AAG GGT AAA GAT GTC GTT GGC ATT 1490 Ser He Arg He Gin Ala Pro He Lys Gly Lys Asp Val Val Gly He 480 485 490
GAA ATC CCT AAC AGC CAA AGC CAA ATT ATT TAT TTA AGA GAA ATT CTA 1538 Glu He Pro Asn Ser Gin Ser Gin He He Tyr Leu Arg Glu He Leu 495 500 505
GAG AGC GAA TTG TTT CAA AAA TCC AGC TCG CCC TTA ACT CTA GCT TTA 1586 Glu Ser Glu Leu Phe Gin Lys Ser Ser Ser Pro Leu Thr Leu Ala Leu 510 515 520
GGC AAA GAC ATT GTG GGT AAC CCT TTC ATC ACG GAT TTA AAA AAG CTC 1634 Gly Lys Asp He Val Gly Asn Pro Phe He Thr Asp Leu Lys Lys Leu 525 530 535
CCC CAT TTG CTC ATC GCT GGC ACG ACA GGA AGC GGT AAG AGC GTG GGC 1682 Pro His Leu Leu He Ala Gly Thr Thr Gly Ser Gly Lys Ser Val Gly 540 545 550 555
GTG AAT GCG ATG ATT TTA TCC TTA CTT TAT AAA AAC CCT CCC GAT CAA 1730 Val Asn Ala Met He Leu Ser Leu Leu Tyr Lys Asn Pro Pro Asp Gin 560 565 570
CTC AAA TTA GTG ATG ATC GAT CCC AAA ATG GTA GAA TTT AGT ATT TAT 1778 Leu Lys Leu Val Met He Asp Pro Lys Met Val Glu Phe Ser He Tyr 575 580 585 GCG GAT ATC CCT CAT TTG CTC ACG CCC ATT ATC ACC GAC CCT AAA AAA 1826 Ala Asp He Pro His Leu Leu Thr Pro He He Thr Asp Pro Lys Lys 590 595 600
GCT ATT GGG GCT TTG CAA AGC GTG GCT AAA GAA ATG GAA CGC CGG TAT 1874 Ala He Gly Ala Leu Gin Ser Val Ala Lys Glu Met Glu Arg Arg Tyr 605 610 615
TCT TTA ATG AGC GAA TAC AAG GTT AAA ACC ATT GAT TCT TAT AAT GAA 1922 Ser Leu Met Ser Glu Tyr Lys Val Lys Thr He Asp Ser Tyr Asn Glu 620 625 630 635
CAA GCC CCA AGT AAC GGC GTT GAA GCG TTC CCC TAT TTG ATT GTG GTG 1970 Gin Ala Pro Ser Asn Gly Val Glu Ala Phe Pro Tyr Leu He Val Val 640 645 650
ATT GAT GAA TTA GCG GAT TTA ATG ATG ACA GGG GGC AAA GAA GCG GAG 2018 He Asp Glu Leu Ala Asp Leu Met Met Thr Gly Gly Lys Glu Ala Glu 655 660 665
TTT CCT ATC GCT AGA ATC GCT CAA ATG GGG CGC GCG AGC GGC TTA CAC 2066 Phe Pro He Ala Arg He Ala Gin Met Gly Arg Ala Ser Gly Leu His 670 675 680
CTC ATT GTA GCG ACC CAA CGC CCA AGC GTG GAT GTC GTA ACC GGC TTG 2114 Leu He Val Ala Thr Gin Arg Pro Ser Val Asp Val Val Thr Gly Leu 685 690 695
ATT AAA ACC AAC TTG CCT TCA AGG GTG AGT TTT AGG GTA GGC ACT AAG 2162 He Lys Thr Asn Leu Pro Ser Arg Val Ser Phe Arg Val Gly Thr Lys 700 705 710 715
ATT GAT TCT AAA GTG ATT TTA GAC ACT GAT GGG GCG CAA AGC TTG TTA 2210 He Asp Ser Lys Val He Leu Asp Thr Asp Gly Ala Gin Ser Leu Leu 720 725 730
GGA AGA GGC GAT ATG CTC TTT ACC CCC CCA GGA GCG AAC GGG TTA GTG 2258 Gly Arg Gly Asp Met Leu Phe Thr Pro Pro Gly Ala Asn Gly Leu Val 735 740 745
CGC TTG CAT GCC CCC TTT GCC ACT GAA GAT GAA ATC AAA AAA ATC GTG 2306 Arg Leu His Ala Pro Phe Ala Thr Glu Asp Glu He Lys Lys He Val 750 755 760
GAT TTT ATT AAA GCC CAA AAA GAA GTA CAA TAC GAT AAA GAT TTC TTG 2354 Asp Phe He Lys Ala Gin Lys Glu Val Gin Tyr Asp Lys Asp Phe Leu 765 770 775
CTA GAA GAA TCA CGC ATG CCT TTA GAC ACC CCT AAT TAT CAA GGC GAT 2402 Leu Glu Glu Ser Arg Met Pro Leu Asp Thr Pro Asn Tyr Gin Gly Asp 780 785 790 795
GAC ATT TTA GAA AGG GCT AAA GCG GTG ATT TTA GAA AAA AAG ATC ACT 2450 Asp He Leu Glu Arg Ala Lys Ala Val He Leu Glu Lys Lys He Thr 800 805 810 TCT ACG AGT TTT TTA CAA CGC CAA TTA AAA ATC GGC TAC AAC CAA GCC 2498 Ser Thr Ser Phe Leu Gin Arg Gin Leu Lys He Gly Tyr Asn Gin Ala 815 820 825
GCT ACC ATT ACT GAC GAA TTA GAA GCT CAA GGC TTT TTA TCC CCA AGA 2546 Ala Thr He Thr Asp Glu Leu Glu Ala Gin Gly Phe Leu Ser Pro Arg 830 835 840
AAC GCT AAA GGC AAC AGA GAG ATT TTG CAA AAC TTT TAGGCTTTGT TTTCAT 2598 Asn Ala Lys Gly Asn Arg Glu He Leu Gin Asn Phe 845 850 855
TGGATATTGG CAAACATTAT TTTTGATTT 2627
(2) INFORMATION FOR SEQ ID NO: 984:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 855 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 984:
Met Lys Ser Lys Lys Leu Tyr Leu Ala Leu He He Gly Val Leu Leu
1 5 10 15
Ala Phe Leu Thr Leu Ser Ser Trp Leu Gly Asn Ser Gly Leu Val Gly
20 25 30
Arg Phe Gly Val Trp Phe Ala Ala Leu Asn Lys Lys Tyr Phe Gly His
35 40 45
Leu Ser Phe He Asn Leu Pro Tyr Leu Ala Trp Val Leu Phe Leu Leu
50 55 60
Tyr Lys Thr Lys Asn Pro Phe Thr Glu He Val Leu Glu Lys Thr Leu 65 70 75 80
Gly His Leu Leu Gly He Leu Ser Leu Leu Phe Leu Gin Ser Ser Leu
85 90 95
Leu Asn Gin Gly Glu He Gly Asn Ser Ala Arg Leu Phe Leu Arg Pro
100 105 110
Phe He Gly Asp Phe Gly Leu Tyr Ala Leu He Thr Leu Met Val Val
115 120 125
He Ser Tyr Leu He Leu Phe Lys Leu Pro Pro Lys Ser Val Phe Tyr
130 135 140
Pro Tyr Met Asn Lys Thr Gin Asn Leu Leu Lys Glu He Tyr Lys Gin 145 150 155 160
Cys Leu Gin Ala Phe Ser Pro Asn Phe Ser Pro Lys Lys Glu Gly Phe
165 170 175
Glu Asn Thr Pro Ser Asp He Gin Lys Lys Glu Thr Lys Asn Asp Lys
180 185 190
Glu Lys Glu Asn Arg Lys Glu Asn Pro He Asn Glu Asn His Lys Thr
195 200 205
Pro Asn Glu Glu Pro Phe Leu Ala He Pro Thr Pro Tyr Asn Thr Thr
210 215 220
Leu Asn Asp Ser Glu Pro Gin Glu Gly Leu Val Gin He Ser Ser His 225 230 235 240
Pro Pro Thr His Tyr Thr He Tyr Pro Lys Arg Asn Arg Phe Asp Asp
245 250 255
Leu Thr Asn Pro Thr Asn Pro Pro Leu Lys Glu He Lys Gin Glu Thr
260 265 270
Lys Glu Arg Glu Pro Thr Pro Thr Lys Glu Thr Leu Thr Pro Thr Thr
275 280 285
Pro Lys Pro He Met Pro Thr Leu Ala Pro He He Glu Asn Asp Asn
290 295 300
Lys Thr Glu Asn Gin Lys Thr Pro Asn His Pro Lys Lys Glu Glu Asn 305 310 315 320
Pro Gin Glu Asn Thr Gin Glu Glu Met He Glu Gly Arg He Glu Glu
325 330 335
Met He Lys Glu Asn Leu Lys Lys Glu Glu Lys Glu Val Gin Asn Ala
340 345 350
Pro Asn Phe Ser Pro Val Thr Pro Thr Ser Ala Lys Lys Pro Val Met
355 360 365
Val Lys Glu Leu Ser Glu Asn Lys Glu He Leu Asp Gly Leu Asp Tyr
370 375 380
Gly Glu Val Gin Lys Pro Lys Asp Tyr Glu Leu Pro Thr Thr Gin Leu 385 390 395 400
Leu Asn Ala Val Cys Leu Lys Asp Thr Ser Leu Asp Glu Asn Glu He
405 410 415
Asp Gin Lys He Gin Asp Leu Leu Ser Lys Leu Arg Thr Phe Lys He
420 425 430
Asp Gly Asp He He Arg Thr Tyr Ser Gly Pro He Val Thr Thr Phe
435 440 445
Glu Phe Arg Pro Ala Pro Asn Val Lys Val Ser Arg He Leu Gly Leu
450 455 460
Ser Asp Asp Leu Ala Met Thr Leu Cys Ala Glu Ser He Arg He Gin 465 470 475 480
Ala Pro He Lys Gly Lys Asp Val Val Gly He Glu He Pro Asn Ser
485 490 495
Gin Ser Gin He He Tyr Leu Arg Glu He Leu Glu Ser Glu Leu Phe
500 505 510
Gin Lys Ser Ser Ser Pro Leu Thr Leu Ala Leu Gly Lys Asp He Val
515 520 525
Gly Asn Pro Phe He Thr Asp Leu Lys Lys Leu Pro His Leu Leu He
530 535 540
Ala Gly Thr Thr Gly Ser Gly Lys Ser Val Gly Val Asn Ala Met He 545 550 555 560
Leu Ser Leu Leu Tyr Lys Asn Pro Pro Asp Gin Leu Lys Leu Val Met
565 570 575
He Asp Pro Lys Met Val Glu Phe Ser He Tyr Ala Asp He Pro His
580 585 590
Leu Leu Thr Pro He He Thr Asp Pro Lys Lys Ala He Gly Ala Leu
595 600 605
Gin Ser Val Ala Lys Glu Met Glu Arg Arg Tyr Ser Leu Met Ser Glu
610 615 620
Tyr Lys Val Lys Thr He Asp Ser Tyr Asn Glu Gin Ala Pro Ser Asn 625 630 635 640
Gly Val Glu Ala Phe Pro Tyr Leu He Val Val He Asp Glu Leu Ala
645 650 655
Asp Leu Met Met Thr Gly Gly Lys Glu Ala Glu Phe Pro He Ala Arg 660 665 670 He Ala Gin Met Gly Arg Ala Ser Gly Leu His Leu He Val Ala Thr
675 680 685
Gin Arg Pro Ser Val Asp Val Val Thr Gly Leu He Lys Thr Asn Leu
690 695 700
Pro Ser Arg Val Ser Phe Arg Val Gly Thr Lys He Asp Ser Lys Val 705 710 715 720
He Leu Asp Thr Asp Gly Ala Gin Ser Leu Leu Gly Arg Gly Asp Met
725 730 735
Leu Phe Thr Pro Pro Gly Ala Asn Gly Leu Val Arg Leu His Ala Pro
740 745 750
Phe Ala Thr Glu Asp Glu He Lys Lys He Val Asp Phe He Lys Ala
755 760 765
Gin Lys Glu Val Gin Tyr Asp Lys Asp Phe Leu Leu Glu Glu Ser Arg
770 775 780
Met Pro Leu Asp Thr Pro Asn Tyr Gin Gly Asp Asp He Leu Glu Arg 785 790 795 800
Ala Lys Ala Val He Leu Glu Lys Lys He Thr Ser Thr Ser Phe Leu
805 810 815
Gin Arg Gin Leu Lys He Gly Tyr Asn Gin Ala Ala Thr He Thr Asp
820 825 830
Glu Leu Glu Ala Gin Gly Phe Leu Ser Pro Arg Asn Ala Lys Gly Asn
835 840 845
Arg Glu He Leu Gin Asn Phe 850 855
(2) INFORMATION FOR SEQ ID NO: 985:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1136 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...1094 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 985:
AACCATAAAA ACGATACAAT AGCGGTATTT TAATAAAACA AGGAGTTTTA ATG AGA 56
Met Arg 1
GTT CAA TCT AAA GGT TTT GCT ATT TTT TCT AAA GAC GGG CAT TTC AAA 104 Val Gin Ser Lys Gly Phe Ala He Phe Ser Lys Asp Gly His Phe Lys 5 10 15
CCC CAT GAT TTT AGC CGC CAT GCT GTA GGC CCT AAA GAT GTG TTG ATT 152 Pro His Asp Phe Ser Arg His Ala Val Gly Pro Lys Asp Val Leu He 20 25 30 GAC ATT CTT TAT GCA GGG ATT TGT CAT AGC GAT ATT CAT AGC GCT TAT 200 Asp He Leu Tyr Ala Gly He Cys His Ser Asp He His Ser Ala Tyr 35 40 45 50
AGC GAA TGG AAA GAA GGC ATT TAC CCT ATG GTT CCT GGG CAT GAA ATT 248 Ser Glu Trp Lys Glu Gly He Tyr Pro Met Val Pro Gly His Glu He 55 60 65
GCT GGG GCC ATC AAA GAA GTG GGT AAG GAA GTT AAG AAA TTT AAG GTT 296 Ala Gly Ala He Lys Glu Val Gly Lys Glu Val Lys Lys Phe Lys Val 70 75 80
GGC GAT GTG GTG GGC GTG GGC TGT TTT GTC AAT TCA TGC AAA GCG TGT 344 Gly Asp Val Val Gly Val Gly Cys Phe Val Asn Ser Cys Lys Ala Cys 85 90 95
AAG CCC TGT AAA GAA CAC CAA GAG CAA TTT TGC GCC AAA GTG GTA TTC 392 Lys Pro Cys Lys Glu His Gin Glu Gin Phe Cys Ala Lys Val Val Phe 100 105 110
ACT TAC GAT TGT TTG GAT TAT TTC CAT GAC AAC GAA CCC CAC ATG GGC 440 Thr Tyr Asp Cys Leu Asp Tyr Phe His Asp Asn Glu Pro His Met Gly 115 120 125 130
GGA TAC TCT AAT AAT ATT GTA GTG GAT GAA AAC TAT GTG ATT AGC GTG 488 Gly Tyr Ser Asn Asn He Val Val Asp Glu Asn Tyr Val He Ser Val 135 140 145
GAT AAA AAC GCT CCT TTA GAA AAA GTA GCC CCC TTG CTT TGT GCG GGC 536 Asp Lys Asn Ala Pro Leu Glu Lys Val Ala Pro Leu Leu Cys Ala Gly 150 155 160
ATC ACC ACT TAT TCG CCC TTA AAA TTT TCT AAG GTT ACT AAA GGC ACA 584 He Thr Thr Tyr Ser Pro Leu Lys Phe Ser Lys Val Thr Lys Gly Thr 165 170 175
AAA GTT GGC GTC GCT GGG TTT GGC GGG CTA GGA AGC ATG GCG GTT AAA 632 Lys Val Gly Val Ala Gly Phe Gly Gly Leu Gly Ser Met Ala Val Lys 180 185 190
TAC GCT GTG GCT ATG GGG GCT GAA GTG AGC GTT TTT GCA AGA AAC GAA 680 Tyr Ala Val Ala Met Gly Ala Glu Val Ser Val Phe Ala Arg Asn Glu 195 200 205 210
CAC AAA AAG CAA GAC GCT TTG AGC ATG GGG GTT AAA CAT TTC TAC ACT 728 His Lys Lys Gin Asp Ala Leu Ser Met Gly Val Lys His Phe Tyr Thr 215 220 225
GAC CCC AAA CAA TGC AAA GAG GAA TTG GAC TTT ATC ATT TCA ACC ATT 776 Asp Pro Lys Gin Cys Lys Glu Glu Leu Asp Phe He He Ser Thr He 230 235 240
CCT ACC CAT TAT GAT TTA AAA GAC TAC CTC AAG CTC TTA ACT TAT AAT 824 Pro Thr His Tyr Asp Leu Lys Asp Tyr Leu Lys Leu Leu Thr Tyr Asn 245 250 255 GGC GAT CTA GCC CTT GTG GGA CTC CCC CCT GTA GAA ATC GCT CCA GCG 872 Gly Asp Leu Ala Leu Val Gly Leu Pro Pro Val Glu He Ala Pro Ala 260 265 270
CTT AGC GTT TTT GAT TTT ATC CAT TTA GGC AAT CGC AAG GTT TAT GGC 920 Leu Ser Val Phe Asp Phe He His Leu Gly Asn Arg Lys Val Tyr Gly 275 280 285 290
TCA TTG ATT GGG GGC ATT AAA GAA ACC CAA GAA ATG ATG GAT TTT TCT 968 Ser Leu He Gly Gly He Lys Glu Thr Gin Glu Met Met Asp Phe Ser 295 300 305
ATC AAA CAC AAT ATT TAC CCT GAA ATA GAT TTG ATC TTA GGC AAG GAT 1016 He Lys His Asn He Tyr Pro Glu He Asp Leu He Leu Gly Lys Asp 310 315 320
ATT GAC ACC GCT TAT CAT AAT CTA ACC CAT GGG AAA GCG AAA TTC CGC 1064 He Asp Thr Ala Tyr His Asn Leu Thr His Gly Lys Ala Lys Phe Arg 325 330 335
TAT GTG ATT GAT ATG AAA AAA TCG TTT GAT TAAAAGTTTT GGCTCTAGCT CTT 1117 Tyr Val He Asp Met Lys Lys Ser Phe Asp 340 345
TTTTAAGAGC TTGAGTTGG 1136
(2) INFORMATION FOR SEQ ID NO: 986:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 348 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:986:
Met Arg Val Gin Ser Lys Gly Phe Ala He Phe Ser Lys Asp Gly His
1 5 10 15
Phe Lys Pro His Asp Phe Ser Arg His Ala Val Gly Pro Lys Asp Val
20 25 30
Leu He Asp He Leu Tyr Ala Gly He Cys His Ser Asp He His Ser
35 40 45
Ala Tyr Ser Glu Trp Lys Glu Gly He Tyr Pro Met Val Pro Gly His
50 55 60
Glu He Ala Gly Ala He Lys Glu Val Gly Lys Glu Val Lys Lys Phe 65 70 75 80
Lys Val Gly Asp Val Val Gly Val Gly Cys Phe Val Asn Ser Cys Lys
85 90 95
Ala Cys Lys Pro Cys Lys Glu His Gin Glu Gin Phe Cys Ala Lys Val
100 105 110
Val Phe Thr Tyr Asp Cys Leu Asp Tyr Phe His Asp Asn Glu Pro His
115 120 125
Met Gly Gly Tyr Ser Asn Asn He Val Val Asp Glu Asn Tyr Val He 130 135 140
Ser Val Asp Lys Asn Ala Pro Leu Glu Lys Val Ala Pro Leu Leu Cys 145 150 155 160
Ala Gly He Thr Thr Tyr Ser Pro Leu Lys Phe Ser Lys Val Thr Lys
165 170 175
Gly Thr Lys Val Gly Val Ala Gly Phe Gly Gly Leu Gly Ser Met Ala
180 185 190
Val Lys Tyr Ala Val Ala Met Gly Ala Glu Val Ser Val Phe Ala Arg
195 200 205
Asn Glu His Lys Lys Gin Asp Ala Leu Ser Met Gly Val Lys His Phe
210 215 220
Tyr Thr Asp Pro Lys Gin Cys Lys Glu Glu Leu Asp Phe He He Ser 225 230 235 240
Thr He Pro Thr His Tyr Asp Leu Lys Asp Tyr Leu Lys Leu Leu Thr
245 250 255
Tyr Asn Gly Asp Leu Ala Leu Val Gly Leu Pro Pro Val Glu He Ala
260 265 270
Pro Ala Leu Ser Val Phe Asp Phe He His Leu Gly Asn Arg Lys Val
275 280 285
Tyr Gly Ser Leu He Gly Gly He Lys Glu Thr Gin Glu Met Met Asp
290 295 300
Phe Ser He Lys His Asn He Tyr Pro Glu He Asp Leu He Leu Gly 305 310 315 320
Lys Asp He Asp Thr Ala Tyr His Asn Leu Thr His Gly Lys Ala Lys
325 330 335
Phe Arg Tyr Val He Asp Met Lys Lys Ser Phe Asp 340 345
(2) INFORMATION FOR SEQ ID NO: 987:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1378 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 25...1317 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 987:
TTAAAAAAGG GTGTTTAATT TTTT ATG ACT TCA GCT TCA AGC CAT TCT TTT 51
Met Thr Ser Ala Ser Ser His Ser Phe
1 5
AAA GAA CAA GAT TTT CAT ATT CCT ATC GCT TTT GCT TTT GAT AAG AAT 99 Lys Glu Gin Asp Phe His He Pro He Ala Phe Ala Phe Asp Lys Asn 10 15 20 25 TAC CTC ATT CCT GCG GGC GCG TGT CTT TAT TCC TTG CTA GAA AGC ATC 147 Tyr Leu He Pro Ala Gly Ala Cys Leu Tyr Ser Leu Leu Glu Ser He 30 35 40
GCT AAA GCC AAT AAA AAA ATC CGT TAC ACC CTA CAC GCT TTA GTG GTA 195 Ala Lys Ala Asn Lys Lys He Arg Tyr Thr Leu His Ala Leu Val Val 45 50 55
GGC TTG AAT GAA GAA GAT AAA GCA AAG CTT AAT CAA ATC ACA GAG CCT 243 Gly Leu Asn Glu Glu Asp Lys Ala Lys Leu Asn Gin He Thr Glu Pro 60 65 70
TTT AAA GAA TTT GCC GCT TTG GAA GTG AGA GAT ATT GAG TCT TTT TTA 291 Phe Lys Glu Phe Ala Ala Leu Glu Val Arg Asp He Glu Ser Phe Leu 75 80 85
GAC ACT ATC CCT AAC CCT TTT GAT GAG GAT TTC ACT AAG CGT TTT TCT 339 Asp Thr He Pro Asn Pro Phe Asp Glu Asp Phe Thr Lys Arg Phe Ser 90 95 100 105
AAA ATG GTG TTA GTG AAG TAT TTT TTG GCG GAT TTG TTC CCC AAA TAT 387 Lys Met Val Leu Val Lys Tyr Phe Leu Ala Asp Leu Phe Pro Lys Tyr 110 115 120
TCC AAA ATG GTG TGG AGC GAT GTG GAT GTC ATC TTT TGC AAT GAA TTT 435 Ser Lys Met Val Trp Ser Asp Val Asp Val He Phe Cys Asn Glu Phe 125 130 135
AGC GCT GAT TTC TTA AAC CTT GAA GAA AAT GAT GAG AAT TAT TTT TAT 483 Ser Ala Asp Phe Leu Asn Leu Glu Glu Asn Asp Glu Asn Tyr Phe Tyr 140 145 150
GGA GTT TTA GAA GTT GAA AAG CAC CAC ATG ATG GAA GGG TTT TTG TTT 531 Gly Val Leu Glu Val Glu Lys His His Met Met Glu Gly Phe Leu Phe 155 160 165
TGC AAT TTA GAT TAC CAG CGC AAG AAA AAT TTC ACC TTA AGA ATG CAT 579 Cys Asn Leu Asp Tyr Gin Arg Lys Lys Asn Phe Thr Leu Arg Met His 170 175 180 185
GAG CTT TTA AGG GGG AAT GAG GCT AAA GGG GAG TTG GAT TTC ACG AAA 627 Glu Leu Leu Arg Gly Asn Glu Ala Lys Gly Glu Leu Asp Phe Thr Lys 190 195 200
TGG TGT TGG CCT AAC ATG AAA GCT TTA GGG ATT GAA TAT TGC GTT TTC 675 Trp Cys Trp Pro Asn Met Lys Ala Leu Gly He Glu Tyr Cys Val Phe 205 210 215
CCT TAT TAT TAC ACC ATT AAA GAT TTT TCT AAC GCG TAT TTA AAC GAG 723 Pro Tyr Tyr Tyr Thr He Lys Asp Phe Ser Asn Ala Tyr Leu Asn Glu 220 225 230
AAT TAC AAG AAA ACC ATT TTA GAG GCA CGA GAA AAC CCT ACC ATT ATC 771 Asn Tyr Lys Lys Thr He Leu Glu Ala Arg Glu Asn Pro Thr He He 235 240 245 CAC TAT GAC GCT TGG TGG GGA GCG GTG AAG CCT TGG GAC TAT CCT TTT 819 His Tyr Asp Ala Trp Trp Gly Ala Val Lys Pro Trp Asp Tyr Pro Phe 250 255 260 265
GGT TTA AAA GCG GAT TTA TGG CTG AAC GCT TTG GCT AAA ACC CCT TTT 867 Gly Leu Lys Ala Asp Leu Trp Leu Asn Ala Leu Ala Lys Thr Pro Phe 270 275 280
ATG AGC GAT TGG ATT GAT TCG ATC GCT AGG GTG GAA ATA GGC AGC GAA 915 Met Ser Asp Trp He Asp Ser He Ala Arg Val Glu He Gly Ser Glu 285 290 295
AAA TGG CAT CGT TAC CAC AGC ATC GTT GCC TAT CAC TAC TAC TTT CCC 963 Lys Trp His Arg Tyr His Ser He Val Ala Tyr His Tyr Tyr Phe Pro 300 305 310
CTA TGG AAG ACT GAA GAG CAG ATC GCC CAT GAC GCA CTC AAG ACC TTT 1011 Leu Trp Lys Thr Glu Glu Gin He Ala His Asp Ala Leu Lys Thr Phe 315 320 325
TTA GAC CAT TAT TTT TCG TGC ATC CAT GCC GCA ATC AAG CAA GAA AAT 1059 Leu Asp His Tyr Phe Ser Cys He His Ala Ala He Lys Gin Glu Asn 330 335 340 345
CTC GGA ATG TTC TTG AAC CAC TAC TTC TCG CAT GCC CAT GCA GAG ATC 1107 Leu Gly Met Phe Leu Asn His Tyr Phe Ser His Ala His Ala Glu He 350 355 360
AAA GAA AAC TCC CTT GAA ATG TTC TTG AAC CAC TAC TTC TCG CAT GTT 1155 Lys Glu Asn Ser Leu Glu Met Phe Leu Asn His Tyr Phe Ser His Val 365 370 375
TAT AGG CTC CCT AAA AAA GCA CGG AAG AGA CTC TTT AGG GTG TTT GTC 1203 Tyr Arg Leu Pro Lys Lys Ala Arg Lys Arg Leu Phe Arg Val Phe Val 380 385 390
AAA CAC TGC ATC CTC ATA CCA CTC AAG AGC CTT GTG GGT AAG ACT CTA 1251 Lys His Cys He Leu He Pro Leu Lys Ser Leu Val Gly Lys Thr Leu 395 400 405
CGA CTC TTA AAA CTC CAT GCG CTA GCT AAA AAA ATC CTA ATC CAA CTC 1299 Arg Leu Leu Lys Leu His Ala Leu Ala Lys Lys He Leu He Gin Leu 410 415 420 425
AAG CTC TTA AAA AAG AGC TAGAGCCAAA ACTTTTAATC AAACGATTTT TTCATATC 1355 Lys Leu Leu Lys Lys Ser 430
AATCACATAG CGGAATTTCG CTT 137 £
(2) INFORMATION FOR SEQ ID NO: 988:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 431 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 988:
Met Thr Ser Ala Ser Ser His Ser Phe Lys Glu Gin Asp Phe His He
1 5 10 15
Pro He Ala Phe Ala Phe Asp Lys Asn Tyr Leu He Pro Ala Gly Ala
20 25 30
Cys Leu Tyr Ser Leu Leu Glu Ser He Ala Lys Ala Asn Lys Lys He
35 40 45
Arg Tyr Thr Leu His Ala Leu Val Val Gly Leu Asn Glu Glu Asp Lys
50 55 60
Ala Lys Leu Asn Gin He Thr Glu Pro Phe Lys Glu Phe Ala Ala Leu 65 70 75 80
Glu Val Arg Asp He Glu Ser Phe Leu Asp Thr He Pro Asn Pro Phe
85 90 95
Asp Glu Asp Phe Thr Lys Arg Phe Ser Lys Met Val Leu Val Lys Tyr
100 105 110
Phe Leu Ala Asp Leu Phe Pro Lys Tyr Ser Lys Met Val Trp Ser Asp
115 120 125
Val Asp Val He Phe Cys Asn Glu Phe Ser Ala Asp Phe Leu Asn Leu
130 135 140
Glu Glu Asn Asp Glu Asn Tyr Phe Tyr Gly Val Leu Glu Val Glu Lys 145 150 155 160
His His Met Met Glu Gly Phe Leu Phe Cys Asn Leu Asp Tyr Gin Arg
165 170 175
Lys Lys Asn Phe Thr Leu Arg Met His Glu Leu Leu Arg Gly Asn Glu
180 185 190
Ala Lys Gly Glu Leu Asp Phe Thr Lys Trp Cys Trp Pro Asn Met Lys
195 200 205
Ala Leu Gly He Glu Tyr Cys Val Phe Pro Tyr Tyr Tyr Thr He Lys
210 215 220
Asp Phe Ser Asn Ala Tyr Leu Asn Glu Asn Tyr Lys Lys Thr He Leu 225 230 235 240
Glu Ala Arg Glu Asn Pro Thr He He His Tyr Asp Ala Trp Trp Gly
245 250 255
Ala Val Lys Pro Trp Asp Tyr Pro Phe Gly Leu Lys Ala Asp Leu Trp
260 265 270
Leu Asn Ala Leu Ala Lys Thr Pro Phe Met Ser Asp Trp He Asp Ser
275 280 285
He Ala Arg Val Glu He Gly Ser Glu Lys Trp His Arg Tyr His Ser
290 295 300
He Val Ala Tyr His Tyr Tyr Phe Pro Leu Trp Lys Thr Glu Glu Gin 305 310 315 320
He Ala His Asp Ala Leu Lys Thr Phe Leu Asp His Tyr Phe Ser Cys
325 330 335
He His Ala Ala He Lys Gin Glu Asn Leu Gly Met Phe Leu Asn His
340 345 350
Tyr Phe Ser His Ala His Ala Glu He Lys Glu Asn Ser Leu Glu Met
355 360 365
Phe Leu Asn His Tyr Phe Ser His Val Tyr Arg Leu Pro Lys Lys Ala 370 375 380 Arg Lys Arg Leu Phe Arg Val Phe Val Lys His Cys He Leu He Pro 385 390 395 400
Leu Lys Ser Leu Val Gly Lys Thr Leu Arg Leu Leu Lys Leu His Ala
405 410 415
Leu Ala Lys Lys He Leu He Gin Leu Lys Leu Leu Lys Lys Ser 420 425 430
(2) INFORMATION FOR SEQ ID NO: 989:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 650 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 46...603 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 989:
GCTGAATTCA ATTTATTTTA TACGATATTA AGGAGACATA TTACC ATG TTT CAA ATT 57
Met Phe Gin He
1
AGA TGG CAT GCA CGA GCG GGT CAA GGT GCA ATC ACT GGC GCT AAA GGG 105 Arg Trp His Ala Arg Ala Gly Gin Gly Ala He Thr Gly Ala Lys Gly 5 10 15 20
TTG GCT GAT GTG ATT TCA AAA ACA GGC AAA GAA GTG CAA GCG TTC GCT 153 Leu Ala Asp Val He Ser Lys Thr Gly Lys Glu Val Gin Ala Phe Ala 25 30 35
TCT TAT GGT TCA GCT AAA AGG GGG GCT GCT ATG ATG GCT TAT AAC CGC 201 Ser Tyr Gly Ser Ala Lys Arg Gly Ala Ala Met Met Ala Tyr Asn Arg 40 45 50
GTT GAT GAT GAA CCT ATC TTA AAC CAT GAA CGC TTC ATG CAG CCT GAT 249 Val Asp Asp Glu Pro He Leu Asn His Glu Arg Phe Met Gin Pro Asp 55 60 65
TAT GTG CTG GTG ATT GAC CCT GGT TTG GTT TTC ATT GAA AAC ATC TTC 297 Tyr Val Leu Val He Asp Pro Gly Leu Val Phe He Glu Asn He Phe 70 75 80
GCC AAT GAA AAA GAA GAC ACG ACT TAT ATT ATC ACT AGC TAC CTT AAC 345 Ala Asn Glu Lys Glu Asp Thr Thr Tyr He He Thr Ser Tyr Leu Asn 85 90 95 100
AAA GAA GAA TTG TTT GAA AAA AAA CCT GAA TTA AAA ACC CGT AAG GTG 393 Lys Glu Glu Leu Phe Glu Lys Lys Pro Glu Leu Lys Thr Arg Lys Val 105 110 115
TTT TTA GTG GAT TGT TTA AAA ATC TCT ATG GAA ACC TTA AAA CGC CCC 441 Phe Leu Val Asp Cys Leu Lys He Ser Met Glu Thr Leu Lys Arg Pro 120 125 130
ATC CCT AAC ACG CCC ATG TTA GGG GCG TTA ATG AAA GTG TCT GGC ATG 489 He Pro Asn Thr Pro Met Leu Gly Ala Leu Met Lys Val Ser Gly Met 135 140 145
CTT GAA ATT GGG GCT TTT AAA GAA GCT TTT AAG AAA GTT TTA GGC AAA 537 Leu Glu He Gly Ala Phe Lys Glu Ala Phe Lys Lys Val Leu Gly Lys 150 155 160
AAA CTC ACG CAA GAA GTC ATT GAC GCT AAC ATG CTC GCT ATC CAA AGA 585 Lys Leu Thr Gin Glu Val He Asp Ala Asn Met Leu Ala He Gin Arg 165 170 175 180
GCT TAT GAA GAA GTT CAA TAACATTAAG GAACAAAGAT GAAAGATTGG AACGAATT 641 Ala Tyr Glu Glu Val Gin 185
TGAAATGGG 650
(2) INFORMATION FOR SEQ ID NO: 990:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 186 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 990:
Met Phe Gin He Arg Trp His Ala Arg Ala Gly Gin Gly Ala He Thr
1 5 10 15
Gly Ala Lys Gly Leu Ala Asp Val He Ser Lys Thr Gly Lys Glu Val
20 25 30
Gin Ala Phe Ala Ser Tyr Gly Ser Ala Lys Arg Gly Ala Ala Met Met
35 40 45
Ala Tyr Asn Arg Val Asp Asp Glu Pro He Leu Asn His Glu Arg Phe
50 55 60
Met Gin Pro Asp Tyr Val Leu Val He Asp Pro Gly Leu Val Phe He '65 70 75 80
Glu Asn He Phe Ala Asn Glu Lys Glu Asp Thr Thr Tyr He He Thr
85 90 95
Ser Tyr Leu Asn Lys Glu Glu Leu Phe Glu Lys Lys Pro Glu Leu Lys
100 105 110
Thr Arg Lys Val Phe Leu Val Asp Cys Leu Lys He Ser Met Glu Thr
115 120 125
Leu Lys Arg Pro He Pro Asn Thr Pro Met Leu Gly Ala Leu Met Lys 130 135 140 Val Ser Gly Met Leu Glu He Gly Ala Phe Lys Glu Ala Phe Lys Lys 145 150 155 160
Val Leu Gly Lys Lys Leu Thr Gin Glu Val He Asp Ala Asn Met Leu
165 170 175
Ala He Gin Arg Ala Tyr Glu Glu Val Gin
180 185
(2) INFORMATION FOR SEQ ID NO: 991:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1008 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...954 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 991:
TTGGGGATTT TAACTTTT ATG GAT TTT TGC TCT GGC ATT GGT GGA GGC CGT 51
Met Asp Phe Cys Ser Gly He Gly Gly Gly Arg 1 5 10
TTG GGC TTG GAG CAA TGC CAT TTA AAA TGC GTA GGG CAT GCA GAA ATC 99 Leu Gly Leu Glu Gin Cys His Leu Lys Cys Val Gly His Ala Glu He 15 20 25
AAT CAT GAA GCC CTT AGG ACT TAT GAA TTA TTT TTT AAA GAT ACC CAT 147 Asn His Glu Ala Leu Arg Thr Tyr Glu Leu Phe Phe Lys Asp Thr His 30 35 40
AAT TTT GGG GAT TTG ATG CGA ATC AAC CCT AAT GAT TTA CCC GAT TTT 195 Asn Phe Gly Asp Leu Met Arg He Asn Pro Asn Asp Leu Pro Asp Phe 45 50 55
GAT GCA CTC ATT AGC GGG TTT CCT TGT CAA GCT TTT TCT ATC AAT GGC 243 Asp Ala Leu He Ser Gly Phe Pro Cys Gin Ala Phe Ser He Asn Gly 60 65 70 75
AAA AGG AAG GGG CTT GAA GAT GAA AGA GGG ACG ATT ATT TAC GGG CTT 291 Lys Arg Lys Gly Leu Glu Asp Glu Arg Gly Thr He He Tyr Gly Leu 80 85 90
ATT CGC ATT TTA AAA GTT AAA CAG CCT GAA TGT TTC TTG CTT GAA AAT 339 He Arg He Leu Lys Val Lys Gin Pro Glu Cys Phe Leu Leu Glu Asn 95 100 105
GTT AAG GGC TTG ATC AAT CAT AAT AAA AAG GCA ACT TTT AAT ATT ATT 387 Val Lys Gly Leu He Asn His Asn Lys Lys Ala Thr Phe Asn He He 110 115 120
ATC AAA GCC CTA CAA GAA GTG GGT TAT ACA ACT TAT TAT AAA ATT TTA 435 He Lys Ala Leu Gin Glu Val Gly Tyr Thr Thr Tyr Tyr Lys He Leu 125 130 135
AAC AGC GCT GAT TTT CAA TTA GCC CAA AAT AGA GAA CGC CTT TAT ATC 483 Asn Ser Ala Asp Phe Gin Leu Ala Gin Asn Arg Glu Arg Leu Tyr He 140 145 150 155
GTA GGG TTT AGG AAG GAT TTA AAA CAC CCA TTT AAT TTC CCT TTA GGT 531 Val Gly Phe Arg Lys Asp Leu Lys His Pro Phe Asn Phe Pro Leu Gly 160 165 170
TTA GCC AAT GAT TAT TAT TTC AAG GAT TTT TTA GAC GCT GAT AAT GAA 579 Leu Ala Asn Asp Tyr Tyr Phe Lys Asp Phe Leu Asp Ala Asp Asn Glu 175 180 185
TGT TAT TTG GAT GTG AGT AAC GCT GCA TTT CAA AGA TAC TTG CAC AAC 627 Cys Tyr Leu Asp Val Ser Asn Ala Ala Phe Gin Arg Tyr Leu His Asn 190 195 200
CGA TAC AAC CAT AAC CGG GTT TCT TTA GAG GAT CTC TTA ACT TTA GAA 675 Arg Tyr Asn His Asn Arg Val Ser Leu Glu Asp Leu Leu Thr Leu Glu 205 210 215
AAC GCT GTT TTA GAC ACA AGA CAA TCT GAT TTA AGG TTG TAT TCT AAT 723 Asn Ala Val Leu Asp Thr Arg Gin Ser Asp Leu Arg Leu Tyr Ser Asn 220 225 230 235
GTT TTT CCT ACT TTA AGG ACT TCT CGG CAT GGC CTG TTT TAT ACC CAA 771 Val Phe Pro Thr Leu Arg Thr Ser Arg His Gly Leu Phe Tyr Thr Gin 240 245 250
AAA GGC AAA ATC AAA AGA TTA AAC GCT ATT GAA AGC TTG CTT TTG CAA 819 Lys Gly Lys He Lys Arg Leu Asn Ala He Glu Ser Leu Leu Leu Gin 255 260 265
GGA TTT CCT AGG GAT TTG ATC GCT AAG ATT AAA GAT AAT CCT AAC TTT 867 Gly Phe Pro Arg Asp Leu He Ala Lys He Lys Asp Asn Pro Asn Phe 270 275 280
AAA GCA AGC CAT TTG CTA TCC CAA GCG GGG AAT GCG ATG AGC GTG AAT 915 Lys Ala Ser His Leu Leu Ser Gin Ala Gly Asn Ala Met Ser Val Asn 285 290 295
GTG ATT GCT GCA ATC GCT AAA CAA ATG TTA AAG GCG ATT TAATAAGGGA GC 966 Val He Ala Ala He Ala Lys Gin Met Leu Lys Ala He 300 305 310
TTTAAGGGGA GAATGATTTC AAAATACCCC CTATCCCCTT AA 100£
(2) INFORMATION FOR SEQ ID NO:992: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 312 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 992:
Met Asp Phe Cys Ser Gly He Gly Gly Gly Arg Leu Gly Leu Glu Gin
1 5 10 15
Cys His Leu Lys Cys Val Gly His Ala Glu He Asn His Glu Ala Leu
20 25 30
Arg Thr Tyr Glu Leu Phe Phe Lys Asp Thr His Asn Phe Gly Asp Leu
35 40 45
Met Arg He Asn Pro Asn Asp Leu Pro Asp Phe Asp Ala Leu He Ser
50 55 60
Gly Phe Pro Cys Gin Ala Phe Ser He Asn Gly Lys Arg Lys Gly Leu 65 70 75 80
Glu Asp Glu Arg Gly Thr He He Tyr Gly Leu He Arg He Leu Lys
85 90 95
Val Lys Gin Pro Glu Cys Phe Leu Leu Glu Asn Val Lys Gly Leu He
100 105 110
Asn His Asn Lys Lys Ala Thr Phe Asn He He He Lys Ala Leu Gin
115 120 125
Glu Val Gly Tyr Thr Thr Tyr Tyr Lys He Leu Asn Ser Ala Asp Phe
130 135 140
Gin Leu Ala Gin Asn Arg Glu Arg Leu Tyr He Val Gly Phe Arg Lys 145 150 155 160
Asp Leu Lys His Pro Phe Asn Phe Pro Leu Gly Leu Ala Asn Asp Tyr
165 170 175
Tyr Phe Lys Asp Phe Leu Asp Ala Asp Asn Glu Cys Tyr Leu Asp Val
180 185 190
Ser Asn Ala Ala Phe Gin Arg Tyr Leu His Asn Arg Tyr Asn His Asn
195 200 205
Arg Val Ser Leu Glu Asp Leu Leu Thr Leu Glu Asn Ala Val Leu Asp
210 215 220
Thr Arg Gin Ser Asp Leu Arg Leu Tyr Ser Asn Val Phe Pro Thr Leu 225 230 235 240
Arg Thr Ser Arg His Gly Leu Phe Tyr Thr Gin Lys Gly Lys He Lys
245 250 255
Arg Leu Asn Ala He Glu Ser Leu Leu Leu Gin Gly Phe Pro Arg Asp
260 265 270
Leu He Ala Lys He Lys Asp Asn Pro Asn Phe Lys Ala Ser His Leu
275 280 285
Leu Ser Gin Ala Gly Asn Ala Met Ser Val Asn Val He Ala Ala He
290 295 300
Ala Lys Gin Met Leu Lys Ala He 305 310
(2) INFORMATION FOR SEQ ID NO:993:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1468 base pairs (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 30...1436 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:993:
AAAATAAAAA TTATATTAAT CAAGGAGCG ATG AAA GCG ATG GAA GGT AAA ATC 53
Met Lys Ala Met Glu Gly Lys He 1 5
ATT CAG GTT TTA GGC CCT GTG GTA GAT GTG GAG TTT GAA TCC TAT CTG 101 He Gin Val Leu Gly Pro Val Val Asp Val Glu Phe Glu Ser Tyr Leu 10 15 20
CCG GCG ATT TTT GAA GCG TTA GAC ATT AAT TTT GAA GTC AAT GGT GTT 149 Pro Ala He Phe Glu Ala Leu Asp He Asn Phe Glu Val Asn Gly Val 25 30 35 40
CAA AAG TCT TTA GTT TTA GAG GTG GCA GCC CAT TTG GGC GGT AAT CGG 197 Gin Lys Ser Leu Val Leu Glu Val Ala Ala His Leu Gly Gly Asn Arg 45 50 55
GTG CGA GCG ATT GCT ATG GAT ATG ACA GAA GGC TTA GTG CGT AAC CAA 245 Val Arg Ala He Ala Met Asp Met Thr Glu Gly Leu Val Arg Asn Gin 60 65 70
GTG ATC AAG GCT CGC GGC AAA ATG ATT GAA GTG CCT GTG GGC GAA GAA 293 Val He Lys Ala Arg Gly Lys Met He Glu Val Pro Val Gly Glu Glu 75 80 85
GTA TTA GGG CGT ATT TTT AAT GTT GTG GGC GAG AGC ATT GAC AAT TTA 341 Val Leu Gly Arg He Phe Asn Val Val Gly Glu Ser He Asp Asn Leu 90 95 100
GAG CCG CTT AAG CCG TCC TTA ACT TGG CCC ATT CAC AGA AAA GCC CCT 389 Glu Pro Leu Lys Pro Ser Leu Thr Trp Pro He His Arg Lys Ala Pro 105 110 115 120
AGT TTT GAG CAG CAA AGC ACT AAA ACA GAA ATG TTT GAA ACT GGT ATT 437 Ser Phe Glu Gin Gin Ser Thr Lys Thr Glu Met Phe Glu Thr Gly He 125 130 135
AAA GTC ATT GAC TTA CTC GCG CCT TAT TCT AAG GGC GGT AAA GTA GGC 485 Lys Val He Asp Leu Leu Ala Pro Tyr Ser Lys Gly Gly Lys Val Gly 140 145 150 TTG TTT GGT GGG GCT GGC GTA GGC AAA ACG GTG ATC ATT ATG GAG CTT 533 Leu Phe Gly Gly Ala Gly Val Gly Lys Thr Val He He Met Glu Leu 155 160 165
ATC CAT AAT GTG GCT TAT AAG CAT AAC GGG TAT TCG GTG TTT GCA GGT 581 He His Asn Val Ala Tyr Lys His Asn Gly Tyr Ser Val Phe Ala Gly 170 175 180
GTG GGG GAG CGC ACC AGA GAG GGG AAT GAT CTG TAT TTT GAA ATG AAA 629 Val Gly Glu Arg Thr Arg Glu Gly Asn Asp Leu Tyr Phe Glu Met Lys 185 190 195 200
GAA GGG GGC GTT TTA GAC AAA GTC GCA CTG TGT TAT GGG CAA ATG AAT 677 Glu Gly Gly Val Leu Asp Lys Val Ala Leu Cys Tyr Gly Gin Met Asn 205 210 215
GAG CCA CCA GGC GCG AGG AAC CGC ATC GCA TTC ACC GGC TTG ACG ATG 725 Glu Pro Pro Gly Ala Arg Asn Arg He Ala Phe Thr Gly Leu Thr Met 220 225 230
GCG GAG TAT TTT CGT GAT GAA AAG GGC TTA GAT GTG TTG ATG TTT ATT 773 Ala Glu Tyr Phe Arg Asp Glu Lys Gly Leu Asp Val Leu Met Phe He 235 240 245
GAC AAC ATC TTT AGA TAC GCT CAA AGC GGT GCG GAA ATG AGC GCG CTA 821 Asp Asn He Phe Arg Tyr Ala Gin Ser Gly Ala Glu Met Ser Ala Leu 250 255 260
TTA GGC CGT ATC CCT TCA GCG GTG GGG TAT CAG CCC ACG CTA GCC GGG 869 Leu Gly Arg He Pro Ser Ala Val Gly Tyr Gin Pro Thr Leu Ala Gly 265 270 275 280
GAA ATG GGG AAA CTT CAA GAG CGT ATC GCT TCC ACT AAA AAT GGC TCT 917 Glu Met Gly Lys Leu Gin Glu Arg He Ala Ser Thr Lys Asn Gly Ser 285 290 295
ATC ACT TCC GTT CAA GCG GTG TAT GTG CCA GCA GAT GAC TTG ACT GAC 965 He Thr Ser Val Gin Ala Val Tyr Val Pro Ala Asp Asp Leu Thr Asp 300 305 310
CCA GCC CCT GCT TCG GTG TTT GCG CAT TTG GAT GCG ACT ACG GTG TTG 1013 Pro Ala Pro Ala Ser Val Phe Ala His Leu Asp Ala Thr Thr Val Leu 315 320 325
AAT AGA AAG ATC GCT GAA AAA GGG ATT TAT CCG GCG GTG GAT CCT TTG 1061 Asn Arg Lys He Ala Glu Lys Gly He Tyr Pro Ala Val Asp Pro Leu 330 335 340
GAT TCC ACT TCA AGG ATT TTA AGC CCT CAA ATG ATC GGT GAG AAA CAC 1109 Asp Ser Thr Ser Arg He Leu Ser Pro Gin Met He Gly Glu Lys His 345 350 355 360
TAT GAA GTC GCT ACC GGT ATC CAG CAG GTT TTA CAA AAA TAC AAG GAT 1157 Tyr Glu Val Ala Thr Gly He Gin Gin Val Leu Gin Lys Tyr Lys Asp 365 370 375 TTG CAA GAC ATT ATT GCG ATT TTG GGA TTA GAC GAA TTG AGC GAA GAG 1205 Leu Gin Asp He He Ala He Leu Gly Leu Asp Glu Leu Ser Glu Glu 380 385 390
GAT AAA AAA ACG GTT GAA AGG GCC AGA AAA ATT GAG AAG TTT TTA TCC 1253 Asp Lys Lys Thr Val Glu Arg Ala Arg Lys He Glu Lys Phe Leu Ser 395 400 405
CAG CCG TTC TTT GTG GCT GAA GTG TTT ACA GGA AGT CCT GGT AAA TAT 1301 Gin Pro Phe Phe Val Ala Glu Val Phe Thr Gly Ser Pro Gly Lys Tyr 410 415 420
GTA ACC CTT CAA GAG ACT TTA GAG GGC TTT GGA GGG ATT TTA GAG GGC 1349 Val Thr Leu Gin Glu Thr Leu Glu Gly Phe Gly Gly He Leu Glu Gly 425 430 435 440
AAA TAC GAT CAT ATT CCC GAG AAC GCG TTT TAT ATG GTG GGT AGC ATT 1397 Lys Tyr Asp His He Pro Glu Asn Ala Phe Tyr Met Val Gly Ser He 445 450 455
CAA GAG GTT TTA GAA AAA GCT AAA AAC ATG AAA AAT TCC TAAGGGTTTT GT 1448 Gin Glu Val Leu Glu Lys Ala Lys Asn Met Lys Asn Ser 460 465
GATGGCTTTG TTGAAAATTA 1468
(2) INFORMATION FOR SEQ ID NO: 994:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 469 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 994:
Met Lys Ala Met Glu Gly Lys He He Gin Val Leu Gly Pro Val Val
1 5 10 15
Asp Val Glu Phe Glu Ser Tyr Leu Pro Ala He Phe Glu Ala Leu Asp
20 25 30
He Asn Phe Glu Val Asn Gly Val Gin Lys Ser Leu Val Leu Glu Val
35 40 45
Ala Ala His Leu Gly Gly Asn Arg Val Arg Ala He Ala Met Asp Met
50 55 60
Thr Glu Gly Leu Val Arg Asn Gin Val He Lys Ala Arg Gly Lys Met 65 70 75 80
He Glu Val Pro Val Gly Glu Glu Val Leu Gly Arg He Phe Asn Val
85 90 95
Val Gly Glu Ser He Asp Asn Leu Glu Pro Leu Lys Pro Ser Leu Thr
100 105 110
Trp Pro He His Arg Lys Ala Pro Ser Phe Glu Gin Gin Ser Thr Lys
115 120 125
Thr Glu Met Phe Glu Thr Gly He Lys Val He Asp Leu Leu Ala Pro 130 135 140
Tyr Ser Lys Gly Gly Lys Val Gly Leu Phe Gly Gly Ala Gly Val Gly 145 150 155 160
Lys Thr Val He He Met Glu Leu He His Asn Val Ala Tyr Lys His
165 170 175
Asn Gly Tyr Ser Val Phe Ala Gly Val Gly Glu Arg Thr Arg Glu Gly
180 185 190
Asn Asp Leu Tyr Phe Glu Met Lys Glu Gly Gly Val Leu Asp Lys Val
195 200 205
Ala Leu Cys Tyr Gly Gin Met Asn Glu Pro Pro Gly Ala Arg Asn Arg
210 215 220
He Ala Phe Thr Gly Leu Thr Met Ala Glu Tyr Phe Arg Asp Glu Lys 225 230 235 240
Gly Leu Asp Val Leu Met Phe He Asp Asn He Phe Arg Tyr Ala Gin
245 250 255
Ser Gly Ala Glu Met Ser Ala Leu Leu Gly Arg He Pro Ser Ala Val
260 265 270
Gly Tyr Gin Pro Thr Leu Ala Gly Glu Met Gly Lys Leu Gin Glu Arg
275 280 285
He Ala Ser Thr Lys Asn Gly Ser He Thr Ser Val Gin Ala Val Tyr
290 295 300
Val Pro Ala Asp Asp Leu Thr Asp Pro Ala Pro Ala Ser Val Phe Ala 305 310 315 320
His Leu Asp Ala Thr Thr Val Leu Asn Arg Lys He Ala Glu Lys Gly
325 330 335
He Tyr Pro Ala Val Asp Pro Leu Asp Ser Thr Ser Arg He Leu Ser
340 345 350
Pro Gin Met He Gly Glu Lys His Tyr Glu Val Ala Thr Gly He Gin
355 360 365
Gin Val Leu Gin Lys Tyr Lys Asp Leu Gin Asp He He Ala He Leu
370 375 380
Gly Leu Asp Glu Leu Ser Glu Glu Asp Lys Lys Thr Val Glu Arg Ala 385 390 395 400
Arg Lys He Glu Lys Phe Leu Ser Gin Pro Phe Phe Val Ala Glu Val
405 410 415
Phe Thr Gly Ser Pro Gly Lys Tyr Val Thr Leu Gin Glu Thr Leu Glu
420 425 430
Gly Phe Gly Gly He Leu Glu Gly Lys Tyr Asp His He Pro Glu Asn
435 440 445
Ala Phe Tyr Met Val Gly Ser He Gin Glu Val Leu Glu Lys Ala Lys
450 455 460
Asn Met Lys Asn Ser 465
(2) INFORMATION FOR SEQ ID NO: 995:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2716 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 28...2649 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 995:
TAAGGAACGC TCTATTTTAG GATAATA ATG ATA ATG AAA CAA GAA CCC ACC ACC 54
Met He Met Lys Gin Glu Pro Thr Thr 1 5
TAC CAA CCA GAA GAG ATA GAA AAA AAG ATT TAT GAA ATT TGC TCT CAT 102 Tyr Gin Pro Glu Glu He Glu Lys Lys He Tyr Glu He Cys Ser His 10 15 20 25
AGG GGG TAT TTT GAA ATT GAT GGC AAT GAA GCG ATC CAA GAA AAA AAC 150 Arg Gly Tyr Phe Glu He Asp Gly Asn Glu Ala He Gin Glu Lys Asn 30 35 40
AAA CGA TTT TGC TTG ATG ATG CCC CCT CCT AAT GTG ACC GGT GTG TTG 198 Lys Arg Phe Cys Leu Met Met Pro Pro Pro Asn Val Thr Gly Val Leu 45 50 55
CAC ATA GGG CAT GCC CTG ACT TTA AGC TTG CAA GAT ATT TTA GCG CGT 246 His He Gly His Ala Leu Thr Leu Ser Leu Gin Asp He Leu Ala Arg 60 65 70
TAC AAA CGC ATG GAT GGG TAT AAG ACT TTG TAT CAG CCC GGG TTG GAT 294 Tyr Lys Arg Met Asp Gly Tyr Lys Thr Leu Tyr Gin Pro Gly Leu Asp 75 80 85
CAC GCT GGC ATT GCA ACG CAA AAT GTC GTG GAA AAG CAG CTT TTA AGT 342 His Ala Gly He Ala Thr Gin Asn Val Val Glu Lys Gin Leu Leu Ser 90 95 100 105
CAA GGG ATT AAA AAA GAA GAT TTA GGG CGT GAA GAG TTC ATT AAA AAA 390 Gin Gly He Lys Lys Glu Asp Leu Gly Arg Glu Glu Phe He Lys Lys 110 115 120
GTG TGG GAA TGG AAA GAA AAG AGC GGG GGA GCG ATT TTA GAG CAA ATG 438 Val Trp Glu Trp Lys Glu Lys Ser Gly Gly Ala He Leu Glu Gin Met 125 130 135
AAG CGT TTA GGC GTG AGC GCG GCC TTT TCT AGG ACT CGT TTC ACG ATG 486 Lys Arg Leu Gly Val Ser Ala Ala Phe Ser Arg Thr Arg Phe Thr Met 140 145 150
GAT AAG GGC TTG CAA AGA GCG GTC AAA TTG GCG TTT TTG AAA TGG TAT 534 Asp Lys Gly Leu Gin Arg Ala Val Lys Leu Ala Phe Leu Lys Trp Tyr 155 160 165
GAA AAA GGT CTC ATT ATT CAA GAT AAT TAC ATG GTG AAT TGG TGC ACT 582 Glu Lys Gly Leu He He Gin Asp Asn Tyr Met Val Asn Trp Cys Thr 170 175 180 185 AAA GAT GGG GCG TTG AGC GAT ATT GAA GTG GAG TAT GAA GAG CGT AAG 630 Lys Asp Gly Ala Leu Ser Asp He Glu Val Glu Tyr Glu Glu Arg Lys 190 195 200
GGG GCG TTG TAT TAT ATT AGA TAT TAT TTA GAA AAT CAA AAA GAT TAT 678 Gly Ala Leu Tyr Tyr He Arg Tyr Tyr Leu Glu Asn Gin Lys Asp Tyr 205 210 215
TTA GTG GTG GCT ACC ACA CGC CCT GAA ACC TTG TTT GGC GAT AGC GCG 726 Leu Val Val Ala Thr Thr Arg Pro Glu Thr Leu Phe Gly Asp Ser Ala 220 225 230
CTT ATG GTC AAT CCT AAC GAT GAG AGA TAC AAG CAT TTG GTG GGG CAA 774 Leu Met Val Asn Pro Asn Asp Glu Arg Tyr Lys His Leu Val Gly Gin 235 240 245
AAA GCG ATC TTG CCT TTA ATC CAT CGC ACA ATC CCT ATT ATC GCT GAT 822 Lys Ala He Leu Pro Leu He His Arg Thr He Pro He He Ala Asp 250 255 260 265
GAA CAT GTT GAA ATG GAG TTT GGC ACA GGG TGT GTG AAA GTA ACC CCT 870 Glu His Val Glu Met Glu Phe Gly Thr Gly Cys Val Lys Val Thr Pro 270 275 280
GGG CAT GAT TTT AAC GAT TAT GAA GTG GGC AAA CGC CAC CAT TTG GAA 918 Gly His Asp Phe Asn Asp Tyr Glu Val Gly Lys Arg His His Leu Glu 285 290 295
ACG ATT AAA ATC TTT GAT GAA AAG GGG ATT TTA AAC GCG CAT TGC GGG 966 Thr He Lys He Phe Asp Glu Lys Gly He Leu Asn Ala His Cys Gly 300 305 310
GAG TTT GAA AAT TTA GAA CGA TTA GAA GCT AGA GAT AAG GTC GTA GAA 1014 Glu Phe Glu Asn Leu Glu Arg Leu Glu Ala Arg Asp Lys Val Val Glu 315 320 325
AGA TTA AAA GAA AAC GCC CTA TTG GAA AAA ATA GAA GAA CAC ACG CAT 1062 Arg Leu Lys Glu Asn Ala Leu Leu Glu Lys He Glu Glu His Thr His 330 335 340 345
CAA GTG GGG CAT TGC TAT CGT TGT CAT AAT GTG GTA GAA CCT TAT GTG 1110 Gin Val Gly His Cys Tyr Arg Cys His Asn Val Val Glu Pro Tyr Val 350 355 360
TCT AAG CAA TGG TTT GTC AAG CCT GAA ATC GCT CAA AGT TCT ATT GAA 1158 Ser Lys Gin Trp Phe Val Lys Pro Glu He Ala Gin Ser Ser He Glu 365 370 375
AAA ATC CAA CAA GGT TTG GCG CGA TTC TAC CCT TCT AAT TGG ATC AAT 1206 Lys He Gin Gin Gly Leu Ala Arg Phe Tyr Pro Ser Asn Trp He Asn 380 385 390
AAT TAC AAC GCT TGG ATG AGG GAA TTA CGC CCT TGG TGT ATC AGC AGG 1254 Asn Tyr Asn Ala Trp Met Arg Glu Leu Arg Pro Trp Cys He Ser Arg 395 400 405 CAA TTG TTT TGG GGG CAT CAA ATA CCG GTA TTC ACT TGC GAG AAT AAC 1302 Gin Leu Phe Trp Gly His Gin He Pro Val Phe Thr Cys Glu Asn Asn 410 415 420 425
CAC CAG TTC GTA AGC TTA GAC ACC CCC TTA AGT TGC CCT ACT TGT AAG 1350 His Gin Phe Val Ser Leu Asp Thr Pro Leu Ser Cys Pro Thr Cys Lys 430 435 440
AGC GAA ACA CTA GAG CAA GAT AAG GAT GTG CTA GAC ACA TGG TTT AGT 1398 Ser Glu Thr Leu Glu Gin Asp Lys Asp Val Leu Asp Thr Trp Phe Ser 445 450 455
TCA GGG CTA TGG GCG TTT TCC ACT CTA GGG TGG GGG CAA GAA AAA AGC 1446 Ser Gly Leu Trp Ala Phe Ser Thr Leu Gly Trp Gly Gin Glu Lys Ser 460 465 470
GGT TTG TTT AAT GAA AGC GAT TTG AAA GAT TTC TAC CCT AAC ACA ACG 1494 Gly Leu Phe Asn Glu Ser Asp Leu Lys Asp Phe Tyr Pro Asn Thr Thr 475 480 485
CTC ATT ACT GGG TTT GAC ATC CTC TTT TTT TGG GTG GCT AGG ATG CTT 1542 Leu He Thr Gly Phe Asp He Leu Phe Phe Trp Val Ala Arg Met Leu 490 495 500 505
TTT TGC AGC GAA TCG CTT TTA GGC GAA TTG CCC TTT AAA GAT ATT TAC 1590 Phe Cys Ser Glu Ser Leu Leu Gly Glu Leu Pro Phe Lys Asp He Tyr 510 515 520
TTG CAC GCC TTA GTG AGA GAT GAA AAG GGT GAA AAA ATG AGC AAA TCT 1638 Leu His Ala Leu Val Arg Asp Glu Lys Gly Glu Lys Met Ser Lys Ser 525 530 535
AAG GGT AAT GTG ATC GAT CCT TTA GAG ATG ATA GAA AAA TAC GGC GCG 1686 Lys Gly Asn Val He Asp Pro Leu Glu Met He Glu Lys Tyr Gly Ala 540 545 550
GAT AGC TTG CGT TTC ACT TTA GCC AAT TTG TGC GCT ACG GGT AGG GAC 1734 Asp Ser Leu Arg Phe Thr Leu Ala Asn Leu Cys Ala Thr Gly Arg Asp 555 560 565
ATT AAG CTT TCC ACT ACG CAT TTA GAA AAT AAC AAG AAT TTC GCC AAC 1782 He Lys Leu Ser Thr Thr His Leu Glu Asn Asn Lys Asn Phe Ala Asn 570 575 580 585
AAG CTT TTT AAT GCG GCG AGT TAC TTG AAG CTC AAA CAA GAA TCT TTC 1830 Lys Leu Phe Asn Ala Ala Ser Tyr Leu Lys Leu Lys Gin Glu Ser Phe 590 595 600
AAA GAT AAA GAG CGT TTG AAT GAA TAC CAA ACG CCT TTG GGG CGT TAT 1878 Lys Asp Lys Glu Arg Leu Asn Glu Tyr Gin Thr Pro Leu Gly Arg Tyr 605 610 615
GCG AAA TCG CGC TTG AAT TCA GCG ACT AAA GAG GCG CGT AAC GCT TTA 1926 Ala Lys Ser Arg Leu Asn Ser Ala Thr Lys Glu Ala Arg Asn Ala Leu 620 625 630 GAT AAT TAT CGT TTT AAT GAC GCC ACG ACT TTG TTA TAC CGC TTT TTG 1974 Asp Asn Tyr Arg Phe Asn Asp Ala Thr Thr Leu Leu Tyr Arg Phe Leu 635 640 645
TGG GGG GAA TTT TGC GAC TGG TTC ATT GAA TTT TCT AAA GTG GAA AAT 2022 Trp Gly Glu Phe Cys Asp Trp Phe He Glu Phe Ser Lys Val Glu Asn 650 655 660 665
GAA GCG ATA GAC GAA TTA GGG AGC GTG TTA AAA GAG GCT TTA AAA CTC 2070 Glu Ala He Asp Glu Leu Gly Ser Val Leu Lys Glu Ala Leu Lys Leu 670 675 680
TTG CAC CCT TTC ATG CCC TTT ATC AGC GAG TCT TTA TAC CAC AAG CTC 2118 Leu His Pro Phe Met Pro Phe He Ser Glu Ser Leu Tyr His Lys Leu 685 690 695
AGC AAT ACG GAA CTA GAA AAC ACT GAA TCT ATC ATG GTC ATG CCT TAC 2166 Ser Asn Thr Glu Leu Glu Asn Thr Glu Ser He Met Val Met Pro Tyr 700 705 710
CCT AAA GAT TTG GCG CAA GAT GAA AAA TTA GAG CAT GAA TTT GAA GTG 2214 Pro Lys Asp Leu Ala Gin Asp Glu Lys Leu Glu His Glu Phe Glu Val 715 720 725
ATT AAA GAT TGC ATT GTG TCT TTA AGG CGT TTA AAA ATC ATG CTA GAA 2262 He Lys Asp Cys He Val Ser Leu Arg Arg Leu Lys He Met Leu Glu 730 735 740 745
ACC CCA CCG ATT GTT CTA AAA GAA GCG AGC GTG GGA TTA AGA GAA GCC 2310 Thr Pro Pro He Val Leu Lys Glu Ala Ser Val Gly Leu Arg Glu Ala 750 755 760
ATA GAA AAC ACA GAG CGT TTG CAA ACT TAC GCC CAA AAA TTA GCG AGG 2358 He Glu Asn Thr Glu Arg Leu Gin Thr Tyr Ala Gin Lys Leu Ala Arg 765 770 775
TTG GAA AAA GTC AGC GTG ATT AGT TCT AAG CCT TTA AAA AGC GTG AGC 2406 Leu Glu Lys Val Ser Val He Ser Ser Lys Pro Leu Lys Ser Val Ser 780 785 790
GAT GTG GGG GAA TTT TGC CAG ACT TAT GCG AAT TTA GAA AAT CTT GAT 2454 Asp Val Gly Glu Phe Cys Gin Thr Tyr Ala Asn Leu Glu Asn Leu Asp 795 800 805
TTA AGC CCG CTT GTT GCG CGT TTG AAA AAG CAG TTG GAA AAA TTG GAA 2502 Leu Ser Pro Leu Val Ala Arg Leu Lys Lys Gin Leu Glu Lys Leu Glu 810 815 820 825
AAA GAA AAA TTA AAA CTC AAT TTG CAC AAT GAA AAT TTT GTC AAA AAC 2550 Lys Glu Lys Leu Lys Leu Asn Leu His Asn Glu Asn Phe Val Lys Asn 830 835 840
GCG CCT AAA AGC GTG CTA GAA AAA GCT AAA GAG AGT TTA AAA ACG CTT 2598 Ala Pro Lys Ser Val Leu Glu Lys Ala Lys Glu Ser Leu Lys Thr Leu 845 850 855 TTA GAA AAA GAA AGT AAA ATT AAG CAA GAA TTG GAC TTG TTA GAA CAA 2646 Leu Glu Lys Glu Ser Lys He Lys Gin Glu Leu Asp Leu Leu Glu Gin 860 865 870
CCA TAATAAAAGG ATAGAAAATG TTTCAAGCGT TAAGCGATGG GTTTAAAAAC GCGCTC 2705 Pro
AATAAAATCC G 2716
(2) INFORMATION FOR SEQ ID NO: 996:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 874 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 996 :
Met He Met Lys Gin Glu Pro Thr Thr Tyr Gin Pro Glu Glu He Glu
1 5 10 15
Lys Lys He Tyr Glu He Cys Ser His Arg Gly Tyr Phe Glu He Asp
20 25 30
Gly Asn Glu Ala He Gin Glu Lys Asn Lys Arg Phe Cys Leu Met Met
35 40 45
Pro Pro Pro Asn Val Thr Gly Val Leu His He Gly His Ala Leu Thr
50 55 60
Leu Ser Leu Gin Asp He Leu Ala Arg Tyr Lys Arg Met Asp Gly Tyr 65 70 75 80
Lys Thr Leu Tyr Gin Pro Gly Leu Asp His Ala Gly He Ala Thr Gin
85 90 95
Asn Val Val Glu Lys Gin Leu Leu Ser Gin Gly He Lys Lys Glu Asp
100 105 110
Leu Gly Arg Glu Glu Phe He Lys Lys Val Trp Glu Trp Lys Glu Lys
115 120 125
Ser Gly Gly Ala He Leu Glu Gin Met Lys Arg Leu Gly Val Ser Ala
130 135 140
Ala Phe Ser Arg Thr Arg Phe Thr Met Asp Lys Gly Leu Gin Arg Ala 145 150 155 160
Val Lys Leu Ala Phe Leu Lys Trp Tyr Glu Lys Gly Leu He He Gin
165 170 175
Asp Asn Tyr Met Val Asn Trp Cys Thr Lys Asp Gly Ala Leu Ser Asp
180 185 190
He Glu Val Glu Tyr Glu Glu Arg Lys Gly Ala Leu Tyr Tyr He Arg
195 200 205
Tyr Tyr Leu Glu Asn Gin Lys Asp Tyr Leu Val Val Ala Thr Thr Arg
210 215 220
Pro Glu Thr Leu Phe Gly Asp Ser Ala Leu Met Val Asn Pro Asn Asp 225 230 235 240
Glu Arg Tyr Lys His Leu Val Gly Gin Lys Ala He Leu Pro Leu He
245 250 255
His Arg Thr He Pro He He Ala Asp Glu His Val Glu Met Glu Phe 260 265 270
Gly Thr Gly Cys Val Lys Val Thr Pro Gly His Asp Phe Asn Asp Tyr
275 280 285
Glu Val Gly Lys Arg His His Leu Glu Thr He Lys He Phe Asp Glu
290 295 300
Lys Gly He Leu Asn Ala His Cys Gly Glu Phe Glu Asn Leu Glu Arg 305 310 315 320
Leu Glu Ala Arg Asp Lys Val Val Glu Arg Leu Lys Glu Asn Ala Leu
325 330 335
Leu Glu Lys He Glu Glu His Thr His Gin Val Gly His Cys Tyr Arg
340 345 350
Cys His Asn Val Val Glu Pro Tyr Val Ser Lys Gin Trp Phe Val Lys
355 360 365
Pro Glu He Ala Gin Ser Ser He Glu Lys He Gin Gin Gly Leu Ala
370 375 380
Arg Phe Tyr Pro Ser Asn Trp He Asn Asn Tyr Asn Ala Trp Met Arg 385 390 395 400
Glu Leu Arg Pro Trp Cys He Ser Arg Gin Leu Phe Trp Gly His Gin
405 410 415
He Pro Val Phe Thr Cys Glu Asn Asn His Gin Phe Val Ser Leu Asp
420 425 430
Thr Pro Leu Ser Cys Pro Thr Cys Lys Ser Glu Thr Leu Glu Gin Asp
435 440 445
Lys Asp Val Leu Asp Thr Trp Phe Ser Ser Gly Leu Trp Ala Phe Ser
450 455 460
Thr Leu Gly Trp Gly Gin Glu Lys Ser Gly Leu Phe Asn Glu Ser Asp 465 470 475 480
Leu Lys Asp Phe Tyr Pro Asn Thr Thr Leu He Thr Gly Phe Asp He
485 490 495
Leu Phe Phe Trp Val Ala Arg Met Leu Phe Cys Ser Glu Ser Leu Leu
500 505 510
Gly Glu Leu Pro Phe Lys Asp He Tyr Leu His Ala Leu Val Arg Asp
515 520 525
Glu Lys Gly Glu Lys Met Ser Lys Ser Lys Gly Asn Val He Asp Pro
530 535 540
Leu Glu Met He Glu Lys Tyr Gly Ala Asp Ser Leu Arg Phe Thr Leu 545 550 555 560
Ala Asn Leu Cys Ala Thr Gly Arg Asp He Lys Leu Ser Thr Thr His
565 570 575
Leu Glu Asn Asn Lys Asn Phe Ala Asn Lys Leu Phe Asn Ala Ala Ser
580 585 590
Tyr Leu Lys Leu Lys Gin Glu Ser Phe Lys Asp Lys Glu Arg Leu Asn
595 600 605
Glu Tyr Gin Thr Pro Leu Gly Arg Tyr Ala Lys Ser Arg Leu Asn Ser
610 615 620
Ala Thr Lys Glu Ala Arg Asn Ala Leu Asp Asn Tyr Arg Phe Asn Asp 625 630 635 640
Ala Thr Thr Leu Leu Tyr Arg Phe Leu Trp Gly Glu Phe Cys Asp Trp
645 650 655
Phe He Glu Phe Ser Lys Val Glu Asn Glu Ala He Asp Glu Leu Gly
660 665 670
Ser Val Leu Lys Glu Ala Leu Lys Leu Leu His Pro Phe Met Pro Phe
675 680 685
He Ser Glu Ser Leu Tyr His Lys Leu Ser Asn Thr Glu Leu Glu Asn 690 695 700 Thr Glu Ser He Met Val Met Pro Tyr Pro Lys Asp Leu Ala Gin Asp 705 710 715 720
Glu Lys Leu Glu His Glu Phe Glu Val He Lys Asp Cys He Val Ser
725 730 735
Leu Arg Arg Leu Lys He Met Leu Glu Thr Pro Pro He Val Leu Lys
740 745 750
Glu Ala Ser Val Gly Leu Arg Glu Ala He Glu Asn Thr Glu Arg Leu
755 760 765
Gin Thr Tyr Ala Gin Lys Leu Ala Arg Leu Glu Lys Val Ser Val He
770 775 780
Ser Ser Lys Pro Leu Lys Ser Val Ser Asp Val Gly Glu Phe Cys Gin 785 790 795 800
Thr Tyr Ala Asn Leu Glu Asn Leu Asp Leu Ser Pro Leu Val Ala Arg
805 810 815
Leu Lys Lys Gin Leu Glu Lys Leu Glu Lys Glu Lys Leu Lys Leu Asn
820 825 830
Leu His Asn Glu Asn Phe Val Lys Asn Ala Pro Lys Ser Val Leu Glu
835 840 845
Lys Ala Lys Glu Ser Leu Lys Thr Leu Leu Glu Lys Glu Ser Lys He
850 855 860
Lys Gin Glu Leu Asp Leu Leu Glu Gin Pro 865 870
(2) INFORMATION FOR SEQ ID NO: 997:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 509 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 32...451 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 997:
GGGCTTTTTT AAACTTCTTC TTATATCTTT A ATG CTA GAA ATA GAC AAC CAA 52
Met Leu Glu He Asp Asn Gin 1 5
ACC CCG CTA GAA TCA GAC TTT TTA TTA TTA GAA AAA ATC GCA AAT GTT 100 Thr Pro Leu Glu Ser Asp Phe Leu Leu Leu Glu Lys He Ala Asn Val 10 15 20
TTA GCC CCC ACT CAA ATC ATT GAG CTT GTT TTG GTG AGC GAT GAA ACC 148 Leu Ala Pro Thr Gin He He Glu Leu Val Leu Val Ser Asp Glu Thr 25 30 35
ATT CGA GAA ATC AAC AAG GAT TTA AGG GGT TGC GAT TAC GCT ACC GAT 196 He Arg Glu He Asn Lys Asp Leu Arg Gly Cys Asp Tyr Ala Thr Asp 40 45 50 55
GTT TTG AGC TTC CCT TTA GAA GCC ATT CCT CAC ACC CCT TTA GGG AGC 244 Val Leu Ser Phe Pro Leu Glu Ala He Pro His Thr Pro Leu Gly Ser 60 65 70
GTG GTG ATT AAT GCG CCA TTA GCT CAA ACT AAC GCT CTG AAA TTA GGA 292 Val Val He Asn Ala Pro Leu Ala Gin Thr Asn Ala Leu Lys Leu Gly 75 80 85
CAT AGC TTA GAA AAT GAG ATC GCT CTT TTA TTC ATT CAT GGG GTG TTG 340 His Ser Leu Glu Asn Glu He Ala Leu Leu Phe He His Gly Val Leu 90 95 100
CAT TTG TTG GGC TAT GAC CAT GAA AAA GAT AAG GGC GAA CAA CGC CAA 388 His Leu Leu Gly Tyr Asp His Glu Lys Asp Lys Gly Glu Gin Arg Gin 105 110 115
AAA GAG AGC GAA CTC ATT AAA GCG TTT AAC TTG CCT TTG AGT TTG ATT 436 Lys Glu Ser Glu Leu He Lys Ala Phe Asn Leu Pro Leu Ser Leu He 120 125 130 135
GAA CGC ACA CAG GAT TAGGTTTAGA TACTCTACTA ATGCTGACAA ATAAAGCTTT T 492 Glu Arg Thr Gin Asp 140
AATTTTTAAG AATGGAA 509
(2) INFORMATION FOR SEQ ID NO: 998:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 140 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 998:
Met Leu Glu He Asp Asn Gin Thr Pro Leu Glu Ser Asp Phe Leu Leu
1 5 10 15
Leu Glu Lys He Ala Asn Val Leu Ala Pro Thr Gin He He Glu Leu
20 25 30
Val Leu Val Ser Asp Glu Thr He Arg Glu He Asn Lys Asp Leu Arg
35 40 45
Gly Cys Asp Tyr Ala Thr Asp Val Leu Ser Phe Pro Leu Glu Ala He
50 55 60
Pro His Thr Pro Leu Gly Ser Val Val He Asn Ala Pro Leu Ala Gin 65 70 75 80
Thr Asn Ala Leu Lys Leu Gly His Ser Leu Glu Asn Glu He Ala Leu
85 90 95
Leu Phe He His Gly Val Leu His Leu Leu Gly Tyr Asp His Glu Lys 100 105 110 sp Lys Gly Glu Gin Arg Gin Lys Glu Ser Glu Leu He Lys Ala Phe
115 120 125 sn Leu Pro Leu Ser Leu He Glu Arg Thr Gin Asp 130 135 140
(2) INFORMATION FOR SEQ ID NO: 999:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1038 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 25...996 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 999:
AAAATTAAAA TAAAAAGGAT AGAA ATG AAT CAA GAA ATT TTA GAC GTG TTG 51
Met Asn Gin Glu He Leu Asp Val Leu 1 5
ATA GTG GGT GCA GGG CCT GGG GGC ATT GCC ACG GCC GTA GAA TGC GAA 99 He Val Gly Ala Gly Pro Gly Gly He Ala Thr Ala Val Glu Cys Glu 10 15 20 25
ATA GCC GGC GTT AAA AAA GTG CTT TTA TGC GAA AAA ACC GAA AGC CAT 147 He Ala Gly Val Lys Lys Val Leu Leu Cys Glu Lys Thr Glu Ser His 30 35 40
TCA GGC ATG TTA GAG AAG TTT TAT AAA GCC GGT AAA AGG ATT GAT AAA 195 Ser Gly Met Leu Glu Lys Phe Tyr Lys Ala Gly Lys Arg He Asp Lys 45 50 55
GAT TAT AAA AAG CAA GTC GTA GAG CTT AAA GGG CAT ATC CCT TTT AAA 243 Asp Tyr Lys Lys Gin Val Val Glu Leu Lys Gly His He Pro Phe Lys 60 65 70
GAC AGC TTT AAA GAA GAA ACT TTA GAG AAT TTC ACT AAC CTT TTA AAA 291 Asp Ser Phe Lys Glu Glu Thr Leu Glu Asn Phe Thr Asn Leu Leu Lys 75 80 85
GAG CAT CAC ATC ACG CCA AGC TAT AAA ACC GAT ATT GAG AGC GTG AAA 339 Glu His His He Thr Pro Ser Tyr Lys Thr Asp He Glu Ser Val Lys 90 95 100 105
AAA GAG GGC GAA TAC TTT AAA ATC ACC ACC ACT TCT AAC ACA ACC TAT 387 Lys Glu Gly Glu Tyr Phe Lys He Thr Thr Thr Ser Asn Thr Thr Tyr 110 115 120 CAT GCT AAA TTC GTG GTG GTT GCG ATC GGG AAA ATG GGC CAG CCA AAC 435 His Ala Lys Phe Val Val Val Ala He Gly 'Lys Met Gly Gin Pro Asn 125 130 135
CGC CCT ACT GCT TAT AAA ATC CCT GTT GCG CTC TCT AAA CAA GTG GTT 483 Arg Pro Thr Ala Tyr Lys He Pro Val Ala Leu Ser Lys Gin Val Val 140 145 150
TTT AGC ATC AAT GAT TGT AAG GAA AAT GAA AAA ACC CTT GTG ATC GGC 531 Phe Ser He Asn Asp Cys Lys Glu Asn Glu Lys Thr Leu Val He Gly 155 160 165
GGA GGC AAC TCA GCG GTG GAA TAC GCC ATT GCT TTG TGC AAA ACC ACC 579 Gly Gly Asn Ser Ala Val Glu Tyr Ala He Ala Leu Cys Lys Thr Thr 170 175 180 185
CCT ACC ACC CTC AAT TAC CGC AAA AAA GAA TTC AGC CGC ATC AAT GAA 627 Pro Thr Thr Leu Asn Tyr Arg Lys Lys Glu Phe Ser Arg He Asn Glu 190 195 200
GAC AAC GCT AAA AAC TTG CAA GAA GTC CTA AAC AAT AAC ACG CTT AAA 675 Asp Asn Ala Lys Asn Leu Gin Glu Val Leu Asn Asn Asn Thr Leu Lys 205 210 215
AGC AAG CTT GGA GTG GAT ATT GAA AGC CTA GAA GAA GAT AAC ACT CAG 723 Ser Lys Leu Gly Val Asp He Glu Ser Leu Glu Glu Asp Asn Thr Gin 220 225 230
ATT AAG GTT AAC TTC ACC GAT AAC ACG AGC GAA AGT TTT GAT CGT TTG 771 He Lys Val Asn Phe Thr Asp Asn Thr Ser Glu Ser Phe Asp Arg Leu 235 240 245
CTG TAT GCG ATC GGC GGC TCT ACC CCT TTA GAG TTT TTT AAA CGC TGT 819 Leu Tyr Ala He Gly Gly Ser Thr Pro Leu Glu Phe Phe Lys Arg Cys 250 255 260 265
TCT TTA GAG CTG GAT CCT AGC ACC AAT ATC CCT GTG GTG AAA GAA AAT 867 Ser Leu Glu Leu Asp Pro Ser Thr Asn He Pro Val Val Lys Glu Asn 270 275 280
TTA GAG AGC AAC AAT ATC CCT AAT TTG TTC ATC GTG GGC GAT ATT TTA 915 Leu Glu Ser Asn Asn He Pro Asn Leu Phe He Val Gly Asp He Leu 285 290 295
TTC AAA TCA GGG GCG AGC ATC GCT ACC GCT TTA AAC CAT GGC TAT GAT 963 Phe Lys Ser Gly Ala Ser He Ala Thr Ala Leu Asn His Gly Tyr Asp 300 305 310
GTT GCT ATA GAA ATC GCT AAA AGG TTG CAC TCT TAAAGCCGCT CACTCATCAA 1016 Val Ala He Glu He Ala Lys Arg Leu His Ser 315 320
ACGGCTTAGC CTTATACAAA AA 1038
(2) INFORMATION FOR SEQ ID NO: 1000: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 324 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1000:
Met Asn Gin Glu He Leu Asp Val Leu He Val Gly Ala Gly Pro Gly
1 5 10 15
Gly He Ala Thr Ala Val Glu Cys Glu He Ala Gly Val Lys Lys Val
20 25 30
Leu Leu Cys Glu Lys Thr Glu Ser His Ser Gly Met Leu Glu Lys Phe
35 40 45
Tyr Lys Ala Gly Lys Arg He Asp Lys Asp Tyr Lys Lys Gin Val Val
50 55 60
Glu Leu Lys Gly His He Pro Phe Lys Asp Ser Phe Lys Glu Glu Thr 65 70 75 80
Leu Glu Asn Phe Thr Asn Leu Leu Lys Glu His His He Thr Pro Ser
85 90 95
Tyr Lys Thr Asp He Glu Ser Val Lys Lys Glu Gly Glu Tyr Phe Lys
100 105 110
He Thr Thr Thr Ser Asn Thr Thr Tyr His Ala Lys Phe Val Val Val
115 120 125
Ala He Gly Lys Met Gly Gin Pro Asn Arg Pro Thr Ala Tyr Lys He
130 135 140
Pro Val Ala Leu Ser Lys Gin Val Val Phe Ser He Asn Asp Cys Lys 145 150 155 160
Glu Asn Glu Lys Thr Leu Val He Gly Gly Gly Asn Ser Ala Val Glu
165 170 175
Tyr Ala He Ala Leu Cys Lys Thr Thr Pro Thr Thr Leu Asn Tyr Arg
180 185 190
Lys Lys Glu Phe Ser Arg He Asn Glu Asp Asn Ala Lys Asn Leu Gin
195 200 205
Glu Val Leu Asn Asn Asn Thr Leu Lys Ser Lys Leu Gly Val Asp He
210 215 220
Glu Ser Leu Glu Glu Asp Asn Thr Gin He Lys Val Asn Phe Thr Asp 225 230 235 240
Asn Thr Ser Glu Ser Phe Asp Arg Leu Leu Tyr Ala He Gly Gly Ser
245 250 255
Thr Pro Leu Glu Phe Phe Lys Arg Cys Ser Leu Glu Leu Asp Pro Ser
260 265 270
Thr Asn He Pro Val Val Lys Glu Asn Leu Glu Ser Asn Asn He Pro
275 280 285
Asn Leu Phe He Val Gly Asp He Leu Phe Lys Ser Gly Ala Ser He
290 295 300
Ala Thr Ala Leu Asn His Gly Tyr Asp Val Ala He Glu He Ala Lys 305 310 315 320
Arg Leu His Ser
(2) INFORMATION FOR SEQ ID NO: 1001: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 704 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 20...670 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1001:
ATTTTTAAGG TATAGCGTT ATG GCA TTA GAT TGG GAT TTT ATG TTT CAC TCC 52
Met Ala Leu Asp Trp Asp Phe Met Phe His Ser 1 5 10
ATC CCT GCG TTT TTT AAG GGG TTA GAA CTC ACG CTT TAT ATT TCT TTC 100 He Pro Ala Phe Phe Lys Gly Leu Glu Leu Thr Leu Tyr He Ser Phe 15 20 25
TTT GGG ATT TTG CTC TCT CTT TTG GTG GGG TTT TTG TGC GCG ATC GTT 148 Phe Gly He Leu Leu Ser Leu Leu Val Gly Phe Leu Cys Ala He Val 30 35 40
TTG TAT TTT AAA ACG CGC TTT CTC TCT CCT GTT GTC TAT ATC TAT GGC 196 Leu Tyr Phe Lys Thr Arg Phe Leu Ser Pro Val Val Tyr He Tyr Gly 45 50 55
GAA ATC GCT AGG AAC ACG CCC CTG CTC ATC CAG CTT TTC TTT TTG TAT 244 Glu He Ala Arg Asn Thr Pro Leu Leu He Gin Leu Phe Phe Leu Tyr 60 65 70 75
TAC GGG TTG AAT GAA ATC GGT TTG AGC GCT TTA GAG TGC GCG ATT TTA 292 Tyr Gly Leu Asn Glu He Gly Leu Ser Ala Leu Glu Cys Ala He Leu 80 85 90
GCG TTA GGG TTT TTG GGT GGG GGG TAT ATG AGT CAA AGT TTT TTG CTT 340 Ala Leu Gly Phe Leu Gly Gly Gly Tyr Met Ser Gin Ser Phe Leu Leu 95 100 105
GGG TTT AAG AGC CTA GCT TCC ATT CAA AGA GAA AGC GCT TTG AGT TTG 388 Gly Phe Lys Ser Leu Ala Ser He Gin Arg Glu Ser Ala Leu Ser Leu 110 115 120
GGG TTT AGC CCT TTG AAA ATG ATG TAT TAT ATT ATT CTG CCT CAA AGT 436 Gly Phe Ser Pro Leu Lys Met Met Tyr Tyr He He Leu Pro Gin Ser 125 130 135
TTA AGC GTT TCT ATG CCT TCC ATA GGG GCG AAT GTG ATT TTT TTA CTC 484 Leu Ser Val Ser Met Pro Ser He Gly Ala Asn Val He Phe Leu Leu 140 145 150 155
AAA GAA ACT TCG GTG GTG GGC GCG ATA GCC CTA ACC GAT ATT ATG TTT 532 Lys Glu Thr Ser Val Val Gly Ala He Ala Leu Thr Asp He Met Phe 160 165 170
GTG GCG AAA GAT TTT ATT GGC ATT TAT TAT AAA ACG ACT GAA AGC CTT 580 Val Ala Lys Asp Phe He Gly He Tyr Tyr Lys Thr Thr Glu Ser Leu 175 180 185
TTG ATG TTA AGC CTC ACT TAT TTG ATC GCT TTA CTC CCT TTA AGC GTT 628 Leu Met Leu Ser Leu Thr Tyr Leu He Ala Leu Leu Pro Leu Ser Val 190 195 200
TTG TTT GTG ATC TTA GAG CGT TTC TTT AAA AAG AAA GTG GCT TAAAATGGG 679 Leu Phe Val He Leu Glu Arg Phe Phe Lys Lys Lys Val Ala 205 210 215
AGTTTTACTA GAATTAGACA ACCTT 704
(2) INFORMATION FOR SEQ ID NO: 1002:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 217 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1002:
Met Ala Leu Asp Trp Asp Phe Met Phe His Ser He Pro Ala Phe Phe
1 5 10 15
Lys Gly Leu Glu Leu Thr Leu Tyr He Ser Phe Phe Gly He Leu Leu
20 25 30
Ser Leu Leu Val Gly Phe Leu Cys Ala He Val Leu Tyr Phe Lys Thr
35 40 45
Arg Phe Leu Ser Pro Val Val Tyr He Tyr Gly Glu He Ala Arg Asn
50 55 60
Thr Pro Leu Leu He Gin Leu Phe Phe Leu Tyr Tyr Gly Leu Asn Glu 65 70 75 80
He Gly Leu Ser Ala Leu Glu Cys Ala He Leu Ala Leu Gly Phe Leu
85 90 95
Gly Gly Gly Tyr Met Ser Gin Ser Phe Leu Leu Gly Phe Lys Ser Leu
100 105 110
Ala Ser He Gin Arg Glu Ser Ala Leu Ser Leu Gly Phe Ser Pro Leu
115 120 125
Lys Met Met Tyr Tyr He He Leu Pro Gin Ser Leu Ser Val Ser Met
130 135 140
Pro Ser He Gly Ala Asn Val He Phe Leu Leu Lys Glu Thr Ser Val 145 150 155 160
Val Gly Ala He Ala Leu Thr Asp He Met Phe Val Ala Lys Asp Phe
165 170 175
He Gly He Tyr Tyr Lys Thr Thr Glu Ser Leu Leu Met Leu Ser Leu 180 185 190
Thr Tyr Leu He Ala Leu Leu Pro Leu Ser Val Leu Phe Val He Leu
195 200 205
Glu Arg Phe Phe Lys Lys Lys Val Ala 210 215
(2) INFORMATION FOR SEQ ID NO:1003:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 737 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...699 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1003:
AGCGTTTCTT TAAAAAGAAA GTGGCTTAAA ATG GGA GTT TTA CTA GAA TTA GAC 54
Met Gly Val Leu Leu Glu Leu Asp 1 5
AAC CTT AAG CGT TTG TTA GAA GGG TTT GAA ACC ACT CTT TTG ATC GCT 102 Asn Leu Lys Arg Leu Leu Glu Gly Phe Glu Thr Thr Leu Leu He Ala 10 15 20
CTT AGC TCT GCA ATG ATT TCA ATC ATT GTT GGA ATG CTT TTG GGG AGC 150 Leu Ser Ser Ala Met He Ser He He Val Gly Met Leu Leu Gly Ser 25 30 35 40
TTG ATG GCG TTT GGT TCT CAA ATA GTG GTT TTG GCG TGT CGT GTG TAT 198 Leu Met Ala Phe Gly Ser Gin He Val Val Leu Ala Cys Arg Val Tyr 45 50 55
TTA GAA AGC ATT CGC ATC ATC CCG CTT TTA GCA TGG CTT TTT ATT GTG 246 Leu Glu Ser He Arg He He Pro Leu Leu Ala Trp Leu Phe He Val 60 65 70
TAT TTC GGG TTA GCG AGC TGG TTT GAT TTG CAT ATT AGC GCG GTT TTG 294 Tyr Phe Gly Leu Ala Ser Trp Phe Asp Leu His He Ser Ala Val Leu 75 80 85
GCA AGC GTT ATT GTT TTT AGC TTG TGG GGT GGC GCT GAA ATG ATG GAT 342 Ala Ser Val He Val Phe Ser Leu Trp Gly Gly Ala Glu Met Met Asp 90 95 100
TTA ACT AGG GGG GTT TTA ACT TCC GTG AGC AAA CAC CAA ATA GAA AGC 390 Leu Thr Arg Gly Val Leu Thr Ser Val Ser Lys His Gin He Glu Ser 105 110 115 120
GCT CTG GCT TTA GGC TTA GAT TCA AAA AAG GTG ATT TTT AAT ATT ATT 438 Ala Leu Ala Leu Gly Leu Asp Ser Lys Lys Val He Phe Asn He He 125 130 135
TTC CCT CAA AGC TTT TTG TCT TTA TTG CCC TCA AGC CTT AAT TTG TTC 486 Phe Pro Gin Ser Phe Leu Ser Leu Leu Pro Ser Ser Leu Asn Leu Phe 140 145 150
ACG CGC ATG ATC AAA ACC ACG GCT TTA GTT TCT CTC ATT GGA GCG ATT 534 Thr Arg Met He Lys Thr Thr Ala Leu Val Ser Leu He Gly Ala He 155 160 165
GAT TTG CTA AAA GTG GGC CAG CAA ATC ATA GAG CTT AAC CTC TTA CGC 582 Asp Leu Leu Lys Val Gly Gin Gin He He Glu Leu Asn Leu Leu Arg 170 175 180
ATG CCT AAT GCG AGC TTT GTG GTT TAT GGC GTT ATC TTA ATG TTT TAT 630 Met Pro Asn Ala Ser Phe Val Val Tyr Gly Val He Leu Met Phe Tyr 185 190 195 200
TTT AGT TTA TGC TAT AGT TTG AGC CTG TAT AGT TCC TAT TTA GAA AAA 678 Phe Ser Leu Cys Tyr Ser Leu Ser Leu Tyr Ser Ser Tyr Leu Glu Lys 205 210 215
AAA TTC CAA CAC ATT AGA GGG TAAAATGAGC GTGATTTTAG AAACCAAAGG GTTA 733 Lys Phe Gin His He Arg Gly 220
AAAA 737
(2) INFORMATION FOR SEQ ID NO: 1004:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 223 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1004:
Met Gly Val Leu Leu Glu Leu Asp Asn Leu Lys Arg Leu Leu Glu Gly
1 5 10 15
Phe Glu Thr Thr Leu Leu He Ala Leu Ser Ser Ala Met He Ser He
20 25 30
He Val Gly Met Leu Leu Gly Ser Leu Met Ala Phe Gly Ser Gin He
35 40 45
Val Val Leu Ala Cys Arg Val Tyr Leu Glu Ser He Arg He He Pro
50 55 60
Leu Leu Ala Trp Leu Phe He Val Tyr Phe Gly Leu Ala Ser Trp Phe 65 70 75 80
Asp Leu His He Ser Ala Val Leu Ala Ser Val He Val Phe Ser Leu 85 90 95
Trp Gly Gly Ala Glu Met Met Asp Leu Thr Arg Gly Val Leu Thr Ser
100 105 110
Val Ser Lys His Gin He Glu Ser Ala Leu Ala Leu Gly Leu Asp Ser
115 120 125
Lys Lys Val He Phe Asn He He Phe Pro Gin Ser Phe Leu Ser Leu
130 135 140
Leu Pro Ser Ser Leu Asn Leu Phe Thr Arg Met He Lys Thr Thr Ala 145 150 155 160
Leu Val Ser Leu He Gly Ala He Asp Leu Leu Lys Val Gly Gin Gin
165 170 175
He He Glu Leu Asn Leu Leu Arg Met Pro Asn Ala Ser Phe Val Val
180 185 190
Tyr Gly Val He Leu Met Phe Tyr Phe Ser Leu Cys Tyr Ser Leu Ser
195 200 205
Leu Tyr Ser Ser Tyr Leu Glu Lys Lys Phe Gin His He Arg Gly 210 215 220
(2) INFORMATION FOR SEQ ID NO: 1005:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 807 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...774 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1005:
AAAAAAAATT CCAACACATT AGAGGGTAAA ATG AGC GTG ATT TTA GAA ACC AAA 54
Met Ser Val He Leu Glu Thr Lys 1 5
GGG TTA AAA AAA ACC TAT CAA AAC CAT TTG GTT TTA GAC GGC ATC AAT 102 Gly Leu Lys Lys Thr Tyr Gin Asn His Leu Val Leu Asp Gly He Asn 10 15 20
TTC ACT TTA AAT AAG GGT GAA GTG GCA GTG ATT TTA GGG CCT AGC GGG 150 Phe Thr Leu Asn Lys Gly Glu Val Ala Val He Leu Gly Pro Ser Gly 25 30 35 40
TGC GGG AAA AGC ACT TTT TTA AAA TGC CTA AAC GGG CTT GAA AAG ATT 198 Cys Gly Lys Ser Thr Phe Leu Lys Cys Leu Asn Gly Leu Glu Lys He 45 50 55
AAT GAA GGT GAA ATC CTT TTT GAA AAC ACT AAC CTT AAC AAT AAG GCC 246 Asn Glu Gly Glu He Leu Phe Glu Asn Thr Asn Leu Asn Asn Lys Ala 60 65 70
ACT AAC TGG AAT CAA ATG CGC CAA AAA ATA GGC ATG GTG TTT CAA AAT 294 Thr Asn Trp Asn Gin Met Arg Gin Lys He Gly Met Val Phe Gin Asn 75 80 85
TAT GAA TTG TTC CCG CAT TTA AAT GTG TTA GAT AAT ATC TTA CTC GCT 342 Tyr Glu Leu Phe Pro His Leu Asn Val Leu Asp Asn He Leu Leu Ala 90 95 100
CCT ATG AAA GTG CAA AAA CGA TCC AAA GAT GAG GTT ATT TCT CAA GCC 390 Pro Met Lys Val Gin Lys Arg Ser Lys Asp Glu Val He Ser Gin Ala 105 110 115 120
ATA GAG CTT TTA AAG CGA GTG GGT TTG GAG CAT AAA CAA CAA GCT TAC 438 He Glu Leu Leu Lys Arg Val Gly Leu Glu His Lys Gin Gin Ala Tyr 125 130 135
CCT AAA GAA TTG AGC GGC GGA CAA AAA CAA CGA GTA GCG ATC GTG CGC 486 Pro Lys Glu Leu Ser Gly Gly Gin Lys Gin Arg Val Ala He Val Arg 140 145 150
TCT TTA TGC ATG CGA CCA AAA ATC ATG CTT TTT GAT GAA GTA ACC GCC 534 Ser Leu Cys Met Arg Pro Lys He Met Leu Phe Asp Glu Val Thr Ala 155 160 165
TCT TTA GAC CCT GAA ATG GTT AAA GAA GTT TTA GAA GTG ATT TTA GAA 582 Ser Leu Asp Pro Glu Met Val Lys Glu Val Leu Glu Val He Leu Glu 170 175 180
TTA GCC ACA ACA GGC ATG AGC ATG GTG ATT GTA ACG CAT GAA ATG AAA 630 Leu Ala Thr Thr Gly Met Ser Met Val He Val Thr His Glu Met Lys 185 190 195 200
TTC GCG CAA AAA ATC GCT CAT AAA ATC GTG TTT TTT GAT AGC GGT AAA 678 Phe Ala Gin Lys He Ala His Lys He Val Phe Phe Asp Ser Gly Lys 205 210 215
ATC GCT GAA GAA AAC AAC GCT AAA GAA TTT TTT AAC CAC CCG AAA TCT 726 He Ala Glu Glu Asn Asn Ala Lys Glu Phe Phe Asn His Pro Lys Ser 220 225 230
CAA AGA GCG CAA AAA TTT TTA GAA ACT TTC CAT TTT TTA GGG AGC TGT T 775 Gin Arg Ala Gin Lys Phe Leu Glu Thr Phe His Phe Leu Gly Ser Cys 235 240 245
AAATAAAGTT TGCTAAAAAG ATGATTCTAA TT 807
(2) INFORMATION FOR SEQ ID NO: 1006:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 248 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1006:
Met Ser Val He Leu Glu Thr Lys Gly Leu Lys Lys Thr Tyr Gin Asn
1 5 10 15
His Leu Val Leu Asp Gly He Asn Phe Thr Leu Asn Lys Gly Glu Val
20 25 30
Ala Val He Leu Gly Pro Ser Gly Cys Gly Lys Ser Thr Phe Leu Lys
35 40 45
Cys Leu Asn Gly Leu Glu Lys He Asn Glu Gly Glu He Leu Phe Glu
50 55 60
Asn Thr Asn Leu Asn Asn Lys Ala Thr Asn Trp Asn Gin Met Arg Gin 65 70 75 80
Lys He Gly Met Val Phe Gin Asn Tyr Glu Leu Phe Pro His Leu Asn
85 90 95
Val Leu Asp Asn He Leu Leu Ala Pro Met Lys Val Gin Lys Arg Ser
100 105 110
Lys Asp Glu Val He Ser Gin Ala He Glu Leu Leu Lys Arg Val Gly
115 120 125
Leu Glu His Lys Gin Gin Ala Tyr Pro Lys Glu Leu Ser Gly Gly Gin
130 135 140
Lys Gin Arg Val Ala He Val Arg Ser Leu Cys Met Arg Pro Lys He 145 150 155 160
Met Leu Phe Asp Glu Val Thr Ala Ser Leu Asp Pro Glu Met Val Lys
165 170 175
Glu Val Leu Glu Val He Leu Glu Leu Ala Thr Thr Gly Met Ser Met
180 185 190
Val He Val Thr His Glu Met Lys Phe Ala Gin Lys He Ala His Lys
195 200 205
He Val Phe Phe Asp Ser Gly Lys He Ala Glu Glu Asn Asn Ala Lys
210 215 220
Glu Phe Phe Asn His Pro Lys Ser Gin Arg Ala Gin Lys Phe Leu Glu 225 230 235 240
Thr Phe His Phe Leu Gly Ser Cys 245
(2) INFORMATION FOR SEQ ID NO: 1007:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 589 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 13...561 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1007: AAGGATTTGT TG ATG AGT TAT TTT TAT AAG CAC TGT TTG AAA TTT TCG TTG 51 Met Ser Tyr Phe Tyr Lys His Cys Leu Lys Phe Ser Leu 1 5 10
GTT GGG TTG CTA GGG CTT TTG AGC GTT CAG CTT GAC GCT AGG AGT TTT 99 Val Gly Leu Leu Gly Leu Leu Ser Val Gin Leu Asp Ala Arg Ser Phe 15 20 25
GTT GAT GGG GAT TTA GAC ATT CAG AAA TTC AGC TAT GAA GAT TCT CTA 147 Val Asp Gly Asp Leu Asp He Gin Lys Phe Ser Tyr Glu Asp Ser Leu 30 35 40 45
CTT AAA AAG GGA GAC CCT AAT GGC GTG CAT AAA GTG CAG GTG CGA GAT 195 Leu Lys Lys Gly Asp Pro Asn Gly Val His Lys Val Gin Val Arg Asp 50 55 60
TAT AAA GGC AAA ATG CAA GAA GCT GAG ATC CAC TCA GAA ATA CGC ATT 243 Tyr Lys Gly Lys Met Gin Glu Ala Glu He His Ser Glu He Arg He 65 70 75
GCG CTT AAA CCG GGG GTT AAA AAA GAA GTT AAA AAA GGC AAG ATT TAT 291 Ala Leu Lys Pro Gly Val Lys Lys Glu Val Lys Lys Gly Lys He Tyr 80 85 90
AGC GCT CAA ATC AAT GAT GGC ATG TGC TAT GCT TTT AGA ATG CTC CAA 339 Ser Ala Gin He Asn Asp Gly Met Cys Tyr Ala Phe Arg Met Leu Gin 95 100 105
ACC GGC GAT AAT ACC ACA GGC CTT GAT TCT AAA GAG TTC CCC AAG CAA 387 Thr Gly Asp Asn Thr Thr Gly Leu Asp Ser Lys Glu Phe Pro Lys Gin 110 115 120 125
AGT CGT GAG AAA AAG GGC CGA GTG ATC ACT TTA ATC GGT AAA GGT GAA 435 Ser Arg Glu Lys Lys Gly Arg Val He Thr Leu He Gly Lys Gly Glu 130 135 140
GTG CCT TAT CTT ATT TTA GAA ACC GAT TGC CAA GTG GGT GAT ATT GCA 483 Val Pro Tyr Leu He Leu Glu Thr Asp Cys Gin Val Gly Asp He Ala 145 150 155
AAG ATC TCT TTG GTG GGT AAT TTT GAT GGC ACT GGG TTT CTT ACG GAA 531 Lys He Ser Leu Val Gly Asn Phe Asp Gly Thr Gly Phe Leu Thr Glu 160 165 170
TAT AAA TTC AAA GAC GCT AAA CCC ATT TAC TAGTCTTTAT TCTTCGCTTC ATT 584 Tyr Lys Phe Lys Asp Ala Lys Pro He Tyr 175 180
CTTAA 589
(2) INFORMATION FOR SEQ ID NO: 1008:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 183 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1008:
Met Ser Tyr Phe Tyr Lys His Cys Leu Lys Phe Ser Leu Val Gly Leu
1 5 10 15
Leu Gly Leu Leu Ser Val Gin Leu Asp Ala Arg Ser Phe Val Asp Gly
20 25 30
Asp Leu Asp He Gin Lys Phe Ser Tyr Glu Asp Ser Leu Leu Lys Lys
35 40 45
Gly Asp Pro Asn Gly Val His Lys Val Gin Val Arg Asp Tyr Lys Gly
50 55 60
Lys Met Gin Glu Ala Glu He His Ser Glu He Arg He Ala Leu Lys 65 70 75 80
Pro Gly Val Lys Lys Glu Val Lys Lys Gly Lys He Tyr Ser Ala Gin
85 90 95
He Asn Asp Gly Met Cys Tyr Ala Phe Arg Met Leu Gin Thr Gly Asp
100 105 110
Asn Thr Thr Gly Leu Asp Ser Lys Glu Phe Pro Lys Gin Ser Arg Glu
115 120 125
Lys Lys Gly Arg Val He Thr Leu He Gly Lys Gly Glu Val Pro Tyr
130 135 140
Leu He Leu Glu Thr Asp Cys Gin Val Gly Asp He Ala Lys He Ser 145 150 155 160
Leu Val Gly Asn Phe Asp Gly Thr Gly Phe Leu Thr Glu Tyr Lys Phe
165 170 175
Lys Asp Ala Lys Pro He Tyr 180
(2) INFORMATION FOR SEQ ID NO: 1009:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 925 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 30...875 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1009:
GGAATTTTTG GGTGCTACTC CTTTCTCGT ATG GGT ATC GCT TTT GCC CAC TCT 53
Met Gly He Ala Phe Ala His Ser 1 5 ATT TTT TGG TCC ATC ACG GCT TCT TTA GTC ATT CGT GTC GCG CCA AGA 101 He Phe Trp Ser He Thr Ala Ser Leu Val He Arg Val Ala Pro Arg 10 15 20
AAT AAA AAA CAA CAG GCC TTA GGG CTG TTA GCG TTA GGG AGT TCG TTA 149 Asn Lys Lys Gin Gin Ala Leu Gly Leu Leu Ala Leu Gly Ser Ser Leu 25 30 35 40
GCG ATG ATT TTA GGG TTG CCG CTT GGG AGG ATC ATT GGG CAA ATT CTA 197 Ala Met He Leu Gly Leu Pro Leu Gly Arg He He Gly Gin He Leu 45 50 55
GAT TGG CGT TCC ACT TTT GGC GTG ATC GGG GGC GTT GCG ACC CTT ATA 245 Asp Trp Arg Ser Thr Phe Gly Val He Gly Gly Val Ala Thr Leu He 60 65 70
GCG TTG CTT ATG TGG AAA TTG CTC CCG CAT CTA CCC AGT AGA AAC GCA 293 Ala Leu Leu Met Trp Lys Leu Leu Pro His Leu Pro Ser Arg Asn Ala 75 80 85
GGC ACG CTC GCA AGT GTC CCT GTA TTA ATG AAA CGG CCG CTT TTA ATG 341 Gly Thr Leu Ala Ser Val Pro Val Leu Met Lys Arg Pro Leu Leu Met 90 95 100
GGG ATT TAT TTG CTT GTG ATC ATG GTC ATC TCT GGG CAT TTC ACC ACT 389 Gly He Tyr Leu Leu Val He Met Val He Ser Gly His Phe Thr Thr 105 110 115 120
TAT AGT TAT ATT GAG CCT TTT ATC ATT CAA ATC AGC CAA TTT TCT CCT 437 Tyr Ser Tyr He Glu Pro Phe He He Gin He Ser Gin Phe Ser Pro 125 130 135
GAC ATT ACA ACG CTA ATG TTG TTT GTG TTT GGG TTA GCG GGC GTG GTG 485 Asp He Thr Thr Leu Met Leu Phe Val Phe Gly Leu Ala Gly Val Val 140 145 150
GGG AGT TTT TTG TTC GGC CGT TTG TAT GCA AAA AAT TCA AGA AAA TTT 533 Gly Ser Phe Leu Phe Gly Arg Leu Tyr Ala Lys Asn Ser Arg Lys Phe 155 160 165
ATC GCT TTT GCG ATG GTT TTA GTC ATT TGC CCG CAA CTC TTG CTT TTT 581 He Ala Phe Ala Met Val Leu Val He Cys Pro Gin Leu Leu Leu Phe 170 175 180
GTG TTT AAA AAC TTA GAG TGG GTG GTT TTC TTG CAA ATT TTC TTA TGG 629 Val Phe Lys Asn Leu Glu Trp Val Val Phe Leu Gin He Phe Leu Trp 185 190 195 200
GGG ATT GGG ATC ACT TCG CTT GGG ATT TCC TTG CAA ATG AGG GTG TTG 677 Gly He Gly He Thr Ser Leu Gly He Ser Leu Gin Met Arg Val Leu 205 210 215
CAG CTT GCG CCG GAT GCC ACG GAT GTT GCG AGT GCG ATT TAC TCA GGG 725 Gin Leu Ala Pro Asp Ala Thr Asp Val Ala Ser Ala He Tyr Ser Gly 220 225 230 AGC TAT AAT GTG GGG ATT GGA TCA GGA GCG CTG TTT GGC AGT ATT GTG 773 Ser Tyr Asn Val Gly He Gly Ser Gly Ala Leu Phe Gly Ser He Val 235 240 245
ATC CAC CAA CTA GGG CTA GGA TAT ATT GGC TTT GTG GGT GGG GCT TTA 821 He His Gin Leu Gly Leu Gly Tyr He Gly Phe Val Gly Gly Ala Leu 250 255 260
GGT TTG TTG GCG CTC TTT TGG CTT AGA TTC ATT ACG ATA AAG TTT AAA 869 Gly Leu Leu Ala Leu Phe Trp Leu Arg Phe He Thr He Lys Phe Lys 265 270 275 280
AAA ACA TAAAGAGCGT TAAAAGGATT AGCCCAATAA AGGAGAATCC CTTTCGCACT 925 Lys Thr
(2) INFORMATION FOR SEQ ID NO: 1010:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 282 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1010:
Met Gly He Ala Phe Ala His Ser He Phe Trp Ser He Thr Ala Ser
1 5 10 15
Leu Val He Arg Val Ala Pro Arg Asn Lys Lys Gin Gin Ala Leu Gly
20 25 30
Leu Leu Ala Leu Gly Ser Ser Leu Ala Met He Leu Gly Leu Pro Leu
35 40 45
Gly Arg He He Gly Gin He Leu Asp Trp Arg Ser Thr Phe Gly Val
50 55 60
He Gly Gly Val Ala Thr Leu He Ala Leu Leu Met Trp Lys Leu Leu 65 70 75 80
Pro His Leu Pro Ser Arg Asn Ala Gly Thr Leu Ala Ser Val Pro Val
85 90 95
Leu Met Lys Arg Pro Leu Leu Met Gly He Tyr Leu Leu Val He Met
100 105 110
Val He Ser Gly His Phe Thr Thr Tyr Ser Tyr He Glu Pro Phe He
115 120 125
He Gin He Ser Gin Phe Ser Pro Asp He Thr Thr Leu Met Leu Phe
130 135 140
Val Phe Gly Leu Ala Gly Val Val Gly Ser Phe Leu Phe Gly Arg Leu 145 150 155 160
Tyr Ala Lys Asn Ser Arg Lys Phe He Ala Phe Ala Met Val Leu Val
165 170 175
He Cys Pro Gin Leu Leu Leu Phe Val Phe Lys Asn Leu Glu Trp Val
180 185 190
Val Phe Leu Gin He Phe Leu Trp Gly He Gly He Thr Ser Leu Gly 195 200 205 He Ser Leu Gin Met Arg Val Leu Gin Leu Ala Pro Asp Ala Thr Asp
210 215 220
Val Ala Ser Ala He Tyr Ser Gly Ser Tyr Asn Val Gly He Gly Ser 225 230 235 240
Gly Ala Leu Phe Gly Ser He Val He His Gin Leu Gly Leu Gly Tyr
245 250 255
He Gly Phe Val Gly Gly Ala Leu Gly Leu Leu Ala Leu Phe Trp Leu
260 265 270
Arg Phe He Thr He Lys Phe Lys Lys Thr 275 280
(2) INFORMATION FOR SEQ ID NO: 1011:
(i) SEQUENCE CHARACTERTSTTCS :
(A) LENGTH: 1097 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 28...1065 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1011:
AAGCCCTTGA AATCATTGGA GAAAACG ATG AAG ACT TAT AAT GTC GCT ATT GTT 54
Met Lys Thr Tyr Asn Val Ala He Val 1 5
GGG GCC AGT GGG GCG GTA GGC CAA GAG CTG ATT AAG GGT TTA GAA AAT 102 Gly Ala Ser Gly Ala Val Gly Gin Glu Leu He Lys Gly Leu Glu Asn 10 15 20 25
TCT TTT TTC CCA ATT AAA AAA TTT GTC CCG CTC GCT AGC ACT AGG AGT 150 Ser Phe Phe Pro He Lys Lys Phe Val Pro Leu Ala Ser Thr Arg Ser 30 35 40
GCT GGT AAA AAG ATC AAA GCT TTC AAT AAA GAC TAT GAA ATT TTA GAA 198 Ala Gly Lys Lys He Lys Ala Phe Asn Lys Asp Tyr Glu He Leu Glu 45 50 55
ACC ACG CAT GAA GTT TTT GAA AGA GAA AAA ATA GAC ATC GCC TTT TTT 246 Thr Thr His Glu Val Phe Glu Arg Glu Lys He Asp He Ala Phe Phe 60 65 70
AGC GCT GGG GGG AGC GTG AGC GMA GAA TTT GCT ACA AGC GCT TCA AAA 294 Ser Ala Gly Gly Ser Val Ser Xaa Glu Phe Ala Thr Ser Ala Ser Lys 75 80 85
ACG GCC TTA GTG GTT GAT AAC ACG AGC TTT TTT AGA TTG AAT AAA GAT 342 Thr Ala Leu Val Val Asp Asn Thr Ser Phe Phe Arg Leu Asn Lys Asp 90 95 100 105
GTG CCT TTA GTC GTT CCT GAA ATC AAC GCT AAA GAA ATT TTT AAC GCT 390 Val Pro Leu Val Val Pro Glu He Asn Ala Lys Glu He Phe Asn Ala 110 115 120
CCC TTG AAT ATC ATC GCT AAC CCT AAT TGC TCC ACC ATT CAA ATG ACG 438 Pro Leu Asn He He Ala Asn Pro Asn Cys Ser Thr He Gin Met Thr 125 130 135
CAA ATC TTA AAC CCC TTA CAT CTC CAT TTT AAG ATA AAA AGC GTG ATT 486 Gin He Leu Asn Pro Leu His Leu His Phe Lys He Lys Ser Val He 140 145 150
GTT AGC ACC TAT CAA GCC GTG AGT GGG GCA GGG AAC AAG GGC ATA GAG 534 Val Ser Thr Tyr Gin Ala Val Ser Gly Ala Gly Asn Lys Gly He Glu 155 160 165
AGT TTA AAA AAT GAG TTA AAA ACC GCT TTA GAG TGT TTG GAA AAA GAC 582 Ser Leu Lys Asn Glu Leu Lys Thr Ala Leu Glu Cys Leu Glu Lys Asp 170 175 180 185
CCC ACT ATT GAT TTA AAC CAA GTC TTG CAA GCT GGG GCT TTC GCT TAT 630 Pro Thr He Asp Leu Asn Gin Val Leu Gin Ala Gly Ala Phe Ala Tyr 190 195 200
CCG ATC GCT TTC AAT GCG ATC GCT CAT ATT GAT ACT TTT AAG GAG AAT 678 Pro He Ala Phe Asn Ala He Ala His He Asp Thr Phe Lys Glu Asn 205 210 215
GGT TAC ACG AAA GAA GAG CTA AAA ATG CTG CAT GAA ACC CAT AAA ATC 726 Gly Tyr Thr Lys Glu Glu Leu Lys Met Leu His Glu Thr His Lys He 220 225 230
ATG GGC GTG GAT TTC CCT ATC AGC GCG ACT TGC GTG CGC GTG CCG GTA 774 Met Gly Val Asp Phe Pro He Ser Ala Thr Cys Val Arg Val Pro Val 235 240 245
TTG AGG AGC CAT AGC GAG AGT TTG AGT ATC GCT TTT GAA AAA GAA TTC 822 Leu Arg Ser His Ser Glu Ser Leu Ser He Ala Phe Glu Lys Glu Phe 250 255 260 265
GAT CTC AAA GAA GTC TAT GAA GTT TTA AAA AAC GCC CCT AGC GTG GCT 870 Asp Leu Lys Glu Val Tyr Glu Val Leu Lys Asn Ala Pro Ser Val Ala 270 275 280
GTT TGC GAT GAT CCC AGT CAT AAT CTT TAC CCC ACG CCC CTA AAA GCG 918 Val Cys Asp Asp Pro Ser His Asn Leu Tyr Pro Thr Pro Leu Lys Ala 285 290 295
AGC CAC ACG GAT AGC GTC TTT ATA GGG CGC TTG AGG AAG GAT TTG TTT 966 Ser His Thr Asp Ser Val Phe He Gly Arg Leu Arg Lys Asp Leu Phe 300 305 310 GAT AAG AAA ACT TTG CAT GGC TTT TGT GTG GCG GAT CAA TTG AGA GTG 1014
Asp Lys Lys Thr Leu His Gly Phe Cys Val Ala Asp Gin Leu Arg Val 315 320 325
GGG GCA GCC ACC AAC GCA CTC AAA ATC GCT CTG CAT TAC ATT AAG AAC 1062
Gly Ala Ala Thr Asn Ala Leu Lys He Ala Leu His Tyr He Lys Asn
330 335 340 345
GCT TGAGTTTATT CAAAGATAAC AAAGATGAAT GT 1097
Ala
(2) INFORMATION FOR SEQ ID NO: 1012:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 346 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTTON: SEQ TD NO: 1012:
Met Lys Thr Tyr Asn Val Ala He Val Gly Ala Ser Gly Ala Val Gly
1 5 10 15
Gin Glu Leu He Lys Gly Leu Glu Asn Ser Phe Phe Pro He Lys Lys
20 25 30
Phe Val Pro Leu Ala Ser Thr Arg Ser Ala Gly Lys Lys He Lys Ala
35 40 45
Phe Asn Lys Asp Tyr Glu He Leu Glu Thr Thr His Glu Val Phe Glu
50 55 60
Arg Glu Lys He Asp He Ala Phe Phe Ser Ala Gly Gly Ser Val Ser 65 70 75 80
Xaa Glu Phe Ala Thr Ser Ala Ser Lys Thr Ala Leu Val Val Asp Asn
85 90 95
Thr Ser Phe Phe Arg Leu Asn Lys Asp Val Pro Leu Val Val Pro Glu
100 105 110
He Asn Ala Lys Glu He Phe Asn Ala Pro Leu Asn He He Ala Asn
115 120 125
Pro Asn Cys Ser Thr He Gin Met Thr Gin He Leu Asn Pro Leu His
130 135 140
Leu His Phe Lys He Lys Ser Val He Val Ser Thr Tyr Gin Ala Val 145 150 155 160
Ser Gly Ala Gly Asn Lys Gly He Glu Ser Leu Lys Asn Glu Leu Lys
165 170 175
Thr Ala Leu Glu Cys Leu Glu Lys Asp Pro Thr He Asp Leu Asn Gin
180 185 190
Val Leu Gin Ala Gly Ala Phe Ala Tyr Pro He Ala Phe Asn Ala He
195 200 205
Ala His He Asp Thr Phe Lys Glu Asn Gly Tyr Thr Lys Glu Glu Leu
210 215 220
Lys Met Leu His Glu Thr His Lys He Met Gly Val Asp Phe Pro He 225 230 235 240 Ser Ala Thr Cys Val Arg Val Pro Val Leu Arg Ser His Ser Glu Ser
245 250 255
Leu Ser He Ala Phe Glu Lys Glu Phe Asp Leu Lys Glu Val Tyr Glu
260 265 270
Val Leu Lys Asn Ala Pro Ser Val Ala Val Cys Asp Asp Pro Ser His
275 280 285
Asn Leu Tyr Pro Thr Pro Leu Lys Ala Ser His Thr Asp Ser Val Phe
290 295 300
He Gly Arg Leu Arg Lys Asp Leu Phe Asp Lys Lys Thr Leu His Gly 305 310 315 320
Phe Cys Val Ala Asp Gin Leu Arg Val Gly Ala Ala Thr Asn Ala Leu
325 330 335
Lys He Ala Leu His Tyr He Lys Asn Ala 340 345
(2) INFORMATION FOR SEQ ID NO: 1013:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1395 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 34...1359 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1013:
TAAAATTTTA GCATACAAAT ACAAGGAAAT GGA ATG ATT ACC CCT AAA GTG TTG 54
Met He Thr Pro Lys Val Leu 1 5
AGC GGG TTT AAA GAC CGC TTG CCT AAA GAT GCG ATA CAA AAA GCC CAG 102 Ser Gly Phe Lys Asp Arg Leu Pro Lys Asp Ala He Gin Lys Ala Gin 10 15 20
TTG CTT GCG AAA GTT TCA GTC GTG TTT CAA AGT TTT GGT TTT GTG CCG 150 Leu Leu Ala Lys Val Ser Val Val Phe Gin Ser Phe Gly Phe Val Pro 25 30 35
ATT GAA ACC CCT CAT TTG GAA TAC GCT CAA ACG TTA TTG CCT GAT GCG 198 He Glu Thr Pro His Leu Glu Tyr Ala Gin Thr Leu Leu Pro Asp Ala 40 45 50 55
AGC AGT GAT ATT CAA AAA GAA ATT TAT CGT TTT AAA GAC CAT GGG GAT 246 Ser Ser Asp He Gin Lys Glu He Tyr Arg Phe Lys Asp His Gly Asp 60 65 70
AGA GAT GTG GCT TTA AGG TTT GAT TTG ACT GTG CCA TTA GCC CGC TTT 294 Arg Asp Val Ala Leu Arg Phe Asp Leu Thr Val Pro Leu Ala Arg Phe 75 80 85
GTC TCT TTG CAC CAC CAA ACG CTA GGC ATG CCC TTT AAA CGC TAC GCT 342 Val Ser Leu His His Gin Thr Leu Gly Met Pro Phe Lys Arg Tyr Ala 90 95 100
ATA GGC AAT GTC TTT AGG GGC GAA AGG GCG CAA AAA GGG CGT TAT AGG 390 He Gly Asn Val Phe Arg Gly Glu Arg Ala Gin Lys Gly Arg Tyr Arg 105 110 115
GAA TTT ACG CAA TGC GAT TTT GAT TTT ATA GGG AGC GAG AGT TTG GTG 438 Glu Phe Thr Gin Cys Asp Phe Asp Phe He Gly Ser Glu Ser Leu Val 120 125 130 135
TGC GAT GCT GAG ATC ATT CAA GTG ATT GTC GCT TCT TTA AAA GCC CTA 486 Cys Asp Ala Glu He He Gin Val He Val Ala Ser Leu Lys Ala Leu 140 145 150
GAT TTA GAA GAT TTT TGC GTC TCT ATC AAC CAC AGA AAA ATT TTG AAC 534 Asp Leu Glu Asp Phe Cys Val Ser He Asn His Arg Lys He Leu Asn 155 160 165
GGG ATA TGC GAA TAT TTT GGG ATC TCT CAA GTG AAT GAA GCG TTG CGC 582 Gly He Cys Glu Tyr Phe Gly He Ser Gin Val Asn Glu Ala Leu Arg 170 175 180
ATT GTG GAT AAA TTG GAA AAA ATT GGC TTG AAT GGG GTT GAA GAA GAA 630 He Val Asp Lys Leu Glu Lys He Gly Leu Asn Gly Val Glu Glu Glu 185 190 195
TTA AAA AAA GAG TGC GGT TTA AAT TCA AAC ACC ATT AAA GAG CTT TTA 678 Leu Lys Lys Glu Cys Gly Leu Asn Ser Asn Thr He Lys Glu Leu Leu 200 205 210 215
GAA TTA ATT CAA ATC AAA CAA AAC GAT TTA AGC CAT GCG GAA TTT TTT 726 Glu Leu He Gin He Lys Gin Asn Asp Leu Ser His Ala Glu Phe Phe 220 225 230
GAA AAA ATT GCT TAT TTG AAA GAC TAT AAT GAA AAT CTA AAA AAA GGC 774 Glu Lys He Ala Tyr Leu Lys Asp Tyr Asn Glu Asn Leu Lys Lys Gly 235 240 245
ATA CAG GAT TTA GAA AGG CTA TAC CAG TTG CTA GGG GAT TTG CAA ATT 822 He Gin Asp Leu Glu Arg Leu Tyr Gin Leu Leu Gly Asp Leu Gin He 250 255 260
TCT CAA AAC CTG TAT AAA ATT GAT TTT TCT ATC GCT AGG GGA TTA GGG 870 Ser Gin Asn Leu Tyr Lys He Asp Phe Ser He Ala Arg Gly Leu Gly 265 270 275
TAT TAT ACA GGG ATT GTG TAT GAA ACC ACG CTT AAT GAA ATG AAG TCT 918 Tyr Tyr Thr Gly He Val Tyr Glu Thr Thr Leu Asn Glu Met Lys Ser 280 285 290 295
I486- TTA GGG AGC GTG TGT TCA GGG GGG CGT TAT GAT CAT TTG ACT AAA AAT 966 Leu Gly Ser Val Cys Ser Gly Gly Arg Tyr Asp His Leu Thr Lys Asn 300 305 310
TTT TCT AAA GAG AAT TTA CAA GGG GTA GGG GCT TCT ATT GGG ATT GAT 1014 Phe Ser Lys Glu Asn Leu Gin Gly Val Gly Ala Ser He Gly He Asp 315 320 325
CGA TTG ATT GTG GCT TTG AGT GAA ATG CAA TTA TTA GAC GAG CGC TCC 1062 Arg Leu He Val Ala Leu Ser Glu Met Gin Leu Leu Asp Glu Arg Ser 330 335 340
ACC CAA GCC AAA GTT TTA ATC GCT TGC ATG CAT GAA GAG TAT TTT TCT 1110 Thr Gin Ala Lys Val Leu He Ala Cys Met His Glu Glu Tyr Phe Ser 345 350 355
TAC GCC AAC CGC TTA GCG GAG TCT TTA AGG CAA AGC GGG ATT TTT AGT 1158 Tyr Ala Asn Arg Leu Ala Glu Ser Leu Arg Gin Ser Gly He Phe Ser 360 365 370 375
GAA GTC TAT CCA GAA GCT CAA AAA ATC AAA AAA CCC TTT TCT TAT GCC 1206 Glu Val Tyr Pro Glu Ala Gin Lys He Lys Lys Pro Phe Ser Tyr Ala 380 385 390
AAC CAT AAA GGG CAT GAG TTT GTG GCT GTC ATT GGC GAA GAA GAA TTT 1254 Asn His Lys Gly His Glu Phe Val Ala Val He Gly Glu Glu Glu Phe 395 400 405
AAA AGC GAA ACT TTA AGC TTG AAA AAC ATG CAT TCA GGC ATG CAG TTG 1302 Lys Ser Glu Thr Leu Ser Leu Lys Asn Met His Ser Gly Met Gin Leu 410 415 420
AAT TGC TTG AGT TTT TTA AAA GCC CTT GAA ATC ATT GGA GAA AAC GAT 1350 Asn Cys Leu Ser Phe Leu Lys Ala Leu Glu He He Gly Glu Asn Asp 425 430 435
GAA GAC TTA TAATGTCGCT ATTGTTGGGG CCAGTGGGGC GGTAGG 1395
Glu Asp Leu
440
(2) INFORMATION FOR SEQ ID NO: 1014:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 442 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1014:
Met He Thr Pro Lys Val Leu Ser Gly Phe Lys Asp Arg Leu Pro Lys 1 5 10 15 Asp Ala He Gin Lys Ala Gin Leu Leu Ala Lys Val Ser Val Val Phe
20 25 30
Gin Ser Phe Gly Phe Val Pro He Glu Thr Pro His Leu Glu Tyr Ala
35 40 45
Gin Thr Leu Leu Pro Asp Ala Ser Ser Asp He Gin Lys Glu He Tyr
50 55 60
Arg Phe Lys Asp His Gly Asp Arg Asp Val Ala Leu Arg Phe Asp Leu 65 70 75 80
Thr Val Pro Leu Ala Arg Phe Val Ser Leu His His Gin Thr Leu Gly
85 90 95
Met Pro Phe Lys Arg Tyr Ala He Gly Asn Val Phe Arg Gly Glu Arg
100 105 110
Ala Gin Lys Gly Arg Tyr Arg Glu Phe Thr Gin Cys Asp Phe Asp Phe
115 120 125
He Gly Ser Glu Ser Leu Val Cys Asp Ala Glu He He Gin Val He
130 135 140
Val Ala Ser Leu Lys Ala Leu Asp Leu Glu Asp Phe Cys Val Ser He 145 150 155 160
Asn His Arg Lys He Leu Asn Gly He Cys Glu Tyr Phe Gly He Ser
165 170 175
Gin Val Asn Glu Ala Leu Arg He Val Asp Lys Leu Glu Lys He Gly
180 185 190
Leu Asn Gly Val Glu Glu Glu Leu Lys Lys Glu Cys Gly Leu Asn Ser
195 200 205
Asn Thr He Lys Glu Leu Leu Glu Leu He Gin He Lys Gin Asn Asp
210 215 220
Leu Ser His Ala Glu Phe Phe Glu Lys He Ala Tyr Leu Lys Asp Tyr 225 230 235 240
Asn Glu Asn Leu Lys Lys Gly He Gin Asp Leu Glu Arg Leu Tyr Gin
245 250 255
Leu Leu Gly Asp Leu Gin He Ser Gin Asn Leu Tyr Lys He Asp Phe
260 265 270
Ser He Ala Arg Gly Leu Gly Tyr Tyr Thr Gly He Val Tyr Glu Thr
275 280 285
Thr Leu Asn Glu Met Lys Ser Leu Gly Ser Val Cys Ser Gly Gly Arg
290 295 300
Tyr Asp His Leu Thr Lys Asn Phe Ser Lys Glu Asn Leu Gin Gly Val 305 310 315 320
Gly Ala Ser He Gly He Asp Arg Leu He Val Ala Leu Ser Glu Met
325 330 335
Gin Leu Leu Asp Glu Arg Ser Thr Gin Ala Lys Val Leu He Ala Cys
340 345 350
Met His Glu Glu Tyr Phe Ser Tyr Ala Asn Arg Leu Ala Glu Ser Leu
355 360 365
Arg Gin Ser Gly He Phe Ser Glu Val Tyr Pro Glu Ala Gin Lys He
370 375 380
Lys Lys Pro Phe Ser Tyr Ala Asn His Lys Gly His Glu Phe Val Ala 385 390 395 400
Val He Gly Glu Glu Glu Phe Lys Ser Glu Thr Leu Ser Leu Lys Asn
405 410 415
Met His Ser Gly Met Gin Leu Asn Cys Leu Ser Phe Leu Lys Ala Leu
420 425 430
Glu He He Gly Glu Asn Asp Glu Asp Leu 435 440 (2) INFORMATION FOR SEQ ID NO: 1015:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 639 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 70...597 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1015:
CACGCCCATA GGAGAAGTGG CAGAAGTTAT GCAGCTCTTA TTAAAGAAGG AAAAATTAAA 60 GCTTGGGGG ATG AGT GAG GCA GGG TTA TCT AGC ATC CAA AAA GCC CAT CAA 111 Met Ser Glu Ala Gly Leu Ser Ser He Gin Lys Ala His Gin 1 5 10
ATT TGC CCT TTA AGC GCG TTG CAG AGC GAA TAT TCC TTG TGG TGG CGC 159 He Cys Pro Leu Ser Ala Leu Gin Ser Glu Tyr Ser Leu Trp Trp Arg 15 20 25 30
GAA CCT GAA AAA GAG ATT TTA GGT TTT TTA GAA AAA GAA AAA ATT GGA 207 Glu Pro Glu Lys Glu He Leu Gly Phe Leu Glu Lys Glu Lys He Gly 35 40 45
TTT GTC GCT TTT TCG CCT TTG GGT AAG GGG TTT TTA GGC GCG AAA TTT 255 Phe Val Ala Phe Ser Pro Leu Gly Lys Gly Phe Leu Gly Ala Lys Phe 50 55 60
GAA AAA AAT GCC ACT TTC GCT AGT GAG GAT TTT AGA AGC GTT TCT CCT 303 Glu Lys Asn Ala Thr Phe Ala Ser Glu Asp Phe Arg Ser Val Ser Pro 65 70 75
AGG TTT AAT CAA GAA AAT CTA GCC AAA AAT TAC GCC TTG GTG GAA TTA 351 Arg Phe Asn Gin Glu Asn Leu Ala Lys Asn Tyr Ala Leu Val Glu Leu 80 85 90
ATC CAA GAT CAT GCA CAC GCT AAA GGC GTT ACA CCA GCC CAA CTG GCT 399 He Gin Asp His Ala His Ala Lys Gly Val Thr Pro Ala Gin Leu Ala 95 100 105 110
CTC TCA TGG ATT TTG CAC ACG CAA AAA ATC ATT GTC CCT CTC TTT GGC 447 Leu Ser Trp He Leu His Thr Gin Lys He He Val Pro Leu Phe Gly 115 120 125
ACC ACC AAA GAA TCT AGG CTC ATA GAA AAT ATA GGG GCT TTG CAG GTT 495 Thr Thr Lys Glu Ser Arg Leu He Glu Asn He Gly Ala Leu Gin Val 130 135 140 TCT TGG AGT CAA AAA GAA TTG GAG ATT TTC CAA AAA GAA TTG ACT GCA 543 Ser Trp Ser Gin Lys Glu Leu Glu He Phe Gin Lys Glu Leu Thr Ala 145 150 155
ATC AAA ATA GAA GGG GCC CGC TAC CCT GAA AGA ATC AAT GAA ATG GTG 591 He Lys He Glu Gly Ala Arg Tyr Pro Glu Arg He Asn Glu Met Val 160 165 170
AAT CAA TAAAAGTATT GGGTATTTAT AATTGCATTG GCTCTTTTAA AA 639
Asn Gin
175
(2) INFORMATION FOR SEQ ID NO: 1016:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 176 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1016:
Met Ser Glu Ala Gly Leu Ser Ser He Gin Lys Ala His Gin He Cys
1 5 10 15
Pro Leu Ser Ala Leu Gin Ser Glu Tyr Ser Leu Trp Trp Arg Glu Pro
20 25 30
Glu Lys Glu He Leu Gly Phe Leu Glu Lys Glu Lys He Gly Phe Val
35 40 45
Ala Phe Ser Pro Leu Gly Lys Gly Phe Leu Gly Ala Lys Phe Glu Lys
50 55 60
Asn Ala Thr Phe Ala Ser Glu Asp Phe Arg Ser Val Ser Pro Arg Phe 65 70 75 80
Asn Gin Glu Asn Leu Ala Lys Asn Tyr Ala Leu Val Glu Leu He Gin
85 90 95
Asp His Ala His Ala Lys Gly Val Thr Pro Ala Gin Leu Ala Leu Ser
100 105 110
Trp He Leu His Thr Gin Lys He He Val Pro Leu Phe Gly Thr Thr
115 120 125
Lys Glu Ser Arg Leu He Glu Asn He Gly Ala Leu Gin Val Ser Trp
130 135 140
Ser Gin Lys Glu Leu Glu He Phe Gin Lys Glu Leu Thr Ala He Lys 145 150 155 160
He Glu Gly Ala Arg Tyr Pro Glu Arg He Asn Glu Met Val Asn Gin 165 170 175
(2) INFORMATION FOR SEQ ID NO: 1017:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2133 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 25...2088 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1017:
TAATTTAAAA AAGGAACATT AAAT ATG GAT TTT ATC ACC ATC AAT TCT AGT 51
Met Asp Phe He Thr He Asn Ser Ser 1 5
AAC AAA ACC GAA GAG TTC GCT CTC AAA CAA GTG GCC AAA CAA GCC ACC 99 Asn Lys Thr Glu Glu Phe Ala Leu Lys Gin Val Ala Lys Gin Ala Thr 10 15 20 25
AGC TCT CTT TTA TAC CGA TTA GGA AAA ACC ATT ATT TTA GCG AGC GTG 147 Ser Ser Leu Leu Tyr Arg Leu Gly Lys Thr He He Leu Ala Ser Val 30 35 40
TGC GTG GAA AGA GAG CCT GTG AGT GAA GAT TTT CTG CCT TTA GTG GTG 195 Cys Val Glu Arg Glu Pro Val Ser Glu Asp Phe Leu Pro Leu Val Val 45 50 55
CAG TTT TTA GAA AAA TCT TAT GCA GCC GGA AAG ATC CCG GGC GGT TTT 243 Gin Phe Leu Glu Lys Ser Tyr Ala Ala Gly Lys He Pro Gly Gly Phe 60 65 70
GTT AAA AGA GAA GGC AGG GCG CAA GAT TTT GAA ATC TTA ACC TCT AGG 291 Val Lys Arg Glu Gly Arg Ala Gin Asp Phe Glu He Leu Thr Ser Arg 75 80 85
CTC ATA GAC AGG ACT TTA CGC CCT TTA TTC CCT AAA GAC TAC CGC TAC 339 Leu He Asp Arg Thr Leu Arg Pro Leu Phe Pro Lys Asp Tyr Arg Tyr 90 95 100 105
CCT ACA CAG ATC ACT TTA ATG GTT TTA AGC CAT GAT ATT GAA AAT GAC 387 Pro Thr Gin He Thr Leu Met Val Leu Ser His Asp He Glu Asn Asp 110 115 120
TTG CAG GTT TCT GCT TTA AAC GCC GCT TCA GCC GCT CTC TTT TTG GCC 435 Leu Gin Val Ser Ala Leu Asn Ala Ala Ser Ala Ala Leu Phe Leu Ala 125 130 135
CAT ATC GCT CCT ATT AAA AGC GTG AGC GCT TGC AGG ATC GCT AGG ATG 483 His He Ala Pro He Lys Ser Val Ser Ala Cys Arg He Ala Arg Met 140 145 150
GAT AAC GAA TTT ATC ATT AAC CCT AGC GCA AGC CTT TTG AAT CAA TCC 531 Asp Asn Glu Phe He He Asn Pro Ser Ala Ser Leu Leu Asn Gin Ser 155 160 165 AGT TTG GAT TTG TTC GTG TCT GGA ACG AAA GAG AGT TTG AAC ATG ATA 579 Ser Leu Asp Leu Phe Val Ser Gly Thr Lys Glu Ser Leu Asn Met He 170 175 180 185
GAA ATG CGC TCT TTG GGG CAA AAA TTG AAC GCT TTA GAA GAG CCT TTA 627 Glu Met Arg Ser Leu Gly Gin Lys Leu Asn Ala Leu Glu Glu Pro Leu 190 195 200
ATG TTA GAA GCT TTA GAA TTG GCT CAA AAA AGT TTG GAA GAA ACT TGC 675 Met Leu Glu Ala Leu Glu Leu Ala Gin Lys Ser Leu Glu Glu Thr Cys 205 210 215
ACG CTT TAT GAA GAG ATT TTC ACG CCC CAC CAA AAC GAG CTG TTT TTC 723 Thr Leu Tyr Glu Glu He Phe Thr Pro His Gin Asn Glu Leu Phe Phe 220 225 230
AAA GAG AGC CAA GGA ATA GTC TTT AAT GAA AGG CTG TTA GAT TTA TTG 771 Lys Glu Ser Gin Gly He Val Phe Asn Glu Arg Leu Leu Asp Leu Leu 235 240 245
AAA AAT CAG TAT TTT GAT GAA ATC ATC AAA GGC ATT GAA AGT TCT GCT 819 Lys Asn Gin Tyr Phe Asp Glu He He Lys Gly He Glu Ser Ser Ala 250 255 260 265
TTG AGC GAG CGA GAA AAT GTT TTC AAT GAA ATT GCC AGA AAA ATC AGT 867 Leu Ser Glu Arg Glu Asn Val Phe Asn Glu He Ala Arg Lys He Ser 270 275 280
GAA GCC CAC TCA GAA TTC AGT TTA GAA GAA ATT GAA TTG TCT TTA GAA 915 Glu Ala His Ser Glu Phe Ser Leu Glu Glu He Glu Leu Ser Leu Glu 285 290 295
AAA GTG AAA AAG ACT GAG ATA AGA CGC ATG ATC ATT AAG GAT AAA ATC 963 Lys Val Lys Lys Thr Glu He Arg Arg Met He He Lys Asp Lys He 300 305 310
CGC CCG GAT AAG CGC GCG TTA GAA GAA GTG CGG CCC ATT TTG ATA GAG 1011 Arg Pro Asp Lys Arg Ala Leu Glu Glu Val Arg Pro He Leu He Glu 315 320 325
AGC GAT TTG CTC CCT ATG GCG CAT AGC TCC ATT TTA TTC ACT AGG GGG 1059 Ser Asp Leu Leu Pro Met Ala His Ser Ser He Leu Phe Thr Arg Gly 330 335 340 345
CAA ACT CAA AGC TTA GTG GTA GGG GTT TTA GGC ACG GAT AAT GAC GCT 1107 Gin Thr Gin Ser Leu Val Val Gly Val Leu Gly Thr Asp Asn Asp Ala 350 355 360
CAA ACC CAT GAG AGT TTG GAG CAT AAA GCT CCC ATT AAA GAG CGC TTC 1155 Gin Thr His Glu Ser Leu Glu His Lys Ala Pro He Lys Glu Arg Phe 365 370 375
ATG TTT CAT TAT AAT TTC CCT CCT TTC TGC GTG GGC GAA GCG AGT TCT 1203 Met Phe His Tyr Asn Phe Pro Pro Phe Cys Val Gly Glu Ala Ser Ser 380 385 390 ATT GGC GCG GCT TCA AGG CGT GAA TTA GGG CAT GGG AAT TTG GCT AAA 1251 He Gly Ala Ala Ser Arg Arg Glu Leu Gly His Gly Asn Leu Ala Lys 395 400 405
AGA GCC TTA GAA ACG AGC ATT AAA AAT AAA GAG CAG GTG ATA CGA TTG 1299 Arg Ala Leu Glu Thr Ser He Lys Asn Lys Glu Gin Val He Arg Leu 410 415 420 425
GTT TCT GAG ATT TTA GAA AGC AAT GGT TCA AGC TCA ATG GCG AGC GTG 1347 Val Ser Glu He Leu Glu Ser Asn Gly Ser Ser Ser Met Ala Ser Val 430 435 440
TGC GCA GGC TCT TTA GCC CTT TAT GCA AGC GGT GTG GAA ATT TAC GAT 1395 Cys Ala Gly Ser Leu Ala Leu Tyr Ala Ser Gly Val Glu He Tyr Asp 445 450 455
TTA GTC GCT GGG GTG GCT ATG GGC ATG GTG AGC GAA GGG CAA GAT CAC 1443 Leu Val Ala Gly Val Ala Met Gly Met Val Ser Glu Gly Gin Asp His 460 465 470
GCT ATT TTA AGC GAT ATT AGC GGC TTA GAA GAC GCA GAA GGC GAT ATG 1491 Ala He Leu Ser Asp He Ser Gly Leu Glu Asp Ala Glu Gly Asp Met 475 480 485
GAT TTT AAG ATT GCT GGG AAT TTA GAA GGC ATT ACG GCC ATG CAA ATG 1539 Asp Phe Lys He Ala Gly Asn Leu Glu Gly He Thr Ala Met Gin Met 490 495 500 505
GAT ACC AAA ATG AGC GGT ATC AAG CTA GAA ATT TTA TAC CAA GCC TTA 1587 Asp Thr Lys Met Ser Gly He Lys Leu Glu He Leu Tyr Gin Ala Leu 510 515 520
CTC CAA GCC AAA GAA GCA CGG AAA CAT ATT TTA AAA ATC ATG CAT GAA 1635 Leu Gin Ala Lys Glu Ala Arg Lys His He Leu Lys He Met His Glu 525 530 535
GCG AAA GAA AAG ATT GTG ATC AAT TTT TCC CAT TTG CCC ACA ACG GAG 1683 Ala Lys Glu Lys He Val He Asn Phe Ser His Leu Pro Thr Thr Glu 540 545 550
ATT TTT AAT GTC GCA CCC GAT AAA ATT GTA GAA ATT ATC GGT CAA GGG 1731 He Phe Asn Val Ala Pro Asp Lys He Val Glu He He Gly Gin Gly 555 560 565
GGG CGT GTG ATT AAA GAG ATA GTA GAA AAG TTT GAA GTT AAA ATT GAT 1779 Gly Arg Val He Lys Glu He Val Glu Lys Phe Glu Val Lys He Asp 570 575 580 585
TTG AAC AAA CCG AGC GGT GAA GTG AAA ATC ATG GGG AAT AAA GAG CGC 1827 Leu Asn Lys Pro Ser Gly Glu Val Lys He Met Gly Asn Lys Glu Arg 590 595 600
GTT TTA AAG ACT AAG GAA TTT ATT TTA AAC TAC TTG CAT TCT TTA GAT 1875 Val Leu Lys Thr Lys Glu Phe He Leu Asn Tyr Leu His Ser Leu Asp 605 610 615 CAA GAA TTG GAG CAA TAC GCT ATT GAT GAG GTA TTA GAA GCT CAA GTG 1923 Gin Glu Leu Glu Gin Tyr Ala He Asp Glu Val Leu Glu Ala Gin Val 620 625 630
AAA CGA ATC GTG GAT TTT GGG GCG TTT TTA AGC TTG CCT AAG GGG GGC 1971 Lys Arg He Val Asp Phe Gly Ala Phe Leu Ser Leu Pro Lys Gly Gly 635 640 645
GAA GGC TTG TTA AGA AAG CAA AAC ATG GAC AAG TGT CAA GTG GTT TTA 2019 Glu Gly Leu Leu Arg Lys Gin Asn Met Asp Lys Cys Gin Val Val Leu 650 655 660 665
AAA GAA GGC GAT AGC ATC AGG TGT AGG GTG ATT AGC TTC AAT AAG GGT 2067 Lys Glu Gly Asp Ser He Arg Cys Arg Val He Ser Phe Asn Lys Gly 670 675 680
AAA ATC GCT TTA GAT TTG GCT TAAAATTTTA AAAAGCGTTT TTTAAAAGCG TTTT 2122 Lys He Ala Leu Asp Leu Ala 685
TAAGCTAGTT T 2133
(2) INFORMATION FOR SEQ ID NO: 1018:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 688 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1018:
Met Asp Phe He Thr He Asn Ser Ser Asn Lys Thr Glu Glu Phe Ala
1 5 10 15
Leu Lys Gin Val Ala Lys Gin Ala Thr Ser Ser Leu Leu Tyr Arg Leu
20 25 30
Gly Lys Thr He He Leu Ala Ser Val Cys Val Glu Arg Glu Pro Val
35 40 45
Ser Glu Asp Phe Leu Pro Leu Val Val Gin Phe Leu Glu Lys Ser Tyr
50 55 60
Ala Ala Gly Lys He Pro Gly Gly Phe Val Lys Arg Glu Gly Arg Ala 65 70 75 80
Gin Asp Phe Glu He Leu Thr Ser Arg Leu He Asp Arg Thr Leu Arg
85 90 95
Pro Leu Phe Pro Lys Asp Tyr Arg Tyr Pro Thr Gin He Thr Leu Met
100 105 110
Val Leu Ser His Asp He Glu Asn Asp Leu Gin Val Ser Ala Leu Asn
115 120 125
Ala Ala Ser Ala Ala Leu Phe Leu Ala His He Ala Pro He Lys Ser
130 135 140
Val Ser Ala Cys Arg He Ala Arg Met Asp Asn Glu Phe He He Asn 145 150 155 160
Pro Ser Ala Ser Leu Leu Asn Gin Ser Ser Leu Asp Leu Phe Val Ser 165 170 175
Gly Thr Lys Glu Ser Leu Asn Met He Glu Met Arg Ser Leu Gly Gin
180 185 190
Lys Leu Asn Ala Leu Glu Glu Pro Leu Met Leu Glu Ala Leu Glu Leu
195 200 205
Ala Gin Lys Ser Leu Glu Glu Thr Cys Thr Leu Tyr Glu Glu He Phe
210 215 220
Thr Pro His Gin Asn Glu Leu Phe Phe Lys Glu Ser Gin Gly He Val 225 230 235 240
Phe Asn Glu Arg Leu Leu Asp Leu Leu Lys Asn Gin Tyr Phe Asp Glu
245 250 255
He He Lys Gly He Glu Ser Ser Ala Leu Ser Glu Arg Glu Asn Val
260 265 270
Phe Asn Glu He Ala Arg Lys He Ser Glu Ala His Ser Glu Phe Ser
275 280 285
Leu Glu Glu He Glu Leu Ser Leu Glu Lys Val Lys Lys Thr Glu He
290 295 300
Arg Arg Met He He Lys Asp Lys He Arg Pro Asp Lys Arg Ala Leu 305 310 315 320
Glu Glu Val Arg Pro He Leu He Glu Ser Asp Leu Leu Pro Met Ala
325 330 335
His Ser Ser He Leu Phe Thr Arg Gly Gin Thr Gin Ser Leu Val Val
340 345 350
Gly Val Leu Gly Thr Asp Asn Asp Ala Gin Thr His Glu Ser Leu Glu
355 360 365
His Lys Ala Pro He Lys Glu Arg Phe Met Phe His Tyr Asn Phe Pro
370 375 380
Pro Phe Cys Val Gly Glu Ala Ser Ser He Gly Ala Ala Ser Arg Arg 385 390 395 400
Glu Leu Gly His Gly Asn Leu Ala Lys Arg Ala Leu Glu Thr Ser He
405 410 415
Lys Asn Lys Glu Gin Val He Arg Leu Val Ser Glu He Leu Glu Ser
420 425 430
Asn Gly Ser Ser Ser Met Ala Ser Val Cys Ala Gly Ser Leu Ala Leu
435 440 445
Tyr Ala Ser Gly Val Glu He Tyr Asp Leu Val Ala Gly Val Ala Met
450 455 460
Gly Met Val Ser Glu Gly Gin Asp His Ala He Leu Ser Asp He Ser 465 470 475 480
Gly Leu Glu Asp Ala Glu Gly Asp Met Asp Phe Lys He Ala Gly Asn
485 490 495
Leu Glu Gly He Thr Ala Met Gin Met Asp Thr Lys Met Ser Gly He
500 505 510
Lys Leu Glu He Leu Tyr Gin Ala Leu Leu Gin Ala Lys Glu Ala Arg
515 520 525
Lys His He Leu Lys He Met His Glu Ala Lys Glu Lys He Val He
530 535 540
Asn Phe Ser His Leu Pro Thr Thr Glu He Phe Asn Val Ala Pro Asp 545 550 555 560
Lys He Val Glu He He Gly Gin Gly Gly Arg Val He Lys Glu He
565 570 575
Val Glu Lys Phe Glu Val Lys He Asp Leu Asn Lys Pro Ser Gly Glu
580 585 590
Val Lys He Met Gly Asn Lys Glu Arg Val Leu Lys Thr Lys Glu Phe 595 600 605 He Leu Asn Tyr Leu His Ser Leu Asp Gin Glu Leu Glu Gin Tyr Ala
610 615 620
He Asp Glu Val Leu Glu Ala Gin Val Lys Arg He Val Asp Phe Gly 625 630 635 640
Ala Phe Leu Ser Leu Pro Lys Gly Gly Glu Gly Leu Leu Arg Lys Gin
645 650 655
Asn Met Asp Lys Cys Gin Val Val Leu Lys Glu Gly Asp Ser He Arg
660 665 670
Cys Arg Val He Ser Phe Asn Lys Gly Lys He Ala Leu Asp Leu Ala 675 680 685
(2) INFORMATION FOR SEQ ID NO: 1019:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1340 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 25...1296 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1019:
AAACAACCAC ATTGCAGGAA AGAC ATG AAA GAT AAC AAT AAC TAT AAT GTT 51
Met Lys Asp Asn Asn Asn Tyr Asn Val 1 5
TTA ATT GTG GGG AAT AAG GGG CGA GAG TAT GCT TTG GCT CAA AGG CTT 99 Leu He Val Gly Asn Lys Gly Arg Glu Tyr Ala Leu Ala Gin Arg Leu 10 15 20 25
CAG CAA GAT GAG CGA GTG AAT GCT TTG TAT TTT TGT TTG GGT AAT GGT 147 Gin Gin Asp Glu Arg Val Asn Ala Leu Tyr Phe Cys Leu Gly Asn Gly 30 35 40
GGC ACT CAA GAT TTA GGC GAG AAT CTG GAA TGC GAA CAT TAC GAG CAT 195 Gly Thr Gin Asp Leu Gly Glu Asn Leu Glu Cys Glu His Tyr Glu His 45 50 55
ATC GTG GAA TTA GCC CTG AAA AAA CAG ATC CAT TTA GCC ATC ATT TCA 243 He Val Glu Leu Ala Leu Lys Lys Gin He His Leu Ala He He Ser 60 65 70
GAA GAA GAG TTT TTG GTT TTA GGG CTT ACA GAA ATG CTA GAA AAA GCG 291 Glu Glu Glu Phe Leu Val Leu Gly Leu Thr Glu Met Leu Glu Lys Ala 75 80 85
GGG ATT TTA GTG TTT GGG GCT TCT AAA GAA GCG GCT AAG TTA GAG GCT 339 Gly He Leu Val Phe Gly Ala Ser Lys Glu Ala Ala Lys Leu Glu Ala 90 95 100 105
TCT AAA AGC TAT ATG AAA GCT TTT GTT AAA GAG TGT GGC ATC AAA AGT 387 Ser Lys Ser Tyr Met Lys Ala Phe Val Lys Glu Cys Gly He Lys Ser 110 115 120
GCG TCT TAC TTT GAA ACA AAC GAC TTA AAA GAA GCT TTG AGT TAC ATT 435 Ala Ser Tyr Phe Glu Thr Asn Asp Leu Lys Glu Ala Leu Ser Tyr He 125 130 135
CAA AAC GCT TCT TTC CCC TTA GTC ATT AAA GCG TTG AAT AAA AAC ACA 483 Gin Asn Ala Ser Phe Pro Leu Val He Lys Ala Leu Asn Lys Asn Thr 140 145 150
AGC ATT GTC TAT CAA GAA GAA GAA GCG ATA AAA ATC CTT GAA GAC GCT 531 Ser He Val Tyr Gin Glu Glu Glu Ala He Lys He Leu Glu Asp Ala 155 160 165
TTC AAA CAA AGC AAT GAG CCT GTG ATT ATA GAG CCT TTT TTA GAG GGA 579 Phe Lys Gin Ser Asn Glu Pro Val He He Glu Pro Phe Leu Glu Gly 170 175 180 185
TTT GAG CTT TCA GTT ACA GCG CTC ATA GCC AAT GAT GAT TTT ATC TTG 627 Phe Glu Leu Ser Val Thr Ala Leu He Ala Asn Asp Asp Phe He Leu 190 195 200
TTG CCC TTT TGC CAA AAC TAC AAA CGC TTA TTA GAG GGG GAT AAT GGG 675 Leu Pro Phe Cys Gin Asn Tyr Lys Arg Leu Leu Glu Gly Asp Asn Gly 205 210 215
GTC AAT ACG GGG GGT ATG GGG GCC ATC GCT CCT GCA AAC TTT TTC TCT 723 Val Asn Thr Gly Gly Met Gly Ala He Ala Pro Ala Asn Phe Phe Ser 220 225 230
AAT GAA TTA GAA GAG AAA ATA AAA AAT CAT ATC TTT AAA CCC ACT TTA 771 Asn Glu Leu Glu Glu Lys He Lys Asn His He Phe Lys Pro Thr Leu 235 240 245
GAG AAA CTT CAG GCT GAC AAC ACG CCT TTT AAA GGG GTT TTA CTC GCT 819 Glu Lys Leu Gin Ala Asp Asn Thr Pro Phe Lys Gly Val Leu Leu Ala 250 255 260 265
GAA ATT GTA ATC ATA GAA GAA AAA GGC GTT TTA GAG CCG TAT TTA TTG 867 Glu He Val He He Glu Glu Lys Gly Val Leu Glu Pro Tyr Leu Leu 270 275 280
GAT TTT AGC GTG CGT TTT AAA GAC ATT GAA TGC CAG ACG ATT TTA CCC 915 Asp Phe Ser Val Arg Phe Lys Asp He Glu Cys Gin Thr He Leu Pro 285 290 295
CTT TTA GAA AGC TCG CTT TTA GAT TTG TGT TTG GCC ACA GCC AAA GGG 963 Leu Leu Glu Ser Ser Leu Leu Asp Leu Cys Leu Ala Thr Ala Lys Gly 300 305 310 GAA TTA CAT TCT CTT GAA TTG GTG TTT TCT AAA GAA TTT GTG ATG AGT 1011 Glu Leu His Ser Leu Glu Leu Val Phe Ser Lys Glu Phe Val Met Ser 315 320 325
GTG GCG CTT GTT TCT AGG AAT TAC CCC ACT AGC TCT TCG CCC AAA CAA 1059 Val Ala Leu Val Ser Arg Asn Tyr Pro Thr Ser Ser Ser Pro Lys Gin 330 335 340 345
ACC CTT TAT ATT GAT CCG GTT GAT GAA AAA AAG GGT CAT TTG ATT TTA 1107 Thr Leu Tyr He Asp Pro Val Asp Glu Lys Lys Gly His Leu He Leu 350 355 360
GGG GAG GTG GAG CAG GAT AAT GGC GTG TTT GAA AGC AGT GGG GGG AGG 1155 Gly Glu Val Glu Gin Asp Asn Gly Val Phe Glu Ser Ser Gly Gly Arg 365 370 375
GTG ATC TTT GCC ATT GGT AGG GGA AAA TCC TTA TTA GAA GCC AGA AAC 1203 Val He Phe Ala He Gly Arg Gly Lys Ser Leu Leu Glu Ala Arg Asn 380 385 390
CAT GCT TAT GAA ATC GCT CAA AAG GTG CAT TTT GAA GGC ATG TTT TAT 1251 His Ala Tyr Glu He Ala Gin Lys Val His Phe Glu Gly Met Phe Tyr 395 400 405
CGC AAG GAT ATT GGT TTT AAG GTG TTA GAT TTG AAA GAA TAT TCT TAAAG 1301 Arg Lys Asp He Gly Phe Lys Val Leu Asp Leu Lys Glu Tyr Ser 410 415 420
GTTAAAGTTT AAGACAAACC AAAGAGTTTG TCTTGTTTG 1340
(2) INFORMATION FOR SEQ ID NO: 1020:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 424 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1020:
Met Lys Asp Asn Asn Asn Tyr Asn Val Leu He Val Gly Asn Lys Gly
1 5 10 15
Arg Glu Tyr Ala Leu Ala Gin Arg Leu Gin Gin Asp Glu Arg Val Asn
20 25 30
Ala Leu Tyr Phe Cys Leu Gly Asn Gly Gly Thr Gin Asp Leu Gly Glu
35 40 45
Asn Leu Glu Cys Glu His Tyr Glu His He Val Glu Leu Ala Leu Lys
50 55 60
Lys Gin He His Leu Ala He He Ser Glu Glu Glu Phe Leu Val Leu
65 70 75 80
Gly Leu Thr Glu Met Leu Glu Lys Ala Gly He Leu Val Phe Gly Ala
85 90 95
Ser Lys Glu Ala Ala Lys Leu Glu Ala Ser Lys Ser Tyr Met Lys Ala 100 105 110
Phe Val Lys Glu Cys Gly He Lys Ser Ala Ser Tyr Phe Glu Thr Asn
115 120 125
Asp Leu Lys Glu Ala Leu Ser Tyr He Gin Asn Ala Ser Phe Pro Leu
130 135 140
Val He Lys Ala Leu Asn Lys Asn Thr Ser He Val Tyr Gin Glu Glu 145 150 155 160
Glu Ala He Lys He Leu Glu Asp Ala Phe Lys Gin Ser Asn Glu Pro
165 170 175
Val He He Glu Pro Phe Leu Glu Gly Phe Glu Leu Ser Val Thr Ala
180 185 190
Leu He Ala Asn Asp Asp Phe He Leu Leu Pro Phe Cys Gin Asn Tyr
195 200 205
Lys Arg Leu Leu Glu Gly Asp Asn Gly Val Asn Thr Gly Gly Met Gly
210 215 220
Ala He Ala Pro Ala Asn Phe Phe Ser Asn Glu Leu Glu Glu Lys He 225 230 235 240
Lys Asn His He Phe Lys Pro Thr Leu Glu Lys Leu Gin Ala Asp Asn
245 250 255
Thr Pro Phe Lys Gly Val Leu Leu Ala Glu He Val He He Glu Glu
260 265 270
Lys Gly Val Leu Glu Pro Tyr Leu Leu Asp Phe Ser Val Arg Phe Lys
275 280 285
Asp He Glu Cys Gin Thr He Leu Pro Leu Leu Glu Ser Ser Leu Leu
290 295 300
Asp Leu Cys Leu Ala Thr Ala Lys Gly Glu Leu His Ser Leu Glu Leu 305 310 315 320
Val Phe Ser Lys Glu Phe Val Met Ser Val Ala Leu Val Ser Arg Asn
325 330 335
Tyr Pro Thr Ser Ser Ser Pro Lys Gin Thr Leu Tyr He Asp Pro Val
340 345 350
Asp Glu Lys Lys Gly His Leu He Leu Gly Glu Val Glu Gin Asp Asn
355 360 365
Gly Val Phe Glu Ser Ser Gly Gly Arg Val He Phe Ala He Gly Arg
370 375 380
Gly Lys Ser Leu Leu Glu Ala Arg Asn His Ala Tyr Glu He Ala Gin 385 390 395 400
Lys Val His Phe Glu Gly Met Phe Tyr Arg Lys Asp He Gly Phe Lys
405 410 415
Val Leu Asp Leu Lys Glu Tyr Ser 420
(2) INFORMATION FOR SEQ ID NO: 1021:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 827 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 17...769 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1021:
TTAAAATGAA GTGAAA ATG AGA GAA ATA AAT ATG ATT TTA TAC ATT CAT ATC 52
Met Arg Glu He Asn Met He Leu Tyr He His He 1 5 10
CCC TTT TGT GAA AAT AAA TGC GGC TAT TGC GCT TTC AAT TCC TAT GAA 100 Pro Phe Cys Glu Asn Lys Cys Gly Tyr Cys Ala Phe Asn Ser Tyr Glu 15 20 25
AAC AAG CAT GGG TTA AAA GAA GAA TAC ACT CAA GCG TTA TGC CTG GAT 148 Asn Lys His Gly Leu Lys Glu Glu Tyr Thr Gin Ala Leu Cys Leu Asp 30 35 40
TTA AAG CAT GCG TTA AGT CAA ACT GAC GAA CCA ATT GAA AGC GTT TTT 196 Leu Lys His Ala Leu Ser Gin Thr Asp Glu Pro He Glu Ser Val Phe 45 50 55 60
ATT GGT GGC GGC ACG CCT AAC ACT TTA AGC GTG AAG GCT TTT GAA AGG 244 He Gly Gly Gly Thr Pro Asn Thr Leu Ser Val Lys Ala Phe Glu Arg 65 70 75
ATT TTT GAA AGC ATT TAT CAA CAT GCG AGC TTG AGC TTG GAT TGT GAG 292 He Phe Glu Ser He Tyr Gin His Ala Ser Leu Ser Leu Asp Cys Glu 80 85 90
ATC ACC ACT GAA GCT AAC CCC GAA TTG ATT ACT AAA GCT TGG TGT CAA 340 He Thr Thr Glu Ala Asn Pro Glu Leu He Thr Lys Ala Trp Cys Gin 95 100 105
GGC TTA AAA GGT TTA GGG ATC AAC CGC TTG AGT TTA GGG GTG CAA AGT 388 Gly Leu Lys Gly Leu Gly He Asn Arg Leu Ser Leu Gly Val Gin Ser 110 115 120
TTT AGG GAA GAT AAA TTA TTG TTT TTA GAG CGC CAA CAT TCC AAA AAT 436 Phe Arg Glu Asp Lys Leu Leu Phe Leu Glu Arg Gin His Ser Lys Asn 125 130 135 140
ATC GCT CCT GCG ATA GAA ACT ATT TTA AAA AGC GGG ATT GAA AAT ATC 484 He Ala Pro Ala He Glu Thr He Leu Lys Ser Gly He Glu Asn He 145 150 155
AGC ATT GAT TTG ATT TAT AAC ACC CCA TTA GAC AAT GAA AAC TCT CTA 532 Ser He Asp Leu He Tyr Asn Thr Pro Leu Asp Asn Glu Asn Ser Leu 160 165 170
AAA GAA GAA TTA AAA CTC GCT AAA GAA CTC CCT ATC AAC CAC TTG AGC 580 Lys Glu Glu Leu Lys Leu Ala Lys Glu Leu Pro He Asn His Leu Ser 175 180 185
GCT TAC GCT TTG AGC GTT GAA AAA AAC ACG AAT TTA GAA AAA AAC GCC 628 Ala Tyr Ala Leu Ser Val Glu Lys Asn Thr Asn Leu Glu Lys Asn Ala 190 195 200
AAA AAA CCC TCA TGC GCT CAT TTT GAC AAT GTG GTG AGA GAG ATT TTA 676 Lys Lys Pro Ser Cys Ala His Phe Asp Asn Val Val Arg Glu He Leu 205 210 215 220
GAG GGC TTT TCT TTC AAG CAA TAC GAG TGT CTA ATT ACG CTA GAA ATT 724 Glu Gly Phe Ser Phe Lys Gin Tyr Glu Cys Leu He Thr Leu Glu He 225 230 235
ATC AAG TCA AAC ACA ACT TGG CTT ACT GGG GGG CTA AAG ATT ATT TAGGG 774 He Lys Ser Asn Thr Thr Trp Leu Thr Gly Gly Leu Lys He He 240 245 250
TGCGGGGCTG GGGCTGTGGG CTGCGTGGCG AATGAGCGCT TTTTTGCAAA AAA 827
(2) INFORMATION FOR SEQ ID NO: 1022:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 251 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1022:
Met Arg Glu He Asn Met He Leu Tyr He His He Pro Phe Cys Glu
1 5 10 15
Asn Lys Cys Gly Tyr Cys Ala Phe Asn Ser Tyr Glu Asn Lys His Gly
20 25 30
Leu Lys Glu Glu Tyr Thr Gin Ala Leu Cys Leu Asp Leu Lys His Ala
35 40 45
Leu Ser Gin Thr Asp Glu Pro He Glu Ser Val Phe He Gly Gly Gly
50 55 60
Thr Pro Asn Thr Leu Ser Val Lys Ala Phe Glu Arg He Phe Glu Ser 65 70 75 80
He Tyr Gin His Ala Ser Leu Ser Leu Asp Cys Glu He Thr Thr Glu
85 90 95
Ala Asn Pro Glu Leu He Thr Lys Ala Trp Cys Gin Gly Leu Lys Gly
100 105 110
Leu Gly He Asn Arg Leu Ser Leu Gly Val Gin Ser Phe Arg Glu Asp
115 120 125
Lys Leu Leu Phe Leu Glu Arg Gin His Ser Lys Asn He Ala Pro Ala
130 135 140
He Glu Thr He Leu Lys Ser Gly He Glu Asn He Ser He Asp Leu 145 150 155 160
He Tyr Asn Thr Pro Leu Asp Asn Glu Asn Ser Leu Lys Glu Glu Leu
165 170 175
Lys Leu Ala Lys Glu Leu Pro He Asn His Leu Ser Ala Tyr Ala Leu
180 185 190
Ser Val Glu Lys Asn Thr Asn Leu Glu Lys Asn Ala Lys Lys Pro Ser 195 200 205 Cys Ala His Phe Asp Asn Val Val Arg Glu He Leu Glu Gly Phe Ser
210 215 220
Phe Lys Gin Tyr Glu Cys Leu He Thr Leu Glu He He Lys Ser Asn
225 230 235 240
Thr Thr Trp Leu Thr Gly Gly Leu Lys He He
245 250
(2) INFORMATION FOR SEQ ID NO: 1023:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1291 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 76...1257 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1023:
GCCAAGTCAT TGCTTATTTC AAAAGAGAGG GGTATTTATA GGGTGTTAAT CGTTCAAAAA 60 TACGGCGGCA CGAGC ATG GGC AGC ATA GAA AGG ATC CAC AAT GTC GCT CAA 111 Met Gly Ser He Glu Arg He His Asn Val Ala Gin 1 5 10
AGG GTT TTA GAA AGC GTT ACA TTA GGG CAT CAA GTC GTG GTG GTG GTT 159 Arg Val Leu Glu Ser Val Thr Leu Gly His Gin Val Val Val Val Val 15 20 25
TCA GCG ATG AGC GGC GAA ACC GAC AGG CTT TTA GAA TTT GGC AAG AAT 207 Ser Ala Met Ser Gly Glu Thr Asp Arg Leu Leu Glu Phe Gly Lys Asn 30 35 40
TTT AGC CAT AAC CCT AAC AAG CGA GAG ATG GAC AGG ATT GTA AGC GTG 255 Phe Ser His Asn Pro Asn Lys Arg Glu Met Asp Arg He Val Ser Val 45 50 55 60
GGG GAA TTG GTT TCA AGT GCG GCT TTG AGC ATG GCG TTA GAA AGG TAT 303 Gly Glu Leu Val Ser Ser Ala Ala Leu Ser Met Ala Leu Glu Arg Tyr 65 70 75
GGG CAT AGA GCC ATT TCC TTG AGC GGG AAA GAA GCG GGC ATT TTA ACC 351 Gly His Arg Ala He Ser Leu Ser Gly Lys Glu Ala Gly He Leu Thr 80 85 90
AGC TCG CAT TTT CAA AAC GCC GTG ATC CAA TCC ATT GAC ACC AAA CGC 399 Ser Ser His Phe Gin Asn Ala Val He Gin Ser He Asp Thr Lys Arg 95 100 105 ATC ACA GAG CTT TTA GAA AAA AAC TAC ATT GTG GTG ATC GCT GGG TTT 447 He Thr Glu Leu Leu Glu Lys Asn Tyr He Val Val He Ala Gly Phe 110 115 120
CAA GGC GCT GAT ATT CAA GGT GAA ACA ACG ACT TTA GGG CGT GGG GGG 495 Gin Gly Ala Asp He Gin Gly Glu Thr Thr Thr Leu Gly Arg Gly Gly 125 130 135 140
AGC GAT TTG AGC GCG GTT GCT TTG GCC GGG GCT TTA AAA GCG CAT TTG 543 Ser Asp Leu Ser Ala Val Ala Leu Ala Gly Ala Leu Lys Ala His Leu 145 150 155
TGC GAA ATC TAT ACG GAT GTG GAT GGC GTT TAT ACC ACC GAT CCG CGC 591 Cys Glu He Tyr Thr Asp Val Asp Gly Val Tyr Thr Thr Asp Pro Arg 160 165 170
ATT GAA GAA AAG GCT CAA AAA ATC GCG CAA ATC AGC TAT GAT GAA ATG 639 He Glu Glu Lys Ala Gin Lys He Ala Gin He Ser Tyr Asp Glu Met 175 180 185
CTT GAA CTG GCT TCT ATG GGG GCT AAA GTT TTA TTA AAC CGC TCG GTG 687 Leu Glu Leu Ala Ser Met Gly Ala Lys Val Leu Leu Asn Arg Ser Val 190 195 200
GAA TTA GCC AAA AAG CTC AGC GTG AAG TTA GTG ACT CGC AAT TCG TTT 735 Glu Leu Ala Lys Lys Leu Ser Val Lys Leu Val Thr Arg Asn Ser Phe 205 210 215 220
AAC CAT AGC GAA GGC ACG CTC ATT GTG GCT GAA AAA GAC TTT AAA GGA 783 Asn His Ser Glu Gly Thr Leu He Val Ala Glu Lys Asp Phe Lys Gly 225 230 235
GAA CGC ATG GAA ACC CCT ATA GTG AGT GGG ATC GCA TTG GAT AAA AAT 831 Glu Arg Met Glu Thr Pro He Val Ser Gly He Ala Leu Asp Lys Asn 240 245 250
CAG GCT CGT GTG AGC ATG GAG GGC GTG GAA GAT CGG CCA GGC ATT GCC 879 Gin Ala Arg Val Ser Met Glu Gly Val Glu Asp Arg Pro Gly He Ala 255 260 265
GCT GAA ATC TTT GGC GCT TTA GCG GAG TAT CGC ATT AAC GTG GAT ATG 927 Ala Glu He Phe Gly Ala Leu Ala Glu Tyr Arg He Asn Val Asp Met 270 275 280
ATC GTC CAA ACG ATC GGC AGA GAC GGC AAA ACC GAT TTG GAT TTT ACG 975 He Val Gin Thr He Gly Arg Asp Gly Lys Thr Asp Leu Asp Phe Thr 285 290 295 300
ATC GTT AAA ACC CAA ATA GAA GAA ACC AAG CAA GCC TTA AAG CCT TTT 1023 He Val Lys Thr Gin He Glu Glu Thr Lys Gin Ala Leu Lys Pro Phe 305 310 315
TTA GCG CAA ATG GAT TCC ATT GAT TAT GAT GAA AAT ATC GCT AAA GTC 1071 Leu Ala Gin Met Asp Ser He Asp Tyr Asp Glu Asn He Ala Lys Val 320 325 330 TCC ATA GTG GGC GTG GGC ATG AAG TCG CAT TCT GGG GTG GCG AGT ATC 1119 Ser He Val Gly Val Gly Met Lys Ser His Ser Gly Val Ala Ser He 335 340 345
GCT TTT AAA GCC CTA GCC AAA GAC AAT ATC AAT ATC ATG ATG ATT TCT 1167 Ala Phe Lys Ala Leu Ala Lys Asp Asn He Asn He Met Met He Ser 350 355 360
ACA AGC GAG ATT AAA ATT TCG GTT TTG ATT GAC ATT AAA TAC GCT GAA 1215 Thr Ser Glu He Lys He Ser Val Leu He Asp He Lys Tyr Ala Glu 365 370 375 380
TTA GCT GTT AGA ACT TTG CAT GCG GTG TAT CAA TTA GAT CAA TGAAAAATT 1266 Leu Ala Val Arg Thr Leu His Ala Val Tyr Gin Leu Asp Gin 385 390
TCTACGATTG GATCAAGGAA TTTGT 1291
(2) INFORMATION FOR SEQ ID NO: 1024:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 394 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1024:
Met Gly Ser He Glu Arg He His Asn Val Ala Gin Arg Val Leu Glu
1 5 10 15
Ser Val Thr Leu Gly His Gin Val Val Val Val Val Ser Ala Met Ser
20 25 30
Gly Glu Thr Asp Arg Leu Leu Glu Phe Gly Lys Asn Phe Ser His Asn
35 40 45
Pro Asn Lys Arg Glu Met Asp Arg He Val Ser Val Gly Glu Leu Val
50 55 60
Ser Ser Ala Ala Leu Ser Met Ala Leu Glu Arg Tyr Gly His Arg Ala 65 70 75 80
He Ser Leu Ser Gly Lys Glu Ala Gly He Leu Thr Ser Ser His Phe
85 90 95
Gin Asn Ala Val He Gin Ser He Asp Thr Lys Arg He Thr Glu Leu
100 105 110
Leu Glu Lys Asn Tyr He Val Val He Ala Gly Phe Gin Gly Ala Asp
115 120 125
He Gin Gly Glu Thr Thr Thr Leu Gly Arg Gly Gly Ser Asp Leu Ser
130 135 140
Ala Val Ala Leu Ala Gly Ala Leu Lys Ala His Leu Cys Glu He Tyr 145 150 155 160
Thr Asp Val Asp Gly Val Tyr Thr Thr Asp Pro Arg He Glu Glu Lys
165 170 175
Ala Gin Lys He Ala Gin He Ser Tyr Asp Glu Met Leu Glu Leu Ala
180 185 190
Ser Met Gly Ala Lys Val Leu Leu Asn Arg Ser Val Glu Leu Ala Lys 195 200 205
Lys Leu Ser Val Lys Leu Val Thr Arg Asn Ser Phe Asn His Ser Glu
210 215 220
Gly Thr Leu He Val Ala Glu Lys Asp Phe Lys Gly Glu Arg Met Glu 225 230 235 240
Thr Pro He Val Ser Gly He Ala Leu Asp Lys Asn Gin Ala Arg Val
245 250 255
Ser Met Glu Gly Val Glu Asp Arg Pro Gly He Ala Ala Glu He Phe
260 265 270
Gly Ala Leu Ala Glu Tyr Arg He Asn Val Asp Met He Val Gin Thr
275 280 285
He Gly Arg Asp Gly Lys Thr Asp Leu Asp Phe Thr He Val Lys Thr
290 295 300
Gin He Glu Glu Thr Lys Gin Ala Leu Lys Pro Phe Leu Ala Gin Met 305 310 315 320
Asp Ser He Asp Tyr Asp Glu Asn He Ala Lys Val Ser He Val Gly
325 330 335
Val Gly Met Lys Ser His Ser Gly Val Ala Ser He Ala Phe Lys Ala
340 345 350
Leu Ala Lys Asp Asn He Asn He Met Met He Ser Thr Ser Glu He
355 360 365
Lys He Ser Val Leu He Asp He Lys Tyr Ala Glu Leu Ala Val Arg
370 375 380
Thr Leu His Ala Val Tyr Gin Leu Asp Gin 385 390
(2) INFORMATION FOR SEQ ID NO: 1025:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 706 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 1...663 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1025:
GGT TAC TCT GTG AAA AAC TCC AAC CGC CTT ATT TAT ACG GAC AAT CTT 48
Gly Tyr Ser Val Lys Asn Ser Asn Arg Leu He Tyr Thr Asp Asn Leu 1 5 10 15
GAA GAG AGC CTA GAA GAG ACT GCA AGC CTT TTT GAA CAC CAC ATT AAA 96
Glu Glu Ser Leu Glu Glu Thr Ala Ser Leu Phe Glu His His He Lys
20 25 30
TTC TAC ACG GAG ATT ATT GAA AAA GAC AAA AAG GTG ATC AAA ACT TTC 144
Phe Tyr Thr Glu He He Glu Lys Asp Lys Lys Val He Lys Thr Phe 35 40 45
AAC AAG GAT TTT AAA ATA GAG CAT GCC AAA GAA GTC ATT TCC AAA GCT 192 Asn Lys Asp Phe Lys He Glu His Ala Lys Glu Val He Ser Lys Ala 50 55 60
CAC CTA AAA CAC AGC GAA TTA AAC GCT TTT TTA ATC GCC GCT CCT AGT 240 His Leu Lys His Ser Glu Leu Asn Ala Phe Leu He Ala Ala Pro Ser 65 70 75 80
TAT GGT ATA GAA GCC CAA AAC GCG CTT TTA AAA ATC TTA GAA GAA CCC 288 Tyr Gly He Glu Ala Gin Asn Ala Leu Leu Lys He Leu Glu Glu Pro 85 90 95
CCG AAT AAC GTT TGT TTT ATC ATG TTC GCT AAA AGC CAA AAC CAT GTG 336 Pro Asn Asn Val Cys Phe He Met Phe Ala Lys Ser Gin Asn His Val 100 105 110
TTA GCC ACC ATT AAA TCC CGC CTA ATT AAA GAA GAC AAA CGC CAA AAA 384 Leu Ala Thr He Lys Ser Arg Leu He Lys Glu Asp Lys Arg Gin Lys 115 120 125
ATC CCC CTA AAA CCT TTA GAT TTG GAT TTA TCC AAG CTG GAT TTG AAA 432 He Pro Leu Lys Pro Leu Asp Leu Asp Leu Ser Lys Leu Asp Leu Lys 130 135 140
GAC ATT TAT GCG TTT TTA AAA AAT TTA GAC AAA GAA AAT TTT GAT TCC 480 Asp He Tyr Ala Phe Leu Lys Asn Leu Asp Lys Glu Asn Phe Asp Ser 145 150 155 160
AGA GAA AAT CAG AGG GAA AGG ATT GAA AGC CTG TTA GAG AGC GTT AAC 528 Arg Glu Asn Gin Arg Glu Arg He Glu Ser Leu Leu Glu Ser Val Asn 165 170 175
AGG CAT AAG ATC CCC TTA AAC GAG CAA GAA TTG CAA GCC TTT GAT TTA 576 Arg His Lys He Pro Leu Asn Glu Gin Glu Leu Gin Ala Phe Asp Leu 180 185 190
GCG ATC AAG GCT AAC AGC TCT TAT TAC AAG CTC AGC TAT AAT CTT TTA 624 Ala He Lys Ala Asn Ser Ser Tyr Tyr Lys Leu Ser Tyr Asn Leu Leu 195 200 205
CCC CTG CTT TTA AGC CTT TTA TCC AAA AAG AAA ACG CCA TGATTGTAAA AC 675 Pro Leu Leu Leu Ser Leu Leu Ser Lys Lys Lys Thr Pro 210 215 220
GCCTTAACCC TGATGCGCTC AAAAACGCTC T 706
(2) INFORMATION FOR SEQ ID NO: 1026:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 221 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1026:
Gly Tyr Ser Val Lys Asn Ser Asn Arg Leu He Tyr Thr Asp Asn Leu
1 5 10 15
Glu Glu Ser Leu Glu Glu Thr Ala Ser Leu Phe Glu His His He Lys
20 25 30
Phe Tyr Thr Glu He He Glu Lys Asp Lys Lys Val He Lys Thr Phe
35 40 45
Asn Lys Asp Phe Lys He Glu His Ala Lys Glu Val He Ser Lys Ala
50 55 60
His Leu Lys His Ser Glu Leu Asn Ala Phe Leu He Ala Ala Pro Ser 65 70 75 80
Tyr Gly He Glu Ala Gin Asn Ala Leu Leu Lys He Leu Glu Glu Pro
85 90 95
Pro Asn Asn Val Cys Phe He Met Phe Ala Lys Ser Gin Asn His Val
100 105 110
Leu Ala Thr He Lys Ser Arg Leu He Lys Glu Asp Lys Arg Gin Lys
115 120 125
He Pro Leu Lys Pro Leu Asp Leu Asp Leu Ser Lys Leu Asp Leu Lys
130 135 140
Asp He Tyr Ala Phe Leu Lys Asn Leu Asp Lys Glu Asn Phe Asp Ser 145 150 155 160
Arg Glu Asn Gin Arg Glu Arg He Glu Ser Leu Leu Glu Ser Val Asn
165 170 175
Arg His Lys He Pro Leu Asn Glu Gin Glu Leu Gin Ala Phe Asp Leu
180 185 190
Ala He Lys Ala Asn Ser Ser Tyr Tyr Lys Leu Ser Tyr Asn Leu Leu
195 200 205
Pro Leu Leu Leu Ser Leu Leu Ser Lys Lys Lys Thr Pro 210 215 220
(2) INFORMATION FOR SEQ ID NO: 1027:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1102 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 58...1059 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1027:
TATTCTCTCG CAATAATTGT TATTGTTATT GCGACAAAAC TTTTAGAAGG AGTTATT ATG 60
Met
1 GGA AGT ATC GGT AGT ATG GGC AAA CCT ATT GAA GGG TTT TTA GTG GCA 108 Gly Ser He Gly Ser Met Gly Lys Pro He Glu Gly Phe Leu Val Ala 5 10 15
GCC ATT CAG TTT CCT GTG CCA ATT GTC AAT AGC CGT AAG GAT ATT GAT 156 Ala He Gin Phe Pro Val Pro He Val Asn Ser Arg Lys Asp He Asp 20 25 30
CAC AAT ATT GAA AGC ATT ATT AGA ACC TTG CAT GCG ACT AAA GCG GGG 204 His Asn He Glu Ser He He Arg Thr Leu His Ala Thr Lys Ala Gly 35 40 45
TAT CCG GGA GTG GAG CTT ATC ATT TTC CCT GAG TAT AGC ACG CAA GGT 252 Tyr Pro Gly Val Glu Leu He He Phe Pro Glu Tyr Ser Thr Gin Gly 50 55 60 65
TTG AAT ACC GCT AAG TGG CTT AGC GAA GAG TTT TTA TTA GAT GTC CCG 300 Leu Asn Thr Ala Lys Trp Leu Ser Glu Glu Phe Leu Leu Asp Val Pro 70 75 80
GGT AAA GAG ACA GAG CTA TAC GCT AAG GCG TGT AAA GAG GCG AAA GTT 348 Gly Lys Glu Thr Glu Leu Tyr Ala Lys Ala Cys Lys Glu Ala Lys Val 85 90 95
TAT GGT GTT TTT TCA ATC ATG GAA CGC AAT CCT GAT TCT AAC AAA AAC 396 Tyr Gly Val Phe Ser He Met Glu Arg Asn Pro Asp Ser Asn Lys Asn 100 105 110
CCC TAC AAC ACC GCC ATT ATC ATT GAT CCG CAA GGT GAA ATC ATT TTA 444 Pro Tyr Asn Thr Ala He He He Asp Pro Gin Gly Glu He He Leu 115 120 125
AAA TAC CGC AAG CTA TTC CCA TGG AAT CCC ATT GAG CCA TGG TAT CCT 492 Lys Tyr Arg Lys Leu Phe Pro Trp Asn Pro He Glu Pro Trp Tyr Pro 130 135 140 145
GGG GAT TTA GGA ATG CCT GTG TGC GAG GGT CCG GGC GGA TCA AAA TTA 540 Gly Asp Leu Gly Met Pro Val Cys Glu Gly Pro Gly Gly Ser Lys Leu 150 155 160
GCC GTG TGC ATT TGC CAT GAC GGC ATG ATT CCA GAG CTC GCC AGA GAA 588 Ala Val Cys He Cys His Asp Gly Met He Pro Glu Leu Ala Arg Glu 165 170 175
GCG GCC TAT AAA GGG TGC AAT GTG TAT ATC CGC ATT TCA GGC TAT AGC 636 Ala Ala Tyr Lys Gly Cys Asn Val Tyr He Arg He Ser Gly Tyr Ser 180 185 190
ACT CAA GTC AAT GAT CAA TGG ATT TTG ACC AAC CGC TCC AAC GCA TGG 684 Thr Gin Val Asn Asp Gin Trp He Leu Thr Asn Arg Ser Asn Ala Trp 195 200 205
CAC AAT TTG ATG TAT ACC GTG AGC GTG AAT TTA GCC GGC TAT GAT AAT 732 His Asn Leu Met Tyr Thr Val Ser Val Asn Leu Ala Gly Tyr Asp Asn 210 215 220 225 GTC TTT TAC TAC TTT GGT GAG GGG CAA ATC TGT AAC TTT GAT GGC ACG 780 Val Phe Tyr Tyr Phe Gly Glu Gly Gin He Cys Asn Phe Asp Gly Thr 230 235 240
ACT CTT GTT CAA GGG CAC CGC AAC CCT TGG GAG ATT GTA ACC GGG GAA 828 Thr Leu Val Gin Gly His Arg Asn Pro Trp Glu He Val Thr Gly Glu 245 250 255
ATC TAT CCC AAA ATG GCA GAC AAC GCT CGC TTA AGC TGG GGA TTA GAA 876 He Tyr Pro Lys Met Ala Asp Asn Ala Arg Leu Ser Trp Gly Leu Glu 260 265 270
AAC AAC ATT TAC AAC CTA GGC CAT AGA GGG TAT GTG GCT AAA CCG GGC 924 Asn Asn He Tyr Asn Leu Gly His Arg Gly Tyr Val Ala Lys Pro Gly 275 280 285
GGA GAA CAT GAC GCA GGC TTA ACC TAT ATC AAA GAC TTA GCG GCC GGT 972 Gly Glu His Asp Ala Gly Leu Thr Tyr He Lys Asp Leu Ala Ala Gly 290 295 300 305
AAA TAC AAA TTG CCT TGG GAA GAT CAC ATG AAA ATC AAA GAC GGC TCT 1020 Lys Tyr Lys Leu Pro Trp Glu Asp His Met Lys He Lys Asp Gly Ser 310 315 320
ATT TAT GGC TAC CCT ACC ACC GGT GGG CGT TTT GGG AAA TAATCCCTAA CC 1071 He Tyr Gly Tyr Pro Thr Thr Gly Gly Arg Phe Gly Lys 325 330
TTGCATTTTT GCTAGAACCC GTTTTTAAGG G 1102
(2) INFORMATION FOR SEQ ID NO: 1028:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 334 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1028:
Met Gly Ser He Gly Ser Met Gly Lys Pro He Glu Gly Phe Leu Val
1 5 10 15
Ala Ala He Gin Phe Pro Val Pro He Val Asn Ser Arg Lys Asp He
20 25 30
Asp His Asn He Glu Ser He He Arg Thr Leu His Ala Thr Lys Ala
35 40 45
Gly Tyr Pro Gly Val Glu Leu He He Phe Pro Glu Tyr Ser Thr Gin
50 55 60
Gly Leu Asn Thr Ala Lys Trp Leu Ser Glu Glu Phe Leu Leu Asp Val
65 70 75 80
Pro Gly Lys Glu Thr Glu Leu Tyr Ala Lys Ala Cys Lys Glu Ala Lys
85 90 95
Val Tyr Gly Val Phe Ser He Met Glu Arg Asn Pro Asp Ser Asn Lys 100 105 110
Asn Pro Tyr Asn Thr Ala He He He Asp Pro Gin Gly Glu He He
115 120 125
Leu Lys Tyr Arg Lys Leu Phe Pro Trp Asn Pro He Glu Pro Trp Tyr
130 135 140
Pro Gly Asp Leu Gly Met Pro Val Cys Glu Gly Pro Gly Gly Ser Lys 145 150 155 160
Leu Ala Val Cys He Cys His Asp Gly Met He Pro Glu Leu Ala Arg
165 170 175
Glu Ala Ala Tyr Lys Gly Cys Asn Val Tyr He Arg He Ser Gly Tyr
180 185 190
Ser Thr Gin Val Asn Asp Gin Trp He Leu Thr Asn Arg Ser Asn Ala
195 200 205
Trp His Asn Leu Met Tyr Thr Val Ser Val Asn Leu Ala Gly Tyr Asp
210 215 220
Asn Val Phe Tyr Tyr Phe Gly Glu Gly Gin He Cys Asn Phe Asp Gly 225 230 235 240
Thr Thr Leu Val Gin Gly His Arg Asn Pro Trp Glu He Val Thr Gly
245 250 255
Glu He Tyr Pro Lys Met Ala Asp Asn Ala Arg Leu Ser Trp Gly Leu
260 265 270
Glu Asn Asn He Tyr Asn Leu Gly His Arg Gly Tyr Val Ala Lys Pro
275 280 285
Gly Gly Glu His Asp Ala Gly Leu Thr Tyr He Lys Asp Leu Ala Ala
290 295 300
Gly Lys Tyr Lys Leu Pro Trp Glu Asp His Met Lys He Lys Asp Gly 305 310 315 320
Ser He Tyr Gly Tyr Pro Thr Thr Gly Gly Arg Phe Gly Lys 325 330
(2) INFORMATION FOR SEQ ID NO: 1029:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1152 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...1095 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1029:
TAGCTATGGA TTTTCGCCGT ATTTGTGGTG GATAAAAGAA AAGGATCTTC A ATG ATT 57
Met He
1
GCT TAC ATT CTC AAA CGC TTG CTT TTG ATT ATC CCT ACT TTA TTA GCT 105 Ala Tyr He Leu Lys Arg Leu Leu Leu He He Pro Thr Leu Leu Ala 5 10 15
ATC ATG ACC ATT AAT TTC TTT TTG ATC CAA TCG GCT CCT GGA GGC CCT 153 He Met Thr He Asn Phe Phe Leu He Gin Ser Ala Pro Gly Gly Pro 20 25 30
ATA GAG CAG ATG ATG GCT AAA ATC AAT AAC ACG CAG TCC AAA GAG ATT 201 He Glu Gin Met Met Ala Lys He Asn Asn Thr Gin Ser Lys Glu He 35 40 45 50
CAA GGC GTT GTT AAA GAG CGT TCG TAT AGG GCG TCT CAA GGG TTG GAG 249 Gin Gly Val Val Lys Glu Arg Ser Tyr Arg Ala Ser Gin Gly Leu Glu 55 60 65
AGC GAT TTG TTA GAA AAT TTA AAA AAA CTC TAT GGT TTT GAC AAG CCC 297 Ser Asp Leu Leu Glu Asn Leu Lys Lys Leu Tyr Gly Phe Asp Lys Pro 70 75 80
ATA GGG GAG CGC TAC CTT CTC ATG CTC AAA AAA TAT CTG CAA TTT GAT 345 He Gly Glu Arg Tyr Leu Leu Met Leu Lys Lys Tyr Leu Gin Phe Asp 85 90 95
TTT GGG GAG AGC TTT TAT CGC CAG ATT AAA GTG ATA GAT TTG ATT AAG 393 Phe Gly Glu Ser Phe Tyr Arg Gin He Lys Val He Asp Leu He Lys 100 105 110
GAA AAA TTG CCC GTA TCC ATT TCG TTA GGG CTT TTT AGC ACG CTT TTG 441 Glu Lys Leu Pro Val Ser He Ser Leu Gly Leu Phe Ser Thr Leu Leu 115 120 125 130
ATT TAT CTT ATT TCT ATC CCT TTA GGG ATT TTC AAG GCC AAA CGC AAT 489 He Tyr Leu He Ser He Pro Leu Gly He Phe Lys Ala Lys Arg Asn 135 140 145
AAC GAG CCT TTA GAC GTG TTA AGC AGC GTG GTG ATC ATT GTC GCT AAC 537 Asn Glu Pro Leu Asp Val Leu Ser Ser Val Val He He Val Ala Asn 150 155 160
GCT ATC CCG GCC TTT TTG TTT GCG GTG GTG TTG ATC GTG TTT TTT GCT 585 Ala He Pro Ala Phe Leu Phe Ala Val Val Leu He Val Phe Phe Ala 165 170 175
GGA GGG AAT TAT TGG CAT TGG TTC CCT TTA AAG GGG CTA GTG AGC GAT 633 Gly Gly Asn Tyr Trp His Trp Phe Pro Leu Lys Gly Leu Val Ser Asp 180 185 190
AAT TTT GAA AGT TTG AGC GCG TTA GGT AAA ATC AAG GAT TAT TTA TGG 681 Asn Phe Glu Ser Leu Ser Ala Leu Gly Lys He Lys Asp Tyr Leu Trp 195 200 205 210
CAT ATC ACT TTG CCC GTT CTT TGC ATT TCT TTA GGG GGT TTT GCA AGC 729 His He Thr Leu Pro Val Leu Cys He Ser Leu Gly Gly Phe Ala Ser 215 220 225
CTT ACG CTT TTA GTG AAA AAC TCT TTT TTA GAT GAA ATG GGC AAG CTC 777 Leu Thr Leu Leu Val Lys Asn Ser Phe Leu Asp Glu Met Gly Lys Leu 230 235 240
TAT GTA CTG AGC GCT AAG GCT AAG GGT TGT TCA GTG GGG CGT ATT TTT 825 Tyr Val Leu Ser Ala Lys Ala Lys Gly Cys Ser Val Gly Arg He Phe 245 250 255
TAT GCG CAT GTG TTC CGT AAT GCG ATT TTA TTA GTG GTG GCG GGT TTC 873 Tyr Ala His Val Phe Arg Asn Ala He Leu Leu Val Val Ala Gly Phe 260 265 270
CCG CAA GCT TTT TTG GGC ATG TTC TTT AGC TCA AGT TTG TTG ATA GAG 921 Pro Gin Ala Phe Leu Gly Met Phe Phe Ser Ser Ser Leu Leu He Glu 275 280 285 290
ATT GTT TTT AGC CTA GAC GGG TTA GGG CTT TTA GGG TAT GAA AGC ATT 969 He Val Phe Ser Leu Asp Gly Leu Gly Leu Leu Gly Tyr Glu Ser He 295 300 305
GTG AGT AGG GAT TAT CCC GTT GTG TTT GGT TCG CTT TAT ATT TTC ACG 1017 Val Ser Arg Asp Tyr Pro Val Val Phe Gly Ser Leu Tyr He Phe Thr 310 315 320
CTT TTA GGT TTG GTA GCG AGT TTG ATA AGC GAT TTG CTC TGT GTG GTG 1065 Leu Leu Gly Leu Val Ala Ser Leu lie Ser Asp Leu Leu Cys Val Val 325 330 335
ATT GAC CCT AGG ATT GAT TTT GAA AAG CGT TGAGGGTAGG AATGAAAACT GAG 1118 He Asp Pro Arg He Asp Phe Glu Lys Arg 340 345
ATGAAATCTT CTTTAAAACT TTTTATGCGG CCTT 1152
(2) INFORMATION FOR SEQ ID NO: 1030:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 348 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1030:
Met He Ala Tyr He Leu Lys Arg Leu Leu Leu He He Pro Thr Leu
1 5 10 15
Leu Ala He Met Thr He Asn Phe Phe Leu He Gin Ser Ala Pro Gly
20 25 30
Gly Pro He Glu Gin Met Met Ala Lys He Asn Asn Thr Gin Ser Lys
35 40 45
Glu He Gin Gly Val Val Lys Glu Arg Ser Tyr Arg Ala Ser Gin Gly
50 55 60
Leu Glu Ser Asp Leu Leu Glu Asn Leu Lys Lys Leu Tyr Gly Phe Asp 65 70 75 80 Lys Pro He Gly Glu Arg Tyr Leu Leu Met Leu Lys Lys Tyr Leu Gin
85 90 95
Phe Asp Phe Gly Glu Ser Phe Tyr Arg Gin He Lys Val He Asp Leu
100 105 110
He Lys Glu Lys Leu Pro Val Ser He Ser Leu Gly Leu Phe Ser Thr
115 120 125
Leu Leu He Tyr Leu He Ser He Pro Leu Gly He Phe Lys Ala Lys
130 135 140
Arg Asn Asn Glu Pro Leu Asp Val Leu Ser Ser Val Val He He Val 145 150 155 160
Ala Asn Ala He Pro Ala Phe Leu Phe Ala Val Val Leu He Val Phe
165 170 175
Phe Ala Gly Gly Asn Tyr Trp His Trp Phe Pro Leu Lys Gly Leu Val
180 185 190
Ser Asp Asn Phe Glu Ser Leu Ser Ala Leu Gly Lys He Lys Asp Tyr
195 200 205
Leu Trp His He Thr Leu Pro Val Leu Cys He Ser Leu Gly Gly Phe
210 215 220
Ala Ser Leu Thr Leu Leu Val Lys Asn Ser Phe Leu Asp Glu Met Gly 225 230 235 240
Lys Leu Tyr Val Leu Ser Ala Lys Ala Lys Gly Cys Ser Val Gly Arg
245 250 255
He Phe Tyr Ala His Val Phe Arg Asn Ala He Leu Leu Val Val Ala
260 265 270
Gly Phe Pro Gin Ala Phe Leu Gly Met Phe Phe Ser Ser Ser Leu Leu
275 280 285
He Glu He Val Phe Ser Leu Asp Gly Leu Gly Leu Leu Gly Tyr Glu
290 295 300
Ser He Val Ser Arg Asp Tyr Pro Val Val Phe Gly Ser Leu Tyr He 305 310 315 320
Phe Thr Leu Leu Gly Leu Val Ala Ser Leu He Ser Asp Leu Leu Cys
325 330 335
Val Val He Asp Pro Arg He Asp Phe Glu Lys Arg 340 345
(2) INFORMATION FOR SEQ ID NO: 1031:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 662 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...618 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1031: TTAAAGGTCT AAACC ATG GAT ATT AAG GCA TGT TAT CAA AAC GCT AAA GCG 51 Met Asp He Lys Ala Cys Tyr Gin Asn Ala Lys Ala 1 5 10
TTA TTA GAG GGG CAT TTC TTG CTC AGC AGT GGG TTT CAT TCC AAT TAT 99 Leu Leu Glu Gly His Phe Leu Leu Ser Ser Gly Phe His Ser Asn Tyr 15 20 25
TAT TTG CAA TCC GCT AAA GTT TTA GAA GAT CCC AAA CTA GCC GAA CAA 147 Tyr Leu Gin Ser Ala Lys Val Leu Glu Asp Pro Lys Leu Ala Glu Gin 30 35 40
TTA GCG CTA GAA TTA GCC AAA CAA ATC CAA GAA GCT CAT TTG AAT ATT 195 Leu Ala Leu Glu Leu Ala Lys Gin He Gin Glu Ala His Leu Asn He 45 50 55 60
GAA TGC GTG TGC TCA CCG GCT ATT GGG GGG ATT TTG GCT GGG TAT GAG 243 Glu Cys Val Cys Ser Pro Ala He Gly Gly He Leu Ala Gly Tyr Glu 65 70 75
CTT GCA AGG GCT TTG GGC GTG CGT TTT ATC TTC ACC GAA AGG GTG GAT 291 Leu Ala Arg Ala Leu Gly Val Arg Phe He Phe Thr Glu Arg Val Asp 80 85 90
AAT ACC ATG GCG TTA AGG CGT GGC TTT GAA GTC AAA AAA AAC GAA AAA 339 Asn Thr Met Ala Leu Arg Arg Gly Phe Glu Val Lys Lys Asn Glu Lys 95 100 105
ATT TTA GTG TGT GAG GAC ATT ATC ACT ACG GGA AAA TCC GCT ATG GAA 387 He Leu Val Cys Glu Asp He He Thr Thr Gly Lys Ser Ala Met Glu 110 115 120
TGC GCT AAA GTT TTA GAA GAA AAG GGT GCT CAA ATC GTG GCT TTT GGT 435 Cys Ala Lys Val Leu Glu Glu Lys Gly Ala Gin He Val Ala Phe Gly 125 130 135 140
GCT TTA GCT AAT CGG GGC ATT TGC AAG CGT GCT CAT TCT CAT TTA AAA 483 Ala Leu Ala Asn Arg Gly He Cys Lys Arg Ala His Ser His Leu Lys 145 150 155
GCC CAA GAG GGA GCG TGT TTG CCT AGC CAT TTG CCC CTT TTT GCT TTA 531 Ala Gin Glu Gly Ala Cys Leu Pro Ser His Leu Pro Leu Phe Ala Leu 160 165 170
GAA GAT TTT GTT TTT GAC ATG CAC AAG CCT AGT TCT TGC CCT TTA TGC 579 Glu Asp Phe Val Phe Asp Met His Lys Pro Ser Ser Cys Pro Leu Cys 175 180 185
GCT ACT AGC GTT GCT ATA AAG CCA GGA AGT CGT GGC AAC TAAAAAAACA AA 630 Ala Thr Ser Val Ala He Lys Pro Gly Ser Arg Gly Asn 190 195 200
AAAAAATAAA ACCCCAAAAA AAAAGCAAGC GT 662
(2) INFORMATION FOR SEQ ID NO: 1032: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 201 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1032:
Met Asp He Lys Ala Cys Tyr Gin Asn Ala Lys Ala Leu Leu Glu Gly
1 5 10 15
His Phe Leu Leu Ser Ser Gly Phe His Ser Asn Tyr Tyr Leu Gin Ser
20 25 30
Ala Lys Val Leu Glu Asp Pro Lys Leu Ala Glu Gin Leu Ala Leu Glu
35 40 45
Leu Ala Lys Gin He Gin Glu Ala His Leu Asn He Glu Cys Val Cys
50 55 60
Ser Pro Ala He Gly Gly He Leu Ala Gly Tyr Glu Leu Ala Arg Ala 65 70 75 80
Leu Gly Val Arg Phe He Phe Thr Glu Arg Val Asp Asn Thr Met Ala
85 90 95
Leu Arg Arg Gly Phe Glu Val Lys Lys Asn Glu Lys He Leu Val Cys
100 105 110
Glu Asp He He Thr Thr Gly Lys Ser Ala Met Glu Cys Ala Lys Val
115 120 125
Leu Glu Glu Lys Gly Ala Gin He Val Ala Phe Gly Ala Leu Ala Asn
130 135 140
Arg Gly He Cys Lys Arg Ala His Ser His Leu Lys Ala Gin Glu Gly 145 150 155 160
Ala Cys Leu Pro Ser His Leu Pro Leu Phe Ala Leu Glu Asp Phe Val
165 170 175
Phe Asp Met His Lys Pro Ser Ser Cys Pro Leu Cys Ala Thr Ser Val
180 185 190
Ala He Lys Pro Gly Ser Arg Gly Asn 195 200
(2) INFORMATION FOR SEQ ID NO: 1033:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 401 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 40...384 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1033: CTCTCTTTGC GCCTAAAGGC CTTTATCACC GATATTTTT ATG ATT TAT ACC CCC 54
Met He Tyr Thr Pro 1 5
ATG CTT TAT ATA ATG ACT TAT GCG ATT TTA GGG AGC GCG AAG GAT TTT 102 Met Leu Tyr He Met Thr Tyr Ala He Leu Gly Ser Ala Lys Asp Phe 10 15 20
AGG GAA AAC CAG AGC GCG ATT TTT TTA TGC CTG CTT TTT TAC GCC CTA 150 Arg Glu Asn Gin Ser Ala He Phe Leu Cys Leu Leu Phe Tyr Ala Leu 25 30 35
ACA CAC AGC TTT TTT ATC GCT TTT AAA TCC CAA AGC CCT GGC ATG CGT 198 Thr His Ser Phe Phe He Ala Phe Lys Ser Gin Ser Pro Gly Met Arg 40 45 50
TAC GCT CGG TTT AAA TTA ATC AAA AAT AAT GGC GAA AAA GTG GGC TTT 246 Tyr Ala Arg Phe Lys Leu He Lys Asn Asn Gly Glu Lys Val Gly Phe 55 60 65
TTT TTA GCT TTG TGG CGC TTT GTT TTG TGG GTG TTG AGC ATG GGG TTA 294 Phe Leu Ala Leu Trp Arg Phe Val Leu Trp Val Leu Ser Met Gly Leu 70 75 80 85
CTC ATA GGG TTT GTT ACG CCT TTT ATT TTT AAG TTT TTT TTG CAT GAC 342 Leu He Gly Phe Val Thr Pro Phe He Phe Lys Phe Phe Leu His Asp 90 95 100
AAA CTC AGC GGC ACT CAT ATT GAA ACC ATC AAG GAG GCA ACA TGAAAAATT 393 Lys Leu Ser Gly Thr His He Glu Thr He Lys Glu Ala Thr 105 110 115
TAGTAATC 401
(2) INFORMATION FOR SEQ ID NO: 1034:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 115 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1034:
Met He Tyr Thr Pro Met Leu Tyr He Met Thr Tyr Ala He Leu Gly
1 5 10 15
Ser Ala Lys Asp Phe Arg Glu Asn Gin Ser Ala He Phe Leu Cys Leu
20 25 30
Leu Phe Tyr Ala Leu Thr His Ser Phe Phe He Ala Phe Lys Ser Gin
35 40 45
Ser Pro Gly Met Arg Tyr Ala Arg Phe Lys Leu He Lys Asn Asn Gly
50 55 60
Glu Lys Val Gly Phe Phe Leu Ala Leu Trp Arg Phe Val Leu Trp Val 65 70 75 80
Leu Ser Met Gly Leu Leu He Gly Phe Val Thr Pro Phe He Phe Lys
85 90 95
Phe Phe Leu His Asp Lys Leu Ser Gly Thr His He Glu Thr He Lys
100 105 110
Glu Ala Thr 115
(2) INFORMATION FOR SEQ ID NO: 1035:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 717 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 53...667 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1035:
AGCGGGGCTG GTATTTCAGC AGAAAGCGGG ATTAAAACCT TTAGAGACGC TG ATG GCT 58
Met Ala
1
TGT GGG AAA GGG CAT GAC ATC ATG GAA GTT GCC TCG CCT TAT GGC TGG 106 Cys Gly Lys Gly His Asp He Met Glu Val Ala Ser Pro Tyr Gly Trp 5 10 15
AAA AAG AAC CCG CAA AAG GTG TTG GAT TTT TAC AAC CAA AGG CGC CGA 154 Lys Lys Asn Pro Gin Lys Val Leu Asp Phe Tyr Asn Gin Arg Arg Arg 20 25 30
CAG CTT TTT GAA GTT TAT CCT AAC AAA GCC CAT AAG GCT TTA GCG GAA 202 Gin Leu Phe Glu Val Tyr Pro Asn Lys Ala His Lys Ala Leu Ala Glu 35 40 45 50
TTG GAA AAA CAC TAT CAA GTC AAT ATC ATC ACC CAA AAT GTA GAT GAT 250 Leu Glu Lys His Tyr Gin Val Asn He He Thr Gin Asn Val Asp Asp 55 60 65
TTG CAT GAA AGA GCG GGT TCT TCT CGC ATT TTG CAC TTG CAT GGG GAA 298 Leu His Glu Arg Ala Gly Ser Ser Arg He Leu His Leu His Gly Glu 70 75 80
TTA TTG AGC GTT CGC AGC GAG AAA GAT CCT AAT TTA GTT TAT AGG TGG 346 Leu Leu Ser Val Arg Ser Glu Lys Asp Pro Asn Leu Val Tyr Arg Trp 85 90 95 GAA AAG GAC TTG AAT TTA GGC GAC TTG GCC AAA GAC AAA TCG CAA TTA 394 Glu Lys Asp Leu Asn Leu Gly Asp Leu Ala Lys Asp Lys Ser Gin Leu 100 105 110
CGC CCT GAT ATT GTG TGG TTT GGC GAA GCG GTG CCT TTG CTT AAA GAA 442 Arg Pro Asp He Val Trp Phe Gly Glu Ala Val Pro Leu Leu Lys Glu 115 120 125 130
GCG ATT TCT TTA GTC AAA CAA GCG CAT CTT TTA ATC ATC ATT GGC ACT 490 Ala He Ser Leu Val Lys Gin Ala His Leu Leu He He He Gly Thr 135 140 145
TCT TTG CAA GTC TAT CCC GCC GCT AGC CTC TAC ACG CAT GCG CAT AAA 538 Ser Leu Gin Val Tyr Pro Ala Ala Ser Leu Tyr Thr His Ala His Lys 150 155 160
GAC GCT CTC ATT TAT TAC ATT GAC CCT AAG GCT AAA AAC GCC CAT TTA 586 Asp Ala Leu He Tyr Tyr He Asp Pro Lys Ala Lys Asn Ala His Leu 165 170 175
CCC CAG AAT GTC CAA TGC ATT AAT GAA AGC GCG GTG CAT GCC ATG CAA 634 Pro Gin Asn Val Gin Cys He Asn Glu Ser Ala Val His Ala Met Gin 180 185 190
GAT TTA ATG CCC AAA CTC ATA GAA ATG GCT TCT TAAGAAATGT TAAAATAATT 687 Asp Leu Met Pro Lys Leu He Glu Met Ala Ser 195 200 205
TTTATTTTTT CAGCTAACGA TTAGCAAAAA 717
(2) INFORMATION FOR SEQ ID NO: 1036:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 205 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1036:
Met Ala Cys Gly Lys Gly His Asp He Met Glu Val Ala Ser Pro Tyr
1 5 10 15
Gly Trp Lys Lys Asn Pro Gin Lys Val Leu Asp Phe Tyr Asn Gin Arg
20 25 30
Arg Arg Gin Leu Phe Glu Val Tyr Pro Asn Lys Ala His Lys Ala Leu
35 40 45
Ala Glu Leu Glu Lys His Tyr Gin Val Asn He He Thr Gin Asn Val
50 55 60
Asp Asp Leu His Glu Arg Ala Gly Ser Ser Arg He Leu His Leu His 65 70 75 80
Gly Glu Leu Leu Ser Val Arg Ser Glu Lys Asp Pro Asn Leu Val Tyr
85 90 95
Arg Trp Glu Lys Asp Leu Asn Leu Gly Asp Leu Ala Lys Asp Lys Ser 100 105 110
Gin Leu Arg Pro Asp He Val Trp Phe Gly Glu Ala Val Pro Leu Leu
115 120 125
Lys Glu Ala He Ser Leu Val Lys Gin Ala His Leu Leu He He He
130 135 140
Gly Thr Ser Leu Gin Val Tyr Pro Ala Ala Ser Leu Tyr Thr His Ala 145 150 155 160
His Lys Asp Ala Leu He Tyr Tyr He Asp Pro Lys Ala Lys Asn Ala
165 170 175
His Leu Pro Gin Asn Val Gin Cys He Asn Glu Ser Ala Val His Ala
180 185 190
Met Gin Asp Leu Met Pro Lys Leu He Glu Met Ala Ser 195 200 205
(2) INFORMATION FOR SEQ ID NO: 1037:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 468 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 23...421 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1037:
ATTAAAGGAG TTTGAGAGTC TG ATG CAA CAA GCC ACA GAA GCA TTG AAT CAC 52
Met Gin Gin Ala Thr Glu Ala Leu Asn His 1 5 10
CCC TAT TTT GGC GTT TTT GTT TTA TTG GTA TTC ACC TTT TGG GTG TTT 100 Pro Tyr Phe Gly Val Phe Val Leu Leu Val Phe Thr Phe Trp Val Phe 15 20 25
AAC TTA ACC TTA AGG ATC CAA AGG TTT TTA AGC CGT AAA ATG GCT CAA 148 Asn Leu Thr Leu Arg He Gin Arg Phe Leu Ser Arg Lys Met Ala Gin 30 35 40
AAA AAG GGC GAA AAG CTC AAG CTC GCT CCC TAT GAA TGC GGG CCT GTG 196 Lys Lys Gly Glu Lys Leu Lys Leu Ala Pro Tyr Glu Cys Gly Pro Val 45 50 55
GCT CTC AAA CAG CCT AAT AGG GTG TCG CAC CAT TTC TAT ATC ATG GCC 244 Ala Leu Lys Gin Pro Asn Arg Val Ser His His Phe Tyr He Met Ala 60 65 70
ATG CTT TTT ATT TTA TTT GAT GTA GAA ATC GTT TTC ATG TTC CCT TGG 292 Met Leu Phe He Leu Phe Asp Val Glu He Val Phe Met Phe Pro Trp 75 80 85 90
GCG ATT GGT TTT AAA AAA TTA GGC TTG TTT GGA CTC GTT GAA ATG CTA 340 Ala He Gly Phe Lys Lys Leu Gly Leu Phe Gly Leu Val Glu Met Leu 95 100 105
GGC TTT GTC TTC TTT TTA ACC ATT GGT TTT ATT TAC GCT TTA AAG CGA 388 Gly Phe Val Phe Phe Leu Thr He Gly Phe He Tyr Ala Leu Lys Arg 110 115 120
AAC GCT TTG AGC TGG CAA AAA TTA GAG GTG AAA TAATGCAACA AGCACCGGTT 441 Asn Ala Leu Ser Trp Gin Lys Leu Glu Val Lys 125 130
GTTCTAAGCA CTTTGGATAA ATTATTG 468
(2) INFORMATION FOR SEQ ID NO: 1038:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 133 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1038:
Met Gin Gin Ala Thr Glu Ala Leu Asn His Pro Tyr Phe Gly Val Phe
1 5 10 15
Val Leu Leu Val Phe Thr Phe Trp Val Phe Asn Leu Thr Leu Arg He
20 25 30
Gin Arg Phe Leu Ser Arg Lys Met Ala Gin Lys Lys Gly Glu Lys Leu
35 40 45
Lys Leu Ala Pro Tyr Glu Cys Gly Pro Val Ala Leu Lys Gin Pro Asn
50 55 60
Arg Val Ser His His Phe Tyr He Met Ala Met Leu Phe He Leu Phe 65 70 75 80
Asp Val Glu He Val Phe Met Phe Pro Trp Ala He Gly Phe Lys Lys
85 90 95
Leu Gly Leu Phe Gly Leu Val Glu Met Leu Gly Phe Val Phe Phe Leu
100 105 110
Thr He Gly Phe He Tyr Ala Leu Lys Arg Asn Ala Leu Ser Trp Gin
115 120 125
Lys Leu Glu Val Lys 130
(2) INFORMATION FOR SEQ ID NO: 1039:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 864 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear ( i i ) MOLECULE TYPE : Genomi c DNA ( ix) FEATURE :
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 37...831 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1039:
AGCGATCAAA CAAGACGCTC CCAAAAGGTT AGTGTG ATG GTA AGA AAA CAA TCC 54
Met Val Arg Lys Gin Ser
1 5
CCC TAT GAA GAT GTG CAA AAA CAA TCG CGC CAG CAT GAC CCC TAT AAA 102 Pro Tyr Glu Asp Val Gin Lys Gin Ser Arg Gin His Asp Pro Tyr Lys 10 15 20
ATC ATA GAA CCC ACC CCT AAA AAA TAT TTA GAG GGC AGC GCT TAT GAG 150 He He Glu Pro Thr Pro Lys Lys Tyr Leu Glu Gly Ser Ala Tyr Glu 25 30 35
GTC ATT TAC AAC CAC CTT TCT TAC AAA CAT GAG ATT TTA GAC AAA TAC 198 Val He Tyr Asn His Leu Ser Tyr Lys His Glu He Leu Asp Lys Tyr 40 45 50
ATA GAG ACT AAC ACG GCT GTG TTT TGG ATC AAA AAA GAC GAT ATT TTT 246 He Glu Thr Asn Thr Ala Val Phe Trp He Lys Lys Asp Asp He Phe 55 60 65 70
TCT GTC GCT ACG ATT TTA AGG CAT TTG GGT TAT GAG TGT TTG AGC GAA 294 Ser Val Ala Thr He Leu Arg His Leu Gly Tyr Glu Cys Leu Ser Glu 75 80 85
ATG AGC GCG ATA GAT TTG TGC GCT AAA AAA GGG CAT TTT GAA TTG TTT 342 Met Ser Ala He Asp Leu Cys Ala Lys Lys Gly His Phe Glu Leu Phe 90 95 100
TAT CAG TTC GTG GGC TTT AGC GAT AGC TGC AAG AAC CGC CGT AGG NTG 390 Tyr Gin Phe Val Gly Phe Ser Asp Ser Cys Lys Asn Arg Arg Arg Xaa 105 110 115
CGC GTG AAG TGC GTT TTG TTG CCT AAT GAG AGC GTG GAT TCT TTG AGT 438 Arg Val Lys Cys Val Leu Leu Pro Asn Glu Ser Val Asp Ser Leu Ser 120 125 130
TTT TTA TAC CGA TCG GCT AAT TGG AGC GAA AGG GAA GCG TAT GAC ATG 486 Phe Leu Tyr Arg Ser Ala Asn Trp Ser Glu Arg Glu Ala Tyr Asp Met 135 140 145 150
CTT GGT ATT GTG TTT GAC AAA CAC CCC TAT TTG AAA CGC CTT ATT ATG 534 Leu Gly He Val Phe Asp Lys His Pro Tyr Leu Lys Arg Leu He Met 155 160 165 CCG CAT GAT TGG GTA GGC CAC CCA TTA TTG CGC TCT TAC CCG CTC AAA 582 Pro His Asp Trp Val Gly His Pro Leu Leu Arg Ser Tyr Pro Leu Lys 170 175 180
GGC GAT GAA TTC GCC CAA TGG TAT GAA GTG GAT AAA ATT TTT GGT AAA 630 Gly Asp Glu Phe Ala Gin Trp Tyr Glu Val Asp Lys He Phe Gly Lys 185 190 195
GAA TAC CGA GAA GTG GTG GGT AAA GAG CAG AGA GAC AGC GCA AGA GTG 678 Glu Tyr Arg Glu Val Val Gly Lys Glu Gin Arg Asp Ser Ala Arg Val 200 205 210
GAT GAA AAA GAC ACT TTC AAT TTT GCA AAA ATT GGC TAT GAA CAG GGC 726 Asp Glu Lys Asp Thr Phe Asn Phe Ala Lys He Gly Tyr Glu Gin Gly 215 220 225 230
AAG GGC GAA GAA TTA AAA GAA GTA GAA GAA AAG CAT GCG TTT AAG AAA 774 Lys Gly Glu Glu Leu Lys Glu Val Glu Glu Lys His Ala Phe Lys Lys 235 240 245
ATC CCT TTT GTC AAA GAT TTG CAC AAA ATC GCC CCC ACT ATC TTA AAA 822 He Pro Phe Val Lys Asp Leu His Lys He Ala Pro Thr He Leu Lys 250 255 260
AAG AGG CTA TAAAATGGCT CAAAATTTCA CGAAACTCAA CCC 864
Lys Arg Leu 265
(2) INFORMATION FOR SEQ ID NO: 1040:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 265 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1040:
Met Val Arg Lys Gin Ser Pro Tyr Glu Asp Val Gin Lys Gin Ser Arg
1 5 10 15
Gin His Asp Pro Tyr Lys He He Glu Pro Thr Pro Lys Lys Tyr Leu
20 25 30
Glu Gly Ser Ala Tyr Glu Val He Tyr Asn His Leu Ser Tyr Lys His
35 40 45
Glu He Leu Asp Lys Tyr He Glu Thr Asn Thr Ala Val Phe Trp He
50 55 60
Lys Lys Asp Asp He Phe Ser Val Ala Thr He Leu Arg His Leu Gly 65 70 75 80
Tyr Glu Cys Leu Ser Glu Met Ser Ala He Asp Leu Cys Ala Lys Lys
85 90 95
Gly His Phe Glu Leu Phe Tyr Gin Phe Val Gly Phe Ser Asp Ser Cys 100 105 110 Lys Asn Arg Arg Arg Xaa Arg Val Lys Cys Val Leu Leu Pro Asn Glu
115 120 125
Ser Val Asp Ser Leu Ser Phe Leu Tyr Arg Ser Ala Asn Trp Ser Glu
130 135 140
Arg Glu Ala Tyr Asp Met Leu Gly He Val Phe Asp Lys His Pro Tyr 145 150 155 160
Leu Lys Arg Leu He Met Pro His Asp Trp Val Gly His Pro Leu Leu
165 170 175
Arg Ser Tyr Pro Leu Lys Gly Asp Glu Phe Ala Gin Trp Tyr Glu Val
180 185 190
Asp Lys He Phe Gly Lys Glu Tyr Arg Glu Val Val Gly Lys Glu Gin
195 200 205
Arg Asp Ser Ala Arg Val Asp Glu Lys Asp Thr Phe Asn Phe Ala Lys
210 215 220
He Gly Tyr Glu Gin Gly Lys Gly Glu Glu Leu Lys Glu Val Glu Glu 225 230 235 240
Lys His Ala Phe Lys Lys He Pro Phe Val Lys Asp Leu His Lys He
245 250 255
Ala Pro Thr He Leu Lys Lys Arg Leu 260 265
(2) INFORMATION FOR SEQ ID NO: 1041:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2623 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...2580 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1041:
TGACGCTTAC CCTTATATCC CTATTTTATC CCACTCTCAA GGAATTTC ATG ATC ACA 57
Met He Thr
1
ATG AAT ATC AAT GGC AAA ACG ATT GAA TGC CAA GAG GGA CAA AGC GTT 105 Met Asn He Asn Gly Lys Thr He Glu Cys Gin Glu Gly Gin Ser Val 5 10 15
TTA GAG GCT GCT AGG AGC GCT GGG ATC TAC ATC CCT ACC ATT TGC TAT 153 Leu Glu Ala Ala Arg Ser Ala Gly He Tyr He Pro Thr He Cys Tyr 20 25 30 35
TTA AGC GGT TGC TCG CCC ACA GTC GCA TGC AAA ATG TGC ATG GTT GAA 201 Leu Ser Gly Cys Ser Pro Thr Val Ala Cys Lys Met Cys Met Val Glu 40 45 50 ATG GAT GGC AAA CGG GTT TAT AGC TGC AAC ACG AAA GCC AAA AAC AAC 249 Met Asp Gly Lys Arg Val Tyr Ser Cys Asn Thr Lys Ala Lys Asn Asn 55 60 65
GCC ACC ATT CTC ACT AAC ACC CCA ACG CTC ATG GAT GAA AGA AAA AGC 297 Ala Thr He Leu Thr Asn Thr Pro Thr Leu Met Asp Glu Arg Lys Ser 70 75 80
ATC ATG CAA ACT TAT GAT GTC AAC CAC CCC CTA GAG TGT GGC GTG TGC 345 He Met Gin Thr Tyr Asp Val Asn His Pro Leu Glu Cys Gly Val Cys 85 90 95
GAT AAG AGT GGG GAG TGC GAA TTG CAA GAC ATG ACG CAT TTA ACC GGC 393 Asp Lys Ser Gly Glu Cys Glu Leu Gin Asp Met Thr His Leu Thr Gly 100 105 110 115
GTA GAG CAC CAA CCC TAT GCG GTG GCT GAT GAT TTT AAA GCA CTG GAT 441 Val Glu His Gin Pro Tyr Ala Val Ala Asp Asp Phe Lys Ala Leu Asp 120 125 130
TTT TGG GCA AAA GCC TTG TAT GAT CCT AAT TTG TGC ATC ATG TGT GAA 489 Phe Trp Ala Lys Ala Leu Tyr Asp Pro Asn Leu Cys He Met Cys Glu 135 140 145
AGG TGC GTA ACC ACT TGT AAG GAC AAT GTG GGC GAA AAC AAC CTT AAA 537 Arg Cys Val Thr Thr Cys Lys Asp Asn Val Gly Glu Asn Asn Leu Lys 150 155 160
GCC ACT AAA GCC GAC TTG CAT GCT CCG GAT AAA TTT AAA GAC AGC ATG 585 Ala Thr Lys Ala Asp Leu His Ala Pro Asp Lys Phe Lys Asp Ser Met 165 170 175
TCC AAA GAC GCT TTT AGC GTG TGG AGT CGT AAG CAA AAA GGC ATT ATT 633 Ser Lys Asp Ala Phe Ser Val Trp Ser Arg Lys Gin Lys Gly He He 180 185 190 195
TCT TTT GTG GGC AGC GTG CCT TGC TAT GAT TGC GGG GAA TGC ATT GCA 681 Ser Phe Val Gly Ser Val Pro Cys Tyr Asp Cys Gly Glu Cys He Ala 200 205 210
GTA TGC CCT GTG GGC GCT TTG AGC TAT AAA GAT TTC GCT TAC ACG GCT 729 Val Cys Pro Val Gly Ala Leu Ser Tyr Lys Asp Phe Ala Tyr Thr Ala 215 220 225
AAC GCA TGG GAG TTA AAA AAG ATC CAT TCT ACT TGT TCG CAT TGC TCG 777 Asn Ala Trp Glu Leu Lys Lys He His Ser Thr Cys Ser His Cys Ser 230 235 240
GCC GGG TGT TTG ATT TCT TAT GAT GTG CGC CAT TTT GAT ACT CTA GGC 825 Ala Gly Cys Leu He Ser Tyr Asp Val Arg His Phe Asp Thr Leu Gly 245 250 255
GAA GAA TCT AAA ATT TTT AGA GTG CTT AAT GAT TTT TAC CAT AAC CCT 873 Glu Glu Ser Lys He Phe Arg Val Leu Asn Asp Phe Tyr His Asn Pro 260 265 270 275 ATT TGT GGG GCA GGC CGT TTC GCT TTT GAT GTG AGC TCT AGC CCT AAA 921 He Cys Gly Ala Gly Arg Phe Ala Phe Asp Val Ser Ser Ser Pro Lys 280 285 290
GGC AGT GCT AAT CTT AAA GAA GCG CAA AAC GCC CTC AAA GAA TGC GAA 969 Gly Ser Ala Asn Leu Lys Glu Ala Gin Asn Ala Leu Lys Glu Cys Glu 295 300 305
GCG GTG CGA ATA GGT GGG GAT ATT ACG AAT GAA GAG GCG TTT TTA ATA 1017 Ala Val Arg He Gly Gly Asp He Thr Asn Glu Glu Ala Phe Leu He 310 315 320
GAG CGT TTA AGA AAA GAG CTT GAT TTT AAA ATC TAC AAT CAA GAA GCG 1065 Glu Arg Leu Arg Lys Glu Leu Asp Phe Lys He Tyr Asn Gin Glu Ala 325 330 335
TAT CGT TTC CAG CAA TTC TTA AAA GTA TTG GGC GAA ATT AAA CGC CCC 1113 Tyr Arg Phe Gin Gin Phe Leu Lys Val Leu Gly Glu He Lys Arg Pro 340 345 350 355
AGC GTT GAA GAG ATT AAA ACT TCT CAT TTA GTC GTT ACG ATA GGA TCT 1161 Ser Val Glu Glu He Lys Thr Ser His Leu Val Val Thr He Gly Ser 360 365 370
TCT ATC AAA ACA GAA AAC CCT TTG GTG CGC TAT GCC ATC AAT AAC GCT 1209 Ser He Lys Thr Glu Asn Pro Leu Val Arg Tyr Ala He Asn Asn Ala 375 380 385
CTC AAA CTC AAT AAA GCT TCT TTA ATC GCT ATG CAC CCT ATT AAG GAT 1257 Leu Lys Leu Asn Lys Ala Ser Leu He Ala Met His Pro He Lys Asp 390 395 400
AAC GCG CTA GCG AAT TTG TGC CGA AGC TCT TTT TGC ATC ACC CAT GAA 1305 Asn Ala Leu Ala Asn Leu Cys Arg Ser Ser Phe Cys He Thr His Glu 405 410 415
GTG GGG GCT GAA GAA ATC CTT TTA GGC ATG CTT TTA AAA ATG CTT AAC 1353 Val Gly Ala Glu Glu He Leu Leu Gly Met Leu Leu Lys Met Leu Asn 420 425 430 435
ATT GAA AGC GCG GCC CTA AAA AGC TTA GAA GAT TCC AAG CAA AAT ATT 1401 He Glu Ser Ala Ala Leu Lys Ser Leu Glu Asp Ser Lys Gin Asn He 440 445 450
GTA GAT GAA GCG GCT CTT AAA GCC TTA GAA GAA GAG CGA AAA AAA GCT 1449 Val Asp Glu Ala Ala Leu Lys Ala Leu Glu Glu Glu Arg Lys Lys Ala 455 460 465
TTA GAA CAA GCC GAG CAA GGG TGC AGT ATT GGA GAA AAT AAG GCA GAA 1497 Leu Glu Gin Ala Glu Gin Gly Cys Ser He Gly Glu Asn Lys Ala Glu 470 475 480
AAT CAA GAA GAG AAT AAA ACA GAA GCG ACT ACC CCA AAA GAA GAA AAT 1545 Asn Gin Glu Glu Asn Lys Thr Glu Ala Thr Thr Pro Lys Glu Glu Asn 485 490 495 CAA GAA GAA AAC AAG ACA GAG GTT AAA GAA GAA AAA ATT GAA GTC CCT 1593 Gin Glu Glu Asn Lys Thr Glu Val Lys Glu Glu Lys He Glu Val Pro 500 505 510 515
ACC AAA ACC ACT TAT TTG CTG CTT GAA GAA GCG GGC ATC AAT TTA GAA 1641 Thr Lys Thr Thr Tyr Leu Leu Leu Glu Glu Ala Gly He Asn Leu Glu 520 525 530
ACT TAT GAA AAA ATT CTG GCT CTT TTG CAA AAA TCA AAT AAC ACC CTG 1689 Thr Tyr Glu Lys He Leu Ala Leu Leu Gin Lys Ser Asn Asn Thr Leu 535 540 545
CTA GTG GTT GGC GAA GAA ATC TAT AGC CAT AAG CAA GCC CAC AAT ATC 1737 Leu Val Val Gly Glu Glu He Tyr Ser His Lys Gin Ala His Asn He 550 555 560
GCT AAA ATG TTG CGT TTG CTA GCC CAA AAA AGC GCT ATT AAA CTC ATT 1785 Ala Lys Met Leu Arg Leu Leu Ala Gin Lys Ser Ala He Lys Leu He 565 570 575
CTT ATC CCC CCA AGC GCC AAC GCT TTA GGC ATC GCT TCT ATT TGT CAA 1833 Leu He Pro Pro Ser Ala Asn Ala Leu Gly He Ala Ser He Cys Gin 580 585 590 595
TTG AGC GAA GAA ATT TTT GAA CAT GAA AAA ATT GTA GGC ATT CGC GCT 1881 Leu Ser Glu Glu He Phe Glu His Glu Lys He Val Gly He Arg Ala 600 605 610
CAA GGG GAT TTC ACT ATC AAT AGC GAT GAT AGG GTT TTT GGA AAA GAC 1929 Gin Gly Asp Phe Thr He Asn Ser Asp Asp Arg Val Phe Gly Lys Asp 615 620 625
GCT GCC AGC AAA GTG GAT TTT ATT TTA CCC AGT CTC AAC CAG CTA GAA 1977 Ala Ala Ser Lys Val Asp Phe He Leu Pro Ser Leu Asn Gin Leu Glu 630 635 640
GGC ACG ATC ACC AAT ATT GAA GGG CGT GTG TTG CCC TTA AAA CCG GCT 2025 Gly Thr He Thr Asn He Glu Gly Arg Val Leu Pro Leu Lys Pro Ala 645 650 655
TTG AGG TTT GAG GGC TAT GAT TTG AGC GAT ATT ATG CAA GGC TTT GGC 2073 Leu Arg Phe Glu Gly Tyr Asp Leu Ser Asp He Met Gin Gly Phe Gly 660 665 670 675
TTT GTG GAA GAA AAC CTC ATA GAA TGC ACC CAC AAA CTC CCT ACA GAA 2121 Phe Val Glu Glu Asn Leu He Glu Cys Thr His Lys Leu Pro Thr Glu 680 685 690
GCG GGC TTT AAA GCC ATA GAA TTT GAT TAT TTA ACC AAC TAT TTC GCT 2169 Ala Gly Phe Lys Ala He Glu Phe Asp Tyr Leu Thr Asn Tyr Phe Ala 695 700 705
AAC GAC AGA GTC AAC CAC AGA GGC TAT CTG CTA GGA ACA AGC CAT TTT 2217 Asn Asp Arg Val Asn His Arg Gly Tyr Leu Leu Gly Thr Ser His Phe 710 715 720 GAA AAG AGC GCT AAA GAA TGC GAA ACC ATA GAA TGC GAG CCT ATC AAG 2265 Glu Lys Ser Ala Lys Glu Cys Glu Thr He Glu Cys Glu Pro He Lys 725 730 735
CCT TTA AAA GAA AAA ATC GCT TTC AAC GCG TAT TTA AAA TAC CCA GAA 2313 Pro Leu Lys Glu Lys He Ala Phe Asn Ala Tyr Leu Lys Tyr Pro Glu 740 745 750 755
ACG CAA TTC AAT AAC GCT ACT AAT AAA AGC GAG AAT TTG CAA TTA AAA 2361 Thr Gin Phe Asn Asn Ala Thr Asn Lys Ser Glu Asn Leu Gin Leu Lys 760 765 770
GCC GGT GTC TAT GTG TCT AAA GCT TTC TTA AAG AAA TTG AAT AAA GAA 2409 Ala Gly Val Tyr Val Ser Lys Ala Phe Leu Lys Lys Leu Asn Lys Glu 775 780 785
GTG GGG CAA AAC ATC ACT TTA TCT AAA GAA GAA GAG GAA TTA ACA GGC 2457 Val Gly Gin Asn He Thr Leu Ser Lys Glu Glu Glu Glu Leu Thr Gly 790 795 800
GTT TTG TAT CTT GAT GAG AGC TTG GAT CAG GAA GTG TTT GTT ATC TCG 2505 Val Leu Tyr Leu Asp Glu Ser Leu Asp Gin Glu Val Phe Val He Ser 805 810 815
CCT TCT CTT TTG AAA AAC CAT TCT GGC TTT TTT AGA GAG GGC GTG TTT 2553 Pro Ser Leu Leu Lys Asn His Ser Gly Phe Phe Arg Glu Gly Val Phe 820 825 830 835
GAT AGC GTG GAT TTA AAG GAG CAA GCA TGAGCGCTTA TATCATTGAA ACCCTGA 2607 Asp Ser Val Asp Leu Lys Glu Gin Ala 840
TTAAAATTTT GATTTT 2623
(2) INFORMATION FOR SEQ ID NO: 1042:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 844 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1042:
Met He Thr Met Asn He Asn Gly Lys Thr He Glu Cys Gin Glu Gly
1 5 10 15
Gin Ser Val Leu Glu Ala Ala Arg Ser Ala Gly He Tyr He Pro Thr
20 25 30
He Cys Tyr Leu Ser Gly Cys Ser Pro Thr Val Ala Cys Lys Met Cys
35 40 45
Met Val Glu Met Asp Gly Lys Arg Val Tyr Ser Cys Asn Thr Lys Ala
50 55 60
Lys Asn Asn Ala Thr He Leu Thr Asn Thr Pro Thr Leu Met Asp Glu 65 70 75 80
Arg Lys Ser He Met Gin Thr Tyr Asp Val Asn His Pro Leu Glu Cys
85 90 95
Gly Val Cys Asp Lys Ser Gly Glu Cys Glu Leu Gin Asp Met Thr His
100 105 110
Leu Thr Gly Val Glu His Gin Pro Tyr Ala Val Ala Asp Asp Phe Lys
115 120 125
Ala Leu Asp Phe Trp Ala Lys Ala Leu Tyr Asp Pro Asn Leu Cys He
130 135 140
Met Cys Glu Arg Cys Val Thr Thr Cys Lys Asp Asn Val Gly Glu Asn 145 150 155 160
Asn Leu Lys Ala Thr Lys Ala Asp Leu His Ala Pro Asp Lys Phe Lys
165 170 175
Asp Ser Met Ser Lys Asp Ala Phe Ser Val Trp Ser Arg Lys Gin Lys
180 185 190
Gly He He Ser Phe Val Gly Ser Val Pro Cys Tyr Asp Cys Gly Glu
195 200 205
Cys He Ala Val Cys Pro Val Gly Ala Leu Ser Tyr Lys Asp Phe Ala
210 215 220
Tyr Thr Ala Asn Ala Trp Glu Leu Lys Lys He His Ser Thr Cys Ser 225 230 235 240
His Cys Ser Ala Gly Cys Leu He Ser Tyr Asp Val Arg His Phe Asp
245 250 255
Thr Leu Gly Glu Glu Ser Lys He Phe Arg Val Leu Asn Asp Phe Tyr
260 265 270
His Asn Pro He Cys Gly Ala Gly Arg Phe Ala Phe Asp Val Ser Ser
275 280 285
Ser Pro Lys Gly Ser Ala Asn Leu Lys Glu Ala Gin Asn Ala Leu Lys
290 295 300
Glu Cys Glu Ala Val Arg He Gly Gly Asp He Thr Asn Glu Glu Ala 305 310 315 320
Phe Leu He Glu Arg Leu Arg Lys Glu Leu Asp Phe Lys He Tyr Asn
325 330 335
Gin Glu Ala Tyr Arg Phe Gin Gin Phe Leu Lys Val Leu Gly Glu He
340 345 350
Lys Arg Pro Ser Val Glu Glu He Lys Thr Ser His Leu Val Val Thr
355 360 365
He Gly Ser Ser He Lys Thr Glu Asn Pro Leu Val Arg Tyr Ala He
370 375 380
Asn Asn Ala Leu Lys Leu Asn Lys Ala Ser Leu He Ala Met His Pro 385 390 395 400
He Lys Asp Asn Ala Leu Ala Asn Leu Cys Arg Ser Ser Phe Cys He
405 410 415
Thr His Glu Val Gly Ala Glu Glu He Leu Leu Gly Met Leu Leu Lys
420 425 430
Met Leu Asn He Glu Ser Ala Ala Leu Lys Ser Leu Glu Asp Ser Lys
435 440 445
Gin Asn He Val Asp Glu Ala Ala Leu Lys Ala Leu Glu Glu Glu Arg
450 455 460
Lys Lys Ala Leu Glu Gin Ala Glu Gin Gly Cys Ser He Gly Glu Asn 465 470 475 480
Lys Ala Glu Asn Gin Glu Glu Asn Lys Thr Glu Ala Thr Thr Pro Lys
485 490 495
Glu Glu Asn Gin Glu Glu Asn Lys Thr Glu Val Lys Glu Glu Lys He 500 505 510 Glu Val Pro Thr Lys Thr Thr Tyr Leu Leu Leu Glu Glu Ala Gly He
515 520 525
Asn Leu Glu Thr Tyr Glu Lys He Leu Ala Leu Leu Gin Lys Ser Asn
530 535 540
Asn Thr Leu Leu Val Val Gly Glu Glu He Tyr Ser His Lys Gin Ala 545 550 555 560
His Asn He Ala Lys Met Leu Arg Leu Leu Ala Gin Lys Ser Ala He
565 570 575
Lys Leu He Leu He Pro Pro Ser Ala Asn Ala Leu Gly He Ala Ser
580 585 590
He Cys Gin Leu Ser Glu Glu He Phe Glu His Glu Lys He Val Gly
595 600 605
He Arg Ala Gin Gly Asp Phe Thr He Asn Ser Asp Asp Arg Val Phe
610 615 620
Gly Lys Asp Ala Ala Ser Lys Val Asp Phe He Leu Pro Ser Leu Asn 625 630 635 640
Gin Leu Glu Gly Thr He Thr Asn He Glu Gly Arg Val Leu Pro Leu
645 650 655
Lys Pro Ala Leu Arg Phe Glu Gly Tyr Asp Leu Ser Asp He Met Gin
660 665 670
Gly Phe Gly Phe Val Glu Glu Asn Leu He Glu Cys Thr His Lys Leu
675 680 685
Pro Thr Glu Ala Gly Phe Lys Ala He Glu Phe Asp Tyr Leu Thr Asn
690 695 700
Tyr Phe Ala Asn Asp Arg Val Asn His Arg Gly Tyr Leu Leu Gly Thr 705 710 715 720
Ser His Phe Glu Lys Ser Ala Lys Glu Cys Glu Thr He Glu Cys Glu
725 730 735
Pro He Lys Pro Leu Lys Glu Lys He Ala Phe Asn Ala Tyr Leu Lys
740 745 750
Tyr Pro Glu Thr Gin Phe Asn Asn Ala Thr Asn Lys Ser Glu Asn Leu
755 760 765
Gin Leu Lys Ala Gly Val Tyr Val Ser Lys Ala Phe Leu Lys Lys Leu
770 775 780
Asn Lys Glu Val Gly Gin Asn He Thr Leu Ser Lys Glu Glu Glu Glu 785 790 795 800
Leu Thr Gly Val Leu Tyr Leu Asp Glu Ser Leu Asp Gin Glu Val Phe
805 810 815
Val He Ser Pro Ser Leu Leu Lys Asn His Ser Gly Phe Phe Arg Glu
820 825 830
Gly Val Phe Asp Ser Val Asp Leu Lys Glu Gin Ala 835 840
(2) INFORMATION FOR SEQ ID NO: 1043:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 378 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence (B) LOCATION: 44...343 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1043:
AAATCCATGG GAAAAATCAC ACGCAATTTA TAAAGGAATC TCT ATG ATA GGG TTA 55
Met He Gly Leu 1
AAC CAC TAT TTG ATT GTT TCA GGG TTG CTC TTT TGC ATT GGT TTA GCG 103 Asn His Tyr Leu He Val Ser Gly Leu Leu Phe Cys He Gly Leu Ala 5 10 15 20
GGC ATG CTG AAA CGC AAA AAC ATT CTG TTA CTC TTT TTT TCT ACA GAA 151 Gly Met Leu Lys Arg Lys Asn He Leu Leu Leu Phe Phe Ser Thr Glu 25 30 35
ATC ATG CTC AAT GCG ATC AAT ATC GGT TTT GTA GCG ATC TCT AAA TAC 199 He Met Leu Asn Ala He Asn He Gly Phe Val Ala He Ser Lys Tyr 40 45 50
ACG CAT AAT TTA GAC GGG CAG ATG TTT GCG CTC TTT ATT ATC TCT ATT 247 Thr His Asn Leu Asp Gly Gin Met Phe Ala Leu Phe He He Ser He 55 60 65
GCC GCT AGT GAG GTG GCT ATT GGT TTG GGC TTG GTG ATT TTG TGG TTT 295 Ala Ala Ser Glu Val Ala He Gly Leu Gly Leu Val He Leu Trp Phe 70 75 80
AAG AAA TTC AAA AGC TTA GAT ATT GAT TCT TTA AAC GCT ATG AAA GGT T 344 Lys Lys Phe Lys Ser Leu Asp He Asp Ser Leu Asn Ala Met Lys Gly 85 90 95 100
GAGCATGCAA TATTCTTCTT TGCTGTCAGT GGTG 37£
(2) INFORMATION FOR SEQ ID NO: 1044:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 100 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1044:
Met He Gly Leu Asn His Tyr Leu He Val Ser Gly Leu Leu Phe Cys
1 5 10 15
He Gly Leu Ala Gly Met Leu Lys Arg Lys Asn He Leu Leu Leu Phe
20 25 30
Phe Ser Thr Glu He Met Leu Asn Ala He Asn He Gly Phe Val Ala 35 40 45 He Ser Lys Tyr Thr His Asn Leu Asp Gly Gin Met Phe Ala Leu Phe
50 55 60
He He Ser He Ala Ala Ser Glu Val Ala He Gly Leu Gly Leu Val 65 70 75 80
He Leu Trp Phe Lys Lys Phe Lys Ser Leu Asp He Asp Ser Leu Asn
85 90 95
Ala Met Lys Gly 100
(2) INFORMATION FOR SEQ ID NO: 1045:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 663 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...627 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1045:
CCGGTGGAAT AAGTC ATG CAA GCA GTG ATT TTA GCG AAT GGG GAG TTT CCT 51 Met Gin Ala Val He Leu Ala Asn Gly Glu Phe Pro 1 5 10
AAA TCT CAA AAA TGC TTA GAC CTT TTA AAA AAC GCT CCC TTT TTA ATC 99 Lys Ser Gin Lys Cys Leu Asp Leu Leu Lys Asn Ala Pro Phe Leu He 15 20 25
GCA TGC GAT GGG GCT GTT ACC TCA TTA CAT GCG CTT CAA TTC AAA CCC 147 Ala Cys Asp Gly Ala Val Thr Ser Leu His Ala Leu Gin Phe Lys Pro 30 35 40
AGC GTT GTT ATA GGC GAT CTA GAT AGC ATT GAT TCG CAT TTG AAA GCT 195 Ser Val Val He Gly Asp Leu Asp Ser He Asp Ser His Leu Lys Ala 45 50 55 60
TTG TAT AAC CCT ATA CGC ATG AGT GAA CAA AAC AGC AAC GAT TTG TCC 243 Leu Tyr Asn Pro He Arg Met Ser Glu Gin Asn Ser Asn Asp Leu Ser 65 70 75
AAA GCC TTT TTT TAT GCT TTA AAT AAA GGC TGT GAT GAC TTT ATT TTT 291 Lys Ala Phe Phe Tyr Ala Leu Asn Lys Gly Cys Asp Asp Phe He Phe 80 85 90
TTA GGG TTG AAT GGC AAG CGA GAA GAT CAC GCT TTA GCG AAC ACT TTT 339 Leu Gly Leu Asn Gly Lys Arg Glu Asp His Ala Leu Ala Asn Thr Phe 95 100 105 TTA TTG TTG GAA TAT TTT AAA TTT TGC CAA AAA ATC CAA GCC ATA AGC 387 Leu Leu Leu Glu Tyr Phe Lys Phe Cys Gin Lys He Gin Ala He Ser 110 115 120
GAC TAT GGT CTT TTT AGG GTG TTA GAA ACC CCT TTC ACT TTG CCC AGT 435 Asp Tyr Gly Leu Phe Arg Val Leu Glu Thr Pro Phe Thr Leu Pro Ser 125 130 135 140
TTT AAA GGG GAA CAA ATC TCG CTT TTT AGC CTG GAT CTT AAA GCC CAA 483 Phe Lys Gly Glu Gin He Ser Leu Phe Ser Leu Asp Leu Lys Ala Gin 145 150 155
TTC ACT TCT AAA AAC CTC AAA TAC CCC TTA AAA AAC TTG CGT TTA AAA 531 Phe Thr Ser Lys Asn Leu Lys Tyr Pro Leu Lys Asn Leu Arg Leu Lys 160 165 170
ACG CTC TTT TCT GGC TCG CTC AAT GAA GCT ACA GAT AGT TAT TTT AGC 579 Thr Leu Phe Ser Gly Ser Leu Asn Glu Ala Thr Asp Ser Tyr Phe Ser 175 180 185
CTT AGC TCT ACA CCT AAA TCG GTG GTG TTG GTG TAT CAA AAA TTC TTA T 628 Leu Ser Ser Thr Pro Lys Ser Val Val Leu Val Tyr Gin Lys Phe Leu 190 195 200
AAGCGGGTTT TGTTAGGCAA GTTTTTGTCT GTATA 663
(2) INFORMATION FOR SEQ ID NO: 1046:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 204 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1046:
Met Gin Ala Val He Leu Ala Asn Gly Glu Phe Pro Lys Ser Gin Lys
1 5 10 15
Cys Leu Asp Leu Leu Lys Asn Ala Pro Phe Leu He Ala Cys Asp Gly
20 25 30
Ala Val Thr Ser Leu His Ala Leu Gin Phe Lys Pro Ser Val Val He
35 40 45
Gly Asp Leu Asp Ser He Asp Ser His Leu Lys Ala Leu Tyr Asn Pro
50 55 60
He Arg Met Ser Glu Gin Asn Ser Asn Asp Leu Ser Lys Ala Phe Phe 65 70 75 80
Tyr Ala Leu Asn Lys Gly Cys Asp Asp Phe He Phe Leu Gly Leu Asn
85 90 95
Gly Lys Arg Glu Asp His Ala Leu Ala Asn Thr Phe Leu Leu Leu Glu
100 105 110
Tyr Phe Lys Phe Cys Gin Lys He Gin Ala He Ser Asp Tyr Gly Leu
115 120 125
Phe Arg Val Leu Glu Thr Pro Phe Thr Leu Pro Ser Phe Lys Gly Glu 130 135 140
Gin He Ser Leu Phe Ser Leu Asp Leu Lys Ala Gin Phe Thr Ser Lys 145 150 155 160
Asn Leu Lys Tyr Pro Leu Lys Asn Leu Arg Leu Lys Thr Leu Phe Ser
165 170 175
Gly Ser Leu Asn Glu Ala Thr Asp Ser Tyr Phe Ser Leu Ser Ser Thr
180 185 190
Pro Lys Ser Val Val Leu Val Tyr Gin Lys Phe Leu 195 200
(2) INFORMATION FOR SEQ ID NO: 1047:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1106 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 17...1048 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1047:
AGTAAGGGGT TAGAGC ATG AAA GTT ATC AAA ACA GCA CCT TTG ATC CCA TCA 52
Met Lys Val He Lys Thr Ala Pro Leu He Pro Ser 1 5 10
GAA ATT AAG GTG CTA GAG AAA GAG GGC AAT CGG GTT AAG ATT TCT CTG 100 Glu He Lys Val Leu Glu Lys Glu Gly Asn Arg Val Lys He Ser Leu 15 20 25
GCT CCA TTT GAG TTT GGT TAC GCT GTT ACG CTC GCT CAT CCT ATT AGA 148 Ala Pro Phe Glu Phe Gly Tyr Ala Val Thr Leu Ala His Pro He Arg 30 35 40
AGG CTC TTG CTT TTA AGC TCT GTG GGG TAT GCT CCT GTA GGT TTA AAG 196 Arg Leu Leu Leu Leu Ser Ser Val Gly Tyr Ala Pro Val Gly Leu Lys 45 50 55 60
ATT GAA GGC GTC CAT CAT GAG TTT GAC TCT TTA AGG GGG GTT ACT GAA 244 He Glu Gly Val His His Glu Phe Asp Ser Leu Arg Gly Val Thr Glu 65 70 75
GAC GTG TCG CTT TTT ATC ATG AAT TTA AAA AAT ATC CGC TTT ATA GCC 292 Asp Val Ser Leu Phe He Met Asn Leu Lys Asn He Arg Phe He Ala 80 85 90
AAG GCG TTA GTG GGG CAG GAT AGC TCT TTA GAA AAC CAA TCG GTT GTG 340 Lys Ala Leu Val Gly Gin Asp Ser Ser Leu Glu Asn Gin Ser Val Val 95 100 105
GTG GAT TAT TCT TTT AAA GGG CCT ATG GAG CTT AGG GCT AGG GAT TTG 388 Val Asp Tyr Ser Phe Lys Gly Pro Met Glu Leu Arg Ala Arg Asp Leu 110 115 120
AAT TCT GAG CAG ATA GAA ATC GTC AAT CCG GAA ATG CCC CTA GCG ACA 436 Asn Ser Glu Gin He Glu He Val Asn Pro Glu Met Pro Leu Ala Thr 125 130 135 140
ATC AAT GAA GAC GCT CAA TTG AAT TTT TCG CTC ATT ATT TAT AAA GGA 484 He Asn Glu Asp Ala Gin Leu Asn Phe Ser Leu He He Tyr Lys Gly 145 150 155
ATG GGG TAT GTC CCA AGC GAA AAC ACA AGG GAA TTG ATG CCT GAG GGC 532 Met Gly Tyr Val Pro Ser Glu Asn Thr Arg Glu Leu Met Pro Glu Gly 160 165 170
TAC ATG CCG CTA GAC GGC TCT TTC ACG CCG ATT AAA AAG GTC GTT TAT 580 Tyr Met Pro Leu Asp Gly Ser Phe Thr Pro He Lys Lys Val Val Tyr 175 180 185
GAG ATT GAA AAC GTT CTG GTT GAG GGC GAT CCC AAC TAT GAA AAA ATC 628 Glu He Glu Asn Val Leu Val Glu Gly Asp Pro Asn Tyr Glu Lys He 190 195 200
ATT TTT GAT ATT GAA ACA GAC GGG CAG ATT GAC CCT TAT AAA GCG TTT 676 He Phe Asp He Glu Thr Asp Gly Gin He Asp Pro Tyr Lys Ala Phe 205 210 215 220
TTA TCA GCG GTG AAA GTG ATG AGC AAG CAA TTG GGT GTT TTT GGC GAA 724 Leu Ser Ala Val Lys Val Met Ser Lys Gin Leu Gly Val Phe Gly Glu 225 230 235
AGA CCC ATT GCT AAC ACG GAG TAT TCA GGC GAT TAC GCT CAA AGA GAT 772 Arg Pro He Ala Asn Thr Glu Tyr Ser Gly Asp Tyr Ala Gin Arg Asp 240 245 250
GAC GCT AAA GAC TTG AGC GCT AAG ATT GAA AGC ATG AAT TTG AGC GCT 820 Asp Ala Lys Asp Leu Ser Ala Lys He Glu Ser Met Asn Leu Ser Ala 255 260 265
AGG TGT TTT AAT TGC TTG GAT AAA ATC GGC ATC AAG TAT GTG GGC GAA 868 Arg Cys Phe Asn Cys Leu Asp Lys He Gly He Lys Tyr Val Gly Glu 270 275 280
CTC GTG TTG ATG AGC GAA GAA GAG CTT AAG GGC GTG AAA AAC ATG GGT 916 Leu Val Leu Met Ser Glu Glu Glu Leu Lys Gly Val Lys Asn Met Gly 285 290 295 300
AAA AAA TCC TAT GAT GAA ATC GCT GAA AAA TTG AAT GAT TTG GGC TAT 964 Lys Lys Ser Tyr Asp Glu He Ala Glu Lys Leu Asn Asp Leu Gly Tyr 305 310 315 CCG GTA GGC ACA GAA TTA AGC CCT GAA CAA AGA GAG AGT TTA AAG AAA 1012 Pro Val Gly Thr Glu Leu Ser Pro Glu Gin Arg Glu Ser Leu Lys Lys 320 325 330
AGA TTA GAA AAA TTA GAA GAT AAA GGA GGT AAC GAC TGATGAGACA CAAACA 1064 Arg Leu Glu Lys Leu Glu Asp Lys Gly Gly Asn Asp 335 340
CGGATACCGC AAGCTTGGGA GAACCAGCTC GCACAGAAAG GC 1106
(2) INFORMATION FOR SEQ ID NO: 1048:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 344 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1048:
Met Lys Val He Lys Thr Ala Pro Leu He Pro Ser Glu He Lys Val
1 5 10 15
Leu Glu Lys Glu Gly Asn Arg Val Lys He Ser Leu Ala Pro Phe Glu
20 25 30
Phe Gly Tyr Ala Val Thr Leu Ala His Pro He Arg Arg Leu Leu Leu
35 40 45
Leu Ser Ser Val Gly Tyr Ala Pro Val Gly Leu Lys He Glu Gly Val
50 55 60
His His Glu Phe Asp Ser Leu Arg Gly Val Thr Glu Asp Val Ser Leu 65 70 75 80
Phe He Met Asn Leu Lys Asn He Arg Phe He Ala Lys Ala Leu Val
85 90 95
Gly Gin Asp Ser Ser Leu Glu Asn Gin Ser Val Val Val Asp Tyr Ser
100 105 110
Phe Lys Gly Pro Met Glu Leu Arg Ala Arg Asp Leu Asn Ser Glu Gin
115 120 125
He Glu He Val Asn Pro Glu Met Pro Leu Ala Thr He Asn Glu Asp
130 135 140
Ala Gin Leu Asn Phe Ser Leu He He Tyr Lys Gly Met Gly Tyr Val 145 150 155 160
Pro Ser Glu Asn Thr Arg Glu Leu Met Pro Glu Gly Tyr Met Pro Leu
165 170 175
Asp Gly Ser Phe Thr Pro He Lys Lys Val Val Tyr Glu He Glu Asn
180 185 190
Val Leu Val Glu Gly Asp Pro Asn Tyr Glu Lys He He Phe Asp He
195 200 205
Glu Thr Asp Gly Gin He Asp Pro Tyr Lys Ala Phe Leu Ser Ala Val
210 215 220
Lys Val Met Ser Lys Gin Leu Gly Val Phe Gly Glu Arg Pro He Ala 225 230 235 240
Asn Thr Glu Tyr Ser Gly Asp Tyr Ala Gin Arg Asp Asp Ala Lys Asp
245 250 255
Leu Ser Ala Lys He Glu Ser Met Asn Leu Ser Ala Arg Cys Phe Asn 260 265 270
Cys Leu Asp Lys He Gly He Lys Tyr Val Gly Glu Leu Val Leu Met
275 280 285
Ser Glu Glu Glu Leu Lys Gly Val Lys Asn Met Gly Lys Lys Ser Tyr
290 295 300
Asp Glu He Ala Glu Lys Leu Asn Asp Leu Gly Tyr Pro Val Gly Thr 305 310 315 320
Glu Leu Ser Pro Glu Gin Arg Glu Ser Leu Lys Lys Arg Leu Glu Lys
325 330 335
Leu Glu Asp Lys Gly Gly Asn Asp 340
(2) INFORMATION FOR SEQ ID NO: 1049:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 423 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...375 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1049:
AGACAAGGAT AAAGC ATG GCA AGG ATT GCT GGT GTA GAT TTA CCA AAA AAG 51 Met Ala Arg He Ala Gly Val Asp Leu Pro Lys Lys 1 5 10
AAG AGA GTG GAG TAT GCC CTT ACC TAT ATT TAT GGG ATT GGG CTT AAG 99 Lys Arg Val Glu Tyr Ala Leu Thr Tyr He Tyr Gly He Gly Leu Lys 15 20 25
AGT TCC AGA GAG ATT TTA GAA GCG GTA GGC ATT TCT TTT GAC AAG CGC 147 Ser Ser Arg Glu He Leu Glu Ala Val Gly He Ser Phe Asp Lys Arg 30 35 40
GTG CAT GAA TTG AGC GAA GAT GAA GTG TCT AGC ATC GCT AAA AAA ATC 195 Val His Glu Leu Ser Glu Asp Glu Val Ser Ser He Ala Lys Lys He 45 50 55 60
CAA CAA AGC TAC CTA GTA GAG GGC GAT TTG CGT AAA AAA GTT CAA ATG 243 Gin Gin Ser Tyr Leu Val Glu Gly Asp Leu Arg Lys Lys Val Gin Met 65 70 75
GAT ATT AAA TCT TTA ATG GAC TTG GGG AAT TAT CGT GGG ATC AGG CAT 291 Asp He Lys Ser Leu Met Asp Leu Gly Asn Tyr Arg Gly He Arg His 80 85 90 CGT AAG GGT CTT CCT GTG AGA GGT CAA ACC ACT AAA AAT AAC GCT AGG 339 Arg Lys Gly Leu Pro Val Arg Gly Gin Thr Thr Lys Asn Asn Ala Arg 95 100 105
ACT CGT AAG GGT AAG AAA AAA ACC GTG GGT AGC AAG TAGCGAATAA GGAGAT 391 Thr Arg Lys Gly Lys Lys Lys Thr Val Gly Ser Lys 110 115 120
GATGATTTAA TGGCTAAGAG AAATGTAACG GC 423
(2) INFORMATION FOR SEQ ID NO: 1050:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 120 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1050:
Met Ala Arg He Ala Gly Val Asp Leu Pro Lys Lys Lys Arg Val Glu
1 5 10 15
Tyr Ala Leu Thr Tyr He Tyr Gly He Gly Leu Lys Ser Ser Arg Glu
20 25 30
He Leu Glu Ala Val Gly He Ser Phe Asp Lys Arg Val His Glu Leu
35 40 45
Ser Glu Asp Glu Val Ser Ser He Ala Lys Lys He Gin Gin Ser Tyr
50 55 60
Leu Val Glu Gly Asp Leu Arg Lys Lys Val Gin Met Asp He Lys Ser 65 70 75 80
Leu Met Asp Leu Gly Asn Tyr Arg Gly He Arg His Arg Lys Gly Leu
85 90 95
Pro Val Arg Gly Gin Thr Thr Lys Asn Asn Ala Arg Thr Arg Lys Gly
100 105 110
Lys Lys Lys Thr Val Gly Ser Lys 115 120
(2) INFORMATION FOR SEQ ID NO: 1051:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 649 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...621 (D) OTHER INFORMATTON: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1051:
AAGCGTAGGG TGTTTTTA ATG ATT TTT TAT AGA AAG GAA GCT ACA ATG AAC 51
Met He Phe Tyr Arg Lys Glu Ala Thr Met Asn 1 5 10
GCA TTG AAA AAA TTA AGT TTC TGC GCC TTG TTA TCC CTA GGC CTC TTC 99 Ala Leu Lys Lys Leu Ser Phe Cys Ala Leu Leu Ser Leu Gly Leu Phe 15 20 25
GCT CAA ACA GCG CAT GCT AAG CAT TTA AAG GGC ACG ATT AAC TAT CCT 147 Ala Gin Thr Ala His Ala Lys His Leu Lys Gly Thr He Asn Tyr Pro 30 35 40
GAT TGG CTT GAA ATC AAT TTT TTT GAC GAA AAA AAC CCG CCC AAT CAA 195 Asp Trp Leu Glu He Asn Phe Phe Asp Glu Lys Asn Pro Pro Asn Gin 45 50 55
TAT GTC GGA TCG GCT TCA ATT TCT GGT AAA AGG AAC GAT TTT TAC GCC 243 Tyr Val Gly Ser Ala Ser He Ser Gly Lys Arg Asn Asp Phe Tyr Ala 60 65 70 75
AAT TAC ATC CCC TAT GAT GAC CAA TTG CCC CCT GAA CAA AAC GCT GAA 291 Asn Tyr He Pro Tyr Asp Asp Gin Leu Pro Pro Glu Gin Asn Ala Glu 80 85 90
AAA ATC GCT CTT TTA AGG GCC AGA ATA AAC GCT TAC AGC ACT TTA GAG 339 Lys He Ala Leu Leu Arg Ala Arg He Asn Ala Tyr Ser Thr Leu Glu 95 100 105
AGC ATT TTA CTC ACT AAA ATG CAC AAT CGT ATT GTT AAG GTG CTT CAA 387 Ser He Leu Leu Thr Lys Met His Asn Arg He Val Lys Val Leu Gin 110 115 120
GTT AAA AAT AAT GTT ATC AGC CAT TTA TTC GGG CTT GTT GAT TTT TTA 435 Val Lys Asn Asn Val He Ser His Leu Phe Gly Leu Val Asp Phe Leu 125 130 135
ACC TCT AAA TCC ATT TTG GCT AAA AGG TTC GTG GAT ACC ACA AAT CAT 483 Thr Ser Lys Ser He Leu Ala Lys Arg Phe Val Asp Thr Thr Asn His 140 145 150 155
CGT GTG TAT GTC ATG GTG CAA TTC CCT TTC ATT CAG CCT GAA GAC TTG 531 Arg Val Tyr Val Met Val Gin Phe Pro Phe He Gin Pro Glu Asp Leu 160 165 170
ATC GCT TAC TTT AAA GCC AAA CGC ATC GAC CTT TCT TCA GCG AGC GCT 579 He Ala Tyr Phe Lys Ala Lys Arg He Asp Leu Ser Ser Ala Ser Ala 175 180 185
ACC CAT CTC AGC GCC CTT TTA AAT AAG GCG TTG TTC CAC CTC TAAGAGTTT 630 Thr His Leu Ser Ala Leu Leu Asn Lys Ala Leu Phe His Leu 190 195 200
GGGATTTAAG ATGCGGTTT 649 (2) INFORMATION FOR SEQ ID NO: 1052:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 201 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1052:
Met He Phe Tyr Arg Lys Glu Ala Thr Met Asn Ala Leu Lys Lys Leu
1 5 10 15
Ser Phe Cys Ala Leu Leu Ser Leu Gly Leu Phe Ala Gin Thr Ala His
20 25 30
Ala Lys His Leu Lys Gly Thr He Asn Tyr Pro Asp Trp Leu Glu He
35 40 45
Asn Phe Phe Asp Glu Lys Asn Pro Pro Asn Gin Tyr Val Gly Ser Ala
50 55 60
Ser He Ser Gly Lys Arg Asn Asp Phe Tyr Ala Asn Tyr He Pro Tyr 65 70 75 80
Asp Asp Gin Leu Pro Pro Glu Gin Asn Ala Glu Lys He Ala Leu Leu
85 90 95
Arg Ala Arg He Asn Ala Tyr Ser Thr Leu Glu Ser He Leu Leu Thr
100 105 110
Lys Met His Asn Arg He Val Lys Val Leu Gin Val Lys Asn Asn Val
115 120 125
He Ser His Leu Phe Gly Leu Val Asp Phe Leu Thr Ser Lys Ser He
130 135 140
Leu Ala Lys Arg Phe Val Asp Thr Thr Asn His Arg Val Tyr Val Met 145 150 155 160
Val Gin Phe Pro Phe He Gin Pro Glu Asp Leu He Ala Tyr Phe Lys
165 170 175
Ala Lys Arg He Asp Leu Ser Ser Ala Ser Ala Thr His Leu Ser Ala
180 185 190
Leu Leu Asn Lys Ala Leu Phe His Leu 195 200
(2) INFORMATION FOR SEQ ID NO: 1053:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 540 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...513 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1053:
GCATGCTCTT AGAGT ATG TCT GTA TCG CAT GTT GCT TTA ATC TTA AGG AAA 51 Met Ser Val Ser His Val Ala Leu He Leu Arg Lys 1 5 10
TTG TTT TAT CAT AGA CAA GGA GTT TTT ATG GGC GGT TTT TCA GTG GGA 99 Leu Phe Tyr His Arg Gin Gly Val Phe Met Gly Gly Phe Ser Val Gly 15 20 25
ATG TTG AAA GAT TAT GTG GAC ATA TTT GTT TTT GCG GTG CTT GGC GTG 147 Met Leu Lys Asp Tyr Val Asp He Phe Val Phe Ala Val Leu Gly Val 30 35 40
GCC AGT TTT TTA GCT TTG TGG TTT GCG ATT GAA AGG GTT ATT TTT TAT 195 Ala Ser Phe Leu Ala Leu Trp Phe Ala He Glu Arg Val He Phe Tyr 45 50 55 60
TCT AAA GTC GAT TTG AAA GCT TAT GAC GAT ATA GAT GCC CTG AAT TTG 243 Ser Lys Val Asp Leu Lys Ala Tyr Asp Asp He Asp Ala Leu Asn Leu 65 70 75
GAT TTA ACC AAG AAT CTA ACC ATT CTC TAT GTG ATT TTT TCT AAC GCG 291 Asp Leu Thr Lys Asn Leu Thr He Leu Tyr Val He Phe Ser Asn Ala 80 85 90
CCT TAT GTG GGC TTA TTA GGG ACG GTT TTA GGG ATT ATG GTG ATT TTC 339 Pro Tyr Val Gly Leu Leu Gly Thr Val Leu Gly He Met Val He Phe 95 100 105
TAT GAC ATG GGC GTG AGC GGC GGG ATG GAC GCT AAA ACG ATC ATG GTA 387 Tyr Asp Met Gly Val Ser Gly Gly Met Asp Ala Lys Thr He Met Val 110 115 120
GGT TTG TCT TTG GCT TTA AAA GCG ACC GCT CTA GGG CTT GCT GTG GCG 435 Gly Leu Ser Leu Ala Leu Lys Ala Thr Ala Leu Gly Leu Ala Val Ala 125 130 135 140
ATT CCC ACT TTG ATC GCT TAT AAT AGC TTG TTG AGA AAA TCC GAT GTT 483 He Pro Thr Leu He Ala Tyr Asn Ser Leu Leu Arg Lys Ser Asp Val 145 150 155
TTG AGC GAA AAA TTC AGG ATC ATG AAA AAA TGAAAAGCAT CAGAAGAGGC GAT 536 Leu Ser Glu Lys Phe Arg He Met Lys Lys 160 165
GGGC 540
(2) INFORMATION FOR SEQ ID NO: 1054:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 166 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1054:
Met Ser Val Ser His Val Ala Leu He Leu Arg Lys Leu Phe Tyr His
1 5 10 15
Arg Gin Gly Val Phe Met Gly Gly Phe Ser Val Gly Met Leu Lys Asp
20 25 30
Tyr Val Asp He Phe Val Phe Ala Val Leu Gly Val Ala Ser Phe Leu
35 40 45
Ala Leu Trp Phe Ala He Glu Arg Val He Phe Tyr Ser Lys Val Asp
50 55 60
Leu Lys Ala Tyr Asp Asp He Asp Ala Leu Asn Leu Asp Leu Thr Lys 65 70 75 80
Asn Leu Thr He Leu Tyr Val He Phe Ser Asn Ala Pro Tyr Val Gly
85 90 95
Leu Leu Gly Thr Val Leu Gly He Met Val He Phe Tyr Asp Met Gly
100 105 110
Val Ser Gly Gly Met Asp Ala Lys Thr He Met Val Gly Leu Ser Leu
115 120 125
Ala Leu Lys Ala Thr Ala Leu Gly Leu Ala Val Ala He Pro Thr Leu
130 135 140
He Ala Tyr Asn Ser Leu Leu Arg Lys Ser Asp Val Leu Ser Glu Lys 145 150 155 160
Phe Arg He Met Lys Lys 165
(2) INFORMATION FOR SEQ ID NO: 1055:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 777 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 25...723 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1055:
TCATGAATTA AACCCTAGCG AACA ATG AAG CTT TTT GAC TAC GCT CCT TTG 51
Met Lys Leu Phe Asp Tyr Ala Pro Leu
1 5
AGT TTG GCT TGG CGG GAG TTT TTG CAA AGC GAA TTT AAA AAG CCT TAT 99 Ser Leu Ala Trp Arg Glu Phe Leu Gin Ser Glu Phe Lys Lys Pro Tyr 10 15 20 25
TTT TTA GAA ATA GAA AAA CGC TAC CTA GAA GCC CTA AAA ATC CCT AAA 147 Phe Leu Glu He Glu Lys Arg Tyr Leu Glu Ala Leu Lys He Pro Lys 30 35 40
ACC ATT TTC CCT AAA AGC TCT AAT CTG TTT TAT GCG CTC AAT CTA ACG 195 Thr He Phe Pro Lys Ser Ser Asn Leu Phe Tyr Ala Leu Asn Leu Thr 45 50 55
CCC CCT TGT GCG GTT AAA ATC ATC CTT TTA GGG CAA GAC CCC TAC CAT 243 Pro Pro Cys Ala Val Lys He He Leu Leu Gly Gin Asp Pro Tyr His 60 65 70
TCC ACC TAC CTA GAA AAT GAT CAA GAA TTG CCG GTG GCG ATG GGG TTG 291 Ser Thr Tyr Leu Glu Asn Asp Gin Glu Leu Pro Val Ala Met Gly Leu 75 80 85
AGC TTT AGC GTG GAA AAA AAC GCC CCT ATT CCT CCA AGT TTA AAA AAT 339 Ser Phe Ser Val Glu Lys Asn Ala Pro He Pro Pro Ser Leu Lys Asn 90 95 100 105
ATT TTT AAA GAA TTG CAT GCG AAT TTA GGC GTG CCT GTG CCT TGT TGT 387 He Phe Lys Glu Leu His Ala Asn Leu Gly Val Pro Val Pro Cys Cys 110 115 120
GGG GAT TTG AGC GCA TGG GCT AAA AGG GGC ATG CTC TTA TTG AAC GCC 435 Gly Asp Leu Ser Ala Trp Ala Lys Arg Gly Met Leu Leu Leu Asn Ala 125 130 135
ATT TTA AGC GTG GAA AAA AAC CAA GCC GCT TCG CAC CAA TAT ATT GGC 483 He Leu Ser Val Glu Lys Asn Gin Ala Ala Ser His Gin Tyr He Gly 140 145 150
TGG GAA GCT TTT AGC GAT CAA ATA CTG ATG CGC CTT TTT GAA ACG ACC 531 Trp Glu Ala Phe Ser Asp Gin He Leu Met Arg Leu Phe Glu Thr Thr 155 160 165
GCC CCT TTA ATC GTG GTG TTA CTA GGG AAA GTC GCC CAA AAA AAG ATC 579 Ala Pro Leu He Val Val Leu Leu Gly Lys Val Ala Gin Lys Lys He 170 175 180 185
GCG TTA ATC CCC AAA AAC AAA CAC ATC ATC ATC ACA GCC CCT CAC CCT 627 Ala Leu He Pro Lys Asn Lys His He He He Thr Ala Pro His Pro 190 195 200
AGC CCA CTA TCT AGG GGG TTT TTA GGG AGT GGG GTT TTT ACA AGC GTT 675 Ser Pro Leu Ser Arg Gly Phe Leu Gly Ser Gly Val Phe Thr Ser Val 205 210 215
CAA AAA GCT TAT AGA GAG GTT TAT CGC AAG GAT TTT GAT TTT AGT TTA T 724 Gin Lys Ala Tyr Arg Glu Val Tyr Arg Lys Asp Phe Asp Phe Ser Leu 220 225 230
GATTGATGCT TAATGAGACA GAACCCCTTA AGAATGCCTT TATTTAAGAG CAT 777
(2) INFORMATION FOR SEQ ID NO: 1056: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 233 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1056:
Met Lys Leu Phe Asp Tyr Ala Pro Leu Ser Leu Ala Trp Arg Glu Phe
1 5 10 15
Leu Gin Ser Glu Phe Lys Lys Pro Tyr Phe Leu Glu He Glu Lys Arg
20 25 30
Tyr Leu Glu Ala Leu Lys He Pro Lys Thr He Phe Pro Lys Ser Ser
35 40 45
Asn Leu Phe Tyr Ala Leu Asn Leu Thr Pro Pro Cys Ala Val Lys He
50 55 60
He Leu Leu Gly Gin Asp Pro Tyr His Ser Thr Tyr Leu Glu Asn Asp 65 70 75 80
Gin Glu Leu Pro Val Ala Met Gly Leu Ser Phe Ser Val Glu Lys Asn
85 90 95
Ala Pro He Pro Pro Ser Leu Lys Asn He Phe Lys Glu Leu His Ala
100 105 110
Asn Leu Gly Val Pro Val Pro Cys Cys Gly Asp Leu Ser Ala Trp Ala
115 120 125
Lys Arg Gly Met Leu Leu Leu Asn Ala He Leu Ser Val Glu Lys Asn
130 135 140
Gin Ala Ala Ser His Gin Tyr He Gly Trp Glu Ala Phe Ser Asp Gin 145 150 155 160
He Leu Met Arg Leu Phe Glu Thr Thr Ala Pro Leu He Val Val Leu
165 170 175
Leu Gly Lys Val Ala Gin Lys Lys He Ala Leu He Pro Lys Asn Lys
180 185 190
His He He He Thr Ala Pro His Pro Ser Pro Leu Ser Arg Gly Phe
195 200 205
Leu Gly Ser Gly Val Phe Thr Ser Val Gin Lys Ala Tyr Arg Glu Val
210 215 220
Tyr Arg Lys Asp Phe Asp Phe Ser Leu 225 230
(2) INFORMATION FOR SEQ ID NO: 1057:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1242 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...1179 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1057:
AAGAAGAAAT AAAAACTC ATG GGG TTT TTA TTT GAA AAA TCG TTA ATG AGT 51
Met Gly Phe Leu Phe Glu Lys Ser Leu Met Ser 1 5 10
TTT TTC GCT CAT CCA ATC AAA ATC CTT AAA ATC ATC AGT TTG ATT TTA 99 Phe Phe Ala His Pro He Lys He Leu Lys He He Ser Leu He Leu 15 20 25
AGT TTT TTG GTA AGC TTT TTG GTT GCT GAA AAC GCT CAT GAG CCA GAA 147 Ser Phe Leu Val Ser Phe Leu Val Ala Glu Asn Ala His Glu Pro Glu 30 35 40
GAA ATC AAG GCT AAA GTG GCT TAT GTG AAA ATC CCC CAA TTA GAA GAT 195 Glu He Lys Ala Lys Val Ala Tyr Val Lys He Pro Gin Leu Glu Asp 45 50 55
TTG GAA AAC AAC CCG GTT TAT ATC GGT CAA ATT ATA GGC GTA ACT TAT 243 Leu Glu Asn Asn Pro Val Tyr He Gly Gin He He Gly Val Thr Tyr 60 65 70 75
GAT TTA TTG CTG TTT GAC GCT GAG TTT TTG GAA GCC AAA ATC AAA GAC 291 Asp Leu Leu Leu Phe Asp Ala Glu Phe Leu Glu Ala Lys He Lys Asp 80 85 90
GGG TTG GAT AAA ACC CAA ATT GAG CTT TTA AAC AAG ATG CCT AAA TGG 339 Gly Leu Asp Lys Thr Gin He Glu Leu Leu Asn Lys Met Pro Lys Trp 95 100 105
AAA AAG GTG GAA AAA GAG CTT TTC AGA GCG ACT TAT TAT TAC AAG ATT 387 Lys Lys Val Glu Lys Glu Leu Phe Arg Ala Thr Tyr Tyr Tyr Lys He 110 115 120
AAG GGC ATA AAA GCG ATT ATT CCG TCC TTA GAA GTG AGC GCG TTT TCC 435 Lys Gly He Lys Ala He He Pro Ser Leu Glu Val Ser Ala Phe Ser 125 130 135
AAT AAA GAC AAA TAC ATA GAT CAT TCC ATA GCC CCA AAA GTT ACT TTG 483 Asn Lys Asp Lys Tyr He Asp His Ser He Ala Pro Lys Val Thr Leu 140 145 150 155
CAG GTA ACG GAT TTG TCC AAA AAC CCT CGT TAT GCG AAT GTC ATG GCT 531 Gin Val Thr Asp Leu Ser Lys Asn Pro Arg Tyr Ala Asn Val Met Ala 160 165 170
AAA GAT TTA CAA GTC TTG CAA TAC AAA ACC AAA GAT TAT GAC GAT AAA 579 Lys Asp Leu Gin Val Leu Gin Tyr Lys Thr Lys Asp Tyr Asp Asp Lys 175 180 185
AAC AAT ATT TTG GTG ATG GAA ATA GCG TTC AAA GAA GCC ACT TGG GAA 627 Asn Asn He Leu Val Met Glu He Ala Phe Lys Glu Ala Thr Trp Glu 190 195 200
GAT TTT CAC ATC AAA GAA GCG ATC AAG CAA GGG TTT GAT AAC GCC TCT 675 Asp Phe His He Lys Glu Ala He Lys Gin Gly Phe Asp Asn Ala Ser 205 210 215
TTA AAC CAG ATC AAG GCT AAA GAA GGG AGC GTT TTT TAT TAT TGC GTG 723 Leu Asn Gin He Lys Ala Lys Glu Gly Ser Val Phe Tyr Tyr Cys Val 220 225 230 235
TTG CCT AAG ACT ATT CAA AAC CTT TCT TTT GAT TAT TTC TCG CTT TCA 771 Leu Pro Lys Thr He Gin Asn Leu Ser Phe Asp Tyr Phe Ser Leu Ser 240 245 250
AAT AAG CAA TTT AAA ACC TTA TCT TTT TCA ACC ATT CCC ACT CAA GAC 819 Asn Lys Gin Phe Lys Thr Leu Ser Phe Ser Thr He Pro Thr Gin Asp 255 260 265
ACT ACC GGT ATT CAA AGC GAT CTC ATC CCT AAA AAC AAT TTT TTA GTC 867 Thr Thr Gly He Gin Ser Asp Leu He Pro Lys Asn Asn Phe Leu Val 270 275 280
TTT TCT AAT GTG GCG TTG CTC GCT TTG TGC GTG TTT TTC TTG GTG CTG 915 Phe Ser Asn Val Ala Leu Leu Ala Leu Cys Val Phe Phe Leu Val Leu 285 290 295
TTT TTC ATT TTT GGG CGC AAA CTC ATT TTT TTA GGG CTT GGG ATT TTG 963 Phe Phe He Phe Gly Arg Lys Leu He Phe Leu Gly Leu Gly He Leu 300 305 310 315
TGC TTA GGG TTT GTT TTG TAT CAC CTT TTA TTC ACG CAA AAA TCA GCC 1011 Cys Leu Gly Phe Val Leu Tyr His Leu Leu Phe Thr Gin Lys Ser Ala 320 325 330
CTA TTG CTC GCT CAT AAA AAA ATC CGC ATT CTG CCC ACG CAA AAT TCC 1059 Leu Leu Leu Ala His Lys Lys He Arg He Leu Pro Thr Gin Asn Ser 335 340 345
ACC ATT TTA GGG CTT TCT AAA AAT GAA ATG CCG ATT AAA ATC TTA GGC 1107 Thr He Leu Gly Leu Ser Lys Asn Glu Met Pro He Lys He Leu Gly 350 355 360
TCG CAT GAT GAT TAT TAT AAA ATC CTA ACG CCG CAT GAA CAA ATA GGA 1155 Ser His Asp Asp Tyr Tyr Lys He Leu Thr Pro His Glu Gin He Gly 365 370 375
TGG GTC AAA AAA GAT GAA GTC AAA TAAAAAGTCC AATCGTTTAA GAGCGATTTA 1209 Trp Val Lys Lys Asp Glu Val Lys 380 385
TAGAGCTTTA GTGATCGCTA TAGGACTAGC TGT 1242
(2) INFORMATION FOR SEQ ID NO: 1058:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 387 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1058:
Met Gly Phe Leu Phe Glu Lys Ser Leu Met Ser Phe Phe Ala His Pro
1 5 10 15
He Lys He Leu Lys He He Ser Leu He Leu Ser Phe Leu Val Ser
20 25 30
Phe Leu Val Ala Glu Asn Ala His Glu Pro Glu Glu He Lys Ala Lys
35 40 45
Val Ala Tyr Val Lys He Pro Gin Leu Glu Asp Leu Glu Asn Asn Pro
50 55 60
Val Tyr He Gly Gin He He Gly Val Thr Tyr Asp Leu Leu Leu Phe 65 70 75 80
Asp Ala Glu Phe Leu Glu Ala Lys He Lys Asp Gly Leu Asp Lys Thr
85 90 95
Gin He Glu Leu Leu Asn Lys Met Pro Lys Trp Lys Lys Val Glu Lys
100 105 110
Glu Leu Phe Arg Ala Thr Tyr Tyr Tyr Lys He Lys Gly He Lys Ala
115 120 125
He He Pro Ser Leu Glu Val Ser Ala Phe Ser Asn Lys Asp Lys Tyr
130 135 140
He Asp His Ser He Ala Pro Lys Val Thr Leu Gin Val Thr Asp Leu 145 150 155 160
Ser Lys Asn Pro Arg Tyr Ala Asn Val Met Ala Lys Asp Leu Gin Val
165 170 175
Leu Gin Tyr Lys Thr Lys Asp Tyr Asp Asp Lys Asn Asn He Leu Val
180 185 190
Met Glu He Ala Phe Lys Glu Ala Thr Trp Glu Asp Phe His He Lys
195 200 205
Glu Ala He Lys Gin Gly Phe Asp Asn Ala Ser Leu Asn Gin He Lys
210 215 220
Ala Lys Glu Gly Ser Val Phe Tyr Tyr Cys Val Leu Pro Lys Thr He 225 230 235 240
Gin Asn Leu Ser Phe Asp Tyr Phe Ser Leu Ser Asn Lys Gin Phe Lys
245 250 255
Thr Leu Ser Phe Ser Thr He Pro Thr Gin Asp Thr Thr Gly He Gin
260 265 270
Ser Asp Leu He Pro Lys Asn Asn Phe Leu Val Phe Ser Asn Val Ala
275 280 285
Leu Leu Ala Leu Cys Val Phe Phe Leu Val Leu Phe Phe He Phe Gly
290 295 300
Arg Lys Leu He Phe Leu Gly Leu Gly He Leu Cys Leu Gly Phe Val 305 310 315 320
Leu Tyr His Leu Leu Phe Thr Gin Lys Ser Ala Leu Leu Leu Ala His
325 330 335
Lys Lys He Arg He Leu Pro Thr Gin Asn Ser Thr He Leu Gly Leu
340 345 350
Ser Lys Asn Glu Met Pro He Lys He Leu Gly Ser His Asp Asp Tyr 355 360 365 Tyr Lys He Leu Thr Pro His Glu Gin He Gly Trp Val Lys Lys Asp
370 375 380
Glu Val Lys 385
(2) INFORMATION FOR SEQ ID NO: 1059:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1455 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 34...1395 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1059:
TTCTAATCTC AAAAATGGGT GGTGTTATTA ACA ATG ACA AAA CGA CTT TTT AAA 54
Met Thr Lys Arg Leu Phe Lys 1 5
GGG TTG TTA GCG GTT TCT CTT GCT GTG AGT TTG CAT GGT GGT GAA GTT 102 Gly Leu Leu Ala Val Ser Leu Ala Val Ser Leu His Gly Gly Glu Val 10 15 20
AAG GAA AAA AAG CCG GTT AAG CCG GTT AAA GAA GAT CCG CAA GAA TTA 150 Lys Glu Lys Lys Pro Val Lys Pro Val Lys Glu Asp Pro Gin Glu Leu 25 30 35
GCG GCT AAA AGG GTG GAA GCG TTC AGT CGT TTC TCT AAT GTG GTT TCA 198 Ala Ala Lys Arg Val Glu Ala Phe Ser Arg Phe Ser Asn Val Val Ser 40 45 50 55
GAA ATT GAA AAA AAA TAT GTG GAT AAA ATC AGC ATT TCT GAG ATC ATG 246 Glu He Glu Lys Lys Tyr Val Asp Lys He Ser He Ser Glu He Met 60 65 70
ACT AAA GCG ATT GAA GGC TTG CTC TCT AAT TTG GAC GCG CAT TCA GCG 294 Thr Lys Ala He Glu Gly Leu Leu Ser Asn Leu Asp Ala His Ser Ala 75 80 85
TAT TTG AAT GAA AAG AAG TTT AAG GAA TTT CAA GCC CAA ACC GAG GGC 342 Tyr Leu Asn Glu Lys Lys Phe Lys Glu Phe Gin Ala Gin Thr Glu Gly 90 95 100
GAA TTT GGG GGG CTT GGG ATC ACG GTG GGC ATG CGC GAT GGC GTT TTA 390 Glu Phe Gly Gly Leu Gly He Thr Val Gly Met Arg Asp Gly Val Leu 105 110 115 ACC GTT ATT GCC CCT TTA GAA GGC ACT CCA GCT TAC AAG GCT GGG GTT 438 Thr Val He Ala Pro Leu Glu Gly Thr Pro Ala Tyr Lys Ala Gly Val 120 125 130 135
AAG TCA GGC GAT AAC ATT TTA AAA ATC AAT AAC GAA AGC ACG CTG AGC 486 Lys Ser Gly Asp Asn He Leu Lys He Asn Asn Glu Ser Thr Leu Ser 140 145 150
ATG AGC ATT GAT GAT GCG ATC AAC CTC ATG CGC GGC AAG CCA AAA ACC 534 Met Ser He Asp Asp Ala He Asn Leu Met Arg Gly Lys Pro Lys Thr 155 160 165
CCT ATT CAG ATC ACC GTT GTA AGA AAA AAC GAG CCA AAA CCT TTA GTG 582 Pro He Gin He Thr Val Val Arg Lys Asn Glu Pro Lys Pro Leu Val 170 175 180
TTT AAC ATC ATT AGA GAC ATC ATT AAA CTC CCC TCT GTC TAT GTG AAA 630 Phe Asn He He Arg Asp He He Lys Leu Pro Ser Val Tyr Val Lys 185 190 195
AAG ATT AAA GAA ACC CCT TAT CTG TAT GTG AGA GTG AGT GGT TTT GAC 678 Lys He Lys Glu Thr Pro Tyr Leu Tyr Val Arg Val Ser Gly Phe Asp 200 205 210 215
AAG AAT GTT ACC AAA TCG GTT TTA GAA GGC TTA AAA GCT AAC CCT AAG 726 Lys Asn Val Thr Lys Ser Val Leu Glu Gly Leu Lys Ala Asn Pro Lys 220 225 230
GCT AAG GGG ATC GTG TTG GAT TTA AGG GGC AAT CCT GGA GGG CTA TTA 774 Ala Lys Gly He Val Leu Asp Leu Arg Gly Asn Pro Gly Gly Leu Leu 235 240 245
AAC CAA GCG GTG GGC TTG TCT AAC CTC TTC ATT AAA GAG GGG GTT TTA 822 Asn Gin Ala Val Gly Leu Ser Asn Leu Phe He Lys Glu Gly Val Leu 250 255 260
GTC TCT CAA AAA GGC AAA AAT AAA GAA GAA AAT TTA GAA TAC AAG GCT 870 Val Ser Gin Lys Gly Lys Asn Lys Glu Glu Asn Leu Glu Tyr Lys Ala 265 270 275
AAC GGC AGA GCC CCT TAT ACC AAT TTG CCT ATT GCG GTG TTA GTC AAT 918 Asn Gly Arg Ala Pro Tyr Thr Asn Leu Pro He Ala Val Leu Val Asn 280 285 290 295
GGC GGT TCA GCG AGC GCG AGC GAG ATC GTC GCA GGG GCA CTG CAA GAT 966 Gly Gly Ser Ala Ser Ala Ser Glu He Val Ala Gly Ala Leu Gin Asp 300 305 310
CAC AAA CGG GCC GTG ATT ATC GGT GAA AAA ACC TTT GGT AAG GGA AGC 1014 His Lys Arg Ala Val He He Gly Glu Lys Thr Phe Gly Lys Gly Ser 315 320 325
GTG CAG ATG CTG CTC CCT GTC AAT AAA GAC GAA GCC ATT AAA ATC ACA 1062 Val Gin Met Leu Leu Pro Val Asn Lys Asp Glu Ala He Lys He Thr 330 335 340 ACC GCA CGC TAC TAT TTG CCG AGC GGG CGT ACC ATT CAA GCT AAG GGG 1110 Thr Ala Arg Tyr Tyr Leu Pro Ser Gly Arg Thr He Gin Ala Lys Gly 345 350 355
ATC ACG CCT GAT ATT GTG ATT TAT CCG GGT AAA GTG CCA GAA AAT GAA 1158 He Thr Pro Asp He Val He Tyr Pro Gly Lys Val Pro Glu Asn Glu 360 365 370 375
AAC AAA TTC AGC TTG AAA GAA GCG GAT CTA AAA CAC CAT TTA GAG CAA 1206 Asn Lys Phe Ser Leu Lys Glu Ala Asp Leu Lys His His Leu Glu Gin 380 385 390
GAG CTT AAA AAG ATT GAT GAT AAA ACC CCC AAT TCC AAA GAG GCG GAT 1254 Glu Leu Lys Lys He Asp Asp Lys Thr Pro Asn Ser Lys Glu Ala Asp 395 400 405
AAA GAC AAG AAA AAC GAA GAG GAA AAA GAG ATT ACT CCT AAA ATG ATC 1302 Lys Asp Lys Lys Asn Glu Glu Glu Lys Glu He Thr Pro Lys Met He 410 415 420
AAC GAT GAT ATT CAG CTA AAA ACC GCT ATT GAC AGC TTG AAA ACC TGG 1350 Asn Asp Asp He Gin Leu Lys Thr Ala He Asp Ser Leu Lys Thr Trp 425 430 435
TCT ATC GTT GAT GAG AAA ATG GAT GAA AAA GCG CCT AAG AAG AAA TAAAA 1400 Ser He Val Asp Glu Lys Met Asp Glu Lys Ala Pro Lys Lys Lys 440 445 450
ACTCATGGGG TTTTTATTTG AAAAATCGTT AATGAGTTTT TTCGCTCATC CAATC 1455
(2) INFORMATION FOR SEQ ID NO: 1060:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 454 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1060:
Met Thr Lys Arg Leu Phe Lys Gly Leu Leu Ala Val Ser Leu Ala Val
1 5 10 15
Ser Leu His Gly Gly Glu Val Lys Glu Lys Lys Pro Val Lys Pro Val
20 25 30
Lys Glu Asp Pro Gin Glu Leu Ala Ala Lys Arg Val Glu Ala Phe Ser
35 40 45
Arg Phe Ser Asn Val Val Ser Glu He Glu Lys Lys Tyr Val Asp Lys
50 55 60
He Ser He Ser Glu He Met Thr Lys Ala He Glu Gly Leu Leu Ser 65 70 75 80
Asn Leu Asp Ala His Ser Ala Tyr Leu Asn Glu Lys Lys Phe Lys Glu
85 90 95
Phe Gin Ala Gin Thr Glu Gly Glu Phe Gly Gly Leu Gly He Thr Val 100 105 110
Gly Met Arg Asp Gly Val Leu Thr Val He Ala Pro Leu Glu Gly Thr
115 120 125
Pro Ala Tyr Lys Ala Gly Val Lys Ser Gly Asp Asn He Leu Lys He
130 135 140
Asn Asn Glu Ser Thr Leu Ser Met Ser He Asp Asp Ala He Asn Leu 145 150 155 160
Met Arg Gly Lys Pro Lys Thr Pro He Gin He Thr Val Val Arg Lys
165 170 175
Asn Glu Pro Lys Pro Leu Val Phe Asn He He Arg Asp He He Lys
180 185 190
Leu Pro Ser Val Tyr Val Lys Lys He Lys Glu Thr Pro Tyr Leu Tyr
195 200 205
Val Arg Val Ser Gly Phe Asp Lys Asn Val Thr Lys Ser Val Leu Glu
210 215 220
Gly Leu Lys Ala Asn Pro Lys Ala Lys Gly He Val Leu Asp Leu Arg 225 230 235 240
Gly Asn Pro Gly Gly Leu Leu Asn Gin Ala Val Gly Leu Ser Asn Leu
245 250 255
Phe He Lys Glu Gly Val Leu Val Ser Gin Lys Gly Lys Asn Lys Glu
260 265 270
Glu Asn Leu Glu Tyr Lys Ala Asn Gly Arg Ala Pro Tyr Thr Asn Leu
275 280 285
Pro He Ala Val Leu Val Asn Gly Gly Ser Ala Ser Ala Ser Glu He
290 295 300
Val Ala Gly Ala Leu Gin Asp His Lys Arg Ala Val He He Gly Glu 305 310 315 320
Lys Thr Phe Gly Lys Gly Ser Val Gin Met Leu Leu Pro Val Asn Lys
325 330 335
Asp Glu Ala He Lys He Thr Thr Ala Arg Tyr Tyr Leu Pro Ser Gly
340 345 350
Arg Thr He Gin Ala Lys Gly He Thr Pro Asp He Val He Tyr Pro
355 360 365
Gly Lys Val Pro Glu Asn Glu Asn Lys Phe Ser Leu Lys Glu Ala Asp
370 375 380
Leu Lys His His Leu Glu Gin Glu Leu Lys Lys He Asp Asp Lys Thr 385 390 395 400
Pro Asn Ser Lys Glu Ala Asp Lys Asp Lys Lys Asn Glu Glu Glu Lys
405 410 415
Glu He Thr Pro Lys Met He Asn Asp Asp He Gin Leu Lys Thr Ala
420 425 430
He Asp Ser Leu Lys Thr Trp Ser He Val Asp Glu Lys Met Asp Glu
435 440 445
Lys Ala Pro Lys Lys Lys 450
(2) INFORMATION FOR SEQ ID NO: 1061:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1150 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA ( ix) FEATURE :
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 22...1098 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1061:
GATATAAAAG GTTAGTTAAT C ATG GAT TTT TTA AAA GAA AAC TTA AAC ACT 51
Met Asp Phe Leu Lys Glu Asn Leu Asn Thr 1 5 10
ATC ATA GAG GGG GAT TGT TTA GAA AAA TTG AAA GAC TTT CCT AAC AGA 99 He He Glu Gly Asp Cys Leu Glu Lys Leu Lys Asp Phe Pro Asn Arg 15 20 25
AGC GTT GAT TTT ATC TTT GCT GAC CCC CCA TAT TTT ATG CAA ACA GAG 147 Ser Val Asp Phe He Phe Ala Asp Pro Pro Tyr Phe Met Gin Thr Glu 30 35 40
GGG GAA TTG AAG CGT TTT GAA GGC ACA AAA TTT CAA GGC GTT GAG GAT 195 Gly Glu Leu Lys Arg Phe Glu Gly Thr Lys Phe Gin Gly Val Glu Asp 45 50 55
TAT TGG GAT AAA TTT GGC TCT TTT AAG GAA TAC GAT GCC TTT TGT TTG 243 Tyr Trp Asp Lys Phe Gly Ser Phe Lys Glu Tyr Asp Ala Phe Cys Leu 60 65 70
GGT TGG TTG AAA GAA TGC CAA AGG ATT TTA AAA GAT AAT GGC AGT ATT 291 Gly Trp Leu Lys Glu Cys Gin Arg He Leu Lys Asp Asn Gly Ser He 75 80 85 90
TGT GTG ATA GGG AGT TTT CAA AAT ATT TTT AGA ATT GGT TTT CAT TTG 339 Cys Val He Gly Ser Phe Gin Asn He Phe Arg He Gly Phe His Leu 95 100 105
CAA AAT TTA GGG TTT TGG ATA CTC AAT GAT ATT ATT TGG CAC AAG AGT 387 Gin Asn Leu Gly Phe Trp He Leu Asn Asp He He Trp His Lys Ser 110 115 120
AAT CCG GTG CCT AAT TTT GCT GGC AAG AGA TTA TGC AAC GCC CAT GAG 435 Asn Pro Val Pro Asn Phe Ala Gly Lys Arg Leu Cys Asn Ala His Glu 125 130 135
ACG CTT ATT TGG TGT GCT AAA CAC AAA AAC AGC AAA GTT GCC TTT AAT 483 Thr Leu He Trp Cys Ala Lys His Lys Asn Ser Lys Val Ala Phe Asn 140 145 150
TAT AAA ACA ATG AAG TAC CTC AAT AAC GAC AAA CAA GAA AAA TCG GTT 531 Tyr Lys Thr Met Lys Tyr Leu Asn Asn Asp Lys Gin Glu Lys Ser Val 155 160 165 170
TGG CAA ATC CCT ATT TGC ATG GGT AAC GAA AGA CTA AAA GAT GCG CAA 579 Trp Gin He Pro He Cys Met Gly Asn Glu Arg Leu Lys Asp Ala Gin 175 180 185
GGT AAA AAA GTG CAT TCC ACG CAA AAA CCA GAA GCG CTT TTA AAA AAA 627 Gly Lys Lys Val His Ser Thr Gin Lys Pro Glu Ala Leu Leu Lys Lys 190 195 200
ATC ATT TTA AGC GCG ACT AAA CCT AAA GAT ATT ATT TTA GAT CCC TTT 675 He He Leu Ser Ala Thr Lys Pro Lys Asp He He Leu Asp Pro Phe 205 210 215
TTT GGC ACA GGC ACA ACA GGG GCT GTG GCT AAA TCC ATG AAC AGG TAT 723 Phe Gly Thr Gly Thr Thr Gly Ala Val Ala Lys Ser Met Asn Arg Tyr 220 225 230
TTT ATT GGT ATT GAA AAA GAT TCT TTT TAT ATT AAA GAA GCG GCA AAA 771 Phe He Gly He Glu Lys Asp Ser Phe Tyr He Lys Glu Ala Ala Lys 235 240 245 250
CGC CTG AAT AAC ACT AGG GAT AAA AGC GAT TTT ATC ACT AAT TTA GAT 819 Arg Leu Asn Asn Thr Arg Asp Lys Ser Asp Phe He Thr Asn Leu Asp 255 260 265
TTA GAA ACT AAA CCC CCA AAA ATA CCT ATG AGT CTT TTA ATT TCT AAA 867 Leu Glu Thr Lys Pro Pro Lys He Pro Met Ser Leu Leu He Ser Lys 270 275 280
CAA TTA TTA AAA ATC GGG GAT TTT TTA TAC TCA CCT AAC AAA GAA AAA 915 Gin Leu Leu Lys He Gly Asp Phe Leu Tyr Ser Pro Asn Lys Glu Lys 285 290 295
ATT TGT CAA GTT TTA GAA AAC GGA CAA GTG AGG GAT AAT GAA AAC TAT 963 He Cys Gin Val Leu Glu Asn Gly Gin Val Arg Asp Asn Glu Asn Tyr 300 305 310
GAA ACT TCT ATT CAT AAG ATG AGC GCT AAA TAT TTG AAT AAA ACC AAC 1011 Glu Thr Ser He His Lys Met Ser Ala Lys Tyr Leu Asn Lys Thr Asn 315 320 325 330
CAT AAT GGC TGG AAA TTT TTT TAT GCG TAT TAC CAA AAT CAA TTT TTA 1059 His Asn Gly Trp Lys Phe Phe Tyr Ala Tyr Tyr Gin Asn Gin Phe Leu 335 340 345
TTG CTA GAT GAA TTG CGT TAT ATC TGC CAA AAG GAC TCT TAATGGACTA TC 1110 Leu Leu Asp Glu Leu Arg Tyr He Cys Gin Lys Asp Ser 350 355
AAACCTTTAA CGAGATTTTT AATCGTTTTG TCTTTGGAAC 1150
(2) INFORMATION FOR SEQ ID NO: 1062:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 359 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1062:
Met Asp Phe Leu Lys Glu Asn Leu Asn Thr He He Glu Gly Asp Cys
1 5 10 15
Leu Glu Lys Leu Lys Asp Phe Pro Asn Arg Ser Val Asp Phe He Phe
20 25 30
Ala Asp Pro Pro Tyr Phe Met Gin Thr Glu Gly Glu Leu Lys Arg Phe
35 40 45
Glu Gly Thr Lys Phe Gin Gly Val Glu Asp Tyr Trp Asp Lys Phe Gly
50 55 60
Ser Phe Lys Glu Tyr Asp Ala Phe Cys Leu Gly Trp Leu Lys Glu Cys 65 70 75 80
Gin Arg He Leu Lys Asp Asn Gly Ser He Cys Val He Gly Ser Phe
85 90 95
Gin Asn He Phe Arg He Gly Phe His Leu Gin Asn Leu Gly Phe Trp
100 105 110
He Leu Asn Asp He He Trp His Lys Ser Asn Pro Val Pro Asn Phe
115 120 125
Ala Gly Lys Arg Leu Cys Asn Ala His Glu Thr Leu He Trp Cys Ala
130 135 140
Lys His Lys Asn Ser Lys Val Ala Phe Asn Tyr Lys Thr Met Lys Tyr 145 150 155 160
Leu Asn Asn Asp Lys Gin Glu Lys Ser Val Trp Gin He Pro He Cys
165 170 175
Met Gly Asn Glu Arg Leu Lys Asp Ala Gin Gly Lys Lys Val His Ser
180 185 190
Thr Gin Lys Pro Glu Ala Leu Leu Lys Lys He He Leu Ser Ala Thr
195 200 205
Lys Pro Lys Asp He He Leu Asp Pro Phe Phe Gly Thr Gly Thr Thr
210 215 220
Gly Ala Val Ala Lys Ser Met Asn Arg Tyr Phe He Gly He Glu Lys 225 230 235 240
Asp Ser Phe Tyr He Lys Glu Ala Ala Lys Arg Leu Asn Asn Thr Arg
245 250 255
Asp Lys Ser Asp Phe He Thr Asn Leu Asp Leu Glu Thr Lys Pro Pro
260 265 270
Lys He Pro Met Ser Leu Leu He Ser Lys Gin Leu Leu Lys He Gly
275 280 285
Asp Phe Leu Tyr Ser Pro Asn Lys Glu Lys He Cys Gin Val Leu Glu
290 295 300
Asn Gly Gin Val Arg Asp Asn Glu Asn Tyr Glu Thr Ser He His Lys 305 310 315 320
Met Ser Ala Lys Tyr Leu Asn Lys Thr Asn His Asn Gly Trp Lys Phe
325 330 335
Phe Tyr Ala Tyr Tyr Gin Asn Gin Phe Leu Leu Leu Asp Glu Leu Arg
340 345 350
Tyr He Cys Gin Lys Asp Ser 355
(2) INFORMATION FOR SEQ ID NO: 1063: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1536 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 34...1497 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1063:
TAGAAAAGAT CAAACAATTA TAAAAGGATA AAA ATG GAT CAT TTA AAG CAT TTG 54
Met Asp His Leu Lys His Leu 1 5
CAG CAA TTG CAA AAC ATT GAA AGG ATC GTG CTT TCA GGC ATT GTG TTG 102 Gin Gin Leu Gin Asn He Glu Arg He Val Leu Ser Gly He Val Leu 10 15 20
GCC AAT CAT AAG ATT GAA GAG GTC CAT AGC GTT TTA GAG CCT AGC GAT 150 Ala Asn His Lys He Glu Glu Val His Ser Val Leu Glu Pro Ser Asp 25 30 35
TTT TAC TAC CCG CCT AAC GGC TTA TTT TTT GAA ATC GCT TTA AAA CTG 198 Phe Tyr Tyr Pro Pro Asn Gly Leu Phe Phe Glu He Ala Leu Lys Leu 40 45 50 55
CAT GAA GAA GAT TGC CCC ATT GAT GAG AAT TTT ATC CGC CAA AAA ATG 246 His Glu Glu Asp Cys Pro He Asp Glu Asn Phe He Arg Gin Lys Met 60 65 70
CCT AAA GAC AAG CAG ATC AAA GAA GAA GAT CTA GTC GCT ATT TTT GCG 294 Pro Lys Asp Lys Gin He Lys Glu Glu Asp Leu Val Ala He Phe Ala 75 80 85
GCA AGC CCC ATA GAT AAT ATT GAA GCC TAT GTG GAA GAG ATT AAA AAC 342 Ala Ser Pro He Asp Asn He Glu Ala Tyr Val Glu Glu He Lys Asn 90 95 100
GCT TCC ATT AAA CGA AAA CTT TTT GGC TTG GCT AAC ACC ATT AGA GAG 390 Ala Ser He Lys Arg Lys Leu Phe Gly Leu Ala Asn Thr He Arg Glu 105 110 115
CAA GCC CTA GAA AGC GCG CAA AAA TCC AGC GAT ATT TTA GGC GCT GTG 438 Gin Ala Leu Glu Ser Ala Gin Lys Ser Ser Asp He Leu Gly Ala Val 120 125 130 135
GAG CGA GAA GTC TAT GCG TTA TTG AAT GGC AGC ACC ATA GAA GGC TTT 486 Glu Arg Glu Val Tyr Ala Leu Leu Asn Gly Ser Thr He Glu Gly Phe 140 145 150
AGG AAT ATT AAA GAA GTG CTT GAA AGC GCA ATG GAT CTC ATT ACA GAA 534 Arg Asn He Lys Glu Val Leu Glu Ser Ala Met Asp Leu He Thr Glu 155 160 165
AAC CAA AGA AAG GGG AGT TTG GAA GTT ACT GGC ATA CCG ACT GGC TTT 582 Asn Gin Arg Lys Gly Ser Leu Glu Val Thr Gly He Pro Thr Gly Phe 170 175 180
GTC CAA TTG GAT AAC TAT ACG AGC GGT TTC AAT AAG GGG AGT TTA GTC 630 Val Gin Leu Asp Asn Tyr Thr Ser Gly Phe Asn Lys Gly Ser Leu Val 185 190 195
ATT ATA GGG GCA AGG CCG TCT ATG GGT AAA ACT AGT TTG ATG ATG AAC 678 He He Gly Ala Arg Pro Ser Met Gly Lys Thr Ser Leu Met Met Asn 200 205 210 215
ATG GTC TTA AGC GCG CTC AAT GAC GAT AGG GGG GTA GCG GTT TTT AGT 726 Met Val Leu Ser Ala Leu Asn Asp Asp Arg Gly Val Ala Val Phe Ser 220 225 230
TTA GAA ATG TCC GCA GAG CAA CTC GCT TTA AGG GCG TTA TCG GAT CTC 774 Leu Glu Met Ser Ala Glu Gin Leu Ala Leu Arg Ala Leu Ser Asp Leu 235 240 245
ACC TCT ATT AAC ATG CAT GAT TTA GAA AGC GGG AGG CTT GAT GAT GAT 822 Thr Ser He Asn Met His Asp Leu Glu Ser Gly Arg Leu Asp Asp Asp 250 255 260
CAA TGG GAA AAT TTA GCC AAA TGC TTT GAT CAC CTT TCG CAA AAA AAA 870 Gin Trp Glu Asn Leu Ala Lys Cys Phe Asp His Leu Ser Gin Lys Lys 265 270 275
CTC TTT TTC TAC GAT AAA AGT TAT GTG AGG ATA GAG CAA ATC CGC TTG 918 Leu Phe Phe Tyr Asp Lys Ser Tyr Val Arg He Glu Gin He Arg Leu 280 285 290 295
CAA CTA CGA AAG CTT AAA TCC CAA CAC AAG GAA TTG GGT ATC GCT TTT 966 Gin Leu Arg Lys Leu Lys Ser Gin His Lys Glu Leu Gly He Ala Phe 300 305 310
ATT GAC TAT TTG CAG CTC ATG TCA GGG AGT AAA GCC ACT AAA GAG CGC 1014 He Asp Tyr Leu Gin Leu Met Ser Gly Ser Lys Ala Thr Lys Glu Arg 315 320 325
CAT GAG CAA ATC GCT GAA ATT TCA AGG GAG CTT AAA ACT TTA GCC AGA 1062 His Glu Gin He Ala Glu He Ser Arg Glu Leu Lys Thr Leu Ala Arg 330 335 340
GAA TTA GAA ATC CCT ATC ATA GCG TTA GTG CAA CTC AAC CGC AGC CTA 1110 Glu Leu Glu He Pro He He Ala Leu Val Gin Leu Asn Arg Ser Leu 345 350 355
GAA AAC CGA GAC GAT AAA CGG CCC ATT CTT TCG GAT ATC AAA GAC AGC 1158 Glu Asn Arg Asp Asp Lys Arg Pro He Leu Ser Asp He Lys Asp Ser 360 365 370 375
GGG GGG ATT GAA CAA GAC GCT GAT ATT GTT TTA TTT TTA TAT AGA GGC 1206 Gly Gly He Glu Gin Asp Ala Asp He Val Leu Phe Leu Tyr Arg Gly 380 385 390
TAT ATC TAT CAA ATG AGG GCT GAA GAC AAC AAA ATA GAC AAA CTC AAA 1254 Tyr He Tyr Gin Met Arg Ala Glu Asp Asn Lys He Asp Lys Leu Lys 395 400 405
AAA GAA GGT AAA ATT GAA GAG GCG CAA GAG TTG TAC TTA AAA GTT AAT 1302 Lys Glu Gly Lys He Glu Glu Ala Gin Glu Leu Tyr Leu Lys Val Asn 410 415 420
GAA GAA AGG CGT ATC CAC AAG CAA AAT GGC AGC ATT GAA GAG GCT GAA 1350 Glu Glu Arg Arg He His Lys Gin Asn Gly Ser He Glu Glu Ala Glu 425 430 435
ATC ATT GTG GCT AAA AAC AGG AAT GGG GCT ACA GGA ACG GTT TAT ACG 1398 He He Val Ala Lys Asn Arg Asn Gly Ala Thr Gly Thr Val Tyr Thr 440 445 450 455
CGC TTT AAC GCT CCT TTC ACG CGC TAT GAA GAC ATG CCC ATA GAT TCC 1446 Arg Phe Asn Ala Pro Phe Thr Arg Tyr Glu Asp Met Pro He Asp Ser 460 465 470
CAT TTA GAA GAA GGG CAA GAA ACT AAA GTG GAT TAT GAT ATA GTT ACA 1494 His Leu Glu Glu Gly Gin Glu Thr Lys Val Asp Tyr Asp He Val Thr 475 480 485
ACT TGAAAGACAA AACTTTTCAG GGGGCGTTTG AACTTCTTA 1536
Thr
(2) INFORMATION FOR SEQ ID NO: 1064:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 488 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1064:
Met Asp His Leu Lys His Leu Gin Gin Leu Gin Asn He Glu Arg He
1 5 10 15
Val Leu Ser Gly He Val Leu Ala Asn His Lys He Glu Glu Val His
20 25 30
Ser Val Leu Glu Pro Ser Asp Phe Tyr Tyr Pro Pro Asn Gly Leu Phe
35 40 45
Phe Glu He Ala Leu Lys Leu His Glu Glu Asp Cys Pro He Asp Glu 50 55 60
Asn Phe He Arg Gin Lys Met Pro Lys Asp Lys Gin He Lys Glu Glu 65 70 75 80
Asp Leu Val Ala He Phe Ala Ala Ser Pro He Asp Asn He Glu Ala
85 90 95
Tyr Val Glu Glu He Lys Asn Ala Ser He Lys Arg Lys Leu Phe Gly
100 105 110
Leu Ala Asn Thr He Arg Glu Gin Ala Leu Glu Ser Ala Gin Lys Ser
115 120 125
Ser Asp He Leu Gly Ala Val Glu Arg Glu Val Tyr Ala Leu Leu Asn
130 135 140
Gly Ser Thr He Glu Gly Phe Arg Asn He Lys Glu Val Leu Glu Ser 145 150 155 160
Ala Met Asp Leu He Thr Glu Asn Gin Arg Lys Gly Ser Leu Glu Val
165 170 175
Thr Gly He Pro Thr Gly Phe Val Gin Leu Asp Asn Tyr Thr Ser Gly
180 185 190
Phe Asn Lys Gly Ser Leu Val He He Gly Ala Arg Pro Ser Met Gly
195 200 205
Lys Thr Ser Leu Met Met Asn Met Val Leu Ser Ala Leu Asn Asp Asp
210 215 220
Arg Gly Val Ala Val Phe Ser Leu Glu Met Ser Ala Glu Gin Leu Ala 225 230 235 240
Leu Arg Ala Leu Ser Asp Leu Thr Ser He Asn Met His Asp Leu Glu
245 250 255
Ser Gly Arg Leu Asp Asp Asp Gin Trp Glu Asn Leu Ala Lys Cys Phe
260 265 270
Asp His Leu Ser Gin Lys Lys Leu Phe Phe Tyr Asp Lys Ser Tyr Val
275 280 285
Arg He Glu Gin He Arg Leu Gin Leu Arg Lys Leu Lys Ser Gin His
290 295 300
Lys Glu Leu Gly He Ala Phe He Asp Tyr Leu Gin Leu Met Ser Gly 305 310 315 320
Ser Lys Ala Thr Lys Glu Arg His Glu Gin He Ala Glu He Ser Arg
325 330 335
Glu Leu Lys Thr Leu Ala Arg Glu Leu Glu He Pro He He Ala Leu
340 345 350
Val Gin Leu Asn Arg Ser Leu Glu Asn Arg Asp Asp Lys Arg Pro He
355 360 365
Leu Ser Asp He Lys Asp Ser Gly Gly He Glu Gin Asp Ala Asp He
370 375 380
Val Leu Phe Leu Tyr Arg Gly Tyr He Tyr Gin Met Arg Ala Glu Asp 385 390 395 400
Asn Lys He Asp Lys Leu Lys Lys Glu Gly Lys He Glu Glu Ala Gin
405 410 415
Glu Leu Tyr Leu Lys Val Asn Glu Glu Arg Arg He His Lys Gin Asn
420 425 430
Gly Ser He Glu Glu Ala Glu He He Val Ala Lys Asn Arg Asn Gly
435 440 445
Ala Thr Gly Thr Val Tyr Thr Arg Phe Asn Ala Pro Phe Thr Arg Tyr
450 455 460
Glu Asp Met Pro He Asp Ser His Leu Glu Glu Gly Gin Glu Thr Lys 465 470 475 480
Val Asp Tyr Asp He Val Thr Thr 485 (2) INFORMATION FOR SEQ ID NO: 1065:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1246 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 98...1207 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1065:
GAAACGCATA AGGGGGTTGG CTATCGCTTT AACCCACTAT GAAAAAAAAT CCCTTAAACT 60 CTTTTTAGGG ATTTATTTAG GCTCTTCGTT TGTGTTG ATG CTA GTG ATT AGC GTT 115
Met Leu Val He Ser Val 1 5
TTA GCG TTT AAC TAT GAA AAA AAC GAA AAA ATC AAA ATG ATA CGC ATG 163 Leu Ala Phe Asn Tyr Glu Lys Asn Glu Lys He Lys Met He Arg Met 10 15 20
GAC ATG GAC AAA ATG GCT TCT AAG ATC GCT AGC GAA GTG ATT GCC TTG 211 Asp Met Asp Lys Met Ala Ser Lys He Ala Ser Glu Val He Ala Leu 25 30 35
CAC ATG CAA ACG CAT GGG GAT TAT CAA AAC GCT TTA AAC GCT CTC ATT 259 His Met Gin Thr His Gly Asp Tyr Gin Asn Ala Leu Asn Ala Leu He 40 45 50
TCA CGC TAT AAA GAC GCT TCC ATA GCC CTT TTT GAT AGT AAA AAG CGT 307 Ser Arg Tyr Lys Asp Ala Ser He Ala Leu Phe Asp Ser Lys Lys Arg 55 60 65 70
GTT TTG TAT TCT AAT ATC CCT GAA AGC GCC AAT TTG ATT AAA AAC CAT 355 Val Leu Tyr Ser Asn He Pro Glu Ser Ala Asn Leu He Lys Asn His 75 80 85
AAA GAA GCG GGC TTT TTT AGT TTT AGG GGA GAG TAT TAC CTA TTG AGC 403 Lys Glu Ala Gly Phe Phe Ser Phe Arg Gly Glu Tyr Tyr Leu Leu Ser 90 95 100
GAT GAA ACT TTC GCT CAC TTA GGC GTG GCT AAA ATG CTT TTT AAA AAT 451 Asp Glu Thr Phe Ala His Leu Gly Val Ala Lys Met Leu Phe Lys Asn 105 110 115
TCT AAA CCC CTT CAT TTT TCT TCT TTG TAT CGT AAC ATT GTT TTA GTG 499 Ser Lys Pro Leu His Phe Ser Ser Leu Tyr Arg Asn He Val Leu Val 120 125 130 TTT GTT GTA GCG TTT TTA TGC GTG ATA GGG GTT TCT GTG TTT TTG GGG 547 Phe Val Val Ala Phe Leu Cys Val He Gly Val Ser Val Phe Leu Gly 135 140 145 150
CGT TTG TTT TTA AAG CCC ATT AGG AAT GAA ATC ACC CGC ATT GAT CAT 595 Arg Leu Phe Leu Lys Pro He Arg Asn Glu He Thr Arg He Asp His 155 160 165
TTT TTA AAA AAC ACC ACG CAT GAA TTA AAC ACC CCC ATG AGC GCT TTA 643 Phe Leu Lys Asn Thr Thr His Glu Leu Asn Thr Pro Met Ser Ala Leu 170 175 180
GTC TTG TCT TTA AAA ACC TTA GAA GAC AAC CAA CAA CAC CGC CGC ATT 691 Val Leu Ser Leu Lys Thr Leu Glu Asp Asn Gin Gin His Arg Arg He 185 190 195
AAA ATC GCT ATC CAG CGC ATG AGT TTT TTA TAC CGC TCG CTC TCG TAT 739 Lys He Ala He Gin Arg Met Ser Phe Leu Tyr Arg Ser Leu Ser Tyr 200 205 210
TTA GTG ATG CAA GAT ATT GAG CGC GAA TCT TTT GTG CTT TTA GAT TTA 787 Leu Val Met Gin Asp He Glu Arg Glu Ser Phe Val Leu Leu Asp Leu 215 220 225 230
AAA GCC CTG ATT ATT AAA GAA AAC ACG CTT TTT AGC GAG ATG ATA GAC 835 Lys Ala Leu He He Lys Glu Asn Thr Leu Phe Ser Glu Met He Asp 235 240 245
TAC CAC AAG CTG GAA TTT AAA AGC GAT TTA GTG GAA GTG GAA CTT AAA 883 Tyr His Lys Leu Glu Phe Lys Ser Asp Leu Val Glu Val Glu Leu Lys 250 255 260
GCT AAA GAG CAG GAT TTC ATT TCG CTT TAT AGC AAT TTG CTC ATG AAC 931 Ala Lys Glu Gin Asp Phe He Ser Leu Tyr Ser Asn Leu Leu Met Asn 265 270 275
GCG ATC AAA TAC AGC GTC ATG AAT GGG TAT ATC CAC ATA GAG CTA ACG 979 Ala He Lys Tyr Ser Val Met Asn Gly Tyr He His He Glu Leu Thr 280 285 290
CAT GCG TTT TTG AAA GTG AAA AAT TTA GGG TAT GAA ATC CCT AAA GAC 1027 His Ala Phe Leu Lys Val Lys Asn Leu Gly Tyr Glu He Pro Lys Asp 295 300 305 310
AAG ATC ACA GAA TTA AGC GTT CGT TAT GTG CGT TTC AAT TCT GGC GTG 1075 Lys He Thr Glu Leu Ser Val Arg Tyr Val Arg Phe Asn Ser Gly Val 315 320 325
TTG GGT TAT GGT ATA GGG TTA GGT TTG GTG AAA AAA GTG TGC GAA AAG 1123 Leu Gly Tyr Gly He Gly Leu Gly Leu Val Lys Lys Val Cys Glu Lys 330 335 340
TAT AAA ATG CGT TTA GAA ATT CAT AGC GAA CCC TCT TTA AAG GGA TCG 1171 Tyr Lys Met Arg Leu Glu He His Ser Glu Pro Ser Leu Lys Gly Ser 345 350 355 TTT TAT GAA AAT TCG TTT TGC GTT CAA TTT CAA GGA TAAAGATGCT TTCAGT 1223 Phe Tyr Glu Asn Ser Phe Cys Val Gin Phe Gin Gly 360 365 370
GTATGAAAAA GTGAATGCTC TAG 1246
(2) INFORMATION FOR SEQ ID NO: 1066:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 370 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1066:
Met Leu Val He Ser Val Leu Ala Phe Asn Tyr Glu Lys Asn Glu Lys
1 5 10 15
He Lys Met He Arg Met Asp Met Asp Lys Met Ala Ser Lys He Ala
20 25 30
Ser Glu Val He Ala Leu His Met Gin Thr His Gly Asp Tyr Gin Asn
35 40 45
Ala Leu Asn Ala Leu He Ser Arg Tyr Lys Asp Ala Ser He Ala Leu
50 55 60
Phe Asp Ser Lys Lys Arg Val Leu Tyr Ser Asn He Pro Glu Ser Ala 65 70 75 80
Asn Leu He Lys Asn His Lys Glu Ala Gly Phe Phe Ser Phe Arg Gly
85 90 95
Glu Tyr Tyr Leu Leu Ser Asp Glu Thr Phe Ala His Leu Gly Val Ala
100 105 110
Lys Met Leu Phe Lys Asn Ser Lys Pro Leu His Phe Ser Ser Leu Tyr
115 120 125
Arg Asn He Val Leu Val Phe Val Val Ala Phe Leu Cys Val He Gly
130 135 140
Val Ser Val Phe Leu Gly Arg Leu Phe Leu Lys Pro He Arg Asn Glu 145 150 155 160
He Thr Arg He Asp His Phe Leu Lys Asn Thr Thr His Glu Leu Asn
165 170 175
Thr Pro Met Ser Ala Leu Val Leu Ser Leu Lys Thr Leu Glu Asp Asn
180 185 190
Gin Gin His Arg Arg He Lys He Ala He Gin Arg Met Ser Phe Leu
195 200 205
Tyr Arg Ser Leu Ser Tyr Leu Val Met Gin Asp He Glu Arg Glu Ser
210 215 220
Phe Val Leu Leu Asp Leu Lys Ala Leu He He Lys Glu Asn Thr Leu 225 230 235 240
Phe Ser Glu Met He Asp Tyr His Lys Leu Glu Phe Lys Ser Asp Leu
245 250 255
Val Glu Val Glu Leu Lys Ala Lys Glu Gin Asp Phe He Ser Leu Tyr
260 265 270
Ser Asn Leu Leu Met Asn Ala He Lys Tyr Ser Val Met Asn Gly Tyr
275 280 285
He His He Glu Leu Thr His Ala Phe Leu Lys Val Lys Asn Leu Gly 290 295 300
Tyr Glu He Pro Lys Asp Lys He Thr Glu Leu Ser Val Arg Tyr Val 305 310 315 320
Arg Phe Asn Ser Gly Val Leu Gly Tyr Gly He Gly Leu Gly Leu Val
325 330 335
Lys Lys Val Cys Glu Lys Tyr Lys Met Arg Leu Glu He His Ser Glu
340 345 350
Pro Ser Leu Lys Gly Ser Phe Tyr Glu Asn Ser Phe Cys Val Gin Phe
355 360 365
Gin Gly 370
(2) INFORMATION FOR SEQ ID NO: 1067:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 703 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 60...665 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1067: GGGATAGTTT TAAAAATGAT GAAGAGTTTT TAACATTTTC TTACGCTTGG ATTGATAAA 59
ATG CTG CCC AAA CTT AAA GAC ACA GGG AGT TTT TAT ATC TTT AAT ACC 107 Met Leu Pro Lys Leu Lys Asp Thr Gly Ser Phe Tyr He Phe Asn Thr 1 5 10 15
CCT TTT AAT TGC GCT TTA TTT TTA GCG TAT TTG CAC CAT AAA AAA GTG 155 Pro Phe Asn Cys Ala Leu Phe Leu Ala Tyr Leu His His Lys Lys Val 20 25 30
CAT TTT TTA AAT TTT ATC ACT TGG GTT AAA AAA GAT GGG TTT GCC AAC 203 His Phe Leu Asn Phe He Thr Trp Val Lys Lys Asp Gly Phe Ala Asn 35 40 45
GCC AAA AAG CGT TAT AAC CAC GCG CAA GAA AGC ATT TTA TTT TAT AGC 251 Ala Lys Lys Arg Tyr Asn His Ala Gin Glu Ser He Leu Phe Tyr Ser 50 55 60
ATG CAC AAG AAA AAC TAC ACC TTT AAT GCC GAT GAG ATT CGC ATC GCT 299 Met His Lys Lys Asn Tyr Thr Phe Asn Ala Asp Glu He Arg He Ala 65 70 75 80 TAT GAA TCC GCT GAA CGC ATC AAA CAT GCT CAA AGT AAG GGG ATT TTA 347 Tyr Glu Ser Ala Glu Arg He Lys His Ala Gin Ser Lys Gly He Leu 85 90 95
AAA AAT AAC AAA CGC TGG TTC CCT AAC CCT AAG GGC AAA TTA TGC CTT 395 Lys Asn Asn Lys Arg Trp Phe Pro Asn Pro Lys Gly Lys Leu Cys Leu 100 105 110
GAT GTG TGG GAA ATC ACT TCA CAA AGG CAT GTT GAA AAA GAG AAG GGT 443 Asp Val Trp Glu He Thr Ser Gin Arg His Val Glu Lys Glu Lys Gly 115 120 125
AAA ATC CTT AAG CCC AAA CAC CCC AGC ATC AAA CCT AAA GCG CTC ATT 491 Lys He Leu Lys Pro Lys His Pro Ser He Lys Pro Lys Ala Leu He 130 135 140
GAA CGC ATG ATA AAA GCT AGC TCT CAC AAA AAC GAT TTG ATT TTA GAT 539 Glu Arg Met He Lys Ala Ser Ser His Lys Asn Asp Leu He Leu Asp 145 150 155 160
TTG TTT AGC GGC AGT GGC ATG ACT AGC TTA GTG GCT AAA AGT TTG GAG 587 Leu Phe Ser Gly Ser Gly Met Thr Ser Leu Val Ala Lys Ser Leu Glu 165 170 175
CGT AAT TTT ATA GGG TGT GAG AGC CAT GCT GAA TAC GTG CAT GGG AGT 635 Arg Asn Phe He Gly Cys Glu Ser His Ala Glu Tyr Val His Gly Ser 180 185 190
TTG GAA ATG TTT AGG TAT AAT GAA TGC GAA TAAAAAAGGA TATTTGACAT GCCA 689 Leu Glu Met Phe Arg Tyr Asn Glu Cys Glu 195 200
AAATTAGAAA AAAT 703
(2) INFORMATION FOR SEQ ID NO: 1068:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 202 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1068:
Met Leu Pro Lys Leu Lys Asp Thr Gly Ser Phe Tyr He Phe Asn Thr
1 5 10 15
Pro Phe Asn Cys Ala Leu Phe Leu Ala Tyr Leu His His Lys Lys Val
20 25 30
His Phe Leu Asn Phe He Thr Trp Val Lys Lys Asp Gly Phe Ala Asn
35 40 45
Ala Lys Lys Arg Tyr Asn His Ala Gin Glu Ser He Leu Phe Tyr Ser
50 55 60
Met His Lys Lys Asn Tyr Thr Phe Asn Ala Asp Glu He Arg He Ala 65 70 75 80
Tyr Glu Ser Ala Glu Arg He Lys His Ala Gin Ser Lys Gly He Leu
85 90 95
Lys Asn Asn Lys Arg Trp Phe Pro Asn Pro Lys Gly Lys Leu Cys Leu
100 105 110
Asp Val Trp Glu He Thr Ser Gin Arg His Val Glu Lys Glu Lys Gly
115 120 125
Lys He Leu Lys Pro Lys His Pro Ser He Lys Pro Lys Ala Leu He
130 135 140
Glu Arg Met He Lys Ala Ser Ser His Lys Asn Asp Leu He Leu Asp 145 150 155 160
Leu Phe Ser Gly Ser Gly Met Thr Ser Leu Val Ala Lys Ser Leu Glu
165 170 175
Arg Asn Phe He Gly Cys Glu Ser His Ala Glu Tyr Val His Gly Ser
180 185 190
Leu Glu Met Phe Arg Tyr Asn Glu Cys Glu 195 200
(2) INFORMATION FOR SEQ ID NO: 1069:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1448 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 67...1404 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1069:
AGAGTTCTAG GGGCGTGGCG TATAAGTCAA GCGAATATTC TAGCGAAGAA AAACAAGAGG 60 AATAAC ATG AAC GAA ACG CTT TAT TGC AGT TTT TGC AAA AAA CCA GAA 108 Met Asn Glu Thr Leu Tyr Cys Ser Phe Cys Lys Lys Pro Glu 1 5 10
TCA AGA GAT CCC AAA AAA CGC CGC ATT ATT TTT GCG AGC AAT CTC AAT 156 Ser Arg Asp Pro Lys Lys Arg Arg He He Phe Ala Ser Asn Leu Asn 15 20 25 30
AAA GAT GTG TGC GTG TGC GAA TAT TGT ATA GAT GTG ATG CAT GGG GAA 204 Lys Asp Val Cys Val Cys Glu Tyr Cys He Asp Val Met His Gly Glu 35 40 45
TTG CAC AAA TAC GAC AAT TCT TTA TTG GCG CTC AAA AGA GAC CGA TTG 252 Leu His Lys Tyr Asp Asn Ser Leu Leu Ala Leu Lys Arg Asp Arg Leu 50 55 60
AGA AGA ATG GAA TCT AGC GCT TAT GAA GAA GAG TTT TTA CTC TCT TAC 300 Arg Arg Met Glu Ser Ser Ala Tyr Glu Glu Glu Phe Leu Leu Ser Tyr 65 70 75
ATT CCA GCC CCT AAA GAG CTT AAG GCG GTT TTA GAC AAT TAT GTG ATA 348 He Pro Ala Pro Lys Glu Leu Lys Ala Val Leu Asp Asn Tyr Val He 80 85 90
GGG CAA GAG CAG GCT AAA AAG GTT TTT TCC GTA GCC GTG TAT AAC CAT 396 Gly Gin Glu Gin Ala Lys Lys Val Phe Ser Val Ala Val Tyr Asn His 95 100 105 110
TAC AAA CGC TTA TCT TTT AAA GAA AAA CTC AAA AAA CAA GAC AAC CAA 444 Tyr Lys Arg Leu Ser Phe Lys Glu Lys Leu Lys Lys Gin Asp Asn Gin 115 120 125
GAC AGC AAT GTG GAG TTA GAG CAT TTA GAA GAA GTG GAG TTG AGC AAG 492 Asp Ser Asn Val Glu Leu Glu His Leu Glu Glu Val Glu Leu Ser Lys 130 135 140
TCT AAT ATT TTA CTA ATC GGC CCT ACA GGA TCA GGC AAA ACT TTA ATG 540 Ser Asn He Leu Leu He Gly Pro Thr Gly Ser Gly Lys Thr Leu Met 145 150 155
GCG CAA ACT CTG GCC AAG CAT TTG GAT ATT CCT ATC GCC ATT AGC GAT 588 Ala Gin Thr Leu Ala Lys His Leu Asp He Pro He Ala He Ser Asp 160 165 170
GCG ACT AGC TTG ACT GAA GCG GGC TAT GTG GGC GAA GAC GTG GAA AAT 636 Ala Thr Ser Leu Thr Glu Ala Gly Tyr Val Gly Glu Asp Val Glu Asn 175 180 185 190
ATT CTC ACA AGA TTG TTG CAA GCG AGC GAC TGG AAT GTC CAA AAA GCC 684 He Leu Thr Arg Leu Leu Gin Ala Ser Asp Trp Asn Val Gin Lys Ala 195 200 205
CAA AAA GGC ATT GTG TTT ATT GAT GAG ATT GAT AAA ATC AGC CGT TTG 732 Gin Lys Gly He Val Phe He Asp Glu He Asp Lys He Ser Arg Leu 210 215 220
TCA GAA AAC CGC TCT ATC ACT AGA GAT GTT TCT GGC GAG GGC GTT CAG 780 Ser Glu Asn Arg Ser He Thr Arg Asp Val Ser Gly Glu Gly Val Gin 225 230 235
CAA GCG TTG TTG AAA ATC GTT GAA GGT TCT TTA GTG AAT ATC CCC CCC 828 Gin Ala Leu Leu Lys He Val Glu Gly Ser Leu Val Asn He Pro Pro 240 245 250
AAA GGC GGC AGA AAG CAC CCT GAG GGC AAT TTC ATT CAA ATT GAC ACG 876 Lys Gly Gly Arg Lys His Pro Glu Gly Asn Phe He Gin He Asp Thr 255 260 265 270
AGC GAT ATT TTA TTC ATT TGT GCT GGA GCG TTT GAT GGG TTA GCT GAA 924 Ser Asp He Leu Phe He Cys Ala Gly Ala Phe Asp Gly Leu Ala Glu 275 280 285 ATC ATT AAA AAA CGC ACC ACG CAG AAT GTG TTG GGT TTC ACT CAA GAA 972 He He Lys Lys Arg Thr Thr Gin Asn Val Leu Gly Phe Thr Gin Glu 290 295 300
AAG ATG AGC AAA AAA GAG CAA GAA GCG ATC TTG CAT TTA GTC CAA ACC 1020 Lys Met Ser Lys Lys Glu Gin Glu Ala He Leu His Leu Val Gin Thr 305 310 315
CAT GAC CTG GTT ACT TAT GGG CTT ATC CCT GAG CTT ATT GGC CGT TTG 1068 His Asp Leu Val Thr Tyr Gly Leu He Pro Glu Leu He Gly Arg Leu 320 325 330
CCG GTT TTA AGC ACG CTA GAT AGC ATC AGT TTA GAA GCG ATG GTG GAT 1116 Pro Val Leu Ser Thr Leu Asp Ser He Ser Leu Glu Ala Met Val Asp 335 340 345 350
ATT TTA CAA AAA CCT AAA AAC GCT CTT ATC AAG CAA TAC CAG CAG CTT 1164 He Leu Gin Lys Pro Lys Asn Ala Leu He Lys Gin Tyr Gin Gin Leu 355 360 365
TTC AAA ATG GAT GAG GTG GAT TTG ATC TTT GAA GAA GAA GCC ATT AAA 1212 Phe Lys Met Asp Glu Val Asp Leu He Phe Glu Glu Glu Ala He Lys 370 375 380
GAA ATC GCT CAA CTC GCA TTA GAA AGA AAA ACC GGG GCT AGG GGC TTA 1260 Glu He Ala Gin Leu Ala Leu Glu Arg Lys Thr Gly Ala Arg Gly Leu 385 390 395
AGG GCG ATC ATT GAA GAT TTT TGT TTG GAT ATT ATG TTT GAT TTA CCC 1308 Arg Ala He He Glu Asp Phe Cys Leu Asp He Met Phe Asp Leu Pro 400 405 410
AAG CTT AAA GGA TCG GAA GTG CGT ATC ACT AAA GAT TGT GTT TTA AAA 1356 Lys Leu Lys Gly Ser Glu Val Arg He Thr Lys Asp Cys Val Leu Lys 415 420 425 430
CAG GCT GAA CCT TTG ATC ATT GCT AAA ACG CAT TCT AAA ATT CTT CCT T 1405 Gin Ala Glu Pro Leu He He Ala Lys Thr His Ser Lys He Leu Pro 435 440 445
AAGGAACACG CTTATAAATT TAACGATAAA GGATTAGAAA GGG 1448
(2) INFORMATION FOR SEQ ID NO: 1070:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 446 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1070: Met Asn Glu Thr Leu Tyr Cys Ser Phe Cys Lys Lys Pro Glu Ser Arg 1 5 10 15
Asp Pro Lys Lys Arg Arg He He Phe Ala Ser Asn Leu Asn Lys Asp
20 25 30
Val Cys Val Cys Glu Tyr Cys He Asp Val Met His Gly Glu Leu His
35 40 45
Lys Tyr Asp Asn Ser Leu Leu Ala Leu Lys Arg Asp Arg Leu Arg Arg
50 55 60
Met Glu Ser Ser Ala Tyr Glu Glu Glu Phe Leu Leu Ser Tyr He Pro 65 70 75 80
Ala Pro Lys Glu Leu Lys Ala Val Leu Asp Asn Tyr Val He Gly Gin
85 90 95
Glu Gin Ala Lys Lys Val Phe Ser Val Ala Val Tyr Asn His Tyr Lys
100 105 110
Arg Leu Ser Phe Lys Glu Lys Leu Lys Lys Gin Asp Asn Gin Asp Ser
115 120 125
Asn Val Glu Leu Glu His Leu Glu Glu Val Glu Leu Ser Lys Ser Asn
130 135 140
He Leu Leu He Gly Pro Thr Gly Ser Gly Lys Thr Leu Met Ala Gin 145 150 155 160
Thr Leu Ala Lys His Leu Asp He Pro He Ala He Ser Asp Ala Thr
165 170 175
Ser Leu Thr Glu Ala Gly Tyr Val Gly Glu Asp Val Glu Asn He Leu
180 185 190
Thr Arg Leu Leu Gin Ala Ser Asp Trp Asn Val Gin Lys Ala Gin Lys
195 200 205
Gly He Val Phe He Asp Glu He Asp Lys He Ser Arg Leu Ser Glu
210 215 220
Asn Arg Ser He Thr Arg Asp Val Ser Gly Glu Gly Val Gin Gin Ala 225 230 235 240
Leu Leu Lys He Val Glu Gly Ser Leu Val Asn He Pro Pro Lys Gly
245 250 255
Gly Arg Lys His Pro Glu Gly Asn Phe He Gin He Asp Thr Ser Asp
260 265 270
He Leu Phe He Cys Ala Gly Ala Phe Asp Gly Leu Ala Glu He He
275 280 285
Lys Lys Arg Thr Thr Gin Asn Val Leu Gly Phe Thr Gin Glu Lys Met
290 295 300
Ser Lys Lys Glu Gin Glu Ala He Leu His Leu Val Gin Thr His Asp 305 310 315 320
Leu Val Thr Tyr Gly Leu He Pro Glu Leu He Gly Arg Leu Pro Val
325 330 335
Leu Ser Thr Leu Asp Ser He Ser Leu Glu Ala Met Val Asp He Leu
340 345 350
Gin Lys Pro Lys Asn Ala Leu He Lys Gin Tyr Gin Gin Leu Phe Lys
355 360 365
Met Asp Glu Val Asp Leu He Phe Glu Glu Glu Ala He Lys Glu He
370 375 380
Ala Gin Leu Ala Leu Glu Arg Lys Thr Gly Ala Arg Gly Leu Arg Ala 385 390 395 400
He He Glu Asp Phe Cys Leu Asp He Met Phe Asp Leu Pro Lys Leu
405 410 415
Lys Gly Ser Glu Val Arg He Thr Lys Asp Cys Val Leu Lys Gin Ala
420 425 430
Glu Pro Leu He He Ala Lys Thr His Ser Lys He Leu Pro 435 440 445 (2) INFORMATION FOR SEQ ID NO: 1071:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 911 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 34...858 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1071:
AGTGTTAGAA AAAACTTTGC TTTGAAATTT GGC ATG AAA GCA GGC ATT ATT GGT 54
Met Lys Ala Gly He He Gly 1 5
TTA GGG CTT ATG GGG GGG AGT TTA GGG CTA GCC TTG CAA GAA TGG GGG 102 Leu Gly Leu Met Gly Gly Ser Leu Gly Leu Ala Leu Gin Glu Trp Gly 10 15 20
CGT TTT AAA AGC GTT ATA GGC TAT GAT CAT AAC GCT TTG CAT GCT AAA 150 Arg Phe Lys Ser Val He Gly Tyr Asp His Asn Ala Leu His Ala Lys 25 30 35
TTG GCT TTG ACT TTG GGG CTT GTA GAT GAA TGC GTG GGA TTT GAA AAG 198 Leu Ala Leu Thr Leu Gly Leu Val Asp Glu Cys Val Gly Phe Glu Lys 40 45 50 55
ATT TTA GAA TGC GAT GTG ATT TTT TTG GCC ATT CCG GTT GAG GGC ATC 246 He Leu Glu Cys Asp Val He Phe Leu Ala He Pro Val Glu Gly He 60 65 70
ATT GGA TGT CTG AAA AAA ATG ACC TCT ATC AAA AAA AGC GCG ACC ATT 294 He Gly Cys Leu Lys Lys Met Thr Ser He Lys Lys Ser Ala Thr He 75 80 85
ATT GAT TTA GGG GGC GCT AAA GCG CAA ATC ATT CGC AAT ATC CCT AAA 342 He Asp Leu Gly Gly Ala Lys Ala Gin He He Arg Asn He Pro Lys 90 95 100
AGC ATT CGT AAG AAT TTC ATC GCT GCG CAC CCC ATG TGC GGG ACA GAG 390 Ser He Arg Lys Asn Phe He Ala Ala His Pro Met Cys Gly Thr Glu 105 110 115
TTT TAT GGC CCT AAA GCG AGC GTT AAG GGG CTG TAT GAA AAC GCT CTA 438 Phe Tyr Gly Pro Lys Ala Ser Val Lys Gly Leu Tyr Glu Asn Ala Leu 120 125 130 135 GTG ATA TTG TGC GAT TTA GAA GAT TCA GGG ACT GAG CAA GTA GAG ATC 486 Val He Leu Cys Asp Leu Glu Asp Ser Gly Thr Glu Gin Val Glu He 140 145 150
GCT AAA GAA ATC TTT TTA GGC GTT AAA GCG CGC TTG ATT AAA ATG AAA 534 Ala Lys Glu He Phe Leu Gly Val Lys Ala Arg Leu He Lys Met Lys 155 160 165
TCC AAT GAG CAT GAC ACC CAT GTG GCT TAT ATC AGC CAT TTA CCC CAT 582 Ser Asn Glu His Asp Thr His Val Ala Tyr He Ser His Leu Pro His 170 175 180
GTT TTG AGC TAT GCG TTA GCC AAT AGC GTT TTA AAG CAA AAC GAC CCA 630 Val Leu Ser Tyr Ala Leu Ala Asn Ser Val Leu Lys Gin Asn Asp Pro 185 190 195
GAG ATG ATT TTA TCT TTA GCG GGT GGG GGT TTT AGG GAT ATG AGC CGT 678 Glu Met He Leu Ser Leu Ala Gly Gly Gly Phe Arg Asp Met Ser Arg 200 205 210 215
CTG TCC AAA AGC TCG CCT TTA ATG TGG AAA GAT ATT TTC AAA CAA AAC 726 Leu Ser Lys Ser Ser Pro Leu Met Trp Lys Asp He Phe Lys Gin Asn 220 225 230
CGA GAC AAT GTC TTA GAA GCG ATT AAA AAA TGC GAA AAA GAA ATC GTG 774 Arg Asp Asn Val Leu Glu Ala He Lys Lys Cys Glu Lys Glu He Val 235 240 245
CAA GCT AAG GCG TGG ATA GAA AAT AAC GAT TAT GAA AGC CTT GCA GAA 822 Gin Ala Lys Ala Trp He Glu Asn Asn Asp Tyr Glu Ser Leu Ala Glu 250 255 260
TGG ATG GCG CAA GCG AAC AAA CTC CAG GAG TTC ATG TAAAGTAAAA TGATGT 874 Trp Met Ala Gin Ala Asn Lys Leu Gin Glu Phe Met 265 270 275
AAAATAATTT AAAATTTTTT ATATTGTTGT TTTTAGG 911
(2) INFORMATION FOR SEQ ID NO-.1072:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 275 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1072:
Met Lys Ala Gly He He Gly Leu Gly Leu Met Gly Gly Ser Leu Gly
1 5 10 15
Leu Ala Leu Gin Glu Trp Gly Arg Phe Lys Ser Val He Gly Tyr Asp
20 25 30
His Asn Ala Leu His Ala Lys Leu Ala Leu Thr Leu Gly Leu Val Asp 35 40 45
Glu Cys Val Gly Phe Glu Lys He Leu Glu Cys Asp Val He Phe Leu
50 55 60
Ala He Pro Val Glu Gly He He Gly Cys Leu Lys Lys Met Thr Ser 65 70 75 80
He Lys Lys Ser Ala Thr He He Asp Leu Gly Gly Ala Lys Ala Gin
85 90 95
He He Arg Asn He Pro Lys Ser He Arg Lys Asn Phe He Ala Ala
100 105 110
His Pro Met Cys Gly Thr Glu Phe Tyr Gly Pro Lys Ala Ser Val Lys
115 120 125
Gly Leu Tyr Glu Asn Ala Leu Val He Leu Cys Asp Leu Glu Asp Ser
130 135 140
Gly Thr Glu Gin Val Glu He Ala Lys Glu He Phe Leu Gly Val Lys 145 150 155 160
Ala Arg Leu He Lys Met Lys Ser Asn Glu His Asp Thr His Val Ala
165 170 175
Tyr He Ser His Leu Pro His Val Leu Ser Tyr Ala Leu Ala Asn Ser
180 185 190
Val Leu Lys Gin Asn Asp Pro Glu Met He Leu Ser Leu Ala Gly Gly
195 200 205
Gly Phe Arg Asp Met Ser Arg Leu Ser Lys Ser Ser Pro Leu Met Trp
210 215 220
Lys Asp He Phe Lys Gin Asn Arg Asp Asn Val Leu Glu Ala He Lys 225 230 235 240
Lys Cys Glu Lys Glu He Val Gin Ala Lys Ala Trp He Glu Asn Asn
245 250 255
Asp Tyr Glu Ser Leu Ala Glu Trp Met Ala Gin Ala Asn Lys Leu Gin
260 265 270
Glu Phe Met 275
(2) INFORMATION FOR SEQ ID NO: 1073:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 304 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 73...267 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1073:
AATAATTTAA AATTTTTTAT ATTGTTGTTT TTAGGGGTGC GAGGAGCGAA ATGGGGTATT 60 TGGATTGTTT TT ATG GAT TAT AGG CTG TTT CAT ATG GAT AGC ATG GAT TTA 111 Met Asp Tyr Arg Leu Phe His Met Asp Ser Met Asp Leu 1 5 10 CCC AGC AAC CAG CAA ACA ACC ATA AGA GAT TAT CTT AAA CCC GGA TCT 159 Pro Ser Asn Gin Gin Thr Thr He Arg Asp Tyr Leu Lys Pro Gly Ser 15 20 25
ATT GTT GTG TTT GCC ATA ATT GTA ATA ATA ATT TCA TCT CAT TTC TCC 207 He Val Val Phe Ala He He Val He He He Ser Ser His Phe Ser 30 35 40 45
AAC GCC TAT AAA ACC CTT ATC GCT TCT AAT AAA AAA CCA GTT TTA AGC 255 Asn Ala Tyr Lys Thr Leu He Ala Ser Asn Lys Lys Pro Val Leu Ser 50 55 60
CAT TTA GAA ATT TGATTTCTTA AACCTTTTTA TCAAAAATAC CGGTGTT 304
His Leu Glu He 65
(2) INFORMATION FOR SEQ ID NO: 1074:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 65 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1074:
Met Asp Tyr Arg Leu Phe His Met Asp Ser Met Asp Leu Pro Ser Asn
1 5 10 15
Gin Gin Thr Thr He Arg Asp Tyr Leu Lys Pro Gly Ser He Val Val
20 25 30
Phe Ala He He Val He He He Ser Ser His Phe Ser Asn Ala Tyr
35 40 45
Lys Thr Leu He Ala Ser Asn Lys Lys Pro Val Leu Ser His Leu Glu
50 55 60
He 65
(2) INFORMATION FOR SEQ ID NO: 1075:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 271 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 34...237 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1075:
AGCTCACATT TTAGAAAAAT ATTTAAAGGG AGA ATG ATG CAA AAT AGC GTT AAA 54
Met Met Gin Asn Ser Val Lys 1 5
AAA TTA GAA TAT GAA GAG CGT TTC AAT GAC GCT CTT TTG AAA TTA CAA 102 Lys Leu Glu Tyr Glu Glu Arg Phe Asn Asp Ala Leu Leu Lys Leu Gin 10 15 20
GCA TGC CAA GAA GAA AAG CAG GTA ACG AGT TGT TTG AAA TGC GAG CAG 150 Ala Cys Gin Glu Glu Lys Gin Val Thr Ser Cys Leu Lys Cys Glu Gin 25 30 35
GTT TTG AAT TGC AAG ATC CGC AAC AGC TAT GTG GAT GCG GCT TAT GAG 198 Val Leu Asn Cys Lys He Arg Asn Ser Tyr Val Asp Ala Ala Tyr Glu 40 45 50 55
AGC ATG AGT TTA GGC GAA CGG GGC GGG TTT GAT TTC AAT TAAATGGGAT TA 249 Ser Met Ser Leu Gly Glu Arg Gly Gly Phe Asp Phe Asn 60 65
AAATGGCTAG TAATACTACC TT 271
(2) INFORMATION FOR SEQ ID NO: 1076:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 68 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1076:
Met Met Gin Asn Ser Val Lys Lys Leu Glu Tyr Glu Glu Arg Phe Asn
1 5 10 15
Asp Ala Leu Leu Lys Leu Gin Ala Cys Gin Glu Glu Lys Gin Val Thr
20 25 30
Ser Cys Leu Lys Cys Glu Gin Val Leu Asn Cys Lys He Arg Asn Ser
35 40 45
Tyr Val Asp Ala Ala Tyr Glu Ser Met Ser Leu Gly Glu Arg Gly Gly
50 55 60
Phe Asp Phe Asn 65
(2) INFORMATION FOR SEQ ID NO: 1077:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 572 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 27...524 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1077:
AAAAGGCTTT TTAAAAGGAC ACACCA ATG AGC GAA CCA TTA GAA ACA TTA GAC 53
Met Ser Glu Pro Leu Glu Thr Leu Asp
1 5
AAG GAT AAA CAA GCT ATG AGT GAA GCA ATT AAA AAA GAT ATT GAA AAA 101 Lys Asp Lys Gin Ala Met Ser Glu Ala He Lys Lys Asp He Glu Lys 10 15 20 25
GAC AAA GAA AAC CTC GCA CGA GTC AAA GCA GAC AAA AAA GTC AAA GCC 149 Asp Lys Glu Asn Leu Ala Arg Val Lys Ala Asp Lys Lys Val Lys Ala 30 35 40
GAT GAA AGT GAA AAA GGC TAC GAA AAA GAC GAT GAC AAA AAA GCC GAG 197 Asp Glu Ser Glu Lys Gly Tyr Glu Lys Asp Asp Asp Lys Lys Ala Glu 45 50 55
AAT CTT GAC AAA GAA ATC GCT AAA GAC AAA GCT AGC CCT AAC GAT AAT 245 Asn Leu Asp Lys Glu He Ala Lys Asp Lys Ala Ser Pro Asn Asp Asn 60 65 70
GAG CTT TAT GAA GAG GAC GAT AGA GTT AAA CGA GAC AAA GAA AGA GAC 293 Glu Leu Tyr Glu Glu Asp Asp Arg Val Lys Arg Asp Lys Glu Arg Asp 75 80 85
GAT GCC TTG CGT GAT AAA GAA AAA GCC AAA GAT GAC GCA TGC ATG GTA 341 Asp Ala Leu Arg Asp Lys Glu Lys Ala Lys Asp Asp Ala Cys Met Val 90 95 100 105
AGA GCG GAC GAT GAC ACC ATA GAG GAC GAT GAG GAA TAT GGT GAT GAT 389 Arg Ala Asp Asp Asp Thr He Glu Asp Asp Glu Glu Tyr Gly Asp Asp 110 115 120
GAT AAG TTA AGA GAC GAA ATA CTC GGT GTT ATG GAG GAG TTA TGC GAT 437 Asp Lys Leu Arg Asp Glu He Leu Gly Val Met Glu Glu Leu Cys Asp 125 130 135
ACC CTT AAT GAT AAC CTT AAC TTC AAA AAA GTC GTC TGT ATG GGC GGT 485 Thr Leu Asn Asp Asn Leu Asn Phe Lys Lys Val Val Cys Met Gly Gly 140 145 150
AAG GTT TCA ATT GCG TTC AAA TTT CTA ATT TTT TGC TCT TAATCTTTTA GA 536 Lys Val Ser He Ala Phe Lys Phe Leu He Phe Cys Ser 155 160 165
AAAAATTCAA ACTCTAAGGA TCTATCTTTT CGTTAG 572
(2) INFORMATION FOR SEQ ID NO: 1078:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 166 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1078:
Met Ser Glu Pro Leu Glu Thr Leu Asp Lys Asp Lys Gin Ala Met Ser
1 5 10 15
Glu Ala He Lys Lys Asp He Glu Lys Asp Lys Glu Asn Leu Ala Arg
20 25 30
Val Lys Ala Asp Lys Lys Val Lys Ala Asp Glu Ser Glu Lys Gly Tyr
35 40 45
Glu Lys Asp Asp Asp Lys Lys Ala Glu Asn Leu Asp Lys Glu He Ala
50 55 60
Lys Asp Lys Ala Ser Pro Asn Asp Asn Glu Leu Tyr Glu Glu Asp Asp 65 70 75 80
Arg Val Lys Arg Asp Lys Glu Arg Asp Asp Ala Leu Arg Asp Lys Glu
85 90 95
Lys Ala Lys Asp Asp Ala Cys Met Val Arg Ala Asp Asp Asp Thr He
100 105 110
Glu Asp Asp Glu Glu Tyr Gly Asp Asp Asp Lys Leu Arg Asp Glu He
115 120 125
Leu Gly Val Met Glu Glu Leu Cys Asp Thr Leu Asn Asp Asn Leu Asn
130 135 140
Phe Lys Lys Val Val Cys Met Gly Gly Lys Val Ser He Ala Phe Lys 145 150 155 160
Phe Leu He Phe Cys Ser 165
(2) INFORMATION FOR SEQ ID NO: 1079:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2327 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 82...2283 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1079:
CAACAAGCTA ACCAAAGCAT TGAAGAAGCT TTACAGAATG TCCCGGGTCT GCAAATTAGG 60 AATGCGACAG GTGTAGGGGC T ATG CCT ACT ATC CAA ATC CGT GGC TTT GGA 111
Met Pro Thr He Gin He Arg Gly Phe Gly 1 5 10
GCT GGG GGT TCA GGG CAT AGC GAT GCG ACG CTG ATG TTA GTC AAT GGT 159 Ala Gly Gly Ser Gly His Ser Asp Ala Thr Leu Met Leu Val Asn Gly 15 20 25
ATT CCT GTT TAT ATG GCC CCC TAC GCT CAC ATT GAG CTA GAC ATT TTC 207 He Pro Val Tyr Met Ala Pro Tyr Ala His He Glu Leu Asp He Phe 30 35 40
CCC GTT ACC TTT CAA GCC ATT GAT CGC ATT GAT GTG ATC AAG GGT GGA 255 Pro Val Thr Phe Gin Ala He Asp Arg He Asp Val He Lys Gly Gly 45 50 55
GGC AGC GTG CAA TAC GGG CCT AAC ACT TAT GGG GGT ATT GTC AAT ATC 303 Gly Ser Val Gin Tyr Gly Pro Asn Thr Tyr Gly Gly He Val Asn He 60 65 70
ATC ACT AAG CCT ATC CCT AAT CAA TGG GAA AAC CAA GCG GCT GAA AGG 351 He Thr Lys Pro He Pro Asn Gin Trp Glu Asn Gin Ala Ala Glu Arg 75 80 85 90
ATC ACT TAT TGG GCT AAG GCT AGA AAC GCT GGG TTT GCC GCT CCT CCT 399 He Thr Tyr Trp Ala Lys Ala Arg Asn Ala Gly Phe Ala Ala Pro Pro 95 100 105
GAT AAA ACC GGC GAT CCT TCT TTC ATC AAG TCT TTA GGC AAC AAC CTC 447 Asp Lys Thr Gly Asp Pro Ser Phe He Lys Ser Leu Gly Asn Asn Leu 110 115 120
CTC TAT AAC ACT TAT GTG AGG AGT GGA GGG ATG ATC AAT AAG CAT GTG 495 Leu Tyr Asn Thr Tyr Val Arg Ser Gly Gly Met He Asn Lys His Val 125 130 135
GGT ATC CAA GCG CAA GCT AAC TGG GTT AGA GGA CAA GGC TTT AGG GAC 543 Gly He Gin Ala Gin Ala Asn Trp Val Arg Gly Gin Gly Phe Arg Asp 140 145 150
AAT AGC CCC TCT AAC ATT TCA AAC TAT TGG CTA GAT GGA GTC TAT GAC 591 Asn Ser Pro Ser Asn He Ser Asn Tyr Trp Leu Asp Gly Val Tyr Asp 155 160 165 170
ATC AAT GAA AAC AAT GGG ATT AAA GCC TAT TAC CAA TAC TAC GAT TTT 639 He Asn Glu Asn Asn Gly He Lys Ala Tyr Tyr Gin Tyr Tyr Asp Phe 175 180 185
GCT ATC GCT CAA CCA GGA TCA CTC AGC GAG CAA GAT TAC AAA ATA AAC 687 Ala He Ala Gin Pro Gly Ser Leu Ser Glu Gin Asp Tyr Lys He Asn 190 195 200 CGC TTC GCT AAT TTG CGC CCC TTA AAC CAA AAA GGC GGG CGT TCA CAA 735 Arg Phe Ala Asn Leu Arg Pro Leu Asn Gin Lys Gly Gly Arg Ser Gin 205 210 215
CGC TTT GGG GCT GTG TAT GAA AAC CGC TTC GGG GAT TTA GAC AAA GTG 783 Arg Phe Gly Ala Val Tyr Glu Asn Arg Phe Gly Asp Leu Asp Lys Val 220 225 230
GGC GGG ACT TTT AGC TTC ACT TAC TAT GGG CAG TTG ATG ACT AGG GAT 831 Gly Gly Thr Phe Ser Phe Thr Tyr Tyr Gly Gin Leu Met Thr Arg Asp 235 240 245 250
TTT CAA GTG AGC TCT AGC TAC AAT AGC GCT AAC ATG GTT ACT TGT TTT 879 Phe Gin Val Ser Ser Ser Tyr Asn Ser Ala Asn Met Val Thr Cys Phe 255 260 265
AGC GAA GCG GCA TGC AGG GCG GCA GGA CTT CCG GCA GGG TAT AAC TTG 927 Ser Glu Ala Ala Cys Arg Ala Ala Gly Leu Pro Ala Gly Tyr Asn Leu 270 275 280
GCT GTG CCT TAT TAT GCC ACT AAC TAC AAT GGC TGG GCA GAA GTA GAA 975 Ala Val Pro Tyr Tyr Ala Thr Asn Tyr Asn Gly Trp Ala Glu Val Glu 285 290 295
AAC CCT GTG CGC TCC ATT AAC AAC GCT TTT GAG CCT AAA GTG AAT TTG 1023 Asn Pro Val Arg Ser He Asn Asn Ala Phe Glu Pro Lys Val Asn Leu 300 305 310
ATC GTC AAT ACC GGG AAA GTC AAG CAA ACC TTT ATC ATG GGC TTG CGT 1071 He Val Asn Thr Gly Lys Val Lys Gin Thr Phe He Met Gly Leu Arg 315 320 325 330
TTC ATG ACC ACC ACT TTT TTA CAG CGC CAA TAC TTA AAC ACC AAT GAA 1119 Phe Met Thr Thr Thr Phe Leu Gin Arg Gin Tyr Leu Asn Thr Asn Glu 335 340 345
TGC GCC ACC AAA ACG AGC GGT GAG GGG GCA GGA TTC TTG TGT GAG GGC 1167 Cys Ala Thr Lys Thr Ser Gly Glu Gly Ala Gly Phe Leu Cys Glu Gly 350 355 360
GCT AAT GTG ATG AGC GGT TGG AAA CCT CAC ATC AAG CAT GGC GTT TAT 1215 Ala Asn Val Met Ser Gly Trp Lys Pro His He Lys His Gly Val Tyr 365 370 375
AGA AAC TGG AAT AAC TGG CGT AAC AAT TAC ACA GCG GTT TAT TTG AGC 1263 Arg Asn Trp Asn Asn Trp Arg Asn Asn Tyr Thr Ala Val Tyr Leu Ser 380 385 390
GAT CGC ATT GAA GCT TGG GAT GGG CGC TTT TTC ATC GTG CCT GGT TTG 1311 Asp Arg He Glu Ala Trp Asp Gly Arg Phe Phe He Val Pro Gly Leu 395 400 405 410
CGC TAC GCT TTT GTG CAA TAC AAC AAC GAA AAT GCG TCT AAC TGG ATG 1359 Arg Tyr Ala Phe Val Gin Tyr Asn Asn Glu Asn Ala Ser Asn Trp Met 415 420 425 CAA ATC CCT GAG AAG GAT TTA AGA AAA ATC AAG CAC ATG AAC AAT TGG 1407 Gin He Pro Glu Lys Asp Leu Arg Lys He Lys His Met Asn Asn Trp 430 435 440
ATG CCC TCA ACC AAC ATT GGC TTT ATC CCC GTG CAA GGC GAT CAC AAT 1455 Met Pro Ser Thr Asn He Gly Phe He Pro Val Gin Gly Asp His Asn 445 450 455
GTG CTT ACC TAC TTT AAC TAC CAA CGC TCT TTC GTC CCG CCT CAA TTA 1503 Val Leu Thr Tyr Phe Asn Tyr Gin Arg Ser Phe Val Pro Pro Gin Leu 460 465 470
GAC GTT TTG AGC TAT GGA GGA GCG GAG TAT TTT ACC CAG CAC TTT GAC 1551 Asp Val Leu Ser Tyr Gly Gly Ala Glu Tyr Phe Thr Gin His Phe Asp 475 480 485 490
ACG GTG GAA GCA GGA GCG CGC TAC ACC TAT AAG GAT AAA TTC AGC TTC 1599 Thr Val Glu Ala Gly Ala Arg Tyr Thr Tyr Lys Asp Lys Phe Ser Phe 495 500 505
AAT GCG GAC TAC TTC AGG ATT TGG GCG CGC GAT TTT GCC ACC GGG CAG 1647 Asn Ala Asp Tyr Phe Arg He Trp Ala Arg Asp Phe Ala Thr Gly Gin 510 515 520
TAT TCA GTC TAT ACA AGC GGT CCC ATG AAG GGT AAT GTG CGC CCC ATT 1695 Tyr Ser Val Tyr Thr Ser Gly Pro Met Lys Gly Asn Val Arg Pro He 525 530 535
AAT GGC TAT TCT CAA GGC GTG GAG CTG GAA TTG TAT TAC AGG CCT ATT 1743 Asn Gly Tyr Ser Gin Gly Val Glu Leu Glu Leu Tyr Tyr Arg Pro He 540 545 550
AGA GGG TTG CAA TTC CAT GCC GCT TTC AAC TAC ATT GAC ACT CGT GTA 1791 Arg Gly Leu Gin Phe His Ala Ala Phe Asn Tyr He Asp Thr Arg Val 555 560 565 570
ACC AGC CAT GGC CCT TTA ACC GAC TTG AAC GGG GAT GTG CTA AAA GGG 1839 Thr Ser His Gly Pro Leu Thr Asp Leu Asn Gly Asp Val Leu Lys Gly 575 580 585
ACT AGC TAT AAC AAG CAT TTC CCT TTT GTA AGC CCT TTC CAA TTC ATT 1887 Thr Ser Tyr Asn Lys His Phe Pro Phe Val Ser Pro Phe Gin Phe He 590 595 600
CTT GAC GCT CGT TAC AAT TGG CGT AAA ACC ACC ATC GGT ATT TCT AGC 1935 Leu Asp Ala Arg Tyr Asn Trp Arg Lys Thr Thr He Gly He Ser Ser 605 610 615
TAT TTT TAC AGC CGT GCT TAT AGC GGG ATT AGC AAC AGT GCA GCA GGA 1983 Tyr Phe Tyr Ser Arg Ala Tyr Ser Gly He Ser Asn Ser Ala Ala Gly 620 625 630
GGC TAT TAT GGG ATG CAA TAT TAT AGT GGG GGG AAC AAC TAT GAA AGC 2031 Gly Tyr Tyr Gly Met Gin Tyr Tyr Ser Gly Gly Asn Asn Tyr Glu Ser 635 640 645 650 GTT CTT AAT AGC GGT TAT CAA TGC GAA GCT TGG TGT ATG ACC CAA CAT 2079 Val Leu Asn Ser Gly Tyr Gin Cys Glu Ala Trp Cys Met Thr Gin His 655 660 665
GAA GGG CTC TTG CCT TGG TAT TGG GTG TGG AAT ATC CAA GTG AGC CAA 2127 Glu Gly Leu Leu Pro Trp Tyr Trp Val Trp Asn He Gin Val Ser Gin 670 675 680
ATT TTC TGG GAA AAC GGG AGA CAC AGA GTT ACA GGA AGC TTG CAA ATC 2175 He Phe Trp Glu Asn Gly Arg His Arg Val Thr Gly Ser Leu Gin He 685 690 695
AAT AAT ATC TTC AAC ATG AAG TAT TAT TTT ACA GGG ATT GGC TCT AGC 2223 Asn Asn He Phe Asn Met Lys Tyr Tyr Phe Thr Gly He Gly Ser Ser 700 705 710
CCT GCA GGC TTG CAA CCT GCG CCT GGA AGA TCG GTT ACA GCG TAT TTG 2271 Pro Ala Gly Leu Gin Pro Ala Pro Gly Arg Ser Val Thr Ala Tyr Leu 715 720 725 730
AAC TAC ACT TTC TAAAGGCTTT AAAAAGGAGG GGGTTATTGC GCGATGATGA GCCG 2327 Asn Tyr Thr Phe
(2) INFORMATION FOR SEQ ID NO: 1080:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 734 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1080:
Met Pro Thr He Gin He Arg Gly Phe Gly Ala Gly Gly Ser Gly His
1 5 10 15
Ser Asp Ala Thr Leu Met Leu Val Asn Gly He Pro Val Tyr Met Ala
20 25 30
Pro Tyr Ala His He Glu Leu Asp He Phe Pro Val Thr Phe Gin Ala
35 40 45
He Asp Arg He Asp Val He Lys Gly Gly Gly Ser Val Gin Tyr Gly
50 55 60
Pro Asn Thr Tyr Gly Gly He Val Asn He He Thr Lys Pro He Pro 65 70 75 80
Asn Gin Trp Glu Asn Gin Ala Ala Glu Arg He Thr Tyr Trp Ala Lys
85 90 95
Ala Arg Asn Ala Gly Phe Ala Ala Pro Pro Asp Lys Thr Gly Asp Pro
100 105 110
Ser Phe He Lys Ser Leu Gly Asn Asn Leu Leu Tyr Asn Thr Tyr Val
115 120 125
Arg Ser Gly Gly Met He Asn Lys His Val Gly He Gin Ala Gin Ala 130 135 140 Asn Trp Val Arg Gly Gin Gly Phe Arg Asp Asn Ser Pro Ser Asn He 145 150 155 160
Ser Asn Tyr Trp Leu Asp Gly Val Tyr Asp He Asn Glu Asn Asn Gly
165 170 175
He Lys Ala Tyr Tyr Gin Tyr Tyr Asp Phe Ala He Ala Gin Pro Gly
180 185 190
Ser Leu Ser Glu Gin Asp Tyr Lys He Asn Arg Phe Ala Asn Leu Arg
195 200 205
Pro Leu Asn Gin Lys Gly Gly Arg Ser Gin Arg Phe Gly Ala Val Tyr
210 215 220
Glu Asn Arg Phe Gly Asp Leu Asp Lys Val Gly Gly Thr Phe Ser Phe 225 230 235 240
Thr Tyr Tyr Gly Gin Leu Met Thr Arg Asp Phe Gin Val Ser Ser Ser
245 250 255
Tyr Asn Ser Ala Asn Met Val Thr Cys Phe Ser Glu Ala Ala Cys Arg
260 265 270
Ala Ala Gly Leu Pro Ala Gly Tyr Asn Leu Ala Val Pro Tyr Tyr Ala
275 280 285
Thr Asn Tyr Asn Gly Trp Ala Glu Val Glu Asn Pro Val Arg Ser He
290 295 300
Asn Asn Ala Phe Glu Pro Lys Val Asn Leu He Val Asn Thr Gly Lys 305 310 315 320
Val Lys Gin Thr Phe He Met Gly Leu Arg Phe Met Thr Thr Thr Phe
325 330 335
Leu Gin Arg Gin Tyr Leu Asn Thr Asn Glu Cys Ala Thr Lys Thr Ser
340 345 350
Gly Glu Gly Ala Gly Phe Leu Cys Glu Gly Ala Asn Val Met Ser Gly
355 360 365
Trp Lys Pro His He Lys His Gly Val Tyr Arg Asn Trp Asn Asn Trp
370 375 380
Arg Asn Asn Tyr Thr Ala Val Tyr Leu Ser Asp Arg He Glu Ala Trp 385 390 395 400
Asp Gly Arg Phe Phe He Val Pro Gly Leu Arg Tyr Ala Phe Val Gin
405 410 415
Tyr Asn Asn Glu Asn Ala Ser Asn Trp Met Gin He Pro Glu Lys Asp
420 425 430
Leu Arg Lys He Lys His Met Asn Asn Trp Met Pro Ser Thr Asn He
435 440 445
Gly Phe He Pro Val Gin Gly Asp His Asn Val Leu Thr Tyr Phe Asn
450 455 460
Tyr Gin Arg Ser Phe Val Pro Pro Gin Leu Asp Val Leu Ser Tyr Gly 465 470 475 480
Gly Ala Glu Tyr Phe Thr Gin His Phe Asp Thr Val Glu Ala Gly Ala
485 490 495
Arg Tyr Thr Tyr Lys Asp Lys Phe Ser Phe Asn Ala Asp Tyr Phe Arg
500 505 510
He Trp Ala Arg Asp Phe Ala Thr Gly Gin Tyr Ser Val Tyr Thr Ser
515 520 525
Gly Pro Met Lys Gly Asn Val Arg Pro He Asn Gly Tyr Ser Gin Gly
530 535 540
Val Glu Leu Glu Leu Tyr Tyr Arg Pro He Arg Gly Leu Gin Phe His 545 550 555 560
Ala Ala Phe Asn Tyr He Asp Thr Arg Val Thr Ser His Gly Pro Leu
565 570 575
Thr Asp Leu Asn Gly Asp Val Leu Lys Gly Thr Ser Tyr Asn Lys His 580 585 590
Phe Pro Phe Val Ser Pro Phe Gin Phe He Leu Asp Ala Arg Tyr Asn
595 600 605
Trp Arg Lys Thr Thr He Gly He Ser Ser Tyr Phe Tyr Ser Arg Ala
610 615 620
Tyr Ser Gly He Ser Asn Ser Ala Ala Gly Gly Tyr Tyr Gly Met Gin 625 630 635 640
Tyr Tyr Ser Gly Gly Asn Asn Tyr Glu Ser Val Leu Asn Ser Gly Tyr
645 650 655
Gin Cys Glu Ala Trp Cys Met Thr Gin His Glu Gly Leu Leu Pro Trp
660 665 670
Tyr Trp Val Trp Asn He Gin Val Ser Gin He Phe Trp Glu Asn Gly
675 680 685
Arg His Arg Val Thr Gly Ser Leu Gin He Asn Asn He Phe Asn Met
690 695 700
Lys Tyr Tyr Phe Thr Gly He Gly Ser Ser Pro Ala Gly Leu Gin Pro 705 710 715 720
Ala Pro Gly Arg Ser Val Thr Ala Tyr Leu Asn Tyr Thr Phe 725 730
(2) INFORMATION FOR SEQ ID NO: 1081:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 232 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 37...204 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1081:
CATTGGAGAT TGTGCCATTC TTTGATTTTA TCTAAA ATG TCT TTA GGG GCA GTG 54
Met Ser Leu Gly Ala Val 1 5
ATT AAG CTT ATT TTT TGT TAT AAA TTA GAG GGG GTA ATA TTA GAT TTA 102 He Lys Leu He Phe Cys Tyr Lys Leu Glu Gly Val He Leu Asp Leu 10 15 20
AAG CGC ATC AAT TTC AAA TCC TAT TAT CCC AAT AAT AAA AAT GCA TTA 150 Lys Arg He Asn Phe Lys Ser Tyr Tyr Pro Asn Asn Lys Asn Ala Leu 25 30 35
TTT ATC AAC AAT AAA AAA ATC CAT TAT CTA GTG CCT CAA AGG TTC ATA 198 Phe He Asn Asn Lys Lys He His Tyr Leu Val Pro Gin Arg Phe He 40 45 50 TTG CTT TAAACTTGCT ATGGACGATT AGAAATCG 232
Leu Leu
55
(2) INFORMATION FOR SEQ ID NO: 1082:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 56 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1082:
Met Ser Leu Gly Ala Val He Lys Leu He Phe Cys Tyr Lys Leu Glu
1 5 10 15
Gly Val He Leu Asp Leu Lys Arg He Asn Phe Lys Ser Tyr Tyr Pro
20 25 30
Asn Asn Lys Asn Ala Leu Phe He Asn Asn Lys Lys He His Tyr Leu
35 40 45
Val Pro Gin Arg Phe He Leu Leu 50 55
(2) INFORMATION FOR SEQ ID NO: 1083:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1142 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 24...1094 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1083:
CTTTTAGAAT TAGGGCTTAA AGC ATG AAA GCT AGT ATT TAT GAT TTC ACT CTA 53
Met Lys Ala Ser He Tyr Asp Phe Thr Leu 1 5 10
AAG GAA TTG AGC CAG CTT TTA AAA CCA AGC TTT AGG GCT AAA CAG CTT 101 Lys Glu Leu Ser Gin Leu Leu Lys Pro Ser Phe Arg Ala Lys Gin Leu 15 20 25
TAT TTG TGG CTC TAT GCG AAG TAT AAA ACA AGC TTT AAG GAC ATG CAA 149 Tyr Leu Trp Leu Tyr Ala Lys Tyr Lys Thr Ser Phe Lys Asp Met Gin 30 35 40
AAT AAT TTT TCA AAA GAT TTT ATC GCT TAT TTG GAG CGA GAA TTT GCT 197 Asn Asn Phe Ser Lys Asp Phe He Ala Tyr Leu Glu Arg Glu Phe Ala 45 50 55
TTG CGC ACG ATA GAA ATC ACG CAT GTG AGG GAG AGC GTT GAT GGC TCT 245 Leu Arg Thr He Glu He Thr His Val Arg Glu Ser Val Asp Gly Ser 60 65 70
AAA AAA TAC CTT TTT AAA TCT TTA AGA GAC AAC CAC ACT TTT GAA GCG 293 Lys Lys Tyr Leu Phe Lys Ser Leu Arg Asp Asn His Thr Phe Glu Ala 75 80 85 90
GTG TTG TTG AAA ATG AAG GAT AAA AAG ATT GAT GCA GAA ACG AAC GCT 341 Val Leu Leu Lys Met Lys Asp Lys Lys He Asp Ala Glu Thr Asn Ala 95 100 105
ATT TTA GAG AGG GAA AAA TAC ACC GTA TGC GTG TCT TGT CAA ATC GGC 389 He Leu Glu Arg Glu Lys Tyr Thr Val Cys Val Ser Cys Gin He Gly 110 115 120
TGT CAA GTG GGT TGC TCG TTT TGT TTC ACT CAA AAA GGC GGT TTT GTA 437 Cys Gin Val Gly Cys Ser Phe Cys Phe Thr Gin Lys Gly Gly Phe Val 125 130 135
AGG AAC TTA AAA GCG AGC GAG ATC ATC CAA CAA GCC CTA CTC ATT AAA 485 Arg Asn Leu Lys Ala Ser Glu He He Gin Gin Ala Leu Leu He Lys 140 145 150
GAA GAC AAC AAC CTC CCC CTT GAA AAA GCG CTC AAC ATT GTT TTT ATG 533 Glu Asp Asn Asn Leu Pro Leu Glu Lys Ala Leu Asn He Val Phe Met 155 160 165 170
GGA ATG GGC GAG CCT TTA AAC AAT TTA GAT GAG GTG TGT AAA GCG ATT 581 Gly Met Gly Glu Pro Leu Asn Asn Leu Asp Glu Val Cys Lys Ala He 175 180 185
GAG ATT TTC AAT ACC GGC ATG CAA ATT TCG CCT AAA AGA ATC ACC ATT 629 Glu He Phe Asn Thr Gly Met Gin He Ser Pro Lys Arg He Thr He 190 195 200
TCC ACT AGC GGC GTA GCC GAT AAA ATC CCT ATT TTA GCG GGC AAA AAC 677 Ser Thr Ser Gly Val Ala Asp Lys He Pro He Leu Ala Gly Lys Asn 205 210 215
TTA GGC GTG CAA TTA GCC ATA TCC TTA CAC GCC GTA GAT GAC AAA ACG 725 Leu Gly Val Gin Leu Ala He Ser Leu His Ala Val Asp Asp Lys Thr 220 225 230
CGC TCG TCT TTA ATG CCC TTG AAT AAA AAA TAC AAT ATT GAA TGC GTT 773 Arg Ser Ser Leu Met Pro Leu Asn Lys Lys Tyr Asn He Glu Cys Val 235 240 245 250
TTG AAT GAA GTG AGG AAA TGG CCT TTA GAG CAG CGC AAA AGA GTG ATG 821
A581- Leu Asn Glu Val Arg Lys Trp Pro Leu Glu Gin Arg Lys Arg Val Met 255 260 265
TTT GAA TAC CTT TTG ATC AAA GAT TTG AAC GAT AGC CTA GAC TGC GCT 869 Phe Glu Tyr Leu Leu He Lys Asp Leu Asn Asp Ser Leu Asp Cys Ala 270 275 280
AAA AAA CTT TTA AAA CTT TTA AAC GGC ATT AAA TCC AAA GTG AAT TTG 917 Lys Lys Leu Leu Lys Leu Leu Asn Gly He Lys Ser Lys Val Asn Leu 285 290 295
ATC TTA TTC AAC CCG CAT GAA GGC TCT AAG TTT GAA CGC CCT AGC TTA 965 He Leu Phe Asn Pro His Glu Gly Ser Lys Phe Glu Arg Pro Ser Leu 300 305 310
GAG AAC GCT AGA ATG TTT GCG GAT TTT TTA AAC TCT AAA GGC TTA TTA 1013 Glu Asn Ala Arg Met Phe Ala Asp Phe Leu Asn Ser Lys Gly Leu Leu 315 320 325 330
TGC ACC ATT AGA GAG TCT AAA GCC TTG GAT ATT GAA GCG GCT TGC GGG 1061 Cys Thr He Arg Glu Ser Lys Ala Leu Asp He Glu Ala Ala Cys Gly 335 340 345
CAG TTG AGG GAG AAA AAA CTC TCT CAG CAA ATT TGAAAACTTT TTTGTGGTGT 1114 Gin Leu Arg Glu Lys Lys Leu Ser Gin Gin He 350 355
TTGTCTTTTT TCTAATGGGG GGTGTTGG 1142
(2) INFORMATION FOR SEQ ID NO: 1084:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 357 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1084:
Met Lys Ala Ser He Tyr Asp Phe Thr Leu Lys Glu Leu Ser Gin Leu
1 5 10 15
Leu Lys Pro Ser Phe Arg Ala Lys Gin Leu Tyr Leu Trp Leu Tyr Ala
20 25 30
Lys Tyr Lys Thr Ser Phe Lys Asp Met Gin Asn Asn Phe Ser Lys Asp
35 40 45
Phe He Ala Tyr Leu Glu Arg Glu Phe Ala Leu Arg Thr He Glu He
50 55 60
Thr His Val Arg Glu Ser Val Asp Gly Ser Lys Lys Tyr Leu Phe Lys 65 70 75 80
Ser Leu Arg Asp Asn His Thr Phe Glu Ala Val Leu Leu Lys Met Lys
85 90 95
Asp Lys Lys He Asp Ala Glu Thr Asn Ala He Leu Glu Arg Glu Lys 100 105 110 Tyr Thr Val Cys Val Ser Cys Gin He Gly Cys Gin Val Gly Cys Ser
115 120 125
Phe Cys Phe Thr Gin Lys Gly Gly Phe Val Arg Asn Leu Lys Ala Ser
130 135 140
Glu He He Gin Gin Ala Leu Leu He Lys Glu Asp Asn Asn Leu Pro 145 150 155 160
Leu Glu Lys Ala Leu Asn He Val Phe Met Gly Met Gly Glu Pro Leu
165 170 175
Asn Asn Leu Asp Glu Val Cys Lys Ala He Glu He Phe Asn Thr Gly
180 185 190
Met Gin He Ser Pro Lys Arg He Thr He Ser Thr Ser Gly Val Ala
195 200 205
Asp Lys He Pro He Leu Ala Gly Lys Asn Leu Gly Val Gin Leu Ala
210 215 220
He Ser Leu His Ala Val Asp Asp Lys Thr Arg Ser Ser Leu Met Pro 225 230 235 240
Leu Asn Lys Lys Tyr Asn He Glu Cys Val Leu Asn Glu Val Arg Lys
245 250 255
Trp Pro Leu Glu Gin Arg Lys Arg Val Met Phe Glu Tyr Leu Leu He
260 265 270
Lys Asp Leu Asn Asp Ser Leu Asp Cys Ala Lys Lys Leu Leu Lys Leu
275 280 285
Leu Asn Gly He Lys Ser Lys Val Asn Leu He Leu Phe Asn Pro His
290 295 300
Glu Gly Ser Lys Phe Glu Arg Pro Ser Leu Glu Asn Ala Arg Met Phe 305 310 315 320
Ala Asp Phe Leu Asn Ser Lys Gly Leu Leu Cys Thr He Arg Glu Ser
325 330 335
Lys Ala Leu Asp He Glu Ala Ala Cys Gly Gin Leu Arg Glu Lys Lys
340 345 350
Leu Ser Gin Gin He 355
(2) INFORMATION FOR SEQ ID NO: 1085:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 990 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 1...987 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1085:
ATG CCC ATT CTT TTT GAT TGT AAC GCT ATT GCT TCA CAA GTT TTA AAA 4£ Met Pro He Leu Phe Asp Cys Asn Ala He Ala Ser Gin Val Leu Lys 1 5 10 15 GAT GAA GCG AGC GCG CTT TTA GAA AGC GTT GGA CAA TTC CAA AAA CCC 96 Asp Glu Ala Ser Ala Leu Leu Glu Ser Val Gly Gin Phe Gin Lys Pro 20 25 30
AAC GAT TTA GAA GCG ATT GTC AAA CTC ATT TTA AAA AGC CAA GAA AAT 144 Asn Asp Leu Glu Ala He Val Lys Leu He Leu Lys Ser Gin Glu Asn 35 40 45
GGG GGT AAG CTT GTG ATA GTG GGT GTG GGT AAG AGC GCT TTA GTG GCG 192 Gly Gly Lys Leu Val He Val Gly Val Gly Lys Ser Ala Leu Val Ala 50 55 60
CAA AAA ATC GTT GCT TCC ATG CTA AGC ACC GGT AAC AGG AGC GCG TTT 240 Gin Lys He Val Ala Ser Met Leu Ser Thr Gly Asn Arg Ser Ala Phe 65 70 75 80
TTA CAC CCC ACA GAA GCC ATG CAT GGG GAT TTG GGC ATG GTG GAA AAA 288 Leu His Pro Thr Glu Ala Met His Gly Asp Leu Gly Met Val Glu Lys 85 90 95
AAC GAT GTG GTT TTA ATG ATT AGC TAT GGG GGC GAG TCT TTA GAA TTA 336 Asn Asp Val Val Leu Met He Ser Tyr Gly Gly Glu Ser Leu Glu Leu 100 105 110
TTG AAT CTG GTG AGC CAT TTA AAA CGC TTG AGC CAT AAA ATC ATC ACT 384 Leu Asn Leu Val Ser His Leu Lys Arg Leu Ser His Lys He He Thr 115 120 125
TTC ACT AAA AGC CCT AAT AGC TCG CTC TCT AAA CTC GGC GAT TAT TAT 432 Phe Thr Lys Ser Pro Asn Ser Ser Leu Ser Lys Leu Gly Asp Tyr Tyr 130 135 140
TTG AGC TTG AAA ATT CAA AAA GAA GCT TGC CCG ATT AAC ACC GCT CCA 480 Leu Ser Leu Lys He Gin Lys Glu Ala Cys Pro He Asn Thr Ala Pro 145 150 155 160
ACG ACT TCT ACC ACC CTA ACT CTA GCG TTA GGC GAT GTT TTA ATG GCA 528 Thr Thr Ser Thr Thr Leu Thr Leu Ala Leu Gly Asp Val Leu Met Ala 165 170 175
TGC TTG ATG CGA GCG AAA AAC TTT AGC CAA GAA GAT TTT GCC TCC TTT 576 Cys Leu Met Arg Ala Lys Asn Phe Ser Gin Glu Asp Phe Ala Ser Phe 180 185 190
CAT CCG GGC GGG CTT TTA GGC AAA AAA CTT TTT GTC AAG GTT AAA GAT 624 His Pro Gly Gly Leu Leu Gly Lys Lys Leu Phe Val Lys Val Lys Asp 195 200 205
TTA CTG CAA ACC ACG AAC CTC CCC CTA ATC GCT CCT AGC ACA AGT TTT 672 Leu Leu Gin Thr Thr Asn Leu Pro Leu He Ala Pro Ser Thr Ser Phe 210 215 220
AAA GAC GCG CTC ATA GAA ATG AGT GAA AAA CGC TTA GGC AGC GCG ATT 720 Lys Asp Ala Leu He Glu Met Ser Glu Lys Arg Leu Gly Ser Ala He 225 230 235 240 TTA GTC AAT GAA GCT AAC GAG CTT GTG GGG GTG TTA AGC GAT GGC GAT 768 Leu Val Asn Glu Ala Asn Glu Leu Val Gly Val Leu Ser Asp Gly Asp 245 250 255
GTC CGT AGG GCG TTA TTA AAA GGG GTG AGT TTA AAG AGC GAA GTG AGG 816 Val Arg Arg Ala Leu Leu Lys Gly Val Ser Leu Lys Ser Glu Val Arg 260 265 270
CAT TTT GCC ACT TTA AAA CCT AAA AGC TTT AAG AAT TTA GAC GCT CTT 864 His Phe Ala Thr Leu Lys Pro Lys Ser Phe Lys Asn Leu Asp Ala Leu 275 280 285
CTT TTA GAA GCG TTG GAA TTT TTA GAG CGC CAT AAG ATC CAG CTT TTA 912 Leu Leu Glu Ala Leu Glu Phe Leu Glu Arg His Lys He Gin Leu Leu 290 295 300
GTG TGC GTA GAT GAT CAT AAT AAG GTT TTA GGG GTC TTG CAC TTG CAC 960 Val Cys Val Asp Asp His Asn Lys Val Leu Gly Val Leu His Leu His 305 310 315 320
CAA CTT TTA GAA TTA GGG CTT AAA GCA TGA 990
Gin Leu Leu Glu Leu Gly Leu Lys Ala 325
(2) INFORMATION FOR SEQ ID NO: 1086:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 329 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1086:
Met Pro He Leu Phe Asp Cys Asn Ala He Ala Ser Gin Val Leu Lys
1 5 10 15
Asp Glu Ala Ser Ala Leu Leu Glu Ser Val Gly Gin Phe Gin Lys Pro
20 25 30
Asn Asp Leu Glu Ala He Val Lys Leu He Leu Lys Ser Gin Glu Asn
35 40 45
Gly Gly Lys Leu Val He Val Gly Val Gly Lys Ser Ala Leu Val Ala
50 55 60
Gin Lys He Val Ala Ser Met Leu Ser Thr Gly Asn Arg Ser Ala Phe 65 70 75 80
Leu His Pro Thr Glu Ala Met His Gly Asp Leu Gly Met Val Glu Lys
85 90 95
Asn Asp Val Val Leu Met He Ser Tyr Gly Gly Glu Ser Leu Glu Leu
100 105 110
Leu Asn Leu Val Ser His Leu Lys Arg Leu Ser His Lys He He Thr
115 120 125
Phe Thr Lys Ser Pro Asn Ser Ser Leu Ser Lys Leu Gly Asp Tyr Tyr 130 135 140 Leu Ser Leu Lys He Gin Lys Glu Ala Cys Pro He Asn Thr Ala Pro 145 150 155 160
Thr Thr Ser Thr Thr Leu Thr Leu Ala Leu Gly Asp Val Leu Met Ala
165 170 175
Cys Leu Met Arg Ala Lys Asn Phe Ser Gin Glu Asp Phe Ala Ser Phe
180 185 190
His Pro Gly Gly Leu Leu Gly Lys Lys Leu Phe Val Lys Val Lys Asp
195 200 205
Leu Leu Gin Thr Thr Asn Leu Pro Leu He Ala Pro Ser Thr Ser Phe
210 215 220
Lys Asp Ala Leu He Glu Met Ser Glu Lys Arg Leu Gly Ser Ala He 225 230 235 240
Leu Val Asn Glu Ala Asn Glu Leu Val Gly Val Leu Ser Asp Gly Asp
245 250 255
Val Arg Arg Ala Leu Leu Lys Gly Val Ser Leu Lys Ser Glu Val Arg
260 265 270
His Phe Ala Thr Leu Lys Pro Lys Ser Phe Lys Asn Leu Asp Ala Leu
275 280 285
Leu Leu Glu Ala Leu Glu Phe Leu Glu Arg His Lys He Gin Leu Leu
290 295 300
Val Cys Val Asp Asp His Asn Lys Val Leu Gly Val Leu His Leu His 305 310 315 320
Gin Leu Leu Glu Leu Gly Leu Lys Ala 325
(2) INFORMATION FOR SEQ ID NO: 1087:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 991 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 64...93 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1087:
TCCTTAAAAC ATTTGAAGTA TAATACACTC TTATGGTCAA ATATTAAGTT TAGGAAAAGC 60 TGC ATG TGG AGT TTC ATT CAA AAA ATC TTT AAG GCT TTA ATC ATC GCA 108 Met Trp Ser Phe He Gin Lys He Phe Lys Ala Leu He He Ala 1 5 10 15
CCT TTA GAT TTT ATC ACG AAG TAT TTC AAG TCG TTT GTG CTG TTA CTC 156 Pro Leu Asp Phe He Thr Lys Tyr Phe Lys Ser Phe Val Leu Leu Leu 20 25 30
ATT GTA TTA GTC TTT TTT AGC GCT AAA GAA AGC GCG CCA AGC GCC CCG 204 He Val Leu Val Phe Phe Ser Ala Lys Glu Ser Ala Pro Ser Ala Pro 35 40 45
CCT AAT CTC GCT AAA CTC TAT TTA AAT GGG GCG ATT TTT AGC ACC GAG 252 Pro Asn Leu Ala Lys Leu Tyr Leu Asn Gly Ala He Phe Ser Thr Glu 50 55 60
GAT TTT GAC AAA GAA GTG GAT AAA ATC CTA AAA ACC CCT AGC ATT AAG 300 Asp Phe Asp Lys Glu Val Asp Lys He Leu Lys Thr Pro Ser He Lys 65 70 75
GGC GTT TTG CTT TTG ATT GAC TCT CCT GGT GGG GCG GTG TCA GCG AGC 348 Gly Val Leu Leu Leu He Asp Ser Pro Gly Gly Ala Val Ser Ala Ser 80 85 90 95
GTG GAA TTG AGC GAA AAA ATC GCT GAT TTG AAG CAA AAA ATG CCC GTT 396 Val Glu Leu Ser Glu Lys He Ala Asp Leu Lys Gin Lys Met Pro Val 100 105 110
TTA GCG TAT GCT AGG GGG GTT ATG GCG AGC GGG AGC TAT TAT GCG GGC 444 Leu Ala Tyr Ala Arg Gly Val Met Ala Ser Gly Ser Tyr Tyr Ala Gly 115 120 125
ATG CAA GCG AGC GAA GTT TAT GCC TCT AAA GCG AGT TTG ATA GGA TCC 492 Met Gin Ala Ser Glu Val Tyr Ala Ser Lys Ala Ser Leu He Gly Ser 130 135 140
ATT GGG GTG ATT TTT TCA GGT GCG AAT GTG GAA AAT TTG CTC AAT AAA 540 He Gly Val He Phe Ser Gly Ala Asn Val Glu Asn Leu Leu Asn Lys 145 150 155
GTC GGC GTA GCC ACT CAA GGC GTG CAT GCG GGC GAA TAC AAA GAA ATA 588 Val Gly Val Ala Thr Gin Gly Val His Ala Gly Glu Tyr Lys Glu He 160 165 170 175
GGC ACT TTC ACC AGA GCG TGG AAA CCC AAC GAA AAA GAT TTT TTG CAA 636 Gly Thr Phe Thr Arg Ala Trp Lys Pro Asn Glu Lys Asp Phe Leu Gin 180 185 190
AAT TTA GTC AAT GAG CAA TAC CAA ATG TTT GTG AAT GAT GTC GCA AAA 684 Asn Leu Val Asn Glu Gin Tyr Gin Met Phe Val Asn Asp Val Ala Lys 195 200 205
GCT AGG AAA TTA GAC GCT AAG GAT TAT AAG GAT TTT GCT GAA GGG AAG 732 Ala Arg Lys Leu Asp Ala Lys Asp Tyr Lys Asp Phe Ala Glu Gly Lys 210 215 220
GTC TTT AGC GCT CAA AAG GCT CTG AAA TTA AAA CTC ATT GAT AAA ATC 780 Val Phe Ser Ala Gin Lys Ala Leu Lys Leu Lys Leu He Asp Lys He 225 230 235
AGC ACG ATT AAG CAA GCG CAA GAT CGC TTA ATG GAA TTG AGT AAG GTT 828 Ser Thr He Lys Gin Ala Gin Asp Arg Leu Met Glu Leu Ser Lys Val 240 245 250 255
AAA AAA GCT TAT TGG CTA GAA AAA AGC CCT ATG GAG CGC TTC ATT GAA 876 Lys Lys Ala Tyr Trp Leu Glu Lys Ser Pro Met Glu Arg Phe He Glu 260 265 270
AAA GCC ACG CAA TCA GCC ACA AAC ATC ATC ACA CAA GCC TTT GGC TAT 924 Lys Ala Thr Gin Ser Ala Thr Asn He He Thr Gin Ala Phe Gly Tyr 275 280 285
CAA TTA TTA ATG AGA TAAAGATGTT AGAATTTATT TTAAAAATTC AAGCTAGAGA C 980 Gin Leu Leu Met Arg 290
TCTAAAGGCT T 991
(2) INFORMATION FOR SEQ ID NO: 1088:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 292 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1088:
Met Trp Ser Phe He Gin Lys He Phe Lys Ala Leu He He Ala Pro
1 5 10 15
Leu Asp Phe He Thr Lys Tyr Phe Lys Ser Phe Val Leu Leu Leu He
20 25 30
Val Leu Val Phe Phe Ser Ala Lys Glu Ser Ala Pro Ser Ala Pro Pro
35 40 45
Asn Leu Ala Lys Leu Tyr Leu Asn Gly Ala He Phe Ser Thr Glu Asp
50 55 60
Phe Asp Lys Glu Val Asp Lys He Leu Lys Thr Pro Ser He Lys Gly 65 70 75 80
Val Leu Leu Leu He Asp Ser Pro Gly Gly Ala Val Ser Ala Ser Val
85 90 95
Glu Leu Ser Glu Lys He Ala Asp Leu Lys Gin Lys Met Pro Val Leu
100 105 110
Ala Tyr Ala Arg Gly Val Met Ala Ser Gly Ser Tyr Tyr Ala Gly Met
115 120 125
Gin Ala Ser Glu Val Tyr Ala Ser Lys Ala Ser Leu He Gly Ser He
130 135 140
Gly Val He Phe Ser Gly Ala Asn Val Glu Asn Leu Leu Asn Lys Val 145 150 155 160
Gly Val Ala Thr Gin Gly Val His Ala Gly Glu Tyr Lys Glu He Gly
165 170 175
Thr Phe Thr Arg Ala Trp Lys Pro Asn Glu Lys Asp Phe Leu Gin Asn
180 185 190
Leu Val Asn Glu Gin Tyr Gin Met Phe Val Asn Asp Val Ala Lys Ala
195 200 205
Arg Lys Leu Asp Ala Lys Asp Tyr Lys Asp Phe Ala Glu Gly Lys Val
210 215 220
Phe Ser Ala Gin Lys Ala Leu Lys Leu Lys Leu He Asp Lys He Ser 225 230 235 240 Thr He Lys Gin Ala Gin Asp Arg Leu Met Glu Leu Ser Lys Val Lys
245 250 255
Lys Ala Tyr Trp Leu Glu Lys Ser Pro Met Glu Arg Phe He Glu Lys
260 265 270
Ala Thr Gin Ser Ala Thr Asn He He Thr Gin Ala Phe Gly Tyr Gin
275 280 285
Leu Leu Met Arg 290
(2) INFORMATION FOR SEQ ID NO: 1089:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1114 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 37...1050 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1089:
CGAGCTATCA CAACAAATCA ATTTGTAGGA ACAAGC ATG TTT TTT AAA ACT TAT 54
Met Phe Phe Lys Thr Tyr 1 5
CAA AAA TTA TTG GGT GCG AGC TGT TTG ACG TTG TAT TTA GCG GGC TGT 102 Gin Lys Leu Leu Gly Ala Ser Cys Leu Thr Leu Tyr Leu Ala Gly Cys 10 15 20
GGG AGT GAT AGT AGC GAG CCA TTG GTG GGA ATT GAA AAA AAT AGC TTC 150 Gly Ser Asp Ser Ser Glu Pro Leu Val Gly He Glu Lys Asn Ser Phe 25 30 35
AAT TCT ACC GTG AAA ATC ATT TCT AAA ACC GAC AAC ATA GAA ATC CAA 198 Asn Ser Thr Val Lys He He Ser Lys Thr Asp Asn He Glu He Gin 40 45 50
GAC TTG AAG CTC AAT CGT GGC AAT TGT GAG CAT GAT CAA AAT TTC TTG 246 Asp Leu Lys Leu Asn Arg Gly Asn Cys Glu His Asp Gin Asn Phe Leu 55 60 65 70
GTA AAG TTA ATC CAA GAA ACA GCC AAT ACA TAC CTG TTT GCA TCA GAA 294 Val Lys Leu He Gin Glu Thr Ala Asn Thr Tyr Leu Phe Ala Ser Glu 75 80 85
AAA GAA AAA GCG ATC AAA AAC CAC CAA GCA AAA ATC GCA AGA CTT CAA 342 Lys Glu Lys Ala He Lys Asn His Gin Ala Lys He Ala Arg Leu Gin 90 95 100 AAA GAT TTA GAA GAA CTC ACA CAG CAT GTG CAA CAA TCC AAT AAT CTT 390 Lys Asp Leu Glu Glu Leu Thr Gin His Val Gin Gin Ser Asn Asn Leu 105 110 115
GAT AAA TTG TTA GAA AAT GGA GGA CTA TTC GTT AGT GGC CAT GAT TAT 438 Asp Lys Leu Leu Glu Asn Gly Gly Leu Phe Val Ser Gly His Asp Tyr 120 125 130
AAA TAT ACA AAA GAT GAT AAC CCA ATA TAT GTT GTT AAG AGG ATG CTT 486 Lys Tyr Thr Lys Asp Asp Asn Pro He Tyr Val Val Lys Arg Met Leu 135 140 145 150
GAT AAC CTT GAT AGC TAT AAA TAT GAA TCA GAC GAC GTG CTA GAC GTG 534 Asp Asn Leu Asp Ser Tyr Lys Tyr Glu Ser Asp Asp Val Leu Asp Val 155 160 165
CCA TAT GAG AAG CTA TTG GAA ATA AGC ATT GCT ATT GAA GAC ACT AAA 582 Pro Tyr Glu Lys Leu Leu Glu He Ser He Ala He Glu Asp Thr Lys 170 175 180
AAC CCC AAA GAC TAC CCT TAT ATC AAC CTT AAA GAA CTC AAA AAA TTA 630 Asn Pro Lys Asp Tyr Pro Tyr He Asn Leu Lys Glu Leu Lys Lys Leu 185 190 195
ATA GAT AGT ATT ATT GAT GAT CAT GGT TAT ATG GCC GAT GGC TTT TTG 678 He Asp Ser He He Asp Asp His Gly Tyr Met Ala Asp Gly Phe Leu 200 205 210
AAT GAA TAT TCT AAT AGG GTA TCA AAA AAA GGT CTC CAA ATC CTT GCT 726 Asn Glu Tyr Ser Asn Arg Val Ser Lys Lys Gly Leu Gin He Leu Ala 215 220 225 230
AAA CTA AAA TCC ATG TGG CCT AGC GTA GGG AAA TTT TAT TTC GCC TCT 774 Lys Leu Lys Ser Met Trp Pro Ser Val Gly Lys Phe Tyr Phe Ala Ser 235 240 245
TTG AAA GAG GCT ATC CCA AGG CAT GCC AAA GAA GTT ACT GAC AAG ATG 822 Leu Lys Glu Ala He Pro Arg His Ala Lys Glu Val Thr Asp Lys Met 250 255 260
ATT AGC TCT GAA GAA AAA TCT ATC AAA GCC AAT CAA GTC AAA CTC ACT 870 He Ser Ser Glu Glu Lys Ser He Lys Ala Asn Gin Val Lys Leu Thr 265 270 275
GAA GCG AAG CAA GAT ATT GAC AAA ATG GAA AAA ATC ATT AAA GAT TTA 918 Glu Ala Lys Gin Asp He Asp Lys Met Glu Lys He He Lys Asp Leu 280 285 290
GAA AGC AAG AAA AAC ACC TTA TCA GTG TAT TTA AAA TTT GGA GAA AGT 966 Glu Ser Lys Lys Asn Thr Leu Ser Val Tyr Leu Lys Phe Gly Glu Ser 295 300 305 310
TTC ACA GCG CAT TAT AAG TGT CAA AAT CTC ATA GAA GTT GGA GTC AAA 1014 Phe Thr Ala His Tyr Lys Cys Gin Asn Leu He Glu Val Gly Val Lys 315 320 325 ACC GAT AAA GGC TCC TGG ACT TTC AAC TTT AAC AGA TAAATCAGGC AAATAT 1066 Thr Asp Lys Gly Ser Trp Thr Phe Asn Phe Asn Arg 330 335
GGACAATAGC ACAGACAGAG CAAAAATCCT TATAGAAGAG CTTAAAAT 1114
(2) INFORMATION FOR SEQ ID NO: 1090:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 338 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1090:
Met Phe Phe Lys Thr Tyr Gin Lys Leu Leu Gly Ala Ser Cys Leu Thr
1 5 10 15
Leu Tyr Leu Ala Gly Cys Gly Ser Asp Ser Ser Glu Pro Leu Val Gly
20 25 30
He Glu Lys Asn Ser Phe Asn Ser Thr Val Lys He He Ser Lys Thr
35 40 45
Asp Asn He Glu He Gin Asp Leu Lys Leu Asn Arg Gly Asn Cys Glu
50 55 60
His Asp Gin Asn Phe Leu Val Lys Leu He Gin Glu Thr Ala Asn Thr 65 70 75 80
Tyr Leu Phe Ala Ser Glu Lys Glu Lys Ala He Lys Asn His Gin Ala
85 90 95
Lys He Ala Arg Leu Gin Lys Asp Leu Glu Glu Leu Thr Gin His Val
100 105 110
Gin Gin Ser Asn Asn Leu Asp Lys Leu Leu Glu Asn Gly Gly Leu Phe
115 120 125
Val Ser Gly His Asp Tyr Lys Tyr Thr Lys Asp Asp Asn Pro He Tyr
130 135 140
Val Val Lys Arg Met Leu Asp Asn Leu Asp Ser Tyr Lys Tyr Glu Ser 145 150 155 160
Asp Asp Val Leu Asp Val Pro Tyr Glu Lys Leu Leu Glu He Ser He
165 170 175
Ala He Glu Asp Thr Lys Asn Pro Lys Asp Tyr Pro Tyr He Asn Leu
180 185 190
Lys Glu Leu Lys Lys Leu He Asp Ser He He Asp Asp His Gly Tyr
195 200 205
Met Ala Asp Gly Phe Leu Asn Glu Tyr Ser Asn Arg Val Ser Lys Lys
210 215 220
Gly Leu Gin He Leu Ala Lys Leu Lys Ser Met Trp Pro Ser Val Gly 225 230 235 240
Lys Phe Tyr Phe Ala Ser Leu Lys Glu Ala He Pro Arg His Ala Lys
245 250 255
Glu Val Thr Asp Lys Met He Ser Ser Glu Glu Lys Ser He Lys Ala
260 265 270
Asn Gin Val Lys Leu Thr Glu Ala Lys Gin Asp He Asp Lys Met Glu
275 280 285
Lys He He Lys Asp Leu Glu Ser Lys Lys Asn Thr Leu Ser Val Tyr 290 295 300
Leu Lys Phe Gly Glu Ser Phe Thr Ala His Tyr Lys Cys Gin Asn Leu 305 310 315 320
He Glu Val Gly Val Lys Thr Asp Lys Gly Ser Trp Thr Phe Asn Phe
325 330 335
Asn Arg
(2) INFORMATION FOR SEQ ID NO: 1091:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 847 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 94...807 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1091:
GCCTTGACGC ATGTTTTTGA AGTTTATCCT AAAGTCAATA TTTTTTTAAA AATCCTTCAC 60 AAAGAAGGGG CTTACCACAA GCTTATTTCT CGC ATG TGT TTG GTC AAA GAC AAG 114
Met Cys Leu Val Lys Asp Lys 1 5
CTC AAA GAC ATT ATC AGC GTC AAA AGC GCG CTT TCT TTT TCG TTA AAA 162 Leu Lys Asp He He Ser Val Lys Ser Ala Leu Ser Phe Ser Leu Lys 10 15 20
GGG GAT TTT GAC TGC CCT TTA GAA GAA AAC TCG CTC TTT AAA GCC CTC 210 Gly Asp Phe Asp Cys Pro Leu Glu Glu Asn Ser Leu Phe Lys Ala Leu 25 30 35
CAA ATT TTA AAG AAT TTT TTA AAA TCA AAA AAT TTC TCT CAT TCT GTC 258 Gin He Leu Lys Asn Phe Leu Lys Ser Lys Asn Phe Ser His Ser Val 40 45 50 55
ATC AAA TCC CTA GAC ACC CTA GCG ATT GAA GTG GAA AAA AAC ATC CCC 306 He Lys Ser Leu Asp Thr Leu Ala He Glu Val Glu Lys Asn He Pro 60 65 70
ACT CAA GCC GGA TTA GGC GGT GGG AGC ACT GAT GCT GGG GGG CTA TTG 354 Thr Gin Ala Gly Leu Gly Gly Gly Ser Thr Asp Ala Gly Gly Leu Leu 75 80 85
TAT CAT TTA AAT CAG ATT TTT GAC TGG CGT TTG AGT TTA GAA GAG CTT 402 Tyr His Leu Asn Gin He Phe Asp Trp Arg Leu Ser Leu Glu Glu Leu 90 95 100 TAT AGC ATG GGA TCT TTA GTG GGC GCG GAC ACC AAT TTT TTC ATC TCG 450 Tyr Ser Met Gly Ser Leu Val Gly Ala Asp Thr Asn Phe Phe He Ser 105 110 115
CAA TAC AAA AGC ACT AAC GCC ACT TCT TAT GGC GAA GTC ATT GAA AAT 498 Gin Tyr Lys Ser Thr Asn Ala Thr Ser Tyr Gly Glu Val He Glu Asn 120 125 130 135
TTT GAA GAA GAG CCT TTA GAA AAT CGC CTA GAA ATC TAT GCA CCA AAT 546 Phe Glu Glu Glu Pro Leu Glu Asn Arg Leu Glu He Tyr Ala Pro Asn 140 145 150
CAT GTT TTT TGC AGC ACC AAA GCC GTT TAT CAA GCT TAT AAG CCT GAA 594 His Val Phe Cys Ser Thr Lys Ala Val Tyr Gin Ala Tyr Lys Pro Glu 155 160 165
ACT TGT TTT TCT CAA GCT AAA GAA TGG CTT AAA AAG CCG AGT TTG GAA 642 Thr Cys Phe Ser Gin Ala Lys Glu Trp Leu Lys Lys Pro Ser Leu Glu 170 175 180
TGC CTA AAA ACT TAT GAT AGA AAC GGA TTA AAC GAC CTT TTA AAG CCG 690 Cys Leu Lys Thr Tyr Asp Arg Asn Gly Leu Asn Asp Leu Leu Lys Pro 185 190 195
GCT TTA CTC ACT AAC CAA GCC TTA AAA GAT ATA GAA AGC GAA CTA GGC 738 Ala Leu Leu Thr Asn Gin Ala Leu Lys Asp He Glu Ser Glu Leu Gly 200 205 210 215
AAG GAG TGG TTT TTT AGC GGG AGC GGG AGC GCG TTT TTT AGG CTA AAG 786 Lys Glu Trp Phe Phe Ser Gly Ser Gly Ser Ala Phe Phe Arg Leu Lys 220 225 230
CCT ATG CAA AAA GGG GGC GAA TGAAACTCAT TGCCAGCAAC AAAAAAGCCT ATTT 841 Pro Met Gin Lys Gly Gly Glu 235
TGACTA 847
(2) INFORMATION FOR SEQ ID NO: 1092:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 238 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1092:
Met Cys Leu Val Lys Asp Lys Leu Lys Asp He He Ser Val Lys Ser
1 5 10 15
Ala Leu Ser Phe Ser Leu Lys Gly Asp Phe Asp Cys Pro Leu Glu Glu
20 25 30
Asn Ser Leu Phe Lys Ala Leu Gin He Leu Lys Asn Phe Leu Lys Ser 35 40 45
Lys Asn Phe Ser His Ser Val He Lys Ser Leu Asp Thr Leu Ala He
50 55 60
Glu Val Glu Lys Asn He Pro Thr Gin Ala Gly Leu Gly Gly Gly Ser 65 70 75 80
Thr Asp Ala Gly Gly Leu Leu Tyr His Leu Asn Gin He Phe Asp Trp
85 90 95
Arg Leu Ser Leu Glu Glu Leu Tyr Ser Met Gly Ser Leu Val Gly Ala
100 105 110
Asp Thr Asn Phe Phe He Ser Gin Tyr Lys Ser Thr Asn Ala Thr Ser
115 120 125
Tyr Gly Glu Val He Glu Asn Phe Glu Glu Glu Pro Leu Glu Asn Arg
130 135 140
Leu Glu He Tyr Ala Pro Asn His Val Phe Cys Ser Thr Lys Ala Val 145 150 155 160
Tyr Gin Ala Tyr Lys Pro Glu Thr Cys Phe Ser Gin Ala Lys Glu Trp
165 170 175
Leu Lys Lys Pro Ser Leu Glu Cys Leu Lys Thr Tyr Asp Arg Asn Gly
180 185 190
Leu Asn Asp Leu Leu Lys Pro Ala Leu Leu Thr Asn Gin Ala Leu Lys
195 200 205
Asp He Glu Ser Glu Leu Gly Lys Glu Trp Phe Phe Ser Gly Ser Gly
210 215 220
Ser Ala Phe Phe Arg Leu Lys Pro Met Gin Lys Gly Gly Glu 225 230 235
(2) INFORMATION FOR SEQ ID NO: 1093:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1092 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 28...1047 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1093:
TGAAACTATC CATTTAAAGG TGTGAAA ATG GCA AAT TTA GAA AAT TTA GAC TGG 54
Met Ala Asn Leu Glu Asn Leu Asp Trp 1 5 l
AAA AAT TTA GGC TTT AGC TAC ATT AAA ACG GAT TTT CGC TTC ATC GCC 102 Lys Asn Leu Gly Phe Ser Tyr He Lys Thr Asp Phe Arg Phe He Ala 10 15 20 25
ACT TAT AAA AAC GGC TCT TGG TCG CAA GGC GGA TTG GTG AGC GAA AAC 150 Thr Tyr Lys Asn Gly Ser Trp Ser Gin Gly Gly Leu Val Ser Glu Asn 30 35 40
ATG TTA CAA CTC AGC GAA GGC TCG CCG GTC TTG CAC TAC GGG CAG GCT 198 Met Leu Gin Leu Ser Glu Gly Ser Pro Val Leu His Tyr Gly Gin Ala 45 50 55
TGT TTT GAA GGC TTG AAG GCT TAC CGC TCT CAA AAG GGG AAA GCT TTA 246 Cys Phe Glu Gly Leu Lys Ala Tyr Arg Ser Gin Lys Gly Lys Ala Leu 60 65 70
CTC TTT CGC CCT TTA GAA AAC GCC AAA CGC TTG CAA ACT TCA TGC GAA 294 Leu Phe Arg Pro Leu Glu Asn Ala Lys Arg Leu Gin Thr Ser Cys Glu 75 80 85
AGA CTG CTC ATG CCC AAA GTG AGC GAA GAG CTG TTT TTA AGG GCA TGC 342 Arg Leu Leu Met Pro Lys Val Ser Glu Glu Leu Phe Leu Arg Ala Cys 90 95 100 105
GCT GAA GTG GTG AAA GCG AAT CAA AAA TGG CTC GCT CCT TAT AAA AGC 390 Ala Glu Val Val Lys Ala Asn Gin Lys Trp Leu Ala Pro Tyr Lys Ser 110 115 120
GGG GCG AGT TTG TAT TTG CGC CCT TTT GTC ATA GGC GTA GGG GAT AAT 438 Gly Ala Ser Leu Tyr Leu Arg Pro Phe Val He Gly Val Gly Asp Asn 125 130 135
TTG GGG GTG AAG CCG GCT AAT GAA TAC CTT TTT ATC GTG TTT TGT GCG 486 Leu Gly Val Lys Pro Ala Asn Glu Tyr Leu Phe He Val Phe Cys Ala 140 145 150
CCT GTG GGG GCG TAT TTT AAG GGG GGT ATA GAA AAA GGG GGG GCT AGG 534 Pro Val Gly Ala Tyr Phe Lys Gly Gly He Glu Lys Gly Gly Ala Arg 155 160 165
TTT ATC ACT ACG ATT TTT GAT AGG GCC GCG CCT AAA GGC ACC GGT GGG 582 Phe He Thr Thr He Phe Asp Arg Ala Ala Pro Lys Gly Thr Gly Gly 170 175 180 185
GTG AAA GTG GGA GGG AAT TAC GCT GCA AGC CTG TTA GCC CAT AAA ATG 630 Val Lys Val Gly Gly Asn Tyr Ala Ala Ser Leu Leu Ala His Lys Met 190 195 200
GCC ACA GAG CAA GGC TAT GAT GAT TGC ATT TAT TTA GAC CCT ACT ACG 678 Ala Thr Glu Gin Gly Tyr Asp Asp Cys He Tyr Leu Asp Pro Thr Thr 205 210 215
CAC ACT AAA ATT GAA GAA GTG GGG GCG GCG AAT TTT TTT GGC ATC ACG 726 His Thr Lys He Glu Glu Val Gly Ala Ala Asn Phe Phe Gly He Thr 220 225 230
CAT GAT GAT GCC TTT ATC ACC CCG CAT TCG CCA AGC ATT CTG CCA AGC 774 His Asp Asp Ala Phe He Thr Pro His Ser Pro Ser He Leu Pro Ser 235 240 245
ATT ACC AAA AAA AGC TTG ATG GTT TTG GCT AAA GAA TAT TTG AAC CTC 822 He Thr Lys Lys Ser Leu Met Val Leu Ala Lys Glu Tyr Leu Asn Leu 250 255 260 265
AAA GTA GAA GAG AGG GAA ATC CTA ATG GAT GAG TTG GAT GCG TTT AAA 870 Lys Val Glu Glu Arg Glu He Leu Met Asp Glu Leu Asp Ala Phe Lys 270 275 280
GAA GCT GGA GCG TGC GGG ACA GCT GCG ATC ATT ACG CCC ATT AAA GAA 918 Glu Ala Gly Ala Cys Gly Thr Ala Ala He He Thr Pro He Lys Glu 285 290 295
ATC GTG CAC AAC AAC AAG TCT TAT TTT TTT GAA GCG CCG GGC CAT ATT 966 He Val His Asn Asn Lys Ser Tyr Phe Phe Glu Ala Pro Gly His He 300 305 310
ACT AAA CGA CTC TAT GAT TTG CTT TTA TCC ATC CAA CAA GGC GAA CAA 1014 Thr Lys Arg Leu Tyr Asp Leu Leu Leu Ser He Gin Gin Gly Glu Gin 315 320 325
GAA GCC CCC AAA GAT TGG ATT TTT GAA GTT GGC TAAAAGGTTA AAATTTATAG 1067 Glu Ala Pro Lys Asp Trp He Phe Glu Val Gly 330 335 340
CTGTATGCCG CATAAAATAA GGGCG 1092
(2) INFORMATION FOR SEQ ID NO: 1094:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1094:
Met Ala Asn Leu Glu Asn Leu Asp Trp Lys Asn Leu Gly Phe Ser Tyr
1 5 10 15
He Lys Thr Asp Phe Arg Phe He Ala Thr Tyr Lys Asn Gly Ser Trp
20 25 30
Ser Gin Gly Gly Leu Val Ser Glu Asn Met Leu Gin Leu Ser Glu Gly
35 40 45
Ser Pro Val Leu His Tyr Gly Gin Ala Cys Phe Glu Gly Leu Lys Ala
50 55 60
Tyr Arg Ser Gin Lys Gly Lys Ala Leu Leu Phe Arg Pro Leu Glu Asn 65 70 75 80
Ala Lys Arg Leu Gin Thr Ser Cys Glu Arg Leu Leu Met Pro Lys Val
85 90 95
Ser Glu Glu Leu Phe Leu Arg Ala Cys Ala Glu Val Val Lys Ala Asn
100 105 110
Gin Lys Trp Leu Ala Pro Tyr Lys Ser Gly Ala Ser Leu Tyr Leu Arg
115 120 125
Pro Phe Val He Gly Val Gly Asp Asn Leu Gly Val Lys Pro Ala Asn 130 135 140 Glu Tyr Leu Phe He Val Phe Cys Ala Pro Val Gly Ala Tyr Phe Lys 145 150 155 160
Gly Gly He Glu Lys Gly Gly Ala Arg Phe He Thr Thr He Phe Asp
165 170 175
Arg Ala Ala Pro Lys Gly Thr Gly Gly Val Lys Val Gly Gly Asn Tyr
180 185 190
Ala Ala Ser Leu Leu Ala His Lys Met Ala Thr Glu Gin Gly Tyr Asp
195 200 205
Asp Cys He Tyr Leu Asp Pro Thr Thr His Thr Lys He Glu Glu Val
210 215 220
Gly Ala Ala Asn Phe Phe Gly He Thr His Asp Asp Ala Phe He Thr 225 230 235 240
Pro His Ser Pro Ser He Leu Pro Ser He Thr Lys Lys Ser Leu Met
245 250 255
Val Leu Ala Lys Glu Tyr Leu Asn Leu Lys Val Glu Glu Arg Glu He
260 265 270
Leu Met Asp Glu Leu Asp Ala Phe Lys Glu Ala Gly Ala Cys Gly Thr
275 280 285
Ala Ala He He Thr Pro He Lys Glu He Val His Asn Asn Lys Ser
290 295 300
Tyr Phe Phe Glu Ala Pro Gly His He Thr Lys Arg Leu Tyr Asp Leu 305 310 315 320
Leu Leu Ser He Gin Gin Gly Glu Gin Glu Ala Pro Lys Asp Trp He
325 330 335
Phe Glu Val Gly 340
(2) INFORMATION FOR SEQ ID NO: 1095:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2111 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...2067 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1095:
GTTATAATTT TATTTTTTAA AAGGATACCC ATG AAT AAA GTT CAA TCT ATT GAT 54
Met Asn Lys Val Gin Ser He Asp 1 5
CCT TTA ATC GCT GAT AAG TTC AAC AAC GAG TTA AGA AGT TAT AAC CTA 102 Pro Leu He Ala Asp Lys Phe Asn Asn Glu Leu Arg Ser Tyr Asn Leu 10 15 20
GAA TAC AAA CTA GAG CAA GAA AGC CTG AAT AAA GAA ATT GAT GAA GCT 150 Glu Tyr Lys Leu Glu Gin Glu Ser Leu Asn Lys Glu He Asp Glu Ala 25 30 35 40
TTA AAA AAT TAC GCT TCT AAA AAT GGG GGT TTA GGG GGT AAC CGC CCT 198 Leu Lys Asn Tyr Ala Ser Lys Asn Gly Gly Leu Gly Gly Asn Arg Pro 45 50 55
GAT GTG AAA CTT TTA TTA AAC ACA CAA GAC CCC AAC AGA AGA GTC CCT 246 Asp Val Lys Leu Leu Leu Asn Thr Gin Asp Pro Asn Arg Arg Val Pro 60 65 70
ATT TTA ATA GAA TAC AAA GGG CTA AAA GAT AAG CTC ATT AAA TTA GAC 294 He Leu He Glu Tyr Lys Gly Leu Lys Asp Lys Leu He Lys Leu Asp 75 80 85
AAA AAC AAA CTG GTA GAA AAC TTT AAA AAC CAT GAG CCT CAT TAT AAA 342 Lys Asn Lys Leu Val Glu Asn Phe Lys Asn His Glu Pro His Tyr Lys 90 95 100
AAC ATT AGA GAA TAC GCC CTA AAT GGG GCT TTG CAT TAC GCT AAT GCG 390 Asn He Arg Glu Tyr Ala Leu Asn Gly Ala Leu His Tyr Ala Asn Ala 105 110 115 120
ATT TTA CAC CAC ACG AGC TAC ACT GAA TGC ATC GCC ATA GGC ATT ACA 438 He Leu His His Thr Ser Tyr Thr Glu Cys He Ala He Gly He Thr 125 130 135
GGC TAT AAA GAC AAT AAG GGC GGC ATA TGC TCT CAA ATC GCT GTC TAT 486 Gly Tyr Lys Asp Asn Lys Gly Gly He Cys Ser Gin He Ala Val Tyr 140 145 150
TAT GTG AAT AAA AGC AAT CTA GGC ATG GGG ATA GAT GTT TCA AAA GGC 534 Tyr Val Asn Lys Ser Asn Leu Gly Met Gly He Asp Val Ser Lys Gly 155 160 165
GAG CAA GGT TAT AGC GAT CTC TCC TTT TTA AGC CGT AAG CAT TTT AAC 582 Glu Gin Gly Tyr Ser Asp Leu Ser Phe Leu Ser Arg Lys His Phe Asn 170 175 180
GAC TTT ATT AAA CGA GTA GAC ACC CTT TCT TTA AGC GAT GAA GAT TTA 630 Asp Phe He Lys Arg Val Asp Thr Leu Ser Leu Ser Asp Glu Asp Leu 185 190 195 200
GAG CGC ATT AGA GAA AAG AAA AAC CAA GAA ATA GAA GAC TGC TTA ATG 678 Glu Arg He Arg Glu Lys Lys Asn Gin Glu He Glu Asp Cys Leu Met 205 210 215
CGG CTC AAC AAC AAT ATT TAC AAC AAA GAA AAG AAT TTT TTA AGC GAA 726 Arg Leu Asn Asn Asn He Tyr Asn Lys Glu Lys Asn Phe Leu Ser Glu 220 225 230
CAC AAT CGG GTA TAT TTA GTG ATT GCG AGC ATT ATC GCT AAT TTA GGC 774 His Asn Arg Val Tyr Leu Val He Ala Ser He He Ala Asn Leu Gly 235 240 245 ATC CCT AAT TTG GTA ACC CCC CTA AAC AAA GAA GAT CTA AAA TCC AGC 822 He Pro Asn Leu Val Thr Pro Leu Asn Lys Glu Asp Leu Lys Ser Ser 250 255 260
GAT GAG GTC CAT CAA AGA GAT GGC GAC ATC ATG CTC AGA AAA ATC CAA 870 Asp Glu Val His Gin Arg Asp Gly Asp He Met Leu Arg Lys He Gin 265 270 275 280
TCC TTT TTA GAG AAT AAG GAT TTG TCT CCA GAG AAA AGG CAA AGC ATT 918 Ser Phe Leu Glu Asn Lys Asp Leu Ser Pro Glu Lys Arg Gin Ser He 285 290 295
ATT TCT TCA TTA GAG ACT TTA TTA AGA AAC GAA AAC AAC AAC AAA GCC 966 He Ser Ser Leu Glu Thr Leu Leu Arg Asn Glu Asn Asn Asn Lys Ala 300 305 310
ACT AAT GGC GAA AGC TGT TTG AAG CGT TGT TTT AGT GAG ATT GTG GAT 1014 Thr Asn Gly Glu Ser Cys Leu Lys Arg Cys Phe Ser Glu He Val Asp 315 320 325
AGT TTG GGC ATT TAT TAT AAA ATC GGT CTT AGC ACG GAT TTT ACC GGT 1062 Ser Leu Gly He Tyr Tyr Lys He Gly Leu Ser Thr Asp Phe Thr Gly 330 335 340
AAA TTG TTC AAT GAA ATG TAT CGC TGG CTG GGT TTC ACG AAA GAC CAA 1110 Lys Leu Phe Asn Glu Met Tyr Arg Trp Leu Gly Phe Thr Lys Asp Gin 345 350 355 360
TTA AAC GAT GTG GTG CTC ACA CCC CCT TAT GTC GCC ACG CTT TTA GCT 1158 Leu Asn Asp Val Val Leu Thr Pro Pro Tyr Val Ala Thr Leu Leu Ala 365 370 375
AGA CTT TCT AAA GTC AAT AAG GAT AGT TTC GTG TGG GAT TTT GCC ACC 1206 Arg Leu Ser Lys Val Asn Lys Asp Ser Phe Val Trp Asp Phe Ala Thr 380 385 390
GGA AGC GCT GGG CTA TTA GTC GCA AGC ATG AAT TTG ATG ATA GAA GAC 1254 Gly Ser Ala Gly Leu Leu Val Ala Ser Met Asn Leu Met He Glu Asp 395 400 405
GCT AAA AAG CGT ATC ACT AGT CCA GAG GAA TTA GAG CAA AAA ATC GCC 1302 Ala Lys Lys Arg He Thr Ser Pro Glu Glu Leu Glu Gin Lys He Ala 410 415 420
CAC ATT AAA GCC AAG CAA CTT TTA GGG ATA GAA ATC TTA TCG GAT ATC 1350 His He Lys Ala Lys Gin Leu Leu Gly He Glu He Leu Ser Asp He 425 430 435 440
CAT ACT TTA GCG GTG TTA AAC ATG ATT TTA ATG GGC GAT GGG AGC AGT 1398 His Thr Leu Ala Val Leu Asn Met He Leu Met Gly Asp Gly Ser Ser 445 450 455
CAA ATC TTA AAC CAA GAC GGC TTG AGC GGT TTT GAT GGC AAA GTC AAT 1446 Gin He Leu Asn Gin Asp Gly Leu Ser Gly Phe Asp Gly Lys Val Asn 460 465 470 AAC GAA GCG TTT AAG GCT AAT GCC TTT GTT TTA AAC CCG CCT TAT TCC 1494 Asn Glu Ala Phe Lys Ala Asn Ala Phe Val Leu Asn Pro Pro Tyr Ser 475 480 485
GCT AGC GGT AAT GGC ATG GTG TTT GTG GAG CAG GCT TTA GAA AAA ATG 1542 Ala Ser Gly Asn Gly Met Val Phe Val Glu Gin Ala Leu Glu Lys Met 490 495 500
CAA AGC GGT TAT GCG AGC GTG ATC ATC CAA TCA AGC GCC GGC AGT GGT 1590 Gin Ser Gly Tyr Ala Ser Val He He Gin Ser Ser Ala Gly Ser Gly 505 510 515 520
AAA GCC AAA GAA TAC AAT GTA AGG ATT TTG GAA AAA CAC ACG CTT TTA 1638 Lys Ala Lys Glu Tyr Asn Val Arg He Leu Glu Lys His Thr Leu Leu 525 530 535
GCG AGC ATT AAA ATG CCT TTA GAT TTA TTC ATC GGT AAA AGC AGC GTT 1686 Ala Ser He Lys Met Pro Leu Asp Leu Phe He Gly Lys Ser Ser Val 540 545 550
CAA ACC CAT ATC TAT GTT TTT AGG GTC AAT GAA AAG CAT GAC GCT AAG 1734 Gin Thr His He Tyr Val Phe Arg Val Asn Glu Lys His Asp Ala Lys 555 560 565
CAA AGG GTG AAA TTT ATT AAT TTC AGT AAC GAC GGC TAC GCT AGA GCG 1782 Gin Arg Val Lys Phe He Asn Phe Ser Asn Asp Gly Tyr Ala Arg Ala 570 575 580
AAT CGC AAA AAA GCC AAA GCC AGC CAC AAT TTA AAA GAC ACG CAT AAC 1830 Asn Arg Lys Lys Ala Lys Ala Ser His Asn Leu Lys Asp Thr His Asn 585 590 595 600
GCC AAA GAG CGC TAC AAC GAA GTC GTG GAT TTA GTC CAT ATT GGC CAA 1878 Ala Lys Glu Arg Tyr Asn Glu Val Val Asp Leu Val His He Gly Gin 605 610 615
TCA TGT TTG AAA TTT CTA AGC GAA GAT GAC TAT TAT GAA AAC ACC ATA 1926 Ser Cys Leu Lys Phe Leu Ser Glu Asp Asp Tyr Tyr Glu Asn Thr He 620 625 630
GAT CCC AAA AAC GGG AGC GAT TGG AAC CAA AAC AAA CCC ACT GAC ACC 1974 Asp Pro Lys Asn Gly Ser Asp Trp Asn Gin Asn Lys Pro Thr Asp Thr 635 640 645
AAA CCC GAA TTA GAG GAT TTT AAA AGA ACG ATA GCC GAT TAC CTT TCT 2022 Lys Pro Glu Leu Glu Asp Phe Lys Arg Thr He Ala Asp Tyr Leu Ser 650 655 660
TAT GAA GTA AGC TTG ATT TTA AAA AAC CAA ATG CCC CCA AAG CGA TAGGC 2072 Tyr Glu Val Ser Leu He Leu Lys Asn Gin Met Pro Pro Lys Arg 665 670 675
CCCCTTAATA GCCAACTCAA CGCTATTAAG TGGGGCGAG 2111
(2) INFORMATION FOR SEQ ID NO: 1096: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 679 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1096:
Met Asn Lys Val Gin Ser He Asp Pro Leu He Ala Asp Lys Phe Asn
1 5 10 15
Asn Glu Leu Arg Ser Tyr Asn Leu Glu Tyr Lys Leu Glu Gin Glu Ser
20 25 30
Leu Asn Lys Glu He Asp Glu Ala Leu Lys Asn Tyr Ala Ser Lys Asn
35 40 45
Gly Gly Leu Gly Gly Asn Arg Pro Asp Val Lys Leu Leu Leu Asn Thr
50 55 60
Gin Asp Pro Asn Arg Arg Val Pro He Leu He Glu Tyr Lys Gly Leu 65 70 75 80
Lys Asp Lys Leu He Lys Leu Asp Lys Asn Lys Leu Val Glu Asn Phe
85 90 95
Lys Asn His Glu Pro His Tyr Lys Asn He Arg Glu Tyr Ala Leu Asn
100 105 110
Gly Ala Leu His Tyr Ala Asn Ala He Leu His His Thr Ser Tyr Thr
115 120 125
Glu Cys He Ala He Gly He Thr Gly Tyr Lys Asp Asn Lys Gly Gly
130 135 140
He Cys Ser Gin He Ala Val Tyr Tyr Val Asn Lys Ser Asn Leu Gly 145 150 155 160
Met Gly He Asp Val Ser Lys Gly Glu Gin Gly Tyr Ser Asp Leu Ser
165 170 175
Phe Leu Ser Arg Lys His Phe Asn Asp Phe He Lys Arg Val Asp Thr
180 185 190
Leu Ser Leu Ser Asp Glu Asp Leu Glu Arg He Arg Glu Lys Lys Asn
195 200 205
Gin Glu He Glu Asp Cys Leu Met Arg Leu Asn Asn Asn He Tyr Asn
210 215 220
Lys Glu Lys Asn Phe Leu Ser Glu His Asn Arg Val Tyr Leu Val He 225 230 235 240
Ala Ser He He Ala Asn Leu Gly He Pro Asn Leu Val Thr Pro Leu
245 250 255
Asn Lys Glu Asp Leu Lys Ser Ser Asp Glu Val His Gin Arg Asp Gly
260 265 270
Asp He Met Leu Arg Lys He Gin Ser Phe Leu Glu Asn Lys Asp Leu
275 280 285
Ser Pro Glu Lys Arg Gin Ser He He Ser Ser Leu Glu Thr Leu Leu
290 295 300
Arg Asn Glu Asn Asn Asn Lys Ala Thr Asn Gly Glu Ser Cys Leu Lys 305 310 315 320
Arg Cys Phe Ser Glu He Val Asp Ser Leu Gly He Tyr Tyr Lys He
325 330 335
Gly Leu Ser Thr Asp Phe Thr Gly Lys Leu Phe Asn Glu Met Tyr Arg
340 345 350
Trp Leu Gly Phe Thr Lys Asp Gin Leu Asn Asp Val Val Leu Thr Pro 355 360 365
Pro Tyr Val Ala Thr Leu Leu Ala Arg Leu Ser Lys Val Asn Lys Asp
370 375 380
Ser Phe Val Trp Asp Phe Ala Thr Gly Ser Ala Gly Leu Leu Val Ala 385 390 395 400
Ser Met Asn Leu Met He Glu Asp Ala Lys Lys Arg He Thr Ser Pro
405 410 415
Glu Glu Leu Glu Gin Lys He Ala His He Lys Ala Lys Gin Leu Leu
420 425 430
Gly He Glu He Leu Ser Asp He His Thr Leu Ala Val Leu Asn Met
435 440 445
He Leu Met Gly Asp Gly Ser Ser Gin He Leu Asn Gin Asp Gly Leu
450 455 460
Ser Gly Phe Asp Gly Lys Val Asn Asn Glu Ala Phe Lys Ala Asn Ala 465 470 475 480
Phe Val Leu Asn Pro Pro Tyr Ser Ala Ser Gly Asn Gly Met Val Phe
485 490 495
Val Glu Gin Ala Leu Glu Lys Met Gin Ser Gly Tyr Ala Ser Val He
500 505 510
He Gin Ser Ser Ala Gly Ser Gly Lys Ala Lys Glu Tyr Asn Val Arg
515 520 525
He Leu Glu Lys His Thr Leu Leu Ala Ser He Lys Met Pro Leu Asp
530 535 540
Leu Phe He Gly Lys Ser Ser Val Gin Thr His He Tyr Val Phe Arg 545 550 555 560
Val Asn Glu Lys His Asp Ala Lys Gin Arg Val Lys Phe He Asn Phe
565 570 575
Ser Asn Asp Gly Tyr Ala Arg Ala Asn Arg Lys Lys Ala Lys Ala Ser
580 585 590
His Asn Leu Lys Asp Thr His Asn Ala Lys Glu Arg Tyr Asn Glu Val
595 600 605
Val Asp Leu Val His He Gly Gin Ser Cys Leu Lys Phe Leu Ser Glu
610 615 620
Asp Asp Tyr Tyr Glu Asn Thr He Asp Pro Lys Asn Gly Ser Asp Trp 625 630 635 640
Asn Gin Asn Lys Pro Thr Asp Thr Lys Pro Glu Leu Glu Asp Phe Lys
645 650 655
Arg Thr He Ala Asp Tyr Leu Ser Tyr Glu Val Ser Leu He Leu Lys
660 665 670
Asn Gin Met Pro Pro Lys Arg 675
(2) INFORMATION FOR SEQ ID NO: 1097:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 644 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 25...597 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1097:
ATCCTTTGAT TTCAAAGGCT TAAA ATG TAT GTG GTG TTA GAA GGC GTT GAT 51
Met Tyr Val Val Leu Glu Gly Val Asp 1 5
GGC GCG GGC AAA AGC ACT CAA GTA GAA TTA TTA AAA GAC CGG TTT AAA 99 Gly Ala Gly Lys Ser Thr Gin Val Glu Leu Leu Lys Asp Arg Phe Lys 10 15 20 25
AAC GCC CTT TTT ACC AAA GAG CCA GGG GGG ACG AGA ATG GGC GAG AGT 147 Asn Ala Leu Phe Thr Lys Glu Pro Gly Gly Thr Arg Met Gly Glu Ser 30 35 40
TTA AGG CGT ATC GCT TTG AAT GAA AAC ATT AGC GAA TTG GCT AGA GCG 195 Leu Arg Arg He Ala Leu Asn Glu Asn He Ser Glu Leu Ala Arg Ala 45 50 55
TTT TTA TTC TTA AGC GAT AGG GCT GAG CAT ACA GAA AGC GTG ATA AAA 243 Phe Leu Phe Leu Ser Asp Arg Ala Glu His Thr Glu Ser Val He Lys 60 65 70
CCG GCA TTG AAA GAA AAA AAG CTC ATC ATT AGC GAC AGG AGC TTG ATC 291 Pro Ala Leu Lys Glu Lys Lys Leu He He Ser Asp Arg Ser Leu He 75 80 85
TCT GGC ATG GCT TAT AGC CAA TTT TCA AGC TTA GAA TTA AAC CTG CTT 339 Ser Gly Met Ala Tyr Ser Gin Phe Ser Ser Leu Glu Leu Asn Leu Leu 90 95 100 105
GCC ACC CAA AGC GTC TTG CCT GCA AAA ATC ATT CTT TTA CTC ATA GAC 387 Ala Thr Gin Ser Val Leu Pro Ala Lys He He Leu Leu Leu He Asp 110 115 120
AAA GAG GGC TTA AAA CAG CGC TTA AGC CTT AAA AGT TTA GAT AAA ATA 435 Lys Glu Gly Leu Lys Gin Arg Leu Ser Leu Lys Ser Leu Asp Lys He 125 130 135
GAA AAC CAA GGC ATA GAA AAA TTA CTT CAT ATC CAG CAA AAG CTC AAA 483 Glu Asn Gin Gly He Glu Lys Leu Leu His He Gin Gin Lys Leu Lys 140 145 150
ACC CAC GCT TAT GCG TTA CAA GAA AAA TTT GGG TGC GAA GTT TTG GAA 531 Thr His Ala Tyr Ala Leu Gin Glu Lys Phe Gly Cys Glu Val Leu Glu 155 160 165
TTA GAC GCT AAA GAA AGC GTT AAA AAC TTG CAC GAA AAA ATC GCC GCT 579 Leu Asp Ala Lys Glu Ser Val Lys Asn Leu His Glu Lys He Ala Ala 170 175 180 185
TTT ATA AAA TGC GCT GTT TAACCTGTTT GAAGCTTTCT TTTAAGCCTC TTTGCCCA 635 Phe He Lys Cys Ala Val 190
AATTGCTTG 644
(2) INFORMATION FOR SEQ ID NO: 1098:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 191 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1098:
Met Tyr Val Val Leu Glu Gly Val Asp Gly Ala Gly Lys Ser Thr Gin
1 5 10 15
Val Glu Leu Leu Lys Asp Arg Phe Lys Asn Ala Leu Phe Thr Lys Glu
20 25 30
Pro Gly Gly Thr Arg Met Gly Glu Ser Leu Arg Arg He Ala Leu Asn
35 40 45
Glu Asn He Ser Glu Leu Ala Arg Ala Phe Leu Phe Leu Ser Asp Arg
50 55 60
Ala Glu His Thr Glu Ser Val He Lys Pro Ala Leu Lys Glu Lys Lys 65 70 75 80
Leu He He Ser Asp Arg Ser Leu He Ser Gly Met Ala Tyr Ser Gin
85 90 95
Phe Ser Ser Leu Glu Leu Asn Leu Leu Ala Thr Gin Ser Val Leu Pro
100 105 110
Ala Lys He He Leu Leu Leu He Asp Lys Glu Gly Leu Lys Gin Arg
115 120 125
Leu Ser Leu Lys Ser Leu Asp Lys He Glu Asn Gin Gly He Glu Lys
130 135 140
Leu Leu His He Gin Gin Lys Leu Lys Thr His Ala Tyr Ala Leu Gin 145 150 155 160
Glu Lys Phe Gly Cys Glu Val Leu Glu Leu Asp Ala Lys Glu Ser Val
165 170 175
Lys Asn Leu His Glu Lys He Ala Ala Phe He Lys Cys Ala Val 180 185 190
(2) INFORMATION FOR SEQ ID NO: 1099:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 620 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 23...583 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1099:
GCAAATCTTA TAAAGGACAT TC ATG AAA TTG GTT TTA GGC ATC AGT GGA GCG 52
Met Lys Leu Val Leu Gly He Ser Gly Ala 1 5 10
AGC GGG ATA CCC CTA GCC TTG CGG TTT TTA GAA AAA TTA CCC AAA GAA 100 Ser Gly He Pro Leu Ala Leu Arg Phe Leu Glu Lys Leu Pro Lys Glu 15 20 25
ATT GAA GTT TTT GTC GTG GCG TCT AAA AAC GCG CAT GTC GTG GCG TTA 148 He Glu Val Phe Val Val Ala Ser Lys Asn Ala His Val Val Ala Leu 30 35 40
GAA GAA TCT AAT ATT AAC CTT AAA AAC GCC ATG AAA GAT TTA CGG CCT 196 Glu Glu Ser Asn He Asn Leu Lys Asn Ala Met Lys Asp Leu Arg Pro 45 50 55
AGT GGT ACT TTT TTC AAC GAG CAA GAC ATC CAT GCG AGC ATC GCT TCA 244 Ser Gly Thr Phe Phe Asn Glu Gin Asp He His Ala Ser He Ala Ser 60 65 70
GGG AGT TAT GGT ATC CAT AAA ATG GCG ATC ATT CCA GCG AGC ATG GAC 292 Gly Ser Tyr Gly He His Lys Met Ala He He Pro Ala Ser Met Asp 75 80 85 90
ATG GTG GCT AAA ATC GCG CAT GGC TTT GGG GGG GAT TTG ATT TCT AGG 340 Met Val Ala Lys He Ala His Gly Phe Gly Gly Asp Leu He Ser Arg 95 100 105
AGT GCG TCT GTG ATG CTT AAA GAA AAG CGC CCC TTA CTC ATT GCC CCT 388 Ser Ala Ser Val Met Leu Lys Glu Lys Arg Pro Leu Leu He Ala Pro 110 115 120
AGA GAA ATG CCT TTA AGC GCT ATC ATG TTA GAA AAT TTG CTC AAA CTC 436 Arg Glu Met Pro Leu Ser Ala He Met Leu Glu Asn Leu Leu Lys Leu 125 130 135
TCC CAT TCT AAT GCA ATC ATT GCG CCG CCG ATG ATG ACT TAT TAC ACC 484 Ser His Ser Asn Ala He He Ala Pro Pro Met Met Thr Tyr Tyr Thr 140 145 150
CAG AGC AAG ACT TTA GAA GCG ATG CAA GAT TTT TTA GTG GGG AAG TGG 532 Gin Ser Lys Thr Leu Glu Ala Met Gin Asp Phe Leu Val Gly Lys Trp 155 160 165 170
TTT GAC AGC TTA GGG ATA GAA AAT GAC TTA TAC CCA CGA TGG GGA ATG 580 Phe Asp Ser Leu Gly He Glu Asn Asp Leu Tyr Pro Arg Trp Gly Met 175 180 185
AAC TGATGCAAAA AATCGGCATT TACCCGGGCA CTTTTGA 620 Asn
(2) INFORMATION FOR SEQ ID NO: 1100:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 187 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1100:
Met Lys Leu Val Leu Gly He Ser Gly Ala Ser Gly He Pro Leu Ala
1 5 10 15
Leu Arg Phe Leu Glu Lys Leu Pro Lys Glu He Glu Val Phe Val Val
20 25 30
Ala Ser Lys Asn Ala His Val Val Ala Leu Glu Glu Ser Asn He Asn
35 40 45
Leu Lys Asn Ala Met Lys Asp Leu Arg Pro Ser Gly Thr Phe Phe Asn
50 55 60
Glu Gin Asp He His Ala Ser He Ala Ser Gly Ser Tyr Gly He His 65 70 75 80
Lys Met Ala He He Pro Ala Ser Met Asp Met Val Ala Lys He Ala
85 90 95
His Gly Phe Gly Gly Asp Leu He Ser Arg Ser Ala Ser Val Met Leu
100 105 110
Lys Glu Lys Arg Pro Leu Leu He Ala Pro Arg Glu Met Pro Leu Ser
115 120 125
Ala He Met Leu Glu Asn Leu Leu Lys Leu Ser His Ser Asn Ala He
130 135 140
He Ala Pro Pro Met Met Thr Tyr Tyr Thr Gin Ser Lys Thr Leu Glu 145 150 155 160
Ala Met Gin Asp Phe Leu Val Gly Lys Trp Phe Asp Ser Leu Gly He
165 170 175
Glu Asn Asp Leu Tyr Pro Arg Trp Gly Met Asn 180 185
(2) INFORMATION FOR SEQ ID NO: 1101:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 341 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 13...309 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1101:
CTCCCTGAAG CG ATG CTC GCA TGG ATG TCT TGC TCG TTG AAA AAA GTA CCA 51 Met Leu Ala Trp Met Ser Cys Ser Leu Lys Lys Val Pro 1 5 10
CTA GGC CGT AAA TCT TTC ATG GCG TTT TTA AGG TTA ATA TTA GAT TCT 99 Leu Gly Arg Lys Ser Phe Met Ala Phe Leu Arg Leu He Leu Asp Ser 15 20 25
TCT AAC GCC ACG ACA TGC GCG TTT TTA GAC GCC ACG ACA AAA ACT TCA 147 Ser Asn Ala Thr Thr Cys Ala Phe Leu Asp Ala Thr Thr Lys Thr Ser 30 35 40 45
ATT TCT TTG GGT AAT TTT TCT AAA AAC CGC AAG GCT AGG GGT ATC CCG 195 He Ser Leu Gly Asn Phe Ser Lys Asn Arg Lys Ala Arg Gly He Pro 50 55 60
CTC GCT CCA CTG ATG CCT AAA ACC AAT TTC ATG AAT GTC CTT TAT AAG 243 Leu Ala Pro Leu Met Pro Lys Thr Asn Phe Met Asn Val Leu Tyr Lys 65 70 75
ATT TGC GCT TTA GAG CTG CTC AAC ACT TTT GCT TTG AGT ATT TTA TTG 291 He Cys Ala Leu Glu Leu Leu Asn Thr Phe Ala Leu Ser He Leu Leu 80 85 90
CTT TCT AAA TTT TTC GCT TGAATGATTT GATTAAGCGC GCCATTTTCT AG 341
Leu Ser Lys Phe Phe Ala 95
(2) INFORMATION FOR SEQ ID NO: 1102:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 99 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1102:
Met Leu Ala Trp Met Ser Cys Ser Leu Lys Lys Val Pro Leu Gly Arg
1 5 10 15
Lys Ser Phe Met Ala Phe Leu Arg Leu He Leu Asp Ser Ser Asn Ala
20 25 30
Thr Thr Cys Ala Phe Leu Asp Ala Thr Thr Lys Thr Ser He Ser Leu
35 40 45
Gly Asn Phe Ser Lys Asn Arg Lys Ala Arg Gly He Pro Leu Ala Pro
50 55 60
Leu Met Pro Lys Thr Asn Phe Met Asn Val Leu Tyr Lys He Cys Ala 65 70 75 80
Leu Glu Leu Leu Asn Thr Phe Ala Leu Ser He Leu Leu Leu Ser Lys
85 90 95
Phe Phe Ala
(2) INFORMATION FOR SEQ ID NO: 1103:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 858 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 85...822 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1103:
GGCGCCAACG ATTTTAATGA TCCTCATTGT GATTTTAGTG GTTGTCAAGC CTTTTTAAAG 60 ACAAGCCATG AAAAAAGAAA AGTC ATG AAA AAA GAA AAG CAT CTC AAG CAA 111
Met Lys Lys Glu Lys His Leu Lys Gin 1 5
GAA AAA ATC ATC AAC ATG TTT GAT GAT ATA GCC AGC TCT TAC GAT CAA 159 Glu Lys He He Asn Met Phe Asp Asp He Ala Ser Ser Tyr Asp Gin 10 15 20 25
GCC AAC CGC TTG ATG AGT TTT GGC TTA GAC GTT AAA TGG CGA GAA AGG 207 Ala Asn Arg Leu Met Ser Phe Gly Leu Asp Val Lys Trp Arg Glu Arg 30 35 40
GCT TGC GAG CAT GCG TTT TTA TTT TTA GAA AAC AAG AAA GCG TTA AGG 255 Ala Cys Glu His Ala Phe Leu Phe Leu Glu Asn Lys Lys Ala Leu Arg 45 50 55
CTT GTG GAT GTG GCA TGC GGG ACG GGG GAT ATG CTT GTG GCT TGG CAA 303 Leu Val Asp Val Ala Cys Gly Thr Gly Asp Met Leu Val Ala Trp Gin 60 65 70
AAA AGC GCT CTC AAT TGC GGT ATA GAG TTT AAG GAA TGT TTG GGG ATT 351 Lys Ser Ala Leu Asn Cys Gly He Glu Phe Lys Glu Cys Leu Gly He 75 80 85
GAC CCC TCT AAT AAC ATG CTT GAA TTA GCC ATC AAA AAA TGT GAA GAG 399 Asp Pro Ser Asn Asn Met Leu Glu Leu Ala He Lys Lys Cys Glu Glu 90 95 100 105
CTT GAA AAC AAA GCT TCT TTC ATC CAA GCT CAA GCC AAA GAT TTA AAA 447 Leu Glu Asn Lys Ala Ser Phe He Gin Ala Gin Ala Lys Asp Leu Lys 110 115 120
GGC GTT GAA AAT AAC AGC GTG GAT ATC CTC TCT ATT GCG TAT GGC TTG 495 Gly Val Glu Asn Asn Ser Val Asp He Leu Ser He Ala Tyr Gly Leu 125 130 135
CGT AAT GTC GTG GAA AGA CAA GAG GCC TTA AAA GAG TTT TTT AGG GTG 543 Arg Asn Val Val Glu Arg Gin Glu Ala Leu Lys Glu Phe Phe Arg Val 140 145 150
TTA AAA CCC AGG GGC GTT TTA GTG ATT TTA GAA TTT TTA AAA AAA GAC 591 Leu Lys Pro Arg Gly Val Leu Val He Leu Glu Phe Leu Lys Lys Asp 155 160 165
AAC CCC ACA TGG CTG GAT AAA ATC TCA GGG TTT TAC ACG AAT AAG GTT 639 Asn Pro Thr Trp Leu Asp Lys He Ser Gly Phe Tyr Thr Asn Lys Val 170 175 180 185
TTG CCT TTA GTG GGA GGG GCT ATC AGT AAG AAT TAT GGT GCT TAT TCT 687 Leu Pro Leu Val Gly Gly Ala He Ser Lys Asn Tyr Gly Ala Tyr Ser 190 195 200
TAT TTA CCG CAA TCC ATT GAG GGG TTT TTG AGT TTA GAG GGT TTG AAG 735 Tyr Leu Pro Gin Ser He Glu Gly Phe Leu Ser Leu Glu Gly Leu Lys 205 210 215
CAT GAA TTA AGA AAC GCA GGG TTT GAG ATT TTA AGG ACT GAA GAT TCT 783 His Glu Leu Arg Asn Ala Gly Phe Glu He Leu Arg Thr Glu Asp Ser 220 225 230
ATC GCT CAA ATT TCA ACG ACC ATG CTT GTT AAA AAA AAC TAAAGGAATG TT 834 He Ala Gin He Ser Thr Thr Met Leu Val Lys Lys Asn 235 240 245
ATGCAAGATG AATTATTTGA AACC 85£
(2) INFORMATION FOR SEQ ID NO: 1104:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 246 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1104:
Met Lys Lys Glu Lys His Leu Lys Gin Glu Lys He He Asn Met Phe
1 5 10 15
Asp Asp He Ala Ser Ser Tyr Asp Gin Ala Asn Arg Leu Met Ser Phe
20 25 30
Gly Leu Asp Val Lys Trp Arg Glu Arg Ala Cys Glu His Ala Phe Leu 35 40 45 Phe Leu Glu Asn Lys Lys Ala Leu Arg Leu Val Asp Val Ala Cys Gly
50 55 60
Thr Gly Asp Met Leu Val Ala Trp Gin Lys Ser Ala Leu Asn Cys Gly 65 70 75 80
He Glu Phe Lys Glu Cys Leu Gly He Asp Pro Ser Asn Asn Met Leu
85 90 95
Glu Leu Ala He Lys Lys Cys Glu Glu Leu Glu Asn Lys Ala Ser Phe
100 105 110
He Gin Ala Gin Ala Lys Asp Leu Lys Gly Val Glu Asn Asn Ser Val
115 120 125
Asp He Leu Ser He Ala Tyr Gly Leu Arg Asn Val Val Glu Arg Gin
130 135 140
Glu Ala Leu Lys Glu Phe Phe Arg Val Leu Lys Pro Arg Gly Val Leu 145 150 155 160
Val He Leu Glu Phe Leu Lys Lys Asp Asn Pro Thr Trp Leu Asp Lys
165 170 175
He Ser Gly Phe Tyr Thr Asn Lys Val Leu Pro Leu Val Gly Gly Ala
180 185 190
He Ser Lys Asn Tyr Gly Ala Tyr Ser Tyr Leu Pro Gin Ser He Glu
195 200 205
Gly Phe Leu Ser Leu Glu Gly Leu Lys His Glu Leu Arg Asn Ala Gly
210 215 220
Phe Glu He Leu Arg Thr Glu Asp Ser He Ala Gin He Ser Thr Thr 225 230 235 240
Met Leu Val Lys Lys Asn 245
(2) INFORMATION FOR SEQ ID NO: 1105:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1443 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 57...1403 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1105:
AAAATTCTGA GATTTTATAT ATTTTATATT TATCGTTAGG TTTTAGGTTT AAAGTT ATG 59
Met 1
GGG AGG AAT CAA GGA GCT TAT TTG GAT CCG TCT GAA TCG ATT TTG ATG 107 Gly Arg Asn Gin Gly Ala Tyr Leu Asp Pro Ser Glu Ser He Leu Met 5 10 15
TTG ATG GTT GCT TTT TTA TTG GTG CTG TTG AAC GCT TTT TTT GTG CTT 155 Leu Met Val Ala Phe Leu Leu Val Leu Leu Asn Ala Phe Phe Val Leu 20 25 30
TCA GAG TTT GCC CTT GTG AAA GTG CGT AAA ACC CGC TTA GAA GAG CTG 203 Ser Glu Phe Ala Leu Val Lys Val Arg Lys Thr Arg Leu Glu Glu Leu 35 40 45
GTT AAA ATC GGT AAT TCC AAC GCT AAA CTC GCT TTA AAG ATG AGT CAA 251 Val Lys He Gly Asn Ser Asn Ala Lys Leu Ala Leu Lys Met Ser Gin 50 55 60 65
AGA CTA GAC ACT TAT TTG AGC GCG ACG CAG TTA GGC ATC ACC CTT TCT 299 Arg Leu Asp Thr Tyr Leu Ser Ala Thr Gin Leu Gly He Thr Leu Ser 70 75 80
TCA TTA GCT TTA GGC TGG GTG GGT GAG CCC GCT ATC GCA AAA TTG TTA 347 Ser Leu Ala Leu Gly Trp Val Gly Glu Pro Ala He Ala Lys Leu Leu 85 90 95
GCC GCG CTG TTT GAG TCT ATG GAT TTG AGA GAA AAT CCT ATT TTT ATC 395 Ala Ala Leu Phe Glu Ser Met Asp Leu Arg Glu Asn Pro He Phe He 100 105 110
CAT TCA ATG AGC GTG GTC ATA GCG TTT TTA AGC ATC ACT TTT TTG CAT 443 His Ser Met Ser Val Val He Ala Phe Leu Ser He Thr Phe Leu His 115 120 125
GTC GTG TTG GGC GAG ATT GTG CCT AAA TCT TTA GCG ATC GCT AAA TCT 491 Val Val Leu Gly Glu He Val Pro Lys Ser Leu Ala He Ala Lys Ser 130 135 140 145
GAA AAA GCC ACC CTT TTT GCC GCA CGC CCT TTG CAT GTG TTT TGG GTG 539 Glu Lys Ala Thr Leu Phe Ala Ala Arg Pro Leu His Val Phe Trp Val 150 155 160
GTG TTT TAT CCG GTG GTG CGT TTG TTT GAT GTG ATC GCT CAT TTT TTT 587 Val Phe Tyr Pro Val Val Arg Leu Phe Asp Val He Ala His Phe Phe 165 170 175
TTG AAA AAG ATG GGC ATC AAT CCT AAA GAG CAT GAC GGC ACG CAT TCT 635 Leu Lys Lys Met Gly He Asn Pro Lys Glu His Asp Gly Thr His Ser 180 185 190
GAA GAA GAG TTA AAA ATC ATT GTG GGC GAG AGT TTG AGA GAG GGC ATT 683 Glu Glu Glu Leu Lys He He Val Gly Glu Ser Leu Arg Glu Gly He 195 200 205
ATT GAT TCA GTG GAG GGC GAA ATC ATT AAA AAC GCA GTG GAT TTT TCT 731 He Asp Ser Val Glu Gly Glu He He Lys Asn Ala Val Asp Phe Ser 210 215 220 225
GAC ACG AGC GCT AAA GAA ATC ATG ACC CCA CGA AAA GAC ATG GTG TGT 779 Asp Thr Ser Ala Lys Glu He Met Thr Pro Arg Lys Asp Met Val Cys 230 235 240 TTG GAT GAA GAA AAC AGC TAT GAA GAA AAT ATA GAC ATT GTT TTA AAA 827 Leu Asp Glu Glu Asn Ser Tyr Glu Glu Asn He Asp He Val Leu Lys 245 250 255
GGC CAT TTC ACG CGC TAC CCT TAT TGC AAG GGT TCT AAG GAT AAC ATT 875 Gly His Phe Thr Arg Tyr Pro Tyr Cys Lys Gly Ser Lys Asp Asn He 260 265 270
ATC GGC ATG GTG CAT ATT AGG GAT TTG CTT TCG CGC TCT ATT TTT ACC 923 He Gly Met Val His He Arg Asp Leu Leu Ser Arg Ser He Phe Thr 275 280 285
CCC AAA ATG CAT GAT TTC AAT CAA ATC GTT AGG AAA ATG ATC ATC GTC 971 Pro Lys Met His Asp Phe Asn Gin He Val Arg Lys Met He He Val 290 295 300 305
CCC GAA AGC GCT TCC ATT TCT CAA ATC CTT ATT AAA ATG AAA AAA GAG 1019 Pro Glu Ser Ala Ser He Ser Gin He Leu He Lys Met Lys Lys Glu 310 315 320
CAA ATC CAT ACC GCT TTG GTG ATT GAT GAA TAC GGC GGC ACA GCC GGG 1067 Gin He His Thr Ala Leu Val He Asp Glu Tyr Gly Gly Thr Ala Gly 325 330 335
TTG CTC ACT ATG GAA GAC ATC ATT GAA GAG ATC ATG GGC GAG ATT AGC 1115 Leu Leu Thr Met Glu Asp He He Glu Glu He Met Gly Glu He Ser 340 345 350
GAC GAA TAC GAC TTA AAA CAA GAG GGC ATA AAC AAG CTT GAA GAG GGC 1163 Asp Glu Tyr Asp Leu Lys Gin Glu Gly He Asn Lys Leu Glu Glu Gly 355 360 365
GTG TTT GAA TTA GAG GGC ATG CTG GAT TTA GAG AGC GTA GAA GAA GCG 1211 Val Phe Glu Leu Glu Gly Met Leu Asp Leu Glu Ser Val Glu Glu Ala 370 375 380 385
CTT CAC ATT GAA TTT GAT AAA GAA TGC GAG CAG GTA ACG CTT GGG GGC 1259 Leu His He Glu Phe Asp Lys Glu Cys Glu Gin Val Thr Leu Gly Gly 390 395 400
TAT GTT TTT AGC TTG TTA GAG CGC ATG CCT ATG GAG GGA GAT ACA ATC 1307 Tyr Val Phe Ser Leu Leu Glu Arg Met Pro Met Glu Gly Asp Thr He 405 410 415
GTT TCG CAT GGG TAT TCT TTT GAA GTC TTA AGC GTG GAT GGG GCT AGG 1355 Val Ser His Gly Tyr Ser Phe Glu Val Leu Ser Val Asp Gly Ala Arg 420 425 430
ATA AAA CGC TTA AAA GCG GTT AAA CAA GAT CAG GGA GAA AAT GAA GCA T 1404 He Lys Arg Leu Lys Ala Val Lys Gin Asp Gin Gly Glu Asn Glu Ala 435 440 445
GAAAAAAACA ACCCTCTTTG TATTGGGCTT ATTATTTAA 1443
(2) INFORMATION FOR SEQ ID NO: 1106: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 449 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1106:
Met Gly Arg Asn Gin Gly Ala Tyr Leu Asp Pro Ser Glu Ser He Leu
1 5 10 15
Met Leu Met Val Ala Phe Leu Leu Val Leu Leu Asn Ala Phe Phe Val
20 25 30
Leu Ser Glu Phe Ala Leu Val Lys Val Arg Lys Thr Arg Leu Glu Glu
35 40 45
Leu Val Lys He Gly Asn Ser Asn Ala Lys Leu Ala Leu Lys Met Ser
50 55 60
Gin Arg Leu Asp Thr Tyr Leu Ser Ala Thr Gin Leu Gly He Thr Leu 65 70 75 80
Ser Ser Leu Ala Leu Gly Trp Val Gly Glu Pro Ala He Ala Lys Leu
85 90 95
Leu Ala Ala Leu Phe Glu Ser Met Asp Leu Arg Glu Asn Pro He Phe
100 105 110
He His Ser Met Ser Val Val He Ala Phe Leu Ser He Thr Phe Leu
115 120 125
His Val Val Leu Gly Glu He Val Pro Lys Ser Leu Ala He Ala Lys
130 135 140
Ser Glu Lys Ala Thr Leu Phe Ala Ala Arg Pro Leu His Val Phe Trp 145 150 155 160
Val Val Phe Tyr Pro Val Val Arg Leu Phe Asp Val He Ala His Phe
165 170 175
Phe Leu Lys Lys Met Gly He Asn Pro Lys Glu His Asp Gly Thr His
180 185 190
Ser Glu Glu Glu Leu Lys He He Val Gly Glu Ser Leu Arg Glu Gly
195 200 205
He He Asp Ser Val Glu Gly Glu He He Lys Asn Ala Val Asp Phe
210 215 220
Ser Asp Thr Ser Ala Lys Glu He Met Thr Pro Arg Lys Asp Met Val 225 230 235 240
Cys Leu Asp Glu Glu Asn Ser Tyr Glu Glu Asn He Asp He Val Leu
245 250 255
Lys Gly His Phe Thr Arg Tyr Pro Tyr Cys Lys Gly Ser Lys Asp Asn
260 265 270
He He Gly Met Val His He Arg Asp Leu Leu Ser Arg Ser He Phe
275 280 285
Thr Pro Lys Met His Asp Phe Asn Gin He Val Arg Lys Met He He
290 295 300
Val Pro Glu Ser Ala Ser He Ser Gin He Leu He Lys Met Lys Lys 305 310 315 320
Glu Gin He His Thr Ala Leu Val He Asp Glu Tyr Gly Gly Thr Ala
325 330 335
Gly Leu Leu Thr Met Glu Asp He He Glu Glu He Met Gly Glu He
340 345 350
Ser Asp Glu Tyr Asp Leu Lys Gin Glu Gly He Asn Lys Leu Glu Glu 355 360 365
Gly Val Phe Glu Leu Glu Gly Met Leu Asp Leu Glu Ser Val Glu Glu
370 375 380
Ala Leu His He Glu Phe Asp Lys Glu Cys Glu Gin Val Thr Leu Gly 385 390 395 400
Gly Tyr Val Phe Ser Leu Leu Glu Arg Met Pro Met Glu Gly Asp Thr
405 410 415
He Val Ser His Gly Tyr Ser Phe Glu Val Leu Ser Val Asp Gly Ala
420 425 430
Arg He Lys Arg Leu Lys Ala Val Lys Gin Asp Gin Gly Glu Asn Glu
435 440 445
Ala
(2) INFORMATION FOR SEQ ID NO: 1107
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 394 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 47...367 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1107:
AGAAACCGGC ACGGTTACCA ACCAAGCGGT AACAATCTTT TTTAAA ATG GAG CGT 55
Met Glu Arg 1
TTG ATC ACT TCT TCT TTA TAC ACT TTT TTA AGC GAC TTT TTT TCT TTC 103 Leu He Thr Ser Ser Leu Tyr Thr Phe Leu Ser Asp Phe Phe Ser Phe 5 10 15
TTT TTC AAT TCC AAA GCG ATG GCG GTG TTC TTG CTT TTT TTT AAG CTC 151 Phe Phe Asn Ser Lys Ala Met Ala Val Phe Leu Leu Phe Phe Lys Leu 20 25 30 35
TCT AGC ATG AGC GAT TTT TCT TTC AAA TTG GCT TTA TCA AAG CGC TCT 199 Ser Ser Met Ser Asp Phe Ser Phe Lys Leu Ala Leu Ser Lys Arg Ser 40 45 50
AAA AAG CCT TCA ATT TCT TCT AAA TCT TCC CCA AAG TGC GCG GCT ACA 247 Lys Lys Pro Ser He Ser Ser Lys Ser Ser Pro Lys Cys Ala Ala Thr 55 60 65
ATG TTG TCT CTG ATT CTA GCA AAA CGC CTT CTT GAT TGC TCT CTT AAG 295 Met Leu Ser Leu He Leu Ala Lys Arg Leu Leu Asp Cys Ser Leu Lys 70 75 80
CGC TCC CTT AAA AAG CCC ACC CCA AAC ACC GCG CCC ACC ACA ATA TGC 343 Arg Ser Leu Lys Lys Pro Thr Pro Asn Thr Ala Pro Thr Thr He Cys 85 90 95
GTA GAG CTT ACG GGC AAG CCT AAT TGAGAGGCTA AAAGCACGGT GATGACT 394 Val Glu Leu Thr Gly Lys Pro Asn 100 105
(2) INFORMATION FOR SEQ ID NO: 1108:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 107 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1108:
Met Glu Arg Leu He Thr Ser Ser Leu Tyr Thr Phe Leu Ser Asp Phe
1 5 10 15
Phe Ser Phe Phe Phe Asn Ser Lys Ala Met Ala Val Phe Leu Leu Phe
20 25 30
Phe Lys Leu Ser Ser Met Ser Asp Phe Ser Phe Lys Leu Ala Leu Ser
35 40 45
Lys Arg Ser Lys Lys Pro Ser He Ser Ser Lys Ser Ser Pro Lys Cys
50 55 60
Ala Ala Thr Met Leu Ser Leu He Leu Ala Lys Arg Leu Leu Asp Cys 65 70 75 80
Ser Leu Lys Arg Ser Leu Lys Lys Pro Thr Pro Asn Thr Ala Pro Thr
85 90 95
Thr He Cys Val Glu Leu Thr Gly Lys Pro Asn 100 105
(2) INFORMATION FOR SEQ ID NO: 1109:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 342 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...321 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1109:
TCAAATTCTT CATAA ATG ATT TTT TCT TTT AAA AGG ACT TCT TTT TGC GTG 51 Met He Phe Ser Phe Lys Arg Thr Ser Phe Cys Val 1 5 10
AGC GTG CCG GTT TTG TCT ATA AAG ATT TTT TTC ACT TTA GCC AGA GTT 99 Ser Val Pro Val Leu Ser He Lys He Phe Phe Thr Leu Ala Arg Val 15 20 25
TCT AAA AAC AAC GCT TCT TTA AAC ACG ATC AAA GGG TTT TTA AAC ACC 147 Ser Lys Asn Asn Ala Ser Leu Asn Thr He Lys Gly Phe Leu Asn Thr 30 35 40
CCT ATC ACT AAC GCA ATG GGC GTA GCC AGA GCG AAC GCG CAA GGG CAG 195 Pro He Thr Asn Ala Met Gly Val Ala Arg Ala Asn Ala Gin Gly Gin 45 50 55 60
CTG ATG ACT AGC ACG CTA ATA CAC ACC ATT AAG GCT TTT TCA AAA TTA 243 Leu Met Thr Ser Thr Leu He His Thr He Lys Ala Phe Ser Lys Leu 65 70 75
CCC CCC AAA CCA AAT TGC CAT AAC AAA AAG CTT ACA AAG GCT AAA AAC 291 Pro Pro Lys Pro Asn Cys His Asn Lys Lys Leu Thr Lys Ala Lys Asn 80 85 90
AAC ACC GCT TTA GAA AAA ATA TCC GCA ATT TGATTCGCGC TACTCTCAAT T 342 Asn Thr Ala Leu Glu Lys He Ser Ala He 95 100
(2) INFORMATION FOR SEQ ID NO: 1110:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 102 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1110:
Met He Phe Ser Phe Lys Arg Thr Ser Phe Cys Val Ser Val Pro Val
1 5 10 15
Leu Ser He Lys He Phe Phe Thr Leu Ala Arg Val Ser Lys Asn Asn
20 25 30
Ala Ser Leu Asn Thr He Lys Gly Phe Leu Asn Thr Pro He Thr Asn
35 40 45
Ala Met Gly Val Ala Arg Ala Asn Ala Gin Gly Gin Leu Met Thr Ser
50 55 60
Thr Leu He His Thr He Lys Ala Phe Ser Lys Leu Pro Pro Lys Pro 65 70 75 80
Asn Cys His Asn Lys Lys Leu Thr Lys Ala Lys Asn Asn Thr Ala Leu 85 90 95 Glu Lys He Ser Ala He 100
(2) INFORMATION FOR SEQ ID NO: 1111:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1108 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...1062 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1111:
AGGCTTTTTG CTCTTGCCTT TTTTCCCATC ATG AGA CTT TAT GAG AGT TTA TTA 54
Met Arg Leu Tyr Glu Ser Leu Leu 1 5
GAA ATG TGC TTG AAT AAG GCA TGG GAG CAT CAA ACC CTA GCC TTA GAA 102 Glu Met Cys Leu Asn Lys Ala Trp Glu His Gin Thr Leu Ala Leu Glu 10 15 20
AAC CCA AGC GTA GCT TGC ATG GTG TTG GAT AAA AAC CAT GAG ATC TTG 150 Asn Pro Ser Val Ala Cys Met Val Leu Asp Lys Asn His Glu He Leu 25 30 35 40
AGT TTA GAA ACC CAC AAA AAA GCC AAA ACC CCG CAT GCA GAA GTC TTA 198 Ser Leu Glu Thr His Lys Lys Ala Lys Thr Pro His Ala Glu Val Leu 45 50 55
GCC GCC CAA TCA GCG CTA AAG ATT TTA CGC CCC AGT TTG AAA AAC GAT 246 Ala Ala Gin Ser Ala Leu Lys He Leu Arg Pro Ser Leu Lys Asn Asp 60 65 70
TTA GAA AAG TTA GAA GAC CCT AAA ACT TTA AGC GAT TTT TTA AAA ACG 294 Leu Glu Lys Leu Glu Asp Pro Lys Thr Leu Ser Asp Phe Leu Lys Thr 75 80 85
CAC CAC GAT AAC GCT TTT ACA GAC TGC GTT TTT TTA ATC ACC TTA GAG 342 His His Asp Asn Ala Phe Thr Asp Cys Val Phe Leu He Thr Leu Glu 90 95 100
CCA TGC AAT TCT TAT GGC AAA ACC CCG GCT TGT AGC GAA TTG TTA GAA 390 Pro Cys Asn Ser Tyr Gly Lys Thr Pro Ala Cys Ser Glu Leu Leu Glu 105 110 115 120
ATT TTA AAG CCT AAA AGA GTG GTC ATT GCC ACA GAA GAA AAC GAA GCT 438 He Leu Lys Pro Lys Arg Val Val He Ala Thr Glu Glu Asn Glu Ala 125 130 135
AAA AAA GGG GGT TTA GCA AGG CTA CAA AAG GCT CGT ATT GAA ACA ATA 486 Lys Lys Gly Gly Leu Ala Arg Leu Gin Lys Ala Arg He Glu Thr He 140 145 150
ATT TGC CAC AAT TTA GAA AAC AAA GCT AAA GAC TTG CTC TTG CCT TTT 534 He Cys His Asn Leu Glu Asn Lys Ala Lys Asp Leu Leu Leu Pro Phe 155 160 165
AGG GTA ATG GAA CAA AAG GGG CGT TTT AAT TTG TTC AAA CTC GCT TTA 582 Arg Val Met Glu Gin Lys Gly Arg Phe Asn Leu Phe Lys Leu Ala Leu 170 175 180
AGA ATG AAT GGG GAT TAC CAT CAT GGC AAG ATC ACC GGG CAA AAA AGC 630 Arg Met Asn Gly Asp Tyr His His Gly Lys He Thr Gly Gin Lys Ser 185 190 195 200
GTT ATT TTC ACG CAC AAC CAG CGA GCA ATA TGC GAC ACG CTT ATT GTT 678 Val He Phe Thr His Asn Gin Arg Ala He Cys Asp Thr Leu He Val 205 210 215
TCT GGG AAA ACC ATA AGA ACG GAC AAC CCC TTA TTG GAC GCT CGC TTT 726 Ser Gly Lys Thr He Arg Thr Asp Asn Pro Leu Leu Asp Ala Arg Phe 220 225 230
TGC GAC AGC TTT TAT CAA AAT AAA AAC CCC AAT ATC GCT ATT TTA TCC 774 Cys Asp Ser Phe Tyr Gin Asn Lys Asn Pro Asn He Ala He Leu Ser 235 240 245
AAG CGC TCA ATT GAC CCT AAT TCA AAA GTT TTT TCT GCG CCT AAT CGT 822 Lys Arg Ser He Asp Pro Asn Ser Lys Val Phe Ser Ala Pro Asn Arg 250 255 260
TTA GTT AAC ACT TTC CAT GAC CCC AAA GAT TTA CCC CTA GAG AAG GGG 870 Leu Val Asn Thr Phe His Asp Pro Lys Asp Leu Pro Leu Glu Lys Gly 265 270 275 280
TTT AAT TTC ATT GAA GGG GGG TGG GAA TTG TTT GAG AGC TTG AGG GAT 918 Phe Asn Phe He Glu Gly Gly Trp Glu Leu Phe Glu Ser Leu Arg Asp 285 290 295
AAA ATA GAC GCG TTG CTT TTG CAT TCG CAT GCG TCT ATG ATT GGC GAA 966 Lys He Asp Ala Leu Leu Leu His Ser His Ala Ser Met He Gly Glu 300 305 310
GCG TTT AAG GCA CTC GCT CTA AAA ACC CCT TTT AAA GGA CGG TTG TTG 1014 Ala Phe Lys Ala Leu Ala Leu Lys Thr Pro Phe Lys Gly Arg Leu Leu 315 320 325
CAT GCG CAA ATC TTA GAA AAT GAA GCC CTT TTA TGG ATA GAA AAC TCT T 1063 His Ala Gin He Leu Glu Asn Glu Ala Leu Leu Trp He Glu Asn Ser 330 335 340 AAGATTATAC CAGCCTTTGA ACGCTTATTC TTACAACAGC GATTC 1108
(2) INFORMATION FOR SEQ ID NO: 1112:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 344 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1112:
Met Arg Leu Tyr Glu Ser Leu Leu Glu Met Cys Leu Asn Lys Ala Trp
1 5 10 15
Glu His Gin Thr Leu Ala Leu Glu Asn Pro Ser Val Ala Cys Met Val
20 25 30
Leu Asp Lys Asn His Glu He Leu Ser Leu Glu Thr His Lys Lys Ala
35 40 45
Lys Thr Pro His Ala Glu Val Leu Ala Ala Gin Ser Ala Leu Lys He
50 55 60
Leu Arg Pro Ser Leu Lys Asn Asp Leu Glu Lys Leu Glu Asp Pro Lys 65 70 75 80
Thr Leu Ser Asp Phe Leu Lys Thr His His Asp Asn Ala Phe Thr Asp
85 90 95
Cys Val Phe Leu He Thr Leu Glu Pro Cys Asn Ser Tyr Gly Lys Thr
100 105 110
Pro Ala Cys Ser Glu Leu Leu Glu He Leu Lys Pro Lys Arg Val Val
115 120 125
He Ala Thr Glu Glu Asn Glu Ala Lys Lys Gly Gly Leu Ala Arg Leu
130 135 140
Gin Lys Ala Arg He Glu Thr He He Cys His Asn Leu Glu Asn Lys 145 150 155 160
Ala Lys Asp Leu Leu Leu Pro Phe Arg Val Met Glu Gin Lys Gly Arg
165 170 175
Phe Asn Leu Phe Lys Leu Ala Leu Arg Met Asn Gly Asp Tyr His His
180 185 190
Gly Lys He Thr Gly Gin Lys Ser Val He Phe Thr His Asn Gin Arg
195 200 205
Ala He Cys Asp Thr Leu He Val Ser Gly Lys Thr He Arg Thr Asp
210 215 220
Asn Pro Leu Leu Asp Ala Arg Phe Cys Asp Ser Phe Tyr Gin Asn Lys 225 230 235 240
Asn Pro Asn He Ala He Leu Ser Lys Arg Ser He Asp Pro Asn Ser
245 250 255
Lys Val Phe Ser Ala Pro Asn Arg Leu Val Asn Thr Phe His Asp Pro
260 265 270
Lys Asp Leu Pro Leu Glu Lys Gly Phe Asn Phe He Glu Gly Gly Trp
275 280 285
Glu Leu Phe Glu Ser Leu Arg Asp Lys He Asp Ala Leu Leu Leu His
290 295 300
Ser His Ala Ser Met He Gly Glu Ala Phe Lys Ala Leu Ala Leu Lys 305 310 315 320
Thr Pro Phe Lys Gly Arg Leu Leu His Ala Gin He Leu Glu Asn Glu 325 330 335
Ala Leu Leu Trp He Glu Asn Ser 340
(2) INFORMATION FOR SEQ ID NO: 1113:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 823 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 14...799 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1113:
ACGATTTTAA AAA ATG GCT AGA AGT TTC AAG CAT TCT CAA TAT CCT AAA 49 Met Ala Arg Ser Phe Lys His Ser Gin Tyr Pro Lys 1 5 10
ATT TTT AAG CCA CTA TAC CCT AAC AAC TTA ACG CTT TCA CTT AAA AAG 97 He Phe Lys Pro Leu Tyr Pro Asn Asn Leu Thr Leu Ser Leu Lys Lys 15 20 25
CAA CAT GTT ATA ATG ATC GCT ATT TTA TTT GAA AGG GTA TTT ATG GAA 145 Gin His Val He Met He Ala He Leu Phe Glu Arg Val Phe Met Glu 30 35 40
AGC GTT TTA AAT TTC CTA ACC AAT ATC AAT GTG ATT TTC ACC CTT TTG 193 Ser Val Leu Asn Phe Leu Thr Asn He Asn Val He Phe Thr Leu Leu 45 50 55 60
GGC TAT TTG ATT GGG GGG ATT CCT TTT GGC TAT GCG TTA ATG AAA ATC 241 Gly Tyr Leu He Gly Gly He Pro Phe Gly Tyr Ala Leu Met Lys He 65 70 75
TTT TAC GGC ATG GAT ATT ACT AAA ATC GGA TCG GGG GGC ATT GGC GCA 289 Phe Tyr Gly Met Asp He Thr Lys He Gly Ser Gly Gly He Gly Ala 80 85 90
ACG AAT GTC TTG CGT GCT TTA CAA AGT AAG GGC GTG AGT AAC GCT AAA 337 Thr Asn Val Leu Arg Ala Leu Gin Ser Lys Gly Val Ser Asn Ala Lys 95 100 105
CAA ATG GCC CTA TTA GTT TTA ATC TTG GAT CTC TTC AAA GGC ATG TTT 385 Gin Met Ala Leu Leu Val Leu He Leu Asp Leu Phe Lys Gly Met Phe 110 115 120 GCA GTA TTT TTG AGC AAA TTG TTT GGG TTG GAT TAT AGT TTG CAA TGG 433 Ala Val Phe Leu Ser Lys Leu Phe Gly Leu Asp Tyr Ser Leu Gin Trp 125 130 135 140
ATG GTC GCT ATC GCT AGC ATT TTA GGG CAT TGC TAT TCG CCT TTT TTG 481 Met Val Ala He Ala Ser He Leu Gly His Cys Tyr Ser Pro Phe Leu 145 150 155
AAT TTC AAT GGA GGT AAG GGC GTT TCT ACG ATC ATG GGC TCT GTG GTG 529 Asn Phe Asn Gly Gly Lys Gly Val Ser Thr He Met Gly Ser Val Val 160 165 170
TTG CTC ATC CCT ATT GAA AGT CTC ATC GGC TTA ACG GTG TGG TTT TTT 577 Leu Leu He Pro He Glu Ser Leu He Gly Leu Thr Val Trp Phe Phe 175 180 185
GTG GGT AAG GTG CTT AAA ATC TCT TCA CTC GCT AGC ATT CTA GGG GTA 625 Val Gly Lys Val Leu Lys He Ser Ser Leu Ala Ser He Leu Gly Val 190 195 200
GGC ACA GCG ACT GTT CTT ATC TTT TTT GTG CCT TAT ATG CAT ATC CCA 673 Gly Thr Ala Thr Val Leu He Phe Phe Val Pro Tyr Met His He Pro 205 210 215 220
GAC AGC GTC AAT ATC CTT AAA GAA GTC GGC ACG CAA ACG CCG ATG GTG 721 Asp Ser Val Asn He Leu Lys Glu Val Gly Thr Gin Thr Pro Met Val 225 230 235
CTT ATT TTT ATT TTC ACC CTT ATC AAG CAT GCG GGT AAT ATT TTT AAT 769 Leu He Phe He Phe Thr Leu He Lys His Ala Gly Asn He Phe Asn 240 245 250
TTA TTG GCC GGC AAG GAA AAG AAA GTC TTA TGAAAACTAA ACAAGGCGTT CAT 822 Leu Leu Ala Gly Lys Glu Lys Lys Val Leu 255 260
A 823
(2) INFORMATION FOR SEQ ID NO: 1114:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 262 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1114:
Met Ala Arg Ser Phe Lys His Ser Gin Tyr Pro Lys He Phe Lys Pro
1 5 10 15
Leu Tyr Pro Asn Asn Leu Thr Leu Ser Leu Lys Lys Gin His Val He
20 25 30
Met He Ala He Leu Phe Glu Arg Val Phe Met Glu Ser Val Leu Asn 35 40 45
Phe Leu Thr Asn He Asn Val He Phe Thr Leu Leu Gly Tyr Leu He
50 55 60
Gly Gly He Pro Phe Gly Tyr Ala Leu Met Lys He Phe Tyr Gly Met 65 70 75 80
Asp He Thr Lys He Gly Ser Gly Gly He Gly Ala Thr Asn Val Leu
85 90 95
Arg Ala Leu Gin Ser Lys Gly Val Ser Asn Ala Lys Gin Met Ala Leu
100 105 110
Leu Val Leu He Leu Asp Leu Phe Lys Gly Met Phe Ala Val Phe Leu
115 120 125
Ser Lys Leu Phe Gly Leu Asp Tyr Ser Leu Gin Trp Met Val Ala He
130 135 140
Ala Ser He Leu Gly His Cys Tyr Ser Pro Phe Leu Asn Phe Asn Gly 145 150 155 160
Gly Lys Gly Val Ser Thr He Met Gly Ser Val Val Leu Leu He Pro
165 170 175
He Glu Ser Leu He Gly Leu Thr Val Trp Phe Phe Val Gly Lys Val
180 185 190
Leu Lys He Ser Ser Leu Ala Ser He Leu Gly Val Gly Thr Ala Thr
195 200 205
Val Leu He Phe Phe Val Pro Tyr Met His He Pro Asp Ser Val Asn
210 215 220
He Leu Lys Glu Val Gly Thr Gin Thr Pro Met Val Leu He Phe He 225 230 235 240
Phe Thr Leu He Lys His Ala Gly Asn He Phe Asn Leu Leu Ala Gly
245 250 255
Lys Glu Lys Lys Val Leu 260
(2) INFORMATION FOR SEQ ID NO: 1115:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 404 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...381 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1115:
TTTATTGGCC GGCAAGGAAA AGAAAGTCTT ATG AAA ACT AAA CAA GGC GTT CAT 54
Met Lys Thr Lys Gin Gly Val His 1 5
ATC CAT AAC TTG GTG TTT GAG GCG ATT TTG GGG ATT TTA GAA TTT GAA 102 He His Asn Leu Val Phe Glu Ala He Leu Gly He Leu Glu Phe Glu 10 15 20
CGC TTA AAA CCC CAA AAA ATA AGC GTG AAT TTG GAT CTT TTC TAC ACG 150 Arg Leu Lys Pro Gin Lys He Ser Val Asn Leu Asp Leu Phe Tyr Thr 25 30 35 40
CAA TTA CCC AAT AAG GTT TAT TTA GAC TAC ATG GAA ATT CAA GAG CTT 198 Gin Leu Pro Asn Lys Val Tyr Leu Asp Tyr Met Glu He Gin Glu Leu 45 50 55
ATT CAA AAG ATG ATG CAA GAA AAC CAA TAC CTT CTC ATT GAA GAC GCC 246 He Gin Lys Met Met Gin Glu Asn Gin Tyr Leu Leu He Glu Asp Ala 60 65 70
CTG AAA GAT TTG AGC CAT GCT TTA AAA ACG CGC TAC AAG GAG ATC ACT 294 Leu Lys Asp Leu Ser His Ala Leu Lys Thr Arg Tyr Lys Glu He Thr 75 80 85
GAA CTT TAT TTA AAA ATC AGC AAG TTA GAG ATT TCT CCC AAT TCT CAA 342 Glu Leu Tyr Leu Lys He Ser Lys Leu Glu He Ser Pro Asn Ser Gin 90 95 100
GTG GGA GCG AGC GTG AAA ATC CGC TAT GAA AGC AAT CTT TAGCCTCTTT TT 393 Val Gly Ala Ser Val Lys He Arg Tyr Glu Ser Asn Leu 105 110 115
CCTTCTTATT G 404
(2) INFORMATION FOR SEQ ID NO: 1116:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 117 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1116:
Met Lys Thr Lys Gin Gly Val His He His Asn Leu Val Phe Glu Ala
1 5 10 15
He Leu Gly He Leu Glu Phe Glu Arg Leu Lys Pro Gin Lys He Ser
20 25 30
Val Asn Leu Asp Leu Phe Tyr Thr Gin Leu Pro Asn Lys Val Tyr Leu
35 40 45
Asp Tyr Met Glu He Gin Glu Leu He Gin Lys Met Met Gin Glu Asn
50 55 60
Gin Tyr Leu Leu He Glu Asp Ala Leu Lys Asp Leu Ser His Ala Leu 65 70 75 80
Lys Thr Arg Tyr Lys Glu He Thr Glu Leu Tyr Leu Lys He Ser Lys
85 90 95
Leu Glu He Ser Pro Asn Ser Gin Val Gly Ala Ser Val Lys He Arg
100 105 110
Tyr Glu Ser Asn Leu 115
(2) INFORMATION FOR SEQ ID NO: 1117:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1227 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...1209 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1117:
TAAAATAACG CTTATTTTAA ACTCTCAAAA AAGGAATCAA ACGCACTCAT C ATG GCT 57
Met Ala 1
AAA GAA ACG CTT GAA ATA ACC CCG GAT CTT TTG AAA AAC CCT TAT CAA 105 Lys Glu Thr Leu Glu He Thr Pro Asp Leu Leu Lys Asn Pro Tyr Gin 5 10 15
AAA ATC ATC AAT GCG AGC GCG AGC GTT TTT GAT GAA AAG CAT GGG CGA 153 Lys He He Asn Ala Ser Ala Ser Val Phe Asp Glu Lys His Gly Arg 20 25 30
TCG TTT TTT AGC ACG CAA TTT TAT GAA AAA ATT GAA CCT TAT TTA AAA 201 Ser Phe Phe Ser Thr Gin Phe Tyr Glu Lys He Glu Pro Tyr Leu Lys 35 40 45 50
GAA GTT TTA ACC CAT CCC ATT GAT TTA GAA TGC GAT CTA AAC ACC GCT 249 Glu Val Leu Thr His Pro He Asp Leu Glu Cys Asp Leu Asn Thr Ala 55 60 65
AAA AAA AAG AAC CGC TTA ACC CCT TTA AAA CAG CTT TTT AAA GCG TGT 297 Lys Lys Lys Asn Arg Leu Thr Pro Leu Lys Gin Leu Phe Lys Ala Cys 70 75 80
TTT AAC ACC GAA GAA ATT TTG ATT GTG AAT AAT AAC ACC AGC GCG ATT 345 Phe Asn Thr Glu Glu He Leu He Val Asn Asn Asn Thr Ser Ala He 85 90 95
TTC CTC ATC GCT AAC GCT TTA GCG CAA GAA AAA GAA ATC ATT GTT TCT 393 Phe Leu He Ala Asn Ala Leu Ala Gin Glu Lys Glu He He Val Ser 100 105 110
TAT GGC GAA TTA GTG GGG GGG GAT TTT AAC CTT AAA GAT ATT TTA TTA 441 Tyr Gly Glu Leu Val Gly Gly Asp Phe Asn Leu Lys Asp He Leu Leu 115 120 125 130
AAT AGT GGG GCT AGG CTG CAT TTA GTG GGG AAT ATT AAT CGC GCT TAT 489 Asn Ser Gly Ala Arg Leu His Leu Val Gly Asn He Asn Arg Ala Tyr 135 140 145
TTA AGG GAT TAC CGC TTA GCC TTG AAT GAA AAC AGC AAA ATA CTC TTT 537 Leu Arg Asp Tyr Arg Leu Ala Leu Asn Glu Asn Ser Lys He Leu Phe 150 155 160
AAA ACC CAC AAC CCC CAT TTT AAA AAA GAC ACG CCC TTT AAA GAT TTA 585 Lys Thr His Asn Pro His Phe Lys Lys Asp Thr Pro Phe Lys Asp Leu 165 170 175
CAA ACT CTT GCT AAA GAG CAT GAT CTC ATT GAT TAT TAC AAT TTA GGG 633 Gin Thr Leu Ala Lys Glu His Asp Leu He Asp Tyr Tyr Asn Leu Gly 180 185 190
GAT GTG GAT TTG TCA AAC AGA GTG GCT TTG GAA GAA ATT TTA GCC CTA 681 Asp Val Asp Leu Ser Asn Arg Val Ala Leu Glu Glu He Leu Ala Leu 195 200 205 210
AAA CCA TCG CTT TTA AGC TTT AGC GCG GAT AAA TTC TTT AAC AGT GCG 729 Lys Pro Ser Leu Leu Ser Phe Ser Ala Asp Lys Phe Phe Asn Ser Ala 215 220 225
CAA GCG GGC ATT ATT ATG GGG CAA AAA GAA CGG GTT GAA GCG TTA AAA 777 Gin Ala Gly He He Met Gly Gin Lys Glu Arg Val Glu Ala Leu Lys 230 235 240
AAC CAC CCC CTT TAT AGA GTT TTA AGG GTG GGT AAA ATC ACG CTC ACC 825 Asn His Pro Leu Tyr Arg Val Leu Arg Val Gly Lys He Thr Leu Thr 245 250 255
TTG CTT TTT TGC AGC CTA AAA GCA TGG ATA AAT CAT CAA GAA GAC ATT 873 Leu Leu Phe Cys Ser Leu Lys Ala Trp He Asn His Gin Glu Asp He 260 265 270
ACA ATC CAT GCG TTA TTG AAC CAA ACT AAA GAC GCA TTA TTG CAA AAA 921 Thr He His Ala Leu Leu Asn Gin Thr Lys Asp Ala Leu Leu Gin Lys 275 280 285 290
GCC CTC AAA CTC TAC GCT CTT TTA AAG CCT TTA GAA TTG AAT GTG AGC 969 Ala Leu Lys Leu Tyr Ala Leu Leu Lys Pro Leu Glu Leu Asn Val Ser 295 300 305
ATA GCC TCT AGC TTT TCT AAA ATA GGG AAT TTG TTT GGT AGG GAA TTA 1017 He Ala Ser Ser Phe Ser Lys He Gly Asn Leu Phe Gly Arg Glu Leu 310 315 320
GAA TCC TTT TGC GTG AAA ATC CAG CCC AAA AAC ACC CGT GCT TTA AAT 1065 Glu Ser Phe Cys Val Lys He Gin Pro Lys Asn Thr Arg Ala Leu Asn 325 330 335
AGT GAG AAA CTT TAT TTA AAG CTT TTC CAA AAA GGC GTT ATC GCA AGG 1113 Ser Glu Lys Leu Tyr Leu Lys Leu Phe Gin Lys Gly Val He Ala Arg 340 345 350
ATT TCA TGC GAA TTC GTG TGC TTT GAA GTC TTT AGC TTG AAT GAA AAA 1161 He Ser Cys Glu Phe Val Cys Phe Glu Val Phe Ser Leu Asn Glu Lys 355 360 365 370
GAT TTT GAA AAA ATC GCT CTG GTT TTA GAA GAA ATT CTT AAT AAA GCT T 1210 Asp Phe Glu Lys He Ala Leu Val Leu Glu Glu He Leu Asn Lys Ala 375 380 385
AAAAATTCGC TATAATA 1227
(2) INFORMATION FOR SEQ ID NO: 1118:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 386 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1118:
Met Ala Lys Glu Thr Leu Glu He Thr Pro Asp Leu Leu Lys Asn Pro
1 5 10 15
Tyr Gin Lys He He Asn Ala Ser Ala Ser Val Phe Asp Glu Lys His
20 25 30
Gly Arg Ser Phe Phe Ser Thr Gin Phe Tyr Glu Lys He Glu Pro Tyr
35 40 45
Leu Lys Glu Val Leu Thr His Pro He Asp Leu Glu Cys Asp Leu Asn
50 55 60
Thr Ala Lys Lys Lys Asn Arg Leu Thr Pro Leu Lys Gin Leu Phe Lys 65 70 75 80
Ala Cys Phe Asn Thr Glu Glu He Leu He Val Asn Asn Asn Thr Ser
85 90 95
Ala He Phe Leu He Ala Asn Ala Leu Ala Gin Glu Lys Glu He He
100 105 110
Val Ser Tyr Gly Glu Leu Val Gly Gly Asp Phe Asn Leu Lys Asp He
115 120 125
Leu Leu Asn Ser Gly Ala Arg Leu His Leu Val Gly Asn He Asn Arg
130 135 140
Ala Tyr Leu Arg Asp Tyr Arg Leu Ala Leu Asn Glu Asn Ser Lys He 145 150 155 160
Leu Phe Lys Thr His Asn Pro His Phe Lys Lys Asp Thr Pro Phe Lys
165 170 175
Asp Leu Gin Thr Leu Ala Lys Glu His Asp Leu He Asp Tyr Tyr Asn
180 185 190
Leu Gly Asp Val Asp Leu Ser Asn Arg Val Ala Leu Glu Glu He Leu
195 200 205
Ala Leu Lys Pro Ser Leu Leu Ser Phe Ser Ala Asp Lys Phe Phe Asn
210 215 220
Ser Ala Gin Ala Gly He He Met Gly Gin Lys Glu Arg Val Glu Ala 225 230 235 240 Leu Lys Asn His Pro Leu Tyr Arg Val Leu Arg Val Gly Lys He Thr
245 250 255
Leu Thr Leu Leu Phe Cys Ser Leu Lys Ala Trp He Asn His Gin Glu
260 265 270
Asp He Thr He His Ala Leu Leu Asn Gin Thr Lys Asp Ala Leu Leu
275 280 285
Gin Lys Ala Leu Lys Leu Tyr Ala Leu Leu Lys Pro Leu Glu Leu Asn
290 295 300
Val Ser He Ala Ser Ser Phe Ser Lys He Gly Asn Leu Phe Gly Arg 305 310 315 320
Glu Leu Glu Ser Phe Cys Val Lys He Gin Pro Lys Asn Thr Arg Ala
325 330 335
Leu Asn Ser Glu Lys Leu Tyr Leu Lys Leu Phe Gin Lys Gly Val He
340 345 350
Ala Arg He Ser Cys Glu Phe Val Cys Phe Glu Val Phe Ser Leu Asn
355 360 365
Glu Lys Asp Phe Glu Lys He Ala Leu Val Leu Glu Glu He Leu Asn
370 375 380
Lys Ala 385
(2) INFORMATION FOR SEQ ID NO: 1119:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1238 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 13...1197 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1119:
AGGAACTTAA GA ATG GAA AAA ATC AGC GAT CTT ATA GAA TGC ATT GCG TAT 51 Met Glu Lys He Ser Asp Leu He Glu Cys He Ala Tyr 1 5 10
GAA AAA AAT TTG CCT AAA GAG ATG ATT TCA AAA GTG ATT CAA GGC TGT 99 Glu Lys Asn Leu Pro Lys Glu Met He Ser Lys Val He Gin Gly Cys 15 20 25
TTG TTA AAA ATG GCG CAA AAT GAG TTA GAC CCC CTA GCA CGC TAC TTG 147 Leu Leu Lys Met Ala Gin Asn Glu Leu Asp Pro Leu Ala Arg Tyr Leu 30 35 40 45
GTG GTT GAA GAA AAC AAG CAG CTC CAG CTT ATC CAG TTG GTA GAA GTT 195 Val Val Glu Glu Asn Lys Gin Leu Gin Leu He Gin Leu Val Glu Val 50 55 60 TTA GAA GAT GGT GAT GAA AGA TTG GTT AAC GAC CCT TCT AAA TAC ATC 243 Leu Glu Asp Gly Asp Glu Arg Leu Val Asn Asp Pro Ser Lys Tyr He 65 70 75
AGC CTG TCT AAA GCC AAA GAA ATG GAT CCA AGC GTT AAG ATT AAA GAC 291 Ser Leu Ser Lys Ala Lys Glu Met Asp Pro Ser Val Lys He Lys Asp 80 85 90
GAA TTG TCC TAT AGC TTG AGT TTG GAG AGC ATG AAA CAA GGA GCG ATC 339 Glu Leu Ser Tyr Ser Leu Ser Leu Glu Ser Met Lys Gin Gly Ala He 95 100 105
AAC CGC CTT TTT AAA GAT TTG CAA TAC CAG TTA GAA AAA GCG TTA GAA 387 Asn Arg Leu Phe Lys Asp Leu Gin Tyr Gin Leu Glu Lys Ala Leu Glu 110 115 120 125
GAC AGC CAC TTT GAA GCG TTT CAA AAG CGT CTT AAC AGC GTT TTA ATG 435 Asp Ser His Phe Glu Ala Phe Gin Lys Arg Leu Asn Ser Val Leu Met 130 135 140
GGG CAA GTG ATT TTA GTG GAT CAC AAC CAA AAC ACC TTT ATT GAG ATT 483 Gly Gin Val He Leu Val Asp His Asn Gin Asn Thr Phe He Glu He 145 150 155
GAG CAG CAA TTT CAG GGC GTT CTT TCC ATG CGC CAT CGC ATC AAG GGC 531 Glu Gin Gin Phe Gin Gly Val Leu Ser Met Arg His Arg He Lys Gly 160 165 170
GAG AGT TTT AAA GTG GGC GAT AGC ATT AAA GCG GTT TTA ACG CAA GTC 579 Glu Ser Phe Lys Val Gly Asp Ser He Lys Ala Val Leu Thr Gin Val 175 180 185
AAA CGC ACG AAA AAA GGC TTA TTA TTA GAG CTG AGC CGC ACC ACC CCT 627 Lys Arg Thr Lys Lys Gly Leu Leu Leu Glu Leu Ser Arg Thr Thr Pro 190 195 200 205
AAA ATG CTT GAA GCT TTG TTG GAA TTG GAA GTC CCT GAA ATT AAA GAC 675 Lys Met Leu Glu Ala Leu Leu Glu Leu Glu Val Pro Glu He Lys Asp 210 215 220
AAA GAA ATT GAA ATC ATC CAT TGT GCG CGA ATC CCA GGC AAC AGA GCG 723 Lys Glu He Glu He He His Cys Ala Arg He Pro Gly Asn Arg Ala 225 230 235
AAA GTG AGC TTT TTT TCC CAT AAC GCT AGG ATT GAC CCC ATA GGC GCG 771 Lys Val Ser Phe Phe Ser His Asn Ala Arg He Asp Pro He Gly Ala 240 245 250
GCT GTG GGG GTT AAG GGC GTG CGC ATT AAT GCG ATC AGT AAC GAA TTG 819 Ala Val Gly Val Lys Gly Val Arg He Asn Ala He Ser Asn Glu Leu 255 260 265
AAT AAA GAA AAC ATT GAT TGC ATA GAA TAT TCT AAT GTG CCT GAA ATT 867 Asn Lys Glu Asn He Asp Cys He Glu Tyr Ser Asn Val Pro Glu He 270 275 280 285 TAC ATC ACT CTC GCA CTC GCT CCA GCC AAA ATT TTA AGC GTT GAA ATC 915 Tyr He Thr Leu Ala Leu Ala Pro Ala Lys He Leu Ser Val Glu He 290 295 300
AAA AAA ATC CCT ATA GAA GAA TTG AAT GCT GAA GAA AAA GAA TCC ATT 963 Lys Lys He Pro He Glu Glu Leu Asn Ala Glu Glu Lys Glu Ser He 305 310 315
CAA GAG CGT TTT ATC GTC AAT AAC CAT TTG CAA AAG GCT AAA GTG CGT 1011 Gin Glu Arg Phe He Val Asn Asn His Leu Gin Lys Ala Lys Val Arg 320 325 330
TTA TTG GAC ATT GAA AAA TCT AAG GCT ATC GGT AAG GGC GGG GTG AAT 1059 Leu Leu Asp He Glu Lys Ser Lys Ala He Gly Lys Gly Gly Val Asn 335 340 345
GTG TGC TTA GCG TCC ATG CTT ACA GGC TAT CAC ATA GAG TTT GAA ACC 1107 Val Cys Leu Ala Ser Met Leu Thr Gly Tyr His He Glu Phe Glu Thr 350 355 360 365
ATT CCT AGC GTG AAA GAA AAC GCA GAA AAT GAA AGC GAA AAA GAA ACG 1155 He Pro Ser Val Lys Glu Asn Ala Glu Asn Glu Ser Glu Lys Glu Thr 370 375 380
CCA AAA GTG GGG GTA GAA GCT TTA GAG TCT TTG TTT AAG AAT TAAGGGTAT 1206 Pro Lys Val Gly Val Glu Ala Leu Glu Ser Leu Phe Lys Asn 385 390 395
CTAAAATTCA ATCTCTAAAA AAGCTTTTAA CT 1238
(2) INFORMATION FOR SEQ ID NO: 1120:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 395 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1120:
Met Glu Lys He Ser Asp Leu He Glu Cys He Ala Tyr Glu Lys Asn
1 5 10 15
Leu Pro Lys Glu Met He Ser Lys Val He Gin Gly Cys Leu Leu Lys
20 25 30
Met Ala Gin Asn Glu Leu Asp Pro Leu Ala Arg Tyr Leu Val Val Glu
35 40 45
Glu Asn Lys Gin Leu Gin Leu He Gin Leu Val Glu Val Leu Glu Asp
50 55 60
Gly Asp Glu Arg Leu Val Asn Asp Pro Ser Lys Tyr He Ser Leu Ser 65 70 75 80
Lys Ala Lys Glu Met Asp Pro Ser Val Lys He Lys Asp Glu Leu Ser
85 90 95
Tyr Ser Leu Ser Leu Glu Ser Met Lys Gin Gly Ala He Asn Arg Leu 100 105 110
Phe Lys Asp Leu Gin Tyr Gin Leu Glu Lys Ala Leu Glu Asp Ser His
115 120 125
Phe Glu Ala Phe Gin Lys Arg Leu Asn Ser Val Leu Met Gly Gin Val
130 135 140
He Leu Val Asp His Asn Gin Asn Thr Phe He Glu He Glu Gin Gin 145 150 155 160
Phe Gin Gly Val Leu Ser Met Arg His Arg He Lys Gly Glu Ser Phe
165 170 175
Lys Val Gly Asp Ser He Lys Ala Val Leu Thr Gin Val Lys Arg Thr
180 185 190
Lys Lys Gly Leu Leu Leu Glu Leu Ser Arg Thr Thr Pro Lys Met Leu
195 200 205
Glu Ala Leu Leu Glu Leu Glu Val Pro Glu He Lys Asp Lys Glu He
210 215 220
Glu He He His Cys Ala Arg He Pro Gly Asn Arg Ala Lys Val Ser 225 230 235 240
Phe Phe Ser His Asn Ala Arg He Asp Pro He Gly Ala Ala Val Gly
245 250 255
Val Lys Gly Val Arg He Asn Ala He Ser Asn Glu Leu Asn Lys Glu
260 265 270
Asn He Asp Cys He Glu Tyr Ser Asn Val Pro Glu He Tyr He Thr
275 280 285
Leu Ala Leu Ala Pro Ala Lys He Leu Ser Val Glu He Lys Lys He
290 295 300
Pro He Glu Glu Leu Asn Ala Glu Glu Lys Glu Ser He Gin Glu Arg 305 310 315 320
Phe He Val Asn Asn His Leu Gin Lys Ala Lys Val Arg Leu Leu Asp
325 330 335
He Glu Lys Ser Lys Ala He Gly Lys Gly Gly Val Asn Val Cys Leu
340 345 350
Ala Ser Met Leu Thr Gly Tyr His He Glu Phe Glu Thr He Pro Ser
355 360 365
Val Lys Glu Asn Ala Glu Asn Glu Ser Glu Lys Glu Thr Pro Lys Val
370 375 380
Gly Val Glu Ala Leu Glu Ser Leu Phe Lys Asn 385 390 395
(2) INFORMATION FOR SEQ ID NO: 1121:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3903 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 21...3857 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1121:
AAGCGATGTA AGGAATTAAC ATG GAT TAT AAA AAA TTA GAT TTA CCC AAC 50
Met Asp Tyr Lys Lys Leu Asp Leu Pro Asn 1 5 10
ACA AAC TAC CCA AAT CAA GAG CAA CTG AAA GCT TTT GAA ACC GCT TTT 98 Thr Asn Tyr Pro Asn Gin Glu Gin Leu Lys Ala Phe Glu Thr Ala Phe 15 20 25
GAC GCC TTT TTA GAA ACC AAC CAA CAA GAA AAT GAA AAT CAC CAA AAC 146 Asp Ala Phe Leu Glu Thr Asn Gin Gin Glu Asn Glu Asn His Gin Asn 30 35 40
GAC GCT TTT AAT GAT TTA TTG AAA GGC GTT TTT AAA TAC AAG GTT AAG 194 Asp Ala Phe Asn Asp Leu Leu Lys Gly Val Phe Lys Tyr Lys Val Lys 45 50 55
CCC ACC AAA AAA ATA GAC AGC ACT ATT CTT AAT GAA AAT AAC GAA GTG 242 Pro Thr Lys Lys He Asp Ser Thr He Leu Asn Glu Asn Asn Glu Val 60 65 70
GAG GTG ATC ATT GAA TTT AAA GCC CTT AAA AAC CCC AAC GAA TTT ATT 290 Glu Val He He Glu Phe Lys Ala Leu Lys Asn Pro Asn Glu Phe He 75 80 85 90
AAA AAG GGC GAT TTG AAT GTT AAA GCC TTT CAT GAA AGC CTT TTG TCT 338 Lys Lys Gly Asp Leu Asn Val Lys Ala Phe His Glu Ser Leu Leu Ser 95 100 105
TAT CTC ACA GAA AGA AAA GAG GGT AAT AAC AAC CTT AAG CAT CTT ATC 386 Tyr Leu Thr Glu Arg Lys Glu Gly Asn Asn Asn Leu Lys His Leu He 110 115 120
TTA GCC ACT ATT AAA GAG CTT TAT ATC ATT GAT GCA AAC GAA TTT GAG 434 Leu Ala Thr He Lys Glu Leu Tyr He He Asp Ala Asn Glu Phe Glu 125 130 135
GTT TTT AAT AAA GAT AAA GAA ATT GAA AAC GCC TTT AAA AAT TGC CAC 482 Val Phe Asn Lys Asp Lys Glu He Glu Asn Ala Phe Lys Asn Cys His 140 145 150
GAT AGA AAG GGT AAC GAT ACA CGC ACA AAA GCG TTT TAT GAT GCT TGC 530 Asp Arg Lys Gly Asn Asp Thr Arg Thr Lys Ala Phe Tyr Asp Ala Cys 155 160 165 170
CAA AAG CGC CTT AAT GAG TTT GAT CGT TCT TTG AAA TAC CAC TAT ATC 578 Gin Lys Arg Leu Asn Glu Phe Asp Arg Ser Leu Lys Tyr His Tyr He 175 180 185
CCC CTC AAA AAA GAA AAT TTA GCC CTA ATC TAT CAA GCC CTA AGC CCT 626 Pro Leu Lys Lys Glu Asn Leu Ala Leu He Tyr Gin Ala Leu Ser Pro 190 195 200
AAT TTT TTG CTC AAA ATT CCA AAA TAT TCT GAC GCT AAC ACG CTT AAC 674 Asn Phe Leu Leu Lys He Pro Lys Tyr Ser Asp Ala Asn Thr Leu Asn 205 210 215
AAA GAT TTT TAT GAA GAA TTG CTT TAC ATT TTA GGG TTA GAA GAG CAA 722 Lys Asp Phe Tyr Glu Glu Leu Leu Tyr He Leu Gly Leu Glu Glu Gin 220 225 230
AAT GAC AAA GGG AAA ATT TTA ATC AAG CCC AGC CGC ACC CAA AAT TCC 770 Asn Asp Lys Gly Lys He Leu He Lys Pro Ser Arg Thr Gin Asn Ser 235 240 245 250
CTA AGC GAT GCT TTA AAA AAG GAA TAC AAA AAT TTA GAC GAT GAA GAA 818 Leu Ser Asp Ala Leu Lys Lys Glu Tyr Lys Asn Leu Asp Asp Glu Glu 255 260 265
GTC ATG GCG TTG CTC ATC GCT TGG AAT AAC CGC ATC TTG TTT TTA CGG 866 Val Met Ala Leu Leu He Ala Trp Asn Asn Arg He Leu Phe Leu Arg 270 275 280
CTT TTA GAA AGC CTT TTA ATT TCT TTT AAG CAT TTT GAA AAT CCT TTC 914 Leu Leu Glu Ser Leu Leu He Ser Phe Lys His Phe Glu Asn Pro Phe 285 290 295
TTA ACC ACA GAA AAC TTT GAA AAT TTC AAC GAT TTA AAC ACG CTC TTT 962 Leu Thr Thr Glu Asn Phe Glu Asn Phe Asn Asp Leu Asn Thr Leu Phe 300 305 310
TTT GAA GTC CTA GCC AAG AAA AAC AGC GAG CGC TTA CCA GAA ATT AAA 1010 Phe Glu Val Leu Ala Lys Lys Asn Ser Glu Arg Leu Pro Glu He Lys 315 320 325 330
GAA GAC AAG ATT TTA GAA AAA ATC CCT TAT TTG AAT TCC AGT TTG TTT 1058 Glu Asp Lys He Leu Glu Lys He Pro Tyr Leu Asn Ser Ser Leu Phe 335 340 345
GAT AAA ACG CCT TTA GAA TTA AAG GGG CAT GAA ATC AAG CTT TTA GAC 1106 Asp Lys Thr Pro Leu Glu Leu Lys Gly His Glu He Lys Leu Leu Asp 350 355 360
AAT AAA AAG CTA GAA ATC TAT AAA AAT TCC GTT CTC AAA AAA CAT AAA 1154 Asn Lys Lys Leu Glu He Tyr Lys Asn Ser Val Leu Lys Lys His Lys 365 370 375
GAT TAT CAA AAA GAA AAA CCT TTG CCC TTG CTA AAA TAC CTT TTT AAA 1202 Asp Tyr Gin Lys Glu Lys Pro Leu Pro Leu Leu Lys Tyr Leu Phe Lys 380 385 390
TTT TTG CGT CTT TAT AAA TTC ACC ACC ACC CCT AAA GAC ATT AAA GAT 1250 Phe Leu Arg Leu Tyr Lys Phe Thr Thr Thr Pro Lys Asp He Lys Asp 395 400 405 410
AAT ACC GAT ACC AGC GAA AGC CGT TTG ATT AAC CCT AGC GTT TTA GGG 1298 Asn Thr Asp Thr Ser Glu Ser Arg Leu He Asn Pro Ser Val Leu Gly 415 420 425 CTT GTT TTT GAA AAA CTC AAC GGC TAT AAA GAG GGG AGC TTT TAT ACC 1346 Leu Val Phe Glu Lys Leu Asn Gly Tyr Lys Glu Gly Ser Phe Tyr Thr 430 435 440
CCA AGC TTT ATC ACA AGC TAC ATG TGC AAA GAG AGC ATC ACG CCC ATC 1394 Pro Ser Phe He Thr Ser Tyr Met Cys Lys Glu Ser He Thr Pro He 445 450 455
GTG TTG GAT AAA TTC AAC GCC ATT TAT CAG TGG GAC TGC GAA AAT CTA 1442 Val Leu Asp Lys Phe Asn Ala He Tyr Gin Trp Asp Cys Glu Asn Leu 460 465 470
AAA GCG TTG CGA GGA GAA ATA GAC AGA AAT TTT TCA AAT GAA AAA GCT 1490 Lys Ala Leu Arg Gly Glu He Asp Arg Asn Phe Ser Asn Glu Lys Ala 475 480 485 490
AAA GAA TAC CTA AAC ACG CTT TTA ACC TTG CGT ATT TGC GAT CCG GCG 1538 Lys Glu Tyr Leu Asn Thr Leu Leu Thr Leu Arg He Cys Asp Pro Ala 495 500 505
GTG GGG AGC GGG CAT TTC TTG GTT TCA GCG CTC AAT GAA ATG GTG CGG 1586 Val Gly Ser Gly His Phe Leu Val Ser Ala Leu Asn Glu Met Val Arg 510 515 520
GTT GCT TAT GAG CTA GGA CTT ATT GCT TCC TTG TAT CGC TAC GAT CTT 1634 Val Ala Tyr Glu Leu Gly Leu He Ala Ser Leu Tyr Arg Tyr Asp Leu 525 530 535
AAA TTA GAA AAC GAT GAA ATC ATC ATT CAC CAC ACG CCA ACG GGT GAA 1682 Lys Leu Glu Asn Asp Glu He He He His His Thr Pro Thr Gly Glu 540 545 550
ATC TTT AAC TAC ATA AAA CCA GAT AGC GAA AAC GAC CCC CAC CAC CAC 1730 He Phe Asn Tyr He Lys Pro Asp Ser Glu Asn Asp Pro His His His 555 560 565 570
ATC CAA AAA GAA CTT TTT AAT CTT AAA AAA TCC ATT ATT GAA AAC TGC 1778 He Gin Lys Glu Leu Phe Asn Leu Lys Lys Ser He He Glu Asn Cys 575 580 585
CTT TTT GGC GTG GAT ATT AAC CCC AAT TCT TGC GAA ATC ACC AAG CTC 1826 Leu Phe Gly Val Asp He Asn Pro Asn Ser Cys Glu He Thr Lys Leu 590 595 600
AGG CTA TGG ATA GAG CTT TTA AAA TAC AGC TAT TAT ATT TTT GAA AAG 1874 Arg Leu Trp He Glu Leu Leu Lys Tyr Ser Tyr Tyr He Phe Glu Lys 605 610 615
GGC AAG AAC ACT AAC GCG CTT GAA ACC CTC CCC AAC ATT GAT ATT AAC 1922 Gly Lys Asn Thr Asn Ala Leu Glu Thr Leu Pro Asn He Asp He Asn 620 625 630
ATT AAG TGC GCT AAT TCG CTC ATT TCT AGG TTT GCC CTC AAA GAT AAA 1970 He Lys Cys Ala Asn Ser Leu He Ser Arg Phe Ala Leu Lys Asp Lys 635 640 645 650 GCC TTG TTA AAA AGC GAA AAA AAT AAA AAC CTA GAA TAC TCT ATC GCT 2018 Ala Leu Leu Lys Ser Glu Lys Asn Lys Asn Leu Glu Tyr Ser He Ala 655 660 665
GAA TAC AAA GAA CTC GTT AAA ATC TAT AAA GAC CCT AAA ATC TTA GAA 2066 Glu Tyr Lys Glu Leu Val Lys He Tyr Lys Asp Pro Lys He Leu Glu 670 675 680
ACC CTA ACG CAC CCC ATA AAA GAC TCT AAC GCC GTT AGA AAA TAC GCT 2114 Thr Leu Thr His Pro He Lys Asp Ser Asn Ala Val Arg Lys Tyr Ala 685 690 695
AAA GAA CGC CTT TAT CAA GAA CTA AAA CAA AAT CCT AAC AAA GAT TTT 2162 Lys Glu Arg Leu Tyr Gin Glu Leu Lys Gin Asn Pro Asn Lys Asp Phe 700 705 710
AAA AAG GCT CTC AAT GAT AGG ATA GAG AAA ATT AAA AAA GCT TTT AAA 2210 Lys Lys Ala Leu Asn Asp Arg He Glu Lys He Lys Lys Ala Phe Lys 715 720 725 730
CTC ACT TTA AAC CCC CCT CCA AAA GAA TTA AAA TTT AAA AAA TTT TTA 2258 Leu Thr Leu Asn Pro Pro Pro Lys Glu Leu Lys Phe Lys Lys Phe Leu 735 740 745
AAA GAG CAT TTA GAA CTC TAT GGC AAG AGT ATC TTA GAA GAG GCA AAC 2306 Lys Glu His Leu Glu Leu Tyr Gly Lys Ser He Leu Glu Glu Ala Asn 750 755 760
TAC AAC GGC TTA GAA TTG GAA GCC CTA GCA TTA GAA AAG CAA ATG GCG 2354 Tyr Asn Gly Leu Glu Leu Glu Ala Leu Ala Leu Glu Lys Gin Met Ala 765 770 775
AAT CTT TTT TTT GAT TAT AGA CCC TAC CCC AAA CTA GAC AAA TCG GAT 2402 Asn Leu Phe Phe Asp Tyr Arg Pro Tyr Pro Lys Leu Asp Lys Ser Asp 780 785 790
AAA GTA GTA GGA CTA GAA CAT TTT AAC CGC TAT GTC CTA ACA TCT TAT 2450 Lys Val Val Gly Leu Glu His Phe Asn Arg Tyr Val Leu Thr Ser Tyr 795 800 805 810
AAA GAT TTA CAA GAT GAA AAC GAA CGC TAC GCT AAC GCT CTT GAA TGG 2498 Lys Asp Leu Gin Asp Glu Asn Glu Arg Tyr Ala Asn Ala Leu Glu Trp 815 820 825
CGC TTT GAA TTC CCT GAA GTT TTA GAT GAT GAG GGG GAT TTT TCA GGC 2546 Arg Phe Glu Phe Pro Glu Val Leu Asp Asp Glu Gly Asp Phe Ser Gly 830 835 840
TTT GAT TGC ATC ATT GGG AAT CCA CCT TAT ATC CGC CAA GAA CAC ATC 2594 Phe Asp Cys He He Gly Asn Pro Pro Tyr He Arg Gin Glu His He 845 850 855
AAA GAC TTA AAG CCT TTA TTA GAA AAG CAA TAC CAA GAT TTC TAT AAC 2642 Lys Asp Leu Lys Pro Leu Leu Glu Lys Gin Tyr Gin Asp Phe Tyr Asn 860 865 870 AGC ACC GCT GAC ATT TAC ACC TAC TTT TTT GCC CTG GCT TTC CAC CTT 2690 Ser Thr Ala Asp He Tyr Thr Tyr Phe Phe Ala Leu Ala Phe His Leu 875 880 885 890
TTA AAA GAA AAG GGG TTT AGC GCT TTC ATC ACT TCT AAC AAA TAT ACG 2738 Leu Lys Glu Lys Gly Phe Ser Ala Phe He Thr Ser Asn Lys Tyr Thr 895 900 905
CGA GCC AAA TAC GGC GCT AAA TTG AGG GAA TGG CTG CTC AAA AAA ACC 2786 Arg Ala Lys Tyr Gly Ala Lys Leu Arg Glu Trp Leu Leu Lys Lys Thr 910 915 920
ACC ATC GTC AGC TAC ATG GAA CTA AAC GCC TTA AAA GTC TTT GAG AGC 2834 Thr He Val Ser Tyr Met Glu Leu Asn Ala Leu Lys Val Phe Glu Ser 925 930 935
GCT GCA GTG GAT ACC AGC ATC ATT CAT TTC ATC AAA CAA ACG CCC TCT 2882 Ala Ala Val Asp Thr Ser He He His Phe He Lys Gin Thr Pro Ser 940 945 950
AAA GAG AGC GAA TTT AAA TAT TAC GAA CCC ACC CCA AAC GAT AAA GAC 2930 Lys Glu Ser Glu Phe Lys Tyr Tyr Glu Pro Thr Pro Asn Asp Lys Asp 955 960 965 970
GAT TTG AAA AGC ACC CCA CAC CTT TTG ATG AAA CAA AAC GTG CTT TCA 2978 Asp Leu Lys Ser Thr Pro His Leu Leu Met Lys Gin Asn Val Leu Ser 975 980 985
ACA GAA AGC TTT ATT TTT GCC AAC GCC ACG CTT TTA GAT TTG AGG GAC 3026 Thr Glu Ser Phe He Phe Ala Asn Ala Thr Leu Leu Asp Leu Arg Asp 990 995 1000
AAA ATA GAG AGT GTT GGC ACC CCG CTT AAA GAC TGG GAC ATT CAA ATC 3074 Lys He Glu Ser Val Gly Thr Pro Leu Lys Asp Trp Asp He Gin He 1005 1010 1015
AAT TAT GGG ATA AAA ACC GGC GCG AAC GAA GCC TTT ATC ATT CCC ACT 3122 Asn Tyr Gly He Lys Thr Gly Ala Asn Glu Ala Phe He He Pro Thr 1020 1025 1030
GAA AAA AGA GAA GAG ATC TTA AAC GCT TGC AAG ACG CAA GAA GAA AGG 3170 Glu Lys Arg Glu Glu He Leu Asn Ala Cys Lys Thr Gin Glu Glu Arg 1035 1040 1045 1050
GAG CGC ACA GAG AGG CTT ATT AAG CCT ATT TTA AGA GGG AAA GAC ATT 3218 Glu Arg Thr Glu Arg Leu He Lys Pro He Leu Arg Gly Lys Asp He 1055 1060 1065
AAA AGG TAT TCT TAT GAG TGG GCG CAT TTG TGG GTT ATC AAC ACC CAT 3266 Lys Arg Tyr Ser Tyr Glu Trp Ala His Leu Trp Val He Asn Thr His 1070 1075 1080
AAC GGC TAC ACT TCT TCT CTC AAA TCC AAA ATC CCT CCC ATT GAT ATA 3314 Asn Gly Tyr Thr Ser Ser Leu Lys Ser Lys He Pro Pro He Asp He 1085 1090 1095 GAA AAA TAC CCC GCA ATT AAA GCG CAT TTA GAC GCT CAT TAC GAC ACT 3362 Glu Lys Tyr Pro Ala He Lys Ala His Leu Asp Ala His Tyr Asp Thr 1100 1105 1110
ATT GCA ACA CGA TGC GAT CAA GGA GAC ACC CCC TAT CAC TTA AGG AAT 3410 He Ala Thr Arg Cys Asp Gin Gly Asp Thr Pro Tyr His Leu Arg Asn 1115 1120 1125 1130
TGC GCG TAT TTA GAG GAT TTT GAA AAA GAG AAA ATT GTG TGG GCA AGT 3458 Cys Ala Tyr Leu Glu Asp Phe Glu Lys Glu Lys He Val Trp Ala Ser 1135 1140 1145
GTG GGA TTT GTT GAA TAT TGT ATG ATC CCA G&A TTA TTG ATA CTT GAT 3506 Val Gly Phe Val Glu Tyr Cys Met He Pro Gly Leu Leu He Leu Asp 1150 1155 1160
ACA AAT TAT TTT TTT GAA GTC AGT AAA TTT GGC AAT ACA AAA AAC TAT 3554 Thr Asn Tyr Phe Phe Glu Val Ser Lys Phe Gly Asn Thr Lys Asn Tyr 1165 1170 1175
TTG CTT GGA CTT TTA AAT TCA AAA TTG CTA ACT TTT TGG TTA AAA GCT 3602 Leu Leu Gly Leu Leu Asn Ser Lys Leu Leu Thr Phe Trp Leu Lys Ala 1180 1185 1190
AAA AAT ACA CCA TTA GGC GAT ATG GGA GCT TAT AGA AAT TAT AAG TAT 3650 Lys Asn Thr Pro Leu Gly Asp Met Gly Ala Tyr Arg Asn Tyr Lys Tyr 1195 1200 1205 1210
AAT ATT ATG GAG TTA CCG ATG GTA AAA ATA ACG GCA AAA AAT AAA AAA 3698 Asn He Met Glu Leu Pro Met Val Lys He Thr Ala Lys Asn Lys Lys 1215 1220 1225
ATC GCC GAT AAA ATC ATC GCT TTA GTG GAT AAA ATC CTA CAA GCA AAA 3746 He Ala Asp Lys He He Ala Leu Val Asp Lys He Leu Gin Ala Lys 1230 1235 1240
GAA AAA GAC CCT AAA GCC AAC ACC CAA AAG TTA GAA AAA GAA ATT GAC 3794 Glu Lys Asp Pro Lys Ala Asn Thr Gin Lys Leu Glu Lys Glu He Asp 1245 1250 1255
GCC TTA GTC TAT CAG CTC TAC CAC CTC ACC GAT GAA GAA ATT AAG ATC 3842 Ala Leu Val Tyr Gin Leu Tyr His Leu Thr Asp Glu Glu He Lys He 1260 1265 1270
ATT GAA GAG GGG CAG TGAATGGAAA AGTTATTTGA AAAGATATTG CATGAAATGA G 3898 He Glu Glu Gly Gin 1275 1
ATCAA 3903
(2) INFORMATION FOR SEQ ID NO:1122:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1279 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1122:
Met Asp Tyr Lys Lys Leu Asp Leu Pro Asn Thr Asn Tyr Pro Asn Gin
1 5 10 15
Glu Gin Leu Lys Ala Phe Glu Thr Ala Phe Asp Ala Phe Leu Glu Thr
20 25 30
Asn Gin Gin Glu Asn Glu Asn His Gin Asn Asp Ala Phe Asn Asp Leu
35 40 45
Leu Lys Gly Val Phe Lys Tyr Lys Val Lys Pro Thr Lys Lys He Asp
50 55 60
Ser Thr He Leu Asn Glu Asn Asn Glu Val Glu Val He He Glu Phe 65 70 75 80
Lys Ala Leu Lys Asn Pro Asn Glu Phe He Lys Lys Gly Asp Leu Asn
85 90 95
Val Lys Ala Phe His Glu Ser Leu Leu Ser Tyr Leu Thr Glu Arg Lys
100 105 110
Glu Gly Asn Asn Asn Leu Lys His Leu He Leu Ala Thr He Lys Glu
115 120 125
Leu Tyr He He Asp Ala Asn Glu Phe Glu Val Phe Asn Lys Asp Lys
130 135 140
Glu He Glu Asn Ala Phe Lys Asn Cys His Asp Arg Lys Gly Asn Asp 145 150 155 160
Thr Arg Thr Lys Ala Phe Tyr Asp Ala Cys Gin Lys Arg Leu Asn Glu
165 170 175
Phe Asp Arg Ser Leu Lys Tyr His Tyr He Pro Leu Lys Lys Glu Asn
180 185 190
Leu Ala Leu He Tyr Gin Ala Leu Ser Pro Asn Phe Leu Leu Lys He
195 200 205
Pro Lys Tyr Ser Asp Ala Asn Thr Leu Asn Lys Asp Phe Tyr Glu Glu
210 215 220
Leu Leu Tyr He Leu Gly Leu Glu Glu Gin Asn Asp Lys Gly Lys He 225 230 235 240
Leu He Lys Pro Ser Arg Thr Gin Asn Ser Leu Ser Asp Ala Leu Lys
245 250 255
Lys Glu Tyr Lys Asn Leu Asp Asp Glu Glu Val Met Ala Leu Leu He
260 265 270
Ala Trp Asn Asn Arg He Leu Phe Leu Arg Leu Leu Glu Ser Leu Leu
275 280 285
He Ser Phe Lys His Phe Glu Asn Pro Phe Leu Thr Thr Glu Asn Phe
290 295 300
Glu Asn Phe Asn Asp Leu Asn Thr Leu Phe Phe Glu Val Leu Ala Lys 305 310 315 320
Lys Asn Ser Glu Arg Leu Pro Glu He Lys Glu Asp Lys He Leu Glu
325 330 335
Lys He Pro Tyr Leu Asn Ser Ser Leu Phe Asp Lys Thr Pro Leu Glu
340 345 350
Leu Lys Gly His Glu He Lys Leu Leu Asp Asn Lys Lys Leu Glu He
355 360 365
Tyr Lys Asn Ser Val Leu Lys Lys His Lys Asp Tyr Gin Lys Glu Lys 370 375 380 Pro Leu Pro Leu Leu Lys Tyr Leu Phe Lys Phe Leu Arg Leu Tyr Lys 385 390 395 400
Phe Thr Thr Thr Pro Lys Asp He Lys Asp Asn Thr Asp Thr Ser Glu
405 410 415
Ser Arg Leu He Asn Pro Ser Val Leu Gly Leu Val Phe Glu Lys Leu
420 425 430
Asn Gly Tyr Lys Glu Gly Ser Phe Tyr Thr Pro Ser Phe He Thr Ser
435 440 445
Tyr Met Cys Lys Glu Ser He Thr Pro He Val Leu Asp Lys Phe Asn
450 455 460
Ala He Tyr Gin Trp Asp Cys Glu Asn Leu Lys Ala Leu Arg Gly Glu 465 470 475 480
He Asp Arg Asn Phe Ser Asn Glu Lys Ala Lys Glu Tyr Leu Asn Thr
485 490 495
Leu Leu Thr Leu Arg He Cys Asp Pro Ala Val Gly Ser Gly His Phe
500 505 510
Leu Val Ser Ala Leu Asn Glu Met Val Arg Val Ala Tyr Glu Leu Gly
515 520 525
Leu He Ala Ser Leu Tyr Arg Tyr Asp Leu Lys Leu Glu Asn Asp Glu
530 535 540
He He He His His Thr Pro Thr Gly Glu He Phe Asn Tyr He Lys 545 550 555 560
Pro Asp Ser Glu Asn Asp Pro His His His He Gin Lys Glu Leu Phe
565 570 575
Asn Leu Lys Lys Ser He He Glu Asn Cys Leu Phe Gly Val Asp He
580 585 590
Asn Pro Asn Ser Cys Glu He Thr Lys Leu Arg Leu Trp He Glu Leu
595 600 605
Leu Lys Tyr Ser Tyr Tyr He Phe Glu Lys Gly Lys Asn Thr Asn Ala
610 615 620
Leu Glu Thr Leu Pro Asn He Asp He Asn He Lys Cys Ala Asn Ser 625 630 635 640
Leu He Ser Arg Phe Ala Leu Lys Asp Lys Ala Leu Leu Lys Ser Glu
645 650 655
Lys Asn Lys Asn Leu Glu Tyr Ser He Ala Glu Tyr Lys Glu Leu Val
660 665 670
Lys He Tyr Lys Asp Pro Lys He Leu Glu Thr Leu Thr His Pro He
675 680 685
Lys Asp Ser Asn Ala Val Arg Lys Tyr Ala Lys Glu Arg Leu Tyr Gin
690 695 700
Glu Leu Lys Gin Asn Pro Asn Lys Asp Phe Lys Lys Ala Leu Asn Asp 705 710 715 720
Arg He Glu Lys He Lys Lys Ala Phe Lys Leu Thr Leu Asn Pro Pro
725 730 735
Pro Lys Glu Leu Lys Phe Lys Lys Phe Leu Lys Glu His Leu Glu Leu
740 745 750
Tyr Gly Lys Ser He Leu Glu Glu Ala Asn Tyr Asn Gly Leu Glu Leu
755 760 765
Glu Ala Leu Ala Leu Glu Lys Gin Met Ala Asn Leu Phe Phe Asp Tyr
770 775 780
Arg Pro Tyr Pro Lys Leu Asp Lys Ser Asp Lys Val Val Gly Leu Glu 785 790 795 800
His Phe Asn Arg Tyr Val Leu Thr Ser Tyr Lys Asp Leu Gin Asp Glu
805 810 815
Asn Glu Arg Tyr Ala Asn Ala Leu Glu Trp Arg Phe Glu Phe Pro Glu 820 825 830
Val Leu Asp Asp Glu Gly Asp Phe Ser Gly Phe Asp Cys He He Gly
835 840 845
Asn Pro Pro Tyr He Arg Gin Glu His He Lys Asp Leu Lys Pro Leu
850 855 860
Leu Glu Lys Gin Tyr Gin Asp Phe Tyr Asn Ser Thr Ala Asp He Tyr 865 870 875 880
Thr Tyr Phe Phe Ala Leu Ala Phe His Leu Leu Lys Glu Lys Gly Phe
885 890 895
Ser Ala Phe He Thr Ser Asn Lys Tyr Thr Arg Ala Lys Tyr Gly Ala
900 905 910
Lys Leu Arg Glu Trp Leu Leu Lys Lys Thr Thr He Val Ser Tyr Met
915 920 925
Glu Leu Asn Ala Leu Lys Val Phe Glu Ser Ala Ala Val Asp Thr Ser
930 935 940
He He His Phe He Lys Gin Thr Pro Ser Lys Glu Ser Glu Phe Lys 945 950 955 960
Tyr Tyr Glu Pro Thr Pro Asn Asp Lys Asp Asp Leu Lys Ser Thr Pro
965 970 975
His Leu Leu Met Lys Gin Asn Val Leu Ser Thr Glu Ser Phe He Phe
980 985 990
Ala Asn Ala Thr Leu Leu Asp Leu Arg Asp Lys He Glu Ser Val Gly
995 1000 1005
Thr Pro Leu Lys Asp Trp Asp He Gin He Asn Tyr Gly He Lys Thr
1010 1015 1020
Gly Ala Asn Glu Ala Phe He He Pro Thr Glu Lys Arg Glu Glu He 025 1030 1035 1040
Leu Asn Ala Cys Lys Thr Gin Glu Glu Arg Glu Arg Thr Glu Arg Leu
1045 1050 1055
He Lys Pro He Leu Arg Gly Lys Asp He Lys Arg Tyr Ser Tyr Glu
1060 1065 1070
Trp Ala His Leu Trp Val He Asn Thr His Asn Gly Tyr Thr Ser Ser
1075 1080 1085
Leu Lys Ser Lys He Pro Pro He Asp He Glu Lys Tyr Pro Ala He
1090 1095 1100
Lys Ala His Leu Asp Ala His Tyr Asp Thr He Ala Thr Arg Cys Asp 105 1110 1115 1120
Gin Gly Asp Thr Pro Tyr His Leu Arg Asn Cys Ala Tyr Leu Glu Asp
1125 1130 1135
Phe Glu Lys Glu Lys He Val Trp Ala Ser Val Gly Phe Val Glu Tyr
1140 1145 1150
Cys Met He Pro Gly Leu Leu He Leu Asp Thr Asn Tyr Phe Phe Glu
1155 1160 1165
Val Ser Lys Phe Gly Asn Thr Lys Asn Tyr Leu Leu Gly Leu Leu Asn
1170 1175 1180
Ser Lys Leu Leu Thr Phe Trp Leu Lys Ala Lys Asn Thr Pro Leu Gly 185 1190 1195 1200
Asp Met Gly Ala Tyr Arg Asn Tyr Lys Tyr Asn He Met Glu Leu Pro
1205 1210 1215
Met Val Lys He Thr Ala Lys Asn Lys Lys He Ala Asp Lys He He
1220 1225 1230
Ala Leu Val Asp Lys He Leu Gin Ala Lys Glu Lys Asp Pro Lys Ala
1235 1240 1245
Asn Thr Gin Lys Leu Glu Lys Glu He Asp Ala Leu Val Tyr Gin Leu 1250 1255 1260 Tyr His Leu Thr Asp Glu Glu He Lys He He Glu Glu Gly Gin 265 1270 1275 1
(2) INFORMATION FOR SEQ ID NO: 1123:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1415 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 28...1377 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1123:
TTTAAATCAT TTAAAAAAAG GATAGAG ATG CAA AAT AAA GAA ATT GGT GAA GAA 54
Met Gin Asn Lys Glu He Gly Glu Glu 1 5
AAA AGC GTT AAT GAA AAA AAT GTA GAG GTT TTT AAT CGT TAT TTT CCC 102 Lys Ser Val Asn Glu Lys Asn Val Glu Val Phe Asn Arg Tyr Phe Pro 10 15 20 25
GGT TGC TTG AGT ATA GAA AAT GAT AAC AAG CTC ACG CTG GAT ACA GGA 150 Gly Cys Leu Ser He Glu Asn Asp Asn Lys Leu Thr Leu Asp Thr Gly 30 35 40
AAA TTA AAA GCG TTA CTA GGG GAT TTT AGC GAG ATA AAA GAA GAG GGC 198 Lys Leu Lys Ala Leu Leu Gly Asp Phe Ser Glu He Lys Glu Glu Gly 45 50 55
TAT GGG TTG GAT TTT GTG GGT AAG AAA ATC GCC TTA AAC CAA GCT TTT 246 Tyr Gly Leu Asp Phe Val Gly Lys Lys He Ala Leu Asn Gin Ala Phe 60 65 70
AAG AAA AAT CAT AAG ATT TTA AAG CCC TTA AAC GAA TCC ACT AGC AAG 294 Lys Lys Asn His Lys He Leu Lys Pro Leu Asn Glu Ser Thr Ser Lys 75 80 85
CAC GTT CTC ATC AAG GGC GAT AAT TTA GAC GCT CTC AAA ATC TTA AAA 342 His Val Leu He Lys Gly Asp Asn Leu Asp Ala Leu Lys He Leu Lys 90 95 100 105
CAA AGC TAT AGT GAA AAA ATC AAA ATG ATT TAC ATT GAC CCG CCT TAC 390 Gin Ser Tyr Ser Glu Lys He Lys Met He Tyr He Asp Pro Pro Tyr 110 115 120
AAC ACG AAA AAC GAG AAT TTT ATC TAT GGC GAT GAT TTC TCG CAA TCC 438 Asn Thr Lys Asn Glu Asn Phe He Tyr Gly Asp Asp Phe Ser Gin Ser 125 130 135
AAT GAA GAG GTT TTA AAA ACA TTG GAT TAT TCT AAA GAA AAA TTG GAT 486 Asn Glu Glu Val Leu Lys Thr Leu Asp Tyr Ser Lys Glu Lys Leu Asp 140 145 150
TAC ATC AAG AAC CTT TTT GGG TCA AAA TGC CAT AGC GGG TGG CTT AGT 534 Tyr He Lys Asn Leu Phe Gly Ser Lys Cys His Ser Gly Trp Leu Ser 155 160 165
TTC ATG TAT CCC AGA TTG TTG CTC GCT AAA GAT TTG CTC AAA CAA GAC 582 Phe Met Tyr Pro Arg Leu Leu Leu Ala Lys Asp Leu Leu Lys Gin Asp 170 175 180 185
GGC GTG ATT TTC ATT TCT ATT GAC GAT AAC GAA TGC GCT CAA CTC AAA 630 Gly Val He Phe He Ser He Asp Asp Asn Glu Cys Ala Gin Leu Lys 190 195 200
CTT TTA TGC GAT GAA ATT TTT GGG GAG GGG AAT TTT GTG GCG TGT TTA 678 Leu Leu Cys Asp Glu He Phe Gly Glu Gly Asn Phe Val Ala Cys Leu 205 210 215
AAA TGG AAA AAG AAA AAA CAA CCA AGT TTT TTA TCA AAA GTA GCC GTA 726 Lys Trp Lys Lys Lys Lys Gin Pro Ser Phe Leu Ser Lys Val Ala Val 220 225 230
ATA TTA GAA TAT ATT TTA GTA TAT GCA AAA GAT TTT AGT CTA ATT GAT 774 He Leu Glu Tyr He Leu Val Tyr Ala Lys Asp Phe Ser Leu He Asp 235 240 245
AAG TTA GGT TTA GAT AAT GTA TCT GAT AGC GAT AAA CCT ATC ATT AAT 822 Lys Leu Gly Leu Asp Asn Val Ser Asp Ser Asp Lys Pro He He Asn 250 255 260 265
ACC TCT AAT AAT TTA TCA AAA AGA TAT TTT AAA AAA GGT ATT AGG GTT 870 Thr Ser Asn Asn Leu Ser Lys Arg Tyr Phe Lys Lys Gly He Arg Val 270 275 280
AAA TCT GAT TTA AAT TTT ATA AAG AGT GGA AAG TAT CAA AAT AAG ACA 918 Lys Ser Asp Leu Asn Phe He Lys Ser Gly Lys Tyr Gin Asn Lys Thr 285 290 295
ATG ACG ATT GAA TTT ATG AAT GAT ATT TTT ATT GAA AAT GGC AGA ACT 966 Met Thr He Glu Phe Met Asn Asp He Phe He Glu Asn Gly Arg Thr 300 305 310
AAA AAT GAT TTT GAA TGT ATA GGT AAA TTT AGA ACA GGA CAA GAA AAT 1014 Lys Asn Asp Phe Glu Cys He Gly Lys Phe Arg Thr Gly Gin Glu Asn 315 320 325
ATT AAT GAA TTT ATT GAA AAA GAT TTA ATT TTT ATA ACA AAA AAT TTA 1062 He Asn Glu Phe He Glu Lys Asp Leu He Phe He Thr Lys Asn Leu 330 335 340 345 GGG ATT AGA AGA GAT TTA TTA GAA GAA GAG CAA TCA AAT AAA AAA ACA 1110 Gly He Arg Arg Asp Leu Leu Glu Glu Glu Gin Ser Asn Lys Lys Thr 350 355 360
ATT ACA GAT TTA TTA ACA GAA TGG GGA CAA AAT CAA GAT GCT ACT AAT 1158 He Thr Asp Leu Leu Thr Glu Trp Gly Gin Asn Gin Asp Ala Thr Asn 365 370 375
GAA TTA AAT ATT TTA TTT AAT AAT TCT AGC GAT GAA AGT ATT TTT TCA 1206 Glu Leu Asn He Leu Phe Asn Asn Ser Ser Asp Glu Ser He Phe Ser 380 385 390
AAT CCT AAA CCT ACA AAA CTC ATC AAC CGA TTG ATT GAA TTA TCC ACC 1254 Asn Pro Lys Pro Thr Lys Leu He Asn Arg Leu He Glu Leu Ser Thr 395 400 405
AAC GAG GGC GAC ATC ATC TTA GAC TTT TTT GCC GGG AGC GGG ACA ACC 1302 Asn Glu Gly Asp He He Leu Asp Phe Phe Ala Gly Ser Gly Thr Thr 410 415 420 425
GCG CAT GCG GTG TTA GAG AGT AAT AAG AGC GAT TAT CAA AAA TTA AGT 1350 Ala His Ala Val Leu Glu Ser Asn Lys Ser Asp Tyr Gin Lys Leu Ser 430 435 440
GAG GGG GGG GGG GGT TAT TTA ATG GTT TGAACGCCGC ATTTAAAGAA AGGCGCT 1404 Glu Gly Gly Gly Gly Tyr Leu Met Val 445 450
TCATTCTCGT C 1415
(2) INFORMATION FOR SEQ ID NO: 1124:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 450 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1124:
Met Gin Asn Lys Glu He Gly Glu Glu Lys Ser Val Asn Glu Lys Asn
1 5 10 15
Val Glu Val Phe Asn Arg Tyr Phe Pro Gly Cys Leu Ser He Glu Asn
20 25 30
Asp Asn Lys Leu Thr Leu Asp Thr Gly Lys Leu Lys Ala Leu Leu Gly
35 40 45
Asp Phe Ser Glu He Lys Glu Glu Gly Tyr Gly Leu Asp Phe Val Gly
50 55 60
Lys Lys He Ala Leu Asn Gin Ala Phe Lys Lys Asn His Lys He Leu 65 70 75 80
Lys Pro Leu Asn Glu Ser Thr Ser Lys His Val Leu He Lys Gly Asp
85 90 95
Asn Leu Asp Ala Leu Lys He Leu Lys Gin Ser Tyr Ser Glu Lys He 100 105 110
Lys Met He Tyr He Asp Pro Pro Tyr Asn Thr Lys Asn Glu Asn Phe
115 120 125
He Tyr Gly Asp Asp Phe Ser Gin Ser Asn Glu Glu Val Leu Lys Thr
130 135 140
Leu Asp Tyr Ser Lys Glu Lys Leu Asp Tyr He Lys Asn Leu Phe Gly 145 150 155 160
Ser Lys Cys His Ser Gly Trp Leu Ser Phe Met Tyr Pro Arg Leu Leu
165 170 175
Leu Ala Lys Asp Leu Leu Lys Gin Asp Gly Val He Phe He Ser He
180 185 190
Asp Asp Asn Glu Cys Ala Gin Leu Lys Leu Leu Cys Asp Glu He Phe
195 200 205
Gly Glu Gly Asn Phe Val Ala Cys Leu Lys Trp Lys Lys Lys Lys Gin
210 215 220
Pro Ser Phe Leu Ser Lys Val Ala Val He Leu Glu Tyr He Leu Val 225 230 235 240
Tyr Ala Lys Asp Phe Ser Leu He Asp Lys Leu Gly Leu Asp Asn Val
245 250 255
Ser Asp Ser Asp Lys Pro He He Asn Thr Ser Asn Asn Leu Ser Lys
260 265 270
Arg Tyr Phe Lys Lys Gly He Arg Val Lys Ser Asp Leu Asn Phe He
275 280 285
Lys Ser Gly Lys Tyr Gin Asn Lys Thr Met Thr He Glu Phe Met Asn
290 295 300
Asp He Phe He Glu Asn Gly Arg Thr Lys Asn Asp Phe Glu Cys He 305 310 315 320
Gly Lys Phe Arg Thr Gly Gin Glu Asn He Asn Glu Phe He Glu Lys
325 330 335
Asp Leu He Phe He Thr Lys Asn Leu Gly He Arg Arg Asp Leu Leu
340 345 350
Glu Glu Glu Gin Ser Asn Lys Lys Thr He Thr Asp Leu Leu Thr Glu
355 360 365
Trp Gly Gin Asn Gin Asp Ala Thr Asn Glu Leu Asn He Leu Phe Asn
370 375 380
Asn Ser Ser Asp Glu Ser He Phe Ser Asn Pro Lys Pro Thr Lys Leu 385 390 395 400
He Asn Arg Leu He Glu Leu Ser Thr Asn Glu Gly Asp He He Leu
405 410 415
Asp Phe Phe Ala Gly Ser Gly Thr Thr Ala His Ala Val Leu Glu Ser
420 425 430
Asn Lys Ser Asp Tyr Gin Lys Leu Ser Glu Gly Gly Gly Gly Tyr Leu
435 440 445
Met Val 450
(2) INFORMATION FOR SEQ ID NO: 1125:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1389 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA ( ix) FEATURE :
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 55...1344 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1125:
GAAACGAATA AAAATTTTCC TTCACAACAT TTAAATGCAT TAAAATACAT TGAA ATG 57
Met
1
CTT TTT TAT ATG AAA AAT TTA GAG CGC AAA AAA TTG CAA TTT GGC GCT 105 Leu Phe Tyr Met Lys Asn Leu Glu Arg Lys Lys Leu Gin Phe Gly Ala 5 10 15
AAA ATC GCA TGC CCC AAT AAT AAC GAG CGC TTG AAA GCG TTT ATC GCT 153 Lys He Ala Cys Pro Asn Asn Asn Glu Arg Leu Lys Ala Phe He Ala 20 25 30
TCT TTA CCC TTT AAA CTC ACA CGC GAT CAA CAA AAC GCC ATT AAA GAA 201 Ser Leu Pro Phe Lys Leu Thr Arg Asp Gin Gin Asn Ala He Lys Glu 35 40 45
ATC CAA AAC GAT CTC ACT AGC TCC ATA GCG TGC AAG CGT TTG ATT ATA 249 He Gin Asn Asp Leu Thr Ser Ser He Ala Cys Lys Arg Leu He He 50 55 60 65
GGC GAT GTG GGG TGC GGG AAA ACG ATG GTG ATT TTA GCG AGC ATG GTA 297 Gly Asp Val Gly Cys Gly Lys Thr Met Val He Leu Ala Ser Met Val 70 75 80
TTA ACT TAC CCA AAT AAA ACC CTT TTA ATG GCG CCC ACT TCC ATT CTC 345 Leu Thr Tyr Pro Asn Lys Thr Leu Leu Met Ala Pro Thr Ser He Leu 85 90 95
GCT AAA CAG CTT TAT AAC GAA GCC TTA AAA TTT TTA CCC CCT TAT TTT 393 Ala Lys Gin Leu Tyr Asn Glu Ala Leu Lys Phe Leu Pro Pro Tyr Phe 100 105 110
GAA GTG GAA TTG CTG CTC GGC GGG AGT TAC AAG AAG CGA TCC AAT CAT 441 Glu Val Glu Leu Leu Leu Gly Gly Ser Tyr Lys Lys Arg Ser Asn His 115 120 125
TTG TTT GAA ACA ATC ACG CAT GTG GTT ATC GGC ACG CAA GCG TTG TTG 489 Leu Phe Glu Thr He Thr His Val Val He Gly Thr Gin Ala Leu Leu 130 135 140 145
TTT GAT AAG CGC GAT TTG AAT GAA TTC GCT CTA GTG ATC ACT GAT GAA 537 Phe Asp Lys Arg Asp Leu Asn Glu Phe Ala Leu Val He Thr Asp Glu 150 155 160
CAG CAC CGA TTT GGC ACC AAG CAG CGC TAC CAA TTA GAA AAA ATG GCA 585 Gin His Arg Phe Gly Thr Lys Gin Arg Tyr Gin Leu Glu Lys Met Ala 165 170 175
AGC AGT AAG GGT AAT AAA CCC CAT TCT TTG CAA TTT TCC GCT ACC CCC 633 Ser Ser Lys Gly Asn Lys Pro His Ser Leu Gin Phe Ser Ala Thr Pro 180 185 190
ATT CCT CGC ACG CTC GCC CTA GCC AAA AGC GCG TTT GTG AAA ACG ACC 681 He Pro Arg Thr Leu Ala Leu Ala Lys Ser Ala Phe Val Lys Thr Thr 195 200 205
ATG ATT AGA GAA ATC CCT TAT CCT AAA GAG ATT GAA ACT CTA GTC TTG 729 Met He Arg Glu He Pro Tyr Pro Lys Glu He Glu Thr Leu Val Leu 210 215 220 225
CAT AAA AGA GAT TTT AAA ATA GTG ATG GAG AAA ATC AGC GAA GAA ATC 777 His Lys Arg Asp Phe Lys He Val Met Glu Lys He Ser Glu Glu He 230 235 240
GCT AAA AAC CAT CAA GTC ATT GTC GTC TAT CCG CTG GTG AAT GAG AGC 825 Ala Lys Asn His Gin Val He Val Val Tyr Pro Leu Val Asn Glu Ser 245 250 255
GAA AAA ATC CCG TAT TTA TCG CTC AGT GAG GGG GCG AGT TTT TGG CAA 873 Glu Lys He Pro Tyr Leu Ser Leu Ser Glu Gly Ala Ser Phe Trp Gin 260 265 270
AAA CGC TTT AAA AAG GTT TAT ACC ACT TCA GGG CAA GAT AAA AAT AAA 921 Lys Arg Phe Lys Lys Val Tyr Thr Thr Ser Gly Gin Asp Lys Asn Lys 275 280 285
GAA GAA GTG ATT GAA GAA TTT AGA GAA TCC GGG AGC ATT CTT TTA GCG 969 Glu Glu Val He Glu Glu Phe Arg Glu Ser Gly Ser He Leu Leu Ala 290 295 300 305
ACT ACG CTC ATT GAG GTG GGC ATT TCT TTA CCA CGA TTG AGC GTG ATG 1017 Thr Thr Leu He Glu Val Gly He Ser Leu Pro Arg Leu Ser Val Met 310 315 320
GTG ATT TTA GCG CCC GAA AGG TTA GGC TTA GCG ACT TTA CAC CAG TTA 1065 Val He Leu Ala Pro Glu Arg Leu Gly Leu Ala Thr Leu His Gin Leu 325 330 335
AGG GGG CGC GTT TCT CGT AAC GGC TTG AAA GGC TAT TGT TTT TTA TGC 1113 Arg Gly Arg Val Ser Arg Asn Gly Leu Lys Gly Tyr Cys Phe Leu Cys 340 345 350
ACG ATC CAA GAA GAA AAC GAA CGA TTA GAA AAG TTT GCT GAT GAA TTG 1161 Thr He Gin Glu Glu Asn Glu Arg Leu Glu Lys Phe Ala Asp Glu Leu 355 360 365
GAC GGC TTT AAA ATC GCT GAA TTG GAT TTA GAA TAC AGA AAA AGC GGG 1209 Asp Gly Phe Lys He Ala Glu Leu Asp Leu Glu Tyr Arg Lys Ser Gly 370 375 380 385 GAT TTA CTC CAG GGA GGG GAG CAG AGC GGG AAT AGT TTT GAA TAC ATT 1257 Asp Leu Leu Gin Gly Gly Glu Gin Ser Gly Asn Ser Phe Glu Tyr He 390 395 400
GAC TTA GCC AAA GAT GAA AAC ATT ATC GCT GAA GTG AAA CGG GAT TTT 1305 Asp Leu Ala Lys Asp Glu Asn He He Ala Glu Val Lys Arg Asp Phe 405 410 415
TTA AAG GCC GCT AGC GTT TCA CGG GGA ACA TTT GAA AAT TGAAAATTAA GG 1356 Leu Lys Ala Ala Ser Val Ser Arg Gly Thr Phe Glu Asn 420 425 430
CAGAATTGGG TAATTTAAAT CATTTAAAAA AAG 1389
(2) INFORMATION FOR SEQ ID NO: 1126:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 430 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1126:
Met Leu Phe Tyr Met Lys Asn Leu Glu Arg Lys Lys Leu Gin Phe Gly
1 5 10 15
Ala Lys He Ala Cys Pro Asn Asn Asn Glu Arg Leu Lys Ala Phe He
20 25 30
Ala Ser Leu Pro Phe Lys Leu Thr Arg Asp Gin Gin Asn Ala He Lys
35 40 45
Glu He Gin Asn Asp Leu Thr Ser Ser He Ala Cys Lys Arg Leu He
50 55 60
He Gly Asp Val Gly Cys Gly Lys Thr Met Val He Leu Ala Ser Met 65 70 75 80
Val Leu Thr Tyr Pro Asn Lys Thr Leu Leu Met Ala Pro Thr Ser He
85 90 95
Leu Ala Lys Gin Leu Tyr Asn Glu Ala Leu Lys Phe Leu Pro Pro Tyr
100 105 110
Phe Glu Val Glu Leu Leu Leu Gly Gly Ser Tyr Lys Lys Arg Ser Asn
115 120 125
His Leu Phe Glu Thr He Thr His Val Val He Gly Thr Gin Ala Leu
130 135 140
Leu Phe Asp Lys Arg Asp Leu Asn Glu Phe Ala Leu Val He Thr Asp 145 150 155 160
Glu Gin His Arg Phe Gly Thr Lys Gin Arg Tyr Gin Leu Glu Lys Met
165 170 175
Ala Ser Ser Lys Gly Asn Lys Pro His Ser Leu Gin Phe Ser Ala Thr
180 185 190
Pro He Pro Arg Thr Leu Ala Leu Ala Lys Ser Ala Phe Val Lys Thr
195 200 205
Thr Met He Arg Glu He Pro Tyr Pro Lys Glu He Glu Thr Leu Val
210 215 220
Leu His Lys Arg Asp Phe Lys He Val Met Glu Lys He Ser Glu Glu 225 230 235 240
He Ala Lys Asn His Gin Val He Val Val Tyr Pro Leu Val Asn Glu
245 250 255
Ser Glu Lys He Pro Tyr Leu Ser Leu Ser Glu Gly Ala Ser Phe Trp
260 265 270
Gin Lys Arg Phe Lys Lys Val Tyr Thr Thr Ser Gly Gin Asp Lys Asn
275 280 285
Lys Glu Glu Val He Glu Glu Phe Arg Glu Ser Gly Ser He Leu Leu
290 295 300
Ala Thr Thr Leu He Glu Val Gly He Ser Leu Pro Arg Leu Ser Val 305 310 315 320
Met Val He Leu Ala Pro Glu Arg Leu Gly Leu Ala Thr Leu His Gin
325 330 335
Leu Arg Gly Arg Val Ser Arg Asn Gly Leu Lys Gly Tyr Cys Phe Leu
340 345 350
Cys Thr He Gin Glu Glu Asn Glu Arg Leu Glu Lys Phe Ala Asp Glu
355 360 365
Leu Asp Gly Phe Lys He Ala Glu Leu Asp Leu Glu Tyr Arg Lys Ser
370 375 380
Gly Asp Leu Leu Gin Gly Gly Glu Gin Ser Gly Asn Ser Phe Glu Tyr 385 390 395 400
He Asp Leu Ala Lys Asp Glu Asn He He Ala Glu Val Lys Arg Asp
405 410 415
Phe Leu Lys Ala Ala Ser Val Ser Arg Gly Thr Phe Glu Asn 420 425 430
(2) INFORMATION FOR SEQ ID NO: 1127:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1463 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 47...1417 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1127:
TGTTATTCCT TTTTCATTCA CCACTTATTC ACGCTATAAT AACGCC ATG GAT ACC 55
Met Asp Thr 1
AAC AAC AAT ATT GAA AAA GAA ATC TTG GCG CTA GTC AAA CAA AAT CCT 103 Asn Asn Asn He Glu Lys Glu He Leu Ala Leu Val Lys Gin Asn Pro 5 10 15
AAA GTT AGT CTC ATA GAG TAT GAA AAT TAC TTT AGC CAA CTC AAA TAC 151 Lys Val Ser Leu He Glu Tyr Glu Asn Tyr Phe Ser Gin Leu Lys Tyr 20 25 30 35
AAC CCT AAC GCA AGC AAG AGC GAT ATT GCC TTT TTT TAT GCC CCC AAC 199 Asn Pro Asn Ala Ser Lys Ser Asp He Ala Phe Phe Tyr Ala Pro Asn 40 45 50
CAA GTC TTA TGC ACC ACG ATT ACA GCT AAA TAC GGC GCG TTG CTT AAA 247 Gin Val Leu Cys Thr Thr He Thr Ala Lys Tyr Gly Ala Leu Leu Lys 55 60 65
GAA ATT TTA AGC CAG AAT AAA GTC GGC ATG CAT TTA GCC CAC AGC GTG 295 Glu He Leu Ser Gin Asn Lys Val Gly Met His Leu Ala His Ser Val 70 75 80
GAT GTG CGT ATT GAA GTA GCG CCT AAA ATC CAA ATT AAC GCC CAA TCT 343 Asp Val Arg He Glu Val Ala Pro Lys He Gin He Asn Ala Gin Ser 85 90 95
AAT ATC AAT TAC AAA GCC ATA AAA ACG AGC GTC AAA GAC TCT TAC ACT 391 Asn He Asn Tyr Lys Ala He Lys Thr Ser Val Lys Asp Ser Tyr Thr 100 105 110 115
TTT GAA AAT TTT GTC GTA GGC TCA TGC AAT AAC ACC GTT TAT GAA ATC 439 Phe Glu Asn Phe Val Val Gly Ser Cys Asn Asn Thr Val Tyr Glu He 120 125 130
GCT AAA AAA GTC GCC CAA AGC GAT ACC CCC CCT TAT AAC CCG GTG CTT 487 Ala Lys Lys Val Ala Gin Ser Asp Thr Pro Pro Tyr Asn Pro Val Leu 135 140 145
TTT TAT GGC GGC ACA GGG TTA GGC AAA ACG CAC ATT TTA AAC GCT ATC 535 Phe Tyr Gly Gly Thr Gly Leu Gly Lys Thr His He Leu Asn Ala He 150 155 160
GGC AAC CAT GCC CTA GAA AAG CAT AAA AAA GTC GTG TTA GTC ACT TCA 583 Gly Asn His Ala Leu Glu Lys His Lys Lys Val Val Leu Val Thr Ser 165 170 175
GAA GAC TTT TTG ACA GAC TTT TTA AAG CAT TTA GAC AAC AAA ACC ATG 631 Glu Asp Phe Leu Thr Asp Phe Leu Lys His Leu Asp Asn Lys Thr Met 180 185 190 195
GAT TCT TTT AAA GCA AAA TAC CGC CAT TGC GAC TTT TTC TTG TTA GAT 679 Asp Ser Phe Lys Ala Lys Tyr Arg His Cys Asp Phe Phe Leu Leu Asp 200 205 210
GAC GCT CAA TTT TTG CAA GGA AAA CCC AAG CTA GAA GAA GAA TTT TTC 727 Asp Ala Gin Phe Leu Gin Gly Lys Pro Lys Leu Glu Glu Glu Phe Phe 215 220 225
CAC ACC TTT AAC GAA TTG CAC GCC AAC AGC AAA CAA ATC GTA TTG ATT 775 His Thr Phe Asn Glu Leu His Ala Asn Ser Lys Gin He Val Leu He 230 235 240
TCA GAC CGA TCG CCT AAA AAC ATC GCC GGC TTA GAA GAT CGC TTA AAA 823 Ser Asp Arg Ser Pro Lys Asn He Ala Gly Leu Glu Asp Arg Leu Lys 245 250 255
TCG CGC TTT GAA TGG GGG ATA ACC GCT AAA GTC ATG CCC CCT GAT TTA 871 Ser Arg Phe Glu Trp Gly He Thr Ala Lys Val Met Pro Pro Asp Leu 260 265 270 275
GAA ACC AAA CTT TCC ATT GTC AAA CAA AAA TGC CAG CTC AAT CAA ATC 919 Glu Thr Lys Leu Ser He Val Lys Gin Lys Cys Gin Leu Asn Gin He 280 285 290
ACT TTG CCT GAA GAG GTG ATG GAA TAC ATC GCC CAA CAC ATC AGC GAC 967 Thr Leu Pro Glu Glu Val Met Glu Tyr He Ala Gin His He Ser Asp 295 300 305
AAT ATC CGC CAA ATG GAA GGC GCG ATC ATT AAA ATC AGC GTG AAC GCG 1015 Asn He Arg Gin Met Glu Gly Ala He He Lys He Ser Val Asn Ala 310 315 320
AAC TTG ATG AAC GCT TCC ATT GAT TTG AAC CTC GCT AAA ACC GTT TTA 1063 Asn Leu Met Asn Ala Ser He Asp Leu Asn Leu Ala Lys Thr Val Leu 325 330 335
GAA GAT TTG CAA AAA GAT CAT GCT GAA GGT TCA AGC TTG GAA AAT ATC 1111 Glu Asp Leu Gin Lys Asp His Ala Glu Gly Ser Ser Leu Glu Asn He 340 345 350 355
CTA CTC GCT GTC GCG CAA AGC CTG AAT CTC AAA TCC AGC GAA ATC AAA 1159 Leu Leu Ala Val Ala Gin Ser Leu Asn Leu Lys Ser Ser Glu He Lys 360 365 370
GTC TCT TCG CGC CAA AAA AAT GTC GCT TTG GCG AGG AAA TTA GTC GTG 1207 Val Ser Ser Arg Gin Lys Asn Val Ala Leu Ala Arg Lys Leu Val Val 375 380 385
TAT TTC GCC AGG CTT TAT ACC CCT AAC CCC ACG CTC TCG CTC GCT CAA 1255 Tyr Phe Ala Arg Leu Tyr Thr Pro Asn Pro Thr Leu Ser Leu Ala Gin 390 395 400
TTT TTG GAT TTA AAG GAT CAT TCA AGC ATT TCT AAA ATG TAT TCT GGC 1303 Phe Leu Asp Leu Lys Asp His Ser Ser He Ser Lys Met Tyr Ser Gly 405 410 415
GTT AAA AAA ATG CTT GAA GAA GAA AAA AGC CCT TTT GTC TTA AGC CTT 1351 Val Lys Lys Met Leu Glu Glu Glu Lys Ser Pro Phe Val Leu Ser Leu 420 425 430 435
AGA GAA GAA ATC AAA AAC CGC TTG AAC GAA TTG AAC GAC AAA AAA ACC 1399 Arg Glu Glu He Lys Asn Arg Leu Asn Glu Leu Asn Asp Lys Lys Thr 440 445 450
GCT TTC AAT TCA AGT GAA TGAAAAAAGG CTTATGAAAA AGCGTTTCAT TCACTTCT 1455 Ala Phe Asn Ser Ser Glu 455 TTTCAAAT 1463
(2) INFORMATION FOR SEQ ID NO: 1128:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 457 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1128:
Met Asp Thr Asn Asn Asn He Glu Lys Glu He Leu Ala Leu Val Lys
1 5 10 15
Gin Asn Pro Lys Val Ser Leu He Glu Tyr Glu Asn Tyr Phe Ser Gin
20 25 30
Leu Lys Tyr Asn Pro Asn Ala Ser Lys Ser Asp He Ala Phe Phe Tyr
35 40 45
Ala Pro Asn Gin Val Leu Cys Thr Thr He Thr Ala Lys Tyr Gly Ala
50 55 60
Leu Leu Lys Glu He Leu Ser Gin Asn Lys Val Gly Met His Leu Ala 65 70 75 80
His Ser Val Asp Val Arg He Glu Val Ala Pro Lys He Gin He Asn
85 90 95
Ala Gin Ser Asn He Asn Tyr Lys Ala He Lys Thr Ser Val Lys Asp
100 105 110
Ser Tyr Thr Phe Glu Asn Phe Val Val Gly Ser Cys Asn Asn Thr Val
115 120 125
Tyr Glu He Ala Lys Lys Val Ala Gin Ser Asp Thr Pro Pro Tyr Asn
130 135 140
Pro Val Leu Phe Tyr Gly Gly Thr Gly Leu Gly Lys Thr His He Leu 145 150 155 160
Asn Ala He Gly Asn His Ala Leu Glu Lys His Lys Lys Val Val Leu
165 170 175
Val Thr Ser Glu Asp Phe Leu Thr Asp Phe Leu Lys His Leu Asp Asn
180 185 190
Lys Thr Met Asp Ser Phe Lys Ala Lys Tyr Arg His Cys Asp Phe Phe
195 200 205
Leu Leu Asp Asp Ala Gin Phe Leu Gin Gly Lys Pro Lys Leu Glu Glu
210 215 220
Glu Phe Phe His Thr Phe Asn Glu Leu His Ala Asn Ser Lys Gin He 225 230 235 240
Val Leu He Ser Asp Arg Ser Pro Lys Asn He Ala Gly Leu Glu Asp
245 250 255
Arg Leu Lys Ser Arg Phe Glu Trp Gly He Thr Ala Lys Val Met Pro
260 265 270
Pro Asp Leu Glu Thr Lys Leu Ser He Val Lys Gin Lys Cys Gin Leu
275 280 285
Asn Gin He Thr Leu Pro Glu Glu Val Met Glu Tyr He Ala Gin His
290 295 300
He Ser Asp Asn He Arg Gin Met Glu Gly Ala He He Lys He Ser 305 310 315 320
Val Asn Ala Asn Leu Met Asn Ala Ser He Asp Leu Asn Leu Ala Lys 325 330 335
Thr Val Leu Glu Asp Leu Gin Lys Asp His Ala Glu Gly Ser Ser Leu
340 345 350
Glu Asn He Leu Leu Ala Val Ala Gin Ser Leu Asn Leu Lys Ser Ser
355 360 365
Glu He Lys Val Ser Ser Arg Gin Lys Asn Val Ala Leu Ala Arg Lys
370 375 380
Leu Val Val Tyr Phe Ala Arg Leu Tyr Thr Pro Asn Pro Thr Leu Ser 385 390 395 400
Leu Ala Gin Phe Leu Asp Leu Lys Asp His Ser Ser He Ser Lys Met
405 410 415
Tyr Ser Gly Val Lys Lys Met Leu Glu Glu Glu Lys Ser Pro Phe Val
420 425 430
Leu Ser Leu Arg Glu Glu He Lys Asn Arg Leu Asn Glu Leu Asn Asp
435 440 445
Lys Lys Thr Ala Phe Asn Ser Ser Glu 450 455
(2) INFORMATION FOR SEQ ID NO: 1129:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1324 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 25...1260 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1129:
TGGCTAAAGC GTAAGGAGAG TTTA ATG GCA GAG ATA AAA AAA GCG AAA AAT 51
Met Ala Glu He Lys Lys Ala Lys Asn 1 5
TTA GGC GAA TGG CTG GAC ATG CGT CTT GGC ACT AAC AAG CTT GTT AAA 99 Leu Gly Glu Trp Leu Asp Met Arg Leu Gly Thr Asn Lys Leu Val Lys 10 15 20 25
GTG CTA ATG ACA GAA TAT TGG ATC CCT AAA AAC ATC AAT TTT TTA TGG 147 Val Leu Met Thr Glu Tyr Trp He Pro Lys Asn He Asn Phe Leu Trp 30 35 40
GCG ATG GGG GTG ATT TTA TTA ACC CTT TTT GGC GTG CTT GTG GTC TCA 195 Ala Met Gly Val He Leu Leu Thr Leu Phe Gly Val Leu Val Val Ser 45 50 55
GGG ATT TTC TTG CTC ATG TAT TAC AAG CCT GAT GCG AAA ATG GCG TTT 243 Gly He Phe Leu Leu Met Tyr Tyr Lys Pro Asp Ala Lys Met Ala Phe 60 65 70
GAT AGC GTG AAT TTC ACC ATC ATG CAA GAA GTG GCT TAT GGC TGG CTT 291 Asp Ser Val Asn Phe Thr He Met Gin Glu Val Ala Tyr Gly Trp Leu 75 80 85
TGG CGC CAC ATG CAT GCC ACG GCA GCG AGC ATG ATT TTT GTC ATC ATT 339 Trp Arg His Met His Ala Thr Ala Ala Ser Met He Phe Val He He 90 95 100 105
TAT ATC CAC ATG TTT GTT GGC ATC TAT TAT GGC TCT TAC AAA AAG GGT 387 Tyr He His Met Phe Val Gly He Tyr Tyr Gly Ser Tyr Lys Lys Gly 110 115 120
CGT GAG ATG ATT TGG ATT AGC GGG ATG ATT TTG TTT GTG GTC TTT AGC 435 Arg Glu Met He Trp He Ser Gly Met He Leu Phe Val Val Phe Ser 125 130 135
GCG GAA GCC TTT AGC GGG TAT ATG CTG CCT TGG GGG CAG ATG AGT TAT 483 Ala Glu Ala Phe Ser Gly Tyr Met Leu Pro Trp Gly Gin Met Ser Tyr 140 145 150
TGG GCC GCA GCG GTT ATC ACG AAT TTA TTT GGA GGC ATT CCT TTC ATT 531 Trp Ala Ala Ala Val He Thr Asn Leu Phe Gly Gly He Pro Phe He 155 160 165
GGG GCT GAT GTG GTG GAG TGG ATT AGA GGC AAT TAT GTT GTG GCG GAT 579 Gly Ala Asp Val Val Glu Trp He Arg Gly Asn Tyr Val Val Ala Asp 170 175 180 185
TCC ACT TTA ACG CGC TTT TTC ATG CTC CAT GTG TTT TTA CTG CCC ATT 627 Ser Thr Leu Thr Arg Phe Phe Met Leu His Val Phe Leu Leu Pro He 190 195 200
GCG ATC ATT CTA CTT GTT GGG GTG CAT TTT TAT TCT TTA CGC ATC CCG 675 Ala He He Leu Leu Val Gly Val His Phe Tyr Ser Leu Arg He Pro 205 210 215
CAT GTC AAT AAC CAA GAA GGC GAA GAG ATT GAC TTT GAA TTA GAA GAG 723 His Val Asn Asn Gin Glu Gly Glu Glu He Asp Phe Glu Leu Glu Glu 220 225 230
AAG AAA TTC ATT GAA GGC AAG AAA AAA GAA TCC AAA GTC ATT CCT TTT 771 Lys Lys Phe He Glu Gly Lys Lys Lys Glu Ser Lys Val He Pro Phe 235 240 245
TGG CCG GTG TTC TTG TCT AAA GAT ATT TTT GTG GTT TGC GCG TTC ATG 819 Trp Pro Val Phe Leu Ser Lys Asp He Phe Val Val Cys Ala Phe Met 250 255 260 265
GTC TTT TTC TTT TAC TTG GTG TGT TAC CAC TAT GAT TTT GCG ATG GAT 867 Val Phe Phe Phe Tyr Leu Val Cys Tyr His Tyr Asp Phe Ala Met Asp 270 275 280
CCT ATC AAC TTT GAA AGG GCT AAC AGC CTT AAA ACG CCG CCT CAC ATT 915 Pro He Asn Phe Glu Arg Ala Asn Ser Leu Lys Thr Pro Pro His He 285 290 295
TAC CCT GAA TGG TAT TTC TTA TGG AGC TAT GAA GTC TTA AGA GGC TTT 963 Tyr Pro Glu Trp Tyr Phe Leu Trp Ser Tyr Glu Val Leu Arg Gly Phe 300 305 310
TTC TTT AGC GCT GAT TTA GGG CTA ATG GCC TTT GGC GTG GCG CAA GTG 1011 Phe Phe Ser Ala Asp Leu Gly Leu Met Ala Phe Gly Val Ala Gin Val 315 320 325
ATT TTC TTT TTG CTA CCC TTC TTG GAT CGA AGT CCA GTG GTC GCT CCT 1059 He Phe Phe Leu Leu Pro Phe Leu Asp Arg Ser Pro Val Val Ala Pro 330 335 340 345
GCG CAC AAA CGG CCG GCG TTT ATG GTG TGG TTT TGG CTT GTA ATC ATT 1107 Ala His Lys Arg Pro Ala Phe Met Val Trp Phe Trp Leu Val He He 350 355 360
GAT ATG ATT GTT TTA ACG ATC TAT GGT AAA TTG CCT CCG CTT GGG ATT 1155 Asp Met He Val Leu Thr He Tyr Gly Lys Leu Pro Pro Leu Gly He 365 370 375
GGT AAA TAC ATT GGC TTA GCG GGT TCA ATC ACT TTT TTG GCC CTT TTC 1203 Gly Lys Tyr He Gly Leu Ala Gly Ser He Thr Phe Leu Ala Leu Phe 380 385 390
TTT GTG GTA TTG CCC ATC ATC ACT ATC GCT GAG AGC AAG AAA CAA GGG 1251 Phe Val Val Leu Pro He He Thr He Ala Glu Ser Lys Lys Gin Gly 395 400 405
GGT GTT AGA TGAAAGAGTT TAAGATTCTA ATCATCCTCA TTGTGGTGGT AGGCGTGAT 1309
Gly Val Arg
410
TTATTATGGG GTTGA 1324
(2) INFORMATION FOR SEQ ID NO: 1130:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 412 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1130:
Met Ala Glu He Lys Lys Ala Lys Asn Leu Gly Glu Trp Leu Asp Met
1 5 10 15
Arg Leu Gly Thr Asn Lys Leu Val Lys Val Leu Met Thr Glu Tyr Trp
20 25 30
He Pro Lys Asn He Asn Phe Leu Trp Ala Met Gly Val He Leu Leu 35 40 45 Thr Leu Phe Gly Val Leu Val Val Ser Gly He Phe Leu Leu Met Tyr
50 55 60
Tyr Lys Pro Asp Ala Lys Met Ala Phe Asp Ser Val Asn Phe Thr He 65 70 75 80
Met Gin Glu Val Ala Tyr Gly Trp Leu Trp Arg His Met His Ala Thr
85 90 95
Ala Ala Ser Met He Phe Val He He Tyr He His Met Phe Val Gly
100 105 110
He Tyr Tyr Gly Ser Tyr Lys Lys Gly Arg Glu Met He Trp He Ser
115 120 125
Gly Met He Leu Phe Val Val Phe Ser Ala Glu Ala Phe Ser Gly Tyr
130 135 140
Met Leu Pro Trp Gly Gin Met Ser Tyr Trp Ala Ala Ala Val He Thr 145 150 155 160
Asn Leu Phe Gly Gly He Pro Phe He Gly Ala Asp Val Val Glu Trp
165 170 175
He Arg Gly Asn Tyr Val Val Ala Asp Ser Thr Leu Thr Arg Phe Phe
180 185 190
Met Leu His Val Phe Leu Leu Pro He Ala He He Leu Leu Val Gly
195 200 205
Val His Phe Tyr Ser Leu Arg He Pro His Val Asn Asn Gin Glu Gly
210 215 220
Glu Glu He Asp Phe Glu Leu Glu Glu Lys Lys Phe He Glu Gly Lys 225 230 235 240
Lys Lys Glu Ser Lys Val He Pro Phe Trp Pro Val Phe Leu Ser Lys
245 250 255
Asp He Phe Val Val Cys Ala Phe Met Val Phe Phe Phe Tyr Leu Val
260 265 270
Cys Tyr His Tyr Asp Phe Ala Met Asp Pro He Asn Phe Glu Arg Ala
275 280 285
Asn Ser Leu Lys Thr Pro Pro His He Tyr Pro Glu Trp Tyr Phe Leu
290 295 300
Trp Ser Tyr Glu Val Leu Arg Gly Phe Phe Phe Ser Ala Asp Leu Gly 305 310 315 320
Leu Met Ala Phe Gly Val Ala Gin Val He Phe Phe Leu Leu Pro Phe
325 330 335
Leu Asp Arg Ser Pro Val Val Ala Pro Ala His Lys Arg Pro Ala Phe
340 345 350
Met Val Trp Phe Trp Leu Val He He Asp Met He Val Leu Thr He
355 360 365
Tyr Gly Lys Leu Pro Pro Leu Gly He Gly Lys Tyr He Gly Leu Ala
370 375 380
Gly Ser He Thr Phe Leu Ala Leu Phe Phe Val Val Leu Pro He He 385 390 395 400
Thr He Ala Glu Ser Lys Lys Gin Gly Gly Val Arg 405 410
(2) INFORMATION FOR SEQ ID NO: 1131:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 462 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 22...429 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1131:
CAATAAAGAA AGGAGCATCA G ATG GCA ATC TTT GAT AAC AAT AAT AAA TCG 51
Met Ala He Phe Asp Asn Asn Asn Lys Ser 1 5 10
GCT AAT GCA AAA ACA GGA CCA GCG ACT ATC ATC GCT CAA GGC ACA AAA 99 Ala Asn Ala Lys Thr Gly Pro Ala Thr He He Ala Gin Gly Thr Lys 15 20 25
ATA AAG GGG GAG CTT CAT TTA GAT TAC CAT TTG CAC GTA GAT GGC GAA 147 He Lys Gly Glu Leu His Leu Asp Tyr His Leu His Val Asp Gly Glu 30 35 40
TTA GAA GGG GTG GTG CAT TCT AAA AGC ACG GTG GTG ATC GGG CAA ACC 195 Leu Glu Gly Val Val His Ser Lys Ser Thr Val Val He Gly Gin Thr 45 50 55
GGC TCG GTA GTG GGT GAG ATT TTT ACT AAT AAA TTA GTG GTC AGT GGC 243 Gly Ser Val Val Gly Glu He Phe Thr Asn Lys Leu Val Val Ser Gly 60 65 70
AAG TTC ACT GGC ACG GTG GAG GCG GAA GTG GTA GAA ATC ATG CCT TTA 291 Lys Phe Thr Gly Thr Val Glu Ala Glu Val Val Glu He Met Pro Leu 75 80 85 90
GGG CAC CTT GAT GGC AAA ATC TCT AGC CAA GAG CTT GTG GTG GAA AGA 339 Gly His Leu Asp Gly Lys He Ser Ser Gin Glu Leu Val Val Glu Arg 95 100 105
AAG GGG ATT TTG ATT GGG GAA ACT CGC CCT AAG AAT ATT CAA GGG GGG 387 Lys Gly He Leu He Gly Glu Thr Arg Pro Lys Asn He Gin Gly Gly 110 115 120
GCG TTG TTA ATC AAT GAG CAA GAA AAG AAA ATT GAA AAT AAA TAGGGAATG 438 Ala Leu Leu He Asn Glu Gin Glu Lys Lys He Glu Asn Lys 125 130 135
ATCCAATCCA GCCTTTATAG AGCC 462
(2) INFORMATION FOR SEQ ID NO: 1132:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 136 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1132:
Met Ala He Phe Asp Asn Asn Asn Lys Ser Ala Asn Ala Lys Thr Gly
1 5 10 15
Pro Ala Thr He He Ala Gin Gly Thr Lys He Lys Gly Glu Leu His
20 25 30
Leu Asp Tyr His Leu His Val Asp Gly Glu Leu Glu Gly Val Val His
35 40 45
Ser Lys Ser Thr Val Val He Gly Gin Thr Gly Ser Val Val Gly Glu
50 55 60
He Phe Thr Asn Lys Leu Val Val Ser Gly Lys Phe Thr Gly Thr Val 65 70 75 80
Glu Ala Glu Val Val Glu He Met Pro Leu Gly His Leu Asp Gly Lys
85 90 95
He Ser Ser Gin Glu Leu Val Val Glu Arg Lys Gly He Leu He Gly
100 105 110
Glu Thr Arg Pro Lys Asn He Gin Gly Gly Ala Leu Leu He Asn Glu
115 120 125
Gin Glu Lys Lys He Glu Asn Lys 130 135
(2) INFORMATION FOR SEQ ID NO: 1133:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 391 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 13...348 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1133:
TAAGGAGTTT CT ATG GAT TGG GGT CGG GTC GTT CAT GTG CTG TTC AGC CTT 51 Met Asp Trp Gly Arg Val Val His Val Leu Phe Ser Leu 1 5 10
ATT TCT TTA ACC ACC ATT GCA GGG TTT TTG TAT GAG CCT AAT ACG GTG 99 He Ser Leu Thr Thr He Ala Gly Phe Leu Tyr Glu Pro Asn Thr Val 15 20 25
GTG TTG TTT GTA GCG TTA GCT TTA AAC CTT ATT TCT GTT ACG CTT AAA 147 Val Leu Phe Val Ala Leu Ala Leu Asn Leu He Ser Val Thr Leu Lys 30 35 40 45 ATT GGG GTG ATC AAG CGT TTC GCT TCA GAG CTA TTG GCC AGC TCT TTA 195 He Gly Val He Lys Arg Phe Ala Ser Glu Leu Leu Ala Ser Ser Leu 50 55 60
GCC ACC GTA TTG CAT CTC ATA CCG GCA TTT GTG TTT TTA CAG ATT TTA 243 Ala Thr Val Leu His Leu He Pro Ala Phe Val Phe Leu Gin He Leu 65 70 75
AAT AAT TTG GTT ACC GCT TAC ATG CTC ATG ATC GGG GCG TTG ATT AGC 291 Asn Asn Leu Val Thr Ala Tyr Met Leu Met He Gly Ala Leu He Ser 80 85 90
AAC GCT TTC AGT CTC ATC TTT TTG TTG ATT GAA AGC GTT GTA ACG AGC 339 Asn Ala Phe Ser Leu He Phe Leu Leu He Glu Ser Val Val Thr Ser 95 100 105
GAA ACG GAT TAAGGGGTAG TGATGGATTT TATCAATATA GAAAAAAAAT GGC 391
Glu Thr Asp
110
(2) INFORMATION FOR SEQ ID NO: 1134:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 112 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1134:
Met Asp Trp Gly Arg Val Val His Val Leu Phe Ser Leu He Ser Leu
1 5 10 15
Thr Thr He Ala Gly Phe Leu Tyr Glu Pro Asn Thr Val Val Leu Phe
20 25 30
Val Ala Leu Ala Leu Asn Leu He Ser Val Thr Leu Lys He Gly Val
35 40 45
He Lys Arg Phe Ala Ser Glu Leu Leu Ala Ser Ser Leu Ala Thr Val
50 55 60
Leu His Leu He Pro Ala Phe Val Phe Leu Gin He Leu Asn Asn Leu 65 70 75 80
Val Thr Ala Tyr Met Leu Met He Gly Ala Leu He Ser Asn Ala Phe
85 90 95
Ser Leu He Phe Leu Leu He Glu Ser Val Val Thr Ser Glu Thr Asp 100 105 110
(2) INFORMATION FOR SEQ ID NO: 1135:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1035 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 25... 93 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1135:
GAATAAAAGA GCTTAGGAGG TTTT ATG GAA TTA TTC AAA CGA ACT AGA ATC 51
Met Glu Leu Phe Lys Arg Thr Arg He
1 5
TTA AGC TTC ATG CGT TAT TCC AAT TAT GGG GTG ATC GTT TCA GCA ATT 99 Leu Ser Phe Met Arg Tyr Ser Asn Tyr Gly Val He Val Ser Ala He 10 15 20 25
TTA GCG CTT CTA GCG TTG GGG CTT TTG TTT TTC AAA GGG TTT TCT TTA 147 Leu Ala Leu Leu Ala Leu Gly Leu Leu Phe Phe Lys Gly Phe Ser Leu 30 35 40
GGG ATT GAT TTT GCG GGG GGG AGT TTG GTG CAA GTG CGC TAC ACT CAA 195 Gly He Asp Phe Ala Gly Gly Ser Leu Val Gin Val Arg Tyr Thr Gin 45 50 55
AAC GCC CCC ATT AAA GAA GTG CGC GAT CTG TTT GAA AAA GAA GCT CGC 243 Asn Ala Pro He Lys Glu Val Arg Asp Leu Phe Glu Lys Glu Ala Arg 60 65 70
TTC AAA GGC GTG CAA GTG AGC GAA TTT GGC TCT AAA GAA GAA ATT TTA 291 Phe Lys Gly Val Gin Val Ser Glu Phe Gly Ser Lys Glu Glu He Leu 75 80 85
ATC AAA TTC CCT TTT GTA GAA ACG GCT GAA AAT GAA GAT CTG AAC GCT 339 He Lys Phe Pro Phe Val Glu Thr Ala Glu Asn Glu Asp Leu Asn Ala 90 95 100 105
ATC GTG GCC AAC ATT CTA AAA CCC AGC GGC GAT TTT GAA ATC CGT AAA 387 He Val Ala Asn He Leu Lys Pro Ser Gly Asp Phe Glu He Arg Lys 110 115 120
TTT GAC ACC GTG GGC CCT AGA GTG GGG AGC GAA TTG AAA GAG AAA GGC 435 Phe Asp Thr Val Gly Pro Arg Val Gly Ser Glu Leu Lys Glu Lys Gly 125 130 135
ATT TTG TCG CTG ATT TTA GCA TTA ATA GCG ATC ATG GTT TAT GTG AGT 483 He Leu Ser Leu He Leu Ala Leu He Ala He Met Val Tyr Val Ser 140 145 150
TTC CGC TAT GAA TGG CGT TTT GCT TTA GCG AGC GTC ATT GCG CTT GTG 531 Phe Arg Tyr Glu Trp Arg Phe Ala Leu Ala Ser Val He Ala Leu Val 155 160 165 CAT GAT GTG ATT TTA GTG GCA AGC TCG GTG ATT GTT TTT AAG ATT GAT 579 His Asp Val He Leu Val Ala Ser Ser Val He Val Phe Lys He Asp 170 175 180 185
ATG AAT TTG GAA GTG ATT GCG GCC TTG CTC ACC TTG ATT GGG TAT TCC 627 Met Asn Leu Glu Val He Ala Ala Leu Leu Thr Leu He Gly Tyr Ser 190 195 200
ATT AAT GAT ACG ATC ATT ATT TTT GAC AGG ATC AGA GAA GAG ATG CTY 675 He Asn Asp Thr He He He Phe Asp Arg He Arg Glu Glu Met Xaa 205 210 215
TCT CAA AAA ACC AAA AAC GCC ACT CAA GCC ATT GAT GAA GCC ATT TCT 723 Ser Gin Lys Thr Lys Asn Ala Thr Gin Ala He Asp Glu Ala He Ser 220 225 230
AGC ACG CTC ACG CGC ACG CTT TTA ACT TCT TTA ACC GTG TTT TTT GTG 771 Ser Thr Leu Thr Arg Thr Leu Leu Thr Ser Leu Thr Val Phe Phe Val 235 240 245
GTG TTG ATT TTG TGC GTG TTT GGG AGT AAG ATC ATC ATT GGC TTT TCA 819 Val Leu He Leu Cys Val Phe Gly Ser Lys He He He Gly Phe Ser 250 255 260 265
TTG CCC ATG TTA ATA GGC ACG ATT GTA GGG ACT TAT AGC TCT ATT TTC 867 Leu Pro Met Leu He Gly Thr He Val Gly Thr Tyr Ser Ser He Phe 270 275 280
ATC GCC CCT AAA GTG GCG TTA TTG TTA GGC TTT GAT ATG GAT AAA TAT 915 He Ala Pro Lys Val Ala Leu Leu Leu Gly Phe Asp Met Asp Lys Tyr 285 290 295
TAT GAG AAT GAG ACT AGA AAA ATT AAA AAA GCT CAA GAG AAA GAA AAA 963 Tyr Glu Asn Glu Thr Arg Lys He Lys Lys Ala Gin Glu Lys Glu Lys 300 305 310
ATG CGC CGT TTG TAT GAG AGC GGT CAA GTT TAAGGAGTTT CTATGGATTG GGG 1016 Met Arg Arg Leu Tyr Glu Ser Gly Gin Val 315 320
TCGGGTCGTT CATGTGCTG 1035
(2) INFORMATION FOR SEQ ID NO: 1136:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 323 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1136: Met Glu Leu Phe Lys Arg Thr Arg He Leu Ser Phe Met Arg Tyr Ser 1 5 10 15
Asn Tyr Gly Val He Val Ser Ala He Leu Ala Leu Leu Ala Leu Gly
20 25 30
Leu Leu Phe Phe Lys Gly Phe Ser Leu Gly He Asp Phe Ala Gly Gly
35 40 45
Ser Leu Val Gin Val Arg Tyr Thr Gin Asn Ala Pro He Lys Glu Val
50 55 60
Arg Asp Leu Phe Glu Lys Glu Ala Arg Phe Lys Gly Val Gin Val Ser 65 70 75 80
Glu Phe Gly Ser Lys Glu Glu He Leu He Lys Phe Pro Phe Val Glu
85 90 95
Thr Ala Glu Asn Glu Asp Leu Asn Ala He Val Ala Asn He Leu Lys
100 105 110
Pro Ser Gly Asp Phe Glu He Arg Lys Phe Asp Thr Val Gly Pro Arg
115 120 125
Val Gly Ser Glu Leu Lys Glu Lys Gly He Leu Ser Leu He Leu Ala
130 135 140
Leu He Ala He Met Val Tyr Val Ser Phe Arg Tyr Glu Trp Arg Phe 145 150 155 160
Ala Leu Ala Ser Val He Ala Leu Val His Asp Val He Leu Val Ala
165 170 175
Ser Ser Val He Val Phe Lys He Asp Met Asn Leu Glu Val He Ala
180 185 190
Ala Leu Leu Thr Leu He Gly Tyr Ser He Asn Asp Thr He He He
195 200 205
Phe Asp Arg He Arg Glu Glu Met Xaa Ser Gin Lys Thr Lys Asn Ala
210 215 220
Thr Gin Ala He Asp Glu Ala He Ser Ser Thr Leu Thr Arg Thr Leu 225 230 235 240
Leu Thr Ser Leu Thr Val Phe Phe Val Val Leu He Leu Cys Val Phe
245 250 255
Gly Ser Lys He He He Gly Phe Ser Leu Pro Met Leu He Gly Thr
260 265 270
He Val Gly Thr Tyr Ser Ser He Phe He Ala Pro Lys Val Ala Leu
275 280 285
Leu Leu Gly Phe Asp Met Asp Lys Tyr Tyr Glu Asn Glu Thr Arg Lys
290 295 300
He Lys Lys Ala Gin Glu Lys Glu Lys Met Arg Arg Leu Tyr Glu Ser 305 310 315 320
Gly Gin Val
(2) INFORMATION FOR SEQ ID NO: 1137:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 670 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 56...634 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1137:
TAGCTATTTC TTTAAAGCCG CTCTTTTGTC TAGCGCAAAT AAATACAAAG CCCCT ATG 58
Met 1
ATC CCA GAA ATC AAA GAT CCG AGT AAA ATC GCA ATT TTT GCC ACT TCC 106 He Pro Glu He Lys Asp Pro Ser Lys He Ala He Phe Ala Thr Ser 5 10 15
ATA GCG TCT TTA TGC TCG CTC GTG AAG GCC AGA TTA GAA ATA AAC ATA 154 He Ala Ser Leu Cys Ser Leu Val Lys Ala Arg Leu Glu He Asn He 20 25 30
GAC ATG GTA AAG CCA ATC CCT GCT AAA AGC CCA GCC CCT AAA ATA TGC 202 Asp Met Val Lys Pro He Pro Ala Lys Ser Pro Ala Pro Lys He Cys 35 40 45
CAC CAG CTG ATG CCT TTA GGG CGT GCG GTG ATT TTA AGC TTT TCG CTT 250 His Gin Leu Met Pro Leu Gly Arg Ala Val He Leu Ser Phe Ser Leu 50 55 60 ' 65
ATA AAA GTG ATT AAG AAA ATC CCT AAA GGT TTG CCC AAG CAA AGC CCT 298 He Lys Val He Lys Lys He Pro Lys Gly Leu Pro Lys Gin Ser Pro 70 75 80
AAA ATA ACC CCT AAA AGC ACC TTA TCC ACT TCT AAA TTG ATG CTA GAA 346 Lys He Thr Pro Lys Ser Thr Leu Ser Thr Ser Lys Leu Met Leu Glu 85 90 95
TCA ACG CTC ACC CCA GCG TTT GCA AAC GCG AAT AAG GGC ATG ATG AAA 394 Ser Thr Leu Thr Pro Ala Phe Ala Asn Ala Asn Lys Gly Met Met Lys 100 105 110
TAC CCG CTA ATG GGG GCT AGA AAA TGC TCC AAT CTT TCT AAG GGG CTT 442 Tyr Pro Leu Met Gly Ala Arg Lys Cys Ser Asn Leu Ser Lys Gly Leu 115 120 125
TGT AAG GCG CTC GCT TTT TCT TCA ATA GAA TGC AAG ATT TCT TGC TGC 490 Cys Lys Ala Leu Ala Phe Ser Ser He Glu Cys Lys He Ser Cys Cys 130 135 140 145
TCT TTA CTC AAA AGC GCT CCT GAA CTC GTT TCT GCG TAT CGT TTG CCT 538 Ser Leu Leu Lys Ser Ala Pro Glu Leu Val Ser Ala Tyr Arg Leu Pro 150 155 160
AGT TCC AAA AGC TCT ACA TTT TTA GAA TCT TTA GGG ATC TTC ACC GGT 586 Ser Ser Lys Ser Ser Thr Phe Leu Glu Ser Leu Gly He Phe Thr Gly 165 170 175
ATC ATA AAA GCT AGA ATC ACT GCA GCA ATC GTC GCA TGG ATA CCG CTT T 635 He He Lys Ala Arg He Thr Ala Ala He Val Ala Trp He Pro Leu 180 185 190
GATGCACGCA AAACCAAAGC AACACCCCTA AAAGC 670
(2) INFORMATION FOR SEQ ID NO: 1138:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 193 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1138:
Met He Pro Glu He Lys Asp Pro Ser Lys He Ala He Phe Ala Thr
1 5 10 15
Ser He Ala Ser Leu Cys Ser Leu Val Lys Ala Arg Leu Glu He Asn
20 25 30
He Asp Met Val Lys Pro He Pro Ala Lys Ser Pro Ala Pro Lys He
35 40 45
Cys His Gin Leu Met Pro Leu Gly Arg Ala Val He Leu Ser Phe Ser
50 55 60
Leu He Lys Val He Lys Lys He Pro Lys Gly Leu Pro Lys Gin Ser 65 70 75 80
Pro Lys He Thr Pro Lys Ser Thr Leu Ser Thr Ser Lys Leu Met Leu
85 90 95
Glu Ser Thr Leu Thr Pro Ala Phe Ala Asn Ala Asn Lys Gly Met Met
100 105 110
Lys Tyr Pro Leu Met Gly Ala Arg Lys Cys Ser Asn Leu Ser Lys Gly
115 120 125
Leu Cys Lys Ala Leu Ala Phe Ser Ser He Glu Cys Lys He Ser Cys
130 135 140
Cys Ser Leu Leu Lys Ser Ala Pro Glu Leu Val Ser Ala Tyr Arg Leu 145 150 155 160
Pro Ser Ser Lys Ser Ser Thr Phe Leu Glu Ser Leu Gly He Phe Thr
165 170 175
Gly He He Lys Ala Arg He Thr Ala Ala He Val Ala Trp He Pro
180 185 190
Leu
(2) INFORMATION FOR SEQ ID NO: 1139:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1427 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...1365 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1139:
AAGACTGCTT GAAAAATT ATG GGT CTG AAA TTA AAA ATT TTA AGG TTG TCT 51
Met Gly Leu Lys Leu Lys He Leu Arg Leu Ser 1 5 10
ATG AAT CTC AAA AAA ACA GAA AAC GCG CTC AGT TTG ACG CTT AAG AAC 99 Met Asn Leu Lys Lys Thr Glu Asn Ala Leu Ser Leu Thr Leu Lys Asn 15 20 25
TTC ATT AAA AGC GAG TCT TTT GGA GGG ATT TTC CTC TTT TTA AAC GCT 147 Phe He Lys Ser Glu Ser Phe Gly Gly He Phe Leu Phe Leu Asn Ala 30 35 40
GTT TTA GCG ATG GTG GTG GCT AAT TCG TTT TTA AAA GAA AGT TAT TTT 195 Val Leu Ala Met Val Val Ala Asn Ser Phe Leu Lys Glu Ser Tyr Phe 45 50 55
GCA CTA TGG CAC ACC CCT TTT GGG TTT CAA ATA GGG GAT TTT TTC ATC 243 Ala Leu Trp His Thr Pro Phe Gly Phe Gin He Gly Asp Phe Phe He 60 65 70 75
GGC TTT AGT TTG CAC AAC TGG ATT GAT GAT GTC TTA ATG GCG TTA TTC 291 Gly Phe Ser Leu His Asn Trp He Asp Asp Val Leu Met Ala Leu Phe 80 85 90
TTT TTA ATG ATA GGC TTA GAA ATC AAA CGA GAA TTG TTG TTT GGG GAA 339 Phe Leu Met He Gly Leu Glu He Lys Arg Glu Leu Leu Phe Gly Glu 95 100 105
TTA TCC AGT TTC AAA AAA GCT TCT TTT CCT GTG ATT GCG GCC ATA GGG 387 Leu Ser Ser Phe Lys Lys Ala Ser Phe Pro Val He Ala Ala He Gly 110 115 120
GGC ATG ATA GCC CCA GGA TTG ATT TAT TTT TTT CTT AAC GCT AAC ACG 435 Gly Met He Ala Pro Gly Leu He Tyr Phe Phe Leu Asn Ala Asn Thr 125 130 135
CCT TCC CAG CAT GGT TTT GGG ATC CCT ATG GCG ACG GAT ATT GCG TTC 483 Pro Ser Gin His Gly Phe Gly He Pro Met Ala Thr Asp He Ala Phe 140 145 150 155
GCT TTA GGC GTG ATC ATG CTT TTA GGC AAG AGG GTG CCA ACC GCT TTA 531 Ala Leu Gly Val He Met Leu Leu Gly Lys Arg Val Pro Thr Ala Leu 160 165 170
AAG GTT TTT TTA ATC ACT CTA GCG GTG GCT GAT GAC TTG GGG GCT ATT 579 Lys Val Phe Leu He Thr Leu Ala Val Ala Asp Asp Leu Gly Ala He 175 180 185 GTG GTG ATC GCG CTC TTT TAT ACC ACG AAT TTA AAA TTC GCA TGG CTT 627 Val Val He Ala Leu Phe Tyr Thr Thr Asn Leu Lys Phe Ala Trp Leu 190 195 200
TTA GGG GCT TTA GGG GTG GTT CTT GTT TTA GCC GTA TTA AAC CGC CTG 675 Leu Gly Ala Leu Gly Val Val Leu Val Leu Ala Val Leu Asn Arg Leu 205 210 215
AAT ATG CGC TCG CTC ATC CCT TAC TTG CTT TTA GGG GTG TTG CTT TGG 723 Asn Met Arg Ser Leu He Pro Tyr Leu Leu Leu Gly Val Leu Leu Trp 220 225 230 235
TTT TGC GTG CAT CAA AGC GGT ATC CAT GCG ACG ATT GCT GCA GTG ATT 771 Phe Cys Val His Gin Ser Gly He His Ala Thr He Ala Ala Val He 240 245 250
CTA GCT TTT ATG ATA CCG GTG AAG ATC CCT AAA GAT TCT AAA AAT GTA 819 Leu Ala Phe Met He Pro Val Lys He Pro Lys Asp Ser Lys Asn Val 255 260 265
GAG CTT TTG GAA CTA GGC AAA CGA TAC GCA GAA ACG AGT TCA GGA GCG 867 Glu Leu Leu Glu Leu Gly Lys Arg Tyr Ala Glu Thr Ser Ser Gly Ala 270 275 280
CTT TTG AGT AAA GAG CAG CAA GAA ATC TTG CAT TCT ATT GAA GAA AAA 915 Leu Leu Ser Lys Glu Gin Gin Glu He Leu His Ser He Glu Glu Lys 285 290 295
GCG AGC GCC TTA CAA AGC CCC TTA GAA AGA TTG GAG CAT TTT CTA GCC 963 Ala Ser Ala Leu Gin Ser Pro Leu Glu Arg Leu Glu His Phe Leu Ala 300 305 310 315
CCC ATT AGC GGG TAT TTC ATC ATG CCC TTA TTC GCG TTT GCA AAC GCT 1011 Pro He Ser Gly Tyr Phe He Met Pro Leu Phe Ala Phe Ala Asn Ala 320 325 330
GGG GTG AGC GTT GAT TCT AGC ATC AAT TTA GAA GTG GAT AAG GTG CTT 1059 Gly Val Ser Val Asp Ser Ser He Asn Leu Glu Val Asp Lys Val Leu 335 340 345
TTA GGG GTT ATT TTA GGG CTT TGC TTG GGC AAA CCT TTA GGG ATT TTC 1107 Leu Gly Val He Leu Gly Leu Cys Leu Gly Lys Pro Leu Gly He Phe 350 355 360
TTA ATC ACT TTT ATA AGC GAA AAG CTT AAA ATC ACC GCA CGC CCT AAA 1155 Leu He Thr Phe He Ser Glu Lys Leu Lys He Thr Ala Arg Pro Lys 365 370 375
GGC ATC AGC TGG TGG CAT ATT TTA GGG GCT GGG CTT TTA GCA GGG ATT 1203 Gly He Ser Trp Trp His He Leu Gly Ala Gly Leu Leu Ala Gly He 380 385 390 395
GGC TTT ACC ATG TCT ATG TTT ATT TCT AAT CTG GCC TTC ACG AGC GAG 1251 Gly Phe Thr Met Ser Met Phe He Ser Asn Leu Ala Phe Thr Ser Glu 400 405 410 CAT AAA GAC GCT ATG GAA GTG GCA AAA ATT GCG ATT TTA CTC GGA TCT 1299 His Lys Asp Ala Met Glu Val Ala Lys He Ala He Leu Leu Gly Ser 415 420 425
TTG ATT TCT GGG ATC ATA GGG GCT TTG TAT TTA TTT GCG CTA GAC AAA 1347 Leu He Ser Gly He He Gly Ala Leu Tyr Leu Phe Ala Leu Asp Lys 430 435 440
AGA GCG GCT TTA AAG AAA TAGCTAAAAA TGCTATAATT TGAGATTAAA ACATCTTT 1403 Arg Ala Ala Leu Lys Lys 445
TAAGGAAATT AAATGGGACA AATT 1427
(2) INFORMATION FOR SEQ ID NO: 1140:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 449 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1140:
Met Gly Leu Lys Leu Lys He Leu Arg Leu Ser Met Asn Leu Lys Lys
1 5 10 15
Thr Glu Asn Ala Leu Ser Leu Thr Leu Lys Asn Phe He Lys Ser Glu
20 25 30
Ser Phe Gly Gly He Phe Leu Phe Leu Asn Ala Val Leu Ala Met Val
35 40 45
Val Ala Asn Ser Phe Leu Lys Glu Ser Tyr Phe Ala Leu Trp His Thr
50 55 60
Pro Phe Gly Phe Gin He Gly Asp Phe Phe He Gly Phe Ser Leu His 65 70 75 80
Asn Trp He Asp Asp Val Leu Met Ala Leu Phe Phe Leu Met He Gly
85 90 95
Leu Glu He Lys Arg Glu Leu Leu Phe Gly Glu Leu Ser Ser Phe Lys
100 105 110
Lys Ala Ser Phe Pro Val He Ala Ala He Gly Gly Met He Ala Pro
115 120 125
Gly Leu He Tyr Phe Phe Leu Asn Ala Asn Thr Pro Ser Gin His Gly
130 135 140
Phe Gly He Pro Met Ala Thr Asp He Ala Phe Ala Leu Gly Val He 145 150 155 160
Met Leu Leu Gly Lys Arg Val Pro Thr Ala Leu Lys Val Phe Leu He
165 170 175
Thr Leu Ala Val Ala Asp Asp Leu Gly Ala He Val Val He Ala Leu
180 185 190
Phe Tyr Thr Thr Asn Leu Lys Phe Ala Trp Leu Leu Gly Ala Leu Gly
195 200 205
Val Val Leu Val Leu Ala Val Leu Asn Arg Leu Asn Met Arg Ser Leu
210 215 220
He Pro Tyr Leu Leu Leu Gly Val Leu Leu Trp Phe Cys Val His Gin 225 230 235 240
Ser Gly He His Ala Thr He Ala Ala Val He Leu Ala Phe Met He
245 250 255
Pro Val Lys He Pro Lys Asp Ser Lys Asn Val Glu Leu Leu Glu Leu
260 265 270
Gly Lys Arg Tyr Ala Glu Thr Ser Ser Gly Ala Leu Leu Ser Lys Glu
275 280 285
Gin Gin Glu He Leu His Ser He Glu Glu Lys Ala Ser Ala Leu Gin
290 295 300
Ser Pro Leu Glu Arg Leu Glu His Phe Leu Ala Pro He Ser Gly Tyr 305 310 315 320
Phe He Met Pro Leu Phe Ala Phe Ala Asn Ala Gly Val Ser Val Asp
325 330 335
Ser Ser He Asn Leu Glu Val Asp Lys Val Leu Leu Gly Val He Leu
340 345 350
Gly Leu Cys Leu Gly Lys Pro Leu Gly He Phe Leu He Thr Phe He
355 360 365
Ser Glu Lys Leu Lys He Thr Ala Arg Pro Lys Gly He Ser Trp Trp
370 375 380
His He Leu Gly Ala Gly Leu Leu Ala Gly He Gly Phe Thr Met Ser 385 390 395 400
Met Phe He Ser Asn Leu Ala Phe Thr Ser Glu His Lys Asp Ala Met
405 410 415
Glu Val Ala Lys He Ala He Leu Leu Gly Ser Leu He Ser Gly He
420 425 430
He Gly Ala Leu Tyr Leu Phe Ala Leu Asp Lys Arg Ala Ala Leu Lys
435 440 445
Lys
(2) INFORMATION FOR SEQ ID NO: 1141:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1903 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 13...1857 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1141:
AAGTAAGTGC TT ATG GAT AAT AGG AAT ATT GAT CCT TAC TTC AAC CCA GAG 51 Met Asp Asn Arg Asn He Asp Pro Tyr Phe Asn Pro Glu 1 5 10
CAA TTT TTA GAA ACC CAA AAA TAC AAA GGC ACG GTT ACA GCA TTA ATC 99 Gin Phe Leu Glu Thr Gin Lys Tyr Lys Gly Thr Val Thr Ala Leu He 15 20 25
TTT TTA TTG CTT TTT TTT ATT TTT TTA ATG GTG GCT TTT AAA AAA GCT 147 Phe Leu Leu Leu Phe Phe He Phe Leu Met Val Ala Phe Lys Lys Ala 30 35 40 45
TTT TTT GCC CAA GCC AAC ATG CCT AAT CTA GTG ATG AGC AAA CAA GAC 195 Phe Phe Ala Gin Ala Asn Met Pro Asn Leu Val Met Ser Lys Gin Asp 50 55 60
ACT GCG GCT AGG GGG ACT ATC TAT AGT CAA GAC AAC TAC AGC CTA GCC 243 Thr Ala Ala Arg Gly Thr He Tyr Ser Gin Asp Asn Tyr Ser Leu Ala 65 70 75
ACT TCA CAA ACC CTT TTC AAA CTG GGC TTT GAT ACA AGG TTT TTA AAC 291 Thr Ser Gin Thr Leu Phe Lys Leu Gly Phe Asp Thr Arg Phe Leu Asn 80 85 90
CCG GAT AAA GAA GAT TTT TTC ATT GAT TTC CTT TCT ATT TAT AGC AAT 339 Pro Asp Lys Glu Asp Phe Phe He Asp Phe Leu Ser He Tyr Ser Asn 95 100 105
ATC CCT AAA AAG TCC TTA AAA GAC GCC ATC AAT ACA AAA GGC TAT ATC 387 He Pro Lys Lys Ser Leu Lys Asp Ala He Asn Thr Lys Gly Tyr He 110 115 120 125
ATT CTA GCC TAT GAT CTC ACG CCC AAT ATG GCT GCT AAT ATT AGA GAC 435 He Leu Ala Tyr Asp Leu Thr Pro Asn Met Ala Ala Asn He Arg Asp 130 135 140
TTA AAT AAG AAA TTT TTA GCC TTT GGG GTT TTT CAA AAT TTC AAA GAC 483 Leu Asn Lys Lys Phe Leu Ala Phe Gly Val Phe Gin Asn Phe Lys Asp 145 150 155
GCG CAC GAT AAG GTG TGG CAA AAG CAA GGG CTA AAC ATT GAA GTG AGC 531 Ala His Asp Lys Val Trp Gin Lys Gin Gly Leu Asn He Glu Val Ser 160 165 170
GGC GTT TCT AGG CAT TAC CCT TAT CAA AAT AGC CTA GAG CCA ATC ATT 579 Gly Val Ser Arg His Tyr Pro Tyr Gin Asn Ser Leu Glu Pro He He 175 180 185
GGC TAT GTG CAA AAA CAA GAA GAA GAC AAG CTC ACT TTA ACT ACC GGT 627 Gly Tyr Val Gin Lys Gin Glu Glu Asp Lys Leu Thr Leu Thr Thr Gly 190 195 200 205
AAA AAA GGC GTT GAA AAA TCT CAA GAT CAC TTG CTT AAA GCC CAA CAA 675 Lys Lys Gly Val Glu Lys Ser Gin Asp His Leu Leu Lys Ala Gin Gin 210 215 220
AAT GGC ATA AGA ACA GGC AAA AGA GAC GTG AGT TTT AAC TTT ATC CAA 723 Asn Gly He Arg Thr Gly Lys Arg Asp Val Ser Phe Asn Phe He Gin 225 230 235
AAC CAC TCT TAT ACA GAG GTT GAA CGC CTT GAT GGC TAT GAG GTG TAT 771 Asn His Ser Tyr Thr Glu Val Glu Arg Leu Asp Gly Tyr Glu Val Tyr 240 245 250
TTG AGC GTT CCT TTA AAA CTC CAA AGA GAA ATT GAA ACC CTA TTG GAT 819 Leu Ser Val Pro Leu Lys Leu Gin Arg Glu He Glu Thr Leu Leu Asp 255 260 265
AAA ACT AAA GAC AAA CTC AAG GCT AAA GAA ATC CTA GTG GGT ATC ATT 867 Lys Thr Lys Asp Lys Leu Lys Ala Lys Glu He Leu Val Gly He He 270 275 280 285
AAC CCT AAA AGC GGG GAA ATT TTA TCG CTA GCT TCA AGC AAG CGC TTC 915 Asn Pro Lys Ser Gly Glu He Leu Ser Leu Ala Ser Ser Lys Arg Phe 290 295 300
AAT CCT AAT GCG ATT AAA ACC AGC GAT TAT GAA AGC TTG AAT TTG AGC 963 Asn Pro Asn Ala He Lys Thr Ser Asp Tyr Glu Ser Leu Asn Leu Ser 305 310 315
GTT GCT GAA AAG GTT TTT GAG CCA GGC AGC ACG ATC AAA CCC ATT GTT 1011 Val Ala Glu Lys Val Phe Glu Pro Gly Ser Thr He Lys Pro He Val 320 325 330
TAT TCC TTG CTG TTA GAC AAG AAT TTG ATC AAC CCC AAA GAA CGC ATT 1059 Tyr Ser Leu Leu Leu Asp Lys Asn Leu He Asn Pro Lys Glu Arg He 335 340 345
GAT TTA AAC CAT GGC TAT TAC CAA TTA GGA AAA TAC ACC ATT AAA GAC 1107 Asp Leu Asn His Gly Tyr Tyr Gin Leu Gly Lys Tyr Thr He Lys Asp 350 355 360 365
GAC TTT ATC CCC AGT AAA AAA GCC GTT GTG GAA GAC ATT TTG ATC CAA 1155 Asp Phe He Pro Ser Lys Lys Ala Val Val Glu Asp He Leu He Gin 370 375 380
TCT AGC AAT GTG GGC ATG ATA AAA ATC AGT AAA AAC TTA AAC CCA AAG 1203 Ser Ser Asn Val Gly Met He Lys He Ser Lys Asn Leu Asn Pro Lys 385 390 395
GAT TTC TAT AAT GGG CTT TTA GGC TAT GGA TTT TCT CAA AAA ACC GGC 1251 Asp Phe Tyr Asn Gly Leu Leu Gly Tyr Gly Phe Ser Gin Lys Thr Gly 400 405 410
ATT GAT TTA TCT CTA GAA GCC ACA GGA AAG ATC CCT CCT TTG TCC GCT 1299 He Asp Leu Ser Leu Glu Ala Thr Gly Lys He Pro Pro Leu Ser Ala 415 420 425
TTC AAG CGT GAA GTG TTA AAG GGG AGC GTT TCT TAT GGC TAT GGG CTG 1347 Phe Lys Arg Glu Val Leu Lys Gly Ser Val Ser Tyr Gly Tyr Gly Leu 430 435 440 445
AAC GCG ACT TTT TTG CAG CTT TTA AGG GCT TAT GCG GTG TTT TCT AAT 1395 Asn Ala Thr Phe Leu Gin Leu Leu Arg Ala Tyr Ala Val Phe Ser Asn 450 455 460 GAA GGC AAA TTG ACT ACC CCC TAT TTA GTG CAA CGA GAA ACC GCC CCT 1443 Glu Gly Lys Leu Thr Thr Pro Tyr Leu Val Gin Arg Glu Thr Ala Pro 465 470 475
AAT GGC GAT ATT TAC ATC CCT AGC CCC AAA CCC ACC TTT CAA GTC ATT 1491 Asn Gly Asp He Tyr He Pro Ser Pro Lys Pro Thr Phe Gin Val He 480 485 490
AGC CCA AAA AGC GCT AGG AAA ATG AAA GAA ACC TTA ATC AAA GTA GTG 1539 Ser Pro Lys Ser Ala Arg Lys Met Lys Glu Thr Leu He Lys Val Val 495 500 505
CGT TAT GGC ACA GGC AAA AAC GCT CAA TTT GAA GGG CTA TAC ATA GGG 1587 Arg Tyr Gly Thr Gly Lys Asn Ala Gin Phe Glu Gly Leu Tyr He Gly 510 515 520 525
GGC AAA ACC GGC ACG GCT AGG GTC GCT AAA AAC GGG AGT TAT AGC GCG 1635 Gly Lys Thr Gly Thr Ala Arg Val Ala Lys Asn Gly Ser Tyr Ser Ala 530 535 540
CAG TCC TAC AAC AGC TCT TTT TTT GGG TTT GCT GAA GAT GAA AGG CAG 1683 Gin Ser Tyr Asn Ser Ser Phe Phe Gly Phe Ala Glu Asp Glu Arg Gin 545 550 555
GTT TTT ACT ATC GGC GTG GTT ATC TTA GGT TCG CAT GGC AAG GAA GAA 1731 Val Phe Thr He Gly Val Val He Leu Gly Ser His Gly Lys Glu Glu 560 565 570
TAT TAC GCC AGC AAG ATT GCA GCC CCC ATT TTT AAA GAA ATC ACC GAA 1779 Tyr Tyr Ala Ser Lys He Ala Ala Pro He Phe Lys Glu He Thr Glu 575 580 585
ATT TTA GTG CGT TAC AAT TAT CTA TCG CCC TCT ATT GCG ATT CAA AAC 1827 He Leu Val Arg Tyr Asn Tyr Leu Ser Pro Ser He Ala He Gin Asn 590 595 600 605
GCG CTC GAG AAA AAC CGC TTT AAG ATA AAA TAAAAGGCTC TTTTCAACCC AAA 1880 Ala Leu Glu Lys Asn Arg Phe Lys He Lys 610 615
CTCCAAAAAA GGAGTCTTAA GTT 1903
(2) INFORMATION FOR SEQ ID NO: 1142:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 615 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1142: Met Asp Asn Arg Asn He Asp Pro Tyr Phe Asn Pro Glu Gin Phe Leu 1 5 10 15
Glu Thr Gin Lys Tyr Lys Gly Thr Val Thr Ala Leu He Phe Leu Leu
20 25 30
Leu Phe Phe He Phe Leu Met Val Ala Phe Lys Lys Ala Phe Phe Ala
35 40 45
Gin Ala Asn Met Pro Asn Leu Val Met Ser Lys Gin Asp Thr Ala Ala
50 55 60
Arg Gly Thr He Tyr Ser Gin Asp Asn Tyr Ser Leu Ala Thr Ser Gin 65 70 75 80
Thr Leu Phe Lys Leu Gly Phe Asp Thr Arg Phe Leu Asn Pro Asp Lys
85 90 95
Glu Asp Phe Phe He Asp Phe Leu Ser He Tyr Ser Asn He Pro Lys
100 105 110
Lys Ser Leu Lys Asp Ala He Asn Thr Lys Gly Tyr He He Leu Ala
115 120 125
Tyr Asp Leu Thr Pro Asn Met Ala Ala Asn He Arg Asp Leu Asn Lys
130 135 140
Lys Phe Leu Ala Phe Gly Val Phe Gin Asn Phe Lys Asp Ala His Asp 145 150 155 160
Lys Val Trp Gin Lys Gin Gly Leu Asn He Glu Val Ser Gly Val Ser
165 170 175
Arg His Tyr Pro Tyr Gin Asn Ser Leu Glu Pro He He Gly Tyr Val
180 185 190
Gin Lys Gin Glu Glu Asp Lys Leu Thr Leu Thr Thr Gly Lys Lys Gly
195 200 205
Val Glu Lys Ser Gin Asp His Leu Leu Lys Ala Gin Gin Asn Gly He
210 215 220
Arg Thr Gly Lys Arg Asp Val Ser Phe Asn Phe He Gin Asn His Ser 225 230 235 240
Tyr Thr Glu Val Glu Arg Leu Asp Gly Tyr Glu Val Tyr Leu Ser Val
245 250 255
Pro Leu Lys Leu Gin Arg Glu He Glu Thr Leu Leu Asp Lys Thr Lys
260 265 270
Asp Lys Leu Lys Ala Lys Glu He Leu Val Gly He He Asn Pro Lys
275 280 285
Ser Gly Glu He Leu Ser Leu Ala Ser Ser Lys Arg Phe Asn Pro Asn
290 295 300
Ala He Lys Thr Ser Asp Tyr Glu Ser Leu Asn Leu Ser Val Ala Glu 305 310 315 320
Lys Val Phe Glu Pro Gly Ser Thr He Lys Pro He Val Tyr Ser Leu
325 330 335
Leu Leu Asp Lys Asn Leu He Asn Pro Lys Glu Arg He Asp Leu Asn
340 345 350
His Gly Tyr Tyr Gin Leu Gly Lys Tyr Thr He Lys Asp Asp Phe He
355 360 365
Pro Ser Lys Lys Ala Val Val Glu Asp He Leu He Gin Ser Ser Asn
370 375 380
Val Gly Met He Lys He Ser Lys Asn Leu Asn Pro Lys Asp Phe Tyr 385 390 395 400
Asn Gly Leu Leu Gly Tyr Gly Phe Ser Gin Lys Thr Gly He Asp Leu
405 410 415
Ser Leu Glu Ala Thr Gly Lys He Pro Pro Leu Ser Ala Phe Lys Arg
420 425 430
Glu Val Leu Lys Gly Ser Val Ser Tyr Gly Tyr Gly Leu Asn Ala Thr 435 440 445 Phe Leu Gin Leu Leu Arg Ala Tyr Ala Val Phe Ser Asn Glu Gly Lys
450 455 460
Leu Thr Thr Pro Tyr Leu Val Gin Arg Glu Thr Ala Pro Asn Gly Asp 465 470 475 480
He Tyr He Pro Ser Pro Lys Pro Thr Phe Gin Val He Ser Pro Lys
485 490 495
Ser Ala Arg Lys Met Lys Glu Thr Leu He Lys Val Val Arg Tyr Gly
500 505 510
Thr Gly Lys Asn Ala Gin Phe Glu Gly Leu Tyr He Gly Gly Lys Thr
515 520 525
Gly Thr Ala Arg Val Ala Lys Asn Gly Ser Tyr Ser Ala Gin Ser Tyr
530 535 540
Asn Ser Ser Phe Phe Gly Phe Ala Glu Asp Glu Arg Gin Val Phe Thr 545 550 555 560
He Gly Val Val He Leu Gly Ser His Gly Lys Glu Glu Tyr Tyr Ala
565 570 575
Ser Lys He Ala Ala Pro He Phe Lys Glu He Thr Glu He Leu Val
580 585 590
Arg Tyr Asn Tyr Leu Ser Pro Ser He Ala He Gin Asn Ala Leu Glu
595 600 605
Lys Asn Arg Phe Lys He Lys 610 615
(2) INFORMATION FOR SEQ ID NO: 1143:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 719 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...678 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1143:
TAGCCTAGTC TTTATCGC ATG CTA TTT AAT GGG CTA TGC TTA TTT GAA CAG 51
Met Leu Phe Asn Gly Leu Cys Leu Phe Glu Gin 1 5 10
GCA AGT TTG TGC TTT AGA AAA GCG AGC GTT TCA ATG AAA AAG CTC AAA 99 Ala Ser Leu Cys Phe Arg Lys Ala Ser Val Ser Met Lys Lys Leu Lys 15 20 25
GGT CTT TTT TTG ATC CTG CTC TTA TGG GTC TAT CCT TTA AGG AGT GAG 147 Gly Leu Phe Leu He Leu Leu Leu Trp Val Tyr Pro Leu Arg Ser Glu 30 35 40
CCA ATC AAT GAG GGA GCA TAC ATT TTA GAA GAG ATT GGC GAT GTG CTT 195 Pro He Asn Glu Gly Ala Tyr He Leu Glu Glu He Gly Asp Val Leu 45 50 55
AGG TTT TTG CCT ATT TTT GTA GGC ACG GTC AGT TTG GCG ATG CGC GAT 243 Arg Phe Leu Pro He Phe Val Gly Thr Val Ser Leu Ala Met Arg Asp 60 65 70 75
TAT AGA GGG TTA GGG GAA TTA GCG GTC GGC ACA TTG GTT ACT CAA GGC 291 Tyr Arg Gly Leu Gly Glu Leu Ala Val Gly Thr Leu Val Thr Gin Gly 80 85 90
GTG ATT TAT GGC CTT AAA GGA GCT TTT AGC AAC GCC CAT AAA GAT GGG 339 Val He Tyr Gly Leu Lys Gly Ala Phe Ser Asn Ala His Lys Asp Gly 95 100 105
GCT AGA GTG GAA TTT GCT AAA CGC CCG TGC TGT AAT TCT TGG AGA GGC 387 Ala Arg Val Glu Phe Ala Lys Arg Pro Cys Cys Asn Ser Trp Arg Gly 110 115 120
ATG CCA AGC GGG CAT GCT GGG GGG GTG TTT AGC GCG GCT GGG TTT GTG 435 Met Pro Ser Gly His Ala Gly Gly Val Phe Ser Ala Ala Gly Phe Val 125 130 135
TAT TAC CGC TAT GGG TGG AAG CCG GCT CTT CCT GTG ATC GCT CTT GCA 483 Tyr Tyr Arg Tyr Gly Trp Lys Pro Ala Leu Pro Val He Ala Leu Ala 140 145 150 155
ATC CTC ACT GAC GCT AGC AGA GTG GTG GCA AGA CAA CAC ACG ATC TTG 531 He Leu Thr Asp Ala Ser Arg Val Val Ala Arg Gin His Thr He Leu 160 165 170
CAA GTT ACG ATC GGC AGC CTT ATC GCA TGG GGG TTT GCT TAT TTA TTC 579 Gin Val Thr He Gly Ser Leu He Ala Trp Gly Phe Ala Tyr Leu Phe 175 180 185
ACT TCA CGC TAC AAA CCC AAG CAA TGG ATG CTC TAT CCT GAA ATT TCT 627 Thr Ser Arg Tyr Lys Pro Lys Gin Trp Met Leu Tyr Pro Glu He Ser 190 195 200
AGC GAT TTT AAG GGC AGT AGC CGC TAT GGG GTG AGC TTT TCT TAT CAA 675 Ser Asp Phe Lys Gly Ser Ser Arg Tyr Gly Val Ser Phe Ser Tyr Gin 205 210 215
TGG TAAAGGGATA AAGTGCTAAA AAAATTATTA TTCATTGCAC T 719
Trp
220
(2) INFORMATION FOR SEQ ID NO: 1144
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 220 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1144:
Met Leu Phe Asn Gly Leu Cys Leu Phe Glu Gin Ala Ser Leu Cys Phe
1 5 10 15
Arg Lys Ala Ser Val Ser Met Lys Lys Leu Lys Gly Leu Phe Leu He
20 25 30
Leu Leu Leu Trp Val Tyr Pro Leu Arg Ser Glu Pro He Asn Glu Gly
35 40 45
Ala Tyr He Leu Glu Glu He Gly Asp Val Leu Arg Phe Leu Pro He
50 55 60
Phe Val Gly Thr Val Ser Leu Ala Met Arg Asp Tyr Arg Gly Leu Gly 65 70 75 80
Glu Leu Ala Val Gly Thr Leu Val Thr Gin Gly Val He Tyr Gly Leu
85 90 95
Lys Gly Ala Phe Ser Asn Ala His Lys Asp Gly Ala Arg Val Glu Phe
100 105 110
Ala Lys Arg Pro Cys Cys Asn Ser Trp Arg Gly Met Pro Ser Gly His
115 120 125
Ala Gly Gly Val Phe Ser Ala Ala Gly Phe Val Tyr Tyr Arg Tyr Gly
130 135 140
Trp Lys Pro Ala Leu Pro Val He Ala Leu Ala He Leu Thr Asp Ala 145 150 155 160
Ser Arg Val Val Ala Arg Gin His Thr He Leu Gin Val Thr He Gly
165 170 175
Ser Leu He Ala Trp Gly Phe Ala Tyr Leu Phe Thr Ser Arg Tyr Lys
180 185 190
Pro Lys Gin Trp Met Leu Tyr Pro Glu He Ser Ser Asp Phe Lys Gly
195 200 205
Ser Ser Arg Tyr Gly Val Ser Phe Ser Tyr Gin Trp 210 215 220
(2) INFORMATION FOR SEQ ID NO: 1145:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1087 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 34...1053 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1145:
TTCAAGCAAA AACACCACCC AAATATAAAG ATA ATG ATT TTA AGC ATT GAA AGT 54
Met He Leu Ser He Glu Ser 1 5 TCT TGC GAT GAC AGC TCT TTA GCC CTT ACA AGA ATA GAG GAC GCT CAA 102 Ser Cys Asp Asp Ser Ser Leu Ala Leu Thr Arg He Glu Asp Ala Gin 10 15 20
CTC ATC GCT CAT TTT AAA ATC TCT CAA GAA AAG CAC CAT AGT TCT TAT 150 Leu He Ala His Phe Lys He Ser Gin Glu Lys His His Ser Ser Tyr 25 30 35
GGG GGC GTT GTG CCT GAG CTT GCA TCA CGT TTG CAT GCT GAG AAT TTG 198 Gly Gly Val Val Pro Glu Leu Ala Ser Arg Leu His Ala Glu Asn Leu 40 45 50 55
CCG CTT TTA TTA GAA CGC ATT AAA ATA AGC TTG AAT AAG GAT TTT TCC 246 Pro Leu Leu Leu Glu Arg He Lys He Ser Leu Asn Lys Asp Phe Ser 60 65 70
AAA ATT AAA GCC ATC GCT ATC ACT AAT CAG CCA GGT TTG AGC GTT ACT 294 Lys He Lys Ala He Ala He Thr Asn Gin Pro Gly Leu Ser Val Thr 75 80 85
TTA ATA GAA GGT TTG ATG ATG GCA AAA GCC TTG AGC TTG TCT TTG AAT 342 Leu He Glu Gly Leu Met Met Ala Lys Ala Leu Ser Leu Ser Leu Asn 90 95 100
TTG CCC TTG ATT TTA GAA GAT CAT TTG AGA GGG CAT GTG TAT TCG CTC 390 Leu Pro Leu He Leu Glu Asp His Leu Arg Gly His Val Tyr Ser Leu 105 110 115
TTT ATC AAT GAA AAA CAA ACC TGC ATG CCT TTA AGC GTG CTC TTA GTC 438 Phe He Asn Glu Lys Gin Thr Cys Met Pro Leu Ser Val Leu Leu Val 120 125 130 135
TCT GGG GGG CAT TCT TTG ATT TTA GAG GCT AGA GAT TAT GAG AAT ATT 486 Ser Gly Gly His Ser Leu He Leu Glu Ala Arg Asp Tyr Glu Asn He 140 145 150
AAA ATC GTT GCC ACG AGT TTA GAC GAT AGC TTT GGG GAG AGT TTT GAT 534 Lys He Val Ala Thr Ser Leu Asp Asp Ser Phe Gly Glu Ser Phe Asp 155 160 165
AAG GTT TCC AAA ATG CTT GAT TTA GGC TAT CCA GGA GGC CCT ATA GTG 582 Lys Val Ser Lys Met Leu Asp Leu Gly Tyr Pro Gly Gly Pro He Val 170 175 180
GAA AAA TTA GCC CTT GAT TAT AGG CAC CCA AAC GAG CCT TTA ATG TTC 630 Glu Lys Leu Ala Leu Asp Tyr Arg His Pro Asn Glu Pro Leu Met Phe 185 190 195
CCT ATC CCT TTA AAA AAC AGC CCG AAT CTG GCT TTT AGT TTT TCA GGT 678 Pro He Pro Leu Lys Asn Ser Pro Asn Leu Ala Phe Ser Phe Ser Gly 200 205 210 215
TTA AAA AAT GCG GTG CGT TTG GAG GTT GAA AAA AAC GCC CCC AAC TTG 726 Leu Lys Asn Ala Val Arg Leu Glu Val Glu Lys Asn Ala Pro Asn Leu 220 225 230 AAT GAA GCG ATC AAA CAA AAG ATT GGC TAT CAT TTT CAA AGT GCA GCG 774 Asn Glu Ala He Lys Gin Lys He Gly Tyr His Phe Gin Ser Ala Ala 235 240 245
ATT GAG CAT TTA ATC CAG CAG ACT AAA CGC TAT TTT AAA ATC AAA CGC 822 He Glu His Leu He Gin Gin Thr Lys Arg Tyr Phe Lys He Lys Arg 250 255 260
CCT AAA ATT TTT GGC ATT GTG GGG GGA GCG AGC CAA AAT TTG GCT TTA 870 Pro Lys He Phe Gly He Val Gly Gly Ala Ser Gin Asn Leu Ala Leu 265 270 275
AGA AAG GCG TTT GAA AAT TTG TGC GAT GCG TTT GAT TGC AAG CTT GTT 918 Arg Lys Ala Phe Glu Asn Leu Cys Asp Ala Phe Asp Cys Lys Leu Val 280 285 290 295
TTA GCC CCT TTA GAA TTT TGC AGC GAC AAT GCC GCC ATG ATA GGG CGA 966 Leu Ala Pro Leu Glu Phe Cys Ser Asp Asn Ala Ala Met He Gly Arg 300 305 310
TCC AGC CTA GAA GCT TAT CAA AAA AAG CGC TTT GTC CCT TTA GAA AAG 1014 Ser Ser Leu Glu Ala Tyr Gin Lys Lys Arg Phe Val Pro Leu Glu Lys 315 320 325
GCT AAC ATT TCG CCA AGA ACG CTG TTA AAA AGT TTT GAG TGAATGGATA CA 1065 Ala Asn He Ser Pro Arg Thr Leu Leu Lys Ser Phe Glu 330 335 340
AAAAGAAAGC GCATGATAAA AC 1087
(2) INFORMATION FOR SEQ ID NO: 1146:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1146:
Met He Leu Ser He Glu Ser Ser Cys Asp Asp Ser Ser Leu Ala Leu
1 5 10 15
Thr Arg He Glu Asp Ala Gin Leu He Ala His Phe Lys He Ser Gin
20 25 30
Glu Lys His His Ser Ser Tyr Gly Gly Val Val Pro Glu Leu Ala Ser
35 40 45
Arg Leu His Ala Glu Asn Leu Pro Leu Leu Leu Glu Arg He Lys He
50 55 60
Ser Leu Asn Lys Asp Phe Ser Lys He Lys Ala He Ala He Thr Asn 65 70 75 80
Gin Pro Gly Leu Ser Val Thr Leu He Glu Gly Leu Met Met Ala Lys
85 90 95
Ala Leu Ser Leu Ser Leu Asn Leu Pro Leu He Leu Glu Asp His Leu 100 105 110
Arg Gly His Val Tyr Ser Leu Phe He Asn Glu Lys Gin Thr Cys Met
115 120 125
Pro Leu Ser Val Leu Leu Val Ser Gly Gly His Ser Leu He Leu Glu
130 135 140
Ala Arg Asp Tyr Glu Asn He Lys He Val Ala Thr Ser Leu Asp Asp 145 150 155 160
Ser Phe Gly Glu Ser Phe Asp Lys Val Ser Lys Met Leu Asp Leu Gly
165 170 175
Tyr Pro Gly Gly Pro He Val Glu Lys Leu Ala Leu Asp Tyr Arg His
180 185 190
Pro Asn Glu Pro Leu Met Phe Pro He Pro Leu Lys Asn Ser Pro Asn
195 200 205
Leu Ala Phe Ser Phe Ser Gly Leu Lys Asn Ala Val Arg Leu Glu Val
210 215 220
Glu Lys Asn Ala Pro Asn Leu Asn Glu Ala He Lys Gin Lys He Gly 225 230 235 240
Tyr His Phe Gin Ser Ala Ala He Glu His Leu He Gin Gin Thr Lys
245 250 255
Arg Tyr Phe Lys He Lys Arg Pro Lys He Phe Gly He Val Gly Gly
260 265 270
Ala Ser Gin Asn Leu Ala Leu Arg Lys Ala Phe Glu Asn Leu Cys Asp
275 280 285
Ala Phe Asp Cys Lys Leu Val Leu Ala Pro Leu Glu Phe Cys Ser Asp
290 295 300
Asn Ala Ala Met He Gly Arg Ser Ser Leu Glu Ala Tyr Gin Lys Lys 305 310 315 320
Arg Phe Val Pro Leu Glu Lys Ala Asn He Ser Pro Arg Thr Leu Leu
325 330 335
Lys Ser Phe Glu 340
(2) INFORMATION FOR SEQ ID NO: 1147:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 547 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 34...498 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1147:
TGCACTTGTT ATGATGAAGA TGGCGCACTA AGA ATG AAT GAA GAC TTG ACA AAT 54
Met Asn Glu Asp Leu Thr Asn 1 5 TCA ACA GAA TAT AAA AGA TAT GGC CAT GAT TAC GCC AAA TAC CCA AGA 102 Ser Thr Glu Tyr Lys Arg Tyr Gly His Asp Tyr Ala Lys Tyr Pro Arg 10 15 20
AGA ATC GCT GAA GAA TTG CAA CAT TAT GGG GGC AAT AGT TTT GCG AAT 150 Arg He Ala Glu Glu Leu Gin His Tyr Gly Gly Asn Ser Phe Ala Asn 25 30 35
TTT TTT AGA GAT GAA GGG GTC TTA TAC AAA GAG ATT TTG TGC GAT GCG 198 Phe Phe Arg Asp Glu Gly Val Leu Tyr Lys Glu He Leu Cys Asp Ala 40 45 50 55
TGC GAT CAT TTA AAG GTT AAT TAC AAT GAA GAA TCT GCA ACC TCT TTG 246 Cys Asp His Leu Lys Val Asn Tyr Asn Glu Glu Ser Ala Thr Ser Leu 60 65 70
ATT GAG CAA AAC ATG CTT TCT AAA CTC TTG AAA GAT AGT TTA GAA AAA 294 He Glu Gin Asn Met Leu Ser Lys Leu Leu Lys Asp Ser Leu Glu Lys 75 80 85
ATG AGT AGG AGA GAG ATT AAA GAA CTT TGC AAT GAA TTG GGC ATG ACA 342 Met Ser Arg Arg Glu He Lys Glu Leu Cys Asn Glu Leu Gly Met Thr 90 95 100
AAT ATT GAT AAA GTG ATT GGT GAA AAC AAA CAA GTC CTA ATC GCA TCT 390 Asn He Asp Lys Val He Gly Glu Asn Lys Gin Val Leu He Ala Ser 105 110 115
ACT TTA ACG CTG TTT AAA GCG GGT GGC TCT CAT TCT TAT GCG TTG GCT 438 Thr Leu Thr Leu Phe Lys Ala Gly Gly Ser His Ser Tyr Ala Leu Ala 120 125 130 135
GTA TCT GTT GCA GAT GCA ATG GTA AGA CAA ACT CTA GGG CAT GTT ATG 486 Val Ser Val Ala Asp Ala Met Val Arg Gin Thr Leu Gly His Val Met 140 145 150
TGG TGG GTA AAG TAGCACTTAA AAAAACTTTG GGCGTTTTGG CTGGCCCTAT TGGTT 543 Trp Trp Val Lys 155
GGGT 547
(2) INFORMATION FOR SEQ ID NO: 1148:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 155 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1148: Met Asn Glu Asp Leu Thr Asn Ser Thr Glu Tyr Lys Arg Tyr Gly His 1 5 10 15
Asp Tyr Ala Lys Tyr Pro Arg Arg He Ala Glu Glu Leu Gin His Tyr
20 25 30
Gly Gly Asn Ser Phe Ala Asn Phe Phe Arg Asp Glu Gly Val Leu Tyr
35 40 45
Lys Glu He Leu Cys Asp Ala Cys Asp His Leu Lys Val Asn Tyr Asn
50 55 60
Glu Glu Ser Ala Thr Ser Leu He Glu Gin Asn Met Leu Ser Lys Leu 65 70 75 80
Leu Lys Asp Ser Leu Glu Lys Met Ser Arg Arg Glu He Lys Glu Leu
85 90 95
Cys Asn Glu Leu Gly Met Thr Asn He Asp Lys Val He Gly Glu Asn
100 105 110
Lys Gin Val Leu He Ala Ser Thr Leu Thr Leu Phe Lys Ala Gly Gly
115 120 125
Ser His Ser Tyr Ala Leu Ala Val Ser Val Ala Asp Ala Met Val Arg
130 135 140
Gin Thr Leu Gly His Val Met Trp Trp Val Lys 145 150 155
(2) INFORMATION FOR SEQ ID NO: 1149:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 523 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...486 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1149:
TTATTTTAAA GGAATTTC ATG CAA ATC ATA GAA GGG AAA TTG CAA TTA CAA 51
Met Gin He He Glu Gly Lys Leu Gin Leu Gin 1 5 10
GGG AAT GAA AGA GTC GCT ATT TTA ACA TCG CGC TTC AAT CAT ATC ATC 99 Gly Asn Glu Arg Val Ala He Leu Thr Ser Arg Phe Asn His He He 15 20 25
ACA GAC AGA TTG CAA GAA GGG GCG ATG GAC TGC TTT AAA AGG CAT GGG 147 Thr Asp Arg Leu Gin Glu Gly Ala Met Asp Cys Phe Lys Arg His Gly 30 35 40
GGC GAT GAG GAT CTT TTA GAC ATC GTG CTG GTG CCT GGG GCT TAT GAA 195 Gly Asp Glu Asp Leu Leu Asp He Val Leu Val Pro Gly Ala Tyr Glu 45 50 55 TTG CCT TTT ATT TTA GAC AAA TTA TTA GAG AGC GAA AAA TAC GAT GGC 243 Leu Pro Phe He Leu Asp Lys Leu Leu Glu Ser Glu Lys Tyr Asp Gly 60 65 70 75
GTG TGC GTT TTG GGA GCG ATC ATT AGA GGG GGG ACT CCG CAT TTT GAT 291 Val Cys Val Leu Gly Ala He He Arg Gly Gly Thr Pro His Phe Asp 80 85 90
TAT GTG AGC GCG GAA GCG ACT AAG GGT ATT GCC CAT GCG ATG CTT AAA 339 Tyr Val Ser Ala Glu Ala Thr Lys Gly He Ala His Ala Met Leu Lys 95 100 105
TAC AGC ATG CCG GTA AGC TTT GGC GTG CTG ACC ACG GAC AAT ATT GAA 387 Tyr Ser Met Pro Val Ser Phe Gly Val Leu Thr Thr Asp Asn He Glu 110 115 120
CAA GCG ATT GAA AGA GCG GGC AGT AAA GCC GGC AAT AAG GGC TTT GAA 435 Gin Ala He Glu Arg Ala Gly Ser Lys Ala Gly Asn Lys Gly Phe Glu 125 130 135
GCG ATG AGC ACC CTC ATT GAA TTG TTG AGC TTG TGC CAA ACT CTC AAG 483 Ala Met Ser Thr Leu He Glu Leu Leu Ser Leu Cys Gin Thr Leu Lys 140 145 150 155
GGT TAAAATGGCG ACACGAACTC AAGCCAGGGG GGCTGTG 523
Gly
(2) INFORMATION FOR SEQ ID NO: 1150:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 156 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1150:
Met Gin He He Glu Gly Lys Leu Gin Leu Gin Gly Asn Glu Arg Val
1 5 10 15
Ala He Leu Thr Ser Arg Phe Asn His He He Thr Asp Arg Leu Gin
20 25 30
Glu Gly Ala Met Asp Cys Phe Lys Arg His Gly Gly Asp Glu Asp Leu
35 40 45
Leu Asp He Val Leu Val Pro Gly Ala Tyr Glu Leu Pro Phe He Leu
50 55 60
Asp Lys Leu Leu Glu Ser Glu Lys Tyr Asp Gly Val Cys Val Leu Gly 65 70 75 80
Ala He He Arg Gly Gly Thr Pro His Phe Asp Tyr Val Ser Ala Glu
85 90 95
Ala Thr Lys Gly He Ala His Ala Met Leu Lys Tyr Ser Met Pro Val 100 105 110 Ser Phe Gly Val Leu Thr Thr Asp Asn He Glu Gin Ala He Glu Arg
115 120 125
Ala Gly Ser Lys Ala Gly Asn Lys Gly Phe Glu Ala Met Ser Thr Leu
130 135 140
He Glu Leu Leu Ser Leu Cys Gin Thr Leu Lys Gly
145 150 155
(2) INFORMATION FOR SEQ ID NO: 1151:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1724 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...1656 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1151:
TATTATTAAG GATACAAA ATG GCA AAA GAA ATC AAA TTT TCA GAT AGC GCG 51
Met Ala Lys Glu He Lys Phe Ser Asp Ser Ala 1 5 10
AGA AAC CTT TTA TTT GAA GGC GTG AGA CAA CTC CAT GAC GCT GTT AAA 99 Arg Asn Leu Leu Phe Glu Gly Val Arg Gin Leu His Asp Ala Val Lys 15 20 25
GTA ACC ATG GGG CCA AGA GGC AGG AAC GTG TTG ATC CAA AAA AGC TAT 147 Val Thr Met Gly Pro Arg Gly Arg Asn Val Leu He Gin Lys Ser Tyr 30 35 40
GGC GCT CCA AGC ATC ACT AAA GAT GGC GTG AGC GTG GCT AAA GAG ATT 195 Gly Ala Pro Ser He Thr Lys Asp Gly Val Ser Val Ala Lys Glu He 45 50 55
GAA TTA AGT TGC CCG GTA GCT AAC ATG GGC GCT CAA CTC GTT AAA GAA 243 Glu Leu Ser Cys Pro Val Ala Asn Met Gly Ala Gin Leu Val Lys Glu 60 65 70 75
GTA GCG AGC AAA ACC GCT GAT GCT GCC GGC GAT GGC ACG ACC ACA GCG 291 Val Ala Ser Lys Thr Ala Asp Ala Ala Gly Asp Gly Thr Thr Thr Ala 80 85 90
ACC GTG CTG GCT TAT AGC ATT TTT AAA GAA GGT TTG AGG AAC ATC ACG 339 Thr Val Leu Ala Tyr Ser He Phe Lys Glu Gly Leu Arg Asn He Thr 95 100 105
GCT GGG GCT AAC CCT ATT GAA GTG AAA CGA GGC ATG GAT AAA GCC GCT 387 Ala Gly Ala Asn Pro He Glu Val Lys Arg Gly Met Asp Lys Ala Ala 110 115 120
GAA GCC ATT ATT AAT GAG CTT AAA AAA GCG AGC AAA AAA GTG GGC GGT 435 Glu Ala He He Asn Glu Leu Lys Lys Ala Ser Lys Lys Val Gly Gly 125 130 135
AAA GAA GAA ATC ACC CAA GTG GCG ACC ATT TCT GCA AAC TCC GAT CAC 483 Lys Glu Glu He Thr Gin Val Ala Thr He Ser Ala Asn Ser Asp His 140 145 150 155
AAT ATC GGG AAA CTC ATC GCT GAC GCT ATG GAA AAA GTG GGT AAA GAC 531 Asn He Gly Lys Leu He Ala Asp Ala Met Glu Lys Val Gly Lys Asp 160 165 170
GGC GTG ATC ACC GTT GAA GAA GCT AAG GGC ATT GAA GAT GAA CTA GAT 579 Gly Val He Thr Val Glu Glu Ala Lys Gly He Glu Asp Glu Leu Asp 175 180 185
GTT GTA GAA GGC ATG CAA TTT GAT AGA GGC TAC CTC TCC CCT TAT TTT 627 Val Val Glu Gly Met Gin Phe Asp Arg Gly Tyr Leu Ser Pro Tyr Phe 190 195 200
GTA ACA AAC GCT GAG AAA ATG ACC GCT CAA TTG GAT AAC GCT TAC ATC 675 Val Thr Asn Ala Glu Lys Met Thr Ala Gin Leu Asp Asn Ala Tyr He 205 210 215
CTT TTA ACG GAT AAA AAA ATC TCT AGC ATG AAA GAC ATT CTC CCG CTA 723 Leu Leu Thr Asp Lys Lys He Ser Ser Met Lys Asp He Leu Pro Leu 220 225 230 235
CTA GAA AAA ACC ATG AAA GAG GGC AAA CCG CTT TTA ATC ATC GCT GAA 771 Leu Glu Lys Thr Met Lys Glu Gly Lys Pro Leu Leu He He Ala Glu 240 245 250
GAC ATT GAG GGC GAA GCT TTA ACG ACT CTA GTG GTG AAT AAA TTA AGA 819 Asp He Glu Gly Glu Ala Leu Thr Thr Leu Val Val Asn Lys Leu Arg 255 260 265
GGC GTG TTG AAT ATC GCA GCG GTT AAA GCT CCA GGC TTT GGG GAC AGA 867 Gly Val Leu Asn He Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg 270 275 280
AGA AAA GAA ATG CTC AAA GAC ATC GCT ATT TTA ACC GGC GGT CAA GTT 915 Arg Lys Glu Met Leu Lys Asp He Ala He Leu Thr Gly Gly Gin Val 285 290 295
ATT AGC GAA GAA TTG GGC TTG AGT CTA GAA AAC GCT GAA GTG GAG TTT 963 He Ser Glu Glu Leu Gly Leu Ser Leu Glu Asn Ala Glu Val Glu Phe 300 305 310 315
TTA GGC AAA GCC GGA AGG ATT GTG ATT GAC AAA GAC AAC ACC ACG ATC 1011 Leu Gly Lys Ala Gly Arg He Val He Asp Lys Asp Asn Thr Thr He 320 325 330 GTA GAT GGC AAA GGC CAT AGC CAT GAT GTC AAA GAC AGA GTC GCG CAA 1059 Val Asp Gly Lys Gly His Ser His Asp Val Lys Asp Arg Val Ala Gin 335 340 345
ATC AAA ACC CAA ATT GCA AGC ACG ACA AGC GAT TAT GAC AAA GAA AAA 1107 He Lys Thr Gin He Ala Ser Thr Thr Ser Asp Tyr Asp Lys Glu Lys 350 355 360
TTG CAA GAA AGG TTG GCT AAA CTC TCT GGC GGT GTG GCT GTG ATT AAA 1155 Leu Gin Glu Arg Leu Ala Lys Leu Ser Gly Gly Val Ala Val He Lys 365 370 375
GTG GGC GCT GCG AGT GAA GTG GAA ATG AAA GAG AAA AAA GAC CGG GTT 1203 Val Gly Ala Ala Ser Glu Val Glu Met Lys Glu Lys Lys Asp Arg Val 380 385 390 395
GAT GAT GCG TTG AGC GCG ACT AAA GCG GCT GTT GAA GAA GGT ATT GTG 1251 Asp Asp Ala Leu Ser Ala Thr Lys Ala Ala Val Glu Glu Gly He Val 400 405 410
ATT GGC GGC GGT GCG GCT CTC ATT CGC GCG GCT CAA AAA GTG CAT TTG 1299 He Gly Gly Gly Ala Ala Leu He Arg Ala Ala Gin Lys Val His Leu 415 420 425
AAT TTG CAC GAT GAT GAA AAA GTG GGC TAT GAA ATC ATC ATG CGC GCC 1347 Asn Leu His Asp Asp Glu Lys Val Gly Tyr Glu He He Met Arg Ala 430 435 440
ATT AAA GCC CCA TTA GCT CAA ATC GCT ATC AAT GCC GGT TAT GAT GGC 1395 He Lys Ala Pro Leu Ala Gin He Ala He Asn Ala Gly Tyr Asp Gly 445 450 455
GGT GTG GTC GTG AAT GAA GTA GAA AAA CAC GAA GGG CAT TTT GGT TTT 1443 Gly Val Val Val Asn Glu Val Glu Lys His Glu Gly His Phe Gly Phe 460 465 470 475
AAC GCT AGC AAT GGC AAG TAT GTG GAT ATG TTT AAA GAA GGC ATT ATT 1491 Asn Ala Ser Asn Gly Lys Tyr Val Asp Met Phe Lys Glu Gly He He 480 485 490
GAC CCC TTA AAA GTA GAA AGG ATC GCT TTA CAA AAT GCG GTT TCG GTT 1539 Asp Pro Leu Lys Val Glu Arg He Ala Leu Gin Asn Ala Val Ser Val 495 500 505
TCA AGC CTG CTT TTA ACC ACA GAA GCC ACC GTG CAT GAA ATC AAA GAA 1587 Ser Ser Leu Leu Leu Thr Thr Glu Ala Thr Val His Glu He Lys Glu 510 515 520
GAA AAA GCG GCC CCA GCA ATG CCT GAT ATG GGT GGC ATG GGC GGT ATG 1635 Glu Lys Ala Ala Pro Ala Met Pro Asp Met Gly Gly Met Gly Gly Met 525 530 535
GGA GGC ATG GGT GGC ATG ATG TAAGCCCCCT TGCTTTTTAG TATCATCTGC TTTT 1690 Gly Gly Met Gly Gly Met Met 540 545 AAAATCCCCT AAAATCCCCC CTTTCTAAAA TCTC 1724
(2) INFORMATION FOR SEQ ID NO:1152:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 546 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1152:
Met Ala Lys Glu He Lys Phe Ser Asp Ser Ala Arg Asn Leu Leu Phe
1 5 10 15
Glu Gly Val Arg Gin Leu His Asp Ala Val Lys Val Thr Met Gly Pro
20 25 30
Arg Gly Arg Asn Val Leu He Gin Lys Ser Tyr Gly Ala Pro Ser He
35 40 45
Thr Lys Asp Gly Val Ser Val Ala Lys Glu He Glu Leu Ser Cys Pro
50 55 60
Val Ala Asn Met Gly Ala Gin Leu Val Lys Glu Val Ala Ser Lys Thr 65 70 75 80
Ala Asp Ala Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala Tyr
85 90 95
Ser He Phe Lys Glu Gly Leu Arg Asn He Thr Ala Gly Ala Asn Pro
100 105 110
He Glu Val Lys Arg Gly Met Asp Lys Ala Ala Glu Ala He He Asn
115 120 125
Glu Leu Lys Lys Ala Ser Lys Lys Val Gly Gly Lys Glu Glu He Thr
130 135 140
Gin Val Ala Thr He Ser Ala Asn Ser Asp His Asn He Gly Lys Leu 145 150 155 160
He Ala Asp Ala Met Glu Lys Val Gly Lys Asp Gly Val He Thr Val
165 170 175
Glu Glu Ala Lys Gly He Glu Asp Glu Leu Asp Val Val Glu Gly Met
180 185 190
Gin Phe Asp Arg Gly Tyr Leu Ser Pro Tyr Phe Val Thr Asn Ala Glu
195 200 205
Lys Met Thr Ala Gin Leu Asp Asn Ala Tyr He Leu Leu Thr Asp Lys
210 215 220
Lys He Ser Ser Met Lys Asp He Leu Pro Leu Leu Glu Lys Thr Met 225 230 235 240
Lys Glu Gly Lys Pro Leu Leu He He Ala Glu Asp He Glu Gly Glu
245 250 255
Ala Leu Thr Thr Leu Val Val Asn Lys Leu Arg Gly Val Leu Asn He
260 265 270
Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Glu Met Leu
275 280 285
Lys Asp He Ala He Leu Thr Gly Gly Gin Val He Ser Glu Glu Leu
290 295 300
Gly Leu Ser Leu Glu Asn Ala Glu Val Glu Phe Leu Gly Lys Ala Gly 305 310 315 320
Arg He Val He Asp Lys Asp Asn Thr Thr He Val Asp Gly Lys Gly 325 330 335
His Ser His Asp Val Lys Asp Arg Val Ala Gin He Lys Thr Gin He
340 345 350
Ala Ser Thr Thr Ser Asp Tyr Asp Lys Glu Lys Leu Gin Glu Arg Leu
355 360 365
Ala Lys Leu Ser Gly Gly Val Ala Val He Lys Val Gly Ala Ala Ser
370 375 380
Glu Val Glu Met Lys Glu Lys Lys Asp Arg Val Asp Asp Ala Leu Ser 385 390 395 400
Ala Thr Lys Ala Ala Val Glu Glu Gly He Val He Gly Gly Gly Ala
405 410 415
Ala Leu He Arg Ala Ala Gin Lys Val His Leu Asn Leu His Asp Asp
420 425 430
Glu Lys Val Gly Tyr Glu He He Met Arg Ala He Lys Ala Pro Leu
435 440 445
Ala Gin He Ala He Asn Ala Gly Tyr Asp Gly Gly Val Val Val Asn
450 455 460
Glu Val Glu Lys His Glu Gly His Phe Gly Phe Asn Ala Ser Asn Gly 465 470 475 480
Lys Tyr Val Asp Met Phe Lys Glu Gly He He Asp Pro Leu Lys Val
485 490 495
Glu Arg He Ala Leu Gin Asn Ala Val Ser Val Ser Ser Leu Leu Leu
500 505 510
Thr Thr Glu Ala Thr Val His Glu He Lys Glu Glu Lys Ala Ala Pro
515 520 525
Ala Met Pro Asp Met Gly Gly Met Gly Gly Met Gly Gly Met Gly Gly
530 535 540
Met Met 545
(2) INFORMATION FOR SEQ ID NO: 1153:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 881 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 14...838 (D) OTHER INFORMATTON:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1153:
CTAGGAGGTT TGC ATG CAA GAG TTT TTA GGT TTT GGT GTG GTG GGG AAT 49 Met Gin Glu Phe Leu Gly Phe Gly Val Val Gly Asn 1 5 10
TTT GCA GGG CAT TTG GAG CAA GCA GGA GAG AGT CAT AGT TTT ATT AAC 97 Phe Ala Gly His Leu Glu Gin Ala Gly Glu Ser His Ser Phe He Asn 15 20 25
ATG AAA AGC GAA GAA AAG GAC GCC CCT AAG GGG CTA TTC CCT TTT TAT 145 Met Lys Ser Glu Glu Lys Asp Ala Pro Lys Gly Leu Phe Pro Phe Tyr 30 35 40
ATC CCC TAT GAA AAT TGT TAT TTG GGG CGT TGT TGC ATT GAT AAC CAT 193 He Pro Tyr Glu Asn Cys Tyr Leu Gly Arg Cys Cys He Asp Asn His 45 50 55 60
AAG ATT ATT TTG CCT AGT GAT CTA GAT TTA AGG GTG CAA GCA GAG CCA 241 Lys He He Leu Pro Ser Asp Leu Asp Leu Arg Val Gin Ala Glu Pro 65 70 75
GAA ATC GCT TTA GAA TGC GAT GTT AAA TAC GAT GAA AAA CAT TTG GTT 289 Glu He Ala Leu Glu Cys Asp Val Lys Tyr Asp Glu Lys His Leu Val 80 85 90
GCA AAG CTC GTG CCT AAT TTT TTC ATG GCG TTT AAT GAC GCT TCT GTG 337 Ala Lys Leu Val Pro Asn Phe Phe Met Ala Phe Asn Asp Ala Ser Val 95 100 105
CGC AAT TTA GAC GCC GCA AAA CTC TCC CAA AAA AAG AAT TTT TCA CCG 385 Arg Asn Leu Asp Ala Ala Lys Leu Ser Gin Lys Lys Asn Phe Ser Pro 110 115 120
GCT TCT AAA GGT ATA GGG CAG AAA TTG CCC ATT GAC AGG TTT GTT TAT 433 Ala Ser Lys Gly He Gly Gin Lys Leu Pro He Asp Arg Phe Val Tyr 125 130 135 140
GGG GGG GTG TGT AAC AAT TTC TCT ATC GCG TCT TTT TTG AAA TAC AAT 481 Gly Gly Val Cys Asn Asn Phe Ser He Ala Ser Phe Leu Lys Tyr Asn 145 150 155
AAT GTT TGG CAC ATT TAT GGG GAA AAC AGC AAA TTG CTC AAA TAC GAG 529 Asn Val Trp His He Tyr Gly Glu Asn Ser Lys Leu Leu Lys Tyr Glu 160 165 170
TTT TTT TAT CAA AAG CTT TTA GAT TGG ATT AAA GAC CAA TTA AAC CAC 577 Phe Phe Tyr Gin Lys Leu Leu Asp Trp He Lys Asp Gin Leu Asn His 175 180 185
CAA CAA GAT GGC GAC TCT TTA GAG GCT CTA AGA CCT TTT TTA GAG CGC 625 Gin Gin Asp Gly Asp Ser Leu Glu Ala Leu Arg Pro Phe Leu Glu Arg 190 195 200
CAT AAT TTC CCC ACT AAA ATG ATT TTT GCA ATA GGG GCT ACC CCT TAT 673 His Asn Phe Pro Thr Lys Met He Phe Ala He Gly Ala Thr Pro Tyr 205 210 215 220
ATG CCT TTT GCG CAA GAG CAT TTT TTG CAA AAA GGC GAT GAG GTG GTG 721 Met Pro Phe Ala Gin Glu His Phe Leu Gin Lys Gly Asp Glu Val Val 225 230 235
ATC GTT GCT TAC AAC CAT TTA CAA TAC AGT TTT GAA AAG ATT CAA AAC 769 He Val Ala Tyr Asn His Leu Gin Tyr Ser Phe Glu Lys He Gin Asn 240 245 250
CTC TTA GAA GAG GAC GCC CTA CAA GCC AAA GAA CAC GCT AAT CTT TCT 817 Leu Leu Glu Glu Asp Ala Leu Gin Ala Lys Glu His Ala Asn Leu Ser 255 260 265
TAT GTC TAT CAA ATC GTA GAA TAGTAAGGCT TTTACACTCT TTGGCTTTGC TTTT 872 Tyr Val Tyr Gin He Val Glu 270 275
TTACCCTTT 881
(2) INFORMATION FOR SEQ ID NO: 1154:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 275 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1154:
Met Gin Glu Phe Leu Gly Phe Gly Val Val Gly Asn Phe Ala Gly His
1 5 10 15
Leu Glu Gin Ala Gly Glu Ser His Ser Phe He Asn Met Lys Ser Glu
20 25 30
Glu Lys Asp Ala Pro Lys Gly Leu Phe Pro Phe Tyr He Pro Tyr Glu
35 40 45
Asn Cys Tyr Leu Gly Arg Cys Cys He Asp Asn His Lys He He Leu
50 55 60
Pro Ser Asp Leu Asp Leu Arg Val Gin Ala Glu Pro Glu He Ala Leu 65 70 75 80
Glu Cys Asp Val Lys Tyr Asp Glu Lys His Leu Val Ala Lys Leu Val
85 90 95
Pro Asn Phe Phe Met Ala Phe Asn Asp Ala Ser Val Arg Asn Leu Asp
100 105 110
Ala Ala Lys Leu Ser Gin Lys Lys Asn Phe Ser Pro Ala Ser Lys Gly
115 120 125
He Gly Gin Lys Leu Pro He Asp Arg Phe Val Tyr Gly Gly Val Cys
130 135 140
Asn Asn Phe Ser He Ala Ser Phe Leu Lys Tyr Asn Asn Val Trp His 145 150 155 160
He Tyr Gly Glu Asn Ser Lys Leu Leu Lys Tyr Glu Phe Phe Tyr Gin
165 170 175
Lys Leu Leu Asp Trp He Lys Asp Gin Leu Asn His Gin Gin Asp Gly
180 185 190
Asp Ser Leu Glu Ala Leu Arg Pro Phe Leu Glu Arg His Asn Phe Pro
195 200 205
Thr Lys Met He Phe Ala He Gly Ala Thr Pro Tyr Met Pro Phe Ala
210 215 220
Gin Glu His Phe Leu Gin Lys Gly Asp Glu Val Val He Val Ala Tyr 225 230 235 240 sn His Leu Gin Tyr Ser Phe Glu Lys He Gin Asn Leu Leu Glu Glu
245 250 255 sp Ala Leu Gin Ala Lys Glu His Ala Asn Leu Ser Tyr Val Tyr Gin
260 265 270
He Val Glu 275
(2) INFORMATION FOR SEQ ID NO: 1155:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 337 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 40...300 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1155:
AGCTCCGAGT TTAGCGAATT GGTTTATGGG AATTTTTTA ATG ATT ATC CTG TCA 54
Met He He Leu Ser 1 5
GCG AGC GTG AAG AAT TTG CGT GAA ATT TCG GTT AAA GAA AAA TTT TTA 102 Ala Ser Val Lys Asn Leu Arg Glu He Ser Val Lys Glu Lys Phe Leu 10 15 20
TGG CTG AAC GCT AAG TCT TAT TTG ATT TCT GTT TTT GCG CCT TTT ATC 150 Trp Leu Asn Ala Lys Ser Tyr Leu He Ser Val Phe Ala Pro Phe He 25 30 35
TTG CTC CCT TGG ATT GAT TTG TTG AGC GCT TTT TTA TTG TAT TTA GGG 198 Leu Leu Pro Trp He Asp Leu Leu Ser Ala Phe Leu Leu Tyr Leu Gly 40 45 50
TTT TTA GCG CTC TTT AGC GTG CTG GAA TTT TTT GAT GAA GAC ATT GCA 246 Phe Leu Ala Leu Phe Ser Val Leu Glu Phe Phe Asp Glu Asp He Ala 55 60 65
GAT ATT ATC GTG GCT AAA AGC AAA ATA AAG ACT AAA ACC AAA TGT TAT 294 Asp He He Val Ala Lys Ser Lys He Lys Thr Lys Thr Lys Cys Tyr 70 75 80 85
AGA GCG TAGAATGTTA GAAAAGCTTT TAAGCGCTAT CAAACAA 337
Arg Ala (2) INFORMATION FOR SEQ ID NO: 1156:
(i) SEQUENCE' CHARACTERISTICS :
(A) LENGTH: 87 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1156:
Met He He Leu Ser Ala Ser Val Lys Asn Leu Arg Glu He Ser Val
1 5 10 15
Lys Glu Lys Phe Leu Trp Leu Asn Ala Lys Ser Tyr Leu He Ser Val
20 25 30
Phe Ala Pro Phe He Leu Leu Pro Trp He Asp Leu Leu Ser Ala Phe
35 40 45
Leu Leu Tyr Leu Gly Phe Leu Ala Leu Phe Ser Val Leu Glu Phe Phe
50 55 60
Asp Glu Asp He Ala Asp He He Val Ala Lys Ser Lys He Lys Thr 65 70 75 80
Lys Thr Lys Cys Tyr Arg Ala 85
(2) INFORMATION FOR SEQ ID NO: 1157:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1044 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 15...977 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1157:
ATTAAGGGGA AGTC ATG GCT GAT AGT TTA GCG GGC ATT GAT CAA GTT ACG 50 Met Ala Asp Ser Leu Ala Gly He Asp Gin Val Thr 1 5 10
AGT TTG CAT AAA AAT AAC GAG TTA CAA TTG TTG TGT TTC AGG CTG GGT 98 Ser Leu His Lys Asn Asn Glu Leu Gin Leu Leu Cys Phe Arg Leu Gly 15 20 25
AAA AAC AAG GAT TTG TAT GCG GTC AAT GTT TTT AAG ATC CGT GAA GTG 146 Lys Asn Lys Asp Leu Tyr Ala Val Asn Val Phe Lys He Arg Glu Val 30 35 40 GTG AAA TAC CAT GGC AAT CTC ACC ATC ATT AGC CAC GAA AAC AAT TCG 194 Val Lys Tyr His Gly Asn Leu Thr He He Ser His Glu Asn Asn Ser 45 50 55 60
CTC GTT GAG GGG CTA ATC ATT ATA AGA GAA CTC ACC ATT CCC TTG ATT 242 Leu Val Glu Gly Leu He He He Arg Glu Leu Thr He Pro Leu He 65 70 75
GAT ATG AAA AAA TGG TTT TAT TAT GAC AGC CAA AAC AAA AAC AAG GAT 290 Asp Met Lys Lys Trp Phe Tyr Tyr Asp Ser Gin Asn Lys Asn Lys Asp 80 85 90
TTA CGC CCT TAT AGG ATA GAA AAA GAA AAA GGC GAA GAT GAT ATT GTT 338 Leu Arg Pro Tyr Arg He Glu Lys Glu Lys Gly Glu Asp Asp He Val 95 100 105
ATG ATT TGT GAG TTT TCT CGC TGG ACT ATA GGG GTT AGG ATC TAT GAA 386 Met He Cys Glu Phe Ser Arg Trp Thr He Gly Val Arg He Tyr Glu 110 115 120
GCG GAT AGG ATT TTG AGC AAG AAA TGG ACT GAA ATG GAG CAA AGC GCT 434 Ala Asp Arg He Leu Ser Lys Lys Trp Thr Glu Met Glu Gin Ser Ala 125 130 135 140
GGG CTA GGG GGA TCT GCA GGC AAT AAC AAA CTC GTG AGC CGC ACG CGC 482 Gly Leu Gly Gly Ser Ala Gly Asn Asn Lys Leu Val Ser Arg Thr Arg 145 150 155
TAT TTT GAT GGG CGC TTG GTG CAA GTG GTG GAT ATT GAA AAA ATG CTT 530 Tyr Phe Asp Gly Arg Leu Val Gin Val Val Asp He Glu Lys Met Leu 160 165 170
ATA GAC GTG TTC CCT TGG ATT GAA GAT GAA AAA CAC AAC GAT TTA GAG 578 He Asp Val Phe Pro Trp He Glu Asp Glu Lys His Asn Asp Leu Glu 175 180 185
ACG CTT TCT AAA ATC CAT TCT AAC CAA TGC GTT TTG CTT GCT GAT GAC 626 Thr Leu Ser Lys He His Ser Asn Gin Cys Val Leu Leu Ala Asp Asp 190 195 200
TCC CCA AGC GTT TTG AAA ACC ATG CAA ATG ATT TTA GAC AAG CTG GGC 674 Ser Pro Ser Val Leu Lys Thr Met Gin Met He Leu Asp Lys Leu Gly 205 210 215 220
GTC AAG CAT ATA GAT TTT ATC AAT GGT AAA ACC TTA CTA GAG CAT TTA 722 Val Lys His He Asp Phe He Asn Gly Lys Thr Leu Leu Glu His Leu 225 230 235
TTC AAC CCC ACA ACC GAT GTG AGT AAT ATT GGC CTG ATT ATT ACC GAT 770 Phe Asn Pro Thr Thr Asp Val Ser Asn He Gly Leu He He Thr Asp 240 245 250
TTG GAA ATG CCA GAG GCG AGC GGT TTT GAA GTG ATC AAG CAG GTT AAA 818 Leu Glu Met Pro Glu Ala Ser Gly Phe Glu Val He Lys Gin Val Lys 255 260 265 AAC AAT CCT TTG ACT TCA AAA ATC CCT ATC GTG GTC AAT TCT TCT ATG 866 Asn Asn Pro Leu Thr Ser Lys He Pro He Val Val Asn Ser Ser Met 270 275 280
AGC GGG AGT TCT AAT GAA GAC ATG GCC AGG AGT TTG AAG GCC GAT GAT 914 Ser Gly Ser Ser Asn Glu Asp Met Ala Arg Ser Leu Lys Ala Asp Asp 285 290 295 300
TTC ATT TCC AAG TCT AAC CCC AAA GAC ATC CAG CGA GTG GTT AAG CAA 962 Phe He Ser Lys Ser Asn Pro Lys Asp He Gin Arg Val Val Lys Gin 305 310 315
TTT TTG GAA TTA GCA TGAAAAAATA CAGCACTATC CCCACCCCTT GCTACGTGTT A 1018 Phe Leu Glu Leu Ala 320
GAGAGCGAAC GCTTAGAAAA AAACGC 1044
(2) INFORMATION FOR SEQ ID NO: 1158:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 321 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1158:
Met Ala Asp Ser Leu Ala Gly He Asp Gin Val Thr Ser Leu His Lys
1 5 10 15
Asn Asn Glu Leu Gin Leu Leu Cys Phe Arg Leu Gly Lys Asn Lys Asp
20 25 30
Leu Tyr Ala Val Asn Val Phe Lys He Arg Glu Val Val Lys Tyr His
35 40 45
Gly Asn Leu Thr He He Ser His Glu Asn Asn Ser Leu Val Glu Gly
50 55 60
Leu He He He Arg Glu Leu Thr He Pro Leu He Asp Met Lys Lys 65 70 75 80
Trp Phe Tyr Tyr Asp Ser Gin Asn Lys Asn Lys Asp Leu Arg Pro Tyr
85 90 95
Arg He Glu Lys Glu Lys Gly Glu Asp Asp He Val Met He Cys Glu
100 105 110
Phe Ser Arg Trp Thr He Gly Val Arg He Tyr Glu Ala Asp Arg He
115 120 125
Leu Ser Lys Lys Trp Thr Glu Met Glu Gin Ser Ala Gly Leu Gly Gly
130 135 140
Ser Ala Gly Asn Asn Lys Leu Val Ser Arg Thr Arg Tyr Phe Asp Gly 145 150 155 160
Arg Leu Val Gin Val Val Asp He Glu Lys Met Leu He Asp Val Phe
165 170 175
Pro Trp He Glu Asp Glu Lys His Asn Asp Leu Glu Thr Leu Ser Lys
180 185 190
He His Ser Asn Gin Cys Val Leu Leu Ala Asp Asp Ser Pro Ser Val 195 200 205
Leu Lys Thr Met Gin Met He Leu Asp Lys Leu Gly Val Lys His He 210 215 220
Asp Phe He Asn Gly Lys Thr Leu Leu Glu His Leu Phe Asn Pro Thr
225 230 235 240
Thr Asp Val Ser Asn He Gly Leu He He Thr Asp Leu Glu Met Pro 245 250 255
Glu Ala Ser Gly Phe Glu Val He Lys Gin Val Lys Asn Asn Pro Leu 260 265 270
Thr Ser Lys He Pro He Val Val Asn Ser Ser Met Ser Gly Ser Ser 275 280 285
Asn Glu Asp Met Ala Arg Ser Leu Lys Ala Asp Asp Phe He Ser Lys 290 295 300
Ser Asn Pro Lys Asp He Gin Arg Val Val Lys Gin Phe Leu Glu Leu
305 310 315 320
Ala
(2) INFORMATION FOR SEQ ID NO: 1159:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 633 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 37...618 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1159:
ATCTGCTTAA ACACAAAAAA GAGTAAAATA ACACGC ATG AAA AAA TTC TTA TTT 54
Met Lys Lys Phe Leu Phe 1 5
AAA CAA AAA TTT TGT GAA AGC CTG CCC AAA AGC TTT TCT AAA ACT TTG 102 Lys Gin Lys Phe Cys Glu Ser Leu Pro Lys Ser Phe Ser Lys Thr Leu 10 15 20
TTA GCG CTC AGT TTG GGC TTG ATT TTA TTA GGC ATT TTT GCG CCT TTC 150 Leu Ala Leu Ser Leu Gly Leu He Leu Leu Gly He Phe Ala Pro Phe 25 30 35
CCT AAA GTC CCT AAA CAG CCT AGC GTG CCT TTA ATG TTT CAT TTC ACC 198 Pro Lys Val Pro Lys Gin Pro Ser Val Pro Leu Met Phe His Phe Thr 40 45 50
GAG CAT TAT GCG CGC TTT ATC CCT ACG ATT TTA TCT GTG GCG ATT CCC 246 Glu His Tyr Ala Arg Phe He Pro Thr He Leu Ser Val Ala He Pro 55 60 65 70
TTA ATC CAA AGA GAT GCG GTA GGG CTT TTT CAA GTC GCT AAC GCT TCT 294 Leu He Gin Arg Asp Ala Val Gly Leu Phe Gin Val Ala Asn Ala Ser 75 80 85
ATC GCT ACA ACC CTT CTC ACG CAC ACC ACC AAA AGA GCC TTA AAC CAT 342 He Ala Thr Thr Leu Leu Thr His Thr Thr Lys Arg Ala Leu Asn His 90 95 100
GTA ACA ATC AAC GAT CAG CGT TTG GGC GAG CGC CCT TAT GGA GGT AAT 390 Val Thr He Asn Asp Gin Arg Leu Gly Glu Arg Pro Tyr Gly Gly Asn 105 110 115
TTC AAC ATG CCA AGC GGG CAT TCG TCT ATG GTG GGT TTG GCG GTG GCG 438 Phe Asn Met Pro Ser Gly His Ser Ser Met Val Gly Leu Ala Val Ala 120 125 130
TTT TTA ATG CGC CGC TAT TCT TTT AAA AAA TAC TTT TGG CTC TTG CCC 486 Phe Leu Met Arg Arg Tyr Ser Phe Lys Lys Tyr Phe Trp Leu Leu Pro 135 140 145 150
CTA GTC CCT TTG ACC ATG CTC GCT CGC ATT TAT TTA GAC ATG CAC ACC 534 Leu Val Pro Leu Thr Met Leu Ala Arg He Tyr Leu Asp Met His Thr 155 160 165
ATT GGC GCG GTG CTG ACC GGG CTT GGC GTT GGA ATG TTG TGC GTA ASC 582 He Gly Ala Val Leu Thr Gly Leu Gly Val Gly Met Leu Cys Val Xaa 170 175 180
TTT TTA CAA GCC CCA AAA AGC CTT AAT CAA AAG CTT TAGTTTCTGT TTTTA 633 Phe Leu Gin Ala Pro Lys Ser Leu Asn Gin Lys Leu 185 190
(2) INFORMATION FOR SEQ ID NO: 1160:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 194 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1160:
Met Lys Lys Phe Leu Phe Lys Gin Lys Phe Cys Glu Ser Leu Pro Lys
1 5 10 15
Ser Phe Ser Lys Thr Leu Leu Ala Leu Ser Leu Gly Leu He Leu Leu
20 25 30
Gly He Phe Ala Pro Phe Pro Lys Val Pro Lys Gin Pro Ser Val Pro
35 40 45
Leu Met Phe His Phe Thr Glu His Tyr Ala Arg Phe He Pro Thr He 50 55 60 Leu Ser Val Ala He Pro Leu He Gin Arg Asp Ala Val Gly Leu Phe 65 70 75 80
Gin Val Ala Asn Ala Ser He Ala Thr Thr Leu Leu Thr His Thr Thr
85 90 95
Lys Arg Ala Leu Asn His Val Thr He Asn Asp Gin Arg Leu Gly Glu
100 105 110
Arg Pro Tyr Gly Gly Asn Phe Asn Met Pro Ser Gly His Ser Ser Met
115 120 125
Val Gly Leu Ala Val Ala Phe Leu Met Arg Arg Tyr Ser Phe Lys Lys
130 135 140
Tyr Phe Trp Leu Leu Pro Leu Val Pro Leu Thr Met Leu Ala Arg He 145 150 155 160
Tyr Leu Asp Met His Thr He Gly Ala Val Leu Thr Gly Leu Gly Val
165 170 175
Gly Met Leu Cys Val Xaa Phe Leu Gin Ala Pro Lys Ser Leu Asn Gin
180 185 190
Lys Leu
(2) INFORMATION FOR SEQ ID NO: 1161:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1091 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 57...1040 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1161:
ATTTCCCTTT GTTCGTTTAT GTTTATAAAG AAAGCAACCA GGTCAGTTTT ATCGCC ATG 59
Met
1
ATG GTT GTG GTG CTT TTT TGC GTT AAT GGC GCT CTT TTT TTG GCG TTA 107 Met Val Val Val Leu Phe Cys Val Asn Gly Ala Leu Phe Leu Ala Leu 5 10 15
GGC TTG ATC TCT GCT TCT TTG ATG CGT TGG AGT GCG ATA GTT TTT AGC 155 Gly Leu He Ser Ala Ser Leu Met Arg Trp Ser Ala He Val Phe Ser 20 25 30
CTG CTC AAT TCC GTT GCT TTC TAT TTC ATT AGC GCT TAT AAG GTG TTT 203 Leu Leu Asn Ser Val Ala Phe Tyr Phe He Ser Ala Tyr Lys Val Phe 35 40 45
TTA AAT AAG AGC ATG ATG GGT AAT GTC TTA AAC ACC AAC ACG CAT GAA 251 Leu Asn Lys Ser Met Met Gly Asn Val Leu Asn Thr Asn Thr His Glu 50 55 60 65
GTT TTA GGC TTT TTG AGC GTC AAA TTA TTC GTT TTT ATC GTT GTT TTT 299 Val Leu Gly Phe Leu Ser Val Lys Leu Phe Val Phe He Val Val Phe 70 75 80
GGG GTG TTG CCT GGC TAT GTC ATC TAT AAA ATC CCC CTT AAA AAT TCT 347 Gly Val Leu Pro Gly Tyr Val He Tyr Lys He Pro Leu Lys Asn Ser 85 90 95
TCT AAA AAA GCG CCC TTT TTA GCG ATC TTG GCG TTA GTG TTT ATC TTT 395 Ser Lys Lys Ala Pro Phe Leu Ala He Leu Ala Leu Val Phe He Phe 100 105 110
ATC GCT AGC GCT TTA GCT AAC ACT AAA AAT TGG CTG TGG TTT GAC AAG 443 He Ala Ser Ala Leu Ala Asn Thr Lys Asn Trp Leu Trp Phe Asp Lys 115 120 125
CAT GCG AAA TTC ATA GGG GGC TTA ATT TTG CCC TTC GCT TAT AGC GTG 491 His Ala Lys Phe He Gly Gly Leu He Leu Pro Phe Ala Tyr Ser Val 130 135 140 145
AAC GCT TTT AGA GTG AGC GCT CTC AAA TTT TTC GCC CCC ACC ATC AAG 539 Asn Ala Phe Arg Val Ser Ala Leu Lys Phe Phe Ala Pro Thr He Lys 150 155 160
CCG CTC CCT CTT TTT TCA CCC AAT CAT TCC CAT TCG TTT GTG GTG CTA 587 Pro Leu Pro Leu Phe Ser Pro Asn His Ser His Ser Phe Val Val Leu 165 170 175
GTC ATT GGC GAA AGC GCT AGG AAA CAT AAT TAC GCC CTT TAT GGC TAT 635 Val He Gly Glu Ser Ala Arg Lys His Asn Tyr Ala Leu Tyr Gly Tyr 180 185 190
CAA AAA CCC ACC ACC CCA AGA CTA AGC AAG CGT TTA GCC GAT AAT GAA 683 Gin Lys Pro Thr Thr Pro Arg Leu Ser Lys Arg Leu Ala Asp Asn Glu 195 200 205
CTC ACT CTT TTC AAC GCC ACT TCT TGC GCC ACT TAC ACG ACA GCG AGT 731 Leu Thr Leu Phe Asn Ala Thr Ser Cys Ala Thr Tyr Thr Thr Ala Ser 210 215 220 225
TTG GAA TGC ATT TTA GAT TCT TCT TTT AAA AAC AAC GCT TAT GAA AAT 779 Leu Glu Cys He Leu Asp Ser Ser Phe Lys Asn Asn Ala Tyr Glu Asn 230 235 240
TTG CCA ACT TAC TTG ACT AAA GCC GGT ATC AAA GTC TTT TGG TAT AGC 827 Leu Pro Thr Tyr Leu Thr Lys Ala Gly He Lys Val Phe Trp Tyr Ser 245 250 255
GCG AAC GAC GGC GAA AAG AAT GTT AAG GTT ACA AGC TAT CTT AAA AAC 875 Ala Asn Asp Gly Glu Lys Asn Val Lys Val Thr Ser Tyr Leu Lys Asn 260 265 270 TAT GAA TTG ATT CAA AAA TGC CCC AAT TGT GAA GCG ATC GCT CCT TAT 923 Tyr Glu Leu He Gin Lys Cys Pro Asn Cys Glu Ala He Ala Pro Tyr 275 280 285
GAT GAA TCT TTA CTT TAT AAT TTG CCT GAC CTT TTA AAA GAA CAC TCT 971 Asp Glu Ser Leu Leu Tyr Asn Leu Pro Asp Leu Leu Lys Glu His Ser 290 295 300 305
AAT GAA AAT GTC TTG CTC ATC TTA CAC TTG CAG GCT CGC ATG GCC CAA 1019 Asn Glu Asn Val Leu Leu He Leu His Leu Gin Ala Arg Met Ala Gin 310 315 320
ACT ACG ACA ACA AAG TGC CTT TAAATTTTAG GGTGTTTAAG CCTTATTGCT CAAG 1074 Thr Thr Thr Thr Lys Cys Leu 325
CGCTGATCTG TCTTCTT 1091
(2) INFORMATION FOR SEQ ID NO: 1162:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 328 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1162:
Met Met Val Val Val Leu Phe Cys Val Asn Gly Ala Leu Phe Leu Ala
1 5 10 15
Leu Gly Leu He Ser Ala Ser Leu Met Arg Trp Ser Ala He Val Phe
20 25 30
Ser Leu Leu Asn Ser Val Ala Phe Tyr Phe He Ser Ala Tyr Lys Val
35 40 45
Phe Leu Asn Lys Ser Met Met Gly Asn Val Leu Asn Thr Asn Thr His
50 55 60
Glu Val Leu Gly Phe Leu Ser Val Lys Leu Phe Val Phe He Val Val 65 70 75 80
Phe Gly Val Leu Pro Gly Tyr Val He Tyr Lys He Pro Leu Lys Asn
85 90 95
Ser Ser Lys Lys Ala Pro Phe Leu Ala He Leu Ala Leu Val Phe He
100 105 110
Phe He Ala Ser Ala Leu Ala Asn Thr Lys Asn Trp Leu Trp Phe Asp
115 120 125
Lys His Ala Lys Phe He Gly Gly Leu He Leu Pro Phe Ala Tyr Ser
130 135 140
Val Asn Ala Phe Arg Val Ser Ala Leu Lys Phe Phe Ala Pro Thr He 145 150 155 160
Lys Pro Leu Pro Leu Phe Ser Pro Asn His Ser His Ser Phe Val Val
165 170 175
Leu Val He Gly Glu Ser Ala Arg Lys His Asn Tyr Ala Leu Tyr Gly
180 185 190
Tyr Gin Lys Pro Thr Thr Pro Arg Leu Ser Lys Arg Leu Ala Asp Asn 195 200 205
Glu Leu Thr Leu Phe Asn Ala Thr Ser Cys Ala Thr Tyr Thr Thr Ala
210 215 220
Ser Leu Glu Cys He Leu Asp Ser Ser Phe Lys Asn Asn Ala Tyr Glu 225 230 235 240
Asn Leu Pro Thr Tyr Leu Thr Lys Ala Gly He Lys Val Phe Trp Tyr
245 250 255
Ser Ala Asn Asp Gly Glu Lys Asn Val Lys Val Thr Ser Tyr Leu Lys
260 265 270
Asn Tyr Glu Leu He Gin Lys Cys Pro Asn Cys Glu Ala He Ala Pro
275 280 285
Tyr Asp Glu Ser Leu Leu Tyr Asn Leu Pro Asp Leu Leu Lys Glu His
290 295 300
Ser Asn Glu Asn Val Leu Leu He Leu His Leu Gin Ala Arg Met Ala 305 310 315 320
Gin Thr Thr Thr Thr Lys Cys Leu 325
(2) INFORMATION FOR SEQ ID NO:1163:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1879 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...1827 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1163:
AAGCTCAAAG ATAAAGCGCT ACAATCTCGC TTAGAAAAAG GACACAAA ATG CTA TTG 57
Met Leu Leu 1
AAT TAC GAT TTT TTA GAA TTT GTT GAT GAG CCG AAA AGA AAC ACT TCT 105 Asn Tyr Asp Phe Leu Glu Phe Val Asp Glu Pro Lys Arg Asn Thr Ser 5 10 15
TTG ACA GCA TCT ATT GAT AAA GCG TTA GCG GAC AGG AAG TTA GCT AGA 153 Leu Thr Ala Ser He Asp Lys Ala Leu Ala Asp Arg Lys Leu Ala Arg 20 25 30 35
CAA AAT AAA CCT AGC GTT AGG GTG CTT GGT AAG GCG ATG CCC TTA AGC 201 Gin Asn Lys Pro Ser Val Arg Val Leu Gly Lys Ala Met Pro Leu Ser 40 45 50
AAG TTT TTA GAT GCT GTT GGC GAT GAA ATC TCA CGA CTT AAA TAT GAT 249 Lys Phe Leu Asp Ala Val Gly Asp Glu He Ser Arg Leu Lys Tyr Asp 55 60 65
ATG AGC CAC AAG ACT ATT AAA GGC TCT ACA ATT GAG AGT TCT AAT CTT 297 Met Ser His Lys Thr He Lys Gly Ser Thr He Glu Ser Ser Asn Leu 70 75 80
ATC AGC ATT TAT AAA AAG ATT GCG AGC GGA CTA CCT TTT GGG ACT ATC 345 He Ser He Tyr Lys Lys He Ala Ser Gly Leu Pro Phe Gly Thr He 85 90 95
TCG GCG TTT AGA CCT TTT AAA GAC GCT TTT TAT AAA GAC TTT ACC GAA 393 Ser Ala Phe Arg Pro Phe Lys Asp Ala Phe Tyr Lys Asp Phe Thr Glu 100 105 110 115
AAA GAA CAA AAC GCT CTA ATC TAT GCT TAT AAG AGC GGA GCA GAC CCT 441 Lys Glu Gin Asn Ala Leu He Tyr Ala Tyr Lys Ser Gly Ala Asp Pro 120 125 130
AAA AAT GCG GAC ATA ATA GCC AAA TAT TGG TTA AGT CAA TCT GTG GAT 489 Lys Asn Ala Asp He He Ala Lys Tyr Trp Leu Ser Gin Ser Val Asp 135 140 145
TTA GAC CCA TAC GAC CCT ATT AAA GTT GTA GAT TTC TTT CAC CCA CAA 537 Leu Asp Pro Tyr Asp Pro He Lys Val Val Asp Phe Phe His Pro Gin 150 155 160
CCT GAA AAT GGT AAA GAG ACT ACA AAA TTT AAG AAC TAC AAA GAT AGG 585 Pro Glu Asn Gly Lys Glu Thr Thr Lys Phe Lys Asn Tyr Lys Asp Arg 165 170 175
ATT GAG AAC ATT TAT GCG ACA CTC TAT AAC ACA TTG GGT AGG GGT TAT 633 He Glu Asn He Tyr Ala Thr Leu Tyr Asn Thr Leu Gly Arg Gly Tyr 180 185 190 195
GTG GAT AAA TTT TTT AAA AAA GAA GCC ACA ATG AGG GAC TTT ATG TCT 681 Val Asp Lys Phe Phe Lys Lys Glu Ala Thr Met Arg Asp Phe Met Ser 200 205 210
AGC GAT AAA TTT GTT GAG AGA TAC CGC TAC ACT AGA AAA GAA AAT ATG 729 Ser Asp Lys Phe Val Glu Arg Tyr Arg Tyr Thr Arg Lys Glu Asn Met 215 220 225
GCA AGG ACA CAA GCA TTA AAA GAC ATA ATG AAT ATT GAC AGA GAT TTC 777 Ala Arg Thr Gin Ala Leu Lys Asp He Met Asn He Asp Arg Asp Phe 230 235 240
ATT GGT TAT ATT GAA GTG TTA GGG TAT TGG AAA GAC AAC CCT AAA GAC 825 He Gly Tyr He Glu Val Leu Gly Tyr Trp Lys Asp Asn Pro Lys Asp 245 250 255
AAT ATC TTA CCA GAC AAA GAG GTT AGC TTT TTT GTA TTC CAA AAC GAA 873 Asn He Leu Pro Asp Lys Glu Val Ser Phe Phe Val Phe Gin Asn Glu 260 265 270 275
CCT AGT AGC ACA TTT GAT TTG AAA AAC CAC TTA TTG ATA TGG GGT AAA 921 Pro Ser Ser Thr Phe Asp Leu Lys Asn His Leu Leu He Trp Gly Lys 280 285 290
CAA TTC AGA CAA GTA GCG ATT TGC TAT GGC GGA CAA TTG ATT GCT AAT 969 Gin Phe Arg Gin Val Ala He Cys Tyr Gly Gly Gin Leu He Ala Asn 295 300 305
AAG AAT AAG ACT TAT AGG ATA GAT TTG ATA AGT TGC AGA CCT GAT AAT 1017 Lys Asn Lys Thr Tyr Arg He Asp Leu He Ser Cys Arg Pro Asp Asn 310 315 320
TTT GGT GAG GTT TGG GCT AAA TTC ACA GGG ATT AAA TTT TCA GTT CCT 1065 Phe Gly Glu Val Trp Ala Lys Phe Thr Gly He Lys Phe Ser Val Pro 325 330 335
AGC GAC TTA CCA CAA GCT CTC ACA CGC ATA AAT GAC AGC GTT TAT ACT 1113 Ser Asp Leu Pro Gin Ala Leu Thr Arg He Asn Asp Ser Val Tyr Thr 340 345 350 355
TTT CTC TCT AGG AAT AAA GAG GGT ATC GGT CTT AAT AAA CTC GCT CTC 1161 Phe Leu Ser Arg Asn Lys Glu Gly He Gly Leu Asn Lys Leu Ala Leu 360 365 370
AAT AAA GTC GTT AAG ACA GAA TTA AAA GCG ACT TGT ATG CCC TAT GAT 1209 Asn Lys Val Val Lys Thr Glu Leu Lys Ala Thr Cys Met Pro Tyr Asp 375 380 385
TAC TCT AAA TTG GGT ATA GAG ACT ATT GGC GAG GAC ATT AGA AGC AAT 1257 Tyr Ser Lys Leu Gly He Glu Thr He Gly Glu Asp He Arg Ser Asn 390 395 400
ATT AAA GCA TTA CAG AAA ATG TCT CGT GGG TAT GGA CAC CCT AAA GAG 1305 He Lys Ala Leu Gin Lys Met Ser Arg Gly Tyr Gly His Pro Lys Glu 405 410 415
TTC TTT TTG GAC GCA ATG ATA AAA AAA CAG GAA AAT GCG ATT AAA CGC 1353 Phe Phe Leu Asp Ala Met He Lys Lys Gin Glu Asn Ala He Lys Arg 420 425 430 435
ATA GAA GCA CGA AAA TGT GCG GTA AGC GAT GAC TTC AAA CAA GGT ATG 1401 He Glu Ala Arg Lys Cys Ala Val Ser Asp Asp Phe Lys Gin Gly Met 440 445 450
AAA CGA AAC ATT AAA GTT AAT AAC CTT GTT AAA GCT ATG CGA CAA GGC 1449 Lys Arg Asn He Lys Val Asn Asn Leu Val Lys Ala Met Arg Gin Gly 455 460 465
AAA AAA GTG AGT AGG ACA TTG ATT GCT AAA GTG CTT GCT AAC ACC ATA 1497 Lys Lys Val Ser Arg Thr Leu He Ala Lys Val Leu Ala Asn Thr He 470 475 480
GAC ACC GAT GCG GGT TAT TGC TTC ATT TCG CCG ACA GAT TTA GCG ACA 1545 Asp Thr Asp Ala Gly Tyr Cys Phe He Ser Pro Thr Asp Leu Ala Thr 485 490 495 CAA CTT GGC AAC ATC AGC CCT AGA CTA TCT AAA AGC ATA GTT ACC GCC 1593 Gin Leu Gly Asn He Ser Pro Arg Leu Ser Lys Ser He Val Thr Ala 500 505 510 515
ATA GAG CAA GCA GAG GGC GTG AGA CTG AAT TAT GCG TTG ATT GAC AAA 1641 He Glu Gin Ala Glu Gly Val Arg Leu Asn Tyr Ala Leu He Asp Lys 520 525 530
ATC ACC TAT AAC TCA CTC CAC AAT ATC TTA AGT TTC ATT TTT GAT ATT 1689 He Thr Tyr Asn Ser Leu His Asn He Leu Ser Phe He Phe Asp He 535 540 545
GAT AAC CCT TTA AGC GAC CAA GTG TTT GAG AGA TTA GTC ATT GAA GTC 1737 Asp Asn Pro Leu Ser Asp Gin Val Phe Glu Arg Leu Val He Glu Val 550 555 560
CCA AGA GAA GCA CTT AAA AAT GTG AAG TTG CCA CAA ATC AAA AAT GTA 1785 Pro Arg Glu Ala Leu Lys Asn Val Lys Leu Pro Gin He Lys Asn Val 565 570 575
TTG ACT TCT CAA ATC TTT GAT GGC GCT TAC CAC TTT AAA AGT TAAACCATG 1836 Leu Thr Ser Gin He Phe Asp Gly Ala Tyr His Phe Lys Ser 580 585 590
CTCTTTATCA GCGCAACTAA CACGAATGCC GGAAAAACCA CAT 1879
(2) INFORMATION FOR SEQ ID NO: 1164:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 593 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1164:
Met Leu Leu Asn Tyr Asp Phe Leu Glu Phe Val Asp Glu Pro Lys Arg
1 5 10 15
Asn Thr Ser Leu Thr Ala Ser He Asp Lys Ala Leu Ala Asp Arg Lys
20 25 30
Leu Ala Arg Gin Asn Lys Pro Ser Val Arg Val Leu Gly Lys Ala Met
35 40 45
Pro Leu Ser Lys Phe Leu Asp Ala Val Gly Asp Glu He Ser Arg Leu
50 55 60
Lys Tyr Asp Met Ser His Lys Thr He Lys Gly Ser Thr He Glu Ser 65 70 75 80
Ser Asn Leu He Ser He Tyr Lys Lys He Ala Ser Gly Leu Pro Phe
85 90 95
Gly Thr He Ser Ala Phe Arg Pro Phe Lys Asp Ala Phe Tyr Lys Asp
100 105 110
Phe Thr Glu Lys Glu Gin Asn Ala Leu He Tyr Ala Tyr Lys Ser Gly
115 120 125
Ala Asp Pro Lys Asn Ala Asp He He Ala Lys Tyr Trp Leu Ser Gin 130 135 140
Ser Val Asp Leu Asp Pro Tyr Asp Pro He Lys Val Val Asp Phe Phe 145 150 155 160
His Pro Gin Pro Glu Asn Gly Lys Glu Thr Thr Lys Phe Lys Asn Tyr
165 170 175
Lys Asp Arg He Glu Asn He Tyr Ala Thr Leu Tyr Asn Thr Leu Gly
180 185 190
Arg Gly Tyr Val Asp Lys Phe Phe Lys Lys Glu Ala Thr Met Arg Asp
195 200 205
Phe Met Ser Ser Asp Lys Phe Val Glu Arg Tyr Arg Tyr Thr Arg Lys
210 215 220
Glu Asn Met Ala Arg Thr Gin Ala Leu Lys Asp He Met Asn He Asp 225 230 235 240
Arg Asp Phe He Gly Tyr He Glu Val Leu Gly Tyr Trp Lys Asp Asn
245 250 255
Pro Lys Asp Asn He Leu Pro Asp Lys Glu Val Ser Phe Phe Val Phe
260 265 270
Gin Asn Glu Pro Ser Ser Thr Phe Asp Leu Lys Asn His Leu Leu He
275 280 285
Trp Gly Lys Gin Phe Arg Gin Val Ala He Cys Tyr Gly Gly Gin Leu
290 295 300
He Ala Asn Lys Asn Lys Thr Tyr Arg He Asp Leu He Ser Cys Arg 305 310 315 320
Pro Asp Asn Phe Gly Glu Val Trp Ala Lys Phe Thr Gly He Lys Phe
325 330 335
Ser Val Pro Ser Asp Leu Pro Gin Ala Leu Thr Arg He Asn Asp Ser
340 345 350
Val Tyr Thr Phe Leu Ser Arg Asn Lys Glu Gly He Gly Leu Asn Lys
355 360 365
Leu Ala Leu Asn Lys Val Val Lys Thr Glu Leu Lys Ala Thr Cys Met
370 375 380
Pro Tyr Asp Tyr Ser Lys Leu Gly He Glu Thr He Gly Glu Asp He 385 390 395 400
Arg Ser Asn He Lys Ala Leu Gin Lys Met Ser Arg Gly Tyr Gly His
405 410 415
Pro Lys Glu Phe Phe Leu Asp Ala Met He Lys Lys Gin Glu Asn Ala
420 425 430
He Lys Arg He Glu Ala Arg Lys Cys Ala Val Ser Asp Asp Phe Lys
435 440 445
Gin Gly Met Lys Arg Asn He Lys Val Asn Asn Leu Val Lys Ala Met
450 455 460
Arg Gin Gly Lys Lys Val Ser Arg Thr Leu He Ala Lys Val Leu Ala 465 470 475 480
Asn Thr He Asp Thr Asp Ala Gly Tyr Cys Phe He Ser Pro Thr Asp
485 490 495
Leu Ala Thr Gin Leu Gly Asn He Ser Pro Arg Leu Ser Lys Ser He
500 505 510
Val Thr Ala He Glu Gin Ala Glu Gly Val Arg Leu Asn Tyr Ala Leu
515 520 525
He Asp Lys He Thr Tyr Asn Ser Leu His Asn He Leu Ser Phe He
530 535 540
Phe Asp He Asp Asn Pro Leu Ser Asp Gin Val Phe Glu Arg Leu Val 545 550 555 560
He Glu Val Pro Arg Glu Ala Leu Lys Asn Val Lys Leu Pro Gin He 565 570 575 Lys Asn Val Leu Thr Ser Gin He Phe Asp Gly Ala Tyr His Phe Lys
580 585 590
Ser
(2) INFORMATION FOR SEQ ID NO: 1165
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1063 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 13...1014 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1165:
TGCTAAATTG TG ATG TTT CAC AAA GCC CTT ATT ACC TTT ATC GTT CTA TGG 51 Met Phe His Lys Ala Leu He Thr Phe He Val Leu Trp 1 5 10
TTT TTT TTG AAT GGC TTA GGG GCT TAT GAT TTC AAG CAT TGT CAA GCG 99 Phe Phe Leu Asn Gly Leu Gly Ala Tyr Asp Phe Lys His Cys Gin Ala 15 20 25
TTT TTT AAA AAA GCG AGC CTT CAA AAA GGA GGC GTG GCT TTA AAA GAA 147 Phe Phe Lys Lys Ala Ser Leu Gin Lys Gly Gly Val Ala Leu Lys Glu 30 35 40 45
TTG CCT AAA GCC GTG TAT TTG TAT TAT TCC AAA ACC TAT CCC AAA CAC 195 Leu Pro Lys Gly Val Tyr Leu Tyr Tyr Ser Lys Thr Tyr Pro Lys His 50 55 60
GCC AAA GTC ATC AAA TCC GAT CCC TTT GTA GGG TTG TAT TTG TTG CAA 243 Ala Lys Val He Lys Ser Asp Pro Phe Val Gly Leu Tyr Leu Leu Gin 65 70 75
AGC GCA CCA AGC GAG TAT GTT TAT ACC TTA AGG GAT TTA GAC AAA GAC 291 Ser Ala Pro Ser Glu Tyr Val Tyr Thr Leu Arg Asp Leu Asp Lys Asp 80 85 90
GCC CTT ATA AGG CCA ATG GCT AGC ATA GGG GAT AAA GAA GCC CTA GAA 339 Ala Leu He Arg Pro Met Ala Ser He Gly Asp Lys Glu Ala Leu Glu 95 100 105
ACG CGA TTA TTG GTG GGG CAA AGA GGC TAT GAG CGC TAC GCT CAA ATT 387 Thr Arg Leu Leu Val Gly Gin Arg Gly Tyr Glu Arg Tyr Ala Gin He 110 115 120 125 TCG CAA AAG ACT CAA AAA AAT GGC GTT ATC AGC AAT ATT TGC TAT CAA 435 Ser Gin Lys Thr Gin Lys Asn Gly Val He Ser Asn He Cys Tyr Gin 130 135 140
ATG TTA GGG CTA GGG GTA GGG GGG AAT GGC TTT ATA GAA ACG AAA TTT 483 Met Leu Gly Leu Gly Val Gly Gly Asn Gly Phe He Glu Thr Lys Phe 145 150 155
ATC AAG CGC TTT TTA AAC CAG CAA GAG CCT TAT TAT GGG GAT ATT GGG 531 He Lys Arg Phe Leu Asn Gin Gin Glu Pro Tyr Tyr Gly Asp He Gly 160 165 170
GTG CGT TTA GAA GAA CAT CAT AAG CGT TTA GTG GTA GTG CAA TTT GAT 579 Val Arg Leu Glu Glu His His Lys Arg Leu Val Val Val Gin Phe Asp 175 180 185
CCA TTT TTC CCT AAA AAC CCT TTT TTA AAA AAC GAT GAA ATC CTA GCG 627 Pro Phe Phe Pro Lys Asn Pro Phe Leu Lys Asn Asp Glu He Leu Ala 190 195 200 205
ATC AAC CAT CAA AAG ATC CAC TCA TTA GCG GAG TTT GAA TGG GTG GTG 675 He Asn His Gin Lys He His Ser Leu Ala Glu Phe Glu Trp Val Val 210 215 220
AGC AAT CTT AAA TAC CAA AGC CTT GCA AAA GTG GAA ATC AAA CGA AAC 723 Ser Asn Leu Lys Tyr Gin Ser Leu Ala Lys Val Glu He Lys Arg Asn 225 230 235
CAT AAA GTC AAA GAA GTA ACG CTC AAA GTC AAT AAG CGT TAT GGG GGG 771 His Lys Val Lys Glu Val Thr Leu Lys Val Asn Lys Arg Tyr Gly Gly 240 245 250
TTT TTA CTC AAA GAC ACT TTT TTA GAG CGC TAT GGC ATC GCT TTA GAT 819 Phe Leu Leu Lys Asp Thr Phe Leu Glu Arg Tyr Gly He Ala Leu Asp 255 260 265
GAG CGT TTT ATT ATC ACT AAA ATA GGC GCT CAT TTG CCC AAA GGC TTG 867 Glu Arg Phe He He Thr Lys He Gly Ala His Leu Pro Lys Gly Leu 270 275 280 285
GAT TTT TTA AAG CTT GGG GAT AGG ATT TTA TGG GTG AAT TAT AAA AGC 915 Asp Phe Leu Lys Leu Gly Asp Arg He Leu Trp Val Asn Tyr Lys Ser 290 295 300
GTG GCG TCC AAC CCA AAG GCT TTA AGA GAA GCG TTA AGC GCG CCT AAA 963 Val Ala Ser Asn Pro Lys Ala Leu Arg Glu Ala Leu Ser Ala Pro Lys 305 310 315
ATT GAA TTA TTA GTC TTG CGT AAA GGC TTT GAA TTT TAC ATT AAA GTC 1011 He Glu Leu Leu Val Leu Arg Lys Gly Phe Glu Phe Tyr He Lys Val 320 325 330
CGT TGAAGTATTG ATGAAAAATG ACGCTTATGA AATTATTCTT TCTTGGTTT 1063
Arg (2) INFORMATION FOR SEQ ID NO: 1166:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 334 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1166:
Met Phe His Lys Ala Leu He Thr Phe He Val Leu Trp Phe Phe Leu
1 5 10 15
Asn Gly Leu Gly Ala Tyr Asp Phe Lys His Cys Gin Ala Phe Phe Lys
20 25 30
Lys Ala Ser Leu Gin Lys Gly Gly Val Ala Leu Lys Glu Leu Pro Lys
35 40 45
Gly Val Tyr Leu Tyr Tyr Ser Lys Thr Tyr Pro Lys His Ala Lys Val
50 55 60
He Lys Ser Asp Pro Phe Val Gly Leu Tyr Leu Leu Gin Ser Ala Pro 65 70 75 80
Ser Glu Tyr Val Tyr Thr Leu Arg Asp Leu Asp Lys Asp Ala Leu He
85 90 95
Arg Pro Met Ala Ser He Gly Asp Lys Glu Ala Leu Glu Thr Arg Leu
100 105 110
Leu Val Gly Gin Arg Gly Tyr Glu Arg Tyr Ala Gin He Ser Gin Lys
115 120 125
Thr Gin Lys Asn Gly Val He Ser Asn He Cys Tyr Gin Met Leu Gly
130 135 140
Leu Gly Val Gly Gly Asn Gly Phe He Glu Thr Lys Phe He Lys Arg 145 150 155 160
Phe Leu Asn Gin Gin Glu Pro Tyr Tyr Gly Asp He Gly Val Arg Leu
165 170 175
Glu Glu His His Lys Arg Leu Val Val Val Gin Phe Asp Pro Phe Phe
180 185 190
Pro Lys Asn Pro Phe Leu Lys Asn Asp Glu He Leu Ala He Asn His
195 200 205
Gin Lys He His Ser Leu Ala Glu Phe Glu Trp Val Val Ser Asn Leu
210 215 220
Lys Tyr Gin Ser Leu Ala Lys Val Glu He Lys Arg Asn His Lys Val 225 230 235 240
Lys Glu Val Thr Leu Lys Val Asn Lys Arg Tyr Gly Gly Phe Leu Leu
245 250 255
Lys Asp Thr Phe Leu Glu Arg Tyr Gly He Ala Leu Asp Glu Arg Phe
260 265 270
He He Thr Lys He Gly Ala His Leu Pro Lys Gly Leu Asp Phe Leu
275 280 285
Lys Leu Gly Asp Arg He Leu Trp Val Asn Tyr Lys Ser Val Ala Ser
290 295 300
Asn Pro Lys Ala Leu Arg Glu Ala Leu Ser Ala Pro Lys He Glu Leu 305 310 315 320
Leu Val Leu Arg Lys Gly Phe Glu Phe Tyr He Lys Val Arg 325 330 (2) INFORMATION FOR SEQ ID NO: 1167:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1133 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...1104 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1167:
TAGTCTTGCG TAAAGGCTTT GAATTTTACA TTAAAGTCCG TTGAAGTATT G ATG AAA 57
Met Lys
1
AAT GAC GCT TAT GAA ATT ATT CTT TCT TGG TTT ATC ACG CCT CTC ACG 105 Asn Asp Ala Tyr Glu He He Leu Ser Trp Phe He Thr Pro Leu Thr 5 10 15
GCG ATT TTA GGG CGT TTC GCT GAA TTT TTT CTC TAC ACT TTG CAT GCG 153 Ala He Leu Gly Arg Phe Ala Glu Phe Phe Leu Tyr Thr Leu His Ala 20 25 30
CAA TTG GTG TTT AAT AGC GTG GTC GCT TTG GCG TTC ATG CTC TTT GCT 201 Gin Leu Val Phe Asn Ser Val Val Ala Leu Ala Phe Met Leu Phe Ala 35 40 45 50
TAT AGG AGT TTG AAA GAA CAG AAT TTC TTC AGC GCT AGC GCG CTA ACA 249 Tyr Arg Ser Leu Lys Glu Gin Asn Phe Phe Ser Ala Ser Ala Leu Thr 55 60 65
GAA GCG TTA TTG TTT GTG GGG TTT TTT GCA CTT TTC AAC TAC GCT TTA 297 Glu Ala Leu Leu Phe Val Gly Phe Phe Ala Leu Phe Asn Tyr Ala Leu 70 75 80
AAA AAT CCC ATG CAT TTT TAT GAA TTT TTC CAA AAC GCT ATT TTT ATT 345 Lys Asn Pro Met His Phe Tyr Glu Phe Phe Gin Asn Ala He Phe He 85 90 95
GCG CCT AAC ATG ATC GCG CAA AGC CTC TCT CAA AGC TTG AGT AAC TTT 393 Ala Pro Asn Met He Ala Gin Ser Leu Ser Gin Ser Leu Ser Asn Phe 100 105 110
TCT GAC CAT GCG CTT TCT TTA GAT TTT ATC TTT AAT CAT GGT TTT TAT 441 Ser Asp His Ala Leu Ser Leu Asp Phe He Phe Asn His Gly Phe Tyr 115 120 125 130 GCC CTT AGT TTC ATC AGC GAT TTG AGC CAT AAT GAA ATG TCT GTG TGG 489 Ala Leu Ser Phe He Ser Asp Leu Ser His Asn Glu Met Ser Val Trp 135 140 145
CTT TTT TTA AGC GTT TTG CAA GGG CTT TTT TTG AGC GTG CTG TTT GCA 537 Leu Phe Leu Ser Val Leu Gin Gly Leu Phe Leu Ser Val Leu Phe Ala 150 155 160
ATC ATC ATT TTA GTG TAT TTA GAA GTG CAT GTG TGG TGC TCT TTA GGG 585 He He He Leu Val Tyr Leu Glu Val His Val Trp Cys Ser Leu Gly 165 170 175
GTG CTG TTT TTA GCG TTT GGG TTT TTT AAA ACC TGG AGG AGC GTT GTG 633 Val Leu Phe Leu Ala Phe Gly Phe Phe Lys Thr Trp Arg Ser Val Val 180 185 190
GTT ATA TGC CTA AAA AAG TGC TTC GCT CTT GGG TTT TAC AAG CCT TTT 681 Val He Cys Leu Lys Lys Cys Phe Ala Leu Gly Phe Tyr Lys Pro Phe 195 200 205 210
TTG TTG TTG GTA GGG TTT TTG AAT GTG TCG GTT ACT AAG GCT TTA ATA 729 Leu Leu Leu Val Gly Phe Leu Asn Val Ser Val Thr Lys Ala Leu He 215 220 225
GAC GCT CAT ATG CAA GAA AAA CAA GAC TTA AGC CTT TTA TTG GTG GTA 777 Asp Ala His Met Gin Glu Lys Gin Asp Leu Ser Leu Leu Leu Val Val 230 235 240
GCG TTA TTT TTG TGT TGC GTT TTT ATC ATC GGC GTG CCT TTT TTC ATC 825 Ala Leu Phe Leu Cys Cys Val Phe He He Gly Val Pro Phe Phe He 245 250 255
AAC GCT TTG TTT AGG GTG CAA AAC AGC CTT AAA GAA ACT TAC AAA CTC 873 Asn Ala Leu Phe Arg Val Gin Asn Ser Leu Lys Glu Thr Tyr Lys Leu 260 265 270
GCC ACC AAT TTG AGT GCC AAC CTC AGC CAA AAC GCC CTT AAT TCC TTA 921 Ala Thr Asn Leu Ser Ala Asn Leu Ser Gin Asn Ala Leu Asn Ser Leu 275 280 285 290
CAA TAC ATC ACG ACC CCA CCC GCT TCT TCT AGC GTT TCT TCT TCT ATG 969 Gin Tyr He Thr Thr Pro Pro Ala Ser Ser Ser Val Ser Ser Ser Met 295 300 305
AGT GAA AGC GTC TCT AAA GAA AAA GAA ACG CAT TCC CCC ACA TTT AAG 1017 Ser Glu Ser Val Ser Lys Glu Lys Glu Thr His Ser Pro Thr Phe Lys 310 315 320
GTA GAA ACC ACT CAA TTA GAT GTA AAA ATC CCA AAT TTC AAG CAA AAA 1065 Val Glu Thr Thr Gin Leu Asp Val Lys He Pro Asn Phe Lys Gin Lys 325 330 335
AAG GTT AAA AAG GAT ACA ATA AAT ACA AAA AAT GAA ATT TAAATAAATA GG 1116 Lys Val Lys Lys Asp Thr He Asn Thr Lys Asn Glu He 340 345 350 AATTTAATGA GAATTTT 1133
(2) INFORMATION FOR SEQ ID NO: 1168:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 351 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1168:
Met Lys Asn Asp Ala Tyr Glu He He Leu Ser Trp Phe He Thr Pro
1 5 10 15
Leu Thr Ala He Leu Gly Arg Phe Ala Glu Phe Phe Leu Tyr Thr Leu
20 25 30
His Ala Gin Leu Val Phe Asn Ser Val Val Ala Leu Ala Phe Met Leu
35 40 45
Phe Ala Tyr Arg Ser Leu Lys Glu Gin Asn Phe Phe Ser Ala Ser Ala
50 55 60
Leu Thr Glu Ala Leu Leu Phe Val Gly Phe Phe Ala Leu Phe Asn Tyr 65 70 75 80
Ala Leu Lys Asn Pro Met His Phe Tyr Glu Phe Phe Gin Asn Ala He
85 90 95
Phe He Ala Pro Asn Met He Ala Gin Ser Leu Ser Gin Ser Leu Ser
100 105 110
Asn Phe Ser Asp His Ala Leu Ser Leu Asp Phe He Phe Asn His Gly
115 120 125
Phe Tyr Ala Leu Ser Phe He Ser Asp Leu Ser His Asn Glu Met Ser
130 135 140
Val Trp Leu Phe Leu Ser Val Leu Gin Gly Leu Phe Leu Ser Val Leu 145 150 155 160
Phe Ala He He He Leu Val Tyr Leu Glu Val His Val Trp Cys Ser
165 170 175
Leu Gly Val Leu Phe Leu Ala Phe Gly Phe Phe Lys Thr Trp Arg Ser
180 185 190
Val Val Val He Cys Leu Lys Lys Cys Phe Ala Leu Gly Phe Tyr Lys
195 200 205
Pro Phe Leu Leu Leu Val Gly Phe Leu Asn Val Ser Val Thr Lys Ala
210 215 220
Leu He Asp Ala His Met Gin Glu Lys Gin Asp Leu Ser Leu Leu Leu 225 230 235 240
Val Val Ala Leu Phe Leu Cys Cys Val Phe He He Gly Val Pro Phe
245 250 255
Phe He Asn Ala Leu Phe Arg Val Gin Asn Ser Leu Lys Glu Thr Tyr
260 265 270
Lys Leu Ala Thr Asn Leu Ser Ala Asn Leu Ser Gin Asn Ala Leu Asn
275 280 285
Ser Leu Gin Tyr He Thr Thr Pro Pro Ala Ser Ser Ser Val Ser Ser
290 295 300
Ser Met Ser Glu Ser Val Ser Lys Glu Lys Glu Thr His Ser Pro Thr 305 310 315 320
Phe Lys Val Glu Thr Thr Gin Leu Asp Val Lys He Pro Asn Phe Lys 325 330 335 in Lys Lys Val Lys Lys Asp Thr He Asn Thr Lys Asn Glu He 340 345 350
(2) INFORMATION FOR SEQ ID NO: 1169:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 777 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 50...748 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1169:
CCCTTTCAAA CAAGGCCCTA AAAATTACGA AGAAAACCTG ATTTTCCCC ATG GAT AAC 58
Met Asp Asn
1
CCT AAA GGC ATT GAT GGT TTT ACT AAC CTT AAA GAA AAA GAC ATC GCC 106 Pro Lys Gly He Asp Gly Phe Thr Asn Leu Lys Glu Lys Asp He Ala 5 10 15
ACT AAT GAA AAT AAG CTT TTA CGC ACC ATT ACA GCG GAT AAA ATG ATA 154 Thr Asn Glu Asn Lys Leu Leu Arg Thr He Thr Ala Asp Lys Met He 20 25 30 35
CCC GCC TTT CTC ATC ACG CCT ATT TCT AGC CAG ATC GCT GGT AAA GTC 202 Pro Ala Phe Leu He Thr Pro He Ser Ser Gin He Ala Gly Lys Val 40 45 50
ATC GCG CAG GTG GAG AGC GAT ATT TTT GCT CAC ATG GGC AAG GCC GTC 250 He Ala Gin Val Glu Ser Asp He Phe Ala His Met Gly Lys Ala Val 55 60 65
TTA ATC CCC AAA GGC TCT AAA GTC ATA GGT TAT TAC AGC AAC AAT AAC 298 Leu He Pro Lys Gly Ser Lys Val He Gly Tyr Tyr Ser Asn Asn Asn 70 75 80
AAA ATG GGC GAA TAC CGC TTG GAT ATT GTA TGG AGC CGC ATC ATC ACT 346 Lys Met Gly Glu Tyr Arg Leu Asp He Val Trp Ser Arg He He Thr 85 90 95
CCC CAT GGC ATC AAT ATC ATG CTC ACT AAC GCT AAA GGG GCG GAC ATT 394 Pro His Gly He Asn He Met Leu Thr Asn Ala Lys Gly Ala Asp He 100 105 110 115 AAA GGC TAT AAC GGC TTG GTG GGG GAA TTG ATT GAA AGG AAT TTC CAG 442 Lys Gly Tyr Asn Gly Leu Val Gly Glu Leu He Glu Arg Asn Phe Gin 120 125 130
CGC TAT GGC GTG CCG TTA CTG CTT TCT ACT CTC ACT AAC GGC CTA TTG 490 Arg Tyr Gly Val Pro Leu Leu Leu Ser Thr Leu Thr Asn Gly Leu Leu 135 140 145
ATT GGG ATC ACT TCG GCT TTA AAC AAC AGA GGC AAT AAA GAA GGA GCC 538 He Gly He Thr Ser Ala Leu Asn Asn Arg Gly Asn Lys Glu Gly Ala 150 155 160
ACC AAT TTC TTT GGG GAT TAT CTT TTA ATG CAA TTG ATG AGG CAA AGC 586 Thr Asn Phe Phe Gly Asp Tyr Leu Leu Met Gin Leu Met Arg Gin Ser 165 170 175
GGC ATG GGG ATC AAT CAA GTA GTC AAT CAA ATT TTA AGA GAT AAG AGC 634 Gly Met Gly He Asn Gin Val Val Asn Gin He Leu Arg Asp Lys Ser 180 185 190 195
AAA ATC GCT CCT ATT GTG GTG ATT AGA GAA GGG AGT AGG GTC TTC ATT 682 Lys He Ala Pro He Val Val He Arg Glu Gly Ser Arg Val Phe He 200 205 210
TCG CCC AAT ACT GAC ATC TTT TTC CCT ATA CCC AGA GAG AAT GAA GTC 730 Ser Pro Asn Thr Asp He Phe Phe Pro He Pro Arg Glu Asn Glu Val 215 220 225
ATC GCT GAG TTT TTG AAG TGACTCAAAA ATCCCCAATT AAAAACGCT 777
He Ala Glu Phe Leu Lys 230
(2) INFORMATION FOR SEQ ID NO: 1170:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 233 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1170:
Met Asp Asn Pro Lys Gly He Asp Gly Phe Thr Asn Leu Lys Glu Lys
1 5 10 15
Asp He Ala Thr Asn Glu Asn Lys Leu Leu Arg Thr He Thr Ala Asp
20 25 30
Lys Met He Pro Ala Phe Leu He Thr Pro He Ser Ser Gin He Ala
35 40 45
Gly Lys Val He Ala Gin Val Glu Ser Asp He Phe Ala His Met Gly
50 55 60
Lys Ala Val Leu He Pro Lys Gly Ser Lys Val He Gly Tyr Tyr Ser 65 70 75 80 Asn Asn Asn Lys Met Gly Glu Tyr Arg Leu Asp He Val Trp Ser Arg
85 90 95
He He Thr Pro His Gly He Asn He Met Leu Thr Asn Ala Lys Gly
100 105 110
Ala Asp He Lys Gly Tyr Asn Gly Leu Val Gly Glu Leu He Glu Arg
115 120 125
Asn Phe Gin Arg Tyr Gly Val Pro Leu Leu Leu Ser Thr Leu Thr Asn
130 135 140
Gly Leu Leu He Gly He Thr Ser Ala Leu Asn Asn Arg Gly Asn Lys 145 150 155 160
Glu Gly Ala Thr Asn Phe Phe Gly Asp Tyr Leu Leu Met Gin Leu Met
165 170 175
Arg Gin Ser Gly Met Gly He Asn Gin Val Val Asn Gin He Leu Arg
180 185 190
Asp Lys Ser Lys He Ala Pro He Val Val He Arg Glu Gly Ser Arg
195 200 205
Val Phe He Ser Pro Asn Thr Asp He Phe Phe Pro He Pro Arg Glu
210 215 220
Asn Glu Val He Ala Glu Phe Leu Lys 225 230
(2) INFORMATION FOR SEQ ID NO: 1171:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1229 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 27...1169 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1171:
AAATAAAATT CAATAAAGGA AAAATA ATG AAA GAA AAA ATC GCT TTA ATC ACC 53
Met Lys Glu Lys He Ala Leu He Thr 1 5
GGG GTT ACC GGG CAA GAC GGG AGC TAT CTG GCT GAA TAC TTG CTG AAT 101 Gly Val Thr Gly Gin Asp Gly Ser Tyr Leu Ala Glu Tyr Leu Leu Asn 10 15 20 25
TTG GGT TAT GAA GTG CAT GGG TTA AAA AGG CGC TCT TCT AGC ATC AAC 149 Leu Gly Tyr Glu Val His Gly Leu Lys Arg Arg Ser Ser Ser He Asn 30 35 40
ACT TCT AGG ATC GAT CAT CTG TAT GAA GAT TTG CAT AGC GAT CAT AAA 197 Thr Ser Arg He Asp His Leu Tyr Glu Asp Leu His Ser Asp His Lys 45 50 55 AGG CGT TTT TTC TTA CAC TAT GGG GAT ATG ACC GAT AGC TCT AAT CTT 245 Arg Arg Phe Phe Leu His Tyr Gly Asp Met Thr Asp Ser Ser Asn Leu 60 65 70
ATC CAT TTA ATC GCT ACC ACT AAG CCT ACA GAG ATT TAT AAT TTA GCC 293 He His Leu He Ala Thr Thr Lys Pro Thr Glu He Tyr Asn Leu Ala 75 80 85
GCT CAA AGC CAT GTA AAA GTC TCT TTT GAA ACC CCC GAA TAC ACC GCT 341 Ala Gin Ser His Val Lys Val Ser Phe Glu Thr Pro Glu Tyr Thr Ala 90 95 100 105
AAC GCT GAT GGT ATT GGC ACG CTA AGG ATT TTA GAA GCC ATG CGG ATT 389 Asn Ala Asp Gly He Gly Thr Leu Arg He Leu Glu Ala Met Arg He 110 115 120
TTA GGA TTA GAA AAG AAA ACG CGC TTT TAT CAA GCC AGC ACG AGC GAA 437 Leu Gly Leu Glu Lys Lys Thr Arg Phe Tyr Gin Ala Ser Thr Ser Glu 125 130 135
TTG TAT GGC GAA GTC TTA GAA ACC CCG CAA AAT GAA AAC ACC CCC TTT 485 Leu Tyr Gly Glu Val Leu Glu Thr Pro Gin Asn Glu Asn Thr Pro Phe 140 145 150
AAC CCA CGA AGC CCC TAT GCG GTC GCT AAA ATG TAT GCC TTT TAC ATC 533 Asn Pro Arg Ser Pro Tyr Ala Val Ala Lys Met Tyr Ala Phe Tyr He 155 160 165
ACC AAA AAT TAC AGA GAG GCC TAT AAC TTG TTT GCG GTT AAT GGC ATT 581 Thr Lys Asn Tyr Arg Glu Ala Tyr Asn Leu Phe Ala Val Asn Gly He 170 175 180 185
CTT TTT AAC CAT GAG AGC AGG GTA AGG GGC GAA ACT TTT GTA ACC CGT 629 Leu Phe Asn His Glu Ser Arg Val Arg Gly Glu Thr Phe Val Thr Arg 190 195 200
AAA ATC ACA CGA GCC GCT AGC GCG ATA GCG TAT AAC TTA ACG GAT TGC 677 Lys He Thr Arg Ala Ala Ser Ala He Ala Tyr Asn Leu Thr Asp Cys 205 210 215
TTG TAT TTA GGG AAT TTA GAC GCT AAA AGA GAC TGG GGG CAT GCC AAA 725 Leu Tyr Leu Gly Asn Leu Asp Ala Lys Arg Asp Trp Gly His Ala Lys 220 225 230
GAT TAC GTG AAA ATG ATG CAT TTA ATG CTC CAA GCG CCC ATC CCA CAA 773 Asp Tyr Val Lys Met Met His Leu Met Leu Gin Ala Pro He Pro Gin 235 240 245
GAT TAT GTG ATC GCC ACA GGA AAG ACC ACA AGC GTG CGC GAT TTT GTG 821 Asp Tyr Val He Ala Thr Gly Lys Thr Thr Ser Val Arg Asp Phe Val 250 255 260 265
AAA ATG AGC TTT GAA TTT ATC GGT ATC AAT TTA GAA TTT CAA AAT ACA 869 Lys Met Ser Phe Glu Phe He Gly He Asn Leu Glu Phe Gin Asn Thr 270 275 280 GGG ATT AAA GAA ATC GGT TTG ATT AAA AGC GTT GAT GAA AAA AGA GCG 917 Gly He Lys Glu He Gly Leu He Lys Ser Val Asp Glu Lys Arg Ala 285 290 295
AAC GCT TTA AAA TTG AAC TTA AGC CAT TTA AAA AAA GGC CAA ATC GTG 965 Asn Ala Leu Lys Leu Asn Leu Ser His Leu Lys Lys Gly Gin He Val 300 305 310
GTG CGC ATA GAC GAG CGT TAT TTC AGG CCT ACC GAA GTG GAT TTG CTT 1013 Val Arg He Asp Glu Arg Tyr Phe Arg Pro Thr Glu Val Asp Leu Leu 315 320 325
TTA GGC GAT CCC ACT AAG GCA GAG AAA GAG CTA GAC TGG GTT AGG GAA 1061 Leu Gly Asp Pro Thr Lys Ala Glu Lys Glu Leu Asp Trp Val Arg Glu 330 335 340 345
TAC GAT TTA AAA GAG TTG GTT AAG GAC ATG TTA GAA TAC GAT TTA AAA 1109 Tyr Asp Leu Lys Glu Leu Val Lys Asp Met Leu Glu Tyr Asp Leu Lys 350 355 360
GAA TGC CAA AAA AAC CTT TAC TTG CAA GAT GGG GGT TAT ATT TTA AGG 1157 Glu Cys Gin Lys Asn Leu Tyr Leu Gin Asp Gly Gly Tyr He Leu Arg 365 370 375
AAT TTT TAT GAA TGAGATTATT TTAATCACTG GTGCCTATGG CATGGTGGGG CAGAA 1214 Asn Phe Tyr Glu 380
CACGGCGTTG TATTT 1229
(2) INFORMATION FOR SEQ ID NO:1172:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 381 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1172:
Met Lys Glu Lys He Ala Leu He Thr Gly Val Thr Gly Gin Asp Gly
1 5 10 15
Ser Tyr Leu Ala Glu Tyr Leu Leu Asn Leu Gly Tyr Glu Val His Gly
20 25 30
Leu Lys Arg Arg Ser Ser Ser He Asn Thr Ser Arg He Asp His Leu
35 40 45
Tyr Glu Asp Leu His Ser Asp His Lys Arg Arg Phe Phe Leu His Tyr
50 55 60
Gly Asp Met Thr Asp Ser Ser Asn Leu He His Leu He Ala Thr Thr 65 70 75 80
Lys Pro Thr Glu He Tyr Asn Leu Ala Ala Gin Ser His Val Lys Val
85 90 95
Ser Phe Glu Thr Pro Glu Tyr Thr Ala Asn Ala Asp Gly He Gly Thr 100 105 110
Leu Arg He Leu Glu Ala Met Arg He Leu Gly Leu Glu Lys Lys Thr
115 120 125
Arg Phe Tyr Gin Ala Ser Thr Ser Glu Leu Tyr Gly Glu Val Leu Glu
130 135 140
Thr Pro Gin Asn Glu Asn Thr Pro Phe Asn Pro Arg Ser Pro Tyr Ala 145 150 155 160
Val Ala Lys Met Tyr Ala Phe Tyr He Thr Lys Asn Tyr Arg Glu Ala
165 170 175
Tyr Asn Leu Phe Ala Val Asn Gly He Leu Phe Asn His Glu Ser Arg
180 185 190
Val Arg Gly Glu Thr Phe Val Thr Arg Lys He Thr Arg Ala Ala Ser
195 200 205
Ala He Ala Tyr Asn Leu Thr Asp Cys Leu Tyr Leu Gly Asn Leu Asp
210 215 220
Ala Lys Arg Asp Trp Gly His Ala Lys Asp Tyr Val Lys Met Met His 225 230 235 240
Leu Met Leu Gin Ala Pro He Pro Gin Asp Tyr Val He Ala Thr Gly
245 250 255
Lys Thr Thr Ser Val Arg Asp Phe Val Lys Met Ser Phe Glu Phe He
260 265 270
Gly He Asn Leu Glu Phe Gin Asn Thr Gly He Lys Glu He Gly Leu
275 280 285
He Lys Ser Val Asp Glu Lys Arg Ala Asn Ala Leu Lys Leu Asn Leu
290 295 300
Ser His Leu Lys Lys Gly Gin He Val Val Arg He Asp Glu Arg Tyr 305 310 315 320
Phe Arg Pro Thr Glu Val Asp Leu Leu Leu Gly Asp Pro Thr Lys Ala
325 330 335
Glu Lys Glu Leu Asp Trp Val Arg Glu Tyr Asp Leu Lys Glu Leu Val
340 345 350
Lys Asp Met Leu Glu Tyr Asp Leu Lys Glu Cys Gin Lys Asn Leu Tyr
355 360 365
Leu Gin Asp Gly Gly Tyr He Leu Arg Asn Phe Tyr Glu 370 375 380
(2) INFORMATION FOR SEQ ID NO: 1173:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1116 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 70...1065 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1173: CAGCAGTATC CCTATCGGTC AAGCCTTAAT GGCGTATTTC AACCCTACAA TCATCAAAAA 60 AGGATAAAA ATG GAT AGC GTA ACT CTA GCA TGC GGG AAC GGA GGG AAA GAA 111 Met Asp Ser Val Thr Leu Ala Cys Gly Asn Gly Gly Lys Glu 1 5 10
ACA AAC GCT TTG ATT GAG CGA GTC TTT ATG CCC TAT TTA AAA GAA TGG 159 Thr Asn Ala Leu He Glu Arg Val Phe Met Pro Tyr Leu Lys Glu Trp 15 20 25 30
ATT GTT GCA TTT GAT GAA GAC GCC CCT AAA TTT GAA GCT AGT GGG GAA 207 He Val Ala Phe Asp Glu Asp Ala Pro Lys Phe Glu Ala Ser Gly Glu 35 40 45
TAT TGC GTG AGC ACG GAT AGT TTT GTC ATC ACG CCC TTA ATT TTT AAT 255 Tyr Cys Val Ser Thr Asp Ser Phe Val He Thr Pro Leu He Phe Asn 50 55 60
GGG GGC GAT ATA GGC AAG CTT TGC GTT TGC GGG AGT GCG AAT GAT GTG 303 Gly Gly Asp He Gly Lys Leu Cys Val Cys Gly Ser Ala Asn Asp Val 65 70 75
AGC GTG CAA GGG GGC GAA CCT TTG TAT TTG AAT ATG GGT TTT ATT TTA 351 Ser Val Gin Gly Gly Glu Pro Leu Tyr Leu Asn Met Gly Phe He Leu 80 85 90
GAA GAA GGC TTA GAA ATT TCT CTT TTA AAA CAA ATT TTA CAA TCC ATA 399 Glu Glu Gly Leu Glu He Ser Leu Leu Lys Gin He Leu Gin Ser He 95 100 105 110
CAA AAA GAA TTG TTT AAA GCC AAC CTG AAA CTC CTC TCC CTA GAC ACT 447 Gin Lys Glu Leu Phe Lys Ala Asn Leu Lys Leu Leu Ser Leu Asp Thr 115 120 125
AAA GTC GTG CCA AAG GGG AGC GTG GAT AAG CTT TTT ATC AAC ACA ACC 495 Lys Val Val Pro Lys Gly Ser Val Asp Lys Leu Phe He Asn Thr Thr 130 135 140
TGC ATT GGT AAA ATC ATC AAG CCA GGG ATT TCT TCG TAC CAT TTA CAA 543 Cys He Gly Lys He He Lys Pro Gly He Ser Ser Tyr His Leu Gin 145 150 155
CAA GGG CAA GCC ATT ATC CTA AGC GAC ACT ATC GCC AAT CAT GGG GCA 591 Gin Gly Gin Ala He He Leu Ser Asp Thr He Ala Asn His Gly Ala 160 165 170
AGC TTA TTT GCG ATG CGT AAT GAA ATC AAG CTT AAA ACG AAT CTA GAA 639 Ser Leu Phe Ala Met Arg Asn Glu He Lys Leu Lys Thr Asn Leu Glu 175 180 185 190
AGC GAT TGC CAA CTG CTC TAT CCC TTA TTA AAA CCC CTA TTT TTA AGC 687 Ser Asp Cys Gin Leu Leu Tyr Pro Leu Leu Lys Pro Leu Phe Leu Ser 195 200 205
GAT CTC AAA ATT GAT GCT TTA AGA GAT GCG ACT AGG GGC GGG TTA GCG 735 Asp Leu Lys He Asp Ala Leu Arg Asp Ala Thr Arg Gly Gly Leu Ala 210 215 220
AGC GTG CTG AAC GAA TGG GCG AAC AGC TCT AGA GTG AAA ATC GTT ATA 783 Ser Val Leu Asn Glu Trp Ala Asn Ser Ser Arg Val Lys He Val He 225 230 235
GAA GAA GAA AAA ATC CCC TTA AAA GAA GAA ACG AAA GGG ATT TGT GAG 831 Glu Glu Glu Lys He Pro Leu Lys Glu Glu Thr Lys Gly He Cys Glu 240 245 250
ATT TTA GGG TTA GAA CCC TAC GCG CTA GCC AAT GAG GGG GTG TTT GTT 879 He Leu Gly Leu Glu Pro Tyr Ala Leu Ala Asn Glu Gly Val Phe Val 255 260 265 270
TTA GCG CTC AAT CAA AAA GAC GCC CCT AAA GCC TTA GAA ATT TTA AAA 927 Leu Ala Leu Asn Gin Lys Asp Ala Pro Lys Ala Leu Glu He Leu Lys 275 280 285
AGT AAC GAA AAA GCT AAA AAC GCT TGC GTG ATT GGC AAA GTG TTT GAA 975 Ser Asn Glu Lys Ala Lys Asn Ala Cys Val He Gly Lys Val Phe Glu 290 295 300
AAC CCT TAT CCT AGC GTG GTT TTA AAG AAC GCA TGG GGT TTT GAA AGG 1023 Asn Pro Tyr Pro Ser Val Val Leu Lys Asn Ala Trp Gly Phe Glu Arg 305 310 315
ATT TTA GAG GTG CCA GAG GGC GAA TTA TTG CCT AGG ATT TGT TAACACGCC 1074 He Leu Glu Val Pro Glu Gly Glu Leu Leu Pro Arg He Cys 320 325 330
GTCATTTTTT AATCGTTTTA AGCCTGCCCT AAAAATGGTT TA 1116
(2) INFORMATION FOR SEQ ID NO: 1174:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 332 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1174:
Met Asp Ser Val Thr Leu Ala Cys Gly Asn Gly Gly Lys Glu Thr Asn
1 5 10 15
Ala Leu He Glu Arg Val Phe Met Pro Tyr Leu Lys Glu Trp He Val
20 25 30
Ala Phe Asp Glu Asp Ala Pro Lys Phe Glu Ala Ser Gly Glu Tyr Cys
35 40 45
Val Ser Thr Asp Ser Phe Val He Thr Pro Leu He Phe Asn Gly Gly
50 55 60
Asp He Gly Lys Leu Cys Val Cys Gly Ser Ala Asn Asp Val Ser Val 65 70 75 80
Gin Gly Gly Glu Pro Leu Tyr Leu Asn Met Gly Phe He Leu Glu Glu 85 90 95
Gly Leu Glu He Ser Leu Leu Lys Gin He Leu Gin Ser He Gin Lys
100 105 110
Glu Leu Phe Lys Ala Asn Leu Lys Leu Leu Ser Leu Asp Thr Lys Val
115 120 125
Val Pro Lys Gly Ser Val Asp Lys Leu Phe He Asn Thr Thr Cys He
130 135 140
Gly Lys He He Lys Pro Gly He Ser Ser Tyr His Leu Gin Gin Gly 145 150 155 160
Gin Ala He He Leu Ser Asp Thr He Ala Asn His Gly Ala Ser Leu
165 170 175
Phe Ala Met Arg Asn Glu He Lys Leu Lys Thr Asn Leu Glu Ser Asp
180 185 190
Cys Gin Leu Leu Tyr Pro Leu Leu Lys Pro Leu Phe Leu Ser Asp Leu
195 200 205
Lys He Asp Ala Leu Arg Asp Ala Thr Arg Gly Gly Leu Ala Ser Val
210 215 220
Leu Asn Glu Trp Ala Asn Ser Ser Arg Val Lys He Val He Glu Glu 225 230 235 240
Glu Lys He Pro Leu Lys Glu Glu Thr Lys Gly He Cys Glu He Leu
245 250 255
Gly Leu Glu Pro Tyr Ala Leu Ala Asn Glu Gly Val Phe Val Leu Ala
260 265 270
Leu Asn Gin Lys Asp Ala Pro Lys Ala Leu Glu He Leu Lys Ser Asn
275 280 285
Glu Lys Ala Lys Asn Ala Cys Val He Gly Lys Val Phe Glu Asn Pro
290 295 300
Tyr Pro Ser Val Val Leu Lys Asn Ala Trp Gly Phe Glu Arg He Leu 305 310 315 320
Glu Val Pro Glu Gly Glu Leu Leu Pro Arg He Cys 325 330
(2) INFORMATION FOR SEQ ID NO: 1175:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1033 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...1005 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1175:
AAAAGGATAT TTTGA ATG AAA AGA ATG TTA GCG GAG TTT GAA AAA ATC CAA 51 Met Lys Arg Met Leu Ala Glu Phe Glu Lys He Gin 1 5 10 GCG ATT CTA ATG GCT TTC CCC CAT GAG TTT AGC GAC TGG GCG TAT TGT 99 Ala He Leu Met Ala Phe Pro His Glu Phe Ser Asp Trp Ala Tyr Cys 15 20 25
ATC AAA GAG GCT AGG GAA AGT TTT TTA AAC ATC ATT CAA ACC ATA GCC 147 He Lys Glu Ala Arg Glu Ser Phe Leu Asn He He Gin Thr He Ala 30 35 40
AAA CAC GCT AAA GTG CTA GTG TGC GTC CAC ACT AAC GAT ATT ATC GGT 195 Lys His Ala Lys Val Leu Val Cys Val His Thr Asn Asp He He Gly 45 50 55 60
TAT GAA ACG CTT AAA AAC TTA CCC GGT GTA GAG ATC GCA AGG ATT GAC 243 Tyr Glu Thr Leu Lys Asn Leu Pro Gly Val Glu He Ala Arg He Asp 65 70 75
ACT AAC GAC ACA TGG GCT AGG GAT TTT GGA GCG ATC AGC GTT GAA AAT 291 Thr Asn Asp Thr Trp Ala Arg Asp Phe Gly Ala He Ser Val Glu Asn 80 85 90
CAT GGC GTT TTA GAG TGC TTG GAT TTT GGC TTT AAT GGC TGG GGG TTA 339 His Gly Val Leu Glu Cys Leu Asp Phe Gly Phe Asn Gly Trp Gly Leu 95 100 105
AAA TAC CCG TCC AAT TTA GAC AAT CAA GTG AAT TTC AAA CTC AAA AGT 387 Lys Tyr Pro Ser Asn Leu Asp Asn Gin Val Asn Phe Lys Leu Lys Ser 110 115 120
TTA GGG TTT TTA AAA CAC CCT TTA AAA ACG ATG CCC TAT ATT TTA GAG 435 Leu Gly Phe Leu Lys His Pro Leu Lys Thr Met Pro Tyr He Leu Glu 125 130 135 140
GGC GGG AGT ATA GAA AGC GAT GGG GCT GGG AGC GTT TTA ACC AAC ACC 483 Gly Gly Ser He Glu Ser Asp Gly Ala Gly Ser Val Leu Thr Asn Thr 145 150 155
CAA TGC CTG TTA GAA AAA AAT CGT AAC CCC CAT TTG AAT CAA AAT GGA 531 Gin Cys Leu Leu Glu Lys Asn Arg Asn Pro His Leu Asn Gin Asn Gly 160 165 170
ATA GAA AAC ATG CTT AAA AAG GAA TTA GGG GCT AAA CAA GTG CTG TGG 579 He Glu Asn Met Leu Lys Lys Glu Leu Gly Ala Lys Gin Val Leu Trp 175 180 185
TAT TCT TAT GGC TAT CTC AAA GGC GAT GAT ACC GAT AGC CAT ACC GAC 627 Tyr Ser Tyr Gly Tyr Leu Lys Gly Asp Asp Thr Asp Ser His Thr Asp 190 195 200
ACG CTC GCT CGT TTT TTA GAT AAA GAC ACC ATT GTT TAT AGC ACA TGC 675 Thr Leu Ala Arg Phe Leu Asp Lys Asp Thr He Val Tyr Ser Thr Cys 205 210 215 220
GAA GAT GAA AAC GAT GAG CAC TAC ACA GCC TTA AAA AAA ATG CAA GAA 723 Glu Asp Glu Asn Asp Glu His Tyr Thr Ala Leu Lys Lys Met Gin Glu 225 230 235 GAA TTA AAA ACC TTT AAA AAA CTA GAC GGC ACG CCC TAT AAA CTC ATC 771 Glu Leu Lys Thr Phe Lys Lys Leu Asp Gly Thr Pro Tyr Lys Leu He 240 245 250
CCC CTA GAA ATC CCT AAA GCC ATT TTT GAT GAA AAC CAA CAA CGC TTG 819 Pro Leu Glu He Pro Lys Ala He Phe Asp Glu Asn Gin Gin Arg Leu 255 260 265
CCG GCA ACT TAT GTG AAT TTT TTA TTG TGC AAT AAC GCT TTA ATC GTG 867 Pro Ala Thr Tyr Val Asn Phe Leu Leu Cys Asn Asn Ala Leu He Val 270 275 280
CCC ACT TAC AAC GAC CCT AAA GAC GCG CTC ATT TTA GAA ACC TTG AAA 915 Pro Thr Tyr Asn Asp Pro Lys Asp Ala Leu He Leu Glu Thr Leu Lys 285 290 295 300
CAA CAC ACG CCC TTA GAA GTG ATA GGG GTT GAT TGC AAC ACC TTA ATC 963 Gin His Thr Pro Leu Glu Val He Gly Val Asp Cys Asn Thr Leu He 305 310 315
AAA CAG CAT GGA AGT TTG CAT TGT GTA ACG ATG CAA CTT TAT TGAACAAAA 1014 Lys Gin His Gly Ser Leu His Cys Val Thr Met Gin Leu Tyr 320 325 330
TCACGCTTTT TGGCGTGGT 1033
(2) INFORMATION FOR SEQ ID NO: 1176:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 330 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1176:
Met Lys Arg Met Leu Ala Glu Phe Glu Lys He Gin Ala He Leu Met
1 5 10 15
Ala Phe Pro His Glu Phe Ser Asp Trp Ala Tyr Cys He Lys Glu Ala
20 25 30
Arg Glu Ser Phe Leu Asn He He Gin Thr He Ala Lys His Ala Lys
35 40 45
Val Leu Val Cys Val His Thr Asn Asp He He Gly Tyr Glu Thr Leu
50 55 60
Lys Asn Leu Pro Gly Val Glu He Ala Arg He Asp Thr Asn Asp Thr 65 70 75 80
Trp Ala Arg Asp Phe Gly Ala He Ser Val Glu Asn His Gly Val Leu
85 90 95
Glu Cys Leu Asp Phe Gly Phe Asn Gly Trp Gly Leu Lys Tyr Pro Ser
100 105 110
Asn Leu Asp Asn Gin Val Asn Phe Lys Leu Lys Ser Leu Gly Phe Leu
115 120 125
Lys His Pro Leu Lys Thr Met Pro Tyr He Leu Glu Gly Gly Ser He 130 135 140
Glu Ser Asp Gly Ala Gly Ser Val Leu Thr Asn Thr Gin Cys Leu Leu 145 150 155 160
Glu Lys Asn Arg Asn Pro His Leu Asn Gin Asn Gly He Glu Asn Met
165 170 175
Leu Lys Lys Glu Leu Gly Ala Lys Gin Val Leu Trp Tyr Ser Tyr Gly
180 185 190
Tyr Leu Lys Gly Asp Asp Thr Asp Ser His Thr Asp Thr Leu Ala Arg
195 200 205
Phe Leu Asp Lys Asp Thr He Val Tyr Ser Thr Cys Glu Asp Glu Asn
210 215 220
Asp Glu His Tyr Thr Ala Leu Lys Lys Met Gin Glu Glu Leu Lys Thr 225 230 235 240
Phe Lys Lys Leu Asp Gly Thr Pro Tyr Lys Leu He Pro Leu Glu He
245 250 255
Pro Lys Ala He Phe Asp Glu Asn Gin Gin Arg Leu Pro Ala Thr Tyr
260 265 270
Val Asn Phe Leu Leu Cys Asn Asn Ala Leu He Val Pro Thr Tyr Asn
275 280 285
Asp Pro Lys Asp Ala Leu He Leu Glu Thr Leu Lys Gin His Thr Pro
290 295 300
Leu Glu Val He Gly Val Asp Cys Asn Thr Leu He Lys Gin His Gly 305 310 315 320
Ser Leu His Cys Val Thr Met Gin Leu Tyr 325 330
(2) INFORMATION FOR SEQ ID NO: 1177:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 449 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 73...408 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1177:
GCTTTATTCA AAAGAGCGAG GAGTTTAGAC GAGATAGCAA AATCATCAAT CTTTATCGCC 60 TTTCAACGCC TA ATG TTT GCA GTG CAT GCT GCG ATG ATT ACG ACA TTA AAG 111 Met Phe Ala Val His Ala Ala Met He Thr Thr Leu Lys 1 5 10
AAA GAA GTT TTC TTT CTT TAC CTT TAT ATC AAA TCA CTC AAA ATC CCG 159 Lys Glu Val Phe Phe Leu Tyr Leu Tyr He Lys Ser Leu Lys He Pro 15 20 25
ATT CCT ACT ACA CTG AAA TAC ATG ATT TCT TTA GGC AAA ATC AGA GAA 207 He Pro Thr Thr Leu Lys Tyr Met He Ser Leu Gly Lys He Arg Glu 30 35 40 45
TTA GAT GTT TTA GCA AAT CTT GCT AAA CTT TGC CCT ACT TGT CAT AGG 255 Leu Asp Val Leu Ala Asn Leu Ala Lys Leu Cys Pro Thr Cys His Arg 50 55 60
GCT TTA AAA AAA GGA TCT AGC GAA GAG GAG TTT CAA AAA CGC TTG ATT 303 Ala Leu Lys Lys Gly Ser Ser Glu Glu Glu Phe Gin Lys Arg Leu He 65 70 75
AGA AAC ATT CTC AAT CGC AAT AAA GAC AAT TTA GAG TTT GCG CAA TTG 351 Arg Asn He Leu Asn Arg Asn Lys Asp Asn Leu Glu Phe Ala Gin Leu 80 85 90
CGT TTT GAA ACC GAT GAT TTT TCA ACG CTT ATT GAT CGT ATT TGT GAA 399 Arg Phe Glu Thr Asp Asp Phe Ser Thr Leu He Asp Arg He Cys Glu 95 100 105
AGC TTG AAA TGAATTATAA AATTTTAGAT TTATTTTGTG GGGCTGGGGG T 449
Ser Leu Lys
110
(2) INFORMATION FOR SEQ ID NO: 1178:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 112 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1178:
Met Phe Ala Val His Ala Ala Met He Thr Thr Leu Lys Lys Glu Val
1 5 10 15
Phe Phe Leu Tyr Leu Tyr He Lys Ser Leu Lys He Pro He Pro Thr
20 25 30
Thr Leu Lys Tyr Met He Ser Leu Gly Lys He Arg Glu Leu Asp Val
35 40 45
Leu Ala Asn Leu Ala Lys Leu Cys Pro Thr Cys His Arg Ala Leu Lys
50 55 60
Lys Gly Ser Ser Glu Glu Glu Phe Gin Lys Arg Leu He Arg Asn He 65 70 75 80
Leu Asn Arg Asn Lys Asp Asn Leu Glu Phe Ala Gin Leu Arg Phe Glu
85 90 95
Thr Asp Asp Phe Ser Thr Leu He Asp Arg He Cys Glu Ser Leu Lys 100 105 110
(2) INFORMATION FOR SEQ ID NO: 1179:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 420 base pairs (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 25...375 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1179:
AAAAACTGGA ATCAAGGGGT TAAA ATG TTT TCT CAT GAA GTT TAT TTG GAG 51
Met Phe Ser His Glu Val Tyr Leu Glu 1 5
GGT TGC ACC CTT GAA TTA AGA AAG ATT TGC GAT GAT TTT GAA AAA AAT 99 Gly Cys Thr Leu Glu Leu Arg Lys He Cys Asp Asp Phe Glu Lys Asn 10 15 20 25
GCC ATG CAA GAT GAT TTA GGG CAG AAA CTC AGG AGT GAT GTG CTA GAG 147 Ala Met Gin Asp Asp Leu Gly Gin Lys Leu Arg Ser Asp Val Leu Glu 30 35 40
GAC ATG CTA AAA ATC GCG CAT GAT TTA GAA AAT TTA GAA GAT GAC ACC 195 Asp Met Leu Lys He Ala His Asp Leu Glu Asn Leu Glu Asp Asp Thr 45 50 55
CAA TAC CAA AGA AGA ATA ATT GAC GAG CAA ATT GAA GAA GCC AAA TCT 243 Gin Tyr Gin Arg Arg He He Asp Glu Gin He Glu Glu Ala Lys Ser 60 65 70
TTG ATG AGG CAA ATT GAT ATG AAT TTC CAT CCA TCA AGC GAG ATC GAT 291 Leu Met Arg Gin He Asp Met Asn Phe His Pro Ser Ser Glu He Asp 75 80 85
AGG CTT ATG CGT GAA GCC AAA GAG CAT GAA AGA GAA GCT AGT AAA AGA 339 Arg Leu Met Arg Glu Ala Lys Glu His Glu Arg Glu Ala Ser Lys Arg 90 95 100 105
TAT GAT GAG TAT CTT AAA TCT AAG GAT AAA AAT GAT TGATGTGAAT GGTTTA 391 Tyr Asp Glu Tyr Leu Lys Ser Lys Asp Lys Asn Asp 110 115
TTAAAAGAAC TGGATGATGC CTTAGATAA 420
(2) INFORMATION FOR SEQ ID NO: 1180:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 117 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1180:
Met Phe Ser His Glu Val Tyr Leu Glu Gly Cys Thr Leu Glu Leu Arg
1 5 10 15
Lys He Cys Asp Asp Phe Glu Lys Asn Ala Met Gin Asp Asp Leu Gly
20 25 30
Gin Lys Leu Arg Ser Asp Val Leu Glu Asp Met Leu Lys He Ala His
35 40 45
Asp Leu Glu Asn Leu Glu Asp Asp Thr Gin Tyr Gin Arg Arg He He
50 55 60
Asp Glu Gin He Glu Glu Ala Lys Ser Leu Met Arg Gin He Asp Met 65 70 75 80
Asn Phe His Pro Ser Ser Glu He Asp Arg Leu Met Arg Glu Ala Lys
85 90 95
Glu His Glu Arg Glu Ala Ser Lys Arg Tyr Asp Glu Tyr Leu Lys Ser
100 105 110
Lys Asp Lys Asn Asp 115
(2) INFORMATION FOR SEQ ID NO: 1181:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 651 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 43...627 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1181:
GGTGCGTTTG TTGTAAAAAT TTTTGTTTGG AAGGAAAAGG CA ATG CTA GGA CTT 54
Met Leu Gly Leu
1
GTA TTG TTA TAT GTT GGG ATT GTT TTA ATC AGC AAT GGG ATT TGC GGG 102 Val Leu Leu Tyr Val Gly He Val Leu He Ser Asn Gly He Cys Gly 5 10 15 20
TTA ACC AAA GTC GAT CCT AAA AGC ACT GCG GTG ATG AAC TTT TTT GTG 150 Leu Thr Lys Val Asp Pro Lys Ser Thr Ala Val Met Asn Phe Phe Val 25 30 35
GGC GGA CTT TCC ATT ATT TGT AAT ATA GTT GTC ATC ACT TAT TCT GCA 198 Gly Gly Leu Ser He He Cys Asn He Val Val He Thr Tyr Ser Ala 40 45 50
CTC CAC CCT ACA GCC CCT GTA GAA GGT GCT GAA GAT ATT GCT CAA GTA 246 Leu His Pro Thr Ala Pro Val Glu Gly Ala Glu Asp He Ala Gin Val 55 60 65
TCG CAC CAT TTG ACT AGT TTC TAT GGA CCA GCG ACT GGG TTA TTG TTT 294 Ser His His Leu Thr Ser Phe Tyr Gly Pro Ala Thr Gly Leu Leu Phe 70 75 80
GGT TTC ACC TAC TTG TAT GCG GCT ATC AAC CAC ACT TTT GGT TTG GAT 342 Gly Phe Thr Tyr Leu Tyr Ala Ala He Asn His Thr Phe Gly Leu Asp 85 90 95 100
TGG AGG CCC TAC TCT TGG TAT AGC TTA TTC GTA GCG ATC AAC ACG ATT 390 Trp Arg Pro Tyr Ser Trp Tyr Ser Leu Phe Val Ala He Asn Thr He 105 110 115
CCT GCT GCG ATT TTA TCC CAC TAT AGC GAT ATG CTT GAT GAC CAC AAA 438 Pro Ala Ala He Leu Ser His Tyr Ser Asp Met Leu Asp Asp His Lys 120 125 130
GTG TTA GGC ATC ACT GAA GGC GAT TGG TGG GCG ATC ATT TGG TTG GCT 486 Val Leu Gly He Thr Glu Gly Asp Trp Trp Ala He He Trp Leu Ala 135 140 145
TGG GGT GTT TTG TGG CTT ACC GCT TTC ATT GAA AAC ATC TTG AAA ATC 534 Trp Gly Val Leu Trp Leu Thr Ala Phe He Glu Asn He Leu Lys He 150 155 160
CCT TTA GGG AAA TTC ACT CCA TGG CTT GCT ATC ATT GAG GGT ATT TTA 582 Pro Leu Gly Lys Phe Thr Pro Trp Leu Ala He He Glu Gly He Leu 165 170 175 180
ACC GCT TGG ATC CCT GCT TGG TTG CTC TTT ATC CAA CAC TGG GTG TGAGA 632 Thr Ala Trp He Pro Ala Trp Leu Leu Phe He Gin His Trp Val 185 190 195
TGATCATAGA GCGTTTAGT 651
(2) INFORMATION FOR SEQ ID NO: 1182:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 195 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
( ii ) MOLECULE TYPE : protein
(xi ) SEQUENCE DESCRIPTION : SEQ ID NO : 1182 :
Met Leu Gly Leu Val Leu Leu Tyr Val Gly He Val Leu He Ser Asn 1 5 10 15 Gly He Cys Gly Leu Thr Lys Val Asp Pro Lys Ser Thr Ala Val Met
20 25 30
Asn Phe Phe Val Gly Gly Leu Ser He He Cys Asn He Val Val He
35 40 45
Thr Tyr Ser Ala Leu His Pro Thr Ala Pro Val Glu Gly Ala Glu Asp
50 55 60
He Ala Gin Val Ser His His Leu Thr Ser Phe Tyr Gly Pro Ala Thr 65 70 75 80
Gly Leu Leu Phe Gly Phe Thr Tyr Leu Tyr Ala Ala He Asn His Thr
85 90 95
Phe Gly Leu Asp Trp Arg Pro Tyr Ser Trp Tyr Ser Leu Phe Val Ala
100 105 110
He Asn Thr He Pro Ala Ala He Leu Ser His Tyr Ser Asp Met Leu
115 120 125
Asp Asp His Lys Val Leu Gly He Thr Glu Gly Asp Trp Trp Ala He
130 135 140
He Trp Leu Ala Trp Gly Val Leu Trp Leu Thr Ala Phe He Glu Asn 145 150 155 160
He Leu Lys He Pro Leu Gly Lys Phe Thr Pro Trp Leu Ala He He
165 170 175
Glu Gly He Leu Thr Ala Trp He Pro Ala Trp Leu Leu Phe He Gin
180 185 190
His Trp Val 195
(2) INFORMATION FOR SEQ ID NO:1183:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 526 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 48...482 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1183:
GAAGGGCATT TGTGCTAAAA ACCACTAAAA AAAGCCTGTT GGTTTTT ATG GGG GTT 56
Met Gly Val
1
TTT TTC CTT ATT TTT GGC GTG GAT CAA GCG ATT AAA TAC GCT ATT TTA 104 Phe Phe Leu He Phe Gly Val Asp Gin Ala He Lys Tyr Ala He Leu 5 10 15
GAA GGG TTT CGC TAT GAA AGT TTG ATG ATA GAT ATT GTT TTA GTG TTC 152 Glu Gly Phe Arg Tyr Glu Ser Leu Met He Asp He Val Leu Val Phe 20 25 30 35 AAT AAA GGC GTG GCG TTT TCC TTG CTC AGT TTT TTA GAG GGG GGT TTG 200 Asn Lys Gly Val Ala Phe Ser Leu Leu Ser Phe Leu Glu Gly Gly Leu 40 45 50
AAA TAC TTG CAA ATC CTT TTG ATT TTA GGG CTT TTT ATC TTT TTA ATG 248 Lys Tyr Leu Gin He Leu Leu He Leu Gly Leu Phe lie Phe Leu Met 55 60 65
CGC CAA AGG GAG CTT TTT AAA AAC CAT GCG ATA GAG TTT GGC ATG GTG 296 Arg Gin Arg Glu Leu Phe Lys Asn His Ala He Glu Phe Gly Met Val 70 75 80
TTT GGC GCC GGG GTT TCT AAT GTT TTA GAC CGG TTT GTG CAT GGG GGC 344 Phe Gly Ala Gly Val Ser Asn Val Leu Asp Arg Phe Val His Gly Gly 85 90 95
GTG GTG GAT TAT GTG TAT TAT CAT TAT GGC TTT GAT TTT GCC ATT TTT 392 Val Val Asp Tyr Val Tyr Tyr His Tyr Gly Phe Asp Phe Ala He Phe 100 105 110 115
AAT TTC GCT GAT GTC ATG ATA GAT GTG GGC GTG GGC GTT TTA TTG TTG 440 Asn Phe Ala Asp Val Met He Asp Val Gly Val Gly Val Leu Leu Leu 120 125 130
AAA CAA TTC TTT TTT AAG CAA AAA CAA AAC AAA ATT AAG GCA TAATCACTC 491 Lys Gin Phe Phe Phe Lys Gin Lys Gin Asn Lys He Lys Ala 135 140 145
TTTTTAAAAT GAAAGGTCGC GTAGCTCAGT TGGTA 526
(2) INFORMATION FOR SEQ ID NO: 1184:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 145 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1184:
Met Gly Val Phe Phe Leu He Phe Gly Val Asp Gin Ala He Lys Tyr
1 5 10 15
Ala He Leu Glu Gly Phe Arg Tyr Glu Ser Leu Met He Asp He Val
20 25 30
Leu Val Phe Asn Lys Gly Val Ala Phe Ser Leu Leu Ser Phe Leu Glu
35 40 45
Gly Gly Leu Lys Tyr Leu Gin He Leu Leu He Leu Gly Leu Phe He
50 55 60
Phe Leu Met Arg Gin Arg Glu Leu Phe Lys Asn His Ala He Glu Phe 65 70 75 80
Gly Met Val Phe Gly Ala Gly Val Ser Asn Val Leu Asp Arg Phe Val
85 90 95
His Gly Gly Val Val Asp Tyr Val Tyr Tyr His Tyr Gly Phe Asp Phe 100 105 110
Ala He Phe Asn Phe Ala Asp Val Met He Asp Val Gly Val Gly Val
115 120 125
Leu Leu Leu Lys Gin Phe Phe Phe Lys Gin Lys Gin Asn Lys He Lys
130 135 140
Ala 145
(2) INFORMATION FOR SEQ ID NO:1185:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1392 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 22...1356 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1185:
TTTAAAAGGT ATTTTATAAC G ATG AAA ATT TTT GGG ACT GAT GGC GTG AGG 51
Met Lys He Phe Gly Thr Asp Gly Val Arg 1 5 10
GGT AAA GCA GGG GTG AAA CTC ACC CCC ATG TTT GTG ATG CGT TTA GGC 99 Gly Lys Ala Gly Val Lys Leu Thr Pro Met Phe Val Met Arg Leu Gly 15 20 25
ATT GCT GCC GGA TTG TAT TTT AAA AAA CAT TCT CAA ACG AAT AAA ATT 147 He Ala Ala Gly Leu Tyr Phe Lys Lys His Ser Gin Thr Asn Lys He 30 35 40
CTA ATC GGT AAA GAC ACC AGA AAA AGC GGC TAT ATG GTA GAA AAC GCT 195 Leu He Gly Lys Asp Thr Arg Lys Ser Gly Tyr Met Val Glu Asn Ala 45 50 55
TTA GTG AGC GCT CTA ACT TCC ATA GGC TAT AAT GTG ATT CAA ATA GGG 243 Leu Val Ser Ala Leu Thr Ser He Gly Tyr Asn Val He Gin He Gly 60 65 70
CCT ATG CCC ACC CCT GCG ATT GCG TTT TTA ACT GAA GAC ATG CGC TGT 291 Pro Met Pro Thr Pro Ala He Ala Phe Leu Thr Glu Asp Met Arg Cys 75 80 85 90
GAT GCG GGT ATT ATG ATA AGC GCG AGC CAC AAC CCT TTT GAA GAT AAT 339 Asp Ala Gly He Met He Ser Ala Ser His Asn Pro Phe Glu Asp Asn 95 100 105 GGC ATT AAG TTT TTC AAT TCT TAT GGC TAT AAG CTT AAA GAA GAA GAA 387 Gly He Lys Phe Phe Asn Ser Tyr Gly Tyr Lys Leu Lys Glu Glu Glu 110 115 120
GAA AAA GCG ATT GAA GAA ATC TTT CAT GAT GAA GAA TTA CTG CAT TCT 435 Glu Lys Ala He Glu Glu He Phe His Asp Glu Glu Leu Leu His Ser 125 130 135
AGC TAT AAA GTG GGT GAG AGC GTC GGT AGC GCT AAA AGG ATA GAC GAT 483 Ser Tyr Lys Val Gly Glu Ser Val Gly Ser Ala Lys Arg He Asp Asp 140 145 150
GTC ATA GGG CGC TAT ATT GCA CAT TTA AAA CAC TCT TTC CCC AAA CAT 531 Val He Gly Arg Tyr He Ala His Leu Lys His Ser Phe Pro Lys His 155 160 165 170
TTG AAT TTA CAG AGT TTA AGG ATC GTG CTA GAT ACG GCT AAT GGC GCG 579 Leu Asn Leu Gin Ser Leu Arg He Val Leu Asp Thr Ala Asn Gly Ala 175 180 185
GCT TAT AAG GTG GCT CCG GTC GTT TTT AGC GAG CTT GGG GCT GAT GTG 627 Ala Tyr Lys Val Ala Pro Val Val Phe Ser Glu Leu Gly Ala Asp Val 190 195 200
TTA GTG ATT AAT GAT GAG CCT AAC GGG TGT AAC ATT AAT GAT CAA TGC 675 Leu Val He Asn Asp Glu Pro Asn Gly Cys Asn He Asn Asp Gin Cys 205 210 215
GGG GCT TTA CAC CCC AAC CAA TTA AGC CAG GAA GTG AAA AAA TAC CGC 723 Gly Ala Leu His Pro Asn Gin Leu Ser Gin Glu Val Lys Lys Tyr Arg 220 225 230
GCA GAT TTA GGC TTT GCT TTT GAT GGC GAT GCT GAC AGG CTA GTG GTG 771 Ala Asp Leu Gly Phe Ala Phe Asp Gly Asp Ala Asp Arg Leu Val Val 235 240 245 250
GTG GAT AAT TTA GGG AAT ATC GTG CAT GGG GAT AAG CTT TTA GGG GTG 819 Val Asp Asn Leu Gly Asn He Val His Gly Asp Lys Leu Leu Gly Val 255 260 265
TTA GGG GTT TAT CAA AAA TCT AAA AAC GCC CTT TCT TCT CAA GCG GTT 867 Leu Gly Val Tyr Gin Lys Ser Lys Asn Ala Leu Ser Ser Gin Ala Val 270 275 280
GTC GCC ACA AAC ATG AGC AAT TTA GCC CTT AAA GAA TAT TTA AAA TCC 915 Val Ala Thr Asn Met Ser Asn Leu Ala Leu Lys Glu Tyr Leu Lys Ser 285 290 295
CAA GAT TTG GAA TTG AAG CAT TGC GCG ATT GGG GAT AAG TTT GTG AGC 963 Gin Asp Leu Glu Leu Lys His Cys Ala He Gly Asp Lys Phe Val Ser 300 305 310
GAA TGC ATG CAA TTG AAT AAA GCC AAT TTT GGA GGC GAG CAA AGC GGG 1011 Glu Cys Met Gin Leu Asn Lys Ala Asn Phe Gly Gly Glu Gin Ser Gly 315 320 325 330 CAT ATC ATT TTT AGC GAT TAC GCT AAA ACA GGC GAT GGT TTG GTG TGC 1059 His He He Phe Ser Asp Tyr Ala Lys Thr Gly Asp Gly Leu Val Cys 335 340 345
GCT TTG CAA GTG AGC GCG TTA GTG TTA GAA AGC AAG CAG GTA AGC TCT 1107 Ala Leu Gin Val Ser Ala Leu Val Leu Glu Ser Lys Gin Val Ser Ser 350 355 360
GTT GCG TTA AAC CCC TTT GAA TTA TAC CCC CAA AGC CTA GTG AAT TTG 1155 Val Ala Leu Asn Pro Phe Glu Leu Tyr Pro Gin Ser Leu Val Asn Leu 365 370 375
AAT GTC CAA AAA AAG CCC CCT TTA GAA AGC CTG AAA GGT TAT AGC GCT 1203 Asn Val Gin Lys Lys Pro Pro Leu Glu Ser Leu Lys Gly Tyr Ser Ala 380 385 390
CTT TTA AAA GAA TTA GAC AAG CTA GAA ATC CGC CAT TTG ATC CGT TAT 1251 Leu Leu Lys Glu Leu Asp Lys Leu Glu He Arg His Leu He Arg Tyr 395 400 405 410
AGC GGC ACT GAA AAC AAA TTG CGA ATC CTT TTA GAA GCT AAA GAT GAA 1299 Ser Gly Thr Glu Asn Lys Leu Arg He Leu Leu Glu Ala Lys Asp Glu 415 420 425
AAG CTT TTA GAA TCC AAA ATG CAA GAA TTA AAA GAG TTT TTT GAA GGG 1347 Lys Leu Leu Glu Ser Lys Met Gin Glu Leu Lys Glu Phe Phe Glu Gly 430 435 440
CAT TTG TGC TAAAAACCAC TAAAAAAAGC CTGTTGGTTT TTATGG 1392
His Leu Cys 445
(2) INFORMATION FOR SEQ ID NO: 1186:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 445 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1186:
Met Lys He Phe Gly Thr Asp Gly Val Arg Gly Lys Ala Gly Val Lys
1 5 10 15
Leu Thr Pro Met Phe Val Met Arg Leu Gly He Ala Ala Gly Leu Tyr
20 25 30
Phe Lys Lys His Ser Gin Thr Asn Lys He Leu He Gly Lys Asp Thr
35 40 45
Arg Lys Ser Gly Tyr Met Val Glu Asn Ala Leu Val Ser Ala Leu Thr
50 55 60
Ser He Gly Tyr Asn Val He Gin He Gly Pro Met Pro Thr Pro Ala 65 70 75 80 He Ala Phe Leu Thr Glu Asp Met Arg Cys Asp Ala Gly He Met He
85 90 95
Ser Ala Ser His Asn Pro Phe Glu Asp Asn Gly He Lys Phe Phe Asn
100 105 110
Ser Tyr Gly Tyr Lys Leu Lys Glu Glu Glu Glu Lys Ala He Glu Glu
115 120 125
He Phe His Asp Glu Glu Leu Leu His Ser Ser Tyr Lys Val Gly Glu
130 135 140
Ser Val Gly Ser Ala Lys Arg He Asp Asp Val He Gly Arg Tyr He 145 150 155 160
Ala His Leu Lys His Ser Phe Pro Lys His Leu Asn Leu Gin Ser Leu
165 170 175
Arg He Val Leu Asp Thr Ala Asn Gly Ala Ala Tyr Lys Val Ala Pro
180 185 190
Val Val Phe Ser Glu Leu Gly Ala Asp Val Leu Val He Asn Asp Glu
195 200 205
Pro Asn Gly Cys Asn He Asn Asp Gin Cys Gly Ala Leu His Pro Asn
210 215 220
Gin Leu Ser Gin Glu Val Lys Lys Tyr Arg Ala Asp Leu Gly Phe Ala 225 230 235 240
Phe Asp Gly Asp Ala Asp Arg Leu Val Val Val Asp Asn Leu Gly Asn
245 250 255
He Val His Gly Asp Lys Leu Leu Gly Val Leu Gly Val Tyr Gin Lys
260 265 270
Ser Lys Asn Ala Leu Ser Ser Gin Ala Val Val Ala Thr Asn Met Ser
275 280 285
Asn Leu Ala Leu Lys Glu Tyr Leu Lys Ser Gin Asp Leu Glu Leu Lys
290 295 300
His Cys Ala He Gly Asp Lys Phe Val Ser Glu Cys Met Gin Leu Asn 305 310 315 320
Lys Ala Asn Phe Gly Gly Glu Gin Ser Gly His He He Phe Ser Asp
325 330 335
Tyr Ala Lys Thr Gly Asp Gly Leu Val Cys Ala Leu Gin Val Ser Ala
340 345 350
Leu Val Leu Glu Ser Lys Gin Val Ser Ser Val Ala Leu Asn Pro Phe
355 360 365
Glu Leu Tyr Pro Gin Ser Leu Val Asn Leu Asn Val Gin Lys Lys Pro
370 375 380
Pro Leu Glu Ser Leu Lys Gly Tyr Ser Ala Leu Leu Lys Glu Leu Asp 385 390 395 400
Lys Leu Glu He Arg His Leu He Arg Tyr Ser Gly Thr Glu Asn Lys
405 410 415
Leu Arg He Leu Leu Glu Ala Lys Asp Glu Lys Leu Leu Glu Ser Lys
420 425 430
Met Gin Glu Leu Lys Glu Phe Phe Glu Gly His Leu Cys 435 440 445
(2) INFORMATION FOR SEQ ID NO: 1187:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 483 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...441 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1187:
TTTTATCAAA GGATTCTT ATG ACA AAG ACC GCT AAA GTC AAT GAC ATC GTT 51
Met Thr Lys Thr Ala Lys Val Asn Asp He Val 1 5 10
CGT GAT TGG GTC GTT TTA GAC GCC AAA GAC AAG GTT TTT GGC CGC TTG 99 Arg Asp Trp Val Val Leu Asp Ala Lys Asp Lys Val Phe Gly Arg Leu 15 20 25
ATC ACT GAA ATC GCT GTG CTT TTA AGA GGG AAA CAC CGC CCT TTT TAC 147 He Thr Glu He Ala Val Leu Leu Arg Gly Lys His Arg Pro Phe Tyr 30 35 40
ACC CCT AAT GTG GAT TGT GGG GAT TTT GTG GTG GTT ATC AAC GCT AAT 195 Thr Pro Asn Val Asp Cys Gly Asp Phe Val Val Val He Asn Ala Asn 45 50 55
AAG GTT AAA TTT TCA GGC ATG AAA TTA GAG GAT AAA GAG TAT TTT ACC 243 Lys Val Lys Phe Ser Gly Met Lys Leu Glu Asp Lys Glu Tyr Phe Thr 60 65 70 75
CAT TCA GGC TAT TTT GGC AGC ACT AAG AGC AAG ACT CTC CAA GAA ATG 291 His Ser Gly Tyr Phe Gly Ser Thr Lys Ser Lys Thr Leu Gin Glu Met 80 85 90
CTA GAA AAA GCC CCT GAA AAG CTC TAC CAC TTA GCC GTT AGG GGC ATG 339 Leu Glu Lys Ala Pro Glu Lys Leu Tyr His Leu Ala Val Arg Gly Met 95 100 105
CTC CCT AAA ACG AAA TTA GGG AAA GCG ATG ATT AAA AAA CTC AAA GTT 387 Leu Pro Lys Thr Lys Leu Gly Lys Ala Met He Lys Lys Leu Lys Val 110 115 120
TAT CGT GAT GAT AAG CAC CCT CAC ACC GCA CAA ACT AGC AAA AAG GAC 435 Tyr Arg Asp Asp Lys His Pro His Thr Ala Gin Thr Ser Lys Lys Asp 125 130 135
GCT AAA TGAGAAAAAT CTATGCTACC GGTAAAAGAA AAACCGCTAT CG 483
Ala Lys
140
(2) INFORMATION FOR SEQ ID NO: HE (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 141 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1188:
Met Thr Lys Thr Ala Lys Val Asn Asp He Val Arg Asp Trp Val Val
1 5 10 15
Leu Asp Ala Lys Asp Lys Val Phe Gly Arg Leu He Thr Glu He Ala
20 25 30
Val Leu Leu Arg Gly Lys His Arg Pro Phe Tyr Thr Pro Asn Val Asp
35 40 45
Cys Gly Asp Phe Val Val Val He Asn Ala Asn Lys Val Lys Phe Ser
50 55 60
Gly Met Lys Leu Glu Asp Lys Glu Tyr Phe Thr His Ser Gly Tyr Phe 65 70 75 80
Gly Ser Thr Lys Ser Lys Thr Leu Gin Glu Met Leu Glu Lys Ala Pro
85 90 95
Glu Lys Leu Tyr His Leu Ala Val Arg Gly Met Leu Pro Lys Thr Lys
100 105 110
Leu Gly Lys Ala Met He Lys Lys Leu Lys Val Tyr Arg Asp Asp Lys
115 120 125
His Pro His Thr Ala Gin Thr Ser Lys Lys Asp Ala Lys 130 135 140
(2) INFORMATION FOR SEQ ID NO: 1189:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2107 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...2058 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1189:
TAGCTTTTGA TTGAAAGC ATG GGT TCT TAC TTT ATG GAG TGT CCA ATG AAA 51
Met Gly Ser Tyr Phe Met Glu Cys Pro Met Lys 1 5 10
AAG AAA GCT AAC GAA GAA AAA GCC CAA AAA AGA GCT AAA ACA GAA GCC 99 Lys Lys Ala Asn Glu Glu Lys Ala Gin Lys Arg Ala Lys Thr Glu Ala 15 20 25 AAA GCA GAA GCC ACA CAA GAA AAT AAA ACT AAA GAA AAC AAT AAA GCC 147 Lys Ala Glu Ala Thr Gin Glu Asn Lys Thr Lys Glu Asn Asn Lys Ala 30 35 40
AAA GAA AGC AAA ATT AAA GAA AGC AAA ATC AAA GAA GCT AAA GCG AAA 195 Lys Glu Ser Lys He Lys Glu Ser Lys He Lys Glu Ala Lys Ala Lys 45 50 55
GAA CCT ATT CCT GTT AAA AAG CTT AGT TTT AAT GAA GCG TTA GAA GAA 243 Glu Pro He Pro Val Lys Lys Leu Ser Phe Asn Glu Ala Leu Glu Glu 60 65 70 75
TTG TTC GCT AAT TCC TTA AGC GAT TGC GTT TCT TAT GAG TCC ATC ATT 291 Leu Phe Ala Asn Ser Leu Ser Asp Cys Val Ser Tyr Glu Ser He He 80 85 90
CAA ATC AGC GCG AAA GTC CCC ACT CTA GCC CAA ATC AAA AAA ATC AAA 339 Gin He Ser Ala Lys Val Pro Thr Leu Ala Gin He Lys Lys He Lys 95 100 105
GAA TTG TGC CAA AAA TAC CAA AAG AAA TTA GTC AGC TCT TCA GAA TAC 387 Glu Leu Cys Gin Lys Tyr Gin Lys Lys Leu Val Ser Ser Ser Glu Tyr 110 115 120
GCT AAA AAA CTC AAT GCG ATT GAC AAG ATT AAA AAA ACC GAA GAA AAG 435 Ala Lys Lys Leu Asn Ala He Asp Lys He Lys Lys Thr Glu Glu Lys 125 130 135
CAA AAA GTT TTA GAT GAA GAA TTA GAA GAT GGC TAT GAC TTT TTG AAA 483 Gin Lys Val Leu Asp Glu Glu Leu Glu Asp Gly Tyr Asp Phe Leu Lys 140 145 150 155
GAA AAG GAT TTT TTA GAG TGG AGC AGA AGC GAT AGC CCA GTG CGC ATG 531 Glu Lys Asp Phe Leu Glu Trp Ser Arg Ser Asp Ser Pro Val Arg Met 160 165 170
TAT TTG CGC GAA ATG GGG GAT ATA AAA CTT TTA AGC AAA GAT GAA GAG 579 Tyr Leu Arg Glu Met Gly Asp He Lys Leu Leu Ser Lys Asp Glu Glu 175 180 185
ATT GAA TTG AGC AAG CAA ATC CGC TTG GGT GAA GAC ATT ATT TTA GAC 627 He Glu Leu Ser Lys Gin He Arg Leu Gly Glu Asp He He Leu Asp 190 195 200
GCG ATC TGC TCG GTG CCG TAT TTG ATT GAT TTT ATC TAT GCG TAT AAA 675 Ala He Cys Ser Val Pro Tyr Leu He Asp Phe He Tyr Ala Tyr Lys 205 210 215
GAC GCT TTA ATC AAT CGT GAA AGA AGG GTT AAA GAG CTT TTC AGG AGC 723 Asp Ala Leu He Asn Arg Glu Arg Arg Val Lys Glu Leu Phe Arg Ser 220 225 230 235
TTT GAT GAT GAC GAT GAA AAT AGC GTG AGC GAT TCT AAA AAA GAT GAA 771 Phe Asp Asp Asp Asp Glu Asn Ser Val Ser Asp Ser Lys Lys Asp Glu 240 245 250 GAC AAC GAA GAA GAT GAA GAA AAC GAA GAA AGG AAA AAA GTC GTT TCT 819 Asp Asn Glu Glu Asp Glu Glu Asn Glu Glu Arg Lys Lys Val Val Ser 255 260 265
GAA AAA GAC AAG AAG CGT GTA GAA AAG GTT CAA GAA AGC TTT AAA GCC 867 Glu Lys Asp Lys Lys Arg Val Glu Lys Val Gin Glu Ser Phe Lys Ala 270 275 280
CTA GAC AAG GCT AAA AAA GAA TGG CTT AAA GCC CTT GAA GCC CCC ATA 915 Leu Asp Lys Ala Lys Lys Glu Trp Leu Lys Ala Leu Glu Ala Pro He 285 290 295
GAT GAA AGA GAA GAC GAA TTG GTG CGT TCA TTG ACC CTA GCT TAC AAA 963 Asp Glu Arg Glu Asp Glu Leu Val Arg Ser Leu Thr Leu Ala Tyr Lys 300 305 310 315
CGC CAA ACA CTC AAA GAC AGA CTC TAT GAT TTA GAA CCT ACC AGC AAA 1011 Arg Gin Thr Leu Lys Asp Arg Leu Tyr Asp Leu Glu Pro Thr Ser Lys 320 325 330
CTG ATT AAT GAA TTA GTC AAA ACG ATG GAA ACC ACT TTA AAA AGC GGC 1059 Leu He Asn Glu Leu Val Lys Thr Met Glu Thr Thr Leu Lys Ser Gly 335 340 345
GAT GGG TTT GAA AAA GAG TTG AAA CGC TTG GAA TAC AAA CTG CCC TTA 1107 Asp Gly Phe Glu Lys Glu Leu Lys Arg Leu Glu Tyr Lys Leu Pro Leu 350 355 360
TTC AAT GAC ACT CTC ATC GCA AAC CAT AAA AAA ATC CTT GCC AAT ATC 1155 Phe Asn Asp Thr Leu He Ala Asn His Lys Lys He Leu Ala Asn He 365 370 375
ACT AAC ATG ACT AAA GAA GAT ATT ATC GCT CAA GTG CCA GAA GCG ACT 1203 Thr Asn Met Thr Lys Glu Asp He He Ala Gin Val Pro Glu Ala Thr 380 385 390 395
ATG GTG AGC GTG TAT ATG GAT CTT AAA AAG CTT TTT TTG ACT AAA GAA 1251 Met Val Ser Val Tyr Met Asp Leu Lys Lys Leu Phe Leu Thr Lys Glu 400 405 410
GCG AGC GAA GAA GGC TTT GAT CTA GCC CCC AAC AAG CTA AAA GAA ATT 1299 Ala Ser Glu Glu Gly Phe Asp Leu Ala Pro Asn Lys Leu Lys Glu He 415 420 425
TTA GAG CAA ATC AAA AGA GGG AAG TTG ATT TCC GAT CGC GCT AAA AAC 1347 Leu Glu Gin He Lys Arg Gly Lys Leu He Ser Asp Arg Ala Lys Asn 430 435 440
AAA ATG GCT AAA TCC AAT TTA AGG TTG GTG GTG AGC ATC GCT AAA CGA 1395 Lys Met Ala Lys Ser Asn Leu Arg Leu Val Val Ser He Ala Lys Arg 445 450 455
TTC ACG AGC AGA GGC TTA CCA TTC TTG GAT TTG ATT CAA GAG GGC AAT 1443 Phe Thr Ser Arg Gly Leu Pro Phe Leu Asp Leu He Gin Glu Gly Asn 460 465 470 475 ATT GGC TTG ATG AAA GCG GTG GAT AAG TTT GAG CAT GAA AAG GGC TTC 1491 He Gly Leu Met Lys Ala Val Asp Lys Phe Glu His Glu Lys Gly Phe 480 485 490
AAG TTT TCT ACC TAT GCG ACC TGG TGG ATC AAA CAA GCT ATC AGC AGA 1539 Lys Phe Ser Thr Tyr Ala Thr Trp Trp He Lys Gin Ala He Ser Arg 495 500 505
GCC ATA GCC GAT CAG GCC CGC ACT ATC CGC ATC CCC ATT CAC ATG ATT 1587 Ala He Ala Asp Gin Ala Arg Thr He Arg He Pro He His Met He 510 515 520
GAT ACG ATT AAT CGC ATC AAT AAA GTC ATG CGC AAA CAC ATT CAA GAA 1635 Asp Thr He Asn Arg He Asn Lys Val Met Arg Lys His He Gin Glu 525 530 535
AAC GGC AAA GAG CCT GAT TTA GAA GTG GTG GCT GAA GAA GTG GGG CTT 1683 Asn Gly Lys Glu Pro Asp Leu Glu Val Val Ala Glu Glu Val Gly Leu 540 545 550 555
TCG TTA GAT AAA GTG AAG AAT GTG ATT AAG GTG ACT AAA GAG CCT ATC 1731 Ser Leu Asp Lys Val Lys Asn Val He Lys Val Thr Lys Glu Pro He 560 565 570
AGT TTG GAA ACC CCA GTC GGC AAT GAT GAT GAT GGC AAG TTT GGG GAT 1779 Ser Leu Glu Thr Pro Val Gly Asn Asp Asp Asp Gly Lys Phe Gly Asp 575 580 585
TTC GTG GAA GAT AAG AAT ATC GTC AGC TCC ATT GAT CAC ATC ATG CGA 1827 Phe Val Glu Asp Lys Asn He Val Ser Ser He Asp His He Met Arg 590 595 600
GAA GAT TTG AAA GCA CAA ATT GAA AGC GTT TTG GAT CAG TTG AAT GAG 1875 Glu Asp Leu Lys Ala Gin He Glu Ser Val Leu Asp Gin Leu Asn Glu 605 610 615
CGA GAA AAA GCG GTG ATC CGC ATG CGT TTT GGG CTT TTA GAC GAT GAA 1923 Arg Glu Lys Ala Val He Arg Met Arg Phe Gly Leu Leu Asp Asp Glu 620 625 630 635
AGC GAT CGA ACT TTA GAA GAA ATT GGC AAG GAA TTG AAT GTT ACT AGA 1971 Ser Asp Arg Thr Leu Glu Glu He Gly Lys Glu Leu Asn Val Thr Arg 640 645 650
GAA AGG GTG CGC CAG ATT GAA AGC TCT GCG ATT AAA AAA TTG AGA AGC 2019 Glu Arg Val Arg Gin He Glu Ser Ser Ala He Lys Lys Leu Arg Ser 655 660 665
CCG CAG TAC GGG CGC ATT TTA AGA AAC TAT TTG CGC ATT TGATGTTAAG GT 2070 Pro Gin Tyr Gly Arg He Leu Arg Asn Tyr Leu Arg He 670 675 680
TTCTCTAAAG CATGCGTTAT TTTCTTGTAG TTTTCTT 2107
(2) INFORMATION FOR SEQ ID NO: 1190: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 680 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1190:
Met Gly Ser Tyr Phe Met Glu Cys Pro Met Lys Lys Lys Ala Asn Glu
1 5 10 15
Glu Lys Ala Gin Lys Arg Ala Lys Thr Glu Ala Lys Ala Glu Ala Thr
20 25 30
Gin Glu Asn Lys Thr Lys Glu Asn Asn Lys Ala Lys Glu Ser Lys He
35 40 45
Lys Glu Ser Lys He Lys Glu Ala Lys Ala Lys Glu Pro He Pro Val
50 55 60
Lys Lys Leu Ser Phe Asn Glu Ala Leu Glu Glu Leu Phe Ala Asn Ser 65 70 75 80
Leu Ser Asp Cys Val Ser Tyr Glu Ser He He Gin He Ser Ala Lys
85 90 95
Val Pro Thr Leu Ala Gin He Lys Lys He Lys Glu Leu Cys Gin Lys
100 105 110
Tyr Gin Lys Lys Leu Val Ser Ser Ser Glu Tyr Ala Lys Lys Leu Asn
115 120 125
Ala He Asp Lys He Lys Lys Thr Glu Glu Lys Gin Lys Val Leu Asp
130 135 140
Glu Glu Leu Glu Asp Gly Tyr Asp Phe Leu Lys Glu Lys Asp Phe Leu 145 150 155 160
Glu Trp Ser Arg Ser Asp Ser Pro Val Arg Met Tyr Leu Arg Glu Met
165 170 175
Gly Asp He Lys Leu Leu Ser Lys Asp Glu Glu He Glu Leu Ser Lys
180 185 190
Gin He Arg Leu Gly Glu Asp He He Leu Asp Ala He Cys Ser Val
195 200 205
Pro Tyr Leu He Asp Phe He Tyr Ala Tyr Lys Asp Ala Leu He Asn
210 215 220
Arg Glu Arg Arg Val Lys Glu Leu Phe Arg Ser Phe Asp Asp Asp Asp 225 230 235 240
Glu Asn Ser Val Ser Asp Ser Lys Lys Asp Glu Asp Asn Glu Glu Asp
245 250 255
Glu Glu Asn Glu Glu Arg Lys Lys Val Val Ser Glu Lys Asp Lys Lys
260 265 270
Arg Val Glu Lys Val Gin Glu Ser Phe Lys Ala Leu Asp Lys Ala Lys
275 280 285
Lys Glu Trp Leu Lys Ala Leu Glu Ala Pro He Asp Glu Arg Glu Asp
290 295 300
Glu Leu Val Arg Ser Leu Thr Leu Ala Tyr Lys Arg Gin Thr Leu Lys 305 310 315 320
Asp Arg Leu Tyr Asp Leu Glu Pro Thr Ser Lys Leu He Asn Glu Leu
325 330 335
Val Lys Thr Met Glu Thr Thr Leu Lys Ser Gly Asp Gly Phe Glu Lys
340 345 350
Glu Leu Lys Arg Leu Glu Tyr Lys Leu Pro Leu Phe Asn Asp Thr Leu 355 360 365
He Ala Asn His Lys Lys He Leu Ala Asn He Thr Asn Met Thr Lys
370 375 380
Glu Asp He He Ala Gin Val Pro Glu Ala Thr Met Val Ser Val Tyr 385 390 395 400
Met Asp Leu Lys Lys Leu Phe Leu Thr Lys Glu Ala Ser Glu Glu Gly
405 410 415
Phe Asp Leu Ala Pro Asn Lys Leu Lys Glu He Leu Glu Gin He Lys
420 425 430
Arg Gly Lys Leu He Ser Asp Arg Ala Lys Asn Lys Met Ala Lys Ser
435 440 445
Asn Leu Arg Leu Val Val Ser He Ala Lys Arg Phe Thr Ser Arg Gly
450 455 460
Leu Pro Phe Leu Asp Leu He Gin Glu Gly Asn He Gly Leu Met Lys 465 470 475 480
Ala Val Asp Lys Phe Glu His Glu Lys Gly Phe Lys Phe Ser Thr Tyr
485 490 495
Ala Thr Trp Trp He Lys Gin Ala He Ser Arg Ala He Ala Asp Gin
500 505 510
Ala Arg Thr He Arg He Pro He His Met He Asp Thr He Asn Arg
515 520 525
He Asn Lys Val Met Arg Lys His He Gin Glu Asn Gly Lys Glu Pro
530 535 540
Asp Leu Glu Val Val Ala Glu Glu Val Gly Leu Ser Leu Asp Lys Val 545 550 555 560
Lys Asn Val He Lys Val Thr Lys Glu Pro He Ser Leu Glu Thr Pro
565 570 575
Val Gly Asn Asp Asp Asp Gly Lys Phe Gly Asp Phe Val Glu Asp Lys
580 585 590
Asn He Val Ser Ser He Asp His He Met Arg Glu Asp Leu Lys Ala
595 600 605
Gin He Glu Ser Val Leu Asp Gin Leu Asn Glu Arg Glu Lys Ala Val
610 615 620
He Arg Met Arg Phe Gly Leu Leu Asp Asp Glu Ser Asp Arg Thr Leu 625 630 635 640
Glu Glu He Gly Lys Glu Leu Asn Val Thr Arg Glu Arg Val Arg Gin
645 650 655
He Glu Ser Ser Ala He Lys Lys Leu Arg Ser Pro Gin Tyr Gly Arg
660 665 670
He Leu Arg Asn Tyr Leu Arg He 675 680
(2) INFORMATION FOR SEQ ID NO: 1191:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 745 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 25...717 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1191:
AAGAATACGT GTGATTGGGA GAAA ATG GTG CAA AAA ATT GGC ATT TTA GGG 51
Met Val Gin Lys He Gly He Leu Gly 1 5
GCG ATG AGA GAA GAA ATA ACC CCT ATA CTA GAA TTG TTT GGC GTG GAT 99 Ala Met Arg Glu Glu He Thr Pro He Leu Glu Leu Phe Gly Val Asp 10 15 20 25
TTT GAA GAG ATC CCT TTA GGG GGG AAT GTC TTC CAT AAA GGC GTT TAT 147 Phe Glu Glu He Pro Leu Gly Gly Asn Val Phe His Lys Gly Val Tyr 30 35 40
CAC AAC AAG GAA ATC ATT GTC GCT TAT AGC AAG ATT GGC AAG GTG CAT 195 His Asn Lys Glu He He Val Ala Tyr Ser Lys He Gly Lys Val His 45 50 55
TCC ACT TTA ACC ACA ACG AGC ATG ATT TTA GCG TTT GGC GTT CAA AAG 243 Ser Thr Leu Thr Thr Thr Ser Met He Leu Ala Phe Gly Val Gin Lys 60 65 70
GTG CTT TTT AGC GGG GTG GCT GGA AGC TTA GTT AAA GAT TTA AAA ATC 291 Val Leu Phe Ser Gly Val Ala Gly Ser Leu Val Lys Asp Leu Lys He 75 80 85
AAT GAT TTA CTA GTG GCT ATT CAA TTA GTC CAG CAT GAT GTG GAT TTG 339 Asn Asp Leu Leu Val Ala He Gin Leu Val Gin His Asp Val Asp Leu 90 95 100 105
AGC GCG TTT GAT CAC CCT TTA GGG TTC ATC CCA GAA AGC GCG ATT TTT 387 Ser Ala Phe Asp His Pro Leu Gly Phe He Pro Glu Ser Ala He Phe 110 115 120
ATT GAA ACG AGC GAA AGT TTG AAC GCT TTG GCT AAA GAA GTC GCT AAT 435 He Glu Thr Ser Glu Ser Leu Asn Ala Leu Ala Lys Glu Val Ala Asn 125 130 135
GAA CAG CAT ATC GTG CTC AAA GAA GGC GTC ATC GCA TCA GGC GAT CAG 483 Glu Gin His He Val Leu Lys Glu Gly Val He Ala Ser Gly Asp Gin 140 145 150
TTT GTG CAT AGC AAA GAA AGG AAA GAG TTT TTA GTT AGC GAG TTT AAA 531 Phe Val His Ser Lys Glu Arg Lys Glu Phe Leu Val Ser Glu Phe Lys 155 160 165
GCG AGC GCG GTG GAA ATG GAG GGG GCG AGC GTG GCG TTT GTG TGC CAA 579 Ala Ser Ala Val Glu Met Glu Gly Ala Ser Val Ala Phe Val Cys Gin 170 175 180 185
AAA TTT GGC GTG CCA TGC TGT GTG TTA AGG AGC ATT AGC GAT AAC GCT 627 Lys Phe Gly Val Pro Cys Cys Val Leu Arg Ser He Ser Asp Asn Ala 190 195 200
GAT GAG GAA GCT AAC ATG AGC TTT GAT GCG TTT TTA GAA AAA AGC GCT 675 Asp Glu Glu Ala Asn Met Ser Phe Asp Ala Phe Leu Glu Lys Ser Ala 205 210 215
CAA ACT TCA GCG AAA TTC TTA AAA AGC ATG GTG GAT GAG CTT TAGGGTTTG 726 Gin Thr Ser Ala Lys Phe Leu Lys Ser Met Val Asp Glu Leu 220 225 230
TTTTTATAGA GGGGTGGAA 745
(2) INFORMATION FOR SEQ ID NO: 1192:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 231 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1192:
Met Val Gin Lys He Gly He Leu Gly Ala Met Arg Glu Glu He Thr
1 5 10 15
Pro He Leu Glu Leu Phe Gly Val Asp Phe Glu Glu He Pro Leu Gly
20 25 30
Gly Asn Val Phe His Lys Gly Val Tyr His Asn Lys Glu He He Val
35 40 45
Ala Tyr Ser Lys He Gly Lys Val His Ser Thr Leu Thr Thr Thr Ser
50 55 60
Met He Leu Ala Phe Gly Val Gin Lys Val Leu Phe Ser Gly Val Ala 65 70 75 80
Gly Ser Leu Val Lys Asp Leu Lys He Asn Asp Leu Leu Val Ala He
85 90 95
Gin Leu Val Gin His Asp Val Asp Leu Ser Ala Phe Asp His Pro Leu
100 105 110
Gly Phe He Pro Glu Ser Ala He Phe He Glu Thr Ser Glu Ser Leu
115 120 125
Asn Ala Leu Ala Lys Glu Val Ala Asn Glu Gin His He Val Leu Lys
130 135 140
Glu Gly Val He Ala Ser Gly Asp Gin Phe Val His Ser Lys Glu Arg 145 150 155 160
Lys Glu Phe Leu Val Ser Glu Phe Lys Ala Ser Ala Val Glu Met Glu
165 170 175
Gly Ala Ser Val Ala Phe Val Cys Gin Lys Phe Gly Val Pro Cys Cys
180 185 190
Val Leu Arg Ser He Ser Asp Asn Ala Asp Glu Glu Ala Asn Met Ser
195 200 205
Phe Asp Ala Phe Leu Glu Lys Ser Ala Gin Thr Ser Ala Lys Phe Leu
210 215 220
Lys Ser Met Val Asp Glu Leu 225 230 (2) INFORMATION FOR SEQ ID NO: 1193:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1986 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 56...1945 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1193:
GGTGTCCTTA AACAGCAGGG TGAAAGAGAT TTTAAAAGAA AGCGCTCTGC ATTCT ATG 58
Met 1
CAA GAT AGT TTG CAT TTT AAG GTT AAT GAA GTG CAA GGG GTT TTA GAA 106 Gin Asp Ser Leu His Phe Lys Val Asn Glu Val Gin Gly Val Leu Glu 5 10 15
AAC ACT TAT ACG AGC ATG GGC ATT GTT AAA GAA ATG CTC CCT AAA GAC 154 Asn Thr Tyr Thr Ser Met Gly He Val Lys Glu Met Leu Pro Lys Asp 20 25 30
ACC AAA AGA GAA ATC AAA ATC GGC TTG TTA AAA AAC TTC ATT TTA GCC 202 Thr Lys Arg Glu He Lys He Gly Leu Leu Lys Asn Phe He Leu Ala 35 40 45
AAT TCG CAT GTC GCT GGG GTG AGC ATG TTT TTT AAA GGC AGA GAA GAT 250 Asn Ser His Val Ala Gly Val Ser Met Phe Phe Lys Gly Arg Glu Asp 50 55 60 65
TTA AGA TTA ACG CTT TTA AGG GAT AAC AAT ACG ATT AAG CTA GTG GAA 298 Leu Arg Leu Thr Leu Leu Arg Asp Asn Asn Thr He Lys Leu Val Glu 70 75 80
AAT CCG TCA TTA GAG AAT AGC CCT TTA GCG CAA AAA GCG ATG AAA AAT 346 Asn Pro Ser Leu Glu Asn Ser Pro Leu Ala Gin Lys Ala Met Lys Asn 85 90 95
AAA GAA ATT TCT AAA AGT TTG GGT TAT TAT AGG AAA ATG CCT AAT GGG 394 Lys Glu He Ser Lys Ser Leu Gly Tyr Tyr Arg Lys Met Pro Asn Gly 100 105 110
GCG GAA GTT TAT GGG GTG GAT ATT CTT TTA CCT TTA TTG AAT GAG AAC 442 Ala Glu Val Tyr Gly Val Asp He Leu Leu Pro Leu Leu Asn Glu Asn 115 120 125 GCT CAA GAG GTT GTA GGG GCT TTG ATG ATT TTT ATT TCC ATT GAC AGC 490 Ala Gin Glu Val Val Gly Ala Leu Met He Phe He Ser He Asp Ser 130 135 140 145
TTC AGC AAT GAA ATC ACT AAA AAC AGG AGC GAT TTA TTT TTA ATT GGC 538 Phe Ser Asn Glu He Thr Lys Asn Arg Ser Asp Leu Phe Leu He Gly 150 155 160
ACT AAA GGT AAA GTG CTT TTG AGC GCG AAT AAG AGT TTG CAA GAC AAA 586 Thr Lys Gly Lys Val Leu Leu Ser Ala Asn Lys Ser Leu Gin Asp Lys 165 170 175
CCT ATC GCA GAA ATT TAT AAG AGC GTG CCT AAA GCC ACC AAC GAA GTG 634 Pro He Ala Glu He Tyr Lys Ser Val Pro Lys Ala Thr Asn Glu Val 180 185 190
ATG GCT ATT TTA GAA AAC GGC TCT AAA GCG ACT TTA GAA TAC TTA GAT 682 Met Ala He Leu Glu Asn Gly Ser Lys Ala Thr Leu Glu Tyr Leu Asp 195 200 205
CCC TTT AGC CAT AAG GAA AAT TTT TTA GCC GTT GAA ACC TTT AAA ATG 730 Pro Phe Ser His Lys Glu Asn Phe Leu Ala Val Glu Thr Phe Lys Met 210 215 220 225
CTA GGC AAA ACA GAA AGT AAA GAC AAT CTT AAT TGG ATG ATC GCT TTA 778 Leu Gly Lys Thr Glu Ser Lys Asp Asn Leu Asn Trp Met He Ala Leu 230 235 240
ATC ATT GAA AAA GAC AAG GTC TAT GAG CAA GTA GGC TCG GTG CGT TTT 826 He He Glu Lys Asp Lys Val Tyr Glu Gin Val Gly Ser Val Arg Phe 245 250 255
GTG GTG ATC ATA GCG AGC GCA ATC ATG GTG TTA GCC TTG ATT ATA GCG 874 Val Val He He Ala Ser Ala He Met Val Leu Ala Leu He He Ala 260 265 270
ATC ACT CTC TTA ATG CGA GCG ATC GTG AGC AGT CGT TTG GAA GCC GTT 922 He Thr Leu Leu Met Arg Ala He Val Ser Ser Arg Leu Glu Ala Val 275 280 285
TCT AGC ACC TTG TCT CAT TTC TTT AAA TTA TTG AAC AAT CAA GCC AAT 970 Ser Ser Thr Leu Ser His Phe Phe Lys Leu Leu Asn Asn Gin Ala Asn 290 295 300 305
TCT AGC GGT ATT AAA TTG ATT GAA GCG AAA TCC AAT GAC GAG TTA GGC 1018 Ser Ser Gly He Lys Leu He Glu Ala Lys Ser Asn Asp Glu Leu Gly 310 315 320
CGC ATG CAA ACA GCG ATC AAT AAA AAT ATC TTG CAA ACC CAA AAA ATC 1066 Arg Met Gin Thr Ala He Asn Lys Asn He Leu Gin Thr Gin Lys He 325 330 335
ATG CAA GAA GAC AGG CAA GCC GTC CAA GAC ACC ATT AAA GTG GTT TCA 1114 Met Gin Glu Asp Arg Gin Ala Val Gin Asp Thr He Lys Val Val Ser 340 345 350 GAT GTG AAA GCA GGG AAT TTT GCG GTG CGC ATC ACG GCT GAG CCC GCA 1162 Asp Val Lys Ala Gly Asn Phe Ala Val Arg He Thr Ala Glu Pro Ala 355 360 365
AGC CCT GAT TTG AAA GAA TTG AGG GAC GCG CTA AAT GGG ATC ATG GAT 1210 Ser Pro Asp Leu Lys Glu Leu Arg Asp Ala Leu Asn Gly He Met Asp 370 375 380 385
TAT TTG CAA GAA AGC GTA GGG ACT CAC ATG CCA AGC ATT TTC AAA ATC 1258 Tyr Leu Gin Glu Ser Val Gly Thr His Met Pro Ser He Phe Lys He 390 395 400
TTT GAA AGC TAT TCT GGT TTG GAT TTT AGA GGC CGG ATC CAA AAC GCT 1306 Phe Glu Ser Tyr Ser Gly Leu Asp Phe Arg Gly Arg He Gin Asn Ala 405 410 415
TCG GGT AGG GTG GAA CTG GTT ACT AAC GCT TTA GGG CAA GAA ATC CAA 1354 Ser Gly Arg Val Glu Leu Val Thr Asn Ala Leu Gly Gin Glu He Gin 420 425 430
AAA ATG CTA GAA ACT TCG TCT AAT TTT GCC AAA GAT TTA GCG AAC GAT 1402 Lys Met Leu Glu Thr Ser Ser Asn Phe Ala Lys Asp Leu Ala Asn Asp 435 440 445
AGC GCG AAT TTA AAA GAG TGC GTG CAA AAT TTA GAA AAA GCT TCA AAC 1450 Ser Ala Asn Leu Lys Glu Cys Val Gin Asn Leu Glu Lys Ala Ser Asn 450 455 460 465
TCC CAA CAC AAA AGC TTG ATG GAA ACT TCC AAA ACG ATA GAA AAT ATC 1498 Ser Gin His Lys Ser Leu Met Glu Thr Ser Lys Thr He Glu Asn He 470 475 480
ACC ACT TCC ATT CAA GGC GTG AGC TCT CAA AGT GAA GCC ATG ATT GAA 1546 Thr Thr Ser He Gin Gly Val Ser Ser Gin Ser Glu Ala Met He Glu 485 490 495
CAA GGG CAA GAC ATT AAA AGC ATT GTA GAA ATC ATT AGA GAT ATT GCT 1594 Gin Gly Gin Asp He Lys Ser He Val Glu He He Arg Asp He Ala 500 505 510
GAT CAA ACC AAT CTT TTA GCC TTA AAC GCC GCT ATT GAA GCC GCA AGG 1642 Asp Gin Thr Asn Leu Leu Ala Leu Asn Ala Ala He Glu Ala Ala Arg 515 520 525
GCC GGC GAG CAT GGC AGA GGC TTT GCG GTG GTG GCT GAT GAG GTA AGA 1690 Ala Gly Glu His Gly Arg Gly Phe Ala Val Val Ala Asp Glu Val Arg 530 535 540 545
AAG CTC GCT GAA AGG ACG CAA AAA TCG CTC AGC GAG ATT GAA GCC AAT 1738 Lys Leu Ala Glu Arg Thr Gin Lys Ser Leu Ser Glu He Glu Ala Asn 550 555 560
ATC AAT ATT TTA GTG CAA AGC ATT TCA GAC ACG AGC GAA AGC ATT AAA 1786 He Asn He Leu Val Gin Ser He Ser Asp Thr Ser Glu Ser He Lys 565 570 575 AAC CAG GTT AAA GAA GTG GAA GAA ATC AAC GCT TCT ATT GAA GCC TTA 1834 Asn Gin Val Lys Glu Val Glu Glu He Asn Ala Ser He Glu Ala Leu 580 585 590
AGA TCG GTT ACT GAG GGC AAT CTA AAA ATC GCT AGC GAT TCT TTA GAA 1882 Arg Ser Val Thr Glu Gly Asn Leu Lys He Ala Ser Asp Ser Leu Glu 595 600 605
ATC AGT CAA GAA ATT GAC AAA GTT TCT AAC GAT ATT TTA GAA GAT GTG 1930 He Ser Gin Glu He Asp Lys Val Ser Asn Asp He Leu Glu Asp Val 610 615 620 625
AAT AAA AAG CAG TTT TAATGCTCAT TCATATTTGC TGCTCAGTGG ATAACCTCTA T 1986 Asn Lys Lys Gin Phe 630
1986
(2) INFORMATION FOR SEQ ID NO: 1194:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 630 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1194:
Met Gin Asp Ser Leu His Phe Lys Val Asn Glu Val Gin Gly Val Leu
1 5 10 15
Glu Asn Thr Tyr Thr Ser Met Gly He Val Lys Glu Met Leu Pro Lys
20 25 30
Asp Thr Lys Arg Glu He Lys He Gly Leu Leu Lys Asn Phe He Leu
35 40 45
Ala Asn Ser His Val Ala Gly Val Ser Met Phe Phe Lys Gly Arg Glu
50 55 60
Asp Leu Arg Leu Thr Leu Leu Arg Asp Asn Asn Thr He Lys Leu Val 65 70 75 80
Glu Asn Pro Ser Leu Glu Asn Ser Pro Leu Ala Gin Lys Ala Met Lys
85 90 95
Asn Lys Glu He Ser Lys Ser Leu Gly Tyr Tyr Arg Lys Met Pro Asn
100 105 110
Gly Ala Glu Val Tyr Gly Val Asp He Leu Leu Pro Leu Leu Asn Glu
115 120 125
Asn Ala Gin Glu Val Val Gly Ala Leu Met He Phe He Ser He Asp
130 135 140
Ser Phe Ser Asn Glu He Thr Lys Asn Arg Ser Asp Leu Phe Leu He 145 150 155 160
Gly Thr Lys Gly Lys Val Leu Leu Ser Ala Asn Lys Ser Leu Gin Asp
165 170 175
Lys Pro He Ala Glu He Tyr Lys Ser Val Pro Lys Ala Thr Asn Glu
180 185 190
Val Met Ala He Leu Glu Asn Gly Ser Lys Ala Thr Leu Glu Tyr Leu 195 200 205
Asp Pro Phe Ser His Lys Glu Asn Phe Leu Ala Val Glu Thr Phe Lys
210 215 220
Met Leu Gly Lys Thr Glu Ser Lys Asp Asn Leu Asn Trp Met He Ala 225 230 235 240
Leu He He Glu Lys Asp Lys Val Tyr Glu Gin Val Gly Ser Val Arg
245 250 255
Phe Val Val He He Ala Ser Ala He Met Val Leu Ala Leu He He
260 265 270
Ala He Thr Leu Leu Met Arg Ala He Val Ser Ser Arg Leu Glu Ala
275 280 285
Val Ser Ser Thr Leu Ser His Phe Phe Lys Leu Leu Asn Asn Gin Ala
290 295 300
Asn Ser Ser Gly He Lys Leu He Glu Ala Lys Ser Asn Asp Glu Leu 305 310 315 320
Gly Arg Met Gin Thr Ala He Asn Lys Asn He Leu Gin Thr Gin Lys
325 330 335
He Met Gin Glu Asp Arg Gin Ala Val Gin Asp Thr He Lys Val Val
340 345 350
Ser Asp Val Lys Ala Gly Asn Phe Ala Val Arg He Thr Ala Glu Pro
355 360 365
Ala Ser Pro Asp Leu Lys Glu Leu Arg Asp Ala Leu Asn Gly He Met
370 375 380
Asp Tyr Leu Gin Glu Ser Val Gly Thr His Met Pro Ser He Phe Lys 385 390 395 400
He Phe Glu Ser Tyr Ser Gly Leu Asp Phe Arg Gly Arg He Gin Asn
405 410 415
Ala Ser Gly Arg Val Glu Leu Val Thr Asn Ala Leu Gly Gin Glu He
420 425 430
Gin Lys Met Leu Glu Thr Ser Ser Asn Phe Ala Lys Asp Leu Ala Asn
435 440 445
Asp Ser Ala Asn Leu Lys Glu Cys Val Gin Asn Leu Glu Lys Ala Ser
450 455 460
Asn Ser Gin His Lys Ser Leu Met Glu Thr Ser Lys Thr He Glu Asn 465 470 475 480
He Thr Thr Ser He Gin Gly Val Ser Ser Gin Ser Glu Ala Met He
485 490 495
Glu Gin Gly Gin Asp He Lys Ser He Val Glu He He Arg Asp He
500 505 510
Ala Asp Gin Thr Asn Leu Leu Ala Leu Asn Ala Ala He Glu Ala Ala
515 520 525
Arg Ala Gly Glu His Gly Arg Gly Phe Ala Val Val Ala Asp Glu Val
530 535 540
Arg Lys Leu Ala Glu Arg Thr Gin Lys Ser Leu Ser Glu He Glu Ala 545 550 555 560
Asn He Asn He Leu Val Gin Ser He Ser Asp Thr Ser Glu Ser He
565 570 575
Lys Asn Gin Val Lys Glu Val Glu Glu He Asn Ala Ser He Glu Ala
580 585 590
Leu Arg Ser Val Thr Glu Gly Asn Leu Lys He Ala Ser Asp Ser Leu
595 600 605
Glu He Ser Gin Glu He Asp Lys Val Ser Asn Asp He Leu Glu Asp
610 615 620
Val Asn Lys Lys Gin Phe 625 630 (2) INFORMATION FOR SEQ ID NO: 1195:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1758 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 8...1702 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1195:
GAGATAA ATG ATG TTT TCT TCA ATG TTT GCT TCG TTG GGG ACT CGT ATC 49 Met Met Phe Ser Ser Met Phe Ala Ser Leu Gly Thr Arg He 1 5 10
ATG CTG GTC GTG TTA GCC GCT CTT TTA GGT TTA GGG GGG CTT TTT ATT 97 Met Leu Val Val Leu Ala Ala Leu Leu Gly Leu Gly Gly Leu Phe He 15 20 25 30
GGT TTT GTA AAG GTT ATG CAA AAA GAT GTG TTA GCG CAA CTC ATG GAG 145 Gly Phe Val Lys Val Met Gin Lys Asp Val Leu Ala Gin Leu Met Glu 35 40 45
CAT TTA GAA ACC GGG CAA TAC AAA AAG CGT GAA AAA ACG CTC GCT TAC 193 His Leu Glu Thr Gly Gin Tyr Lys Lys Arg Glu Lys Thr Leu Ala Tyr 50 55 60
ATG ACA AAA ATT ATT GAA CAG GGC ATT CAT GAG TAT TAC AAA AAT TTT 241 Met Thr Lys He He Glu Gin Gly He His Glu Tyr Tyr Lys Asn Phe 65 70 75
GAC AAT GCT ACT GCA AGA AAA ATG GCG TTA GAT TAT TTC AAA CGC ATC 289 Asp Asn Ala Thr Ala Arg Lys Met Ala Leu Asp Tyr Phe Lys Arg He 80 85 90
AAC GAC GAT AAG GGC ATG ATT TAT ATG GTG GTG GTG GAT AAA AAC GGG 337 Asn Asp Asp Lys Gly Met He Tyr Met Val Val Val Asp Lys Asn Gly 95 100 105 110
GTG GTA TTG TTT GAT CCG GTC AAT CCT AAA ACC GTA GNC CAA TCA GGG 385 Val Val Leu Phe Asp Pro Val Asn Pro Lys Thr Val Xaa Gin Ser Gly 115 120 125
CTT GAC GCT CAG AGC GTT GAT GGG GTG TAT TAT GTT AGG GGG TAT TTG 433 Leu Asp Ala Gin Ser Val Asp Gly Val Tyr Tyr Val Arg Gly Tyr Leu 130 135 140 GAG GCG GCC AAA AAA GGG GGA GGC TAC ACT TAT TAT AAA ATG CCT AAA 481 Glu Ala Ala Lys Lys Gly Gly Gly Tyr Thr Tyr Tyr Lys Met Pro Lys 145 150 155
TAC GAT GGA GGC GTA CCG GAG AAA AAA TTC GCC TAC TCG CAT TAT GAT 529 Tyr Asp Gly Gly Val Pro Glu Lys Lys Phe Ala Tyr Ser His Tyr Asp 160 165 170
GAA GTT TCT CAA ATG GTG ATC GCA ACG ACT TCC TAT TAC ACT GAC ATT 577 Glu Val Ser Gin Met Val He Ala Thr Thr Ser Tyr Tyr Thr Asp He 175 180 185 190
AAC ACA GAA AAT AAA GCG ATC AAA GAA GGC GTG AAT AAG GTT TTT GAT 625 Asn Thr Glu Asn Lys Ala He Lys Glu Gly Val Asn Lys Val Phe Asp 195 200 205
GAA AAC ACC ACG AAA TTA TTC CTT TGG ATA CTG ACA GCG ACG ATA GCG 673 Glu Asn Thr Thr Lys Leu Phe Leu Trp He Leu Thr Ala Thr He Ala 210 215 220
CTA GTG GTT TTG ACG CTC ATA TAC GCT AAA TTA AGG ATC GTG AAA CGC 721 Leu Val Val Leu Thr Leu He Tyr Ala Lys Leu Arg He Val Lys Arg 225 230 235
ATT GAT GAA CTG GTC CTT AAA ATC AAC GCT TTT AGC CGT GGG GAT AAG 769 He Asp Glu Leu Val Leu Lys He Asn Ala Phe Ser Arg Gly Asp Lys 240 245 250
GAT TTG AGA GCC AAA ATT GAT GTG GGT GAT CGC AAC GAT GAA ATC TCG 817 Asp Leu Arg Ala Lys He Asp Val Gly Asp Arg Asn Asp Glu He Ser 255 260 265 270
CAA GTG GGC CGT GGG ATC AAT TTG TTT GTG GAA AAC GCC CGC TTG ATT 865 Gin Val Gly Arg Gly He Asn Leu Phe Val Glu Asn Ala Arg Leu He 275 280 285
ATG GAA GAG ATT AAA GGG ATT TCC ACC CTC AAT AAA ACT TCA ATG GAT 913 Met Glu Glu He Lys Gly He Ser Thr Leu Asn Lys Thr Ser Met Asp 290 295 300
AAA TTA GTC CAA ATC ACG CAA GAA ACC CAA AAG AGC ATG AAA GAT TCC 961 Lys Leu Val Gin He Thr Gin Glu Thr Gin Lys Ser Met Lys Asp Ser 305 310 315
TCA ACC ACC CTA AAT TCC GTG AAA AAT AAA GCC ACT GAT ATA GCG AGC 1009 Ser Thr Thr Leu Asn Ser Val Lys Asn Lys Ala Thr Asp He Ala Ser 320 325 330
ATG ATG AAT GCT TCC ATA GAG CAA TCT CAA GGG TTA AGG AAG CGT TTG 1057 Met Met Asn Ala Ser He Glu Gin Ser Gin Gly Leu Arg Lys Arg Leu 335 340 345 350
ATT GAA ACG CAA GGG CTG GTC AAA GAG AGC AAG GAT GCG ATC GGG GAT 1105 He Glu Thr Gin Gly Leu Val Lys Glu Ser Lys Asp Ala He Gly Asp 355 360 365 TTA TTT TCT CAA ATC ACA GAG AGC GCG CAC ACT GAA GAG GAA CTC TCT 1153 Leu Phe Ser Gin He Thr Glu Ser Ala His Thr Glu Glu Glu Leu Ser 370 375 380
AGC AAA GTG GAG CAG CTA AGC CGT AAC GCT GAT GAT GTC AAA TCC ATT 1201 Ser Lys Val Glu Gin Leu Ser Arg Asn Ala Asp Asp Val Lys Ser He 385 390 395
CTG GAT ATT ATC AAT GAT ATT GCC GAT CAA ACG AAT TTA TTA GCC CTA 1249 Leu Asp He He Asn Asp He Ala Asp Gin Thr Asn Leu Leu Ala Leu 400 405 410
AAC GCT GCT ATT GAA GCC GCA AGG GCT GGC GAG CAT GGC AGA GGC TTT 1297 Asn Ala Ala He Glu Ala Ala Arg Ala Gly Glu His Gly Arg Gly Phe 415 420 425 430
GCG GTG GTG GCT GAT GAA GTT AGG AAT TTA GCC GGG CGC ACT CAA AAG 1345 Ala Val Val Ala Asp Glu Val Arg Asn Leu Ala Gly Arg Thr Gin Lys 435 440 445
TCT TTA GCC GAA ATC AAT TCC ACT ATC ATG GTG ATT GTC CAA GAA ATC 1393 Ser Leu Ala Glu He Asn Ser Thr He Met Val He Val Gin Glu He 450 455 460
AAT GCC GTG AGT TCG CAA ATG AAT CTC AAT TCG CAA AAA ATG GAG CGT 1441 Asn Ala Val Ser Ser Gin Met Asn Leu Asn Ser Gin Lys Met Glu Arg 465 470 475
TTG AGC GAT ATG AGT AAA AGC GTG CAA GAA ACT TAC GAA AAA ATG AGT 1489 Leu Ser Asp Met Ser Lys Ser Val Gin Glu Thr Tyr Glu Lys Met Ser 480 485 490
TCT AAT TTA AGC TCA GTC GTG TCA GAC AGC AAT CAA AGC ATG GAC GAT 1537 Ser Asn Leu Ser Ser Val Val Ser Asp Ser Asn Gin Ser Met Asp Asp 495 500 505 510
TAC GCC AAA TCC GGA CAC CAA ATT GAA GTT ATG GTA AGC GAT TTT GCA 1585 Tyr Ala Lys Ser Gly His Gin He Glu Val Met Val Ser Asp Phe Ala 515 520 525
GAG GTG GAA AAA GTG GCT TCT AAG ACT TTA GCG GAT TCT TCA GAT ATT 1633 Glu Val Glu Lys Val Ala Ser Lys Thr Leu Ala Asp Ser Ser Asp He 530 535 540
TTA AAC ATC GCT ACG CAT GTG AGT GGA ACG ACC ATG AAT TTA GAC AAA 1681 Leu Asn He Ala Thr His Val Ser Gly Thr Thr Met Asn Leu Asp Lys 545 550 555
CAA GTG AAT TTG TTT AAA ACT TAATCAGGGG GAGTTTATTA AAAAAGGGTT GGAT 1736 Gin Val Asn Leu Phe Lys Thr 560 565
TGTTAAAAGT TTCTGTGATC AC 1758
(2) INFORMATION FOR SEQ ID NO: 1196: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 565 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1196:
Met Met Phe Ser Ser Met Phe Ala Ser Leu Gly Thr Arg He Met Leu
1 5 10 15
Val Val Leu Ala Ala Leu Leu Gly Leu Gly Gly Leu Phe He Gly Phe
20 25 30
Val Lys Val Met Gin Lys Asp Val Leu Ala Gin Leu Met Glu His Leu
35 40 45
Glu Thr Gly Gin Tyr Lys Lys Arg Glu Lys Thr Leu Ala Tyr Met Thr
50 55 60
Lys He He Glu Gin Gly He His Glu Tyr Tyr Lys Asn Phe Asp Asn 65 70 75 80
Ala Thr Ala Arg Lys Met Ala Leu Asp Tyr Phe Lys Arg He Asn Asp
85 90 95
Asp Lys Gly Met He Tyr Met Val Val Val Asp Lys Asn Gly Val Val
100 105 110
Leu Phe Asp Pro Val Asn Pro Lys Thr Val Xaa Gin Ser Gly Leu Asp
115 120 125
Ala Gin Ser Val Asp Gly Val Tyr Tyr Val Arg Gly Tyr Leu Glu Ala
130 135 140
Ala Lys Lys Gly Gly Gly Tyr Thr Tyr Tyr Lys Met Pro Lys Tyr Asp 145 150 155 160
Gly Gly Val Pro Glu Lys Lys Phe Ala Tyr Ser His Tyr Asp Glu Val
165 170 175
Ser Gin Met Val He Ala Thr Thr Ser Tyr Tyr Thr Asp He Asn Thr
180 185 190
Glu Asn Lys Ala He Lys Glu Gly Val Asn Lys Val Phe Asp Glu Asn
195 200 205
Thr Thr Lys Leu Phe Leu Trp He Leu Thr Ala Thr He Ala Leu Val
210 215 220
Val Leu Thr Leu He Tyr Ala Lys Leu Arg He Val Lys Arg He Asp 225 230 235 240
Glu Leu Val Leu Lys He Asn Ala Phe Ser Arg Gly Asp Lys Asp Leu
245 250 255
Arg Ala Lys He Asp Val Gly Asp Arg Asn Asp Glu He Ser Gin Val
260 265 270
Gly Arg Gly He Asn Leu Phe Val Glu Asn Ala Arg Leu He Met Glu
275 280 285
Glu He Lys Gly He Ser Thr Leu Asn Lys Thr Ser Met Asp Lys Leu
290 295 300
Val Gin He Thr Gin Glu Thr Gin Lys Ser Met Lys Asp Ser Ser Thr 305 310 315 320
Thr Leu Asn Ser Val Lys Asn Lys Ala Thr Asp He Ala Ser Met Met
325 330 335
Asn Ala Ser He Glu Gin Ser Gin Gly Leu Arg Lys Arg Leu He Glu
340 345 350
Thr Gin Gly Leu Val Lys Glu Ser Lys Asp Ala He Gly Asp Leu Phe 355 360 365
Ser Gin He Thr Glu Ser Ala His Thr Glu Glu Glu Leu Ser Ser Lys
370 375 380
Val Glu Gin Leu Ser Arg Asn Ala Asp Asp Val Lys Ser He Leu Asp 385 390 395 400
He He Asn Asp He Ala Asp Gin Thr Asn Leu Leu Ala Leu Asn Ala
405 410 415
Ala He Glu Ala Ala Arg Ala Gly Glu His Gly Arg Gly Phe Ala Val
420 425 430
Val Ala Asp Glu Val Arg Asn Leu Ala Gly Arg Thr Gin Lys Ser Leu
435 440 445
Ala Glu He Asn Ser Thr He Met Val He Val Gin Glu He Asn Ala
450 455 460
Val Ser Ser Gin Met Asn Leu Asn Ser Gin Lys Met Glu Arg Leu Ser 465 470 475 480
Asp Met Ser Lys Ser Val Gin Glu Thr Tyr Glu Lys Met Ser Ser Asn
485 490 495
Leu Ser Ser Val Val Ser Asp Ser Asn Gin Ser Met Asp Asp Tyr Ala
500 505 510
Lys Ser Gly His Gin He Glu Val Met Val Ser Asp Phe Ala Glu Val
515 520 525
Glu Lys Val Ala Ser Lys Thr Leu Ala Asp Ser Ser Asp He Leu Asn
530 535 540
He Ala Thr His Val Ser Gly Thr Thr Met Asn Leu Asp Lys Gin Val 545 550 555 560
Asn Leu Phe Lys Thr 565
(2) INFORMATION FOR SEQ ID NO: 1197:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 525 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...474 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1197:
CCTTGAGATT GCTCT ATG GAA GCA TTC ATC ATG CTC GCT ATA TCA GTG GCT 51 Met Glu Ala Phe He Met Leu Ala He Ser Val Ala 1 5 10
TTA TTT TTC ACG GAA TTT AGG GTG GTT GAG GAA TCT TTC ATG CTC TTT 99 Leu Phe Phe Thr Glu Phe Arg Val Val Glu Glu Ser Phe Met Leu Phe 15 20 25 TGG GTT TCT TGC GTG ATT TGG ACT AAT TTA TCC ATT GAA GTT TTA TTG 147 Trp Val Ser Cys Val He Trp Thr Asn Leu Ser He Glu Val Leu Leu 30 35 40
AGG GTG GAA ATC CCT TTA ATC TCT TCC ATA ATC AAG CGG GCG TTT TCC 195 Arg Val Glu He Pro Leu He Ser Ser He He Lys Arg Ala Phe Ser 45 50 55 60
ACA AAC AAA TTG ATC CCA CGG CCC ACT TGC GAG ATT TCA TCG TTG CGA 243 Thr Asn Lys Leu He Pro Arg Pro Thr Cys Glu He Ser Ser Leu Arg 65 70 75
TCA CCC ACA TCA ATT TTG GCT CTC AAA TCC TTA TCC CCA CGG CTA AAA 291 Ser Pro Thr Ser He Leu Ala Leu Lys Ser Leu Ser Pro Arg Leu Lys 80 85 90
GCG TTG ATT TTA AGG ACC AGT TCA TCA ATG CGT TTC ACG ATC CTT AAT 339 Ala Leu He Leu Arg Thr Ser Ser Ser Met Arg Phe Thr He Leu Asn 95 100 105
TTA GCG TAT ATG AGC GTC AAA ACC ACT AGC GCT ATC GTC GCT GTC AGT 387 Leu Ala Tyr Met Ser Val Lys Thr Thr Ser Ala He Val Ala Val Ser 110 115 120
ATC CAA AGG AAT AAT TTC GTG GTG TTT TCA TCA AAA ACC TTA TTC ACG 435 He Gin Arg Asn Asn Phe Val Val Phe Ser Ser Lys Thr Leu Phe Thr 125 130 135 140
CCT TCT TTG ATC GCT TTA TTT TCT GTG TTA ATG TCA GTG TAATAGGAAG TC 486 Pro Ser Leu He Ala Leu Phe Ser Val Leu Met Ser Val 145 150
GTTGCGATCA CCATTTGAGA AACTTCATCA TAATGCGAG 525
(2) INFORMATION FOR SEQ ID NO: 1198:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 153 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1198:
Met Glu Ala Phe He Met Leu Ala He Ser Val Ala Leu Phe Phe Thr
1 5 10 15
Glu Phe Arg Val Val Glu Glu Ser Phe Met Leu Phe Trp Val Ser Cys
20 25 30
Val He Trp Thr Asn Leu Ser He Glu Val Leu Leu Arg Val Glu He
35 40 45
Pro Leu He Ser Ser He He Lys Arg Ala Phe Ser Thr Asn Lys Leu
50 55 60
He Pro Arg Pro Thr Cys Glu He Ser Ser Leu Arg Ser Pro Thr Ser 65 70 75 80
He Leu Ala Leu Lys Ser Leu Ser Pro Arg Leu Lys Ala Leu He Leu
85 90 95
Arg Thr Ser Ser Ser Met Arg Phe Thr He Leu Asn Leu Ala Tyr Met
100 105 110
Ser Val Lys Thr Thr Ser Ala He Val Ala Val Ser He Gin Arg Asn
115 120 125
Asn Phe Val Val Phe Ser Ser Lys Thr Leu Phe Thr Pro Ser Leu He
130 135 140
Ala Leu Phe Ser Val Leu Met Ser Val 145 150
(2) INFORMATION FOR SEQ ID NO: 1199:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1209 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 25...1164 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1199:
AAATTCAATA AAAAAGGAAA AACC ATG CGC ATG CAA ACC AAA TTA ATC CAT 51
Met Arg Met Gin Thr Lys Leu He His
1 5
GGG GGC ATT AGT GAG GAC GCA ACA ACG GGG GCG GTG AGC GTG CCT ATT 99 Gly Gly He Ser Glu Asp Ala Thr Thr Gly Ala Val Ser Val Pro He 10 15 20 25
TAT CAA ACT TCC ACC TAC CGC CAA GAC GCC ATA GGC CGC CAT AAG GGC 147 Tyr Gin Thr Ser Thr Tyr Arg Gin Asp Ala He Gly Arg His Lys Gly 30 35 40
TAT GAA TAC TCT CGC TCA GGC AAC CCC ACG CGC TTT GCT TTA GAA GAA 195 Tyr Glu Tyr Ser Arg Ser Gly Asn Pro Thr Arg Phe Ala Leu Glu Glu 45 50 55
CTC ATC GCT GAT TTA GAA GGG GGG GTT AAG GGG TTT GCT TTT GCC TCT 243 Leu He Ala Asp Leu Glu Gly Gly Val Lys Gly Phe Ala Phe Ala Ser 60 65 70
GGA TTA GCT GGA ATC CAC GCC GTT TTT TCC CTC TTG CAA TCA GGC GAT 291 Gly Leu Ala Gly He His Ala Val Phe Ser Leu Leu Gin Ser Gly Asp 75 80 85 CAT GTG TTA TTG GGC GAT GAT GTT TAT GGG GGG ACT TTC CGC TTG TTT 339 His Val Leu Leu Gly Asp Asp Val Tyr Gly Gly Thr Phe Arg Leu Phe 90 95 100 105
AAT CAA GTG CTT GTC AAA AAC GGG CTT TCT TGC ACC ATT ATA GAC ACT 387 Asn Gin Val Leu Val Lys Asn Gly Leu Ser Cys Thr He He Asp Thr 110 115 120
AGC GAT ATA TCC CAA ATT AAA AAG GCT ATC AAG CCC AAC ACC AAA GCC 435 Ser Asp He Ser Gin He Lys Lys Ala He Lys Pro Asn Thr Lys Ala 125 130 135
CTT TAT TTA GAA ACC CCT AGT AAC CCC TTG CTT AAA ATC ACG GAT TTA 483 Leu Tyr Leu Glu Thr Pro Ser Asn Pro Leu Leu Lys He Thr Asp Leu 140 145 150
GCG CAA TGC GCT AGT GTC GCT AAA GAT CAT GGT TTG CTC ACT ATC GTG 531 Ala Gin Cys Ala Ser Val Ala Lys Asp His Gly Leu Leu Thr He Val 155 160 165
GAT AAC ACC TTT GCC ACC CCC TAT TAT CAA AAC CCG CTT CTT TTG GGA 579 Asp Asn Thr Phe Ala Thr Pro Tyr Tyr Gin Asn Pro Leu Leu Leu Gly 170 175 180 185
GCG GAC ATT GTG GCA CAT AGC GGC ACC AAA TAC TTA GGC GGG CAT AGC 627 Ala Asp He Val Ala His Ser Gly Thr Lys Tyr Leu Gly Gly His Ser 190 195 200
GAT GTG GTC GCC GGG CTT GTA ACC ACT AAT AAT GAA GCG CTA GCC CAA 675 Asp Val Val Ala Gly Leu Val Thr Thr Asn Asn Glu Ala Leu Ala Gin 205 210 215
GAG ATC GCT TTT TTC CAA AAC GCT ATC GGT GGG GTT TTA GGC CCT CAA 723 Glu He Ala Phe Phe Gin Asn Ala He Gly Gly Val Leu Gly Pro Gin 220 225 230
GAC AGC TGG CTG TTG CAA AGA GGG ATT AAA ACG CTG GGA TTG CGC ATG 771 Asp Ser Trp Leu Leu Gin Arg Gly He Lys Thr Leu Gly Leu Arg Met 235 240 245
GAA GCC CAT CAA AAA AAC GCT CTT TGT GTG GCT GAG TTT TTA GAA AAA 819 Glu Ala His Gin Lys Asn Ala Leu Cys Val Ala Glu Phe Leu Glu Lys 250 255 260 265
CAC CCT AAA GTG GAA AGG GTT TAT TAC CCG GGC CTT CCC ACT CAC CCT 867 His Pro Lys Val Glu Arg Val Tyr Tyr Pro Gly Leu Pro Thr His Pro 270 275 280
AAT TAC GAA CTA GCT AAA AAA CAG ATG CGT GGC TTT AGC GGG ATG CTC 915 Asn Tyr Glu Leu Ala Lys Lys Gin Met Arg Gly Phe Ser Gly Met Leu 285 290 295
TCT TTC ACT CTC AAA AAT GAT AGC GAG GCG GTT GCT TTT GTA GAA AGC 963 Ser Phe Thr Leu Lys Asn Asp Ser Glu Ala Val Ala Phe Val Glu Ser 300 305 310 CTT AAA CTA TTC ATT TTA GGC GAG AGT TTG GGC GGG GTG GAA AGT TTG 1011 Leu Lys Leu Phe He Leu Gly Glu Ser Leu Gly Gly Val Glu Ser Leu 315 320 325
GTG GGG ATT CCG GCA TTT ATG ACC CAT GCG TGC ATC CCT AAA ACG CAA 1059 Val Gly He Pro Ala Phe Met Thr His Ala Cys He Pro Lys Thr Gin 330 335 340 345
CGA GAA GCT GCT GGG ATT AGA GAT GGC CTG GTG CGC TTG TCT GTA GGG 1107 Arg Glu Ala Ala Gly He Arg Asp Gly Leu Val Arg Leu Ser Val Gly 350 355 360
ATT GAG CAT GAA CAG GAT TTG TTA GAA GAT TTA GAG CAA GCG TTC GCT 1155 He Glu His Glu Gin Asp Leu Leu Glu Asp Leu Glu Gin Ala Phe Ala 365 370 375
AAA ATA GGC TAAAGTTTCA TTACAATTTA TGAATAAAGG AGTTAAAAAC ATGAA 1209 Lys He Gly 380
(2) INFORMATION FOR SEQ ID NO: 1200:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 380 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1200:
Met Arg Met Gin Thr Lys Leu He His Gly Gly He Ser Glu Asp Ala
1 5 10 15
Thr Thr Gly Ala Val Ser Val Pro He Tyr Gin Thr Ser Thr Tyr Arg
20 25 30
Gin Asp Ala He Gly Arg His Lys Gly Tyr Glu Tyr Ser Arg Ser Gly
35 40 45
Asn Pro Thr Arg Phe Ala Leu Glu Glu Leu He Ala Asp Leu Glu Gly
50 55 60
Gly Val Lys Gly Phe Ala Phe Ala Ser Gly Leu Ala Gly He His Ala 65 70 75 80
Val Phe Ser Leu Leu Gin Ser Gly Asp His Val Leu Leu Gly Asp Asp
85 90 95
Val Tyr Gly Gly Thr Phe Arg Leu Phe Asn Gin Val Leu Val Lys Asn
100 105 110
Gly Leu Ser Cys Thr He He Asp Thr Ser Asp He Ser Gin He Lys
115 120 125
Lys Ala He Lys Pro Asn Thr Lys Ala Leu Tyr Leu Glu Thr Pro Ser
130 135 140
Asn Pro Leu Leu Lys He Thr Asp Leu Ala Gin Cys Ala Ser Val Ala 145 150 155 160
Lys Asp His Gly Leu Leu Thr He Val Asp Asn Thr Phe Ala Thr Pro 165 170 175 Tyr Tyr Gin Asn Pro Leu Leu Leu Gly Ala Asp He Val Ala His Ser
180 185 190
Gly Thr Lys Tyr Leu Gly Gly His Ser Asp Val Val Ala Gly Leu Val
195 200 205
Thr Thr Asn Asn Glu Ala Leu Ala Gin Glu He Ala Phe Phe Gin Asn
210 215 220
Ala He Gly Gly Val Leu Gly Pro Gin Asp Ser Trp Leu Leu Gin Arg 225 230 235 240
Gly He Lys Thr Leu Gly Leu Arg Met Glu Ala His Gin Lys Asn Ala
245 250 255
Leu Cys Val Ala Glu Phe Leu Glu Lys His Pro Lys Val Glu Arg Val
260 265 270
Tyr Tyr Pro Gly Leu Pro Thr His Pro Asn Tyr Glu Leu Ala Lys Lys
275 280 285
Gin Met Arg Gly Phe Ser Gly Met Leu Ser Phe Thr Leu Lys Asn Asp
290 295 300
Ser Glu Ala Val Ala Phe Val Glu Ser Leu Lys Leu Phe He Leu Gly 305 310 315 320
Glu Ser Leu Gly Gly Val Glu Ser Leu Val Gly He Pro Ala Phe Met
325 330 335
Thr His Ala Cys He Pro Lys Thr Gin Arg Glu Ala Ala Gly He Arg
340 345 350
Asp Gly Leu Val Arg Leu Ser Val Gly He Glu His Glu Gin Asp Leu
355 360 365
Leu Glu Asp Leu Glu Gin Ala Phe Ala Lys He Gly 370 375 380
(2) INFORMATION FOR SEQ ID NO: 1201:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 912 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 46...873 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1201:
ATTATTAACT TTTTATGCTA TAATGCGAGG GTTCTTTCAT CAAGA ATG GTG ATT GAC 57
Met Val He Asp 1
GAG ATT TTT CAA ATA ATG ATG TTA AGA AGA ATT AAA GTA GGT TCT AAT 105 Glu He Phe Gin He Met Met Leu Arg Arg He Lys Val Gly Ser Asn 5 10 15 20
TTG AAT AAA AAA GAG AGT TTG TTA GAT GCG TTT GTT AAA ACC TAT CTG 153 Leu Asn Lys Lys Glu Ser Leu Leu Asp Ala Phe Val Lys Thr Tyr Leu 25 30 35
CAG ATT TTA GAA CCC ATT AGT TCT AAA CGC TTA AAA GAG TTG GCG GAC 201 Gin He Leu Glu Pro He Ser Ser Lys Arg Leu Lys Glu Leu Ala Asp 40 45 50
TTG AAA ATA TCT TGC GCG ACG ATC AGG AAT TAT TTT CAA ATC CTT TCT 249 Leu Lys He Ser Cys Ala Thr He Arg Asn Tyr Phe Gin He Leu Ser 55 60 65
AAA GAG GGC ATG CTT TAT CAA GCC CAT TCT AGT GGC GCT AGA TTG CCC 297 Lys Glu Gly Met Leu Tyr Gin Ala His Ser Ser Gly Ala Arg Leu Pro 70 75 80
ACT TTT AAG GCG TTT GAA AAC TAT TGG CAA AAG TCG TTG CGC TTT GAA 345 Thr Phe Lys Ala Phe Glu Asn Tyr Trp Gin Lys Ser Leu Arg Phe Glu 85 90 95 100
ACT TTA AAG GTG AAT GAA AAA CGC CTA AAA AGC GCG AGT GAA AAT TTT 393 Thr Leu Lys Val Asn Glu Lys Arg Leu Lys Ser Ala Ser Glu Asn Phe 105 110 115
GGG CTT TTC ACG CTG TTA AAA AAA CCC AGT TTG GAG CGT TTA GAA AGA 441 Gly Leu Phe Thr Leu Leu Lys Lys Pro Ser Leu Glu Arg Leu Glu Arg 120 125 130
GTC ATT GAG TGC GAA AAA CGC TTT TTG ATT TTG GAC TTT TTG GCG TTT 489 Val He Glu Cys Glu Lys Arg Phe Leu He Leu Asp Phe Leu Ala Phe 135 140 145
TCT TGC GCA CTG GGT TAC AGC GTT AAA ATG GAA AAG TTT TTA TTA GAG 537 Ser Cys Ala Leu Gly Tyr Ser Val Lys Met Glu Lys Phe Leu Leu Glu 150 155 160
CTT GTG GGC AGA AGC GTT AAA GAA GTG CGC TCA ATC GCT GCT TCT TTC 585 Leu Val Gly Arg Ser Val Lys Glu Val Arg Ser He Ala Ala Ser Phe 165 170 175 180
AAT GCG TTG AGT TTG GCC AGG CAA TTA GAG CGT TTG GAG TAT TCC AAC 633 Asn Ala Leu Ser Leu Ala Arg Gin Leu Glu Arg Leu Glu Tyr Ser Asn 185 190 195
ACA CAA ATC ACA CGC TTT AAT CTG ATG GGG TTA AAA ACG CTT TTA AAC 681 Thr Gin He Thr Arg Phe Asn Leu Met Gly Leu Lys Thr Leu Leu Asn 200 205 210
AGC CCT TTA TTT TTT GAC ATT TTA GGG GGT AAG GTT TTA GAG CGT TTG 729 Ser Pro Leu Phe Phe Asp He Leu Gly Gly Lys Val Leu Glu Arg Leu 215 220 225
AGT AAG GGT TTG CAT TTT ATA GAG CCT GAT TGC ATG CTA GTA ACA CGC 777 Ser Lys Gly Leu His Phe He Glu Pro Asp Cys Met Leu Val Thr Arg 230 235 240 CCT GTA GAA TTT CAA AAC AAG CGG ATG CAA CTG CTT TGC GTG GGG AAA 825 Pro Val Glu Phe Gin Asn Lys Arg Met Gin Leu Leu Cys Val Gly Lys 245 250 255 260
CTA GAA TGC GAT TAT GAA GGG TTT TTT CAA ACG ATT TCT GAG GAG GAA T 874 Leu Glu Cys Asp Tyr Glu Gly Phe Phe Gin Thr He Ser Glu Glu Glu 265 270 275
AATGAAAGAT GAACACAACC AAGAACACGA TCATTTAA 912
(2) INFORMATION FOR SEQ ID NO: 1202:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 276 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1202:
Met Val He Asp Glu He Phe Gin He Met Met Leu Arg Arg He Lys
1 5 10 15
Val Gly Ser Asn Leu Asn Lys Lys Glu Ser Leu Leu Asp Ala Phe Val
20 25 30
Lys Thr Tyr Leu Gin He Leu Glu Pro He Ser Ser Lys Arg Leu Lys
35 40 45
Glu Leu Ala Asp Leu Lys He Ser Cys Ala Thr He Arg Asn Tyr Phe
50 55 60
Gin He Leu Ser Lys Glu Gly Met Leu Tyr Gin Ala His Ser Ser Gly 65 70 75 80
Ala Arg Leu Pro Thr Phe Lys Ala Phe Glu Asn Tyr Trp Gin Lys Ser
85 90 95
Leu Arg Phe Glu Thr Leu Lys Val Asn Glu Lys Arg Leu Lys Ser Ala
100 105 110
Ser Glu Asn Phe Gly Leu Phe Thr Leu Leu Lys Lys Pro Ser Leu Glu
115 120 125
Arg Leu Glu Arg Val He Glu Cys Glu Lys Arg Phe Leu He Leu Asp
130 135 140
Phe Leu Ala Phe Ser Cys Ala Leu Gly Tyr Ser Val Lys Met Glu Lys 145 150 155 160
Phe Leu Leu Glu Leu Val Gly Arg Ser Val Lys Glu Val Arg Ser He
165 170 175
Ala Ala Ser Phe Asn Ala Leu Ser Leu Ala Arg Gin Leu Glu Arg Leu
180 185 190
Glu Tyr Ser Asn Thr Gin He Thr Arg Phe Asn Leu Met Gly Leu Lys
195 200 205
Thr Leu Leu Asn Ser Pro Leu Phe Phe Asp He Leu Gly Gly Lys Val
210 215 220
Leu Glu Arg Leu Ser Lys Gly Leu His Phe He Glu Pro Asp Cys Met 225 230 235 240
Leu Val Thr Arg Pro Val Glu Phe Gin Asn Lys Arg Met Gin Leu Leu
245 250 255
Cys Val Gly Lys Leu Glu Cys Asp Tyr Glu Gly Phe Phe Gin Thr He 260 265 270 Ser Glu Glu Glu 275
(2) INFORMATION FOR SEQ ID NO: 1203:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 720 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 50...685 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1203:
AGCTCTTTAC TATTTTATTA TCTATCTTTT ATTAAAAAAA CTTGTTATC ATG ATA AAC 58
Met He Asn
1
ATG AAC ACA CAC ACA AGA GGC ATT GAC AGC AAT CTG ATT CAT TCG CTC 106 Met Asn Thr His Thr Arg Gly He Asp Ser Asn Leu He His Ser Leu 5 10 15
CAA AGC ATT TCA TTA TCC ATG TTT AGA AAG GGT TTT TTT GGG CTT TAT 154 Gin Ser He Ser Leu Ser Met Phe Arg Lys Gly Phe Phe Gly Leu Tyr 20 25 30 35
CAA GGC TCT ATT TCA GCA CGC ATT GGC GCA AAT CAA TTT GTG ATC AAC 202 Gin Gly Ser He Ser Ala Arg He Gly Ala Asn Gin Phe Val He Asn 40 45 50
AAA AGA AAC GCT GTT TTT GAT CAA TTG AAT GAA AAC ACC TTA CTG GTT 250 Lys Arg Asn Ala Val Phe Asp Gin Leu Asn Glu Asn Thr Leu Leu Val 55 60 65
TTG CAT GAC AAA ATA GAT TAC CGC TGG AAA GAA GCG AGC TTG GAT TCG 298 Leu His Asp Lys He Asp Tyr Arg Trp Lys Glu Ala Ser Leu Asp Ser 70 75 80
CCC ATT CAT GCG AGC GTG TAT AGG GAG TTT TTG GAC GCT AAA TTC ATC 346 Pro He His Ala Ser Val Tyr Arg Glu Phe Leu Asp Ala Lys Phe He 85 90 95
GCT TAC GCG CGC CCT CCT TAT AGT TTG GCG TAT TCC TTG CGC CAC AAC 394 Ala Tyr Ala Arg Pro Pro Tyr Ser Leu Ala Tyr Ser Leu Arg His Asn 100 105 110 115 CGA TTG CTC CCT AGA GAT TAT TTA GGG TAT CGT TCT TTG GGC GAA GAA 442 Arg Leu Leu Pro Arg Asp Tyr Leu Gly Tyr Arg Ser Leu Gly Glu Glu 120 125 130
ATT TCC ATT TTT AAC CCC AAA GAC TAT GAC AGC TGG CAA GAA AGA GCG 490 He Ser He Phe Asn Pro Lys Asp Tyr Asp Ser Trp Gin Glu Arg Ala 135 140 145
GAT ACA GAA ATT TTA CGC CAA CTG CAA GAG AGC AAA AAA TAT TTT GTT 538 Asp Thr Glu He Leu Arg Gin Leu Gin Glu Ser Lys Lys Tyr Phe Val 150 155 160
TTC ATT AAG GGG TGT GGG ATT TTT GCC TAC CAC AGA GAG CTT TCT AAA 586 Phe He Lys Gly Cys Gly He Phe Ala Tyr His Arg Glu Leu Ser Lys 165 170 175
CTC ATG GAA GTT TTT GAT TTG ATT GAA AAC TCA TGC AAG GTT TTA CGA 634 Leu Met Glu Val Phe Asp Leu He Glu Asn Ser Cys Lys Val Leu Arg 180 185 190 195
TTG GGC GAT TTA ATG GAT TAT TGC TAT AAT GAT GAT CCA CGA TTG AGC 682 Leu Gly Asp Leu Met Asp Tyr Cys Tyr Asn Asp Asp Pro Arg Leu Ser 200 205 210
GTG TAAAAAGCTA AAAAGGATAA AACATGACCA TCAAC 720
Val
(2) INFORMATION FOR SEQ ID NO: 1204:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 212 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1204:
Met He Asn Met Asn Thr His Thr Arg Gly He Asp Ser Asn Leu He
1 5 10 15
His Ser Leu Gin Ser He Ser Leu Ser Met Phe Arg Lys Gly Phe Phe
20 25 30
Gly Leu Tyr Gin Gly Ser He Ser Ala Arg He Gly Ala Asn Gin Phe
35 40 45
Val He Asn Lys Arg Asn Ala Val Phe Asp Gin Leu Asn Glu Asn Thr
50 55 60
Leu Leu Val Leu His Asp Lys He Asp Tyr Arg Trp Lys Glu Ala Ser 65 70 75 80
Leu Asp Ser Pro He His Ala Ser Val Tyr Arg Glu Phe Leu Asp Ala
85 90 95
Lys Phe He Ala Tyr Ala Arg Pro Pro Tyr Ser Leu Ala Tyr Ser Leu 100 105 110 Arg His Asn Arg Leu Leu Pro Arg Asp Tyr Leu Gly Tyr Arg Ser Leu
115 120 125
Gly Glu Glu He Ser He Phe Asn Pro Lys Asp Tyr Asp Ser Trp Gin
130 135 140
Glu Arg Ala Asp Thr Glu He Leu Arg Gin Leu Gin Glu Ser Lys Lys 145 150 155 160
Tyr Phe Val Phe He Lys Gly Cys Gly He Phe Ala Tyr His Arg Glu
165 170 175
Leu Ser Lys Leu Met Glu Val Phe Asp Leu He Glu Asn Ser Cys Lys
180 185 190
Val Leu Arg Leu Gly Asp Leu Met Asp Tyr Cys Tyr Asn Asp Asp Pro
195 200 205
Arg Leu Ser Val 210
(2) INFORMATION FOR SEQ ID NO: 1205:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2498 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 91...2445 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1205:
GGGAGTTTTG TGCGATATAT CAAGTTTTTC AAAGAGTTGA ACAATAAAAA TGTGAATCTG 60 GTTGGGGGCA AGAACGCTAG TATTGGTGAA ATG TTT CAA GAA TTA GTG CCT ATT 114
Met Phe Gin Glu Leu Val Pro He 1 5
GGT ATT AAA GTG CCT GAT GGC TTT GCG ATC ACC AGC GAA GCG TAT TGG 162 Gly He Lys Val Pro Asp Gly Phe Ala He Thr Ser Glu Ala Tyr Trp 10 15 20
TAT CTT TTA GAG CAA GGA GGG GCT AAA CAA AAA ATC ATA GAG CTT TTA 210 Tyr Leu Leu Glu Gin Gly Gly Ala Lys Gin Lys He He Glu Leu Leu 25 30 35 40
GAA AAT GTT GAT GCC ACC GAA ATT GAT GTG TTA AAA ATC CGC TCC AAA 258 Glu Asn Val Asp Ala Thr Glu He Asp Val Leu Lys He Arg Ser Lys 45 50 55
CAA ATC AGA GAG CTT ATT TTT GGC ACG CCT TTT CCT AGC GAT TTG AGA 306 Gin He Arg Glu Leu He Phe Gly Thr Pro Phe Pro Ser Asp Leu Arg 60 65 70 GAT GAG ATT TTT CAA GCT TAT GAG ATT TTA AGC CAG CAA TAC CAC ATG 354 Asp Glu He Phe Gin Ala Tyr Glu He Leu Ser Gin Gin Tyr His Met 75 80 85
AAA GAA GCC GAT GTG GCT GTA AGG AGT TCC GCT ACT GCA GAA GAT TTG 402 Lys Glu Ala Asp Val Ala Val Arg Ser Ser Ala Thr Ala Glu Asp Leu 90 95 100
CCG GAC GCT TCT TTT GCC GGG CAG CAA GAC ACT TAT TTA AAC ATT AAG 450 Pro Asp Ala Ser Phe Ala Gly Gin Gin Asp Thr Tyr Leu Asn He Lys 105 110 115 120
GGT AAA ACC GAA TTG ATC CAC TAT ATC AAA TCC TGT TTA GCG TCG CTT 498 Gly Lys Thr Glu Leu He His Tyr He Lys Ser Cys Leu Ala Ser Leu 125 130 135
TTT ACC GAT AGA GCG ATT AGC TAT AGG GCG AGT CGT GGG TTT GAT CAT 546 Phe Thr Asp Arg Ala He Ser Tyr Arg Ala Ser Arg Gly Phe Asp His 140 145 150
TTA AAA GTC GCG CTC AGC GTG GGG GTG CAA AAA ATG GTG CGA GCG GAT 594 Leu Lys Val Ala Leu Ser Val Gly Val Gin Lys Met Val Arg Ala Asp 155 160 165
AAA GGC AGC GCG GGC GTG ATG TTT TCT ATT GAC ACC GAA ACC GGT TTT 642 Lys Gly Ser Ala Gly Val Met Phe Ser He Asp Thr Glu Thr Gly Phe 170 175 180
AAA GAC GCG GTG TTT ATC ACT TCA GCG TGG GGG TTA GGC GAA AAT GTG 690 Lys Asp Ala Val Phe He Thr Ser Ala Trp Gly Leu Gly Glu Asn Val 185 190 195 200
GTG GGT GGC ACG ATA AAC CCT GAT GAA TTT TAT GTG TTT AAG CCC ACT 738 Val Gly Gly Thr He Asn Pro Asp Glu Phe Tyr Val Phe Lys Pro Thr 205 210 215
TTA GAG CAA AAC AAA CGC CCC ATT ATC AAA CGC CAA CTC GGC AAT AAA 786 Leu Glu Gin Asn Lys Arg Pro He He Lys Arg Gin Leu Gly Asn Lys 220 225 230
ACG CAA AAA ATG GTC TAT GCC CCA AGG GGT AGC GAA CAC CCC ACC AGA 834 Thr Gin Lys Met Val Tyr Ala Pro Arg Gly Ser Glu His Pro Thr Arg 235 240 245
AAC ATT AAA ACC ACC AAA AAA GAA TGG CAA TCC TTT TCA TTG AGC GAT 882 Asn He Lys Thr Thr Lys Lys Glu Trp Gin Ser Phe Ser Leu Ser Asp 250 255 260
GAA GAC GTG CTG ATT TTA GCC AAA TAC GCC ATT GAA ATT GAA AAA CAC 930 Glu Asp Val Leu He Leu Ala Lys Tyr Ala He Glu He Glu Lys His 265 270 275 280
TAC TCT AAA GAA GCC AAA CAA TAC CGC CCT ATG GAT ATA GAA TGG GCT 978 Tyr Ser Lys Glu Ala Lys Gin Tyr Arg Pro Met Asp He Glu Trp Ala 285 290 295 AAA GAT GGC GAG AGC GGG GAA ATC TTT ATC GTT CAA GCG CGC CCA GAA 1026 Lys Asp Gly Glu Ser Gly Glu He Phe He Val Gin Ala Arg Pro Glu 300 305 310
ACC GTT CAA AGC CAA AAA AGT AAA GAA GAA AGT CAA GTC TTT GAA AAA 1074 Thr Val Gin Ser Gin Lys Ser Lys Glu Glu Ser Gin Val Phe Glu Lys 315 320 325
TTC AAA TTC AAA AAC CCT AAC GAA AAG AAA GAG ATT ATC TTA CAA GGC 1122 Phe Lys Phe Lys Asn Pro Asn Glu Lys Lys Glu He He Leu Gin Gly 330 335 340
AGA GCG ATT GGG AGT AAA ATT GGC TCA GGA AAA GTG CGC ATC ATC AAT 1170 Arg Ala He Gly Ser Lys He Gly Ser Gly Lys Val Arg He He Asn 345 350 355 360
GAT TTG GAG CAC ATG AAT TCT TTT AAA GAG GGC GAA ATT TTA GTT ACG 1218 Asp Leu Glu His Met Asn Ser Phe Lys Glu Gly Glu He Leu Val Thr 365 370 375
GAT AAC ACC GAT CCG GAC TGG GAG CCT TGC ATG AAA AAA GCG AGC GCG 1266 Asp Asn Thr Asp Pro Asp Trp Glu Pro Cys Met Lys Lys Ala Ser Ala 380 385 390
GTT ATC ACT AAT CGT GGA GGG CGC ACT TGC CAT GCC GCT ATT GTG GCG 1314 Val He Thr Asn Arg Gly Gly Arg Thr Cys His Ala Ala He Val Ala 395 400 405
AGA GAA ATT GGC GTG CCA GCT ATC GTT GGG GTG AGC GGG GCG ACT GAT 1362 Arg Glu He Gly Val Pro Ala He Val Gly Val Ser Gly Ala Thr Asp 410 415 420
AGC CTT TAT ACC GGC ATG GAA ATC ACG GTT TCT TGC GCT GAG GGC GAA 1410 Ser Leu Tyr Thr Gly Met Glu He Thr Val Ser Cys Ala Glu Gly Glu 425 430 435 440
GAG GGC TAT GTG TAT GCG GGC ATT TAT GAG CAT GAA ATT GAA AGG GTG 1458 Glu Gly Tyr Val Tyr Ala Gly He Tyr Glu His Glu He Glu Arg Val 445 450 455
GAG CTT TCT AAC ATG CAA GAA ACT CAA ACA AAA ATT TAC ATC AAT ATT 1506 Glu Leu Ser Asn Met Gin Glu Thr Gin Thr Lys He Tyr He Asn He 460 465 470
GGA AAC CCT GAA AAA GCC TTT GGC TTT TCT CAA CTC CCT AAT CAC GGC 1554 Gly Asn Pro Glu Lys Ala Phe Gly Phe Ser Gin Leu Pro Asn His Gly 475 480 485
GTA GGG CTA GCC AGG ATG GAA ATG ATT ATT TTA AAT CAA ATC AAA GCC 1602 Val Gly Leu Ala Arg Met Glu Met He He Leu Asn Gin He Lys Ala 490 495 500
CAC CCT TTA GCT TTA GTG GAT TTG CAC CAC AAA AAA AGC GTG AAA GAA 1650 His Pro Leu Ala Leu Val Asp Leu His His Lys Lys Ser Val Lys Glu 505 510 515 520 AAA AAT GAA ATT GAA AAC CTC ATG GCA GGC TAT GCT AAC CCT AAA GAT 1698 Lys Asn Glu He Glu Asn Leu Met Ala Gly Tyr Ala Asn Pro Lys Asp 525 530 535
TTT TTT GTG AAA AAA ATC GCT GAA GGC ATT GGC ATG ATC AGT GCA GCG 1746 Phe Phe Val Lys Lys He Ala Glu Gly He Gly Met He Ser Ala Ala 540 545 550
TTT TAC CCT AAA CCT GTC ATT GTG AGA ACG AGC GAT TTC AAA TCC AAT 1794 Phe Tyr Pro Lys Pro Val He Val Arg Thr Ser Asp Phe Lys Ser Asn 555 560 565
GAA TAC ATG CGC ATG CTT GGC GGC TCT AGC TAT GAG CCT AAT GAA GAA 1842 Glu Tyr Met Arg Met Leu Gly Gly Ser Ser Tyr Glu Pro Asn Glu Glu 570 575 580
AAC CCC ATG CTT GGC TAT AGG GGG GCT AGT CGG TAT TAT TCA GAG AGC 1890 Asn Pro Met Leu Gly Tyr Arg Gly Ala Ser Arg Tyr Tyr Ser Glu Ser 585 590 595 600
TAT AAT GAA GCG TTT TCG TGG GAG TGT GAA GCC TTA GCG TTA GTG AGG 1938 Tyr Asn Glu Ala Phe Ser Trp Glu Cys Glu Ala Leu Ala Leu Val Arg 605 610 615
GAA GAA ATG GGA TTA ACC AAC ATG AAA GTG ATG ATC CCT TTT TTG CGA 1986 Glu Glu Met Gly Leu Thr Asn Met Lys Val Met He Pro Phe Leu Arg 620 625 630
ACC ATT GAA GAG GGT AAA AAA GTC CTA GAA ATC TTA AGA AAA AAC AAT 2034 Thr He Glu Glu Gly Lys Lys Val Leu Glu He Leu Arg Lys Asn Asn 635 640 645
TTA GAA TCC GGT AAA AAC GGG CTT GAA ATT TAT ATC ATG TGC GAA TTG 2082 Leu Glu Ser Gly Lys Asn Gly Leu Glu He Tyr He Met Cys Glu Leu 650 655 660
CCG GTG AAT GTC ATT TTG GCT GAT GAT TTC TTG AGC TTG TTT GAT GGC 2130 Pro Val Asn Val He Leu Ala Asp Asp Phe Leu Ser Leu Phe Asp Gly 665 670 675 680
TTT TCT ATT GGA TCA AAC GAT TTA ACC CAG CTC ACT TTA GGC GTG GAT 2178 Phe Ser He Gly Ser Asn Asp Leu Thr Gin Leu Thr Leu Gly Val Asp 685 690 695
AGA GAC AGC GAA TTG GTC AGC CAT GTC TTT GAT GAA AGG AAT GAA GCG 2226 Arg Asp Ser Glu Leu Val Ser His Val Phe Asp Glu Arg Asn Glu Ala 700 705 710
ATG CTA AAG ATG TTT AAA AAA GCG ATT GAA GCT TGC AAA AGG CAC AAC 2274 Met Leu Lys Met Phe Lys Lys Ala He Glu Ala Cys Lys Arg His Asn 715 720 725
AAA TAT TGC GGG ATT TGC GGG CAA GCC CCA AGC GAT TAC CCT GAA GTA 2322 Lys Tyr Cys Gly He Cys Gly Gin Ala Pro Ser Asp Tyr Pro Glu Val 730 735 740 ACA GAG TTT TTA GTC AAA GAG GGC ATC ACT TCC ATT TCT TTA AAC CCT 2370 Thr Glu Phe Leu Val Lys Glu Gly He Thr Ser He Ser Leu Asn Pro 745 750 755 760
GAT AGC GTG ATC CCC ACT TGG AAC GCT GTA GCC AAG TTA GAA AAA GAA 2418 Asp Ser Val He Pro Thr Trp Asn Ala Val Ala Lys Leu Glu Lys Glu 765 770 775
CTA AAA GAA CAT GGC TTA ACT GAA CAT TGATAATAAA TAAATCAATC TAACTTG 2472 Leu Lys Glu His Gly Leu Thr Glu His 780 785
AGTGGATTTT TCGTATTAGT TTCCAT 2498
(2) INFORMATION FOR SEQ ID NO: 1206:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 785 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1206:
Met Phe Gin Glu Leu Val Pro He Gly He Lys Val Pro Asp Gly Phe
1 5 10 15
Ala He Thr Ser Glu Ala Tyr Trp Tyr Leu Leu Glu Gin Gly Gly Ala
20 25 30
Lys Gin Lys He He Glu Leu Leu Glu Asn Val Asp Ala Thr Glu He
35 40 45
Asp Val Leu Lys He Arg Ser Lys Gin He Arg Glu Leu He Phe Gly
50 55 60
Thr Pro Phe Pro Ser Asp Leu Arg Asp Glu He Phe Gin Ala Tyr Glu 65 70 75 80
He Leu Ser Gin Gin Tyr His Met Lys Glu Ala Asp Val Ala Val Arg
85 90 95
Ser Ser Ala Thr Ala Glu Asp Leu Pro Asp Ala Ser Phe Ala Gly Gin
100 105 110
Gin Asp Thr Tyr Leu Asn He Lys Gly Lys Thr Glu Leu He His Tyr
115 120 125
He Lys Ser Cys Leu Ala Ser Leu Phe Thr Asp Arg Ala He Ser Tyr
130 135 140
Arg Ala Ser Arg Gly Phe Asp His Leu Lys Val Ala Leu Ser Val Gly 145 150 155 160
Val Gin Lys Met Val Arg Ala Asp Lys Gly Ser Ala Gly Val Met Phe
165 170 175
Ser He Asp Thr Glu Thr Gly Phe Lys Asp Ala Val Phe He Thr Ser
180 185 190
Ala Trp Gly Leu Gly Glu Asn Val Val Gly Gly Thr He Asn Pro Asp
195 200 205
Glu Phe Tyr Val Phe Lys Pro Thr Leu Glu Gin Asn Lys Arg Pro He
210 215 220
He Lys Arg Gin Leu Gly Asn Lys Thr Gin Lys Met Val Tyr Ala Pro 225 230 235 240
Arg Gly Ser Glu His Pro Thr Arg Asn He Lys Thr Thr Lys Lys Glu
245 250 255
Trp Gin Ser Phe Ser Leu Ser Asp Glu Asp Val Leu He Leu Ala Lys
260 265 270
Tyr Ala He Glu He Glu Lys His Tyr Ser Lys Glu Ala Lys Gin Tyr
275 280 285
Arg Pro Met Asp He Glu Trp Ala Lys Asp Gly Glu Ser Gly Glu He
290 295 300
Phe He Val Gin Ala Arg Pro Glu Thr Val Gin Ser Gin Lys Ser Lys 305 310 315 320
Glu Glu Ser Gin Val Phe Glu Lys Phe Lys Phe Lys Asn Pro Asn Glu
325 330 335
Lys Lys Glu He He Leu Gin Gly Arg Ala He Gly Ser Lys He Gly
340 345 350
Ser Gly Lys Val Arg He He Asn Asp Leu Glu His Met Asn Ser Phe
355 360 365
Lys Glu Gly Glu He Leu Val Thr Asp Asn Thr Asp Pro Asp Trp Glu
370 375 380
Pro Cys Met Lys Lys Ala Ser Ala Val He Thr Asn Arg Gly Gly Arg 385 390 395 400
Thr Cys His Ala Ala He Val Ala Arg Glu He Gly Val Pro Ala He
405 410 415
Val Gly Val Ser Gly Ala Thr Asp Ser Leu Tyr Thr Gly Met Glu He
420 425 430
Thr Val Ser Cys Ala Glu Gly Glu Glu Gly Tyr Val Tyr Ala Gly He
435 440 445
Tyr Glu His Glu He Glu Arg Val Glu Leu Ser Asn Met Gin Glu Thr
450 455 460
Gin Thr Lys He Tyr He Asn He Gly Asn Pro Glu Lys Ala Phe Gly 465 470 475 480
Phe Ser Gin Leu Pro Asn His Gly Val Gly Leu Ala Arg Met Glu Met
485 490 495
He He Leu Asn Gin He Lys Ala His Pro Leu Ala Leu Val Asp Leu
500 505 510
His His Lys Lys Ser Val Lys Glu Lys Asn Glu He Glu Asn Leu Met
515 520 525
Ala Gly Tyr Ala Asn Pro Lys Asp Phe Phe Val Lys Lys He Ala Glu
530 535 540
Gly He Gly Met He Ser Ala Ala Phe Tyr Pro Lys Pro Val He Val 545 550 555 560
Arg Thr Ser Asp Phe Lys Ser Asn Glu Tyr Met Arg Met Leu Gly Gly
565 570 575
Ser Ser Tyr Glu Pro Asn Glu Glu Asn Pro Met Leu Gly Tyr Arg Gly
580 585 590
Ala Ser Arg Tyr Tyr Ser Glu Ser Tyr Asn Glu Ala Phe Ser Trp Glu
595 600 605
Cys Glu Ala Leu Ala Leu Val Arg Glu Glu Met Gly Leu Thr Asn Met
610 615 620
Lys Val Met He Pro Phe Leu Arg Thr He Glu Glu Gly Lys Lys Val 625 630 635 640
Leu Glu He Leu Arg Lys Asn Asn Leu Glu Ser Gly Lys Asn Gly Leu
645 650 655
Glu He Tyr He Met Cys Glu Leu Pro Val Asn Val He Leu Ala Asp 660 665 670 Asp Phe Leu Ser Leu Phe Asp Gly Phe Ser He Gly Ser Asn Asp Leu
675 680 685
Thr Gin Leu Thr Leu Gly Val Asp Arg Asp Ser Glu Leu Val Ser His
690 695 700
Val Phe Asp Glu Arg Asn Glu Ala Met Leu Lys Met Phe Lys Lys Ala 705 710 715 720
He Glu Ala Cys Lys Arg His Asn Lys Tyr Cys Gly He Cys Gly Gin
725 730 735
Ala Pro Ser Asp Tyr Pro Glu Val Thr Glu Phe Leu Val Lys Glu Gly
740 745 750
He Thr Ser He Ser Leu Asn Pro Asp Ser Val He Pro Thr Trp Asn
755 760 765
Ala Val Ala Lys Leu Glu Lys Glu Leu Lys Glu His Gly Leu Thr Glu
770 775 780
His 785
(2) INFORMATION FOR SEQ ID NO: 1207:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 565 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 61...483 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1207:
GTTATCTTTA ATCAATCAGA TGATAGAATT TATCTTTTAT TTTTGAATTG GGAGCATTTG 60 ATG AAA AAA TTA GCG GTT TCT TTA TTA TTT ACA GGG ACT TTT TTG GGG 108 Met Lys Lys Leu Ala Val Ser Leu Leu Phe Thr Gly Thr Phe Leu Gly 1 5 10 15
CTT TTT TTG AAT GCG AGC GAT TTT AAG AGC ATG GAT GAC AAG CAA CTA 156 Leu Phe Leu Asn Ala Ser Asp Phe Lys Ser Met Asp Asp Lys Gin Leu 20 25 30
TTA GAG CAA GCA GGG AAA GTT GCT CCT AGC GAA GTC CCT GAG TTT CGC 204 Leu Glu Gin Ala Gly Lys Val Ala Pro Ser Glu Val Pro Glu Phe Arg 35 40 45
GCG GAA GTC AAT AAG CGA TTA GCA GTG ATG AAA GAA GAA GAT CGT AAA 252 Ala Glu Val Asn Lys Arg Leu Ala Val Met Lys Glu Glu Asp Arg Lys 50 55 60
AAT TAT AAA GCG GAT TTT AAG AAA GCG ATG GAT AAG AAT TTA GCT TCT 300 Asn Tyr Lys Ala Asp Phe Lys Lys Ala Met Asp Lys Asn Leu Ala Ser 65 70 75 80
TTA AGC CAA GAA GAT CGC AAC AAG CGT AAA AAA GAA ATT CTT GAA GCG 348 Leu Ser Gin Glu Asp Arg Asn Lys Arg Lys Lys Glu He Leu Glu Ala 85 90 95
ATT GCT AAC AAA AAG AAA ACA ATG ACC ATG AAA GAA TAT CGT GAA GAA 396 He Ala Asn Lys Lys Lys Thr Met Thr Met Lys Glu Tyr Arg Glu Glu 100 105 110
GGG TTG GAT TTG CAT GAT TGC GCA TGC GAA GGC CCT TTT CAT GAT CAT 444 Gly Leu Asp Leu His Asp Cys Ala Cys Glu Gly Pro Phe His Asp His 115 120 125
GAG AGA AAA AAA GGG AAA AAA CCA AGC CAT CAT AAG CAT TAGCGCTTAG GG 495 Glu Arg Lys Lys Gly Lys Lys Pro Ser His His Lys His 130 135 140
TGTGCTAACT TTTTTTGATT TTTGTGAAAC CACGCCGTAA GTCCCTAGCT TTTGGCTGTG 555 GGGATTAAGG 565
(2) INFORMATION FOR SEQ ID NO: 1208:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 141 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1208:
Met Lys Lys Leu Ala Val Ser Leu Leu Phe Thr Gly Thr Phe Leu Gly
1 5 10 15
Leu Phe Leu Asn Ala Ser Asp Phe Lys Ser Met Asp Asp Lys Gin Leu
20 25 30
Leu Glu Gin Ala Gly Lys Val Ala Pro Ser Glu Val Pro Glu Phe Arg
35 40 45
Ala Glu Val Asn Lys Arg Leu Ala Val Met Lys Glu Glu Asp Arg Lys
50 55 60
Asn Tyr Lys Ala Asp Phe Lys Lys Ala Met Asp Lys Asn Leu Ala Ser 65 70 75 80
Leu Ser Gin Glu Asp Arg Asn Lys Arg Lys Lys Glu He Leu Glu Ala
85 90 95
He Ala Asn Lys Lys Lys Thr Met Thr Met Lys Glu Tyr Arg Glu Glu
100 105 110
Gly Leu Asp Leu His Asp Cys Ala Cys Glu Gly Pro Phe His Asp His
115 120 125
Glu Arg Lys Lys Gly Lys Lys Pro Ser His His Lys His 130 135 140
(2) INFORMATION FOR SEQ ID NO: 1209:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 558 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...506 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1209:
GGGTGCATGG GCCTCAAAAA GTCGCTATCA TTCTCTACTA AAGGATAAGA ATG GAA 56
Met Glu
1
AAA TTA GAA GTA GGG CAA TTA GCC CCT GAT TTT AGA TTG AAA AAC AGC 104 Lys Leu Glu Val Gly Gin Leu Ala Pro Asp Phe Arg Leu Lys Asn Ser 5 10 15
GAT GGC GTG GAA ATT TCT TTA AAA GAT TTG CTC CAT AAA AAA GTG GTG 152 Asp Gly Val Glu He Ser Leu Lys Asp Leu Leu His Lys Lys Val Val 20 25 30
CTG TAT TTC TAC CCT AAA GAC AAC ACC CCC GGA TGC ACT CTA GAA GCC 200 Leu Tyr Phe Tyr Pro Lys Asp Asn Thr Pro Gly Cys Thr Leu Glu Ala 35 40 45 50
AAA GAC TTT AGC GCT CTA TTT AGT GAA TTT GAA AAG AAA AAC GCT GTT 248 Lys Asp Phe Ser Ala Leu Phe Ser Glu Phe Glu Lys Lys Asn Ala Val 55 60 65
GTC GTA GGC ATA AGC CCT GAT AAC GCG CAA TCG CAT CAA AAA TTT ATC 296 Val Val Gly He Ser Pro Asp Asn Ala Gin Ser His Gin Lys Phe He 70 75 80
AGC CAA TGC TCT TTG AAT GTG ATT TTG CTC TGC GAT GAA GAT AAA AAA 344 Ser Gin Cys Ser Leu Asn Val He Leu Leu Cys Asp Glu Asp Lys Lys 85 90 95
GCC GCC AAT CTT TAC AAA GCT TAT GGC AAA CGC ATG CTT TAT GGG AAG 392 Ala Ala Asn Leu Tyr Lys Ala Tyr Gly Lys Arg Met Leu Tyr Gly Lys 100 105 110
GAG CAT TTG GGG ATT ATC CGC TCC ACC TTC ATT ATC AAC ACG CAA GGC 440 Glu His Leu Gly He He Arg Ser Thr Phe He He Asn Thr Gin Gly 115 120 125 130
GTT TTA GAA AAA TGT TTC TAC AAT GTC AAA GCG AAA GGC CAT GCT CAA 488 Val Leu Glu Lys Cys Phe Tyr Asn Val Lys Ala Lys Gly His Ala Gin 135 140 145 AAG GTT TTA GAG AGT TTG TAGTTTAACT TTCTAACTTT CGCCCATTTT AATTTGAG 544 Lys Val Leu Glu Ser Leu 150
ATTTTTTAGC CATT 558
(2) INFORMATION FOR SEQ ID NO: 1210:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 152 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1210:
Met Glu Lys Leu Glu Val Gly Gin Leu Ala Pro Asp Phe Arg Leu Lys
1 5 10 15
Asn Ser Asp Gly Val Glu He Ser Leu Lys Asp Leu Leu His Lys Lys
20 25 30
Val Val Leu Tyr Phe Tyr Pro Lys Asp Asn Thr Pro Gly Cys Thr Leu
35 40 45
Glu Ala Lys Asp Phe Ser Ala Leu Phe Ser Glu Phe Glu Lys Lys Asn
50 55 60
Ala Val Val Val Gly He Ser Pro Asp Asn Ala Gin Ser His Gin Lys 65 70 75 80
Phe He Ser Gin Cys Ser Leu Asn Val He Leu Leu Cys Asp Glu Asp
85 90 95
Lys Lys Ala Ala Asn Leu Tyr Lys Ala Tyr Gly Lys Arg Met Leu Tyr
100 105 110
Gly Lys Glu His Leu Gly He He Arg Ser Thr Phe He He Asn Thr
115 120 125
Gin Gly Val Leu Glu Lys Cys Phe Tyr Asn Val Lys Ala Lys Gly His
130 135 140
Ala Gin Lys Val Leu Glu Ser Leu 145 150
(2) INFORMATION FOR SEQ ID NO: 1211:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 700 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...651 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1211:
ACTTAGAAGG GGTGATTT ATG AGT AAA GAG CTT ATT TTA AAG CGC ATT AAA 51
Met Ser Lys Glu Leu He Leu Lys Arg He Lys 1 5 10
GAA GCC AGA GCC AAG CAT GCC ATT CAG GGA GCG AAC CCT ATT TAT AGG 99 Glu Ala Arg Ala Lys His Ala He Gin Gly Ala Asn Pro He Tyr Arg 15 20 25
AAT ATC ATT AAA GTG GAG TTT GAG GAC TTG GTG GAA GAA TAC AAG CAT 147 Asn He He Lys Val Glu Phe Glu Asp Leu Val Glu Glu Tyr Lys His 30 35 40
TTC CAA GTG TTG AAT AAA GCT GAA GTC ATT GAA AGC GCT AAA GAA AAT 195 Phe Gin Val Leu Asn Lys Ala Glu Val He Glu Ser Ala Lys Glu Asn 45 50 55
TTA GAG CAA GCC ATT TTA AAG GCT TTA GAA AAT TTT AAA AGC AAA AAA 243 Leu Glu Gin Ala He Leu Lys Ala Leu Glu Asn Phe Lys Ser Lys Lys 60 65 70 75
ATC TTA CAC TCC ACA GAT TTG AAT TTG AAT TTT GAA GCG TTT AAG GAT 291 He Leu His Ser Thr Asp Leu Asn Leu Asn Phe Glu Ala Phe Lys Asp 80 85 90
TTT ACT TTA CAG CCT TAT GAT AAA GAA ATT GAA GCG ATG CGT GAA GAG 339 Phe Thr Leu Gin Pro Tyr Asp Lys Glu He Glu Ala Met Arg Glu Glu 95 100 105
TTG TTT GAG ATT GAT ACG GCT TTA TTG CAT GGG GTT TGT GGG ATT TCA 387 Leu Phe Glu He Asp Thr Ala Leu Leu His Gly Val Cys Gly He Ser 110 115 120
AGC TTG GGC ATG ATT GGG GCG GTC TCT TCG CAT GCA AGC CCG CGA TTG 435 Ser Leu Gly Met He Gly Ala Val Ser Ser His Ala Ser Pro Arg Leu 125 130 135
CTT TCG CTC ATC ACC CTT AAT TGC ATC ATC TTA TTG AAA AAA GAA TCC 483 Leu Ser Leu He Thr Leu Asn Cys He He Leu Leu Lys Lys Glu Ser 140 145 150 155
ATT GTG CGC AAT TTG AGT GAA GGC ATG CAA GCT TTA AAA AAC CAA AGC 531 He Val Arg Asn Leu Ser Glu Gly Met Gin Ala Leu Lys Asn Gin Ser 160 165 170
CAA AAC GGT GCA TTA CCC ACA AAC ATG CTC CTT ATT GGC GGG CCT AGC 579 Gin Asn Gly Ala Leu Pro Thr Asn Met Leu Leu He Gly Gly Pro Ser 175 180 185
CGG ACA GCT GAC ATT GAA TTA AAA ACC GTT TTT GGG GTG CAT GGG CCT 627 Arg Thr Ala Asp He Glu Leu Lys Thr Val Phe Gly Val His Gly Pro 190 195 200
CAA AAA GTC GCT ATC ATT CTC TAC TAAAGGATAA GAATGGAAAA ATTAGAAGTA 681 Gin Lys Val Ala He He Leu Tyr 205 210
GGGCAATTAG CCCCTGATT 700
(2) INFORMATION FOR SEQ ID NO: 1212:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 211 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1212:
Met Ser Lys Glu Leu He Leu Lys Arg He Lys Glu Ala Arg Ala Lys
1 5 10 15
His Ala He Gin Gly Ala Asn Pro He Tyr Arg Asn He He Lys Val
20 25 30
Glu Phe Glu Asp Leu Val Glu Glu Tyr Lys His Phe Gin Val Leu Asn
35 40 45
Lys Ala Glu Val He Glu Ser Ala Lys Glu Asn Leu Glu Gin Ala He
50 55 60
Leu Lys Ala Leu Glu Asn Phe Lys Ser Lys Lys He Leu His Ser Thr 65 70 75 80
Asp Leu Asn Leu Asn Phe Glu Ala Phe Lys Asp Phe Thr Leu Gin Pro
85 90 95
Tyr Asp Lys Glu He Glu Ala Met Arg Glu Glu Leu Phe Glu He Asp
100 105 110
Thr Ala Leu Leu His Gly Val Cys Gly He Ser Ser Leu Gly Met He
115 120 125
Gly Ala Val Ser Ser His Ala Ser Pro Arg Leu Leu Ser Leu He Thr
130 135 140
Leu Asn Cys He He Leu Leu Lys Lys Glu Ser He Val Arg Asn Leu 145 150 155 160
Ser Glu Gly Met Gin Ala Leu Lys Asn Gin Ser Gin Asn Gly Ala Leu
165 170 175
Pro Thr Asn Met Leu Leu He Gly Gly Pro Ser Arg Thr Ala Asp He
180 185 190
Glu Leu Lys Thr Val Phe Gly Val His Gly Pro Gin Lys Val Ala He
195 200 205
He Leu Tyr 210
(2) INFORMATION FOR SEQ ID NO:1213:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 571 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA ( ix) FEATURE :
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 52...531 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1213:
TATTCCAATA ACGACTACCC TATTATTTTG CCTAGCGGTT CATGCACAGG G ATG ATG 57
Met Met
1
CGG CAT GAT TAT TTG GAA TTG TTT GAA GGG CAT GCG GAA TTC AAC ATG 105 Arg His Asp Tyr Leu Glu Leu Phe Glu Gly His Ala Glu Phe Asn Met 5 10 15
GTT AAA GAT TTT TGC TCT AGG GTG TAT GAA TTG AGC GAA TTT TTG GAT 153 Val Lys Asp Phe Cys Ser Arg Val Tyr Glu Leu Ser Glu Phe Leu Asp 20 25 30
AAA AAA TTG CAA GTC AAA TAT GAA GAT AAG GGC GAA CCC CTT AAA ATC 201 Lys Lys Leu Gin Val Lys Tyr Glu Asp Lys Gly Glu Pro Leu Lys He 35 40 45 50
ACA TGG CAT TCT AAT TGC CAT GCC TTA AGG GTG GCT AAA GTG ATT GAC 249 Thr Trp His Ser Asn Cys His Ala Leu Arg Val Ala Lys Val He Asp 55 60 65
TCG GCG AAA AAC CTC ATC AGA CAG CTT AAA AAT GTG GAA CTC ATT GAA 297 Ser Ala Lys Asn Leu He Arg Gin Leu Lys Asn Val Glu Leu He Glu 70 75 80
TTG GAA AAA GAA GAA GAA TGC TGC GGG TTT GGG GGG ACT TTT TCG GTT 345 Leu Glu Lys Glu Glu Glu Cys Cys Gly Phe Gly Gly Thr Phe Ser Val 85 90 95
AAA GAG CCT GAA ATT TCA GCG GTT ATG GTT AAA GAA AAG ATT AAA AAC 393 Lys Glu Pro Glu He Ser Ala Val Met Val Lys Glu Lys He Lys Asn 100 105 110
ATA GAA AGC CGT CAA GTG GAT GTG ATT GTT TCA GCG GAT GCT GGG TGT 441 He Glu Ser Arg Gin Val Asp Val He Val Ser Ala Asp Ala Gly Cys 115 120 125 130
TTG ATG AAT ATC AGC ACC GCT ATG CAA AAA ATG GGC TCT TTG ACA AAA 489 Leu Met Asn He Ser Thr Ala Met Gin Lys Met Gly Ser Leu Thr Lys 135 140 145
CCC ATG CAT TTT TAT GAC TTT TTA GCC TCA AGA CTT GGG CTT TAACATTAA 540 Pro Met His Phe Tyr Asp Phe Leu Ala Ser Arg Leu Gly Leu 150 155 160
AGAATTATTT TAAGGAATGA TCATGGAAAA A 571 (2) INFORMATION FOR SEQ ID NO: 1214:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 160 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1214:
Met Met Arg His Asp Tyr Leu Glu Leu Phe Glu Gly His Ala Glu Phe
1 5 10 15
Asn Met Val Lys Asp Phe Cys Ser Arg Val Tyr Glu Leu Ser Glu Phe
20 25 30
Leu Asp Lys Lys Leu Gin Val Lys Tyr Glu Asp Lys Gly Glu Pro Leu
35 40 45
Lys He Thr Trp His Ser Asn Cys His Ala Leu Arg Val Ala Lys Val
50 55 60
He Asp Ser Ala Lys Asn Leu He Arg Gin Leu Lys Asn Val Glu Leu 65 70 75 80
He Glu Leu Glu Lys Glu Glu Glu Cys Cys Gly Phe Gly Gly Thr Phe
85 90 95
Ser Val Lys Glu Pro Glu He Ser Ala Val Met Val Lys Glu Lys He
100 105 110
Lys Asn He Glu Ser Arg Gin Val Asp Val He Val Ser Ala Asp Ala
115 120 125
Gly Cys Leu Met Asn He Ser Thr Ala Met Gin Lys Met Gly Ser Leu
130 135 140
Thr Lys Pro Met His Phe Tyr Asp Phe Leu Ala Ser Arg Leu Gly Leu 145 150 155 160
(2) INFORMATION FOR SEQ ID NO: 1215:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 759 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 70...714 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1215:
AACCTGCAAA CCCACCTTAA AAGGCTCAAA AGAAGTGAGT TTGTGGGCCA AAAAAAGGAA 60 TTAGAGGGC ATG GGG AGG TTT TCT TTA AAA GAA ATT TTA ATG CTC AGC CTT 111 Met Gly Arg Phe Ser Leu Lys Glu He Leu Met Leu Ser Leu 1 5 10
ACC TTA TTG GCT TTA CTG GGT TGG ATT TTT GGC AAA CCT TTA GGC TTG 159 Thr Leu Leu Ala Leu Leu Gly Trp He Phe Gly Lys Pro Leu Gly Leu 15 20 25 30
CAT GCG AGT GCG ACG GCT TTG ATT GTC ATG GTT TTA ATG GCG TTT TGT 207 His Ala Ser Ala Thr Ala Leu He Val Met Val Leu Met Ala Phe Cys 35 40 45
AAG ATT GTA AGC TAT GAA GAC ATC ATT AAA AAC AAG AGC GCG TTC AAT 255 Lys He Val Ser Tyr Glu Asp He He Lys Asn Lys Ser Ala Phe Asn 50 55 60
ATT TTT TTA TTG CTT GGA TCG CTG CTC ACG ATG GCT GGC GGG CTT AAA 303 He Phe Leu Leu Leu Gly Ser Leu Leu Thr Met Ala Gly Gly Leu Lys 65 70 75
AAT GTA GGG TTT TTA AAT TTT ATC GGC AAT GCG GCT CAA AAT TTT TTA 351 Asn Val Gly Phe Leu Asn Phe He Gly Asn Ala Ala Gin Asn Phe Leu 80 85 90
GAG CAT GCT CAC TTG GAT CCG TTA ATA GCG GTC TTG TTT ATT GTA GCC 399 Glu His Ala His Leu Asp Pro Leu He Ala Val Leu Phe He Val Ala 95 100 105 110
CTC TTT TAT CTG TCG CAT TAT TTT TTC GCA AGC ATC ACC GCT CAT GTG 447 Leu Phe Tyr Leu Ser His Tyr Phe Phe Ala Ser He Thr Ala His Val 115 120 125
AGC GCG TTA TTC GCG CTT TTT GTA GGG ATT GGT TCG CAC ATT CAA GGG 495 Ser Ala Leu Phe Ala Leu Phe Val Gly He Gly Ser His He Gin Gly 130 135 140
GTC AAT TTG CAA GAA TTG AGC TTG TTT TTA ATG TTT TCT TTA GGG ATT 543 Val Asn Leu Gin Glu Leu Ser Leu Phe Leu Met Phe Ser Leu Gly He 145 150 155
ATG GGG ATT TTA ACG CCC TAT GGC ACA GGC CCA TCC ACC ATT TAT TAC 591 Met Gly He Leu Thr Pro Tyr Gly Thr Gly Pro Ser Thr He Tyr Tyr 160 165 170
GGG AGC GGG TAT ATT CAA AGC AAG GAT TTT TGG AAA TGG GGG TTT ATT 639 Gly Ser Gly Tyr He Gin Ser Lys Asp Phe Trp Lys Trp Gly Phe He 175 180 185 190
TTT GGC TTT TTG TAT TTA ATC GTG TTT TTA AGC GTG TGC ACA CCT TGG 687 Phe Gly Phe Leu Tyr Leu He Val Phe Leu Ser Val Cys Thr Pro Trp 195 200 205
GTC AAA TTC ATC GCT TAT AGG TGG TTG TAGCTGGAAA CTTTACACAA CGCCCTT 741 Val Lys Phe He Ala Tyr Arg Trp Leu 210 215
TTAAAATGGT ATGAAGAA 755 (2) INFORMATION FOR SEQ ID NO: 1216:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 215 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1216:
Met Gly Arg Phe Ser Leu Lys Glu He Leu Met Leu Ser Leu Thr Leu
1 5 10 15
Leu Ala Leu Leu Gly Trp He Phe Gly Lys Pro Leu Gly Leu His Ala
20 25 30
Ser Ala Thr Ala Leu He Val Met Val Leu Met Ala Phe Cys Lys He
35 40 45
Val Ser Tyr Glu Asp He He Lys Asn Lys Ser Ala Phe Asn He Phe
50 55 60
Leu Leu Leu Gly Ser Leu Leu Thr Met Ala Gly Gly Leu Lys Asn Val 65 70 75 80
Gly Phe Leu Asn Phe He Gly Asn Ala Ala Gin Asn Phe Leu Glu His
85 90 95
Ala His Leu Asp Pro Leu He Ala Val Leu Phe He Val Ala Leu Phe
100 105 110
Tyr Leu Ser His Tyr Phe Phe Ala Ser He Thr Ala His Val Ser Ala
115 120 125
Leu Phe Ala Leu Phe Val Gly He Gly Ser His He Gin Gly Val Asn
130 135 140
Leu Gin Glu Leu Ser Leu Phe Leu Met Phe Ser Leu Gly He Met Gly 145 150 155 160
He Leu Thr Pro Tyr Gly Thr Gly Pro Ser Thr He Tyr Tyr Gly Ser
165 170 175
Gly Tyr He Gin Ser Lys Asp Phe Trp Lys Trp Gly Phe He Phe Gly
180 185 190
Phe Leu Tyr Leu He Val Phe Leu Ser Val Cys Thr Pro Trp Val Lys
195 200 205
Phe He Ala Tyr Arg Trp Leu 210 215
(2) INFORMATION FOR SEQ ID NO: 1217:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 357 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 37...309 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1217:
AAGGGGGGAT TTATATCGGT AAAGAGTTGT TTAAGC ATG GCT AGT GGC CTT TTT 54
Met Ala Ser Gly Leu Phe
1 5
GAA AAC GAT GGA ATC AAA GAC AAC AAA GCG CGA GAT TTT TTC TAT AGC 102 Glu Asn Asp Gly He Lys Asp Asn Lys Ala Arg Asp Phe Phe Tyr Ser 10 15 20
CAT AGC TCC CTA ATT GTC TTT TTC CTT TTA CTG CTT GGG TTT GGG TAT 150 His Ser Ser Leu He Val Phe Phe Leu Leu Leu Leu Gly Phe Gly Tyr 25 30 35
TAT TTA GGG AAG TTG CTT TTT GGG GGC TCT TCT TTA GAA GTT TAT TTG 198 Tyr Leu Gly Lys Leu Leu Phe Gly Gly Ser Ser Leu Glu Val Tyr Leu 40 45 50
GAT TTA AGA GAC AAG CAT GAA CGA TTG CAG CAA GAA ATC ACC GAA TTG 246 Asp Leu Arg Asp Lys His Glu Arg Leu Gin Gin Glu He Thr Glu Leu 55 60 65 70
CAA AGC AAG AAT GTG CGC TTG CAA AAG CGT TTG TTT GAG TTG AAG GAA 294 Gin Ser Lys Asn Val Arg Leu Gin Lys Arg Leu Phe Glu Leu Lys Glu 75 80 85
TTA CGG CCT AGA GAT TAGATTTAAG GAAAATGGTA GTGTTAAAAA AGATGATAGG T 350 Leu Arg Pro Arg Asp 90
TTGGTGG 357
(2) INFORMATION FOR SEQ ID NO: 1218:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 91 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1218:
Met Ala Ser Gly Leu Phe Glu Asn Asp Gly He Lys Asp Asn Lys Ala
1 5 10 15
Arg Asp Phe Phe Tyr Ser His Ser Ser Leu He Val Phe Phe Leu Leu
20 25 30
Leu Leu Gly Phe Gly Tyr Tyr Leu Gly Lys Leu Leu Phe Gly Gly Ser
35 40 45
Ser Leu Glu Val Tyr Leu Asp Leu Arg Asp Lys His Glu Arg Leu Gin 50 55 60 Gin Glu He Thr Glu Leu Gin Ser Lys Asn Val Arg Leu Gin Lys Arg 65 70 75 80
Leu Phe Glu Leu Lys Glu Leu Arg Pro Arg Asp 85 90
(2) INFORMATION FOR SEQ ID NO: 1219:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 678 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 1...675 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1219:
ATG ATA GAA GTT TTA ATG ATA GAA GAT GAT ATA GAA TTA GCC GAG TTT 48 Met He Glu Val Leu Met He Glu Asp Asp He Glu Leu Ala Glu Phe 1 5 10 15
TTG AGC GAG TTT TTG CTC CAA CAT GGC ATT CAT GTA ACC AAT TAC GAT 96 Leu Ser Glu Phe Leu Leu Gin His Gly He His Val Thr Asn Tyr Asp 20 25 30
GAG CCA TAT ACC GGC ATT AGT GCG GCT AAC ACA CAA AAT TAT GAT TTG 144 Glu Pro Tyr Thr Gly He Ser Ala Ala Asn Thr Gin Asn Tyr Asp Leu 35 40 45
TTG TTG TTG GAT TTG ACT TTG CCT AAT TTA GAC GGG CTT GAA GTG TGT 192 Leu Leu Leu Asp Leu Thr Leu Pro Asn Leu Asp Gly Leu Glu Val Cys 50 55 60
AGG CGC ATC TCC AAA CAA AAA CAT ATC CCT ATT ATT ATT TCT TCA GCG 240 Arg Arg He Ser Lys Gin Lys His He Pro He He He Ser Ser Ala 65 70 75 80
AGA AGT GAT GTG GAA GAT AAG ATT AAA GCA CTA GAT TAT GGG GCT GAT 288 Arg Ser Asp Val Glu Asp Lys He Lys Ala Leu Asp Tyr Gly Ala Asp 85 90 95
GAT TAC CTC CCT AAA CCC TAT GAT CCT AAA GAA TTA TTA GCT CGC ATC 336 Asp Tyr Leu Pro Lys Pro Tyr Asp Pro Lys Glu Leu Leu Ala Arg He 100 105 110
CAA TCG CTA CTC AGG CGT TCT CAT AAA AAA GAA GAA GTG AGT GAG CCA 384 Gin Ser Leu Leu Arg Arg Ser His Lys Lys Glu Glu Val Ser Glu Pro 115 120 125 GGC GAT GCG AAT ATC TTT AGG GTG GAT AAG GAT AGC CGA GAA GTG TAT 432 Gly Asp Ala Asn He Phe Arg Val Asp Lys Asp Ser Arg Glu Val Tyr 130 135 140
ATG CAT GAA AAA AAG CTG GAC TTA ACT AGG GCT GAA TAT GAA ATC CTT 480 Met His Glu Lys Lys Leu Asp Leu Thr Arg Ala Glu Tyr Glu He Leu 145 150 155 160
TCG CTT CTC ATT AGC AAA AAA GGT TAT GTG TTT AGC CGT GAA AGC ATT 528 Ser Leu Leu He Ser Lys Lys Gly Tyr Val Phe Ser Arg Glu Ser He 165 170 175
GCG ATT GAG AGC GAG AGC ATC AAC CCT GAA AGC TCT AAT AAA AGC ATT 576 Ala He Glu Ser Glu Ser He Asn Pro Glu Ser Ser Asn Lys Ser He 180 185 190
GAT GTG ATC ATT GGC CGT TTG CGA TCT AAG ATT GAA AAA AAT CCT AAA 624 Asp Val He He Gly Arg Leu Arg Ser Lys He Glu Lys Asn Pro Lys 195 200 205
CAA CCG CAA TAC ATC ATC TCT GTT AGA GGG ATT GGT TAT AAA TTA GAA 672 Gin Pro Gin Tyr He He Ser Val Arg Gly He Gly Tyr Lys Leu Glu 210 215 220
TAC TGA 678
Tyr
225
(2) INFORMATION FOR SEQ ID NO: 1220:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 225 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1220:
Met He Glu Val Leu Met He Glu Asp Asp He Glu Leu Ala Glu Phe
1 5 10 15
Leu Ser Glu Phe Leu Leu Gin His Gly He His Val Thr Asn Tyr Asp
20 25 30
Glu Pro Tyr Thr Gly He Ser Ala Ala Asn Thr Gin Asn Tyr Asp Leu
35 40 45
Leu Leu Leu Asp Leu Thr Leu Pro Asn Leu Asp Gly Leu Glu Val Cys
50 55 60
Arg Arg He Ser Lys Gin Lys His He Pro He He He Ser Ser Ala 65 70 75 80
Arg Ser Asp Val Glu Asp Lys He Lys Ala Leu Asp Tyr Gly Ala Asp
85 90 95
Asp Tyr Leu Pro Lys Pro Tyr Asp Pro Lys Glu Leu Leu Ala Arg He 100 105 110 Gin Ser Leu Leu Arg Arg Ser His Lys Lys Glu Glu Val Ser Glu Pro
115 120 125
Gly Asp Ala Asn He Phe Arg Val Asp Lys Asp Ser Arg Glu Val Tyr
130 135 140
Met His Glu Lys Lys Leu Asp Leu Thr Arg Ala Glu Tyr Glu He Leu 145 150 155 160
Ser Leu Leu He Ser Lys Lys Gly Tyr Val Phe Ser Arg Glu Ser He
165 170 175
Ala He Glu Ser Glu Ser He Asn Pro Glu Ser Ser Asn Lys Ser He
180 185 190
Asp Val He He Gly Arg Leu Arg Ser Lys He Glu Lys Asn Pro Lys
195 200 205
Gin Pro Gin Tyr He He Ser Val Arg Gly He Gly Tyr Lys Leu Glu
210 215 220
Tyr 225
(2) INFORMATION FOR SEQ ID NO: 1221:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1134 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 72...1082 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1221:
AAGTCTATGC CACGATCAAT GGTTTCCCTT TCAATTCACA GCTCAAACTT TTAGAAGAAC 60 ATATTGATAA A ATG GCA GAA TTA GAG CCG GAC GCT TTT ATT ATC GCT GCG 110 Met Ala Glu Leu Glu Pro Asp Ala Phe He He Ala Ala 1 5 10
CCT GGT GTG GTG AAA CTC GCT TTA AAA ATC GCC CCG CAT ATC CCT ATC 158 Pro Gly Val Val Lys Leu Ala Leu Lys He Ala Pro His He Pro He 15 20 25
CAT TTA TCC ACG CAA GCG AAT GTC TTA AAT TTG CTA GAT GCA CAA GTG 206 His Leu Ser Thr Gin Ala Asn Val Leu Asn Leu Leu Asp Ala Gin Val 30 35 40 45
TTT TAT GAT TTA GGG GTT AAA CGC ATC GTG TGC GCG AGG GAA TTG AGC 254 Phe Tyr Asp Leu Gly Val Lys Arg He Val Cys Ala Arg Glu Leu Ser 50 55 60
CTG AAT GAT GCG ATT GAG ATT AAA AAA GCC TTA CCT AAT TTA GAA TTA 302 Leu Asn Asp Ala He Glu He Lys Lys Ala Leu Pro Asn Leu Glu Leu 65 70 75
GAA ATC TTT GTG CAT GGG AGC ATG TGC TTT GCC TTT TCA GGG CGC TGC 350 Glu He Phe Val His Gly Ser Met Cys Phe Ala Phe Ser Gly Arg Cys 80 85 90
TTG ATT TCG GCC TTA CAA AAG GGG CGC GTG CCT AAT AGA GGG AGT TGC 398 Leu He Ser Ala Leu Gin Lys Gly Arg Val Pro Asn Arg Gly Ser Cys 95 100 105
GCG AAT GAT TGC CGG TTT GAT TAT GAA TAT TAC GTG AAA AAC CCT GAT 446 Ala Asn Asp Cys Arg Phe Asp Tyr Glu Tyr Tyr Val Lys Asn Pro Asp 110 115 120 125
AAT GGC GTG ATG ATG AGA CTG GTT GAA GAA GAG GGC GTA GGC ACG CAT 494 Asn Gly Val Met Met Arg Leu Val Glu Glu Glu Gly Val Gly Thr His 130 135 140
ATT TTT AAC GCT AAG GAT TTG AAC CTC TCT GGC CAT ATC GCT GAA ATT 542 He Phe Asn Ala Lys Asp Leu Asn Leu Ser Gly His He Ala Glu He 145 150 155
TTA AGT TCC AAC GCC ATT AGC GCG CTT AAG ATT GAA GGG CGC ACC AAG 590 Leu Ser Ser Asn Ala He Ser Ala Leu Lys He Glu Gly Arg Thr Lys 160 165 170
TCC AGT TAC TAC GCC GCG CAA ACC ACG CGC ATC TAT CGT TTA GCG GTT 638 Ser Ser Tyr Tyr Ala Ala Gin Thr Thr Arg He Tyr Arg Leu Ala Val 175 180 185
GAT GAT TTT TAC CAT AAC ACC TTA AAG CCG AGT TTT TAT GCC AGC GAA 686 Asp Asp Phe Tyr His Asn Thr Leu Lys Pro Ser Phe Tyr Ala Ser Glu 190 195 200 205
TTG AAC ACG CTT AAA AAC AGG GGT TTT ACG GAC GGC TAT TTG ATG CGA 734 Leu Asn Thr Leu Lys Asn Arg Gly Phe Thr Asp Gly Tyr Leu Met Arg 210 215 220
AGG CCT TTT GAA AGG TTG GAT ACT CAA AAC CAC CAA ACA GCC ATT AGC 782 Arg Pro Phe Glu Arg Leu Asp Thr Gin Asn His Gin Thr Ala He Ser 225 230 235
GAA GGG GAT TTT CAA GTC AAT GGC GAA ATA ACC GAA GAC GGG CGT TTT 830 Glu Gly Asp Phe Gin Val Asn Gly Glu He Thr Glu Asp Gly Arg Phe 240 245 250
TTT GCA TGC AAA TTC ACC ACT ACC ACT AAC ACC GCT TAT GAA ATC ATC 878 Phe Ala Cys Lys Phe Thr Thr Thr Thr Asn Thr Ala Tyr Glu He He 255 260 265
GCT CCC AAA AAT GCG GCT ATC ACG CCC ATA GTC AAT GAA ATT GGC AAG 926 Ala Pro Lys Asn Ala Ala He Thr Pro He Val Asn Glu He Gly Lys 270 275 280 285
ATT TAC ACC TTT GAA AAA CGC TCT TAT TTA GTG CTG TAT AAA ATC CTT 974 He Tyr Thr Phe Glu Lys Arg Ser Tyr Leu Val Leu Tyr Lys He Leu 290 295 300
TTA GAA AAT AAC ACC GAG CTA GAA ACT ATC CAT AGC GGG AAC GTG AAT 1022 Leu Glu Asn Asn Thr Glu Leu Glu Thr He His Ser Gly Asn Val Asn 305 310 315
TTA GTG CGA CTG CCC GCA CCC TTA CCG GCT TTT AGT TTT TTA CGC ACC 1070 Leu Val Arg Leu Pro Ala Pro Leu Pro Ala Phe Ser Phe Leu Arg Thr 320 325 330
CAA GTC AGA GTC TAAAAATGGC GTTTAGAGAT TAGGTATTGA AAATGATTAA GAGAA 1127 Gin Val Arg Val 335
ACGCATG 1134
(2) INFORMATION FOR SEQ ID NO: 1222:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 337 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1222:
Met Ala Glu Leu Glu Pro Asp Ala Phe He He Ala Ala Pro Gly Val
1 5 10 15
Val Lys Leu Ala Leu Lys He Ala Pro His He Pro He His Leu Ser
20 25 30
Thr Gin Ala Asn Val Leu Asn Leu Leu Asp Ala Gin Val Phe Tyr Asp
35 40 45
Leu Gly Val Lys Arg He Val Cys Ala Arg Glu Leu Ser Leu Asn Asp
50 55 60
Ala He Glu He Lys Lys Ala Leu Pro Asn Leu Glu Leu Glu He Phe 65 70 75 80
Val His Gly Ser Met Cys Phe Ala Phe Ser Gly Arg Cys Leu He Ser
85 90 95
Ala Leu Gin Lys Gly Arg Val Pro Asn Arg Gly Ser Cys Ala Asn Asp
100 105 110
Cys Arg Phe Asp Tyr Glu Tyr Tyr Val Lys Asn Pro Asp Asn Gly Val
115 120 125
Met Met Arg Leu Val Glu Glu Glu Gly Val Gly Thr His He Phe Asn
130 135 140
Ala Lys Asp Leu Asn Leu Ser Gly His He Ala Glu He Leu Ser Ser 145 150 155 160
Asn Ala He Ser Ala Leu Lys He Glu Gly Arg Thr Lys Ser Ser Tyr
165 170 175
Tyr Ala Ala Gin Thr Thr Arg He Tyr Arg Leu Ala Val Asp Asp Phe
180 185 190
Tyr His Asn Thr Leu Lys Pro Ser Phe Tyr Ala Ser Glu Leu Asn Thr 195 200 205 Leu Lys Asn Arg Gly Phe Thr Asp Gly Tyr Leu Met Arg Arg Pro Phe
210 215 220
Glu Arg Leu Asp Thr Gin Asn His Gin Thr Ala He Ser Glu Gly Asp 225 230 235 240
Phe Gin Val Asn Gly Glu He Thr Glu Asp Gly Arg Phe Phe Ala Cys
245 250 255
Lys Phe Thr Thr Thr Thr Asn Thr Ala Tyr Glu He He Ala Pro Lys
260 265 270
Asn Ala Ala He Thr Pro He Val Asn Glu He Gly Lys He Tyr Thr
275 280 285
Phe Glu Lys Arg Ser Tyr Leu Val Leu Tyr Lys He Leu Leu Glu Asn
290 295 300
Asn Thr Glu Leu Glu Thr He His Ser Gly Asn Val Asn Leu Val Arg 305 310 315 320
Leu Pro Ala Pro Leu Pro Ala Phe Ser Phe Leu Arg Thr Gin Val Arg
325 330 335
Val
(2) INFORMATION FOR SEQ ID NO:1223
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1123 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...1038 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1223:
ATTAAGGAGA AATAAGAA ATG TTA CAA CCC CCT AAA ATT GTC GCT GAA TTG 51
Met Leu Gin Pro Pro Lys He Val Ala Glu Leu 1 5 10
AGC GCT AAT CAT AAC CAG GAT TTA AAC CTA GCC AAA GAA AGC CTT CAT 99 Ser Ala Asn His Asn Gin Asp Leu Asn Leu Ala Lys Glu Ser Leu His 15 20 25
GCC ATT AAG GAA AGC GGT GCG GAT TTT GTC AAG CTC CAA ACC TAC ACG 147 Ala He Lys Glu Ser Gly Ala Asp Phe Val Lys Leu Gin Thr Tyr Thr 30 35 40
CCA AGC TGC ATG ACT TTA AAC TCT AAA GAA GAT CCT TTC ATC ATT CAA 195 Pro Ser Cys Met Thr Leu Asn Ser Lys Glu Asp Pro Phe He He Gin 45 50 55
GGC ACT TTA TGG GAT AAA GAA AAT TTG TAT GAA TTG TAT CAA AAG GCT 243 Gly Thr Leu Trp Asp Lys Glu Asn Leu Tyr Glu Leu Tyr Gin Lys Ala 60 65 70 75
TCT ACC CCC CTA GAA TGG CAT GCT GAA TTG TTT GAG TTG GCT AGA AAG 291 Ser Thr Pro Leu Glu Trp His Ala Glu Leu Phe Glu Leu Ala Arg Lys 80 85 90
CTT GAT TTA GGC ATT TTT AGC TCG CCT TTT AGT TCA CAA GCT TTA GAG 339 Leu Asp Leu Gly He Phe Ser Ser Pro Phe Ser Ser Gin Ala Leu Glu 95 100 105
CTT TTA GAG AGC CTA AAT TGC CCC ATG TAT AAA ATC GCT AGT TTT GAA 387 Leu Leu Glu Ser Leu Asn Cys Pro Met Tyr Lys He Ala Ser Phe Glu 110 115 120
ATC GTT GAT TTG GAC TTG ATT GAA AAG GCC GCT CGC ACA CAA AAG CCC 435 He Val Asp Leu Asp Leu He Glu Lys Ala Ala Arg Thr Gin Lys Pro 125 130 135
ATT ATC CTT TCT AGC GGT ATC GCT ACA CAC ACC GAA TTG CAA GAC GCT 483 He He Leu Ser Ser Gly He Ala Thr His Thr Glu Leu Gin Asp Ala 140 145 150 155
ATC TCA TTG TGC AGA AGA GTG AAT AAT TTT GAC ATC ACC CTT TTA AAA 531 He Ser Leu Cys Arg Arg Val Asn Asn Phe Asp He Thr Leu Leu Lys 160 165 170
TGC GTG AGC GCT TAT CCC AGT AAA ATA GAA GAC GCT AAC TTG TTG AGC 579 Cys Val Ser Ala Tyr Pro Ser Lys He Glu Asp Ala Asn Leu Leu Ser 175 180 185
ATG GTT AAA TTA GGC GAA ATC TTT GGC GTT AAA TTT GGC TTG AGC GAT 627 Met Val Lys Leu Gly Glu He Phe Gly Val Lys Phe Gly Leu Ser Asp 190 195 200
CAC ACG ATT GGC TCT CTT TGC CCC ATT TTA GCC ACC ACT TTA GGA GCG 675 His Thr He Gly Ser Leu Cys Pro He Leu Ala Thr Thr Leu Gly Ala 205 210 215
AGC ATG ATA GAA AAG CAT TTC ATT TTA AAC AAA TCC TTA CAA ACC CCA 723 Ser Met He Glu Lys His Phe He Leu Asn Lys Ser Leu Gin Thr Pro 220 225 230 235
GAC AGC GCT TTT AGC ATG GAT TTT AAC GGA TTT AAA AGC ATG GTT GAA 771 Asp Ser Ala Phe Ser Met Asp Phe Asn Gly Phe Lys Ser Met Val Glu 240 245 250
GCC ATC AAG CAA AGC GTT TTA GCC TTA GGC GAA GAA GAG CCA AGA ATC 819 Ala He Lys Gin Ser Val Leu Ala Leu Gly Glu Glu Glu Pro Arg He 255 260 265
AAT CCA AAG ACT TTA GAA AAG CGA AGA TTT TTT GCA CGC TCT TTA TTT 867 Asn Pro Lys Thr Leu Glu Lys Arg Arg Phe Phe Ala Arg Ser Leu Phe 270 275 280 GTT ATT AAG GAT ATT CAA AAA GGC GAA GCA TTG ACT GAA AAC AAT ATC 915 Val He Lys Asp He Gin Lys Gly Glu Ala Leu Thr Glu Asn Asn He 285 290 295
AAA GCC TTA CGC CCC AAC CTT GGC TTA CAC CCT AAA TTT TAT AAA GAA 963 Lys Ala Leu Arg Pro Asn Leu Gly Leu His Pro Lys Phe Tyr Lys Glu 300 305 310 315
ATT TTA GGC CAA AAA GCA TCA AAA TTC TTA AAA GCC AAC ACC CCC TTA 1011 He Leu Gly Gin Lys Ala Ser Lys Phe Leu Lys Ala Asn Thr Pro Leu 320 325 330
AGC GCT GAT GAT ATA GAA CGC TCA TTG TAGGTTCGTT TTGATCAAAA AATGGGG 1065 Ser Ala Asp Asp He Glu Arg Ser Leu 335 340
TTTTTAATTT TGTTTTATGG TTTTAGATTT GATTTTAAAC TCATTTTCTT TATTTTAA 1123
(2) INFORMATION FOR SEQ ID NO: 1224:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 340 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1224:
Met Leu Gin Pro Pro Lys He Val Ala Glu Leu Ser Ala Asn His Asn
1 5 10 15
Gin Asp Leu Asn Leu Ala Lys Glu Ser Leu His Ala He Lys Glu Ser
20 25 30
Gly Ala Asp Phe Val Lys Leu Gin Thr Tyr Thr Pro Ser Cys Met Thr
35 40 45
Leu Asn Ser Lys Glu Asp Pro Phe He He Gin Gly Thr Leu Trp Asp
50 55 60
Lys Glu Asn Leu Tyr Glu Leu Tyr Gin Lys Ala Ser Thr Pro Leu Glu 65 70 75 80
Trp His Ala Glu Leu Phe Glu Leu Ala Arg Lys Leu Asp Leu Gly He
85 90 95
Phe Ser Ser Pro Phe Ser Ser Gin Ala Leu Glu Leu Leu Glu Ser Leu
100 105 110
Asn Cys Pro Met Tyr Lys He Ala Ser Phe Glu He Val Asp Leu Asp
115 120 125
Leu He Glu Lys Ala Ala Arg Thr Gin Lys Pro He He Leu Ser Ser
130 135 140
Gly He Ala Thr His Thr Glu Leu Gin Asp Ala He Ser Leu Cys Arg 145 150 155 160
Arg Val Asn Asn Phe Asp He Thr Leu Leu Lys Cys Val Ser Ala Tyr
165 170 175
Pro Ser Lys He Glu Asp Ala Asn Leu Leu Ser Met Val Lys Leu Gly
180 185 190
Glu He Phe Gly Val Lys Phe Gly Leu Ser Asp His Thr He Gly Ser 195 200 205
Leu Cys Pro He Leu Ala Thr Thr Leu Gly Ala Ser Met He Glu Lys
210 215 220
His Phe He Leu Asn Lys Ser Leu Gin Thr Pro Asp Ser Ala Phe Ser 225 230 235 240
Met Asp Phe Asn Gly Phe Lys Ser Met Val Glu Ala He Lys Gin Ser
245 250 255
Val Leu Ala Leu Gly Glu Glu Glu Pro Arg He Asn Pro Lys Thr Leu
260 265 270
Glu Lys Arg Arg Phe Phe Ala Arg Ser Leu Phe Val He Lys Asp He
275 280 285
Gin Lys Gly Glu Ala Leu Thr Glu Asn Asn He Lys Ala Leu Arg Pro
290 295 300
Asn Leu Gly Leu His Pro Lys Phe Tyr Lys Glu He Leu Gly Gin Lys 305 310 315 320
Ala Ser Lys Phe Leu Lys Ala Asn Thr Pro Leu Ser Ala Asp Asp He
325 330 335
Glu Arg Ser Leu 340
(2) INFORMATION FOR SEQ ID NO: 1225:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1234 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...1197 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1225:
GTGAATGAGC GTTTGGAGGT GCTGTTGGAA ATG GTT TTG ATG CGG TTT GAA GAG 54
Met Val Leu Met Arg Phe Glu Glu 1 5
CCC GAT CCT GGA AGA GCT ATC AGA ACC TTT CAG AGC GTG AAT GAC AGA 102 Pro Asp Pro Gly Arg Ala He Arg Thr Phe Gin Ser Val Asn Asp Arg 10 15 20
GGC GTG CCT CTC CTC TTG CTA GAC AAA CTA AAA TCC CTT CTC ATC TAT 15.0 Gly Val Pro Leu Leu Leu Leu Asp Lys Leu Lys Ser Leu Leu He Tyr 25 30 35 40
TAC TCC AAC ATT TTT TGC GAT GGG AAA AGG GGG CTA GAC CAA TTT ATC 198 Tyr Ser Asn He Phe Cys Asp Gly Lys Arg Gly Leu Asp Gin Phe He 45 50 55 ATC GAT CAT TTT GGG GAG ATC TTT AAG ATC TTT GCC AAG ATT AAA AAG 246 He Asp His Phe Gly Glu He Phe Lys He Phe Ala Lys He Lys Lys 60 65 70
AGC GAC CAC ATC TCC AGC GTT GGA GGC TTT GAT GAA GGC GAT ATC TTC 294 Ser Asp His He Ser Ser Val Gly Gly Phe Asp Glu Gly Asp He Phe 75 80 85
CGC TAC CAC GCA GGG AGC CAA AAA TTT GAT GGA ATC GAG TTT TTA GGG 342 Arg Tyr His Ala Gly Ser Gin Lys Phe Asp Gly He Glu Phe Leu Gly 90 95 100
CAC TAC GAA GCA AGC ACG GAC AAA ACC TAC GAG AAA CTC AAA GAT GAA 390 His Tyr Glu Ala Ser Thr Asp Lys Thr Tyr Glu Lys Leu Lys Asp Glu 105 110 115 120
CTA AAA AAA ATC AAA AAA AGC AAA TTG AAA AGT TTC ATC CAA TCC TAT 438 Leu Lys Lys He Lys Lys Ser Lys Leu Lys Ser Phe He Gin Ser Tyr 125 130 135
GTC AGC GAT TTG AAA AAT TTC TAT CAG GCT TTT CTT GAT CTA TTG AGC 486 Val Ser Asp Leu Lys Asn Phe Tyr Gin Ala Phe Leu Asp Leu Leu Ser 140 145 150
GAG ATT GAC ACC AAC CCA ACC ACC TTT AAG GTC ATG CTC ATC AAC AAG 534 Glu He Asp Thr Asn Pro Thr Thr Phe Lys Val Met Leu He Asn Lys 155 160 165
ATC GAC TCG TCT TTT TTC AAT TCG CTC ATC CGC CTG AAA ATC AAC AAC 582 He Asp Ser Ser Phe Phe Asn Ser Leu He Arg Leu Lys He Asn Asn 170 175 180
GAA CTA GAC GAT GAA ACG CTG AAA CTC TTT GCC AAA ACC GAT ATT GTG 630 Glu Leu Asp Asp Glu Thr Leu Lys Leu Phe Ala Lys Thr Asp He Val 185 190 195 200
CTT TTC AAA GCT ACT AGA GAT AGG CCA GGA ACG GAC AAC CTG ATT AAT 678 Leu Phe Lys Ala Thr Arg Asp Arg Pro Gly Thr Asp Asn Leu He Asn 205 210 215
GCG TAT CTT AAA AAG GGC AAA GAG GGA TTG AAG AGC GAG ATG ATT GCT 726 Ala Tyr Leu Lys Lys Gly Lys Glu Gly Leu Lys Ser Glu Met He Ala 220 225 230
CAA TGC AGA AAT GAT ATA GGG CTG GCT TTT TGG CAG TCT GTA AAC AAC 774 Gin Cys Arg Asn Asp He Gly Leu Ala Phe Trp Gin Ser Val Asn Asn 235 240 245
GCA TCC AAC TCA TCA TGC TTC CAC TAT ATC TTC TTT GAA AAG AAC TGC 822 Ala Ser Asn Ser Ser Cys Phe His Tyr He Phe Phe Glu Lys Asn Cys 250 255 260
CAG GAG ATG GGT CTT GCC GAT CTC AAA AAA TTG ATC CCT AGG AAG CAA 870 Gin Glu Met Gly Leu Ala Asp Leu Lys Lys Leu He Pro Arg Lys Gin 265 270 275 280 TTC TCC CAA GAA AAA GAA CAC ATC ATC CCC ATC AAT TTA TTA AAA CAG 918 Phe Ser Gin Glu Lys Glu His He He Pro He Asn Leu Leu Lys Gin 285 290 295
GAA TCC AAC AAT AAG ATC AGA GAT CTT GGT TTT GAA GAC AAA AAA GAT 966 Glu Ser Asn Asn Lys He Arg Asp Leu Gly Phe Glu Asp Lys Lys Asp 300 305 310
CTT GAA GAC TAC ATT GAC ACA TAC GGC AAC CTC ATC TCC CTG GAA AAA 1014 Leu Glu Asp Tyr He Asp Thr Tyr Gly Asn Leu He Ser Leu Glu Lys 315 320 325
TCG CTC AAT CGT AAG GCA AGC GAT AAG GAT CTG TAT GGA AAA GAT GAA 1062 Ser Leu Asn Arg Lys Ala Ser Asp Lys Asp Leu Tyr Gly Lys Asp Glu 330 335 340
ATC TAT AAA AGT AGT GAG ATC CCT TTC AAC AGG CGC TTT GAT ACA AAA 1110 He Tyr Lys Ser Ser Glu He Pro Phe Asn Arg Arg Phe Asp Thr Lys 345 350 355 360
AAC TTC AAT AAG AAG GCA TTG GTA AAA AGA AAT GAA GAA ATG CGA GAA 1158 Asn Phe Asn Lys Lys Ala Leu Val Lys Arg Asn Glu Glu Met Arg Glu 365 370 375
TGG CTG ATC GAC ACC TTT TTT AAG GAT TTC GCC GCC CAC TAAAGAGAGT GA 1209 Trp Leu He Asp Thr Phe Phe Lys Asp Phe Ala Ala His 380 385
GATTAAAAGA GAGTGATCGC ACTCA 1234
(2) INFORMATION FOR SEQ ID NO: 1226:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 389 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1226:
Met Val Leu Met Arg Phe Glu Glu Pro Asp Pro Gly Arg Ala He Arg
1 5 10 15
Thr Phe Gin Ser Val Asn Asp Arg Gly Val Pro Leu Leu Leu Leu Asp
20 25 30
Lys Leu Lys Ser Leu Leu He Tyr Tyr Ser Asn He Phe Cys Asp Gly
35 40 45
Lys Arg Gly Leu Asp Gin Phe He He Asp His Phe Gly Glu He Phe
50 55 60
Lys He Phe Ala Lys He Lys Lys Ser Asp His He Ser Ser Val Gly 65 70 75 80
Gly Phe Asp Glu Gly Asp He Phe Arg Tyr His Ala Gly Ser Gin Lys
85 90 95
Phe Asp Gly He Glu Phe Leu Gly His Tyr Glu Ala Ser Thr Asp Lys 100 105 110
Thr Tyr Glu Lys Leu Lys Asp Glu Leu Lys Lys He Lys Lys Ser Lys
115 120 125
Leu Lys Ser Phe He Gin Ser Tyr Val Ser Asp Leu Lys Asn Phe Tyr
130 135 140
Gin Ala Phe Leu Asp Leu Leu Ser Glu He Asp Thr Asn Pro Thr Thr 145 150 155 160
Phe Lys Val Met Leu He Asn Lys He Asp Ser Ser Phe Phe Asn Ser
165 170 175
Leu He Arg Leu Lys He Asn Asn Glu Leu Asp Asp Glu Thr Leu Lys
180 185 190
Leu Phe Ala Lys Thr Asp He Val Leu Phe Lys Ala Thr Arg Asp Arg
195 200 205
Pro Gly Thr Asp Asn Leu He Asn Ala Tyr Leu Lys Lys' Gly Lys Glu
210 215 220
Gly Leu Lys Ser Glu Met He Ala Gin Cys Arg Asn Asp He Gly Leu 225 230 235 240
Ala Phe Trp Gin Ser Val Asn Asn Ala Ser Asn Ser Ser Cys Phe His
245 250 255
Tyr He Phe Phe Glu Lys Asn Cys Gin Glu Met Gly Leu Ala Asp Leu
260 265 270
Lys Lys Leu He Pro Arg Lys Gin Phe Ser Gin Glu Lys Glu His He
275 280 285
He Pro He Asn Leu Leu Lys Gin Glu Ser Asn Asn Lys He Arg Asp
290 295 300
Leu Gly Phe Glu Asp Lys Lys Asp Leu Glu Asp Tyr He Asp Thr Tyr 305 310 315 320
Gly Asn Leu He Ser Leu Glu Lys Ser Leu Asn Arg Lys Ala Ser Asp
325 330 335
Lys Asp Leu Tyr Gly Lys Asp Glu He Tyr Lys Ser Ser Glu He Pro
340 345 350
Phe Asn Arg Arg Phe Asp Thr Lys Asn Phe Asn Lys Lys Ala Leu Val
355 360 365
Lys Arg Asn Glu Glu Met Arg Glu Trp Leu He Asp Thr Phe Phe Lys
370 375 380
Asp Phe Ala Ala His 385
(2) INFORMATION FOR SEQ ID NO: 1227:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 889 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...840 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1227:
TTATAAAGGA AAATC ATG GGA TTT TTA AAA GGT AAA AAA GGG CTT ATT GTA 51 Met Gly Phe Leu Lys Gly Lys Lys Gly Leu He Val 1 5 10
GGG GTG GCG AAC AAT AAA TCC ATC GCT TAT GGG ATC GCT CAA TCT TGT 99 Gly Val Ala Asn Asn Lys Ser He Ala Tyr Gly He Ala Gin Ser Cys 15 20 25
TTC AAT CAA GGG GCT ACT TTG GCT TTC ACT TAT TTG AAT GAG AGT TTA 147 Phe Asn Gin Gly Ala Thr Leu Ala Phe Thr Tyr Leu Asn Glu Ser Leu 30 35 40
GAA AAG CGC GTA AGG CCT ATC GCG CAG GAA TTG AAT AGC CCC TAT GTG 195 Glu Lys Arg Val Arg Pro He Ala Gin Glu Leu Asn Ser Pro Tyr Val 45 50 55 60
TAT GAA TTG GAT GTG AGC AAA GAA GAG CAT TTC AAG TCG CTA TAC AAT 243 Tyr Glu Leu Asp Val Ser Lys Glu Glu His Phe Lys Ser Leu Tyr Asn 65 70 75
AGC GTT AAA AAG GAT TTA GGC TCA TTG GAT TTT ATT GTT CAT AGC GTG 291 Ser Val Lys Lys Asp Leu Gly Ser Leu Asp Phe He Val His Ser Val 80 85 90
GCC TTT GCC CCT AAA GAG GCT TTA GAG GGG AGC TTG TTG GAA ACT TCT 339 Ala Phe Ala Pro Lys Glu Ala Leu Glu Gly Ser Leu Leu Glu Thr Ser 95 100 105
AAA AGC GCG TTT AAC ACC GCT ATG GAA ATT TCT GTT TAT TCT TTA ATA 387 Lys Ser Ala Phe Asn Thr Ala Met Glu He Ser Val Tyr Ser Leu He 110 115 120
GAG CTG ACA AAC ACC CTA AAA CCT TTA TTG AAT AAC GGA GCG TCT GTT 435 Glu Leu Thr Asn Thr Leu Lys Pro Leu Leu Asn Asn Gly Ala Ser Val 125 130 135 140
TTG ACT CTA AGC TAT TTG GGT AGC ACC AAA TAC ATG GCG CAT TAC AAT 483 Leu Thr Leu Ser Tyr Leu Gly Ser Thr Lys Tyr Met Ala His Tyr Asn 145 150 155
GTG ATG GGG TTG GCT AAA GCG GCC CTA GAG AGT GCG GTG CGT TAT TTA 531 Val Met Gly Leu Ala Lys Ala Ala Leu Glu Ser Ala Val Arg Tyr Leu 160 165 170
GCG GTG GAT TTA GGC AAA CAC CAT ATA AGA GTG AAT GCC CTA TCG GCC 579 Ala Val Asp Leu Gly Lys His His He Arg Val Asn Ala Leu Ser Ala 175 180 185
GGG CCT ATT AGG ACG CTC GCT TCT AGC GGG ATC GCT GAT TTT AGA ATG 627 Gly Pro He Arg Thr Leu Ala Ser Ser Gly He Ala Asp Phe Arg Met 190 195 200
ATT TTA AAA TGG AAT GAA ATC AAC GCC CCT TTA AGA AAA AAT GTG AGT 675 He Leu Lys Trp Asn Glu He Asn Ala Pro Leu Arg Lys Asn Val Ser 205 210 215 220
TTA GAA GAA GTG GGC AAT GCC GGG ATG TAT TTG CTC TCT AGT TTG TCT 723 Leu Glu Glu Val Gly Asn Ala Gly Met Tyr Leu Leu Ser Ser Leu Ser 225 230 235
AGC GGG GTG AGT GGG GAA GTG CAT TTT GTG GAT GCT GGC TAT CAT GTT 771 Ser Gly Val Ser Gly Glu Val His Phe Val Asp Ala Gly Tyr His Val 240 245 250
ATG GGC ATG GGG GCT GTG GAA GAA AAA GAT AAT AAA GCT ACG CTA CTG 819 Met Gly Met Gly Ala Val Glu Glu Lys Asp Asn Lys Ala Thr Leu Leu 255 260 265
TGG GAT TTG CAT AAA GAA CAA TAAGGGGTAT TGATGAAATT AAGCGAATTG TTAA 874 Trp Asp Leu His Lys Glu Gin 270 275
ACGCCTATTC TATTG 889
(2) INFORMATION FOR SEQ ID NO: 1228:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 275 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1228:
Met Gly Phe Leu Lys Gly Lys Lys Gly Leu He Val Gly Val Ala Asn
1 5 10 15
Asn Lys Ser He Ala Tyr Gly He Ala Gin Ser Cys Phe Asn Gin Gly
20 25 30
Ala Thr Leu Ala Phe Thr Tyr Leu Asn Glu Ser Leu Glu Lys Arg Val
35 40 45
Arg Pro He Ala Gin Glu Leu Asn Ser Pro Tyr Val Tyr Glu Leu Asp
50 55 60
Val Ser Lys Glu Glu His Phe Lys Ser Leu Tyr Asn Ser Val Lys Lys 65 70 75 80
Asp Leu Gly Ser Leu Asp Phe He Val His Ser Val Ala Phe Ala Pro
85 90 95
Lys Glu Ala Leu Glu Gly Ser Leu Leu Glu Thr Ser Lys Ser Ala Phe
100 105 110
Asn Thr Ala Met Glu He Ser Val Tyr Ser Leu He Glu Leu Thr Asn
115 120 125
Thr Leu Lys Pro Leu Leu Asn Asn Gly Ala Ser Val Leu Thr Leu Ser
130 135 140
Tyr Leu Gly Ser Thr Lys Tyr Met Ala His Tyr Asn Val Met Gly Leu 145 150 155 160
Ala Lys Ala Ala Leu Glu Ser Ala Val Arg Tyr Leu Ala Val Asp Leu 165 170 175 Gly Lys His His He Arg Val Asn Ala Leu Ser Ala Gly Pro He Arg
180 185 190
Thr Leu Ala Ser Ser Gly He Ala Asp Phe Arg Met He Leu Lys Trp
195 200 205
Asn Glu He Asn Ala Pro Leu Arg Lys Asn Val Ser Leu Glu Glu Val
210 215 220
Gly Asn Ala Gly Met Tyr Leu Leu Ser Ser Leu Ser Ser Gly Val Ser 225 230 235 240
Gly Glu Val His Phe Val Asp Ala Gly Tyr His Val Met Gly Met Gly
245 250 255
Ala Val Glu Glu Lys Asp Asn Lys Ala Thr Leu Leu Trp Asp Leu His
260 265 270
Lys Glu Gin 275
(2) INFORMATION FOR SEQ ID NO: 1229:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1760 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 33...1688 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1229:
TGGAGGTTAG TAATTTTAAA GGGTAAAATA AA ATG GAA AAT CAT TCG CAT GCC 53
Met Glu Asn His Ser His Ala
1 5
AAT ACG CAT ACC GAT ACG CGC ACC GAT GAT AAA AGC ACT AAG ATC GTG 101 Asn Thr His Thr Asp Thr Arg Thr Asp Asp Lys Ser Thr Lys He Val 10 15 20
CGC TTG TTG GGG TTA ATA GGG GGA GCG TTA ATC GCG CTT GTT ATC TAC 149 Arg Leu Leu Gly Leu He Gly Gly Ala Leu He Ala Leu Val He Tyr 25 30 35
TAT GCG CTC AAT TCT CAA ATG CCT CAT ATT GTA GAA GAA ATC CCC AAG 197 Tyr Ala Leu Asn Ser Gin Met Pro His He Val Glu Glu He Pro Lys 40 45 50 55
CTC AGT TCT TTG AAT TAT AAG GCG ATG CCT GTT GTG GCA GGG GTG GCT 245 Leu Ser Ser Leu Asn Tyr Lys Ala Met Pro Val Val Ala Gly Val Ala 60 65 70
GTT TTA ATG GGG ATA TGG TGG ATG ACT GAA GCC ATT GAC TTG CCC GCA 293 Val Leu Met Gly He Trp Trp Met Thr Glu Ala He Asp Leu Pro Ala 75 80 85
ACC GCG CTT TTA CCT TTG GTG CTT TTT AGC GTC TTT AGC GTG GAT CAA 341 Thr Ala Leu Leu Pro Leu Val Leu Phe Ser Val Phe Ser Val Asp Gin 90 95 100
TTC GCT AGC GTC AGC TCT TCT TAC GCA TCG CCG ATC ATC TTT CTT TTT 389 Phe Ala Ser Val Ser Ser Ser Tyr Ala Ser Pro He He Phe Leu Phe 105 110 115
ATG GGA GGG TTT ATT TTA GCC CTA AGC ATG CAA AAA TGG AAT TTG CAC 437 Met Gly Gly Phe He Leu Ala Leu Ser Met Gin Lys Trp Asn Leu His 120 125 130 135
ACG CGC ATC GCT TTA AGC ATT ATT TTA TTA GTA GGC ACA AGC CCT AGG 485 Thr Arg He Ala Leu Ser He He Leu Leu Val Gly Thr Ser Pro Arg 140 145 150
AGG TTG ATT TTA GGT TTC ATG ATG GCT ACA GGC TTT CTG TCT ATG TGG 533 Arg Leu He Leu Gly Phe Met Met Ala Thr Gly Phe Leu Ser Met Trp 155 160 165
GTG AGC AAT ACC GCA ACG GCG GTG ATG ATG CTC CCT GTT GGC ATG AGC 581 Val Ser Asn Thr Ala Thr Ala Val Met Met Leu Pro Val Gly Met Ser 170 175 180
GTT TTG CAA TTA GTC GCT AAA CTG GTG GGC AAA GAA GAC GCC TCT AAT 629 Val Leu Gin Leu Val Ala Lys Leu Val Gly Lys Glu Asp Ala Ser Asn 185 190 195
TCA TGG CAT CAA AAA GAA GAA ATC ACC AAA GCG CAT GGG GGT ATT ATG 677 Ser Trp His Gin Lys Glu Glu He Thr Lys Ala His Gly Gly He Met 200 205 210 215
AGT AAT ATC GTG CAT AAG GGT AAA GAT ATT ACT CAA GTC ATT CAA GAA 725 Ser Asn He Val His Lys Gly Lys Asp He Thr Gin Val He Gin Glu 220 225 230
AAG ACT ACT ATC TAT CGC ACG AAT TTC AGT ATT TGC TTG ATG CTT GGC 773 Lys Thr Thr He Tyr Arg Thr Asn Phe Ser He Cys Leu Met Leu Gly 235 240 245
ATC GCT TAT GCG GCT TCT ATT GGC TCT TTA GGC ACT TTG ATT GGC ACG 821 He Ala Tyr Ala Ala Ser He Gly Ser Leu Gly Thr Leu He Gly Thr 250 255 260
CCG CCT AAC GCT TTA TTG GCC GGC TAT ATG AAA ACC GCT TTC AAT ATT 869 Pro Pro Asn Ala Leu Leu Ala Gly Tyr Met Lys Thr Ala Phe Asn He 265 270 275
GAA ATT GAT TTC GCT CAG TGG ATG GTG TTT GGG ACG CCG TTA GCC TTT 917 Glu He Asp Phe Ala Gin Trp Met Val Phe Gly Thr Pro Leu Ala Phe 280 285 290 295 ATC ATG CTC ATT TTA GCG TGG CTC TTG CTC ACT TAT GTG ATT TTC CCT 965 He Met Leu He Leu Ala Trp Leu Leu Leu Thr Tyr Val He Phe Pro 300 305 310
TTA AAG ATT AAA GAA ATC CCA GGG GGT AAG GAA GTC ATT AGG GTA GAG 1013 Leu Lys He Lys Glu He Pro Gly Gly Lys Glu Val He Arg Val Glu 315 320 325
TTA AAA AAA TTA GGC CGT TTG AGT CAG GCG GAA ATC TCT GTG GGG ATT 1061 Leu Lys Lys Leu Gly Arg Leu Ser Gin Ala Glu He Ser Val Gly He 330 335 340
ATT TTT ATT TTA GCG TCT TTA GGG TGG ATT TTT TTA GGC GTA ATG TTA 1109 He Phe He Leu Ala Ser Leu Gly Trp He Phe Leu Gly Val Met Leu 345 350 355
AAA TCT TGG GGC GTT AAG ATA GAT AAA ATT GAT TCA GTG ATC GCT ATG 1157 Lys Ser Trp Gly Val Lys He Asp Lys He Asp Ser Val He Ala Met 360 365 370 375
GGG GTT TCT GCG CTT TTA TTC ATT TTG CCC GCT AAC CAT CAG GGC GAT 1205 Gly Val Ser Ala Leu Leu Phe He Leu Pro Ala Asn His Gin Gly Asp 380 385 390
AGG CTC ATT GAT TGG GGT GTT GCT AAA AAA CTC CCT TGG GAT GTG TTG 1253 Arg Leu He Asp Trp Gly Val Ala Lys Lys Leu Pro Trp Asp Val Leu 395 400 405
CTT TTA TTT GGC GGC GGG TTA GCC TTG AGC GCG CAA TTT TCT AAA ACC 1301 Leu Leu Phe Gly Gly Gly Leu Ala Leu Ser Ala Gin Phe Ser Lys Thr 410 415 420
GGG TTG AGT TTG TGG ATC GGG CAT TTA GTC TCT GGC TTT TCG CAT TTA 1349 Gly Leu Ser Leu Trp He Gly His Leu Val Ser Gly Phe Ser His Leu 425 430 435
CCG ATT TTA TTC ATC ATT GTC ATG GTT ACT TTA ATG GTC ATT TTC TTA 1397 Pro He Leu Phe He He Val Met Val Thr Leu Met Val He Phe Leu 440 445 450 455
ACC GAA ATC ACT TCT AAC ACC GCC ACC GCT GCC GCA TTT TTA CCG GTG 1445 Thr Glu He Thr Ser Asn Thr Ala Thr Ala Ala Ala Phe Leu Pro Val 460 465 470
ATT GGA GGG GTT GCG ATG GGC ATG GGT TAT GAA AAC CAT CAG AGC TTG 1493 He Gly Gly Val Ala Met Gly Met Gly Tyr Glu Asn His Gin Ser Leu 475 480 485
TTA TTG ACC ATT CCT GTA GCC TTG AGT GCG ACT TGC GCG TTC ATG CTC 1541 Leu Leu Thr He Pro Val Ala Leu Ser Ala Thr Cys Ala Phe Met Leu 490 495 500
CCT GTG GTC ACC CCA CCG AAT GCA ATA GCT TAT GGC TCT GGG TAT GTT 1589 Pro Val Val Thr Pro Pro Asn Ala He Ala Tyr Gly Ser Gly Tyr Val 505 510 515 AAA ATA ACG GAC ATG ATT AAA GCC GGT TTG TGG CTT AAT CTG GTA GGA 1637
Lys He Thr Asp Met He Lys Ala Gly Leu Trp Leu Asn Leu Val Gly
520 525 530 535
GTT GTT TTG ATT AGC ACG TTT AGC TAT TTT TTG GTT TCG TTA ATA TTT 1685
Val Val Leu He Ser Thr Phe Ser Tyr Phe Leu Val Ser Leu He Phe
540 545 550
AAT TGATTAAGGA AAAAAGTGAA AGAAGAGTTA TTTAAAGAAA AATCTCGTTA CATTAC 1744 Asn
AGGGTTTGTT TTAATC 1760
(2) INFORMATION FOR SEQ ID NO: 1230:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 552 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1230:
Met Glu Asn His Ser His Ala Asn Thr His Thr Asp Thr Arg Thr Asp
1 5 10 15
Asp Lys Ser Thr Lys He Val Arg Leu Leu Gly Leu He Gly Gly Ala
20 25 30
Leu He Ala Leu Val He Tyr Tyr Ala Leu Asn Ser Gin Met Pro His
35 40 45
He Val Glu Glu He Pro Lys Leu Ser Ser Leu Asn Tyr Lys Ala Met
50 55 60
Pro Val Val Ala Gly Val Ala Val Leu Met Gly He Trp Trp Met Thr 65 70 75 80
Glu Ala He Asp Leu Pro Ala Thr Ala Leu Leu Pro Leu Val Leu Phe
85 90 95
Ser Val Phe Ser Val Asp Gin Phe Ala Ser Val Ser Ser Ser Tyr Ala
100 105 110
Ser Pro He He Phe Leu Phe Met Gly Gly Phe He Leu Ala Leu Ser
115 120 125
Met Gin Lys Trp Asn Leu His Thr Arg He Ala Leu Ser He He Leu
130 135 140
Leu Val Gly Thr Ser Pro Arg Arg Leu He Leu Gly Phe Met Met Ala 145 150 155 160
Thr Gly Phe Leu Ser Met Trp Val Ser Asn Thr Ala Thr Ala Val Met
165 170 175
Met Leu Pro Val Gly Met Ser Val Leu Gin Leu Val Ala Lys Leu Val
180 185 190
Gly Lys Glu Asp Ala Ser Asn Ser Trp His Gin Lys Glu Glu He Thr
195 200 205
Lys Ala His Gly Gly He Met Ser Asn He Val His Lys Gly Lys Asp
210 215 220
He Thr Gin Val He Gin Glu Lys Thr Thr He Tyr Arg Thr Asn Phe 225 230 235 240
Ser He Cys Leu Met Leu Gly He Ala Tyr Ala Ala Ser He Gly Ser
245 250 255
Leu Gly Thr Leu He Gly Thr Pro Pro Asn Ala Leu Leu Ala Gly Tyr
260 265 270
Met Lys Thr Ala Phe Asn He Glu He Asp Phe Ala Gin Trp Met Val
275 280 285
Phe Gly Thr Pro Leu Ala Phe He Met Leu He Leu Ala Trp Leu Leu
290 295 300
Leu Thr Tyr Val He Phe Pro Leu Lys He Lys Glu He Pro Gly Gly 305 310 315 320
Lys Glu Val He Arg Val Glu Leu Lys Lys Leu Gly Arg Leu Ser Gin
325 330 335
Ala Glu He Ser Val Gly He He Phe He Leu Ala Ser Leu Gly Trp
340 345 350
He Phe Leu Gly Val Met Leu Lys Ser Trp Gly Val Lys He Asp Lys
355 360 365
He Asp Ser Val He Ala Met Gly Val Ser Ala Leu Leu Phe He Leu
370 375 380
Pro Ala Asn His Gin Gly Asp Arg Leu He Asp Trp Gly Val Ala Lys 385 390 395 400
Lys Leu Pro Trp Asp Val Leu Leu Leu Phe Gly Gly Gly Leu Ala Leu
405 410 415
Ser Ala Gin Phe Ser Lys Thr Gly Leu Ser Leu Trp He Gly His Leu
420 425 430
Val Ser Gly Phe Ser His Leu Pro He Leu Phe He He Val Met Val
435 440 445
Thr Leu Met Val He Phe Leu Thr Glu He Thr Ser Asn Thr Ala Thr
450 455 460
Ala Ala Ala Phe Leu Pro Val He Gly Gly Val Ala Met Gly Met Gly 465 470 475 480
Tyr Glu Asn His Gin Ser Leu Leu Leu Thr He Pro Val Ala Leu Ser
485 490 495
Ala Thr Cys Ala Phe Met Leu Pro Val Val Thr Pro Pro Asn Ala He
500 505 510
Ala Tyr Gly Ser Gly Tyr Val Lys He Thr Asp Met He Lys Ala Gly
515 520 525
Leu Trp Leu Asn Leu Val Gly Val Val Leu He Ser Thr Phe Ser Tyr
530 535 540
Phe Leu Val Ser Leu He Phe Asn 545 550
(2) INFORMATION FOR SEQ ID NO: 1231:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 661 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 53...592 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1231:
GGTGGCGGCG TATTTTAACG GGCGTCCTAT AGAATGCGCT CTTATTAGCG CC ATG GTC 58
Met Val
1
ATG GCT AGT GTT ATC GCT TAT CAA AAA GCG CAC CAT AGC GAA GCC ATT 106 Met Ala Ser Val He Ala Tyr Gin Lys Ala His His Ser Glu Ala He 5 10 15
TTA CCC TTT TTG TAT CCG GGC GTT GGG TTT TTT GCG CTT TTT GGG GTT 154 Leu Pro Phe Leu Tyr Pro Gly Val Gly Phe Phe Ala Leu Phe Gly Val 20 25 30
TAT AAG GAT TTT GGT GCA GTA GCG ATC ATT TGG CTT TTA GTC GTG GTG 202 Tyr Lys Asp Phe Gly Ala Val Ala He He Trp Leu Leu Val Val Val 35 40 45 50
GTT GCA AGC GAT GTG GGG GCG TTT TTT GGA GGC AAG CTT TTA GGC AAA 250 Val Ala Ser Asp Val Gly Ala Phe Phe Gly Gly Lys Leu Leu Gly Lys 55 60 65
ACC CCT TTC ACG CCC ACT TCG CCG AAT AAA ACC TTA GAG GGC GCG TTG 298 Thr Pro Phe Thr Pro Thr Ser Pro Asn Lys Thr Leu Glu Gly Ala Leu 70 75 80
ATT GGC GTG GTT TTG GCG AGC GTT TTA GGA TCG TTT GTG GGC ATG GGG 346 He Gly Val Val Leu Ala Ser Val Leu Gly Ser Phe Val Gly Met Gly 85 90 95
AAA TTG AGC GGA GGC TTT TTT ATG GCG CTC TTT TTT AGT TTT TTA ATC 394 Lys Leu Ser Gly Gly Phe Phe Met Ala Leu Phe Phe Ser Phe Leu He 100 105 110
GCT CTT GTG GCG GTG TTT GGG GAT TTG TAT GAA AGC TAT TTG AAA AGA 442 Ala Leu Val Ala Val Phe Gly Asp Leu Tyr Glu Ser Tyr Leu Lys Arg 115 120 125 130
AAG GTC GGT ATC AAA GAT AGC GGT AAG ATT TTA CCC GGG CAT GGG GGC 490 Lys Val Gly He Lys Asp Ser Gly Lys He Leu Pro Gly His Gly Gly 135 140 145
GTT TTA GAC CGG TTG GAT TCC ATG CTT TTT GGG GCT TTA GGC TTG CAT 538 Val Leu Asp Arg Leu Asp Ser Met Leu Phe Gly Ala Leu Gly Leu His 150 155 160
GCG CTG TTG TAT TTT TTA GAA ATT TGG AAA GAA ACG GCG GTG TTT TTA 586 Ala Leu Leu Tyr Phe Leu Glu He Trp Lys Glu Thr Ala Val Phe Leu 165 170 175
GGG GAT TGAATGGTTG TTTTAGGAAG CACCGGCTCT ATTGGGAAAA ACGCCCTAAA AA 644 Gly Asp 180
TCGCAAAAAA ATTTGGC 661
(2) INFORMATION FOR SEQ ID NO: 1232:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 180 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1232:
Met Val Met Ala Ser Val He Ala Tyr Gin Lys Ala His His Ser Glu
1 5 10 15
Ala He Leu Pro Phe Leu Tyr Pro Gly Val Gly Phe Phe Ala Leu Phe
20 25 30
Gly Val Tyr Lys Asp Phe Gly Ala Val Ala He He Trp Leu Leu Val
35 40 45
Val Val Val Ala Ser Asp Val Gly Ala Phe Phe Gly Gly Lys Leu Leu
50 55 60
Gly Lys Thr Pro Phe Thr Pro Thr Ser Pro Asn Lys Thr Leu Glu Gly 65 70 75 80
Ala Leu He Gly Val Val Leu Ala Ser Val Leu Gly Ser Phe Val Gly
85 90 95
Met Gly Lys Leu Ser Gly Gly Phe Phe Met Ala Leu Phe Phe Ser Phe
100 105 110
Leu He Ala Leu Val Ala Val Phe Gly Asp Leu Tyr Glu Ser Tyr Leu
115 120 125
Lys Arg Lys Val Gly He Lys Asp Ser Gly Lys He Leu Pro Gly His
130 135 140
Gly Gly Val Leu Asp Arg Leu Asp Ser Met Leu Phe Gly Ala Leu Gly 145 150 155 160
Leu His Ala Leu Leu Tyr Phe Leu Glu He Trp Lys Glu Thr Ala Val
165 170 175
Phe Leu Gly Asp 180
(2) INFORMATION FOR SEQ ID NO: 1233:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1157 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 12...1115 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1233:
TAGGGGATTG A ATG GTT GTT TTA GGA AGC ACC GGC TCT ATT GGG AAA AAC 50 Met Val Val Leu Gly Ser Thr Gly Ser He Gly Lys Asn 1 5 10
GCC CTA AAA ATC GCA AAA AAA TTT GGC ATA GAA ATA GAG GCC TTA AGC 98 Ala Leu Lys He Ala Lys Lys Phe Gly He Glu He Glu Ala Leu Ser 15 20 25
TGT GGG AAA AAT ATC GCT TTA ATC AAT GAA CAA ATC CAA GTT TTC AAA 146 Cys Gly Lys Asn He Ala Leu He Asn Glu Gin He Gin Val Phe Lys 30 35 40 45
CCC AAG AAA GTG GCG ATT TTA GAT CCT AGC GAT TTG AAT GAT TTA GAG 194 Pro Lys Lys Val Ala He Leu Asp Pro Ser Asp Leu Asn Asp Leu Glu 50 55 60
CCT TTG GGT GCG GAA GTG TTT GTG GGG TTA GAG GGC ATT GAT GCG ATG 242 Pro Leu Gly Ala Glu Val Phe Val Gly Leu Glu Gly He Asp Ala Met 65 70 75
ATA GAA GAG TGC ACC TCA AAT TTA GTC CTT AAC GCC ATT GTG GGC GTG 290 He Glu Glu Cys Thr Ser Asn Leu Val Leu Asn Ala He Val Gly Val 80 85 90
GCA GGA TTG AAA GCG AGC TTT AAA AGC TTA CAA AGG AAT AAA AAA CTG 338 Ala Gly Leu Lys Ala Ser Phe Lys Ser Leu Gin Arg Asn Lys Lys Leu 95 100 105
GCC CTA GCG AAT AAA GAA AGC TTA GTG AGC GCG GGG CAT TTA TTA GAC 386 Ala Leu Ala Asn Lys Glu Ser Leu Val Ser Ala Gly His Leu Leu Asp 110 115 120 125
ATT TCA CAA ATC ACG CCC ATT GAT AGC GAG CAT TTT GGT TTG TGG GCG 434 He Ser Gin He Thr Pro He Asp Ser Glu His Phe Gly Leu Trp Ala 130 135 140
TTG TTG CAA AAC AAG ACT TTA AAG CCT AAA TCC TTA ATC ATT AGC GCG 482 Leu Leu Gin Asn Lys Thr Leu Lys Pro Lys Ser Leu He He Ser Ala 145 150 155
AGT GGG GGG GCT TTC AGG GAC ACG CCT TTA GAA TTT ATT CCT ATT CAA 530 Ser Gly Gly Ala Phe Arg Asp Thr Pro Leu Glu Phe He Pro He Gin 160 165 170
AAC GCG CAA AAT GCG CTC AAG CAC CCT AAT TGG AGC ATG GGA TCT AAA 578 Asn Ala Gin Asn Ala Leu Lys His Pro Asn Trp Ser Met Gly Ser Lys 175 180 185
ATC ACC ATT GAT TCA GCG AGC ATG GTC AAT AAG CTT TTT GAA ATC CTA 626 He Thr He Asp Ser Ala Ser Met Val Asn Lys Leu Phe Glu He Leu 190 195 200 205
GAA ACT TAT TGG CTT TTT GGC GCG TCT TTA AAG ATT GAT GCG CTG ATT 674 Glu Thr Tyr Trp Leu Phe Gly Ala Ser Leu Lys He Asp Ala Leu He 210 215 220
GAA AGG AGT TCT ATC GTG CAT GCT TTG GTG GAG TTT GAA GAC AAC TCT 722 Glu Arg Ser Ser He Val His Ala Leu Val Glu Phe Glu Asp Asn Ser 225 230 235
ATC ATC GCG CAT TTA GCG AGC GCA GAT ATG CAA TTA CCC ATA AGC TAT 770 He He Ala His Leu Ala Ser Ala Asp Met Gin Leu Pro He Ser Tyr 240 245 250
GCG ATC GAT CCG AAG TTG GCC TCT TTG AGC GCG TCT ATC AAG CCC TTA 818 Ala He Asp Pro Lys Leu Ala Ser Leu Ser Ala Ser He Lys Pro Leu 255 260 265
GAT CTA TAC GCT TTA AGC GCG ATT AAA TTT GAA CCC ATT AGC ATG GAG 866 Asp Leu Tyr Ala Leu Ser Ala He Lys Phe Glu Pro He Ser Met Glu 270 275 280 285
CGC TAC ACT TTG TGG TGT TAT AAA GAC TTA CTG CTA GAA AAC CCT AAG 914 Arg Tyr Thr Leu Trp Cys Tyr Lys Asp Leu Leu Leu Glu Asn Pro Lys 290 295 300
CTT GGC GTG GTG CTG AAT GCG AGC AAT GAA GTG GCG ATG GAG AAG TTT 962 Leu Gly Val Val Leu Asn Ala Ser Asn Glu Val Ala Met Glu Lys Phe 305 310 315
TTA AAC AAA GAG ATC GCT TTT GGT GGC CTT ATC CAA ACC ATT TCT CAA 1010 Leu Asn Lys Glu He Ala Phe Gly Gly Leu He Gin Thr He Ser Gin 320 325 330
GCC TTA GAA TCA TAC GAT AAA ATG CCT TTC AAG CTC TCT AGT TTA GAA 1058 Ala Leu Glu Ser Tyr Asp Lys Met Pro Phe Lys Leu Ser Ser Leu Glu 335 340 345
GAA GTG CTG GAA TTA GAC AAA GAA GTT AGG GAG CGT TTT AAA AAT GTA 1106 Glu Val Leu Glu Leu Asp Lys Glu Val Arg Glu Arg Phe Lys Asn Val 350 355 360 365
GCG GGA GTG TAGTATAATA AGATTTTGCT TCTAATAGCG TTTTATTTCA AT 1157
Ala Gly Val
(2) INFORMATION FOR SEQ ID NO: 1234
;i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 368 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1234:
Met Val Val Leu Gly Ser Thr Gly Ser He Gly Lys Asn Ala Leu Lys
1 5 10 15
He Ala Lys Lys Phe Gly He Glu He Glu Ala Leu Ser Cys Gly Lys
20 25 30
Asn He Ala Leu He Asn Glu Gin He Gin Val Phe Lys Pro Lys Lys
35 40 45
Val Ala He Leu Asp Pro Ser Asp Leu Asn Asp Leu Glu Pro Leu Gly
50 55 60
Ala Glu Val Phe Val Gly Leu Glu Gly He Asp Ala Met He Glu Glu 65 70 75 80
Cys Thr Ser Asn Leu Val Leu Asn Ala He Val Gly Val Ala Gly Leu
85 90 95
Lys Ala Ser Phe Lys Ser Leu Gin Arg Asn Lys Lys Leu Ala Leu Ala
100 105 110
Asn Lys Glu Ser Leu Val Ser Ala Gly His Leu Leu Asp He Ser Gin
115 120 125
He Thr Pro He Asp Ser Glu His Phe Gly Leu Trp Ala Leu Leu Gin
130 135 140
Asn Lys Thr Leu Lys Pro Lys Ser Leu He He Ser Ala Ser Gly Gly 145 150 155 160
Ala Phe Arg Asp Thr Pro Leu Glu Phe He Pro He Gin Asn Ala Gin
165 170 175
Asn Ala Leu Lys His Pro Asn Trp Ser Met Gly Ser Lys He Thr He
180 185 190
Asp Ser Ala Ser Met Val Asn Lys Leu Phe Glu He Leu Glu Thr Tyr
195 200 205
Trp Leu Phe Gly Ala Ser Leu Lys He Asp Ala Leu He Glu Arg Ser
210 215 220
Ser He Val His Ala Leu Val Glu Phe Glu Asp Asn Ser He He Ala 225 230 235 240
His Leu Ala Ser Ala Asp Met Gin Leu Pro He Ser Tyr Ala He Asp
245 250 255
Pro Lys Leu Ala Ser Leu Ser Ala Ser He Lys Pro Leu Asp Leu Tyr
260 265 270
Ala Leu Ser Ala He Lys Phe Glu Pro He Ser Met Glu Arg Tyr Thr
275 280 285
Leu Trp Cys Tyr Lys Asp Leu Leu Leu Glu Asn Pro Lys Leu Gly Val
290 295 300
Val Leu Asn Ala Ser Asn Glu Val Ala Met Glu Lys Phe Leu Asn Lys 305 310 315 320
Glu He Ala Phe Gly Gly Leu He Gin Thr He Ser Gin Ala Leu Glu
325 330 335
Ser Tyr Asp Lys Met Pro Phe Lys Leu Ser Ser Leu Glu Glu Val Leu
340 345 350
Glu Leu Asp Lys Glu Val Arg Glu Arg Phe Lys Asn Val Ala Gly Val 355 360 365
(2) INFORMATION FOR SEQ ID NO: 1235:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1025 base pairs (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 22...999 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1235:
AAAGAATATA AAGGAATCAA A ATG GCA AAA CAT GAT TTA GTG GGT TCG GTT 51
Met Ala Lys His Asp Leu Val Gly Ser Val 1 5 10
CTC TGG GAC GCA TAT TCT AAA GAA GTT CAA AGG CGC ATG GAC AAC CCC 99 Leu Trp Asp Ala Tyr Ser Lys Glu Val Gin Arg Arg Met Asp Asn Pro 15 20 25
ACG CAT TTA GGG GTC ATC ACC GAA GAG CAG GCT AAA GCC AAA AAC GCT 147 Thr His Leu Gly Val He Thr Glu Glu Gin Ala Lys Ala Lys Asn Ala 30 35 40
AAG CTC ATT GTG GCG GAT TAT GGC GCA GAG GCA TGC GGT GAT GCG GTG 195 Lys Leu He Val Ala Asp Tyr Gly Ala Glu Ala Cys Gly Asp Ala Val 45 50 55
AGG TTG TAT TGG CTT GTA GAT GAA AGC ACG GAT AGA ATT GTT GAC GCG 243 Arg Leu Tyr Trp Leu Val Asp Glu Ser Thr Asp Arg He Val Asp Ala 60 65 70
AAG TTT AAA AGC TTT GGT TGC GGA ACA GCG ATC GCA AGC TCA GAC ATG 291 Lys Phe Lys Ser Phe Gly Cys Gly Thr Ala He Ala Ser Ser Asp Met 75 80 85 90
ATG GTA GAG TTG TGC TTG AAT AAA AGA GTC CAA GAT GCG GTA AAA ATC 339 Met Val Glu Leu Cys Leu Asn Lys Arg Val Gin Asp Ala Val Lys He 95 100 105
ACG AAT TTA GAT GTG GAA AGA GGC TTG AGA GAC GAT CCG GAC ACG CCG 387 Thr Asn Leu Asp Val Glu Arg Gly Leu Arg Asp Asp Pro Asp Thr Pro 110 115 120
GCG GTG CCT GGG CAA AAA ATG CAC TGC TCG GTG ATG GCG TAT GAT GTG 435 Ala Val Pro Gly Gin Lys Met His Cys Ser Val Met Ala Tyr Asp Val 125 130 135
ATC AAA AAA GCT GCC GGC ATG TAT TTG GGG AAA AAC GCT GAA GAT TTT 483 He Lys Lys Ala Ala Gly Met Tyr Leu Gly Lys Asn Ala Glu Asp Phe 140 145 150 GAA GAA GAA ATC ATC GTG TGC GAG TGC GCT AGG GTG AGT TTA GGT ACG 531 Glu Glu Glu He He Val Cys Glu Cys Ala Arg Val Ser Leu Gly Thr 155 160 165 170
ATT AAA GAA GTG ATT AAG CTC AAT GAT TTA AAA AGC GTT GAA GAA ATC 579 He Lys Glu Val He Lys Leu Asn Asp Leu Lys Ser Val Glu Glu He 175 180 185
ACT AAC TAC ACC AAA GCC GGT GCT TTT TGT AAA AGC TGT GTG AGG CCT 627 Thr Asn Tyr Thr Lys Ala Gly Ala Phe Cys Lys Ser Cys Val Arg Pro 190 195 200
GGA GGG CAT GAA AAA AGG GAT TAT TAC TTG GTG GAT ATT CTT AAA GAA 675 Gly Gly His Glu Lys Arg Asp Tyr Tyr Leu Val Asp He Leu Lys Glu 205 210 215
GTG CGC GAA GAA ATG GAA GCT GAA AAA CTT AAA GCG ACC GCT AAT AAA 723 Val Arg Glu Glu Met Glu Ala Glu Lys Leu Lys Ala Thr Ala Asn Lys 220 225 230
TCC CAA AGC GGA GAA TTG GCT TTC AGG GAA ATG ACT ATG GTT CAA AAG 771 Ser Gin Ser Gly Glu Leu Ala Phe Arg Glu Met Thr Met Val Gin Lys 235 240 245 250
ATT AAA GCG GTG GAT AAA GTC ATT GAT GAA AAT ATC CGC CCG ATG CTT 819 He Lys Ala Val Asp Lys Val He Asp Glu Asn He Arg Pro Met Leu 255 260 265
ATG ATG GAT GGA GGG GAT TTA GAG ATT TTA GAC ATT AAA GAA AGC GAT 867 Met Met Asp Gly Gly Asp Leu Glu He Leu Asp He Lys Glu Ser Asp 270 275 280
GAT TAC ATT GAT GTG TAT ATC CGC TAC ATG GGG GCA TGT GAT GGG TGC 915 Asp Tyr He Asp Val Tyr He Arg Tyr Met Gly Ala Cys Asp Gly Cys 285 290 295
ATG AGC GCG ACT ACC GGG ACT TTA TTT GCC ATT GAA AAC GCT TTG CAG 963 Met Ser Ala Thr Thr Gly Thr Leu Phe Ala He Glu Asn Ala Leu Gin 300 305 310
GAA TTA TTG GAT CGC AGT ATC AGG GTG TTA CCG ATT TGAACTTTTT AGGGGG 1015 Glu Leu Leu Asp Arg Ser He Arg Val Leu Pro He 315 320 325
TGGAGGCCTT 1025
(2) INFORMATION FOR SEQ ID NO: 1236:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 326 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRTPTION: SEQ ID NO: 1236:
Met Ala Lys His Asp Leu Val Gly Ser Val Leu Trp Asp Ala Tyr Ser
1 5 10 15
Lys Glu Val Gin Arg Arg Met Asp Asn Pro Thr His Leu Gly Val He
20 25 30
Thr Glu Glu Gin Ala Lys Ala Lys Asn Ala Lys Leu He Val Ala Asp
35 40 45
Tyr Gly Ala Glu Ala Cys Gly Asp Ala Val Arg Leu Tyr Trp Leu Val
50 55 60
Asp Glu Ser Thr Asp Arg He Val Asp Ala Lys Phe Lys Ser Phe Gly 65 70 75 80
Cys Gly Thr Ala He Ala Ser Ser Asp Met Met Val Glu Leu Cys Leu
85 90 95
Asn Lys Arg Val Gin Asp Ala Val Lys He Thr Asn Leu Asp Val Glu
100 105 110
Arg Gly Leu Arg Asp Asp Pro Asp Thr Pro Ala Val Pro Gly Gin Lys
115 120 125
Met His Cys Ser Val Met Ala Tyr Asp Val He Lys Lys Ala Ala Gly
130 135 140
Met Tyr Leu Gly Lys Asn Ala Glu Asp Phe Glu Glu Glu He He Val 145 150 155 160
Cys Glu Cys Ala Arg Val Ser Leu Gly Thr He Lys Glu Val He Lys
165 170 175
Leu Asn Asp Leu Lys Ser Val Glu Glu He Thr Asn Tyr Thr Lys Ala
180 185 190
Gly Ala Phe Cys Lys Ser Cys Val Arg Pro Gly Gly His Glu Lys Arg
195 200 205
Asp Tyr Tyr Leu Val Asp He Leu Lys Glu Val Arg Glu Glu Met Glu
210 215 220
Ala Glu Lys Leu Lys Ala Thr Ala Asn Lys Ser Gin Ser Gly Glu Leu 225 230 235 240
Ala Phe Arg Glu Met Thr Met Val Gin Lys He Lys Ala Val Asp Lys
245 250 255
Val He Asp Glu Asn He Arg Pro Met Leu Met Met Asp Gly Gly Asp
260 265 270
Leu Glu He Leu Asp He Lys Glu Ser Asp Asp Tyr He Asp Val Tyr
275 280 285
He Arg Tyr Met Gly Ala Cys Asp Gly Cys Met Ser Ala Thr Thr Gly
290 295 300
Thr Leu Phe Ala He Glu Asn Ala Leu Gin Glu Leu Leu Asp Arg Ser 305 310 315 320
He Arg Val Leu Pro He 325
(2) INFORMATION FOR SEQ ID NO: 1237:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 414 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA ( ix) FEATURE :
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...375 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1237:
TAGAGCTTGA TTTTTTTA ATG TTA ATA ATG GAT TGG AAA TTA AAA GTG GTG 51
Met Leu He Met Asp Trp Lys Leu Lys Val Val 1 5 10
AAA GAA ATC ATC ACC ATT ACC GCC ACA ACC GCC ACA ATG GGG ATC TTA 99 Lys Glu He He Thr He Thr Ala Thr Thr Ala Thr Met Gly He Leu 15 20 25
ACC ACA TAT TCA TTA AAC ACT AAT ATG AGC ACC ATT AAA GAA AAG CCG 147 Thr Thr Tyr Ser Leu Asn Thr Asn Met Ser Thr He Lys Glu Lys Pro 30 35 40
GCA AAA AAA GTA GAA AGC CTT GTT TTA GCC CCG GAT TTT GCG TTA ATG 195 Ala Lys Lys Val Glu Ser Leu Val Leu Ala Pro Asp Phe Ala Leu Met 45 50 55
ATA GAC TGC CCC ACT AAA GCG CAC CCT GTC ATT CCC CCC AAA AGC CCT 243 He Asp Cys Pro Thr Lys Ala His Pro Val He Pro Pro Lys Ser Pro 60 65 70 75
GAG ATG ATA TTC CCC AAG CCT TGC GCT TTA GTT TCT TTA TTT TTA TCG 291 Glu Met He Phe Pro Lys Pro Cys Ala Leu Val Ser Leu Phe Leu Ser 80 85 90
CTC ACG CCG TCT TTT AAA ATC ACA TCT AAA GTT TTA GCC GTC AAT AAG 339 Leu Thr Pro Ser Phe Lys He Thr Ser Lys Val Leu Ala Val Asn Lys 95 100 105
CTT TCT ATC GTT CCC ACT AGT GCT AAA GAA AGA GCG TAAGGCAACA ACTCTA 391 Leu Ser He Val Pro Thr Ser Ala Lys Glu Arg Ala 110 115
TCATTATTTT AAAATCCAAA TTT 414
(2) INFORMATION FOR SEQ ID NO: 1238:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 119 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1238: Met Leu He Met Asp Trp Lys Leu Lys Val Val Lys Glu He He Thr
1 5 10 15
He Thr Ala Thr Thr Ala Thr Met Gly He Leu Thr Thr Tyr Ser Leu
20 25 30
Asn Thr Asn Met Ser Thr He Lys Glu Lys Pro Ala Lys Lys Val Glu
35 40 45
Ser Leu Val Leu Ala Pro Asp Phe Ala Leu Met He Asp Cys Pro Thr
50 55 60
Lys Ala His Pro Val He Pro Pro Lys Ser Pro Glu Met He Phe Pro 65 70 75 80
Lys Pro Cys Ala Leu Val Ser Leu Phe Leu Ser Leu Thr Pro Ser Phe
85 90 95
Lys He Thr Ser Lys Val Leu Ala Val Asn Lys Leu Ser He Val Pro
100 105 110
Thr Ser Ala Lys Glu Arg Ala 115
(2) INFORMATION FOR SEQ ID NO: 1239:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 686 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...660 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1239:
TATAAGGTTG CTCTC ATG AAA AAA CCC TAT AGG AAG ATT TCT GAT TAT GCG 51 Met Lys Lys Pro Tyr Arg Lys He Ser Asp Tyr Ala 1 5 10
ATC GTG GGT GGT TTG AGC GCG TTA GTG ATG GTG AGC ATT GTG GGG TGT 99 He Val Gly Gly Leu Ser Ala Leu Val Met Val Ser He Val Gly Cys 15 20 25
AAG AGC AAT GCT GAT GAC AAA CCA AAA GAG CAA AGC TCT TTA AGT CAA 147 Lys Ser Asn Ala Asp Asp Lys Pro Lys Glu Gin Ser Ser Leu Ser Gin 30 35 40
AGC GTT CAA AAA GGC GCG TTT GTG ATT TTA GAA GAG CAA AAG GAT AAA 195 Ser Val Gin Lys Gly Ala Phe Val He Leu Glu Glu Gin Lys Asp Lys 45 50 55 60
TCT TAC AAG GTT GTT GAA GAA TAC CCC AGC TCA AGA ACC CAC ATT ATA 243 Ser Tyr Lys Val Val Glu Glu Tyr Pro Ser Ser Arg Thr His He He 65 70 75 GTG CGC GAT TTG CAA GGC AAT GAA CGC GTG TTA AGC AAT GAA GAG ATT 291 Val Arg Asp Leu Gin Gly Asn Glu Arg Val Leu Ser Asn Glu Glu He 80 85 90
CAA AAG CTC ATC AAA GAA GAA GAA GCT AAA ATT GAT AAC GGC ACG AGC 339 Gin Lys Leu He Lys Glu Glu Glu Ala Lys He Asp Asn Gly Thr Ser 95 100 105
AAG CTT GTC CAG CCT AAT AAT GGA GGG AGT AAT GAA GGC TCA GGC TTT 387 Lys Leu Val Gin Pro Asn Asn Gly Gly Ser Asn Glu Gly Ser Gly Phe 110 115 120
GGC TTG GGG AGC GCG ATT TTA GGG AGC GCG GCG GGG GCG ATT TTA GGG 435 Gly Leu Gly Ser Ala He Leu Gly Ser Ala Ala Gly Ala He Leu Gly 125 130 135 140
AGT TAT ATT GGT AAT AAG CTT TTC AAT AAC CCT AAT TAC CAG CAA AAC 483 Ser Tyr He Gly Asn Lys Leu Phe Asn Asn Pro Asn Tyr Gin Gin Asn 145 150 155
GCC CAA CGG ACC TAC AAA TCC CCA CAA GCT TAC CAA CGC TCT CAA AAT 531 Ala Gin Arg Thr Tyr Lys Ser Pro Gin Ala Tyr Gin Arg Ser Gin Asn 160 165 170
TCC TTT TCT AAA AGT GCG CCC AGT GCT TCA AGC ATG GGC GGA GCG AGT 579 Ser Phe Ser Lys Ser Ala Pro Ser Ala Ser Ser Met Gly Gly Ala Ser 175 180 185
AAG GGA CAG AGC GGG TTT TTT GGC TCT AGT AGG CCT ACT AGT TCA CCG 627 Lys Gly Gin Ser Gly Phe Phe Gly Ser Ser Arg Pro Thr Ser Ser Pro 190 195 200
GCG GTA AGC TCT GGG ACA AGG GGC TTT AAC TCA TAATTTAATT GATTCAAGGC 680 Ala Val Ser Ser Gly Thr Arg Gly Phe Asn Ser 205 210 215
TAAAAA 686
(2) INFORMATION FOR SEQ ID NO: 1240:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 215 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1240:
Met Lys Lys Pro Tyr Arg Lys He Ser Asp Tyr Ala He Val Gly Gly
1 5 10 15
Leu Ser Ala Leu Val Met Val Ser He Val Gly Cys Lys Ser Asn Ala
20 25 30
Asp Asp Lys Pro Lys Glu Gin Ser Ser Leu Ser Gin Ser Val Gin Lys 35 40 45
Gly Ala Phe Val He Leu Glu Glu Gin Lys Asp Lys Ser Tyr Lys Val
50 55 60
Val Glu Glu Tyr Pro Ser Ser Arg Thr His He He Val Arg Asp Leu 65 70 75 80
Gin Gly Asn Glu Arg Val Leu Ser Asn Glu Glu He Gin Lys Leu He
85 90 95
Lys Glu Glu Glu Ala Lys He Asp Asn Gly Thr Ser Lys Leu Val Gin
100 105 110
Pro Asn Asn Gly Gly Ser Asn Glu Gly Ser Gly Phe Gly Leu Gly Ser
115 120 125
Ala He Leu Gly Ser Ala Ala Gly Ala He Leu Gly Ser Tyr He Gly
130 135 140
Asn Lys Leu Phe Asn Asn Pro Asn Tyr Gin Gin Asn Ala Gin Arg Thr 145 150 155 160
Tyr Lys Ser Pro Gin Ala Tyr Gin Arg Ser Gin Asn Ser Phe Ser Lys
165 170 175
Ser Ala Pro Ser Ala Ser Ser Met Gly Gly Ala Ser Lys Gly Gin Ser
180 185 190
Gly Phe Phe Gly Ser Ser Arg Pro Thr Ser Ser Pro Ala Val Ser Ser
195 200 205
Gly Thr Arg Gly Phe Asn Ser 210 215
(2) INFORMATION FOR SEQ ID NO: 1241:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1407 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...1362 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1241:
TTATAGGACT TTTTA ATG GAG TTA GAA ACT CAT TTG TCA AAA TAT TTC ACC 51 Met Glu Leu Glu Thr His Leu Ser Lys Tyr Phe Thr 1 5 10
CTA GCC TTT ACG CAT AAA AGC ATG AGC TTA GAA ATG CGA GAA AAA CTC 99 Leu Ala Phe Thr His Lys Ser Met Ser Leu Glu Met Arg Glu Lys Leu 15 20 25
GCT ATT AAT TCG AAT GCA ACG CTT AAA GAA TTT TTA CAA ACC ATT AAA 147 Ala He Asn Ser Asn Ala Thr Leu Lys Glu Phe Leu Gin Thr He Lys 30 35 40 AAC CAT TGC CCT AAC ATC AAA GAG TGC ATG GTG TTA TCC ACA TGC AAT 195 Asn His Cys Pro Asn He Lys Glu Cys Met Val Leu Ser Thr Cys Asn 45 50 55 60
CGC TTT GAA ATC TAT GCG AGC CTA AAA CAC GGC GCT AAT ACT AAT GAA 243 Arg Phe Glu He Tyr Ala Ser Leu Lys His Gly Ala Asn Thr Asn Glu 65 70 75
CAA AAA AAC GCA CTA TTA AAG ATT TTG GCT CAA AAT AAA AAA ATG AGC 291 Gin Lys Asn Ala Leu Leu Lys He Leu Ala Gin Asn Lys Lys Met Ser 80 85 90
GTG TCT GAT TTA GAA AAA TGC GTT TTA ATG AAC ACT GAT GAA AGC GCA 339 Val Ser Asp Leu Glu Lys Cys Val Leu Met Asn Thr Asp Glu Ser Ala 95 100 105
GTC CAT CAT GTC TTT AGC GTG TGC AGC AGT TTG GAT AGC TTG GTG GTT 387 Val His His Val Phe Ser Val Cys Ser Ser Leu Asp Ser Leu Val Val 110 115 120
GGG GAA ACT CAA ATC ACA GGG CAG ATG AAA AAC GCT TAT AAA TTC GCT 435 Gly Glu Thr Gin He Thr Gly Gin Met Lys Asn Ala Tyr Lys Phe Ala 125 130 135 140
TTT GAA GAG AAA TTT TGC TCT AAA GAT TTA ACC CGA TTG CTC CAT TTT 483 Phe Glu Glu Lys Phe Cys Ser Lys Asp Leu Thr Arg Leu Leu His Phe 145 150 155
GCT TTC AAA TGC GCC GCT AAA GTG CGC AAT TTA ACC GGC ATT TCC AAG 531 Ala Phe Lys Cys Ala Ala Lys Val Arg Asn Leu Thr Gly He Ser Lys 160 165 170
CAA GGG GTT TCC ATC TCT TCA GTG GCG GTC AAA GAA GCG CTT AAT ATT 579 Gin Gly Val Ser He Ser Ser Val Ala Val Lys Glu Ala Leu Asn He 175 180 185
TTT GAA AAA GAA AGG ATT AAG GAT AAA AAA GCC CTT GTG ATA GGG CTT 627 Phe Glu Lys Glu Arg He Lys Asp Lys Lys Ala Leu Val He Gly Leu 190 195 200
GGC GAG ATG GCT CAA TTA GTC ATC AAG CAC CTT TTA AAC AAG CAA TTT 675 Gly Glu Met Ala Gin Leu Val He Lys His Leu Leu Asn Lys Gin Phe 205 210 215 220
GAA GCG CTT ATC TTA GGG CGT AAT GCG GCT AAA TTT GAA GAT TTC ATC 723 Glu Ala Leu He Leu Gly Arg Asn Ala Ala Lys Phe Glu Asp Phe He 225 230 235
AAA GAA TTA GAA GAA CCT AAA AAA GTA AGC TTT CAA AAT ATA GAA AAT 771 Lys Glu Leu Glu Glu Pro Lys Lys Val Ser Phe Gin Asn He Glu Asn 240 245 250
TTA AAC GCT TAT ATC AAT GAA TAC GAA CTG CTT TTT TGC GCC ACT TCT 819 Leu Asn Ala Tyr He Asn Glu Tyr Glu Leu Leu Phe Cys Ala Thr Ser 255 260 265 TCG CCG CAT TTT ATC GTG CAA AAT CGC ATG TTA AAA GAA ACG ATT TTC 867 Ser Pro His Phe He Val Gin Asn Arg Met Leu Lys Glu Thr He Phe 270 275 280
AGG CGT TTT TGG TTT GAT TTA GCC GTG CCA CGG AAT ATT GAA AAG CCG 915 Arg Arg Phe Trp Phe Asp Leu Ala Val Pro Arg Asn He Glu Lys Pro 285 290 295 300
GTA TTG GAT AAT ATT TTC TTA TAC AGC GTT GAT GAT TTA GAG CCT ATG 963 Val Leu Asp Asn He Phe Leu Tyr Ser Val Asp Asp Leu Glu Pro Met 305 310 315
GTG AGA GAA AAT GTG GAA AAC AGG CAA GAG AGC AGA ATG AGA GCT TAT 1011 Val Arg Glu Asn Val Glu Asn Arg Gin Glu Ser Arg Met Arg Ala Tyr 320 325 330
GAG ATT GTA GGG CTT GCC ACA ATG GAG TTT TAC CAA TGG ATT CAA AGT 1059 Glu He Val Gly Leu Ala Thr Met Glu Phe Tyr Gin Trp He Gin Ser 335 340 345
TTA GAA GTA GAG CCT GTG ATT AAG GAT TTA AGG GAA TTG GCT AGG ATT 1107 Leu Glu Val Glu Pro Val He Lys Asp Leu Arg Glu Leu Ala Arg He 350 355 360
TCA GCC CAA AAA GAA TTG CAA AAA GCG CTT AAA AAA CGC TAT GTG CCT 1155 Ser Ala Gin Lys Glu Leu Gin Lys Ala Leu Lys Lys Arg Tyr Val Pro 365 370 375 380
AAA GAA TAC GAA AAC AAC ATT GAA AAG ATC TTG CAC AAC GCT TTC AAC 1203 Lys Glu Tyr Glu Asn Asn He Glu Lys He Leu His Asn Ala Phe Asn 385 390 395
ACT TTT TTG CAT AAC CCT ACC ATC GCC TTA AAA AAG AAC GCT CAA AAA 1251 Thr Phe Leu His Asn Pro Thr He Ala Leu Lys Lys Asn Ala Gin Lys 400 405 410
GAA GAA TCC GAT GTG CTT GTG GGT GCG ATT AAA AAC TTG TTT AAT TTA 1299 Glu Glu Ser Asp Val Leu Val Gly Ala He Lys Asn Leu Phe Asn Leu 415 420 425
GAC AAA TCT AAC GCT AAC CAT GCC CAG AAT TTG AAT CTC TAT AAA TGC 1347 Asp Lys Ser Asn Ala Asn His Ala Gin Asn Leu Asn Leu Tyr Lys Cys 430 435 440
GAA TAT TAC GAG GAA TAATGCATGC TATTTTCAAA ACTCTTTGCC CCCACTCTCA A 1403
Glu Tyr Tyr Glu Glu
445
AGAA 1407
(2) INFORMATION FOR SEQ ID NO: 1242:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 449 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1242:
Met Glu Leu Glu Thr His Leu Ser Lys Tyr Phe Thr Leu Ala Phe Thr
1 5 10 15
His Lys Ser Met Ser Leu Glu Met Arg Glu Lys Leu Ala He Asn Ser
20 25 30
Asn Ala Thr Leu Lys Glu Phe Leu Gin Thr He Lys Asn His Cys Pro
35 40 45
Asn He Lys Glu Cys Met Val Leu Ser Thr Cys Asn Arg Phe Glu He
50 55 60
Tyr Ala Ser Leu Lys His Gly Ala Asn Thr Asn Glu Gin Lys Asn Ala 65 70 75 80
Leu Leu Lys He Leu Ala Gin Asn Lys Lys Met Ser Val Ser Asp Leu
85 90 95
Glu Lys Cys Val Leu Met Asn Thr Asp Glu Ser Ala Val His His Val
100 105 110
Phe Ser Val Cys Ser Ser Leu Asp Ser Leu Val Val Gly Glu Thr Gin
115 120 125
He Thr Gly Gin Met Lys Asn Ala Tyr Lys Phe Ala Phe Glu Glu Lys
130 135 140
Phe Cys Ser Lys Asp Leu Thr Arg Leu Leu His Phe Ala Phe Lys Cys 145 150 155 160
Ala Ala Lys Val Arg Asn Leu Thr Gly He Ser Lys Gin Gly Val Ser
165 170 175
He Ser Ser Val Ala Val Lys Glu Ala Leu Asn He Phe Glu Lys Glu
180 185 190
Arg He Lys Asp Lys Lys Ala Leu Val He Gly Leu Gly Glu Met Ala
195 200 205
Gin Leu Val He Lys His Leu Leu Asn Lys Gin Phe Glu Ala Leu He
210 215 220
Leu Gly Arg Asn Ala Ala Lys Phe Glu Asp Phe He Lys Glu Leu Glu 225 230 235 240
Glu Pro Lys Lys Val Ser Phe Gin Asn He Glu Asn Leu Asn Ala Tyr
245 250 255
He Asn Glu Tyr Glu Leu Leu Phe Cys Ala Thr Ser Ser Pro His Phe
260 265 270
He Val Gin Asn Arg Met Leu Lys Glu Thr He Phe Arg Arg Phe Trp
275 280 285
Phe Asp Leu Ala Val Pro Arg Asn He Glu Lys Pro Val Leu Asp Asn
290 295 300
He Phe Leu Tyr Ser Val Asp Asp Leu Glu Pro Met Val Arg Glu Asn 305 310 315 320
Val Glu Asn Arg Gin Glu Ser Arg Met Arg Ala Tyr Glu He Val Gly
325 330 335
Leu Ala Thr Met Glu Phe Tyr Gin Trp He Gin Ser Leu Glu Val Glu
340 345 350
Pro Val He Lys Asp Leu Arg Glu Leu Ala Arg He Ser Ala Gin Lys
355 360 365
Glu Leu Gin Lys Ala Leu Lys Lys Arg Tyr Val Pro Lys Glu Tyr Glu 370 375 380 Asn Asn He Glu Lys He Leu His Asn Ala Phe Asn Thr Phe Leu His 385 390 395 400
Asn Pro Thr He Ala Leu Lys Lys Asn Ala Gin Lys Glu Glu Ser Asp
405 410 415
Val Leu Val Gly Ala He Lys Asn Leu Phe Asn Leu Asp Lys Ser Asn
420 425 430
Ala Asn His Ala Gin Asn Leu Asn Leu Tyr Lys Cys Glu Tyr Tyr Glu
435 440 445
Glu
(2) INFORMATION FOR SEQ ID NO: 1243
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1202 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 40...1125 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1243:
AAAACCCAAA CGCCGTTAAA ATATTTAAAA AGGAAATTC ATG CCC ATT GAT TTG 54
Met Pro He Asp Leu 1 5
AAC GAA CAT TTA AAA AAG AAA AAT TCT CAA AGA GAA ACC CCC ACG CCT 102 Asn Glu His Leu Lys Lys Lys Asn Ser Gin Arg Glu Thr Pro Thr Pro 10 15 20
AAT ACG CCT AAT AAT GGG GGG CGT TTC ATC CCG CCG TCT AAT TCT TTT 150 Asn Thr Pro Asn Asn Gly Gly Arg Phe He Pro Pro Ser Asn Ser Phe 25 30 35
AAT TCT AAA AAA CTA TCG GTT TTA ATT GTC ATT GTC CTT TTA GGC GTT 198 Asn Ser Lys Lys Leu Ser Val Leu He Val He Val Leu Leu Gly Val 40 45 50
ATC GCT TTT TTG GCC AAG CCT TTT GAA GTG ATT AGC TCA GGA GAA ATT 246 He Ala Phe Leu Ala Lys Pro Phe Glu Val He Ser Ser Gly Glu He 55 60 65
GGC ATT AAA ATC ACC GCC GGG AAA TAC GAA CCC ACC CCC TTA CAG CCA 294 Gly He Lys He Thr Ala Gly Lys Tyr Glu Pro Thr Pro Leu Gin Pro 70 75 80 85
GGG ATC CAC TTC TTT GTG CCT ATC ATT CAA GAC ATT CTC ATT GTG GAT 342 Gly He His Phe Phe Val Pro He He Gin Asp He Leu He Val Asp 90 95 100
ACA AGG ATT AGG AAT ATC AAT TTT TCA CGC ACC GAA GAC ATG GGC GTG 390 Thr Arg He Arg Asn He Asn Phe Ser Arg Thr Glu Asp Met Gly Val 105 110 115
GCG GGT AAA AAC CAA GGG ATT TTT AGA AAC GAC GCT ATT AAT GTG ATG 438 Ala Gly Lys Asn Gin Gly He Phe Arg Asn Asp Ala He Asn Val Met 120 125 130
GAT AGT AGG GGT TTG ACC GTT TCT ATT GAA CTC ACC GTG CAA TAC CGC 486 Asp Ser Arg Gly Leu Thr Val Ser He Glu Leu Thr Val Gin Tyr Arg 135 140 145
TTA AAC CCC CAA ACC ACC CCC CAA ACG ATC GCT ACT TAT GGC TTG TCT 534 Leu Asn Pro Gin Thr Thr Pro Gin Thr He Ala Thr Tyr Gly Leu Ser 150 155 160 165
TGG GAG CAA AAA ATC ATC AAC CCT GTG GTG CGC GAT GTG GTG CGC TCT 582 Trp Glu Gin Lys He He Asn Pro Val Val Arg Asp Val Val Arg Ser 170 175 180
GTC GTG GGG CGC TAT CCG GCT GAA GAT TTA CCC ATT AAG CGC AAT GAA 630 Val Val Gly Arg Tyr Pro Ala Glu Asp Leu Pro He Lys Arg Asn Glu 185 190 195
ATC GCC GCT CTT ATT AAT AGC GGT ATC AAT AAA GAA GTT TCT AAG CTC 678 He Ala Ala Leu He Asn Ser Gly He Asn Lys Glu Val Ser Lys Leu 200 205 210
CCT AAC ACC CCT GTG GAA TTA AGC TCT ATC CAA TTG AGA GAA ATC GTC 726 Pro Asn Thr Pro Val Glu Leu Ser Ser He Gin Leu Arg Glu He Val 215 220 225
TTG CCC GCT AAG ATT AAA GAG CAA ATA GAA AAA GTC CAA ATC GCG CGC 774 Leu Pro Ala Lys He Lys Glu Gin He Glu Lys Val Gin He Ala Arg 230 235 240 245
CAA GAA TCA GAA AGG GTG AAA TAC GAG GTG GAG CGC TCC AAG CAA GAA 822 Gin Glu Ser Glu Arg Val Lys Tyr Glu Val Glu Arg Ser Lys Gin Glu 250 255 260
GCT CAA AAA CAA GCC GCT CTG GCT AAA GGG GAA GCG GAC GCT AAC AGG 870 Ala Gin Lys Gin Ala Ala Leu Ala Lys Gly Glu Ala Asp Ala Asn Arg 265 270 275
ATT AAG GCT CAG GGC GTG GCT GAT GCG ATT GTG ATT GAG GCT AAG GCA 918 He Lys Ala Gin Gly Val Ala Asp Ala He Val He Glu Ala Lys Ala 280 285 290
AAA TCT CAA GCT AAT TTA AGC ATT TCG CAA AGC TTG AGC GAC AAG CTT 966 Lys Ser Gin Ala Asn Leu Ser He Ser Gin Ser Leu Ser Asp Lys Leu 295 300 305 TTA AGA CTG CGC CAA ATT GAA GTT CAA GGC CAG TTT AAT GAA GCG TTA 1014 Leu Arg Leu Arg Gin He Glu Val Gin Gly Gin Phe Asn Glu Ala Leu 310 315 320 325
AAA ACG AAC AAT AAC GCT CAA ATC ATG CTC ACT CCA GGT GGG GCT GTG 1062 Lys Thr Asn Asn Asn Ala Gin He Met Leu Thr Pro Gly Gly Ala Val 330 335 340
CCT AAT ATT TGG ATT GAC ACT AAA AGC AAG GTT AAA TCT AGT ATT GCC 1110 Pro Asn He Trp He Asp Thr Lys Ser Lys Val Lys Ser Ser He Ala 345 350 355
GAG ACT AAA GAG CCT TAAAAACGCA TGGCATCTCT TGCCTTTATC CAAGCTTTTT T 1166 Glu Thr Lys Glu Pro 360
GGAGTCTTTT AAGGGATTTT TAAGTCAAGC GACTCT 1202
(2) INFORMATION FOR SEQ ID NO: 1244:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 362 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1244:
Met Pro He Asp Leu Asn Glu His Leu Lys Lys Lys Asn Ser Gin Arg
1 5 10 15
Glu Thr Pro Thr Pro Asn Thr Pro Asn Asn Gly Gly Arg Phe He Pro
20 25 30
Pro Ser Asn Ser Phe Asn Ser Lys Lys Leu Ser Val Leu He Val He
35 40 45
Val Leu Leu Gly Val He Ala Phe Leu Ala Lys Pro Phe Glu Val He
50 55 60
Ser Ser Gly Glu He Gly He Lys He Thr Ala Gly Lys Tyr Glu Pro 65 70 75 80
Thr Pro Leu Gin Pro Gly He His Phe Phe Val Pro He He Gin Asp
85 90 95
He Leu He Val Asp Thr Arg He Arg Asn He Asn Phe Ser Arg Thr
100 105 110
Glu Asp Met Gly Val Ala Gly Lys Asn Gin Gly He Phe Arg Asn Asp
115 120 125
Ala He Asn Val Met Asp Ser Arg Gly Leu Thr Val Ser He Glu Leu
130 135 140
Thr Val Gin Tyr Arg Leu Asn Pro Gin Thr Thr Pro Gin Thr He Ala 145 150 155 160
Thr Tyr Gly Leu Ser Trp Glu Gin Lys He He Asn Pro Val Val Arg
165 170 175
Asp Val Val Arg Ser Val Val Gly Arg Tyr Pro Ala Glu Asp Leu Pro
180 185 190
He Lys Arg Asn Glu He Ala Ala Leu He Asn Ser Gly He Asn Lys 195 200 205
Glu Val Ser Lys Leu Pro Asn Thr Pro Val Glu Leu Ser Ser He Gin
210 215 220
Leu Arg Glu He Val Leu Pro Ala Lys He Lys Glu Gin He Glu Lys 225 230 235 240
Val Gin He Ala Arg Gin Glu Ser Glu Arg Val Lys Tyr Glu Val Glu
245 250 255
Arg Ser Lys Gin Glu Ala Gin Lys Gin Ala Ala Leu Ala Lys Gly Glu
260 265 270
Ala Asp Ala Asn Arg He Lys Ala Gin Gly Val Ala Asp Ala He Val
275 280 285
He Glu Ala Lys Ala Lys Ser Gin Ala Asn Leu Ser He Ser Gin Ser
290 295 300
Leu Ser Asp Lys Leu Leu Arg Leu Arg Gin He Glu Val Gin Gly Gin 305 310 315 320
Phe Asn Glu Ala Leu Lys Thr Asn Asn Asn Ala Gin He Met Leu Thr
325 330 335
Pro Gly Gly Ala Val Pro Asn He Trp He Asp Thr Lys Ser Lys Val
340 345 350
Lys Ser Ser He Ala Glu Thr Lys Glu Pro 355 360
(2) INFORMATION FOR SEQ ID NO: 1245:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 630 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 23...559 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1245:
GACTAAAGAG CCTTAAAAAC GC ATG GCA TCT CTT GCC TTT ATC CAA GCT TTT 52
Met Ala Ser Leu Ala Phe He Gin Ala Phe 1 5 10
TTG GAG TCT TTT AAG GGA TTT TTA AGT CAA GCG ACT CTA ATC AGC GTT 100 Leu Glu Ser Phe Lys Gly Phe Leu Ser Gin Ala Thr Leu He Ser Val 15 20 25
TTA ATA GCG AGC GTT TTA ATC CTT TTT TGC GCG ATT TTG CTC CTT TTG 148 Leu He Ala Ser Val Leu He Leu Phe Cys Ala He Leu Leu Leu Leu 30 35 40
GCT CTG CTT TTG AGA AAC CGC TTA GCT AGC TAT ATA GCA ACA GCA GCT 196 Ala Leu Leu Leu Arg Asn Arg Leu Ala Ser Tyr He Ala Thr Ala Ala 45 50 55
TTT TTG GGT GCG TTT TTA AGC ATG CCT TTT GTT TTG AAC ATT TTA CTC 244 Phe Leu Gly Ala Phe Leu Ser Met Pro Phe Val Leu Asn He Leu Leu 60 65 70
ACT CAA GCG ATT TAC CCC ATA GAA ACA CGC ATC TTA CAC GCT AAC CCT 292 Thr Gin Ala He Tyr Pro He Glu Thr Arg He Leu His Ala Asn Pro 75 80 85 90
TTA AGT TAC AGC AAC GCC TTT TCT TTG CAA GTG GGA GTC AAA AAC CAT 340 Leu Ser Tyr Ser Asn Ala Phe Ser Leu Gin Val Gly Val Lys Asn His 95 100 105
TCC AAA TTT ACT CTA AAC AAA TGC GTT TTA CGC CTA GAA GTG CTT AAA 388 Ser Lys Phe Thr Leu Asn Lys Cys Val Leu Arg Leu Glu Val Leu Lys 110 115 120
AAC CCT CAC AAT TTT GTA GAA GAG CAT GCT TTT AAA TGG TTT GTC AAA 436 Asn Pro His Asn Phe Val Glu Glu His Ala Phe Lys Trp Phe Val Lys 125 130 135
AAA AGC TAT GAA AAA ATT TTT AAA GAA AAG ATT TTG CCC AAA GAA TCT 484 Lys Ser Tyr Glu Lys He Phe Lys Glu Lys He Leu Pro Lys Glu Ser 140 145 150
AAG GTC TTT TCA TTC TTT ATT GAC AAC TAC CCT TAT TCA AAA ACG GCC 532 Lys Val Phe Ser Phe Phe He Asp Asn Tyr Pro Tyr Ser Lys Thr Ala 155 160 165 170
CCT TAT CAA GTT TCT TTG TTT TGT TTA TAAAAAACTA AAAGATAACG CCCAAGA 586 Pro Tyr Gin Val Ser Leu Phe Cys Leu 175
TAACATTCAT TAAAAAGCGA TTAAAAACGC TTAAAGGCAT AGAT 630
(2) INFORMATION FOR SEQ ID NO: 1246:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 179 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1246:
Met Ala Ser Leu Ala Phe He Gin Ala Phe Leu Glu Ser Phe Lys Gly
1 5 10 15
Phe Leu Ser Gin Ala Thr Leu He Ser Val Leu He Ala Ser Val Leu
20 25 30
He Leu Phe Cys Ala He Leu Leu Leu Leu Ala Leu Leu Leu Arg Asn
35 40 45
Arg Leu Ala Ser Tyr He Ala Thr Ala Ala Phe Leu Gly Ala Phe Leu 50 55 60
Ser Met Pro Phe Val Leu Asn He Leu Leu Thr Gin Ala He Tyr Pro 65 70 75 80
He Glu Thr Arg He Leu His Ala Asn Pro Leu Ser Tyr Ser Asn Ala
85 90 95
Phe Ser Leu Gin Val Gly Val Lys Asn His Ser Lys Phe Thr Leu Asn
100 105 110
Lys Cys Val Leu Arg Leu Glu Val Leu Lys Asn Pro His Asn Phe Val
115 120 125
Glu Glu His Ala Phe Lys Trp Phe Val Lys Lys Ser Tyr Glu Lys He
130 135 140
Phe Lys Glu Lys He Leu Pro Lys Glu Ser Lys Val Phe Ser Phe Phe 145 150 155 160
He Asp Asn Tyr Pro Tyr Ser Lys Thr Ala Pro Tyr Gin Val Ser Leu
165 170 175
Phe Cys Leu
(2) INFORMATION FOR SEQ ID NO: 1247
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1350 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 47...1273 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1247:
AAAGAATGAT CTTAAAAGGG CAAACCACAT TTATTAAGGA GAATGC ATG CAA GAA 55
Met Gin Glu 1
ATC ATA GGA GCG TCT TTA GTT TTT TTG TGC AAT GAA AAG TGC GAA GTG 103 He He Gly Ala Ser Leu Val Phe Leu Cys Asn Glu Lys Cys Glu Val 5 10 15
TTA GAA GAT TAT GGC GTA GTC TTT GAT GAA AAG ATT GTT GAA ATA GGC 151 Leu Glu Asp Tyr Gly Val Val Phe Asp Glu Lys He Val Glu He Gly 20 25 30 35
GAT TAT CAA AGT TTA ACG CTT AAA TAC CCT CAC TTA AAG GCG CAG TTT 199 Asp Tyr Gin Ser Leu Thr Leu Lys Tyr Pro His Leu Lys Ala Gin Phe 40 45 50
TTT GAA AAT TCC GTT CTG TTG CCC GCT TTT ATC AAC GCG CAC ACC CAT 247 Phe Glu Asn Ser Val Leu Leu Pro Ala Phe He Asn Ala His Thr His 55 60 65
TTT GAA TTT TCC AAC AAC AAG GCG AGT TTT GAT TAC GGG AGT TTT TCT 295 Phe Glu Phe Ser Asn Asn Lys Ala Ser Phe Asp Tyr Gly Ser Phe Ser 70 75 80
GGC TGG TTA GGG AGC GTG TTA AAC AAT GGG GGG GCG ATT TTA GAA AAT 343 Gly Trp Leu Gly Ser Val Leu Asn Asn Gly Gly Ala He Leu Glu Asn 85 90 95
TGC CAA GGG GCT ATT CAA AAC GCT ATC AGC ACG CAA TTA AAA AGC GGG 391 Cys Gin Gly Ala He Gin Asn Ala He Ser Thr Gin Leu Lys Ser Gly 100 105 110 115
GTG GGG AGC GTG GGA GCG ATT TCT AAC CAC CTG ATA GAA GTT AAT TTG 439 Val Gly Ser Val Gly Ala He Ser Asn His Leu He Glu Val Asn Leu 120 125 130
TTA AAA GAA AGC CCT TTG AAT GCT GTC GTG TTT TTA GAG TTT TTA GGG 487 Leu Lys Glu Ser Pro Leu Asn Ala Val Val Phe Leu Glu Phe Leu Gly 135 140 145
AGC AGT TAT TCT TTA GAA AAA TTA AAA GCG TTT GAG GCC AAA TTT AAG 535 Ser Ser Tyr Ser Leu Glu Lys Leu Lys Ala Phe Glu Ala Lys Phe Lys 150 155 160
GAA TTA AAA GAT TTA GAA GAT AAA AAA CTT AAA GCG GCT CTC GCT GTG 583 Glu Leu Lys Asp Leu Glu Asp Lys Lys Leu Lys Ala Ala Leu Ala Val 165 170 175
CAT GCC CCT TAT TCG GTC CAA AAA GAC ATG GCT TTG AGC GTC ATC CAA 631 His Ala Pro Tyr Ser Val Gin Lys Asp Met Ala Leu Ser Val He Gin 180 185 190 195
TTA GCC AAA GAT TCA CAA AGC CTG CTT TCT ACG CAT TTT TTA GAA TCG 679 Leu Ala Lys Asp Ser Gin Ser Leu Leu Ser Thr His Phe Leu Glu Ser 200 205 210
CTT GAA GAA TTA GAA TGG GTA GAA AAC TCT AAA GGG TGG TTT GAA AAT 727 Leu Glu Glu Leu Glu Trp Val Glu Asn Ser Lys Gly Trp Phe Glu Asn 215 220 225
TTT TAC CAG CAT TTT TTA AAG GAG TCT CAT TTC AAA TCG CTC TAT AAG 775 Phe Tyr Gin His Phe Leu Lys Glu Ser His Phe Lys Ser Leu Tyr Lys 230 235 240
GGC GCG AAC GAT TAC ATT GAC ATG TTT AAA GAC ACG CAC ACT TTA TTC 823 Gly Ala Asn Asp Tyr He Asp Met Phe Lys Asp Thr His Thr Leu Phe 245 250 255
GTG CAT AAC CAG TTC GCT TCT TTA GAA GCG TTA AAA AGG ATT AAA TCT 871 Val His Asn Gin Phe Ala Ser Leu Glu Ala Leu Lys Arg He Lys Ser 260 265 270 275
CAA GTC AAA AAC GCT TTT TTA ATC ACA TGC CCC TTT TCT AAC CGC CTA 919 Gin Val Lys Asn Ala Phe Leu He Thr Cys Pro Phe Ser Asn Arg Leu 280 285 290
TTG AGC GGG CAA GCG TTG GAT TTA GAA AGA ACT AAA GAA GCC GGT TTG 967 Leu Ser Gly Gin Ala Leu Asp Leu Glu Arg Thr Lys Glu Ala Gly Leu 295 300 305
AGC GTG AGC GTG GCC ACT GAT GGC TTG AGT TCT AAC ATT TCG CTG AGC 1015 Ser Val Ser Val Ala Thr Asp Gly Leu Ser Ser Asn He Ser Leu Ser 310 315 320
CTT TTA GAC GAA TTA AGA GCG TTT TTG CTC ACC CAT AAC ATG CCG TTA 1063 Leu Leu Asp Glu Leu Arg Ala Phe Leu Leu Thr His Asn Met Pro Leu 325 330 335
TTA GAA TTA GCT AAA ATA GCC CTT TTA GGG GCG ACT AGG CAT GGG GCT 1111 Leu Glu Leu Ala Lys He Ala Leu Leu Gly Ala Thr Arg His Gly Ala 340 345 350 355
AAA GCT TTA GCT TTG AAT AAT GGC GAG ATA GAA GCC AAC AAA AGG GCG 1159 Lys Ala Leu Ala Leu Asn Asn Gly Glu He Glu Ala Asn Lys Arg Ala 360 365 370
GAT TTG AGC GTG TTT GGT TTT AAT GAA AAA TTC ACT AAA GAG CAA GCG 1207 Asp Leu Ser Val Phe Gly Phe Asn Glu Lys Phe Thr Lys Glu Gin Ala 375 380 385
ATT TTG CAA TTT TTA TTG CAT GCT AAA GAA GTG GAG TGC TTG TTT TTA 1255 He Leu Gin Phe Leu Leu His Ala Lys Glu Val Glu Cys Leu Phe Leu 390 395 400
GGG GGG AAA AGG GTG ATC TAATTTGTTT TAAAGACAGA ATGCGTTAAA ATGAGAAA 1311 Gly Gly Lys Arg Val He 405
TCTAAATCAA TTAAGGAAAG AGTCAATGAA ACTAGTTTT 1350
(2) INFORMATION FOR SEQ ID NO: 1248:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 409 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1248:
Met Gin Glu He He Gly Ala Ser Leu Val Phe Leu Cys Asn Glu Lys
1 5 10 15
Cys Glu Val Leu Glu Asp Tyr Gly Val Val Phe Asp Glu Lys He Val
20 25 30
Glu He Gly Asp Tyr Gin Ser Leu Thr Leu Lys Tyr Pro His Leu Lys
35 40 45 Ala Gin Phe Phe Glu Asn Ser Val Leu Leu Pro Ala Phe He Asn Ala
50 55 60
His Thr His Phe Glu Phe Ser Asn Asn Lys Ala Ser Phe Asp Tyr Gly 65 70 75 80
Ser Phe Ser Gly Trp Leu Gly Ser Val Leu Asn Asn Gly Gly Ala He
85 90 95
Leu Glu Asn Cys Gin Gly Ala He Gin Asn Ala He Ser Thr Gin Leu
100 105 110
Lys Ser Gly Val Gly Ser Val Gly Ala He Ser Asn His Leu He Glu
115 120 125
Val Asn Leu Leu Lys Glu Ser Pro Leu Asn Ala Val Val Phe Leu Glu
130 135 140
Phe Leu Gly Ser Ser Tyr Ser Leu Glu Lys Leu Lys Ala Phe Glu Ala 145 150 155 160
Lys Phe Lys Glu Leu Lys Asp Leu Glu Asp Lys Lys Leu Lys Ala Ala
165 170 175
Leu Ala Val His Ala Pro Tyr Ser Val Gin Lys Asp Met Ala Leu Ser
180 185 190
Val He Gin Leu Ala Lys Asp Ser Gin Ser Leu Leu Ser Thr His Phe
195 200 205
Leu Glu Ser Leu Glu Glu Leu Glu Trp Val Glu Asn Ser Lys Gly Trp
210 215 220
Phe Glu Asn Phe Tyr Gin His Phe Leu Lys Glu Ser His Phe Lys Ser 225 230 235 240
Leu Tyr Lys Gly Ala Asn Asp Tyr He Asp Met Phe Lys Asp Thr His
245 250 255
Thr Leu Phe Val His Asn Gin Phe Ala Ser Leu Glu Ala Leu Lys Arg
260 265 270
He Lys Ser Gin Val Lys Asn Ala Phe Leu He Thr Cys Pro Phe Ser
275 280 285
Asn Arg Leu Leu Ser Gly Gin Ala Leu Asp Leu Glu Arg Thr Lys Glu
290 295 300
Ala Gly Leu Ser Val Ser Val Ala Thr Asp Gly Leu Ser Ser Asn He 305 310 315 320
Ser Leu Ser Leu Leu Asp Glu Leu Arg Ala Phe Leu Leu Thr His Asn
325 330 335
Met Pro Leu Leu Glu Leu Ala Lys He Ala Leu Leu Gly Ala Thr Arg
340 345 350
His Gly Ala Lys Ala Leu Ala Leu Asn Asn Gly Glu He Glu Ala Asn
355 360 365
Lys Arg Ala Asp Leu Ser Val Phe Gly Phe Asn Glu Lys Phe Thr Lys
370 375 380
Glu Gin Ala He Leu Gin Phe Leu Leu His Ala Lys Glu Val Glu Cys 385 390 395 400
Leu Phe Leu Gly Gly Lys Arg Val He 405
(2) INFORMATION FOR SEQ ID NO: 1249:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1356 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...1320 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1249:
AGGCTTGTAT TGAAAGTTTA TATTGAAACC ATG GGT TGT GCC ATG AAT TCT AGG 54
Met Gly Cys Ala Met Asn Ser Arg 1 5
GAT AGT GAG CAT TTA TTG AGC GAG CTG TCC AAA CTA GAC TAT AAA GAG 102 Asp Ser Glu His Leu Leu Ser Glu Leu Ser Lys Leu Asp Tyr Lys Glu 10 15 20
ACC AAT GAC CCT AAA ACA GCG GAT TTG ATT TTA ATC AAC ACT TGC AGC 150 Thr Asn Asp Pro Lys Thr Ala Asp Leu He Leu He Asn Thr Cys Ser 25 30 35 40
GTG CGC GAA AAG CCT GAA CGA AAA TTG TTT TCA GAA ATC GGT CAA TTC 198 Val Arg Glu Lys Pro Glu Arg Lys Leu Phe Ser Glu He Gly Gin Phe 45 50 55
GCT AAA ATC AAA AAA CCC AAC GCC AAA ATC GGG GTT TGC GGG TGC ACT 246 Ala Lys He Lys Lys Pro Asn Ala Lys He Gly Val Cys Gly Cys Thr 60 65 70
GCA AGC CAC ATG GGA GCG GAT ATT TTG AAA AAA GCC CCA AGC GTG AGC 294 Ala Ser His Met Gly Ala Asp He Leu Lys Lys Ala Pro Ser Val Ser 75 80 85
TTT GTG TTA GGG GCT AGG AAT GTG TCT AAA ATC TCT CAA GTG ATC CAT 342 Phe Val Leu Gly Ala Arg Asn Val Ser Lys He Ser Gin Val He His 90 95 100
AAA GAA AAA GCG GTT GAA GTG GCG ATT GAT TAT GAT GAA AGC GCG TAT 390 Lys Glu Lys Ala Val Glu Val Ala He Asp Tyr Asp Glu Ser Ala Tyr 105 110 115 120
GCG TTT GAA TTT TTT GAA AAA AAG GCT CAA ATC CGA TCG TTG CTA AAT 438 Ala Phe Glu Phe Phe Glu Lys Lys Ala Gin He Arg Ser Leu Leu Asn 125 130 135
ATC TCT ATA GGG TGC GAT AAG AAA TGC GCT TAT TGC ATC GTC CCG CAC 486 He Ser He Gly Cys Asp Lys Lys Cys Ala Tyr Cys He Val Pro His 140 145 150
ACT AGG GGG AAA GAA ATT TCT ATC CCT ATG GAT TTG ATT TTA AAA GAA 534 Thr Arg Gly Lys Glu He Ser He Pro Met Asp Leu He Leu Lys Glu 155 160 165 GCT GAG AAA TTA GCG AAT AAC GGC ACC AAA GAG CTT ATG CTT TTA GGG 582 Ala Glu Lys Leu Ala Asn Asn Gly Thr Lys Glu Leu Met Leu Leu Gly 170 175 180
CAG AAT GTG AAT AAT TAC GGC GCG CGT TTC AGC AGC GAG CAT GCG AAA 630 Gin Asn Val Asn Asn Tyr Gly Ala Arg Phe Ser Ser Glu His Ala Lys 185 190 195 200
GTG GAT TTT AGC GAT TTG TTG GAT AAA TTG AGC GAA ATC CAG GGG ATT 678 Val Asp Phe Ser Asp Leu Leu Asp Lys Leu Ser Glu He Gin Gly He 205 210 215
GAA AGG ATA CGA TTC ACT TCG CCT CAC CCC TTG CAC ATG AAT GAT GGA 726 Glu Arg He Arg Phe Thr Ser Pro His Pro Leu His Met Asn Asp Gly 220 225 230
TTT TTA GAG CGT TTT GCC AAA AAC CCT AAA GTG TGC AAG AGT ATC CAC 774 Phe Leu Glu Arg Phe Ala Lys Asn Pro Lys Val Cys Lys Ser He His 235 240 245
ATG CCT TTA CAG AGC GGA TCT AGC GCG GTG TTA AAG ATG ATG CGA AGG 822 Met Pro Leu Gin Ser Gly Ser Ser Ala Val Leu Lys Met Met Arg Arg 250 255 260
GGT TAT AGT AAG GAG TGG TTT TTA AAT AGG GTG GAG AGG TTA AAA GCT 870 Gly Tyr Ser Lys Glu Trp Phe Leu Asn Arg Val Glu Arg Leu Lys Ala 265 270 275 280
TTA GTG CCT GAA GTG GGC ATT AGC ACG GAT ATT ATC GTA GGC TTC CCT 918 Leu Val Pro Glu Val Gly He Ser Thr Asp He He Val Gly Phe Pro 285 290 295
AAT GAG AGC GAT AAG GAT TTT GAA GAC ACA ATG GAG GTG CTA GAA AAA 966 Asn Glu Ser Asp Lys Asp Phe Glu Asp Thr Met Glu Val Leu Glu Lys 300 305 310
GTG CGC TTT GAC ACG CTC TAT AGT TTC ATT TAT TCC CCA CGC CCT TTC 1014 Val Arg Phe Asp Thr Leu Tyr Ser Phe He Tyr Ser Pro Arg Pro Phe 315 320 325
ACT GAA GCG GGA GCT TGG AAG GAA AGA GTG CCG TTA GAA GTT TCA TCT 1062 Thr Glu Ala Gly Ala Trp Lys Glu Arg Val Pro Leu Glu Val Ser Ser 330 335 340
TCA AGG TTG GAG AGG TTG CAA AAC AGG CAC AAA GAA ATT TTA GAA GAA 1110 Ser Arg Leu Glu Arg Leu Gin Asn Arg His Lys Glu He Leu Glu Glu 345 350 355 360
AAA GCC AAG CTA GAA GTG GGC AAA ACG CAT GTG GTG TTG GTG GAA AAC 1158 Lys Ala Lys Leu Glu Val Gly Lys Thr His Val Val Leu Val Glu Asn 365 370 375
AGG CGT GAA ATG GAT AAT CAA ATC GTG GGT TTT GAA GGG CGT AGC GAT 1206 Arg Arg Glu Met Asp Asn Gin He Val Gly Phe Glu Gly Arg Ser Asp 380 385 390 ACG GGG AAA TTC ATT GAA GTA ACT TGT AAG GAA AAA AGA AAC CCG GGC 1254 Thr Gly Lys Phe He Glu Val Thr Cys Lys Glu Lys Arg Asn Pro Gly 395 4.00 405
GAG CTT GTA AAA GTG GAG ATT ATT TCT CAT TCC AAA GGG CGC TTG ATG 1302 Glu Leu Val Lys Val Glu He He Ser His Ser Lys Gly Arg Leu Met 410 415 420
GCG GCC ACT AAA GGC AAC TAATAAAAAT AACCAATGAA AAAGCGGGTT TAAAGG 1356 Ala Ala Thr Lys Gly Asn 425 430
(2) INFORMATION FOR SEQ ID NO: 1250:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 430 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1250:
Met Gly Cys Ala Met Asn Ser Arg Asp Ser Glu His Leu Leu Ser Glu
1 5 10 15
Leu Ser Lys Leu Asp Tyr Lys Glu Thr Asn Asp Pro Lys Thr Ala Asp
20 25 30
Leu He Leu He Asn Thr Cys Ser Val Arg Glu Lys Pro Glu Arg Lys
35 40 45
Leu Phe Ser Glu He Gly Gin Phe Ala Lys He Lys Lys Pro Asn Ala
50 55 60
Lys He Gly Val Cys Gly Cys Thr Ala Ser His Met Gly Ala Asp He 65 70 75 80
Leu Lys Lys Ala Pro Ser Val Ser Phe Val Leu Gly Ala Arg Asn Val
85 90 95
Ser Lys He Ser Gin Val He His Lys Glu Lys Ala Val Glu Val Ala
100 105 110
He Asp Tyr Asp Glu Ser Ala Tyr Ala Phe Glu Phe Phe Glu Lys Lys
115 120 125
Ala Gin He Arg Ser Leu Leu Asn. He Ser He Gly Cys Asp Lys Lys
130 135 140
Cys Ala Tyr Cys He Val Pro His Thr Arg Gly Lys Glu He Ser He 145 150 155 160
Pro Met Asp Leu He Leu Lys Glu Ala Glu Lys Leu Ala Asn Asn Gly
165 170 175
Thr Lys Glu Leu Met Leu Leu Gly Gin Asn Val Asn Asn Tyr Gly Ala
180 185 190
Arg Phe Ser Ser Glu His Ala Lys Val Asp Phe Ser Asp Leu Leu Asp
195 200 205
Lys Leu Ser Glu He Gin Gly He Glu Arg He Arg Phe Thr Ser Pro
210 215 220
His Pro Leu His Met Asn Asp Gly Phe Leu Glu Arg Phe Ala Lys Asn 225 230 235 240 Pro Lys Val Cys Lys Ser He His Met Pro Leu Gin Ser Gly Ser Ser
245 250 255
Ala Val Leu Lys Met Met Arg Arg Gly Tyr Ser Lys Glu Trp Phe Leu
260 265 270
Asn Arg Val Glu Arg Leu Lys Ala Leu Val Pro Glu Val Gly He Ser
275 280 285
Thr Asp He He Val Gly Phe Pro Asn Glu Ser Asp Lys Asp Phe Glu
290 295 300
Asp Thr Met Glu Val Leu Glu Lys Val Arg Phe Asp Thr Leu Tyr Ser 305 310 315 320
Phe He Tyr Ser Pro Arg Pro Phe Thr Glu Ala Gly Ala Trp Lys Glu
325 330 335
Arg Val Pro Leu Glu Val Ser Ser Ser Arg Leu Glu Arg Leu Gin Asn
340 345 350
Arg His Lys Glu He Leu Glu Glu Lys Ala Lys Leu Glu Val Gly Lys
355 360 365
Thr His Val Val Leu Val Glu Asn Arg Arg Glu Met Asp Asn Gin He
370 375 380
Val Gly Phe Glu Gly Arg Ser Asp Thr Gly Lys Phe He Glu Val Thr 385 390 395 400
Cys Lys Glu Lys Arg Asn Pro Gly Glu Leu Val Lys Val Glu He He
405 410 415
Ser His Ser Lys Gly Arg Leu Met Ala Ala Thr Lys Gly Asn 420 425 430
(2) INFORMATION FOR SEQ ID NO: 1251:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1530 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 50...1501 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1251:
ATTGAAATAC AAATACGAAA GCTTAAAAGA GCAAGATTAA AGGCTAGCA ATG GCT AAA 58
Met Ala Lys
1
ATC ACA ACC GTG ATT GAT ATA GGC TCT AAT TCA GTG CGT TTG GCT GTC 106 He Thr Thr Val He Asp He Gly Ser Asn Ser Val Arg Leu Ala Val 5 10 15
TTT AAA AAG ACG AGC CAG TTT GGG TTT TAC TTG CTT TTT GAG ACT AAG 154 Phe Lys Lys Thr Ser Gin Phe Gly Phe Tyr Leu Leu Phe Glu Thr Lys 20 25 30 35 TCT AAG GTT AGG ATT TCA GAG GGC TGT TAT GCG TTT AAT GGA ATC TTG 202 Ser Lys Val Arg He Ser Glu Gly Cys Tyr Ala Phe Asn Gly He Leu 40 45 50
CAA GAA ATC CCC ATG CAA CGA GCC GTT AAA GCC TTG AGC GAA TTT AAA 250 Gin Glu He Pro Met Gin Arg Ala Val Lys Ala Leu Ser Glu Phe Lys 55 60 65
GAA ATC GCT CTC AAA TAC AAA AGC AAA AAA ATC CTG TGC GTG GCG ACC 298 Glu He Ala Leu Lys Tyr Lys Ser Lys Lys He Leu Cys Val Ala Thr 70 75 80
TCA GCG GTG CGC GAT GCC CCT AAT CGG CTG GAG TTT GTA GCG AGG GTG 346 Ser Ala Val Arg Asp Ala Pro Asn Arg Leu Glu Phe Val Ala Arg Val 85 90 95
AAA AAG GCT TGC GGT TTG CAA ATC AAA ATC ATT GAT GGG CAA AAA GAA 394 Lys Lys Ala Cys Gly Leu Gin He Lys He He Asp Gly Gin Lys Glu 100 105 110 115
GCG CTC TAT GGC GGG ATT GCG TGC GCG AAT TTG TTG CAT AAA AAT TCA 442 Ala Leu Tyr Gly Gly He Ala Cys Ala Asn Leu Leu His Lys Asn Ser 120 125 130
GGG ATC ACG ATA GAT ATT GGA GGG GGT AGC ACC GAG TGC GCG TTG ATT 490 Gly He Thr He Asp He Gly Gly Gly Ser Thr Glu Cys Ala Leu He 135 140 145
GAA AAA GGC AAG ATT AAG GAC TTA ATC TCG CTT GAT GTT GGG ACG ATT 538 Glu Lys Gly Lys He Lys Asp Leu He Ser Leu Asp Val Gly Thr He 150 155 160
CGC ATT AAA GAA ATG TTT TTA GAC AAA GAC TTA GAG GTC AAA TTG GCT 586 Arg He Lys Glu Met Phe Leu Asp Lys Asp Leu Glu Val Lys Leu Ala 165 170 175
AAA GCC TTT ATC CAA AAA GAA GTC TCT AAA CTG CCC TTT AAA CAC AAA 634 Lys Ala Phe He Gin Lys Glu Val Ser Lys Leu Pro Phe Lys His Lys 180 185 190 195
AAC GCC TTT GGG GTG GGG GGG ACG ATC AGA GCG TTG AGT AAG GTA TTG 682 Asn Ala Phe Gly Val Gly Gly Thr He Arg Ala Leu Ser Lys Val Leu 200 205 210
ATG AAA CGC TTT TGT TAC CCT ATT GAT TCT TTG CAT GGC TAT GAA ATA 730 Met Lys Arg Phe Cys Tyr Pro He Asp Ser Leu His Gly Tyr Glu He 215 220 225
GAT GCA CAT AAA AAT TTA GCG TTC ATT GAA AAA ATC GTC ATG CTC AAA 778 Asp Ala His Lys Asn Leu Ala Phe He Glu Lys He Val Met Leu Lys 230 235 240
GAA GAT CAA TTA CGG CTT TTA GGG GTG AAT GAA GAG CGT TTG GAT AGC 826 Glu Asp Gin Leu Arg Leu Leu Gly Val Asn Glu Glu Arg Leu Asp Ser 245 250 255 ATC AGG AGC GGG GCG TTG ATT TTA TCA GTC GTT TTG GAG CAT TTA AAA 874 He Arg Ser Gly Ala Leu He Leu Ser Val Val Leu Glu His Leu Lys 260 265 270 275
ACT TCT TTA ATG ATC ACT AGT GGG GTG GGG GTG AGA GAA GGC GTG TTT 922 Thr Ser Leu Met He Thr Ser Gly Val Gly Val Arg Glu Gly Val Phe 280 285 290
TTG AGC GAT TTA TTG CGC CAT CAT TAC CAT AAA TTC CCC CCC AAT ATC 970 Leu Ser Asp Leu Leu Arg His His Tyr His Lys Phe Pro Pro Asn He 295 300 305
AAC CCC TCT CTC ATC TCT TTA AAA GAT CGC TTT TTG CCC CAT GAA AAG 1018 Asn Pro Ser Leu He Ser Leu Lys Asp Arg Phe Leu Pro His Glu Lys 310 315 320
CAC AGC CAA AAG GTC AAA AAA GAA TGC GTG AAA TTG TTT GAA GCC TTA 1066 His Ser Gin Lys Val Lys Lys Glu Cys Val Lys Leu Phe Glu Ala Leu 325 330 335
TCG CCT TTG CAT AAA ATA GAT GAA AAA TAC CTT TTC CAT TTA AAG ATT 1114 Ser Pro Leu His Lys He Asp Glu Lys Tyr Leu Phe His Leu Lys He 340 345 350 355
GCG GGG GAA TTA GCG AGC ATG GGT AAG ATT TTA AGC GTC TAT TTA GCC 1162 Ala Gly Glu Leu Ala Ser Met Gly Lys He Leu Ser Val Tyr Leu Ala 360 365 370
CAC AAG CAC AGC GCG TAT TTT ATT TTA AAC GCT TTG AGT TAT GGC TTT 1210 His Lys His Ser Ala Tyr Phe He Leu Asn Ala Leu Ser Tyr Gly Phe 375 380 385
AGC CAC CAG GAT AGA GCG ATC ATT TGC TTA TTA GCC CAA TTC AGC CAT 1258 Ser His Gin Asp Arg Ala He He Cys Leu Leu Ala Gin Phe Ser His 390 395 400
AAA AAA ATC CCT AAA GAC AAC GCT ATC GCC CAC ATG AGC GCG ATG ATG 1306 Lys Lys He Pro Lys Asp Asn Ala He Ala His Met Ser Ala Met Met 405 410 415
CCA AGC CTT TTA ACC TTA CAA TGG CTG AGT TTT ATC CTT TCT TTA GCC 1354 Pro Ser Leu Leu Thr Leu Gin Trp Leu Ser Phe He Leu Ser Leu Ala 420 425 430 435
GAA AAT TTG TGC CTA ACA GAC AGC CAT CAT TTA AAA TAC ACG CTA GAA 1402 Glu Asn Leu Cys Leu Thr Asp Ser His His Leu Lys Tyr Thr Leu Glu 440 445 450
AAA AAC AAG CTT GTG ATC CAT TCT AAT GAC ACG CTT TAC TTG GCT AAA 1450 Lys Asn Lys Leu Val He His Ser Asn Asp Thr Leu Tyr Leu Ala Lys 455 460 465
GAA ATG CTC CCC AAA CTC GTT AAG CCC ATT CCT TTG ACG ATA GAG TTT 1498 Glu Met Leu Pro Lys Leu Val Lys Pro He Pro Leu Thr He Glu Phe 470 475 480 GCT TGAAAATAGC GATTGTCAGG CTTTCAGCG 1530
Ala
(2) INFORMATION FOR SEQ ID NO: 1252:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 484 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1252:
Met Ala Lys He Thr Thr Val He Asp He Gly Ser Asn Ser Val Arg
1 5 10 15
Leu Ala Val Phe Lys Lys Thr Ser Gin Phe Gly Phe Tyr Leu Leu Phe
20 25 30
Glu Thr Lys Ser Lys Val Arg He Ser Glu Gly Cys Tyr Ala Phe Asn
35 40 45
Gly He Leu Gin Glu He Pro Met Gin Arg Ala Val Lys Ala Leu Ser
50 55 60
Glu Phe Lys Glu He Ala Leu Lys Tyr Lys Ser Lys Lys He Leu Cys 65 70 75 80
Val Ala Thr Ser Ala Val Arg Asp Ala Pro Asn Arg Leu Glu Phe Val
85 90 95
Ala Arg Val Lys Lys Ala Cys Gly Leu Gin He Lys He He Asp Gly
100 105 110
Gin Lys Glu Ala Leu Tyr Gly Gly He Ala Cys Ala Asn Leu Leu His
115 120 125
Lys Asn Ser Gly He Thr He Asp He Gly Gly Gly Ser Thr Glu Cys
130 135 140
Ala Leu He Glu Lys Gly Lys He Lys Asp Leu He Ser Leu Asp Val 145 150 155 160
Gly Thr He Arg He Lys Glu Met Phe Leu Asp Lys Asp Leu Glu Val
165 170 175
Lys Leu Ala Lys Ala Phe He Gin Lys Glu Val Ser Lys Leu Pro Phe
180 185 190
Lys His Lys Asn Ala Phe Gly Val Gly Gly Thr He Arg Ala Leu Ser
195 200 205
Lys Val Leu Met Lys Arg Phe Cys Tyr Pro He Asp Ser Leu His Gly
210 215 220
Tyr Glu He Asp Ala His Lys Asn Leu Ala Phe He Glu Lys He Val 225 230 235 240
Met Leu Lys Glu Asp Gin Leu Arg Leu Leu Gly Val Asn Glu Glu Arg
245 250 255
Leu Asp Ser He Arg Ser Gly Ala Leu He Leu Ser Val Val Leu Glu
260 265 270
His Leu Lys Thr Ser Leu Met He Thr Ser Gly Val Gly Val Arg Glu
275 280 285
Gly Val Phe Leu Ser Asp Leu Leu Arg His His Tyr His Lys Phe Pro 290 295 300 Pro Asn He Asn Pro Ser Leu He Ser Leu Lys Asp Arg Phe Leu Pro 305 310 315 320
His Glu Lys His Ser Gin Lys Val Lys Lys Glu Cys Val Lys Leu Phe
325 330 335
Glu Ala Leu Ser Pro Leu His Lys He Asp Glu Lys Tyr Leu Phe His
340 345 350
Leu Lys He Ala Gly Glu Leu Ala Ser Met Gly Lys He Leu Ser Val
355 360 365
Tyr Leu Ala His Lys His Ser Ala Tyr Phe He Leu Asn Ala Leu Ser
370 375 380
Tyr Gly Phe Ser His Gin Asp Arg Ala He He Cys Leu Leu Ala Gin 385 390 395 400
Phe Ser His Lys Lys He Pro Lys Asp Asn Ala He Ala His Met Ser
405 410 415
Ala Met Met Pro Ser Leu Leu Thr Leu Gin Trp Leu Ser Phe He Leu
420 425 430
Ser Leu Ala Glu Asn Leu Cys Leu Thr Asp Ser His His Leu Lys Tyr
435 440 445
Thr Leu Glu Lys Asn Lys Leu Val He His Ser Asn Asp Thr Leu Tyr
450 455 460
Leu Ala Lys Glu Met Leu Pro Lys Leu Val Lys Pro He Pro Leu Thr 465 470 475 480
He Glu Phe Ala
(2) INFORMATION FOR SEQ ID NO:1253
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1130 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 60...1073 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1253: GCCAACTACC ATAAAAAGGA TTTTTCTATC CAAAATATAG AGCCTAAAAA AATTAAAGA 59
ATG CGT TTT AAA CAT CTT AAA GGA AAA AGA ATG ACT TAC AAA GAA CGA 107 Met Arg Phe Lys His Leu Lys Gly Lys Arg Met Thr Tyr Lys Glu Arg 1 5 10 15
CTC ATA CAC GAA AAA ATA TTG AAA CAA GAC GAC AAG GGT TTT AAA ACA 155 Leu He His Glu Lys He Leu Lys Gin Asp Asp Lys Gly Phe Lys Thr 20 25 30 GAA CTG CGC ATT TTG AGT ATT TTT ATC GTG GAA TCT TTA GTG AAT ATT 203 Glu Leu Arg He Leu Ser He Phe He Val Glu Ser Leu Val Asn He 35 40 45
TTG GGG TTT ATT TTA GCT AAA ATG CCC CAT TCG TGG TTT TTA AGG TGC 251 Leu Gly Phe He Leu Ala Lys Met Pro His Ser Trp Phe Leu Arg Cys 50 55 60
ATT AAA GCG GTG GCG TGG CTC ATG AAA ACT TTT GAT AAG TGC CGT TAT 299 He Lys Ala Val Ala Trp Leu Met Lys Thr Phe Asp Lys Cys Arg Tyr 65 70 75 80
TTT GAC GCT AAG GCC AAT TTG GAT TTT GTG TTT GGG GAT TCT AAA AGC 347 Phe Asp Ala Lys Ala Asn Leu Asp Phe Val Phe Gly Asp Ser Lys Ser 85 90 95
GAA GAA GAG AAA AAA AGG ATC ATT AAA AAG GGT TAT GAA AAT TTT GCT 395 Glu Glu Glu Lys Lys Arg He He Lys Lys Gly Tyr Glu Asn Phe Ala 100 105 110
TTC ATT ATT TTA GAA ACT ATT AGA GTG ATC TTT ATC CCT AAA GAT GAA 443 Phe He He Leu Glu Thr He Arg Val He Phe He Pro Lys Asp Glu 115 120 125
TAC GAC GCT CGT TTC ACG CTC ATC AAT GAA GAA AAT GTG TGG AAA TCT 491 Tyr Asp Ala Arg Phe Thr Leu He Asn Glu Glu Asn Val Trp Lys Ser 130 135 140
TTA AAC AAG GAA GGC CAA GCG ATC ACT TTA TGC ATG CAT TTT GGC TAT 539 Leu Asn Lys Glu Gly Gin Ala He Thr Leu Cys Met His Phe Gly Tyr 145 150 155 160
TGG GAA GCG GTA GGC ACG ACT TTA GCG CAA TAT TAT GAA AAT TAT GGT 587 Trp Glu Ala Val Gly Thr Thr Leu Ala Gin Tyr Tyr Glu Asn Tyr Gly 165 170 175
AGG GGG TGT TTG GGG CGT TTG ACT AAA TTT GCC CCT ATC AAT CAC ATG 635 Arg Gly Cys Leu Gly Arg Leu Thr Lys Phe Ala Pro He Asn His Met 180 185 190
ATT ATG AGT AGG CGA GAG GCG TTT GGG GTG CGT TTT GTC AAT AAA ATA 683 He Met Ser Arg Arg Glu Ala Phe Gly Val Arg Phe Val Asn Lys He 195 200 205
GGG GCG ATG AAA GAA CTC ATT AAA ATG TAT AAT CAA GGC AAT GGT CTG 731 Gly Ala Met Lys Glu Leu He Lys Met Tyr Asn Gin Gly Asn Gly Leu 210 215 220
GTG GGG ATT TTA GTG GAT CAA AAT GTC GTG CCT AAA GAT GGG GTG GTG 779 Val Gly He Leu Val Asp Gin Asn Val Val Pro Lys Asp Gly Val Val 225 230 235 240
GTG AAA TTC TTT GAT AGA GAC GCT ACG CAC ACC ACG ATC GCT TCT ATT 827 Val Lys Phe Phe Asp Arg Asp Ala Thr His Thr Thr He Ala Ser He 245 250 255 TTG TCG CGC CGT TAT AAT ATA GAT ATT CAG CCG GTA TTC ATT GAT TTT 875 Leu Ser Arg Arg Tyr Asn He Asp He Gin Pro Val Phe He Asp Phe 260 265 270
AAT GAC GAT TAT TCG CAT TAT ACA GCG ACC TAT TAT CCG AGT ATC CGC 923 Asn Asp Asp Tyr Ser His Tyr Thr Ala Thr Tyr Tyr Pro Ser He Arg 275 280 285
TCT CAA ATC ACC GAT AAC GCG CAA AAC GAT ATT TTA GAA TGC ACG CAA 971 Ser Gin He Thr Asp Asn Ala Gin Asn Asp He Leu Glu Cys Thr Gin 290 295 300
GCC CAA GCG AGT TTG TGC GAA GAG GTG ATT AGA AAC CAC CCG GAA AGT 1019 Ala Gin Ala Ser Leu Cys Glu Glu Val He Arg Asn His Pro Glu Ser 305 310 315 320
TAT TTT TGG TTC CAT AGG CGT TTT AAA AGC ACC CAC CCT GAG ATT TAT 1067 Tyr Phe Trp Phe His Arg Arg Phe Lys Ser Thr His Pro Glu He Tyr 325 330 335
CAA AGA TAGGGTTTTG TTTTAATCAA AAATTAAAAA CTAAAGCCTT ATTTTTAAGA AAA 1126 Gin Arg
CTTT 1130
(2) INFORMATION FOR SEQ ID NO: 1254:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 338 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1254:
Met Arg Phe Lys His Leu Lys Gly Lys Arg Met Thr Tyr Lys Glu Arg
1 5 10 15
Leu He His Glu Lys He Leu Lys Gin Asp Asp Lys Gly Phe Lys Thr
20 25 30
Glu Leu Arg He Leu Ser He Phe He Val Glu Ser Leu Val Asn He
35 40 45
Leu Gly Phe He Leu Ala Lys Met Pro His Ser Trp Phe Leu Arg Cys
50 55 60
He Lys Ala Val Ala Trp Leu Met Lys Thr Phe Asp Lys Cys Arg Tyr 65 70 75 80
Phe Asp Ala Lys Ala Asn Leu Asp Phe Val Phe Gly Asp Ser Lys Ser
85 90 95
Glu Glu Glu Lys Lys Arg He He Lys Lys Gly Tyr Glu Asn Phe Ala
100 105 110
Phe He He Leu Glu Thr He Arg Val He Phe He Pro Lys Asp Glu
115 120 125
Tyr Asp Ala Arg Phe Thr Leu He Asn Glu Glu Asn Val Trp Lys Ser 130 135 140
Leu Asn Lys Glu Gly Gin Ala He Thr Leu Cys Met His Phe Gly Tyr 145 150 155 160
Trp Glu Ala Val Gly Thr Thr Leu Ala Gin Tyr Tyr Glu Asn Tyr Gly
165 170 175
Arg Gly Cys Leu Gly Arg Leu Thr Lys Phe Ala Pro He Asn His Met
180 185 190
He Met Ser Arg Arg Glu Ala Phe Gly Val Arg Phe Val Asn Lys He
195 200 205
Gly Ala Met Lys Glu Leu He Lys Met Tyr Asn Gin Gly Asn Gly Leu
210 215 220
Val Gly He Leu Val Asp Gin Asn Val Val Pro Lys Asp Gly Val Val 225 230 235 240
Val Lys Phe Phe Asp Arg Asp Ala Thr His Thr Thr He Ala Ser He
245 250 255
Leu Ser Arg Arg Tyr Asn He Asp He Gin Pro Val Phe He Asp Phe
260 265 270
Asn Asp Asp Tyr Ser His Tyr Thr Ala Thr Tyr Tyr Pro Ser He Arg
275 280 285
Ser Gin He Thr Asp Asn Ala Gin Asn Asp He Leu Glu Cys Thr Gin
290 295 300
Ala Gin Ala Ser Leu Cys Glu Glu Val He Arg Asn His Pro Glu Ser 305 310 315 320
Tyr Phe Trp Phe His Arg Arg Phe Lys Ser Thr His Pro Glu He Tyr
325 330 335
Gin Arg
(2) INFORMATION FOR SEQ ID NO: 1255:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 8748 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...8694 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1255:
AGAGGGTAGC ATTTA ATG AAA AAG TTT AAA AAG AAA CCA AAA AGT ATC AAA 51 Met Lys Lys Phe Lys Lys Lys Pro Lys Ser He Lys 1 5 10
CGA TCG CAT CAA AAT CAA AAA ACA ATC TTA AAG CGT CCT TTA TGG CTT 99 Arg Ser His Gin Asn Gin Lys Thr He Leu Lys Arg Pro Leu Trp Leu 15 20 25 ATG CCT TTA CTC ATC AGC GGG TTT GCT AGT GGG GTG TAT GCG AAT AAT 147 Met Pro Leu Leu He Ser Gly Phe Ala Ser Gly Val Tyr Ala Asn Asn 30 35 40
CTG TGG GAT TTG TTA AAC CCA AAA GTG GGG GGT GAG TAT GTG CAT TGG 195 Leu Trp Asp Leu Leu Asn Pro Lys Val Gly Gly Glu Tyr Val His Trp 45 50 55 60
GTT AAG GGC AGT CAG TAT TGT GCA TGG TGG GAA TTT GCT GGG TGT TTA 243 Val Lys Gly Ser Gin Tyr Cys Ala Trp Trp Glu Phe Ala Gly Cys Leu 65 70 75
AAG AAT GTA TGG GGG GCA AAT CAT AAA GGC TAT GAT GCT GGA AAC GCC 291 Lys Asn Val Trp Gly Ala Asn His Lys Gly Tyr Asp Ala Gly Asn Ala 80 85 90
GCT AAC TAT TTG TCT TCT CAA AAC TAT CAA GCT ATT TCG GTG GGT AGT 339 Ala Asn Tyr Leu Ser Ser Gin Asn Tyr Gin Ala He Ser Val Gly Ser 95 100 105
GGG AAT GAA ACG GGG ACT TAT AGT TTA AGC GGT TTT ACC AAT TAT GTT 387 Gly Asn Glu Thr Gly Thr Tyr Ser Leu Ser Gly Phe Thr Asn Tyr Val 110 115 120
GGG GGC AAT CTC ACG ATC AAT CTA GGC AAT AGC GTT GTT TTA GAT TTA 435 Gly Gly Asn Leu Thr He Asn Leu Gly Asn Ser Val Val Leu Asp Leu 125 130 135 140
AGC GGT TCT AAT AGT TTC ACT TCG TAT CAA GGT TAT AAT CAA GGC AAA 483 Ser Gly Ser Asn Ser Phe Thr Ser Tyr Gin Gly Tyr Asn Gin Gly Lys 145 150 155
GAT GAT GTA ACA TTT ACG GTT GGC GCA ATC AAT TTA AAC GGC ACT TTA 531 Asp Asp Val Thr Phe Thr Val Gly Ala He Asn Leu Asn Gly Thr Leu 160 165 170
GAA GTG GGT AAT CGT GTG GGA TCG GGA GCT GGC ACG CAC ACC GGC ACA 579 Glu Val Gly Asn Arg Val Gly Ser Gly Ala Gly Thr His Thr Gly Thr 175 180 185
GCC ACT TTA AAC TTG AAC GCT AAT AAG GTC AAT ATC AAT TCC AAT ATC 627 Ala Thr Leu Asn Leu Asn Ala Asn Lys Val Asn He Asn Ser Asn He 190 195 200
AAC GCG TAT AAA ACT TCG CAA GTG AAT ATA GGC AAC GCT AAC AGC GTT 675 Asn Ala Tyr Lys Thr Ser Gin Val Asn He Gly Asn Ala Asn Ser Val 205 210 215 220
ATT ACC ATT GGT TCG GTT TCT TTG AGT GGG GAT GTT TGC AGT TCT TTA 723 He Thr He Gly Ser Val Ser Leu Ser Gly Asp Val Cys Ser Ser Leu 225 230 235
GCT AGC GTT GGG ATA GGG GCT AAT TGC TCC ACT TCT GGG CCT AGC TAT 771 Ala Ser Val Gly He Gly Ala Asn Cys Ser Thr Ser Gly Pro Ser Tyr 240 245 250 TCT TTT AAA GGG ACG ACT AAC GCT ACT AAC ACG GCG TTT AGT AAT GCA 819 Ser Phe Lys Gly Thr Thr Asn Ala Thr Asn Thr Ala Phe Ser Asn Ala 255 260 265
AGC GGC AGT TTC ACT TTT GAA GAG AAC GCC ACT TTT AGC GGG GCG AAA 867 Ser Gly Ser Phe Thr Phe Glu Glu Asn Ala Thr Phe Ser Gly Ala Lys 270 275 280
TGG AAT GGG GGG ACT TAT ACC TTT AAT AAA GAG TTT AGC GCT ACC AAT 915 Trp Asn Gly Gly Thr Tyr Thr Phe Asn Lys Glu Phe Ser Ala Thr Asn 285 290 295 300
AAC ACC GCC TTT AGT AGC GGT AGT TTT AAT TTT AAA GGT GTA AGC TCT 963 Asn Thr Ala Phe Ser Ser Gly Ser Phe Asn Phe Lys Gly Val Ser Ser 305 310 315
TTT AAT GGT ACT TCG TTT AGT AAC GCT TCT TAT ACT TTT GAC AAT CAA 1011 Phe Asn Gly Thr Ser Phe Ser Asn Ala Ser Tyr Thr Phe Asp Asn Gin 320 325 330
GCC ACT TTC CAA AAC AGC TCC TTT AAT GGG GGG ACT TTT ACT TTT AAT 1059 Ala Thr Phe Gin Asn Ser Ser Phe Asn Gly Gly Thr Phe Thr Phe Asn 335 340 345
AAC CAA ACT AAT CCA ACT AAC AAC GCT CAG CAC CCC CAA ATT CAA AAC 1107 Asn Gin Thr Asn Pro Thr Asn Asn Ala Gin His Pro Gin He Gin Asn 350 355 360
AGC TCT TTT AGT GGT AAC GCT ACC ACT CTT AAG GGC TTT GTG AAT TTC 1155 Ser Ser Phe Ser Gly Asn Ala Thr Thr Leu Lys Gly Phe Val Asn Phe 365 370 375 380
CAG CAA GCC TTT AAC AAT TCA AAC CAC CAA CTA ACG ATC CAA AAC GCT 1203 Gin Gin Ala Phe Asn Asn Ser Asn His Gin Leu Thr He Gin Asn Ala 385 390 395
TCC TTT AAT AAC GCC ACT TTT AAC AAT ACC GGT AAA ATC ACT ATA GAA 1251 Ser Phe Asn Asn Ala Thr Phe Asn Asn Thr Gly Lys He Thr He Glu 400 405 410
AAA GAT GCG AGT TTT AAT AAC ACG ACA TTC AAC ACT TCT GTT GAT ACA 1299 Lys Asp Ala Ser Phe Asn Asn Thr Thr Phe Asn Thr Ser Val Asp Thr 415 420 425
AAC AAC ATG AGT GTT ACC GGT GGC GTT ACT TTA AGC GGT AAA AAT GAC 1347 Asn Asn Met Ser Val Thr Gly Gly Val Thr Leu Ser Gly Lys Asn Asp 430 435 440
TTG AAA AAT GGC TCA ACC CTT GAT TTT GGG AGT TCT AAA ATC ACT CTC 1395 Leu Lys Asn Gly Ser Thr Leu Asp Phe Gly Ser Ser Lys He Thr Leu 445 450 455 460
GCT CAA GGG ACG ACT TTC AAC CTC ACA AGT TTA GGC AGT GAG AAG AGC 1443 Ala Gin Gly Thr Thr Phe Asn Leu Thr Ser Leu Gly Ser Glu Lys Ser 465 470 475 GTA ACG ATT TTA AAT TCT AGC GGT GGG ATC ACT TAT AGT AAC CTT TTA 1491 Val Thr He Leu Asn Ser Ser Gly Gly He Thr Tyr Ser Asn Leu Leu 480 485 490
AAC CAT GCA ATC AAC GGC TTG ACA AGT GCC TTA AAA ACG AAC GAA AGC 1539 Asn His Ala He Asn Gly Leu Thr Ser Ala Leu Lys Thr Asn Glu Ser 495 500 505
CTT TCA AAT CCG CAA AGT TTC GCT CAA GGT TTG TGG GAT ATA ATC ACT 1587 Leu Ser Asn Pro Gin Ser Phe Ala Gin Gly Leu Trp Asp He He Thr 510 515 520
TAC AAT GGG GTT ACC GGG CAG CTT TTG AAT GAA AAC GCT GCA ACA TCT 1635 Tyr Asn Gly Val Thr Gly Gin Leu Leu Asn Glu Asn Ala Ala Thr Ser 525 530 535 540
AAA CCC ACT GAC TCT TCG CCC TCT AAA TCC TCT ACA AAC TCT ACG CAA 1683 Lys Pro Thr Asp Ser Ser Pro Ser Lys Ser Ser Thr Asn Ser Thr Gin 545 550 555
GTC TAT CAA GTG GGT TAC AAA ATA GGG GAT ACT ATC TAC AAA CTG CAA 1731 Val Tyr Gin Val Gly Tyr Lys He Gly Asp Thr He Tyr Lys Leu Gin 560 565 570
GAA ACT TTC AGC CAC AAT TCC ATT ATT ATT CAG GCT TTA GAG AGC GGG 1779 Glu Thr Phe Ser His Asn Ser He He He Gin Ala Leu Glu Ser Gly 575 580 585
ACT TAC ACG CCA CCC CCT GTC ATT AAC GGC TCC AAA TTT GAC TTA TCC 1827 Thr Tyr Thr Pro Pro Pro Val He Asn Gly Ser Lys Phe Asp Leu Ser 590 595 600
GCT TCA AAT TAT ATC AAT GCT GAC ATG CCT TGG TAT GAC CAT AAA TAT 1875 Ala Ser Asn Tyr He Asn Ala Asp Met Pro Trp Tyr Asp His Lys Tyr 605 610 615 620
TAC ATC CCT AAA TCC CAA AAT TTT ACA GAG AGC GGG ACT TAT TAC TTG 1923 Tyr He Pro Lys Ser Gin Asn Phe Thr Glu Ser Gly Thr Tyr Tyr Leu 625 630 635
CCG AGC GTC CAA ATA TGG GGG AGC TAC ACT AAC TCG TTT AAA CAA ACT 1971 Pro Ser Val Gin He Trp Gly Ser Tyr Thr Asn Ser Phe Lys Gin Thr 640 645 650
TTT AGC GCA AAT GGT AGT AAT CTG GTG ATT GGG TAT AAC TCA ACA TGG 2019 Phe Ser Ala Asn Gly Ser Asn Leu Val He Gly Tyr Asn Ser Thr Trp 655 660 665
ACT GAT CAT AAT GTC TCT TCT AGC GGC ACG GTG TCT TTT GGG GAC ACT 2067 Thr Asp His Asn Val Ser Ser Ser Gly Thr Val Ser Phe Gly Asp Thr 670 675 680
TCA GGG AGC GCT CTT AAT GGG CAT TGC GGA CCT TGG CCG TAT TAC CAA 2115 Ser Gly Ser Ala Leu Asn Gly His Cys Gly Pro Trp Pro Tyr Tyr Gin 685 690 695 700 TGC ACA GGC ACG ACT AAC GGC ACT TAT AGC GCC TAT CAT GTG TAT ATC 2163 Cys Thr Gly Thr Thr Asn Gly Thr Tyr Ser Ala Tyr His Val Tyr He 705 710 715
ACA GCG AAT CTG CGT TCT GGC AAT CGT ATA GGC ACC GGT GGG GCA GCT 2211 Thr Ala Asn Leu Arg Ser Gly Asn Arg He Gly Thr Gly Gly Ala Ala 720 725 730
AAT CTA ATC TTT AAT GGG GTA GAT AGT ATC AAT ATC GCT AAC GCT ACC 2259 Asn Leu He Phe Asn Gly Val Asp Ser He Asn He Ala Asn Ala Thr 735 740 745
ATC ACG CAA CAT AAC GCC GGA ATC TAT TCA AGC TCT ATG ACT TTT TCC 2307 He Thr Gin His Asn Ala Gly He Tyr Ser Ser Ser Met Thr Phe Ser 750 755 760
ACG CAA AGC ATG GAT AAT TCG CAG AAT TTG AAT GGT CTA AAT TCT AAC 2355 Thr Gin Ser Met Asp Asn Ser Gin Asn Leu Asn Gly Leu Asn Ser Asn 765 770 775 780
GGC AAA CTT TCG GTG TAT GGC ACC ACT TTC ACT AAC GAA GCT AAA GAT 2403 Gly Lys Leu Ser Val Tyr Gly Thr Thr Phe Thr Asn Glu Ala Lys Asp 785 790 795
GGG AAA TTC ATT TTC AAT GCA GGG CAA GCG GTT TTT GAA AAC ACC AAC 2451 Gly Lys Phe He Phe Asn Ala Gly Gin Ala Val Phe Glu Asn Thr Asn 800 805 810
TTT AAT GGA GGG AGT TAC CAA TTC AGC GGC GAT AGC TTG AAT TTT TCA 2499 Phe Asn Gly Gly Ser Tyr Gin Phe Ser Gly Asp Ser Leu Asn Phe Ser 815 820 825
AAC AAC AAC CAG TTC AAT AGC GGT TCG TTT GAA ATT AGC GCA AAA AAC 2547 Asn Asn Asn Gin Phe Asn Ser Gly Ser Phe Glu He Ser Ala Lys Asn 830 835 840
GCT TCG TTC AAT AAC GCT AAC TTT AAC AAC AGC GCT TCT TTT AAT TTC 2595 Ala Ser Phe Asn Asn Ala Asn Phe Asn Asn Ser Ala Ser Phe Asn Phe 845 850 855 860
AAT AAT TCT AAC GCG ACC ACT TCG TTT GTG GGG GAT TTC ACT AAC GCT 2643 Asn Asn Ser Asn Ala Thr Thr Ser Phe Val Gly Asp Phe Thr Asn Ala 865 870 875
AAT TCA AAT TTG CAA ATC GCC GGG AAC GCT GTT TTT GGG AAC TCT ACT 2691 Asn Ser Asn Leu Gin He Ala Gly Asn Ala Val Phe Gly Asn Ser Thr 880 885 890
AAT GGC TCT CAA AAT ACC GCT AAT TTT AAT AAT ACC GGC TCT GTT AAT 2739 Asn Gly Ser Gin Asn Thr Ala Asn Phe Asn Asn Thr Gly Ser Val Asn 895 900 905
ATT TCA GGG AAT GCA ACC TTT GAT AAT GTG GTG TTT AAT GGC CCT ACG 2787 He Ser Gly Asn Ala Thr Phe Asp Asn Val Val Phe Asn Gly Pro Thr 910 915 920 AAC ACG AGC GTG AAA GGG CAG GTT ACT TTA AAT AAC ATC ACT TTA AAA 2835 Asn Thr Ser Val Lys Gly Gin Val Thr Leu Asn Asn He Thr Leu Lys 925 930 935 940
AAC CTG AAC GCC CCT TTG TCT TTT GGC GAT GGG ACG ATT ACT TTT AAC 2883 Asn Leu Asn Ala Pro Leu Ser Phe Gly Asp Gly Thr He Thr Phe Asn 945 950 955
GCT CAT TCG GTG ATT AAT ATT GCT GAA TCT ATC ACT AAT GGC AAC CCT 2931 Ala His Ser Val He Asn He Ala Glu Ser He Thr Asn Gly Asn Pro 960 965 970
ATC ACT CTT GTA AGC TCT TCT AAA GAA ATT GAA TAC AAC AAC GCT TTC 2979 He Thr Leu Val Ser Ser Ser Lys Glu He Glu Tyr Asn Asn Ala Phe 975 980 985
AGT AAA AAT CTA TGG CAG CTC ATC AAC TAC CAA GGG CAT GGG GCA AGC 3027 Ser Lys Asn Leu Trp Gin Leu He Asn Tyr Gin Gly His Gly Ala Ser 990 995 1000
AGT GAA AAG CTC GTC TCT AGC GCG GGT AAT GGC GTT TAT GAT GTG GTG 3075 Ser Glu Lys Leu Val Ser Ser Ala Gly Asn Gly Val Tyr Asp Val Val 1005 1010 1015 1020
TAT TCT TTC AAT AAC CAA ACC TAC AAT TTC CAA GAG GTT TTT TCA CAA 3123 Tyr Ser Phe Asn Asn Gin Thr Tyr Asn Phe Gin Glu Val Phe Ser Gin 1025 1030 1035
AAC AGC ATT TCT ATC CGG CGT TTG GGC GTT AAC ATG GTG TTT GAT TAT 3171 Asn Ser He Ser He Arg Arg Leu Gly Val Asn Met Val Phe Asp Tyr 1040 1045 1050
GTG GAT ATG GAA AAA TCG GAT CAT TTA TAT TAT CAA AAC GCT CTC GGT 3219 Val Asp Met Glu Lys Ser Asp His Leu Tyr Tyr Gin Asn Ala Leu Gly 1055 1060 1065
TTT ATG ACC TAC ATG CCT AAT AGC TAT AAC AAT AAT TTA GGG AAT GCA 3267 Phe Met Thr Tyr Met Pro Asn Ser Tyr Asn Asn Asn Leu Gly Asn Ala 1070 1075 1080
AAC AAC ACC ATT TAC TAT TAC GAC AAG AGC ATT GAT TTT TAT GCG AGC 3315 Asn Asn Thr He Tyr Tyr Tyr Asp Lys Ser He Asp Phe Tyr Ala Ser 1085 1090 1095 1100
GGG AAA ACT CTA TTC ACT AAA GCG GAA TTT TCT CAA ACA TTC ACC GGG 3363 Gly Lys Thr Leu Phe Thr Lys Ala Glu Phe Ser Gin Thr Phe Thr Gly 1105 1110 1115
CAA AAC AGC GCG ATC GTT TTT GGG GCT AAA AGC ATA TGG ACG AGC TTA 3411 Gin Asn Ser Ala He Val Phe Gly Ala Lys Ser He Trp Thr Ser Leu 1120 1125 1130
AGC GAT GCA CCG CAG TCT AAC ACC ATC ATT CGC TTT GGG GAC AAT AAG 3459 Ser Asp Ala Pro Gin Ser Asn Thr He He Arg Phe Gly Asp Asn Lys 1135 1140 1145 GGA GCA GGG AGT AAT GAT GCG AGC GGG CAT TGC TGG AAT TTG CAA TGC 3507 Gly Ala Gly Ser Asn Asp Ala Ser Gly His Cys Trp Asn Leu Gin Cys 1150 1155 1160
ATA GGC TTT ATT ACA GGG CAT TAT GAA GCG CAA AAG ATT TAC ATC ACC 3555 He Gly Phe He Thr Gly His Tyr Glu Ala Gin Lys He Tyr He Thr 1165 1170 1175 1180
GGT AGC ATT GAA AGC GGG AAT CGC ATT TCT AGC GGT GGG GGC GCG AGC 3603 Gly Ser He Glu Ser Gly Asn Arg He Ser Ser Gly Gly Gly Ala Ser 1185 1190 1195
CTT AAT TTT AAC GGG CTT CAA GGC ATT CTT TTA ACG AAC GCG ACT TTG 3651 Leu Asn Phe Asn Gly Leu Gin Gly He Leu Leu Thr Asn Ala Thr Leu 1200 1205 1210
TAT AAC CGC GCC GCT GGC ACG CAA AGC TCG TCT ATG AAT TTT ATC TCT 3699 Tyr Asn Arg Ala Ala Gly Thr Gin Ser Ser Ser Met Asn Phe He Ser 1215 1220 1225
AAC AGC GCG AAC ATT CAG GCT CAA AAC TCC TAT TTT ATA GAC GAT ACC 3747 Asn Ser Ala Asn He Gin Ala Gin Asn Ser Tyr Phe He Asp Asp Thr 1230 1235 1240
GCA CAA AAT GGC GGT AAC CCT AAT TTC AGT TTC AAC GCT TTG AAT CTG 3795 Ala Gin Asn Gly Gly Asn Pro Asn Phe Ser Phe Asn Ala Leu Asn Leu 1245 1250 1255 1260
GAT TTT TCT AAC AGC TCT TTT AGA GGC TAT GTG GGG AAA ACG CAA TCT 3843 Asp Phe Ser Asn Ser Ser Phe Arg Gly Tyr Val Gly Lys Thr Gin Ser 1265 1270 1275
GTT TTT AAA TTC AAT GCC AAG AAT GCG ATC AGT TTC ACC AAC AGC ACG 3891 Val Phe Lys Phe Asn Ala Lys Asn Ala He Ser Phe Thr Asn Ser Thr 1280 1285 1290
AAT TTA AGC TCT GGT TTG TAT CAA ATG CAA GCT AAA AGC GTG TTG TTT 3939 Asn Leu Ser Ser Gly Leu Tyr Gin Met Gin Ala Lys Ser Val Leu Phe 1295 1300 1305
GAC AAT TCC AAT TTA AGC GTT TCA GTG GGG ACA AGC AGT ATT AAA GCC 3987 Asp Asn Ser Asn Leu Ser Val Ser Val Gly Thr Ser Ser He Lys Ala 1310 1315 1320
AAT GCG ATC AAT CTT TCT CAA AAT GCC TCT ATT AAT GCG AGC AAC CAT 4035 Asn Ala He Asn Leu Ser Gin Asn Ala Ser He Asn Ala Ser Asn His 1325 1330 1335 1340
TCA ACC TTA GAA CTT CAA GGC GAT TTG AAT GTG AAC GAC ACC AGC TCG 4083 Ser Thr Leu Glu Leu Gin Gly Asp Leu Asn Val Asn Asp Thr Ser Ser 1345 1350 1355
CTC AAC CTC AAC CAA AGC ACG ATT AAT GTT TCC AAT AAC GCC ACG ATC 4131 Leu Asn Leu Asn Gin Ser Thr He Asn Val Ser Asn Asn Ala Thr He 1360 1365 1370 AAC GAT TAT GCG AGC TTG ATT GCG AGT AAT GGC TCT CAC CTT AAT TTT 4179 Asn Asp Tyr Ala Ser Leu He Ala Ser Asn Gly Ser His Leu Asn Phe 1375 1380 1385
AAC GGG GCG GTT AAT TTC AAT TCA GCG AAT ATT ACT ACG AGT TTG AAT 4227 Asn Gly Ala Val Asn Phe Asn Ser Ala Asn He Thr Thr Ser Leu Asn 1390 1395 1400
AAT TCC TCT ATC GTG TTT AAG GGG GCG GTC TCT TTA GGA GGG CAG TTT 4275 Asn Ser Ser He Val Phe Lys Gly Ala Val Ser Leu Gly Gly Gin Phe 1405 1410 1415 1420
AAT TTA AGC AAT AAC TCT TCT TTA GAT TTC CAA GGC TCT AGC GCT ATC 4323 Asn Leu Ser Asn Asn Ser Ser Leu Asp Phe Gin Gly Ser Ser Ala He 1425 1430 1435
ACC TCT AAC ACG GCG TTT AAT TTC TAT GAT AAC GCT TTT TCT CAA AGC 4371 Thr Ser Asn Thr Ala Phe Asn Phe Tyr Asp Asn Ala Phe Ser Gin Ser 1440 1445 1450
CCC ATC ACT TTC CAT CAA GCC CTT GAC ATT AAA GCG CCC TTA AGT TTG 4419 Pro He Thr Phe His Gin Ala Leu Asp He Lys Ala Pro Leu Ser Leu 1455 1460 1465
GGA GGC AAC CTT TTA AAC CCT AAC AAC AGC AGC GTG CTG GAT TTA AAA 4467 Gly Gly Asn Leu Leu Asn Pro Asn Asn Ser Ser Val Leu Asp Leu Lys 1470 1475 1480
AAC AGC CAG CTT GTT TTT GGC GAT CAA GGG AGT TTG AAT ATC GCT AAC 4515 Asn Ser Gin Leu Val Phe Gly Asp Gin Gly Ser Leu Asn He Ala Asn 1485 1490 1495 1500
ATT GAT TTA CTA AGC GAT CTA AAT GAT AAT AAA AAT CGT GTG TAT AAC 4563 He Asp Leu Leu Ser Asp Leu Asn Asp Asn Lys Asn Arg Val Tyr Asn 1505 1510 1515
ATC ATT CAA GCG GAC ATG AAT AGT AAT TGG TAT GAG CGT ATC AGC TTC 4611 He He Gin Ala Asp Met Asn Ser Asn Trp Tyr Glu Arg He Ser Phe 1520 1525 1530
TTT GGC ATG CAC ATC AAT GAC GGG ATT TAT GAT GCT AAA AAC CAA ACT 4659 Phe Gly Met His He Asn Asp Gly He Tyr Asp Ala Lys Asn Gin Thr 1535 1540 1545
TAT AGT TTC ACT AAC CCC CTT AAT AAC GCC CTA AAA ATC ACC GAG AGC 4707 Tyr Ser Phe Thr Asn Pro Leu Asn Asn Ala Leu Lys He Thr Glu Ser 1550 1555 1560
TTT AAA GAC AAC CAA CTA AGC GTT ACG CTC TCT CAA ATC CCG GGT ATT 4755 Phe Lys Asp Asn Gin Leu Ser Val Thr Leu Ser Gin He Pro Gly He 1565 1570 1575 1580
AAA AAC ACG CTC TAT AAC ATT GGC TCT GAA ATT TTT AAC TAC CAA AAA 4803 Lys Asn Thr Leu Tyr Asn He Gly Ser Glu He Phe Asn Tyr Gin Lys 1585 1590 1595 GTT TAT AAC AAC GCT AAT GGC GTG TAT TCT TAT AGC GAT GAT GCA CAA 4851 Val Tyr Asn Asn Ala Asn Gly Val Tyr Ser Tyr Ser Asp Asp Ala Gin 1600 1605 1610
GGC GTG TTT TAT CTC ACA AGC AAC GTG AAA GGC TAT TAC AAC CCT AAC 4899 Gly Val Phe Tyr Leu Thr Ser Asn Val Lys Gly Tyr Tyr Asn Pro Asn 1615 1620 1625
CAA TCC TAT CAA GCC AGC GGC AGT AAC AAC ACC ACG AAA AAT AAT AAT 4947 Gin Ser Tyr Gin Ala Ser Gly Ser Asn Asn Thr Thr Lys Asn Asn Asn 1630 1635 1640
CTA ACC TCT GAA TCT TCT ATC ATC TCG CAA ACC TAT AAC GCG CAA GGC 4995 Leu Thr Ser Glu Ser Ser He He Ser Gin Thr Tyr Asn Ala Gin Gly 1645 1650 1655 1660
AAC CCT ATT AGC GCG TTG CAC ATC TAT AAC AAG GGC TAT AAT TTC AAC 5043 Asn Pro He Ser Ala Leu His He Tyr Asn Lys Gly Tyr Asn Phe Asn 1665 1670 1675
AAT ATC AAA GCG TTA GGG CAA ATG GCT CTC AAA CTC TAC CCT GAA ATC 5091 Asn He Lys Ala Leu Gly Gin Met Ala Leu Lys Leu Tyr Pro Glu He 1680 1685 1690
AAA AAG GTA TTA GGG AAT GAT TTT TCG CCC TCA AGT TTG AAC GCT TTA 5139 Lys Lys Val Leu Gly Asn Asp Phe Ser Pro Ser Ser Leu Asn Ala Leu 1695 1700 1705
AAC TCT AAT GCG CTA AAC CAA CTT ACC AAA CTC ATC ACG CCT AAC GAC 5187 Asn Ser Asn Ala Leu Asn Gin Leu Thr Lys Leu He Thr Pro Asn Asp 1710 1715 1720
TGG AAA AAC ATT AAC GAG TTG ATT GAT AAC GCA AAC AAT TCG GTG GTG 5235 Trp Lys Asn He Asn Glu Leu He Asp Asn Ala Asn Asn Ser Val Val 1725 1730 1735 1740
CAA AAT TTC AAT AAC GGC ACT TTG ATT GTG GGA GCG ACT CAA ATA GGG 5283 Gin Asn Phe Asn Asn Gly Thr Leu He Val Gly Ala Thr Gin He Gly 1745 1750 1755
CAA ACA GAC ACC AAT AGC GCG GTT GTT TTT GGG GGC TTG GGC TAT CAA 5331 Gin Thr Asp Thr Asn Ser Ala Val Val Phe Gly Gly Leu Gly Tyr Gin 1760 1765 1770
ACA CCT TGT GAT TAT ACT GAT ATT GTG TGC CAA AAA TTT AGA GGC ACT 5379 Thr Pro Cys Asp Tyr Thr Asp He Val Cys Gin Lys Phe Arg Gly Thr 1775 1780 1785
TAT TTA GGA CAG CTT TTA GAG TCC AGC TCG GCT GAT TTG GGC TAT ATT 5427 Tyr Leu Gly Gin Leu Leu Glu Ser Ser Ser Ala Asp Leu Gly Tyr He 1790 1795 1800
GAC ACG ACT TTT AAC GCT AAA GAA ATT TAT CTT ACC GGC ACT TTA GGG 5475 Asp Thr Thr Phe Asn Ala Lys Glu He Tyr Leu Thr Gly Thr Leu Gly 1805 1810 1815 1820 AGC GGG AAC GCA TGG GGG ACT GGG GGG AGC GCG AGC GTA ACT TTT AAC 5523 Ser Gly Asn Ala Trp Gly Thr Gly Gly Ser Ala Ser Val Thr Phe Asn 1825 1830 1835
AGC CAA ACT TCG CTC ATT CTC AAT CAG GCT AAT ATC GTA AGC TCG CAA 5571 Ser Gin Thr Ser Leu He Leu Asn Gin Ala Asn He Val Ser Ser Gin 1840 1845 1850
ACC GAT GGG ATC TTT AGC ATG CTG GGT CAA GAG GGT ATT AAT AAG GTT 5619 Thr Asp Gly He Phe Ser Met Leu Gly Gin Glu Gly He Asn Lys Val 1855 1860 1865
TTC AAT CAA GCC GGG CTC GCT AAT ATT TTG GGC GAA GTG GCG GTG CAA 5667 Phe Asn Gin Ala Gly Leu Ala Asn He Leu Gly Glu Val Ala Val Gin 1870 1875 1880
TCC ATC AAC AAA GCC GGG GGA TTA GGG AAT TTG ATA GTA AAT ACG CTA 5715 Ser He Asn Lys Ala Gly Gly Leu Gly Asn Leu He Val Asn Thr Leu 1885 1890 1895 1900
GGG AGT AAT AGC GTG ATT GGG GGG TAT TTA ACG CCT GAA CAA AAA AAT 5763 Gly Ser Asn Ser Val He Gly Gly Tyr Leu Thr Pro Glu Gin Lys Asn 1905 1910 1915
CAA ACC CTA AGC CAG CTT TTA GGG CAG AAT AAC TTT GAT AAT CTC ATG 5811 Gin Thr Leu Ser Gin Leu Leu Gly Gin Asn Asn Phe Asp Asn Leu Met 1920 1925 1930
AAC GAT AGC GGT TTG AAT ACG GCG ATT AAG GAT TTG ATC AGA CAA AAA 5859 Asn Asp Ser Gly Leu Asn Thr Ala He Lys Asp Leu He Arg Gin Lys 1935 1940 1945
TTA GGC TTT TGG ACC GGG CTA GTG GGG GGA TTA GCC GGA CTA GGG GGC 5907 Leu Gly Phe Trp Thr Gly Leu Val Gly Gly Leu Ala Gly Leu Gly Gly 1950 1955 1960
ATT GAT TTG CAA AAC CCT GAA AAG CTT ATA GGC AGC ATG TCA ATC AAT 5955 He Asp Leu Gin Asn Pro Glu Lys Leu He Gly Ser Met Ser He Asn 1965 1970 1975 1980
GAT TTA TTG AGT AAA AAA GGG TTG TTC AAT CAG ATC ACC GGC TTT ATT 6003 Asp Leu Leu Ser Lys Lys Gly Leu Phe Asn Gin He Thr Gly Phe He 1985 1990 1995
TCC GCT AAC GAT ATA GGG CAA GTC ATA AGC GTA ATG TTG CAA GAT ATT 6051 Ser Ala Asn Asp He Gly Gin Val He Ser Val Met Leu Gin Asp He 2000 2005 2010
GTC AAA CCG AGC AAC GCT TTA AAA AAC GAT GTA GCG GCT TTA GGC AAG 6099 Val Lys Pro Ser Asn Ala Leu Lys Asn Asp Val Ala Ala Leu Gly Lys 2015 2020 2025
CAA ATG ATT GGC GAA TTT TTA GGC CAA GAC ACG CTC AAT TCT TTA GAA 6147 Gin Met He Gly Glu Phe Leu Gly Gin Asp Thr Leu Asn Ser Leu Glu 2030 2035 2040 AGC TTG TTG CAA AAC CAG CAG ATT AAA AGC GTT TTA GAC AAA GTC CTA 6195 Ser Leu Leu Gin Asn Gin Gin He Lys Ser Val Leu Asp Lys Val Leu 2045 2050 2055 2060
GCG GCT AAA GGT TTA GGG CCT ATT TAT GAA CAA GGC TTG GGG GAT TTG 6243 Ala Ala Lys Gly Leu Gly Pro He Tyr Glu Gin Gly Leu Gly Asp Leu 2065 2070 2075
ATA CCT AAT CTT GGT AAA AAA GGG CTT TTC GCT CCT TAT GGC TTG AGT 6291 He Pro Asn Leu Gly Lys Lys Gly Leu Phe Ala Pro Tyr Gly Leu Ser 2080 2085 2090
CAA GTG TGG CAA AAA GGG GAT TTT AGT TTC AAC GCA CAA GGC AAT GTT 6339 Gin Val Trp Gin Lys Gly Asp Phe Ser Phe Asn Ala Gin Gly Asn Val 2095 2100 2105
TTT GTG CAA AAT TCC ACT TTC TCT AAC GCC AAT GGA GGC ACG CTC TCT 6387 Phe Val Gin Asn Ser Thr Phe Ser Asn Ala Asn Gly Gly Thr Leu Ser 2110 2115 2120
TTT AAC GCA GGA AAT TCG CTC ATT TTT GCC GGA AAC AAT CAT ATT GCA 6435 Phe Asn Ala Gly Asn Ser Leu He Phe Ala Gly Asn Asn His He Ala 2125 2130 2135 2140
TTC ACT AAC CAC GCT GGA ACT CTT CAA TTA TTG TCC GAT CAA GTT TCT 6483 Phe Thr Asn His Ala Gly Thr Leu Gin Leu Leu Ser Asp Gin Val Ser 2145 2150 2155
AAC ATT AAC ATC ACC ACG CTT AAC GCT AGC AAC GGC CTT AAG ATT AAC 6531 Asn He Asn He Thr Thr Leu Asn Ala Ser Asn Gly Leu Lys He Asn 2160 2165 2170
GCC GCT AAT AAC AAT GTT TCT GTG TCT CAA GGC AAT CTG TTT GTC AGC 6579 Ala Ala Asn Asn Asn Val Ser Val Ser Gin Gly Asn Leu Phe Val Ser 2175 2180 2185
GCT AGC TGC GCG CAA CAA AGC GAT CCA ACT ACA GCT AAT ATT GCA AAC 6627 Ala Ser Cys Ala Gin Gin Ser Asp Pro Thr Thr Ala Asn He Ala Asn 2190 2195 2200
CCT TGC GCG CTT AGC GCC CAA AGC ACG AAT GGC GCT TCT TCT AAT AAT 6675 Pro Cys Ala Leu Ser Ala Gin Ser Thr Asn Gly Ala Ser Ser Asn Asn 2205 2210 2215 2220
GCG TCA AAT AAC GCG CCA ATC GCC TTG AGT AAT AAC GAT GAA AGC TTG 6723 Ala Ser Asn Asn Ala Pro He Ala Leu Ser Asn Asn Asp Glu Ser Leu 2225 2230 2235
ATG GTT GCG GCG AAT GAT TTC AAT TTT TCA GGC AAT ATT TAC GCT AAT 6771 Met Val Ala Ala Asn Asp Phe Asn Phe Ser Gly Asn He Tyr Ala Asn 2240 2245 2250
GGG GTG GTT GAT TTT TCA AAG ATT AAA GGC TCT GCA AAC ATT AAA AAC 6819 Gly Val Val Asp Phe Ser Lys He Lys Gly Ser Ala Asn He Lys Asn 2255 2260 2265 CTG TAT CTT TAC AAT AAC GCT CAA TTC CAA GCC AAC AAT CTC ACT ATT 6867 Leu Tyr Leu Tyr Asn Asn Ala Gin Phe Gin Ala Asn Asn Leu Thr He 2270 2275 2280
TCC AAT CAA GCG GTG TTA GAA AAA AAC GCC AGC TTT GTA ACG AAT AAT 6915 Ser Asn Gin Ala Val Leu Glu Lys Asn Ala Ser Phe Val Thr Asn Asn 2285 2290 2295 2300
TTA AAC ATT CAA GGA GCG TTT AAC AAC AAC GCC ACG CAA AAA ATA GAG 6963 Leu Asn He Gin Gly Ala Phe Asn Asn Asn Ala Thr Gin Lys He Glu 2305 2310 2315
GTG CTT CAA AAT TTA GTG ATC GCT TCA AAC GCT TCT TTA AGC ACC GGG 7011 Val Leu Gin Asn Leu Val He Ala Ser Asn Ala Ser Leu Ser Thr Gly 2320 2325 2330
ATT TAT GGG TTA GAA GTA GGG GGG GCT TTG AAT AAT TCT GGA GCG ATC 7059 He Tyr Gly Leu Glu Val Gly Gly Ala Leu Asn Asn Ser Gly Ala He 2335 2340 2345
CAT TTT AAT TTA GAA AAT ACC CAA ACG CCA ACG CCG CTC ATT CAA GCA 7107 His Phe Asn Leu Glu Asn Thr Gin Thr Pro Thr Pro Leu He Gin Ala 2350 2355 2360
GAG GGG ATC ATT AAC CTC AAC ACC ACC CAA ACG CCT TTT ATG AAT GTC 7155 Glu Gly He He Asn Leu Asn Thr Thr Gin Thr Pro Phe Met Asn Val 2365 2370 2375 2380
AAT AAC AGC ATG GCC AAT AAT ACG ACT TAC ACT TTA TTA AAA AGC AGC 7203 Asn Asn Ser Met Ala Asn Asn Thr Thr Tyr Thr Leu Leu Lys Ser Ser 2385 2390 2395
CGT TAC ATT GAT TAC AAT ATC AAC CCC AAC AGC TTG CAA TCG TAT TTG 7251 Arg Tyr He Asp Tyr Asn He Asn Pro Asn Ser Leu Gin Ser Tyr Leu 2400 2405 2410
AAT CTC TAC ACT TTA ATC AAT ATC AAC GGG AAC CAC ATA GAG GAA AAA 7299 Asn Leu Tyr Thr Leu He Asn He Asn Gly Asn His He Glu Glu Lys 2415 2420 2425
AAC GGC GCA TTG ACT TAT TTG GGC CAA CGG GTT TTG TTG CAA GAT AAG 7347 Asn Gly Ala Leu Thr Tyr Leu Gly Gin Arg Val Leu Leu Gin Asp Lys 2430 2435 2440
GGG TTA TTG TTA AGC GTA GCG CTG CCC AAC TCA AAC AAC GCT TCT CAA 7395 Gly Leu Leu Leu Ser Val Ala Leu Pro Asn Ser Asn Asn Ala Ser Gin 2445 2450 2455 2460
AAC AAC ATT TTA AGC CTT TCT GTC CTT TAT AAC CAA GTT AAA ATG TCT 7443 Asn Asn He Leu Ser Leu Ser Val Leu Tyr Asn Gin Val Lys Met Ser 2465 2470 2475
TGC GGC GAT AAA GCG ATG GAT TTT ACC CCC CCT ACC TTA CAA GAT TAC 7491 Cys Gly Asp Lys Ala Met Asp Phe Thr Pro Pro Thr Leu Gin Asp Tyr 2480 2485 2490 ATT GTG GGC ATT CAA GGG CAA AGC GCG CTC AAT CAA ATT GAA GCT GTT 7539 He Val Gly He Gin Gly Gin Ser Ala Leu Asn Gin He Glu Ala Val 2495 2500 2505
GGG GGG AAC GCT ATC AAG TGG CTT TCA ACA TTG ATG ATG GAG ACT AAA 7587 Gly Gly Asn Ala He Lys Trp Leu Ser Thr Leu Met Met Glu Thr Lys 2510 2515 2520
GAA AAC CCG TTT TTT GCG CCG ATT TAT TTA AAA AAC CAC TCT TTG AAT 7635 Glu Asn Pro Phe Phe Ala Pro He Tyr Leu Lys Asn His Ser Leu Asn 2525 2530 2535 2540
GAA ATC TTA GGC GTA ACA AAA GAT CTT CAA AAC ACC GCA AGC TTG ATT 7683 Glu He Leu Gly Val Thr Lys Asp Leu Gin Asn Thr Ala Ser Leu He 2545 2550 2555
TCT AAC CCT AAT TTT AGA GAT AAC GCT ACC AAT CTT TTA GAA TTG GCG 7731 Ser Asn Pro Asn Phe Arg Asp Asn Ala Thr Asn Leu Leu Glu Leu Ala 2560 2565 2570
AGT TAC ACC CAA CAA ACC AGC CGT TTA ACA AAA CTC TCT GAT TTT AGA 7779 Ser Tyr Thr Gin Gin Thr Ser Arg Leu Thr Lys Leu Ser Asp Phe Arg 2575 2580 2585
TCT AGA GAG GGA GAG TCT GAT TTT TCT TTG TTA GAG CTT AAA AAC AAG 7827 Ser Arg Glu Gly Glu Ser Asp Phe Ser Leu Leu Glu Leu Lys Asn Lys 2590 2595 2600
CGT TTT AGC GAT CCT AAT CCA GAG GTT TTT GTC AAA TAC TCT CAA CTT 7875 Arg Phe Ser Asp Pro Asn Pro Glu Val Phe Val Lys Tyr Ser Gin Leu 2605 2610 2615 2620
AGC AAA CAC CCA AAT AAC CTT TGG GTT CAA GGG GTG GGA GGA GCG AGC 7923 Ser Lys His Pro Asn Asn Leu Trp Val Gin Gly Val Gly Gly Ala Ser 2625 2630 2635
TTT ATT TCT GGG GGC AAT GGC ACG CTT TAT GGC TTG AAT GCG GGC TAT 7971 Phe He Ser Gly Gly Asn Gly Thr Leu Tyr Gly Leu Asn Ala Gly Tyr 2640 2645 2650
GAC AGG TTG GTT AAA AAT GTG ATC CTT GGG GGT TAT GTG GCT TAT GGC 8019 Asp Arg Leu Val Lys Asn Val He Leu Gly Gly Tyr Val Ala Tyr Gly 2655 2660 2665
TAT AGC GAC TTT AAT GGG AAC ATC ATG CAT TCT TTG GGT AAT AAT GTG 8067 Tyr Ser Asp Phe Asn Gly Asn He Met His Ser Leu Gly Asn Asn Val 2670 2675 2680
GAT GTG GGG ATG TAT GCG AGG GCT TTT TTA AAA AGG AAC GAA TTC ACT 8115 Asp Val Gly Met Tyr Ala Arg Ala Phe Leu Lys Arg Asn Glu Phe Thr 2685 2690 2695 2700
TTG AGC GCG AAT GAA ACT TAT GGA GGC AAT GCA ACT AGT ATC AAT TCT 8163 Leu Ser Ala Asn Glu Thr Tyr Gly Gly Asn Ala Thr Ser He Asn Ser 2705 2710 2715 TCT AAT TCT TTG CTC TCT GTG TTG AAC CAA CGC TAC AAC TAC AAC ACC 8211 Ser Asn Ser Leu Leu Ser Val Leu Asn Gin Arg Tyr Asn Tyr Asn Thr 2720 2725 2730
TGG ACA ACG AGC GTG AAC GGG AAT TAC GGC TAT GAT TTC ATG TTC AAA 8259 Trp Thr Thr Ser Val Asn Gly Asn Tyr Gly Tyr Asp Phe Met Phe Lys 2735 2740 2745
CAA AAA AGC GTG GTG CTA AAA CCT CAA GTG GGT TTG AGC TAT CAT TTC 8307 Gin Lys Ser Val Val Leu Lys Pro Gin Val Gly Leu Ser Tyr His Phe 2750 2755 2760
ATA GGT CTA AGT GGG ATG AAA GGC AAT GAT GCC GCT TAC AAA CAA TTC 8355 He Gly Leu Ser Gly Met Lys Gly Asn Asp Ala Ala Tyr Lys Gin Phe 2765 2770 2775 2780
CTC ATG CAT TCA AAC CCC TCT AAC GAA TCG GTT TTA ACG CTC AAC ATG 8403 Leu Met His Ser Asn Pro Ser Asn Glu Ser Val Leu Thr Leu Asn Met 2785 2790 2795
GGG TTG GAG AGC CGT AAA TAT TTT GGT AAA AAT TCC TAT TAT TTT GTA 8451 Gly Leu Glu Ser Arg Lys Tyr Phe Gly Lys Asn Ser Tyr Tyr Phe Val 2800 2805 2810
ACG GCG AGA CTA GGT AGG GAT CTT TTG ATC AAA TCT AAA GGC AGC AAT 8499 Thr Ala Arg Leu Gly Arg Asp Leu Leu He Lys Ser Lys Gly Ser Asn 2815 2820 2825
ACG GTG CGT TTT GTG GGC GAA AAC ACT TTA TTG TAT CGC AAG GGG GAA 8547 Thr Val Arg Phe Val Gly Glu Asn Thr Leu Leu Tyr Arg Lys Gly Glu 2830 2835 2840
GTT TTT AAC ACT TTT GCG AGC GTG ATT ACA GGG GGC GAA ATG CAT TTG 8595 Val Phe Asn Thr Phe Ala Ser Val He Thr Gly Gly Glu Met His Leu 2845 2850 2855 2860
TGG CGT TTG GTG TAT GTG AAT GCG GGG GTG GGG CTT AAG ATG GGC TTG 8643 Trp Arg Leu Val Tyr Val Asn Ala Gly Val Gly Leu Lys Met Gly Leu 2865 2870 2875
CAA TAC CAA GAT ATT AAT ATA ACC GGG AAT GTG GGC ATG CGA GTG GCG 8691 Gin Tyr Gin Asp He Asn He Thr Gly Asn Val Gly Met Arg Val Ala 2880 2885 2890
TTT TAGCTTTTTT GCTATAATGC TTCGTTCAAA TTTTATGGTT AGGTTTTTCT ATGT 874£ Phe
(2) INFORMATION FOR SEQ ID NO: 1256:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2893 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1256:
Met Lys Lys Phe Lys Lys Lys Pro Lys Ser He Lys Arg Ser His Gin
1 5 10 15
Asn Gin Lys Thr He Leu Lys Arg Pro Leu Trp Leu Met Pro Leu Leu
20 25 30
He Ser Gly Phe Ala Ser Gly Val Tyr Ala Asn Asn Leu Trp Asp Leu
35 40 45
Leu Asn Pro Lys Val Gly Gly Glu Tyr Val His Trp Val Lys Gly Ser
50 55 60
Gin Tyr Cys Ala Trp Trp Glu Phe Ala Gly Cys Leu Lys Asn Val Trp 65 70 75 80
Gly Ala Asn His Lys Gly Tyr Asp Ala Gly Asn Ala Ala Asn Tyr Leu
85 90 95
Ser Ser Gin Asn Tyr Gin Ala He Ser Val Gly Ser Gly Asn Glu Thr
100 105 110
Gly Thr Tyr Ser Leu Ser Gly Phe Thr Asn Tyr Val Gly Gly Asn Leu
115 120 125
Thr He Asn Leu Gly Asn Ser Val Val Leu Asp Leu Ser Gly Ser Asn
130 135 140
Ser Phe Thr Ser Tyr Gin Gly Tyr Asn Gin Gly Lys Asp Asp Val Thr 145 150 155 160
Phe Thr Val Gly Ala He Asn Leu Asn Gly Thr Leu Glu Val Gly Asn
165 170 175
Arg Val Gly Ser Gly Ala Gly Thr His Thr Gly Thr Ala Thr Leu Asn
180 185 190
Leu Asn Ala Asn Lys Val Asn He Asn Ser Asn He Asn Ala Tyr Lys
195 200 205
Thr Ser Gin Val Asn He Gly Asn Ala Asn Ser Val He Thr He Gly
210 215 220
Ser Val Ser Leu Ser Gly Asp Val Cys Ser Ser Leu Ala Ser Val Gly 225 230 235 240
He Gly Ala Asn Cys Ser Thr Ser Gly Pro Ser Tyr Ser Phe Lys Gly
245 250 255
Thr Thr Asn Ala Thr Asn Thr Ala Phe Ser Asn Ala Ser Gly Ser Phe
260 265 270
Thr Phe Glu Glu Asn Ala Thr Phe Ser Gly Ala Lys Trp Asn Gly Gly
275 280 285
Thr Tyr Thr Phe Asn Lys Glu Phe Ser Ala Thr Asn Asn Thr Ala Phe
290 295 300
Ser Ser Gly Ser Phe Asn Phe Lys Gly Val Ser Ser Phe Asn Gly Thr 305 310 315 320
Ser Phe Ser Asn Ala Ser Tyr Thr Phe Asp Asn Gin Ala Thr Phe Gin
325 330 335
Asn Ser Ser Phe Asn Gly Gly Thr Phe Thr Phe Asn Asn Gin Thr Asn
340 345 350
Pro Thr Asn Asn Ala Gin His Pro Gin He Gin Asn Ser Ser Phe Ser
355 360 365
Gly Asn Ala Thr Thr Leu Lys Gly Phe Val Asn Phe Gin Gin Ala Phe
370 375 380
Asn Asn Ser Asn His Gin Leu Thr He Gin Asn Ala Ser Phe Asn Asn 385 390 395 400
Ala Thr Phe Asn Asn Thr Gly Lys He Thr He Glu Lys Asp Ala Ser
405 410 415
Phe Asn Asn Thr Thr Phe Asn Thr Ser Val Asp Thr Asn Asn Met Ser
420 425 430
Val Thr Gly Gly Val Thr Leu Ser Gly Lys Asn Asp Leu Lys Asn Gly
435 440 445
Ser Thr Leu Asp Phe Gly Ser Ser Lys He Thr Leu Ala Gin Gly Thr
450 455 460
Thr Phe Asn Leu Thr Ser Leu Gly Ser Glu Lys Ser Val Thr He Leu 465 470 475 480
Asn Ser Ser Gly Gly He Thr Tyr Ser Asn Leu Leu Asn His Ala He
485 490 495
Asn Gly Leu Thr Ser Ala Leu Lys Thr Asn Glu Ser Leu Ser Asn Pro
500 505 510
Gin Ser Phe Ala Gin Gly Leu Trp Asp He He Thr Tyr Asn Gly Val
515 520 525
Thr Gly Gin Leu Leu Asn Glu Asn Ala Ala Thr Ser Lys Pro Thr Asp
530 535 540
Ser Ser Pro Ser Lys Ser Ser Thr Asn Ser Thr Gin Val Tyr Gin Val 545 550 555 560
Gly Tyr Lys He Gly Asp Thr He Tyr Lys Leu Gin Glu Thr Phe Ser
565 570 575
His Asn Ser He He He Gin Ala Leu Glu Ser Gly Thr Tyr Thr Pro
580 585 590
Pro Pro Val He Asn Gly Ser Lys Phe Asp Leu Ser Ala Ser Asn Tyr
595 600 605
He Asn Ala Asp Met Pro Trp Tyr Asp His Lys Tyr Tyr He Pro Lys
610 615 620
Ser Gin Asn Phe Thr Glu Ser Gly Thr Tyr Tyr Leu Pro Ser Val Gin 625 630 635 640
He Trp Gly Ser Tyr Thr Asn Ser Phe Lys Gin Thr Phe Ser Ala Asn
645 650 655
Gly Ser Asn Leu Val He Gly Tyr Asn Ser Thr Trp Thr Asp His Asn
660 665 670
Val Ser Ser Ser Gly Thr Val Ser Phe Gly Asp Thr Ser Gly Ser Ala
675 680 685
Leu Asn Gly His Cys Gly Pro Trp Pro Tyr Tyr Gin Cys Thr Gly Thr
690 695 700
Thr Asn Gly Thr Tyr Ser Ala Tyr His Val Tyr He Thr Ala Asn Leu 705 710 715 720
Arg Ser Gly Asn Arg He Gly Thr Gly Gly Ala Ala Asn Leu He Phe
725 730 735
Asn Gly Val Asp Ser He Asn He Ala Asn Ala Thr He Thr Gin His
740 745 750
Asn Ala Gly He Tyr Ser Ser Ser Met Thr Phe Ser Thr Gin Ser Met
755 760 765
Asp Asn Ser Gin Asn Leu Asn Gly Leu Asn Ser Asn Gly Lys Leu Ser
770 775 780
Val Tyr Gly Thr Thr Phe Thr Asn Glu Ala Lys Asp Gly Lys Phe He 785 790 795 800
Phe Asn Ala Gly Gin Ala Val Phe Glu Asn Thr Asn Phe Asn Gly Gly
805 810 815
Ser Tyr Gin Phe Ser Gly Asp Ser Leu Asn Phe Ser Asn Asn Asn Gin 820 825 830 Phe Asn Ser Gly Ser Phe Glu He Ser Ala Lys Asn Ala Ser Phe Asn
835 840 845
Asn Ala Asn Phe Asn Asn Ser Ala Ser Phe Asn Phe Asn Asn Ser Asn
850 855 860
Ala Thr Thr Ser Phe Val Gly Asp Phe Thr Asn Ala Asn Ser Asn Leu 865 870 875 880
Gin He Ala Gly Asn Ala Val Phe Gly Asn Ser Thr Asn Gly Ser Gin
885 890 895
Asn Thr Ala Asn Phe Asn Asn Thr Gly Ser Val Asn He Ser Gly Asn
900 905 910
Ala Thr Phe Asp Asn Val Val Phe Asn Gly Pro Thr Asn Thr Ser Val
915 920 925
Lys Gly Gin Val Thr Leu Asn Asn He Thr Leu Lys Asn Leu Asn Ala
930 935 940
Pro Leu Ser Phe Gly Asp Gly Thr He Thr Phe Asn Ala His Ser Val 945 950 955 960
He Asn He Ala Glu Ser He Thr Asn Gly Asn Pro He Thr Leu Val
965 970 975
Ser Ser Ser Lys Glu He Glu Tyr Asn Asn Ala Phe Ser Lys Asn Leu
980 985 990
Trp Gin Leu He Asn Tyr Gin Gly His Gly Ala Ser Ser Glu Lys Leu
995 1000 1005
Val Ser Ser Ala Gly Asn Gly Val Tyr Asp Val Val Tyr Ser Phe Asn
1010 1015 1020
Asn Gin Thr Tyr Asn Phe Gin Glu Val Phe Ser Gin Asn Ser He Ser 025 1030 1035 1040
He Arg Arg Leu Gly Val Asn Met Val Phe Asp Tyr Val Asp Met Glu
1045 1050 1055
Lys Ser Asp His Leu Tyr Tyr Gin Asn Ala Leu Gly Phe Met Thr Tyr
1060 1065 1070
Met Pro Asn Ser Tyr Asn Asn Asn Leu Gly Asn Ala Asn Asn Thr He
1075 1080 1085
Tyr Tyr Tyr Asp Lys Ser He Asp Phe Tyr Ala Ser Gly Lys Thr Leu
1090 1095 1100
Phe Thr Lys Ala Glu Phe Ser Gin Thr Phe Thr Gly Gin Asn Ser Ala 105 1110 1115 1120
He Val Phe Gly Ala Lys Ser He Trp Thr Ser Leu Ser Asp Ala Pro
1125 1130 1135
Gin Ser Asn Thr He He Arg Phe Gly Asp Asn Lys Gly Ala Gly Ser
1140 1145 1150
Asn Asp Ala Ser Gly His Cys Trp Asn Leu Gin Cys He Gly Phe He
1155 1160 1165
Thr Gly His Tyr Glu Ala Gin Lys He Tyr He Thr Gly Ser He Glu
1170 1175 1180
Ser Gly Asn Arg He Ser Ser Gly Gly Gly Ala Ser Leu Asn Phe Asn 185 1190 1195 1200
Gly Leu Gin Gly He Leu Leu Thr Asn Ala Thr Leu Tyr Asn Arg Ala
1205 1210 1215
Ala Gly Thr Gin Ser Ser Ser Met Asn Phe He Ser Asn Ser Ala Asn
1220 1225 1230
He Gin Ala Gin Asn Ser Tyr Phe He Asp Asp Thr Ala Gin Asn Gly
1235 1240 1245
Gly Asn Pro Asn Phe Ser Phe Asn Ala Leu Asn Leu Asp Phe Ser Asn
1250 1255 1260
Ser Ser Phe Arg Gly Tyr Val Gly Lys Thr Gin Ser Val Phe Lys Phe 265 1270 1275 1280
Asn Ala Lys Asn Ala He Ser Phe Thr Asn Ser Thr Asn Leu Ser Ser
1285 1290 1295
Gly Leu Tyr Gin Met Gin Ala Lys Ser Val Leu Phe Asp Asn Ser Asn
1300 1305 1310
Leu Ser Val Ser Val Gly Thr Ser Ser He Lys Ala Asn Ala He Asn
1315 1320 1325
Leu Ser Gin Asn Ala Ser He Asn Ala Ser Asn His Ser Thr Leu Glu
1330 1335 1340
Leu Gin Gly Asp Leu Asn Val Asn Asp Thr Ser Ser Leu Asn Leu Asn 345 1350 1355 1360
Gin Ser Thr He Asn Val Ser Asn Asn Ala Thr He Asn Asp Tyr Ala
1365 1370 1375
Ser Leu He Ala Ser Asn Gly Ser His Leu Asn Phe Asn Gly Ala Val
1380 1385 1390
Asn Phe Asn Ser Ala Asn He Thr Thr Ser Leu Asn Asn Ser Ser He
1395 1400 1405
Val Phe Lys Gly Ala Val Ser Leu Gly Gly Gin Phe Asn Leu Ser Asn
1410 1415 1420
Asn Ser Ser Leu Asp Phe Gin Gly Ser Ser Ala He Thr Ser Asn Thr 425 1430 1435 1440
Ala Phe Asn Phe Tyr Asp Asn Ala Phe Ser Gin Ser Pro He Thr Phe
1445 1450 1455
His Gin Ala Leu Asp He Lys Ala Pro Leu Ser Leu Gly Gly Asn Leu
1460 1465 1470
Leu Asn Pro Asn Asn Ser Ser Val Leu Asp Leu Lys Asn Ser Gin Leu
1475 1480 1485
Val Phe Gly Asp Gin Gly Ser Leu Asn He Ala Asn He Asp Leu Leu
1490 1495 1500
Ser Asp Leu Asn Asp Asn Lys Asn Arg Val Tyr Asn He He Gin Ala 505 1510 1515 1520
Asp Met Asn Ser Asn Trp Tyr Glu Arg He Ser Phe Phe Gly Met His
1525 1530 1535
He Asn Asp Gly He Tyr Asp Ala Lys Asn Gin Thr Tyr Ser Phe Thr
1540 1545 1550
Asn Pro Leu Asn Asn Ala Leu Lys He Thr Glu Ser Phe Lys Asp Asn
1555 1560 1565
Gin Leu Ser Val Thr Leu Ser Gin He Pro Gly He Lys Asn Thr Leu
1570 1575 1580
Tyr Asn He Gly Ser Glu He Phe Asn Tyr Gin Lys Val Tyr Asn Asn 585 1590 1595 1600
Ala Asn Gly Val Tyr Ser Tyr Ser Asp Asp Ala Gin Gly Val Phe Tyr
1605 1610 1615
Leu Thr Ser Asn Val Lys Gly Tyr Tyr Asn Pro Asn Gin Ser Tyr Gin
1620 1625 1630
Ala Ser Gly Ser Asn Asn Thr Thr Lys Asn Asn Asn Leu Thr Ser Glu
1635 1640 1645
Ser Ser He He Ser Gin Thr Tyr Asn Ala Gin Gly Asn Pro He Ser
1650 1655 1660
Ala Leu His He Tyr Asn Lys Gly Tyr Asn Phe Asn Asn He Lys Ala 665 1670 1675 1680
Leu Gly Gin Met Ala Leu Lys Leu Tyr Pro Glu He Lys Lys Val Leu
1685 1690 1695
Gly Asn Asp Phe Ser Pro Ser Ser Leu Asn Ala Leu Asn Ser Asn Ala 1700 1705 1710 Leu Asn Gin Leu Thr Lys Leu He Thr Pro Asn Asp Trp Lys Asn He
1715 1720 1725
Asn Glu Leu He Asp Asn Ala Asn Asn Ser Val Val Gin Asn Phe Asn
1730 1735 1740
Asn Gly Thr Leu He Val Gly Ala Thr Gin He Gly Gin Thr Asp Thr 745 1750 1755 1760
Asn Ser Ala Val Val Phe Gly Gly Leu Gly Tyr Gin Thr Pro Cys Asp
1765 1770 1775
Tyr Thr Asp He Val Cys Gin Lys Phe Arg Gly Thr Tyr Leu Gly Gin
1780 1785 1790
Leu Leu Glu Ser Ser Ser Ala Asp Leu Gly Tyr He Asp Thr Thr Phe
1795 1800 1805
Asn Ala Lys Glu He Tyr Leu Thr Gly Thr Leu Gly Ser Gly Asn Ala
1810 1815 1820
Trp Gly Thr Gly Gly Ser Ala Ser Val Thr Phe Asn Ser Gin Thr Ser 825 1830 1835 1840
Leu He Leu Asn Gin Ala Asn He Val Ser Ser Gin Thr Asp Gly He
1845 1850 1855
Phe Ser Met Leu Gly Gin Glu Gly He Asn Lys Val Phe Asn Gin Ala
1860 1865 1870
Gly Leu Ala Asn He Leu Gly Glu Val Ala Val Gin Ser He Asn Lys
1875 1880 1885
Ala Gly Gly Leu Gly Asn Leu He Val Asn Thr Leu Gly Ser Asn Ser
1890 1895 1900
Val He Gly Gly Tyr Leu Thr Pro Glu Gin Lys Asn Gin Thr Leu Ser 905 1910 1915 1920
Gin Leu Leu Gly Gin Asn Asn Phe Asp Asn Leu Met Asn Asp Ser Gly
1925 1930 1935
Leu Asn Thr Ala He Lys Asp Leu He Arg Gin Lys Leu Gly Phe Trp
1940 1945 1950
Thr Gly Leu Val Gly Gly Leu Ala Gly Leu Gly Gly He Asp Leu Gin
1955 1960 1965
Asn Pro Glu Lys Leu He Gly Ser Met Ser He Asn Asp Leu Leu Ser
1970 1975 1980
Lys Lys Gly Leu Phe Asn Gin He Thr Gly Phe He Ser Ala Asn Asp 985 1990 1995 2000
He Gly Gin Val He Ser Val Met Leu Gin Asp He Val Lys Pro Ser
2005 2010 2015
Asn Ala Leu Lys Asn Asp Val Ala Ala Leu Gly Lys Gin Met He Gly
2020 2025 2030
Glu Phe Leu Gly Gin Asp Thr Leu Asn Ser Leu Glu Ser Leu Leu Gin
2035 2040 2045
Asn Gin Gin He Lys Ser Val Leu Asp Lys Val Leu Ala Ala Lys Gly
2050 2055 2060
Leu Gly Pro He Tyr Glu Gin Gly Leu Gly Asp Leu He Pro Asn Leu 065 2070 2075 2080
Gly Lys Lys Gly Leu Phe Ala Pro Tyr Gly Leu Ser Gin Val Trp Gin
2085 2090 2095
Lys Gly Asp Phe Ser Phe Asn Ala Gin Gly Asn Val Phe Val Gin Asn
2100 2105 2110
Ser Thr Phe Ser Asn Ala Asn Gly Gly Thr Leu Ser Phe Asn Ala Gly
2115 2120 2125
Asn Ser Leu He Phe Ala Gly Asn Asn His He Ala Phe Thr Asn His
2130 2135 2140
Ala Gly Thr Leu Gin Leu Leu Ser Asp Gin Val Ser Asn He Asn He 145 2150 2155 2160
Thr Thr Leu Asn Ala Ser Asn Gly Leu Lys He Asn Ala Ala Asn Asn
2165 2170 2175
Asn Val Ser Val Ser Gin Gly Asn Leu Phe Val Ser Ala Ser Cys Ala
2180 2185 2190
Gin Gin Ser Asp Pro Thr Thr Ala Asn He Ala Asn Pro Cys Ala Leu
2195 2200 2205
Ser Ala Gin Ser Thr Asn Gly Ala Ser Ser Asn Asn Ala Ser Asn Asn
2210 2215 2220
Ala Pro He Ala Leu Ser Asn Asn Asp Glu Ser Leu Met Val Ala Ala 225 2230 2235 2240
Asn Asp Phe Asn Phe Ser Gly Asn He Tyr Ala Asn Gly Val Val Asp
2245 2250 2255
Phe Ser Lys He Lys Gly Ser Ala Asn He Lys Asn Leu Tyr Leu Tyr
2260 2265 2270
Asn Asn Ala Gin Phe Gin Ala Asn Asn Leu Thr He Ser Asn Gin Ala
2275 2280 2285
Val Leu Glu Lys Asn Ala Ser Phe Val Thr Asn Asn Leu Asn He Gin
2290 2295 2300
Gly Ala Phe Asn Asn Asn Ala Thr Gin Lys He Glu Val Leu Gin Asn 305 2310 2315 2320
Leu Val He Ala Ser Asn Ala Ser Leu Ser Thr Gly He Tyr Gly Leu
2325 2330 2335
Glu Val Gly Gly Ala Leu Asn Asn Ser Gly Ala He His Phe Asn Leu
2340 2345 2350
Glu Asn Thr Gin Thr Pro Thr Pro Leu He Gin Ala Glu Gly He He
2355 2360 2365
Asn Leu Asn Thr Thr Gin Thr Pro Phe Met Asn Val Asn Asn Ser Met
2370 2375 2380
Ala Asn Asn Thr Thr Tyr Thr Leu Leu Lys Ser Ser Arg Tyr He Asp 385 2390 2395 2400
Tyr Asn He Asn Pro Asn Ser Leu Gin Ser Tyr Leu Asn Leu Tyr Thr
2405 2410 2415
Leu He Asn He Asn Gly Asn His He Glu Glu Lys Asn Gly Ala Leu
2420 2425 2430
Thr Tyr Leu Gly Gin Arg Val Leu Leu Gin Asp Lys Gly Leu Leu Leu
2435 2440 2445
Ser Val Ala Leu Pro Asn Ser Asn Asn Ala Ser Gin Asn Asn He Leu
2450 2455 2460
Ser Leu Ser Val Leu Tyr Asn Gin Val Lys Met Ser Cys Gly Asp Lys 465 2470 2475 2480
Ala Met Asp Phe Thr Pro Pro Thr Leu Gin Asp Tyr He Val Gly He
2485 2490 2495
Gin Gly Gin Ser Ala Leu Asn Gin He Glu Ala Val Gly Gly Asn Ala
2500 2505 2510
He Lys Trp Leu Ser Thr Leu Met Met Glu Thr Lys Glu Asn Pro Phe
2515 2520 2525
Phe Ala Pro He Tyr Leu Lys Asn His Ser Leu Asn Glu He Leu Gly
2530 2535 2540
Val Thr Lys Asp Leu Gin Asn Thr Ala Ser Leu He Ser Asn Pro Asn 545 2550 2555 2560
Phe Arg Asp Asn Ala Thr Asn Leu Leu Glu Leu Ala Ser Tyr Thr Gin
2565 2570 2575
Gin Thr Ser Arg Leu Thr Lys Leu Ser Asp Phe Arg Ser Arg Glu Gly 2580 2585 2590 Glu Ser Asp Phe Ser Leu Leu Glu Leu Lys Asn Lys Arg Phe Ser Asp
2595 2600 2605
Pro Asn Pro Glu Val Phe Val Lys Tyr Ser Gin Leu Ser Lys His Pro
2610 2615 2620
Asn Asn Leu Trp Val Gin Gly Val Gly Gly Ala Ser Phe He Ser Gly 625 2630 2635 2640
Gly Asn Gly Thr Leu Tyr Gly Leu Asn Ala Gly Tyr Asp Arg Leu Val
2645 2650 2655
Lys Asn Val He Leu Gly Gly Tyr Val Ala Tyr Gly Tyr Ser Asp Phe
2660 2665 2670
Asn Gly Asn He Met His Ser Leu Gly Asn Asn Val Asp Val Gly Met
2675 2680 2685
Tyr Ala Arg Ala Phe Leu Lys Arg Asn Glu Phe Thr Leu Ser Ala Asn
2690 2695 2700
Glu Thr Tyr Gly Gly Asn Ala Thr Ser He Asn Ser Ser Asn Ser Leu 705 2710 2715 2720
Leu Ser Val Leu Asn Gin Arg Tyr Asn Tyr Asn Thr Trp Thr Thr Ser
2725 2730 2735
Val Asn Gly Asn Tyr Gly Tyr Asp Phe Met Phe Lys Gin Lys Ser Val
2740 2745 2750
Val Leu Lys Pro Gin Val Gly Leu Ser Tyr His Phe He Gly Leu Ser
2755 2760 2765
Gly Met Lys Gly Asn Asp Ala Ala Tyr Lys Gin Phe Leu Met His Ser
2770 2775 2780
Asn Pro Ser Asn Glu Ser Val Leu Thr Leu Asn Met Gly Leu Glu Ser 785 2790 2795 2800
Arg Lys Tyr Phe Gly Lys Asn Ser Tyr Tyr Phe Val Thr Ala Arg Leu
2805 2810 2815
Gly Arg Asp Leu Leu He Lys Ser Lys Gly Ser Asn Thr Val Arg Phe
2820 2825 2830
Val Gly Glu Asn Thr Leu Leu Tyr Arg Lys Gly Glu Val Phe Asn Thr
2835 2840 2845
Phe Ala Ser Val He Thr Gly Gly Glu Met His Leu Trp Arg Leu Val
2850 2855 2860
Tyr Val Asn Ala Gly Val Gly Leu Lys Met Gly Leu Gin Tyr Gin Asp 865 2870 2875 2880
He Asn He Thr Gly Asn Val Gly Met Arg Val Ala Phe 2885 2890
(2) INFORMATION FOR SEQ ID NO: 1257:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1075 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 32...1048 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1257:
GGAACTCTCA TCAAAAAACA AGGAACATAA T ATG AGA CAT GGA GAT ATT AGT 52
Met Arg His Gly Asp He Ser 1 5
AGC AGC CCA GAT ACT GTG GGT GTA GCG GTA GTT AAT TAT AAG ATG CCT 100 Ser Ser Pro Asp Thr Val Gly Val Ala Val Val Asn Tyr Lys Met Pro 10 15 20
AGA CTC CAC ACT AAG AAT GAG GTG TTG GAA AAT TGT CGC AAT ATC GCT 148 Arg Leu His Thr Lys Asn Glu Val Leu Glu Asn Cys Arg Asn He Ala 25 30 35
AAG GTG ATT GGT GGG GTC AAA CAG GGT TTG CCT GGG TTG GAT CTG ATT 196 Lys Val He Gly Gly Val Lys Gin Gly Leu Pro Gly Leu Asp Leu He 40 45 50 55
ATT TTC CCT GAA TAC AGC ACG CAT GGG ATT ATG TAT GAC AGA CAA GAA 244 He Phe Pro Glu Tyr Ser Thr His Gly He Met Tyr Asp Arg Gin Glu 60 65 70
ATG TTT GAT ACA GCC GCA AGC GTT CCT GGA GAA GAA ACC GCG ATC TTT 292 Met Phe Asp Thr Ala Ala Ser Val Pro Gly Glu Glu Thr Ala He Phe 75 80 85
GCT GAA GCT TGT AAG AAA AAC AAG GTT TGG GGA GTG TTC TCT TTG ACA 340 Ala Glu Ala Cys Lys Lys Asn Lys Val Trp Gly Val Phe Ser Leu Thr 90 95 100
GGG GAA AAA CAC GAG CAA GCC AAA AAG AAT CCC TAT AAC ACT TTG ATT 388 Gly Glu Lys His Glu Gin Ala Lys Lys Asn Pro Tyr Asn Thr Leu He 105 110 115
CTT GTC AAT GAT AAG GGT GAG ATC GTG CAA AAA TAC CGC AAA ATC TTG 436 Leu Val Asn Asp Lys Gly Glu He Val Gin Lys Tyr Arg Lys He Leu 120 125 130 135
CCT TGG TGC CCT ATT GAA TGT TGG TAT CCT GGG GAT AAA ACT TAT GTG 484 Pro Trp Cys Pro He Glu Cys Trp Tyr Pro Gly Asp Lys Thr Tyr Val 140 145 150
GTT GAT GGG CCT AAG GGC TTG AAA GTT TCT TTG ATT ATT TGC GAT GAT 532 Val Asp Gly Pro Lys Gly Leu Lys Val Ser Leu He He Cys Asp Asp 155 160 165
GGA AAC TAC CCT GAA ATT TGG CGC GAT TGC GCG ATG CGT GGG GCA GAA 580 Gly Asn Tyr Pro Glu He Trp Arg Asp Cys Ala Met Arg Gly Ala Glu 170 175 180
CTC ATT GTG CGC TGT CAA GGT TAC ATG TAT CCG GCT AAG GAG CAA CAA 628 Leu He Val Arg Cys Gin Gly Tyr Met Tyr Pro Ala Lys Glu Gin Gin 185 190 195
ATT GCA ATA GTA AAA GCT ATG GCG TGG GCC AAT CAA TGT TAT GTA GCG 676 He Ala He Val Lys Ala Met Ala Trp Ala Asn Gin Cys Tyr Val Ala 200 205 210 215
GTA GCG AAT GCG ACC GGT TTT GAT GGG GTG TAT TCC TAT TTT GGG CAT 724 Val Ala Asn Ala Thr Gly Phe Asp Gly Val Tyr Ser Tyr Phe Gly His 220 225 230
TCT AGC ATT ATT GGT TTT GAC GGG CAT ACT TTG GGC GAA TGC GGG GAA 772 Ser Ser He He Gly Phe Asp Gly His Thr Leu Gly Glu Cys Gly Glu 235 240 245
GAA GAA AAT GGT CTT CAA TAC GCT CAA CTT TCT GTG CAA CAA ATC CGT 820 Glu Glu Asn Gly Leu Gin Tyr Ala Gin Leu Ser Val Gin Gin He Arg 250 255 260
GAT GCG AGA AAA TAC GAC CAA AGC CAA AAC CAA CTC TTC AAA CTC TTG 868 Asp Ala Arg Lys Tyr Asp Gin Ser Gin Asn Gin Leu Phe Lys Leu Leu 265 270 275
CAC AGA GGT TAT AGT GGG GTT TTT GCT AGT GGC GAT GGG GAT AAG GGT 916 His Arg Gly Tyr Ser Gly Val Phe Ala Ser Gly Asp Gly Asp Lys Gly 280 285 290 295
GTG GCG GAA TGC CCT TTT GAG TTC TAT AAA ACT TGG GTG AAT GAC CCC 964 Val Ala Glu Cys Pro Phe Glu Phe Tyr Lys Thr Trp Val Asn Asp Pro 300 305 310
AAA AAA GCT CAA GAA AAT GTA GAA AAA ATC ACC CGC CCA AGC GTG GGT 1012 Lys Lys Ala Gin Glu Asn Val Glu Lys He Thr Arg Pro Ser Val Gly 315 320 325
GTG GCC GCT TGT CCT GTG GGC GAT TTG CCC ACG AAA TAAAGGGCAA AAGGAG 1064 Val Ala Ala Cys Pro Val Gly Asp Leu Pro Thr Lys 330 335
GAGGGGGGGG G 1075
(2) INFORMATION FOR SEQ ID NO: 1258:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 339 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1258:
Met Arg His Gly Asp He Ser Ser Ser Pro Asp Thr Val Gly Val Ala
1 5 10 15
Val Val Asn Tyr Lys Met Pro Arg Leu His Thr Lys Asn Glu Val Leu
20 25 30
Glu Asn Cys Arg Asn He Ala Lys Val He Gly Gly Val Lys Gin Gly 35 40 45 Leu Pro Gly Leu Asp Leu He He Phe Pro Glu Tyr Ser Thr His Gly
50 55 60
He Met Tyr Asp Arg Gin Glu Met Phe Asp Thr Ala Ala Ser Val Pro 65 70 75 80
Gly Glu Glu Thr Ala He Phe Ala Glu Ala Cys Lys Lys Asn Lys Val
85 90 95
Trp Gly Val Phe Ser Leu Thr Gly Glu Lys His Glu Gin Ala Lys Lys
100 105 110
Asn Pro Tyr Asn Thr Leu He Leu Val Asn Asp Lys Gly Glu He Val
115 120 125
Gin Lys Tyr Arg Lys He Leu Pro Trp Cys Pro He Glu Cys Trp Tyr
130 135 140
Pro Gly Asp Lys Thr Tyr Val Val Asp Gly Pro Lys Gly Leu Lys Val 145 150 155 160
Ser Leu He He Cys Asp Asp Gly Asn Tyr Pro Glu He Trp Arg Asp
165 170 175
Cys Ala Met Arg Gly Ala Glu Leu He Val Arg Cys Gin Gly Tyr Met
180 185 190
Tyr Pro Ala Lys Glu Gin Gin He Ala He Val Lys Ala Met Ala Trp
195 200 205
Ala Asn Gin Cys Tyr Val Ala Val Ala Asn Ala Thr Gly Phe Asp Gly
210 215 220
Val Tyr Ser Tyr Phe Gly His Ser Ser He He Gly Phe Asp Gly His 225 230 235 240
Thr Leu Gly Glu Cys Gly Glu Glu Glu Asn Gly Leu Gin Tyr Ala Gin
245 250 255
Leu Ser Val Gin Gin He Arg Asp Ala Arg Lys Tyr Asp Gin Ser Gin
260 265 270
Asn Gin Leu Phe Lys Leu Leu His Arg Gly Tyr Ser Gly Val Phe Ala
275 280 285
Ser Gly Asp Gly Asp Lys Gly Val Ala Glu Cys Pro Phe Glu Phe Tyr
290 295 300
Lys Thr Trp Val Asn Asp Pro Lys Lys Ala Gin Glu Asn Val Glu Lys 305 310 315 320
He Thr Arg Pro Ser Val Gly Val Ala Ala Cys Pro Val Gly Asp Leu
325 330 335
Pro Thr Lys
(2) INFORMATION FOR SEQ ID NO: 1259:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1722 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 40...1686 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1259:
ATCTCTATCC TTTATAGAAT TTGTTGTGGA GACTGGCTT ATG AAT AAT GTT TTT 54
Met Asn Asn Val Phe 1 5
GTT AAG GGT TTG TTT TTT TTT CTT TTA TTG TTT GGG TTT TTT TTG AAA 102 Val Lys Gly Leu Phe Phe Phe Leu Leu Leu Phe Gly Phe Phe Leu Lys 10 15 20
GCT TCA GAA AGC CCA AAC GCT ACT CTT AAT CCA TCT AAA GAA AAT GTT 150 Ala Ser Glu Ser Pro Asn Ala Thr Leu Asn Pro Ser Lys Glu Asn Val 25 30 35
TCT GTT GAA GAG CAA AAG CGT TTT GGA GGC GTT TTA GTT TTT GCA AGA 198 Ser Val Glu Glu Gin Lys Arg Phe Gly Gly Val Leu Val Phe Ala Arg 40 45 50
GGC GCT GAT GGC TCG AGC ATG GAT CCT GCT TTA GTG ACT GAT GGC GAA 246 Gly Ala Asp Gly Ser Ser Met Asp Pro Ala Leu Val Thr Asp Gly Glu 55 60 65
AGC TAT GTA GCA ACG GGC AAT ATT TAT GAC ACG CTC GTG CAA TTC AGA 294 Ser Tyr Val Ala Thr Gly Asn He Tyr Asp Thr Leu Val Gin Phe Arg 70 75 80 85
TAC GGC ACC ACA GAA GTT GAA CCC GCC TTA GCG ACA AGC TGG GAC ATA 342 Tyr Gly Thr Thr Glu Val Glu Pro Ala Leu Ala Thr Ser Trp Asp He 90 95 100
TCC CCA GAT GGT CTT GTA TAT ACC TTT CAT TTA CGC AAA GGG GTT TAT 390 Ser Pro Asp Gly Leu Val Tyr Thr Phe His Leu Arg Lys Gly Val Tyr 105 110 115
TTC CAC CAA ACG AAG TAT TGG AAT AAA AAA GTA GAG TTT AGC GCT AAA 438 Phe His Gin Thr Lys Tyr Trp Asn Lys Lys Val Glu Phe Ser Ala Lys 120 125 130
GAT GTG CTG TTT TCG TTT GAA CGC CAG ATG GAT AAA GCT AAA CGA TAT 486 Asp Val Leu Phe Ser Phe Glu Arg Gin Met Asp Lys Ala Lys Arg Tyr
135 140 145
TAT AGC CCG GGG GCT AAA AGC TAT AAG TAT TGG GAA GGC ATG GGC ATG 534 Tyr Ser Pro Gly Ala Lys Ser Tyr Lys Tyr Trp Glu Gly Met Gly Met 150 155 160 165
TCT CAT ATT ATT AAG AGC ATT GAA GCT TTA GAT GAC TAT ACC ATT AGA 582 Ser His He He Lys Ser He Glu Ala Leu Asp Asp Tyr Thr He Arg 170 175 180
TTC ACA CTT AAT GGG CCA GAA GCC CCG TTT TTA GCG AAT TTG GGC ATG 630 Phe Thr Leu Asn Gly Pro Glu Ala Pro Phe Leu Ala Asn Leu Gly Met 185 190 195
GAC TTT TTG AGC ATT TTG AGT AAG GAT TAC GCT GAT TAC CTG GCT CAA 678 Asp Phe Leu Ser He Leu Ser Lys Asp Tyr Ala Asp Tyr Leu Ala Gin 200 205 210
AAT AAT AAA AAA GAC GAG TTG GCT AAA AAA CCT ATT GGG ACA GGG CCT 726 Asn Asn Lys Lys Asp Glu Leu Ala Lys Lys Pro He Gly Thr Gly Pro 215 220 225
TTC AAA TTC TTT TTG TGG AAT AAA GAT GAA AAA ATC ATT CTT TTA AAA 774 Phe Lys Phe Phe Leu Trp Asn Lys Asp Glu Lys He He Leu Leu Lys 230 235 240 245
AAT CAA GAT TAT TGG GGG CCT AAA GCG TAT TTG GAT AAG GTG GTG GTG 822 Asn Gin Asp Tyr Trp Gly Pro Lys Ala Tyr Leu Asp Lys Val Val Val 250 255 260
CGC ACC ATT CCT AAT TCT TCC ACT CGC GCT TTA GCG TTG CGC ACC GGC 870 Arg Thr He Pro Asn Ser Ser Thr Arg Ala Leu Ala Leu Arg Thr Gly 265 270 275
GAA ATC ATG CTC ATG ACT GGG CCT AAT CTC AAT GAA GTG GAG CAA TTA 918 Glu He Met Leu Met Thr Gly Pro Asn Leu Asn Glu Val Glu Gin Leu 280 285 290
GAA AAA GTC CCT AAT ATC GTG GTG GAC AAA AGT GCT GGG TTG TTG GCG 966 Glu Lys Val Pro Asn He Val Val Asp Lys Ser Ala Gly Leu Leu Ala 295 300 305
AGT TGG CTT TCG TTG AAC ACG CAA AAA AAG TAT TTT GAC AAC CCT TTG 1014 Ser Trp Leu Ser Leu Asn Thr Gin Lys Lys Tyr Phe Asp Asn Pro Leu 310 315 320 325
GTG CGT TTG GCT ATC AAT CAT GCG ATC AAT GCA GAT GAT TAC ATC AAA 1062 Val Arg Leu Ala He Asn His Ala He Asn Ala Asp Asp Tyr He Lys 330 335 340
GTG CTT TAT GAA GGC TTT GCT CAA AAA ATG GTC AAT CCT TTC CCG CCC 1110 Val Leu Tyr Glu Gly Phe Ala Gin Lys Met Val Asn Pro Phe Pro Pro 345 350 355
ACC ATA TGG GGT TAT AAC TAC AAT ATC AAA CCC TAT GAA TAC GAT TTG 1158 Thr He Trp Gly Tyr Asn Tyr Asn He Lys Pro Tyr Glu Tyr Asp Leu 360 365 370
AAA AAG GCT AAG GAG TTG TTG AAA CAA GCG GGC TAT CCT AAC GGC TTT 1206 Lys Lys Ala Lys Glu Leu Leu Lys Gin Ala Gly Tyr Pro Asn Gly Phe 375 380 385
AAA ACC ACT ATT TTT ACC ACT GCC ACT CGT AAC CCA AAA GGA GCG GTG 1254 Lys Thr Thr He Phe Thr Thr Ala Thr Arg Asn Pro Lys Gly Ala Val 390 395 400 405
TTC ATA CAG GCG AGC CTG GCT AAA ATT GGC ATT GAT GTG AAA ATT GAA 1302 Phe He Gin Ala Ser Leu Ala Lys He Gly He Asp Val Lys He Glu 410 415 420 GTG TAT GAG TGG GGG GCT TAT TTG AAA AGA ACG GGT CTG GGC GAA CAT 1350 Val Tyr Glu Trp Gly Ala Tyr Leu Lys Arg Thr Gly Leu Gly Glu His 425 430 435
GAA ATG GCG TTT TCA GGC TGG ATG GCA GAC ATT GCG GAT CCG GAT AAT 1398 Glu Met Ala Phe Ser Gly Trp Met Ala Asp He Ala Asp Pro Asp Asn 440 445 450
TTC TTA TAC ACC TTA TGG AGC GAG CAA GCC GCC TCA GCT ATA CCC ACT 1446 Phe Leu Tyr Thr Leu Trp Ser Glu Gin Ala Ala Ser Ala He Pro Thr 455 460 465
CAA AAC CAT TCC TTT TAT AAA AAT AAG GAG TTT TCC AAT CTG CTC ATA 1494 Gin Asn His Ser Phe Tyr Lys Asn Lys Glu Phe Ser Asn Leu Leu He 470 475 480 485
AAG GCT AAA CGC GTT TCG GAT CAA AAA GAG AGG GAA GCC CTT TAT TTA 1542 Lys Ala Lys Arg Val Ser Asp Gin Lys Glu Arg Glu Ala Leu Tyr Leu 490 495 500
AAG GCA CAA GAA ATT ATC CAT AAA GAC GCG CCT TAT GTG CCT TTA GCC 1590 Lys Ala Gin Glu He He His Lys Asp Ala Pro Tyr Val Pro Leu Ala 505 510 515
TAT CCT TAT TCG GTG GTG CCG CAT TTG TCT AAA GTT AAG GGT TAT AAA 1638 Tyr Pro Tyr Ser Val Val Pro His Leu Ser Lys Val Lys Gly Tyr Lys 520 525 530
ACG ACC GGA GTG AGC GTG AAT CGC TTC TTT AAG GTG TAT TTA GAA AAA T 1687 Thr Thr Gly Val Ser Val Asn Arg Phe Phe Lys Val Tyr Leu Glu Lys 535 540 545
AAAAGGGGTT GCATGCTGAG TTTTATCATT AAGCG 1722
(2) INFORMATION FOR SEQ ID NO: 1260:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 549 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1260:
Met Asn Asn Val Phe Val Lys Gly Leu Phe Phe Phe Leu Leu Leu Phe
1 5 10 15
Gly Phe Phe Leu Lys Ala Ser Glu Ser Pro Asn Ala Thr Leu Asn Pro
20 25 30
Ser Lys Glu Asn Val Ser Val Glu Glu Gin Lys Arg Phe Gly Gly Val
35 40 45
Leu Val Phe Ala Arg Gly Ala Asp Gly Ser Ser Met Asp Pro Ala Leu
50 55 60
Val Thr Asp Gly Glu Ser Tyr Val Ala Thr Gly Asn He Tyr Asp Thr 65 70 75 80
Leu Val Gin Phe Arg Tyr Gly Thr Thr Glu Val Glu Pro Ala Leu Ala
85 90 95
Thr Ser Trp Asp He Ser Pro Asp Gly Leu Val Tyr Thr Phe His Leu
100 105 110
Arg Lys Gly Val Tyr Phe His Gin Thr Lys Tyr Trp Asn Lys Lys Val
115 120 125
Glu Phe Ser Ala Lys Asp Val Leu Phe Ser Phe Glu Arg Gin Met Asp
130 135 140
Lys Ala Lys Arg Tyr Tyr Ser Pro Gly Ala Lys Ser Tyr Lys Tyr Trp 145 150 155 160
Glu Gly Met Gly Met Ser His He He Lys Ser He Glu Ala Leu Asp
165 170 175
Asp Tyr Thr He Arg Phe Thr Leu Asn Gly Pro Glu Ala Pro Phe Leu
180 185 190
Ala Asn Leu Gly Met Asp Phe Leu Ser He Leu Ser Lys Asp Tyr Ala
195 200 205
Asp Tyr Leu Ala Gin Asn Asn Lys Lys Asp Glu Leu Ala Lys Lys Pro
210 215 220
He Gly Thr Gly Pro Phe Lys Phe Phe Leu Trp Asn Lys Asp Glu Lys 225 230 235 240
He He Leu Leu Lys Asn Gin Asp Tyr Trp Gly Pro Lys Ala Tyr Leu
245 250 255
Asp Lys Val Val Val Arg Thr He Pro Asn Ser Ser Thr Arg Ala Leu
260 265 270
Ala Leu Arg Thr Gly Glu He Met Leu Met Thr Gly Pro Asn Leu Asn
275 280 285
Glu Val Glu Gin Leu Glu Lys Val Pro Asn He Val Val Asp Lys Ser
290 295 300
Ala Gly Leu Leu Ala Ser Trp Leu Ser Leu Asn Thr Gin Lys Lys Tyr 305 310 315 320
Phe Asp Asn Pro Leu Val Arg Leu Ala He Asn His Ala He Asn Ala
325 330 335
Asp Asp Tyr He Lys Val Leu Tyr Glu Gly Phe Ala Gin Lys Met Val
340 345 350
Asn Pro Phe Pro Pro Thr He Trp Gly Tyr Asn Tyr Asn He Lys Pro
355 360 365
Tyr Glu Tyr Asp Leu Lys Lys Ala Lys Glu Leu Leu Lys Gin Ala Gly
370 375 380
Tyr Pro Asn Gly Phe Lys Thr Thr He Phe Thr Thr Ala Thr Arg Asn 385 390 395 400
Pro Lys Gly Ala Val Phe He Gin Ala Ser Leu Ala Lys He Gly He
405 410 415
Asp Val Lys He Glu Val Tyr Glu Trp Gly Ala Tyr Leu Lys Arg Thr
420 425 430
Gly Leu Gly Glu His Glu Met Ala Phe Ser Gly Trp Met Ala Asp He
435 440 445
Ala Asp Pro Asp Asn Phe Leu Tyr Thr Leu Trp Ser Glu Gin Ala Ala
450 455 460
Ser Ala He Pro Thr Gin Asn His Ser Phe Tyr Lys Asn Lys Glu Phe 465 470 475 480
Ser Asn Leu Leu He Lys Ala Lys Arg Val Ser Asp Gin Lys Glu Arg
485 490 495
Glu Ala Leu Tyr Leu Lys Ala Gin Glu He He His Lys Asp Ala Pro 500 505 510 Tyr Val Pro Leu Ala Tyr Pro Tyr Ser Val Val Pro His Leu Ser Lys
515 520 525
Val Lys Gly Tyr Lys Thr Thr Gly Val Ser Val Asn Arg Phe Phe Lys
530 535 540
Val Tyr Leu Glu Lys 545
(2) INFORMATION FOR SEQ ID NO: 1261:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1080 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...1050 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1261:
GCGTGAATCG CTTCTTTAAG GTGTATTTAG AAAAATAAAA GGGGTTGC ATG CTG AGT 57
Met Leu Ser 1
TTT ATC ATT AAG CGT ATT TTG TGG GCG ATC CCC ACG CTG TTT GGA GTG 105 Phe He He Lys Arg He Leu Trp Ala He Pro Thr Leu Phe Gly Val 5 10 15
AGT ATC ATT GTG TTT ATG ATG GTG CAT TTA GTG CCA GGA GAT CCG GCA 153 Ser He He Val Phe Met Met Val His Leu Val Pro Gly Asp Pro Ala 20 25 30 35
TTA GTG ATT TTA GGT GAA AAG GCC AAT CAA GCC GCT ATT GAT GCT TTA 201 Leu Val He Leu Gly Glu Lys Ala Asn Gin Ala Ala He Asp Ala Leu 40 45 50
AGA GAG CAA TTT GGA TTG AAT AAG CCC TTG ATA GAG CAG TAT TTT TTC 249 Arg Glu Gin Phe Gly Leu Asn Lys Pro Leu He Glu Gin Tyr Phe Phe 55 60 65
TTT ATC AAT AAT GTG TTG CAT GGC AAT TTT GGC ACT TCT ATC ATG ACC 297 Phe He Asn Asn Val Leu His Gly Asn Phe Gly Thr Ser He Met Thr 70 75 80
GGT GAG CCT GTG ATG CAT GAG TTT TGG CAA CGC TTC CCG GCC ACG GTG 345 Gly Glu Pro Val Met His Glu Phe Trp Gin Arg Phe Pro Ala Thr Val 85 90 95
GAA TTA GCT TTG ATC GCT CTG TTT ATG GCT CTT GTT TTG GGT ATT AGC 393 Glu Leu Ala Leu He Ala Leu Phe Met Ala Leu Val Leu Gly He Ser 100 105 110 115
GTT GGC GTG TTA GCT GCG ATC AAA CGC TAT AGC GTG TTT GAT TAT TCC 441 Val Gly Val Leu Ala Ala He Lys Arg Tyr Ser Val Phe Asp Tyr Ser 120 125 130
AGC ATG ACT TTT GCT TTA GCC GGG ATT TCT ATG CCG GTG TTT TGG CTA 489 Ser Met Thr Phe Ala Leu Ala Gly He Ser Met Pro Val Phe Trp Leu 135 140 145
GGG CTC ATG CTG ATT TAT ATC TTT AGC GTG CAA TTG GGG TGG TTG CCT 537 Gly Leu Met Leu He Tyr He Phe Ser Val Gin Leu Gly Trp Leu Pro 150 155 160
GTT TTT GGG CGT TTG AGC GAT GTG TAT TAT TTA GAT GGC CCC ACA GGT 585 Val Phe Gly Arg Leu Ser Asp Val Tyr Tyr Leu Asp Gly Pro Thr Gly 165 170 175
CTT TAT TTG ATA GAC AGC CTG ATC GCA AGG GAT TAT GGG GCG TTT ATG 633 Leu Tyr Leu He Asp Ser Leu He Ala Arg Asp Tyr Gly Ala Phe Met 180 185 190 195
GAT ACG ATC AAG CAC TTG ATT TTG CCT AGC ATT GTG TTA GCC ACG GTT 681 Asp Thr He Lys His Leu He Leu Pro Ser He Val Leu Ala Thr Val 200 205 210
TCT ACC GCT GTT ATT GCC AGA ATG ACT CGC GCG AGC ATG GCA GAA GTG 729 Ser Thr Ala Val He Ala Arg Met Thr Arg Ala Ser Met Ala Glu Val 215 220 225
TCT AAA GAA GAT TAT GTG CGT ACC GCT AAA GCT AAG GGG TGT AGC TCC 777 Ser Lys Glu Asp Tyr Val Arg Thr Ala Lys Ala Lys Gly Cys Ser Ser 230 235 240
TTT AGG GTG ATT TTT GTG CAC ACT TTG CGT AAT GCT TTA ATC CCT GTA 825 Phe Arg Val He Phe Val His Thr Leu Arg Asn Ala Leu He Pro Val 245 250 255
ACG ACT ATC GCA GGC TTG ATG TTG GCC GGG CTT TTA GGG GGG AGC ATG 873 Thr Thr He Ala Gly Leu Met Leu Ala Gly Leu Leu Gly Gly Ser Met 260 265 270 275
ATA ACT GAA ACG GTT TTC TCA TGG CCT GGG ATT GGT AAG TGG ATT GTT 921 He Thr Glu Thr Val Phe Ser Trp Pro Gly He Gly Lys Trp He Val 280 285 290
AAT GCG CTC AAC CAG CGC GAT TTC CCG ATT ATC CAG TCC ATG TCT TTG 969 Asn Ala Leu Asn Gin Arg Asp Phe Pro He He Gin Ser Met Ser Leu 295 300 305
ATT ATT GCC ATG ATG TAT ATT GGG GCT AAT CTC TTA GTG GAT ATT TTA 1017 He He Ala Met Met Tyr He Gly Ala Asn Leu Leu Val Asp He Leu 310 315 320 TAC GCT TTT ATT GAT CCT AGA ATA AGG TTG TCA TAATGGAGTC TTTTAGAGAG 1070 Tyr Ala Phe He Asp Pro Arg He Arg Leu Ser 325 330
TTTATCCAAC 1080
(2) INFORMATION FOR SEQ ID NO:1262:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 334 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1262:
Met Leu Ser Phe He He Lys Arg He Leu Trp Ala He Pro Thr Leu
1 5 10 15
Phe Gly Val Ser He He Val Phe Met Met Val His Leu Val Pro Gly
20 25 30
Asp Pro Ala Leu Val He Leu Gly Glu Lys Ala Asn Gin Ala Ala He
35 40 45
Asp Ala Leu Arg Glu Gin Phe Gly Leu Asn Lys Pro Leu He Glu Gin
50 55 60
Tyr Phe Phe Phe He Asn Asn Val Leu His Gly Asn Phe Gly Thr Ser 65 70 75 80
He Met Thr Gly Glu Pro Val Met His Glu Phe Trp Gin Arg Phe Pro
85 90 95
Ala Thr Val Glu Leu Ala Leu He Ala Leu Phe Met Ala Leu Val Leu
100 105 110
Gly He Ser Val Gly Val Leu Ala Ala He Lys Arg Tyr Ser Val Phe
115 120 125
Asp Tyr Ser Ser Met Thr Phe Ala Leu Ala Gly He Ser Met Pro Val
130 135 140
Phe Trp Leu Gly Leu Met Leu He Tyr He Phe Ser Val Gin Leu Gly 145 150 155 160
Trp Leu Pro Val Phe Gly Arg Leu Ser Asp Val Tyr Tyr Leu Asp Gly
165 170 175
Pro Thr Gly Leu Tyr Leu He Asp Ser Leu He Ala Arg Asp Tyr Gly
180 185 190
Ala Phe Met Asp Thr He Lys His Leu He Leu Pro Ser He Val Leu
195 200 205
Ala Thr Val Ser Thr Ala Val He Ala Arg Met Thr Arg Ala Ser Met
210 215 220
Ala Glu Val Ser Lys Glu Asp Tyr Val Arg Thr Ala Lys Ala Lys Gly 225 230 235 240
Cys Ser Ser Phe Arg Val He Phe Val His Thr Leu Arg Asn Ala Leu
245 250 255
He Pro Val Thr Thr He Ala Gly Leu Met Leu Ala Gly Leu Leu Gly
260 265 270
Gly Ser Met He Thr Glu Thr Val Phe Ser Trp Pro Gly He Gly Lys
275 280 285
Trp He Val Asn Ala Leu Asn Gin Arg Asp Phe Pro He He Gin Ser 290 295 300
Met Ser Leu He He Ala Met Met Tyr He Gly Ala Asn Leu Leu Val 305 310 315 320
Asp He Leu Tyr Ala Phe He Asp Pro Arg He Arg Leu Ser 325 330
(2) INFORMATION FOR SEQ ID NO: 1263:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 955 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 32...892 (D) OTHER INFORMATTON:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1263:
ATCCTAAACG CACCTCTTAA AAGGAGCTTG C ATG ATT TTA GAA GTT AAA GAT 52
Met He Leu Glu Val Lys Asp 1 5
TTA AAA ACT TAT TTT TTC ACC GAT AAG GGC GTG AAT AAA GCA GTG GAT 100 Leu Lys Thr Tyr Phe Phe Thr Asp Lys Gly Val Asn Lys Ala Val Asp 10 15 20
GGT GTG AGT TTT GGT TTG AAA AAG TCT CAA ACG CTC TGC ATT GTA GGG 148 Gly Val Ser Phe Gly Leu Lys Lys Ser Gin Thr Leu Cys He Val Gly 25 30 35
GAG AGC GGG AGC GGG AAA AGC ATC ACT TCG CTC TCT ATT TTA GGG TTG 196 Glu Ser Gly Ser Gly Lys Ser He Thr Ser Leu Ser He Leu Gly Leu 40 45 50 55
ATT GAA AAA CCG GGT CAA ATT GTG GGA GGG AGC ATT CAA TTT TTA GGG 244 He Glu Lys Pro Gly Gin He Val Gly Gly Ser He Gin Phe Leu Gly 60 65 70
CAG GAT TTG TTG CAA CTC AAA GAA AAG CAG ATG CAA AAA GAA ATT AGG 292 Gin Asp Leu Leu Gin Leu Lys Glu Lys Gin Met Gin Lys Glu He Arg 75 80 85
GGT AAA AAA ATT GGC ATG ATC TTT CAA GAG CCT ATG ACA AGC CTA AAC 340 Gly Lys Lys He Gly Met He Phe Gin Glu Pro Met Thr Ser Leu Asn 90 95 100
CCT TCC TAC ACG GTG GGG TTT CAA ATC AAT GAA GTG TTG AAA ATC CAC 388 Pro Ser Tyr Thr Val Gly Phe Gin He Asn Glu Val Leu Lys He His 105 110 115
CAC CCT AAC CTC AAT AAA AAA GAA CGC TTA GAA AGG GTG GTT TAT GAA 436 His Pro Asn Leu Asn Lys Lys Glu Arg Leu Glu Arg Val Val Tyr Glu 120 125 130 135
TTA GAG CGT GTG GGC ATT CCC CAT GCA GGG GAT AAA TAC CAC GAA TAC 484 Leu Glu Arg Val Gly He Pro His Ala Gly Asp Lys Tyr His Glu Tyr 140 145 150
CCT TTC AAT CTC AGC GGG GGG CAG CGC CAA AGG GTG ATG ATC GCT ATG 532 Pro Phe Asn Leu Ser Gly Gly Gin Arg Gin Arg Val Met He Ala Met 155 160 165
GCT ATG GTG TGT GAG CCT GAA ATC TTG ATC GCT GAT GAG CCT ACG ACA 580 Ala Met Val Cys Glu Pro Glu He Leu He Ala Asp Glu Pro Thr Thr 170 175 180
GCC TTA GAT GTA ACC ATT CAA GCG CAA ATT TTA GAA TTG ATG AAA GAA 628 Ala Leu Asp Val Thr He Gin Ala Gin He Leu Glu Leu Met Lys Glu 185 190 195
TTG CAA CAA AAA AAA GGC ACT TCT ATT TTG TTT ATC ACC CAT GAT TTA 676 Leu Gin Gin Lys Lys Gly Thr Ser He Leu Phe He Thr His Asp Leu 200 205 210 215
GGC GTG GTG GCG CAA ATC GCT GAT GAA GTG GTG GTG ATG TAT AAA GGG 724 Gly Val Val Ala Gin He Ala Asp Glu Val Val Val Met Tyr Lys Gly 220 225 230
CAT GTG GTG GAG CAA GCG AGT GCG AAA GAG CTT TTT GCT GAT CCA AGA 772 His Val Val Glu Gin Ala Ser Ala Lys Glu Leu Phe Ala Asp Pro Arg 235 240 245
CAC CCT TAT ACG AAA GCT CTT TTA AGC GCG ATC CCT AAA CCG GGC AAA 820 His Pro Tyr Thr Lys Ala Leu Leu Ser Ala He Pro Lys Pro Gly Lys 250 255 260
GAA TAC CGC AAA AAA CGC TTA GAG ACC GTG GAT GAA AAT GTG GAT TAT 868 Glu Tyr Arg Lys Lys Arg Leu Glu Thr Val Asp Glu Asn Val Asp Tyr 265 270 275
TTG AGT TTT CAA AAG GAG TTG CGA TGAAGCTCTT AGAAATTAAA GAATTGAAAA 922 Leu Ser Phe Gin Lys Glu Leu Arg 280 285
AATCCTATGC GATAGACAGG GGGTTATTCA AGC 955
(2) INFORMATION FOR SEQ ID NO: 1264:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 287 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1264:
Met He Leu Glu Val Lys Asp Leu Lys Thr Tyr Phe Phe Thr Asp Lys
1 5 10 15
Gly Val Asn Lys Ala Val Asp Gly Val Ser Phe Gly Leu Lys Lys Ser
20 25 30
Gin Thr Leu Cys He Val Gly Glu Ser Gly Ser Gly Lys Ser He Thr
35 40 45
Ser Leu Ser He Leu Gly Leu He Glu Lys Pro Gly Gin He Val Gly
50 55 60
Gly Ser He Gin Phe Leu Gly Gin Asp Leu Leu Gin Leu Lys Glu Lys 65 70 75 80
Gin Met Gin Lys Glu He Arg Gly Lys Lys He Gly Met He Phe Gin
85 90 95
Glu Pro Met Thr Ser Leu Asn Pro Ser Tyr Thr Val Gly Phe Gin He
100 105 110
Asn Glu Val Leu Lys He His His Pro Asn Leu Asn Lys Lys Glu Arg
115 120 125
Leu Glu Arg Val Val Tyr Glu Leu Glu Arg Val Gly He Pro His Ala
130 135 140
Gly Asp Lys Tyr His Glu Tyr Pro Phe Asn Leu Ser Gly Gly Gin Arg 145 150 155 160
Gin Arg Val Met He Ala Met Ala Met Val Cys Glu Pro Glu He Leu
165 170 175
He Ala Asp Glu Pro Thr Thr Ala Leu Asp Val Thr He Gin Ala Gin
180 185 190
He Leu Glu Leu Met Lys Glu Leu Gin Gin Lys Lys Gly Thr Ser He
195 200 205
Leu Phe He Thr His Asp Leu Gly Val Val Ala Gin He Ala Asp Glu
210 215 220
Val Val Val Met Tyr Lys Gly His Val Val Glu Gin Ala Ser Ala Lys 225 230 235 240
Glu Leu Phe Ala Asp Pro Arg His Pro Tyr Thr Lys Ala Leu Leu Ser
245 250 255
Ala He Pro Lys Pro Gly Lys Glu Tyr Arg Lys Lys Arg Leu Glu Thr
260 265 270
Val Asp Glu Asn Val Asp Tyr Leu Ser Phe Gin Lys Glu Leu Arg 275 280 285
(2) INFORMATION FOR SEQ ID NO: 1265:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 894 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 37...840 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1265:
AAATGTGGAT TATTTGAGTT TTCAAAAGGA GTTGCG ATG AAG CTC TTA GAA ATT 54
Met Lys Leu Leu Glu He 1 5
AAA GAA TTG AAA AAA TCC TAT GCG ATA GAC AGG GGG TTA TTC AAG CCT 102 Lys Glu Leu Lys Lys Ser Tyr Ala He Asp Arg Gly Leu Phe Lys Pro 10 15 20
AAA AGA GTG ATC CAT GCG CTC AAT GGG ATC AGT TTT GAA GTG GAA CAA 150 Lys Arg Val He His Ala Leu Asn Gly He Ser Phe Glu Val Glu Gin 25 30 35
AAT GAA GTT TTG AGC ATT GTG GGG GAG AGC GGT TGC GGG AAA AGC ACG 198 Asn Glu Val Leu Ser He Val Gly Glu Ser Gly Cys Gly Lys Ser Thr 40 45 50
ACA GCC AAA ATT TTA GCC GGG ATT GAA AGG CAA GAT AGC GGG GCG ATT 246 Thr Ala Lys He Leu Ala Gly He Glu Arg Gin Asp Ser Gly Ala He 55 60 65 70
TAT TTC AAT GGT AAG CGC CAT TTG CAT TTT AGC AAA CAG GAT TGG TTT 294 Tyr Phe Asn Gly Lys Arg His Leu His Phe Ser Lys Gin Asp Trp Phe 75 80 85
GAT TAC CGC AAA AAG GTG CAA ATG ATT TTT CAA GAC CCT TAT TCT AGC 342 Asp Tyr Arg Lys Lys Val Gin Met He Phe Gin Asp Pro Tyr Ser Ser 90 95 100
CTA AAC CCT CGG TGG AAA GTG GGC GAG ATC ATC GCT GAA CCC TTG CTT 390 Leu Asn Pro Arg Trp Lys Val Gly Glu He He Ala Glu Pro Leu Leu 105 110 115
TTA AAT TCT CAT TTT TCA AAA AAA GAA ATC AAA ACA AAA GTG CTA GAG 438 Leu Asn Ser His Phe Ser Lys Lys Glu He Lys Thr Lys Val Leu Glu 120 125 130
ATC ATG CAA AAA GTG GGC TTG AAA TTA GAA TGG ATC GAT CGT TAC CCC 486 He Met Gin Lys Val Gly Leu Lys Leu Glu Trp He Asp Arg Tyr Pro 135 140 145 150
CAC CAA TTT TCA GGC GGT CAA AGG CAA CGA ATC GGC ATT GCT AGG GCG 534 His Gin Phe Ser Gly Gly Gin Arg Gin Arg He Gly He Ala Arg Ala 155 160 165
CTC ATT TTG CAT CCT AGC GTG GTG ATT TGC GAT GAG CCT GTG TCT GCG 582 Leu He Leu His Pro Ser Val Val He Cys Asp Glu Pro Val Ser Ala 170 175 180
CTA GAC GTG TCC ATT CAA GCG CAA GTG TTG AAT TTG CTC TTG GAT TTG 630 Leu Asp Val Ser He Gin Ala Gin Val Leu Asn Leu Leu Leu Asp Leu 185 190 195
CAA AAA GAA ATG GGG CTG ACT TAT ATT TTT ATC AGC CAT GAT TTA GGG 678 Gin Lys Glu Met Gly Leu Thr Tyr He Phe He Ser His Asp Leu Gly 200 205 210
GTG GTG GAG CAT ATA AGC GAT AAA ATC ATC GTA ATG AAT CAG GGG CAA 726 Val Val Glu His He Ser Asp Lys He He Val Met Asn Gin Gly Gin 215 220 225 230
ATC GTA GAA ACG GGG GAT GTG GAT AGC GTG ATA AGC GCT CCA AAG CAC 774 He Val Glu Thr Gly Asp Val Asp Ser Val He Ser Ala Pro Lys His 235 240 245
CCT TAT ACG CAG AAA TTA CTC AAT GCG GTG CCG CAT TTG GAA AAA TCC 822 Pro Tyr Thr Gin Lys Leu Leu Asn Ala Val Pro His Leu Glu Lys Ser 250 255 260
ATG CAA AGA TTT GCC AAA TAAAAGAAAG GATTTTTAAG CTGTGTTTGT AGATAGCG 878 Met Gin Arg Phe Ala Lys 265
TGGAAATTAT CATCGC 894
(2) INFORMATION FOR SEQ ID NO: 1266:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 268 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1266:
Met Lys Leu Leu Glu He Lys Glu Leu Lys Lys Ser Tyr Ala He Asp
1 5 10 15
Arg Gly Leu Phe Lys Pro Lys Arg Val He His Ala Leu Asn Gly He
20 25 30
Ser Phe Glu Val Glu Gin Asn Glu Val Leu Ser He Val Gly Glu Ser
35 40 45
Gly Cys Gly Lys Ser Thr Thr Ala Lys He Leu Ala Gly He Glu Arg
50 55 60
Gin Asp Ser Gly Ala He Tyr Phe Asn Gly Lys Arg His Leu His Phe 65 70 75 80
Ser Lys Gin Asp Trp Phe Asp Tyr Arg Lys Lys Val Gin Met He Phe
85 90 95
Gin Asp Pro Tyr Ser Ser Leu Asn Pro Arg Trp Lys Val Gly Glu He
100 105 110
He Ala Glu Pro Leu Leu Leu Asn Ser His Phe Ser Lys Lys Glu He
115 120 125
Lys Thr Lys Val Leu Glu He Met Gin Lys Val Gly Leu Lys Leu Glu
130 135 140
Trp He Asp Arg Tyr Pro His Gin Phe Ser Gly Gly Gin Arg Gin Arg 145 150 155 160
He Gly He Ala Arg Ala Leu He Leu His Pro Ser Val Val He Cys
165 170 175
Asp Glu Pro Val Ser Ala Leu Asp Val Ser He Gin Ala Gin Val Leu
180 185 190
Asn Leu Leu Leu Asp Leu Gin Lys Glu Met Gly Leu Thr Tyr He Phe
195 200 205
He Ser His Asp Leu Gly Val Val Glu His He Ser Asp Lys He He
210 215 220
Val Met Asn Gin Gly Gin He Val Glu Thr Gly Asp Val Asp Ser Val 225 230 235 240
He Ser Ala Pro Lys His Pro Tyr Thr Gin Lys Leu Leu Asn Ala Val
245 250 255
Pro His Leu Glu Lys Ser Met Gin Arg Phe Ala Lys 260 265
(2) INFORMATION FOR SEQ ID NO: 1267:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1141 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 62...1087 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1267:
TTAAGCTGTG TTTGTAGATA GCGTGGAAAT TATCATCGCT TCGGGTAAGG GGGGGCCTGG 60 A ATG GTG AGT TTT AGG CGA GAA AAA TTT GTC ATC AAA GGA GGC CCT GAT 109 Met Val Ser Phe Arg Arg Glu Lys Phe Val He Lys Gly Gly Pro Asp 1 5 10 15
GGG GGC GAT GGA GGC GAT GGA GGC GAT GTG TAT TTT GAA GTG GAT AAC 157 Gly Gly Asp Gly Gly Asp Gly Gly Asp Val Tyr Phe Glu Val Asp Asn 20 25 30
AAT ACC GAC ACT CTA GCG AGT TTT AGA GGC ACC AAA CAC CAT AAG GCT 205 Asn Thr Asp Thr Leu Ala Ser Phe Arg Gly Thr Lys His His Lys Ala 35 40 45
AAA AAT GGG GCT CCA GGA GGT ACA CGA AAT TGC GCG GGC AAA AAG GGC 253 Lys Asn Gly Ala Pro Gly Gly Thr Arg Asn Cys Ala Gly Lys Lys Gly 50 55 60
GAA GAC AAG ATC ATT GTC GTG CCA CCA GGA ACG CAG GTT TTT GTA GGT 301 Glu Asp Lys He He Val Val Pro Pro Gly Thr Gin Val Phe Val Gly 65 70 75 80 GAT GAG TTG TGG CTT GAT TTA GTG GAA CCT AAA GAA AGG GTG TTA GCC 349 Asp Glu Leu Trp Leu Asp Leu Val Glu Pro Lys Glu Arg Val Leu Ala 85 90 95
TTA AAA GGG GGC AAG GGG GGG TTA GGG AAT GCA CAT TTT AAA AGC GCG 397 Leu Lys Gly Gly Lys Gly Gly Leu Gly Asn Ala His Phe Lys Ser Ala 100 105 110
ACT AAA CAA CAA CCC ACT TAC GCG CAA AAA GGC TTA GAG GGG GTT GAA 445 Thr Lys Gin Gin Pro Thr Tyr Ala Gin Lys Gly Leu Glu Gly Val Glu 115 120 125
AAA TGC GTG CGT TTG GAA TTA AAA CTC ATC GCT GAT ATA GGG TTA GTG 493 Lys Cys Val Arg Leu Glu Leu Lys Leu He Ala Asp He Gly Leu Val 130 135 140
GGC TTC CCT AAT GCG GGT AAA TCC ACG CTC ATT TCC ACC ATC TCT AAC 541 Gly Phe Pro Asn Ala Gly Lys Ser Thr Leu He Ser Thr He Ser Asn 145 150 155 160
GCT AAG CCT AAA ATC GCT AAC TAT GAA TTT ACG ACT CTA GTG CCT AAT 589 Ala Lys Pro Lys He Ala Asn Tyr Glu Phe Thr Thr Leu Val Pro Asn 165 170 175
TTA GGG GTT GTG AGC GTG GAT GAA AAA AGC GGA TTT CTA ATG GCG GAT 637 Leu Gly Val Val Ser Val Asp Glu Lys Ser Gly Phe Leu Met Ala Asp 180 185 190
ATT CCT GGC ATT ATT GAA GGG GCT AGC GAG GGA AAG GGC TTA GGG ATT 685 He Pro Gly He He Glu Gly Ala Ser Glu Gly Lys Gly Leu Gly He 195 200 205
AGC TTT TTA AAG CAT ATT GAA CGC ACC AAA GTT CTA GCT TTT GTT TTA 733 Ser Phe Leu Lys His He Glu Arg Thr Lys Val Leu Ala Phe Val Leu 210 215 220
GAC GCT TCC AGG CTG GAT TTA GGC ATT AAA GAG CAA TAC CAA CGC TTG 781 Asp Ala Ser Arg Leu Asp Leu Gly He Lys Glu Gin Tyr Gin Arg Leu 225 230 235 240
AGG TTG GAG TTG GAA AAA TTT TCA TCC GCT TTG GCC AAT AAG CCT TTT 829 Arg Leu Glu Leu Glu Lys Phe Ser Ser Ala Leu Ala Asn Lys Pro Phe 245 250 255
GGG GTG TTG CTC AAT AAA TGC GAT GTT GTA GAA AAC ATT GAT GAG ATG 877 Gly Val Leu Leu Asn Lys Cys Asp Val Val Glu Asn He Asp Glu Met 260 265 270
ACT AAG GAT TTT TGT GCC TTT TTA AAT TTG GGA GCG CAG AAA TTA AAC 925 Thr Lys Asp Phe Cys Ala Phe Leu Asn Leu Gly Ala Gin Lys Leu Asn 275 280 285
GAG TTT GGT TTA GAG CCG TAT TTA GGG TTT TTG CAC CCC CAT TTA ACC 973 Glu Phe Gly Leu Glu Pro Tyr Leu Gly Phe Leu His Pro His Leu Thr 290 295 300 AAT GAT TTT GAA AAT AAC CCT AAT GAG CAA TCA GCG CTC TTT GTC TTG 1021 Asn Asp Phe Glu Asn Asn Pro Asn Glu Gin Ser Ala Leu Phe Val Leu 305 310 315 320
CCC CTT TCA GCG GTT AGC GCT CTT AAT GTG CAT GCA CTC AAA TTT GTG 1069 Pro Leu Ser Ala Val Ser Ala Leu Asn Val His Ala Leu Lys Phe Val 325 330 335
TTG TTG GAA GCG TTA CCC TAAAACGCTA TTTTTAAAAT AATCCATTAA AATAAAGG 1125 Leu Leu Glu Ala Leu Pro 340
CGAGGAATGA AAAGAT 1141
(2) INFORMATION FOR SEQ ID NO: 1268:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 342 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1268:
Met Val Ser Phe Arg Arg Glu Lys Phe Val He Lys Gly Gly Pro Asp
1 5 10 15
Gly Gly Asp Gly Gly Asp Gly Gly Asp Val Tyr Phe Glu Val Asp Asn
20 25 30
Asn Thr Asp Thr Leu Ala Ser Phe Arg Gly Thr Lys His His Lys Ala
35 40 45
Lys Asn Gly Ala Pro Gly Gly Thr Arg Asn Cys Ala Gly Lys Lys Gly
50 55 60
Glu Asp Lys He He Val Val Pro Pro Gly Thr Gin Val Phe Val Gly 65 70 75 80
Asp Glu Leu Trp Leu Asp Leu Val Glu Pro Lys Glu Arg Val Leu Ala
85 90 95
Leu Lys Gly Gly Lys Gly Gly Leu Gly Asn Ala His Phe Lys Ser Ala
100 105 110
Thr Lys Gin Gin Pro Thr Tyr Ala Gin Lys Gly Leu Glu Gly Val Glu
115 120 125
Lys Cys Val Arg Leu Glu Leu Lys Leu He Ala Asp He Gly Leu Val
130 135 140
Gly Phe Pro Asn Ala Gly Lys Ser Thr Leu He Ser Thr He Ser Asn 145 150 155 160
Ala Lys Pro Lys He Ala Asn Tyr Glu Phe Thr Thr Leu Val Pro Asn
165 170 175
Leu Gly Val Val Ser Val Asp Glu Lys Ser Gly Phe Leu Met Ala Asp
180 185 190
He Pro Gly He He Glu Gly Ala Ser Glu Gly Lys Gly Leu Gly He
195 200 205
Ser Phe Leu Lys His He Glu Arg Thr Lys Val Leu Ala Phe Val Leu
210 215 220
Asp Ala Ser Arg Leu Asp Leu Gly He Lys Glu Gin Tyr Gin Arg Leu 225 230 235 240
Arg Leu Glu Leu Glu Lys Phe Ser Ser Ala Leu Ala Asn Lys Pro Phe
245 250 255
Gly Val Leu Leu Asn Lys Cys Asp Val Val Glu Asn He Asp Glu Met
260 265 270
Thr Lys Asp Phe Cys Ala Phe Leu Asn Leu Gly Ala Gin Lys Leu Asn
275 280 285
Glu Phe Gly Leu Glu Pro Tyr Leu Gly Phe Leu His Pro His Leu Thr
290 295 300
Asn Asp Phe Glu Asn Asn Pro Asn Glu Gin Ser Ala Leu Phe Val Leu 305 310 315 320
Pro Leu Ser Ala Val Ser Ala Leu Asn Val His Ala Leu Lys Phe Val
325 330 335
Leu Leu Glu Ala Leu Pro 340
(2) INFORMATION FOR SEQ ID NO: 1269:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 621 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...567 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1269:
ATTAAAGGAT AATGA ATG AAA AAA ATG GTT TTG GTA TCG GTT TTA CTA GCA 51 Met Lys Lys Met Val Leu Val Ser Val Leu Leu Ala 1 5 10
GGG TTT TTG CAA GCG GTG AAT TTG GAT TTA TCT TCG GCT AAG CTA ACA 99 Gly Phe Leu Gin Ala Val Asn Leu Asp Leu Ser Ser Ala Lys Leu Thr 15 20 25
TGG ACA GCC TTT AAA ACT AAG GCT AAA ACA CCA GTA AAT GGG AGT TTT 147 Trp Thr Ala Phe Lys Thr Lys Ala Lys Thr Pro Val Asn Gly Ser Phe 30 35 40
GAA AGC ATC ACC TAT AAA TTG GGT AAA TCT CAA GAT AGT TTA AAA ACC 195 Glu Ser He Thr Tyr Lys Leu Gly Lys Ser Gin Asp Ser Leu Lys Thr 45 50 55 60
CTT TTA GAG GGA GCG AGC GCG AGC ATG GAT AGC TTG AAA GTC AAT TTA 243 Leu Leu Glu Gly Ala Ser Ala Ser Met Asp Ser Leu Lys Val Asn Leu 65 70 75 GGC GAT GAA TTG AAA AAC AAA AAT GTG AAA GAA GCT TTT TTC GCT CTT 291 Gly Asp Glu Leu Lys Asn Lys Asn Val Lys Glu Ala Phe Phe Ala Leu 80 85 90
TTT AAA AAC ACT AAC ATC AAA GTA ACT TTC AGG AAT GTG ATA GAA GGC 339 Phe Lys Asn Thr Asn He Lys Val Thr Phe Arg Asn Val He Glu Gly 95 100 105
GAT CAT GCA GGT TCT CTT ACG GCT TAT GTG AGA ATG AAT GAA AAG CTG 387 Asp His Ala Gly Ser Leu Thr Ala Tyr Val Arg Met Asn Glu Lys Leu 110 115 120
GTG AAA GTG CCT ATG CAA TAC ACG ATT GCT GAG GAT AAG ATC GTG GTT 435 Val Lys Val Pro Met Gin Tyr Thr He Ala Glu Asp Lys He Val Val 125 130 135 140
AAA GGG GTT TTG GAT TTA TTG AAT TTT GGC TTG AAA AAC GAA TTA GCG 483 Lys Gly Val Leu Asp Leu Leu Asn Phe Gly Leu Lys Asn Glu Leu Ala 145 150 155
AGC TTG GCC AAA CGA TGC GAA AGC TTT CAT GAG GGC TTG ACT TGG TCG 531 Ser Leu Ala Lys Arg Cys Glu Ser Phe His Glu Gly Leu Thr Trp Ser 160 165 170
CAA GTG GAA ATC CAA TTT GAA AGC ATG ATC AAG GGA TAATGTAAAA TCATGG 583 Gin Val Glu He Gin Phe Glu Ser Met He Lys Gly 175 180
AGTTGTTGCA CAGCATTAAT GATTTCAATG AAGCTAAG 621
(2) INFORMATION FOR SEQ ID NO: 1270:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 184 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1270:
Met Lys Lys Met Val Leu Val Ser Val Leu Leu Ala Gly Phe Leu Gin
1 5 10 15
Ala Val Asn Leu Asp Leu Ser Ser Ala Lys Leu Thr Trp Thr Ala Phe
20 25 30
Lys Thr Lys Ala Lys Thr Pro Val Asn Gly Ser Phe Glu Ser He Thr
35 40 45
Tyr Lys Leu Gly Lys Ser Gin Asp Ser Leu Lys Thr Leu Leu Glu Gly
50 55 60
Ala Ser Ala Ser Met Asp Ser Leu Lys Val Asn Leu Gly Asp Glu Leu 65 70 75 80
Lys Asn Lys Asn Val Lys Glu Ala Phe Phe Ala Leu Phe Lys Asn Thr
85 90 95
Asn He Lys Val Thr Phe Arg Asn Val He Glu Gly Asp His Ala Gly 100 105 110
Ser Leu Thr Ala Tyr Val Arg Met Asn Glu Lys Leu Val Lys Val Pro
115 120 125
Met Gin Tyr Thr He Ala Glu Asp Lys He Val Val Lys Gly Val Leu
130 135 140
Asp Leu Leu Asn Phe Gly Leu Lys Asn Glu Leu Ala Ser Leu Ala Lys 145 150 155 160
Arg Cys Glu Ser Phe His Glu Gly Leu Thr Trp Ser Gin Val Glu He
165 170 175
Gin Phe Glu Ser Met He Lys Gly 180
(2) INFORMATION FOR SEQ ID NO: 1271:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1406 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...1338 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1271:
CAAGTGGAAA TCCAATTTGA AAGCATGATC AAGGGATAAT GTAAAATC ATG GAG TTG 57
Met Glu Leu 1
TTG CAC AGC ATT AAT GAT TTC AAT GAA GCT AAG CAG GTG ATC GCT GGG 105 Leu His Ser He Asn Asp Phe Asn Glu Ala Lys Gin Val He Ala Gly 5 10 15
GGG GTC AAT TCA CCT GTT AGG GCG TTT AAG AGC GTT AAA GGC ACT CCC 153 Gly Val Asn Ser Pro Val Arg Ala Phe Lys Ser Val Lys Gly Thr Pro 20 25 30 35
CCC TTT ATT TTA AAA GGC AAG GGG GCG TAT CTT TAT GAT GTG GAT AAC 201 Pro Phe He Leu Lys Gly Lys Gly Ala Tyr Leu Tyr Asp Val Asp Asn 40 45 50
AAC CAT TAT ATA GAT TTT GTG CAA AGC TGG GGG CCT TTG ATT TTT GGG 249 Asn His Tyr He Asp Phe Val Gin Ser Trp Gly Pro Leu He Phe Gly 55 60 65
CAT GCT GAT GAA GAG ATT GAA GAA AAT ATT ATT AAT GCA TTA AAA AAA 297 His Ala Asp Glu Glu He Glu Glu Asn He He Asn Ala Leu Lys Lys 70 75 80 GGC ACT TCT TTT GGC GCT CCC ACA GAA TTA GAA ACC ACT TTA GCT AAG 345 Gly Thr Ser Phe Gly Ala Pro Thr Glu Leu Glu Thr Thr Leu Ala Lys 85 90 95
GAA ATC ATT TCT TGT TAT GAA GGC TTA GAT AAG GTG CGT TTA GTC AGT 393 Glu He He Ser Cys Tyr Glu Gly Leu Asp Lys Val Arg Leu Val Ser 100 105 110 115
AGC GGC ACA GAA GCG ACC ATG AGC GCG ATA CGA CTC GCT AGA GCT TAT 441 Ser Gly Thr Glu Ala Thr Met Ser Ala He Arg Leu Ala Arg Ala Tyr 120 125 130
AGC CAA AAA GAT GAT TTG ATC AAG TTT GAA GGG TGC TAC CAT GGG CAT 489 Ser Gin Lys Asp Asp Leu He Lys Phe Glu Gly Cys Tyr His Gly His 135 140 145
AGC GAC TCC TTA TTG GTG AAA GCG GGT AGC GGG TGT GCT ACT TTT GGA 537 Ser Asp Ser Leu Leu Val Lys Ala Gly Ser Gly Cys Ala Thr Phe Gly 150 155 160
TCG CCT TCT TCT TTA GGC GTG CCG AAC GAT TTT AGC AAA CAC ACT CTA 585 Ser Pro Ser Ser Leu Gly Val Pro Asn Asp Phe Ser Lys His Thr Leu 165 170 175
GTG GCT CGT TAT AAC GAT TTA AAC TCC ACA GAA GAA TGC TTT AAA AAA 633 Val Ala Arg Tyr Asn Asp Leu Asn Ser Thr Glu Glu Cys Phe Lys Lys 180 185 190 195
GGC AAT GTG GGT TGC GTC ATC ATT GAA CCC ATT GCC GGG AAT ATG GGG 681 Gly Asn Val Gly Cys Val He He Glu Pro He Ala Gly Asn Met Gly 200 205 210
TTA GTG CCG GCT CAA AAA GAG TTT TTA TTG GGC TTA AAG GCC TTG TGT 729 Leu Val Pro Ala Gin Lys Glu Phe Leu Leu Gly Leu Lys Ala Leu Cys 215 220 225
GAA AAA TAC CAA GCG GTG CTG ATT TTA GAT GAA GTG ATG AGC GGT TTT 777 Glu Lys Tyr Gin Ala Val Leu He Leu Asp Glu Val Met Ser Gly Phe 230 235 240
AGA GCG AGC TTG AGC GGT TCG CAA GAA TTT TAT GGC GTG GTG CCG GAT 825 Arg Ala Ser Leu Ser Gly Ser Gin Glu Phe Tyr Gly Val Val Pro Asp 245 250 255
TTG GTA ACC TTT GGT AAG GTG ATA GGT GCT GGG CTT CCT TTG GCG TGT 873 Leu Val Thr Phe Gly Lys Val He Gly Ala Gly Leu Pro Leu Ala Cys 260 265 270 275
TTT GGA GGG CGT GCA GAA ATT ATG GAC TTG CTT TCG CCC ATT GGA AGC 921 Phe Gly Gly Arg Ala Glu He Met Asp Leu Leu Ser Pro He Gly Ser 280 285 290
GTG TAT CAA GCA GGC ACT TTG AGC GGT AAC CCC CTA GCG GTG TGC GCG 969 Val Tyr Gin Ala Gly Thr Leu Ser Gly Asn Pro Leu Ala Val Cys Ala 295 300 305 GGG TTG AGT GCG CTT TAT AAA ATC AAA AGA GAC AAA ACC CTT TAT ACT 1017 Gly Leu Ser Ala Leu Tyr Lys He Lys Arg Asp Lys Thr Leu Tyr Thr 310 315 320
CGC TTA GAC GCT TTA GCT ATT CGT TTG ACT CAA GGC TTA CAA AAG AGC 1065 Arg Leu Asp Ala Leu Ala He Arg Leu Thr Gin Gly Leu Gin Lys Ser 325 330 335
GCT CAA AAC TAT AAC ATC GCT TTA GAG ACG CTT AAC ATG GGG AGC ATG 1113 Ala Gin Asn Tyr Asn He Ala Leu Glu Thr Leu Asn Met Gly Ser Met 340 345 350 355
TTT GGC TTT TTC TTT AAC GAA AAT GCG GTG CAC GAT TTT GAT GAC GCT 1161 Phe Gly Phe Phe Phe Asn Glu Asn Ala Val His Asp Phe Asp Asp Ala 360 365 370
TTA AAA AGC GAT ACG GAG ATG TTT GCA AAA TTC CAC CAA AAA ATG CTC 1209 Leu Lys Ser Asp Thr Glu Met Phe Ala Lys Phe His Gin Lys Met Leu 375 380 385
TTT AAG GGC GTG TAT TTG GCG TGC TCA AGC TTT GAA ACC GGC TTT ATT 1257 Phe Lys Gly Val Tyr Leu Ala Cys Ser Ser Phe Glu Thr Gly Phe He 390 395 400
TGT GAG CCT ATG ACT GAA GAG ATG ATT GAT TTA ACG ATC GCA AAA GCC 1305 Cys Glu Pro Met Thr Glu Glu Met He Asp Leu Thr He Ala Lys Ala 405 410 415
GAT GAA AGT TTT GAT GAA ATC ATA AAA GGT GTG TGAATTTTTT GAAAAAGCCA 1358 Asp Glu Ser Phe Asp Glu He He Lys Gly Val 420 425 430
AAGTATTATA AATTCATAGA GGGGGCGAAT TATTTGAGCT TGGGGCTT 1406
(2) INFORMATION FOR SEQ ID NO: 1272:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 430 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1272:
Met Glu Leu Leu His Ser He Asn Asp Phe Asn Glu Ala Lys Gin Val
1 5 10 15
He Ala Gly Gly Val Asn Ser Pro Val Arg Ala Phe Lys Ser Val Lys
20 25 30
Gly Thr Pro Pro Phe He Leu Lys Gly Lys Gly Ala Tyr Leu Tyr Asp
35 40 45
Val Asp Asn Asn His Tyr He Asp Phe Val Gin Ser Trp Gly Pro Leu
50 55 60
He Phe Gly His Ala Asp Glu Glu He Glu Glu Asn He He Asn Ala 65 70 75 80
Leu Lys Lys Gly Thr Ser Phe Gly Ala Pro Thr Glu Leu Glu Thr Thr
85 90 95
Leu Ala Lys Glu He He Ser Cys Tyr Glu Gly Leu Asp Lys Val Arg
100 105 110
Leu Val Ser Ser Gly Thr Glu Ala Thr Met Ser Ala He Arg Leu Ala
115 120 125
Arg Ala Tyr Ser Gin Lys Asp Asp Leu He Lys Phe Glu Gly Cys Tyr
130 135 140
His Gly His Ser Asp Ser Leu Leu Val Lys Ala Gly Ser Gly Cys Ala 145 150 155 160
Thr Phe Gly Ser Pro Ser Ser Leu Gly Val Pro Asn Asp Phe Ser Lys
165 170 175
His Thr Leu Val Ala Arg Tyr Asn Asp Leu Asn Ser Thr Glu Glu Cys
180 185 190
Phe Lys Lys Gly Asn Val Gly Cys Val He He Glu Pro He Ala Gly
195 200 205
Asn Met Gly Leu Val Pro Ala Gin Lys Glu Phe Leu Leu Gly Leu Lys
210 215 220
Ala Leu Cys Glu Lys Tyr Gin Ala Val Leu He Leu Asp Glu Val Met 225 230 235 240
Ser Gly Phe Arg Ala Ser Leu Ser Gly Ser Gin Glu Phe Tyr Gly Val
245 250 255
Val Pro Asp Leu Val Thr Phe Gly Lys Val He Gly Ala Gly Leu Pro
260 265 270
Leu Ala Cys Phe Gly Gly Arg Ala Glu He Met Asp Leu Leu Ser Pro
275 280 285
He Gly Ser Val Tyr Gin Ala Gly Thr Leu Ser Gly Asn Pro Leu Ala
290 295 300
Val Cys Ala Gly Leu Ser Ala Leu Tyr Lys He Lys Arg Asp Lys Thr 305 310 315 320
Leu Tyr Thr Arg Leu Asp Ala Leu Ala He Arg Leu Thr Gin Gly Leu
325 330 335
Gin Lys Ser Ala Gin Asn Tyr Asn He Ala Leu Glu Thr Leu Asn Met
340 345 350
Gly Ser Met Phe Gly Phe Phe Phe Asn Glu Asn Ala Val His Asp Phe
355 360 365
Asp Asp Ala Leu Lys Ser Asp Thr Glu Met Phe Ala Lys Phe His Gin
370 375 380
Lys Met Leu Phe Lys Gly Val Tyr Leu Ala Cys Ser Ser Phe Glu Thr 385 390 395 400
Gly Phe He Cys Glu Pro Met Thr Glu Glu Met He Asp Leu Thr He
405 410 415
Ala Lys Ala Asp Glu Ser Phe Asp Glu He He Lys Gly Val 420 425 430
(2) INFORMATION FOR SEQ ID NO:1273:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1100 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 90...1052 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1273:
AAGGCGCAAA ACTTAGCCAA AAAAGAGATG GACGCACTAG ATTCTCATCT GTTAGCGTTT 60 TTAAATCAAA ATGCAAATGC CATTCACTG ATG CCC AAA ATC CCT ATC ACG CTC 113
Met Pro Lys He Pro He Thr Leu 1 5
ATC ACC GGT TTT TTA GGC AGC GGT AAA ACG AGT TTT TTG AGC GAA TAT 161 He Thr Gly Phe Leu Gly Ser Gly Lys Thr Ser Phe Leu Ser Glu Tyr 10 15 20
TTA AAC CAA ACA GAT CAC CAA GGC GTC GCT CTT ATC ATC AAT GAA ATC 209 Leu Asn Gin Thr Asp His Gin Gly Val Ala Leu He He Asn Glu He 25 30 35 40
GGT CAA GCC GCT TTG GAT CAG CGC ATC TTA AGC GTT CAA TAT TGC GGT 257 Gly Gin Ala Ala Leu Asp Gin Arg He Leu Ser Val Gin Tyr Cys Gly 45 50 55
GAA AAA ATG CTC TAT CTT AAC GCA GGG TGC GTG TGT TGC AAC AAA CGC 305 Glu Lys Met Leu Tyr Leu Asn Ala Gly Cys Val Cys Cys Asn Lys Arg 60 65 70
TTG GAT TTA GTG GAG TCT CTA AAA GCC ACG CTC AAC AAC TAT GAA TGG 353 Leu Asp Leu Val Glu Ser Leu Lys Ala Thr Leu Asn Asn Tyr Glu Trp 75 80 85
CAC GGC GAA ATT CTA AGG CGC ATC ATC ATT GAA ACT ACC GGT TTA GCC 401 His Gly Glu He Leu Arg Arg He He He Glu Thr Thr Gly Leu Ala 90 95 100
AAC CCG GCA CCG ATT TTA TGG ACG ATT TTG AGC GAC ACT TTT TTA GGA 449 Asn Pro Ala Pro He Leu Trp Thr He Leu Ser Asp Thr Phe Leu Gly 105 110 115 120
GTG CAT TTT GAG ATT CAA AGC GTG GTG GCT TGC GTG GAT GCA TTG AAT 497 Val His Phe Glu He Gin Ser Val Val Ala Cys Val Asp Ala Leu Asn 125 130 135
GCT AGA GAG CAT TTA ACC AAC AAT GAA GCT AAA GAG CAA ATC GTT TTT 545 Ala Arg Glu His Leu Thr Asn Asn Glu Ala Lys Glu Gin He Val Phe 140 145 150
GCT GAT AGC GTT TTA TTG ACC AAA ACG GAT TTA CAA AAC GAT AGC GCG 593 Ala Asp Ser Val Leu Leu Thr Lys Thr Asp Leu Gin Asn Asp Ser Ala 155 160 165 GCT TTA ACA AAA CTA AAA GAG AGG ATA CAA GCC CTT AAC CCT AGT GCA 641 Ala Leu Thr Lys Leu Lys Glu Arg He Gin Ala Leu Asn Pro Ser Ala 170 175 180
GAA ATT TTT GAC AAG AGG GCG ATA GAC TAT GAG AGC CTC TTT TCA CGC 689 Glu He Phe Asp Lys Arg Ala He Asp Tyr Glu Ser Leu Phe Ser Arg 185 190 195 200
AAA AAT AGG GCG CGA AAT TTT ATG CCA AGA ATG CCA AAA GAT TCG CAC 737 Lys Asn Arg Ala Arg Asn Phe Met Pro Arg Met Pro Lys Asp Ser His 205 210 215
TCG CAA GGC TTT GAG ACT TTA AGC ATT AAT TTT GAA GGC ACG ATG GAG 785 Ser Gin Gly Phe Glu Thr Leu Ser He Asn Phe Glu Gly Thr Met Glu 220 225 230
TGG AGC GCG TTT GGG ATT TGG CTG AGT TTG TTA TTG CAT CAA TAC GGC 833 Trp Ser Ala Phe Gly He Trp Leu Ser Leu Leu Leu His Gin Tyr Gly 235 240 245
ACA CAG ATT TTA CGC ATC AAG GGG ATT ATT GAC ATT GGA AGC GGC TTT 881 Thr Gin He Leu Arg He Lys Gly He He Asp He Gly Ser Gly Phe 250 255 260
TTG GTG AGT ATT AAC GGC GTG ATG CAT GTC ATT TAC CCG CCT AAG CAT 929 Leu Val Ser He Asn Gly Val Met His Val He Tyr Pro Pro Lys His 265 270 275 280
ATT TTA AAG GAT CAA AAC GGC TCT AAC CTC GTT TTT ATC ATG CGC CAT 977 He Leu Lys Asp Gin Asn Gly Ser Asn Leu Val Phe He Met Arg His 285 290 295
TTA GAG CGT GAA AAA ATC TTA AAT TCC TTA AAG GGT TTT AAG GAT TTT 1025 Leu Glu Arg Glu Lys He Leu Asn Ser Leu Lys Gly Phe Lys Asp Phe 300 305 310
CTC GGC ATC AAG GGT TTT GAA ACC CAA TAATTTTTCT ATTTATGGAT AGCTGTT 1079 Leu Gly He Lys Gly Phe Glu Thr Gin 315 320
TGCATTTTGA TGGGGAAAAG A 1100
(2) INFORMATION FOR SEQ ID NO: 1274:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 321 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1274: Met Pro Lys He Pro He Thr Leu He Thr Gly Phe Leu Gly Ser Gly 1 5 10 15
Lys Thr Ser Phe Leu Ser Glu Tyr Leu Asn Gin Thr Asp His Gin Gly
20 25 30
Val Ala Leu He He Asn Glu He Gly Gin Ala Ala Leu Asp Gin Arg
35 40 45
He Leu Ser Val Gin Tyr Cys Gly Glu Lys Met Leu Tyr Leu Asn Ala
50 55 60
Gly Cys Val Cys Cys Asn Lys Arg Leu Asp Leu Val Glu Ser Leu Lys 65 70 75 80
Ala Thr Leu Asn Asn Tyr Glu Trp His Gly Glu He Leu Arg Arg He
85 90 95
He He Glu Thr Thr Gly Leu Ala Asn Pro Ala Pro He Leu Trp Thr
100 105 110
He Leu Ser Asp Thr Phe Leu Gly Val His Phe Glu He Gin Ser Val
115 120 125
Val Ala Cys Val Asp Ala Leu Asn Ala Arg Glu His Leu Thr Asn Asn
130 135 140
Glu Ala Lys Glu Gin He Val Phe Ala Asp Ser Val Leu Leu Thr Lys 145 150 155 160
Thr Asp Leu Gin Asn Asp Ser Ala Ala Leu Thr Lys Leu Lys Glu Arg
165 170 175
He Gin Ala Leu Asn Pro Ser Ala Glu He Phe Asp Lys Arg Ala He
180 185 190
Asp Tyr Glu Ser Leu Phe Ser Arg Lys Asn Arg Ala Arg Asn Phe Met
195 200 205
Pro Arg Met Pro Lys Asp Ser His Ser Gin Gly Phe Glu Thr Leu Ser
210 215 220
He Asn Phe Glu Gly Thr Met Glu Trp Ser Ala Phe Gly He Trp Leu 225 230 235 240
Ser Leu Leu Leu His Gin Tyr Gly Thr Gin He Leu Arg He Lys Gly
245 250 255
He He Asp He Gly Ser Gly Phe Leu Val Ser He Asn Gly Val Met
260 265 270
His Val He Tyr Pro Pro Lys His He Leu Lys Asp Gin Asn Gly Ser
275 280 285
Asn Leu Val Phe He Met Arg His Leu Glu Arg Glu Lys He Leu Asn
290 295 300
Ser Leu Lys Gly Phe Lys Asp Phe Leu Gly He Lys Gly Phe Glu Thr 305 310 315 320
Gin
(2) INFORMATION FOR SEQ ID NO: 1275:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1713 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 26...1648 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1275:
TTAAAAGCAA ACAAGAAAGT TAAGC ATG CAC ACT CTC ATT AAG GGC ATT TTA 52
Met His Thr Leu He Lys Gly He Leu 1 5
GAA GAG ATT TTA GAA GAA GAA GTC ATT GTT GAA TAC CCT AAA GAC AGA 100 Glu Glu He Leu Glu Glu Glu Val He Val Glu Tyr Pro Lys Asp Arg 10 15 20 25
GAG CAT GGG CAT TAC GCT ACG CCC ATT GCT TTC AAT CTC GCC AAA GTT 148 Glu His Gly His Tyr Ala Thr Pro He Ala Phe Asn Leu Ala Lys Val 30 35 40
TTT AAA AAA TCG CCC TTA GCC ATC GCT GAA GAG TTA GCC CTT AAA ATC 196 Phe Lys Lys Ser Pro Leu Ala He Ala Glu Glu Leu Ala Leu Lys He 45 50 55
AGC ACG CAT GAA AAA ACT CAA GGG CTT TTT GAC AGC GTA GTG GCT TGT 244 Ser Thr His Glu Lys Thr Gin Gly Leu Phe Asp Ser Val Val Ala Cys 60 65 70
AAG GGC TAT ATC AAT TTC ACG CTT TCT TTA GAT TTT TTG GAG CGT TTC 292 Lys Gly Tyr He Asn Phe Thr Leu Ser Leu Asp Phe Leu Glu Arg Phe 75 80 85
ACC CAA AAA GCT TTG GAA TTG AAA GAA AAA TTT GGC TCT CAA GTT AAA 340 Thr Gin Lys Ala Leu Glu Leu Lys Glu Lys Phe Gly Ser Gin Val Lys 90 95 100 105
AGC GAA CGT TCT CAA AAA ATC TTT TTA GAA TTT GTG AGC GCT AAC CCC 388 Ser Glu Arg Ser Gin Lys He Phe Leu Glu Phe Val Ser Ala Asn Pro 110 115 120
ACA GGG CCT TTA CAC ATA GGG CAT GCT AGA GGG GCG GTG TTT GGC GAT 436 Thr Gly Pro Leu His He Gly His Ala Arg Gly Ala Val Phe Gly Asp 125 130 135
AGT TTG GCT AAA ATC GCT CGC TTT TTA GGG CAT GAA GTT TTA TGC GAG 484 Ser Leu Ala Lys He Ala Arg Phe Leu Gly His Glu Val Leu Cys Glu 140 145 150
TAT TAT GTC AAT GAC ATG GGA TCT CAA ATC CGC TTG TTA GGG CTT TCT 532 Tyr Tyr Val Asn Asp Met Gly Ser Gin He Arg Leu Leu Gly Leu Ser 155 160 165
GTA TGG CTC GCT TAC AGA GAA CAT GTT TTA AAA GAA AGC GTA ACT TAC 580 Val Trp Leu Ala Tyr Arg Glu His Val Leu Lys Glu Ser Val Thr Tyr 170 175 180 185
CCA GAA GTC TTT TAC AAA GGC GAA TAC ATC ATT GAA ATC GCT AAA AAG 628 Pro Glu Val Phe Tyr Lys Gly Glu Tyr He He Glu He Ala Lys Lys 190 195 200
GCG AAC AAC GAT TTA GAA CCA AGC CTT TTA AAA GAA AAC GAA GAA ACG 676 Ala Asn Asn Asp Leu Glu Pro Ser Leu Leu Lys Glu Asn Glu Glu Thr 205 210 215
ATT ATT GAA GTT TTA AGC GGC TAT GCT AGG GAT CTA ATG CTT TTA GAA 724 He He Glu Val Leu Ser Gly Tyr Ala Arg Asp Leu Met Leu Leu Glu 220 225 230
ATT AAA GAT AAT TTA GAC GCT TTA GGC ATT CAT TTT GAT TCC TAT GCG 772 He Lys Asp Asn Leu Asp Ala Leu Gly He His Phe Asp Ser Tyr Ala 235 240 245
AGC GAA AAA GAA GTT TTT AAA CAT AAA GAT GCG GTG TTT GAA CAA TTA 820 Ser Glu Lys Glu Val Phe Lys His Lys Asp Ala Val Phe Glu Gin Leu 250 255 260 265
GAA AAA GCG AAC GCC CTT TAT GAA AAG GAT TCT AAA ATC TGG CTC AAA 868 Glu Lys Ala Asn Ala Leu Tyr Glu Lys Asp Ser Lys He Trp Leu Lys 270 275 280
TCT TCA CTC TAC CAG GAT GAA AGC GAT CGG GTG CTC ATT AAA GAA GAT 916 Ser Ser Leu Tyr Gin Asp Glu Ser Asp Arg Val Leu He Lys Glu Asp 285 290 295
AAA AGC TAC ACT TAT TTA GCG GGC GAT ATT GTC TAT CAT GAT GAA AAA 964 Lys Ser Tyr Thr Tyr Leu Ala Gly Asp He Val Tyr His Asp Glu Lys 300 305 310
TTC AAG CAA GAT TAT ACC AAA TAC ATC AAC ATT TGG GGG GCA GAC CAC 1012 Phe Lys Gin Asp Tyr Thr Lys Tyr He Asn He Trp Gly Ala Asp His 315 320 325
CAC GGC TAT ATC GCT AGA GTG AAA GCC AGC CTT GAG TTT TTG GGC TAT 1060 His Gly Tyr He Ala Arg Val Lys Ala Ser Leu Glu Phe Leu Gly Tyr 330 335 340 345
GAT TCC AAC AAG CTT GAA GTC TTG CTC GCT CAA ATG GTG CGC TTG CTC 1108 Asp Ser Asn Lys Leu Glu Val Leu Leu Ala Gin Met Val Arg Leu Leu 350 355 360
AAA GAT AAC GAG CCT TAC AAG ATG AGT AAA AGA GCG GGT AAT TTT ATT 1156 Lys Asp Asn Glu Pro Tyr Lys Met Ser Lys Arg Ala Gly Asn Phe He 365 370 375
TTG ATT AAA GAT GTG GTT GAT GAT GTG GGT AAG GAC GCT TTG AGG TTT 1204 Leu He Lys Asp Val Val Asp Asp Val Gly Lys Asp Ala Leu Arg Phe 380 385 390
ATT TTT TTG AGC AAA CGG CTT GAC ACT CAT TTA GAA TTT GAT GTC AAT 1252 He Phe Leu Ser Lys Arg Leu Asp Thr His Leu Glu Phe Asp Val Asn 395 400 405 ACT TTA AAA AAG CAA GAC AGC TCA AAC CCC ATT TAC TAT ATC CAT TAC 1300 Thr Leu Lys Lys Gin Asp Ser Ser Asn Pro He Tyr Tyr He His Tyr 410 415 420 425
GCT AAT TCG CGC ATC CAC ACC ATG CTA GAA AAA TCG CCC TTT TCT AAA 1348 Ala Asn Ser Arg He His Thr Met Leu Glu Lys Ser Pro Phe Ser Lys 430 435 440
GAA GAG GTT TTG CAA ACC CCT TTA ACC AAT TTA AAC GCT GAA GAA AAA 1396 Glu Glu Val Leu Gin Thr Pro Leu Thr Asn Leu Asn Ala Glu Glu Lys 445 450 455
TAC TTG CTT TTT AGC GCT TTA AGC TTG CCT AAA GCA ATT GAA TCC TCT 1444 Tyr Leu Leu Phe Ser Ala Leu Ser Leu Pro Lys Ala He Glu Ser Ser 460 465 470
TTT GAA GAA TAC GGC TTG CAA AAA ATG TGC GAA TAC GCA AAA ACC CTC 1492 Phe Glu Glu Tyr Gly Leu Gin Lys Met Cys Glu Tyr Ala Lys Thr Leu 475 480 485
GCA TCA GAA TTC CAC CGC TTC TAT AAC GCT GGC AAA ATC TTA GAC ACC 1540 Ala Ser Glu Phe His Arg Phe Tyr Asn Ala Gly Lys He Leu Asp Thr 490 495 500 505
CCT AAA GCT AAA GAG CTT TTA AAA ATT TGT TTA ATA GTA AGC TTG AGC 1588 Pro Lys Ala Lys Glu Leu Leu Lys He Cys Leu He Val Ser Leu Ser 510 515 520
TTA AGC AAC GCT TTT AAA CTT TTA GGC ATA GAG ATA AAG ACC AAA ATT 1636 Leu Ser Asn Ala Phe Lys Leu Leu Gly He Glu He Lys Thr Lys He 525 530 535
TCC GCT AGA GAT TAAGCCAATA TTTAATTTTT TGTTATAACA TTCCCCTTAT TTTTT 1693 Ser Ala Arg Asp 540
GAAACTAAGG AGAATATTAT 1713
(2) INFORMATION FOR SEQ ID NO: 1276:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 541 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1276:
Met His Thr Leu He Lys Gly He Leu Glu Glu He Leu Glu Glu Glu
1 5 10 15
Val He Val Glu Tyr Pro Lys Asp Arg Glu His Gly His Tyr Ala Thr
20 25 30
Pro He Ala Phe Asn Leu Ala Lys Val Phe Lys Lys Ser Pro Leu Ala 35 40 45
He Ala Glu Glu Leu Ala Leu Lys He Ser Thr His Glu Lys Thr Gin
50 55 60
Gly Leu Phe Asp Ser Val Val Ala Cys Lys Gly Tyr He Asn Phe Thr 65 70 75 80
Leu Ser Leu Asp Phe Leu Glu Arg Phe Thr Gin Lys Ala Leu Glu Leu
85 90 95
Lys Glu Lys Phe Gly Ser Gin Val Lys Ser Glu Arg Ser Gin Lys He
100 105 110
Phe Leu Glu Phe Val Ser Ala Asn Pro Thr Gly Pro Leu His He Gly
115 120 125
His Ala Arg Gly Ala Val Phe Gly Asp Ser Leu Ala Lys He Ala Arg
130 135 140
Phe Leu Gly His Glu Val Leu Cys Glu Tyr Tyr Val Asn Asp Met Gly 145 150 155 160
Ser Gin He Arg Leu Leu Gly Leu Ser Val Trp Leu Ala Tyr Arg Glu
165 170 175
His Val Leu Lys Glu Ser Val Thr Tyr Pro Glu Val Phe Tyr Lys Gly
180 185 190
Glu Tyr He He Glu He Ala Lys Lys Ala Asn Asn Asp Leu Glu Pro
195 200 205
Ser Leu Leu Lys Glu Asn Glu Glu Thr He He Glu Val Leu Ser Gly
210 215 220
Tyr Ala Arg Asp Leu Met Leu Leu Glu He Lys Asp Asn Leu Asp Ala 225 230 235 240
Leu Gly He His Phe Asp Ser Tyr Ala Ser Glu Lys Glu Val Phe Lys
245 250 255
His Lys Asp Ala Val Phe Glu Gin Leu Glu Lys Ala Asn Ala Leu Tyr
260 265 270
Glu Lys Asp Ser Lys He Trp Leu Lys Ser Ser Leu Tyr Gin Asp Glu
275 280 285
Ser Asp Arg Val Leu He Lys Glu Asp Lys Ser Tyr Thr Tyr Leu Ala
290 295 300
Gly Asp He Val Tyr His Asp Glu Lys Phe Lys Gin Asp Tyr Thr Lys 305 310 315 320
Tyr He Asn He Trp Gly Ala Asp His His Gly Tyr He Ala Arg Val
325 330 335
Lys Ala Ser Leu Glu Phe Leu Gly Tyr Asp Ser Asn Lys Leu Glu Val
340 345 350
Leu Leu Ala Gin Met Val Arg Leu Leu Lys Asp Asn Glu Pro Tyr Lys
355 360 365
Met Ser Lys Arg Ala Gly Asn Phe He Leu He Lys Asp Val Val Asp
370 375 380
Asp Val Gly Lys Asp Ala Leu Arg Phe He Phe Leu Ser Lys Arg Leu 385 390 395 400
Asp Thr His Leu Glu Phe Asp Val Asn Thr Leu Lys Lys Gin Asp Ser
405 410 415
Ser Asn Pro He Tyr Tyr He His Tyr Ala Asn Ser Arg He His Thr
420 425 430
Met Leu Glu Lys Ser Pro Phe Ser Lys Glu Glu Val Leu Gin Thr Pro
435 440 445
Leu Thr Asn Leu Asn Ala Glu Glu Lys Tyr Leu Leu Phe Ser Ala Leu
450 455 460
Ser Leu Pro Lys Ala He Glu Ser Ser Phe Glu Glu Tyr Gly Leu Gin 465 470 475 480 Lys Met Cys Glu Tyr Ala Lys Thr Leu Ala Ser Glu Phe His Arg Phe
485 490 495
Tyr Asn Ala Gly Lys He Leu Asp Thr Pro Lys Ala Lys Glu Leu Leu
500 505 510
Lys He Cys Leu He Val Ser Leu Ser Leu Ser Asn Ala Phe Lys Leu
515 520 525
Leu Gly He Glu He Lys Thr Lys He Ser Ala Arg Asp 530 535 540
(2) INFORMATION FOR SEQ ID NO: 1277:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 896 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 38...835 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1277:
AAGTAGAGAT TATATTACCT AGATAGTGAA TCAACGA ATG AAA AGC CAC TTC CAA 55
Met Lys Ser His Phe Gin 1 5
TAC AGC ACG CTA GAA AAT ATC CCT AAA GCC TTT GAC ATT CTC AAA GAC 103 Tyr Ser Thr Leu Glu Asn He Pro Lys Ala Phe Asp He Leu Lys Asp 10 15 20
CCC CCT AAA AAA CTC TAT TGT GTG GGC GAT ACC AAG CTT TTG GAC ACG 151 Pro Pro Lys Lys Leu Tyr Cys Val Gly Asp Thr Lys Leu Leu Asp Thr 25 30 35
CCT TTA AAA GTG GCG ATC ATA GGC ACA AGA AGA CCC ACC CCT TAC AGC 199 Pro Leu Lys Val Ala He He Gly Thr Arg Arg Pro Thr Pro Tyr Ser 40 45 50
AAG CAA CAC ACG ATC ACT CTA GCT AGA GAG CTT GCT AAA AAT GGC GCG 247 Lys Gin His Thr He Thr Leu Ala Arg Glu Leu Ala Lys Asn Gly Ala 55 60 65 70
GTT ATT GTG AGT GGG GGA GCG TTA GGC GTG GAT ATT ATC GCT CAA GAA 295 Val He Val Ser Gly Gly Ala Leu Gly Val Asp He He Ala Gin Glu 75 80 85
AAC GCC TTG CCA AAA ACG ATC ATG CTT TCG CCT TGC AGT TTG GAT TTC 343 Asn Ala Leu Pro Lys Thr He Met Leu Ser Pro Cys Ser Leu Asp Phe 90 95 100 ATC TAT CCT ACG AAC AAT CAT AAA GTG ATC CAA GAA ATC GCG CAA AAC 391 He Tyr Pro Thr Asn Asn His Lys Val He Gin Glu He Ala Gin Asn 105 110 115
GGC TTG ATT TTA AGC GAA TAT GAA AAG GAT TTC ATG CCC ATT AAA GGC 439 Gly Leu He Leu Ser Glu Tyr Glu Lys Asp Phe Met Pro He Lys Gly 120 125 130
TCT TTT TTG GCG AGA AAC CGC CTG GTG ATC GCT TTA AGC GAT GTG GTG 487 Ser Phe Leu Ala Arg Asn Arg Leu Val He Ala Leu Ser Asp Val Val 135 140 145 150
ATT ATC CCC CAA GCG GAT TTA AAA AGC GGC TCT ATG AGC AGC GCG AGA 535 He He Pro Gin Ala Asp Leu Lys Ser Gly Ser Met Ser Ser Ala Arg 155 160 165
TTA GCC CAG AAA TAC CAA AAG CCT TTA TTT GTT TTA CCC CAA CGC CTG 583 Leu Ala Gin Lys Tyr Gin Lys Pro Leu Phe Val Leu Pro Gin Arg Leu 170 175 180
AAT GAG AGC GAT GGC ACT AAT GAG CTT TTA GAA AAA GGG CAG GCT CAA 631 Asn Glu Ser Asp Gly Thr Asn Glu Leu Leu Glu Lys Gly Gin Ala Gin 185 190 195
GGG ATA TTT AAT ATT CAA AAT TTT ATA AAC ACC CTT TTA AAA GAC TAC 679 Gly He Phe Asn He Gin Asn Phe He Asn Thr Leu Leu Lys Asp Tyr 200 205 210
CAT TTA AAA GAA ATG CCT GAA ATG GAA GAT GAA TTT TTA GAA TAT TGT 727 His Leu Lys Glu Met Pro Glu Met Glu Asp Glu Phe Leu Glu Tyr Cys 215 220 225 230
GCC AAA AAC CCG AGC TAT GAA GAA GCG TAT CTC AAA TTT GGG GAT AAG 775 Ala Lys Asn Pro Ser Tyr Glu Glu Ala Tyr Leu Lys Phe Gly Asp Lys 235 240 245
CTT TTA GAA TAC GAG CTG TTG GGT AAG ATC AAG CGC ATC AAT CAC ATT 823 Leu Leu Glu Tyr Glu Leu Leu Gly Lys He Lys Arg He Asn His He 250 255 260
GTG GTG TTA GCG TGATTTTGGC ATGCGATGTG GGGTTAAAAC GCATTGGCAT CGCTG 880 Val Val Leu Ala 265
CGCTTTTAAA TGGCGT 896
(2) INFORMATION FOR SEQ ID NO: 1278:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 266 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1278:
Met Lys Ser His Phe Gin Tyr Ser Thr Leu Glu Asn He Pro Lys Ala
1 5 10 15
Phe Asp He Leu Lys Asp Pro Pro Lys Lys Leu Tyr Cys Val Gly Asp
20 25 30
Thr Lys Leu Leu Asp Thr Pro Leu Lys Val Ala He He Gly Thr Arg
35 40 45
Arg Pro Thr Pro Tyr Ser Lys Gin His Thr He Thr Leu Ala Arg Glu
50 55 60
Leu Ala Lys Asn Gly Ala Val He Val Ser Gly Gly Ala Leu Gly Val 65 70 75 80
Asp He He Ala Gin Glu Asn Ala Leu Pro Lys Thr He Met Leu Ser
85 90 95
Pro Cys Ser Leu Asp Phe He Tyr Pro Thr Asn Asn His Lys Val He
100 105 110
Gin Glu He Ala Gin Asn Gly Leu He Leu Ser Glu Tyr Glu Lys Asp
115 120 125
Phe Met Pro He Lys Gly Ser Phe Leu Ala Arg Asn Arg Leu Val He
130 135 140
Ala Leu Ser Asp Val Val He He Pro Gin Ala Asp Leu Lys Ser Gly 145 150 155 160
Ser Met Ser Ser Ala Arg Leu Ala Gin Lys Tyr Gin Lys Pro Leu Phe
165 170 175
Val Leu Pro Gin Arg Leu Asn Glu Ser Asp Gly Thr Asn Glu Leu Leu
180 185 190
Glu Lys Gly Gin Ala Gin Gly He Phe Asn He Gin Asn Phe He Asn
195 200 205
Thr Leu Leu Lys Asp Tyr His Leu Lys Glu Met Pro Glu Met Glu Asp
210 215 220
Glu Phe Leu Glu Tyr Cys Ala Lys Asn Pro Ser Tyr Glu Glu Ala Tyr 225 230 235 240
Leu Lys Phe Gly Asp Lys Leu Leu Glu Tyr Glu Leu Leu Gly Lys He
245 250 255
Lys Arg He Asn His He Val Val Leu Ala 260 265
(2) INFORMATION FOR SEQ ID NO:1279:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 459 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 82...408 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1279:
TAAAACGCTC TTGAAAGGGT GAGCGTGGAA TTTGCTTCTA TCGTTTGGTT GCTCATAGTC 60 AATATTCTGA TTTTTATTCT C ATG CTG GTG GAT AAA AAT TCG GCT GAT CAA 111
Met Leu Val Asp Lys Asn Ser Ala Asp Gin 1 5 10
AAA ATG TGG CGT ATT CCT GAA AAA GCT TTG TGG GTT TTA TCG CTC CTT 159 Lys Met Trp Arg He Pro Glu Lys Ala Leu Trp Val Leu Ser Leu Leu 15 20 25
GGC GGG TCT GTC GGG TTT TTG GTC GCT ATG GTT GTG TCC CAC CAT AAG 207 Gly Gly Ser Val Gly Phe Leu Val Ala Met Val Val Ser His His Lys 30 35 40
ATC TTA AAG CCT GAG TTT AAA TAC GGC GTT TCG CTC ATT TAC TTG ATA 255 He Leu Lys Pro Glu Phe Lys Tyr Gly Val Ser Leu He Tyr Leu He 45 50 55
GAG AGC ACA ATC CTT TAC TTT GTC AGC AAA GAT CTT TCT TGG ATA GTA 303 Glu Ser Thr He Leu Tyr Phe Val Ser Lys Asp Leu Ser Trp He Val 60 65 70
GCG CTA ACG ATA TTC TCA CTA TCT TTG ATA CTG GTA GCG TTT AAG ATC 351 Ala Leu Thr He Phe Ser Leu Ser Leu He Leu Val Ala Phe Lys He 75 80 85 90
TTC CTC CTT AAA GAC AAC CCT AAC AAA CGC TTC AAA AAC AAC AAG AGG 399 Phe Leu Leu Lys Asp Asn Pro Asn Lys Arg Phe Lys Asn Asn Lys Arg 95 100 105
GAT AAA AAA TAATGTCTTA TTTTTTTAAA ATCATTCTGG GCACAAGCGT GATCGTGGG 457 Asp Lys Lys
GG 459
(2) INFORMATION FOR SEQ ID NO: 1280:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 109 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1280:
Met Leu Val Asp Lys Asn Ser Ala Asp Gin Lys Met Trp Arg He Pro
1 5 10 15
Glu Lys Ala Leu Trp Val Leu Ser Leu Leu Gly Gly Ser Val Gly Phe
20 25 30
Leu Val Ala Met Val Val Ser His His Lys He Leu Lys Pro Glu Phe 35 40 45 Lys Tyr Gly Val Ser Leu He Tyr Leu He Glu Ser Thr He Leu Tyr
50 55 60
Phe Val Ser Lys Asp Leu Ser Trp He Val Ala Leu Thr He Phe Ser 65 70 75 80
Leu Ser Leu He Leu Val Ala Phe Lys He Phe Leu Leu Lys Asp Asn
85 90 95
Pro Asn Lys Arg Phe Lys Asn Asn Lys Arg Asp Lys Lys 100 105
(2) INFORMATION FOR SEQ ID NO: 1281:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 399 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 47...379 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1281:
AGCGCTCAAA TCATTTATTG GTTATCAAAA TATTTTAGGA GTGAGT ATG GAA AAT 55
Met Glu Asn
1
GAT GTT AAA GAA GAT CTA GAG CAA GCA AGA CCA AAG TTA GAG CCA GAA 103 Asp Val Lys Glu Asp Leu Glu Gin Ala Arg Pro Lys Leu Glu Pro Glu 5 10 15
AAG CAA AAG CAA GAG CCA GAG GAA CAG AAA CAA GAA AAA CAA GAC AAA 151 Lys Gin Lys Gin Glu Pro Glu Glu Gin Lys Gin Glu Lys Gin Asp Lys 20 25 30 35
CAA GAG CAG AAG CCA AAG CAA GAA AAA GAA GAG TCA AAG AGC AAG GAA 199 Gin Glu Gin Lys Pro Lys Gin Glu Lys Glu Glu Ser Lys Ser Lys Glu 40 45 50
CAA GAA GAA AAC AAA AAA CAA AAG AGA TCT AGC TAT ATT TTT TGG GGA 247 Gin Glu Glu Asn Lys Lys Gin Lys Arg Ser Ser Tyr He Phe Trp Gly 55 60 65
TGT ATT ATT GGT TTG TGT ATA GTT GTT ATT ATT GCC AAA ATT ATT GCG 295 Cys He He Gly Leu Cys He Val Val He He Ala Lys He He Ala 70 75 80
TTT GGC GGA TCT AGT GAG GAG GCA AAA GCA GAC AAA CCA AAA AAC TCT 343 Phe Gly Gly Ser Ser Glu Glu Ala Lys Ala Asp Lys Pro Lys Asn Ser 85 90 95 TTA AGT ATG CTG AAA AAC TTT TAC CTA CCG ATA TTA TAAAAGATAA TCTTAA 395 Leu Ser Met Leu Lys Asn Phe Tyr Leu Pro He Leu 100 105 110
TAAC 399
(2) INFORMATION FOR SEQ ID NO: 1282:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 111 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1282:
Met Glu Asn Asp Val Lys Glu Asp Leu Glu Gin Ala Arg Pro Lys Leu
1 5 10 15
Glu Pro Glu Lys Gin Lys Gin Glu Pro Glu Glu Gin Lys Gin Glu Lys
20 25 30
Gin Asp Lys Gin Glu Gin Lys Pro Lys Gin Glu Lys Glu Glu Ser Lys
35 40 45
Ser Lys Glu Gin Glu Glu Asn Lys Lys Gin Lys Arg Ser Ser Tyr He
50 55 60
Phe Trp Gly Cys He He Gly Leu Cys He Val Val He He Ala Lys 65 70 75 80
He He Ala Phe Gly Gly Ser Ser Glu Glu Ala Lys Ala Asp Lys Pro
85 90 95
Lys Asn Ser Leu Ser Met Leu Lys Asn Phe Tyr Leu Pro He Leu 100 105 110
(2) INFORMATION FOR SEQ ID NO: 1283:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1627 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 21...1568 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1283:
TCTAAATCCT AATCTAACAA ATG AAA CAA AAG CTT AAA GCT CAA ATC AAA 50
Met Lys Gin Lys Leu Lys Ala Gin He Lys 1 5 10 GAG CGC ATG GCT TCT ATC GCT TAT AAT GAA AAA GGG TTT CCT AGC CCC 98 Glu Arg Met Ala Ser He Ala Tyr Asn Glu Lys Gly Phe Pro Ser Pro 15 20 25
TTT TTA TTT AAA GAC TTG AAA AAA GCC GCG CTC AAA ATC ATA GAA GCC 146 Phe Leu Phe Lys Asp Leu Lys Lys Ala Ala Leu Lys He He Glu Ala 30 35 40
ATG CGC ACA AAC ACA GAA ATT TTA GTG GTG GGC GAT TAT GAC GCT GAC 194 Met Arg Thr Asn Thr Glu He Leu Val Val Gly Asp Tyr Asp Ala Asp 45 50 55
GGC GTG ATT AGC TCT GCT ATC ATG GCA AAA TTT TTT GAA AGC CTG AAC 242 Gly Val He Ser Ser Ala He Met Ala Lys Phe Phe Glu Ser Leu Asn 60 65 70
TAT AAG CAT GTC CGC ATT GCA ATC CCT AAT CGC TTC ATG GAT GGC TAT 290 Tyr Lys His Val Arg He Ala He Pro Asn Arg Phe Met Asp Gly Tyr 75 80 85 90
GGG ATT TCT AAA AAA TTT TTA GAA AAA CAC CAC GCC CCT TTA ATC ATC 338 Gly He Ser Lys Lys Phe Leu Glu Lys His His Ala Pro Leu He He 95 100 105
ACG GTG GAT AAC GGG ATT AAC GCC TTT GAA GCC GCG CGA TTT TGT AAA 386 Thr Val Asp Asn Gly He Asn Ala Phe Glu Ala Ala Arg Phe Cys Lys 110 115 120
GAA AAA AAT TAC ACC CTT ATC ATC ACA GAT CAC CAT TGC TTA CAC CAT 434 Glu Lys Asn Tyr Thr Leu He He Thr Asp His His Cys Leu His His 125 130 135
GAT GAA GTC CCA GAC GCT TAT GCG GTG ATC AAC CCC AAG CAA CCG GAT 482 Asp Glu Val Pro Asp Ala Tyr Ala Val He Asn Pro Lys Gin Pro Asp 140 145 150
TGT GAT TTT ATC CAA AAG GAA GTG TGC GGG GCG TTG GTA GCG TTT TAT 530 Cys Asp Phe He Gin Lys Glu Val Cys Gly Ala Leu Val Ala Phe Tyr 155 160 165 170
TTG TGC TAT GGG ATC CAT CAG CTT TTA GGA AAA GAA AAA AGC CAT TCT 578 Leu Cys Tyr Gly He His Gin Leu Leu Gly Lys Glu Lys Ser His Ser 175 180 185
AGT GAG TTA TTA TGT TTA GCG GGC GTG GCG ACT ATT GCT GAC ATG ATG 626 Ser Glu Leu Leu Cys Leu Ala Gly Val Ala Thr He Ala Asp Met Met 190 195 200
CCT TTG ACT TTT TTT AAC CGC TTT TTA GTT TCT AAA GCC TTG TAT TTT 674 Pro Leu Thr Phe Phe Asn Arg Phe Leu Val Ser Lys Ala Leu Tyr Phe 205 210 215
TTG CAA AAA GAA TCC TTA GGG GCG ATG GGT TTT TTG CGC CAA AGA GAA 722 Leu Gin Lys Glu Ser Leu Gly Ala Met Gly Phe Leu Arg Gin Arg Glu 220 225 230 GTT TTT AGA AAA CGC TCT TTA AAA GCG AGT GAT ATT TCT TTT AAT ATC 770 Val Phe Arg Lys Arg Ser Leu Lys Ala Ser Asp He Ser Phe Asn He 235 240 245 250
GCC CCC TTA ATC AAC TCC GCA GGG CGC ATG CAA GAT GCG AAA ATG GCT 818 Ala Pro Leu He Asn Ser Ala Gly Arg Met Gin Asp Ala Lys Met Ala 255 260 265
TTA GAT TTT TTA AGC GCG AAT AAT TCT CAA GAT GGC TAT TCT TTG TAT 866 Leu Asp Phe Leu Ser Ala Asn Asn Ser Gin Asp Gly Tyr Ser Leu Tyr 270 275 280
GAA CGC TTG AAA GCA TGC AAT TTG AAG CGT AAA ATG ATC CAA CAG CAG 914 Glu Arg Leu Lys Ala Cys Asn Leu Lys Arg Lys Met He Gin Gin Gin 285 290 295
GTT TTT GAA GAA GCT TTT AAG CAT GCG ATG GTT GGA GAA AAA ATT ATC 962 Val Phe Glu Glu Ala Phe Lys His Ala Met Val Gly Glu Lys He He 300 305 310
GTC GCT TTT AAG GAC AAT TGG CAT GAG GGA GTG CTG GGG ATT GTG GCT 1010 Val Ala Phe Lys Asp Asn Trp His Glu Gly Val Leu Gly He Val Ala 315 320 325 330
TCA AAA TTA GTG GAA GCC ACT CAA AAG CCA AGC CTG GTT TTT ACC TTT 1058 Ser Lys Leu Val Glu Ala Thr Gin Lys Pro Ser Leu Val Phe Thr Phe 335 340 345
AAA GAA GGG GTG TAT AAA GGG AGC GCT CGT AGC TCT TCA AAC ATT GAC 1106 Lys Glu Gly Val Tyr Lys Gly Ser Ala Arg Ser Ser Ser Asn He Asp 350 355 360
TTG ATT GAC GCT TTG AAT GGG GTT TCT TCT TTA TTG CTC GGC TAT GGA 1154 Leu He Asp Ala Leu Asn Gly Val Ser Ser Leu Leu Leu Gly Tyr Gly 365 370 375
GGG CAT AGG CAA GCT TGC GGG TTG AGC GTT GAA AAA AAC AAT ATC ATC 1202 Gly His Arg Gin Ala Cys Gly Leu Ser Val Glu Lys Asn Asn He He 380 385 390
TCG CTC TTT GAA ACT TTA GAA AAT TTT GAT TTT AAA GTT TTA CCT TTT 1250 Ser Leu Phe Glu Thr Leu Glu Asn Phe Asp Phe Lys Val Leu Pro Phe 395 400 405 410
TGT AAA ACA GAG CCC CCT TTA ACG CTC AAA TTA AAA GAC ATT GAC AGA 1298 Cys Lys Thr Glu Pro Pro Leu Thr Leu Lys Leu Lys Asp He Asp Arg 415 420 425
GAG CTT TTA GAG ATT ATA GAA ATG GGC GAA CCT TAT GGG CAA GAA AAC 1346 Glu Leu Leu Glu He He Glu Met Gly Glu Pro Tyr Gly Gin Glu Asn 430 435 440
CCT GAA CCC CTA TTC CAA GCA AAA AAT TTA GAA GTC ATA GAA GAA AAA 1394 Pro Glu Pro Leu Phe Gin Ala Lys Asn Leu Glu Val He Glu Glu Lys 445 450 455 ATC ATT AAA GAA AGC CAC CAG GTT TTG CGT TTT AAG GAT AAA GAA TGC 1442 He He Lys Glu Ser His Gin Val Leu Arg Phe Lys Asp Lys Glu Cys 460 465 470
GTC AAA GAG GCT ATT TAT TTT AGC GCT GAG CGG TTT TTG AAA GCG GGC 1490 Val Lys Glu Ala He Tyr Phe Ser Ala Glu Arg Phe Leu Lys Ala Gly 475 480 485 490
GAA AAG GTG AGC GTG CTT TTT AGC GTG GAA TTA GAT GAG TGT TCT AAT 1538 Glu Lys Val Ser Val Leu Phe Ser Val Glu Leu Asp Glu Cys Ser Asn 495 500 505
GAG CCT AAA ATG TTT GTT AAA AGT TTG TTG TAGTGCCTTT TGTTGAAGAA GAA 1591 Glu Pro Lys Met Phe Val Lys Ser Leu Leu 510 515
TTTGAAATTT TAAAACCCAC CAAAGCCTTG TTTTTT 1627
(2) INFORMATION FOR SEQ ID NO: 1284:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 516 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1284:
Met Lys Gin Lys Leu Lys Ala Gin He Lys Glu Arg Met Ala Ser He
1 5 10 15
Ala Tyr Asn Glu Lys Gly Phe Pro Ser Pro Phe Leu Phe Lys Asp Leu
20 25 30
Lys Lys Ala Ala Leu Lys He He Glu Ala Met Arg Thr Asn Thr Glu
35 40 45
He Leu Val Val Gly Asp Tyr Asp Ala Asp Gly Val He Ser Ser Ala
50 55 60
He Met Ala Lys Phe Phe Glu Ser Leu Asn Tyr Lys His Val Arg He 65 70 75 80
Ala He Pro Asn Arg Phe Met Asp Gly Tyr Gly He Ser Lys Lys Phe
85 90 95
Leu Glu Lys His His Ala Pro Leu He He Thr Val Asp Asn Gly He
100 105 110
Asn Ala Phe Glu Ala Ala Arg Phe Cys Lys Glu Lys Asn Tyr Thr Leu
115 120 125
He He Thr Asp His His Cys Leu His His Asp Glu Val Pro Asp Ala
130 135 140
Tyr Ala Val He Asn Pro Lys Gin Pro Asp Cys Asp Phe He Gin Lys 145 150 155 160
Glu Val Cys Gly Ala Leu Val Ala Phe Tyr Leu Cys Tyr Gly He His
165 170 175
Gin Leu Leu Gly Lys Glu Lys Ser His Ser Ser Glu Leu Leu Cys Leu
180 185 190
Ala Gly Val Ala Thr He Ala Asp Met Met Pro Leu Thr Phe Phe Asn 195 200 205
Arg Phe Leu Val Ser Lys Ala Leu Tyr Phe Leu Gin Lys Glu Ser Leu
210 215 220
Gly Ala Met Gly Phe Leu Arg Gin Arg Glu Val Phe Arg Lys Arg Ser 225 230 235 240
Leu Lys Ala Ser Asp He Ser Phe Asn He Ala Pro Leu He Asn Ser
245 250 255
Ala Gly Arg Met Gin Asp Ala Lys Met Ala Leu Asp Phe Leu Ser Ala
260 265 270
Asn Asn Ser Gin Asp Gly Tyr Ser Leu Tyr Glu Arg Leu Lys Ala Cys
275 280 285
Asn Leu Lys Arg Lys Met He Gin Gin Gin Val Phe Glu Glu Ala Phe
290 295 300
Lys His Ala Met Val Gly Glu Lys He He Val Ala Phe Lys Asp Asn 305 310 315 320
Trp His Glu Gly Val Leu Gly He Val Ala Ser Lys Leu Val Glu Ala
325 330 335
Thr Gin Lys Pro Ser Leu Val Phe Thr Phe Lys Glu Gly Val Tyr Lys
340 345 350
Gly Ser Ala Arg Ser Ser Ser Asn He Asp Leu He Asp Ala Leu Asn
355 360 365
Gly Val Ser Ser Leu Leu Leu Gly Tyr Gly Gly His Arg Gin Ala Cys
370 375 380
Gly Leu Ser Val Glu Lys Asn Asn He He Ser Leu Phe Glu Thr Leu 385 390 395 400
Glu Asn Phe Asp Phe Lys Val Leu Pro Phe Cys Lys Thr Glu Pro Pro
405 410 415
Leu Thr Leu Lys Leu Lys Asp He Asp Arg Glu Leu Leu Glu He He
420 425 430
Glu Met Gly Glu Pro Tyr Gly Gin Glu Asn Pro Glu Pro Leu Phe Gin
435 440 445
Ala Lys Asn Leu Glu Val He Glu Glu Lys He He Lys Glu Ser His
450 455 460
Gin Val Leu Arg Phe Lys Asp Lys Glu Cys Val Lys Glu Ala He Tyr 465 470 475 480
Phe Ser Ala Glu Arg Phe Leu Lys Ala Gly Glu Lys Val Ser Val Leu
485 490 495
Phe Ser Val Glu Leu Asp Glu Cys Ser Asn Glu Pro Lys Met Phe Val
500 505 510
Lys Ser Leu Leu 515
(2) INFORMATION FOR SEQ ID NO: 1285:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 961 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 51...908 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1285:
ATCGTAATGA AATAATCACC ACCCCTATAA GCTTTGTAGC GACGGCTAAC ATG CTT 56
Met Leu
1
TTA GAA AGC GGT TAT ACA CCC GTA TTT GCT GGA ATT AAA AAC GAT GGC 104 Leu Glu Ser Gly Tyr Thr Pro Val Phe Ala Gly He Lys Asn Asp Gly 5 10 15
AAT ATA GAT GAA TTA GCC CTA GAA AAG CTC ATT AAC GAA AGA ACC AAA 152 Asn He Asp Glu Leu Ala Leu Glu Lys Leu He Asn Glu Arg Thr Lys 20 25 30
GCC ATA GTG AGC GTG GAT TAT GCC GGT AAA AGC GTG GAA GTA GAA AGC 200 Ala He Val Ser Val Asp Tyr Ala Gly Lys Ser Val Glu Val Glu Ser 35 40 45 50
GTT CAA AAG CTT TGC AAA AAG CAT TCT TTG AGC TTT CTT TCT GAC AGC 248 Val Gin Lys Leu Cys Lys Lys His Ser Leu Ser Phe Leu Ser Asp Ser 55 60 65
TCG CAT GCT CTA GGA AGC GAG TAT CAA AAC AAA AAA GTA GGA GGC TTT 296 Ser His Ala Leu Gly Ser Glu Tyr Gin Asn Lys Lys Val Gly Gly Phe 70 75 80
GCG TTA GCG AGC GTG TTT AGT TTC CAT GCC ATT AAG CCC ATC ACT ACG 344 Ala Leu Ala Ser Val Phe Ser Phe His Ala He Lys Pro He Thr Thr 85 90 95
GCT GAA GGG GGA GCG GTC GTT ACT AAC GAT AGC GAA TTG CAT GAA AAA 392 Ala Glu Gly Gly Ala Val Val Thr Asn Asp Ser Glu Leu His Glu Lys 100 105 110
ATG AAA TTG TTT CGC TCT CAT GGC ATG CTC AAA AAA GAT TTT TTT GAA 440 Met Lys Leu Phe Arg Ser His Gly Met Leu Lys Lys Asp Phe Phe Glu 115 120 125 130
GGC GAA GTC AAA AGC ATA GGG CAT AAC TTC CGC TTG AAT GAA ATC CAA 488 Gly Glu Val Lys Ser He Gly His Asn Phe Arg Leu Asn Glu He Gin 135 140 145
AGC GCT TTG GGT TTG AGC CAG CTT AAA AAA GCC CCC TTT TTA ATG CAA 536 Ser Ala Leu Gly Leu Ser Gin Leu Lys Lys Ala Pro Phe Leu Met Gin 150 155 160
AAA AGA GAA GAA GCC GCT CTA ACC TAT GAC AGG ATT TTT AAA GAT AAC 584 Lys Arg Glu Glu Ala Ala Leu Thr Tyr Asp Arg He Phe Lys Asp Asn 165 170 175
CCT TAT TTC ACC CCT TTA CAC CCC TTG TTA AAA GAT AAA AGC TCT AAC 632 Pro Tyr Phe Thr Pro Leu His Pro Leu Leu Lys Asp Lys Ser Ser Asn 180 185 190
CAC CTT TAT CCT ATT TTA ATG CAC CAA AAA TTT TTT ACA TGC AAA AAA 680 His Leu Tyr Pro He Leu Met His Gin Lys Phe Phe Thr Cys Lys Lys 195 200 205 210
CTC ATT TTA GAA AGT TTG CAC AAG CGT GGC ATT TTA GCC CAA GTG CAT 728 Leu He Leu Glu Ser Leu His Lys Arg Gly He Leu Ala Gin Val His 215 220 225
TAC AAG CCC ATT TAT CAA TAC CAA TTG TAC CAA CAG CTC TTC AAT ACA 776 Tyr Lys Pro He Tyr Gin Tyr Gin Leu Tyr Gin Gin Leu Phe Asn Thr 230 235 240
GCC CCA TTA AAA AGC GCA GAG GAT TTC TAT CAC GCT GAA ATT TCC TTG 824 Ala Pro Leu Lys Ser Ala Glu Asp Phe Tyr His Ala Glu He Ser Leu 245 250 255
CCT TGT CAT GCG AAT TTA AAT TTA GAG AGC GTT CAA AAC ATC GCT CAT 872 Pro Cys His Ala Asn Leu Asn Leu Glu Ser Val Gin Asn He Ala His 260 265 270
AGC GTT TTA AAA ACT TTT GAG AGT TTT AAA ATA GAA TGAGTTTCAT TTAGGG 924 Ser Val Leu Lys Thr Phe Glu Ser Phe Lys He Glu 275 280 285
CTTCAAATCT TAATCATTAA GAATGGTGCG GAAGAAA 961
(2) INFORMATION FOR SEQ ID NO: 1286:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 286 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1286:
Met Leu Leu Glu Ser Gly Tyr Thr Pro Val Phe Ala Gly He Lys Asn
1 5 10 15
Asp Gly Asn He Asp Glu Leu Ala Leu Glu Lys Leu He Asn Glu Arg
20 25 30
Thr Lys Ala He Val Ser Val Asp Tyr Ala Gly Lys Ser Val Glu Val
35 40 45
Glu Ser Val Gin Lys Leu Cys Lys Lys His Ser Leu Ser Phe Leu Ser
50 55 60
Asp Ser Ser His Ala Leu Gly Ser Glu Tyr Gin Asn Lys Lys Val Gly 65 70 75 80
Gly Phe Ala Leu Ala Ser Val Phe Ser Phe His Ala He Lys Pro He
85 90 95
Thr Thr Ala Glu Gly Gly Ala Val Val Thr Asn Asp Ser Glu Leu His 100 105 110 Glu Lys Met Lys Leu Phe Arg Ser His Gly Met Leu Lys Lys Asp Phe
115 120 125
Phe Glu Gly Glu Val Lys Ser He Gly His Asn Phe Arg Leu Asn Glu
130 135 140
He Gin Ser Ala Leu Gly Leu Ser Gin Leu Lys Lys Ala Pro Phe Leu 145 150 155 160
Met Gin Lys Arg Glu Glu Ala Ala Leu Thr Tyr Asp Arg He Phe Lys
165 170 175
Asp Asn Pro Tyr Phe Thr Pro Leu His Pro Leu Leu Lys Asp Lys Ser
180 185 190
Ser Asn His Leu Tyr Pro He Leu Met His Gin Lys Phe Phe Thr Cys
195 200 205
Lys Lys Leu He Leu Glu Ser Leu His Lys Arg Gly He Leu Ala Gin
210 215 220
Val His Tyr Lys Pro He Tyr Gin Tyr Gin Leu Tyr Gin Gin Leu Phe 225 230 235 240
Asn Thr Ala Pro Leu Lys Ser Ala Glu Asp Phe Tyr His Ala Glu He
245 250 255
Ser Leu Pro Cys His Ala Asn Leu Asn Leu Glu Ser Val Gin Asn He
260 265 270
Ala His Ser Val Leu Lys Thr Phe Glu Ser Phe Lys He Glu 275 280 285
(2) INFORMATION FOR SEQ ID NO: 1287:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 728 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 15...692 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1287:
TAATGGGCCT TTGA ATG CGT TTT GTC TAT CAC CCT TTA GCC AAA GAG CCT 50 Met Arg Phe Val Tyr His Pro Leu Ala Lys Glu Pro 1 5 10
GTT TTA AAA ATA GAA GGC GAG AGT TAT ACG CAT TTA TAC CGA TCA AGG 98 Val Leu Lys He Glu Gly Glu Ser Tyr Thr His Leu Tyr Arg Ser Arg 15 20 25
CGT GTC AAA AGT GCG AGT CGT TTG GAT TTG AGA AAT TTA AAA GAC GGC 146 Arg Val Lys Ser Ala Ser Arg Leu Asp Leu Arg Asn Leu Lys Asp Gly 30 35 40
TTT TTA TAC ACC TAT GAG CAT GCA GAA ATC ACT AAA AAA CAC GCC CTT 194 Phe Leu Tyr Thr Tyr Glu His Ala Glu He Thr Lys Lys His Ala Leu 45 50 55 60
TTA AAG CTA GTG GGC GCG CGA TTA TTA GAG GTT ATG GCC AGT AAA AAA 242 Leu Lys Leu Val Gly Ala Arg Leu Leu Glu Val Met Ala Ser Lys Lys 65 70 75
ACG CAT TTG ATT TTA AGC GTG ATT GAA ATC AAA AAC ATT GAA AAA ATC 290 Thr His Leu He Leu Ser Val He Glu He Lys Asn He Glu Lys He 80 85 90
CTA CCC TTT TTA AAT CAG TTA GGC GTG AGC AAG TTG AGT TTA TTC TAT 338 Leu Pro Phe Leu Asn Gin Leu Gly Val Ser Lys Leu Ser Leu Phe Tyr 95 100 105
GCG GAT TTT AGC CAA CGC AAT GAA AAA ATA GAC ATC GCT AAA TTA GAG 386 Ala Asp Phe Ser Gin Arg Asn Glu Lys He Asp He Ala Lys Leu Glu 110 115 120
CGC TTT CAA AAG ATT TTG ATC CAT TCT TGC GAG CAG TGT GGT AGG AGT 434 Arg Phe Gin Lys He Leu He His Ser Cys Glu Gin Cys Gly Arg Ser 125 130 135 140
GCT TTA ATG GAA TTG GAA GTG TTT TCA AAC ACT AAA GAG GCG CTA AAA 482 Ala Leu Met Glu Leu Glu Val Phe Ser Asn Thr Lys Glu Ala Leu Lys 145 150 155
GCC TAT CCT AAG GCG AGC GTT TTG GAT TTT AAG GGC GAA ACC TTA NCC 530 Ala Tyr Pro Lys Ala Ser Val Leu Asp Phe Lys Gly Glu Thr Leu Xaa 160 165 170
GCA AGC GCG GAT TTT GAA AAG GGC GTT ATC ATA GGG CCT GAG GGG GGC 578 Ala Ser Ala Asp Phe Glu Lys Gly Val He He Gly Pro Glu Gly Gly 175 180 185
TTT AGC GAA CCA GAA AGA GGG TAT TTT AAA GAG CGT GAA ATT TAT CGC 626 Phe Ser Glu Pro Glu Arg Gly Tyr Phe Lys Glu Arg Glu He Tyr Arg 190 195 200
ATC CCG TTA GAT ATG GTG CTA AAG TCT GAG AGT GCA TGC GTG TTT GTA 674 He Pro Leu Asp Met Val Leu Lys Ser Glu Ser Ala Cys Val Phe Val 205 210 215 220
GCG AGT ATC GCA CAA GTT TAGGGGGTTA TTGGGGATTT TAAATCCTAA AAAATC 72! Ala Ser He Ala Gin Val 225
(2) INFORMATION FOR SEQ ID NO: 1288
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 226 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1288:
Met Arg Phe Val Tyr His Pro Leu Ala Lys Glu Pro Val Leu Lys He
1 5 10 15
Glu Gly Glu Ser Tyr Thr His Leu Tyr Arg Ser Arg Arg Val Lys Ser
20 25 30
Ala Ser Arg Leu Asp Leu Arg Asn Leu Lys Asp Gly Phe Leu Tyr Thr
35 40 45
Tyr Glu His Ala Glu He Thr Lys Lys His Ala Leu Leu Lys Leu Val
50 55 60
Gly Ala Arg Leu Leu Glu Val Met Ala Ser Lys Lys Thr His Leu He 65 70 75 80
Leu Ser Val He Glu He Lys Asn He Glu Lys He Leu Pro Phe Leu
85 90 95
Asn Gin Leu Gly Val Ser Lys Leu Ser Leu Phe Tyr Ala Asp Phe Ser
100 105 110
Gin Arg Asn Glu Lys He Asp He Ala Lys Leu Glu Arg Phe Gin Lys
115 120 125
He Leu He His Ser Cys Glu Gin Cys Gly Arg Ser Ala Leu Met Glu
130 135 140
Leu Glu Val Phe Ser Asn Thr Lys Glu Ala Leu Lys Ala Tyr Pro Lys 145 150 155 160
Ala Ser Val Leu Asp Phe Lys Gly Glu Thr Leu Xaa Ala Ser Ala Asp
165 170 175
Phe Glu Lys Gly Val He He Gly Pro Glu Gly Gly Phe Ser Glu Pro
180 185 190
Glu Arg Gly Tyr Phe Lys Glu Arg Glu He Tyr Arg He Pro Leu Asp
195 200 205
Met Val Leu Lys Ser Glu Ser Ala Cys Val Phe Val Ala Ser He Ala
210 215 220
Gin Val 225
(2) INFORMATION FOR SEQ ID NO: 1289:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 888 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 13...840 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1289: TTATGAAATT GA ATG ACC CTT TCG CAA GCC CTA AAC AAA GCC AAA AAA GGA 51 Met Thr Leu Ser Gin Ala Leu Asn Lys Ala Lys Lys Gly 1 5 10
TTA TCG CAA AAA GGT TTT AGG GGG GGC TTA GAA TCT GAA ATT TTA TTA 99 Leu Ser Gin Lys Gly Phe Arg Gly Gly Leu Glu Ser Glu He Leu Leu 15 20 25
GGC TTT GTC TTG CAA AAA GAA AGG GTT TTT TTG CAC ACG CAT GCC TAT 147 Gly Phe Val Leu Gin Lys Glu Arg Val Phe Leu His Thr His Ala Tyr 30 35 40 45
TTA GAG TTA AAC CAC GAA GAA GAG GTG CGT TTT TTT GAA TTG GTA GAA 195 Leu Glu Leu Asn His Glu Glu Glu Val Arg Phe Phe Glu Leu Val Glu 50 55 60
AAG CGC TTG AAT AAC TGC CCC ATA GAG TAT TTA TTA GAA AGC TGT GAT 243 Lys Arg Leu Asn Asn Cys Pro He Glu Tyr Leu Leu Glu Ser Cys Asp 65 70 75
TTT TAT GGG CGC TCT TTT TTT GTG AAT GAG CAT GTT TTA ATC CCA CGA 291 Phe Tyr Gly Arg Ser Phe Phe Val Asn Glu His Val Leu He Pro Arg 80 85 90
CCT GAA ACC GAG ATT TTG GTC CAA AAA GCC CTT GAT ATT ATT TCT CAA 339 Pro Glu Thr Glu He Leu Val Gin Lys Ala Leu Asp He He Ser Gin 95 100 105
TAC CAT TTA AAA GAG ATA GGC GAA ATC GGC ATA GGG AGC GGA TGC GTG 387 Tyr His Leu Lys Glu He Gly Glu He Gly He Gly Ser Gly Cys Val 110 115 120 125
TCT GTG AGT TTG GCT TTA GAA AAC CCT AAT CTC TCT ATT TAT GCG AGC 435 Ser Val Ser Leu Ala Leu Glu Asn Pro Asn Leu Ser He Tyr Ala Ser 130 135 140
GAT ATT TCA CCA AAC GCT TTA GAA GTG GCG TCC AAA AAT ATT GAG CAC 483 Asp He Ser Pro Asn Ala Leu Glu Val Ala Ser Lys Asn He Glu His 145 150 155
TTT TGT CTA AAA GAG CGT GTT TTT TTA AAA CAA ACA CGC CTT TGG GAT 531 Phe Cys Leu Lys Glu Arg Val Phe Leu Lys Gin Thr Arg Leu Trp Asp 160 165 170
CAT ATG CCC ATG ATA GAA ATG CTT GTC TCT AAC CCG CCC TAT ATC GCT 579 His Met Pro Met He Glu Met Leu Val Ser Asn Pro Pro Tyr He Ala 175 180 185
AGA AAT TAT CCT TTG GAA AAA TCC GTC CTC AAA GAA CCG CAT GAA GCC 627 Arg Asn Tyr Pro Leu Glu Lys Ser Val Leu Lys Glu Pro His Glu Ala 190 195 200 205
CTT TTT GGG GGG GTT AAA GGC GAT GAG ATC TTA AAA GAA ATC GTT TTT 675 Leu Phe Gly Gly Val Lys Gly Asp Glu He Leu Lys Glu He Val Phe 210 215 220 TTA GCC GCT AAA TTA AAA ATC CCT TTT TTG GTT TGT GAA ATG GGG TAT 723 Leu Ala Ala Lys Leu Lys He Pro Phe Leu Val Cys Glu Met Gly Tyr 225 230 235
GAC CAG TTG AAA AGC TTG AAA GAA TGC TTG GAA TTT TGC GGT TAT GAT 771 Asp Gin Leu Lys Ser Leu Lys Glu Cys Leu Glu Phe Cys Gly Tyr Asp 240 245 250
GCA GAG TTT TAC AAG GAT TTG AGC GGC TTT GAT AGA GGG TTT GTG GGC 819 Ala Glu Phe Tyr Lys Asp Leu Ser Gly Phe Asp Arg Gly Phe Val Gly 255 260 265
GTT TTA AAA AGT TTT TTA AGA TAAATTAAAA CTTAATTACC CTTTTAGTGT TACA 874 Val Leu Lys Ser Phe Leu Arg 270 275
ATAAAAACAC TTAA 88i
(2) INFORMATION FOR SEQ ID NO: 1290:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 276 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1290:
Met Thr Leu Ser Gin Ala Leu Asn Lys Ala Lys Lys Gly Leu Ser Gin
1 5 10 15
Lys Gly Phe Arg Gly Gly Leu Glu Ser Glu He Leu Leu Gly Phe Val
20 25 30
Leu Gin Lys Glu Arg Val Phe Leu His Thr His Ala Tyr Leu Glu Leu
35 40 45
Asn His Glu Glu Glu Val Arg Phe Phe Glu Leu Val Glu Lys Arg Leu
50 55 60
Asn Asn Cys Pro He Glu Tyr Leu Leu Glu Ser Cys Asp Phe Tyr Gly 65 70 75 80
Arg Ser Phe Phe Val Asn Glu His Val Leu He Pro Arg Pro Glu Thr
85 90 95
Glu He Leu Val Gin Lys Ala Leu Asp He He Ser Gin Tyr His Leu
100 105 110
Lys Glu He Gly Glu He Gly He Gly Ser Gly Cys Val Ser Val Ser
115 120 125
Leu Ala Leu Glu Asn Pro Asn Leu Ser He Tyr Ala Ser Asp He Ser
130 135 140
Pro Asn Ala Leu Glu Val Ala Ser Lys Asn He Glu His Phe Cys Leu 145 150 155 160
Lys Glu Arg Val Phe Leu Lys Gin Thr Arg Leu Trp Asp His Met Pro
165 170 175
Met He Glu Met Leu Val Ser Asn Pro Pro Tyr He Ala Arg Asn Tyr
180 185 190
Pro Leu Glu Lys Ser Val Leu Lys Glu Pro His Glu Ala Leu Phe Gly 195 200 205
Gly Val Lys Gly Asp Glu He Leu Lys Glu He Val Phe Leu Ala Ala
210 215 220
Lys Leu Lys He Pro Phe Leu Val Cys Glu Met Gly Tyr Asp Gin Leu 225 230 235 240
Lys Ser Leu Lys Glu Cys Leu Glu Phe Cys Gly Tyr Asp Ala Glu Phe
245 250 255
Tyr Lys Asp Leu Ser Gly Phe Asp Arg Gly Phe Val Gly Val Leu Lys
260 265 270
Ser Phe Leu Arg 275
(2) INFORMATION FOR SEQ ID NO: 1291:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1026 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 28...960 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1291:
CCATCTCAAA ATAAGGACGC CTAAATC ATG GCA GAA AAA ACA GCT AAC GAT TTA 54
Met Ala Glu Lys Thr Ala Asn Asp Leu 1 5
AAA CTA AGT GAG ATA GAA CTC GTG GAT TTT CGT ATT TAT GGC ATG CAA 102 Lys Leu Ser Glu He Glu Leu Val Asp Phe Arg He Tyr Gly Met Gin 10 15 20 25
GAG GGC GTC CCT TAT GAG GGG ATT TAT GGT ATC AAT GTG GCT AAA GTC 150 Glu Gly Val Pro Tyr Glu Gly He Tyr Gly He Asn Val Ala Lys Val 30 35 40
CAA GAA ATC ATC CCC ATG CCC ACC CTT TTT GAA TAC CCC ACG AAT TTG 198 Gin Glu He He Pro Met Pro Thr Leu Phe Glu Tyr Pro Thr Asn Leu 45 50 55
GAT TAC ATT ATC GGC GTG TTT GAT TTG CGC TCC ATA ATC ATT CCG CTT 246 Asp Tyr He He Gly Val Phe Asp Leu Arg Ser He He He Pro Leu 60 65 70
ATA GAC TTG GCT AAA TGG ATA GGG ATT ATC CCA GAT AAA AGC AAG GAA 294 He Asp Leu Ala Lys Trp He Gly He He Pro Asp Lys Ser Lys Glu 75 80 85 AAC GAA AAA ATC GTC ATT ATC ACT GAA TTT AAC AAC GTT AAA ATG GGT 342 Asn Glu Lys He Val He He Thr Glu Phe Asn Asn Val Lys Met Gly 90 95 100 105
TTT TTA GTC CAT TCG GCT AGG CGT ATC AGG CGC ATT AGC TGG AAA GAT 390 Phe Leu Val His Ser Ala Arg Arg He Arg Arg He Ser Trp Lys Asp 110 115 120
GTG GAG CCT GCA TCC TTT AGC GCC TCT AAT AGC ATC AAT AAA GAA AAT 438 Val Glu Pro Ala Ser Phe Ser Ala Ser Asn Ser He Asn Lys Glu Asn 125 130 135
ATT ACC GGC ACG ACA CGC ATT GAA AAC GAC AAA ACC CTG CTC ATT TTG 486 He Thr Gly Thr Thr Arg He Glu Asn Asp Lys Thr Leu Leu He Leu 140 145 150
GAT TTA GAA AGC ATT TTA GAC GAT TTA AAG CTT AAT GAA GAC GCT AAA 534 Asp Leu Glu Ser He Leu Asp Asp Leu Lys Leu Asn Glu Asp Ala Lys 155 160 165
AAC GCT AAA GAT ACC CAT AAA GAG CGT TTT GAA GGC GAA GTG TTG TTT 582 Asn Ala Lys Asp Thr His Lys Glu Arg Phe Glu Gly Glu Val Leu Phe 170 175 180 185
TTA GAC GAT AGC AAG ACC GCA AGA AAA ACC TTA AAA AAC CAT TTG AGC 630 Leu Asp Asp Ser Lys Thr Ala Arg Lys Thr Leu Lys Asn His Leu Ser 190 195 200
AAA TTA GGT TTT AGC ATC ACT GAA GCT GTG GAT GGG GAA GAC GGG TTG 678 Lys Leu Gly Phe Ser He Thr Glu Ala Val Asp Gly Glu Asp Gly Leu 205 210 215
AAC AAA TTA GAA ATG TTA TTC AAA AAA TAC GGG GAC GAT TTG AGA AAG 726 Asn Lys Leu Glu Met Leu Phe Lys Lys Tyr Gly Asp Asp Leu Arg Lys 220 225 230
CAT TTG AAA TTC ATT ATT TCA GAT GTT GAA ATG CCT AAA ATG GAT GGC 774 His Leu Lys Phe He He Ser Asp Val Glu Met Pro Lys Met Asp Gly 235 240 245
TAT CAT TTC TTA TTC AAG CTC CAA AAA GAC CCT AGG TTT GCT TAT ATT 822 Tyr His Phe Leu Phe Lys Leu Gin Lys Asp Pro Arg Phe Ala Tyr He 250 255 260 265
CCT GTG ATT TTT AAT TCT TCT ATT TGC GAT AAT TAC AGC GCT GAA AGG 870 Pro Val He Phe Asn Ser Ser He Cys Asp Asn Tyr Ser Ala Glu Arg 270 275 280
GCT AAA GAA ATG GGG GCT GTA GCG TAT TTA GTC AAG TTT GAC GCA GAA 918 Ala Lys Glu Met Gly Ala Val Ala Tyr Leu Val Lys Phe Asp Ala Glu 285 290 295
AAA TTC ACC GAA GAA ATT TCT AAG ATT TTA GAC AAG AAT GCG TAATTCTTT 969 Lys Phe Thr Glu Glu He Ser Lys He Leu Asp Lys Asn Ala 300 305 310 TTATAAAATT GTAAAATACT CTTATCTCAA ACGCTAAAAA GGGGTTTTAA ATGGATG 1026 (2) INFORMATION FOR SEQ ID NO: 1292:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 311 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1292:
Met Ala Glu Lys Thr Ala Asn Asp Leu Lys Leu Ser Glu He Glu Leu
1 5 10 15
Val Asp Phe Arg He Tyr Gly Met Gin Glu Gly Val Pro Tyr Glu Gly
20 25 30
He Tyr Gly He Asn Val Ala Lys Val Gin Glu He He Pro Met Pro
35 40 45
Thr Leu Phe Glu Tyr Pro Thr Asn Leu Asp Tyr He He Gly Val Phe
50 55 60
Asp Leu Arg Ser He He He Pro Leu He Asp Leu Ala Lys Trp He 65 70 75 80
Gly He He Pro Asp Lys Ser Lys Glu Asn Glu Lys He Val He He
85 90 95
Thr Glu Phe Asn Asn Val Lys Met Gly Phe Leu Val His Ser Ala Arg
100 105 110
Arg He Arg Arg He Ser Trp Lys Asp Val Glu Pro Ala Ser Phe Ser
115 120 125
Ala Ser Asn Ser He Asn Lys Glu Asn He Thr Gly Thr Thr Arg He
130 135 140
Glu Asn Asp Lys Thr Leu Leu He Leu Asp Leu Glu Ser He Leu Asp 145 150 155 160
Asp Leu Lys Leu Asn Glu Asp Ala Lys Asn Ala Lys Asp Thr His Lys
165 170 175
Glu Arg Phe Glu Gly Glu Val Leu Phe Leu Asp Asp Ser Lys Thr Ala
180 185 190
Arg Lys Thr Leu Lys Asn His Leu Ser Lys Leu Gly Phe Ser He Thr
195 200 205
Glu Ala Val Asp Gly Glu Asp Gly Leu Asn Lys Leu Glu Met Leu Phe
210 215 220
Lys Lys Tyr Gly Asp Asp Leu Arg Lys His Leu Lys Phe He He Ser 225 230 235 240
Asp Val Glu Met Pro Lys Met Asp Gly Tyr His Phe Leu Phe Lys Leu
245 250 255
Gin Lys Asp Pro Arg Phe Ala Tyr He Pro Val He Phe Asn Ser Ser
260 265 270
He Cys Asp Asn Tyr Ser Ala Glu Arg Ala Lys Glu Met Gly Ala Val
275 280 285
Ala Tyr Leu Val Lys Phe Asp Ala Glu Lys Phe Thr Glu Glu He Ser
290 295 300
Lys He Leu Asp Lys Asn Ala 305 310 (2) INFORMATION FOR SEQ ID NO:1293:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 753 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 32...697 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1293:
GATCTATAAA GAATAGCCAT AAAGAAGAAT T ATG TTA GAT TAT CGC CAA AAA 52
Met Leu Asp Tyr Arg Gin Lys 1 5
ATT GAT GCT CTC ATC ACC AAA ATA GAA AAG GCT CGC ACC GCC TAT TCA 100 He Asp Ala Leu He Thr Lys He Glu Lys Ala Arg Thr Ala Tyr Ser 10 15 20
AGG CAC CAC ATT GTC AAA ATC GTG GCT GTT TCA AAA AAC GCT TCC CCA 148 Arg His His He Val Lys He Val Ala Val Ser Lys Asn Ala Ser Pro 25 30 35
GAA GCT ATC CAA CAT TAT TAT AAC TGC TCT CAA AGG GCT TTT GGA GAA 196 Glu Ala He Gin His Tyr Tyr Asn Cys Ser Gin Arg Ala Phe Gly Glu 40 45 50 55
AAT AAA GTT CAA GAT TTA AAA ACT AAA ATG CAT TCT TTA GAG CAT TTG 244 Asn Lys Val Gin Asp Leu Lys Thr Lys Met His Ser Leu Glu His Leu 60 65 70
CCC CTT GAA TGG CAC ATG ATA GGC TCT TTA CAA GAA AAT AAA ATC AAT 292 Pro Leu Glu Trp His Met He Gly Ser Leu Gin Glu Asn Lys He Asn 75 80 85
GCG CTT TTG AGT TTA AAG CCC GCT CTT TTG CAT TCT TTA GAC TCT TTA 340 Ala Leu Leu Ser Leu Lys Pro Ala Leu Leu His Ser Leu Asp Ser Leu 90 95 100
AAA CTC GCT TTG AAA ATA GAA AAG CGT TGC GAA ATA TTG GGC GTC AAT 388 Lys Leu Ala Leu Lys He Glu Lys Arg Cys Glu He Leu Gly Val Asn 105 110 115
TTA AAC GCT CTT TTA CAG GTT AAT AGC GCG TAT GAG GAA AGT AAA AGC 436 Leu Asn Ala Leu Leu Gin Val Asn Ser Ala Tyr Glu Glu Ser Lys Ser 120 125 130 135 GGG GTG GTG CCT GAA GAA GCG CTA GAA ATT TAT TCT CAA ATC AGT GAA 484 Gly Val Val Pro Glu Glu Ala Leu Glu He Tyr Ser Gin He Ser Glu 140 145 150
ACT TGC AAG CAC CTC AAG CTT AAG GGG CTT ATG TGT ATA GGG GCA CAC 532 Thr Cys Lys His Leu Lys Leu Lys Gly Leu Met Cys He Gly Ala His 155 160 165
ACA GAT GAT GAA AAG GAA ATT GAA AAA TCC TTT ATC ACC ACC AAA AAG 580 Thr Asp Asp Glu Lys Glu He Glu Lys Ser Phe He Thr Thr Lys Lys 170 175 180
CTT TTT GAC CAA ATA AAG AAT GCG AGC GTT CTT TCA ATG GGC ATG AGT 628 Leu Phe Asp Gin He Lys Asn Ala Ser Val Leu Ser Met Gly Met Ser 185 190 195
GAT GAT TTT GAA TTA GCG ATT GCT TGC GGG GCG AAT CTT TTA AGG ATT 676 Asp Asp Phe Glu Leu Ala He Ala Cys Gly Ala Asn Leu Leu Arg He 200 205 210 215
GGC TCT TTT TTG TTC AAA GAG TAAGATGCTA GAAACTTATG CACTTAAAAA TGGG 731 Gly Ser Phe Leu Phe Lys Glu 220
GCTGTTTTTA TCTCTGATGC GC 753
(2) INFORMATION FOR SEQ ID NO: 1294:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 222 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1294:
Met Leu Asp Tyr Arg Gin Lys He Asp Ala Leu He Thr Lys He Glu
1 5 10 15
Lys Ala Arg Thr Ala Tyr Ser Arg His His He Val Lys He Val Ala
20 25 30
Val Ser Lys Asn Ala Ser Pro Glu Ala He Gin His Tyr Tyr Asn Cys
35 40 45
Ser Gin Arg Ala Phe Gly Glu Asn Lys Val Gin Asp Leu Lys Thr Lys
50 55 60
Met His Ser Leu Glu His Leu Pro Leu Glu Trp His Met He Gly Ser 65 70 75 80
Leu Gin Glu Asn Lys He Asn Ala Leu Leu Ser Leu Lys Pro Ala Leu
85 90 95
Leu His Ser Leu Asp Ser Leu Lys Leu Ala Leu Lys He Glu Lys Arg
100 105 110
Cys Glu He Leu Gly Val Asn Leu Asn Ala Leu Leu Gin Val Asn Ser
115 120 125
Ala Tyr Glu Glu Ser Lys Ser Gly Val Val Pro Glu Glu Ala Leu Glu 130 135 140
He Tyr Ser Gin He Ser Glu Thr Cys Lys His Leu Lys Leu Lys Gly 145 150 155 160
Leu Met Cys He Gly Ala His Thr Asp Asp Glu Lys Glu He Glu Lys
165 170 175
Ser Phe He Thr Thr Lys Lys Leu Phe Asp Gin He Lys Asn Ala Ser
180 185 190
Val Leu Ser Met Gly Met Ser Asp Asp Phe Glu Leu Ala He Ala Cys
195 200 205
Gly Ala Asn Leu Leu Arg He Gly Ser Phe Leu Phe Lys Glu 210 215 220
(2) INFORMATION FOR SEQ ID NO: 1295:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1633 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 22...1593 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1295:
TTTTAAATTC AAAGGATAAA A ATG TAT CAA GTA GCC ATT TGC GAC CCC ATC 51
Met Tyr Gin Val Ala He Cys Asp Pro He 1 5 10
CAT GCT AAA GGC ATT CAA ATT TTA GAA GCT CAA AAA GAC ATT GTC TTG 99 His Ala Lys Gly He Gin He Leu Glu Ala Gin Lys Asp He Val Leu 15 20 25
CAT GAT TAT TCC AAA TGC CCT AAA AAG GAG CTT TTA GAA AAA CTC ACT 147 His Asp Tyr Ser Lys Cys Pro Lys Lys Glu Leu Leu Glu Lys Leu Thr 30 35 40
CCC ATG GAT GCG CTC ATC ACT CGC AGC ATG ACC CCT ATC ACA AGC GAT 195 Pro Met Asp Ala Leu He Thr Arg Ser Met Thr Pro He Thr Ser Asp 45 50 55
TTT TTA AAG CCC TTA ACC CAC TTA AAA TCC ATC GTG AGA GCG GGC GTG 243 Phe Leu Lys Pro Leu Thr His Leu Lys Ser He Val Arg Ala Gly Val 60 65 70
GGA GTG GAT AAT ATT GAT TTA GAA AGC TGC TCT CAA AAA GGG ATT GTA 291 Gly Val Asp Asn He Asp Leu Glu Ser Cys Ser Gin Lys Gly He Val 75 80 85 90 GTG ATG AAT ATC CCT ACC GCT AAC ACG ATT GCC GCT GTG GAA TTG ACC 339 Val Met Asn He Pro Thr Ala Asn Thr He Ala Ala Val Glu Leu Thr 95 100 105
ATG GCG CAT TTG ATC AAT GCA GTG CGT TCG TTC CCT TGT GCA AAC GAT 387 Met Ala His Leu He Asn Ala Val Arg Ser Phe Pro Cys Ala Asn Asp 110 115 120
CAA ATC AAA CAC CAA AGG TTA TGG AAA AGA GAA GAT TGG TAT GGC ACG 435 Gin He Lys His Gin Arg Leu Trp Lys Arg Glu Asp Trp Tyr Gly Thr 125 130 135
GAA TTG AAA AAT AAA AAG CTG GGC ATC ATT GGT TTT GGG AAT ATT GGC 483 Glu Leu Lys Asn Lys Lys Leu Gly He He Gly Phe Gly Asn He Gly 140 145 150
TCT AGG GTG GGC ATT AGA GCA AAA GCC TTT GAA ATG GAA GTT CTA GCC 531 Ser Arg Val Gly He Arg Ala Lys Ala Phe Glu Met Glu Val Leu Ala 155 160 165 170
TAT GAT CCT TAT ATC CCT TCT TCA AAA GCC ACT GAT TTA GGA GTC ATT 579 Tyr Asp Pro Tyr He Pro Ser Ser Lys Ala Thr Asp Leu Gly Val He 175 180 185
TAC ACG AAA AAT TTT GAA GAC ATT TTG CAA TGC GAT ATG ATC ACT ATC 627 Tyr Thr Lys Asn Phe Glu Asp He Leu Gin Cys Asp Met He Thr He 190 195 200
CAC ACC CCT AAA AAT AAA GAA ACG ATT AAC ATG ATA GGT GCT AAA GAG 675 His Thr Pro Lys Asn Lys Glu Thr He Asn Met He Gly Ala Lys Glu 205 210 215
ATT GAG CGC ATG AAA AAA GGG GTT ATT TTG CTC AAT TGC GCT AGG GGT 723 He Glu Arg Met Lys Lys Gly Val He Leu Leu Asn Cys Ala Arg Gly 220 225 230
GGG CTT TAT AAT GAA GAC GCT CTT TAT GAG GCT TTA GAA ACC AAA AAA 771 Gly Leu Tyr Asn Glu Asp Ala Leu Tyr Glu Ala Leu Glu Thr Lys Lys 235 240 245 250
GTG CGT TGG CTT GGC ATT GAT GTC TTT TCT AAA GAG CCT GGC ATT CAC 819 Val Arg Trp Leu Gly He Asp Val Phe Ser Lys Glu Pro Gly He His 255 260 265
AAC AAG CTT TTA GAC TTG CCC AAT GTT TAT GCG ACC CCC CAT ATT GGC 867 Asn Lys Leu Leu Asp Leu Pro Asn Val Tyr Ala Thr Pro His He Gly 270 275 280
GCA AAC ACT TTA GAA TCC CAA GAA GAA ATT TCC AAA CAA GCC GCT CAA 915 Ala Asn Thr Leu Glu Ser Gin Glu Glu He Ser Lys Gin Ala Ala Gin 285 290 295
GGG GTT ATG GAA TCT TTA AGG GGT TCA AGC CAC CCG CAT GCT TTG AAT 963 Gly Val Met Glu Ser Leu Arg Gly Ser Ser His Pro His Ala Leu Asn 300 305 310 TTA CCC ATG CAA GCT TTT GAC GCG AGC GCA AAA GCC TAC TTG AAT TTA 1011 Leu Pro Met Gin Ala Phe Asp Ala Ser Ala Lys Ala Tyr Leu Asn Leu 315 320 325 330
GCG CAA AAA TTG GGT TAT TTT TCC AGT CAA ATC CAT AAG GGC GTG TGC 1059 Ala Gin Lys Leu Gly Tyr Phe Ser Ser Gin He His Lys Gly Val Cys 335 340 345
CAA AAA ATT GAG CTC AGT CTT TGT GGG GAG ATC AAC CAA TTT AAA GAC 1107 Gin Lys He Glu Leu Ser Leu Cys Gly Glu He Asn Gin Phe Lys Asp 350 355 360
GCT CTT GTA GCC TTT ATG TTA GTG GGG GTG TTA AAA CCT GTT GTA GGG 1155 Ala Leu Val Ala Phe Met Leu Val Gly Val Leu Lys Pro Val Val Gly 365 370 375
GAT AAA ATC AAT TAC ATT AAC GCC CCC TTT GTG GCC AAA GAA AGA GGT 1203 Asp Lys He Asn Tyr He Asn Ala Pro Phe Val Ala Lys Glu Arg Gly 380 385 390
ATT GAG ATT AAG GTT AGC CTT AAA GAA AGC GCT TCG CCC TAT AAG AAC 1251 He Glu He Lys Val Ser Leu Lys Glu Ser Ala Ser Pro Tyr Lys Asn 395 400 405 410
ATG CTC TCT TTA ACC CTC AAT GCG GCT AAT GGC ACA ATC AGC GTG AGC 1299 Met Leu Ser Leu Thr Leu Asn Ala Ala Asn Gly Thr He Ser Val Ser 415 420 425
GGC ACG GTG TTT GAA GAA GAT ATT TTA AAA CTC ACT GAG ATT GAT GGG 1347 Gly Thr Val Phe Glu Glu Asp He Leu Lys Leu Thr Glu He Asp Gly 430 435 440
TTT CAT ATT GAT ATA GAG CCA AAG GGT AAA ATG CTT TTA TTC AGG AAT 1395 Phe His He Asp He Glu Pro Lys Gly Lys Met Leu Leu Phe Arg Asn 445 450 455
ACG GAT ATT CCA GGC GTT ATT GGG AGT GTG GGG AAT GCG TTC GCT AGG 1443 Thr Asp He Pro Gly Val He Gly Ser Val Gly Asn Ala Phe Ala Arg 460 465 470
CAT GGC ATT AAC ATC GCT GAT TTT CGT TTG GGG CGT AAC ACG CAA AAA 1491 His Gly He Asn He Ala Asp Phe Arg Leu Gly Arg Asn Thr Gin Lys 475 480 485 490
GAA GCC CTA GCA CTC ATT ATT GTA GAT GAA GAA GTT TCT TTG GAA GTT 1539 Glu Ala Leu Ala Leu He He Val Asp Glu Glu Val Ser Leu Glu Val 495 500 505
TTA GAA GAG CTT AAA AAC ATT CCT GCG TGC TTA AGC GTT CAT TAT GTG 1587 Leu Glu Glu Leu Lys Asn He Pro Ala Cys Leu Ser Val His Tyr Val 510 515 520
GTT ATT TAAGGTAGTT GGATGCGAGA TTTTTTAAAA CTTTTAAAAA 1633
Val He (2) INFORMATION FOR SEQ ID NO: 1296:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 524 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1296:
Met Tyr Gin Val Ala He Cys Asp Pro He His Ala Lys Gly He Gin
1 5 10 15
He Leu Glu Ala Gin Lys Asp He Val Leu His Asp Tyr Ser Lys Cys
20 25 30
Pro Lys Lys Glu Leu Leu Glu Lys Leu Thr Pro Met Asp Ala Leu He
35 40 45
Thr Arg Ser Met Thr Pro He Thr Ser Asp Phe Leu Lys Pro Leu Thr
50 55 60
His Leu Lys Ser He Val Arg Ala Gly Val Gly Val Asp Asn He Asp 65 70 75 80
Leu Glu Ser Cys Ser Gin Lys Gly He Val Val Met Asn He Pro Thr
85 90 95
Ala Asn Thr He Ala Ala Val Glu Leu Thr Met Ala His Leu He Asn
100 105 110
Ala Val Arg Ser Phe Pro Cys Ala Asn Asp Gin He Lys His Gin Arg
115 120 125
Leu Trp Lys Arg Glu Asp Trp Tyr Gly Thr Glu Leu Lys Asn Lys Lys
130 135 140
Leu Gly He He Gly Phe Gly Asn He Gly Ser Arg Val Gly He Arg 145 150 155 160
Ala Lys Ala Phe Glu Met Glu Val Leu Ala Tyr Asp Pro Tyr He Pro
165 170 175
Ser Ser Lys Ala Thr Asp Leu Gly Val He Tyr Thr Lys Asn Phe Glu
180 185 190
Asp He Leu Gin Cys Asp Met He Thr He His Thr Pro Lys Asn Lys
195 200 205
Glu Thr He Asn Met He Gly Ala Lys Glu He Glu Arg Met Lys Lys
210 215 220
Gly Val He Leu Leu Asn Cys Ala Arg Gly Gly Leu Tyr Asn Glu Asp 225 230 235 240
Ala Leu Tyr Glu Ala Leu Glu Thr Lys Lys Val Arg Trp Leu Gly He
245 250 255
Asp Val Phe Ser Lys Glu Pro Gly He His Asn Lys Leu Leu Asp Leu
260 265 270
Pro Asn Val Tyr Ala Thr Pro His He Gly Ala Asn Thr Leu Glu Ser
275 280 285
Gin Glu Glu He Ser Lys Gin Ala Ala Gin Gly Val Met Glu Ser Leu
290 295 300
Arg Gly Ser Ser His Pro His Ala Leu Asn Leu Pro Met Gin Ala Phe 305 310 315 320
Asp Ala Ser Ala Lys Ala Tyr Leu Asn Leu Ala Gin Lys Leu Gly Tyr 325 330 335 Phe Ser Ser Gin He His Lys Gly Val Cys Gin Lys He Glu Leu Ser
340 345 350
Leu Cys Gly Glu He Asn Gin Phe Lys Asp Ala Leu Val Ala Phe Met
355 360 365
Leu Val Gly Val Leu Lys Pro Val Val Gly Asp Lys He Asn Tyr He
370 375 380
Asn Ala Pro Phe Val Ala Lys Glu Arg Gly He Glu He Lys Val Ser 385 390 395 400
Leu Lys Glu Ser Ala Ser Pro Tyr Lys Asn Met Leu Ser Leu Thr Leu
405 410 415
Asn Ala Ala Asn Gly Thr He Ser Val Ser Gly Thr Val Phe Glu Glu
420 425 430
Asp He Leu Lys Leu Thr Glu He Asp Gly Phe His He Asp He Glu
435 440 445
Pro Lys Gly Lys Met Leu Leu Phe Arg Asn Thr Asp He Pro Gly Val
450 455 460
He Gly Ser Val Gly Asn Ala Phe Ala Arg His Gly He Asn He Ala 465 470 475 480
Asp Phe Arg Leu Gly Arg Asn Thr Gin Lys Glu Ala Leu Ala Leu He
485 490 495
He Val Asp Glu Glu Val Ser Leu Glu Val Leu Glu Glu Leu Lys Asn
500 505 510
He Pro Ala Cys Leu Ser Val His Tyr Val Val He 515 520
(2) INFORMATION FOR SEQ ID NO: 1297:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1748 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 34...1701 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1297:
GGATTTTATA TTTATTTTAT AGTAAGGCAG TCA ATG AGC AAG ATA GCA GAT GAT 54
Met Ser Lys He Ala Asp Asp 1 5
CAG AAC TTT AAT GAC GAG GAG GAA AAC TTC GCA AAA CTC TTT AAA AAA 102 Gin Asn Phe Asn Asp Glu Glu Glu Asn Phe Ala Lys Leu Phe Lys Lys 10 15 20
GAA TTA GAA AAA GAA GAA ACC CTA GAA AAA GGC ACT ATC AAA GAA GGG 150 Glu Leu Glu Lys Glu Glu Thr Leu Glu Lys Gly Thr He Lys Glu Gly 25 30 35 CTA GTC GTT TCC ATC AAT GAG AAT GAT GGT TAT GCC ATG GTG AGC GTG 198 Leu Val Val Ser He Asn Glu Asn Asp Gly Tyr Ala Met Val Ser Val 40 45 50 55
GGC GGT AAG ACA GAA GGC CGT TTG GCT TTG AAT GAG ATC ACC GAT GAA 246 Gly Gly Lys Thr Glu Gly Arg Leu Ala Leu Asn Glu He Thr Asp Glu 60 65 70
AAG GGG CAG TTG CTG TAT CAA AAA AAT GAC CCC ATT ATC GTG CAT GTG 294 Lys Gly Gin Leu Leu Tyr Gin Lys Asn Asp Pro He He Val His Val 75 80 85
TCC GAA AAA GGT GAA CAC CCT AGC GTT TCC TAC AAA AAG GCC ATT TCC 342 Ser Glu Lys Gly Glu His Pro Ser Val Ser Tyr Lys Lys Ala He Ser 90 95 100
CAA CAA AAG ATT CAA GCT AAA ATT GAA GAA TTA GGC GAA AAC TAT GAA 390 Gin Gin Lys He Gin Ala Lys He Glu Glu Leu Gly Glu Asn Tyr Glu 105 110 115
AAC GCC ATT ATT GAA GGC AAG ATT GTA GGC AAG AAT AAA GGG GGT TAT 438 Asn Ala He He Glu Gly Lys He Val Gly Lys Asn Lys Gly Gly Tyr 120 125 130 135
ATC GTG GAG TCT CAA GGC GTG GAG TAT TTC CTC TCC CGC TCG CAC TCT 486 He Val Glu Ser Gin Gly Val Glu Tyr Phe Leu Ser Arg Ser His Ser 140 145 150
TCT TTA AAG AAT GAC GCA AAC CAT ATC GGC AAA CGC GTT AAA GCG TGC 534 Ser Leu Lys Asn Asp Ala Asn His He Gly Lys Arg Val Lys Ala Cys 155 160 165
ATC ATT CGT GTG GAT AAG GAA AAC CAT TCT ATC AAT ATT TCT CGC AAA 582 He He Arg Val Asp Lys Glu Asn His Ser He Asn He Ser Arg Lys 170 175 180
CGA TTC TTT GAA GTC AAT GAC AAA CGA CAA CTT GAG GTT TCT AAG GAA 630 Arg Phe Phe Glu Val Asn Asp Lys Arg Gin Leu Glu Val Ser Lys Glu 185 190 195
TTG TTA GAA GCC ACA GAG CCG GTG TTA GGG GTT GTG CGC CAG ATC ACC 678 Leu Leu Glu Ala Thr Glu Pro Val Leu Gly Val Val Arg Gin He Thr 200 205 210 215
CCT TTT GGC ATT TTT GTA GAA GCT AAG GGG ATT GAG GGC TTG GTC CAT 726 Pro Phe Gly He Phe Val Glu Ala Lys Gly He Glu Gly Leu Val His 220 225 230
TAT TCT GAA ATC AGC CAT AAG GGA CCA GTC AAT CCT GAA AAA TAC TAC 774 Tyr Ser Glu He Ser His Lys Gly Pro Val Asn Pro Glu Lys Tyr Tyr 235 240 245
AAA GAG GGC GAT GAA GTC TAT GTC AAA GCC ATC GCT TAT GAT GCA GAA 822 Lys Glu Gly Asp Glu Val Tyr Val Lys Ala He Ala Tyr Asp Ala Glu 250 255 260 AAA AGA CGC CTT TCA CTC TCC ATA AAA GCG ACT ATA GAA GAC CCA TGG 870 Lys Arg Arg Leu Ser Leu Ser He Lys Ala Thr He Glu Asp Pro Trp 265 270 275
GAA GAG ATT CAA GAC AAG CTA AAA CCC GGA TAC GCC ATT AAG GTA GTG 918 Glu Glu He Gin Asp Lys Leu Lys Pro Gly Tyr Ala He Lys Val Val 280 285 290 295
GTG AGC AAC ATT GAA CAT TAT GGG GTG TTT GTG GAT ATT GGT AAT GAT 966 Val Ser Asn He Glu His Tyr Gly Val Phe Val Asp He Gly Asn Asp 300 305 310
ATT GAA GGC TTT TTG CAT GTT TCT GAA ATC TCT TGG GAT AAA AAT GTC 1014 He Glu Gly Phe Leu His Val Ser Glu He Ser Trp Asp Lys Asn Val 315 320 325
AGC CAC CCT AAC AAT TAC TTG AGC GTG GGG CAA GAG ATT GAT GTG AAA 1062 Ser His Pro Asn Asn Tyr Leu Ser Val Gly Gin Glu He Asp Val Lys 330 335 340
ATC ATT GAC ATT GAT CCA AAA AAT CGC CGC TTA AGG GTT TCT TTA AAG 1110 He He Asp He Asp Pro Lys Asn Arg Arg Leu Arg Val Ser Leu Lys 345 350 355
CAA CTC ACT AAC AGG CCT TTT GAT GTT TTT GAA TCT AAA CAC CAA GTG 1158 Gin Leu Thr Asn Arg Pro Phe Asp Val Phe Glu Ser Lys His Gin Val 360 365 370 375
GGG GAT GTT TTA GAA GGC AAA GTG GCG ACT TTA ACG GAT TTT GGG GCG 1206 Gly Asp Val Leu Glu Gly Lys Val Ala Thr Leu Thr Asp Phe Gly Ala 380 385 390
TTT TTA AAT CTG GGT GGG GTG GAT GGT TTG CTC CAC AAT CAC GAC GCT 1254 Phe Leu Asn Leu Gly Gly Val Asp Gly Leu Leu His Asn His Asp Ala 395 400 405
TTT TGG GAT AAA GAT AAA AAA TGC AAA GAC CAC TAT AAA ATT GGC GAT 1302 Phe Trp Asp Lys Asp Lys Lys Cys Lys Asp His Tyr Lys He Gly Asp 410 415 420
GTG ATC AAA GTG AAA ATC CTT AAA ATC AAC AAA AAA GAT AAA AAG ATT 1350 Val He Lys Val Lys He Leu Lys He Asn Lys Lys Asp Lys Lys He 425 430 435
TCT TTG AGC GCG AAG CAC TTG GTG ACT TCC CCT ACA GAA GAA TTC GCT 1398 Ser Leu Ser Ala Lys His Leu Val Thr Ser Pro Thr Glu Glu Phe Ala 440 445 450 455
CAA AAG CAT AAA ACA GAC AGC GTG ATT CAA GGC AAA GTG GTG AGT ATT 1446 Gin Lys His Lys Thr Asp Ser Val He Gin Gly Lys Val Val Ser He 460 465 470
AAG GAT TTT GGC GTT TTC ATT AAT GCT GAT GGC ATT GAT GTG CTG ATC 1494 Lys Asp Phe Gly Val Phe He Asn Ala Asp Gly He Asp Val Leu He 475 480 485 AAA AAT GAA GAT TTG AAC CCC TTG AAA AAA GAT GAA ATT AAA ATA GGC 1542 Lys Asn Glu Asp Leu Asn Pro Leu Lys Lys Asp Glu He Lys He Gly 490 495 500
CAA GAA ATC ACA TGC GTG GTG GTT GCG ATT GAA AAA TCT AAC AAC AAG ■ 1590 Gin Glu He Thr Cys Val Val Val Ala He Glu Lys Ser Asn Asn Lys 505 510 515
GTG CGT GCT TCT GTG CAT AGG TTA GAG CGC AAA AAA GAA AAA GAA GAA 1638 Val Arg Ala Ser Val His Arg Leu Glu Arg Lys Lys Glu Lys Glu Glu 520 525 530 535
TTG CAA GCT TTT AAC ACG AGC GAT GAT AAA ATG ACT TTA GGG GAT ATT 1686 Leu Gin Ala Phe Asn Thr Ser Asp Asp Lys Met Thr Leu Gly Asp He 540 545 550
CTT AAA GAA AAA CTC TAAAGAGTGA TTTTAAAAGC ATGAGAATGG CATGAGATTT A 1742 Leu Lys Glu Lys Leu 555
AGGGTG 174?
(2) INFORMATION FOR SEQ ID NO: 1298:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 556 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1298:
Met Ser Lys He Ala Asp Asp Gin Asn Phe Asn Asp Glu Glu Glu Asn
1 5 10 15
Phe Ala Lys Leu Phe Lys Lys Glu Leu Glu Lys Glu Glu Thr Leu Glu
20 25 30
Lys Gly Thr He Lys Glu Gly Leu Val Val Ser He Asn Glu Asn Asp
35 40 45
Gly Tyr Ala Met Val Ser Val Gly Gly Lys Thr Glu Gly Arg Leu Ala
50 55 60
Leu Asn Glu He Thr Asp Glu Lys Gly Gin Leu Leu Tyr Gin Lys Asn 65 70 75 80
Asp Pro He He Val His Val Ser Glu Lys Gly Glu His Pro Ser Val
85 90 95
Ser Tyr Lys Lys Ala He Ser Gin Gin Lys He Gin Ala Lys He Glu
100 105 110
Glu Leu Gly Glu Asn Tyr Glu Asn Ala He He Glu Gly Lys He Val
115 120 125
Gly Lys Asn Lys Gly Gly Tyr He Val Glu Ser Gin Gly Val Glu Tyr
130 135 140
Phe Leu Ser Arg Ser His Ser Ser Leu Lys Asn Asp Ala Asn His He 145 150 155 160
Gly Lys Arg Val Lys Ala Cys He He Arg Val Asp Lys Glu Asn His 165 170 175
Ser He Asn He Ser Arg Lys Arg Phe Phe Glu Val Asn Asp Lys Arg
180 185 190
Gin Leu Glu Val Ser Lys Glu Leu Leu Glu Ala Thr Glu Pro Val Leu
195 200 205
Gly Val Val Arg Gin He Thr Pro Phe Gly He Phe Val Glu Ala Lys
210 215 220
Gly He Glu Gly Leu Val His Tyr Ser Glu He Ser His Lys Gly Pro 225 230 235 240
Val Asn Pro Glu Lys Tyr Tyr Lys Glu Gly Asp Glu Val Tyr Val Lys
245 250 255
Ala He Ala Tyr Asp Ala Glu Lys Arg Arg Leu Ser Leu Ser He Lys
260 265 270
Ala Thr He Glu Asp Pro Trp Glu Glu He Gin Asp Lys Leu Lys Pro
275 280 285
Gly Tyr Ala He Lys Val Val Val Ser Asn He Glu His Tyr Gly Val
290 295 300
Phe Val Asp He Gly Asn Asp He Glu Gly Phe Leu His Val Ser Glu 305 310 315 320
He Ser Trp Asp Lys Asn Val Ser His Pro Asn Asn Tyr Leu Ser Val
325 330 335
Gly Gin Glu He Asp Val Lys He He Asp He Asp Pro Lys Asn Arg
340 345 350
Arg Leu Arg Val Ser Leu Lys Gin Leu Thr Asn Arg Pro Phe Asp Val
355 360 365
Phe Glu Ser Lys His Gin Val Gly Asp Val Leu Glu Gly Lys Val Ala
370 375 380
Thr Leu Thr Asp Phe Gly Ala Phe Leu Asn Leu Gly Gly Val Asp Gly 385 390 395 400
Leu Leu His Asn His Asp Ala Phe Trp Asp Lys Asp Lys Lys Cys Lys
405 410 415
Asp His Tyr Lys He Gly Asp Val He Lys Val Lys He Leu Lys He
420 425 430
Asn Lys Lys Asp Lys Lys He Ser Leu Ser Ala Lys His Leu Val Thr
435 440 445
Ser Pro Thr Glu Glu Phe Ala Gin Lys His Lys Thr Asp Ser Val He
450 455 460
Gin Gly Lys Val Val Ser He Lys Asp. Phe Gly Val Phe He Asn Ala 465 470 475 480
Asp Gly He Asp Val Leu He Lys Asn Glu Asp Leu Asn Pro Leu Lys
485 490 495
Lys Asp Glu He Lys He Gly Gin Glu He Thr Cys Val Val Val Ala
500 505 510
He Glu Lys Ser Asn Asn Lys Val Arg Ala Ser Val His Arg Leu Glu
515 520 525
Arg Lys Lys Glu Lys Glu Glu Leu Gin Ala Phe Asn Thr Ser Asp Asp
530 535 540
Lys Met Thr Leu Gly Asp He Leu Lys Glu Lys Leu 545 550 555
(2) INFORMATION FOR SEQ ID NO: 1299:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1636 base pairs
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 43...1584 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1299:
AATTTGGGGT GTTTTAACAG AATGCAAGCT TGAAGGAGAA TC ATG TCC ATT TCA 54
Met Ser He Ser
1
CGC AGA AGT ATC CTA ACA AAA ATC CCA ATC GCG CTC GCT AGC GCT AAT 102 Arg Arg Ser He Leu Thr Lys He Pro He Ala Leu Ala Ser Ala Asn 5 10 15 20
GTT TTG AAA GCT GTT GGT GTT TTT GAA AAA GTA GAA TCC ATT CCG CAT 150 Val Leu Lys Ala Val Gly Val Phe Glu Lys Val Glu Ser He Pro His 25 30 35
GCA ACG CAT TTT GGC CCC TTT ATC GCA AAG GTT CAA AAT GGA GTG ATT 198 Ala Thr His Phe Gly Pro Phe He Ala Lys Val Gin Asn Gly Val He 40 45 50
AAA GAT ATT GTC CCC CAA AAA AGC GAT TAT AAC CCT ACT ATG ATG TTA 246 Lys Asp He Val Pro Gin Lys Ser Asp Tyr Asn Pro Thr Met Met Leu 55 60 65
AAA GCG ATG GTT GAT AGG GTG TAT TCA GAT AGT AGG GTG AAG TAT CCT 294 Lys Ala Met Val Asp Arg Val Tyr Ser Asp Ser Arg Val Lys Tyr Pro 70 75 80
TGC GTG CGC AAG AGC TTC TTA GAA AAC AAA AAA AAC CAC AAA GAA TTG 342 Cys Val Arg Lys Ser Phe Leu Glu Asn Lys Lys Asn His Lys Glu Leu 85 90 95 100
CGC GGG AGA GAA GAG TTT GTG CGT GTG AGT TGG GAT GTG GCG TTG GAT 390 Arg Gly Arg Glu Glu Phe Val Arg Val Ser Trp Asp Val Ala Leu Asp 105 110 115
TTA GCG GCT AAA AAG CTT AAA GAA ATC CCT AAA GAA AAC ATT TAT AAT 438 Leu Ala Ala Lys Lys Leu Lys Glu He Pro Lys Glu Asn He Tyr Asn 120 125 130
GCC AGT TAT GGT GGC TGG GGG CAT GCG GGC AGC TTG CAT CGT TGC CAT 486 Ala Ser Tyr Gly Gly Trp Gly His Ala Gly Ser Leu His Arg Cys His 135 140 145
CAT TTA GCA TGG CGT TTT TTT AAC ACG ACT TTA GGA GGG GCT ATT GGC 534 His Leu Ala Trp Arg Phe Phe Asn Thr Thr Leu Gly Gly Ala He Gly 150 155 160
ACT GAT GGG GAA TAT AGT AAT GGC GCG GCC GCA AGA ATA AAC CCT ATG 582 Thr Asp Gly Glu Tyr Ser Asn Gly Ala Ala Ala Arg He Asn Pro Met 165 170 175 180
ATT GTA GGG GAT ATG GAA GTT TAT TCG CAA CAA ACC ACG CAT GAA GAG 630 He Val Gly Asp Met Glu Val Tyr Ser Gin Gin Thr Thr His Glu Glu 185 190 195
ATG ATT AAA AAT TGT AAG GTG TAT GTC ATG TGG GGG GCG GAT TTA CTC 678 Met He Lys Asn Cys Lys Val Tyr Val Met Trp Gly Ala Asp Leu Leu 200 205 210
AAG TGC AAC CGC ATT GAT TAT TTT GTG CCA AAC CAT GTC AAT GAC AGC 726 Lys Cys Asn Arg He Asp Tyr Phe Val Pro Asn His Val Asn Asp Ser 215 220 225
TAC TAC CCC AAG TAT AAA AGA GCT GGT ATT AAA TTC ATT AGT ATC GAT 774 Tyr Tyr Pro Lys Tyr Lys Arg Ala Gly He Lys Phe He Ser He Asp 230 235 240
CCC ATT TAT ACC GAA ACC GCT CAA GCC TTT AGT GCT GAA TGG ATA CCC 822 Pro He Tyr Thr Glu Thr Ala Gin Ala Phe Ser Ala Glu Trp He Pro 245 250 255 260
ATT CGC CCT AAC ACT GAT GTA GCG TTA ATG CTA GGC ATG ATG CAT TAT 870 He Arg Pro Asn Thr Asp Val Ala Leu Met Leu Gly Met Met His Tyr 265 270 275
CTT TAT ACG AGC AAT CAA TAT GAT AAA GCG TTT ATC GCT AAA TAC ACT 918 Leu Tyr Thr Ser Asn Gin Tyr Asp Lys Ala Phe He Ala Lys Tyr Thr 280 285 290
GAT GGT TTT GAT AAA TTT TTA CCC TAT TTG CTA GGA GAG AGC GAT AAT 966 Asp Gly Phe Asp Lys Phe Leu Pro Tyr Leu Leu Gly Glu Ser Asp Asn 295 300 305
GCG CCT AAG ACT TTA GAA TGG GCG TCT CAA ATC ACT GGA GTG AGC GCA 1014 Ala Pro Lys Thr Leu Glu Trp Ala Ser Gin He Thr Gly Val Ser Ala 310 315 320
GAA AAA ATC AAA GAA TTA GCG GAT TTG TTT GTT TCT AAA CGC ACT TTT 1062 Glu Lys He Lys Glu Leu Ala Asp Leu Phe Val Ser Lys Arg Thr Phe 325 330 335 340
TTA GCG GGT AAT TGG GCC ATG CAA AGA GCT CAG TAT GGC GAG CAA CCG 1110 Leu Ala Gly Asn Trp Ala Met Gin Arg Ala Gin Tyr Gly Glu Gin Pro 345 350 355
GAT TGG GCG TTA ATT GTT TTA GCT AGC ATG ATT GGT CAA GTG GGC TTA 1158 Asp Trp Ala Leu He Val Leu Ala Ser Met He Gly Gin Val Gly Leu 360 365 370 TCG GGT GGG GGC TTT GGC TTT TCT ATG CAT TAT GGA GGG AAC GCT CAA 1206 Ser Gly Gly Gly Phe Gly Phe Ser Met His Tyr Gly Gly Asn Ala Gin 375 380 385
GCA AGC TCA GGG GCA AGA ATT GTT CCT ATG ATT TCA CAA GGG CAT AAT 1254 Ala Ser Ser Gly Ala Arg He Val Pro Met He Ser Gin Gly His Asn 390 395 400
TCT GTA AAA AGC GTT ATT CCA GCA TCT AGG GTT TCT GAA GCG ATT TTA 1302 Ser Val Lys Ser Val He Pro Ala Ser Arg Val Ser Glu Ala He Leu 405 410 415 420
AAT CCG GAT AAA GAA ATT GAT TTT ATG GGC AAA AAA CTC AAA TTG CCT 1350 Asn Pro Asp Lys Glu He Asp Phe Met Gly Lys Lys Leu Lys Leu Pro 425 430 435
AAA ATC AAA ATG ATT TAT AAT TGT GGG GCG GAT TTA TTA GGG CAT GAA 1398 Lys He Lys Met He Tyr Asn Cys Gly Ala Asp Leu Leu Gly His Glu 440 445 450
ACT GAT ACA AAC GAG CTG ATT CGC GCT TTA AGG ACC TTA GAT TGC GTG 1446 Thr Asp Thr Asn Glu Leu He Arg Ala Leu Arg Thr Leu Asp Cys Val 455 460 465
ATC GTG CAT GAG CCT TGG TGG CGC CTA CGG CAA AAT TTG CTG ATA TTG 1494 He Val His Glu Pro Trp Trp Arg Leu Arg Gin Asn Leu Leu He Leu 470 475 480
TCT TTG CTT CCA CTA GCA CTG TGG AAA GAG ATG ATA TTG CTT TTG GAG 1542 Ser Leu Leu Pro Leu Ala Leu Trp Lys Glu Met He Leu Leu Leu Glu 485 490 495 500
GGA GTT ATT CTA AGA ATG TGG TTT ATG CCA TGC GTA AGG TGG TAGAGCCTG 1593 Gly Val He Leu Arg Met Trp Phe Met Pro Cys Val Arg Trp 505 510
TTTATGAATC TAAAGACGAT TATGAGATTT TCAGACAGCT TGC 1636
(2) INFORMATION FOR SEQ ID NO: 1300:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 514 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1300:
Met Ser He Ser Arg Arg Ser He Leu Thr Lys He Pro He Ala Leu
1 5 10 15
Ala Ser Ala Asn Val Leu Lys Ala Val Gly Val Phe Glu Lys Val Glu
20 25 30
Ser He Pro His Ala Thr His Phe Gly Pro Phe He Ala Lys Val Gin 35 40 45
Asn Gly Val He Lys Asp He Val Pro Gin Lys Ser Asp Tyr Asn Pro
50 55 60
Thr Met Met Leu Lys Ala Met Val Asp Arg Val Tyr Ser Asp Ser Arg 65 70 75 80
Val Lys Tyr Pro Cys Val Arg Lys Ser Phe Leu Glu Asn Lys Lys Asn
85 90 95
His Lys Glu Leu Arg Gly Arg Glu Glu Phe Val Arg Val Ser Trp Asp
100 105 110
Val Ala Leu Asp Leu Ala Ala Lys Lys Leu Lys Glu He Pro Lys Glu
115 120 125
Asn He Tyr Asn Ala Ser Tyr Gly Gly Trp Gly His Ala Gly Ser Leu
130 135 140
His Arg Cys His His Leu Ala Trp Arg Phe Phe Asn Thr Thr Leu Gly 145 150 155 160
Gly Ala He Gly Thr Asp Gly Glu Tyr Ser Asn Gly Ala Ala Ala Arg
165 170 175
He Asn Pro Met He Val Gly Asp Met Glu Val Tyr Ser Gin Gin Thr
180 185 190
Thr His Glu Glu Met He Lys Asn Cys Lys Val Tyr Val Met Trp Gly
195 200 205
Ala Asp Leu Leu Lys Cys Asn Arg He Asp Tyr Phe Val Pro Asn His
210 215 220
Val Asn Asp Ser Tyr Tyr Pro Lys Tyr Lys Arg Ala Gly He Lys Phe 225 230 235 240
He Ser He Asp Pro He Tyr Thr Glu Thr Ala Gin Ala Phe Ser Ala
245 250 255
Glu Trp He Pro He Arg Pro Asn Thr Asp Val Ala Leu Met Leu Gly
260 265 270
Met Met His Tyr Leu Tyr Thr Ser Asn Gin Tyr Asp Lys Ala Phe He
275 280 285
Ala Lys Tyr Thr Asp Gly Phe Asp Lys Phe Leu Pro Tyr Leu Leu Gly
290 295 300
Glu Ser Asp Asn Ala Pro Lys Thr Leu Glu Trp Ala Ser Gin He Thr 305 310 315 320
Gly Val Ser Ala Glu Lys He Lys Glu Leu Ala Asp Leu Phe Val Ser
325- 330 335
Lys Arg Thr Phe Leu Ala Gly Asn Trp Ala Met Gin Arg Ala Gin Tyr
340 345 350
Gly Glu Gin Pro Asp Trp Ala Leu He Val Leu Ala Ser Met He Gly
355 360 365
Gin Val Gly Leu Ser Gly Gly Gly Phe Gly Phe Ser Met His Tyr Gly
370 375 380
Gly Asn Ala Gin Ala Ser Ser Gly Ala Arg He Val Pro Met He Ser 385 390 395 400
Gin Gly His Asn Ser Val Lys Ser Val He Pro Ala Ser Arg Val Ser
405 410 415
Glu Ala He Leu Asn Pro Asp Lys Glu He Asp Phe Met Gly Lys Lys
420 425 430
Leu Lys Leu Pro Lys He Lys Met He Tyr Asn Cys Gly Ala Asp Leu
435 440 445
Leu Gly His Glu Thr Asp Thr Asn Glu Leu He Arg Ala Leu Arg Thr
450 455 460
Leu Asp Cys Val He Val His Glu Pro Trp Trp Arg Leu Arg Gin Asn 465 470 475 480 Leu Leu He Leu Ser Leu Leu Pro Leu Ala Leu Trp Lys Glu Met He
485 490 495
Leu Leu Leu Glu Gly Val He Leu Arg Met Trp Phe Met Pro Cys Val
500 505 510
Arg Trp
(2) INFORMATION FOR SEQ ID NO: 1301:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 540 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 24...509 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1301:
ATTGACGCTT ATAAGGATAA AAG ATG AAT ATT TTT CAA ACG AGT TTG AAA TGT 53
Met Asn He Phe Gin Thr Ser Leu Lys Cys 1 5 10
TGC GTG GGG TTG GTT TTG TCT GTG GGG GTC TTA TTA GGG GAT TCT AAA 101 Cys Val Gly Leu Val Leu Ser Val Gly Val Leu Leu Gly Asp Ser Lys 15 20 25
GCT TTT AAG GTT AGG GTG GAT AAA AGT TTA ACC CCG CCT TTT TTG AAT 149 Ala Phe Lys Val Arg Val Asp Lys Ser Leu Thr Pro Pro Phe Leu Asn 30 35 40
GTG CTT TCA TTA GCT TTT AAA CAA GAC ATG AAA AAA GAG GTC ATT TTT 197 Val Leu Ser Leu Ala Phe Lys Gin Asp Met Lys Lys Glu Val He Phe 45 50 55
GTG ATT ACC AAA AGC AAT AAG TTG AGT AAA AAA GTG CTT TGT GAT TTT 245 Val He Thr Lys Ser Asn Lys Leu Ser Lys Lys Val Leu Cys Asp Phe 60 65 70
GAC GCT TTT TTA TTG CCT GAG ACT CTG ATG AGC GGC ATG CCT AAA AAA 293 Asp Ala Phe Leu Leu Pro Glu Thr Leu Met Ser Gly Met Pro Lys Lys 75 80 85 90
GCA CTA TTC CAT AAA GAG TTT TTA TTC CAA TCT AAA GAA AAT AAA ACG 341 Ala Leu Phe His Lys Glu Phe Leu Phe Gin Ser Lys Glu Asn Lys Thr 95 100 105
CTC TAT GCG TTT TCG CTG ATT GAT TCT CAA TAT TGC TCA AAA GGT GGA 389 Leu Tyr Ala Phe Ser Leu He Asp Ser Gin Tyr Cys Ser Lys Gly Gly 110 115 120
AAT TAC AGA TAC GAA CTA GAA AAA TTA GAA CGC TGG TTT GTG CAA AAA 437
Asn Tyr Arg Tyr Glu Leu Glu Lys Leu Glu Arg Trp Phe Val Gin Lys 125 130 135
GCA CCT GAG TTG GCT GAA AGC TAT AGG GTG AAT TAC AAA AAT CAA TAC 485
Ala Pro Glu Leu Ala Glu Ser Tyr Arg Val Asn Tyr Lys Asn Gin Tyr 140 145 150
AAT AAA ACA CAG ATC TCA CAA AAA TAAAGAATGA GCGATGATTT TAGTATTAGA 539
Asn Lys Thr Gin He Ser Gin Lys
155 160
540
(2) INFORMATION FOR SEQ ID NO: 1302:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 162 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1302:
Met Asn He Phe Gin Thr Ser Leu Lys Cys Cys Val Gly Leu Val Leu
1 5 10 15
Ser Val Gly Val Leu Leu Gly Asp Ser Lys Ala Phe Lys Val Arg Val
20 25 30
Asp Lys Ser Leu Thr Pro Pro Phe Leu Asn Val Leu Ser Leu Ala Phe
35 40 45
Lys Gin Asp Met Lys Lys Glu Val He Phe Val He Thr Lys Ser Asn
50 55 60
Lys Leu Ser Lys Lys Val Leu Cys Asp Phe Asp Ala Phe Leu Leu Pro 65 70 75 80
Glu Thr Leu Met Ser Gly Met Pro Lys Lys Ala Leu Phe His Lys Glu
85 90 95
Phe Leu Phe Gin Ser Lys Glu Asn Lys Thr Leu Tyr Ala Phe Ser Leu
100 105 110
He Asp Ser Gin Tyr Cys Ser Lys Gly Gly Asn Tyr Arg Tyr Glu Leu
115 120 125
Glu Lys Leu Glu Arg Trp Phe Val Gin Lys Ala Pro Glu Leu Ala Glu
130 135 140
Ser Tyr Arg Val Asn Tyr Lys Asn Gin Tyr Asn Lys Thr Gin He Ser 145 150 155 160
Gin Lys
(2) INFORMATION FOR SEQ ID NO: 1303: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1572 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 25...1548 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1303:
TCTCACAAAA ATAAAGAATG AGCG ATG ATT TTA GTA TTA GAT TTT GGG AGT 51
Met He Leu Val Leu Asp Phe Gly Ser 1 5
CAA TAC ACA CAG CTG ATT GCT AGA AGA TTG AGA GAG AGA GGG ATT TAT 99 Gin Tyr Thr Gin Leu He Ala Arg Arg Leu Arg Glu Arg Gly He Tyr 10 15 20 25
ACA GAA ATA GTC CCT TTT TTT GAA AGC ATA GAA AAC ATT CAA AAA AAA 147 Thr Glu He Val Pro Phe Phe Glu Ser He Glu Asn He Gin Lys Lys 30 35 40
GCC CCC AAA GGT TTG ATT TTG AGT GGG GGG CCA GCG AGC GTG TAT GCT 195 Ala Pro Lys Gly Leu He Leu Ser Gly Gly Pro Ala Ser Val Tyr Ala 45 50 55
AAA GAC GCT TAC AAG CCT AGT GGG AAA ATC TTT GAT TTG AAT GTG CCG 243 Lys Asp Ala Tyr Lys Pro Ser Gly Lys He Phe Asp Leu Asn Val Pro 60 65 70
ATT TTA GGG ATT TGC TAC GGC ATG CAG TAT TTG GTG GAT TTT TTT GGG 291 He Leu Gly He Cys Tyr Gly Met Gin Tyr Leu Val Asp Phe Phe Gly 75 80 85
GGG GTA GTG GTT GGT GCG AAT GAG CAA GAA TTT GGT AAG GCT GTT TTA 339 Gly Val Val Val Gly Ala Asn Glu Gin Glu Phe Gly Lys Ala Val Leu 90 95 100 105
GAA ATC ACT CAA AAT TCT GTG ATT TTT GAA GGC GTG AAG ATT AAA AGC 387 Glu He Thr Gin Asn Ser Val He Phe Glu Gly Val Lys He Lys Ser 110 115 120
CTT GTG TGG ATG AGC CAT ATG GAT AAA GTC ATA GAA CTG CCT AAA GGC 435 Leu Val Trp Met Ser His Met Asp Lys Val He Glu Leu Pro Lys Gly 125 130 135
TTT ACT ACC CTT GCA AAA AGC CCT AAT TCC CCC CAT TGC GCG ATT GAA 483 Phe Thr Thr Leu Ala Lys Ser Pro Asn Ser Pro His Cys Ala He Glu 140 145 150 AAC GGC AAG ATT TTT GGC TTG CAA TTC CAC CCA GAA GTC GTT CAA AGC 531 Asn Gly Lys He Phe Gly Leu Gin Phe His Pro Glu Val Val Gin Ser 155 160 165
GAA GAA GGG GGT AAG ATT TTA GAA AAT TTT GCC CTT TTA GTT TGC GGC 579 Glu Glu Gly Gly Lys He Leu Glu Asn Phe Ala Leu Leu Val Cys Gly 170 175 180 185
TGT GAA AAA ACT TGG GGG ATG CAG CAT TTC GCT CAA AGA GAA ATC GCA 627 Cys Glu Lys Thr Trp Gly Met Gin His Phe Ala Gin Arg Glu He Ala 190 195 200
CGA TTG AAA GAA AAA ATC GCT AAC GCT AAG GTT TTG TGC GCG GTG AGT 675 Arg Leu Lys Glu Lys He Ala Asn Ala Lys Val Leu Cys Ala Val Ser 205 210 215
GGG GGC GTG GAT TCT ACG GTG GTC GCT ACG CTG TTG CAC AGA GCC ATT 723 Gly Gly Val Asp Ser Thr Val Val Ala Thr Leu Leu His Arg Ala He 220 225 230
AAG GAT AAT TTG ATC GCT GTT TTT GTG GAT CAT GGC TTG TTG CGT AAA 771 Lys Asp Asn Leu He Ala Val Phe Val Asp His Gly Leu Leu Arg Lys 235 240 245
AAT GAA AAA GAA AGG GTG CAA GCG ATG TTT AAG GAC TTG AAA ATC CCT 819 Asn Glu Lys Glu Arg Val Gin Ala Met Phe Lys Asp Leu Lys He Pro 250 255 260 265
TTA AAC ACG ATA GAC GCT AAA GAA GTC TTT TTG TCT AAA TTA AAG GGC 867 Leu Asn Thr He Asp Ala Lys Glu Val Phe Leu Ser Lys Leu Lys Gly 270 275 280
GTG AGC GAG CCT GAA TTG AAG CGA AAA ATC ATC GGC GAG ACC TTT ATT 915 Val Ser Glu Pro Glu Leu Lys Arg Lys He He Gly Glu Thr Phe He 285 290 295
GAA GTG TTT GAA AAA GAA GCC AAA AAG CAC CAT TTA AAA GGC AAA ATT 963 Glu Val Phe Glu Lys Glu Ala Lys Lys His His Leu Lys Gly Lys He 300 305 310
GAA TTT TTA GCC CAA GGC ACT TTA TAC CCT GAT GTG ATT GAA TCC GTG 1011 Glu Phe Leu Ala Gin Gly Thr Leu Tyr Pro Asp Val He Glu Ser Val 315 320 325
AGC GTT AAA GGG CCT TCA AAA GTG ATC AAA ACC CAT CAT AAT GTG GGC 1059 Ser Val Lys Gly Pro Ser Lys Val He Lys Thr His His Asn Val Gly 330 335 340 345
GGA CTG CCT GAA TGG ATG GAT TTT AAA CTC ATA GAG CCT TTA AGG GAG 1107 Gly Leu Pro Glu Trp Met Asp Phe Lys Leu He Glu Pro Leu Arg Glu 350 355 360
TTG TTT AAA GAT GAG GTG CGC TTA CTG GGT AAA GAA TTG GGC GTT AGT 1155 Leu Phe Lys Asp Glu Val Arg Leu Leu Gly Lys Glu Leu Gly Val Ser 365 370 375 CAG GAT TTT TTA ATG CGC CAC CCT TTT CCA GGG CCT GGG CTT GCT GTA 1203 Gin Asp Phe Leu Met Arg His Pro Phe Pro Gly Pro Gly Leu Ala Val 380 385 390
AGG ATT TTA GGC GAA ATC AGT GAG AGT AAG ATC AAA CGC TTG CAA GAA 1251 Arg He Leu Gly Glu He Ser Glu Ser Lys He Lys Arg Leu Gin Glu 395 400 405
GCG GAT TTT ATT TTT ATA GAG GAA CTT AAA AAA GCC AAT TTG TAT GAC 1299 Ala Asp Phe He Phe He Glu Glu Leu Lys Lys Ala Asn Leu Tyr Asp 410 415 420 425
AAG GTT TGG CAA GCT TTT TGC GTG CTG TTG AAT GTC AAT TCT GTG GGG 1347 Lys Val Trp Gin Ala Phe Cys Val Leu Leu Asn Val Asn Ser Val Gly 430 435 440
GTT ATG GGG GAT AAC CGC ACT TAT GAA AAC GCT ATT TGC TTA AGA GCG 1395 Val Met Gly Asp Asn Arg Thr Tyr Glu Asn Ala He Cys Leu Arg Ala 445 450 455
GTA AAT GCG AGC GAT GGC ATG ACG GCG AGC TTT TCA TTT TTA GAG CAT 1443 Val Asn Ala Ser Asp Gly Met Thr Ala Ser Phe Ser Phe Leu Glu His 460 465 470
TCT TTT TTA GAA AAG GTT TCT AAC CGT ATC ACT AAT GAA GTG AGC GGT 1491 Ser Phe Leu Glu Lys Val Ser Asn Arg He Thr Asn Glu Val Ser Gly 475 480 485
ATC AAT AGG GTG GTG TAT GAC ATT ACC TCT AAA CCA CCA GGA ACG ATT 1539 He Asn Arg Val Val Tyr Asp He Thr Ser Lys Pro Pro Gly Thr He 490 495 500 505
GAA TGG GAA TGATTATCTT AAAAAATAGC ACTA 1572
Glu Trp Glu
(2) INFORMATION FOR SEQ ID NO: 1304:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 508 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1304:
Met He Leu Val Leu Asp Phe Gly Ser Gin Tyr Thr Gin Leu He Ala
1 5 10 15
Arg Arg Leu Arg Glu Arg Gly He Tyr Thr Glu He Val Pro Phe Phe
20 25 30
Glu Ser He Glu Asn He Gin Lys Lys Ala Pro Lys Gly Leu He Leu 35 40 45 Ser Gly Gly Pro Ala Ser Val Tyr Ala Lys Asp Ala Tyr Lys Pro Ser
50 55 60
Gly Lys He Phe Asp Leu Asn Val Pro He Leu Gly He Cys Tyr Gly 65 70 75 80
Met Gin Tyr Leu Val Asp Phe Phe Gly Gly Val Val Val Gly Ala Asn
85 90 95
Glu Gin Glu Phe Gly Lys Ala Val Leu Glu He Thr Gin Asn Ser Val
100 105 110
He Phe Glu Gly Val Lys He Lys Ser Leu Val Trp Met Ser His Met
115 120 125
Asp Lys Val He Glu Leu Pro Lys Gly Phe Thr Thr Leu Ala Lys Ser
130 135 140
Pro Asn Ser Pro His Cys Ala He Glu Asn Gly Lys He Phe Gly Leu 145 150 155 160
Gin Phe His Pro Glu Val Val Gin Ser Glu Glu Gly Gly Lys He Leu
165 170 175
Glu Asn Phe Ala Leu Leu Val Cys Gly Cys Glu Lys Thr Trp Gly Met
180 185 190
Gin His Phe Ala Gin Arg Glu He Ala Arg Leu Lys Glu Lys He Ala
195 200 205
Asn Ala Lys Val Leu Cys Ala Val Ser Gly Gly Val Asp Ser Thr Val
210 215 220
Val Ala Thr Leu Leu His Arg Ala He Lys Asp Asn Leu He Ala Val 225 230 235 240
Phe Val Asp His Gly Leu Leu Arg Lys Asn Glu Lys Glu Arg Val Gin
245 250 255
Ala Met Phe Lys Asp Leu Lys He Pro Leu Asn Thr He Asp Ala Lys
260 265 270
Glu Val Phe Leu Ser Lys Leu Lys Gly Val Ser Glu Pro Glu Leu Lys
275 280 285
Arg Lys He He Gly Glu Thr Phe He Glu Val Phe Glu Lys Glu Ala
290 295 300
Lys Lys His His Leu Lys Gly Lys He Glu Phe Leu Ala Gin Gly Thr 305 310 315 320
Leu Tyr Pro Asp Val He Glu Ser Val Ser Val Lys Gly Pro Ser Lys
325 330 335
Val He Lys Thr His His Asn Val Gly Gly Leu Pro Glu Trp Met Asp
340 345 350
Phe Lys Leu He Glu Pro Leu Arg Glu Leu Phe Lys Asp Glu Val Arg
355 360 365
Leu Leu Gly Lys Glu Leu Gly Val Ser Gin Asp Phe Leu Met Arg His
370 375 380
Pro Phe Pro Gly Pro Gly Leu Ala Val Arg He Leu Gly Glu He Ser 385 390 395 400
Glu Ser Lys He Lys Arg Leu Gin Glu Ala Asp Phe He Phe He Glu
405 410 415
Glu Leu Lys Lys Ala Asn Leu Tyr Asp Lys Val Trp Gin Ala Phe Cys
420 425 430
Val Leu Leu Asn Val Asn Ser Val Gly Val Met Gly Asp Asn Arg Thr
435 440 445
Tyr Glu Asn Ala He Cys Leu Arg Ala Val Asn Ala Ser Asp Gly Met
450 455 460
Thr Ala Ser Phe Ser Phe Leu Glu His Ser Phe Leu Glu Lys Val Ser 465 470 475 480
Asn Arg He Thr Asn Glu Val Ser Gly He Asn Arg Val Val Tyr Asp 485 490 495
He Thr Ser Lys Pro Pro Gly Thr He Glu Trp Glu 500 505
(2) INFORMATION FOR SEQ ID NO: 1305:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 834 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 26...808 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1305:
AAAAACCCCC ACGCTATGGT AAATC ATG CTC ATT TGT AAC GAT AAA TCC AAT 52
Met Leu He Cys Asn Asp Lys Ser Asn 1 5
CCA AAA ACC CTT TTA GAA GAA ATC ATG GCG TTA AGG CCA TGG CGT AAA 100 Pro Lys Thr Leu Leu Glu Glu He Met Ala Leu Arg Pro Trp Arg Lys 10 15 20 25
GGC CCT TTT GAA ATT TCT CAA ATC AAG ATT GAT AGC GAA TGG GAT AGC 148 Gly Pro Phe Glu He Ser Gin He Lys He Asp Ser Glu Trp Asp Ser 30 35 40
TCC ATT AAA TGG GAT CTA GTT AAA AAC GCC ACT CCT TTA AAA GAT AAG 196 Ser He Lys Trp Asp Leu Val Lys Asn Ala Thr Pro Leu Lys Asp Lys 45 50 55
GTT GTG GCT GAT GTG GGT TGC AAT AAC GGC TAT TAC TTG TTT AAA ATG 244 Val Val Ala Asp Val Gly Cys Asn Asn Gly Tyr Tyr Leu Phe Lys Met 60 65 70
CTA GAA CAT GGG CCT AAA AGT TTG GTG GGG TTT GAT CCG GGC GTT TTA 292 Leu Glu His Gly Pro Lys Ser Leu Val Gly Phe Asp Pro Gly Val Leu 75 80 85
GTC AAA AAA CAA TTT GAA TTT TTA GCC CCC TTT TTT GAT AAA GAA AAA 340 Val Lys Lys Gin Phe Glu Phe Leu Ala Pro Phe Phe Asp Lys Glu Lys 90 95 100 105
AAA ATC ATT TAT GAG TCT TTG GGG GTA GAG GAT TTG CAT GAA AAA TAC 388 Lys He He Tyr Glu Ser Leu Gly Val Glu Asp Leu His Glu Lys Tyr 110 115 120 CCT AAC GCT TTT GAT GTC ATT TTT TGC TTA GGG GTG CTA TAC CAC AGA 436 Pro Asn Ala Phe Asp Val He Phe Cys Leu Gly Val Leu Tyr His Arg 125 130 135
AAA AGC CCG CTA GAG GCT TTA AAA GCC TTG TAT CAC GCT TTG AAA ATA 484 Lys Ser Pro Leu Glu Ala Leu Lys Ala Leu Tyr His Ala Leu Lys He 140 145 150
AAA GGG GAG CTG GTG TTG GAT ACC TTA ATC ATT GAT TCG CCC TTA GAC 532 Lys Gly Glu Leu Val Leu Asp Thr Leu He He Asp Ser Pro Leu Asp 155 160 165
ATC GCC CTT TGC CCT AAA AAA ACT TAT GCT AAA ATG AAA AAT GTT TAT 580 He Ala Leu Cys Pro Lys Lys Thr Tyr Ala Lys Met Lys Asn Val Tyr 170 175 180 185
TTT ATC CCC AGT GTT AGC GCG TTA AAA GGG TGG TGC GAA AGG GTA GGG 628 Phe He Pro Ser Val Ser Ala Leu Lys Gly Trp Cys Glu Arg Val Gly 190 195 200
TTT GAA AAT TTT GAG ATT CTT AGC GTT TTA AAG ACC ACG CCT AAA GAA 676 Phe Glu Asn Phe Glu He Leu Ser Val Leu Lys Thr Thr Pro Lys Glu 205 210 215
CAG CGT AAA ACG GAT TTT ATT TTG GGG CAG AGT TTG GAA GAT TTT TTG 724 Gin Arg Lys Thr Asp Phe He Leu Gly Gin Ser Leu Glu Asp Phe Leu 220 225 230
GAT AAA ACA GAT CCC TCT AAA ACT TTA GAG GGG TAT GAC GCC CCT TTA 772 Asp Lys Thr Asp Pro Ser Lys Thr Leu Glu Gly Tyr Asp Ala Pro Leu 235 240 245
AGG GGG TAT TTT AAA ATG CTT AAA CCA AGC AAG CGT TAAATAAAGG ATTAAG 824 Arg Gly Tyr Phe Lys Met Leu Lys Pro Ser Lys Arg 250 255 260
ATAGTGCAAG 834
(2) INFORMATION FOR SEQ ID NO: 1306:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 261 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1306:
Met Leu He Cys Asn Asp Lys Ser Asn Pro Lys Thr Leu Leu Glu Glu
1 5 10 15
He Met Ala Leu Arg Pro Trp Arg Lys Gly Pro Phe Glu He Ser Gin
20 25 30
He Lys He Asp Ser Glu Trp Asp Ser Ser He Lys Trp Asp Leu Val 35 40 45
Lys Asn Ala Thr Pro Leu Lys Asp Lys Val Val Ala Asp Val Gly Cys
50 55 60
Asn Asn Gly Tyr Tyr Leu Phe Lys Met Leu Glu His Gly Pro Lys Ser 65 70 75 80
Leu Val Gly Phe Asp Pro Gly Val Leu Val Lys Lys Gin Phe Glu Phe
85 90 95
Leu Ala Pro Phe Phe Asp Lys Glu Lys Lys He He Tyr Glu Ser Leu
100 105 110
Gly Val Glu Asp Leu His Glu Lys Tyr Pro Asn Ala Phe Asp Val He
115 120 125
Phe Cys Leu Gly Val Leu Tyr His Arg Lys Ser Pro Leu Glu Ala Leu
130 135 140
Lys Ala Leu Tyr His Ala Leu Lys He Lys Gly Glu Leu Val Leu Asp 145 150 155 160
Thr Leu He He Asp Ser Pro Leu Asp He Ala Leu Cys Pro Lys Lys
165 170 175
Thr Tyr Ala Lys Met Lys Asn Val Tyr Phe He Pro Ser Val Ser Ala
180 185 190
Leu Lys Gly Trp Cys Glu Arg Val Gly Phe Glu Asn Phe Glu He Leu
195 200 205
Ser Val Leu Lys Thr Thr Pro Lys Glu Gin Arg Lys Thr Asp Phe He
210 215 220
Leu Gly Gin Ser Leu Glu Asp Phe Leu Asp Lys Thr Asp Pro Ser Lys 225 230 235 240
Thr Leu Glu Gly Tyr Asp Ala Pro Leu Arg Gly Tyr Phe Lys Met Leu
245 250 255
Lys Pro Ser Lys Arg 260
(2) INFORMATION FOR SEQ ID NO: 1307:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1224 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...1197 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1307:
TTAAATTCAA TTTTAAAGAA GAGTAGTTAA ATG GTT ATT GTT TTA GTC GTG GAT 54
Met Val He Val Leu Val Val Asp
1 5
AGT TTT AAA GAC ACC AGT AAT GGC ACT TCT ATG ACA GCG TTT CGT TTT 102 Ser Phe Lys Asp Thr Ser Asn Gly Thr Ser Met Thr Ala Phe Arg Phe 10 15 20
TTT GAA GCG CTG AAA AAA AGA GGG CAT GTG ATG AGA GTG GTC GCC CCT 150 Phe Glu Ala Leu Lys Lys Arg Gly His Val Met Arg Val Val Ala Pro 25 30 35 40
CAT GTG GAT AAT TTA GGG AGT GAA GAA GAG GGG TAT TAC AAC CTT AAA 198 His Val Asp Asn Leu Gly Ser Glu Glu Glu Gly Tyr Tyr Asn Leu Lys 45 50 55
GAG CGC TAC ATC CCC CTA GTT ACA GAA ATT TCA CAC AAA CAA CAC ATC 246 Glu Arg Tyr He Pro Leu Val Thr Glu He Ser His Lys Gin His He 60 65 70
CTT TTT GCT AAA CCC GAT GAA AAA ATC TTA AGA AAG GCT TTT AAG GGA 294 Leu Phe Ala Lys Pro Asp Glu Lys He Leu Arg Lys Ala Phe Lys Gly 75 80 85
GCG GAT ATG ATC CAT ACT TAT TTG CCT TTT TTG CTA GAA AAA ACA GCC 342 Ala Asp Met He His Thr Tyr Leu Pro Phe Leu Leu Glu Lys Thr Ala 90 95 100
GTA AAA ATC GCG CGA GAA ATG CAA GTG CCT TAT ATT GGC TCT TTC CAT 390 Val Lys He Ala Arg Glu Met Gin Val Pro Tyr He Gly Ser Phe His 105 110 115 120
TTA CAG CCA GAG CAT ATT TCT TAT AAC ATG AAA TTG GGG TGG TTT TCT 438 Leu Gin Pro Glu His He Ser Tyr Asn Met Lys Leu Gly Trp Phe Ser 125 130 135
TGG TTC AAC ATG ATG CTT TTT TCG TGG TTT AAA TCT TCG CAT TAC CGC 486 Trp Phe Asn Met Met Leu Phe Ser Trp Phe Lys Ser Ser His Tyr Arg 140 145 150
TAT ATC CAC CAT ATC CAT TGC CCG TCA AAA TTC ATT GTA GAA GAA TTA 534 Tyr He His His He His Cys Pro Ser Lys Phe He Val Glu Glu Leu 155 160 165
GAA AAA TAC AAC TAT GGA GGG AAA AAA TAC GCT ATT TCT AAC GGC TTT 582 Glu Lys Tyr Asn Tyr Gly Gly Lys Lys Tyr Ala He Ser Asn Gly Phe 170 175 180
GAT CCC ATG TTT AGA TTT GAA CAC CCG CAA AAA AGC CTT TTT GAC ACC 630 Asp Pro Met Phe Arg Phe Glu His Pro Gin Lys Ser Leu Phe Asp Thr 185 190 195 200
ACA CCC TTT AAA ATC GCT ATG GTA GGA CGC TAT TCT AAT GAA AAA AAT 678 Thr Pro Phe Lys He Ala Met Val Gly Arg Tyr Ser Asn Glu Lys Asn 205 210 215
CAA AGC GTT TTA ATC AAA GCG GTT GCT TTA AGC AAA TAC AAA CAA GAT 726 Gin Ser Val Leu He Lys Ala Val Ala Leu Ser Lys Tyr Lys Gin Asp 220 225 230
ATT GTA TTA TTG CTC AAA GGC AAA GGG CCT GAT GAG AAA AAA ATC AAA 774 He Val Leu Leu Leu Lys Gly Lys Gly Pro Asp Glu Lys Lys He Lys 235 240 245
CTT TTA GCC CAA AAA CTA GGC GTA AAA GCG GAG TTT GGG TTT GTC AAT 822 Leu Leu Ala Gin Lys Leu Gly Val Lys Ala Glu Phe Gly Phe Val Asn 250 255 260
TCC AAT GAA TTG TTA GAG ATC TTA AAA ACT TGC ACC CTT TAT GTG CAT 870 Ser Asn Glu Leu Leu Glu He Leu Lys Thr Cys Thr Leu Tyr Val His 265 270 275 280
GCA GCC AAT GTG GAA AGC GAA GCG ATT GCG TGC TTA GAG GCC ATT AGC 918 Ala Ala Asn Val Glu Ser Glu Ala He Ala Cys Leu Glu Ala He Ser 285 290 295
GTG GGG ATT GTG CCT GTT ATC GCT AAT AGC CCT TTA AGC GCG ACC AGG 966 Val Gly He Val Pro Val He Ala Asn Ser Pro Leu Ser Ala Thr Arg 300 305 310
CAA TTT GCG CTA GAT GAA CGA TCG CTA TTT GAA CCT AAT AAC GCT AAA 1014 Gin Phe Ala Leu Asp Glu Arg Ser Leu Phe Glu Pro Asn Asn Ala Lys 315 320 325
GAT TTG AGC GCT AAA ATA GAT TGG TGG TTA GAA AAC AAG CTT GAA AGA 1062 Asp Leu Ser Ala Lys He Asp Trp Trp Leu Glu Asn Lys Leu Glu Arg 330 335 340
GAA AGG ATG CAA AAC GAA TAC GCT AAA AGC GCT TTA AAT TAC ACT TTA 1110 Glu Arg Met Gin Asn Glu Tyr Ala Lys Ser Ala Leu Asn Tyr Thr Leu 345 350 355 360
GAA AAT TCA GTC ATT CAA ATT GAA AAA GTT TAT GAA GAA GCG ATC AGA 1158 Glu Asn Ser Val He Gin He Glu Lys Val Tyr Glu Glu Ala He Arg 365 370 375
GAT TTT AAA AAT AAC CCC CAT CTC TTT AAA ACC TTA TCA TAATGAAAGG AT 1209 Asp Phe Lys Asn Asn Pro His Leu Phe Lys Thr Leu Ser 380 385
AAAAAATGCA AGAAG 1224
(2) INFORMATION FOR SEQ ID NO: 1308:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 389 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1308:
Met Val He Val Leu Val Val Asp Ser Phe Lys Asp Thr Ser Asn Gly 1 5 10 15 Thr Ser Met Thr Ala Phe Arg Phe Phe Glu Ala Leu Lys Lys Arg Gly
20 25 30
His Val Met Arg Val Val Ala Pro His Val Asp Asn Leu Gly Ser Glu
35 40 45
Glu Glu Gly Tyr Tyr Asn Leu Lys Glu Arg Tyr He Pro Leu Val Thr
50 55 60
Glu He Ser His Lys Gin His He Leu Phe Ala Lys Pro Asp Glu Lys 65 70 75 80
He Leu Arg Lys Ala Phe Lys Gly Ala Asp Met He His Thr Tyr Leu
85 90 95
Pro Phe Leu Leu Glu Lys Thr Ala Val Lys He Ala Arg Glu Met Gin
100 105 110
Val Pro Tyr He Gly Ser Phe His Leu Gin Pro Glu His He Ser Tyr
115 120 125
Asn Met Lys Leu Gly Trp Phe Ser Trp Phe Asn Met Met Leu Phe Ser
130 135 140
Trp Phe Lys Ser Ser His Tyr Arg Tyr He His His He His Cys Pro 145 150 155 160
Ser Lys Phe He Val Glu Glu Leu Glu Lys Tyr Asn Tyr Gly Gly Lys
165 170 175
Lys Tyr Ala He Ser Asn Gly Phe Asp Pro Met Phe Arg Phe Glu His
180 185 190
Pro Gin Lys Ser Leu Phe Asp Thr Thr Pro Phe Lys He Ala Met Val
195 200 205
Gly Arg Tyr Ser Asn Glu Lys Asn Gin Ser Val Leu He Lys Ala Val
210 215 220
Ala Leu Ser Lys Tyr Lys Gin Asp He Val Leu Leu Leu Lys Gly Lys 225 230 235 240
Gly Pro Asp Glu Lys Lys He Lys Leu Leu Ala Gin Lys Leu Gly Val
245 250 255
Lys Ala Glu Phe Gly Phe Val Asn Ser Asn Glu Leu Leu Glu He Leu
260 265 270
Lys Thr Cys Thr Leu Tyr Val His Ala Ala Asn Val Glu Ser Glu Ala
275 280 285
He Ala Cys Leu Glu Ala He Ser Val Gly He Val Pro Val He Ala
290 295 300
Asn Ser Pro Leu Ser Ala Thr Arg Gin Phe Ala Leu Asp Glu Arg Ser 305 310 315 320
Leu Phe Glu Pro Asn Asn Ala Lys Asp Leu Ser Ala Lys He Asp Trp
325 330 335
Trp Leu Glu Asn Lys Leu Glu Arg Glu Arg Met Gin Asn Glu Tyr Ala
340 345 350
Lys Ser Ala Leu Asn Tyr Thr Leu Glu Asn Ser Val He Gin He Glu
355 360 365
Lys Val Tyr Glu Glu Ala He Arg Asp Phe Lys Asn Asn Pro His Leu
370 375 380
Phe Lys Thr Leu Ser 385
(2) INFORMATION FOR SEQ ID NO: 1309:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 947 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...903 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1309:
TAGAAAAGGT AGTTT ATG GAG TTA GAA GAA ATT GTT GAT AGT GAG AGG AAT 51 Met Glu Leu Glu Glu He Val Asp Ser Glu Arg Asn 1 5 10
ATC CAT AAG ACT ATA GAA GTT TTA GGA AAA GGC GGA CAG GGT ATA GTG 99 He His Lys Thr He Glu Val Leu Gly Lys Gly Gly Gin Gly He Val 15 20 25
TAT CGC TGT TTG GAT AAG GAT GTG GCT ATT AAG GTA GTA TTG AGG GAT 147 Tyr Arg Cys Leu Asp Lys Asp Val Ala He Lys Val Val Leu Arg Asp 30 35 40
GGA GAT TTT ATT AAA GAC AAA GAA TCC CTC AAA CAA TAT GAA AAA AGC 195 Gly Asp Phe He Lys Asp Lys Glu Ser Leu Lys Gin Tyr Glu Lys Ser 45 50 55 60
GTT CTA AAC TTA TCT TTT AAG CCG ATA GAG AGT CAT TTC CCT ATG TCA 243 Val Leu Asn Leu Ser Phe Lys Pro He Glu Ser His Phe Pro Met Ser 65 70 75
ATT CCA CTG GTA ACT TTG AAA GAA AAA CAA GGC TAT GTG ATG AAA ATG 291 He Pro Leu Val Thr Leu Lys Glu Lys Gin Gly Tyr Val Met Lys Met 80 85 90
GCT GAG GGC TAT GAA CCA CTA AAA ACT TTT TTA AAG AAG CCC AGC ATT 339 Ala Glu Gly Tyr Glu Pro Leu Lys Thr Phe Leu Lys Lys Pro Ser He 95 100 105
TTA GAA AAC GAA GAA AAA GAT GGG ATT TTT AGG ATC AAT AAT GCC ATT 387 Leu Glu Asn Glu Glu Lys Asp Gly He Phe Arg He Asn Asn Ala He 110 115 120
CAA GAA CTT TGC AAA GAT AAC CAT TAT ATG ACT TTA AGT TTA AGT TAT 435 Gin Glu Leu Cys Lys Asp Asn His Tyr Met Thr Leu Ser Leu Ser Tyr 125 130 135 140
TAC TCA CAA ACA CAA GGA TTG AGA TCA CGA TTA AAA ATA CTC ACC CAT 483 Tyr Ser Gin Thr Gin Gly Leu Arg Ser Arg Leu Lys He Leu Thr His 145 150 155
TTA GCA AAA CTT CTA TTC AGA TTG CAA AGT AAG GGT TTG GTG TAT GGG 531 Leu Ala Lys Leu Leu Phe Arg Leu Gin Ser Lys Gly Leu Val Tyr Gly 160 165 170
GAC TTG AAT TTA AAC AAT GTT TTT TAT AAA GAC AAT TCA GCG TTT TTA 579 Asp Leu Asn Leu Asn Asn Val Phe Tyr Lys Asp Asn Ser Ala Phe Leu 175 180 185
ATT GAT GCG GAT AAT GTG CGT TAT GAG AGC GAA AAA GCC CTG TGT GTT 627 He Asp Ala Asp Asn Val Arg Tyr Glu Ser Glu Lys Ala Leu Cys Val 190 195 200
ATT TTT ACG CCT AAC TAT GGG GCT TTA GAG ATT AGC CAA ACC TCT AAA 675 He Phe Thr Pro Asn Tyr Gly Ala Leu Glu He Ser Gin Thr Ser Lys 205 210 215 220
AAT AGC GAT ACA ACC AAT TAC AAC ACC ATG CTT AGC GAT ACC TTT TCT 723 Asn Ser Asp Thr Thr Asn Tyr Asn Thr Met Leu Ser Asp Thr Phe Ser 225 230 235
TTT GCT ATC ATA ACT TAT GAA CTT TTA AAT ATG GTT CAT CCT TTT GAT 771 Phe Ala He He Thr Tyr Glu Leu Leu Asn Met Val His Pro Phe Asp 240 245 250
GGG AAT AAG GCA GAT GAT AGT GTA GAA AAT TTT ATA GAA TTG CCT TGG 819 Gly Asn Lys Ala Asp Asp Ser Val Glu Asn Phe He Glu Leu Pro Trp 255 260 265
ATT GAA GAT AGA AAG GAT GAT AGC AAT CGT TCT TGT GGC TTA CTG CCT 867 He Glu Asp Arg Lys Asp Asp Ser Asn Arg Ser Cys Gly Leu Leu Pro 270 275 280
TTT TTC TTA ACA AGG GAT TTA AAA AAT TTA TTA GCG TAATGCTTTG AAGAAG 919 Phe Phe Leu Thr Arg Asp Leu Lys Asn Leu Leu Ala 285 290 295
GCAAAAAAGA TCCTTTGAAA CGCCCTAC 947
(2) INFORMATION FOR SEQ ID NO: 1310:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 296 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1310:
Met Glu Leu Glu Glu He Val Asp Ser Glu Arg Asn He His Lys Thr
1 5 10 15
He Glu Val Leu Gly Lys Gly Gly Gin Gly He Val Tyr Arg Cys Leu
20 25 30
Asp Lys Asp Val Ala He Lys Val Val Leu Arg Asp Gly Asp Phe He
35 40 45
Lys Asp Lys Glu Ser Leu Lys Gin Tyr Glu Lys Ser Val Leu Asn Leu 50 55 60
Ser Phe Lys Pro He Glu Ser His Phe Pro Met Ser He Pro Leu Val 65 70 75 80
Thr Leu Lys Glu Lys Gin Gly Tyr Val Met Lys Met Ala Glu Gly Tyr
85 90 95
Glu Pro Leu Lys Thr Phe Leu Lys Lys Pro Ser He Leu Glu Asn Glu
100 105 110
Glu Lys Asp Gly He Phe Arg He Asn Asn Ala He Gin Glu Leu Cys
115 120 125
Lys Asp Asn His Tyr Met Thr Leu Ser Leu Ser Tyr Tyr Ser Gin Thr
130 135 140
Gin Gly Leu Arg Ser Arg Leu Lys He Leu Thr His Leu Ala Lys Leu 145 150 155 160
Leu Phe Arg Leu Gin Ser Lys Gly Leu Val Tyr Gly Asp Leu Asn Leu
165 170 175
Asn Asn Val Phe Tyr Lys Asp Asn Ser Ala Phe Leu He Asp Ala Asp
180 185 190
Asn Val Arg Tyr Glu Ser Glu Lys Ala Leu Cys Val He Phe Thr Pro
195 200 205
Asn Tyr Gly Ala Leu Glu He Ser Gin Thr Ser Lys Asn Ser Asp Thr
210 215 220
Thr Asn Tyr Asn Thr Met Leu Ser Asp Thr Phe Ser Phe Ala He He 225 230 235 240
Thr Tyr Glu Leu Leu Asn Met Val His Pro Phe Asp Gly Asn Lys Ala
245 250 255
Asp Asp Ser Val Glu Asn Phe He Glu Leu Pro Trp He Glu Asp Arg
260 265 270
Lys Asp Asp Ser Asn Arg Ser Cys Gly Leu Leu Pro Phe Phe Leu Thr
275 280 285
Arg Asp Leu Lys Asn Leu Leu Ala 290 295
(2) INFORMATION FOR SEQ ID NO: 1311:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 509 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 44...469 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1311:
ATGCTTTGAA GAAGGCAAAA AAGATCCTTT GAAACGCCCT ACT ATG CCC TTA TTT 55
Met Pro Leu Phe 1 ATA GAG AGC TTA GAA AAA GCT AGC TTG CAA GTG TTA GAA TGT GAA AAT 103 He Glu Ser Leu Glu Lys Ala Ser Leu Gin Val Leu Glu Cys Glu Asn 5 10 15 20
TGT TCA ATG ACT TAT TAT GAT AGA GAT TAT AAT AGA GAA TGT GAG ATT 151 Cys Ser Met Thr Tyr Tyr Asp Arg Asp Tyr Asn Arg Glu Cys Glu He 25 30 35
TGC CCT TAT TGC GAT GCT AAA AAA CCT GTC AGA CTT GTA GCA ACA AGT 199 Cys Pro Tyr Cys Asp Ala Lys Lys Pro Val Arg Leu Val Ala Thr Ser 40 45 50
TAT TAC CAA AAG AGC GAA GTT TTT TAT TTT GTC TCG AAT TTT ACA GAC 247 Tyr Tyr Gin Lys Ser Glu Val Phe Tyr Phe Val Ser Asn Phe Thr Asp 55 60 65
CCT ATT TTT TTA CCG ACA ACC TTA TTT AAG GGG ATT GAA GTG GTT AAA 295 Pro He Phe Leu Pro Thr Thr Leu Phe Lys Gly He Glu Val Val Lys 70 75 80
AGC GAA TGG GAG TTT GCA GAG ATT GCT AAT AAT ATA TTG ATT TTT CAT 343 Ser Glu Trp Glu Phe Ala Glu He Ala Asn Asn He Leu He Phe His 85 90 95 100
CAT GAC ATA CAA CAA GAA AAG ATT CTC ATT AAT AAT AAA AGA TTG GAT 391 His Asp He Gin Gin Glu Lys He Leu He Asn Asn Lys Arg Leu Asp 105 110 115
CAC TAT AGG ATA GAA ATA GAT TTA GAA AAA GAA TTG ACT ATT TCA TAC 439 His Tyr Arg He Glu He Asp Leu Glu Lys Glu Leu Thr He Ser Tyr 120 125 130
AAT GGT TTT TTA ATT AAG GTT CAA AAA TGC TGAGTTTTAT CAAAGAAGAT AGC 492 Asn Gly Phe Leu He Lys Val Gin Lys Cys 135 140
ATCATCAAGG CTTATAA 509
(2) INFORMATION FOR SEQ ID NO: 1312:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 142 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1312:
Met Pro Leu Phe He Glu Ser Leu Glu Lys Ala Ser Leu Gin Val Leu
1 5 10 15
Glu Cys Glu Asn Cys Ser Met Thr Tyr Tyr Asp Arg Asp Tyr Asn Arg
20 25 30
Glu Cys Glu He Cys Pro Tyr Cys Asp Ala Lys Lys Pro Val Arg Leu 35 40 45
Val Ala Thr Ser Tyr Tyr Gin Lys Ser Glu Val Phe Tyr Phe Val Ser
50 55 60
Asn Phe Thr Asp Pro He Phe Leu Pro Thr Thr Leu Phe Lys Gly He 65 70 75 80
Glu Val Val Lys Ser Glu Trp Glu Phe Ala Glu He Ala Asn Asn He
85 90 95
Leu He Phe His His Asp He Gin Gin Glu Lys He Leu He Asn Asn
100 105 110
Lys Arg Leu Asp His Tyr Arg He Glu He Asp Leu Glu Lys Glu Leu
115 120 125
Thr He Ser Tyr Asn Gly Phe Leu He Lys Val Gin Lys Cys 130 135 140
(2) INFORMATION FOR SEQ ID NO:1313:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1260 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 27...1193 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1313:
AATGGTTTTT TAATTAAGGT TCAAAA ATG CTG AGT TTT ATC AAA GAA GAT AGC 53
Met Leu Ser Phe He Lys Glu Asp Ser 1 5
ATC ATC AAG GCT TAT AAC CTC AAT ACC GCA AAA CTA GAG CCA AAA GAT 101 He He Lys Ala Tyr Asn Leu Asn Thr Ala Lys Leu Glu Pro Lys Asp 10 15 20 25
AGA GAA AAA TTG GGA TTA TTA AAG ATT GAA AAA AAT AAA ATA TAT TTT 149 Arg Glu Lys Leu Gly Leu Leu Lys He Glu Lys Asn Lys He Tyr Phe 30 35 40
CAT CTA GAT GAA AAG CGT TAT TTG AAA TTA GAG ATC ATA GGC AAA ACC 197 His Leu Asp Glu Lys Arg Tyr Leu Lys Leu Glu He He Gly Lys Thr 45 50 55
AAA GAA AAA GAA ATT AAA AAC GCT TTT TGC AGT AAT GCT TTT CTT GCA 245 Lys Glu Lys Glu He Lys Asn Ala Phe Cys Ser Asn Ala Phe Leu Ala 60 65 70
GCT CAA GTC CTA AAT TTA AAC CAA GAA AGA CAA GTT TTA GAA TTG AAG 293 Ala Gin Val Leu Asn Leu Asn Gin Glu Arg Gin Val Leu Glu Leu Lys 75 80 85
TGC CAT TTC TTC AAG CAC CCT ATA AAA ATT CTT CCT GAA CCA TTA AAC 341 Cys His Phe Phe Lys His Pro He Lys He Leu Pro Glu Pro Leu Asn 90 95 100 105
ATT AAT TTC AAA GAC ACA ATC ATA AAA AAG TTA CTA AAA GAT ATG GGC 389 He Asn Phe Lys Asp Thr lie He Lys Lys Leu Leu Lys Asp Met Gly 110 115 120
AAA GAT AAA AAA ATA GAA GAT TTT AAA GAA ACT TGT ATT TTA AAA ATA 437 Lys Asp Lys Lys He Glu Asp Phe Lys Glu Thr Cys He Leu Lys He 125 130 135
GCT GGT TTT ACT TAT TTT GTG TGC GTA TTG CCT TAT GAA TAT GAG AAT 485 Ala Gly Phe Thr Tyr Phe Val Cys Val Leu Pro Tyr Glu Tyr Glu Asn 140 145 150
AAA GAG GAT AAA GAG AAT AGT GAA GAG ATT TTA AAA GAA GAT TTC AGG 533 Lys Glu Asp Lys Glu Asn Ser Glu Glu He Leu Lys Glu Asp Phe Arg 155 160 165
CTG TTA AAT ACC AAG GGG GGA TTA AGC GTT AAG CGT GCT TTG ATA AAT 581 Leu Leu Asn Thr Lys Gly Gly Leu Ser Val Lys Arg Ala Leu He Asn 170 175 180 185
AAC AGG CAT TCT TAT GAA GCG ATA AAA TTA AGA CCC ATT AAA CAA GAG 629 Asn Arg His Ser Tyr Glu Ala He Lys Leu Arg Pro He Lys Gin Glu 190 195 200
TTA GTG CCT GGT TTG TGT TTG TTT TTT CAA GGT TCA TTA GAA TTT AAT 677 Leu Val Pro Gly Leu Cys Leu Phe Phe Gin Gly Ser Leu Glu Phe Asn 205 210 215
GAT AAA ACC ACA AAA ACC ATG CGA ACG AGC CTT TTA GAC CAG ATC CAG 725 Asp Lys Thr Thr Lys Thr Met Arg Thr Ser Leu Leu Asp Gin He Gin 220 225 230
CAA GAT GAC AAA TCT TAT TTA AAA ATT TGG GAA AAA TAT CTC ATC AAA 773 Gin Asp Asp Lys Ser Tyr Leu Lys He Trp Glu Lys Tyr Leu He Lys 235 240 245
AGC GCT CAA AAA AGT TTT AAT GAG GCA AAA GAA GTG GGG GTT TTA GAG 821 Ser Ala Gin Lys Ser Phe Asn Glu Ala Lys Glu Val Gly Val Leu Glu 250 255 260 265
ATT GAA AGC GTG AGT AAA GAA GGA GGG AAT TTA AGA ATT CGT TTT AAG 869 He Glu Ser Val Ser Lys Glu Gly Gly Asn Leu Arg He Arg Phe Lys 270 275 280
CCA GCT TTA GGC AAG AAT AAA ATG GAA ATC TTA AAG AAA TCA CAA TTT 917 Pro Ala Leu Gly Lys Asn Lys Met Glu He Leu Lys Lys Ser Gin Phe 285 290 295
AAA AAG GGG AGT GAT TTA GGG GTT TTA GAG GAT TTA GAC CCA CAA AAT 965 Lys Lys Gly Ser Asp Leu Gly Val Leu Glu Asp Leu Asp Pro Gin Asn 300 305 310
GAA GAA AAT TTA ATC AAT CTT ATT TCT GAA CAA AAG AAA CAA ATT TCT 1013 Glu Glu Asn Leu He Asn Leu He Ser Glu Gin Lys Lys Gin He Ser 315 320 325
AAA AAC AAC AGC CAA TCA ATA ATG ATT GAA GAC ATT AGT GGG GAT GAT 1061 Lys Asn Asn Ser Gin Ser He Met He Glu Asp He Ser Gly Asp Asp 330 335 340 345
TTT ATT ATA GAT TAC GAT CTT TCC ATA AAA GAG GGC GAT GCT TTT CAT 1109 Phe He He Asp Tyr Asp Leu Ser He Lys Glu Gly Asp Ala Phe His 350 355 360
TTA AAT TAT ATG GGG GAT CTA AAT ACG CTT AAA AAA CAA TAT AGC GCA 1157 Leu Asn Tyr Met Gly Asp Leu Asn Thr Leu Lys Lys Gin Tyr Ser Ala 365 370 375
TTA GAT AAG ACA AAG AAA GGT TTG AAG CGC CAA TCC TAATTTAGGA TTAATT 1209 Leu Asp Lys Thr Lys Lys Gly Leu Lys Arg Gin Ser 380 385
TTAAACATTA AAGAGGATAA AGAGAATAGT GATAGCGATA ATGATACTGC A 1260
(2) INFORMATION FOR SEQ ID NO: 1314:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 389 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1314:
Met Leu Ser Phe He Lys Glu Asp Ser He He Lys Ala Tyr Asn Leu
1 5 10 15
Asn Thr Ala Lys Leu Glu Pro Lys Asp Arg Glu Lys Leu Gly Leu Leu
20 25 30
Lys He Glu Lys Asn Lys He Tyr Phe His Leu Asp Glu Lys Arg Tyr
35 40 45
Leu Lys Leu Glu He He Gly Lys Thr Lys Glu Lys Glu He Lys Asn
50 55 60
Ala Phe Cys Ser Asn Ala Phe Leu Ala Ala Gin Val Leu Asn Leu Asn 65 70 75 80
Gin Glu Arg Gin Val Leu Glu Leu Lys Cys His Phe Phe Lys His Pro
85 90 95
He Lys He Leu Pro Glu Pro Leu Asn He Asn Phe Lys Asp Thr He
100 105 110
He Lys Lys Leu Leu Lys Asp Met Gly Lys Asp Lys Lys He Glu Asp
115 120 125
Phe Lys Glu Thr Cys He Leu Lys He Ala Gly Phe Thr Tyr Phe Val 130 135 140 Cys Val Leu Pro Tyr Glu Tyr Glu Asn Lys Glu Asp Lys Glu Asn Ser 145 150 155 160
Glu Glu He Leu Lys Glu Asp Phe Arg Leu Leu Asn Thr Lys Gly Gly
165 170 175
Leu Ser Val Lys Arg Ala Leu He Asn Asn Arg His Ser Tyr Glu Ala
180 185 190
He Lys Leu Arg Pro He Lys Gin Glu Leu Val Pro Gly Leu Cys Leu
195 200 205
Phe Phe Gin Gly Ser Leu Glu Phe Asn Asp Lys Thr Thr Lys Thr Met
210 215 220
Arg Thr Ser Leu Leu Asp Gin He Gin Gin Asp Asp Lys Ser Tyr Leu 225 230 235 240
Lys He Trp Glu Lys Tyr Leu He Lys Ser Ala Gin Lys Ser Phe Asn
245 250 255
Glu Ala Lys Glu Val Gly Val Leu Glu He Glu Ser Val Ser Lys Glu
260 265 270
Gly Gly Asn Leu Arg He Arg Phe Lys Pro Ala Leu Gly Lys Asn Lys
275 280 285
Met Glu He Leu Lys Lys Ser Gin Phe Lys Lys Gly Ser Asp Leu Gly
290 295 300
Val Leu Glu Asp Leu Asp Pro Gin Asn Glu Glu Asn Leu He Asn Leu 305 310 315 320
He Ser Glu Gin Lys Lys Gin He Ser Lys Asn Asn Ser Gin Ser He
325 330 335
Met He Glu Asp He Ser Gly Asp Asp Phe He He Asp Tyr Asp Leu
340 345 350
Ser He Lys Glu Gly Asp Ala Phe His Leu Asn Tyr Met Gly Asp Leu
355 360 365
Asn Thr Leu Lys Lys Gin Tyr Ser Ala Leu Asp Lys Thr Lys Lys Gly
370 375 380
Leu Lys Arg Gin Ser 385
(2) INFORMATION FOR SEQ ID NO: 1315:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1185 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 16...1113 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1315:
TCCATAGGCT AGTTG ATG TCA AAA AGA AGC GAA GTT TTA GAA CAA TTT CAT 51 Met Ser Lys Arg Ser Glu Val Leu Glu Gin Phe His 1 5 10 GGC GGT TTA AAA AAT TTA GAA TTA CAA ACT AAA AGA CGC ATG GGT TTG 99 Gly Gly Leu Lys Asn Leu Glu Leu Gin Thr Lys Arg Arg Met Gly Leu 15 20 25
TGG GGC GAT CCA AAA GAG AAT GAA GAA CAA ACT TTG TTT TTA GAA GAA 147 Trp Gly Asp Pro Lys Glu Asn Glu Glu Gin Thr Leu Phe Leu Glu Glu 30 35 40
ATT GAA AAT GAA TTA AAG CAA TTA GAA AAC AAA GAA AAT CTT AAA GCA 195 lie Glu Asn Glu Leu Lys Gin Leu Glu Asn Lys Glu Asn Leu Lys Ala 45 50 55 60
GAC AAC AAC ACA GAA TTT AAA GAA GAA AAT CAA GAC ACT AAA GAA AAC 243 Asp Asn Asn Thr Glu Phe Lys Glu Glu Asn Gin Asp Thr Lys Glu Asn 65 70 75
CAG CCT AAC GAT TTG TTT TCT TTG CCA TTG CCC ACT CAA ACC ACC ATC 291 Gin Pro Asn Asp Leu Phe Ser Leu Pro Leu Pro Thr Gin Thr Thr He 80 85 90
AAT GGA ATT AAA GAA TTT GTA GAA GAG CCT GTG ATA GAA ACA GAG AAA 339 Asn Gly He Lys Glu Phe Val Glu Glu Pro Val He Glu Thr Glu Lys 95 100 105
AAA GAA ACA TCC CAA AAT GAG CCA ATC CAA GAA AAA AAA GAA AGA ATT 387 Lys Glu Thr Ser Gin Asn Glu Pro He Gin Glu Lys Lys Glu Arg He 110 115 120
TTT AAA AAC TTT TTC TCC AGA ATA GGC TTT GAT AAA AGT ATT GCC CCT 435 Phe Lys Asn Phe Phe Ser Arg He Gly Phe Asp Lys Ser He Ala Pro 125 130 135 140
ACA ATG CTT TTT GAA GAA GTG AGA GAT GCA AGC GTT ATC TAT CAT TTA 483 Thr Met Leu Phe Glu Glu Val Arg Asp Ala Ser Val He Tyr His Leu 145 150 155
GAG AAA AAA TTA GGC GAT TAT ATC TTT TAT GTA GCG TGT TTC TTC TTT 531 Glu Lys Lys Leu Gly Asp Tyr He Phe Tyr Val Ala Cys Phe Phe Phe 160 165 170
GGC ACA ACG GCA TTG CTT ATT ATC TTA CTG ACT ATT CTG TTG CCC TTA 579 Gly Thr Thr Ala Leu Leu He He Leu Leu Thr He Leu Leu Pro Leu 175 180 185
AAA CAA AAA GAG CCG TAT TTA GTG CAA TTT TCT AAC AAT AAA GAA AAT 627 Lys Gin Lys Glu Pro Tyr Leu Val Gin Phe Ser Asn Asn Lys Glu Asn 190 195 200
TTT GCT TTA GTT CAA AAG GCA GAT AGC AGC ATT ACA GCC AAT AAA GCT 675 Phe Ala Leu Val Gin Lys Ala Asp Ser Ser He Thr Ala Asn Lys Ala 205 210 215 220
CTT ATT CGT TCA TTA GTG GGA GCG TAT GTG CTA AAC AGG GAA AGC ATT 723 Leu He Arg Ser Leu Val Gly Ala Tyr Val Leu Asn Arg Glu Ser He 225 230 235 ACT CAT ATT GAG CAA CAT GAA AAA ATG CGT CAA AAC ACC ATT AAA GAG 771 Thr His He Glu Gin His Glu Lys Met Arg Gin Asn Thr He Lys Glu 240 245 250
CAA AGT TCC AAT GAA GTA TGG TAT GAA TTT GAA AAA CTC ATC GCT CAT 819 Gin Ser Ser Asn Glu Val Trp Tyr Glu Phe Glu Lys Leu He Ala His 255 260 265
TAT GAC AGC ATT TAC ACT AAT CCT TTA CTC ACA AGA AAA GTA AAG ATT 867 Tyr Asp Ser He Tyr Thr Asn Pro Leu Leu Thr Arg Lys Val Lys He 270 275 280
GCA AAT ATT TAC TTA GAT AAA GAT TTA GCC TAT ATT GAC ATT GAA GTG 915 Ala Asn He Tyr Leu Asp Lys Asp Leu Ala Tyr He Asp He Glu Val 285 290 295 300
AGC TTG TAT CAT AGT GGA GAA TTA GAG AGC TTG AAG CGC TAT AAA GTG 963 Ser Leu Tyr His Ser Gly Glu Leu Glu Ser Leu Lys Arg Tyr Lys Val 305 310 315
GTG ATG AGT TTT GAA TTT AAA AAA CAA GAA ATC AAT TTT GAC TCC ATG 1011 Val Met Ser Phe Glu Phe Lys Lys Gin Glu He Asn Phe Asp Ser Met 320 325 330
TCT TTA AAT CCT ACA GGC TTT ATG GTT ACA AGT TAT GAT GTA ACT GAA 1059 Ser Leu Asn Pro Thr Gly Phe Met Val Thr Ser Tyr Asp Val Thr Glu 335 340 345
ATT GCG ATT GTG AAT TAC CCA ACC GCT AAA GCG ATT GGG CTT TTT CTT 1107 He Ala He Val Asn Tyr Pro Thr Ala Lys Ala He Gly Leu Phe Leu 350 355 360
GCT TCA TAGCTCCATA ACTAGCTAGA TCCAATATGT TTCCATATTT AGAACTAACC CC 1165
Ala Ser
365
GTTAGAGGAA GCTCCACAAG 1185
(2) INFORMATION FOR SEQ ID NO: 1316:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 366 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1316:
Met Ser Lys Arg Ser Glu Val Leu Glu Gin Phe His Gly Gly Leu Lys
1 5 10 15
Asn Leu Glu Leu Gin Thr Lys Arg Arg Met Gly Leu Trp Gly Asp Pro
20 25 30
Lys Glu Asn Glu Glu Gin Thr Leu Phe Leu Glu Glu He Glu Asn Glu 35 40 45
Leu Lys Gin Leu Glu Asn Lys Glu Asn Leu Lys Ala Asp Asn Asn Thr
50 55 60
Glu Phe Lys Glu Glu Asn Gin Asp Thr Lys Glu Asn Gin Pro Asn Asp 65 70 75 80
Leu Phe Ser Leu Pro Leu Pro Thr Gin Thr Thr He Asn Gly He Lys
85 90 95
Glu Phe Val Glu Glu Pro Val He Glu Thr Glu Lys Lys Glu Thr Ser
100 105 110
Gin Asn Glu Pro He Gin Glu Lys Lys Glu Arg He Phe Lys Asn Phe
115 120 125
Phe Ser Arg He Gly Phe Asp Lys Ser He Ala Pro Thr Met Leu Phe
130 135 140
Glu Glu Val Arg Asp Ala Ser Val He Tyr His Leu Glu Lys Lys Leu 145 150 155 160
Gly Asp Tyr He Phe Tyr Val Ala Cys Phe Phe Phe Gly Thr Thr Ala
165 170 175
Leu Leu He He Leu Leu Thr He Leu Leu Pro Leu Lys Gin Lys Glu
180 185 190
Pro Tyr Leu Val Gin Phe Ser Asn Asn Lys Glu Asn Phe Ala Leu Val
195 200 205
Gin Lys Ala Asp Ser Ser He Thr Ala Asn Lys Ala Leu He Arg Ser
210 215 220
Leu Val Gly Ala Tyr Val Leu Asn Arg Glu Ser He Thr His He Glu 225 230 235 240
Gin His Glu Lys Met Arg Gin Asn Thr He Lys Glu Gin Ser Ser Asn
245 250 255
Glu Val Trp Tyr Glu Phe Glu Lys Leu He Ala His Tyr Asp Ser He
260 265 270
Tyr Thr Asn Pro Leu Leu Thr Arg Lys Val Lys He Ala Asn He Tyr
275 280 285
Leu Asp Lys Asp Leu Ala Tyr He Asp He Glu Val Ser Leu Tyr His
290 295 300
Ser Gly Glu Leu Glu Ser Leu Lys Arg Tyr Lys Val Val Met Ser Phe 305 310 315 320
Glu Phe Lys Lys Gin Glu He Asn Phe Asp Ser Met Ser Leu Asn Pro
325 330 335
Thr Gly Phe Met Val Thr Ser Tyr Asp Val Thr Glu He Ala He Val
340 345 350
Asn Tyr Pro Thr Ala Lys Ala He Gly Leu Phe Leu Ala Ser 355 360 365
(2) INFORMATION FOR SEQ ID NO: 1317:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 745 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 37...717 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1317:
TAAGGGGGGA TTTGGAAGGT GTTGGAATTG AAATTA ATG GGA CAG AAG AGG ATG 54
Met Gly Gin Lys Arg Met
1 5
AAT AAA TCA AAC AAA TTA GTC ATT ATC AAT CGC GCC ATT CCA GGT GGG 102 Asn Lys Ser Asn Lys Leu Val He He Asn Arg Ala He Pro Gly Gly 10 15 20
GGC AAG ACC TCT TTG ATC AAA CAG ATT GAA GAG TTG GCA AAA AGC TTG 150 Gly Lys Thr Ser Leu He Lys Gin He Glu Glu Leu Ala Lys Ser Leu 25 30 35
GGG CAT TCT ATT AGC GTT CAT TCT ACC GAT GAA TAT TTC ATC CAA ACA 198 Gly His Ser He Ser Val His Ser Thr Asp Glu Tyr Phe He Gin Thr 40 45 50
GAT GAA GAG GGT ATC AGG CAT TAT GTT GTT GAT AAA AAG AAA CTC AAT 246 Asp Glu Glu Gly He Arg His Tyr Val Val Asp Lys Lys Lys Leu Asn 55 60 65 70
GAA TAC CAC CAA AAC AAT CAA GAA GCC TTC AAA CAA GCT TTA GAA AAT 294 Glu Tyr His Gin Asn Asn Gin Glu Ala Phe Lys Gin Ala Leu Glu Asn 75 80 85
CGT ATA GAT ATT GTA GTG TGC GAT AAC ACC AAT TTT GAA TCG TGG CAA 342 Arg He Asp He Val Val Cys Asp Asn Thr Asn Phe Glu Ser Trp Gin 90 95 100
AGC AAA CCA TAT ACA GAT ATG GCT AGA GAA TTT GGC TAT AAA ATT TTG 390 Ser Lys Pro Tyr Thr Asp Met Ala Arg Glu Phe Gly Tyr Lys He Leu 105 110 115
TTG ATT GAT TTT AAG AAT AGA CAC TTA GAA ACC CCC ATG GAT TAT GGA 438 Leu He Asp Phe Lys Asn Arg His Leu Glu Thr Pro Met Asp Tyr Gly 120 125 130
TGG GAT GTT GCG CAA TGC ATC AAG AAG CCA CGA GGT ATT GCA AAG CAT 486 Trp Asp Val Ala Gin Cys He Lys Lys Pro Arg Gly He Ala Lys His 135 140 145 150
TAT GAC TAT GAT TTT TAT TTG GAG AGG GTT TTG GTT GAG CCA CAG GAT 534 Tyr Asp Tyr Asp Phe Tyr Leu Glu Arg Val Leu Val Glu Pro Gin Asp 155 160 165
TAT GAG AAA CAA AAT AGA GAG TTG AGC TTA AAA GCC TTA GAA TTT TTG 582 Tyr Glu Lys Gin Asn Arg Glu Leu Ser Leu Lys Ala Leu Glu Phe Leu 170 175 180
AAA TAC AAT TTT GAT TTT GAT GTG ATT TTT TAT TCT TTT GGG GAG CAA 630 Lys Tyr Asn Phe Asp Phe Asp Val He Phe Tyr Ser Phe Gly Glu Gin 185 190 195
TTA ATG CCT ATT CTT ACT AGA ATG TTA GTT TCT GTC TCT AAG TCT CAT 678 Leu Met Pro He Leu Thr Arg Met Leu Val Ser Val Ser Lys Ser His 200 205 210
AGA AAG AGA CTT GAA AAC TAT GGC AAA GAC ATT AAA ACC TAATTTAGAT AA 729 Arg Lys Arg Leu Glu Asn Tyr Gly Lys Asp He Lys Thr 215 220 225
AGATGAGTTA AACACA 745
(2) INFORMATION FOR SEQ ID NO: 1318:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 227 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1318:
Met Gly Gin Lys Arg Met Asn Lys Ser Asn Lys Leu Val He He Asn
1 5 10 15
Arg Ala He Pro Gly Gly Gly Lys Thr Ser Leu He Lys Gin He Glu
20 25 30
Glu Leu Ala Lys Ser Leu Gly His Ser He Ser Val His Ser Thr Asp
35 40 45
Glu Tyr Phe He Gin Thr Asp Glu Glu Gly He Arg His Tyr Val Val
50 55 60
Asp Lys Lys Lys Leu Asn Glu Tyr His Gin Asn Asn Gin Glu Ala Phe 65 70 75 80
Lys Gin Ala Leu Glu Asn Arg He Asp He Val Val Cys Asp Asn Thr
85 90 95
Asn Phe Glu Ser Trp Gin Ser Lys Pro Tyr Thr Asp Met Ala Arg Glu
100 105 110
Phe Gly Tyr Lys He Leu Leu He Asp Phe Lys Asn Arg His Leu Glu
115 120 125
Thr Pro Met Asp Tyr Gly Trp Asp Val Ala Gin Cys He Lys Lys Pro
130 135 140
Arg Gly He Ala Lys His Tyr Asp Tyr Asp Phe Tyr Leu Glu Arg Val 145 150 155 160
Leu Val Glu Pro Gin Asp Tyr Glu Lys Gin Asn Arg Glu Leu Ser Leu
165 170 175
Lys Ala Leu Glu Phe Leu Lys Tyr Asn Phe Asp Phe Asp Val He Phe
180 185 190
Tyr Ser Phe Gly Glu Gin Leu Met Pro He Leu Thr Arg Met Leu Val
195 200 205
Ser Val Ser Lys Ser His Arg Lys Arg Leu Glu Asn Tyr Gly Lys Asp
210 215 220
He Lys Thr 225 (2) INFORMATION FOR SEQ ID NO: 1319:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 531 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...468 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1319:
ATTTACTAAA GGAAAACA ATG ATT AAA CTA ATC TTA CAC AAG AAG TCC ATA 51
Met He Lys Leu He Leu His Lys Lys Ser He 1 5 10
CAA ATT GAT GAA ACA TTG CTG AAT GTA AAA GAG CAT TTA GAA AAG TTT 99 Gin He Asp Glu Thr Leu Leu Asn Val Lys Glu His Leu Glu Lys Phe 15 20 25
TAT TCA AAT AAA GAA CAA GAG ACA ATC GCT CAA ACT TTA GAG AAT GAA 147 Tyr Ser Asn Lys Glu Gin Glu Thr He Ala Gin Thr Leu Glu Asn Glu 30 35 40
ACA GAA ATT TCT TGT AGC TAT TTT TGG GAC AAA GAC TTC TTG TTG TTA 195 Thr Glu He Ser Cys Ser Tyr Phe Trp Asp Lys Asp Phe Leu Leu Leu 45 50 55
GAG CAA CTT TTA GAA AAT RAT TTA GGT CAT TTT ACC TTT GAG AGC GAG 243 Glu Gin Leu Leu Glu Asn Xaa Leu Gly His Phe Thr Phe Glu Ser Glu 60 65 70 75
TTT GCC CTA CTA AAA GAT AAA GAG ACT TTA AAC CTA TCT CAA ATC AAA 291 Phe Ala Leu Leu Lys Asp Lys Glu Thr Leu Asn Leu Ser Gin He Lys 80 85 90
CAA ATC GGT GTC TTA AAG GTT CTT ACC TAT GAR ATG ATA CAA ACC TTA 339 Gin He Gly Val Leu Lys Val Leu Thr Tyr Xaa Met He Gin Thr Leu 95 100 105
AAA AAT CAA ATC ATT CAT TTA GCA CAA GTT GTC AAT GAA GAA AAT TTA 387 Lys Asn Gin He He His Leu Ala Gin Val Val Asn Glu Glu Asn Leu 110 115 120
GAA AAA GAT GAA GAA CTT GTT GTC TAC CAC CTA AAT TTC ACG TCA CGC 435 Glu Lys Asp Glu Glu Leu Val Val Tyr His Leu Asn Phe Thr Ser Arg 125 130 135 AAC AAT CTT ACA AAA TAT TAT CCA AGT TCT GTG TGATTAAAAA AGAAAGAAAT 488 Asn Asn Leu Thr Lys Tyr Tyr Pro Ser Ser Val 140 145 150
ATCGCATGAA AAAATTAAGT CATTTTAGAA AGCTTATCGC CTT 531
(2) INFORMATION FOR SEQ ID NO:1320:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 150 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1320:
Met He Lys Leu He Leu His Lys Lys Ser He Gin He Asp Glu Thr
1 5 10 15
Leu Leu Asn Val Lys Glu His Leu Glu Lys Phe Tyr Ser Asn Lys Glu
20 25 30
Gin Glu Thr He Ala Gin Thr Leu Glu Asn Glu Thr Glu He Ser Cys
35 40 45
Ser Tyr Phe Trp Asp Lys Asp Phe Leu Leu Leu Glu Gin Leu Leu Glu
50 55 60
Asn Xaa Leu Gly His Phe Thr Phe Glu Ser Glu Phe Ala Leu Leu Lys 65 70 75 80
Asp Lys Glu Thr Leu Asn Leu Ser Gin He Lys Gin He Gly Val Leu
85 90 95
Lys Val Leu Thr Tyr Xaa Met He Gin Thr Leu Lys Asn Gin He He
100 105 110
His Leu Ala Gin Val Val Asn Glu Glu Asn Leu Glu Lys Asp Glu Glu
115 120 125
Leu Val Val Tyr His Leu Asn Phe Thr Ser Arg Asn Asn Leu Thr Lys
130 135 140
Tyr Tyr Pro Ser Ser Val 145 150
(2) INFORMATION FOR SEQ ID NO: 1321:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 334 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 34...294 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1321:
CTGGTTTATG AGTATTTTTT AAAAGAAGTC CCC ATG CAA TTA GTT GGT ATT TCA 54
Met Gin Leu Val Gly He Ser 1 5
GTT TCT AAT CTC AAA GAA ATC AGC TCC AAA GAA AAA TTT CTT TGG CTC 102 Val Ser Asn Leu Lys Glu He Ser Ser Lys Glu Lys Phe Leu Trp Leu 10 15 20
AAT GCT AAG AGT TTT TTA CTC TCA GGA TTT GTG CCT TTT ATT ATG ATA 150 Asn Ala Lys Ser Phe Leu Leu Ser Gly Phe Val Pro Phe He Met He 25 30 35
CCT TGG CTA GAT ATA TTG AAC TCT TTT GTG CTT TAT GTG TGC TTT CTC 198 Pro Trp Leu Asp He Leu Asn Ser Phe Val Leu Tyr Val Cys Phe Leu 40 45 50 55
TTA ATT TTT AGC ATA GCG GAG TTC TTT GAT GAA GAT ATA AGT GAC ATT 246 Leu He Phe Ser He Ala Glu Phe Phe Asp Glu Asp He Ser Asp He 60 65 70
TTA ATC GCT CAT TCC AAA ATT AAA ACC AAA GCT AAT TCA TTT TAC GCT T 295 Leu He Ala His Ser Lys He Lys Thr Lys Ala Asn Ser Phe Tyr Ala 75 80 85
AAAAGGAAAA AATATGCAAA AAGAAGTCTT AGTAGAAAA 334
(2) INFORMATION FOR SEQ ID NO: 1322:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 87 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1322:
Met Gin Leu Val Gly He Ser Val Ser Asn Leu Lys Glu He Ser Ser
1 5 10 15
Lys Glu Lys Phe Leu Trp Leu Asn Ala Lys Ser Phe Leu Leu Ser Gly
20 25 30
Phe Val Pro Phe He Met He Pro Trp Leu Asp He Leu Asn Ser Phe
35 40 45
Val Leu Tyr Val Cys Phe Leu Leu He Phe Ser He Ala Glu Phe Phe
50 55 60
Asp Glu Asp He Ser Asp He Leu He Ala His Ser Lys He Lys Thr 65 70 75 80
Lys Ala Asn Ser Phe Tyr Ala 85
(2) INFORMATION FOR SEQ ID NO: 1323: (i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 995 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 37...948 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1323:
TAAAAACACC CCTAAAAGAA AAAGAAAGTC TTCTTA ATG TTA GAA AGC GCC CTT 54
Met Leu Glu Ser Ala Leu 1 5
AAA TAT TGC AAG GAA AAA GCC ATA GAC CTT TTA GTA GGG TTT GTG CCA 102 Lys Tyr Cys Lys Glu Lys Ala He Asp Leu Leu Val Gly Phe Val Pro 10 15 20
AAA ACC TAT TCT ATG GCA CAA GAG TGC AAT ATT TTA GGC TTG TAT GAT 150 Lys Thr Tyr Ser Met Ala Gin Glu Cys Asn He Leu Gly Leu Tyr Asp 25 30 35
GAT GCT TTC ATT ATT ACC AAA CAA GAA AAT CTA GTA GGC ATT ATA TCC 198 Asp Ala Phe He He Thr Lys Gin Glu Asn Leu Val Gly He He Ser 40 45 50
TTA CAA GGA CTA AGC TAT TCT AAT TTA ATG CAA AAA GAC TTA GAG GGC 246 Leu Gin Gly Leu Ser Tyr Ser Asn Leu Met Gin Lys Asp Leu Glu Gly 55 60 65 70
TAT TTT GAT GCT AGA CAA AAT GTT CTC AAC ACC ATT AGT AAA GAC ATT 294 Tyr Phe Asp Ala Arg Gin Asn Val Leu Asn Thr He Ser Lys Asp He 75 80 85
CAA TTA AGA ATT GTG GCT AAA AGG CGT AAG GAA TTT ATC AAT CAA AGT 342 Gin Leu Arg He Val Ala Lys Arg Arg Lys Glu Phe He Asn Gin Ser 90 95 100
CCA AAT ATT GAC AAT ATT TAT GCC AAA GCT ATT ATC ACA CAA TTT GAA 390 Pro Asn He Asp Asn He Tyr Ala Lys Ala He He Thr Gin Phe Glu 105 110 115
AGC AAG GGA ATC TAT AAA ACA GAG TAT TTT TTA GTG TTT GAA ACT ATC 438 Ser Lys Gly He Tyr Lys Thr Glu Tyr Phe Leu Val Phe Glu Thr He 120 125 130
ACT TCT AAT GTC AAG TCT TTC TTT GAA AAA AAG AAA TTG GAA ATG ACT 486 Thr Ser Asn Val Lys Ser Phe Phe Glu Lys Lys Lys Leu Glu Met Thr 135 140 145 150
ACT TCA ATT AAT GAA GAG TTA GAA GAA AGC TCT AAA GAA GAT AAA CAA 534 Thr Ser He Asn Glu Glu Leu Glu Glu Ser Ser Lys Glu Asp Lys Gin 155 160 165
GAG AAT GAA AAT MGC TCC AAT GAA ACT CAT TCA AAC ACA AGC TCT AAA 582 Glu Asn Glu Asn Xaa Ser Asn Glu Thr His Ser Asn Thr Ser Ser Lys 170 175 180
AAA GAC AAG AAA AAC AAG TTC AAA AAA AAG ATA ACC TTT AGC ACC AAA 630 Lys Asp Lys Lys Asn Lys Phe Lys Lys Lys He Thr Phe Ser Thr Lys 185 190 195
AGT AAA AGA GCC TTA CTC ATT CAA ACC ATA GAA AGA GTA AAA AAC GCT 678 Ser Lys Arg Ala Leu Leu He Gin Thr He Glu Arg Val Lys Asn Ala 200 205 210
CTT AAA GAA TTT AAA CCC ACT TTA CTA AAT TCT AAA GAA GTA TTA AAT 726 Leu Lys Glu Phe Lys Pro Thr Leu Leu Asn Ser Lys Glu Val Leu Asn 215 220 225 230
TTC TAC GCA GAA TAC ATC AAT GGC AAA TAC ATC GCC TTT AAT CCT AAA 774 Phe Tyr Ala Glu Tyr He Asn Gly Lys Tyr He Ala Phe Asn Pro Lys 235 240 245
TTA AAG CGA TTA AGC GAT ACT ATA TTG CAT CTA ATG TGC ATT TTA AGA 822 Leu Lys Arg Leu Ser Asp Thr He Leu His Leu Met Cys He Leu Arg 250 255 260
AAG ATT ACT TTG TCA TTG AAT TTC AAA ATC AAA ACA CCT TTT GTG CGT 870 Lys He Thr Leu Ser Leu Asn Phe Lys He Lys Thr Pro Phe Val Arg 265 270 275
GTG TGG GGA TTA AGG CTT ATG AGA GCG AAG AAA TTT CTT CGC TCC CTA 918 Val Trp Gly Leu Arg Leu Met Arg Ala Lys Lys Phe Leu Arg Ser Leu 280 285 290
TAT CTA CTC TTT TAC ACA CCC AAA TTG AAC TAGATTTAAT CTTTCATATC CGC 971 Tyr Leu Leu Phe Tyr Thr Pro Lys Leu Asn 295 300
TCTTTAGGGC AATTTGAAAG CCTG 995
(2) INFORMATION FOR SEQ ID NO: 1324:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 304 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1324: Met Leu Glu Ser Ala Leu Lys Tyr Cys Lys Glu Lys Ala He Asp Leu
1 5 10 15
Leu Val Gly Phe Val Pro Lys Thr Tyr Ser Met Ala Gin Glu Cys Asn
20 25 30
He Leu Gly Leu Tyr Asp Asp Ala Phe He He Thr Lys Gin Glu Asn
35 40 45
Leu Val Gly He He Ser Leu Gin Gly Leu Ser Tyr Ser Asn Leu Met
50 55 60
Gin Lys Asp Leu Glu Gly Tyr Phe Asp Ala Arg Gin Asn Val Leu Asn 65 70 75 80
Thr He Ser Lys Asp He Gin Leu Arg He Val Ala Lys Arg Arg Lys
85 90 95
Glu Phe He Asn Gin Ser Pro Asn He Asp Asn He Tyr Ala Lys Ala
100 105 110
He He Thr Gin Phe Glu Ser Lys Gly He Tyr Lys Thr Glu Tyr Phe
115 120 125
Leu Val Phe Glu Thr He Thr Ser Asn Val Lys Ser Phe Phe Glu Lys
130 135 140
Lys Lys Leu Glu Met Thr Thr Ser He Asn Glu Glu Leu Glu Glu Ser 145 150 155 160
Ser Lys Glu Asp Lys Gin Glu Asn Glu Asn Xaa Ser Asn Glu Thr His
165 170 175
Ser Asn Thr Ser Ser Lys Lys Asp Lys Lys Asn Lys Phe Lys Lys Lys
180 185 190
He Thr Phe Ser Thr Lys Ser Lys Arg Ala Leu Leu He Gin Thr He
195 200 205
Glu Arg Val Lys Asn Ala Leu Lys Glu Phe Lys Pro Thr Leu Leu Asn
210 215 220
Ser Lys Glu Val Leu Asn Phe Tyr Ala Glu Tyr He Asn Gly Lys Tyr 225 230 235 240
He Ala Phe Asn Pro Lys Leu Lys Arg Leu Ser Asp Thr He Leu His
245 250 255
Leu Met Cys He Leu Arg Lys He Thr Leu Ser Leu Asn Phe Lys He
260 265 270
Lys Thr Pro Phe Val Arg Val Trp Gly Leu Arg Leu Met Arg Ala Lys
275 280 285
Lys Phe Leu Arg Ser Leu Tyr Leu Leu Phe Tyr Thr Pro Lys Leu Asn 290 295 300
(2) INFORMATION FOR SEQ ID NO: 1325:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1598 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 39...1556 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1325:
ATAATTATAT AGAATTAGTG CAAGCCAATC GTTTGAGC ATG CAA GAG TGT GCT TTA 56
Met Gin Glu Cys Ala Leu 1 5
AAC TTA GTT ATA AGG GCT AAA AGT AAA GCT AAA TTA GAC AAG TCT TTA 104 Asn Leu Val He Arg Ala Lys Ser Lys Ala Lys Leu Asp Lys Ser Leu 10 15 20
AAA GAG ATT TTA TCC TTG CTT AAT AAT GCT GGA CTA GGC AGT GTT ACA 152 Lys Glu He Leu Ser Leu Leu Asn Asn Ala Gly Leu Gly Ser Val Thr 25 30 35
GAA ACT ATA GGG CTA AAA CCA TCT TAT TTT TCA TTC TTC CCA AAT AAC 200 Glu Thr He Gly Leu Lys Pro Ser Tyr Phe Ser Phe Phe Pro Asn Asn 40 45 50
GCC AAT ATC AAC CCT AGA ATG AGA CAT CAA ACT TCC CAA GTC ATA GCA 248 Ala Asn He Asn Pro Arg Met Arg His Gin Thr Ser Gin Val He Ala 55 60 65 70
TCT TTG ATT TTG TTT GAG AAA AAT AAT ACA GGT TTT AGA GCA AAT TCT 296 Ser Leu He Leu Phe Glu Lys Asn Asn Thr Gly Phe Arg Ala Asn Ser 75 80 85
TGG GGG GAT ATG CCC TTA TCT GTG TTT AAG AAC CTA GAC CAT AGC CCT 344 Trp Gly Asp Met Pro Leu Ser Val Phe Lys Asn Leu Asp His Ser Pro 90 95 100
TAT TTG TTT AAT TTT CAT AAT CAA GAA GTC AAA CAT AAG GGC GTG TTA 392 Tyr Leu Phe Asn Phe His Asn Gin Glu Val Lys His Lys Gly Val Leu 105 110 115
GCC CAC AAT GTC GCA CGA GTA GTG GGA CAT ACC ATG ATT ATA GGA GCA 440 Ala His Asn Val Ala Arg Val Val Gly His Thr Met He He Gly Ala 120 125 130
ACA GGT GCT GGT AAA ACC ACA CTC ATT AGC TAT TTG ATG ATG AGT GCC 488 Thr Gly Ala Gly Lys Thr Thr Leu He Ser Tyr Leu Met Met Ser Ala 135 140 145 150
TTA AAA TAT TCT AAC ATT GAT ATT TTA GCT CTT GAT AGA CTA AAT GGT 536 Leu Lys Tyr Ser Asn He Asp He Leu Ala Leu Asp Arg Leu Asn Gly 155 160 165
TTG TAT TCC TTT ACC AAG TAT TTT GAT GGG ATT TAT AAT CAA GGC GAA 584 Leu Tyr Ser Phe Thr Lys Tyr Phe Asp Gly He Tyr Asn Gin Gly Glu 170 175 180
AAC TTT CAT ATT AAC CCT TTT TCA TTA GAA GAT AGC GCA ACT AAT AGA 632 Asn Phe His He Asn Pro Phe Ser Leu Glu Asp Ser Ala Thr Asn Arg 185 190 195
GCC TTT TTA TTG CAT TTT TAT GCC CAA ATG GCA AAA GTG GAT AGT TAT 680 Ala Phe Leu Leu His Phe Tyr Ala Gin Met Ala Lys Val Asp Ser Tyr 200 205 210
GAT GAC CAT AAG GAT AAA GTA GAA GAT AGA ACA GCC CTT TTA AAT GCT 728 Asp Asp His Lys Asp Lys Val Glu Asp Arg Thr Ala Leu Leu Asn Ala 215 220 225 230
ATT GAT ACG ATG TAT AGA AAT TAT AAC GAT GAA GTC AAA CAA GCC AAA 776 He Asp Thr Met Tyr Arg Asn Tyr Asn Asp Glu Val Lys Gin Ala Lys 235 240 245
TTT AGC AAC CAA GAA TTA CCC CTT CCT TTT GAT TTA AAA GAG TTT GTC 824 Phe Ser Asn Gin Glu Leu Pro Leu Pro Phe Asp Leu Lys Glu Phe Val 250 255 260
AAT GCC ATT GCT AAA ACC AAT ACA GAC ATT TTA GAT AGT AGT TTT GAA 872 Asn Ala He Ala Lys Thr Asn Thr Asp He Leu Asp Ser Ser Phe Glu 265 270 275
GAC TAT TTA AAA TCT TCC TTA TTT TCT AGC CGA ATG GAT AGT CTA GAT 920 Asp Tyr Leu Lys Ser Ser Leu Phe Ser Ser Arg Met Asp Ser Leu Asp 280 285 290
TTT AAA ACT CGT ATT AGC ACC ATA AAT ACC GAT AGC ATT TTA CAT AAT 968 Phe Lys Thr Arg He Ser Thr He Asn Thr Asp Ser He Leu His Asn 295 300 305 310
GAT GAT GAC GCT GGG CTT TTA GCC TAC TAT GTC TTT CAT AAG ATG ATT 1016 Asp Asp Asp Ala Gly Leu Leu Ala Tyr Tyr Val Phe His Lys Met He 315 320 325
GAC AGA GCC TTA AAA ATC AAT CGT GGG TTT TTA TGC TTT ATT GAT GAG 1064 Asp Arg Ala Leu Lys He Asn Arg Gly Phe Leu Cys Phe He Asp Glu 330 335 340
TTT AAG TCT TAC GCT CAA AAT GAA ATG ATG AAT AAA AAA ATC AAT GAA 1112 Phe Lys Ser Tyr Ala Gin Asn Glu Met Met Asn Lys Lys He Asn Glu 345 350 355
ATC ATT ACT CAA GCT AGA AAG GCT AAT GGG GTG ATT GTT CTA GCC TTA 1160 He He Thr Gin Ala Arg Lys Ala Asn Gly Val He Val Leu Ala Leu 360 365 370
CAA GAC ATT AAC CAA CTA AGC GAA GTG AGA AAC GCT CAA AGC TTT ATA 1208 Gin Asp He Asn Gin Leu Ser Glu Val Arg Asn Ala Gin Ser Phe He 375 380 385 390
AAA AAT ATG GGG CAA TTG ATT TTG TAT CCC CAA AGA AAT ATT GAT ACC 1256 Lys Asn Met Gly Gin Leu He Leu Tyr Pro Gin Arg Asn He Asp Thr 395 400 405
AAA GAT TTA AAC GAT AAA TTT GGC ATT AGA CTA AGC GAT ACA GAA AAA 1304 Lys Asp Leu Asn Asp Lys Phe Gly He Arg Leu Ser Asp Thr Glu Lys 410 415 420 CAT TTT TTA GAA AAC ACC GCC GTT AAT GAA TAC AAA GTC TTA CTC AAA 1352 His Phe Leu Glu Asn Thr Ala Val Asn Glu Tyr Lys Val Leu Leu Lys 425 430 435
AAC ATG AAT GAT GGC TCA TCT AAC ATT ATA GAT GTG AGC CTA AGT TCT 1400 Asn Met Asn Asp Gly Ser Ser Asn He He Asp Val Ser Leu Ser Ser 440 445 450
TTG GGT AAT TAC CTA CAA ATC TTT AGC TCT AAT TCT AGC ATG GTA GAA 1448 Leu Gly Asn Tyr Leu Gin He Phe Ser Ser Asn Ser Ser Met Val Glu 455 460 465 470
CAC ATT GAT AAT CTC ATT AAG CAT TAC CCT AAA ACT TGG CGA GAA GTC 1496 His He Asp Asn Leu He Lys His Tyr Pro Lys Thr Trp Arg Glu Val 475 480 485
TTT GTG AGT AAC AAA CAC GAA AAT TTT GAT GAC AAA AAA CAC TTA GAA 1544 Phe Val Ser Asn Lys His Glu Asn Phe Asp Asp Lys Lys His Leu Glu 490 495 500
AAG GTG CTT AAA TGAAAAACAT CATGCGTTTA GTTTTTGTGA TAGTGGCTAT GT 159£ Lys Val Leu Lys 505
(2) INFORMATION FOR SEQ ID NO: 1326:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 506 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1326:
Met Gin Glu Cys Ala Leu Asn Leu Val He Arg Ala Lys Ser Lys Ala
1 5 10 15
Lys Leu Asp Lys Ser Leu Lys Glu He Leu Ser Leu Leu Asn Asn Ala
20 25 30
Gly Leu Gly Ser Val Thr Glu Thr He Gly Leu Lys Pro Ser Tyr Phe
35 40 45
Ser Phe Phe Pro Asn Asn Ala Asn He Asn Pro Arg Met Arg His Gin
50 55 60
Thr Ser Gin Val He Ala Ser Leu He Leu Phe Glu Lys Asn Asn Thr 65 70 75 80
Gly Phe Arg Ala Asn Ser Trp Gly Asp Met Pro Leu Ser Val Phe Lys
85 90 95
Asn Leu Asp His Ser Pro Tyr Leu Phe Asn Phe His Asn Gin Glu Val
100 105 110
Lys His Lys Gly Val Leu Ala His Asn Val Ala Arg Val Val Gly His
115 120 125
Thr Met He He Gly Ala Thr Gly Ala Gly Lys Thr Thr Leu He Ser 130 135 140 Tyr Leu Met Met Ser Ala Leu Lys Tyr Ser Asn He Asp He Leu Ala 145 150 155 160
Leu Asp Arg Leu Asn Gly Leu Tyr Ser Phe Thr Lys Tyr Phe Asp Gly
165 170 175
He Tyr Asn Gin Gly Glu Asn Phe His He Asn Pro Phe Ser Leu Glu
180 185 190
Asp Ser Ala Thr Asn Arg Ala Phe Leu Leu His Phe Tyr Ala Gin Met
195 200 205
Ala Lys Val Asp Ser Tyr Asp Asp His Lys Asp Lys Val Glu Asp Arg
210 215 220
Thr Ala Leu Leu Asn Ala He Asp Thr Met Tyr Arg Asn Tyr Asn Asp 225 230 235 240
Glu Val Lys Gin Ala Lys Phe Ser Asn Gin Glu Leu Pro Leu Pro Phe
245 250 255
Asp Leu Lys Glu Phe Val Asn Ala He Ala Lys Thr Asn Thr Asp He
260 265 270
Leu Asp Ser Ser Phe Glu Asp Tyr Leu Lys Ser Ser Leu Phe Ser Ser
275 280 285
Arg Met Asp Ser Leu Asp Phe Lys Thr Arg He Ser Thr He Asn Thr
290 295 300
Asp Ser He Leu His Asn Asp Asp Asp Ala Gly Leu Leu Ala Tyr Tyr 305 310 315 320
Val Phe His Lys Met He Asp Arg Ala Leu Lys He Asn Arg Gly Phe
325 330 335
Leu Cys Phe He Asp Glu Phe Lys Ser Tyr Ala Gin Asn Glu Met Met
340 345 350
Asn Lys Lys He Asn Glu He He Thr Gin Ala Arg Lys Ala Asn Gly
355 360 365
Val He Val Leu Ala Leu Gin Asp He Asn Gin Leu Ser Glu Val Arg
370 375 380
Asn Ala Gin Ser Phe He Lys Asn Met Gly Gin Leu He Leu Tyr Pro 385 390 395 400
Gin Arg Asn He Asp Thr Lys Asp Leu Asn Asp Lys Phe Gly He Arg
405 410 415
Leu Ser Asp Thr Glu Lys His Phe Leu Glu Asn Thr Ala Val Asn Glu
420 425 430
Tyr Lys Val Leu Leu Lys Asn Met Asn Asp Gly Ser Ser Asn He He
435 440 445
Asp Val Ser Leu Ser Ser Leu Gly Asn Tyr Leu Gin He Phe Ser Ser
450 455 460
Asn Ser Ser Met Val Glu His He Asp Asn Leu He Lys His Tyr Pro 465 470 475 480
Lys Thr Trp Arg Glu Val Phe Val Ser Asn Lys His Glu Asn Phe Asp
485 490 495
Asp Lys Lys His Leu Glu Lys Val Leu Lys 500 505
(2) INFORMATION FOR SEQ ID NO: 1327:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 563 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 24...509 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1327:
CAACTCTTTT TTAAGGGGGA CAC ATG TCT AAT TTG CAA GAA CTT AGA GAG CAT 53
Met Ser Asn Leu Gin Glu Leu Arg Glu His 1 5 10
TTA AAA GAA TTA GAA AAT TCC TTT GAA ATA GGC TCT TTT ACT AAA GAA 101 Leu Lys Glu Leu Glu Asn Ser Phe Glu He Gly Ser Phe Thr Lys Glu 15 20 25
AAT ATT AAA GAA TAC GCT AAA TGC TTT TTT ATG AGT TTA AGC ATG TTT 149 Asn He Lys Glu Tyr Ala Lys Cys Phe Phe Met Ser Leu Ser Met Phe 30 35 40
TTA GAA GAA CAA GAA AAA AAC CAA CAA GAA GAG TTT TTA GAA CAA GAT 197 Leu Glu Glu Gin Glu Lys Asn Gin Gin Glu Glu Phe Leu Glu Gin Asp 45 50 55
ACC AAA GAA AAT CAA GAA GAG CTC ATT AAA AAC ATT CAA ACA AGC ATT 245 Thr Lys Glu Asn Gin Glu Glu Leu He Lys Asn He Gin Thr Ser He 60 65 70
GCT AAA AAC CAA GAG TTA GAA AAA ATC TCT TTT GAA AAA TGG GAG AAT 293 Ala Lys Asn Gin Glu Leu Glu Lys He Ser Phe Glu Lys Trp Glu Asn 75 80 85 90
AAA ATT CAA GAA AGG GTT TTG CCT AAG TTA AAA CGC ATT GTT ACG CAT 341 Lys He Gin Glu Arg Val Leu Pro Lys Leu Lys Arg He Val Thr His 95 100 105
AAG TTG CAA GAA AGT ATC ACA TCT AGC ATA AAC ACG CAA TTA GAG AGT 389 Lys Leu Gin Glu Ser He Thr Ser Ser He Asn Thr Gin Leu Glu Ser 110 115 120
TTT AAA AAA GAT GAG TTA GAT TTA TCT AGC GTG TTT GAA ATC CAA AGA 437 Phe Lys Lys Asp Glu Leu Asp Leu Ser Ser Val Phe Glu He Gin Arg 125 130 135
AAG AAC ACT CAA ATA GCG TAT AGA TTA GCT ATA GGG GGG CTT ATA GGT 485 Lys Asn Thr Gin He Ala Tyr Arg Leu Ala He Gly Gly Leu He Gly 140 145 150
ATC ATT GCT TTA AGC TCG CAA ATT TGATTATTAA CTCTATACTT CACGCTTTTT 539 He He Ala Leu Ser Ser Gin He 155 160 AGCCTTTGTG TGTTCTTTTG TAAA 563
(2) INFORMATION FOR SEQ ID NO: 1328:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 162 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1328:
Met Ser Asn Leu Gin Glu Leu Arg Glu His Leu Lys Glu Leu Glu Asn
1 5 10 15
Ser Phe Glu He Gly Ser Phe Thr Lys Glu Asn He Lys Glu Tyr Ala
20 25 30
Lys Cys Phe Phe Met Ser Leu Ser Met Phe Leu Glu Glu Gin Glu Lys
35 40 45
Asn Gin Gin Glu Glu Phe Leu Glu Gin Asp Thr Lys Glu Asn Gin Glu
50 55 60
Glu Leu He Lys Asn He Gin Thr Ser He Ala Lys Asn Gin Glu Leu 65 70 75 80
Glu Lys He Ser Phe Glu Lys Trp Glu Asn Lys He Gin Glu Arg Val
85 90 95
Leu Pro Lys Leu Lys Arg He Val Thr His Lys Leu Gin Glu Ser He
100 105 110
Thr Ser Ser He Asn Thr Gin Leu Glu Ser Phe Lys Lys Asp Glu Leu
115 120 125
Asp Leu Ser Ser Val Phe Glu He Gin Arg Lys Asn Thr Gin He Ala
130 135 140
Tyr Arg Leu Ala He Gly Gly Leu He Gly He He Ala Leu Ser Ser 145 150 155 160
Gin He
(2) INFORMATION FOR SEQ ID NO: 1329:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3222 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 22...3186 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1329: CCTTAAATCT AAGGGGTGTG C ATG CCA TAC AAT GAA ATC ACA AGG GTT CAA 51
Met Pro Tyr Asn Glu He Thr Arg Val Gin 1 5 10
ATC CCT GCC TTA ATG CAT TTA GCC AAG TTG GGC TAT GAT TTT ATC CCC 99 He Pro Ala Leu Met His Leu Ala Lys Leu Gly Tyr Asp Phe He Pro 15 20 25
ACT AAT TCT AAA GAA AAT AAG CCC AAC CTA GAC ACC GCC ACC AAC ATT 147 Thr Asn Ser Lys Glu Asn Lys Pro Asn Leu Asp Thr Ala Thr Asn He 30 35 40
TTA ACC AAT AGT TTC ACT AAA TCC TTT GAG CGG TTA AAC CCC ACT AAA 195 Leu Thr Asn Ser Phe Thr Lys Ser Phe Glu Arg Leu Asn Pro Thr Lys 45 50 55
AAC GCA CAA GAA ACG CTT GCT GAA ATG AAA AAA CGC TTG AAT TGC GAT 243 Asn Ala Gin Glu Thr Leu Ala Glu Met Lys Lys Arg Leu Asn Cys Asp 60 65 70
GAT TTG GGC AAA AGC TTT TAT GAA TAC TTG CTC AAA AGC GAG AAT CAA 291 Asp Leu Gly Lys Ser Phe Tyr Glu Tyr Leu Leu Lys Ser Glu Asn Gin 75 80 85 90
ATC ATA GAC TTT GAT AAC CCT AAC AAC AAT CTT TAT GAA ATG ATG ACT 339 He He Asp Phe Asp Asn Pro Asn Asn Asn Leu Tyr Glu Met Met Thr 95 100 105
GAA TTA CCC TAC AAA TCT TTT AGG CCT GAC ACC ACC CTT TTT ATC AAT 387 Glu Leu Pro Tyr Lys Ser Phe Arg Pro Asp Thr Thr Leu Phe He Asn 110 115 120
GGC TTG CCT TTG GTG AAT ATA GAA GTT AAA CAG CCT TAC GCC AAA AAA 435 Gly Leu Pro Leu Val Asn He Glu Val Lys Gin Pro Tyr Ala Lys Lys 125 130 135
GGC ATT AAA GAA GAA AGA GAT CGC CAC ATC AAA CGC TAT GAA AAC CCT 483 Gly He Lys Glu Glu Arg Asp Arg His He Lys Arg Tyr Glu Asn Pro 140 145 150
GAA AAC AAA GTT TTT TAT AAT CTC GCG CAA ATC TGG CTT TTT AGC GAT 531 Glu Asn Lys Val Phe Tyr Asn Leu Ala Gin He Trp Leu Phe Ser Asp 155 160 165 170
AAC TTA CCC TAT GAT GAA AAC AAA CCC GAT CAA GGC GCG TTT TAT AGC 579 Asn Leu Pro Tyr Asp Glu Asn Lys Pro Asp Gin Gly Ala Phe Tyr Ser 175 180 185
GCT TCT TAT TCG CCC ATT TTC CAA CGC TTT GTT GAA GCT CAT AGG CTA 627 Ala Ser Tyr Ser Pro He Phe Gin Arg Phe Val Glu Ala His Arg Leu 190 195 200
GAT ATT WCC CCC SSN CCC CSC CAA AAA AAT GAT CAA AAT CAT CAA AAC 675 Asp He Xaa Pro Xaa Pro Xaa Gin Lys Asn Asp Gin Asn His Gin Asn 205 210 215 GAT CAA AAT CAT CGA TCG CTT GAA GAA ATT CAA AAA AGC GTC TTA AAC 723 Asp Gin Asn His Arg Ser Leu Glu Glu He Gin Lys Ser Val Leu Asn 220 225 230
GAA TTT AAC CTT AAA GAC ACC GAC ACC CCA AAA AGC CCT AAA GAC ACC 771 Glu Phe Asn Leu Lys Asp Thr Asp Thr Pro Lys Ser Pro Lys Asp Thr 235 240 245 250
CCC ACA AAC TCC CTT TTA ACT TCG TTT TGC TCT CCA AAA AGG CTT TGC 819 Pro Thr Asn Ser Leu Leu Thr Ser Phe Cys Ser Pro Lys Arg Leu Cys 255 260 265
TTT ATC CTA AAA TAC GGC ATC AGT TTC TTA AAA GAA AAA TCA GAG TTT 867 Phe He Leu Lys Tyr Gly He Ser Phe Leu Lys Glu Lys Ser Glu Phe 270 275 280
AAA AAA CAC GTT TGG CGT TAT GCG CAG ATG TTT GCG AGC TTG AAC GTT 915 Lys Lys His Val Trp Arg Tyr Ala Gin Met Phe Ala Ser Leu Asn Val 285 290 295
TTA AAA GAA TTG CAA AAG CAT TAT GGA ACA AAC CAA AAC CTA AAA GAT 963 Leu Lys Glu Leu Gin Lys His Tyr Gly Thr Asn Gin Asn Leu Lys Asp 300 305 310
CCC CTA AAA GGC ATC ATC TGG CAC ACG CAA GGC AGC GGT AAA ACC GCC 1011 Pro Leu Lys Gly He He Trp His Thr Gin Gly Ser Gly Lys Thr Ala 315 320 325 330
TTA ACC TAC CAC TTA ACC AAA CTC ATC AGA GAC TTT TTT AGC CGA TCC 1059 Leu Thr Tyr His Leu Thr Lys Leu He Arg Asp Phe Phe Ser Arg Ser 335 340 345
AAC CTA AAC AAA AAG ACT AAA TTT TAT TTT ATT GTG GAC AGG TTG GAT 1107 Asn Leu Asn Lys Lys Thr Lys Phe Tyr Phe He Val Asp Arg Leu Asp 350 355 360
TTA TTG GAG CAA GCC AAA AAC GAG TTT TTA AAA AGA GGC CTT TGT GTG 1155 Leu Leu Glu Gin Ala Lys Asn Glu Phe Leu Lys Arg Gly Leu Cys Val 365 370 375
CAT GAG GCA GAA AAT AAA GAG GAT TTG AGC CAA AAA TTA AAA AGC TCT 1203 His Glu Ala Glu Asn Lys Glu Asp Leu Ser Gin Lys Leu Lys Ser Ser 380 385 390
AGC GTT TTT GAA GGC TCT CAA GGG AAT GAT GAA ATC ATC GTT GTG AAT 1251 Ser Val Phe Glu Gly Ser Gin Gly Asn Asp Glu He He Val Val Asn 395 400 405 410
ATC CAA AAA TTC AAA GCC CCC AAT GAA GAA AAA TCC CCC AAT GAA GAC 1299 He Gin Lys Phe Lys Ala Pro Asn Glu Glu Lys Ser Pro Asn Glu Asp 415 420 425
CCC TCT AAT AGC GCT CCT AAA GAA ATC ATT TCT AAA ACA GAA TTA CAA 1347 Pro Ser Asn Ser Ala Pro Lys Glu He He Ser Lys Thr Glu Leu Gin 430 435 440 GAA TCC ATT CAA AAC AGC CGC AAT TTA CAA AGG GTG TTT ATC ATA GAT 1395 Glu Ser He Gin Asn Ser Arg Asn Leu Gin Arg Val Phe He He Asp 445 450 455
GAA GCC CAC AGG AGC TAC GAT CCT AAA GGT TGC TTT TAC GCT AAT TTG 1443 Glu Ala His Arg Ser Tyr Asp Pro Lys Gly Cys Phe Tyr Ala Asn Leu 460 465 470
ATA GAA TGC GAC AAG ACA GCA ATT AAA ATC GCC CTC ACA GGC ACG CCC 1491 He Glu Cys Asp Lys Thr Ala He Lys He Ala Leu Thr Gly Thr Pro 475 480 485 490
CTA TTA GAA GAC AAC GCG CAA GAT AAA GCC ACT AAA AAC ACT TTT GGC 1539 Leu Leu Glu Asp Asn Ala Gin Asp Lys Ala Thr Lys Asn Thr Phe Gly 495 500 505
AAC TAC TTG CAC ACC TAT TCT TAT ACA GAA TCC ATT AAA GAC AGA CAC 1587 Asn Tyr Leu His Thr Tyr Ser Tyr Thr Glu Ser He Lys Asp Arg His 510 515 520
ACC CTA AAA CTC CAG TTA GAA AGC ATT GAA ACG AGC TAT AAA GAA AAA 1635 Thr Leu Lys Leu Gin Leu Glu Ser He Glu Thr Ser Tyr Lys Glu Lys 525 530 535
TTA CAA GAA ATC TAT CGC CTT TTA CAA GAA AGC ATC ACT ATT GAA GAC 1683 Leu Gin Glu He Tyr Arg Leu Leu Gin Glu Ser He Thr He Glu Asp 540 545 550
ACA GAA GTT AAA AAA GAA ACG ATT TTT AAC GAT GAA AAA TAC ATT AAC 1731 Thr Glu Val Lys Lys Glu Thr He Phe Asn Asp Glu Lys Tyr He Asn 555 560 565 570
GCC ATG CTC TAT TAT ATC ATT AGA GAT TTA TTG GAT TTT AGG CGT TTG 1779 Ala Met Leu Tyr Tyr He He Arg Asp Leu Leu Asp Phe Arg Arg Leu 575 580 585
AAT GAT AAT GAA CGC TTA AAG GCT ATG GTG GTT TGT TTT TCT AGC AAG 1827 Asn Asp Asn Glu Arg Leu Lys Ala Met Val Val Cys Phe Ser Ser Lys 590 595 600
CAA GCC AGA TTA GCT GAT TGT CTT TTT AAT GAA GTC CAA GAA AAA GTC 1875 Gin Ala Arg Leu Ala Asp Cys Leu Phe Asn Glu Val Gin Glu Lys Val 605 610 615
TTA CAA GAA AAC CCC AAC CTA AGG ATT TTA AAC AAA CTC AAA TCC AGC 1923 Leu Gin Glu Asn Pro Asn Leu Arg He Leu Asn Lys Leu Lys Ser Ser 620 625 630
CTG ATT TTG CAT GAT GAA CAA GAA GTC AAA GAA AAG GTT CAT TCT TTC 1971 Leu He Leu His Asp Glu Gin Glu Val Lys Glu Lys Val His Ser Phe 635 640 645 650
AAA CAT GAA GAT ACC GAT ATA GTC TTT GTG TTT AAC ATG CTT TTA ACC 2019 Lys His Glu Asp Thr Asp He Val Phe Val Phe Asn Met Leu Leu Thr 655 660 665 GGC TTT GAT TTA CCC AGT CTC AAA CGC CTT TAT ATC CAC AGA GAA TTA 2067 Gly Phe Asp Leu Pro Ser Leu Lys Arg Leu Tyr He His Arg Glu Leu 670 675 680
AAA GAT CAC AAT TTG CTC CAA GCC CTA GCC AGA GTG AAT CGC TCC TAT 2115 Lys Asp His Asn Leu Leu Gin Ala Leu Ala Arg Val Asn Arg Ser Tyr 685 690 695
AAA AAC ATG TCT TTT GGC TAC CTT ATA GAT TTT GTA GGC ATT CAA GAA 2163 Lys Asn Met Ser Phe Gly Tyr Leu He Asp Phe Val Gly He Gin Glu 700 705 710
AAT TTT GAC AAA ACG ACT GAT GAT TAC TTG AAA GAA TTA AAC CGA TTC 2211 Asn Phe Asp Lys Thr Thr Asp Asp Tyr Leu Lys Glu Leu Asn Arg Phe 715 720 725 730
AAT CAA AGC GGT GCC AAT AGC GAT TCT CAT ATC AAA GAC ATG TTT GCG 2259 Asn Gin Ser Gly Ala Asn Ser Asp Ser His He Lys Asp Met Phe Ala 735 740 745
GAT CGT AAG ACT TTA GAA GAA GAC ATT AAA AAC GCC TAT GAT GAT CTT 2307 Asp Arg Lys Thr Leu Glu Glu Asp He Lys Asn Ala Tyr Asp Asp Leu 750 755 760
TTT GAT TAC CCC ATT GAC GAT ATA GAG GGC ATG ACT AGC GCC ATT GTC 2355 Phe Asp Tyr Pro He Asp Asp He Glu Gly Met Thr Ser Ala He Val 765 770 775
AGC ATG AGC GCA ATG AAC GAG CTT GTA AAA GTC TCA CGC GCC ATT AAC 2403 Ser Met Ser Ala Met Asn Glu Leu Val Lys Val Ser Arg Ala He Asn 780 785 790
ACG CTC AAA GAG CGC TAC AAT TTA ATC CGC ACT TCT AAT GAT AAA AAA 2451 Thr Leu Lys Glu Arg Tyr Asn Leu He Arg Thr Ser Asn Asp Lys Lys 795 800 805 810
ATC CTT TCA CTA AAA GAA AAA ATT GAT ATT GAA AAG ATC CAT AAA ATC 2499 He Leu Ser Leu Lys Glu Lys He Asp He Glu Lys He His Lys He 815 820 825
TCT TCA ATG CTT CAT CAA AAA GCC AAA CAC CTC CAT GCG TTA AAG AAT 2547 Ser Ser Met Leu His Gin Lys Ala Lys His Leu His Ala Leu Lys Asn 830 835 840
ATC AAT GAG CCT AAA AAC CCA AAC GAT TTA ATG ATT TTA GAA GAC CTC 2595 He Asn Glu Pro Lys Asn Pro Asn Asp Leu Met He Leu Glu Asp Leu 845 850 855
ATC GCT CTT TTA GAC TTT AAA ATA GAG TTT AAA GAA CGC AAA GAA TTA 2643 He Ala Leu Leu Asp Phe Lys He Glu Phe Lys Glu Arg Lys Glu Leu 860 865 870
CGC TTT AAA GAA CAA GAA GAG ATT ACC ACC AAA CAA AAG CAA GCT AAA 2691 Arg Phe Lys Glu Gin Glu Glu He Thr Thr Lys Gin Lys Gin Ala Lys 875 880 885 890 GAG ATT TTA GAA AAA ATC CCG GAT CAA CAA GAT AAA GAA ATC CAA AAG 2739 Glu He Leu Glu Lys He Pro Asp Gin Gin Asp Lys Glu He Gin Lys 895 900 905
TTT TAC AAA GAC TTT TCA AAA TTA CTC CAA ACG CCC ACA ACA AGC CAG 2787 Phe Tyr Lys Asp Phe Ser Lys Leu Leu Gin Thr Pro Thr Thr Ser Gin 910 915 920
AAT TTT GAG GAA ATT TCT CAT TCC TAT GAT GCG ATC ATT TCA CAA CTC 2835 Asn Phe Glu Glu He Ser His Ser Tyr Asp Ala He He Ser Gin Leu 925 930 935
AAA CAA CAC AAA GAA CAA ACC ACC CAC TTA TTA AAC AAA TAC GAT AAT 2883 Lys Gin His Lys Glu Gin Thr Thr His Leu Leu Asn Lys Tyr Asp Asn 940 945 950
GAT TTG TCT TAT GCG ATC ACG AAC AAA CGC CTT CAT AAG CAC CTT ATG 2931 Asp Leu Ser Tyr Ala He Thr Asn Lys Arg Leu His Lys His Leu Met 955 960 965 970
GAA CAA AAC ATT TCT AAC TCA GCG GGA ATT TTC ACG CTT TTA AGC GCC 2979 Glu Gin Asn He Ser Asn Ser Ala Gly He Phe Thr Leu Leu Ser Ala 975 980 985
TTA AAA AAA GCT ATT GAT GCG CGT ATT TTT AAG CGT CAA GAA ATC TTA 3027 Leu Lys Lys Ala He Asp Ala Arg He Phe Lys Arg Gin Glu He Leu 990 995 1000
AAC GAA GAG TAT TAC CTA AAA AAT GCC ATA AAA GCA GAA TTA AAT AAC 3075 Asn Glu Glu Tyr Tyr Leu Lys Asn Ala He Lys Ala Glu Leu Asn Asn 1005 1010 1015
GCT TTC AAA AAA GAC CCC TCC TTA AAA GAT TTA GAA AAA GAA AAA GAA 3123 Ala Phe Lys Lys Asp Pro Ser Leu Lys Asp Leu Glu Lys Glu Lys Glu 1020 1025 1030
CTT ATC ATT CAA ACC CTT TTT AAC GAA CTC ACA CAA AAC CAC CAT CAA 3171 Leu He He Gin Thr Leu Phe Asn Glu Leu Thr Gin Asn His His Gin 1035 1040 1045 1050
GGA AAT CCG CAT GCC TAATAACGCT TTATTGCAAA TCAAACAAGA CACCCT 3222
Gly Asn Pro His Ala 1055
(2) INFORMATION FOR SEQ ID NO: 1330:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1055 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1330:
Met Pro Tyr Asn Glu He Thr Arg Val Gin He Pro Ala Leu Met His
1 5 10 15
Leu Ala Lys Leu Gly Tyr Asp Phe He Pro Thr Asn Ser Lys Glu Asn
20 25 30
Lys Pro Asn Leu Asp Thr Ala Thr Asn He Leu Thr Asn Ser Phe Thr
35 40 45
Lys Ser Phe Glu Arg Leu Asn Pro Thr Lys Asn Ala Gin Glu Thr Leu
50 55 60
Ala Glu Met Lys Lys Arg Leu Asn Cys Asp Asp Leu Gly Lys Ser Phe 65 70 75 80
Tyr Glu Tyr Leu Leu Lys Ser Glu Asn Gin He He Asp Phe Asp Asn
85 90 95
Pro Asn Asn Asn Leu Tyr Glu Met Met Thr Glu Leu Pro Tyr Lys Ser
100 105 110
Phe Arg Pro Asp Thr Thr Leu Phe He Asn Gly Leu Pro Leu Val Asn
115 120 125
He Glu Val Lys Gin Pro Tyr Ala Lys Lys Gly He Lys Glu Glu Arg
130 135 140
Asp Arg His He Lys Arg Tyr Glu Asn Pro Glu Asn Lys Val Phe Tyr 145 150 155 160
Asn Leu Ala Gin He Trp Leu Phe Ser Asp Asn Leu Pro Tyr Asp Glu
165 170 175
Asn Lys Pro Asp Gin Gly Ala Phe Tyr Ser Ala Ser Tyr Ser Pro He
180 185 190
Phe Gin Arg Phe Val Glu Ala His Arg Leu Asp He Xaa Pro Xaa Pro
195 200 205
Xaa Gin Lys Asn Asp Gin Asn His Gin Asn Asp Gin Asn His Arg Ser
210 215 220
Leu Glu Glu He Gin Lys Ser Val Leu Asn Glu Phe Asn Leu Lys Asp 225 230 235 240
Thr Asp Thr Pro Lys Ser Pro Lys Asp Thr Pro Thr Asn Ser Leu Leu
245 250 255
Thr Ser Phe Cys Ser Pro Lys Arg Leu Cys Phe He Leu Lys Tyr Gly
260 265 270
He Ser Phe Leu Lys Glu Lys Ser Glu Phe Lys Lys His Val Trp Arg
275 280 285
Tyr Ala Gin Met Phe Ala Ser Leu Asn Val Leu Lys Glu Leu Gin Lys
290 295 300
His Tyr Gly Thr Asn Gin Asn Leu Lys Asp Pro Leu Lys Gly He He 305 310 315 320
Trp His Thr Gin Gly Ser Gly Lys Thr Ala Leu Thr Tyr His Leu Thr
325 330 335
Lys Leu He Arg Asp Phe Phe Ser Arg Ser Asn Leu Asn Lys Lys Thr
340 345 350
Lys Phe Tyr Phe He Val Asp Arg Leu Asp Leu Leu Glu Gin Ala Lys
355 360 365
Asn Glu Phe Leu Lys Arg Gly Leu Cys Val His Glu Ala Glu Asn Lys
370 375 380
Glu Asp Leu Ser Gin Lys Leu Lys Ser Ser Ser Val Phe Glu Gly Ser 385 390 395 400
Gin Gly Asn Asp Glu He He Val Val Asn He Gin Lys Phe Lys Ala
405 410 415
Pro Asn Glu Glu Lys Ser Pro Asn Glu Asp Pro Ser Asn Ser Ala Pro 420 425 430
Lys Glu He He Ser Lys Thr Glu Leu Gin Glu Ser He Gin Asn Ser
435 440 445
Arg Asn Leu Gin Arg Val Phe He He Asp Glu Ala His Arg Ser Tyr
450 455 460
Asp Pro Lys Gly Cys Phe Tyr Ala Asn Leu He Glu Cys Asp Lys Thr 465 470 475 480
Ala He Lys He Ala Leu Thr Gly Thr Pro Leu Leu Glu Asp Asn Ala
485 490 495
Gin Asp Lys Ala Thr Lys Asn Thr Phe Gly Asn Tyr Leu His Thr Tyr
500 505 510
Ser Tyr Thr Glu Ser He Lys Asp Arg His Thr Leu Lys Leu Gin Leu
515 520 525
Glu Ser He Glu Thr Ser Tyr Lys Glu Lys Leu Gin Glu He Tyr Arg
530 535 540
Leu Leu Gin Glu Ser He Thr He Glu Asp Thr Glu Val Lys Lys Glu 545 550 555 560
Thr He Phe Asn Asp Glu Lys Tyr He Asn Ala Met Leu Tyr Tyr He
565 570 575
He Arg Asp Leu Leu Asp Phe Arg Arg Leu Asn Asp Asn Glu Arg Leu
580 585 590
Lys Ala Met Val Val Cys Phe Ser Ser Lys Gin Ala Arg Leu Ala Asp
595 600 605
Cys Leu Phe Asn Glu Val Gin Glu Lys Val Leu Gin Glu Asn Pro Asn
610 615 620
Leu Arg He Leu Asn Lys Leu Lys Ser Ser Leu He Leu His Asp Glu 625 630 635 640
Gin Glu Val Lys Glu Lys Val His Ser Phe Lys His Glu Asp Thr Asp
645 650 655
He Val Phe Val Phe Asn Met Leu Leu Thr Gly Phe Asp Leu Pro Ser
660 665 670
Leu Lys Arg Leu Tyr He His Arg Glu Leu Lys Asp His Asn Leu Leu
675 680 685
Gin Ala Leu Ala Arg Val Asn Arg Ser Tyr Lys Asn Met Ser Phe Gly
690 695 700
Tyr Leu He Asp Phe Val Gly He Gin Glu Asn Phe Asp Lys Thr Thr 705 710 715 720
Asp Asp Tyr Leu Lys Glu Leu Asn Arg Phe Asn Gin Ser Gly Ala Asn
725 730 735
Ser Asp Ser His He Lys Asp Met Phe Ala Asp Arg Lys Thr Leu Glu
740 745 750
Glu Asp He Lys Asn Ala Tyr Asp Asp Leu Phe Asp Tyr Pro He Asp
755 760 765
Asp He Glu Gly Met Thr Ser Ala He Val Ser Met Ser Ala Met Asn
770 775 780
Glu Leu Val Lys Val Ser Arg Ala He Asn Thr Leu Lys Glu Arg Tyr 785 790 795 800
Asn Leu He Arg Thr Ser Asn Asp Lys Lys He Leu Ser Leu Lys Glu
805 810 815
Lys He Asp He Glu Lys He His Lys He Ser Ser Met Leu His Gin
820 825 830
Lys Ala Lys His Leu His Ala Leu Lys Asn He Asn Glu Pro Lys Asn
835 840 845
Pro Asn Asp Leu Met He Leu Glu Asp Leu He Ala Leu Leu Asp Phe 850 855 860 Lys He Glu Phe Lys Glu Arg Lys Glu Leu Arg Phe Lys Glu Gin Glu 865 870 875 880
Glu He Thr Thr Lys Gin Lys Gin Ala Lys Glu He Leu Glu Lys He
885 890 895
Pro Asp Gin Gin Asp Lys Glu He Gin Lys Phe Tyr Lys Asp Phe Ser
900 905 910
Lys Leu Leu Gin Thr Pro Thr Thr Ser Gin Asn Phe Glu Glu He Ser
915 920 925
His Ser Tyr Asp Ala He He Ser Gin Leu Lys Gin His Lys Glu Gin
930 935 940
Thr Thr His Leu Leu Asn Lys Tyr Asp Asn Asp Leu Ser Tyr Ala He 945 950 955 960
Thr Asn Lys Arg Leu His Lys His Leu Met Glu Gin Asn He Ser Asn
965 970 975
Ser Ala Gly He Phe Thr Leu Leu Ser Ala Leu Lys Lys Ala He Asp
980 985 990
Ala Arg He Phe Lys Arg Gin Glu He Leu Asn Glu Glu Tyr Tyr Leu
995 1000 1005
Lys Asn Ala He Lys Ala Glu Leu Asn Asn Ala Phe Lys Lys Asp Pro
1010 1015 1020
Ser Leu Lys Asp Leu Glu Lys Glu Lys Glu Leu He He Gin Thr Leu 025 1030 1035 1040
Phe Asn Glu Leu Thr Gin Asn His His Gin Gly Asn Pro His Ala 1045 1050 1055
(2) INFORMATION FOR SEQ ID NO: 1331:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 574 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 26...511 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1331:
TAGAAGAATT TGAAAGGTTG CTCGC ATG CAA AGA GAA TTA AGG CTT TTA AAT 52
Met Gin Arg Glu Leu Arg Leu Leu Asn 1 5
AAC AAG CAT TGC ATG GAA TAC TTG CAA TTT CTG TCC AAA AAC CAT TTG 100 Asn Lys His Cys Met Glu Tyr Leu Gin Phe Leu Ser Lys Asn His Leu 10 15 20 25
AGT TTT AAC CTT TTG TGC GAA AGA GAT GCG ATT GAT TTT TCC CCC AAG 148 Ser Phe Asn Leu Leu Cys Glu Arg Asp Ala He Asp Phe Ser Pro Lys 30 35 40 CTC CCT AAA GAA ATT CAT GAA AAA TTC GGC GCG TTA GTG CTA TTT GTT 196 Leu Pro Lys Glu He His Glu Lys Phe Gly Ala Leu Val Leu Phe Val 45 50 55
TTA GCC GGA TAC ACC TTA GAA AGC TTG ATA ATT GAT ACA AAA AGC GTG 244 Leu Ala Gly Tyr Thr Leu Glu Ser Leu He He Asp Thr Lys Ser Val 60 65 70
CAA TTT GAA GCC GGG TTT GGC CCT AAT AAC ATT GGC AGT GTG GTT CAA 292 Gin Phe Glu Ala Gly Phe Gly Pro Asn Asn He Gly Ser Val Val Gin 75 80 85
GTA AAA CTT CCT GGC ATC ATT CAA ATC CTT ATC AAA GAA AAA AAT GAA 340 Val Lys Leu Pro Gly He He Gin He Leu He Lys Glu Lys Asn Glu 90 95 100 105
AAT GCC GTT TTA TTC AAT CGT TGC GAT TCG CTT GAA TTG TTT CAA AAA 388 Asn Ala Val Leu Phe Asn Arg Cys Asp Ser Leu Glu Leu Phe Gin Lys 110 115 120
GAA GAT TCA ATC GCG CAA GAG CCA AAA AAA GAC GAG CGG GAG TCT AAA 436 Glu Asp Ser He Ala Gin Glu Pro Lys Lys Asp Glu Arg Glu Ser Lys 125 130 135
GAA TGG CTG GAT TCT AAA GAG GCT CTT TTT TCC AAT TCC AAA AAC CGC 484 Glu Trp Leu Asp Ser Lys Glu Ala Leu Phe Ser Asn Ser Lys Asn Arg 140 145 150
GCG ATT TTA GAA AAT CTG CAC AAA AGC TAAAGGAATC ATTGATGAGC GTTTTGA 538 Ala He Leu Glu Asn Leu His Lys Ser 155 160
AATTGCATGT AAAAGTCTTT CGTTTTGAAA CCAATA 574
(2) INFORMATION FOR SEQ ID NO: 1332:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 162 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1332:
Met Gin Arg Glu Leu Arg Leu Leu Asn Asn Lys His Cys Met Glu Tyr
1 5 10 15
Leu Gin Phe Leu Ser Lys Asn His Leu Ser Phe Asn Leu Leu Cys Glu
20 25 30
Arg Asp Ala He Asp Phe Ser Pro Lys Leu Pro Lys Glu He His Glu
35 40 45
Lys Phe Gly Ala Leu Val Leu Phe Val Leu Ala Gly Tyr Thr Leu Glu
50 55 60
Ser Leu He He Asp Thr Lys Ser Val Gin Phe Glu Ala Gly Phe Gly 65 70 75 80
Pro Asn Asn He Gly Ser Val Val Gin Val Lys Leu Pro Gly He He
85 90 95
Gin He Leu He Lys Glu Lys Asn Glu Asn Ala Val Leu Phe Asn Arg
100 105 110
Cys Asp Ser Leu Glu Leu Phe Gin Lys Glu Asp Ser He Ala Gin Glu
115 120 125
Pro Lys Lys Asp Glu Arg Glu Ser Lys Glu Trp Leu Asp Ser Lys Glu
130 135 140
Ala Leu Phe Ser Asn Ser Lys Asn Arg Ala He Leu Glu Asn Leu His 145 150 155 160
Lys Ser
(2) INFORMATION FOR SEQ ID NO: 1333
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1697 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 14...1648 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1333:
TTTAGGGGGG TAA ATG CCT TCA AAC GCT CTT TTG ATT GAA GAA ATC ACT 49 Met Pro Ser Asn Ala Leu Leu He Glu Glu He Thr 1 5 10
CAT TTA ATC AAT GTT TCT CAT AGT AGC GTG CAT AAT TGG ATC AAA ACC 97 His Leu He Asn Val Ser His Ser Ser Val His Asn Trp He Lys Thr 15 20 25
AAT CTT TTA GAG AAA CTA GAA ATT GAT CAT AAA ATT TAT GTG AAA ACG 145 Asn Leu Leu Glu Lys Leu Glu He Asp His Lys He Tyr Val Lys Thr 30 35 40
AGT TCT TTT TTA GAT TTT TGC CGC AAC CAT TTA GGG AAA AAC AAG CTT 193 Ser Ser Phe Leu Asp Phe Cys Arg Asn His Leu Gly Lys Asn Lys Leu 45 50 55 60
AAC AAA TAC GCT AAC AAA TCC TTA AAA GGC GTG CAT AAC CAT CAA GAA 241 Asn Lys Tyr Ala Asn Lys Ser Leu Lys Gly Val His Asn His Gin Glu 65 70 75
TTG ATT TTA AAA TAC CTA GAA ATA TTA GAA AAT AGC TCT GAT CTA GAA 289 Leu He Leu Lys Tyr Leu Glu He Leu Glu Asn Ser Ser Asp Leu Glu 80 85 90
AAG TTG GGT TCT TAT TAT GAA GAA GAG CTT TCT AAC GCC ACC AGA AAT 337 Lys Leu Gly Ser Tyr Tyr Glu Glu Glu Leu Ser Asn Ala Thr Arg Asn 95 100 105
TTA GAA GGC ATT TAC TAC ACT CCT AAC AGG ATA GTA GAA CAA CTT TTC 385 Leu Glu Gly He Tyr Tyr Thr Pro Asn Arg He Val Glu Gin Leu Phe 110 115 120
ACC CTC CCT AAA GAT TTT GAT GTC TCT CAA GCG ATT TTT TGC GAT CCG 433 Thr Leu Pro Lys Asp Phe Asp Val Ser Gin Ala He Phe Cys Asp Pro 125 130 135 140
GCT GTG GGG AGT GGG AAT TTT ATC ATG CAT GCT TTA AAA CTG GGT TTT 481 Ala Val Gly Ser Gly Asn Phe He Met His Ala Leu Lys Leu Gly Phe 145 150 155
AAG GTT GAA AAC ATT TAT GGC TAT GAT ACG GAC GCT TTT GCT GTC GCT 529 Lys Val Glu Asn He Tyr Gly Tyr Asp Thr Asp Ala Phe Ala Val Ala 160 165 170
TTG ACT AAA AAG CGT ATT AAA GAG CGT TAT CAT TTA GAT TGC CTT AAT 577 Leu Thr Lys Lys Arg He Lys Glu Arg Tyr His Leu Asp Cys Leu Asn 175 180 185
ATT GTG CAA AAA GAT TTT TTA AAT TTA AAA CAC ACC CCG CAA TTT GAT 625 He Val Gin Lys Asp Phe Leu Asn Leu Lys His Thr Pro Gin Phe Asp 190 195 200
TGC ATT TTC ACT AAC CCG CCA TGG GGC AAG AAA TAC AAT CAA AAC CAA 673 Cys He Phe Thr Asn Pro Pro Trp Gly Lys Lys Tyr Asn Gin Asn Gin 205 210 215 220
AAA GAA AAT TTT AAA CAG CAA TTC AAC CTC TCT CAA AGC CTA GAT AGC 721 Lys Glu Asn Phe Lys Gin Gin Phe Asn Leu Ser Gin Ser Leu Asp Ser 225 230 235
GCG TCG CTC TTT TTT ATA GCG AGT TTG AAT TGT TTA AAA GAA AAC GCT 769 Ala Ser Leu Phe Phe He Ala Ser Leu Asn Cys Leu Lys Glu Asn Ala 240 245 250
CAT TTG GGG TTA TTA TTA CCC GAA AGT TGT TTG AAT ATT GAT GCG TTT 817 His Leu Gly Leu Leu Leu Pro Glu Ser Cys Leu Asn He Asp Ala Phe 255 260 265
AAA AAA ATG CGA GAA ATG GCT TTA AAG TTT CAC ATT AGA AGC CTG ATT 865 Lys Lys Met Arg Glu Met Ala Leu Lys Phe His He Arg Ser Leu He 270 275 280
GAT TTT GAC AAA CCT TTT AAA AAT CTA ATG ACT AAG GCT GTG GGT TTG 913 Asp Phe Asp Lys Pro Phe Lys Asn Leu Met Thr Lys Ala Val Gly Leu 285 290 295 300
GCG CTT AAA AAA ACC CCT AAT AAG GAT CAA AAA ATC TCA TGC TTT TAT 961 Ala Leu Lys Lys Thr Pro Asn Lys Asp Gin Lys He Ser Cys Phe Tyr 305 310 315
CAA AAT AGC AAG TTC AAA CGC TCG CCC TCT TCT TTT TTT AAC AAC CCT 1009 Gin Asn Ser Lys Phe Lys Arg Ser Pro Ser Ser Phe Phe Asn Asn Pro 320 325 330
AAA AAG ATT TTT AAT ATC CAT TGC TCT AGC AAA GAA AAT AAA ATT TTA 1057 Lys Lys He Phe Asn He His Cys Ser Ser Lys Glu Asn Lys He Leu 335 340 345
GAC CAC CTT TTT TCC CTC CCT CAT ATG ACT TTA AAA AAT AAC GCT CAT 1105 Asp His Leu Phe Ser Leu Pro His Met Thr Leu Lys Asn Asn Ala His 350 355 360
TTT GCT TTA GGG ATT GTT ACA GGC AAC AAT AAA GAA AAA TTA CAC CCC 1153 Phe Ala Leu Gly He Val Thr Gly Asn Asn Lys Glu Lys Leu His Pro 365 370 375 380
AAA CAA GAA AAA AAT ACC ATT CCC ATT TTT AGG GGT TCA GAT ATT TTA 1201 Lys Gin Glu Lys Asn Thr He Pro He Phe Arg Gly Ser Asp He Leu 385 390 395
AAA GAC GGA TTA AAA GCC CCT AGC CAA TTC ATT AAC GCT GGT TTA AAA 1249 Lys Asp Gly Leu Lys Ala Pro Ser Gin Phe He Asn Ala Gly Leu Lys 400 405 410
GAC TGC CAG CAA GTC GCC CCC TTA AGC CTT TAT CAA GCT AGA GAA AAA 1297 Asp Cys Gin Gin Val Ala Pro Leu Ser Leu Tyr Gin Ala Arg Glu Lys 415 420 425
ATC GTG TAT AAA TTC ATT TCT TCA AAG CTT GTC TTT TTT TAT GAC AAT 1345 He Val Tyr Lys Phe He Ser Ser Lys Leu Val Phe Phe Tyr Asp Asn 430 435 440
AAG CAA CGC CTT TTT TTA AAC AGC GCG AAC ATG TTT GTT TTA AAA GAA 1393 Lys Gin Arg Leu Phe Leu Asn Ser Ala Asn Met Phe Val Leu Lys Glu 445 450 455 460
AAT TTC CCT ATC AAC GCT CAT GCA TTA AAA GAA TTG TTA AAC AGC GAT 1441 Asn Phe Pro He Asn Ala His Ala Leu Lys Glu Leu Leu Asn Ser Asp 465 470 475
TTA ATG CAA TTC ATT TTT GAA TCG CTT TTT AAA ACG CAT AAA ATC TTA 1489 Leu Met Gin Phe He Phe Glu Ser Leu Phe Lys Thr His Lys He Leu 480 485 490
AGA AAA GAT TTG GAA TGT TTG CCC CTA TTT GTG CAA TTT ATT AAC GAT 1537 Arg Lys Asp Leu Glu Cys Leu Pro Leu Phe Val Gin Phe He Asn Asp 495 500 505
AAT TTT GAT GAA AAA TTT TAT TTA AAA AAT TTA GGG ATA GAA AAA AAA 1585 Asn Phe Asp Glu Lys Phe Tyr Leu Lys Asn Leu Gly He Glu Lys Lys 510 515 520 GAC CCT AAA CAT TTC ACC ATC AGG AAA AAT CAT GCA TGT TGC TTG TCT 1633 Asp Pro Lys His Phe Thr He Arg Lys Asn His Ala Cys Cys Leu Ser 525 530 535 540
TTT GGC TTT AGG GGA TAATCTCATC ACGCTTAGCC TTTTAAAAGA AATCGCTTTC A 1689 Phe Gly Phe Arg Gly 545
AACAGCAA 1697
(2) INFORMATION FOR SEQ ID NO: 1334:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 545 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1334:
Met Pro Ser Asn Ala Leu Leu He Glu Glu He Thr His Leu He Asn
1 5 10 15
Val Ser His Ser Ser Val His Asn Trp He Lys Thr Asn Leu Leu Glu
20 25 30
Lys Leu Glu He Asp His Lys He Tyr Val Lys Thr Ser Ser Phe Leu
35 40 45
Asp Phe Cys Arg Asn His Leu Gly Lys Asn Lys Leu Asn Lys Tyr Ala
50 55 60
Asn Lys Ser Leu Lys Gly Val His Asn His Gin Glu Leu He Leu Lys 65 70 75 80
Tyr Leu Glu He Leu Glu Asn Ser Ser Asp Leu Glu Lys Leu Gly Ser
85 90 95
Tyr Tyr Glu Glu Glu Leu Ser Asn Ala Thr Arg Asn Leu Glu Gly He
100 105 110
Tyr Tyr Thr Pro Asn Arg He Val Glu Gin Leu Phe Thr Leu Pro Lys
115 120 125
Asp Phe Asp Val Ser Gin Ala He Phe Cys Asp Pro Ala Val Gly Ser
130 135 140
Gly Asn Phe He Met His Ala Leu Lys Leu Gly Phe Lys Val Glu Asn 145 150 155 160
He Tyr Gly Tyr Asp Thr Asp Ala Phe Ala Val Ala Leu Thr Lys Lys
165 170 175
Arg He Lys Glu Arg Tyr His Leu Asp Cys Leu Asn He Val Gin Lys
180 185 190
Asp Phe Leu Asn Leu Lys His Thr Pro Gin Phe Asp Cys He Phe Thr
195 200 205
Asn Pro Pro Trp Gly Lys Lys Tyr Asn Gin Asn Gin Lys Glu Asn Phe
210 215 220
Lys Gin Gin Phe Asn Leu Ser Gin Ser Leu Asp Ser Ala Ser Leu Phe 225 230 235 240
Phe He Ala Ser Leu Asn Cys Leu Lys Glu Asn Ala His Leu Gly Leu
245 250 255
Leu Leu Pro Glu Ser Cys Leu Asn He Asp Ala Phe Lys Lys Met Arg 260 265 270
Glu Met Ala Leu Lys Phe His He Arg Ser Leu He Asp Phe Asp Lys
275 280 285
Pro Phe Lys Asn Leu Met Thr Lys Ala Val Gly Leu Ala Leu Lys Lys
290 295 300
Thr Pro Asn Lys Asp Gin Lys He Ser Cys Phe Tyr Gin Asn Ser Lys 305 310 315 320
Phe Lys Arg Ser Pro Ser Ser Phe Phe Asn Asn Pro Lys Lys He Phe
325 330 335
Asn He His Cys Ser Ser Lys Glu Asn Lys He Leu Asp His Leu Phe
340 345 350
Ser Leu Pro His Met Thr Leu Lys Asn Asn Ala His Phe Ala Leu Gly
355 360 365
He Val Thr Gly Asn Asn Lys Glu Lys Leu His Pro Lys Gin Glu Lys
370 375 380
Asn Thr He Pro He Phe Arg Gly Ser Asp He Leu Lys Asp Gly Leu 385 390 395 400
Lys Ala Pro Ser Gin Phe He Asn Ala Gly Leu Lys Asp Cys Gin Gin
405 410 415
Val Ala Pro Leu Ser Leu Tyr Gin Ala Arg Glu Lys He Val Tyr Lys
420 425 430
Phe He Ser Ser Lys Leu Val Phe Phe Tyr Asp Asn Lys Gin Arg Leu
435 440 445
Phe Leu Asn Ser Ala Asn Met Phe Val Leu Lys Glu Asn Phe Pro He
450 455 460
Asn Ala His Ala Leu Lys Glu Leu Leu Asn Ser Asp Leu Met Gin Phe 465 470 475 480
He Phe Glu Ser Leu Phe Lys Thr His Lys He Leu Arg Lys Asp Leu
485 490 495
Glu Cys Leu Pro Leu Phe Val Gin Phe He Asn Asp Asn Phe Asp Glu
500 505 510
Lys Phe Tyr Leu Lys Asn Leu Gly He Glu Lys Lys Asp Pro Lys His
515 520 525
Phe Thr He Arg Lys Asn His Ala Cys Cys Leu Ser Phe Gly Phe Arg
530 535 540
Gly 545
(2) INFORMATION FOR SEQ ID NO: 1335:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1884 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 46...1842 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1335:
TATTGTGTTA TACTTCTAAT TTCAATTTTG CTTGTTAGGA CATTT ATG AAA AAT ATT 57
Met Lys Asn He
1
AGA AAT ATC GCT GTA ATC GCG CAT GTT GAT CAT GGG AAA ACC ACT CTA 105 Arg Asn He Ala Val He Ala His Val Asp His Gly Lys Thr Thr Leu 5 10 15 20
GTA GAT GGC TTA CTT TCT CAA TCT GGC ACA TTT AGT GAG AGG GAA AAA 153 Val Asp Gly Leu Leu Ser Gin Ser Gly Thr Phe Ser Glu Arg Glu Lys 25 30 35
GTG GAT GAA AGG GTG ATG GAT AGC AAT GAT TTG GAA AGA GAA AGA GGG 201 Val Asp Glu Arg Val Met Asp Ser Asn Asp Leu Glu Arg Glu Arg Gly 40 45 50
ATT ACT ATC CTG TCT AAA AAC ACC GCT ATT TAT TAC AAA GAC ACT AAA 249 He Thr He Leu Ser Lys Asn Thr Ala He Tyr Tyr Lys Asp Thr Lys 55 60 65
ATC AAT ATC ATT GAC ACT CCC GGG CAT GCT GAT TTT GGG GGC GAA GTG 297 He Asn He He Asp Thr Pro Gly His Ala Asp Phe Gly Gly Glu Val 70 75 80
GAG CGC GTT TTA AAA ATG GTG GAT GGG GTG TTG CTT TTA GTG GAC GCT 345 Glu Arg Val Leu Lys Met Val Asp Gly Val Leu Leu Leu Val Asp Ala 85 90 95 100
CAA GAA GGG GTC ATG CCT CAA ACT AAA TTC GTG GTT AAA AAG GCT TTG 393 Gin Glu Gly Val Met Pro Gin Thr Lys Phe Val Val Lys Lys Ala Leu 105 110 115
AGT TTT GGG ATT TGC CCT ATT GTG GTG GTG AAT AAA ATT GAT AAG CCT 441 Ser Phe Gly He Cys Pro He Val Val Val Asn Lys He Asp Lys Pro 120 125 130
GCC GCT GAA CCG GAC AGA GTG GTG GAT GAA GTT TTT GAC TTG TTC GTA 489 Ala Ala Glu Pro Asp Arg Val Val Asp Glu Val Phe Asp Leu Phe Val 135 140 145
GCC ATG GGG GCT AGC GAT AAG CAA TTG GAT TTC CCT GTG GTG TAT GCC 537 Ala Met Gly Ala Ser Asp Lys Gin Leu Asp Phe Pro Val Val Tyr Ala 150 155 160
GCC GCA CGA GAT GGC TAT GCG ATG AAA AGT TTA GAC GAT GAA AAG AAA 585 Ala Ala Arg Asp Gly Tyr Ala Met Lys Ser Leu Asp Asp Glu Lys Lys 165 170 175 180
AAT TTA GAG CCT TTG TTT GAA ACG ATT TTA GAG CAT GTG CCA AGC CCT 633 Asn Leu Glu Pro Leu Phe Glu Thr He Leu Glu His Val Pro Ser Pro 185 190 195
AGC GGG AGC GTT GAT GAG CCT TTG CAA ATG CAA ATT TTC ACG CTT GAT 681 Ser Gly Ser Val Asp Glu Pro Leu Gin Met Gin He Phe Thr Leu Asp 200 205 210
TAT GAC AAT TAT GTG GGC AAA ATC GGT ATC GCT AGG GTG TTT AAT GGC 729 Tyr Asp Asn Tyr Val Gly Lys He Gly He Ala Arg Val Phe Asn Gly 215 220 225
TCG GTT AAA AAG AAT GAA AGC GTG CTG TTG ATG AAA AGC GAT GGG AGT 777 Ser Val Lys Lys Asn Glu Ser Val Leu Leu Met Lys Ser Asp Gly Ser 230 235 240
AAA GAA AAT GGC CGT ATC ACT AAG CTT ATA GGT TTT TTA GGG CTG GCT 825 Lys Glu Asn Gly Arg He Thr Lys Leu He Gly Phe Leu Gly Leu Ala 245 250 255 260
AGG ACT GAG ATT GAA AAC GCT TAT GCG GGC GAT ATT GTA GCG ATT GCC 873 Arg Thr Glu He Glu Asn Ala Tyr Ala Gly Asp He Val Ala He Ala 265 270 275
GGG TTT AAT GCA ATG GAT GTG GGC GAT AGC GTC GTT GAT CCT GCT AAC 921 Gly Phe Asn Ala Met Asp Val Gly Asp Ser Val Val Asp Pro Ala Asn 280 285 290
CCC ATG CCT TTA GAT CCC ATG CAT TTA GAA GAG CCT ACG ATG AGC GTG 969 Pro Met Pro Leu Asp Pro Met His Leu Glu Glu Pro Thr Met Ser Val 295 300 305
TAT TTT GCT GTC AAT GAT TCA CCC TTA GCC GGG TTA GAA GGA AAG CAT 1017 Tyr Phe Ala Val Asn Asp Ser Pro Leu Ala Gly Leu Glu Gly Lys His 310 315 320
GTT ACT GCT AAT AAA TTG AAA GAC AGG CTC TTA AAA GAA ATG CAA ACC 1065 Val Thr Ala Asn Lys Leu Lys Asp Arg Leu Leu Lys Glu Met Gin Thr 325 330 335 340
AAT ATC GCT ATG AAA TGC GAA GAA ATG GGC GAG GGC AAG TTT AAA GTG 1113 Asn He Ala Met Lys Cys Glu Glu Met Gly Glu Gly Lys Phe Lys Val 345 350 355
AGT GGG CGT GGG GAA TTG CAA ATC ACT ATT TTA GCT GAA AAC TTG CGC 1161 Ser Gly Arg Gly Glu Leu Gin He Thr He Leu Ala Glu Asn Leu Arg 360 365 370
CGT GAA GGG TTT GAA TTT AGC ATT TCA CGC CCT GAA GTC ATC ATT AAA 1209 Arg Glu Gly Phe Glu Phe Ser He Ser Arg Pro Glu Val He He Lys 375 380 385
GAA GAA AAT GGC GTT AAA TGC GAG CCT TTT GAG CAT TTA GTG ATT GAC 1257 Glu Glu Asn Gly Val Lys Cys Glu Pro Phe Glu His Leu Val He Asp 390 395 400
ACG CCC CAA GAT TTT AGT GGG GCT ATC ATT GAG AGA TTG GGC AAA AGA 1305 Thr Pro Gin Asp Phe Ser Gly Ala He He Glu Arg Leu Gly Lys Arg 405 410 415 420 AAA GCT GAG ATG AAA GCG ATG AAT CCC ATG AGT GAT GGC TAT ACA AGA 1353 Lys Ala Glu Met Lys Ala Met Asn Pro Met Ser Asp Gly Tyr Thr Arg 425 430 435
TTA GAA TTT GAA ATT CCT GCA AGA GGG CTT ATC GGT TAT AGG AGC GAG 1401 Leu Glu Phe Glu He Pro Ala Arg Gly Leu He Gly Tyr Arg Ser Glu 440 445 450
TTT TTA ACC GAC ACC AAG GGC GAA GGC GTG ATG AAT CAT AGC TTT TTA 1449 Phe Leu Thr Asp Thr Lys Gly Glu Gly Val Met Asn His Ser Phe Leu 455 460 465
GAA TTC CGC CCT TTC AGC GGG AGC GTG GAA TCG CGC AAA AAT GGG GCG 1497 Glu Phe Arg Pro Phe Ser Gly Ser Val Glu Ser Arg Lys Asn Gly Ala 470 475 480
CTA ATC AGC ATG GAA AAT GGC GAA GCG ACC GCT TTT TCC CTT TTC AAT 1545 Leu He Ser Met Glu Asn Gly Glu Ala Thr Ala Phe Ser Leu Phe Asn 485 490 495 500
ATC CAA GAA AGA GGC ACG CTT TTT ATC AAC CCC CAA ACG AAG GTT TAT 1593 He Gin Glu Arg Gly Thr Leu Phe He Asn Pro Gin Thr Lys Val Tyr 505 510 515
GTG GGC ATG GTC ATT GGC GAG CAC AGC CGG GAT AAT GAT TTA GAT GTC 1641 Val Gly Met Val He Gly Glu His Ser Arg Asp Asn Asp Leu Asp Val 520 525 530
AAT CCT ATT AAA TCC AAG CAT TTA ACC AAC ATG AGA GCG AGC GGG AGC 1689 Asn Pro He Lys Ser Lys His Leu Thr Asn Met Arg Ala Ser Gly Ser 535 540 545
GAT GAT GCG ATC AAA CTC ACC CCG CCT AGG ACT ATG GTG TTA GAA AGA 1737 Asp Asp Ala He Lys Leu Thr Pro Pro Arg Thr Met Val Leu Glu Arg 550 555 560
GCG TTA GAA TGG ATT GAA GAA GAT GAG ATT TTG GAA GTT ACC CCC TTG 1785 Ala Leu Glu Trp He Glu Glu Asp Glu He Leu Glu Val Thr Pro Leu 565 570 575 580
AAT TTA AGG ATC AGG AAA AAG ATT TTA GAC CCT AAC ATG AGG AAA AGG 1833 Asn Leu Arg He Arg Lys Lys He Leu Asp Pro Asn Met Arg Lys Arg 585 590 595
GCG AAA AAA TAAATAGAAT TTTTTGGAAT GCATGCCAAT TTATTCAACC AA 1884
Ala Lys Lys
(2) INFORMATION FOR SEQ ID NO: 1336:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 599 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1336:
Met Lys Asn He Arg Asn He Ala Val He Ala His Val Asp His Gly
1 5 10 15
Lys Thr Thr Leu Val Asp Gly Leu Leu Ser Gin Ser Gly Thr Phe Ser
20 25 30
Glu Arg Glu Lys Val Asp Glu Arg Val Met Asp Ser Asn Asp Leu Glu
35 40 45
Arg Glu Arg Gly He Thr He Leu Ser Lys Asn Thr Ala He Tyr Tyr
50 55 60
Lys Asp Thr Lys He Asn He He Asp Thr Pro Gly His Ala Asp Phe 65 70 75 80
Gly Gly Glu Val Glu Arg Val Leu Lys Met Val Asp Gly Val Leu Leu
85 90 95
Leu Val Asp Ala Gin Glu Gly Val Met Pro Gin Thr Lys Phe Val Val
100 105 110
Lys Lys Ala Leu Ser Phe Gly He Cys Pro He Val Val Val Asn Lys
115 120 125
He Asp Lys Pro Ala Ala Glu Pro Asp Arg Val Val Asp Glu Val Phe
130 135 140
Asp Leu Phe Val Ala Met Gly Ala Ser Asp Lys Gin Leu Asp Phe Pro 145 150 155 160
Val Val Tyr Ala Ala Ala Arg Asp Gly Tyr Ala Met Lys Ser Leu Asp
165 170 175
Asp Glu Lys Lys Asn Leu Glu Pro Leu Phe Glu Thr He Leu Glu His
180 185 190
Val Pro Ser Pro Ser Gly Ser Val Asp Glu Pro Leu Gin Met Gin He
195 200 205
Phe Thr Leu Asp Tyr Asp Asn Tyr Val Gly Lys He Gly He Ala Arg
210 215 220
Val Phe Asn Gly Ser Val Lys Lys Asn Glu Ser Val Leu Leu Met Lys 225 230 235 240
Ser Asp Gly Ser Lys Glu Asn Gly Arg He Thr Lys Leu He Gly Phe
245 250 255
Leu Gly Leu Ala Arg Thr Glu He Glu Asn Ala Tyr Ala Gly Asp He
260 265 270
Val Ala He Ala Gly Phe Asn Ala Met Asp Val Gly Asp Ser Val Val
275 280 285
Asp Pro Ala Asn Pro Met Pro Leu Asp Pro Met His Leu Glu Glu Pro
290 295 300
Thr Met Ser Val Tyr Phe Ala Val Asn Asp Ser Pro Leu Ala Gly Leu 305 310 315 320
Glu Gly Lys His Val Thr Ala Asn Lys Leu Lys Asp Arg Leu Leu Lys
325 330 335
Glu Met Gin Thr Asn He Ala Met Lys Cys Glu Glu Met Gly Glu Gly
340 345 350
Lys Phe Lys Val Ser Gly Arg Gly Glu Leu Gin He Thr He Leu Ala
355 360 365
Glu Asn Leu Arg Arg Glu Gly Phe Glu Phe Ser He Ser Arg Pro Glu
370 375 380
Val He He Lys Glu Glu Asn Gly Val Lys Cys Glu Pro Phe Glu His 385 390 395 400
Leu Val He Asp Thr Pro Gin Asp Phe Ser Gly Ala He He Glu Arg
405 410 415
Leu Gly Lys Arg Lys Ala Glu Met Lys Ala Met Asn Pro Met Ser Asp
420 425 430
Gly Tyr Thr Arg Leu Glu Phe Glu He Pro Ala Arg Gly Leu He Gly
435 440 445
Tyr Arg Ser Glu Phe Leu Thr Asp Thr Lys Gly Glu Gly Val Met Asn
450 455 460
His Ser Phe Leu Glu Phe Arg Pro Phe Ser Gly Ser Val Glu Ser Arg 465 470 475 480
Lys Asn Gly Ala Leu He Ser Met Glu Asn Gly Glu Ala Thr Ala Phe
485 490 495
Ser Leu Phe Asn He Gin Glu Arg Gly Thr Leu Phe He Asn Pro Gin
500 505 510
Thr Lys Val Tyr Val Gly Met Val He Gly Glu His Ser Arg Asp Asn
515 520 525
Asp Leu Asp Val Asn Pro He Lys Ser Lys His Leu Thr Asn Met Arg
530 535 540
Ala Ser Gly Ser Asp Asp Ala He Lys Leu Thr Pro Pro Arg Thr Met 545 550 555 560
Val Leu Glu Arg Ala Leu Glu Trp He Glu Glu Asp Glu He Leu Glu
565 570 575
Val Thr Pro Leu Asn Leu Arg He Arg Lys Lys He Leu Asp Pro Asn
580 585 590
Met Arg Lys Arg Ala Lys Lys 595
(2) INFORMATION FOR SEQ ID NO: 1337:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 880 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 54...839 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1337:
TTCTACTAGG CGGACCACCA TGCCAGAGCT ATTCTACCCT TGGCAAAAGA AAA ATG 56
Met
1
GAT GAA AAA GCG AAT CTG TTT AAA GAA TAT TTG CGG CTT TTA GAT TTA 104 Asp Glu Lys Ala Asn Leu Phe Lys Glu Tyr Leu Arg Leu Leu Asp Leu 5 10 15 GTA AAA CCA AAA ATA TTT GTT TTT GAA AAT GTG GTG GGT TTA ATG TCT 152 Val Lys Pro Lys He Phe Val Phe Glu Asn Val Val Gly Leu Met Ser 20 25 30
ATG CAA AAA GGG CAA TTA TTC AAA CAA ATT TGT AAC GCT TTT AAA GAG 200 Met Gin Lys Gly Gin Leu Phe Lys Gin He Cys Asn Ala Phe Lys Glu 35 40 45
AGA GAT TAT ATT TTA GAG CAT GCC ATT TTG AAC GCC CTA GAT TAT GGT 248 Arg Asp Tyr He Leu Glu His Ala He Leu Asn Ala Leu Asp Tyr Gly 50 55 60 65
GTG CCT CAA ATG AGA GAA CGA GTG ATT TTA GTG GGC GTG CTT AAA AGC 296 Val Pro Gin Met Arg Glu Arg Val He Leu Val Gly Val Leu Lys Ser 70 75 80
TTT AAA CAA AAA TTT TAC TTC CCT AAA CCC ATA AAA ACG CAT TTT TCT 344 Phe Lys Gin Lys Phe Tyr Phe Pro Lys Pro He Lys Thr His Phe Ser 85 90 95
CTG AAA GAC GCT TTA GGG GAT TTA CCA CCC ATT CAA AGC GGT GAA AAT 392 Leu Lys Asp Ala Leu Gly Asp Leu Pro Pro He Gin Ser Gly Glu Asn 100 105 110
GGT GAT GCT TTA GGT TAT CTT AAA AAT GCG GAT AAT GTT TTT TTG GAA 440 Gly Asp Ala Leu Gly Tyr Leu Lys Asn Ala Asp Asn Val Phe Leu Glu 115 120 125
TTT GTG CGA AAT TCT AAA GAA TTA AGC GAA CAT AGC AGT CCT AAA AAC 488 Phe Val Arg Asn Ser Lys Glu Leu Ser Glu His Ser Ser Pro Lys Asn 130 135 140 145
AAT GAA AAA CTG ATA AAA ATC ATG CAA ACG CTA AAA GAC GGA CAG AGT 536 Asn Glu Lys Leu He Lys He Met Gin Thr Leu Lys Asp Gly Gin Ser 150 155 160
AAA GAT GAT TTG CCA GAA AGT CTG CGT CCC AAA AGT GGT TAT ATT AAT 584 Lys Asp Asp Leu Pro Glu Ser Leu Arg Pro Lys Ser Gly Tyr He Asn 165 170 175
ACC TAT GCC AAA ATG TGG TGG GAA AAA CCA GCC CCC ACC ATT ACA AGA 632 Thr Tyr Ala Lys Met Trp Trp Glu Lys Pro Ala Pro Thr He Thr Arg 180 185 190
AAT TTT TCT ACC CCA AGC AGT TCT AGG TGT ATC CAT CCA AGA GAC TCT 680 Asn Phe Ser Thr Pro Ser Ser Ser Arg Cys He His Pro Arg Asp Ser 195 200 205
AGA GCG TTA AGC ATT AGA GAG GGG GCA AGA TTG CAA AGC TTT CCT GAT 728 Arg Ala Leu Ser He Arg Glu Gly Ala Arg Leu Gin Ser Phe Pro Asp 210 215 220 225
AAT TAT AAA TTC TGT GGG AGT GGT AGC GCT AAA AGA TTG CAA ATT GGC 776 Asn Tyr Lys Phe Cys Gly Ser Gly Ser Ala Lys Arg Leu Gin He Gly 230 235 240 AAT GCC GTG CCG CCT TTA TTG AGT GTA GCG CTC GCG CAG GCG GTC TTT 824 Asn Ala Val Pro Pro Leu Leu Ser Val Ala Leu Ala Gin Ala Val Phe 245 250 255
GAC TTT TTA AAG GGG TAAGATGTTT AACAATAATG ACTTTAAGGA TTACAGAAAA T 880 Asp Phe Leu Lys Gly 260
880
(2) INFORMATION FOR SEQ ID NO: 1338:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 262 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1338:
Met Asp Glu Lys Ala Asn Leu Phe Lys Glu Tyr Leu Arg Leu Leu Asp
1 5 10 15
Leu Val Lys Pro Lys He Phe Val Phe Glu Asn Val Val Gly Leu Met
20 25 30
Ser Met Gin Lys Gly Gin Leu Phe Lys Gin He Cys Asn Ala Phe Lys
35 40 45
Glu Arg Asp Tyr He Leu Glu His Ala He Leu Asn Ala Leu Asp Tyr
50 55 60
Gly Val Pro Gin Met Arg Glu Arg Val He Leu Val Gly Val Leu Lys 65 70 75 80
Ser Phe Lys Gin Lys Phe Tyr Phe Pro Lys Pro He Lys Thr His Phe
85 90 95
Ser Leu Lys Asp Ala Leu Gly Asp Leu Pro Pro He Gin Ser Gly Glu
100 105 110
Asn Gly Asp Ala Leu Gly Tyr Leu Lys Asn Ala Asp Asn Val Phe Leu
115 120 125
Glu Phe Val Arg Asn Ser Lys Glu Leu Ser Glu His Ser Ser Pro Lys
130 135 140
Asn Asn Glu Lys Leu He Lys He Met Gin Thr Leu Lys Asp Gly Gin 145 150 155 160
Ser Lys Asp Asp Leu Pro Glu Ser Leu Arg Pro Lys Ser Gly Tyr He
165 170 175
Asn Thr Tyr Ala Lys Met Trp Trp Glu Lys Pro Ala Pro Thr He Thr
180 185 190
Arg Asn Phe Ser Thr Pro Ser Ser Ser Arg Cys He His Pro Arg Asp
195 200 205
Ser Arg Ala Leu Ser He Arg Glu Gly Ala Arg Leu Gin Ser Phe Pro
210 215 220
Asp Asn Tyr Lys Phe Cys Gly Ser Gly Ser Ala Lys Arg Leu Gin He 225 230 235 240
Gly Asn Ala Val Pro Pro Leu Leu Ser Val Ala Leu Ala Gin Ala Val
245 250 255
Phe Asp Phe Leu Lys Gly 260 (2) INFORMATION FOR SEQ ID NO: 1339:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1376 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 13...1338 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1339:
TGGTAGTTAA GA ATG GGT AAT CAT TTT TCT AAA TTA GGA TTT GTT TTA GCC 51 Met Gly Asn His Phe Ser Lys Leu Gly Phe Val Leu Ala 1 5 10
GCA TTA GGA AGC GCG ATA GGT TTA GGG CAT ATC TGG CGT TTC CCC TAC 99 Ala Leu Gly Ser Ala He Gly Leu Gly His He Trp Arg Phe Pro Tyr 15 20 25
ATG ACT GGG GTG AGT GGT GGG GGT GCT TTT GTT TTA TTG TTT TTA TTT 147 Met Thr Gly Val Ser Gly Gly Gly Ala Phe Val Leu Leu Phe Leu Phe 30 35 40 45
TTA TCT TTA AGC GTT GGC GCG GCG ATG TTT ATC GCT GAA ATG CTA TTA 195 Leu Ser Leu Ser Val Gly Ala Ala Met Phe He Ala Glu Met Leu Leu 50 55 60
GGA CAA AGC ACT CAA AAA AAT GTA ACA GAA GCT TTT AAA GAG CTT GAC 243 Gly Gin Ser Thr Gin Lys Asn Val Thr Glu Ala Phe Lys Glu Leu Asp 65 70 75
ATT AAC CCC AAA AAA CGC TGG AAA TAC GCA GGG CTT TTG CTT GTT TCT 291 He Asn Pro Lys Lys Arg Trp Lys Tyr Ala Gly Leu Leu Leu Val Ser 80 85 90
GGG CCA TTA ATA CTG ACT TTT TAC GGC ACG ATT TTA GGT TGG GTG CTT 339 Gly Pro Leu He Leu Thr Phe Tyr Gly Thr He Leu Gly Trp Val Leu 95 100 105
TAT TAT TTG GTG AGT GTT AGT TTT AAT TTG CCT AAC AAT ATC CAA GAA 387 Tyr Tyr Leu Val Ser Val Ser Phe Asn Leu Pro Asn Asn He Gin Glu 110 115 120 125
TCT GAA CAA ATT TTT ACT CAA ACT TTG CAG TCT ATA GGG CTA CAA TCC 435 Ser Glu Gin He Phe Thr Gin Thr Leu Gin Ser He Gly Leu Gin Ser 130 135 140
ATA GGG CTT TTT AGC GTT TTA TTG ATA ACC GGA TGG ATT GTT TCT AGG 483 He Gly Leu Phe Ser Val Leu Leu He Thr Gly Trp He Val Ser Arg 145 150 155
GGG ATT AAA GAA GGC ATT GAA AAG CTC AAT TTG GTT TTA ATG CCC TTA 531 Gly He Lys Glu Gly He Glu Lys Leu Asn Leu Val Leu Met Pro Leu 160 165 170
CTC TTT GCT ACT TTT TTT GGT TTG CTT TTC TAT GCG ATG AGC ATG GAT 579 Leu Phe Ala Thr Phe Phe Gly Leu Leu Phe Tyr Ala Met Ser Met Asp 175 180 185
TCT TTT TCT AAA GCT TTT CAT TTC ATG TTT GAT TTC AAA CCA AAA GAT 627 Ser Phe Ser Lys Ala Phe His Phe Met Phe Asp Phe Lys Pro Lys Asp 190 195 200 205
TTG ACC TCT CAA GTG TTC ACT TAT TCC TTG GGG CAG GTT TTC TTT TCC 675 Leu Thr Ser Gin Val Phe Thr Tyr Ser Leu Gly Gin Val Phe Phe Ser 210 215 220
TTA AGC ATC GGT TTA GGG ATC AAT ATC ACT TAC GCT GCG GTT ACG GAT 723 Leu Ser He Gly Leu Gly He Asn He Thr Tyr Ala Ala Val Thr Asp 225 230 235
AAA ACG CAG AAT TTG CTT AAA AGC ACT ATT TGG GTG GTT TTA TCA GGA 771 Lys Thr Gin Asn Leu Leu Lys Ser Thr He Trp Val Val Leu Ser Gly 240 245 250
ATT CTA ATT TCT CTT GTG GCA GGA CTT ATG ATT TTC ACT TTT GTG TTT 819 He Leu He Ser Leu Val Ala Gly Leu Met He Phe Thr Phe Val Phe 255 260 265
GAA TAT GGG GCG AAT GTC TCA CAA GGC ACA GGG TTA ATC TTC ACT TCT 867 Glu Tyr Gly Ala Asn Val Ser Gin Gly Thr Gly Leu He Phe Thr Ser 270 275 280 285
TTA CCG GTG GTT TTT GGC CAA ATG GGA GCG ATA GGC ATT CTT GTT TCG 915 Leu Pro Val Val Phe Gly Gin Met Gly Ala He Gly He Leu Val Ser 290 295 300
ATT CTT TTC TTG CTC GCG CTC GCT TTT GCT GGC ATC ACT TCT ACG GTG 963 He Leu Phe Leu Leu Ala Leu Ala Phe Ala Gly He Thr Ser Thr Val 305 310 315
GCT TTA TTG GAG CCA AGC GTG ATG TAT CTT ACC GAA AGG TAT CAA TAC 1011 Ala Leu Leu Glu Pro Ser Val Met Tyr Leu Thr Glu Arg Tyr Gin Tyr 320 325 330
TCT CGT TTT AAG GTT ACT TGG GGT CTT GTA GCA CTA ATT TTT GTG GTA 1059 Ser Arg Phe Lys Val Thr Trp Gly Leu Val Ala Leu He Phe Val Val 335 340 345
GGC GTG GTG TTG ATT TTC TCG CTC CAT AAG GAT TAT AAA GAT TAT CTC 1107 Gly Val Val Leu He Phe Ser Leu His Lys Asp Tyr Lys Asp Tyr Leu 350 355 360 365
ACT TTC TTT GAA AAA AGT CTT TTT GAT TGG TTG GAT TTT GCA TCA AGC 1155 Thr Phe Phe Glu Lys Ser Leu Phe Asp Trp Leu Asp Phe Ala Ser Ser 370 375 380
ACC ATT ATC ATG CCT TTA GGC GGG ATG GCA ACC TTT ATT TTT ATG GGT 1203 Thr He He Met Pro Leu Gly Gly Met Ala Thr Phe He Phe Met Gly 385 390 395
TGG GTT TTG AAA AAA GAA AAA TTG CGT CTT TTG AGC GTG CAC TTT TTA 1251 Trp Val Leu Lys Lys Glu Lys Leu Arg Leu Leu Ser Val His Phe Leu 400 405 410
GGC CCT AAA TTG TTT GCA ACT TGG TAT TTC TTG CTT AAA TAT ATC ACC 1299 Gly Pro Lys Leu Phe Ala Thr Trp Tyr Phe Leu Leu Lys Tyr He Thr 415 420 425
CCT TTA ATT GTG TTT TCC ATT TGG TTG AGC AAG ATT TAT TAAAATATTT GG 1350 Pro Leu He Val Phe Ser He Trp Leu Ser Lys He Tyr 430 435 440
CATGGGAAAA TTTTCTAAAT TAGGCT 1376
(2) INFORMATION FOR SEQ ID NO: 1340:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 442 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1340:
Met Gly Asn His Phe Ser Lys Leu Gly Phe Val Leu Ala Ala Leu Gly
1 5 10 15
Ser Ala He Gly Leu Gly His He Trp Arg Phe Pro Tyr Met Thr Gly
20 25 30
Val Ser Gly Gly Gly Ala Phe Val Leu Leu Phe Leu Phe Leu Ser Leu
35 40 45
Ser Val Gly Ala Ala Met Phe He Ala Glu Met Leu Leu Gly Gin Ser
50 55 60
Thr Gin Lys Asn Val Thr Glu Ala Phe Lys Glu Leu Asp He Asn Pro 65 70 75 80
Lys Lys Arg Trp Lys Tyr Ala Gly Leu Leu Leu Val Ser Gly Pro Leu
85 90 95
He Leu Thr Phe Tyr Gly Thr He Leu Gly Trp Val Leu Tyr Tyr Leu
100 105 110
Val Ser Val Ser Phe Asn Leu Pro Asn Asn He Gin Glu Ser Glu Gin
115 120 125
He Phe Thr Gin Thr Leu Gin Ser He Gly Leu Gin Ser He Gly Leu 130 135 140 Phe Ser Val Leu Leu He Thr Gly Trp He Val Ser Arg Gly He Lys 145 150 155 160
Glu Gly He Glu Lys Leu Asn Leu Val Leu Met Pro Leu Leu Phe Ala
165 170 175
Thr Phe Phe Gly Leu Leu Phe Tyr Ala Met Ser Met Asp Ser Phe Ser
180 185 190
Lys Ala Phe His Phe Met Phe Asp Phe Lys Pro Lys Asp Leu Thr Ser
195 200 205
Gin Val Phe Thr Tyr Ser Leu Gly Gin Val Phe Phe Ser Leu Ser He
210 215 220
Gly Leu Gly He Asn He Thr Tyr Ala Ala Val Thr Asp Lys Thr Gin 225 230 235 240
Asn Leu Leu Lys Ser Thr He Trp Val Val Leu Ser Gly He Leu He
245 250 255
Ser Leu Val Ala Gly Leu Met He Phe Thr Phe Val Phe Glu Tyr Gly
260 265 270
Ala Asn Val Ser Gin Gly Thr Gly Leu He Phe Thr Ser Leu Pro Val
275 280 285
Val Phe Gly Gin Met Gly Ala He Gly He Leu Val Ser He Leu Phe
290 295 300
Leu Leu Ala Leu Ala Phe Ala Gly He Thr Ser Thr Val Ala Leu Leu 305 310 315 320
Glu Pro Ser Val Met Tyr Leu Thr Glu Arg Tyr Gin Tyr Ser Arg Phe
325 330 335
Lys Val Thr Trp Gly Leu Val Ala Leu He Phe Val Val Gly Val Val
340 345 350
Leu He Phe Ser Leu His Lys Asp Tyr Lys Asp Tyr Leu Thr Phe Phe
355 360 365
Glu Lys Ser Leu Phe Asp Trp Leu Asp Phe Ala Ser Ser Thr He He
370 375 380
Met Pro Leu Gly Gly Met Ala Thr Phe He Phe Met Gly Trp Val Leu 385 390 395 400
Lys Lys Glu Lys Leu Arg Leu Leu Ser Val His Phe Leu Gly Pro Lys
405 410 415
Leu Phe Ala Thr Trp Tyr Phe Leu Leu Lys Tyr He Thr Pro Leu He
420 425 430
Val Phe Ser He Trp Leu Ser Lys He Tyr 435 440
(2) INFORMATION FOR SEQ ID NO: 1341:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1120 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 17...1081 (D) OTHER INFORMATION: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1341:
GAAATAAGGA TGCTTG ATG AAA AGC ATT TTG CTC TTT ATG ATT TTT GTA GTT 52
Met Lys Ser He Leu Leu Phe Met He Phe Val Val 1 5 10
TGT CAG TTA GAA GGC AAA AAA TTT TCA CAA GAT AAT TTT AAG GTG GAT 100 Cys Gin Leu Glu Gly Lys Lys Phe Ser Gin Asp Asn Phe Lys Val Asp 15 20 25
TAT AAC TAC TAT TTG CGC AAA CAG GAT TTG CAC ATC ATT AAA ACG CAA 148 Tyr Asn Tyr Tyr Leu Arg Lys Gin Asp Leu His He He Lys Thr Gin 30 35 40
AAC GAT TTG TCC AAT TCT TGG TAT CTC CCT CCA CAA AAA GCC CCC AAA 196 Asn Asp Leu Ser Asn Ser Trp Tyr Leu Pro Pro Gin Lys Ala Pro Lys 45 50 55 60
GAA CAT TCT TGG GTG GAT TTT GCT AAA AAA TAT TTA AAC ATG ATG GAT 244 Glu His Ser Trp Val Asp Phe Ala Lys Lys Tyr Leu Asn Met Met Asp 65 70 75
TAT CTA GGC ACT TAT TTT CTG CCT TTT TAT CAT AGT TTC ACC CCC ATT 292 Tyr Leu Gly Thr Tyr Phe Leu Pro Phe Tyr His Ser Phe Thr Pro He 80 85 90
TTT CAA TGG TAC CAC CCC AAT ATC AAC CCG TAT CAA CGC AAT GAG TTT 340 Phe Gin Trp Tyr His Pro Asn He Asn Pro Tyr Gin Arg Asn Glu Phe 95 100 105
AAG TTC CAA ATT AGT TTT AGA GTG CCT GTA TTT AGG CAT ATT CTT TGG 388 Lys Phe Gin He Ser Phe Arg Val Pro Val Phe Arg His He Leu Trp 110 115 120
ACT AAA GGC ACG CTG TAT TTA GCT TAT ACC CAA ACT GAC TGG TTT CAA 436 Thr Lys Gly Thr Leu Tyr Leu Ala Tyr Thr Gin Thr Asp Trp Phe Gin 125 130 135 140
ATT TAC AAT GAC CCC CAA TCC GCT CCC ATG CGA ATG ATG AAT TTC ATG 484 He Tyr Asn Asp Pro Gin Ser Ala Pro Met Arg Met Met Asn Phe Met 145 150 155
CCT GAA CTC ATT TAT GTT TAT CCT ATC AAT TTT AAA CCT TTT GGG GGT 532 Pro Glu Leu He Tyr Val Tyr Pro He Asn Phe Lys Pro Phe Gly Gly 160 165 170
AAA ATA GGG AAT TTT TCT GAA ATT TGG ATA GGT TGG CAG CAC ATT TCT 580 Lys He Gly Asn Phe Ser Glu He Trp He Gly Trp Gin His He Ser 175 180 185
AAT GGC GTG GGG GGC GCG CAA TGT TAC CAA CCT TTT AAT AAA GAA GGC 628 Asn Gly Val Gly Gly Ala Gin Cys Tyr Gin Pro Phe Asn Lys Glu Gly 190 195 200
AAT CCT GAA AAC CAG TTT CCA GGA CAA CCT GTA ATC GTT AAA GAT TAT 676 Asn Pro Glu Asn Gin Phe Pro Gly Gin Pro Val He Val Lys Asp Tyr 205 210 215 220
AAT GGG CAA AAA GAT GTG CGC TGG GGG GGG TGT CGT TCG GTG AGC GCG 724 Asn Gly Gin Lys Asp Val Arg Trp Gly Gly Cys Arg Ser Val Ser Ala 225 230 235
GGG CAA CGC CCT GTG TTT CGT TTG GTG TGG GAA AAG GGA GGC CTA AAA 772 Gly Gin Arg Pro Val Phe Arg Leu Val Trp Glu Lys Gly Gly Leu Lys 240 245 250
ATC ATG GTC GCT TAT TGG CCC TAT GTC CCT TAT GAT CAA TCC AAT CCT 820 He Met Val Ala Tyr Trp Pro Tyr Val Pro Tyr Asp Gin Ser Asn Pro 255 260 265
AAT TTG ATT GAT TAC ATG GGG TAT GGT AAC GCT AAA ATT GAT TAC AGG 868 Asn Leu He Asp Tyr Met Gly Tyr Gly Asn Ala Lys He Asp Tyr Arg 270 275 280
AGA GGG CGC CAC CAT TTT GAA TTG CAG CTT TAT GAT ATT TTC ACG CAA 916 Arg Gly Arg His His Phe Glu Leu Gin Leu Tyr Asp He Phe Thr Gin 285 290 295 300
TAC TGG CGT TAT GAT CGC TGG CAT GGA GCT TTC CGC TTA GGC TAT ACC 964 Tyr Trp Arg Tyr Asp Arg Trp His Gly Ala Phe Arg Leu Gly Tyr Thr 305 310 315
TAT CGC ATT AAC CCT TTT GTG GGG ATT TAT GCG CAG TGG TTT AAC GGC 1012 Tyr Arg He Asn Pro Phe Val Gly He Tyr Ala Gin Trp Phe Asn Gly 320 325 330
TAT GGC GAT GGC TTG TAT GAA TAC GAT GTT TTT TCC AAT CGT ATA GGG 1060 Tyr Gly Asp Gly Leu Tyr Glu Tyr Asp Val Phe Ser Asn Arg He Gly 335 340 345
GTA GGA ATA CGC TTA AAC CCT TAAAAAAGCG TTCTTTTAYG CTATAATTAA GACC 1115 Val Gly He Arg Leu Asn Pro 350 355
AAAAA 1120
(2) INFORMATION FOR SEQ ID NO:1342:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 355 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1342:
Met Lys Ser He Leu Leu Phe Met He Phe Val Val Cys Gin Leu Glu 1 5 10 15 Gly Lys Lys Phe Ser Gin Asp Asn Phe Lys Val Asp Tyr Asn Tyr Tyr
20 25 30
Leu Arg Lys Gin Asp Leu His He He Lys Thr Gin Asn Asp Leu Ser
35 40 45
Asn Ser Trp Tyr Leu Pro Pro Gin Lys Ala Pro Lys Glu His Ser Trp
50 55 60
Val Asp Phe Ala Lys Lys Tyr Leu Asn Met Met Asp Tyr Leu Gly Thr 65 70 75 80
Tyr Phe Leu Pro Phe Tyr His Ser Phe Thr Pro He Phe Gin Trp Tyr
85 90 95
His Pro Asn He Asn Pro Tyr Gin Arg Asn Glu Phe Lys Phe Gin He
100 105 110
Ser Phe Arg Val Pro Val Phe Arg His He Leu Trp Thr Lys Gly Thr
115 120 125
Leu Tyr Leu Ala Tyr Thr Gin Thr Asp Trp Phe Gin He Tyr Asn Asp
130 135 140
Pro Gin Ser Ala Pro Met Arg Met Met Asn Phe Met Pro Glu Leu He 145 150 155 160
Tyr Val Tyr Pro He Asn Phe Lys Pro Phe Gly Gly Lys He Gly Asn
165 170 175
Phe Ser Glu He Trp He Gly Trp Gin His He Ser Asn Gly Val Gly
180 185 190
Gly Ala Gin Cys Tyr Gin Pro Phe Asn Lys Glu Gly Asn Pro Glu Asn
195 200 205
Gin Phe Pro Gly Gin Pro Val He Val Lys Asp Tyr Asn Gly Gin Lys
210 215 220
Asp Val Arg Trp Gly Gly Cys Arg Ser Val Ser Ala Gly Gin Arg Pro 225 230 235 240
Val Phe Arg Leu Val Trp Glu Lys Gly Gly Leu Lys He Met Val Ala
245 250 255
Tyr Trp Pro Tyr Val Pro Tyr Asp Gin Ser Asn Pro Asn Leu He Asp
260 265 270
Tyr Met Gly Tyr Gly Asn Ala Lys He Asp Tyr Arg Arg Gly Arg His
275 280 285
His Phe Glu Leu Gin Leu Tyr Asp He Phe Thr Gin Tyr Trp Arg Tyr
290 295 300
Asp Arg Trp His Gly Ala Phe Arg Leu Gly Tyr Thr Tyr Arg He Asn 305 310 315 320
Pro Phe Val Gly He Tyr Ala Gin Trp Phe Asn Gly Tyr Gly Asp Gly
325 330 335
Leu Tyr Glu Tyr Asp Val Phe Ser Asn Arg He Gly Val Gly He Arg
340 345 350
Leu Asn Pro 355
(2) INFORMATION FOR SEQ ID NO: 1343:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 697 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE: (A) NAME/KEY: Coding Sequence
(B) LOCATION: 34...669 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1343:
AAAAAATTCA TTTTATCTTT TAGAGGGTTT TTA ATG TCT TAT TTT AAG AAT GCT 54
Met Ser Tyr Phe Lys Asn Ala 1 5
TTC AAT CAA AAA TCT TTA ATA GAT GAT TCT AGT GTG TAT TTA GAG CCT 102 Phe Asn Gin Lys Ser Leu He Asp Asp Ser Ser Val Tyr Leu Glu Pro 10 15 20
TGT TCT AGC TCT AAT TTC ATA GAA TTA AAA CGC ATG CAT TAT AAT GAA 150 Cys Ser Ser Ser Asn Phe He Glu Leu Lys Arg Met His Tyr Asn Glu 25 30 35
GAG AAT ACT AAG AAA ACA TGG GAT ATT ATT AAG TCT TTA GAC AGC GTG 198 Glu Asn Thr Lys Lys Thr Trp Asp He He Lys Ser Leu Asp Ser Val 40 45 50 55
GCG GTT TTA CTC TAT GAA AAA GAA TCC GAT TGC TTT GTG ATT GTG AAA 246 Ala Val Leu Leu Tyr Glu Lys Glu Ser Asp Cys Phe Val He Val Lys 60 65 70
CAA TTC CGC CCA GCC ATT TAT GCG CGC CGT TTT CAT TTT AAG TGC GAT 294 Gin Phe Arg Pro Ala He Tyr Ala Arg Arg Phe His Phe Lys Cys Asp 75 80 85
CAA GAT CAA ACT ATT GAC GGA TAC ACT TAT GAA TTG TGC GCA GGG CTT 342 Gin Asp Gin Thr He Asp Gly Tyr Thr Tyr Glu Leu Cys Ala Gly Leu 90 95 100
GTG GAT AAA GCT AAT AAG AGT TTA GAA GAA ATC GCT TGC GAA GAA GCG 390 Val Asp Lys Ala Asn Lys Ser Leu Glu Glu He Ala Cys Glu Glu Ala 105 110 115
CTA GAA GAA TGC GGT TAT CAA ATT AGC CCT AAA AAT TTA GAA ACC ATA 438 Leu Glu Glu Cys Gly Tyr Gin He Ser Pro Lys Asn Leu Glu Thr He 120 125 130 135
GGC CAA TTT TAT AGC GCG ACT GGG TTG AGT GGG AGT TTG CAA ACG CTC 486 Gly Gin Phe Tyr Ser Ala Thr Gly Leu Ser Gly Ser Leu Gin Thr Leu 140 145 150
TAT TAC GCT GAA GTG CAT AAG AAT TTG AAA GTT TCA AAG GGT GGG GGG 534 Tyr Tyr Ala Glu Val His Lys Asn Leu Lys Val Ser Lys Gly Gly Gly 155 160 165
ATT GAT ACC GAA AGG ATT GAA GTG CTG TTT TTA GAG CGA TCA AAA GCT 582 He Asp Thr Glu Arg He Glu Val Leu Phe Leu Glu Arg Ser Lys Ala 170 175 180 CTT GAT TTT ATA ATG GAT TTT CAA TAC GCT AAA ACC ACC GGA TTG TCT 630 Leu Asp Phe He Met Asp Phe Gin Tyr Ala Lys Thr Thr Gly Leu Ser 185 190 195
TTA GCC ATT TTA TGG CAT TTA AAA AAG TTT AAA AAT GTT TAAAAGGAAT TT 681 Leu Ala He Leu Trp His Leu Lys Lys Phe Lys Asn Val 200 205 210
TATGTTAAGG CTTTTG 697
(2) INFORMATION FOR SEQ ID NO: 1344:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 212 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1344:
Met Ser Tyr Phe Lys Asn Ala Phe Asn Gin Lys Ser Leu He Asp Asp
1 5 10 15
Ser Ser Val Tyr Leu Glu Pro Cys Ser Ser Ser Asn Phe He Glu Leu
20 25 30
Lys Arg Met His Tyr Asn Glu Glu Asn Thr Lys Lys Thr Trp Asp He
35 40 45
He Lys Ser Leu Asp Ser Val Ala Val Leu Leu Tyr Glu Lys Glu Ser
50 55 60
Asp Cys Phe Val He Val Lys Gin Phe Arg Pro Ala He Tyr Ala Arg 65 70 75 80
Arg Phe His Phe Lys Cys Asp Gin Asp Gin Thr He Asp Gly Tyr Thr
85 90 95
Tyr Glu Leu Cys Ala Gly Leu Val Asp Lys Ala Asn Lys Ser Leu Glu
100 105 110
Glu He Ala Cys Glu Glu Ala Leu Glu Glu Cys Gly Tyr Gin He Ser
115 120 125
Pro Lys Asn Leu Glu Thr He Gly Gin Phe Tyr Ser Ala Thr Gly Leu
130 135 140
Ser Gly Ser Leu Gin Thr Leu Tyr Tyr Ala Glu Val His Lys Asn Leu 145 150 155 160
Lys Val Ser Lys Gly Gly Gly He Asp Thr Glu Arg He Glu Val Leu
165 170 175
Phe Leu Glu Arg Ser Lys Ala Leu Asp Phe He Met Asp Phe Gin Tyr
180 185 190
Ala Lys Thr Thr Gly Leu Ser Leu Ala He Leu Trp His Leu Lys Lys
195 200 205
Phe Lys Asn Val 210
(2) INFORMATION FOR SEQ ID NO: 1345:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 2071 base pairs (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 49...2022 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1345:
TGCGAACAAT TATGGGATGA TATTATAAAA ATTGGTGGGA ATGATAAG ATG AAC GGA 57
Met Asn Gly
1
CAT TTT ATC GGT TCT ATT TTG TAT GTG CTA GAT AGT AAT ACG CAC TCT 105 His Phe He Gly Ser He Leu Tyr Val Leu Asp Ser Asn Thr His Ser 5 10 15
AAC AAT ACA TTA CTC ATC ATT GAC GGC CAA CAA AGG CTC ACC ACT ATC 153 Asn Asn Thr Leu Leu He He Asp Gly Gin Gin Arg Leu Thr Thr He 20 25 30 35
ACG CTT TTA CTC ATC GCT TTA AGG AAT CAT CTA AGC GAA GAA GTT GAA 201 Thr Leu Leu Leu He Ala Leu Arg Asn His Leu Ser Glu Glu Val Glu 40 45 50
ATT TTG GAG AAA TTT TCG CGT AAA GAA ATA GAG AGC TAT CTT ATC AAC 249 He Leu Glu Lys Phe Ser Arg Lys Glu He Glu Ser Tyr Leu He Asn 55 60 65
AGC AAT AAG GAC GGC GAT AAG AAA TTC AGG CTC ATT CTT TCA GAG TCC 297 Ser Asn Lys Asp Gly Asp Lys Lys Phe Arg Leu He Leu Ser Glu Ser 70 75 80
GAT AAA GAC ACC TTG CTG TCT TTG ATT GAT AAA AAC AAA AGA AAG CCG 345 Asp Lys Asp Thr Leu Leu Ser Leu He Asp Lys Asn Lys Arg Lys Pro 85 90 95
AGC GAG CCT TCG GTA AAA ATA GTG GAA AAT TTT GAA TTG TTT GAA AAA 393 Ser Glu Pro Ser Val Lys He Val Glu Asn Phe Glu Leu Phe Glu Lys 100 105 110 115
TGG ATC AGT GAA AAC ACC GAC AAA CTA GAA ACG ATT TTT AAA GGA TTA 441 Trp He Ser Glu Asn Thr Asp Lys Leu Glu Thr He Phe Lys Gly Leu 120 125 130
AAA AAA CTC ATG ATA GTT TGG ATT TCT TTA GAT AAA GGA AAA GAT GAT 489 Lys Lys Leu Met He Val Trp He Ser Leu Asp Lys Gly Lys Asp Asp 135 140 145 CCT CAA CTT ATT TTT GAG AGC ATG AAC TCA AAA GAT ATC GAA CTC ACG 537 Pro Gin Leu He Phe Glu Ser Met Asn Ser Lys Asp He Glu Leu Thr 150 155 160
CAA ACG GAT TTG ATC AGA AAT TAT ATC GTA ATG GAA ACG GAG GTT GAA 585 Gin Thr Asp Leu He Arg Asn Tyr He Val Met Glu Thr Glu Val Glu 165 170 175
AAA CAG GAA GAC TTT TAT AAT CAA TAT TGG AGG GCT ATG GAG GAG AGA 633 Lys Gin Glu Asp Phe Tyr Asn Gin Tyr Trp Arg Ala Met Glu Glu Arg 180 185 190 195
TTT GAA CAA AAT GAA ACA TTG TTT AAT CGG TTT GTC CGG CAT TAT CTC 681 Phe Glu Gin Asn Glu Thr Leu Phe Asn Arg Phe Val Arg His Tyr Leu 200 205 210
ACG ATC AAA ATA GGA AAG ATT CCC AAT GAG AAA AGA GTT TAT GAA GCT 729 Thr He Lys He Gly Lys He Pro Asn Glu Lys Arg Val Tyr Glu Ala 215 220 225
TTC AAG GAT TAC CGG CAA AAA AAG GGG ATA GAA ATA GAG GAT TTA TTA 777 Phe Lys Asp Tyr Arg Gin Lys Lys Gly He Glu He Glu Asp Leu Leu 230 235 240
AAA GAT TTA CAA AAA TAC TGC GGG TAT TTT TGC CAG ATT GCA TTC AAA 825 Lys Asp Leu Gin Lys Tyr Cys Gly Tyr Phe Cys Gin He Ala Phe Lys 245 250 255
AAA GAA GAC GAT AAA GAT TTA AAC AAG GCT TTA AGT TTT TTG GTG AAT 873 Lys Glu Asp Asp Lys Asp Leu Asn Lys Ala Leu Ser Phe Leu Val Asn 260 265 270 275
TTA GAG ATG GAT GTG ATC TAT CCG CTA CTA CTA GAG CTT TAT AGC GAT 921 Leu Glu Met Asp Val He Tyr Pro Leu Leu Leu Glu Leu Tyr Ser Asp 280 285 290
TAT AAG GAT GGC GTT TTA TCC AAG CAG GAT TTT ATC CCT ATT ATC TAT 969 Tyr Lys Asp Gly Val Leu Ser Lys Gin Asp Phe He Pro He He Tyr 295 300 305
TTA ATA GAG AGC TAT ATT TGC AGA AGG GCG GTG TGT GGG CTT GGC ACA 1017 Leu He Glu Ser Tyr He Cys Arg Arg Ala Val Cys Gly Leu Gly Thr 310 315 320
AAT AGT CTC AAT AAA GTT TTT CCC TCT TTT ACA AAG CAC ATC CAA AAA 1065 Asn Ser Leu Asn Lys Val Phe Pro Ser Phe Thr Lys His He Gin Lys 325 330 335
GAT GAA TAT TTT AAA AGC CTA AAG GCG CAT TTT GTC TGT CTG ACA GAA 1113 Asp Glu Tyr Phe Lys Ser Leu Lys Ala His Phe Val Cys Leu Thr Glu 340 345 350 355
AAA CAA AGA TTT CCA AAC AAT GAC GAG TTT AAA AAG CTT TTT ATT ACG 1161 Lys Gin Arg Phe Pro Asn Asn Asp Glu Phe Lys Lys Leu Phe He Thr 360 365 370 ATA GAT TTT TAT AAG TTT AAA AAA AAT AAA TAC TTT CTT GAA AGG TTA 1209 He Asp Phe Tyr Lys Phe Lys Lys Asn Lys Tyr Phe Leu Glu Arg Leu 375 380 385
GAA AAT TTT GAC ACA AAA GAA CCG GTC GAT ACT CAA AAA TGC AAT ATA 1257 Glu Asn Phe Asp Thr Lys Glu Pro Val Asp Thr Gin Lys Cys Asn He 390 395 400
GAA CAT ATA ATG CCT CAA ACC CTT ACT CCA GAA TGG CAA AGG GAT TTG 1305 Glu His He Met Pro Gin Thr Leu Thr Pro Glu Trp Gin Arg Asp Leu 405 410 415
GGT GAA AAT TTT CAA GCA ATA CAC GAG AAA TAC CTC CAC ACA ATA GGG 1353 Gly Glu Asn Phe Gin Ala He His Glu Lys Tyr Leu His Thr He Gly 420 425 430 435
AAT CTC ACT CTA ACC GGT TAT AAC TCT AAG TAT AGC AAC AAT TCT TTC 1401 Asn Leu Thr Leu Thr Gly Tyr Asn Ser Lys Tyr Ser Asn Asn Ser Phe 440 445 450
CAA GAA AAA AGA GAT ATG GAG AAG GGC TTT AAA CAA AGC TCA TTA AAA 1449 Gin Glu Lys Arg Asp Met Glu Lys Gly Phe Lys Gin Ser Ser Leu Lys 455 460 465
CTC AAT CAA AGT TTG AAA GAT TTG GAA TCT TTT GGC GAA AAA GAG ATT 1497 Leu Asn Gin Ser Leu Lys Asp Leu Glu Ser Phe Gly Glu Lys Glu He 470 475 480
GAA AAA AGG GCT AGT GAT TTA GCG GAT TGG GCT TTA AAG ATT TGG ACT 1545 Glu Lys Arg Ala Ser Asp Leu Ala Asp Trp Ala Leu Lys He Trp Thr 485 490 495
TAC CCA ATT CTA GAG GCA GAA ACA TTA GAG GAA TAT AAA CCC AAA AAA 1593 Tyr Pro He Leu Glu Ala Glu Thr Leu Glu Glu Tyr Lys Pro Lys Lys 500 505 510 515
GAA AAG AAA GAA AAG AAA GAA AAA GAG GAG TAT AAA CTC AAG AAA GAA 1641 Glu Lys Lys Glu Lys Lys Glu Lys Glu Glu Tyr Lys Leu Lys Lys Glu 520 525 530
AAA AAG GTT TAT GAT TTA AGC TCT TAT AAG TTT AGC TCT GAT TCA AGG 1689 Lys Lys Val Tyr Asp Leu Ser Ser Tyr Lys Phe Ser Ser Asp Ser Arg 535 540 545
GAA TTG TTT GAT ATT TTA AGA GAA AAG ATT AAA GCT CTT GAT GAA AGG 1737 Glu Leu Phe Asp He Leu Arg Glu Lys He Lys Ala Leu Asp Glu Arg 550 555 560
ATA ACT GAA AAA TTT AAT CAA AAA TAT ATA GCT TAT AAG TTT TGT AAA 1785 He Thr Glu Lys Phe Asn Gin Lys Tyr He Ala Tyr Lys Phe Cys Lys 565 570 575
ATA AGT TTT GTG GAT ATT GTT GTG CAA GAA AAA GGC TTA AAA TTG TAT 1833 He Ser Phe Val Asp He Val Val Gin Glu Lys Gly Leu Lys Leu Tyr 580 585 590 595 TTA AAA ATG AAC TTG AAT GAA TTG CAA GAT GAA ATA AAG GAA AAA CTA 1881 Leu Lys Met Asn Leu Asn Glu Leu Gin Asp Glu He Lys Glu Lys Leu 600 605 610
AAA ATT AGA GAC GTT TCT AAT ATC GGT CGT CCA TGC GTT GGA AAC ATG 1929 Lys He Arg Asp Val Ser Asn He Gly Arg Pro Cys Val Gly Asn Met 615 620 625
GAA GTA GAG CTA GAA ACA AAA GAA AAT ATC CCT TAT TGT TTG GGA TTG 1977 Glu Val Glu Leu Glu Thr Lys Glu Asn He Pro Tyr Cys Leu Gly Leu 630 635 640
ATC AAG CAG GCT TTA GAA AAA CAG ATG GGT GGT AGG AAT AGG CAA TAAAA 2027 He Lys Gin Ala Leu Glu Lys Gin Met Gly Gly Arg Asn Arg Gin 645 650 655
ACCCAACTTA TTCAAAATAA AGAGTATAAT TACAAATTAC TTAC 2071
(2) INFORMATION FOR SEQ ID NO: 1346:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 658 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1346:
Met Asn Gly His Phe He Gly Ser He Leu Tyr Val Leu Asp Ser Asn
1 5 10 15
Thr His Ser Asn Asn Thr Leu Leu He He Asp Gly Gin Gin Arg Leu
20 25 30
Thr Thr He Thr Leu Leu Leu He Ala Leu Arg Asn His Leu Ser Glu
35 40 45
Glu Val Glu He Leu Glu Lys Phe Ser Arg Lys Glu He Glu Ser Tyr
50 55 60
Leu He Asn Ser Asn Lys Asp Gly Asp Lys Lys Phe Arg Leu He Leu 65 70 75 80
Ser Glu Ser Asp Lys Asp Thr Leu Leu Ser Leu He Asp Lys Asn Lys
85 90 95
Arg Lys Pro Ser Glu Pro Ser Val Lys He Val Glu Asn Phe Glu Leu
100 105 110
Phe Glu Lys Trp He Ser Glu Asn Thr Asp Lys Leu Glu Thr He Phe
115 120 125
Lys Gly Leu Lys Lys Leu Met He Val Trp He Ser Leu Asp Lys Gly
130 135 140
Lys Asp Asp Pro Gin Leu He Phe Glu Ser Met Asn Ser Lys Asp He 145 150 155 160
Glu Leu Thr Gin Thr Asp Leu He Arg Asn Tyr He Val Met Glu Thr
165 170 175
Glu Val Glu Lys Gin Glu Asp Phe Tyr Asn Gin Tyr Trp Arg Ala Met
180 185 190
Glu Glu Arg Phe Glu Gin Asn Glu Thr Leu Phe Asn Arg Phe Val Arg 195 200 205
His Tyr Leu Thr He Lys He Gly Lys He Pro Asn Glu Lys Arg Val
210 215 220
Tyr Glu Ala Phe Lys Asp Tyr Arg Gin Lys Lys Gly He Glu He Glu 225 230 235 240
Asp Leu Leu Lys Asp Leu Gin Lys Tyr Cys Gly Tyr Phe Cys Gin He
245 250 255
Ala Phe Lys Lys Glu Asp Asp Lys Asp Leu Asn Lys Ala Leu Ser Phe
260 265 270
Leu Val Asn Leu Glu Met Asp Val He Tyr Pro Leu Leu Leu Glu Leu
275 280 285
Tyr Ser Asp Tyr Lys Asp Gly Val Leu Ser Lys Gin Asp Phe He Pro
290 295 300
He He Tyr Leu He Glu Ser Tyr He Cys Arg Arg Ala Val Cys Gly 305 310 315 320
Leu Gly Thr Asn Ser Leu Asn Lys Val Phe Pro Ser Phe Thr Lys His
325 330 335
He Gin Lys Asp Glu Tyr Phe Lys Ser Leu Lys Ala His Phe Val Cys
340 345 350
Leu Thr Glu Lys Gin Arg Phe Pro Asn Asn Asp Glu Phe Lys Lys Leu
355 360 365
Phe He Thr He Asp Phe Tyr Lys Phe Lys Lys Asn Lys Tyr Phe Leu
370 375 380
Glu Arg Leu Glu Asn Phe Asp Thr Lys Glu Pro Val Asp Thr Gin Lys 385 390 395 400
Cys Asn He Glu His He Met Pro Gin Thr Leu Thr Pro Glu Trp Gin
405 410 415
Arg Asp Leu Gly Glu Asn Phe Gin Ala He His Glu Lys Tyr Leu His
420 425 430
Thr He Gly Asn Leu Thr Leu Thr Gly Tyr Asn Ser Lys Tyr Ser Asn
435 440 445
Asn Ser Phe Gin Glu Lys Arg Asp Met Glu Lys Gly Phe Lys Gin Ser
450 455 460
Ser Leu Lys Leu Asn Gin Ser Leu Lys Asp Leu Glu Ser Phe Gly Glu 465 470 475 480
Lys Glu He Glu Lys Arg Ala Ser Asp Leu Ala Asp Trp Ala Leu Lys
485 490 495
He Trp Thr Tyr Pro He Leu Glu Ala Glu Thr Leu Glu Glu Tyr Lys
500 505 510
Pro Lys Lys Glu Lys Lys Glu Lys Lys Glu Lys Glu Glu Tyr Lys Leu
515 520 525
Lys Lys Glu Lys Lys Val Tyr Asp Leu Ser Ser Tyr Lys Phe Ser Ser
530 535 540
Asp Ser Arg Glu Leu Phe Asp He Leu Arg Glu Lys He Lys Ala Leu 545 550 555 560
Asp Glu Arg He Thr Glu Lys Phe Asn Gin Lys Tyr He Ala Tyr Lys
565 570 575
Phe Cys Lys He Ser Phe Val Asp He Val Val Gin Glu Lys Gly Leu
580 585 590
Lys Leu Tyr Leu Lys Met Asn Leu Asn Glu Leu Gin Asp Glu He Lys
595 600 605
Glu Lys Leu Lys He Arg Asp Val Ser Asn He Gly Arg Pro Cys Val
610 615 620
Gly Asn Met Glu Val Glu Leu Glu Thr Lys Glu Asn He Pro Tyr Cys 625 630 635 640 Leu Gly Leu He Lys Gin Ala Leu Glu Lys Gin Met Gly Gly Arg Asn
645 650 655
Arg Gin
(2) INFORMATION FOR SEQ ID NO: 1347:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 598 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...558 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1347:
GTGGTGGCTG AGTAGAAA ATG TTT GAA GCG ACG ACG ATT TTA GGC TAT AGA 51
Met Phe Glu Ala Thr Thr He Leu Gly Tyr Arg 1 5 10
GGG GAA TTG AAT CAT AAA AAG TTC GCG CTC ATT GGA GGC GAT GGG CAG 99 Gly Glu Leu Asn His Lys Lys Phe Ala Leu He Gly Gly Asp Gly Gin 15 20 25
GTA ACT TTG GGT AAT TGC GTG GTC AAA GCC AAT GCG ACA AAA ATC AGA 147 Val Thr Leu Gly Asn Cys Val Val Lys Ala Asn Ala Thr Lys He Arg 30 35 40
AGC TTG TAT CAC AAC CAG GTT TTA AGC GGG TTT GCC GGA AGC ACC GCG 195 Ser Leu Tyr His Asn Gin Val Leu Ser Gly Phe Ala Gly Ser Thr Ala 45 50 55
GAC GCT TTT AGT TTG TTT GAT ATG TTT GAA CGC ATT TTA GAG AGC AAA 243 Asp Ala Phe Ser Leu Phe Asp Met Phe Glu Arg He Leu Glu Ser Lys 60 65 70 75
AAG GGG GAT TTG TTT AAA AGC GTG GTG GAT TTC AGT AAA GAA TGG CGC 291 Lys Gly Asp Leu Phe Lys Ser Val Val Asp Phe Ser Lys Glu Trp Arg 80 85 90
AAA GAT AAG TAT TTA CGC CGA CTG GAA GCG ATG ATG ATC GTT TTA AAC 339 Lys Asp Lys Tyr Leu Arg Arg Leu Glu Ala Met Met He Val Leu Asn 95 100 105
TTC GAT CAC ATT TTC ATT TTG AGC GGC ATG GGC GAT GTT TTA GAA GCT 387 Phe Asp His He Phe He Leu Ser Gly Met Gly Asp Val Leu Glu Ala 110 115 120 GAA GAC AAT AAG ATC GCT GCT ATT GGG AGT GGG GGG AAT TAC GCT TTA 435 Glu Asp Asn Lys He Ala Ala He Gly Ser Gly Gly Asn Tyr Ala Leu 125 130 135
AGC GCG GCT AGG GCT TTA GAT CAT TTC GCT CAT TTA GAG CCT AGA AAA 483 Ser Ala Ala Arg Ala Leu Asp His Phe Ala His Leu Glu Pro Arg Lys 140 145 150 155
CTT GTA GAA GAG TCC TTA AAA ATC GCA GGG GAT CTT TGC ATT TAC ACC 531 Leu Val Glu Glu Ser Leu Lys He Ala Gly Asp Leu Cys He Tyr Thr 160 165 170
AAC ACG AAT ATT AAA ATT TTG GAG CTT TAATGTCTAA ATTGAATATG ACCCCAC 585 Asn Thr Asn He Lys He Leu Glu Leu 175 180
GAGAAATTGT CGC 598
(2) INFORMATION FOR SEQ ID NO: 1348:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 180 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1348:
Met Phe Glu Ala Thr Thr He Leu Gly Tyr Arg Gly Glu Leu Asn His
1 5 10 15
Lys Lys Phe Ala Leu He Gly Gly Asp Gly Gin Val Thr Leu Gly Asn
20 25 30
Cys Val Val Lys Ala Asn Ala Thr Lys He Arg Ser Leu Tyr His Asn
35 40 45
Gin Val Leu Ser Gly Phe Ala Gly Ser Thr Ala Asp Ala Phe Ser Leu
50 55 60
Phe Asp Met Phe Glu Arg He Leu Glu Ser Lys Lys Gly Asp Leu Phe 65 70 75 80
Lys Ser Val Val Asp Phe Ser Lys Glu Trp Arg Lys Asp Lys Tyr Leu
85 90 95
Arg Arg Leu Glu Ala Met Met He Val Leu Asn Phe Asp His He Phe
100 105 110
He Leu Ser Gly Met Gly Asp Val Leu Glu Ala Glu Asp Asn Lys He
115 120 125
Ala Ala He Gly Ser Gly Gly Asn Tyr Ala Leu Ser Ala Ala Arg Ala
130 135 140
Leu Asp His Phe Ala His Leu Glu Pro Arg Lys Leu Val Glu Glu Ser 145 150 155 160
Leu Lys He Ala Gly Asp Leu Cys He Tyr Thr Asn Thr Asn He Lys
165 170 175
He Leu Glu Leu 180 (2) INFORMATION FOR SEQ ID NO: 1349:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 450 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 34...396 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1349:
TGTTTCATAG TAACAAATTG AAAATATACC ATT ATG TAT GGA GGT AAT GCT ATG 54
Met Tyr Gly Gly Asn Ala Met 1 5
GCT GAC ACA ATC AAT ACA ACT GAA GCA ACT CAT GAA ACA AAA AAA CCA 102 Ala Asp Thr He Asn Thr Thr Glu Ala Thr His Glu Thr Lys Lys Pro 10 15 20
AAC GCT TTT GTA AAT TTT TTC AAA AAC AAT TTG ACT GAT AAG CGT TAT 150 Asn Ala Phe Val Asn Phe Phe Lys Asn Asn Leu Thr Asp Lys Arg Tyr 25 30 35
GAT TCA TTA GGT CTC ATT GGA GCA GGG GTT TTA TGT TGT GTC TTG AGC 198 Asp Ser Leu Gly Leu He Gly Ala Gly Val Leu Cys Cys Val Leu Ser 40 45 50 55
GGT GCT ATG GGG ATT GTT GGG ATA ATC TTT GTC GCA ATA GGA ATC TTT 246 Gly Ala Met Gly He Val Gly He He Phe Val Ala He Gly He Phe 60 65 70
TTG TCT TTT TCT AAT ATC AAC TTA GTG AAA TTA GTT GAA AAA TTG TCC 294 Leu Ser Phe Ser Asn He Asn Leu Val Lys Leu Val Glu Lys Leu Ser 75 80 85
AAA AAA CAA TCT AAA GTG CCA ACA ACT GTC AAT AAC GAA ACT CAA AAA 342 Lys Lys Gin Ser Lys Val Pro Thr Thr Val Asn Asn Glu Thr Gin Lys 90 95 100
TCT CAA GCA ACA AGC GTT ACC AAC GAA CCA ACT GAA GCC AAA GAG ACT 390 Ser Gin Ala Thr Ser Val Thr Asn Glu Pro Thr Glu Ala Lys Glu Thr 105 110 115
AAA GAT TGAGGCAAAA CAACGATTTT GACTGAAGAA AGAATGAGAG AAAATTTCAA AA 448
Lys Asp
120 AT 450
(2) INFORMATION FOR SEQ ID NO: 1350:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 121 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1350:
Met Tyr Gly Gly Asn Ala Met Ala Asp Thr He Asn Thr Thr Glu Ala
1 5 10 15
Thr His Glu Thr Lys Lys Pro Asn Ala Phe Val Asn Phe Phe Lys Asn
20 25 30
Asn Leu Thr Asp Lys Arg Tyr Asp Ser Leu Gly Leu He Gly Ala Gly
35 40 45
Val Leu Cys Cys Val Leu Ser Gly Ala Met Gly He Val Gly He He
50 55 60
Phe Val Ala He Gly He Phe Leu Ser Phe Ser Asn He Asn Leu Val 65 70 75 80
Lys Leu Val Glu Lys Leu Ser Lys Lys Gin Ser Lys Val Pro Thr Thr
85 90 95
Val Asn Asn Glu Thr Gin Lys Ser Gin Ala Thr Ser Val Thr Asn Glu
100 105 110
Pro Thr Glu Ala Lys Glu Thr Lys Asp 115 120
(2) INFORMATION FOR SEQ ID NO: 1351:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 504 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 69...443 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1351:
TAACCATTAG TTTCAAGCAG TATGAAAATC TTCTCCATAT CCATCAAAAA GGTTGCGACA 60 ATGAAGTG ATG TGC AGA ACG CTC ATC TCT ATC GCT TTG TTA GAA AGC TCT 110 Met Cys Arg Thr Leu He Ser He Ala Leu Leu Glu Ser Ser 1 5 10 CTA GGG TTG AAC AAC AGG CGA GAA AAA TCC CTT AAA GAC ACT TCT TAT 158 Leu Gly Leu Asn Asn Arg Arg Glu Lys Ser Leu Lys Asp Thr Ser Tyr 15 20 25 30
TCC ATG TTT CAT ATC ACC CTA AAC ACC GCT AAA AAA TTC TAC CCT ACC 206 Ser Met Phe His He Thr Leu Asn Thr Ala Lys Lys Phe Tyr Pro Thr 35 40 45
TAC TCT AAA ACG CTC CTC AAA TTC AAA TTG CTA AAC GAT GTG GGT TTT 254 Tyr Ser Lys Thr Leu Leu Lys Phe Lys Leu Leu Asn Asp Val Gly Phe 50 55 60
GCG ATC CAA TTA GCC AAA CAA ATT TTA AAA GAA AAT TTT GAT TAT TAC 302 Ala He Gin Leu Ala Lys Gin He Leu Lys Glu Asn Phe Asp Tyr Tyr 65 70 75
AAA CAA AAA CAC CCC AAC AAA AGC GTG TAT CAA TTA GTA GAA ATG GCA 350 Lys Gin Lys His Pro Asn Lys Ser Val Tyr Gin Leu Val Glu Met Ala 80 85 90
ATA GGC GCT TAC AAT GGG GGA ATG AAA CAC AAC CCT AAT GGC GCT TAC 398 He Gly Ala Tyr Asn Gly Gly Met Lys His Asn Pro Asn Gly Ala Tyr 95 100 105 110
GTG AAA AAA TTC CGT TGC ATT TAT TCT CAA GTG CGA TAT AAC GAG TAGAG 448 Val Lys Lys Phe Arg Cys He Tyr Ser Gin Val Arg Tyr Asn Glu 115 120 125
CATACTCATT TTATAAGCAA TCTTGATGAC ACACTTCTAC TATCTTATGA ATTTAT 504
(2) INFORMATION FOR SEQ ID NO: 1352:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 125 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1352:
Met Cys Arg Thr Leu He Ser He Ala Leu Leu Glu Ser Ser Leu Gly
1 5 10 15
Leu Asn Asn Arg Arg Glu Lys Ser Leu Lys Asp Thr Ser Tyr Ser Met
20 25 30
Phe His He Thr Leu Asn Thr Ala Lys Lys Phe Tyr Pro Thr Tyr Ser
35 40 45
Lys Thr Leu Leu Lys Phe Lys Leu Leu Asn Asp Val Gly Phe Ala He
50 55 60
Gin Leu Ala Lys Gin He Leu Lys Glu Asn Phe Asp Tyr Tyr Lys Gin 65 70 75 80
Lys His Pro Asn Lys Ser Val Tyr Gin Leu Val Glu Met Ala He Gly
85 90 95
Ala Tyr Asn Gly Gly Met Lys His Asn Pro Asn Gly Ala Tyr Val Lys 100 105 110
Lys Phe Arg Cys He Tyr Ser Gin Val Arg Tyr Asn Glu 115 120 125
(2) INFORMATION FOR SEQ ID NO: 1353:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2329 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 31...2274 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1353:
ATTTTATATC AAACACAGGT AGTAGGCACA ATG GAA GAC TTT TTG TAT AAC ACC 54
Met Glu Asp Phe Leu Tyr Asn Thr 1 5
TTA TAT TTC ATA GAG GAT TAT AAG TTG GTT GTT ATT TTT AGT TTC ATA 102 Leu Tyr Phe He Glu Asp Tyr Lys Leu Val Val He Phe Ser Phe He 10 15 20
GGG TTA ATA GCG TTA TTT TTT CTT TAC AAA TTC ATA AAA GCT CAA AAA 150 Gly Leu He Ala Leu Phe Phe Leu Tyr Lys Phe He Lys Ala Gin Lys 25 30 35 40
AAG GCT TTT AAA GAT AAA GCT AAC CAG CCT CAA AAG AAA AAA AGC TTT 198 Lys Ala Phe Lys Asp Lys Ala Asn Gin Pro Gin Lys Lys Lys Ser Phe 45 50 55
AAA GAA ATC ATT ATA GAT GGG CTG AAA GAA AGG GTT AAA ACC TTT GGC 246 Lys Glu He He He Asp Gly Leu Lys Glu Arg Val Lys Thr Phe Gly 60 65 70
TTT TGG TTG CAA GCT ATA CTA TTA CTA TCC TAT TCT TTT ATC ACA TCA 294 Phe Trp Leu Gin Ala He Leu Leu Leu Ser Tyr Ser Phe He Thr Ser 75 80 85
GGA TTA TTT TTC TTG ATT CTC TTA GGT AAT TTT TAT GAT GAT AAT CGA 342 Gly Leu Phe Phe Leu He Leu Leu Gly Asn Phe Tyr Asp Asp Asn Arg 90 95 100
TCG CCT GAG AGC GAT GAT GAT CTT TTT GAT ATA TGG ATC TAT GCG ATA 390 Ser Pro Glu Ser Asp Asp Asp Leu Phe Asp He Trp He Tyr Ala He 105 110 115 120 CAA GAT TTT CCT AAT TAC TAT TTT AAA GCG CTT GGT TTT AGT TCA CTC 438 Gin Asp Phe Pro Asn Tyr Tyr Phe Lys Ala Leu Gly Phe Ser Ser Leu 125 130 135
AAG ATT TAT GGG TTC AAT ATA TCC TTA GTC GTA TAT GGT TCT ATT TTA 486 Lys He Tyr Gly Phe Asn He Ser Leu Val Val Tyr Gly Ser He Leu 140 145 150
TGC TCT TAT ATC TTC ATT ACC TTT TTT GTG TGG TTC TTA AAA TAC TTA 534 Cys Ser Tyr He Phe He Thr Phe Phe Val Trp Phe Leu Lys Tyr Leu 155 160 165
ACT CGG ACT AGA GAT ATA GGA GCG AAT AAA AAA GTT GAT GAT CTC TTT 582 Thr Arg Thr Arg Asp He Gly Ala Asn Lys Lys Val Asp Asp Leu Phe 170 175 180
GGT AGC GCG AGT TGG GAA ACT GAA GAG AAA ATG ATC AAA GCC AAA CTC 630 Gly Ser Ala Ser Trp Glu Thr Glu Glu Lys Met He Lys Ala Lys Leu 185 190 195 200
ATC ACG CCC AAC AAT AAA AAA CGC GCC TTT GAC AAA CGA GAG GTG ATT 678 He Thr Pro Asn Asn Lys Lys Arg Ala Phe Asp Lys Arg Glu Val He 205 210 215
GTA GGC AGG CGT GGC TTG GGG GAT TTT ATC GCT TAC GCA GGG CAG GCG 726 Val Gly Arg Arg Gly Leu Gly Asp Phe He Ala Tyr Ala Gly Gin Ala 220 225 230
TTC ATT GGC TTG ATT GCT CCT ACT AGA AGC GGT AAG GGG GTG GGT TTC 774 Phe He Gly Leu He Ala Pro Thr Arg Ser Gly Lys Gly Val Gly Phe 235 240 245
ATC ATG CCC AAT ATG ATC AAT TAT CCT CAA AAT ATC GTT GTG TTT GAC 822 He Met Pro Asn Met He Asn Tyr Pro Gin Asn He Val Val Phe Asp 250 255 260
CCT AAA GCT GAC ACT ATG GAG ACT TGC GGA AAA ATC AGA GAA AAA CGC 870 Pro Lys Ala Asp Thr Met Glu Thr Cys Gly Lys He Arg Glu Lys Arg 265 270 275 280
TTC AAC CAA AAA GTG TTC ATC TAT GAA CCT TTC TCC TTA AAA ACA CAC 918 Phe Asn Gin Lys Val Phe He Tyr Glu Pro Phe Ser Leu Lys Thr His 285 290 295
CGA TTT AAT CCT TTC GCT TAT GTG GAT TTT GGT AAT GAT GTG GTT TTG 966 Arg Phe Asn Pro Phe Ala Tyr Val Asp Phe Gly Asn Asp Val Val Leu 300 305 310
ACC GAA GAC ATA CTC TCT CAA ATT GAC ACA CGC CTA AAA GGG CAT GGC 1014 Thr Glu Asp He Leu Ser Gin He Asp Thr Arg Leu Lys Gly His Gly 315 320 325
ATG GTG GCT AGT GGA GGG GAT TTT TCC ACT CAA ATC TTT GGA TTA GCA 1062 Met Val Ala Ser Gly Gly Asp Phe Ser Thr Gin He Phe Gly Leu Ala 330 335 340 AAG CTC GTG TTC CCT GAA AGA CCT AAT GAA AAA GAT CCT TTC TTT AGC 1110 Lys Leu Val Phe Pro Glu Arg Pro Asn Glu Lys Asp Pro Phe Phe Ser 345 350 355 360
AAT CAA GCG CGA AAT CTT TTT GTC ATC AAT TGC AAT ATT TAC AGG GAT 1158 Asn Gin Ala Arg Asn Leu Phe Val He Asn Cys Asn He Tyr Arg Asp 365 370 375
CTC ATG TGG ACT AAA AAG GGG CTT GAG TTT GTC AAA AGA AAA AAA ATC 1206 Leu Met Trp Thr Lys Lys Gly Leu Glu Phe Val Lys Arg Lys Lys He 380 385 390
ATC ATG CCT GAA ACA CCC ACG ATG TTT TTC ATA GGT TCT ATG GCA AGC 1254 He Met Pro Glu Thr Pro Thr Met Phe Phe He Gly Ser Met Ala Ser 395 400 405
GGG ATC AAC TTG ATT GAT GAA GAC ACA AAC ATG GAA AAA GTC GTG TCT 1302 Gly He Asn Leu He Asp Glu Asp Thr Asn Met Glu Lys Val Val Ser 410 415 420
TTA ATG GAA TTT TTT GGA GGT GAA GAA GAT AAG AGT GGC GAT AAT CTA 1350 Leu Met Glu Phe Phe Gly Gly Glu Glu Asp Lys Ser Gly Asp Asn Leu 425 430 435 440
AGA GTG CTT AGT CCT GCC ACT AGA AAC ATG TGG AAT AGC TTC AAG ACA 1398 Arg Val Leu Ser Pro Ala Thr Arg Asn Met Trp Asn Ser Phe Lys Thr 445 450 455
ATG GGC GGC GCT AGA GAA ACT TAT AGC TCG GTT CAA GGG GTA TAC ACT 1446 Met Gly Gly Ala Arg Glu Thr Tyr Ser Ser Val Gin Gly Val Tyr Thr 460 465 470
TCA GCC TTT GCG CCT TAT AAT AAC GCA ATG ATT AGA AAT TTC ACG AGC 1494 Ser Ala Phe Ala Pro Tyr Asn Asn Ala Met He Arg Asn Phe Thr Ser 475 480 485
GCC AAT GAT TTT GAT TTC AGG CGT TTA AGG ATC GAT GAA GTG AGT ATT 1542 Ala Asn Asp Phe Asp Phe Arg Arg Leu Arg He Asp Glu Val Ser He 490 495 500
GGT GTG ATC GCT AAT CCT AAA GAA AGC ACT ATT GTT GGA CCG ATA TTA 1590 Gly Val He Ala Asn Pro Lys Glu Ser Thr He Val Gly Pro He Leu 505 510 515 520
GAG CTG TTT TTC AAT GTG ATG ATT TAT AGC AAT TTG ATT CTG CCA ATC 1638 Glu Leu Phe Phe Asn Val Met He Tyr Ser Asn Leu He Leu Pro He 525 530 535
CAT GAT CCA CAG TGC AAA AGA AGT TGC TTG ATG CTC ATG GAC GAA TTC 1686 His Asp Pro Gin Cys Lys Arg Ser Cys Leu Met Leu Met Asp Glu Phe 540 545 550
ACT TTA TGT GGC TAT TTA GAG ACC TTT GTT AAA GCG GTA GGG ATT ATG 1734 Thr Leu Cys Gly Tyr Leu Glu Thr Phe Val Lys Ala Val Gly He Met 555 560 565 GCA GAA TAC AAC ATG CGC CCC GCT TTT GTG TTT CAA AGT AAG GCG CAA 1782 Ala Glu Tyr Asn Met Arg Pro Ala Phe Val Phe Gin Ser Lys Ala Gin 570 575 580
CTA GAG AAT GAC CCC CCA CTT GGT TAT GGT AGG AAT GGC GCT AAG ACT 1830 Leu Glu Asn Asp Pro Pro Leu Gly Tyr Gly Arg Asn Gly Ala Lys Thr 585 590 595 600
ATT TTA GAC AAC CTT TCT TTG AAT ATG TAT TAT GGG ATT AAC AAC GAT 1878 He Leu Asp Asn Leu Ser Leu Asn Met Tyr Tyr Gly He Asn Asn Asp 605 610 615
AAC TAC TAT GAA CAT TTT GAA AAA CTT TCT AAG GTA TTA GGG AAA TAC 1926 Asn Tyr Tyr Glu His Phe Glu Lys Leu Ser Lys Val Leu Gly Lys Tyr 620 625 630
ACA AGG CAA GAC GTG AGC CGA AGC ATT GAT GAT AAT ACA GGT AAG ACC 1974 Thr Arg Gin Asp Val Ser Arg Ser He Asp Asp Asn Thr Gly Lys Thr 635 640 645
AAC ACT TCT ATC AGC AAC AAA GAG CGG TTT TTG ATG ACC CCT GAT GAA 2022 Asn Thr Ser He Ser Asn Lys Glu Arg Phe Leu Met Thr Pro Asp Glu 650 655 660
TTG ATG ACT ATG GGC GAT GAG CTT ATC ATT CTA GAG AAT ACG CTC AAA 2070 Leu Met Thr Met Gly Asp Glu Leu He He Leu Glu Asn Thr Leu Lys 665 670 675 680
CCT ATC AAG TGC CAC AAG GCG CTT TAC TAT GAT GAT CCA TTC TTC ACC 2118 Pro He Lys Cys His Lys Ala Leu Tyr Tyr Asp Asp Pro Phe Phe Thr 685 690 695
GAT GAA CTC ATT AAG GTA AGT CCA AGC TTG AGC AAG AAA TAC AAA TTG 2166 Asp Glu Leu He Lys Val Ser Pro Ser Leu Ser Lys Lys Tyr Lys Leu 700 705 710
GGG AAA GTG CCT AAT CAA GCA ACT TTC TAT GAT GAT TTG CAA GCC GCT 2214 Gly Lys Val Pro Asn Gin Ala Thr Phe Tyr Asp Asp Leu Gin Ala Ala 715 720 725
AAA ACT AGA GGT GAA TTG AGT TAT GAT AAA TCT TTA GTG CCT GTG GGT 2262 Lys Thr Arg Gly Glu Leu Ser Tyr Asp Lys Ser Leu Val Pro Val Gly 730 735 740
TCA AGT GAA CTG TGATTAAGAC AAAATATCTT AACAAAAAGA AAATTAAAAG ATAAT 2319
Ser Ser Glu Leu
745
GATATAAATA 2329
(2) INFORMATION FOR SEQ ID NO: 1354:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 748 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1354:
Met Glu Asp Phe Leu Tyr Asn Thr Leu Tyr Phe He Glu Asp Tyr Lys
1 5 10 15
Leu Val Val He Phe Ser Phe He Gly Leu He Ala Leu Phe Phe Leu
20 25 30
Tyr Lys Phe He Lys Ala Gin Lys Lys Ala Phe Lys Asp Lys Ala Asn
35 40 45
Gin Pro Gin Lys Lys Lys Ser Phe Lys Glu He He He Asp Gly Leu
50 55 60
Lys Glu Arg Val Lys Thr Phe Gly Phe Trp Leu Gin Ala He Leu Leu 65 70 75 80
Leu Ser Tyr Ser Phe He Thr Ser Gly Leu Phe Phe Leu He Leu Leu
85 90 95
Gly Asn Phe Tyr Asp Asp Asn Arg Ser Pro Glu Ser Asp Asp Asp Leu
100 105 110
Phe Asp He Trp He Tyr Ala He Gin Asp Phe Pro Asn Tyr Tyr Phe
115 120 125
Lys Ala Leu Gly Phe Ser Ser Leu Lys He Tyr Gly Phe Asn He Ser
130 135 140
Leu Val Val Tyr Gly Ser He Leu Cys Ser Tyr He Phe He Thr Phe 145 150 155 160
Phe Val Trp Phe Leu Lys Tyr Leu Thr Arg Thr Arg Asp He Gly Ala
165 170 175
Asn Lys Lys Val Asp Asp Leu Phe Gly Ser Ala Ser Trp Glu Thr Glu
180 185 190
Glu Lys Met He Lys Ala Lys Leu He Thr Pro Asn Asn Lys Lys Arg
195 200 205
Ala Phe Asp Lys Arg Glu Val He Val Gly Arg Arg Gly Leu Gly Asp
210 215 220
Phe He Ala Tyr Ala Gly Gin Ala Phe He Gly Leu He Ala Pro Thr 225 230 235 240
Arg Ser Gly Lys Gly Val Gly Phe He Met Pro Asn Met He Asn Tyr
245 250 255
Pro Gin Asn He Val Val Phe Asp Pro Lys Ala Asp Thr Met Glu Thr
260 265 270
Cys Gly Lys He Arg Glu Lys Arg Phe Asn Gin Lys Val Phe He Tyr
275 280 285
Glu Pro Phe Ser Leu Lys Thr His Arg Phe Asn Pro Phe Ala Tyr Val
290 295 300
Asp Phe Gly Asn Asp Val Val Leu Thr Glu Asp He Leu Ser Gin He 305 310 315 320
Asp Thr Arg Leu Lys Gly His Gly Met Val Ala Ser Gly Gly Asp Phe
325 330 335
Ser Thr Gin He Phe Gly Leu Ala Lys Leu Val Phe Pro Glu Arg Pro
340 345 350
Asn Glu Lys Asp Pro Phe Phe Ser Asn Gin Ala Arg Asn Leu Phe Val
355 360 365
He Asn Cys Asn He Tyr Arg Asp Leu Met Trp Thr Lys Lys Gly Leu 370 375 380 Glu Phe Val Lys Arg Lys Lys He He Met Pro Glu Thr Pro Thr Met 385 390 395 400
Phe Phe He Gly Ser Met Ala Ser Gly He Asn Leu He Asp Glu Asp
405 410 415
Thr Asn Met Glu Lys Val Val Ser Leu Met Glu Phe Phe Gly Gly Glu
420 425 430
Glu Asp Lys Ser Gly Asp Asn Leu Arg Val Leu Ser Pro Ala Thr Arg
435 440 445
Asn Met Trp Asn Ser Phe Lys Thr Met Gly Gly Ala Arg Glu Thr Tyr
450 455 460
Ser Ser Val Gin Gly Val Tyr Thr Ser Ala Phe Ala Pro Tyr Asn Asn 465 470 475 480
Ala Met He Arg Asn Phe Thr Ser Ala Asn Asp Phe Asp Phe Arg Arg
485 490 495
Leu Arg He Asp Glu Val Ser He Gly Val He Ala Asn Pro Lys Glu
500 505 510
Ser Thr He Val Gly Pro He Leu Glu Leu Phe Phe Asn Val Met He
515 520 525
Tyr Ser Asn Leu He Leu Pro He His Asp Pro Gin Cys Lys Arg Ser
530 535 540
Cys Leu Met Leu Met Asp Glu Phe Thr Leu Cys Gly Tyr Leu Glu Thr 545 550 555 560
Phe Val Lys Ala Val Gly He Met Ala Glu Tyr Asn Met Arg Pro Ala
565 570 575
Phe Val Phe Gin Ser Lys Ala Gin Leu Glu Asn Asp Pro Pro Leu Gly
580 585 590
Tyr Gly Arg Asn Gly Ala Lys Thr He Leu Asp Asn Leu Ser Leu Asn
595 600 605
Met Tyr Tyr Gly He Asn Asn Asp Asn Tyr Tyr Glu His Phe Glu Lys
610 615 620
Leu Ser Lys Val Leu Gly Lys Tyr Thr Arg Gin Asp Val Ser Arg Ser 625 630 635 640
He Asp Asp Asn Thr Gly Lys Thr Asn Thr Ser He Ser Asn Lys Glu
645 650 655
Arg Phe Leu Met Thr Pro Asp Glu Leu Met Thr Met Gly Asp Glu Leu
660 665 670
He He Leu Glu Asn Thr Leu Lys Pro He Lys Cys His Lys Ala Leu
675 680 685
Tyr Tyr Asp Asp Pro Phe Phe Thr Asp Glu Leu He Lys Val Ser Pro
690 695 700
Ser Leu Ser Lys Lys Tyr Lys Leu Gly Lys Val Pro Asn Gin Ala Thr 705 710 715 720
Phe Tyr Asp Asp Leu Gin Ala Ala Lys Thr Arg Gly Glu Leu Ser Tyr
725 730 735
Asp Lys Ser Leu Val Pro Val Gly Ser Ser Glu Leu 740 745
(2) INFORMATION FOR SEQ ID NO: 1355:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1037 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear (ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 19...1008 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1355:
TAAATTTGGA ATAAGAAC ATG ACT GAA GAC AGA TTG AGT GCA GAA GAT AAA 51
Met Thr Glu Asp Arg Leu Ser Ala Glu Asp Lys 1 5 10
AAG TTT CTA GAA GTA GAA AGA GCT TTA AAA GAA GCG GCA TTA AAT CCT 99 Lys Phe Leu Glu Val Glu Arg Ala Leu Lys Glu Ala Ala Leu Asn Pro 15 20 25
CTA AGG CAT GCT ACT GAA GAA CTT TTT GGT GAT TTT TTA AAA ATG GAA 147 Leu Arg His Ala Thr Glu Glu Leu Phe Gly Asp Phe Leu Lys Met Glu 30 35 40
AAT ATC ACT GAG ATT TGT TAC AAT GGG AAC AAG GTT GTA TGG GTT TTA 195 Asn He Thr Glu He Cys Tyr Asn Gly Asn Lys Val Val Trp Val Leu 45 50 55
AAA AAT AAT GGC GAA TGG CAA CCA TTT GAT GTG AGA GAC AGG AAA GCC 243 Lys Asn Asn Gly Glu Trp Gin Pro Phe Asp Val Arg Asp Arg Lys Ala 60 65 70 75
TTT AGC CTG TCT CGT TTA ATG CAT TTT GCT CGG TGT TGT GCA AGT TTT 291 Phe Ser Leu Ser Arg Leu Met His Phe Ala Arg Cys Cys Ala Ser Phe 80 85 90
AAG AAA AAA ACA ATA GAC AAC TAT GAA AAT CCT ATT TTG AGC AGC AAT 339 Lys Lys Lys Thr He Asp Asn Tyr Glu Asn Pro He Leu Ser Ser Asn 95 100 105
TTA GCG AAT GGT GAA AGG GTG CAG ATT GTC CTT TCC CCT GTT ACA GTT 387 Leu Ala Asn Gly Glu Arg Val Gin He Val Leu Ser Pro Val Thr Val 110 115 120
AAT GAT GAA ACC ATT TCC ATA TCC ATA AGG ATA CCT AGC AAA ACA ACC 435 Asn Asp Glu Thr He Ser He Ser He Arg He Pro Ser Lys Thr Thr 125 130 135
TAT CCT CAT AGC TTC TTT GAA GAG CAA GGT TTT TAT AAT CTA CTA GAC 483 Tyr Pro His Ser Phe Phe Glu Glu Gin Gly Phe Tyr Asn Leu Leu Asp 140 145 150 155
AAC AAA GAA CAA GCG ATC AGC GCG ATT AAA GAT GGT ATT GCT ATT GGT 531 Asn Lys Glu Gin Ala He Ser Ala He Lys Asp Gly He Ala He Gly 160 165 170 AAA AAT GTG ATT GTT TGT GGT GGC ACA GGA AGC GGT AAA ACG ACT TAT 579 Lys Asn Val He Val Cys Gly Gly Thr Gly Ser Gly Lys Thr Thr Tyr 175 180 185
ATC AAA AGC ATC ATG GAG TTT ATC CCT AAA GAA GAA AGG ATC ATA TCC 627 He Lys Ser He Met Glu Phe He Pro Lys Glu Glu Arg He He Ser 190 195 200
ATT GAA GAC ACC GAA GAG ATT GTA TTC AAA CAC CAC AAA AAC TAC ACA 675 He Glu Asp Thr Glu Glu He Val Phe Lys His His Lys Asn Tyr Thr 205 210 215
CAG CTT TTT TTT GGT GGG AAT ATC ACC TCT GCT GAT TGC TTA AAG TCA 723 Gin Leu Phe Phe Gly Gly Asn He Thr Ser Ala Asp Cys Leu Lys Ser 220 225 230 235
TGT CTG AGA ATG CGG CCT GAT AGA ATC ATT TTA GGG GAA CTC AGA AGC 771 Cys Leu Arg Met Arg Pro Asp Arg He He Leu Gly Glu Leu Arg Ser 240 245 250
AGT GAG GCA TAC GAT TTT TAT AAT GTG CTT TGT AGC GGT CAT AAA GGC 819 Ser Glu Ala Tyr Asp Phe Tyr Asn Val Leu Cys Ser Gly His Lys Gly 255 260 265
ACA CTA ACC ACT CTG CAT GCA GGG AGC AGT GAA GAA GCG TTT ATC CGT 867 Thr Leu Thr Thr Leu His Ala Gly Ser Ser Glu Glu Ala Phe He Arg 270 275 280
TTG GCC AAC ATG AGT TCA TCT AAT AGC GCA GCA AGG AAT ATC AAG TTT 915 Leu Ala Asn Met Ser Ser Ser Asn Ser Ala Ala Arg Asn He Lys Phe 285 290 295
GAA AGT CTT ATT GAG GGC TTT AAA GAT TTG ATT GAT ATG ATT GTC CAT 963 Glu Ser Leu He Glu Gly Phe Lys Asp Leu He Asp Met He Val His 300 305 310 315
ATC AAC CAC CAC AAA CAG TGT GAT GAA TTT TAT ATC AAA CAC AGG TAGTA 1013 He Asn His His Lys Gin Cys Asp Glu Phe Tyr He Lys His Arg 320 325 330
GGCACAATGG AAGACTTTTT GTAT 1037
(2) INFORMATION FOR SEQ ID NO: 1356:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 330 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1356: Met Thr Glu Asp Arg Leu Ser Ala Glu Asp Lys Lys Phe Leu Glu Val 1 5 10 15
Glu Arg Ala Leu Lys Glu Ala Ala Leu Asn Pro Leu Arg His Ala Thr
20 25 30
Glu Glu Leu Phe Gly Asp Phe Leu Lys Met Glu Asn He Thr Glu He
35 40 45
Cys Tyr Asn Gly Asn Lys Val Val Trp Val Leu Lys Asn Asn Gly Glu
50 55 60
Trp Gin Pro Phe Asp Val Arg Asp Arg Lys Ala Phe Ser Leu Ser Arg 65 70 75 80
Leu Met His Phe Ala Arg Cys Cys Ala Ser Phe Lys Lys Lys Thr He
85 90 95
Asp Asn Tyr Glu Asn Pro He Leu Ser Ser Asn Leu Ala Asn Gly Glu
100 105 110
Arg Val Gin He Val Leu Ser Pro Val Thr Val Asn Asp Glu Thr He
115 120 125
Ser He Ser He Arg He Pro Ser Lys Thr Thr Tyr Pro His Ser Phe
130 135 140
Phe Glu Glu Gin Gly Phe Tyr Asn Leu Leu Asp Asn Lys Glu Gin Ala 145 150 155 160
He Ser Ala He Lys Asp Gly He Ala He Gly Lys Asn Val He Val
165 170 175
Cys Gly Gly Thr Gly Ser Gly Lys Thr Thr Tyr He Lys Ser He Met
180 185 190
Glu Phe He Pro Lys Glu Glu Arg He He Ser He Glu Asp Thr Glu
195 200 205
Glu He Val Phe Lys His His Lys Asn Tyr Thr Gin Leu Phe Phe Gly
210 215 220
Gly Asn He Thr Ser Ala Asp Cys Leu Lys Ser Cys Leu Arg Met Arg 225 230 235 240
Pro Asp Arg He He Leu Gly Glu Leu Arg Ser Ser Glu Ala Tyr Asp
245 250 255
Phe Tyr Asn Val Leu Cys Ser Gly His Lys Gly Thr Leu Thr Thr Leu
260 265 270
His Ala Gly Ser Ser Glu Glu Ala Phe He Arg Leu Ala Asn Met Ser
275 280 285
Ser Ser Asn Ser Ala Ala Arg Asn He Lys Phe Glu Ser Leu He Glu
290 295 300
Gly Phe Lys Asp Leu He Asp Met He Val His He Asn His His Lys 305 310 315 320
Gin Cys Asp Glu Phe Tyr He Lys His Arg 325 330
(2) INFORMATION FOR SEQ ID NO: 1357:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 5334 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 22...5250 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1357:
TAAATAAAAA GGCGTTAAGA C ATG AAT GAA GAA AAC GAT AAA CTT GAA ACT 51
Met Asn Glu Glu Asn Asp Lys Leu Glu Thr 1 5 10
TCT AAA AAA GCC CAA CAA GAT TCA CCC CAA GAT TTA TCC AAT GAA GAA 99 Ser Lys Lys Ala Gin Gin Asp Ser Pro Gin Asp Leu Ser Asn Glu Glu 15 20 25
GCA ACA GAA GCC AAT CAT TTT GAA AAT CTT TTA AAA GAA TCC AAA GAA 147 Ala Thr Glu Ala Asn His Phe Glu Asn Leu Leu Lys Glu Ser Lys Glu 30 35 40
AGC TCA GAT CAT CAT CTT GAC AAC CCC ACA GAA ACT CAA ACC CAT TTT 195 Ser Ser Asp His His Leu Asp Asn Pro Thr Glu Thr Gin Thr His Phe 45 50 55
GAT GGA GAC AAG TCA GAA GAA ACC CAA ACT CAA ATG GAT TCT GAA GGT 243 Asp Gly Asp Lys Ser Glu Glu Thr Gin Thr Gin Met Asp Ser Glu Gly 60 65 70
AAT GAA ACT TCA GAA TCT AGC AAT GGC AGT CTA GCA GAC AAG TTA TTC 291 Asn Glu Thr Ser Glu Ser Ser Asn Gly Ser Leu Ala Asp Lys Leu Phe 75 80 85 90
AAA AAA GCC AGA AAA TTA GTT GAT AAT AAA AAA CCT TTC ACT CAG CAA 339 Lys Lys Ala Arg Lys Leu Val Asp Asn Lys Lys Pro Phe Thr Gin Gin 95 100 105
AAG AAT TTA GAT GAA GAA ACC CAA GAA CTG AAC GAA GAA GAC GAT CAA 387 Lys Asn Leu Asp Glu Glu Thr Gin Glu Leu Asn Glu Glu Asp Asp Gin 110 115 120
GAA AAT AAT GAG TAT CAA GAA GAA ACT CAA ACG GAC TTA ATT GAT GAT 435 Glu Asn Asn Glu Tyr Gin Glu Glu Thr Gin Thr Asp Leu He Asp Asp 125 130 135
GAA ACT TCT AAA AAA ACC CAA CAA CAT TCA CCC CAA GAT TTA TCC AAT 483 Glu Thr Ser Lys Lys Thr Gin Gin His Ser Pro Gin Asp Leu Ser Asn 140 145 150
GAA GAA GCA ACA GAA GCC AAT CAT TTT GAA AAT CTT TTA AAA GAA TCC 531 Glu Glu Ala Thr Glu Ala Asn His Phe Glu Asn Leu Leu Lys Glu Ser 155 160 165 170
AAA GAA AGC TCA GAT CAT CAT CTT GAC AAC CCC ACA GAA ACT CAA ACC 579 Lys Glu Ser Ser Asp His His Leu Asp Asn Pro Thr Glu Thr Gin Thr 175 180 185
AAT TTT GAT GGA GAC AAG TCA GAA GAA ACC CAA ACT CAA ATG GAT TCT 627 Asn Phe Asp Gly Asp Lys Ser Glu Glu Thr Gin Thr Gin Met Asp Ser 190 195 200
GAA GGT AAT GAA ACT TCA GAA TCT AGC AAT GGC AGT CTA GCA GAC AAG 675 Glu Gly Asn Glu Thr Ser Glu Ser Ser Asn Gly Ser Leu Ala Asp Lys 205 210 215
TTA TTC AAA AAA GCC AGA AAA TTA GTT GAT AAT AAA AAA CCT TTC ACT 723 Leu Phe Lys Lys Ala Arg Lys Leu Val Asp Asn Lys Lys Pro Phe Thr 220 225 230
CAG CAA AAG AAT TTA GAT GAA GAA ACC CAA GAA CTG AAC GAA GAA GAC 771 Gin Gin Lys Asn Leu Asp Glu Glu Thr Gin Glu Leu Asn Glu Glu Asp 235 240 245 250
GAT CAA GAA AAT AAT GAG TAT CAA GAA GAA ACT CAA ACG GAC TTA ATT 819 Asp Gin Glu Asn Asn Glu Tyr Gin Glu Glu Thr Gin Thr Asp Leu He 255 260 265
GAT GAT GAA ACT TCT AAA AAA ACC CAA CAA CAT TCA CCC CAA GAT TTA 867 Asp Asp Glu Thr Ser Lys Lys Thr Gin Gin His Ser Pro Gin Asp Leu 270 275 280
TCC AAT GAA GAA GCA ACA GAA GCC AAT CAT TTT GAA AAT CTT TTA AAA 915 Ser Asn Glu Glu Ala Thr Glu Ala Asn His Phe Glu Asn Leu Leu Lys 285 290 295
GAA TCC AAA GAA AGC TCA GAT CAT CAT CTT GAC AAC CCC ACA GAA ACT 963 Glu Ser Lys Glu Ser Ser Asp His His Leu Asp Asn Pro Thr Glu Thr 300 305 310
CAA ACC AAT TTT GAT GGA GAC AAG TCA GAA GAA ATA ACT GAC GAC TCT 1011 Gin Thr Asn Phe Asp Gly Asp Lys Ser Glu Glu He Thr Asp Asp Ser 315 320 325 330
AAC GAT CAA GAG ATT ATC AAA GGA AGC AAA AAG AAA TAT ATT ATT GGT 1059 Asn Asp Gin Glu He He Lys Gly Ser Lys Lys Lys Tyr He He Gly 335 340 345
GGC ATT GTA GTC GCT GTT CTT ATC GTG ATT ATT TTA TTT TCT AGA AGC 1107 Gly He Val Val Ala Val Leu He Val He He Leu Phe Ser Arg Ser 350 355 360
ATT TTT CAC TAC TTC ATG CCT TTG GAA GAT AAA AGC TCT CGT TTT AGC 1155 He Phe His Tyr Phe Met Pro Leu Glu Asp Lys Ser Ser Arg Phe Ser 365 370 375
AAA GAC AGG AAT CTT TAT GTC AAT GAT GAA ATC CAA ATA AGG CAA GAG 1203 Lys Asp Arg Asn Leu Tyr Val Asn Asp Glu He Gin He Arg Gin Glu 380 385 390
TAT AAC CGA TTG CTG AAA GAA CGG AAT GAA AAA GGC AAT ATG ATC GAT 1251 Tyr Asn Arg Leu Leu Lys Glu Arg Asn Glu Lys Gly Asn Met He Asp 395 400 405 410 AAG AAT CTT TTC TTC AAT GAC GAT CCC AAT AGA ACC TTA TAC AAC TAT 1299 Lys Asn Leu Phe Phe Asn Asp Asp Pro Asn Arg Thr Leu Tyr Asn Tyr 415 420 425
TTG AAT ATT GCA GAA ATT GAG GAC AAA AAC CCG TTG AGA GCC TTT TAT 1347 Leu Asn He Ala Glu He Glu Asp Lys Asn Pro Leu Arg Ala Phe Tyr 430 435 440
GAA TGT ATT AGT AAT GGT GGC AAC TAT GAA GAA TGT TTG AAG CTT ATC 1395 Glu Cys He Ser Asn Gly Gly Asn Tyr Glu Glu Cys Leu Lys Leu He 445 450 455
AAA GAC AAA AAA CTT CAA GAT CAG ATG AAA AAG ACT CTA GAG GCT TAT 1443 Lys Asp Lys Lys Leu Gin Asp Gin Met Lys Lys Thr Leu Glu Ala Tyr 460 465 470
AAC GAC TGC ATC AAA AAT GCC AAA ACT GAA GAA GAA AGG ATC AAG TGT 1491 Asn Asp Cys He Lys Asn Ala Lys Thr Glu Glu Glu Arg He Lys Cys 475 480 485 490
TTA GAT TTA ATC AAA GAT GAA AAC CTA AAA AAA AGC TTA CTG AAC CAA 1539 Leu Asp Leu He Lys Asp Glu Asn Leu Lys Lys Ser Leu Leu Asn Gin 495 500 505
CAA AAA GTT CAA GTG GCG CTA GAT TGT TTG AAA AAC GCT AAA ACC GAT 1587 Gin Lys Val Gin Val Ala Leu Asp Cys Leu Lys Asn Ala Lys Thr Asp 510 515 520
GAA GAA CGA AAC GAG TGC CTA AAA CTC ATA AAT GAC CCT GAG ATT AGA 1635 Glu Glu Arg Asn Glu Cys Leu Lys Leu He Asn Asp Pro Glu He Arg 525 530 535
GAG AAA TTC CGT AAG GAA TTA GAG CTT CAA AAA GAG CTT CAA GAG TAT 1683 Glu Lys Phe Arg Lys Glu Leu Glu Leu Gin Lys Glu Leu Gin Glu Tyr 540 545 550
AAG GAT TGT ATC AAA AAC GCC AAA ACA GAA GCT GAG AAA AAC AAA TGC 1731 Lys Asp Cys He Lys Asn Ala Lys Thr Glu Ala Glu Lys Asn Lys Cys 555 560 565 570
TTG AAA GGC TTG TCT AAA GAA GCT ATA GAG AGA TTG AAA CAG CAA GCG 1779 Leu Lys Gly Leu Ser Lys Glu Ala He Glu Arg Leu Lys Gin Gin Ala 575 580 585
CTA GAT TGT TTG AAA AAC GCT AAA ACC GAT GAA GAA CGA AAC GAG TGC 1827 Leu Asp Cys Leu Lys Asn Ala Lys Thr Asp Glu Glu Arg Asn Glu Cys 590 595 600
TTG AAA AAT ATT CCC CAA GAC TTG CAA AAA GAA CTA TTA GCT GAT ATG 1875 Leu Lys Asn He Pro Gin Asp Leu Gin Lys Glu Leu Leu Ala Asp Met 605 610 615
AGC GTC AAG GCT TAC AAG GAT TGC GTA TCA AAA GCT AGA AAT GAA AAA 1923 Ser Val Lys Ala Tyr Lys Asp Cys Val Ser Lys Ala Arg Asn Glu Lys 620 625 630 GAG AAA CAA GAA TGC GAG AAA TTG CTC ACG CCT GAA GCG AGG AAA AAG 1971 Glu Lys Gin Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Lys 635 640 645 650
TTA GAA CAA CAG GTT CTA GAT TGT TTG AAA AAC GCT AAA ACC GAT GAA 2019 Leu Glu Gin Gin Val Leu Asp Cys Leu Lys Asn Ala Lys Thr Asp Glu 655 660 665
GAA CGA AAA AAG TGT TTG AAA GAT CTC CCT AAA GAC TTA CAA AGC GAT 2067 Glu Arg Lys Lys Cys Leu Lys Asp Leu Pro Lys Asp Leu Gin Ser Asp 670 675 680
ATT CTA GCC AAA GAG AGC CTG AAA GCT TAT AAA GAC TGC GTA TCT CAA 2115 He Leu Ala Lys Glu Ser Leu Lys Ala Tyr Lys Asp Cys Val Ser Gin 685 690 695
GCC AAA ACC GAA GCT GAG AAA AAA GAA TGC GAG AAA TTA CTC ACC CCT 2163 Ala Lys Thr Glu Ala Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro 700 705 710
GAA GCG AAA AAA CTT TTA GAA GAA GAA GCC AAA GAG AGC GTT AAG GCT 2211 Glu Ala Lys Lys Leu Leu Glu Glu Glu Ala Lys Glu Ser Val Lys Ala 715 720 725 730
TAT TTG GAT TGC GTA TCT CAA GCC AAA ACC GAA GCT GAG AAA AAA GAA 2259 Tyr Leu Asp Cys Val Ser Gin Ala Lys Thr Glu Ala Glu Lys Lys Glu 735 740 745
TGC GAG AAA TTG CTC ACC CCT GAA GCG AAA AAA AAG TTA GAA GAA GCT 2307 Cys Glu Lys Leu Leu Thr Pro Glu Ala Lys Lys Lys Leu Glu Glu Ala 750 755 760
AAA AAA AGC GTT AAA GCT TAC TTG GAT TGC GTA TCA AGA GCT AGG AAT 2355 Lys Lys Ser Val Lys Ala Tyr Leu Asp Cys Val Ser Arg Ala Arg Asn 765 770 775
GAA AAA GAG AAA AAA GAA TGC GAG AAA TTG CTC ACC CCT GAA GCG AAA 2403 Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Lys 780 785 790
AAA CTT TTA GAG CAA CAA GCA CTA GAT TGT TTG AAA AAC GCT AAA ACC 2451 Lys Leu Leu Glu Gin Gin Ala Leu Asp Cys Leu Lys Asn Ala Lys Thr 795 800 805 810
GAT AAA GAA CGA AAA AAG TGT TTG AAA GAT CTC CCT AAA GAC TTG CAG 2499 Asp Lys Glu Arg Lys Lys Cys Leu Lys Asp Leu Pro Lys Asp Leu Gin 815 820 825
AAA AAG GTT TTA GCT AAA GAA AGC GTT AAA GCT TAC TTG GAT TGC GTA 2547 Lys Lys Val Leu Ala Lys Glu Ser Val Lys Ala Tyr Leu Asp Cys Val 830 835 840
TCT CAA GCC AAA ACT GAA GCT GAG AAA AAA GAA TGC GAG AAA TTA CTC 2595 Ser Gin Ala Lys Thr Glu Ala Glu Lys Lys Glu Cys Glu Lys Leu Leu 845 850 855 ACC CCT GAA GCG AGA AAA CTT TTA GAA GAA GCT AAA AAA AGC GTT AAG 2643 Thr Pro Glu Ala Arg Lys Leu Leu Glu Glu Ala Lys Lys Ser Val Lys 860 865 870
GCT TAY TTG GAT TGC GTA TCT CAA GCC AAA ACT GAA GCT GAG AAA AAA 2691 Ala Xaa Leu Asp Cys Val Ser Gin Ala Lys Thr Glu Ala Glu Lys Lys 875 880 885 890
GAA TGC GAG AAA TTA CTC ACC CCT GAA GCG AGA AAA CTC TTA GAA GAA 2739 Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Glu 895 900 905
GCT AAA GAG AGC GTT AAA GCT TAT AAA GAC TGC GTA TCA AAA GCT AGG 2787 Ala Lys Glu Ser Val Lys Ala Tyr Lys Asp Cys Val Ser Lys Ala Arg 910 915 920
AAT GAA AAA GAG AAA AAA GAA TGC GAG AAA TTA CTC ACG CCT GAA GCG 2835 Asn Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala 925 930 935
AAA AAA CTT TTA GAG CAA CAA GTG CTA GAT TGT TTG AAA AAC GCT AAA 2883 Lys Lys Leu Leu Glu Gin Gin Val Leu Asp Cys Leu Lys Asn Ala Lys 940 945 950
ACC GAA GCT GAT AAA AAA AGG TGT GTC AAA GAT CTC CCT AAA GAC TTG 2931 Thr Glu Ala Asp Lys Lys Arg Cys Val Lys Asp Leu Pro Lys Asp Leu 955 960 965 970
CAG AAA AAG GTT TTA GCT AAA GAG AGC GTT AAG GCT TAT TTG GAC TGC 2979 Gin Lys Lys Val Leu Ala Lys Glu Ser Val Lys Ala Tyr Leu Asp Cys 975 980 985
GTA TCA AGA GCT AGG AAT GAA AAA GAG AAA AAA GAA TGC GAG AAA TTG 3027 Val Ser Arg Ala Arg Asn Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu 990 995 1000
CTC ACC CCT GAA GCG AAA AAA CTT TTA GAA GAA GCC AAA GAG AGT CTT 3075 Leu Thr Pro Glu Ala Lys Lys Leu Leu Glu Glu Ala Lys Glu Ser Leu 1005 1010 1015
AAA GCT TAT AAA GAC TGC CTC TCT CAA GCT AGA AAT GAA GAA GAA AGG 3123 Lys Ala Tyr Lys Asp Cys Leu Ser Gin Ala Arg Asn Glu Glu Glu Arg 1020 1025 1030
AGA GCT TGC GAG AAA CTA CTC ACG CCT GAA GCG AGA AAA CTC TTA GAG 3171 Arg Ala Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu 1035 1040 1045 1050
CAA GAA GTT AAG AAA AGC ATT AAG GCT TAT TTG GAC TGC GTA TCA AGA 3219 Gin Glu Val Lys Lys Ser He Lys Ala Tyr Leu Asp Cys Val Ser Arg 1055 1060 1065
GCT AGG AAT GAA AAA GAG AAA AAA GAA TGC GAG AAA TTA CTC ACG CCT 3267 Ala Arg Asn Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro 1070 1075 1080 GAA GCG AGA AAA TTT TTA GCG AAG CAA GTG CTA AAT TGT TTG GAA AAA 3315 Glu Ala Arg Lys Phe Leu Ala Lys Gin Val Leu Asn Cys Leu Glu Lys 1085 1090 1095
GCT GGA AAT GAA GAA GAA AGA AAA GCA TGT CTT AAA AAT CTC CCT AAA 3363 Ala Gly Asn Glu Glu Glu Arg Lys Ala Cys Leu Lys Asn Leu Pro Lys 1100 1105 1110
GAC TTA CAG GAA AAT ATT TTA GCT AAA GAG AGT CTT AAA GCT TAT AAA 3411 Asp Leu Gin Glu Asn He Leu Ala Lys Glu Ser Leu Lys Ala Tyr Lys 1115 1120 1125 1130
GAC TGC CTC TCT CAA GCT AGA AAT GAA GAA GAA AGG AGA GCT TGC GAG 3459 Asp Cys Leu Ser Gin Ala Arg Asn Glu Glu Glu Arg Arg Ala Cys Glu 1135 1140 1145
AAA CTA CTC ACG CCT GAA GCG AGA AAA CTC TTA GAG CAA GAA GTT AAG 3507 Lys Leu Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Gin Glu Val Lys 1150 1155 1160
AAA AGC GTT AAG GCT TAT TTG GAC TGC GTA TCA AGA GCT AGG AAT GAA 3555 Lys Ser Val Lys Ala Tyr Leu Asp Cys Val Ser Arg Ala Arg Asn Glu 1165 1170 1175
AAA GAG AAA AAA GAA TGC GAG AAA TTA CTC ACG CCT GAA GCG AGA AAA 3603 Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys 1180 1185 1190
TTT TTA GCG AAA GAA CTC CAA CAA AAA GAT AAA GCG ATC AAA GAT TGC 3651 Phe Leu Ala Lys Glu Leu Gin Gin Lys Asp Lys Ala He Lys Asp Cys 1195 1200 1205 1210
TTG AAA AAC GCC GAT CCT AAC GAC AGA GCG GCT ATC ATG AAG TGT TTG 3699 Leu Lys Asn Ala Asp Pro Asn Asp Arg Ala Ala He Met Lys Cys Leu 1215 1220 1225
GAT GGT TTG AGC GAT GAA GAG AAG CTC AAA TAC CTG CAA GAA GCT AGA 3747 Asp Gly Leu Ser Asp Glu Glu Lys Leu Lys Tyr Leu Gin Glu Ala Arg 1230 1235 1240
GAA AAG GCT GTT GCG GAT TGT TTG GCT ATG GCT AAA ACC GAT GAA GAA 3795 Glu Lys Ala Val Ala Asp Cys Leu Ala Met Ala Lys Thr Asp Glu Glu 1245 1250 1255
AAA AGG AAA TGC CAA AAC CTT TAT AGC GAT TTG ATC CAA GAA ATC CAA 3843 Lys Arg Lys Cys Gin Asn Leu Tyr Ser Asp Leu He Gin Glu He Gin 1260 1265 1270
AAT AAA AGG ACA CAA AAC AAA CAA AAT CAA TTG AGT AAA ACA GAA AGG 3891 Asn Lys Arg Thr Gin Asn Lys Gin Asn Gin Leu Ser Lys Thr Glu Arg 1275 1280 1285 1290
TTG CAT CAA GCA AGC GAG TGC TTG GAT AAC TTA GAT GAC CCT ACT GAT 3939 Leu His Gin Ala Ser Glu Cys Leu Asp Asn Leu Asp Asp Pro Thr Asp 1295 1300 1305 CAA GAG GCC ATA GAG CAA TGT TTA GAG GGC TTG AGC GAT AGT GAA AGG 3987 Gin Glu Ala He Glu Gin Cys Leu Glu Gly Leu Ser Asp Ser Glu Arg 1310 1315 1320
GCG CTA ATT CTA GGA ATT AAA CGA CAA GCT GAT GAA GTG GAT CTG ATT 4035 Ala Leu He Leu Gly He Lys Arg Gin Ala Asp Glu Val Asp Leu He 1325 1330 1335
TAT AGC GAT CTA AGA AAC CGT AAA ACC TTT GAT AAC ATG GCG GCT AAA 4083 Tyr Ser Asp Leu Arg Asn Arg Lys Thr Phe Asp Asn Met Ala Ala Lys 1340 1345 1350
GGT TAT CCA TTG TTA CCA ATG GAT TTC AAA AAT GGC GGC GAT ATT GCC 4131 Gly Tyr Pro Leu Leu Pro Met Asp Phe Lys Asn Gly Gly Asp He Ala 1355 1360 1365 1370
ACT ATT AAC GCC ACT AAT GTT GAT GCG GAC AAA ATA GCT AGC GAT AAT 4179 Thr He Asn Ala Thr Asn Val Asp Ala Asp Lys He Ala Ser Asp Asn 1375 1380 1385
CCT ATT TAT GCT TCC ATA GAG CCT GAT ATT GCC AAG CAA TAC GAA ACA 4227 Pro He Tyr Ala Ser He Glu Pro Asp He Ala Lys Gin Tyr Glu Thr 1390 1395 1400
GAA AAA ACC ATT AAG GAT AAG AAT TTA GAA GCT AAA TTA GCT AAG GCT 4275 Glu Lys Thr He Lys Asp Lys Asn Leu Glu Ala Lys Leu Ala Lys Ala 1405 1410 1415
TTA GGT GGC AAT AAA AAA GAT GAC GAT AAA GAA AAA AGT AAA AAA TCC 4323 Leu Gly Gly Asn Lys Lys Asp Asp Asp Lys Glu Lys Ser Lys Lys Ser 1420 1425 1430
ACA GCA GAA GCT AAA GCA GAA AAC AAT AAG ATA GAC AAA GAT GTC GCA 4371 Thr Ala Glu Ala Lys Ala Glu Asn Asn Lys He Asp Lys Asp Val Ala 1435 1440 1445 1450
GAA ACT GCC AAG AAT ATC AGT GAA ATC GCT CTT AAG AAC AAA AAA GAA 4419 Glu Thr Ala Lys Asn He Ser Glu He Ala Leu Lys Asn Lys Lys Glu 1455 1460 1465
AAG AGT GGG GAA TTT GTA GAT GAA AAT GGT AAT CCC ATT GAT GAC AAA 4467 Lys Ser Gly Glu Phe Val Asp Glu Asn Gly Asn Pro He Asp Asp Lys 1470 1475 1480
AAG AAA GCA GAA AAA CAA GAT GAA ACA AGC CCT GTC AAA CAG GCC TTT 4515 Lys Lys Ala Glu Lys Gin Asp Glu Thr Ser Pro Val Lys Gin Ala Phe 1485 1490 1495
ATA GGC AAG AGT GAT CCC ACA TTT GTT TTA GCG CAA TAC ACC CCC ATT 4563 He Gly Lys Ser Asp Pro Thr Phe Val Leu Ala Gin Tyr Thr Pro He 1500 1505 1510
GAA ATC ACT CTG ACT TCT AAA GTA GAT GCC ACT CTC ACA GGT ATA GTG 4611 Glu He Thr Leu Thr Ser Lys Val Asp Ala Thr Leu Thr Gly He Val 1515 1520 1525 1530 AGT GGG GTT GTA GCC AAA GAT GTA TGG AAC ATG AAC GGC ACT ATG ATC 4659 Ser Gly Val Val Ala Lys Asp Val Trp Asn Met Asn Gly Thr Met He 1535 1540 1545
TTA TTA GAC AAA GGC ACT AAG GTG TAT GGG AAT TAT CAA AGC GTG AAA 4707 Leu Leu Asp Lys Gly Thr Lys Val Tyr Gly Asn Tyr Gin Ser Val Lys 1550 1555 1560
GGT GGC ACA CCC ATT ATG ACA CGC TTA ATG ATA GTC TTT ACT AAA GCC 4755 Gly Gly Thr Pro He Met Thr Arg Leu Met He Val Phe Thr Lys Ala 1565 1570 1575
ATT ACG CCT GAT GGT GTG ATA ATA CCT CTA GCA AAC GCT CAA GCA GCA 4803 He Thr Pro Asp Gly Val He He Pro Leu Ala Asn Ala Gin Ala Ala 1580 1585 1590
GGC ATG TTG GGT GAA GCA GGG GTA GAT GGC TAT GTG AAT AAT CAC TTT 4851 Gly Met Leu Gly Glu Ala Gly Val Asp Gly Tyr Val Asn Asn His Phe 1595 1600 1605 1610
ATG AAG CGC ATA GGC TTT GCT GTG ATA GCA AGC GTG GTT AAT AGC TTC 4899 Met Lys Arg He Gly Phe Ala Val He Ala Ser Val Val Asn Ser Phe 1615 1620 1625
TTG CAA ACT GCG CCT ATC ATA GCT CTA GAT AAA CTC ATA GGC CTT GGC 4947 Leu Gin Thr Ala Pro He He Ala Leu Asp Lys Leu He Gly Leu Gly 1630 1635 1640
AAA GGT AGA AGT GAA AGG ACA CCT GAA TTT AAT TAC GCT TTG GGT CAA 4995 Lys Gly Arg Ser Glu Arg Thr Pro Glu Phe Asn Tyr Ala Leu Gly Gin 1645 1650 1655
GCT ATC AAT GGT AGC ATG CAA AGT TCA GCT CAG ATG TCT AAT CAA ATT 5043 Ala He Asn Gly Ser Met Gin Ser Ser Ala Gin Met Ser Asn Gin He 1660 1665 1670
CTA GGG CAA CTG ATG AAT ATC CCC CCA AGT TTT TAC AAA AAC GAG GGC 5091 Leu Gly Gin Leu Met Asn He Pro Pro Ser Phe Tyr Lys Asn Glu Gly 1675 1680 1685 1690
GAT AGT ATT AAG ATT CTC ACA ATG GAC GAT ATT GAT TTT AGC GGT GTG 5139 Asp Ser He Lys He Leu Thr Met Asp Asp He Asp Phe Ser Gly Val 1695 1700 1705
TAT GAT GTT AAA ATT ACT AAC AAA TCT GTG GTA GAT GAA ATT ATC AAA 5187 Tyr Asp Val Lys He Thr Asn Lys Ser Val Val Asp Glu He He Lys 1710 1715 1720
CAA AGC ACC AAA ACT TTG TCT AGA GAA CAT GAA GAA ATC ACC ACA AGC 5235 Gin Ser Thr Lys Thr Leu Ser Arg Glu His Glu Glu He Thr Thr Ser 1725 1730 1735
CCC AAA GGT GGC AAT TAATTCAAGA GAAAGGATAA AATATATTCA TGTTACTAAA C 5291 Pro Lys Gly Gly Asn 1740 TCGGTTCTTT ACAAAATAAA AGACAAAACC AACAACAGGC TCT 5334
(2) INFORMATION FOR SEQ ID NO: 1358:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1743 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1358:
Met Asn Glu Glu Asn Asp Lys Leu Glu Thr Ser Lys Lys Ala Gin Gin
1 5 10 15
Asp Ser Pro Gin Asp Leu Ser Asn Glu Glu Ala Thr Glu Ala Asn His
20 25 30
Phe Glu Asn Leu Leu Lys Glu Ser Lys Glu Ser Ser Asp His His Leu
35 40 45
Asp Asn Pro Thr Glu Thr Gin Thr His Phe Asp Gly Asp Lys Ser Glu
50 55 60
Glu Thr Gin Thr Gin Met Asp Ser Glu Gly Asn Glu Thr Ser Glu Ser 65 70 75 80
Ser Asn Gly Ser Leu Ala Asp Lys Leu Phe Lys Lys Ala Arg Lys Leu
85 90 95
Val Asp Asn Lys Lys Pro Phe Thr Gin Gin Lys Asn Leu Asp Glu Glu
100 105 110
Thr Gin Glu Leu Asn Glu Glu Asp Asp Gin Glu Asn Asn Glu Tyr Gin
115 120 125
Glu Glu Thr Gin Thr Asp Leu He Asp Asp Glu Thr Ser Lys Lys Thr
130 135 140
Gin Gin His Ser Pro Gin Asp Leu Ser Asn Glu Glu Ala Thr Glu Ala 145 150 155 160
Asn His Phe Glu Asn Leu Leu Lys Glu Ser Lys Glu Ser Ser Asp His
165 170 175
His Leu Asp Asn Pro Thr Glu Thr Gin Thr Asn Phe Asp Gly Asp Lys
180 185 190
Ser Glu Glu Thr Gin Thr Gin Met Asp Ser Glu Gly Asn Glu Thr Ser
195 200 205
Glu Ser Ser Asn Gly Ser Leu Ala Asp Lys Leu Phe Lys Lys Ala Arg
210 215 220
Lys Leu Val Asp Asn Lys Lys Pro Phe Thr Gin Gin Lys Asn Leu Asp 225 230 235 240
Glu Glu Thr Gin Glu Leu Asn Glu Glu Asp Asp Gin Glu Asn Asn Glu
245 250 255
Tyr Gin Glu Glu Thr Gin Thr Asp Leu He Asp Asp Glu Thr Ser Lys
260 265 270
Lys Thr Gin Gin His Ser Pro Gin Asp Leu Ser Asn Glu Glu Ala Thr
275 280 285
Glu Ala Asn His Phe Glu Asn Leu Leu Lys Glu Ser Lys Glu Ser Ser
290 295 300
Asp His His Leu Asp Asn Pro Thr Glu Thr Gin Thr Asn Phe Asp Gly 305 310 315 320
Asp Lys Ser Glu Glu He Thr Asp Asp Ser Asn Asp Gin Glu He He 325 330 335
Lys Gly Ser Lys Lys Lys Tyr He He Gly Gly He Val Val Ala Val
340 345 350
Leu He Val He He Leu Phe Ser Arg Ser He Phe His Tyr Phe Met
355 360 365
Pro Leu Glu Asp Lys Ser Ser Arg Phe Ser Lys Asp Arg Asn Leu Tyr
370 375 380
Val Asn Asp Glu He Gin He Arg Gin Glu Tyr Asn Arg Leu Leu Lys 385 390 395 400
Glu Arg Asn Glu Lys Gly Asn Met He Asp Lys Asn Leu Phe Phe Asn
405 410 415
Asp Asp Pro Asn Arg Thr Leu Tyr Asn Tyr Leu Asn He Ala Glu He
420 425 430
Glu Asp Lys Asn Pro Leu Arg Ala Phe Tyr Glu Cys He Ser Asn Gly
435 440 445
Gly Asn Tyr Glu Glu Cys Leu Lys Leu He Lys Asp Lys Lys Leu Gin
450 455 460
Asp Gin Met Lys Lys Thr Leu Glu Ala Tyr Asn Asp Cys He Lys Asn 465 470 475 480
Ala Lys Thr Glu Glu Glu Arg He Lys Cys Leu Asp Leu He Lys Asp
485 490 495
Glu Asn Leu Lys Lys Ser Leu Leu Asn Gin Gin Lys Val Gin Val Ala
500 505 510
Leu Asp Cys Leu Lys Asn Ala Lys Thr Asp Glu Glu Arg Asn Glu Cys
515 520 525
Leu Lys Leu He Asn Asp Pro Glu lie Arg Glu Lys Phe Arg Lys Glu
530 535 540
Leu Glu Leu Gin Lys Glu Leu Gin Glu Tyr Lys Asp Cys He Lys Asn 545 550 555 560
Ala Lys Thr Glu Ala Glu Lys Asn Lys Cys Leu Lys Gly Leu Ser Lys
565 570 575
Glu Ala He Glu Arg Leu Lys Gin Gin Ala Leu Asp Cys Leu Lys Asn
580 585 590
Ala Lys Thr Asp Glu Glu Arg Asn Glu Cys Leu Lys Asn He Pro Gin
595 600 605
Asp Leu Gin Lys Glu Leu Leu Ala Asp Met Ser Val Lys Ala Tyr Lys
610 615 620
Asp Cys Val Ser Lys Ala Arg Asn Glu Lys Glu Lys Gin Glu Cys Glu 625 630 635 640
Lys Leu Leu Thr Pro Glu Ala Arg Lys Lys Leu Glu Gin Gin Val Leu
645 650 655
Asp Cys Leu Lys Asn Ala Lys Thr Asp Glu Glu Arg Lys Lys Cys Leu
660 665 670
Lys Asp Leu Pro Lys Asp Leu Gin Ser Asp He Leu Ala Lys Glu Ser
675 680 685
Leu Lys Ala Tyr Lys Asp Cys Val Ser Gin Ala Lys Thr Glu Ala Glu
690 695 700
Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Lys Lys Leu Leu 705 710 715 720
Glu Glu Glu Ala Lys Glu Ser Val Lys Ala Tyr Leu Asp Cys Val Ser
725 730 735
Gin Ala Lys Thr Glu Ala Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr
740 745 750
Pro Glu Ala Lys Lys Lys Leu Glu Glu Ala Lys Lys Ser Val Lys Ala 755 760 765 Tyr Leu Asp Cys Val Ser Arg Ala Arg Asn Glu Lys Glu Lys Lys Glu
770 775 780
Cys Glu Lys Leu Leu Thr Pro Glu Ala Lys Lys Leu Leu Glu Gin Gin 785 790 795 800
Ala Leu Asp Cys Leu Lys Asn Ala Lys Thr Asp Lys Glu Arg Lys Lys
805 810 815
Cys Leu Lys Asp Leu Pro Lys Asp Leu Gin Lys Lys Val Leu Ala Lys
820 825 830
Glu Ser Val Lys Ala Tyr Leu Asp Cys Val Ser Gin Ala Lys Thr Glu
835 840 845
Ala Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys
850 855 860
Leu Leu Glu Glu Ala Lys Lys Ser Val Lys Ala Xaa Leu Asp Cys Val 865 870 875 880
Ser Gin Ala Lys Thr Glu Ala Glu Lys Lys Glu Cys Glu Lys Leu Leu
885 890 895
Thr Pro Glu Ala Arg Lys Leu Leu Glu Glu Ala Lys Glu Ser Val Lys
900 905 910
Ala Tyr Lys Asp Cys Val Ser Lys Ala Arg Asn Glu Lys Glu Lys Lys
915 920 925
Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Lys Lys Leu Leu Glu Gin
930 935 940
Gin Val Leu Asp Cys Leu Lys Asn Ala Lys Thr Glu Ala Asp Lys Lys 945 950 955 960
Arg Cys Val Lys Asp Leu Pro Lys Asp Leu Gin Lys Lys Val Leu Ala
965 970 975
Lys Glu Ser Val Lys Ala Tyr Leu Asp Cys Val Ser Arg Ala Arg Asn
980 985 990
Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Lys
995 1000 1005
Lys Leu Leu Glu Glu Ala Lys Glu Ser Leu Lys Ala Tyr Lys Asp Cys
1010 1015 1020
Leu Ser Gin Ala Arg Asn Glu Glu Glu Arg Arg Ala Cys Glu Lys Leu 025 1030 1035 1040
Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Gin Glu Val Lys Lys Ser
1045 1050 1055
He Lys Ala Tyr Leu Asp Cys Val Ser Arg Ala Arg Asn Glu Lys Glu
1060 1065 1070
Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Phe Leu
1075 1080 1085
Ala Lys Gin Val Leu Asn Cys Leu Glu Lys Ala Gly Asn Glu Glu Glu
1090 1095 1100
Arg Lys Ala Cys Leu Lys Asn Leu Pro Lys Asp Leu Gin Glu Asn He 105 1110 1115 1120
Leu Ala Lys Glu Ser Leu Lys Ala Tyr Lys Asp Cys Leu Ser Gin Ala
1125 1130 1135
Arg Asn Glu Glu Glu Arg Arg Ala Cys Glu Lys Leu Leu Thr Pro Glu
1140 1145 1150
Ala Arg Lys Leu Leu Glu Gin Glu Val Lys Lys Ser Val Lys Ala Tyr
1155 1160 1165
Leu Asp Cys Val Ser Arg Ala Arg Asn Glu Lys Glu Lys Lys Glu Cys
1170 1175 1180
Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Phe Leu Ala Lys Glu Leu 185 1190 1195 1200
Gin Gin Lys Asp Lys Ala He Lys Asp Cys Leu Lys Asn Ala Asp Pro 1205 1210 1215
Asn Asp Arg Ala Ala He Met Lys Cys Leu Asp Gly Leu Ser Asp Glu
1220 1225 1230
Glu Lys Leu Lys Tyr Leu Gin Glu Ala Arg Glu Lys Ala Val Ala Asp
1235 1240 1245
Cys Leu Ala Met Ala Lys Thr Asp Glu Glu Lys Arg Lys Cys Gin Asn
1250 1255 1260
Leu Tyr Ser Asp Leu He Gin Glu He Gin Asn Lys Arg Thr Gin Asn 265 1270 1275 1280
Lys Gin Asn Gin Leu Ser Lys Thr Glu Arg Leu His Gin Ala Ser Glu
1285 1290 1295
Cys Leu Asp Asn Leu Asp Asp Pro Thr Asp Gin Glu Ala He Glu Gin
1300 1305 1310
Cys Leu Glu Gly Leu Ser Asp Ser Glu Arg Ala Leu He Leu Gly He
1315 1320 1325
Lys Arg Gin Ala Asp Glu Val Asp Leu He Tyr Ser Asp Leu Arg Asn
1330 1335 1340
Arg Lys Thr Phe Asp Asn Met Ala Ala Lys Gly Tyr Pro Leu Leu Pro 345 1350 1355 1360
Met Asp Phe Lys Asn Gly Gly Asp He Ala Thr He Asn Ala Thr Asn
1365 1370 1375
Val Asp Ala Asp Lys He Ala Ser Asp Asn Pro He Tyr Ala Ser He
1380 1385 1390
Glu Pro Asp He Ala Lys Gin Tyr Glu Thr Glu Lys Thr He Lys Asp
1395 1400 1405
Lys Asn Leu Glu Ala Lys Leu Ala Lys Ala Leu Gly Gly Asn Lys Lys
1410 1415 1420
Asp Asp Asp Lys Glu Lys Ser Lys Lys Ser Thr Ala Glu Ala Lys Ala 425 1430 1435 1440
Glu Asn Asn Lys He Asp Lys Asp Val Ala Glu Thr Ala Lys Asn He
1445 1450 1455
Ser Glu He Ala Leu Lys Asn Lys Lys Glu Lys Ser Gly Glu Phe Val
1460 1465 1470
Asp Glu Asn Gly Asn Pro He Asp Asp Lys Lys Lys Ala Glu Lys Gin
1475 1480 1485
Asp Glu Thr Ser Pro Val Lys Gin Ala Phe He Gly Lys Ser Asp Pro
1490 1495 1500
Thr Phe Val Leu Ala Gin Tyr Thr Pro He Glu He Thr Leu Thr Ser 505 1510 1515 1520
Lys Val Asp Ala Thr Leu Thr Gly He Val Ser Gly Val Val Ala Lys
1525 1530 1535
Asp Val Trp Asn Met Asn Gly Thr Met He Leu Leu Asp Lys Gly Thr
1540 1545 1550
Lys Val Tyr Gly Asn Tyr Gin Ser Val Lys Gly Gly Thr Pro He Met
1555 1560 1565
Thr Arg Leu Met He Val Phe Thr Lys Ala He Thr Pro Asp Gly Val
1570 1575 1580
He He Pro Leu Ala Asn Ala Gin Ala Ala Gly Met Leu Gly Glu Ala 585 1590 1595 1600
Gly Val Asp Gly Tyr Val Asn Asn His Phe Met Lys Arg He Gly Phe
1605 1610 1615
Ala Val He Ala Ser Val Val Asn Ser Phe Leu Gin Thr Ala Pro He
1620 1625 1630
He Ala Leu Asp Lys Leu He Gly Leu Gly Lys Gly Arg Ser Glu Arg 1635 1640 1645 Thr Pro Glu Phe Asn Tyr Ala Leu Gly Gin Ala He Asn Gly Ser Met
1650 1655 1660
Gin Ser Ser Ala Gin Met Ser Asn Gin He Leu Gly Gin Leu Met Asn 665 1670 1675 1680
He Pro Pro Ser Phe Tyr Lys Asn Glu Gly Asp Ser He Lys He Leu
1685 1690 1695
Thr Met Asp Asp He Asp Phe Ser Gly Val Tyr Asp Val Lys He Thr
1700 1705 1710
Asn Lys Ser Val Val Asp Glu He He Lys Gin Ser Thr Lys Thr Leu
1715 1720 1725
Ser Arg Glu His Glu Glu He Thr Thr Ser Pro Lys Gly Gly Asn 1730 1735 1740
(2) INFORMATION FOR SEQ ID NO: 1359:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 877 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 22...825 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1359:
AACAAATAAA GGAGTATTAA A ATG AAA CAA AGT TTG CGC GAA CAA AAA TTA 51
Met Lys Gin Ser Leu Arg Glu Gin Lys Leu 1 5 10
TTG AAA ATT TTA GAA AAT GAT GTC TTG ACG ATT TTG GAT AGT TTT TCT 99 Leu Lys He Leu Glu Asn Asp Val Leu Thr He Leu Asp Ser Phe Ser 15 20 25
AAT TAT CTT TTT GAA CTG AGA GAA GAG TTG GAC TTC ATA GAA GAA GAA 147 Asn Tyr Leu Phe Glu Leu Arg Glu Glu Leu Asp Phe He Glu Glu Glu 30 35 40
ATG GAA GGT GAA ATC ACC GAA CAA AAC CTT ACC GCT CTT TAT GAT TTT 195 Met Glu Gly Glu He Thr Glu Gin Asn Leu Thr Ala Leu Tyr Asp Phe 45 50 55
TCT AAT TTC TTA GAA GAC CAT GTC AAT GTA TTT TAT GAG AAT GTT TTG 243 Ser Asn Phe Leu Glu Asp His Val Asn Val Phe Tyr Glu Asn Val Leu 60 65 70
AAT ATA GAT GAT GTC AAA ACA GAA CAC CTT TAT TCA GGT CTC ATA GAT 291 Asn He Asp Asp Val Lys Thr Glu His Leu Tyr Ser Gly Leu He Asp 75 80 85 90 AGT CTT AAC GCT AAT CTT CAC TTT GTC AAG TCA TTT CTC AGT AAT CAG 339 Ser Leu Asn Ala Asn Leu His Phe Val Lys Ser Phe Leu Ser Asn Gin 95 100 105
GAT TTA GAC TTC CGC TTT TTT AAG GAA ATA AAC GAT GGG CAA GAT CCC 387 Asp Leu Asp Phe Arg Phe Phe Lys Glu He Asn Asp Gly Gin Asp Pro 110 115 120
CAA AAA ACA TTA TCA AGA TTA ATT CCT CTT CAA AGT GGG AAA AAT GAT 435 Gin Lys Thr Leu Ser Arg Leu He Pro Leu Gin Ser Gly Lys Asn Asp 125 130 135
GCA AGC TCG TTT AAA GCC AAT AAT TCT TTT GTT TCA TTA GTT TAT GTT 483 Ala Ser Ser Phe Lys Ala Asn Asn Ser Phe Val Ser Leu Val Tyr Val 140 145 150
TAT GTT TAC TTC ATG CTA GAA ACT ATC ATG CAG TCG TAT AGG ATT CTC 531 Tyr Val Tyr Phe Met Leu Glu Thr He Met Gin Ser Tyr Arg He Leu 155 160 165 170
AGA TTG CTA GAA AAA CCT ATC AAT AAC AAC ATA AGC GAG GAC ATG CAG 579 Arg Leu Leu Glu Lys Pro He Asn Asn Asn He Ser Glu Asp Met Gin 175 180 185
AAC GAT ATA GAG AAT TTT TTT GTT CAA GCG AAT TTT TTA GAA TAC TAT 627 Asn Asp He Glu Asn Phe Phe Val Gin Ala Asn Phe Leu Glu Tyr Tyr 190 195 200
GTT CAG AAC AAA ATA TAC CCA ACC AAT CAT GCC TAT GAC TTC ACG CAT 675 Val Gin Asn Lys He Tyr Pro Thr Asn His Ala Tyr Asp Phe Thr His 205 210 215
TTG ATC ATG GAC TCC ATT ATT CCT AAT TGG ATT CAA ACT GAT ATG AGC 723 Leu He Met Asp Ser He He Pro Asn Trp He Gin Thr Asp Met Ser 220 225 230
GTT GAA GCT AAA AAG AAA GAG CTT TTT GAA AAA TAT TTT CAA AAC ATT 771 Val Glu Ala Lys Lys Lys Glu Leu Phe Glu Lys Tyr Phe Gin Asn He 235 240 245 250
GAT GAA GTA ACA AAC AAA ATG CTC GAT CAA AAA ANT CAA AAC AAA AGT 819 Asp Glu Val Thr Asn Lys Met Leu Asp Gin Lys Xaa Gin Asn Lys Ser 255 260 265
AAC GAT TGAGTGGCGT TAATGCGCTA GAATAGTGCT AAAAATAAGA ATAAAGGAGT CA 877 Asn Asp
877 (2) INFORMATION FOR SEQ ID NO: 1360:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 268 amino acids
(B) TYPE: amino acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1360:
Met Lys Gin Ser Leu Arg Glu Gin Lys Leu Leu Lys He Leu Glu Asn
1 5 10 15
Asp Val Leu Thr He Leu Asp Ser Phe Ser Asn Tyr Leu Phe Glu Leu
20 25 30
Arg Glu Glu Leu Asp Phe He Glu Glu Glu Met Glu Gly Glu He Thr
35 40 45
Glu Gin Asn Leu Thr Ala Leu Tyr Asp Phe Ser Asn Phe Leu Glu Asp
50 55 60
His Val Asn Val Phe Tyr Glu Asn Val Leu Asn He Asp Asp Val Lys 65 70 75 80
Thr Glu His Leu Tyr Ser Gly Leu He Asp Ser Leu Asn Ala Asn Leu
85 90 95
His Phe Val Lys Ser Phe Leu Ser Asn Gin Asp Leu Asp Phe Arg Phe
100 105 110
Phe Lys Glu He Asn Asp Gly Gin Asp Pro Gin Lys Thr Leu Ser Arg
115 120 125
Leu He Pro Leu Gin Ser Gly Lys Asn Asp Ala Ser Ser Phe Lys Ala
130 135 140
Asn Asn Ser Phe Val Ser Leu Val Tyr Val Tyr Val Tyr Phe Met Leu 145 150 155 160
Glu Thr He Met Gin Ser Tyr Arg He Leu Arg Leu Leu Glu Lys Pro
165 170 175
He Asn Asn Asn He Ser Glu Asp Met Gin Asn Asp He Glu Asn Phe
180 185 190
Phe Val Gin Ala Asn Phe Leu Glu Tyr Tyr Val Gin Asn Lys He Tyr
195 200 205
Pro Thr Asn His Ala Tyr Asp Phe Thr His Leu He Met Asp Ser He
210 215 220
He Pro Asn Trp He Gin Thr Asp Met Ser Val Glu Ala Lys Lys Lys 225 230 235 240
Glu Leu Phe Glu Lys Tyr Phe Gin Asn He Asp Glu Val Thr Asn Lys
245 250 255
Met Leu Asp Gin Lys Xaa Gin Asn Lys Ser Asn Asp 260 265
(2) INFORMATION FOR SEQ ID NO: 1361:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 736 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 26...706 (D) OTHER INFORMATION:
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1361:
AAAAAATCAA TAAAAGGGGT TTAGC ATG CAA GCA GTA ATT TAT GGC AAG CAA 52
Met Gin Ala Val He Tyr Gly Lys Gin 1 5
GTG ATT ATG CAC CTT CTA AAC TCT CAT CAA GAA AAA TTG CAA GAA ATC 100 Val He Met His Leu Leu Asn Ser His Gin Glu Lys Leu Gin Glu He 10 15 20 25
TAT CTT TCT AAA GAA ATA GAC AAG AAA CTT TTT TTC GCG CTC AAA AAA 148 Tyr Leu Ser Lys Glu He Asp Lys Lys Leu Phe Phe Ala Leu Lys Lys 30 35 40
GCA TGC CCT AAT ATC ATC AAA GTG GAT AAT AAA AAA GCG CAA AGC TTG 196 Ala Cys Pro Asn He He Lys Val Asp Asn Lys Lys Ala Gin Ser Leu 45 50 55
GCT AAG GGG GGG AAT CAT CAA GGG GTT TTG GCT AAG GTG GAA CTG CCC 244 Ala Lys Gly Gly Asn His Gin Gly Val Leu Ala Lys Val Glu Leu Pro 60 65 70
TTA GCG GTT TCT TTA AAA GAG GTT AAA AAA GCT CAA AAA CTT TTG GTG 292 Leu Ala Val Ser Leu Lys Glu Val Lys Lys Ala Gin Lys Leu Leu Val 75 80 85
CTT TGC GGG ATT ACG GAT GTG GGG AAT ATT GGA GGT ATT TTT AGG AGC 340 Leu Cys Gly He Thr Asp Val Gly Asn He Gly Gly He Phe Arg Ser 90 95 100 105
GCG TAT TGC TTA GGA ATG GGT GGC GTT ATT TTA GAT TTT GCT AAA GAA 388 Ala Tyr Cys Leu Gly Met Gly Gly Val He Leu Asp Phe Ala Lys Glu 110 115 120
TTG GCT TAT GAG GGG ATT GTG CGA TCC AGC TTG GGG CTT ATG TAT GAT 436 Leu Ala Tyr Glu Gly He Val Arg Ser Ser Leu Gly Leu Met Tyr Asp 125 130 135
TTG CCT TTT AGC GTT ATG CCT AAC ACG CTG GAT TTA ATC AAT GAA TTG 484 Leu Pro Phe Ser Val Met Pro Asn Thr Leu Asp Leu He Asn Glu Leu 140 145 150
AAA ACG AGC GGG TTT TTA TGT TTG GGC GCG AGC ATG CAA GGC TCT AGT 532 Lys Thr Ser Gly Phe Leu Cys Leu Gly Ala Ser Met Gin Gly Ser Ser 155 160 165
CAA ATA GAA AAT CTA TCC TTA AAA AAA TGC GCT CTT TTT TTG GGG AGC 580 Gin He Glu Asn Leu Ser Leu Lys Lys Cys Ala Leu Phe Leu Gly Ser 170 175 180 185
GAG CAT GAG GGG TTG TCT AAA AAA ATC CTT GCT AAA ATG GAT ACT ATA 628 Glu His Glu Gly Leu Ser Lys Lys He Leu Ala Lys Met Asp Thr He 190 195 200
TTG AGC GTA AAA ATG CGA AGA GAT TTT GAT TCG CTC AAT GTG AGC GTG 676 Leu Ser Val Lys Met Arg Arg Asp Phe Asp Ser Leu Asn Val Ser Val 205 210 215
GCA GCA GGG ATC TTA ATG GAT AAA ATC AAC TAGGTGGTCA ATTGAATGGA ACA 729 Ala Ala Gly He Leu Met Asp Lys He Asn 220 225
GAATAAA 736
(2) INFORMATION FOR SEQ ID NO: 1362:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 227 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1362:
Met Gin Ala Val He Tyr Gly Lys Gin Val He Met His Leu Leu Asn
1 5 10 15
Ser His Gin Glu Lys Leu Gin Glu He Tyr Leu Ser Lys Glu He Asp
20 25 30
Lys Lys Leu Phe Phe Ala Leu Lys Lys Ala Cys Pro Asn He He Lys
35 40 45
Val Asp Asn Lys Lys Ala Gin Ser Leu Ala Lys Gly Gly Asn His Gin
50 55 60
Gly Val Leu Ala Lys Val Glu Leu Pro Leu Ala Val Ser Leu Lys Glu 65 70 75 80
Val Lys Lys Ala Gin Lys Leu Leu Val Leu Cys Gly He Thr Asp Val
85 90 95
Gly Asn He Gly Gly He Phe Arg Ser Ala Tyr Cys Leu Gly Met Gly
100 105 110
Gly Val He Leu Asp Phe Ala Lys Glu Leu Ala Tyr Glu Gly He Val
115 120 125
Arg Ser Ser Leu Gly Leu Met Tyr Asp Leu Pro Phe Ser Val Met Pro
130 135 140
Asn Thr Leu Asp Leu He Asn Glu Leu Lys Thr Ser Gly Phe Leu Cys 145 150 155 160
Leu Gly Ala Ser Met Gin Gly Ser Ser Gin He Glu Asn Leu Ser Leu
165 170 175
Lys Lys Cys Ala Leu Phe Leu Gly Ser Glu His Glu Gly Leu Ser Lys
180 185 190
Lys He Leu Ala Lys Met Asp Thr He Leu Ser Val Lys Met Arg Arg
195 200 205
Asp Phe Asp Ser Leu Asn Val Ser Val Ala Ala Gly He Leu Met Asp
210 215 220
Lys He Asn 225 (2) INFORMATION FOR SEQ ID NO: 1363-
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 344 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
( i) MOLECULE TYPE- Genomic DNA (ix) FEATURE:
(A) NAME/KEY: Coding Sequence
(B) LOCATION: 30...290 (D) OTHER INFORMATION.
(xi) SEQUENCE DESCRIPTION- SEQ ID NO : 1363-
AACCATGCCT TTTCATCGTC TCTATCAAA ATG TTA GGG TCT AAA ACA TAT TCC 53
Met Leu Gly Ser Lys Thr Tyr Ser 1 5
GTT TTA AGA TCG TAT GAA AAA ACA TTC TCG CCT GAA GAG CTT TGC ATT 101 Val Leu Arg Ser Tyr Glu Lys Thr Phe Ser Pro Glu Glu Leu Cys He 10 15 20
TTA ATG GGC AAA ACA TAC GAA TAC CCC ATC ATG CTT AAA GAA TTA TTG 149 Leu Met Gly Lys Thr Tyr Glu Tyr Pro He Met Leu Lys Glu Leu Leu 25 30 35 40
ATG CTT TTG GCA AAC GCT AGG GGA TTG CTT GAA GCC TTG AAA GTG ATT 197 Met Leu Leu Ala Asn Ala Arg Gly Leu Leu Glu Ala Leu Lys Val He 45 50 55
TTC AAC ATG CTT GGC TTG TCA AAA TTA AAA GAC AAA AGC CCG TTT TCT 245 Phe Asn Met Leu Gly Leu Ser Lys Leu Lys Asp Lys Ser Pro Phe Ser 60 65 70
TTG AGA GTG TTG AGC AGT TTC AAG GAA TCC AAA CGC CCC ATT ACA TAGAA 295 Leu Arg Val Leu Ser Ser Phe Lys Glu Ser Lys Arg Pro He Thr 75 80 85
AGCCTTACGA TTTTTAAACA AACGCTCTAA AAAAAGCTTG TTCGTATGA 344
(2) INFORMATION FOR SEQ ID NO: 1364:
(l) SEQUENCE CHARACTERISTICS.
(A) LENGTH: 87 ammo acids
Figure imgf002019_0001
(C) STRANDEDNESS: single
(D) TOPOLOGY- linear
(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1364:
Met Leu Gly Ser Lys Thr Tyr Ser Val Leu Arg Ser Tyr Glu Lys Thr
1 5 10 15
Phe Ser Pro Glu Glu Leu Cys He Leu Met Gly Lys Thr Tyr Glu Tyr
20 25 30
Pro He Met Leu Lys Glu Leu Leu Met Leu Leu Ala Asn Ala Arg Gly
35 40 45
Leu Leu Glu Ala Leu Lys Val He Phe Asn Met Leu Gly Leu Ser Lys
50 55 60
Leu Lys Asp Lys Ser Pro Phe Ser Leu Arg Val Leu Ser Ser Phe Lys 65 70 75 80
Glu Ser Lys Arg Pro He Thr 85
(2) INFORMATION FOR SEQ ID NO: 1365:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 31 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1365: CTGAATTCGA ATGAAAAGAA TTTTAGTCTC T 31
(2) INFORMATION FOR SEQ ID NO: 1366:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 29 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1366: CCGCTCGAGT TAAAACTCAT AATTCAAAT 29
(2) INFORMATION FOR SEQ ID NO: 1367:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 29 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1367: CGCGGATCCG AAGACATGTG CAACCGATG 29
(2) INFORMATION FOR SEQ ID NO: 1368:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 30 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1368: CCGCTCGAGC TAAAAGTTTT GCAAAATCAC 30
(2) INFORMATION FOR SEQ ID NO: 1369:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 32 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1369: CGCGGATCCG ATTTTACTTG AAAAATTTAA AC 32
(2) INFORMATION FOR SEQ ID NO: 1370:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 31 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1370: CCGCTCGAGT TAGAAAGTGT AGTTCAAATA C 31
(2) INFORMATION FOR SEQ ID NO: 1371:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 24 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1371: GCGGATCCTT TTCTTCAATG TTTG 24
(2) INFORMATION FOR SEQ ID NO:1372:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 29 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: Genomic DNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1372: CCGCTCGAGT CAAAGTTTTA AACAAATTC 29
(2) INFORMATION FOR SEQ ID NO: 1373:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 23 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1373: CCGAATTCGG TTATAAAGCC CCT 23
(2) INFORMATION FOR SEQ ID NO: 1374:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 24 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1374: CCGCTCGAGT TAAGGCTGAT TTAA 24
(2) INFORMATION FOR SEQ ID NO: 1375:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 32 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1375: CGCGGATCCG AGGAAATAGC ATGTTAATAA CC 32
(2) INFORMATION FOR SEQ ID NO: 1376:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1376: CCGCTCGAGT CACTGCTTGC ATGACTTATT CCA 33
Other embodiments are within the following claims.
What is claimed is:

Claims

1. An isolated polynucleotide that encodes:
(i) a polypeptide comprising an amino acid sequence that is homologous to the amino acid sequence of a Helicobacter polypeptide selected from the group consisting of GHPO 35 (SEQ ID NO:2), GHPO 55 (SEQ ID NO:4), GHPO 78 (SEQ ID NO:6), GHPO 89 (SEQ ID NO:8), GHPO 129 (SEQ ID NO: 10), GHPO 541 (SEQ ID NO: 12), GHPO 607 (SEQ ID NO: 14), GHPO
635 (SEQ ID NO: 16), GHPO 701 (SEQ ID NO: 18), GHPO 712 (SEQ ID NO:20), GHPO 761 (SEQ ID NO:22), GHPO 838 (SEQ ID NO:24), GHPO 1034 (SEQ ID NO:26), GHPO 1085 (SEQ ID NO:28), GHPO 1213 (SEQ ID NO:30), GHPO 1255 (SEQ ID NO:32), GHPO 1308 (SEQ ID NO:34), GHPO 1389 (SEQ ID NO:36), GHPO 1706 (SEQ ID NO:38), GHPO 234 (SEQ ID
NO:40), GHPO 314 (SEQ ID NO:42), GHPO 510 (SEQ ID NO:44), GHPO 603 (SEQ ID NO:46), GHPO 937 (SEQ ID NO:48), GHPO 1027 (SEQ ID NO:50), GHPO 1099 (SEQ ID NO: 52), GHPO 1151 (SEQ ID NO:54), GHPO 1275 (SEQ ID NO:56), GHPO 1365 (SEQ ID NO:58), GHPO 1578 (SEQ ID NO:60), GHPO 22 (SEQ ID NO:62), GHPO 58 (SEQ ID NO:64), GHPO 200
(SEQ ID NO:66), GHPO 558 (SEQ ID NO:68), GHPO 563 (SEQ ID NO:70), GHPO 695 (SEQ ID NO:72), GHPO 699 (SEQ ID NO:74), GHPO 702 (SEQ ID NO:76), GHPO 709 (SEQ ID NO:78), GHPO 741 (SEQ ID NO:80), GHPO 762 (SEQ ID NO:82), GHPO 827 (SEQ ID NO: 84), GHPO 852 (SEQ ID NO:86), GHPO 1013 (SEQ ID NO:88), GHPO 1020 (SEQ ID NO:90), GHPO
1031 (SEQ ID NO:92), GHPO 1052 (SEQ ID NO:94), GHPO 1127 (SEQ ID NO:96), GHPO 1149 (SEQ ID NO:98), GHPO 1176 (SEQ ID NO: 100), GHPO 1250 (SEQ ID NO: 102), GHPO 1312 (SEQ ID NO: 104), GHPO 1358 (SEQ ID NO: 106), GHPO 1490 (SEQ ID NO: 108), GHPO 1559 (SEQ ID NO: 110), GHPO 1651 (SEQ ID NO: 112), GHPO 1726 (SEQ ID NO: 114), GHPO 1780
(SEQ ID NO: 116), GHPO 895 (SEQ ID NO: 118), GHPO 1447 (SEQ ID NO: 120), GHPO 28 (SEQ ID NO: 122), GHPO 86 (SEQ ID NO: 124), GHPO 155 (SEQ ID NO: 126), GHPO 157 (SEQ ID NO: 128), GHPO 237 (SEQ ID NO: 130), GHPO 290 (SEQ ID NO: 132), GHPO 293 (SEQ ID NO: 134), GHPO 335 (SEQ ID NO: 136), GHPO 374 (SEQ ID NO: 138), GHPO 442 (SEQ ID NO: 140), GHPO 480 (SEQ ID NO: 142), GHPO 523 (SEQ ID NO: 144), GHPO 610 (SEQ ID NO: 146), GHPO 675 (SEQ ID NO: 148), GHPO 690 (SEQ ID
NO: 150), GHPO 829 (SEQ ID NO: 152), GHPO 850 (SEQ ID NO: 154), GHPO 876 (SEQ ID NO: 156), GHPO 984 (SEQ ID NO: 158), GHPO 989 (SEQ ID NO: 160), GHPO 1111 (SEQ ID NO: 162), GHPO 1145 (SEQ ID NO: 164), GHPO 1256 (SEQ ID NO: 166), GHPO 1264 (SEQ ID NO: 168), GHPO 1316 (SEQ ID NO: 170), GHPO 1368 (SEQ ID NO: 172), GHPO 1442 (SEQ ID
NO: 174), GHPO 1506 (SEQ ID NO: 176), GHPO 1543 (SEQ ID NO: 178), GHPO 1574 (SEQ ID NO: 180), GHPO 1627 (SEQ ID NO: 182), GHPO 1657 (SEQ ID NO: 184), GHPO 1664 (SEQ ID NO: 186), GHPO 1694 (SEQ ID NO: 188), GHPO 1704 (SEQ ID NO: 190), GHPO 1763 (SEQ ID NO: 192), GHPO 616 (SEQ ID NO: 194), GHPO 76 (SEQ ID NO: 196), GHPO 109 (SEQ
ID NO: 198), GHPO 163 (SEQ ID NO:200), GHPO 169 (SEQ ID NO:202), GHPO 208 (SEQ ID NO:204), GHPO 219 (SEQ ID NO:206), GHPO 445 (SEQ ID NO:208), GHPO 479 (SEQ ID NO:210), GHPO 525 (SEQ ID NO:212), GHPO 535 (SEQ ID NO:214), GHPO 731 (SEQ ID NO:216), GHPO 836 (SEQ ID NO:218), GHPO 879 (SEQ ID NO:220), GHPO 881 (SEQ ID
NO:222), GHPO 886 (SEQ ID NO:224), GHPO 893 (SEQ ID NO:226), GHPO 894 (SEQ ID N0.228), GHPO 976 (SEQ ID NO:230), GHPO 1011 (SEQ ID NO:232), GHPO 1024 (SEQ ID NO:234), GHPO 1084 (SEQ ID NO:236), GHPO 1329 (SEQ ID NO:238), GHPO 1330 (SEQ ID NO:240), GHPO 1346 (SEQ ID NO:242), GHPO 1360 (SEQ ID NO:244), GHPO 1388 (SEQ ID
NO:246), GHPO 1411 (SEQ ID NO:248), GHPO 1419 (SEQ ID NO:250), GHPO 1446 (SEQ ID NO:252), GHPO 1469 (SEQ ID NO:254), GHPO 1501 (SEQ ID NO:256), GHPO 1505 (SEQ ID NO:258), GHPO 1522 (SEQ ID NO:260), GHPO 1525 (SEQ ID NO:262), GHPO 1615 (SEQ ID NO:264), GHPO 1689 (SEQ ID NO:266), GHPO 1733 (SEQ ID NO:268), GHPO 18 (SEQ ID NO:270), GHPO 139 (SEQ ID NO:272), GHPO 142 (SEQ ID NO:274), GHPO 250 (SEQ ID NO:276), GHPO 257 (SEQ ID NO:278), GHPO
325 (SEQ ID NO:280), GHPO 355 (SEQ ID NO:282), GHPO 357 (SEQ ID NO:284), GHPO 454 (SEQ ID NO:286), GHPO 475 (SEQ ID NO:288), GHPO 515 (SEQ ID NO:290), GHPO 527 (SEQ ID NO:292), GHPO 551 (SEQ ID NO:294), GHPO 602 (SEQ ID NO:296), GHPO 626 (SEQ ID NO:298), GHPO 646 (SEQ ID NO:300), GHPO 653 (SEQ ID NO:302), GHPO 655 (SEQ ID
NO:304), GHPO 670 (SEQ ID NO:306), GHPO 739 (SEQ ID NO:308), GHPO 798 (SEQ ID NO:310), GHPO 1102 (SEQ ID NO:312), GHPO 1114 (SEQ ID NO:314), GHPO 1152 (SEQ ID NO:316), GHPO 1272 (SEQ ID NO:318), GHPO 1345 (SEQ ID NO:320), GHPO 1377 (SEQ ID NO:322), GHPO 1424 (SEQ ID NO:324), GHPO 1430 (SEQ ID NO:326), GHPO 1502 (SEQ ID
NO:328), GHPO 1600 (SEQ ID NO:330), GHPO 1714 (SEQ ID NO:332), GHPO 359 (SEQ ID NO:334), GHPO 678 (SEQ ID NO:336), GHPO 708 (SEQ ID NO:338), GHPO 759 (SEQ ID NO:340), GHPO 847 (SEQ ID NO:342), GHPO 1050 (SEQ ID NO:344), GHPO 1101 (SEQ ID NO:346), GHPO 1120 (SEQ ID NO:348), GHPO 1138 (SEQ ID NO:350), GHPO 1310
(SEQ ID NO:352), GHPO 1320 (SEQ ID NO:354), GHPO 1375 (SEQ ID NO:356), GHPO 1432 (SEQ ID NO:358), GHPO 21 (SEQ ID NO:360), GHPO 282 (SEQ ID NO:362), GHPO 1089 (SEQ ID NO:364), GHPO 1141 (SEQ ID NO:366), GHPO 1280 (SEQ ID NO:368), GHPO 1608 (SEQ ID NO:370), GHPO 15 (SEQ ID NO:372), GHPO 16 (SEQ ID NO:374), GHPO 36 (SEQ ID
NO:376), GHPO 38 (SEQ ID NO:378), GHPO 52 (SEQ ID NO:380), GHPO 57 (SEQ ID NO:382), GHPO 64 (SEQ ID NO:384), GHPO 79 (SEQ ID NO:386), GHPO 84 (SEQ ID NO:388), GHPO 86 (SEQ ID NO:390), GHPO 99 (SEQ ID NO:392), GHPO 106 (SEQ ID NO:394), GHPO 118 (SEQ ID NO:396), GHPO 122 (SEQ ID NO:398), GHPO 128 (SEQ ID NO:400), GHPO 138 (SEQ ID NO:402), GHPO 153 (SEQ ID NO:404), GHPO 160 (SEQ ID NO:406), GHPO 168 (SEQ ID NO:408), GHPO 179 (SEQ ID NO:410), GHPO
189 (SEQ ID NO:412), GHPO 229 (SEQ ID NO:414), GHPO 243 (SEQ ID NO:416), GHPO 244 (SEQ ID NO:418), GHPO 251 (SEQ ID NO:420), GHPO 267 (SEQ ID NO:422), GHPO 269 (SEQ ID NO:424), GHPO 279 (SEQ ID NO:426), GHPO 284 (SEQ ID NO:428), GHPO 296 (SEQ ID NO:430), GHPO 300 (SEQ ID NO:432), GHPO 305 (SEQ ID NO:434), GHPO 319 (SEQ ID
NO:436), GHPO 330 (SEQ ID NO:438), GHPO 340 (SEQ ID NO:440), GHPO 342 (SEQ ID NO:442), GHPO 344 (SEQ ID NO:444), GHPO 358 (SEQ ID NO:446), GHPO 373 (SEQ ID NO:448), GHPO 382 (SEQ ID NO:450), GHPO 384 (SEQ ID NO:452), GHPO 398 (SEQ ID NO:454), GHPO 409 (SEQ ID NO:456), GHPO 422 (SEQ ID NO:458), GHPO 430 (SEQ ID NO:460), GHPO
446 (SEQ ID NO:462), GHPO 447 (SEQ ID NO:464), GHPO 450 (SEQ ID NO:466), GHPO 451 (SEQ ID NO:468), GHPO 452 (SEQ ID NO:470), GHPO 456 (SEQ ID NO:472), GHPO 461 (SEQ ID NO:474), GHPO 476 (SEQ ID NO:476), GHPO 478 (SEQ ID NO:478), GHPO 491 (SEQ ID NO:480), GHPO 511 (SEQ ID NO:482), GHPO 519 (SEQ ID NO:484), GHPO 526 (SEQ ID
NO:486), GHPO 534 (SEQ ID NO:488), GHPO 536 (SEQ ID NO:490), GHPO 542 (SEQ ID NO:492), GHPO 544 (SEQ ID NO:494), GHPO 576 (SEQ ID NO:496), GHPO 578 (SEQ ID NO:498), GHPO 580 (SEQ ID NO:500), GHPO 585 (SEQ ID NO:502), GHPO 599 (SEQ ID NO:504), GHPO 639 (SEQ ID NO:506), GHPO 642 (SEQ ID NO:508), GHPO 647 (SEQ ID NO:510), GHPO
654 (SEQ ID NO:512), GHPO 669 (SEQ ID NO:514), GHPO 710 (SEQ ID NO:516), GHPO 713 (SEQ ID NO:518), GHPO 716 (SEQ ID NO:520), GHPO 718 (SEQ ID NO:522), GHPO 726 (SEQ ID NO:524), GHPO 734 (SEQ ID NO:526), GHPO 740 (SEQ ID NO:528), GHPO 770 (SEQ ID NO:530), GHPO 782 (SEQ ID NO:532), GHPO 786 (SEQ ID NO:534), GHPO 792 (SEQ ID NO:536), GHPO 797 (SEQ ID NO:538), GHPO 816 (SEQ ID NO:540), GHPO 828 (SEQ ID NO:542), GHPO 839 (SEQ ID NO:544), GHPO 840 (SEQ ID
NO:546), GHPO 842 (SEQ ID NO:548), GHPO 885 (SEQ ID NO:550), GHPO 889 (SEQ ID NO:552), GHPO 903 (SEQ ID NO:554), GHPO 912 (SEQ ID NO:556), GHPO 946 (SEQ ID NO:558), GHPO 958 (SEQ ID NO:560), GHPO 968 (SEQ ID NO:562), GHPO 987 (SEQ ID NO:564), GHPO 992 (SEQ ID NO:566), GHPO 996 (SEQ ID NO:568), GHPO 997 (SEQ ID NO:570), GHPO
1002 (SEQ ID NO: 572), GHPO 1026 (SEQ ID NO: 574), GHPO 1028 (SEQ ID NO:576), GHPO 1034 (SEQ ID NO:578), GHPO 1038 (SEQ ID NO:580), GHPO 1059 (SEQ ID NO:582), GHPO 1065 (SEQ ID NO:584), GHPO 1072 (SEQ ID NO:586), GHPO 1073 (SEQ ID NO:588), GHPO 1088 (SEQ ID NO:590), GHPO 1091 (SEQ ID NO:592), GHPO 1105 (SEQ ID NO:594),
GHPO 1115 (SEQ ID NO:596), GHPO 1159 (SEQ ID NO:598), GHPO 1177 (SEQ ID NO:600), GHPO 1187 (SEQ ID NO:602), GHPO 1192 (SEQ ID NO:604), GHPO 1195 (SEQ ID NO:606), GHPO 1224 (SEQ ID NO:608), GHPO 1225 (SEQ ID NO:610), GHPO 1228 (SEQ ID NO:612), GHPO 1229 (SEQ ID NO:614), GHPO 1231 (SEQ ID NO:616), GHPO 1236 (SEQ ID
NO:618), GHPO 1242 (SEQ ID NO:620), GHPO 1248 (SEQ ID NO:622), GHPO 1270 (SEQ ID NO:624), GHPO 1271 (SEQ ID NO:626), GHPO 1298 (SEQ ID NO:628), GHPO 1301 (SEQ ID NO:630), GHPO 1304 (SEQ ID NO:632), GHPO 1315 (SEQ ID NO:634), GHPO 1319 (SEQ ID NO:636), GHPO 1323 (SEQ ID NO:638), GHPO 1331 (SEQ ID NO:640), GHPO 1332
(SEQ ID NO:642), GHPO 1347 (SEQ ID NO:644), GHPO 1373 (SEQ ID NO:646), GHPO 1376 (SEQ ID NO:648), GHPO 1380 (SEQ ID NO:650), GHPO 1394 (SEQ ID NO:652), GHPO 1407 (SEQ ID NO:654), GHPO 1415 (SEQ ID NO:656), GHPO 1425 (SEQ ID NO:658), GHPO 1427 (SEQ ID NO:660), GHPO 1444 (SEQ ID NO:662), GHPO 1449 (SEQ ID NO:664), GHPO 1465 (SEQ ID NO:666), GHPO 1475 (SEQ ID NO:668), GHPO 1479 (SEQ ID NO:670), GHPO 1483 (SEQ ID NO:672), GHPO 1488 (SEQ ID
NO:674), GHPO 1496 (SEQ ID NO:676), GHPO 1524 (SEQ ID NO:678), GHPO 1536 (SEQ ID NO:680), GHPO 1539 (SEQ ID NO:682), GHPO 1540 (SEQ ID NO:684), GHPO 1542 (SEQ ID NO:686), GHPO 1555 (SEQ ID NO:688), GHPO 1560 (SEQ ID NO:690), GHPO 1564 (SEQ ID NO:692), GHPO 1570 (SEQ ID NO:694), GHPO 1588 (SEQ ID NO:696), GHPO 1604
(SEQ ID NO:698), GHPO 1605 (SEQ ID NO:700), GHPO 1619 (SEQ ID NO:702), GHPO 1629 (SEQ ID NO:704), GHPO 1642 (SEQ ID NO:706), GHPO 1654 (SEQ ID NO:708), GHPO 1661 (SEQ ID NO:710), GHPO 1673 (SEQ ID NO:712), GHPO 1687 (SEQ ID NO:714), GHPO 1692 (SEQ ID NO:716), GHPO 1693 (SEQ ID NO:718), GHPO 1699 (SEQ ID NO:720),
GHPO 1738 (SEQ ID NO:722), GHPO 1745 (SEQ ID NO:724), GHPO 1746 (SEQ ID NO:726), GHPO 1754 (SEQ ID NO:728), GHPO 1792 (SEQ ID NO:730), GHPO 1795 (SEQ ID NO:732), GHPO 1796 (SEQ ID NO:734), GHPO 7 (SEQ ID NO:736), GHPO 8 (SEQ ID NO:738), GHPO 9 (SEQ ID NO:740), GHPO 10 (SEQ ID NO:742), GHPO 12 (SEQ ID NO:744), GHPO
25 (SEQ ID NO:746), GHPO 27 (SEQ ID NO:748), GHPO 29 (SEQ ID NO:750), GHPO 30 (SEQ ID NO:752), GHPO 37 (SEQ ID NO:754), GHPO 49 (SEQ ID NO:756), GHPO 51 (SEQ ID NO:758), GHPO 54 (SEQ ID NO:760), GHPO 65 (SEQ ID NO:762), GHPO 66 (SEQ ID NO:764), GHPO 68 (SEQ ID NO:766), GHPO 70 (SEQ ID NO:768), GHPO 77 (SEQ ID
NO:770), GHPO 83 (SEQ ID NO:772), GHPO 85 (SEQ ID NO:774), GHPO 87 (SEQ ID NO:776), GHPO 91 (SEQ ID NO:778), GHPO 92 (SEQ ID NO:780), GHPO 96 (SEQ ID NO:782), GHPO 97 (SEQ ID NO:784), GHPO 111 (SEQ ID NO:786), GHPO 115 (SEQ ID NO:788), GHPO 117 (SEQ ID NO:790), GHPO 123 (SEQ ID NO:792), GHPO 124 (SEQ ID NO:794), GHPO 126 (SEQ ID NO:796), GHPO 127 (SEQ ID NO:798), GHPO 128 (SEQ ID NO:800), GHPO 131 (SEQ ID NO:802), GHPO 133 (SEQ ID NO:804), GHPO
140 (SEQ ID NO:806), GHPO 141 (SEQ ID NO:808), GHPO 145 (SEQ ID NO:810), GHPO 147 (SEQ ID NO:812), GHPO 166 (SEQ ID NO:814), GHPO 181 (SEQ ID NO:816), GHPO 187 (SEQ ID NO:818), GHPO 188 (SEQ ID NO:820), GHPO 192 (SEQ ID NO:822), GHPO 202 (SEQ ID NO:824), GHPO 204 (SEQ ID NO:826), GHPO 205 (SEQ ID NO:828), GHPO 212 (SEQ ID
NO:830), GHPO 218 (SEQ ID NO:832), GHPO 226 (SEQ ID NO:834), GHPO 231 (SEQ ID NO:836), GHPO 236 (SEQ ID NO:838), GHPO 239 (SEQ ID NO:840), GHPO 245 (SEQ ID NO:842), GHPO 246 (SEQ ID NO:844), GHPO 248 (SEQ ID NO:846), GHPO 253 (SEQ ID NO:848), GHPO 265 (SEQ ID NO:850), GHPO 266 (SEQ ID NO:852), GHPO 271 (SEQ ID NO:854), GHPO
272 (SEQ ID NO:856), GHPO 286 (SEQ ID NO:858), GHPO 291 (SEQ ID NO:860), GHPO 292 (SEQ ID NO:862), GHPO 297 (SEQ ID NO:864), GHPO 304 (SEQ ID NO:866), GHPO 307 (SEQ ID NO:868), GHPO 324 (SEQ ID NO:870), GHPO 326 (SEQ ID NO:872), GHPO 331 (SEQ ID NO:874), GHPO 343 (SEQ ID NO:876), GHPO 345 (SEQ ID NO:878), GHPO 346 (SEQ ID
NO:880), GHPO 352 (SEQ ID NO:882), GHPO 355 (SEQ ID NO:884), GHPO 363 (SEQ ID NO:886), GHPO 369 (SEQ ID NO:888), GHPO 376 (SEQ ID NO:890), GHPO 378 (SEQ ID NO:892), GHPO 388 (SEQ ID NO:894), GHPO 396 (SEQ ID NO:896), GHPO 403 (SEQ ID NO:898), GHPO 410 (SEQ ID NO:900), GHPO 415 (SEQ ID NO:902), GHPO 421 (SEQ ID NO:904), GHPO
439 (SEQ ID NO:906), GHPO 441 (SEQ ID NO:908), GHPO 443 (SEQ ID NO:910), GHPO 453 (SEQ ID NO:912), GHPO 455 (SEQ ID NO:914), GHPO 464 (SEQ ID NO:916), GHPO 467 (SEQ ID NO:918), GHPO 468 (SEQ ID NO:920), GHPO 470 (SEQ ID NO:922), GHPO 486 (SEQ ID NO:924), GHPO 487 (SEQ ID NO:926), GHPO 488 (SEQ ID NO:928), GHPO 489 (SEQ ID NO:930), GHPO 498 (SEQ ID NO:932), GHPO 501 (SEQ ID NO:934), GHPO 504 (SEQ ID NO:936), GHPO 512 (SEQ ID NO:938), GHPO 517 (SEQ ID
NO:940), GHPO 520 (SEQ ID NO:942), GHPO 528 (SEQ ID NO:944), GHPO 530 (SEQ ID NO:946), GHPO 532 (SEQ ID NO:948), GHPO 548 (SEQ ID NO:950), GHPO 561 (SEQ ID NO:952), GHPO 564 (SEQ ID NO:954), GHPO 572 (SEQ ID NO:956), GHPO 573 (SEQ ID'NO:958), GHPO 574 (SEQ ID NO:960), GHPO 577 (SEQ ID NO:962), GHPO 579 (SEQ ID NO:964), GHPO
583 (SEQ ID NO:966), GHPO 588 (SEQ ID NO:968), GHPO 593 (SEQ ID NO:970), GHPO 597 (SEQ ID NO:972), GHPO 598 (SEQ ID NO:974), GHPO 604 (SEQ ID NO:976), GHPO 606 (SEQ ID NO:978), GHPO 611 (SEQ ID NO:980), GHPO 612 (SEQ ID NO:982), GHPO 615 (SEQ ID NO:984), GHPO 632 (SEQ ID NO:986), GHPO 633 (SEQ ID NO:988), GHPO 637 (SEQ ID
NO:990), GHPO 651 (SEQ ID NO:992), GHPO 663 (SEQ ID NO:994), GHPO 686 (SEQ ID NO:996), GHPO 693 (SEQ ID NO:998), GHPO 698 (SEQ ID NO: 1000), GHPO 703 (SEQ ID NO: 1002), GHPO 704 (SEQ ID NO: 1004), GHPO 705 (SEQ ID NO: 1006), GHPO 707 (SEQ ID NO: 1008), GHPO 721 (SEQ ID NO: 1010), GHPO 727 (SEQ ID NO: 1012), GHPO 728 (SEQ ID
NO: 1014), GHPO 733 (SEQ ID NO: 1016), GHPO 758 (SEQ ID NO:1018), GHPO 763 (SEQ ID NO: 1020), GHPO 771 (SEQ ID NO:1022), GHPO 774 (SEQ ID NO: 1024), GHPO 776 (SEQ ID NO: 1026), GHPO 783 (SEQ ID NO: 1028), GHPO 800 (SEQ ID NO: 1030), GHPO 806 (SEQ ID NO: 1032), GHPO 807 (SEQ ID NO: 1034), GHPO 808 (SEQ ID NO: 1036), GHPO 809
(SEQ ID NO: 1038), GHPO 811 (SEQ ID NO: 1040), GHPO 815 (SEQ ID NO: 1042), GHPO 819 (SEQ ID NO: 1044), GHPO 841 (SEQ ID NO: 1046), GHPO 843 (SEQ ID NO: 1048), GHPO 846 (SEQ ID NO: 1050), GHPO 875 (SEQ ID NO: 1052), GHPO 892 (SEQ ID NO: 1054), GHPO 902 (SEQ ID NO: 1056), GHPO 904 (SEQ ID NO: 1058), GHPO 906 (SEQ ID NO: 1060), GHPO 908 (SEQ ID NO: 1062), GHPO 921 (SEQ ID NO: 1064), GHPO 923 (SEQ ID NO: 1066), GHPO 926 (SEQ ID NO: 1068), GHPO 933 (SEQ ID
NO: 1070), GHPO 939 (SEQ ID NO: 1072), GHPO 940 (SEQ ID NO: 1074), GHPO 943 (SEQ ID NO: 1076), GHPO 951 (SEQ ID NO: 1078), GHPO 961 (SEQ ID NO: 1080), GHPO 965 (SEQ ID NO: 1082), GHPO 990 (SEQ ID NO: 1084), GHPO 991 (SEQ ID NO: 1086), GHPO 998 (SEQ ID NO: 1088), GHPO 1001 (SEQ ID NO: 1090), GHPO 1005 (SEQ ID NO: 1092), GHPO
1033 (SEQ ID NO: 1094), GHPO 1039 (SEQ ID NO: 1096), GHPO 1041 (SEQ ID NO: 1098), GHPO 1043 (SEQ ID NO: 1100), GHPO 1044 (SEQ ID NO: 1102), GHPO 1051 (SEQ ID NO: 1104), GHPO 1058 (SEQ ID NO: 1106), GHPO 1060 (SEQ ID NO: 1108), GHPO 1075 (SEQ ID NO: l 110), GHPO 1077 (SEQ ID NO: 1112), GHPO 1082 (SEQ ID NO: 1114), GHPO 1083 (SEQ
ID NO: 1116), GHPO 1086 (SEQ ID NO: l 118), GHPO 1087 (SEQ ID NO: 1120), GHPO 1090 (SEQ ID NO: 1122), GHPO 1097 (SEQ ID NO: 1124), GHPO 1098 (SEQ ID NO: l 126), GHPO 1103 (SEQ ID NO:l 128), GHPO 1113 (SEQ ID NO: l 130), GHPO 1116 (SEQ ID NO:l 132), GHPO 1123 (SEQ ID NO: 1134), GHPO 1125 (SEQ ID NO: 1136), GHPO 1129 (SEQ ID
NO: 1138), GHPO 1130 (SEQ ID NO: 1140), GHPO 1134 (SEQ ID NO: 1142), GHPO 1161 (SEQ ID NO: l 144), GHPO 1166 (SEQ ID NO: 1146), GHPO 1170 (SEQ ID NO: 1148), GHPO 1175 (SEQ ID NO: l 150), GHPO 1181 (SEQ ID NO: 1152), GHPO 1186 (SEQ ID NO: 1154), GHPO 1188 (SEQ ID NO: 1156), GHPO 1191 (SEQ ID NO: 1158), GHPO 1193 (SEQ ID NO: 1160),
GHPO 1196 (SEQ ID NO: 1162), GHPO 1204 (SEQ ID NO: 1164), GHPO 1210 (SEQ ID NO:l 166), GHPO 1211 (SEQ ID NO: 1168), GHPO 1216 (SEQ ID NO: 1170), GHPO 1218 (SEQ ID NO: 1172), GHPO 1220 (SEQ ID NO: 1174), GHPO 1223 (SEQ ID NO: 1176), GHPO 1226 (SEQ ID NO: 1 178), GHPO 1240 (SEQ ID NO: 1180), GHPO 1246 (SEQ ID NO: 1182), GHPO 1251 (SEQ ID NO: 1184), GHPO 1252 (SEQ ID NO: 1186), GHPO 1261 (SEQ ID NO: 1188), GHPO 1265 (SEQ ID NO: 1190), GHPO 1267 (SEQ ID
NO: 1192), GHPO 1278 (SEQ ID NO: 1194), GHPO 1282 (SEQ ID NO: 1196), GHPO 1283 (SEQ ID NO: 1198), GHPO 1287 (SEQ ID NO: 1200), GHPO 1292 (SEQ ID NO: 1202), GHPO 1293 (SEQ ID NO: 1204), GHPO 1302 (SEQ ID NO: 1206), GHPO 1309 (SEQ ID NO: 1208), GHPO 1317 (SEQ ID NO: 1210), GHPO 1318 (SEQ ID NO:1212), GHPO 1321 (SEQ ID NO:1214),
GHPO 1325 (SEQ ID NO: 1216), GHPO 1341 (SEQ ID NO: 1218), GHPO 1351 (SEQ ID NO:1220), GHPO 1354 (SEQ ID NO:1222), GHPO 1363 (SEQ ID NO: 1224), GHPO 1371 (SEQ ID NO: 1226), GHPO 1381 (SEQ ID NO: 1228), GHPO 1401 (SEQ ID NO: 1230), GHPO 1402 (SEQ ID NO: 1232), GHPO 1403 (SEQ ID NO: 1234), GHPO 1408 (SEQ ID NO: 1236), GHPO
1416 (SEQ ID NO:1238), GHPO 1420 (SEQ ID NO:1240), GHPO 1428 (SEQ ID NO: 1242), GHPO 1437 (SEQ ID NO: 1244), GHPO 1439 (SEQ ID NO: 1246), GHPO 1460 (SEQ ID NO:1248), GHPO 1463 (SEQ ID NO: 1250), GHPO 1472 (SEQ ID NO: 1252), GHPO 1474 (SEQ ID NO: 1254), GHPO 1484 (SEQ ID NO: 1256), GHPO 1489 (SEQ ID NO: 1258), GHPO 1494 (SEQ
ID NO: 1260), GHPO 1495 (SEQ ID NO: 1262), GHPO 1498 (SEQ ID NO: 1264), GHPO 1499 (SEQ ID NO: 1266), GHPO 1500 (SEQ ID NO: 1268), GHPO 1503 (SEQ ID NO: 1270), GHPO 1504 (SEQ ID NO: 1272), GHPO 1510 (SEQ ID NO: 1274), GHPO 1518 (SEQ ID NO:1276), GHPO 1533 (SEQ ID NO: 1278), GHPO 1541 (SEQ ID NO:1280), GHPO 1544 (SEQ ID
NO: 1282), GHPO 1548 (SEQ ID NO: 1284), GHPO 1565 (SEQ ID NO: 1286), GHPO 1575 (SEQ ID NO: 1288), GHPO 1582 (SEQ ID NO: 1290), GHPO 1595 (SEQ ID NO: 1292), GHPO 1597 (SEQ ID NO: 1294), GHPO 1599 (SEQ ID NO: 1296), GHPO 1601 (SEQ ID NO: 1298), GHPO 1609 (SEQ ID NO: 1300), GHPO 1613 (SEQ ID NO: 1302), GHPO 1614 (SEQ ID NO: 1304), GHPO 1626 (SEQ ID NO: 1306), GHPO 1628 (SEQ ID NO: 1308), GHPO 1639 (SEQ ID NO: 1310), GHPO 1640 (SEQ ID NO: 1312), GHPO 1641 (SEQ
ID NO: 1314), GHPO 1646 (SEQ ID NO: 1316), GHPO 1662 (SEQ ID NO: 1318), GHPO 1667 (SEQ ID NO: 1320), GHPO 1668 (SEQ ID NO: 1322), GHPO 1670 (SEQ ID NO: 1324), GHPO 1671 (SEQ ID NO: 1326), GHPO 1672 (SEQ ID NO: 1328), GHPO 1678 (SEQ ID NO: 1330), GHPO 1684 (SEQ ID NO: 1332), GHPO 1695 (SEQ ID NO: 1334), GHPO 1697 (SEQ ID
NO: 1336), GHPO 1701 (SEQ ID NO: 1338), GHPO 1719 (SEQ ID NO: 1340), GHPO 1723 (SEQ ID NO: 1342), GHPO 1732 (SEQ ID NO: 1344), GHPO 1739 (SEQ ID NO: 1346), GHPO 1741 (SEQ ID NO: 1348), GHPO 1747 (SEQ ID NO: 1350), GHPO 1749 (SEQ ID NO: 1352), GHPO 1750 (SEQ ID NO: 1354), GHPO 1751 (SEQ ID NO: 1356), GHPO 1755 (SEQ ID NO: 1358),
GHPO 1771 (SEQ ID NO: 1360), GHPO 1786 (SEQ ID NO: 1362), and GHPO 1789 (SEQ ID NO: 1364); or
(ii) a derivative of said Helicobacter polypeptide.
2. The isolated polynucleotide of claim 1, which encodes a mature form of said Helicobacter polypeptide.
3. The isolated polynucleotide of claim 1 or 2, wherein the polynucleotide is a DNA molecule.
4. The isolated polynucleotide of claim 1, which is a DNA molecule that can be amplified by polymerase chain reaction from a Helicobacter genome.
5. The isolated DNA molecule of claim 4, which can be amplified by the polymerase chain reaction from a Helicobacter pylori genome.
6. The isolated polynucleotide of claim 1 , which is a DNA molecule that encodes the mature form or a derivative of a polypeptide encoded by the DNA molecule of claim 4.
7. The isolated polynucleotide of claim 1, which is a DNA molecule that encodes the mature form or a derivative of a polypeptide encoded by the DNA molecule of claim 5.
8. A compound, in a substantially purified form, that is the mature form or a derivative of a polypeptide comprising an amino acid sequence that is homologous to a Helicobacter polypeptide selected from the group consisting of GHPO 35 (SEQ ID NO:2), GHPO 55 (SEQ ID NO:4), GHPO 78 (SEQ ID NO:6), GHPO 89 (SEQ ID NO:8), GHPO 129 (SEQ ID NO: 10), GHPO 541 (SEQ ID NO:12), GHPO 607 (SEQ ID NO:14), GHPO 635 (SEQ ID NO:16),
GHPO 701 (SEQ ID NO:18), GHPO 712 (SEQ ID NO:20), GHPO 761 (SEQ ID NO:22), GHPO 838 (SEQ ID NO:24), GHPO 1034 (SEQ ID NO:26), GHPO 1085 (SEQ ID NO:28), GHPO 1213 (SEQ ID NO:30), GHPO 1255 (SEQ ID NO:32), GHPO 1308 (SEQ ID NO:34), GHPO 1389 (SEQ ID NO:36), GHPO 1706 (SEQ ID NO:38), GHPO 234 (SEQ ID NO:40), GHPO
314 (SEQ ID NO:42), GHPO 510 (SEQ ID NO:44), GHPO 603 (SEQ ID NO:46), GHPO 937 (SEQ ID NO:48), GHPO 1027 (SEQ ID NO:50), GHPO 1099 (SEQ ID NO:52), GHPO 1151 (SEQ ID NO:54), GHPO 1275 (SEQ ID NO:56), GHPO 1365 (SEQ ID NO:58), GHPO 1578 (SEQ ID NO:60), GHPO 22 (SEQ ID NO:62), GHPO 58 (SEQ ID NO:64), GHPO 200 (SEQ ID NO:66), GHPO 558 (SEQ ID NO:68), GHPO 563 (SEQ ID NO:70), GHPO 695 (SEQ ID NO:72), GHPO 699 (SEQ ID NO:74), GHPO 702 (SEQ ID NO:76), GHPO
709 (SEQ ID NO:78), GHPO 741 (SEQ ID NO:80), GHPO 762 (SEQ ID NO:82), GHPO 827 (SEQ ID NO:84), GHPO 852 (SEQ ID NO:86), GHPO 1013 (SEQ ID NO:88), GHPO 1020 (SEQ ID NO:90), GHPO 1031 (SEQ ID NO:92), GHPO 1052 (SEQ ID NO:94), GHPO 1127 (SEQ ID NO:96), GHPO 1149 (SEQ ID NO:98), GHPO 1176 (SEQ ID NO: 100), GHPO 1250 (SEQ ID
NO: 102), GHPO 1312 (SEQ ID NO: 104), GHPO 1358 (SEQ ID NO: 106), GHPO 1490 (SEQ ID NO: 108), GHPO 1559 (SEQ ID NO: 110), GHPO 1651 (SEQ ID NO: 112), GHPO 1726 (SEQ ID NO: 114), GHPO 1780 (SEQ ID NO: l 16), GHPO 895 (SEQ ID NO: 118), GHPO 1447 (SEQ ID NO: 120), GHPO 28 (SEQ ID NO: 122), GHPO 86 (SEQ ID NO: 124), GHPO 1 5 (SEQ
ID NO: 126), GHPO 157 (SEQ ID NO: 128), GHPO 237 (SEQ ID NO: 130), GHPO 290 (SEQ ID NO: 132), GHPO 293 (SEQ ID NO: 134), GHPO 335 (SEQ ID NO: 136), GHPO 374 (SEQ ID NO: 138), GHPO 442 (SEQ ID NO: 140), GHPO 480 (SEQ ID NO: 142), GHPO 523 (SEQ ID NO: 144), GHPO 610 (SEQ ID NO: 146), GHPO 675 (SEQ ID NO: 148), GHPO 690 (SEQ ID
NO: 150), GHPO 829 (SEQ ID NO: 152), GHPO 850 (SEQ ID NO: 154), GHPO 876 (SEQ ID NO: 156), GHPO 984 (SEQ ID NO: 158), GHPO 989 (SEQ ID NO: 160), GHPO 1111 (SEQ ID NO: 162), GHPO 1145 (SEQ ID NO: 164), GHPO 1256 (SEQ ID NO: 166), GHPO 1264 (SEQ ID NO: 168), GHPO 1316 (SEQ ID NO:170), GHPO 1368 (SEQ ID NO: 172), GHPO 1442 (SEQ ID
NO: 174), GHPO 1506 (SEQ ID NO: 176), GHPO 1543 (SEQ ID NO: 178), GHPO 1574 (SEQ ID NO: 180), GHPO 1627 (SEQ ID NO: 182), GHPO 1657 (SEQ ID NO: 184), GHPO 1664 (SEQ ID NO: 186), GHPO 1694 (SEQ ID NO: 188), GHPO 1704 (SEQ ID NO: 190), GHPO 1763 (SEQ ID NO: 192), GHPO 616 (SEQ ID NO: 194), GHPO 76 (SEQ ID NO: 196), GHPO 109 (SEQ ID NO: 198), GHPO 163 (SEQ ID NO:200), GHPO 169 (SEQ ID NO:202), GHPO 208 (SEQ ID NO:204), GHPO 219 (SEQ ID NO:206), GHPO 445
(SEQ ID NO:208), GHPO 479 (SEQ ID NO:210), GHPO 525 (SEQ ID NO:212), GHPO 535 (SEQ ID NO:214), GHPO 731 (SEQ ID NO:216), GHPO 836 (SEQ ID NO:218), GHPO 879 (SEQ ID NO:220), GHPO 881 (SEQ ID NO:222), GHPO 886 (SEQ ID NO:224), GHPO 893 (SEQ ID NO:226), GHPO 894 (SEQ ID NO:228), GHPO 976 (SEQ ID NO:230), GHPO 1011 (SEQ ID
NO:232), GHPO 1024 (SEQ ID NO:234), GHPO 1084 (SEQ ID NO:236), GHPO 1329 (SEQ ID NO:238), GHPO 1330 (SEQ ID NO:240), GHPO 1346 (SEQ ID NO:242), GHPO 1360 (SEQ ID NO:244), GHPO 1388 (SEQ ID NO:246), GHPO 1411 (SEQ ID NO:248), GHPO 1419 (SEQ ID NO:250), GHPO 1446 (SEQ ID NO:252), GHPO 1469 (SEQ ID NO:254), GHPO 1501
(SEQ ID NO:256), GHPO 1505 (SEQ ID NO:258), GHPO 1522 (SEQ ID NO:260), GHPO 1525 (SEQ ID NO:262), GHPO 1615 (SEQ ID NO:264), GHPO 1689 (SEQ ID NO:266), GHPO 1733 (SEQ ID NO:268), GHPO 18 (SEQ ID NO:270), GHPO 139 (SEQ ID NO:272), GHPO 142 (SEQ ID NO:274), GHPO 250 (SEQ ID NO:276), GHPO 257 (SEQ ID NO:278), GHPO
325 (SEQ ID NO:280), GHPO 355 (SEQ ID NO:282), GHPO 357 (SEQ ID NO:284), GHPO 454 (SEQ ID NO:286), GHPO 475 (SEQ ID NO:288), GHPO 515 (SEQ ID NO:290), GHPO 527 (SEQ ID NO:292), GHPO 551 (SEQ ID NO:294), GHPO 602 (SEQ ID NO:296), GHPO 626 (SEQ ID NO:298), GHPO 646 (SEQ ID NO:300), GHPO 653 (SEQ ID NO:302), GHPO 655 (SEQ ID
NO:304), GHPO 670 (SEQ ID NO:306), GHPO 739 (SEQ ID NO:308), GHPO 798 (SEQ ID NO:310), GHPO 1102 (SEQ ID NO:312), GHPO 1114 (SEQ ID NO:314), GHPO 1152 (SEQ ID NO:316), GHPO 1272 (SEQ ID NO:318), GHPO 1345 (SEQ ID NO:320), GHPO 1377 (SEQ ID NO:322), GHPO 1424 (SEQ ID NO:324), GHPO 1430 (SEQ ID NO:326), GHPO 1502 (SEQ ID NO:328), GHPO 1600 (SEQ ID NO:330), GHPO 1714 (SEQ ID NO:332), GHPO 359 (SEQ ID NO:334), GHPO 678 (SEQ ID NO:336), GHPO 708
(SEQ ID NO:338), GHPO 759 (SEQ ID NO:340), GHPO 847 (SEQ ID NO:342), GHPO 1050 (SEQ ID NO:344), GHPO 1101 (SEQ ID NO:346), GHPO 1120 (SEQ ID NO:348), GHPO 1138 (SEQ ID NO:350), GHPO 1310 (SEQ ID NO:352), GHPO 1320 (SEQ ID NO:354), GHPO 1375 (SEQ ID NO:356), GHPO 1432 (SEQ ID NO:358), GHPO 21 (SEQ ID NO:360), GHPO
282 (SEQ ID NO:362), GHPO 1089 (SEQ ID NO:364), GHPO 1141 (SEQ ID NO:366), GHPO 1280 (SEQ ID NO:368), GHPO 1608 (SEQ ID NO:370), GHPO 15 (SEQ ID NO:372), GHPO 16 (SEQ ID NO:374), GHPO 36 (SEQ ID NO:376), GHPO 38 (SEQ ID NO:378), GHPO 52 (SEQ ID NO:380), GHPO 57 (SEQ ID NO:382), GHPO 64 (SEQ ID NO:384), GHPO 79 (SEQ ID
NO:386), GHPO 84 (SEQ ID NO:388), GHPO 86 (SEQ ID NO:390), GHPO 99 (SEQ ID NO:392), GHPO 106 (SEQ ID NO:394), GHPO 118 (SEQ ID NO:396), GHPO 122 (SEQ ID NO:398), GHPO 128 (SEQ ID NO:400), GHPO 138 (SEQ ID NO:402), GHPO 153 (SEQ ID NO:404), GHPO 160 (SEQ ID NO:406), GHPO 168 (SEQ ID NO:408), GHPO 179 (SEQ ID NO:410), GHPO
189 (SEQ ID NO:412), GHPO 229 (SEQ ID NO:414), GHPO 243 (SEQ ID NO:416), GHPO 244 (SEQ ID NO:418), GHPO 251 (SEQ ID NO:420), GHPO 267 (SEQ ID NO:422), GHPO 269 (SEQ ID NO:424), GHPO 279 (SEQ ID NO:426), GHPO 284 (SEQ ID NO:428), GHPO 296 (SEQ ID NO:430), GHPO 300 (SEQ ID NO:432), GHPO 305 (SEQ ID NO:434), GHPO 319 (SEQ ID
NO:436), GHPO 330 (SEQ ID NO:438), GHPO 340 (SEQ ID NO:440), GHPO 342 (SEQ ID NO:442), GHPO 344 (SEQ ID NO:444), GHPO 358 (SEQ ID NO:446), GHPO 373 (SEQ ID NO:448), GHPO 382 (SEQ ID NO:450), GHPO 384 (SEQ ID NO:452), GHPO 398 (SEQ ID NO:454), GHPO 409 (SEQ ID NO:456), GHPO 422 (SEQ ID NO:458), GHPO 430 (SEQ ID NO:460), GHPO 446 (SEQ ID NO:462), GHPO 447 (SEQ ID NO:464), GHPO 450 (SEQ ID NO:466), GHPO 451 (SEQ ID NO:468), GHPO 452 (SEQ ID NO:470), GHPO
456 (SEQ ID NO:472), GHPO 461 (SEQ ID NO:474), GHPO 476 (SEQ ID NO:476), GHPO 478 (SEQ ID NO:478), GHPO 491 (SEQ ID NO:480), GHPO 511 (SEQ ID NO:482), GHPO 519 (SEQ ID NO:484), GHPO 526 (SEQ ID NO:486), GHPO 534 (SEQ ID NO:488), GHPO 536 (SEQ ID NO:490), GHPO 542 (SEQ ID NO:492), GHPO 544 (SEQ ID NO:494), GHPO 576 (SEQ ID
NO:496), GHPO 578 (SEQ ID NO:498), GHPO 580 (SEQ ID NO:500), GHPO 585 (SEQ ID NO:502), GHPO 599 (SEQ ID NO:504), GHPO 639 (SEQ ID NO:506), GHPO 642 (SEQ ID NO:508), GHPO 647 (SEQ ID NO:510), GHPO 654 (SEQ ID NO:512), GHPO 669 (SEQ ID NO:514), GHPO 710 (SEQ ID NO:516), GHPO 713 (SEQ ID NO:518), GHPO 716 (SEQ ID NO:520), GHPO
718 (SEQ ID NO:522), GHPO 726 (SEQ ID NO:524), GHPO 734 (SEQ ID NO:526), GHPO 740 (SEQ ID NO:528), GHPO 770 (SEQ ID NO:530), GHPO 782 (SEQ ID NO:532), GHPO 786 (SEQ ID NO:534), GHPO 792 (SEQ ID NO:536), GHPO 797 (SEQ ID NO:538), GHPO 816 (SEQ ID NO:540), GHPO 828 (SEQ ID NO:542), GHPO 839 (SEQ ID NO:544), GHPO 840 (SEQ ID
NO:546), GHPO 842 (SEQ ID NO:548), GHPO 885 (SEQ ID NO:550), GHPO 889 (SEQ ID NO:552), GHPO 903 (SEQ ID NO:554), GHPO 912 (SEQ ID NO:556), GHPO 946 (SEQ ID NO:558), GHPO 958 (SEQ ID NO:560), GHPO 968 (SEQ ID NO:562), GHPO 987 (SEQ ID NO:564), GHPO 992 (SEQ ID NO:566), GHPO 996 (SEQ ID NO:568), GHPO 997 (SEQ ID NO:570), GHPO
1002 (SEQ ID NO:572), GHPO 1026 (SEQ ID NO:574), GHPO 1028 (SEQ ID NO:576), GHPO 1034 (SEQ ID NO:578), GHPO 1038 (SEQ ID NO:580), GHPO 1059 (SEQ ID NO:582), GHPO 1065 (SEQ ID NO:584), GHPO 1072 (SEQ ID NO:586), GHPO 1073 (SEQ ID NO:588), GHPO 1088 (SEQ ID NO:590), GHPO 1091 (SEQ ID NO:592), GHPO 1105 (SEQ ID NO:594), GHPO 1115 (SEQ ID NO:596), GHPO 1159 (SEQ ID NO:598), GHPO 1177 (SEQ ID NO:600), GHPO 1187 (SEQ ID NO:602), GHPO 1192 (SEQ ID
NO:604), GHPO 1195 (SEQ ID NO:606), GHPO 1224 (SEQ ID NO:608), GHPO 1225 (SEQ ID NO:610), GHPO 1228 (SEQ ID NO:612), GHPO 1229 (SEQ ID NO:614), GHPO 1231 (SEQ ID NO:616), GHPO 1236 (SEQ ID NO:618), GHPO 1242 (SEQ ID NO:620), GHPO 1248 (SEQ ID NO:622), GHPO 1270 (SEQ ID NO:624), GHPO 1271 (SEQ ID NO:626), GHPO 1298
(SEQ ID NO:628), GHPO 1301 (SEQ ID NO:630), GHPO 1304 (SEQ ID NO:632), GHPO 1315 (SEQ ID NO:634), GHPO 1319 (SEQ ID NO:636), GHPO 1323 (SEQ ID NO:638), GHPO 1331 (SEQ ID NO:640), GHPO 1332 (SEQ ID NO:642), GHPO 1347 (SEQ ID NO:644), GHPO 1373 (SEQ ID NO:646), GHPO 1376 (SEQ ID NO:648), GHPO 1380 (SEQ ID NO:650),
GHPO 1394 (SEQ ID NO:652), GHPO 1407 (SEQ ID NO:654), GHPO 1415 (SEQ ID NO:656), GHPO 1425 (SEQ ID NO:658), GHPO 1427 (SEQ ID NO:660), GHPO 1444 (SEQ ID NO:662), GHPO 1449 (SEQ ID NO:664), GHPO 1465 (SEQ ID NO:666), GHPO 1475 (SEQ ID NO:668), GHPO 1479 (SEQ ID NO:670), GHPO 1483 (SEQ ID NO:672), GHPO 1488 (SEQ ID
NO:674), GHPO 1496 (SEQ ID NO:676), GHPO 1524 (SEQ ID NO:678), GHPO 1536 (SEQ ID NO:680), GHPO 1539 (SEQ ID NO:682), GHPO 1540 (SEQ ID NO:684), GHPO 1542 (SEQ ID NO:686), GHPO 1555 (SEQ ID NO:688), GHPO 1560 (SEQ ID NO:690), GHPO 1564 (SEQ ID NO:692), GHPO 1570 (SEQ ID NO:694), GHPO 1588 (SEQ ID NO:696), GHPO 1604
(SEQ ID NO:698), GHPO 1605 (SEQ ID NO:700), GHPO 1619 (SEQ ID NO:702), GHPO 1629 (SEQ ID NO:704), GHPO 1642 (SEQ ID NO:706), GHPO 1654 (SEQ ID NO:708), GHPO 1661 (SEQ ID NO:710), GHPO 1673 (SEQ ID NO:712), GHPO 1687 (SEQ ID NO:714), GHPO 1692 (SEQ ID NO:716), GHPO 1693 (SEQ ID NO:718), GHPO 1699 (SEQ ID NO:720), GHPO 1738 (SEQ ID NO: 722), GHPO 1745 (SEQ ID NO: 724), GHPO 1746 (SEQ ID NO:726), GHPO 1754 (SEQ ID NO:728), GHPO 1792 (SEQ ID
NO:730), GHPO 1795 (SEQ ID NO:732), GHPO 1796 (SEQ ID NO:734), GHPO 7 (SEQ ID NO:736), GHPO 8 (SEQ ID NO:738), GHPO 9 (SEQ ID NO:740), GHPO 10 (SEQ ID NO:742), GHPO 12 (SEQ ID NO:744), GHPO 25 (SEQ ID NO:746), GHPO 27 (SEQ ID NO:748), GHPO 29 (SEQ ID NO:750), GHPO 30 (SEQ ID NO:752), GHPO 37 (SEQ ID NO:754), GHPO
49 (SEQ ID NO:756), GHPO 51 (SEQ ID NO:758), GHPO 54 (SEQ ID NO:760), GHPO 65 (SEQ ID NO:762), GHPO 66 (SEQ ID NO:764), GHPO 68 (SEQ ID NO:766), GHPO 70 (SEQ ID NO:768), GHPO 77 (SEQ ID NO:770), GHPO 83 (SEQ ID NO:772), GHPO 85 (SEQ ID NO:774), GHPO 87 (SEQ ID NO:776), GHPO 91 (SEQ ID NO:778), GHPO 92 (SEQ ID
NO:780), GHPO 96 (SEQ ID NO:782), GHPO 97 (SEQ ID NO:784), GHPO 111 (SEQ ID NO:786), GHPO 115 (SEQ ID NO:788), GHPO 117 (SEQ ID NO:790), GHPO 123 (SEQ ID NO:792), GHPO 124 (SEQ ID NO:794), GHPO 126 (SEQ ID NO:796), GHPO 127 (SEQ ID NO:798), GHPO 128 (SEQ ID NO:800), GHPO 131 (SEQ ID NO:802), GHPO 133 (SEQ ID NO:804), GHPO
140 (SEQ ID NO:806), GHPO 141 (SEQ ID NO:808), GHPO 145 (SEQ ID NO:810), GHPO 147 (SEQ ID NO:812), GHPO 166 (SEQ ID NO:814), GHPO 181 (SEQ ID NO:816), GHPO 187 (SEQ ID NO:818), GHPO 188 (SEQ ID NO:820), GHPO 192 (SEQ ID NO:822), GHPO 202 (SEQ ID NO:824), GHPO 204 (SEQ ID NO:826), GHPO 205 (SEQ ID NO:828), GHPO 212 (SEQ ID
NO:830), GHPO 218 (SEQ ID NO:832), GHPO 226 (SEQ ID NO:834), GHPO 231 (SEQ ID NO:836), GHPO 236 (SEQ ID NO:838), GHPO 239 (SEQ ID NO:840), GHPO 245 (SEQ ID NO:842), GHPO 246 (SEQ ID NO:844), GHPO 248 (SEQ ID NO:846), GHPO 253 (SEQ ID NO:848), GHPO 265 (SEQ ID NO:850), GHPO 266 (SEQ ID NO:852), GHPO 271 (SEQ ID NO:854), GHPO 272 (SEQ ID NO:856), GHPO 286 (SEQ ID NO:858), GHPO 291 (SEQ ID NO:860), GHPO 292 (SEQ ID NO:862), GHPO 297 (SEQ ID NO:864), GHPO
304 (SEQ ID NO:866), GHPO 307 (SEQ ID NO:868), GHPO 324 (SEQ ID NO:870), GHPO 326 (SEQ ID NO:872), GHPO 331 (SEQ ID NO:874), GHPO 343 (SEQ ID NO:876), GHPO 345 (SEQ ID NO:878), GHPO 346 (SEQ ID NO:880), GHPO 352 (SEQ ID NO:882), GHPO 355 (SEQ ID NO:884), GHPO 363 (SEQ ID NO:886), GHPO 369 (SEQ ID NO:888), GHPO 376 (SEQ ID
NO:890), GHPO 378 (SEQ ID NO:892), GHPO 388 (SEQ ID NO:894), GHPO 396 (SEQ ID NO:896), GHPO 403 (SEQ ID NO:898), GHPO 410 (SEQ ID NO:900), GHPO 415 (SEQ ID NO:902), GHPO 421 (SEQ ID NO:904), GHPO 439 (SEQ ID NO:906), GHPO 441 (SEQ ID NO:908), GHPO 443 (SEQ ID NO:910), GHPO 453 (SEQ ID NO:912), GHPO 455 (SEQ ID NO:914), GHPO
464 (SEQ ID NO:916), GHPO 467 (SEQ ID NO:918), GHPO 468 (SEQ ID NO:920), GHPO 470 (SEQ ID NO:922), GHPO 486 (SEQ ID NO:924), GHPO 487 (SEQ ID NO:926), GHPO 488 (SEQ ID NO:928), GHPO 489 (SEQ ID NO:930), GHPO 498 (SEQ ID NO:932), GHPO 501 (SEQ ID NO:934), GHPO 504 (SEQ ID NO:936), GHPO 512 (SEQ ID NO:938), GHPO 517 (SEQ ID
NO:940), GHPO 520 (SEQ ID NO:942), GHPO 528 (SEQ ID NO:944), GHPO 530 (SEQ ID NO:946), GHPO 532 (SEQ ID NO:948), GHPO 548 (SEQ ID NO:950), GHPO 561 (SEQ ID NO:952), GHPO 564 (SEQ ID NO:954), GHPO 572 (SEQ ID NO:956), GHPO 573 (SEQ ID NO:958), GHPO 574 (SEQ ID NO:960), GHPO 577 (SEQ ID NO:962), GHPO 579 (SEQ ID NO:964), GHPO
583 (SEQ ID NO:966), GHPO 588 (SEQ ID NO:968), GHPO 593 (SEQ ID NO:970), GHPO 597 (SEQ ID NO:972), GHPO 598 (SEQ ID NO:974), GHPO 604 (SEQ ID NO:976), GHPO 606 (SEQ ID NO:978), GHPO 611 (SEQ ID NO:980), GHPO 612 (SEQ ID NO:982), GHPO 615 (SEQ ID NO:984), GHPO 632 (SEQ ID NO:986), GHPO 633 (SEQ ID NO:988), GHPO 637 (SEQ ID NO:990), GHPO 651 (SEQ ID NO:992), GHPO 663 (SEQ ID NO:994), GHPO 686 (SEQ ID NO:996), GHPO 693 (SEQ ID NO:998), GHPO 698 (SEQ ID
NO: 1000), GHPO 703 (SEQ ID NO: 1002), GHPO 704 (SEQ ID NO: 1004), GHPO 705 (SEQ ID NO: 1006), GHPO 707 (SEQ ID NO: 1008), GHPO 721 (SEQ ID NO:1010), GHPO 727 (SEQ ID NO:1012), GHPO 728 (SEQ ID NO: 1014), GHPO 733 (SEQ ID NO: 1016), GHPO 758 (SEQ ID NO: 1018), GHPO 763 (SEQ ID NO: 1020), GHPO 771 (SEQ ID NO: 1022), GHPO 774
(SEQ ID NO: 1024), GHPO 776 (SEQ ID NO: 1026), GHPO 783 (SEQ ID NO: 1028), GHPO 800 (SEQ ID NO: 1030), GHPO 806 (SEQ ID NO: 1032), GHPO 807 (SEQ ID NO:1034), GHPO 808 (SEQ ID NO: 1036), GHPO 809 (SEQ ID NO: 1038), GHPO 811 (SEQ ID NO: 1040), GHPO 815 (SEQ ID NO: 1042), GHPO 819 (SEQ ID NO: 1044), GHPO 841 (SEQ ID NO:1046),
GHPO 843 (SEQ ID NO: 1048), GHPO 846 (SEQ ID NO: 1050), GHPO 875 (SEQ ID NO: 1052), GHPO 892 (SEQ ID NO: 1054), GHPO 902 (SEQ ID NO: 1056), GHPO 904 (SEQ ID NO: 1058), GHPO 906 (SEQ ID NO:1060), GHPO 908 (SEQ ID NO: 1062), GHPO 921 (SEQ ID NO: 1064), GHPO 923 (SEQ ID NO: 1066), GHPO 926 (SEQ ID NO: 1068), GHPO 933 (SEQ ID
NO: 1070), GHPO 939 (SEQ ID NO:1072), GHPO 940 (SEQ ID NO: 1074), GHPO 943 (SEQ ID NO:1076), GHPO 951 (SEQ ID NO:1078), GHPO 961 (SEQ ID NO: 1080), GHPO 965 (SEQ ID NO: 1082), GHPO 990 (SEQ ID NO: 1084), GHPO 991 (SEQ ID NO: 1086), GHPO 998 (SEQ ID NO: 1088), GHPO 1001 (SEQ ID NO: 1090), GHPO 1005 (SEQ ID NO: 1092), GHPO
1033 (SEQ ID NO: 1094), GHPO 1039 (SEQ ID NO: 1096), GHPO 1041 (SEQ ID NO: 1098), GHPO 1043 (SEQ ID NO: 1100), GHPO 1044 (SEQ ID NO: 1102), GHPO 1051 (SEQ ID NO: 1104), GHPO 1058 (SEQ ID NO: 1106), GHPO 1060 (SEQ ID NO: 1108), GHPO 1075 (SEQ ID NO: 1110), GHPO 1077 (SEQ ID NO: 1112), GHPO 1082 (SEQ ID NO: 1114), GHPO 1083 (SEQ ID NO: 1116), GHPO 1086 (SEQ ID NO: 1118), GHPO 1087 (SEQ ID NO: 1120), GHPO 1090 (SEQ ID NO: 1122), GHPO 1097 (SEQ ID NO: 1124),
GHPO 1098 (SEQ ID NO: 1126), GHPO 1103 (SEQ ID NO: 1128), GHPO 1113 (SEQ ID NO: 1130), GHPO 1116 (SEQ ID NO: 1132), GHPO 1123 (SEQ ID NO: 1134), GHPO 1125 (SEQ ID NO: 1136), GHPO 1129 (SEQ ID NO: 1138), GHPO 1130 (SEQ ID NO: 1140), GHPO 1134 (SEQ ID NO: 1142), GHPO 1161 (SEQ ID NO: 1144), GHPO 1166 (SEQ ID NO: 1146), GHPO
1170 (SEQ ID NO: l 148), GHPO 1175 (SEQ ID NO: 1150), GHPO 1181 (SEQ ID NO: 1152), GHPO 1186 (SEQ ID NO: 1154), GHPO 1188 (SEQ ID NO: 1156), GHPO 1191 (SEQ ID NO: 1158), GHPO 1193 (SEQ ID NO: 1160), GHPO 1196 (SEQ ID NO: 1162), GHPO 1204 (SEQ ID NO: 1164), GHPO 1210 (SEQ ID NO: 1166), GHPO 1211 (SEQ ID NO: 1168), GHPO 1216 (SEQ
ID NO: 1170), GHPO 1218 (SEQ ID NO:1172), GHPO 1220 (SEQ ID NO: 1174), GHPO 1223 (SEQ ID NO: 1176), GHPO 1226 (SEQ ID NO: 1178), GHPO 1240 (SEQ ID NO: 1180), GHPO 1246 (SEQ ID NO: 1182), GHPO 1251 (SEQ ID NO: 1184), GHPO 1252 (SEQ ID NO: 1186), GHPO 1261 (SEQ ID NO: 1188), GHPO 1265 (SEQ ID NO: 1190), GHPO 1267 (SEQ ID
NO: 1192), GHPO 1278 (SEQ ID NO: 1194), GHPO 1282 (SEQ ID NO: 1196), GHPO 1283 (SEQ ID NO: 1198), GHPO 1287 (SEQ ID NO: 1200), GHPO 1292 (SEQ ID NO:1202), GHPO 1293 (SEQ ID NO: 1204), GHPO 1302 (SEQ ID NO: 1206), GHPO 1309 (SEQ ID NO: 1208), GHPO 1317 (SEQ ID NO: 1210), GHPO 1318 (SEQ ID NO: 1212), GHPO 1321 (SEQ ID NO: 1214),
GHPO 1325 (SEQ ID NO: 1216), GHPO 1341 (SEQ ID NO: 1218), GHPO 1351 (SEQ ID NO: 1220), GHPO 1354 (SEQ ID NO: 1222), GHPO 1363 (SEQ ID NO: 1224), GHPO 1371 (SEQ ID NO: 1226), GHPO 1381 (SEQ ID NO: 1228), GHPO 1401 (SEQ ID NO: 1230), GHPO 1402 (SEQ ID NO: 1232), GHPO 1403 (SEQ ID NO: 1234), GHPO 1408 (SEQ ID NO: 1236), GHPO 1416 (SEQ ID NO:1238), GHPO 1420 (SEQ ID NO:1240), GHPO 1428 (SEQ ID NO: 1242), GHPO 1437 (SEQ ID NO: 1244), GHPO 1439 (SEQ ID
NO: 1246), GHPO 1460 (SEQ ID NO: 1248), GHPO 1463 (SEQ ID NO: 1250), GHPO 1472 (SEQ ID NO: 1252), GHPO 1474 (SEQ ID NO: 1254), GHPO 1484 (SEQ ID NO: 1256), GHPO 1489 (SEQ ID NO: 1258), GHPO 1494 (SEQ ID NO: 1260), GHPO 1495 (SEQ ID NO: 1262), GHPO 1498 (SEQ ID NO:1264), GHPO 1499 (SEQ ID NO:1266), GHPO 1500 (SEQ ID NO:1268),
GHPO 1503 (SEQ ID NO: 1270), GHPO 1504 (SEQ ID NO: 1272), GHPO 1510 (SEQ ID NO: 1274), GHPO 1518 (SEQ ID NO: 1276), GHPO 1533 (SEQ ID NO: 1278), GHPO 1541 (SEQ ID NO: 1280), GHPO 1544 (SEQ ID NO: 1282), GHPO 1548 (SEQ ID NO: 1284), GHPO 1565 (SEQ ID NO: 1286), GHPO 1575 (SEQ ID NO: 1288), GHPO 1582 (SEQ ID NO:1290), GHPO
1595 (SEQ ID NO: 1292), GHPO 1597 (SEQ ID NO: 1294), GHPO 1599 (SEQ ID NO:1296), GHPO 1601 (SEQ ID NO:1298), GHPO 1609 (SEQ ID NO: 1300), GHPO 1613 (SEQ ID NO: 1302), GHPO 1614 (SEQ ID NO: 1304), GHPO 1626 (SEQ ID NO: 1306), GHPO 1628 (SEQ ID NO: 1308), GHPO 1639 (SEQ ID NO:1310), GHPO 1640 (SEQ ID NO:1312), GHPO 1641 (SEQ
ID NO: 1314), GHPO 1646 (SEQ ID NO: 1316), GHPO 1662 (SEQ ID NO: 1318), GHPO 1667 (SEQ ID NO: 1320), GHPO 1668 (SEQ ID NO: 1322), GHPO 1670 (SEQ ID NO:1324), GHPO 1671 (SEQ ID NO:1326), GHPO 1672 (SEQ ID NO: 1328), GHPO 1678 (SEQ ID NO: 1330), GHPO 1684 (SEQ ID NO: 1332), GHPO 1695 (SEQ ID NO: 1334), GHPO 1697 (SEQ ID
NO:1336), GHPO 1701 (SEQ ID NO:1338), GHPO 1719 (SEQ ID NO:1340), GHPO 1723 (SEQ ID NO: 1342), GHPO 1732 (SEQ ID NO: 1344), GHPO 1739 (SEQ ID NO: 1346), GHPO 1741 (SEQ ID NO: 1348), GHPO 1747 (SEQ ID NO: 1350), GHPO 1749 (SEQ ID NO: 1352), GHPO 1750 (SEQ ID NO: 1354), GHPO 1751 (SEQ ID NO: 1356), GHPO 1755 (SEQ ID NO:1358), GHPO 1771 (SEQ ID NO: 1360), GHPO 1786 (SEQ ID NO: 1362), and GHPO 1789 (SEQ ID NO: 1364); or
(ii) a derivative of said Helicobacter polypeptide.
9. The compound of claim 8, which is the mature form or a derivative of a polypeptide encoded by a DNA molecule of claim 4.
10. The compound of claim 8, which is the mature form or a derivative of a polypeptide encoded by a DNA molecule of claim 5.
11. A pharmaceutical composition for preventing or treating Helicobacter infection in a mammal, said composition comprising a prophylactically or therapeutically effective amount of a compound of claim 8, 9, or 10 admixed with a physiologically acceptable diluent or carrier.
12. The composition of claim 11, further comprising an antibiotic, an antisecretory agent, a bismuth salt, or a combination thereof.
13. The composition of claim 12, wherein said antibiotic is selected from the group consisting of amoxicillin, clarithromycin, tetracycline, metronidizole, and erythromycin.
14. The composition of claim 12, wherein said bismuth salt is selected from the group consisting of bismuth subcitrate and bismuth subsalicylate.
15. The composition of claim 12, wherein said antisecretory agent is a proton pump inhibitor.
16. The composition of claim 15, wherein said proton pump inhibitor is selected from the group consisting of omeprazole, lansoprazole, and pantoprazole.
17. The composition of claim 12, wherein said antisecretory agent is an
H2-receptor antagonist.
18. The composition of claim 17, wherein said H2-receptor antagonist is selected from the group consisting of ranitidine, cimetidine, famotidine, nizatidine, and roxatidine.
19. The composition of claim 12, wherein said antisecretory agent is a prostaglandin analog.
20. The composition of claim 19, wherein said prostaglandin analog is misoprostil or enprostil.
21. The composition of claim 11 , further comprising a prophylactically or therapeutically effective amount of a second Helicobacter polypeptide or a derivative thereof.
22. The composition of claim 21, wherein the second Helicobacter polypeptide is a Helicobacter urease, or a subunit or a derivative thereof.
23. The composition of claim 11, further comprising an adjuvant.
24. A pharmaceutical composition for preventing or treating
Helicobacter infection in a mammal, said composition comprising a prophylactically or therapeutically effective amount of a polynucleotide of claim 1 or 2 admixed with a physiologically acceptable diluent or carrier.
25. A pharmaceutical composition for preventing or treating
Helicobacter infection in a mammal, said composition comprising a prophylactically or therapeutically effective amount of a polynucleotide of claim 4, 5, or 6 admixed with a physiologically acceptable diluent or carrier.
26. A pharmaceutical composition for preventing or treating
Helicobacter infection in a mammal, said composition comprising a prophylactically or therapeutically effective amount of a polynucleotide of claim 7 admixed with a physiologically acceptable diluent or carrier.
27. A composition comprising a viral vector, in the genome of which is inserted a DNA molecule of claim 3, said DNA molecule being placed under conditions for expression in a mammalian cell and said viral vector being admixed with a physiologically acceptable diluent or carrier.
28. The composition of claim 27, wherein said viral vector is a poxvirus.
29. A composition that comprises a bacterial vector comprising a DNA molecule of claim 3, said DNA molecule being placed under conditions for expression and said bacterial vector being admixed with a physiologically acceptable diluent or carrier.
30. The composition of claim 29, wherein said vector is selected from the group consisting of Shigella, Salmonella, Vibrio cholerae, Lactobacillus, Bacille bilie de Calmette-Guerin, and Streptococcus.
31. The composition of claim 24, wherein said polynucleotide is a DNA molecule that is inserted in a plasmid that is unable to replicate and to substantially integrate in a mammalian genome and is placed under conditions for expression in a mammalian cell.
32. An expression cassette comprising a DNA molecule of claim 3, said DNA molecule being placed under conditions for expression in a procaryotic or eucaryotic cell.
33. A process for producing a compound of claim 8, which comprises culturing a procaryotic or eucaryotic cell transformed or transfected with an expression cassette of claim 32, and recovering said compound from the cell culture.
34. A pharmaceutical composition for preventing or treating Helicobacter infection in a mammal, said composition comprising a prophylactically or therapeutically effective amount of an antibody that binds to the compound of claim 8, 9, or 10 admixed with a physiologically acceptable diluent or carrier.
PCT/US1998/006371 1997-04-01 1998-04-01 Identification of polynucleotides encoding novel helicobacter polypeptides in the helicobacter genome WO1998043478A1 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
EP98917972A EP0977482A4 (en) 1997-04-01 1998-04-01 Identification of polynucleotides encoding novel helicobacter polypeptides in the helicobacter genome
CA002286306A CA2286306A1 (en) 1997-04-01 1998-04-01 Identification of polynucleotides encoding novel helicobacter polypeptides in the helicobacter genome
KR1019997008969A KR20010005893A (en) 1997-04-01 1998-04-01 Identification of polynucleotides encoding novel helicobacter polypeptides in the helicobacter genome
JP54194798A JP2001527393A (en) 1997-04-01 1998-04-01 Identification of a polynucleotide encoding a novel Helicobacter polypeptide in the Helicobacter genome
AU70995/98A AU756010B2 (en) 1997-04-01 1998-04-01 Identification of polynucleotides encoding novel helicobacter polypeptides in the helicobacter genome
NZ338039A NZ338039A (en) 1997-04-01 1998-04-01 Identification of polynucleotides encoding helicobacter polypeptides in the helicobacter genome

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US83345797A 1997-04-01 1997-04-01
US88122797A 1997-06-24 1997-06-24
US90261597A 1997-07-29 1997-07-29
US08/881,227 1997-07-29
US08/902,615 1997-07-29
US08/833,457 1997-07-29

Publications (1)

Publication Number Publication Date
WO1998043478A1 true WO1998043478A1 (en) 1998-10-08

Family

ID=27420239

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1998/006371 WO1998043478A1 (en) 1997-04-01 1998-04-01 Identification of polynucleotides encoding novel helicobacter polypeptides in the helicobacter genome

Country Status (8)

Country Link
EP (1) EP0977482A4 (en)
JP (1) JP2001527393A (en)
KR (1) KR20010005893A (en)
CN (1) CN1263436A (en)
AU (1) AU756010B2 (en)
CA (1) CA2286306A1 (en)
NZ (1) NZ338039A (en)
WO (1) WO1998043478A1 (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000022135A1 (en) * 1998-10-15 2000-04-20 Chiron Behring Gmbh & Co. Helicobacter pylori vaccine
WO2000026383A1 (en) * 1998-11-04 2000-05-11 Governors Of The University Of Alberta Alpha 1,2-fucosyltransferase from helicobacter pylori
WO2000073502A2 (en) * 1999-05-31 2000-12-07 MAX-PLANCK-Gesellschaft zur Förderung der Wissenschaften e.V. Essential genes and gene products for identifying, developing and optimising immunological and pharmacological active ingredients for the treatment of microbial infections
WO2001008486A1 (en) * 1999-08-03 2001-02-08 Smithkline Beecham Corporation CoaA POLYPEPTIDES AND POLYNUCLEOTIDES AND METHODS THEREOF
WO2001029198A1 (en) * 1999-10-15 2001-04-26 Csl Limited Polypeptide fragments comprising c-terminal portion of helicobacter catalase
WO2001092336A1 (en) * 2000-05-29 2001-12-06 A+ Science Invest Ab Lactoferrin polypeptides from h. pylori and vaccine compositions thereof
WO2002000888A1 (en) * 2000-06-28 2002-01-03 National Research Council Of Canada Helicobacter pylori heptosyl transferase polypeptides
WO2002005845A1 (en) * 2000-07-05 2002-01-24 Merieux Oravax Immunological combinations for prophylaxis and therapy of helicobacter pylori infection
WO2003080654A3 (en) * 2002-03-26 2003-11-13 Ca Nat Research Council Helicobacter flagellar, motility polypeptides
WO2003102025A1 (en) * 2002-05-30 2003-12-11 Japan Science And Technology Agency Protein inducing cell death of helicobacter pylori
EP1426441A1 (en) * 2001-08-24 2004-06-09 Kyowa Hakko Kogyo Co., Ltd. Alpha-1,2-fucosyl transferase and dna encoding the same
WO2005023851A1 (en) * 2003-09-05 2005-03-17 Karolinska Innovations Ab Plasminogen/plasmin binding polypeptides and nucleic acids therefore
EP1278768B1 (en) * 2000-04-27 2006-11-15 Max-Planck-Gesellschaft zur Förderung der Wissenschaften e.V. Method for identifying helicobacter antigens
US20070178110A1 (en) * 2003-11-21 2007-08-02 Ace Biosciences A/S Surface-located campylobacter jejuni polypeptides
WO2017102779A1 (en) * 2015-12-14 2017-06-22 Technische Universität München Helicobacter pylori vaccines

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20030024129A (en) * 2001-09-17 2003-03-26 국윤호 Detection and identification of helicobacter pylori
KR101985335B1 (en) * 2017-11-01 2019-06-03 연세대학교 원주산학협력단 A method for simultaneously detecting and identifying Helicobacter pylori, and an antibiotics-resistance gene thereof, and a use therefor based PCR-Reverse blot hybridization assay
CN113717248B (en) * 2020-09-30 2022-07-08 广州派真生物技术有限公司 Adeno-associated virus mutant and application thereof

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB8928625D0 (en) * 1989-12-19 1990-02-21 3I Res Expl Ltd H.pylori dna probes

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
WHITE O., ET AL.: "THE COMPLETE GENOME SEQUENCE OF THE GASTRIC PATHOGEN HELICOBACTER PYLORI.", NATURE, NATURE PUBLISHING GROUP, UNITED KINGDOM, vol. 388., 7 August 1997 (1997-08-07), United Kingdom, pages 539 - 547., XP002910083, ISSN: 0028-0836, DOI: 10.1038/41448 *

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000022135A1 (en) * 1998-10-15 2000-04-20 Chiron Behring Gmbh & Co. Helicobacter pylori vaccine
WO2000026383A1 (en) * 1998-11-04 2000-05-11 Governors Of The University Of Alberta Alpha 1,2-fucosyltransferase from helicobacter pylori
US7094578B2 (en) 1998-11-04 2006-08-22 Governors Of The University Of Alberta α1,2 fucosyltransferase
US6238894B1 (en) 1998-11-04 2001-05-29 Diane Taylor α1,2 fucosyltransferase
US6670160B2 (en) 1998-11-04 2003-12-30 Governors Of The University Of Alberta α1,2-fucosyltransferase
WO2000073502A3 (en) * 1999-05-31 2002-10-03 Max Planck Gesellschaft Essential genes and gene products for identifying, developing and optimising immunological and pharmacological active ingredients for the treatment of microbial infections
WO2000073502A2 (en) * 1999-05-31 2000-12-07 MAX-PLANCK-Gesellschaft zur Förderung der Wissenschaften e.V. Essential genes and gene products for identifying, developing and optimising immunological and pharmacological active ingredients for the treatment of microbial infections
WO2001008486A1 (en) * 1999-08-03 2001-02-08 Smithkline Beecham Corporation CoaA POLYPEPTIDES AND POLYNUCLEOTIDES AND METHODS THEREOF
WO2001029198A1 (en) * 1999-10-15 2001-04-26 Csl Limited Polypeptide fragments comprising c-terminal portion of helicobacter catalase
US7786260B1 (en) 1999-10-15 2010-08-31 Csl Limited Polypeptide fragments comprising c terminal portion of helicobacter catalase
EP1278768B1 (en) * 2000-04-27 2006-11-15 Max-Planck-Gesellschaft zur Förderung der Wissenschaften e.V. Method for identifying helicobacter antigens
WO2001092336A1 (en) * 2000-05-29 2001-12-06 A+ Science Invest Ab Lactoferrin polypeptides from h. pylori and vaccine compositions thereof
WO2002000888A1 (en) * 2000-06-28 2002-01-03 National Research Council Of Canada Helicobacter pylori heptosyl transferase polypeptides
WO2002005845A1 (en) * 2000-07-05 2002-01-24 Merieux Oravax Immunological combinations for prophylaxis and therapy of helicobacter pylori infection
EP1426441A1 (en) * 2001-08-24 2004-06-09 Kyowa Hakko Kogyo Co., Ltd. Alpha-1,2-fucosyl transferase and dna encoding the same
EP1426441A4 (en) * 2001-08-24 2005-03-16 Kyowa Hakko Kogyo Kk Alpha-1,2-fucosyl transferase and dna encoding the same
WO2003080654A3 (en) * 2002-03-26 2003-11-13 Ca Nat Research Council Helicobacter flagellar, motility polypeptides
WO2003102025A1 (en) * 2002-05-30 2003-12-11 Japan Science And Technology Agency Protein inducing cell death of helicobacter pylori
WO2005023851A1 (en) * 2003-09-05 2005-03-17 Karolinska Innovations Ab Plasminogen/plasmin binding polypeptides and nucleic acids therefore
US20070178110A1 (en) * 2003-11-21 2007-08-02 Ace Biosciences A/S Surface-located campylobacter jejuni polypeptides
WO2017102779A1 (en) * 2015-12-14 2017-06-22 Technische Universität München Helicobacter pylori vaccines
US10828358B2 (en) 2015-12-14 2020-11-10 Technische Universität München Helicobacter pylori vaccines
AU2016374289B2 (en) * 2015-12-14 2023-08-03 MAX-PLANCK-Gesellschaft zur Förderung der Wissenschaften e.V. Helicobacter pylori vaccines

Also Published As

Publication number Publication date
EP0977482A4 (en) 2002-03-06
KR20010005893A (en) 2001-01-15
CA2286306A1 (en) 1998-10-08
AU7099598A (en) 1998-10-22
CN1263436A (en) 2000-08-16
JP2001527393A (en) 2001-12-25
AU756010B2 (en) 2003-01-02
NZ338039A (en) 2001-04-27
EP0977482A1 (en) 2000-02-09

Similar Documents

Publication Publication Date Title
AU756010B2 (en) Identification of polynucleotides encoding novel helicobacter polypeptides in the helicobacter genome
AU2016219667B2 (en) Antibacterial phage, phage peptides and methods of use thereof
AU754264B2 (en) Chlamydia trachomatis genomic sequence and polypeptides, fragments thereof and uses thereof, in particular for the diagnosis, prevention and treatment of infection
JPH10210986A (en) New procaryotic polynucleotide, polypeptide and use thereof
JPH09322781A (en) Staphylococcus aureus polynucleotide and sequence
PT1812058E (en) Chlamydia trachomatis antigens for vaccine and diagnostic use
EP1012157A1 (en) $i(BORRELIA BURGDORFERI) POLYNUCLEOTIDES AND SEQUENCES
AU726892B2 (en) Nucleic acid and amino acid sequences relating to helicobacter pylori and vaccine compositions thereof
WO1997037044A9 (en) Nucleic acid and amino acid sequences relating to helicobacter pylori and vaccine compositions thereof
KR20000069297A (en) Nucleic Acid and Amino Acid Sequences Relating to Helicobacter pylori and Vaccine Compositions thereof
US20040219585A1 (en) Nontypeable haemophilus influenzae virulence factors
AU750792B2 (en) 76 kDa, 32 kDa, and 50 kDa helicobacter polypeptides and corresponding polynucleotide molecules
AU735391B2 (en) Helicobacter polypeptides and corresponding polynucleotide molecules
US20030158396A1 (en) Identification of polynucleotides encoding novel helicobacter polypeptides in the helicobacter genome
US20030124141A1 (en) Helicobacter polypeptides and corresponding polynucleotide molecules
US20020115078A1 (en) Identification of polynucleotides encoding novel helicobacter polypeptides in the helicobacter genome
US20020160456A1 (en) Identification of polynucleotides encoding novel helicobacter polypeptides in the helicobacter genome
US20030023066A1 (en) Helicobacter polypeptides and corresponding polynucleotide molecules
US20020044949A1 (en) 76 kda helicobacter polypeptides and corresponding polynucleotide molecules
US20020026035A1 (en) Helicobacter ghpo 1360 and ghpo 750 polypeptides and corresponding polynucleotide molecules
US20030069404A1 (en) Helicobacter antigens and corresponding DNA fragments
JP2001503637A (en) Helicobacter polypeptides and corresponding polynucleotide molecules
AU710880B2 (en) Nucleic acid and amino acid sequences relating to helicobacter pylori for diagnostics and therapeutics
AU3796099A (en) Assays using nucleic acid and amino acid sequences relating to helicobacter pylori
MXPA99004890A (en) Nucleic acid and amino acid sequences relating to helicobacter pylori

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 98805612.7

Country of ref document: CN

AK Designated states

Kind code of ref document: A1

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE GH GM GW HU ID IL IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US US US UZ VN YU ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 338039

Country of ref document: NZ

WWE Wipo information: entry into national phase

Ref document number: 1019997008969

Country of ref document: KR

ENP Entry into the national phase

Ref document number: 2286306

Country of ref document: CA

Ref document number: 2286306

Country of ref document: CA

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 70995/98

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 1998917972

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWP Wipo information: published in national office

Ref document number: 1998917972

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1019997008969

Country of ref document: KR

WWG Wipo information: grant in national office

Ref document number: 70995/98

Country of ref document: AU

WWW Wipo information: withdrawn in national office

Ref document number: 1998917972

Country of ref document: EP

WWR Wipo information: refused in national office

Ref document number: 1019997008969

Country of ref document: KR