CN101087882A - 来自ulkenia的PUFA-PKS基因 - Google Patents
来自ulkenia的PUFA-PKS基因 Download PDFInfo
- Publication number
- CN101087882A CN101087882A CNA2005800188787A CN200580018878A CN101087882A CN 101087882 A CN101087882 A CN 101087882A CN A2005800188787 A CNA2005800188787 A CN A2005800188787A CN 200580018878 A CN200580018878 A CN 200580018878A CN 101087882 A CN101087882 A CN 101087882A
- Authority
- CN
- China
- Prior art keywords
- ala
- val
- leu
- glu
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title abstract description 43
- 241001491678 Ulkenia Species 0.000 title description 91
- 235000020777 polyunsaturated fatty acids Nutrition 0.000 claims abstract description 54
- 238000004519 manufacturing process Methods 0.000 claims abstract description 16
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 7
- 230000002255 enzymatic effect Effects 0.000 claims abstract description 3
- 108020004414 DNA Proteins 0.000 claims description 74
- 150000001413 amino acids Chemical class 0.000 claims description 27
- 235000001014 amino acid Nutrition 0.000 claims description 18
- 230000014509 gene expression Effects 0.000 claims description 16
- 101000787132 Acidithiobacillus ferridurans Uncharacterized 8.2 kDa protein in mobL 3'region Proteins 0.000 claims description 15
- 101000827262 Acidithiobacillus ferrooxidans Uncharacterized 18.9 kDa protein in mobE 3'region Proteins 0.000 claims description 15
- 101000811747 Antithamnion sp. UPF0051 protein in atpA 3'region Proteins 0.000 claims description 15
- 101000827607 Bacillus phage SPP1 Uncharacterized 8.5 kDa protein in GP2-GP6 intergenic region Proteins 0.000 claims description 15
- 101000961975 Bacillus thuringiensis Uncharacterized 13.4 kDa protein Proteins 0.000 claims description 15
- 101000964407 Caldicellulosiruptor saccharolyticus Uncharacterized 10.7 kDa protein in xynB 3'region Proteins 0.000 claims description 15
- 101000768777 Haloferax lucentense (strain DSM 14919 / JCM 9276 / NCIMB 13854 / Aa 2.2) Uncharacterized 50.6 kDa protein in the 5'region of gyrA and gyrB Proteins 0.000 claims description 15
- 101000607404 Infectious laryngotracheitis virus (strain Thorne V882) Protein UL24 homolog Proteins 0.000 claims description 15
- 101000735632 Klebsiella pneumoniae Uncharacterized 8.8 kDa protein in aacA4 3'region Proteins 0.000 claims description 15
- 101000818100 Spirochaeta aurantia Uncharacterized 12.7 kDa protein in trpE 5'region Proteins 0.000 claims description 15
- 101001037658 Streptomyces coelicolor (strain ATCC BAA-471 / A3(2) / M145) Glucokinase Proteins 0.000 claims description 15
- -1 ketone compounds Chemical class 0.000 claims description 15
- 101000977023 Azospirillum brasilense Uncharacterized 17.8 kDa protein in nodG 5'region Proteins 0.000 claims description 14
- 101000961984 Bacillus thuringiensis Uncharacterized 30.3 kDa protein Proteins 0.000 claims description 14
- 101000644901 Drosophila melanogaster Putative 115 kDa protein in type-1 retrotransposable element R1DM Proteins 0.000 claims description 14
- 101000747702 Enterobacteria phage N4 Uncharacterized protein Gp2 Proteins 0.000 claims description 14
- 101000758599 Escherichia coli Uncharacterized 14.7 kDa protein Proteins 0.000 claims description 14
- 101000768930 Lactococcus lactis subsp. cremoris Uncharacterized protein in pepC 5'region Proteins 0.000 claims description 14
- 101000976302 Leptospira interrogans Uncharacterized protein in sph 3'region Proteins 0.000 claims description 14
- 101000778886 Leptospira interrogans serogroup Icterohaemorrhagiae serovar Lai (strain 56601) Uncharacterized protein LA_2151 Proteins 0.000 claims description 14
- 101001121571 Rice tungro bacilliform virus (isolate Philippines) Protein P2 Proteins 0.000 claims description 14
- 101000818098 Spirochaeta aurantia Uncharacterized protein in trpE 3'region Proteins 0.000 claims description 14
- 101001026590 Streptomyces cinnamonensis Putative polyketide beta-ketoacyl synthase 2 Proteins 0.000 claims description 14
- 101000750896 Synechococcus elongatus (strain PCC 7942 / FACHB-805) Uncharacterized protein Synpcc7942_2318 Proteins 0.000 claims description 14
- 101000916321 Xenopus laevis Transposon TX1 uncharacterized 149 kDa protein Proteins 0.000 claims description 14
- 101000760088 Zymomonas mobilis subsp. mobilis (strain ATCC 10988 / DSM 424 / LMG 404 / NCIMB 8938 / NRRL B-806 / ZM1) 20.9 kDa protein Proteins 0.000 claims description 14
- 101000666833 Autographa californica nuclear polyhedrosis virus Uncharacterized 20.8 kDa protein in FGF-VUBI intergenic region Proteins 0.000 claims description 13
- 101000977027 Azospirillum brasilense Uncharacterized protein in nodG 5'region Proteins 0.000 claims description 13
- 101000962005 Bacillus thuringiensis Uncharacterized 23.6 kDa protein Proteins 0.000 claims description 13
- 101000785191 Drosophila melanogaster Uncharacterized 50 kDa protein in type I retrotransposable element R1DM Proteins 0.000 claims description 13
- 101000747704 Enterobacteria phage N4 Uncharacterized protein Gp1 Proteins 0.000 claims description 13
- 101000861206 Enterococcus faecalis (strain ATCC 700802 / V583) Uncharacterized protein EF_A0048 Proteins 0.000 claims description 13
- 101000769180 Escherichia coli Uncharacterized 11.1 kDa protein Proteins 0.000 claims description 13
- 101000976301 Leptospira interrogans Uncharacterized 35 kDa protein in sph 3'region Proteins 0.000 claims description 13
- 101000658690 Neisseria meningitidis serogroup B Transposase for insertion sequence element IS1106 Proteins 0.000 claims description 13
- 101000748660 Pseudomonas savastanoi Uncharacterized 21 kDa protein in iaaL 5'region Proteins 0.000 claims description 13
- 101000584469 Rice tungro bacilliform virus (isolate Philippines) Protein P1 Proteins 0.000 claims description 13
- 101000818096 Spirochaeta aurantia Uncharacterized 15.5 kDa protein in trpE 3'region Proteins 0.000 claims description 13
- 101000766081 Streptomyces ambofaciens Uncharacterized HTH-type transcriptional regulator in unstable DNA locus Proteins 0.000 claims description 13
- 101000804403 Synechococcus elongatus (strain PCC 7942 / FACHB-805) Uncharacterized HIT-like protein Synpcc7942_1390 Proteins 0.000 claims description 13
- 101000750910 Synechococcus elongatus (strain PCC 7942 / FACHB-805) Uncharacterized HTH-type transcriptional regulator Synpcc7942_2319 Proteins 0.000 claims description 13
- 101000644897 Synechococcus sp. (strain ATCC 27264 / PCC 7002 / PR-6) Uncharacterized protein SYNPCC7002_B0001 Proteins 0.000 claims description 13
- 101000916336 Xenopus laevis Transposon TX1 uncharacterized 82 kDa protein Proteins 0.000 claims description 13
- 101001000760 Zea mays Putative Pol polyprotein from transposon element Bs1 Proteins 0.000 claims description 13
- 101000678262 Zymomonas mobilis subsp. mobilis (strain ATCC 10988 / DSM 424 / LMG 404 / NCIMB 8938 / NRRL B-806 / ZM1) 65 kDa protein Proteins 0.000 claims description 13
- 238000000034 method Methods 0.000 claims description 12
- 239000002773 nucleotide Substances 0.000 claims description 11
- 125000003729 nucleotide group Chemical group 0.000 claims description 11
- 230000004071 biological effect Effects 0.000 claims description 8
- 150000007523 nucleic acids Chemical class 0.000 claims description 7
- 108020004707 nucleic acids Proteins 0.000 claims description 6
- 102000039446 nucleic acids Human genes 0.000 claims description 6
- 108020004511 Recombinant DNA Proteins 0.000 claims description 5
- 102000053602 DNA Human genes 0.000 claims description 3
- 230000008521 reorganization Effects 0.000 claims description 3
- 239000002299 complementary DNA Substances 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 7
- TVEXGJYMHHTVKP-UHFFFAOYSA-N 6-oxabicyclo[3.2.1]oct-3-en-7-one Chemical compound C1C2C(=O)OC1C=CC2 TVEXGJYMHHTVKP-UHFFFAOYSA-N 0.000 claims 1
- 108020004635 Complementary DNA Proteins 0.000 claims 1
- 238000010804 cDNA synthesis Methods 0.000 claims 1
- 238000010276 construction Methods 0.000 claims 1
- 108010030975 Polyketide Synthases Proteins 0.000 abstract description 5
- 230000009261 transgenic effect Effects 0.000 abstract description 3
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical group SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 241
- 102220023258 rs387907548 Human genes 0.000 description 93
- 102220369447 c.1352G>A Human genes 0.000 description 91
- 102220023257 rs387907546 Human genes 0.000 description 73
- 241001298226 Ulkenia sp. Species 0.000 description 33
- 102220369445 c.668T>C Human genes 0.000 description 32
- MBMBGCFOFBJSGT-KUBAVDMBSA-N docosahexaenoic acid Natural products CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCC(O)=O MBMBGCFOFBJSGT-KUBAVDMBSA-N 0.000 description 30
- 102220023256 rs387907547 Human genes 0.000 description 30
- DVSZKTAMJJTWFG-UHFFFAOYSA-N docosa-2,4,6,8,10,12-hexaenoic acid Chemical group CCCCCCCCCC=CC=CC=CC=CC=CC=CC(O)=O DVSZKTAMJJTWFG-UHFFFAOYSA-N 0.000 description 28
- 235000020673 eicosapentaenoic acid Nutrition 0.000 description 23
- 108700026244 Open Reading Frames Proteins 0.000 description 21
- 101710146995 Acyl carrier protein Proteins 0.000 description 20
- 102000004190 Enzymes Human genes 0.000 description 19
- 108090000790 Enzymes Proteins 0.000 description 19
- 241000233671 Schizochytrium Species 0.000 description 16
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 15
- 150000002632 lipids Chemical class 0.000 description 15
- 239000013612 plasmid Substances 0.000 description 15
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 14
- 239000002253 acid Substances 0.000 description 14
- YZXBAPSDXZZRGB-DOFZRALJSA-N arachidonic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O YZXBAPSDXZZRGB-DOFZRALJSA-N 0.000 description 14
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 13
- 210000004027 cell Anatomy 0.000 description 13
- 241000196324 Embryophyta Species 0.000 description 12
- 241000894006 Bacteria Species 0.000 description 11
- 238000006243 chemical reaction Methods 0.000 description 11
- 244000005700 microbiome Species 0.000 description 11
- 108090000364 Ligases Proteins 0.000 description 10
- 102000003960 Ligases Human genes 0.000 description 10
- 239000004927 clay Substances 0.000 description 10
- 239000012634 fragment Substances 0.000 description 10
- 229930194542 Keto Natural products 0.000 description 9
- 108091034117 Oligonucleotide Proteins 0.000 description 9
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 8
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 8
- JAZBEHYOTPTENJ-JLNKQSITSA-N all-cis-5,8,11,14,17-icosapentaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O JAZBEHYOTPTENJ-JLNKQSITSA-N 0.000 description 8
- 230000001276 controlling effect Effects 0.000 description 8
- 229960005135 eicosapentaenoic acid Drugs 0.000 description 8
- 239000003921 oil Substances 0.000 description 8
- 235000019198 oils Nutrition 0.000 description 8
- OYHQOLUKZRVURQ-HZJYTTRNSA-N Linoleic acid Chemical compound CCCCC\C=C/C\C=C/CCCCCCCC(O)=O OYHQOLUKZRVURQ-HZJYTTRNSA-N 0.000 description 7
- 108020005038 Terminator Codon Proteins 0.000 description 7
- HXWJFEZDFPRLBG-UHFFFAOYSA-N Timnodonic acid Natural products CCCC=CC=CCC=CCC=CCC=CCCCC(O)=O HXWJFEZDFPRLBG-UHFFFAOYSA-N 0.000 description 7
- 235000021342 arachidonic acid Nutrition 0.000 description 7
- 229940114079 arachidonic acid Drugs 0.000 description 7
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 7
- 229960004232 linoleic acid Drugs 0.000 description 7
- 241000251468 Actinopterygii Species 0.000 description 6
- 241000233866 Fungi Species 0.000 description 6
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 6
- 101001014220 Monascus pilosus Dehydrogenase mokE Proteins 0.000 description 6
- 241000294598 Moritella marina Species 0.000 description 6
- 101000573542 Penicillium citrinum Compactin nonaketide synthase, enoyl reductase component Proteins 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 235000019688 fish Nutrition 0.000 description 6
- 230000000968 intestinal effect Effects 0.000 description 6
- 241000588724 Escherichia coli Species 0.000 description 5
- 102000004867 Hydro-Lyases Human genes 0.000 description 5
- 108090001042 Hydro-Lyases Proteins 0.000 description 5
- 102000004195 Isomerases Human genes 0.000 description 5
- 108090000769 Isomerases Proteins 0.000 description 5
- 241001465754 Metazoa Species 0.000 description 5
- 108091081024 Start codon Proteins 0.000 description 5
- 241000700605 Viruses Species 0.000 description 5
- 238000006555 catalytic reaction Methods 0.000 description 5
- 239000000194 fatty acid Substances 0.000 description 5
- 235000020660 omega-3 fatty acid Nutrition 0.000 description 5
- 238000012546 transfer Methods 0.000 description 5
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 5
- 241000351920 Aspergillus nidulans Species 0.000 description 4
- 101000584877 Clostridium pasteurianum Putative peroxiredoxin in rubredoxin operon Proteins 0.000 description 4
- 108020004705 Codon Proteins 0.000 description 4
- JDMUPRLRUUMCTL-VIFPVBQESA-N D-pantetheine 4'-phosphate Chemical group OP(=O)(O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS JDMUPRLRUUMCTL-VIFPVBQESA-N 0.000 description 4
- 101000618323 Enterobacteria phage T4 Uncharacterized 7.3 kDa protein in mobB-Gp55 intergenic region Proteins 0.000 description 4
- 241000206602 Eukaryota Species 0.000 description 4
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 4
- 101001056912 Saccharopolyspora erythraea 6-deoxyerythronolide-B synthase EryA1, modules 1 and 2 Proteins 0.000 description 4
- 101001056914 Saccharopolyspora erythraea 6-deoxyerythronolide-B synthase EryA3, modules 5 and 6 Proteins 0.000 description 4
- 241000863430 Shewanella Species 0.000 description 4
- 241000490596 Shewanella sp. Species 0.000 description 4
- 101000819251 Staphylococcus aureus Uncharacterized protein in ileS 3'region Proteins 0.000 description 4
- 241000723873 Tobacco mosaic virus Species 0.000 description 4
- 230000008827 biological function Effects 0.000 description 4
- 102220369446 c.1274G>A Human genes 0.000 description 4
- 235000014113 dietary fatty acids Nutrition 0.000 description 4
- 229930195729 fatty acid Natural products 0.000 description 4
- 108091008053 gene clusters Proteins 0.000 description 4
- 235000020978 long-chain polyunsaturated fatty acids Nutrition 0.000 description 4
- 102220004457 rs11567847 Human genes 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 108700016155 Acyl transferases Proteins 0.000 description 3
- 102000057234 Acyl transferases Human genes 0.000 description 3
- 241000195619 Euglena gracilis Species 0.000 description 3
- 101000833492 Homo sapiens Jouberin Proteins 0.000 description 3
- 101001122476 Homo sapiens Mu-type opioid receptor Proteins 0.000 description 3
- 101000651236 Homo sapiens NCK-interacting protein with SH3 domain Proteins 0.000 description 3
- 102100024407 Jouberin Human genes 0.000 description 3
- 102100028647 Mu-type opioid receptor Human genes 0.000 description 3
- 101100131043 Oryza sativa subsp. japonica MOF1 gene Proteins 0.000 description 3
- 241001208362 Photobacterium profundum SS9 Species 0.000 description 3
- 241001491289 Schizochytrium sp. ATCC 20888 Species 0.000 description 3
- 241000233675 Thraustochytrium Species 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 230000004087 circulation Effects 0.000 description 3
- 238000005336 cracking Methods 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 239000012153 distilled water Substances 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 150000004665 fatty acids Chemical class 0.000 description 3
- 235000021323 fish oil Nutrition 0.000 description 3
- 235000003869 genetically modified organism Nutrition 0.000 description 3
- 238000005304 joining Methods 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 238000002156 mixing Methods 0.000 description 3
- 108090000765 processed proteins & peptides Proteins 0.000 description 3
- 235000018102 proteins Nutrition 0.000 description 3
- 102000004169 proteins and genes Human genes 0.000 description 3
- 238000006722 reduction reaction Methods 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 101000768957 Acholeplasma phage L2 Uncharacterized 37.2 kDa protein Proteins 0.000 description 2
- 101000823746 Acidianus ambivalens Uncharacterized 17.7 kDa protein in bps2 3'region Proteins 0.000 description 2
- 101000916369 Acidianus ambivalens Uncharacterized protein in sor 5'region Proteins 0.000 description 2
- 101000769342 Acinetobacter guillouiae Uncharacterized protein in rpoN-murA intergenic region Proteins 0.000 description 2
- 101000823696 Actinobacillus pleuropneumoniae Uncharacterized glycosyltransferase in aroQ 3'region Proteins 0.000 description 2
- 101000786513 Agrobacterium tumefaciens (strain 15955) Uncharacterized protein outside the virF region Proteins 0.000 description 2
- 101000618005 Alkalihalobacillus pseudofirmus (strain ATCC BAA-2126 / JCM 17055 / OF4) Uncharacterized protein BpOF4_00885 Proteins 0.000 description 2
- 101000645498 Alkalihalobacillus pseudofirmus (strain ATCC BAA-2126 / JCM 17055 / OF4) Uncharacterized protein BpOF4_10220 Proteins 0.000 description 2
- 102100020724 Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Human genes 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 101000967489 Azorhizobium caulinodans (strain ATCC 43989 / DSM 5975 / JCM 20966 / LMG 6465 / NBRC 14845 / NCIMB 13405 / ORS 571) Uncharacterized protein AZC_3924 Proteins 0.000 description 2
- 101000823761 Bacillus licheniformis Uncharacterized 9.4 kDa protein in flaL 3'region Proteins 0.000 description 2
- 101000819719 Bacillus methanolicus Uncharacterized N-acetyltransferase in lysA 3'region Proteins 0.000 description 2
- 101000789586 Bacillus subtilis (strain 168) UPF0702 transmembrane protein YkjA Proteins 0.000 description 2
- 101000792624 Bacillus subtilis (strain 168) Uncharacterized protein YbxH Proteins 0.000 description 2
- 101000790792 Bacillus subtilis (strain 168) Uncharacterized protein YckC Proteins 0.000 description 2
- 101000819705 Bacillus subtilis (strain 168) Uncharacterized protein YlxR Proteins 0.000 description 2
- 101000948218 Bacillus subtilis (strain 168) Uncharacterized protein YtxJ Proteins 0.000 description 2
- 101000718627 Bacillus thuringiensis subsp. kurstaki Putative RNA polymerase sigma-G factor Proteins 0.000 description 2
- 101000641200 Bombyx mori densovirus Putative non-structural protein Proteins 0.000 description 2
- 208000017667 Chronic Disease Diseases 0.000 description 2
- 101000947633 Claviceps purpurea Uncharacterized 13.8 kDa protein Proteins 0.000 description 2
- 101001132313 Clostridium pasteurianum 34.2 kDa protein in rubredoxin operon Proteins 0.000 description 2
- 241000199914 Dinophyceae Species 0.000 description 2
- 101000618325 Enterobacteria phage T4 Uncharacterized 12.4 kDa protein in mobB-Gp55 intergenic region Proteins 0.000 description 2
- 101000948901 Enterobacteria phage T4 Uncharacterized 16.0 kDa protein in segB-ipI intergenic region Proteins 0.000 description 2
- 101000653284 Enterobacteria phage T4 Uncharacterized 9.4 kDa protein in Gp31-cd intergenic region Proteins 0.000 description 2
- 101000805958 Equine herpesvirus 4 (strain 1942) Virion protein US10 homolog Proteins 0.000 description 2
- 101000790442 Escherichia coli Insertion element IS2 uncharacterized 11.1 kDa protein Proteins 0.000 description 2
- 101000788354 Escherichia phage P2 Uncharacterized 8.2 kDa protein in gpA 5'region Proteins 0.000 description 2
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 2
- 241000195620 Euglena Species 0.000 description 2
- 101000770304 Frankia alni UPF0460 protein in nifX-nifW intergenic region Proteins 0.000 description 2
- 101000797344 Geobacillus stearothermophilus Putative tRNA (cytidine(34)-2'-O)-methyltransferase Proteins 0.000 description 2
- 101000748410 Geobacillus stearothermophilus Uncharacterized protein in fumA 3'region Proteins 0.000 description 2
- 101000772675 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) UPF0438 protein HI_0847 Proteins 0.000 description 2
- 101000631019 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) Uncharacterized protein HI_0350 Proteins 0.000 description 2
- 101000768938 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 8.9 kDa protein in int-C1 intergenic region Proteins 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- 101000785414 Homo sapiens Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Proteins 0.000 description 2
- 101000782488 Junonia coenia densovirus (isolate pBRJ/1990) Putative non-structural protein NS2 Proteins 0.000 description 2
- 101000811523 Klebsiella pneumoniae Uncharacterized 55.8 kDa protein in cps region Proteins 0.000 description 2
- 101000818409 Lactococcus lactis subsp. lactis Uncharacterized HTH-type transcriptional regulator in lacX 3'region Proteins 0.000 description 2
- 101001110310 Lentilactobacillus kefiri NADP-dependent (R)-specific alcohol dehydrogenase Proteins 0.000 description 2
- 101000878851 Leptolyngbya boryana Putative Fe(2+) transport protein A Proteins 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- 101000758828 Methanosarcina barkeri (strain Fusaro / DSM 804) Uncharacterized protein Mbar_A1602 Proteins 0.000 description 2
- 101001122401 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF3 Proteins 0.000 description 2
- 241000592260 Moritella Species 0.000 description 2
- 101001055788 Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155) Pentapeptide repeat protein MfpA Proteins 0.000 description 2
- 101000740670 Orgyia pseudotsugata multicapsid polyhedrosis virus Protein C42 Proteins 0.000 description 2
- 241000607568 Photobacterium Species 0.000 description 2
- 101000769182 Photorhabdus luminescens Uncharacterized protein in pnp 3'region Proteins 0.000 description 2
- 101710159752 Poly(3-hydroxyalkanoate) polymerase subunit PhaE Proteins 0.000 description 2
- 101710130262 Probable Vpr-like protein Proteins 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 101000961392 Pseudescherichia vulneris Uncharacterized 29.9 kDa protein in crtE 3'region Proteins 0.000 description 2
- 101000731030 Pseudomonas oleovorans Poly(3-hydroxyalkanoate) polymerase 2 Proteins 0.000 description 2
- 101001065485 Pseudomonas putida Probable fatty acid methyltransferase Proteins 0.000 description 2
- 101000758676 Pyrococcus woesei Uncharacterized 24.7 kDa protein in gap 5'region Proteins 0.000 description 2
- 101000711023 Rhizobium leguminosarum bv. trifolii Uncharacterized protein in tfuA 3'region Proteins 0.000 description 2
- 101000948156 Rhodococcus erythropolis Uncharacterized 47.3 kDa protein in thcA 5'region Proteins 0.000 description 2
- 101000917565 Rhodococcus fascians Uncharacterized 33.6 kDa protein in fasciation locus Proteins 0.000 description 2
- 101000790284 Saimiriine herpesvirus 2 (strain 488) Uncharacterized 9.5 kDa protein in DHFR 3'region Proteins 0.000 description 2
- 102100029437 Serine/threonine-protein kinase A-Raf Human genes 0.000 description 2
- 102100035254 Sodium- and chloride-dependent GABA transporter 3 Human genes 0.000 description 2
- 101710104417 Sodium- and chloride-dependent GABA transporter 3 Proteins 0.000 description 2
- 101000936719 Streptococcus gordonii Accessory Sec system protein Asp3 Proteins 0.000 description 2
- 101000788499 Streptomyces coelicolor Uncharacterized oxidoreductase in mprA 5'region Proteins 0.000 description 2
- 101001102841 Streptomyces griseus Purine nucleoside phosphorylase ORF3 Proteins 0.000 description 2
- 101000708557 Streptomyces lincolnensis Uncharacterized 17.2 kDa protein in melC2-rnhH intergenic region Proteins 0.000 description 2
- 101000691656 Streptomyces venezuelae Narbonolide/10-deoxymethynolide synthase PikA1, modules 1 and 2 Proteins 0.000 description 2
- 101000691655 Streptomyces venezuelae Narbonolide/10-deoxymethynolide synthase PikA2, modules 3 and 4 Proteins 0.000 description 2
- 101000691658 Streptomyces venezuelae Narbonolide/10-deoxymethynolide synthase PikA3, module 5 Proteins 0.000 description 2
- 101001125873 Streptomyces venezuelae Narbonolide/10-deoxymethynolide synthase PikA4, module 6 Proteins 0.000 description 2
- 101000649826 Thermotoga neapolitana Putative anti-sigma factor antagonist TM1081 homolog Proteins 0.000 description 2
- 241000607598 Vibrio Species 0.000 description 2
- 101000827562 Vibrio alginolyticus Uncharacterized protein in proC 3'region Proteins 0.000 description 2
- 101000778915 Vibrio parahaemolyticus serotype O3:K6 (strain RIMD 2210633) Uncharacterized membrane protein VP2115 Proteins 0.000 description 2
- 125000002252 acyl group Chemical group 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000003570 biosynthesizing effect Effects 0.000 description 2
- 230000001851 biosynthetic effect Effects 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 210000003763 chloroplast Anatomy 0.000 description 2
- 239000013611 chromosomal DNA Substances 0.000 description 2
- 238000006482 condensation reaction Methods 0.000 description 2
- 238000013016 damping Methods 0.000 description 2
- 230000009849 deactivation Effects 0.000 description 2
- 230000030609 dephosphorylation Effects 0.000 description 2
- 238000006209 dephosphorylation reaction Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 239000003085 diluting agent Substances 0.000 description 2
- 235000004626 essential fatty acids Nutrition 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 235000013305 food Nutrition 0.000 description 2
- 235000019867 fractionated palm kernal oil Nutrition 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- VZCCETWTMQHEPK-QNEBEIHSSA-N gamma-linolenic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/CCCCC(O)=O VZCCETWTMQHEPK-QNEBEIHSSA-N 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- PHTQWCKDNZKARW-UHFFFAOYSA-N isoamylol Chemical compound CC(C)CCO PHTQWCKDNZKARW-UHFFFAOYSA-N 0.000 description 2
- 150000002576 ketones Chemical class 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 235000016709 nutrition Nutrition 0.000 description 2
- 230000035764 nutrition Effects 0.000 description 2
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 230000037452 priming Effects 0.000 description 2
- 230000009465 prokaryotic expression Effects 0.000 description 2
- 239000011541 reaction mixture Substances 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 238000004062 sedimentation Methods 0.000 description 2
- 238000010008 shearing Methods 0.000 description 2
- 238000003756 stirring Methods 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 2
- XSXIVVZCUAHUJO-AVQMFFATSA-N (11e,14e)-icosa-11,14-dienoic acid Chemical compound CCCCC\C=C\C\C=C\CCCCCCCCCC(O)=O XSXIVVZCUAHUJO-AVQMFFATSA-N 0.000 description 1
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- NWXMGUDVXFXRIG-WESIUVDSSA-N (4s,4as,5as,6s,12ar)-4-(dimethylamino)-1,6,10,11,12a-pentahydroxy-6-methyl-3,12-dioxo-4,4a,5,5a-tetrahydrotetracene-2-carboxamide Chemical compound C1=CC=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(=O)C(C(N)=O)=C(O)[C@@]4(O)C(=O)C3=C(O)C2=C1O NWXMGUDVXFXRIG-WESIUVDSSA-N 0.000 description 1
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 description 1
- GZCWLCBFPRFLKL-UHFFFAOYSA-N 1-prop-2-ynoxypropan-2-ol Chemical compound CC(O)COCC#C GZCWLCBFPRFLKL-UHFFFAOYSA-N 0.000 description 1
- FFRBMBIXVSCUFS-UHFFFAOYSA-N 2,4-dinitro-1-naphthol Chemical compound C1=CC=C2C(O)=C([N+]([O-])=O)C=C([N+]([O-])=O)C2=C1 FFRBMBIXVSCUFS-UHFFFAOYSA-N 0.000 description 1
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 description 1
- ALYNCZNDIQEVRV-UHFFFAOYSA-N 4-aminobenzoic acid Chemical compound NC1=CC=C(C(O)=O)C=C1 ALYNCZNDIQEVRV-UHFFFAOYSA-N 0.000 description 1
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 description 1
- 241000224424 Acanthamoeba sp. Species 0.000 description 1
- 102000013563 Acid Phosphatase Human genes 0.000 description 1
- 108010051457 Acid Phosphatase Proteins 0.000 description 1
- 241000228431 Acremonium chrysogenum Species 0.000 description 1
- 108700037654 Acyl carrier protein (ACP) Proteins 0.000 description 1
- 102000048456 Acyl carrier protein (ACP) Human genes 0.000 description 1
- 241001136782 Alca Species 0.000 description 1
- 101000758020 Alkalihalobacillus pseudofirmus (strain ATCC BAA-2126 / JCM 17055 / OF4) Uncharacterized aminotransferase BpOF4_10225 Proteins 0.000 description 1
- 208000024827 Alzheimer disease Diseases 0.000 description 1
- 102100030343 Antigen peptide transporter 2 Human genes 0.000 description 1
- 241000003610 Aplanochytrium Species 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000606125 Bacteroides Species 0.000 description 1
- 244000178993 Brassica juncea Species 0.000 description 1
- 235000011332 Brassica juncea Nutrition 0.000 description 1
- 235000014700 Brassica juncea var napiformis Nutrition 0.000 description 1
- MIMFUSIYQURABA-UHFFFAOYSA-N C=CCCC.[C] Chemical compound C=CCCC.[C] MIMFUSIYQURABA-UHFFFAOYSA-N 0.000 description 1
- 101000946068 Caenorhabditis elegans Ceramide glucosyltransferase 3 Proteins 0.000 description 1
- 101100275473 Caenorhabditis elegans ctc-3 gene Proteins 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical group [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 102100027667 Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 2 Human genes 0.000 description 1
- VEXZGXHMUGYJMC-UHFFFAOYSA-M Chloride anion Chemical compound [Cl-] VEXZGXHMUGYJMC-UHFFFAOYSA-M 0.000 description 1
- 108700031407 Chloroplast Genes Proteins 0.000 description 1
- 101000744710 Clostridium pasteurianum Uncharacterized glutaredoxin-like 8.6 kDa protein in rubredoxin operon Proteins 0.000 description 1
- RGJOEKWQDUBAIZ-IBOSZNHHSA-N CoASH Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-IBOSZNHHSA-N 0.000 description 1
- 102100032182 Crooked neck-like protein 1 Human genes 0.000 description 1
- 241000199913 Crypthecodinium Species 0.000 description 1
- 241000199912 Crypthecodinium cohnii Species 0.000 description 1
- 241000605056 Cytophaga Species 0.000 description 1
- ZAQJHHRNXZUBTE-NQXXGFSBSA-N D-ribulose Chemical compound OC[C@@H](O)[C@@H](O)C(=O)CO ZAQJHHRNXZUBTE-NQXXGFSBSA-N 0.000 description 1
- ZAQJHHRNXZUBTE-UHFFFAOYSA-N D-threo-2-Pentulose Natural products OCC(O)C(O)C(=O)CO ZAQJHHRNXZUBTE-UHFFFAOYSA-N 0.000 description 1
- 238000007399 DNA isolation Methods 0.000 description 1
- 241000238557 Decapoda Species 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 235000021297 Eicosadienoic acid Nutrition 0.000 description 1
- 101000653283 Enterobacteria phage T4 Uncharacterized 11.5 kDa protein in Gp31-cd intergenic region Proteins 0.000 description 1
- 101000618324 Enterobacteria phage T4 Uncharacterized 7.9 kDa protein in mobB-Gp55 intergenic region Proteins 0.000 description 1
- 108010087894 Fatty acid desaturases Proteins 0.000 description 1
- 241000589565 Flavobacterium Species 0.000 description 1
- 101150094690 GAL1 gene Proteins 0.000 description 1
- 101150038242 GAL10 gene Proteins 0.000 description 1
- 102100028501 Galanin peptides Human genes 0.000 description 1
- 102100024637 Galectin-10 Human genes 0.000 description 1
- 102100039555 Galectin-7 Human genes 0.000 description 1
- 241000702463 Geminiviridae Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 241000705948 Glossomastix Species 0.000 description 1
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 244000020551 Helianthus annuus Species 0.000 description 1
- 235000003222 Helianthus annuus Nutrition 0.000 description 1
- 101000652582 Homo sapiens Antigen peptide transporter 2 Proteins 0.000 description 1
- 101000725947 Homo sapiens Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 2 Proteins 0.000 description 1
- 101000736065 Homo sapiens DNA replication complex GINS protein PSF2 Proteins 0.000 description 1
- 101100121078 Homo sapiens GAL gene Proteins 0.000 description 1
- 101000608772 Homo sapiens Galectin-7 Proteins 0.000 description 1
- 108091029795 Intergenic region Proteins 0.000 description 1
- 241000003482 Japonochytrium Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 241001467308 Labyrinthuloides Species 0.000 description 1
- 241001491666 Labyrinthulomycetes Species 0.000 description 1
- 240000006240 Linum usitatissimum Species 0.000 description 1
- 235000004431 Linum usitatissimum Nutrition 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- LTYOQGRJFJAKNA-KKIMTKSISA-N Malonyl CoA Natural products S(C(=O)CC(=O)O)CCNC(=O)CCNC(=O)[C@@H](O)C(CO[P@](=O)(O[P@](=O)(OC[C@H]1[C@@H](OP(=O)(O)O)[C@@H](O)[C@@H](n2c3ncnc(N)c3nc2)O1)O)O)(C)C LTYOQGRJFJAKNA-KKIMTKSISA-N 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- ACFIXJIJDZMPPO-NNYOXOHSSA-N NADPH Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](OP(O)(O)=O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 ACFIXJIJDZMPPO-NNYOXOHSSA-N 0.000 description 1
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 description 1
- 239000005642 Oleic acid Substances 0.000 description 1
- 101710157860 Oxydoreductase Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 240000005373 Panax quinquefolius Species 0.000 description 1
- 235000003140 Panax quinquefolius Nutrition 0.000 description 1
- 108020002230 Pancreatic Ribonuclease Proteins 0.000 description 1
- 102000005891 Pancreatic ribonuclease Human genes 0.000 description 1
- 241000562398 Phaeomonas Species 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 241000031611 Pinguiochrysis Species 0.000 description 1
- 241000705982 Pinguiococcus Species 0.000 description 1
- 241000031610 Pinguiophyceae Species 0.000 description 1
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 1
- 241000031608 Polypodochrysis Species 0.000 description 1
- 101000961876 Pyrococcus woesei Uncharacterized protein in gap 3'region Proteins 0.000 description 1
- 101000912235 Rebecca salina Acyl-lipid (7-3)-desaturase Proteins 0.000 description 1
- 101001056915 Saccharopolyspora erythraea 6-deoxyerythronolide-B synthase EryA2, modules 3 and 4 Proteins 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000863432 Shewanella putrefaciens Species 0.000 description 1
- 101000877236 Siganus canaliculatus Acyl-CoA Delta-4 desaturase Proteins 0.000 description 1
- 101000819248 Staphylococcus aureus Uncharacterized protein in ileS 5'region Proteins 0.000 description 1
- 235000021355 Stearic acid Nutrition 0.000 description 1
- 102100028897 Stearoyl-CoA desaturase Human genes 0.000 description 1
- 241001466451 Stramenopiles Species 0.000 description 1
- 241000973887 Takayama Species 0.000 description 1
- 241001467333 Thraustochytriaceae Species 0.000 description 1
- 102000008579 Transposases Human genes 0.000 description 1
- 108010020764 Transposases Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 239000006035 Tryptophane Substances 0.000 description 1
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 1
- MUCRYNWJQNHDJH-OADIDDRXSA-N Ursonic acid Chemical compound C1CC(=O)C(C)(C)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@@]5(C(O)=O)CC[C@@H](C)[C@H](C)[C@H]5C4=CC[C@@H]3[C@]21C MUCRYNWJQNHDJH-OADIDDRXSA-N 0.000 description 1
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- AHANXAKGNAKFSK-PDBXOOCHSA-N all-cis-icosa-11,14,17-trienoic acid Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCCCC(O)=O AHANXAKGNAKFSK-PDBXOOCHSA-N 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 239000010775 animal oil Substances 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 230000003925 brain function Effects 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 150000001721 carbon Chemical group 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 230000000747 cardiac effect Effects 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- RGJOEKWQDUBAIZ-UHFFFAOYSA-N coenzime A Natural products OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-UHFFFAOYSA-N 0.000 description 1
- 239000005516 coenzyme A Substances 0.000 description 1
- 229940093530 coenzyme a Drugs 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 238000006114 decarboxylation reaction Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 108010011713 delta-15 desaturase Proteins 0.000 description 1
- 108010037489 delta-4 fatty acid desaturase Proteins 0.000 description 1
- 108010022240 delta-8 fatty acid desaturase Proteins 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- KDTSHFARGAKYJN-UHFFFAOYSA-N dephosphocoenzyme A Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 KDTSHFARGAKYJN-UHFFFAOYSA-N 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 235000020669 docosahexaenoic acid Nutrition 0.000 description 1
- 229940090949 docosahexaenoic acid Drugs 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 210000002969 egg yolk Anatomy 0.000 description 1
- JAZBEHYOTPTENJ-UHFFFAOYSA-N eicosapentaenoic acid Natural products CCC=CCC=CCC=CCC=CCC=CCCCC(O)=O JAZBEHYOTPTENJ-UHFFFAOYSA-N 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 230000004438 eyesight Effects 0.000 description 1
- 235000019197 fats Nutrition 0.000 description 1
- 230000004136 fatty acid synthesis Effects 0.000 description 1
- 150000002190 fatty acyls Chemical group 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 229940098330 gamma linoleic acid Drugs 0.000 description 1
- VZCCETWTMQHEPK-UHFFFAOYSA-N gamma-Linolensaeure Natural products CCCCCC=CCC=CCC=CCCCCC(O)=O VZCCETWTMQHEPK-UHFFFAOYSA-N 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- 150000002327 glycerophospholipids Chemical class 0.000 description 1
- 230000002414 glycolytic effect Effects 0.000 description 1
- 101150073906 gpdA gene Proteins 0.000 description 1
- 101150095733 gpsA gene Proteins 0.000 description 1
- 238000000227 grinding Methods 0.000 description 1
- 230000002650 habitual effect Effects 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 210000002415 kinetochore Anatomy 0.000 description 1
- 150000002617 leukotrienes Chemical class 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 125000000346 malonyl group Chemical group C(CC(=O)*)(=O)* 0.000 description 1
- LTYOQGRJFJAKNA-DVVLENMVSA-N malonyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-DVVLENMVSA-N 0.000 description 1
- LTYOQGRJFJAKNA-VFLPNFFSSA-N malonyl-coa Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-VFLPNFFSSA-N 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000004570 mortar (masonry) Substances 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 235000020939 nutritional additive Nutrition 0.000 description 1
- QIQXTHQIDYTFRH-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 1
- OQCDKBAXFALNLD-UHFFFAOYSA-N octadecanoic acid Natural products CCCCCCCC(C)CCCCCCCCC(O)=O OQCDKBAXFALNLD-UHFFFAOYSA-N 0.000 description 1
- 235000020665 omega-6 fatty acid Nutrition 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 229940049547 paraxin Drugs 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 150000003071 polychlorinated biphenyls Chemical class 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 150000003180 prostaglandins Chemical class 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000002787 reinforcement Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 201000000980 schizophrenia Diseases 0.000 description 1
- 230000001932 seasonal effect Effects 0.000 description 1
- 229930000044 secondary metabolite Natural products 0.000 description 1
- 238000004904 shortening Methods 0.000 description 1
- RPLOPBHEZLFENN-HTMVYDOJSA-M sodium;4-[(2r,3r)-2-[(2,2-dichloroacetyl)amino]-3-hydroxy-3-(4-nitrophenyl)propoxy]-4-oxobutanoate Chemical compound [Na+].[O-]C(=O)CCC(=O)OC[C@@H](NC(=O)C(Cl)Cl)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 RPLOPBHEZLFENN-HTMVYDOJSA-M 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 239000008117 stearic acid Substances 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 239000008399 tap water Substances 0.000 description 1
- 235000020679 tap water Nutrition 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 229960004799 tryptophan Drugs 0.000 description 1
- 208000001072 type 2 diabetes mellitus Diseases 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 238000010626 work up procedure Methods 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6436—Fatty acid esters
- C12P7/6445—Glycerides
- C12P7/6472—Glycerides containing polyunsaturated fatty acid [PUFA] residues, i.e. having two or more double bonds in their backbone
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Oil, Petroleum & Natural Gas (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Immobilizing And Processing Of Enzymes And Microorganisms (AREA)
Abstract
本发明涉及编码特异于多酮化合物合酶(PKS)的序列的基因。由此合成的PKS特征在于其产生PUFAs(多不饱和脂肪酸)的酶能力。本发明还涉及鉴定相应的DNA序列,以及所述核苷酸序列用于产生重组和/或转基因生物的应用。
Description
本发明描述了对于多酮化合物合酶(polyketide synthase)(PKS)特异性的基因编码序列。由它们合成的PKS的特征是具有产生PUFAs(多不饱和脂肪酸)的酶学能力。本发明另外包括相应DNA序列的鉴定以及所述核苷酸序列对于生产重组和/或转基因生物的用途。
术语PUFAs(多不饱和脂肪酸)表示具有链长度>C12和至少两个双键的多重不饱和长链脂肪酸。有两个PUFA的主要家族,其根据相对于烷基末端,在ω-3和在ω-6脂肪酸中的第一个双键的位置而区别。它们是细胞膜的重要组分,在那里它们以脂质,特别是磷脂的形式存在。PUFAs还作为在人和在动物中的重要分子,例如,前列腺素,白三烯和环前列腺的初级阶段而起作用(A.P.Simopoulos,essential fattyacids in health and chronic disease,Am.J.Clin.Nutr.1999(70),pp.560-569)。ω-3脂肪酸族的重要代表是DHA(二十二碳六烯酸)和EPA(二十碳五烯酸),其可以在鱼油和在海洋微生物中发现。ω-6脂肪酸的重要代表是ARA(花生四烯酸),其出现在,例如,丝状真菌中,但是也可以从动物组织如肝和肾中分离。DHA和ARA在人母乳中彼此相接出现。
PUFAs对于人来说在适当发育方面,特别是对于发育脑,组织形成及其修复是必需的。因而,DHA是人细胞膜的重要组分,特别是神经的细胞膜。它在脑功能的成熟中发挥重要作用并且对于视力的发育是必需的。ω-3 PUFAs如DHA和EPA被用作营养添加剂,因为具有DHA充分供应的平衡营养对于某些疾病的预防有利(A.P.Simopoulos,Essential fatty acids in health and chronic disease,AmericanJournal of Clinical Nutrition 1999(70),pp.560-569)。例如,患有非胰岛素依赖型糖尿病的成人呈现与后来出现的心脏问题相关的DHA平衡的缺陷或者至少是失衡的DHA平衡。同样地,神经元疾病如,例如,阿尔茨海默病或精神分裂症伴随着低的DHA水平。
有大量的DHA商业提取物的来源,例如,来自海洋冷水鱼的油,蛋黄部分或海洋微生物。适于提取n-3 PUFA的微生物发现于,例如,弧菌属(Vibrio)的细菌中(例如,海产弧菌(Vibrio marinus))或腰鞭毛虫(Dinophyta)中,其中特别是Crypthecodinium属,如C.cohnii或在Stramenopiles(或Labyrinthulomycota)中,如Pinguiophyceae如,例如,Glossomastix,Phaeomonas,Pinguiochrysis,Pinguiococcus和Polypodochrysis。其它生产PUFA的优选微生物特别属于Thraustochytriales目,(Thraustchytriidea)具有Japonochytrium属,Schizochytrium属,Thraustochytrium属,Althornia属,Labyrinthuloides属,Aplanochytrium属和Ulkenia属。
提取自商业上已知的PUFA来源如植物或动物的油的特征经常是非常不均匀的组成。以这种方式提取的油必须进行昂贵的纯化处理以便能够富集一种或几种PUFAs。另外,来自这些来源的PUFA的供应也会发生不可控制的波动。因而,疾病和天气影响能够减少动物也能够减少植物的产量。从鱼中提取PUFA出现季节波动并且甚至能够由于过度捕捞或气候变化(例如,厄尔尼诺现象)而暂时性地停止。动物油,特别是鱼油,可以通过食物链从环境中积聚有害物质。已知动物受有机氯化物,例如,多氯化联苯高度胁迫,特别是在商业性鱼场中,其抵消了鱼类消费的健康方面(Hites等,2004,Global assessmentof organic contaminants in farmed salmon,Science 303,pp.226-229)。鱼产品质量的所得损失导致消费者对于鱼和鱼油作为ω-3 PUFA来源的接受度下降。另外,从鱼浓缩DHA因为高度技术需要而相对昂贵。另一方面,DHA存在于少数海洋微生物,占细胞总脂肪组分的大约50%,并且它们能够在大的发酵罐中进行相对经济地培养。微生物的另一个优点是提取自它们的油的组成限于少数几种组分。
对于长链PUFA如二十二碳六烯酸(DHA;22:6,n-3)和二十碳五烯酸(EPA;20:5,n-3)的生物合成已知多种生物催化途径。在真核生物中生产长链PUFA的常规生物合成途径起始于亚油酸(LA;18:2,n-6)和α亚油酸的δ-6去饱和作用。它导致由亚油酸合成γ亚油酸(GLA;18:3,n-6)以及由α亚油酸合成十八碳四烯酸(OTA;18:4,n-3)。对于n-6以及n-3脂肪酸来说,此去饱和作用后接延伸步骤以及δ-5去饱和作用,致成花生四烯酸(ARA;20:4,n-6)和二十碳五烯酸(EPA;20:5,n-3)。起始自二十碳五烯酸(EPA;20:5,n-3)的二十二碳六烯酸(DHA;22:6,n-3)的合成随后能够通过两种不同的生物合成途径发生。在所谓的线性生物合成途径中,发生二十碳五烯酸(EPA;20:5,n-3)延伸另外两个碳单位,随后发生δ-4去饱和作用以形成二十二碳六烯酸(DHA;22:6,n-3)。这种生物合成途径的存在能够通过生物如破囊壶菌属(Thraustochytrium)和裸藻属(Euglena)中δ-4去饱和酶的存在而确证(Qiu,等,Identification of a delta 4 fatty acid desaturase fromThraustochytrium sp.involved in the biosynthesis of docosahexaenoic acidby heterologous expression in Saccharomyces cerevisiae and Brassicajuncea.,J.Biol.Chem.276(2001),pp.31561-31,566和Meyer等,Biosynthesis of docosahexaenoic acid in Euglena gracilis:Biochemicaland molecular evidence for the involvement of a delta 4 fatty acyl groupdesaturase.Biochemistry 42(2003),pp.9779-9788)。起始自二十碳五烯酸(EPA;20:5,n-3)的二十二碳六烯酸(DHA;22:6,n-3)合成的第二条途径,所谓的Sprecher途径,独立于δ-4去饱和作用。它由两个连续延伸步骤,每步延伸2个碳单位至二十四碳五烯酸(24:5,n-3)以及随后δ-6去饱和作用至二十四碳六烯酸(24:6,n-3)组成。随后通过过氧化物酶体β氧化作用缩短两个碳单位而接着发生二十二碳六烯酸的形成(H.Sprecher,Metabolism of highly unsaturated n-3 and n-6 fatty acids.Biochimica et Biophysica Acta 1486(2000),pp.219-231)。这一第二生物合成途径是在哺乳动物中占优势的DHA合成途径(Leonard等,Identification and expression of mammalian long-chain PUFA elongationenzymes.Lipids 37(2002),pp.733-740)。对于C20 PUFA形成的备选生物合成途径存在于少数缺δ-6变性酶活性的生物中。这些生物包括,例如,原生生物Acanthamoeba sp.和Euglena gracilis。在备选的C20PUFA合成中的第一步在于C18脂肪酸,亚油酸(LA;18:2,n-6)和α亚油酸(ALA;18:3,n-3)延伸两个碳单位。随后通过δ8去饱和作用和接下来的δ5去饱和作用将得到的脂肪酸二十碳二烯酸(20:2,n-6)和二十碳三烯酸(20:3,n-3)转化成花生四烯酸(ARA;20:4,n-6)和/或二十碳五烯酸(EPA;20:5,n-3)(Sayanova和Napier,Eicosapentaenoic acid:Biosynthetic routes and the potential for synthesis in transgenic plants.Phytochemistry 65(2004),pp.147-158;Wallis和Browse;The delta-8desaturase of Euglena gracilis:An alternate pathway for synthesis of20-carbon polyunsaturated fatty acids.Arch.Biochem.Biophys.362(1999),pp.307-316)。
高等植物不具有由初级阶段合成C20 PUFA的能力。它们通过各种去饱和酶起始自硬脂酸(18:0),形成油酸(C18:1;δ-9去饱和酶),亚油酸(18:2,n-6,δ12去饱和酶)和α亚油酸(18:3,n-3;δ15去饱和酶)。
不过,某些海洋微生物采取完全不同的生物合成途径来产生EPA和DHA。这些产生PUFA的微生物包括γ蛋白细菌的海洋代表以及少数几种cytophaga flavobacterium bacteroides族和到目前为止的真核性原生生物,Schizochytrium sp.ATCC 20888(Metz等,2001,Productionof polyunsaturated fatty acids by polyketide synthases in both prokaryotesand eukaryotes.Science 293:290-293)。它们通过所谓的多酮化合物合酶(PKS)来合成长链PUFA。这些PKSs代表催化由酮化合物(ketide)单位组成的次级代谢产物合成的大酶(G.W.Wallis,J.L.Watts和J.Browse,Polyunsaturated fatty acid synthesis:what will they think of next?Trendsin Biochemical Sciences 27(9)(2000)pp.467-473)。多酮化合物的合成包含许多与脂肪酸合成类似的酶反应(Hopwood & Sherman Annu.Rev.Genet.24(1990)pp.37-66;Katz & Donadio Annu.Rev.of Microbiol.47(1993)pp.875-912)。
已知不同PUFA-PKSs(PUFA-合成的PKSs)的基因序列。由此,从海洋细菌Shewanella sp.分离出38kb基因组片段含有生产EPA的信息。随后对这一片段的测序导致鉴定了8个开放阅读框(ORFs)(H.Takeyama等,Microbiology 143(1997)pp.2725-2731)。来自Shewanella的这些开放阅读框,其中五个与多酮化合物合酶基因密切相关。同样,美国专利号5,798,259描述了来自Shewanella putrefaciens SCRC-2874的EPA基因簇。PUFA-PKS基因也发现于海洋原核生物Photobacteriumprofundum株SS9中(Allen和Bartlett,Microbiology 2002,148 pp.1903-1913)和Moritella marina株MP-1,早期的Vibrio marinus(Tanaka等,Biotechnol.Letters 1999,21,pp.939-945)。类似的产生PUFA的PKS样ORFs也能够在真核性原生生物Schizochytrium中鉴定(Metz等,Science 293(2001)pp.290-293,US专利No.6,556,583及WO02/083870A2)。在Schizochytrium中确定了三种ORFs,其与来自Shewanella的EPA基因簇呈现部分同一性。在少数原核生物和真核生物Schizochytrium中存在这些保守性PKS基因给出了暗示,PUFA-PKS基因可能在原核生物和真核生物之间进行了水平转移。
即使是使用正常情况下不产生PUFAs的微生物中分离的基因簇对PUFAs进行转基因生产也已经能够得以显示了。因而,存在于来自Shewanella sp.SCRC-2738的簇中的上述五种ORFs(开放阅读框)足以在非IPA生产者大肠杆菌(E.coli)和Synechoccus sp.中生产可测量量的EPA(Yazawa,Lipids 1996,31,pp.297-300和Takayama等,Microbiology 1997,143,pp.2725-2731)。
通常,对于大规模生产PUFAs的新的PUFA生产者总是存在需要。首先这种生产是否发生在,例如,原核生物,原生生物或在植物中并不重要。目标始终是尽可能经济地和以尽可能保护环境的方式大量生产高质量的PUFAs。本发明追求这一目标,因为它介绍了来自特别有效的PUFA生产者Ulkenia sp.的合适的PUFA-PKS基因。
考虑到技术状态,所以本发明的任务是从生产DHA的微生物Ulkenia sp.中鉴定和分离另外的PUFA-PKS基因,其极适于生产PUFAs。此外,应当获得关于这些基因的位置和排列以及它们的调控元件的知识。由此获得的知识,特别是由此获得的核酸物质,应当使得PUFA-PKS基因在同系生物以及在转基因生物中的加强表达成为可能。
通过本发明的权利要求书中所定义的主题解决了这些任务以及其它未曾被明确地说明但可以从本文件初始讨论的联系中轻易得到或总结的其它任务。
1.PUFA-PKS,其特征是它们
a.包括在SEQ ID No.6(ORF 1),7(ORF 2),8和/或80(ORF 3)中所示氨基酸序列的至少其中一种,以及具有与它们有至少70%,优选80%,特别优选至少90%和更加特别优选至少99%和最优选100%序列同源性的同源序列,其具有PUFA-PKS的至少一个结构域的生物学活性,或
b.包括在SEQ ID No.32,34,45,58,59,60,61,72,74和/或77中所示氨基酸序列的至少其中一种,以及具有与它们有至少70%,优选80%,特别优选至少90%和更加特别优选至少99%和最优选100%序列同源性的同源序列,其具有PUFA-PKS的至少一个结构域的生物学活性。
2.具有10个或更多ACP结构域的根据权利要求1的分离的PUFA-PKS。
另外,本发明在优选的方面涉及这样一种PUFA-PKS,其包含与序列SEQ ID No.6(ORF 1),7(ORF 2)和/或8和/或80(ORF 3)的至少500个直接连续氨基酸具有至少70%,优选至少80%,特别优选至少90%和更加特别优选至少99%同一性的至少一种氨基酸序列。
在另一个优选的方面,本发明涉及分离的DNA分子,其编码根据任一项在前权利要求的PUFA-PKS。
后者优选特征为它编码与序列SEQ ID No.6(ORF 1),7(ORF 2)和/或8和/或80(ORF 3)的至少500个直接连续氨基酸具有至少70%同一性的氨基酸序列。
另外,本发明涉及这样的分离DNA分子,其与来自序列SEQ ID No.3,4,5和/或9的至少500个连续核苷酸具有至少70%,优选至少80%,特别优选至少90%和更加特别优选至少95%的同一性。
在另一个优选的方面,本发明涉及一种重组DNA分子,其包含与控制转录的至少一种DNA序列功能性连接的先前所述DNA分子的其中之一,优选选自SEQ ID No.3,4和5和/或9或其至少500个核苷酸的部分以及它们的功能性变体。
在又一个优选的方面,本发明涉及包含前述重组DNA分子的重组宿主细胞。
在又一个优选的视点下,本发明涉及内源性表达具有至少10个ACP结构域的根据本发明的PUFA-PKS的重组宿主细胞。
另外,在又一个优选的方面,本发明涉及一种生产含有PUFA的油的方法,包括培养这种重组宿主细胞,以及涉及以此方式生产的油。
另外,在又一个优选的方面,本发明涉及一种生产含有PUFA,优选DHA的生物质量的方法,包括培养这种重组宿主细胞,以及涉及以此方式生产的生物质量。
所以,在又一个优选的方面,本发明还涉及根据权利要求15的重组生物质量,其包含根据权利要求8的核酸和/或根据权利要求1的氨基酸序列或与它同源的至少500个连续氨基酸的部分。
本发明在又一个优选的方面还涉及SEQ ID No.32,33,34,45,58,59,60,61,72,74和/或77中所示、来自包含SEQ ID No.6,7,8和/或80的PUFA-PKS的个别酶结构域的用途,用于生产人工多酮化合物,例如,多酮化合物抗生素和/或新的,变化的脂肪酸。
根据本发明,有关核酸的同一性指示在待比较链的特定位置上的相同碱基对。不过,缺口是有可能的。以%计算同一性值的可能性由程序blastn和fasta代表。
就氨基酸而言,概念同源性也包含,例如氨基酸序列的保守性交换,其丝毫不影响蛋白质的功能和/或结构。甚至是这些同源性值也通过本领域熟练技术人员已知的程序,例如,blastp,Matrix PAM30,GapPenalties:9,Extension:1进行计算(Altschul等,NAR 25,3389-3402)。
来自Ulkenia sp.的PUFA-PKS基因的序列信息可由SEQ ID No.3-5和/或9中所定义的核酸序列和氨基酸序列获得。SEQ ID No.1和2代表目前分离的两种粘粒上的完整基因组DNA序列(见实施例2和3)。后者对其部分包含PUFA合成所必需的三种相关开放阅读框ORFs1-3的信息以及它们的侧翼调控序列。另外,作为其结果提出了能够源自基因组序列的蛋白质序列。
本发明另外包括用根据本发明的核酸对宿主生物进行同源和异源转化用来生产高纯PUFAs的方法。分离的开放阅读框优选导致在同系生物和转基因生物中生产PUFA,特别是DHA,EPA和DPA。
由此生产的PUFAs优选作为生物质量的组分或作为油而存在。
在本发明之前,只有真核生物,原生生物Schizochytrium的PUFA-PKS基因是已知的(美国专利号6,566,583,WO02/083870)。随后测定的序列数据部分源自cDNA和源自染色体DNA。在本发明中首次从染色体DNA完全描述了对于PUFA合成必需的真核性原生生物的所有PUFA-PKS基因。这不仅导致确定了以往未知的来自Ulkenia sp.的PUFA-PKS编码基因信息,还另外提供了关于侧翼调控元件如转录启动子和终止子的数据。此外,染色体序列信息使得深入了解个别PUFA-PKS基因的位置和排列成为可能。
这里完全令人吃惊的是簇同样地不再存在,因为以往知道它是来自原核性PUFA-PKS代表如Shewanella,Photobacterium或Moritella。鉴定的粘粒(Seq ID No.1)一开始显示个别ORFs的线性排列在Ulkenia中被打乱并且还显示个别ORFs的阅读方向是反向的(图1)。这可能是大段基因转座的结果。作为转座的结果,个别ORFs还清楚地呈现彼此的更大间隔。因而,两个ORFs 1和2具有大约13kb的间隔。第三个ORF直到在另一个粘粒上才能够在此情况下得以鉴定(SeqID No.2)并且在两种粘粒之间(Seq ID No.1和2)没能发现部分同一性(图1)。这意味着来自Ulkenia sp.的ORF在空间上不再位于两种ORFs1和2附近。这作出结论,即PUFA基因簇,已知来自上述原核代表,不再存在于真核生物Ulkenia sp.中。已经部分测定了原生生物Schizochytrium的个别PUFA-PKS基因在基因组上的位置和排列(WO02/083870)并且还显示了两种ORFs A和B的相反方向。不过,它们彼此仅仅分离4224个碱基对。在专利申请WO 02/083870中将这一序列片段讨论为具有双向启动子元件的基因间隔区。至少对于Ulkenia在同源性ORFs 1和2之间的双向启动子元件似乎是不可能的,这是因为对于Ulkenia测定的12.95kb的间隔区。没有其它明显ORFs存在于来自Ulkenia的ORFs 1和ORF2之间的12.95kb区域之内。表明区域中发生了大的重组和/或转座事件。转座酶样事件也能够基于少数重复序列重复而发生。
更加令人特别吃惊的是与EPA生产者Shewanella(6xACP)和Photobacterium(5xACP)的PUFA-PKS以及DHA生产者Moritella(5xACP)和Schizochytrium(9xACP)的PUFA-PKS相比,来自Ulkeniasp.的PUFA-PKS具有最大数目的酰基载体蛋白的重复,有10个ACP结构域(图3)。这意味着分离自Ulkenia sp.的PUFA-PKS相对于来自亲缘性原生生物Schizochytrium的PUFA-PKS不仅具有偏移性氨基酸序列,而且在结构上也是独特的。另一种特性是这样的事实,即来自Ulkenia sp.的第三个ORF相对于来自Schizochytrium的ORF C短了38个氨基酸并且另包含了丙氨酸富集的结构域,该结构域并不以此方式存在于Schizochytrium中(图6)中。令人感兴趣的是,这种序列类似存在于来自ORF 1的个别ACT结构域之间的区域并且可能代表连接区。所述相似性在于序列长度以及丙氨酸连续仅被个别脯氨酸和缬氨酸打乱的事实。相对于Schizochytrium ORF C缺失的ORF 3中氨基酸的最大部分是删除的结果,有30个氨基酸长,位于脱水酶/异构酶结构域之间(图6)。作为结果,这些结构域位于相应的蛋白质上,彼此相距短的间隔,这能够对于酶学活性具有影响。对于ORF 3来说,即使其它的5’位置上的ATG密码子也可作为起始密码子,从而在理论上甚至是最大为1848个氨基酸长的ORF也能够存在(Seq ID No.9和80)。在此情况下甚至同时出现ORF 3的变体也是可能的。
特别地,来自Ulkenia sp.的ORF 1(Seq ID No.3和6)在一方面包含所谓的β酮酰基合成酶结构域(Seq ID No.14和32),其特征是靶标(motive)(DXAC)(Seq ID No.12和30)。Ulkenia ORF 1中酶学结构域的活性中心的靶标能够以优选的形式扩展到17个氨基酸的范围(GMNCVVDAACASSLIAV)Seq ID No.11和29)。完整的β酮酰基合成酶结构域可以分成N末端(Seq ID No.10和28)和分成C末端(Seq ID No.13和31)部分。β酮酰基合成酶结构域的生物学功能是催化脂肪酸和/或PKS合成的缩合反应。进行延伸的酰基基团通过硫酯键结合到酶学结构域的活性中心的半胱氨酸基团并且以几个步骤转移到酰基载体蛋白上的丙二酰基团的碳原子2上,释放CO2。β酮酰基合成酶结构域后接丙二酰CoA-ACP转移酶结构域(Seq ID No.15和33)。此结构域催化丙二酰CoA转移到酰基载体蛋白(ACP)上的4’-phosphopantetheine基团。丙二酰CoA-ACP转移酶结构域也将甲基或乙基丙二酸酯转移到ACP上,期间它们将分枝导入其它的线性碳链上。随后将连接区域后接富含丙氨酸序列的部分(Seq ID No.16和34),该部分包含10个重复的酰基载体蛋白结构域(ACP结构域)(17-26和35-44)。这些ACP结构域对于它们的部分彼此通过连接区域相互分离,所述连接区域主要由丙氨酸和脯氨酸组成。每个ACP结构域的特征是4’-phosphopantetheine分子(LGXDS(L/I))的结合靶标。所述4’-phosphopantetheine分子在这里结合到靶标内的保守丝氨酸上。ACP结构域通过4’-phosphopantetheine基团作为载体起作用来生长脂肪酸和/或多酮化合物链。与酮还原酶具有部分同一性的序列(Seq ID No.27和45)随后接上。这些结构域的生物学功能在于3-酮酰基-ACP化合物的NADPH依赖型还原作用。它代表脂肪酸生物合成中的第一次还原反应。这种反应在多酮化合物合成中也经常发生(还参见图3)。
来自Ulkenia sp.的ORF 2(Seq ID No.4和7)也以β酮酰基合成酶结构域(Seq ID No.50和58)起始,其特征是靶标(DXAC)(Seq IDNo.48和56)。Ulkenia ORF 2中酶学结构域的活性中心的这种靶标能够以优选的形式扩展到17个氨基酸的范围(PLHYSVDAACATALYVL)Seq ID No.47和55)。完整的β酮酰基合成酶结构域可以分成N末端(Seq ID No.46和54)和C末端(Seq ID No.49和57)部分。此结构域的生物学活性对应于ORF1中所述的β酮酰基合成酶结构域。Kethosynthases在延伸循环中发挥关键作用并且显示了比脂肪酸合成的其它酶更高的底物特异性。这再次后接与β酮酰基合成酶结构域具有较小部分同一性的序列片段。另外,这一结构域缺少用于活性中心的靶标DXAC。它具有来自II型PKS类似系统的所谓链长因子(CLF)的特性(Seq ID No.51和59)。CLF氨基酸序列与酮合成酶具有部分同一性,但是没有具有相应的半胱氨酸基团的特征性活性中心。PKS系统中的CLFs的部分目前正以争论方式进行讨论。最近的结果指出CLF结构的部分在于丙二酰ACP的脱羧作用。产生的乙酰基随后可以结合到β酮酰基合成酶结构域的活性中心上并且因而代表了起始缩合反应的所谓引动分子(priming molecule)。还发现CLF同源性序列作为分子PKS系统中的负载结构域。具有CLF序列特性的结构域存在于所有先前已知的PUFA-PKS系统。这后接酰基转移酶结构域(Seq ID No.52和60)。这种结构域催化许多酰基转移如从酰基转移到辅酶A或转移到ACP结构域。来自ORF 2的终止结构域显示与氧化还原酶的部分同一性(Seq ID No.53和61)并且很可能代表了一种烯酰基还原酶结构域。烯酰基还原酶结构域的生物学活性存在于脂肪酸合成的第二次还原反应中。它催化脂肪酸酰基ACP的反式双键的还原(也参见图2)。
来自Ulkenia sp.的ORF 3(Seq ID No.5和8)由两种脱水酶/异构酶结构域(Seq ID No.66,68,72和74)组成。两种结构域都包含“活性位点”组氨酸,直接相邻半胱氨酸(Seq ID No.67和73以及Seq ID No.69和75)。这些结构域的生物学功能是反式双键插入到脂肪酸或多酮化合物分子中,伴随着H2O的分解和双键随后转化成顺式异构形式。第二种脱水酶/异构酶结构域并入丙氨酸富集区(Seq ID No.70和76),所述丙氨酸富集区没有已知的功能但是可能代表连接区。这后接烯酰基还原酶结构域(Seq ID No.71和77),其与来自Ulkenia的已经存在于ORF 2中的烯酰基还原酶结构域具有高度部分同一性。它的生物学功能对应于上面已经介绍过的烯酰基还原酶结构域(也参见图2)。
优选在来自Ulkenia sp.的ORF 1起始ATG密码子前面给出2000bp(Sequence ID No.62)作为启动子序列。它们特别优选1500bp,更加特别优选1000bp在起始密码子之前。
优选可以在终止密码子TAA之后给出2000bp(Sequence ID No.63)作为ORF 1的终止序列。特别优选1500bp,更加特别优选1000bp在终止密码子之后。具有碱基序列AATAAA的ORF 1的mRNA合成的潜在终止信号存在于终止密码子TAA之后的412bp。
优选在来自Ulkenia sp.的ORF 2起始ATG密码子前面给出2000bp(Sequence ID No.64)作为启动子序列。它们特别优选1500bp,更加特别优选1000bp在起始密码子之前。
优选可以在终止密码子TAA之后给出2000bp(Sequence ID No.65)作为ORF 2的终止序列。具有碱基序列AATAAA的ORF 2的mRNA合成的潜在终止信号存在于终止密码子TAA之后的1650bp。
优选在来自Ulkenia sp.的ORF 3起始ATG密码子前面给出2000bp(Sequence ID No.78)作为启动子序列。它们特别优选1500bp,更加特别优选1000bp在起始密码子之前。
优选可以在终止密码子TAA之后给出2000bp(Sequence ID No.79)作为ORF 3的终止序列。具有碱基序列AATAAA的ORF 3的mRNA合成的潜在终止信号存在于终止密码子TAA之后的4229bp。
PUFA,例如,DHA可以在Ulkenia sp.中进行同源生产,此外还可以在宿主,例如,大肠杆菌中利用本发明测定的序列信息进行异源生产。根据本发明的核酸序列可以用来提高PUFA的产量,其中它们被用来,例如,提高生产PUFA的生物中PUFA-PKS基因的数目。自然地,甚至是个别核酸片段,例如,编码ACP结构域的序列片段也可在同源或异源生产生物中进行扩增。特别地,ACP结构域呈现自己提高生产,因为辅因子4-phosphapantheteine的结合位点对于PUFA合成是必需的。自然地,即使是不同调控元件,例如,启动子,终止子和增强子元件的使用也能够导致经遗传修饰的PUFA生产者内产量的提高。在个别序列片段中的遗传修饰能够导致获得产物结构的变化并且因而导致不同PUFAs的生产。另外,PUFA合成酶与多酮化合物合酶的相似性使得混合系统的构建成为可能。这种所谓的组合性生物合成允许新的人工生物活性物质的生产。例如,通过PKS-和PUFA-PKS单位的混合系统在转基因微生物中生产的新型多酮化合物抗生素是有可能的。
适于这里给出的PUFA基因的异源表达的宿主除了大肠杆菌之外为,例如,酵母如酿酒酵母(Saccharomyces cerevisiae)和毕赤酵母(Pichia Pastoris)或者丝状真菌,例如,构巢曲霉(Aspergillus nidulans)和Acremonium chrysogenum。通过将根据本发明的基因导入,例如,大豆,油菜,向日葵,亚麻或其它的,优选富含油的植物中来生成生产PUFA的植物。为了PUFA基因的有效异源表达,甚至也可以使用其它的附属基因,例如,4-phosphopantheteine转移酶。另外,可以使用宿主特异性启动子/操纵系统进行加强的或可诱导的基因表达。
可以使用多种原核表达系统进行PUFA的异源生产。可以构建除了相应的PUFA基因之外还包含启动子,核糖体结合位点和转录终止子的表达载体。将大肠杆菌色氨酸生物合成的启动子/操纵子区和λ噬菌体的启动子引证作为大肠杆菌中这些调控元件的例子。同样地,可以将可选择的标记,例如,对氨苄青霉素、四环素或氯霉素的抗性用于合适的载体上。对于大肠杆菌的转化非常合适的载体为pBR322,pCQV2和pUC质粒以及它们的衍生物。这些质粒可包含病毒以及细菌元件。可以使用每种源自大肠杆菌K12的菌株,例如,JM101,JM109,RR1,HB101,DH1或AG1作为大肠杆菌宿主菌株。自然地,所有其它惯用的原核表达系统也可以用于异源PUFA生产(还参见Sambrook等)。还可以使用生油(oil-building)细菌作为宿主系统。
可以将哺乳动物、植物和昆虫细胞以及真菌,例如,酵母用作真核表达系统。对于酵母系统来说,可以使用来自于糖酵解酶基因的转录起始元件。这包括乙醇脱氢酶,甘油醛-3-磷酸脱氢酶,phosphoglukoisomerase,磷酸甘油酯激酶等的调控元件。不过,即使是来自基因如来自酸性磷酸酶,乳糖酶,金属硫蛋白或葡糖淀粉酶基因的调控元件也可以使用。这里还使用允许加强的或可诱导的表达的启动子。可由半乳糖诱导的启动子(GAL1,GAL7和GAL10)也是令人特别感兴趣的(Lue等,1987 Mol.Cell.Biol.7,p.3446 ff.和Johnston1987 Mircobiol.Rev.51,p.458 ff.)。3’终止序列还优选源自酵母。由于紧邻起始密码子(ATG)的核苷酸序列影响酵母中基因的表达,还优选来自酵母的有效翻译起始序列。在使用酵母质粒的情况下,它们包含来自酵母的复制起点并且包含选择标记。这种选择标记优选是营养缺陷型标记,例如,LEU,TRP或HIS。这种酵母质粒是所谓的YRps(酵母复制性质粒),YCps(酵母着丝点质粒)和YEps(酵母游离质粒)。没有复制起点的质粒是Yips(酵母整合质粒),其用于整合转化的DNA至基因组中。特别感兴趣的是质粒pYES2和pYX424以及pPICZ质粒。
如果将丝状真菌,例如,构巢曲霉用作异源PUFA生产者,也可以使用来自对应生物的启动子。可以将用于加强表达的gpdA启动子和用于可诱导表达的alcA启动子用作实例。优选使用酵母质粒如pHELP(D.J.Balance和G.Turner(1985)Development of ahigh-frequency transforming vector for Aspergillus nidulans.Gene 36,321-331)和可选择标记如ura,bio或paba用于转化丝状真菌。甚至优选来自丝状真菌的3’调控元件。
通过杆状病毒表达系统可以在昆虫细胞中生产PUFA。这些表达系统可由,例如Clonetech或Invitrogen商购。
可以将载体,例如,来自土壤杆菌的Ti质粒或完整病毒如菜花样花叶病毒(CaMV),双粒病毒,番茄金黄花叶病毒或烟草花叶病毒(TMV)用于植物的转化。优选的启动子为,例如,CaMV的35S启动子。对于植物转化的其它可能性为磷酸钙法,聚乙二醇法,微注射,电穿孔或原生质体的脂染。还优选通过用DNA带电微粒轰击(基因枪)进行的转化。植物中备选的PUFA生产源自叶绿体的转化。例如,N末端引导肽使得蛋白质在叶绿体中的转运成为可能。优选的引导肽源自核酮糖双磷酸酯羧化酶的小亚基但是也可以使用其它chloroplastidary蛋白的引导肽。叶绿体基因组的稳定转化提供了另一种可能性。对此尤其可以考虑生物导弹法还可以考虑其它方法(Blowers等Plant Cell 1989 1 pp.123-132,Kline等.Nature 1987 327 pp.70-73和Schrier等Embo J.4 pp.25-32)。
对于哺乳动物细胞还可以使用可以商购的表达系统。其中,可以使用病毒性或非病毒性转化和表达系统,例如,慢病毒或腺病毒系统或Invitrogen的T-Rex系统等作为例子。同样,来自Invitrogen的Flp-In系统,可以用于哺乳动物细胞中DNA的目的性整合。
下面利用几个实施例介绍构成了根据本发明方法基础的核酸和氨基酸。不过,所述序列和本发明并不限于这些实施例。
附图简述
图1描述了来自Ulkenia sp.的PUFA-PKS基因在基因组上的位置。另外,显示了由这些基因编码的PUFA-PKS的个别结构域。KS:酮合成酶,MAT:丙二酰-CoA:ACP酰基转移酶,ACP:酰基载体蛋白,KR:酮还原酶,CLF:链长因子,AT:酰基转移酶,ER:烯酰基还原酶和DH:脱水酶/异构酶。
图2显示来自Ulkenia sp.的ORF2和ORF3与来自Moritellamarina(GenBank编号:AB025342.1),Photobacterium profundum SS9(GenBank编号:AF409100),Shewanella sp.SCRC-2783(GenBank编号:U73935.1)和Schizochytrium(GenBank编号:AF378327,AF378328,AF378329)的相应同源性ORFs的比较。在进化过程中个别ORFs之中和之间的基因转座也在结构域结构旁边指出。
图3显示来自Ulkenia sp.的ORF1与来自Moritella marina(GenBank编号:AB025342.1),Photobacterium profundum SS9(GenBank编号:AF409100),Shewanella sp.SCRC-2783(GenBank编号:U73935.1)和Schizochytrium(GenBank编号:AF378327,AF378328,AF378329)的相应同源性ORFs的比较。强调了ACP结构域和氨基酸连续LGIDSIKRVEIL重复的数目。
图4包含了来自Ulkenia sp.的ORF1与来自Schizochytrium的ORF A的序列比较。两种序列的部分同一性的程度为大约81.5%。
图5包含了来自Ulkenia sp.的ORF 2与来自Schizochytrium的ORF B的序列比较。两种序列的部分同一性的程度为大约75.9%。
图6包含了来自Ulkenia sp.的ORF 3与来自Schizochytrium的ORF C的序列比较。两种序列的部分同一性的程度为大约80.0%。
图7描述了由FASTAX进行的,实施例1中所述PCR产物与数据库序列(Swiss-PROT全文库)的序列比较。
图8显示了用于生产来自实施例2的粘粒库的Cosmid SuperCosI(Stragagene)的载体图(card)。
图9描述了由BLASTX进行的,实施例3中所述PCR产物与数据库序列(Swiss-PROT全文库)的序列比较。
实施例
实施例1:
从分离自Ulkenia sp.SAM2179的DNA扩增PUFA-PKS特异性序列
1.1包含编码PUFA-PKS的基因的基因组DNA的分离
在250ml带有阻流板的Erlenmeyer烧瓶中用Ulkenia sp.SAM2179接种50ml DH1培养基(50g/l葡萄糖;12.5g/l酵母提取物;16.65g/l Tropic Marin;pH6.0)并于28℃和150rpm培养48h。随后用灭菌自来水洗涤细胞,离心下去并将细胞沉淀物冷冻于-85℃中。为了进一步的检查(workup),随后将细胞沉淀物转移入研钵中并以研棒在液氮下粉碎成精细粉末。随后,将大约1/10研成粉末的细胞材料与2ml裂解缓冲液(50mM tris/Cl pH7.2;50mM EDTA;3%(v/v)SDA;0.01%(v/v)2-巯基乙醇)混合并于68℃温育1h。随后加入2ml苯酚/氯仿/异戊醇(25∶24∶1),搅动并于100000rpm离心20min。在除去上层水相后,将后者转移入两个新的反应容器中,每个600μl,并且分别再次与600μl苯酚/氯仿/异戊醇(25∶24∶1)混合,搅动并于13000rpm离心15min。随后将特定上层相每个400μl转移入新的反应容器中并在每种情况下加入1ml乙醇(100%)后倒转两到三次。随后,将沉淀的DNA缠绕在玻璃棒上,用70%乙醇洗涤,干燥并溶于50μl蒸馏水中。将以此方式提取的DNA与2μl RNase A混合并保存于4℃待用。
1.2利用靶标特异性寡核苷酸进行PCR反应
将PCR引物MOF1和MOR1用作靶标特异性寡核苷酸。
MOF1:5’-CTC GGC ATT GAC TCC ATC-3’(Seq ID No.81)
MOR1:5’-GAG AAT CTC GAC ACG CTT-3’(Seq ID No.82)。将在上面1.1段中所述的来自Ulkenia sp.SAM2179的基因组DNA稀释1∶100。随后将2μl的这种稀释液转移入50μl体积的PCR反应混合物中(1x缓冲液(Sigma);dNTPs(每种200μM);MOF1(20pmol),MOR1(20pmol)和2.5U Taq-DNA聚合酶(Sigma))。在下列条件下实施PCR:起始变性94℃ 3min,随后为30个循环,每个循环于94℃ 1min,55℃ 1min,72℃ 1min,和最后8min 72℃。随后通过凝胶电泳分析PCR产物并通过T/A克隆(Invitrogen)将具有合适大小的片段插入载体pCR2.1 TOPO中。在转化大肠杆菌TOP 10F’之后,分离质粒DNA(Qiaprep Spin,QUAGEN)并进行测序。
将获得的序列数据与官方EMBL核苷酸序列数据库(
http://www.ebi.ac.uk/embl/)相比较并进行评估。用FASTAX获得的序列比较对于来自Ulkenia sp.SAM 2179的PCR主要产物与来自Schizochytrium sp.ATCC 20888的PUFA-PKS(ORF A;ORF:开放阅读框)的酰基载体蛋白产生部分同一性,其在氨基酸水平上为大约90%(图7)。令人吃惊的是,为了确定在Ulkenia sp.SAM 2179中的这种PUFA-PKS,仅须实施单次PCR实验。这说明所用寡核苷酸的特别高的效力。
实施例2:
由来自Ulkenia sp.SAM 2179的基因组DNA生产基因组文库
在500μl体积中以2.5U Sau3AI于37℃ 2min将来自Ulkenia sp.SAM 2179的50μg基因组DNA部分裂解并且接下着立即用相同体积的苯酚/氯仿进行沉淀,随后用乙醇沉淀并溶解于蒸馏水中。随后根据生产商的说明书用SAP(虾碱性磷酸酶;Roche)将Sau3AI裂解的基因组DNA去磷酸化。随后通过将该反应加热20分钟至65℃来进行酶的灭活。将粘粒Supercos I(Stratagene,图8)用作载体。将10μgSupercos I用XbaI于37℃完全裂解几小时。随后将酶于65℃加热灭活20min并且根据生产商的说明书用SAP(Roche)将剪切的粘粒去磷酸化。在这里也通过将该反应于65℃加热20分钟进行酶的灭活。随后用BamHI于37℃将XbaI裂解的和去磷酸化的Supercos I粘粒完全裂解几小时。随后将剪切的粘粒DNA用苯酚/氯仿进行沉淀,用乙醇沉淀并接下来溶解于蒸馏水中。为了进行连接,将1μg用XbaI和BamHI裂解的粘粒DNA,和3.5μl Sau3AI裂解的基因组DNA组合于20μl的体积中并用T4连接酶(Biolabs)根据生产商的说明书连接几小时。随后根据生产商的说明书利用Gigapack III XL Packaging Extract(Stratagene)将大约1/7的连接物包装在噬菌体中。随后将后者用于转染大肠杆菌XL1-Blue MR。随后以PCR筛选的形式由QIAGEN公司(Hilden,Germany)由基因文库中进行PUFA-PKS特异性粘粒的分离,所述PCR筛选利用Ulkenia-PKS-特异性寡核苷酸PSF2:5’-ATT ACT CCT CTCTGC ATC CGT-3’(Seq ID No.83)和PSR2:5’-GCC GAA GACAGC ATC AAA CTC-3’(Seq ID No.84)。随后对由此确定的粘粒克隆C19F09的粘粒DNA进行分离和测序(Seq ID No.1)。
实施例3:
来自Ulkenia sp.的ORF3的鉴定
为了鉴定来自Ulkenia sp.SAM 2179的ORF,寡核苷酸源自不同PUFA-PKS的高度保守的序列片段。令人感兴趣的是,对于PCR扩增似乎合适的非常高的部分同一性出现在个别物种之间编码脱水酶/异构酶的序列片段区域。
3.1包含编码PUFA-PKS的基因的基因组DNA的分离
参见实施例1.1
3.2利用PUFA-PKS-特异性寡核苷酸进行的PCR反应
将下列PCR引物用作PUFA-PKS-特异性寡核苷酸:
CFOR1:5’-GTC GAG AGT GGC CAG TGC GAT-3’(Seq No.85)
CREV3:5’-AAA GTG GCA GGG AAA GTA CCA-3’(Seq IDNo.86).
将在上述3.1段所述的来自Ulkenia sp.2179的基因组DNA稀释到1∶10的比例。随后将2μl这种稀释液转移入50μl体积的PCR反应混合物中(1x缓冲液(Sigma);dNTPs(每种200μM);CFOR1(20pmol),CREV3(20pmol)和2.5U Taq-DNA聚合酶(Sigma)。在下列条件下进行PCR:94℃初始变性3min,随后30个循环,每个循环于94℃1min,60℃ 1min,72℃ 1min,和最后8min 72℃。随后通过凝胶电泳分析PCR产物并通过T/A克隆(Invitrogen)将合适大小的片段插入载体pCR2.1 TOPO中。在转化大肠杆菌E.coli TOP10F’之后,分离质粒DNA(Qiaprep Spin,QUAGEN)并进行部分测序。
将获得的序列数据与官方EMBL核苷酸序列数据库(http:∥www.ebi.ac.uk/embl/)相比较并进行评估。用FASTAX获得的序列比较对于来自Ulkenia sp.SAM 2179的PCR主要产物与来自Schizochytrium sp.ATCC 20888的PUFA-PKS合成酶的ORF C产生部分同一性,其在氨基酸水平上为大约80%(图9)。令人吃惊的是,为了确定在Ulkenia sp.SAM 2179中的这种PUFA-PKS,仅须实施单次PCR实验。这说明所用寡核苷酸的特别高的效力。随后以PCR筛选的形式通过QIAGEN公司(Hilden,Germany)由实施例2中所述基因文库中分离PUFA-PKS特异性粘粒,所述PCR筛选利用已经用于PCR的寡核苷酸CFOR1:5’-GTC GAG AGT GGC CAG TGC GAT-3’(Seq ID No.85)和CREV3:5’-AAA GTG GCA GGG AAA GTA CCA-3’(Seq IDNo.86)。随后对由此确定的粘粒克隆058G09的粘粒DNA进行分离和测序(Seq ID No.2)。
序列表
<110>努特诺瓦营养产品及食品成分有限公司
(Nutrinova Nutrition Specialties and Food Ingredients GmbH)
<120>来自ulkenia的PUFA-PKS基因
(PUFA-PKS Gene aus Ulkenia)
<130>SCT064799-47
<160>86
<170>PatentIn version 3.1
<210>1
<211>43372
<212>DNA
<213>Ulkenia sp.
<400>1
ggatccacag cgttcattta ctcaagatca cactcgtgtg cagtccttga accttgggaa 60
agctcatgtc tctaggtatt gctgtcatgg tttgaaattt tgtcctcaaa agaatcgctt 120
gtaatttttc acttggtggg gtgcacaatg gtctctcaga accatctgct ctaaggagtc 180
ctactgacac ctacctacca cccttccttc atacccatgc ctactaacca acctattgat 240
aactctaacc agggttctat gataggcaaa tcagccaatc tcccgtggaa attagtcttt 300
tcaatcgttg gccagcaagc accatcgcaa cgacagcgct gcatcagcag gaactcgagt 360
acgcttcacc gtcatcgtca tcggtatcac cactattcat gaaatcagaa cctagtcacc 420
cagttacttt ttacgaggca gttgattctg tggagagatg ctcctgatca atggatatgt 480
ctattttatc tacaggtcac acataatcaa tcattcgggg tcatgatttt ccgccatggc 540
gatagtccaa aaaaactcag gaggcaaaat cattgttcaa tttacaacta cccacggagt 600
aaattaatgt aagagctcca atttacaggc aggtatatca tcacggtgtg ctgcagtagg 660
ttctgggtta tcatcctcaa tcattcataa acataacatt cattcataaa cataacattc 720
attcattcat aaacataaca ttcattcatt cattcactca ttcactcatt cattcattca 780
ctcattaatc cgcttaattt aactttaaat tgattgattg attgattgat ggcagaacca 840
cctattagca attggttact ccttgtattg aaaggcctga ataagtaagc aagcaagcca 900
ttggtaaacc ttcctcgccg cgactcgagc gacctcgaga gcggtctgag tgagtctctc 960
acgcaggccc cccgcctcct gagccgtctg tctcgctcaa ctgaagctcc gacaagccaa 1020
gctcacagct gcaagcttgc aagcaagctc gcttctgtct actcgtcctg catcgaatca 1080
acaaccttct cttacgccat gacggacgcc tcttccgaga tgcgcaagcg taagcgctac 1140
gcataccgca tcctcactga tgagtcatcc tcctcccatg caccctctgc tgaggatggt 1200
tccgtgcagg actctcgtat gctccgccat gccggcagca tctgggatgc cgaagagcgc 1260
cgccgcgctg gcaaaatgtc ctcttccgca actgcagcca tgtccagtgt acctcctgga 1320
gaggaactct ggcttgtgtc tatccctgcg gacttcgacg cccatgacct caatggcctt 1380
cgcctgtctg ggaagaagcc cctcgcggac caagaaatcc aaattggcgc tacccacacg 1440
ctcactgctg acctgctctc gggctcttct caggtgcggt gcctgcgccc tactagctcc 1500
tatgtcaacg gcctgaggct tacaccgcct gccgcgcgtg ttttccacgt cgtagagcgt 1560
gatgccgctg atgatgaggc cagtgaagcg ggaggcagtg cccaagagga ggaggagcgc 1620
ctgcgcaagg ctgaagaggt cgtcaagaga cttttgccga agccgcgtga gcaaattgaa 1680
tttaggactt tttctatggc cgacaaagag gaactgctga agcgcatgca aaaggcaaag 1740
gcgcgtggag agaagaagag gggcagaaac gcgattaagg aagaagcaga agacgaggag 1800
gacaaggagg aagagaagtt ggtggccaag acagcaaaga aggacaagaa gaagggcaag 1860
aaggaaaagg agaaaaggcg caagtctgtg gcctgagctg gaaacccctt taaagtgaat 1920
aaaggctgtc ttgacatgtt caagaacgct tattcgatac atgaagacgt gctctggggt 1980
tatttcgatg aagcctgatc taaatactag tctgcttcag aatcatgcac agtgttcaaa 2040
ttgattctta actacagcct acgctgaagt tcagcttcaa attttggtct attttgaagt 2100
tcttcaccga aagtcatttc tagagtcccg ccccaaagtc tgatctacac tctctactcc 2160
attaccgcta atatccttta caactcttat ctttttcgac ttcttcaagc gctaaggagc 2220
ggaccactaa actgatgcaa gcttgcatca actctacgac cttttttatg tcaacacaag 2280
ttctggcctt acgctgaact cgtctctgat acacaatatg caacgaacac cgccaagacg 2340
gtcgctcatg cacatacgca cacatatata caaccaaaca tacaaataaa cacataagca 2400
ttggtcaagc cagctacagg accaatattc catcttttgc tgcttttctg caatttgggc 2460
cgctttttta tgtttggctg tatatatttt tcttggcatg caacctaaca agacacatga 2520
gcagaaaaaa taaatacggt caaagtcttg tctctgatgc tcatgtcttt cttctaatct 2580
taccagcgag aagacctttc taaagaataa tatcacatat actcaattgt ccaaattgct 2640
ttcaataagc attctttact ggatagctct cgccaaactg tcattcttag gaacactgct 2700
aatacgtggc tgaaagcact cccaacatgc acttttattc ctatgcattt tcttcttgga 2760
gctcaatttg acaaaatgcc ggtcgataag ctcgcggtct tgactttgat gcttacttcc 2820
ttgtttaact cgaaaacctt ctcatggctc attggaaaat catcaaatgg attatctatc 2880
atcttcactt aacccaattt ttgtttctct aaaacagccc caactatttt ttaaagaaat 2940
ttgtgtgctc tatcttctgt ttgcaactca aactaacaag ccacatcaac aaacatttat 3000
ttttttcaaa cttgataact ttagaccaac tttgcatcct cgatgctcgg gactccatct 3060
taccccttgt caggtatgaa gcatctgatg aagcttgcag tattattacc ttttccagaa 3120
cactactgct accttcaaag atttgttcat ttcttttctt tgggggaaac aatgaatgct 3180
gattacccga agcgtaatat ggttgttgca tatattcaaa tattttaaac cttctaagta 3240
tttatatgat aggtatatgt tatttttaaa gacctttaat gcagttattt catatcaata 3300
accaagctct cgcagttttg cgctgtactg gcagtggtgg aggacccgtt gatctttata 3360
aaataggatc actggaggaa ggtgagacca ggaaactaag actatataag tttgtgggtt 3420
tctgtcattg tcactgacaa ggatcaaagt tatcctaatg cagagcatcc aacctttgtc 3480
tcagggaccc acccaatcca ctcttcaagt tttcactttc aatttcaggc caatttaaga 3540
caggaataca actcaaacta aatcaggatt cttctttttt aactcccagt catgcgatct 3600
ttaaaattga tcacattgcc ggcataataa ccatgggttt cgcaacttcc tccctggttt 3660
ctttgccaaa taaaacttcc acacactcga gagcaaactc cattgccgtg ccaggccctc 3720
tagacgtcac aattttggcc tcgtgctcaa ccaccacgcg atcctctgaa catcctcctt 3780
gagcgctctc aagatctttc gcaaatgccg gatggcacgt ggctctgcgc cctttcacaa 3840
tacccaaagg tgctagcacc accgctggag cggcgcaaat tgcggcaacc caggctccgc 3900
gcgagttctg tgccaacaat agcgagcgga gaggctcgct cgcggcgaga tttgaagcgc 3960
cgggcatccc gccaggaacg attacgaggt cgaaagatgg agatgtagaa ctggcatcca 4020
ggaggtcatc caaacgaaca tcagcctcaa tacgcacgcc tcgagaacag gtgagggttt 4080
ttccgctatc gcaaacagcg gcaacaatca cggaggcacg ggctctgcgc aggacatcaa 4140
tcggaatcac gctttccatc tcttcactgc catccgccat gacaaccagc acggacggtg 4200
gggaggaagg ggaagaagac atcgtcgaat tatgggaaac gtcgagactg gagcaagcgg 4260
gggcgattgt ttaagcgagc acaaagtgac gaggaattga gttacaatgt gaatctatag 4320
ataaataggt acctgtgcct tgcgacgaca gaaagatatt ttctcataat aggcctatct 4380
aaaaccaata attttgaaca ttttcatcat tgacgaaaag ctcctgcctt ccaaattgga 4440
agtgactatc cttaatatag tgcaataacg cattggacca aacagaatcc tcctggaggt 4500
gaccaccatg ttaggacctt gaacttcgca attgattggt ttcgaccttt tctccctcct 4560
tttataaaat aagcggctca aattaattag cctatcacgg tttctctagt ttttgggggt 4620
ttcgctatta tttggttatt atgaacaaat gtacagcttc ttacttacca gcctcctcgt 4680
tcagcatggt gaatgcatga aataaggaat caacttcatg actcatgctc tgcgtacaac 4740
attagattat ttttgcatgt ggtgttgaaa gtaagtcttc aagtcttttt cgtcaggata 4800
aaaactttct ttcatttgaa gttgtatgca agtcgcacca agatgtgatg actattttgc 4860
ttttcattaa ctttcctttg cagcaaaaaa gctctgtgcc tatgaaagcg ttagaactta 4920
cttatataac ctccaaatgg tagtgactat tccacctaaa ttacatatca taatgattta 4980
agtctttgtt aaaaagtgga tgtttggtaa gaaactggaa taactaaggg accactaagc 5040
tccagacact acaagtgaag caaatcttca atttaaatta tcaaagtact tcaaccaaaa 5100
ttttagcgtc tcaacaagta cccttcgtgt gctatcccgg aggcaatcac atgtgcacaa 5160
gtaacgatgt tgaacgtacc tatggctctg gtttattttg gcagccatga gcaacgcaac 5220
actgaccgta tctttctcta cgctacaatg tcctccgcca agcaaaaaga gaatatccca 5280
gctcatttgc aaagccgaga ttttattcct gccagtggtg tcaactggtc atttacggag 5340
aggattgcac ttcaaagcca tgcaatgaat gtggtattat ccacgacaat cttggaaaat 5400
ccaagctttt aaaatgcccc aaaaccatgc aaacacgtag ccgatcgtga tatccacgcc 5460
ctccagctgc gccacctatc caaggacatg gtttaagaat tgtcgtttgg tcatatgtta 5520
gttttcaacc cgcaattggg ccttagtcca ccttgttacc ataggaaatg caagctttgc 5580
aaattttgta ggctaatctc taagtgtagc ttttgtcatt gtaaagacac aattcattga 5640
catgaggttg aaagctgttc tcatatgtaa caatccgcaa cattgactac gtcacatgtt 5700
cgtgcataga gggaacactt atcttgcata gtatgccctc acaactctcc tcccccgtac 5760
agcaatcgca cgcaccatca tttattcaaa tgagacaata cttgctatcg tcccgattgc 5820
tctttagttg gacatagaac taaatgcgcg tcgcgatgcg accggaaagg tttaccagca 5880
gactgttctg caatcgttcc gtaccctatt tcacaacatt agtcgatcga tcagaacaaa 5940
tcaagataga acctgcagga ggggtcgcgc aaagtttagg cacccaggca cagccgctct 6000
gtaagtggat tttcattcaa ttgtggtcct gtgcattcat tgtttgctcg tgtagcaaat 6060
agaaccacaa ggggttttgc agaaagaaaa caaggatcat ggggcgaaac cgaggccaga 6120
cggcgggacc actcgaccgc cagtcgaggt tcatgaccaa ggttctgcgg caccgcgcgg 6180
cagacatggg tcttgaaatg cgttcagatg ggtttgtgcg cgtagaagac cttctgaaac 6240
ttcagcaact taaagacatt ggccttgagg atgtcaaagc tattgttgct gctgataaca 6300
aacagcgatt tggccttcag caggaagagg accagacctg gtggattcgt gccaaccaag 6360
gtcactctat ggctagtgtc gagacagaag atcttcttga ggaggttgac ctcgatggga 6420
tttctctctg tttgcacggc acctatttgc ggttctggcc attgatagta cgcgatggtt 6480
taaagcgtat gcaacgtaac catatccact ttgcaacagg ccttcccggg gacgatggtg 6540
tccttagtgg atttcgcaac tctgctgagg tgcttattta tcttgatacc gtgcaggcga 6600
aaaaagctgg actcaaaatg tatcgctctg caaaccaggt gctcctaagt ccaggtcttg 6660
gcgacagtgg agtaatccct gtcaccttgt ttgctaaggc tgtcgagcgc cgctctggaa 6720
agctactttg gccaatagag gaaggtaaag agtcgcaacc ccctacagcg cctacttcag 6780
accaccaacc tcgacaagga caactagcaa gtaagcgaaa agctggtggc cacaacaaga 6840
aactatcgca catgcttagc cgtgtcctgc ggcactctgc agttgatgaa ggaatcacca 6900
ttcgtgaaga tggcttcgtg cgccttgaag atctccaaac caaactcaag cgtttcgaaa 6960
atgtaactct tgatgacgtt caagctgtgg tgcgtgacaa tgacaaacaa cgcttcacac 7020
tacgccagga gtcagacggg tcctggatta ttcgcgcaaa ccaaggtcat tccatggctg 7080
ttgtcaaaga atcttttctc ttgcgggaac ttgaccctac cacaattgat gtgtgtcttc 7140
atggtactta caaagaagct tgggcaaaga ttcgaaaaac tggtctctcg cgcatgaacc 7200
gaaaccatat tcactttgct cgtggattgc cctccgactc caatggtgtt atcagtggca 7260
tgcggaaatc atgcgaagta catctctata ttgatgcctc tgcagcaggc aaagatggga 7320
ttaaattctt tgaatctgac aacggtgtta tcttaagtcc tggtaatggt gatggcatta 7380
tccctcctaa atactttaag tctgtcacag atcgccaagg cgcttcctta gaaaacctaa 7440
aatgacaaat tatgtagatc ttagttgttg aggacttcat gtcctttttg ttgtttgatt 7500
ccttgtatag cttatacacc ctggttatgt acattgtcat tcttgttaga ggcaattctt 7560
catctttgat tgatattcta tagaacttcc tcatgggtgt acctatacac aattatttat 7620
tataccgtgt gatattgtga ggttctaaag ttagcatcgc ctctgacacc tatgatggat 7680
gcagagtgac gccaatcctt cctctatatt gtgcgtgcct gctcgagaat caaatgatgt 7740
taaaagtcgt cttcattcat tatataacag agcataatgg aataataaaa ggaggcagga 7800
gacaagggta cttctgttgt gtaaaattcc attactatgt tcgtgtatag tagtattcct 7860
tgcctttagg atagtaggga agatattctc tgtgactttc acctacttca ctcttatgca 7920
agctcttatg caatcacaga tggatgtaga ttccgcttct tcattctcac tacgagaaca 7980
gcgcaactac aaatcttaag gactgtcaac tggcctgaaa tagtgaccaa ttatatattc 8040
caaaataaat ttatttgtat aaaattgtaa agatgcagca tgatagctta ggtacacata 8100
aacaacggtt aagtgtatag ggatacgcaa acgcaagcga gaacatgcaa gcgagaccat 8160
cgcctttcac cataatgtta taaatgtcta ttcttctgcc aagagcacga tacactcaac 8220
gttggtctaa gcactaaaga cagcatgtat ttatgtaagg acaacaacaa gcacctatac 8280
ctcaaaactt agtaataggc ttactaaaca ttctaacact atgatcttca tgtgaaaata 8340
ctcagcagca tggatgttga agctccacaa atggaataca gaaaacacaa tctagcaaga 8400
cgatgaaaat tgttcttagg tttcaggatc agaataacca aaatgcgcac cacacctgtt 8460
tctgatgctg tagctgtcat gttatggtaa aaacgtgcac agggcaccac tagcctgtta 8520
ttgtgtcgat tttgatacag tttatcacac gagagcttac tgactatgtt gtagaatgta 8580
aataccctat tcaaataacc ttgtggacac actcatccaa catactctac tcaactctta 8640
ctaaaacaac caaaagattc cgctgaacta gaccaaaata atttgagtga tatgctgcaa 8700
ttcgtttgaa cacaatacat gtattgatgg ctgagatatg acttgccaaa gattgttcgt 8760
tgcaattaaa gtttactctc tgagtgcata tactcaatac aatgcagctt tatcgtggaa 8820
atccgggcta agcatgccat taggacccta tagcaggctc tgggcacgat ctttatatct 8880
tagcgatagt ttgtgcagca aaataatgga taaatcaaac ttcaacgagt cttaattcat 8940
agtttcgaat ccctacgagg ctatatatat aaagaaggtg tgagtcgaca gcacagttat 9000
gtaggaaaag ttataattat gtggaaaata accttagttg tcgaatcgtg gtgaataaaa 9060
gcttcattta agcgttttca gagatgccgg agcccatacc aaatattaat ttgctcaaag 9120
tcatcaattt cttatttgat agaatctaaa acagctttat attatatgaa gagcatatat 9180
attttaagct agtttagact tcaaccaagg ggatccaatt ttcgctcgtc actctgcgtc 9240
aaggtcgttt gcaaaaacat caaatctggt gcaagctcaa atgactaggg tcaataagga 9300
ctcctactaa ttatagttgt cactattatt tccactagga accgataaaa cagatgtaat 9360
taactctctt ggcgcttacc ttgtatagca agagtaaaga gtaaatgatg cggcaaaaac 9420
tatctctgtt acttatatgt tatagagtgc attggctgcg ccatgccata tgatagtagg 9480
taaactttgg aagttgaaag gggcgagaaa gggatcacag gtgatctata tataaaatgc 9540
aaatgaaaat tttaaagttt ggaaagttta tatgcgacac ataaaattat aatttgcata 9600
tgtggattaa gtgaatggaa tgagtctagc tataactact acctatccct atcataatca 9660
tgggaacaga tcaggagcaa attgggctta caggcgctca gtgggcacgt agatgtcatc 9720
aatctcggca gcaacctgct tggcgttagc cttcagcggg gcattacgga cagcttcgag 9780
gcggcgcaag aagcaggcac cacggaggat ctgcaagttg atttgcacaa catcggggta 9840
ctcgttggca acggcggggt caaggtaggt acccttgatg aagtcgttga aagatccaat 9900
cgctgggcca caccaaacct ggtagtccat ggcacggtcc gggatgccag cgtttgccca 9960
gaagctcgcc aaaccaaggt accagcggaa gcacaaggac atcttaagct tggggtcacg 10020
ctccgcgcgc tcaatcttct ccgggttctg caacctgttg atgtagaagt ccttggtctc 10080
ttcccaaact tctgacagag acttcttgaa aatgcgcttc tccacacgtt ccagctctcc 10140
aggagccatg gactcaaagg agtcatactt gacgaagagc tcatagagct tgttggcacg 10200
cgaggggaac atagttccct tcttgagcac ctggagcttg acaccttcct caaacatgtc 10260
agctgctggg gccatgcaga tgtcggagta ggtggcttgt gagagctgct tgcgaacggt 10320
gtcacaggtt ccagcttgct tactcatctg gtttacggta ccagtgacga tgaaggccgc 10380
gcccatgttg aaggtggcaa tggcggcctg agggcatcca atgccaccac cagcaccaac 10440
gcgaacgcga aggtgggcag ggtagccgca ctccttgtgc agacgatcac ggaggttgac 10500
aatgagaggg aggatgacgt ggatggggcg gttatcggtg tggccaccgg agtccgcctc 10560
aacggcaatg tcgtctgcca caggcactgt gcgtgcgaga gcagcctgct cttgggtgat 10620
ctcgccggac ttcagcagct tctcgaggag attctcgggc gcgggacgga taaacattgc 10680
ggcaagctct gtgcgagaaa ccttaccgat gacgcggttc ttaataaccg tggagccatc 10740
agcagcgcga gagagacctg cagcacggta gcgcacgagc tgcggggtca aggtcataaa 10800
ggcggaggct tcaacgacag tgacgccctt ctcgaggaag aggtcgacgt tacccttctc 10860
gaggttgctg tcgaagggag agtggatgag gttgacagcg taagggccct tgggcagttc 10920
agcctggata gcttcgagag ccttgcgtac ggtggcgata ggaagaccac cagcaccgag 10980
agaaccaagg atgccgcgct ttccggcagc gataaccatc tcagcggatg caatgccctt 11040
tgccatggcg ccggtgtaca tgggggcgga tacaccatat gtctccatga aggcacggct 11100
gccaagatcc ttgatatcgc acttgggcac aacaatagat gcttcacttg ggcttgcttc 11160
aacgagatca ccgttggcgt tgacaccaag catcaaagtg ctgttgagct ccaaaagttt 11220
ggcacggaga gcctcagagg aagccacaac agctccggag acgctgcggg ccggagcagc 11280
aggggcaagg gcaggagcag cggaaggttt gttatccttg ttgagaatag ggtcgcgtgt 11340
ggcgtcttgc tcctgaatgt cgagacgctc catgaacttg ggggcaatag gctgcatctt 11400
gcgagcctgg ataagagcct cgatcttggg gtccgcagga ggaagcttag ctagcacctg 11460
gggcggcacg agctgctttt tggggtcata gcgaccattg accacaatct tacgcaagaa 11520
cttgttctta gtaggcttct tgccagccac catatcgttg taactctgcg tagcctcctc 11580
aacagtctcg gggtggtaca gaggggagac cttcacgcca ggcacgcggt gggcttggag 11640
agaggcaacc agcttgacca tggttgtcca agcattctcg ttctggcggt ccatggatcc 11700
ggtgacaaaa ggcttgctat ttccaagggt ggcgcgaatt gcggcgctac ggtgggcgtt 11760
gggaccagtc tcaacaaaga cgtcaaagtt cttgtcgcta acggtcttgg cgatcttagg 11820
aaagtctgcc tgaacagtgt acagctgtgc tgcgtattca ccaaagctgg gtgcgtactc 11880
gtcgctggct ccagtggact tgttaacaag cttcttctgg ttgacgctcg tgtacaggtc 11940
aaggccggca acctcgggaa tctcgaggac gctatggatc tcagcgatct gcttgccgta 12000
cggctcgacc acggggcagt ggccacacat accaaggtcc acgggcaaag cagggaggtt 12060
gctgctcagg cgagcaatgg cagccttgca atcttcaggc ttgccactga tgagagcact 12120
gttggcatcg ttgacaatgg tcaagtgcac gtacttattg ttggggccga tggccgcttc 12180
aacggcctcg cgggttccac gtaccacgta tccttgccag aactcgctga caggggtatc 12240
ttggggaata ttccaggcct tgcggagggc gtcaaactca acagcgaggg ccttacgcca 12300
gacctccgag ttgcggagtt tagttgtcag ctcctcagag acaaggccgt tcttctcaga 12360
aaaggcaaaa accatggaaa tctctccaag gctcagtccg aaagcagcct tgggctggat 12420
gccaagcacg tcgcgagcga tgtgggtgaa gcacatggac atgagaatac cgagtcggaa 12480
catctccacc tggttgcggt tgaactcatc ttcctgcgcc ttaagctcct ccttcgtcga 12540
ggcgcgcggg atcaaccatc tgtcgccttg atcccaaagc ttgttggtct tggcgtttac 12600
aaactcgtga agttcgggcc agatgcggtg aatgtcaagg ccgataccat agtaagggct 12660
tcggccttcg ccgtacataa acgcaacgcg atcgcttgac agtggcttgg gtgcaaagtg 12720
gctgcccgag ggtgatgtcc agtcgcggcc catcttaaga ctccgcggga tgcccttgga 12780
ggcgagttca agctccttct ggagcttact aggagaggtc accaggcaca gagcgaaggc 12840
cggcaacggg gtcttggtct cctgggcaat gctctcgccg agcaactcca taaaagcaag 12900
acgtacatta gcgctaggct gggcgaggcg ctcgcggagc ttgtcaacac gctgcgtgat 12960
agcgtcatgg gagtctccgc ggattacgag gagtttgacg gcatcgtcat cgagcgaaat 13020
gcggctcttg gtctcgtggt ggccctccac atcagagagc agcaccgtgt agcatgaacg 13080
ggtctcggaa acacctgaga cagctgcgtg gcggcgagct ccagggttct tcaaccaggc 13140
ccgcgaggac tggcacgcgt acagagactt gccccactgt gtctcaggtg caggctcctc 13200
ccaggaggcg ccgtttgagg gcaagtagcg gttgtacaga cagagagccg tcttgatgag 13260
actggcagct cctgaggcgt agccggtgtc accgacagtg gacttgacgc tgctgacagc 13320
gacgttgtgg ggctccacag cttcgttgct agagcgctgg ctgagaatgg cctcaatgcc 13380
gcggatttcc tcctcagcag tgagttcctt aggcagaacg gaggggttct tgaggtggcg 13440
ggcagagtca gcggagagct cgagcatctc aacgtccttg gggttgacgc gagcctgggc 13500
gagagcctcc tccatgcagg ctgccggcat gttgccgggc acgatagcgt ccatgcaggc 13560
gtaaatgcgt tcgtccttgg tgcagtcgct ctcgcgcttg aggacgaggg caccacatcc 13620
ctcaccaaca aagtagccgt cagcgccgga gtcgaagctg gcccgcgggc tctcctgctc 13680
cgagaccttg aaacgacgcg acttcacgta gagattctca gcgctggcgc aaagatccac 13740
accggcgatc actacggcct cgacctcgcc agtctcgagc aagtacttgc ccaactctgc 13800
gcaacggtag acggagttgt tgccctctgt gatggtgaaa gaaggaccct cgaaacccca 13860
ttgtgaagac acgcgggtgg ccacgaggtt gccgatgtag gatgtgtacg aggtagcggt 13920
accgcaatcg ttgatgtagg acatcatatc attgagggct gaagcggctt cgggacgagc 13980
acgctccttg agggcaacgc gggcgcggtg acggtagagc tcaaggtcag tgccaaggcc 14040
gacgaagaca gcgaccttac ctcccttctt gaggccagag ttgagaatgg cacggtcgat 14100
ggttgtgaca gcaagtagct gcatggggcg caacatgtcg tctggcgtca tgggcgtgcg 14160
caggcggcta aagtccacct cgacgtcctc aatgtagcat ccgtggggca cctccttgac 14220
accgcacagg tccaaaaagt ccttgtcttt accaaggaaa cgccagcgct tctcaggcaa 14280
tggcacagca ccatgttggc cattgtagat ggcacgctca aaggcgtcca ggcccttgag 14340
ggagccgaag gtggcatcca taccggtaat agcaatgcgc atgttgccct ccccgccaca 14400
acgtgagctg agggaactga tgctatcgtg ggtggcacag gcagccttgg agcggtcaaa 14460
ctcctcaaag actgcgtggg cgttggtgcc accaaagccg aaagcggaga gaccagcgcg 14520
cttgggctcg ccctcagtgt cgggccatgg gatgggctca gagaccacaa gcgggtccat 14580
ttgggaagat ccatcgacac caggagtggg cgggatcaca ccatgcttca tggcaaggag 14640
taccttgcac atgcctgcga aaccagctgc aacgagtgtg tggccaaagt tacccttgga 14700
gcttccaaag cgaggcacct tgccctcgaa gcaagccttg acggcatcaa tctcaacgcg 14760
gtctccctgg ggagtacccg ttgcgtggca ctcgacgtac tggatcttgt gcgggtgcac 14820
gttgacgcgc ttgtaggtat caatgaggca ggacttctcg ctgggcaagt gcggcttgag 14880
gggaagacca cagccagcat tgctgatggt agcaccgagc agagtaccgt aaatgtggtc 14940
tccatcgcga atagcgtcgt caaggcgctt gagaaccata atggcaccac cttcaccagg 15000
ggtgagaccc tgactgtcct tgtgaagcgg gtacgagatg ccgtctcccg atacaggcat 15060
ggcctggaaa gtggagaatc cggagagaat gaaaaagggc tccgggaagc aagttgcacc 15120
agcgagcatg acatcagcag caccggaaac gaggtggtcc tgggcgaggc gaaggacgta 15180
aagggcggtg gcacaggcag catcgacaga gtagtgaaga ggaccgaggt tgagctcttc 15240
tgctacgaag gatgccgggt ccataaagat gcggcggtca ccagcctcgg ggttctgcga 15300
ctgctcacgc tcggaccact tggaggcatc cttgaagacg cgagcgccga gtttcttttc 15360
gacgtggttt tggtacacat tgaggagttc gccctggagg ttgtccatgg gaaaggacag 15420
gcatccgctc acaataccgc accttgtaga gtcggagacc gatgtctcgg agagagcctt 15480
cttggagagc ttaaggagaa gctcgtgttc gttatcgacg gagtcatcga cgcagccgta 15540
gttctcgttg caaaaggtat ctgcaaattt gctacgctct gctttgaagt gctcggctcg 15600
cttgttggat ccgaggcgtt tatcgctaat cttagtccat gcagcctcac cgcccatgac 15660
tactttccag aactcttcct tgtctttgca gcccgcgtat tgcacggcca tgcccaccac 15720
ggcaatgcgc ttctcgtcgt gcatttcgtg agcagcgctc acattcttgc gagaggccat 15780
ctttttgctt tcttgttgct gcttactgta aacaaaaaaa agagcttgcg tgtcacctga 15840
ccggcacttt tagatcgatc aaaaagcggt cgtgtagatg gtttgctttg gaggagatgt 15900
ataaatgatg tgattgacta ccttgagcaa gtgattacag ggatgccaga gcaatcaaat 15960
aatcaatcag ttaatcaacg ccgtaataaa ggctatcaat caatcaatca atcaatcagc 16020
caactagcta gccgaagctg cgatggactg gcgtttggac agcgcgaagc tgtaggaact 16080
ggcgccgcac gagctgcgag gctgccaagc tagaggctgt ctgcctttgt ctcactcctt 16140
ttccgaggaa ggagagagag agagagagag agagagagag tggggggatg aaagtttgga 16200
tgcacgatgc gtgctttgtg gtttgtttcc ttgtttcttt ctttgcttgt tttttctctc 16260
tttttctttg ttattttgtc tctcttgaag caaatagaaa gaacctcgaa ctagacgctc 16320
caaagggtct tcaagaggtc tcgaaggcta ggctggcgaa agcgcgcacg ctggtcaagc 16380
aagcaagcaa agcaagcagg caagcaagca agcaagcaag caagcaaagc aagcaagggg 16440
tggattccac gaatgcgaga agtcaaaact ctgcttcaaa cagagaacaa atgggcaaac 16500
gaatgaggat aaatgagcaa ctaagtgaag tttacatttt caaaactcaa caaaacgatt 16560
acccaatcaa ctatgagacg cgcagacgtc tgcggcagca tctcttttat gattttcaaa 16620
aacaaaaaca aaaaccaaaa caaaataatt tgcaacaaat taatgaaaag cgaaacaaca 16680
aacagaaaca ttgtttaaac taaaaagtca tttttattga aaatctgttc ttttcatctg 16740
tacgtatgta tgtttgtatg tacacacttt gcttcatcgg tttattcgag tgctcttcat 16800
tcttgaaatt gccttagttc ttgctgttat aactgtcaaa caaacctcgc gaccttgaca 16860
agcagctcca cctcaccttc gggcctgctc gtttgccttt ctcgcttttt tcgcgatctt 16920
ctgccatcct tgcctactct gtccttatct catcaggctg ctgcggcctc ttgacctagc 16980
agttcaagta taattaattt gaaaataaac aaaaaaacac tgccacttat tatgcagatg 17040
gcactctctc agtgttgcaa aagtagagtg aaattctggt ttacaaaaaa tatttattta 17100
ataaacaaat aaaataaata taaattcatg ttatgttaga tcattttatt ttgttttctg 17160
agggcgcgat aaacgcttac ttgagaacca agaaaagcaa gaaaagcaaa ggtgcgaaag 17220
aagcaaacac attgatttcc ctagttccca ccacttcttt ctttctttgt ttgtatattt 17280
gtttgtttct ttctttcctg ctttgttttg tttgttttgt ttgttttgtt tgtttgtctg 17340
tttgtctgtt tatctgtttg ttagtttgtt agttactaga ctgctaattg atttgaaaac 17400
caagccaaac ccacgcaatg aatacgcaga aagcacagct aaaaagaaga agaagaggag 17460
gaattccgaa tcaggcgaga aagtctcgaa agcagtgcac caaaatcctc atttggaatc 17520
aaagccctcc ttcccagcga ctacggaggc ccacgacgac gacgacgccg acgacgccgc 17580
ccgcccgccc atcctcctct ctctccgcct gctcctcgtc ttctccctcc ctccctccct 17640
ccctcgcgca cgccgctccg aatggaatga catgactgac gcaagcgcgc aatggccgcc 17700
gtgcgatggc tcgaagcagc atcgcatcgc attgcattgg cattattcat tgattcattc 17760
attgattcat tcattaattt attcatttta attcattcat tcattcaatc attcatttat 17820
tcattcattc attaatttat tcattttaat tcattcatta atttattcat taatttattc 17880
atttttattc attcatactc ccgagcgcta cccggcgcta ggtgggtgct aggcgtggat 17940
ggagcggacc tctctgccag cagaaagagg aatgaatcta tctggatact gcgcgcagct 18000
tcttgcttgc tttgcttcaa cttgcttgca aacagccagg aggccgaacg gcttcgaccg 18060
ctcagcgtgt tcgccagcaa agaaccacct ccgccctcgc agtcgccgga tggatgaacg 18120
agcgaatgcg aatcctcctc cgatcttgaa cctcgaacct tcaatcaact tgccttaatt 18180
ttactttcat gactctcact attttaaata tacatgtatg tatgtatgta tgtatgtatg 18240
tatgtatgaa tgcacctcat actgataggg acctgcgggg gactgatacc acctgtctga 18300
atcaatttgc gagaccgcga gactgagtgg caggtagtag ctagctaagt agctgcctaa 18360
gagtctatcg gcatgcatga atcaaaaact atcatgtcaa tgttcctttg aggcttcgaa 18420
gtccgtcatt tgtcacgaaa ggttttgggt gaacgatcca ctgtttcgag agagatggtg 18480
tgaatgtata ggtgatagtt gccgagctgg cgagccgtcc caagcggtgc cggcactcac 18540
ccggctgaag cttcttacat gctctccgtt cataatcgtc caaattgatc ctgattcatg 18600
attcatgatt catgattcat gatgacacga gttggagttg gacgataagt cagcgctcgc 18660
tcaaccaaac tacctctgct cgcctagctg ctgttaggta gtgctactga ggcaggaccc 18720
aacttgaagc tacctactgc ctaggtattc ctacgctgtt tcgctgattt gcaatctctt 18780
cgttaccaag agataaaatt aacgagttat gacattgcgt atgcagacta cataataaag 18840
attgtgtcat ttatttataa gtggaaaggt gtaagatcaa gaactaagca ctaggtagca 18900
attaggcgtt atttgttagc gcgtggaaga aaatgcctct ggacagatag ctattaatag 18960
ctattaatag ccggtgttgt atttacaacc ttctgaaaga atttctccat agaggaaagt 19020
aaagaaacat cttattctgt gaaaagagat aaacaacttt ctagaaaatg gatgacagag 19080
caaagaaggt cgatcgtctt caaccgcaga tctgggaatg ctaaggttgg cgccaggctt 19140
acattatgcg tcatgctgac caaagggcgt aaagtgccga tgggcatccg atatatgcgc 19200
gttcaaggtg aggaattcaa gatcatcaag tttgtttgaa tttcgaggtt gaaaacacag 19260
agttttgaca atcgatcaat caatcaatca atcaatcaat caatttaaaa ccaatttaaa 19320
accaaatgaa tgagtgaatg agtgaatgac tgaatcaatt taaactaaat gaatgaatga 19380
atttaaaacc aaatgaatga gtccttagcg atttcaagtt ctgcagtgaa atctacaaat 19440
ctacgacgaa agtagtgaga tcgtatcaac gtgtatagac agacaatgat gctgcggata 19500
cctaagtgct tgcgtggagg gactacgatg cagatcccga gttttaggtc ctagttcctc 19560
cgttctctgg taaaaaagaa agcctctcct tcttgacgcc attcagcgac gtggaacaag 19620
cgagacagag gcacaagttt tggagtcatt gagtcgggtc tgctctgctt tgaggatgaa 19680
ccaacgacct tcggagtctt gcagatagat ggtccattct tcaaacgaca cagagatcgt 19740
cgtctcgcgt aagttggcag tgggtctaga gctagctaaa aacatctgac agagagcaca 19800
tacagagcta aagaggagtg tactcggcaa aatagcgtgg acggatgaca tcatcaatcg 19860
ctcagctttt tcgtttctta ccaaaaaatt gacaaaccag agaaataaat agattgactc 19920
aacaaattaa attaaaacaa taaattaaaa aagatctctt aaagaagttt tctgaaagaa 19980
accaaaaaca ataaactctg cgacaagaac ttgaggccag aagggatgaa gaaggtacgt 20040
atctagatgg tgactgggga cacaaagaag caaggtctga attctcagaa gccagctgca 20100
gccagccagc tactaggagt gtctgccagc tccgtcgtca tgccacgagt gtccctgcca 20160
acgcttcaag cgtacttgca acttttattt gattactaca ttactacatt ataacttcat 20220
ctatagcttt aaaaaggaaa taaaggaaat aataaaataa atcaaataat ggtaaaaagt 20280
tataaataat caacgactaa aaaggaattt tattcgaagg tcctcggcag gaaataagtg 20340
gaatcaaaga gaaggcggga acggtaggga ccatacatga tagtcccaaa ctgaggaact 20400
acgaattgcg gggctaagca aattcatagg atcccagtta gggacagacc ctcgaggtcc 20460
gagttggtat cctgggccaa agcttgcgca agggtgctct agagctacaa ctcaatacca 20520
gtagttgcat ggccatctct gatagctttc ttcatgaata tggggtgagc ttagagacaa 20580
gcagtagaca ctctgtgacc tacgagctat atttgctgtc gcagagcatc tcctcaaaat 20640
aattcatcga agaaagacgg attgaaagtt ttgccttatt tgaacaaagt taatatttta 20700
actctcggta gttaaaccat gatagctcat ttatagcgta ggctgacaca gaagcgtagg 20760
ggcttagacg tcatgatgat tcgtgatgaa ataaatcaag gattctcgaa cgttgacacg 20820
cgcaatggag cgtgccaatg tcaaaagggt attgctgtat catcaacgta ggtaggtagt 20880
caaacgggct acagctctgt cctattcact cactaagaca aaatgttttc tctcaaacgg 20940
ccagctcgaa agtaatattg ggagcaagaa tgaaaatcat tctccagtac acttgcagtg 21000
agatcaagtt tcaagaccat caaacgatac gatacaggag gtactatctt tgctgaagtc 21060
agtagcagca gcattacgag cctggtagat ataaattgat aaaaagacaa gaggtatatc 21120
atatttcaga gtagagtaca tactgagctg gaaacataaa actagtgcac gcaatcgacg 21180
gttcaacttt tctcaagacg cttccagtcg tttcttaatt agctcagatg gtagcaaaag 21240
tgatatgcgc atcagacttt cgtaaacgta aaactcggca tctgtagatg ttgagtcatt 21300
gttttcttca ataatttact tctcgcagca gtgcacttgg aaaggtttgt caagtttgac 21360
ccagctaatg aaacacaaca tcatcaggcg gggctcgaaa agtagatctg aaagtctata 21420
aagaatgaaa gttactctca acacagaaag caatttgtgc aaacataaga gagaatggcg 21480
tctatgctgc aagagaaaat tcgacggtcg catcatagtc gtctacactg ctgtgcatgg 21540
gcaatttata atatcatgtc tgatcacggt ttctgagaac atttaaacga aataagtcaa 21600
aacgaatgcg ctctgtcgcg attatagttt tgttctgaca gtaactccta accaaagggc 21660
caaataagga cgagagaata aaatagattg ctctctcact tcggacccag gaatcccgaa 21720
tttatataat ttcaatgtac tcacgtaaca ctgacaagct atgcggcgtc aataactcat 21780
ccacgttggg agaatctcga aacaacgcaa cgagttattt tatcctgatt aataatctag 21840
cttgaaccgt ttgttgtaac tagaacccaa gctgcaaaga gctacaacca aggtttgatt 21900
tcgttccaag ctaacatgaa actctcaaac ttcgtcgatt tttttaatgt ttgtcaaaaa 21960
cctagtacag cggtcctagg taccgatttg agaagcaggc aacccgctta taaataaaag 22020
aaaaagagtc tttattattt tataaataga aaaaacttta attgggacaa tattctttat 22080
gtgttctctg tcttcttcct tcatgtatga cgtaatgatc atgctccttt catctccttc 22140
cttccaaaaa gttcattttt cctactaggt ctttttcaaa attaaaaata taattaagta 22200
agaaagaaag aaggaaagaa agaaaaacct gggtactaat cagtgtgata tgaggtgaat 22260
ggtggttttg ttttacttct cggaagtgtc gagtcctata aggagcacta tacctatcct 22320
agacgctttt ggtaccaagc cctgcgcggc aggcatacgt cagcaagcta cgatagcagt 22380
acacgctact cagaaaggcc tagtgaggta ggcgagcagg aagtagtgct cttgcgtcat 22440
gcttatgatg gcatcagcca cgcgagaacc tcattcgaat agtccttttg caattcattc 22500
acgcatgcat gcattgatgc ctgctacaga gtagctagtg agagagtatg atacttagtt 22560
agtgctactt atgcgttgtc acctatgcaa tagcattgga tagaaggaat cagattcacc 22620
gctgactctc gctgagagta agggccatac gcagtgctcc tgagttgttt cattaaacgg 22680
acttcaagct gagttctggc taggcacctg gtagctgggg ctagagggta cctacctacc 22740
tacctactga tagctaactt tcaaatgagg aaagattgga gattgaatag aaagaaagtg 22800
atacatactg tcagccgtat cgaaactccg aagtggcacg cggatggcgt cagcaaactg 22860
ccgtagcaag tgaataacgc acatctcaat tgggacgtcc atgaaaacaa aaaacaaaaa 22920
agcaaaaaaa agttgcaatc gatcatgaat cgtgctgatt catgggttgc ttgcttagtt 22980
gttatgctgg agggtgtcga gacttggatc tggtgagcag tgcgctctcc actcaagttg 23040
gaccctttgg tatcagggga gtgcgagtgg gcacactacc atagtatcct aaattacctc 23100
tacgttttga ttgcctttga tcacagcaga taattttcaa tttaaataaa aatcataaaa 23160
agaagaagaa gaagaagaaa gaaagtgaag gtggcgtttc tgatgtcatc attttcgcag 23220
tgcttcccag cgaagattta ctgtgaacta ctacgcatgt gagtatggca agcactgggt 23280
aagtaggtac ctaccactac catgttgtaa aacaaaacaa ggaatatgtt agctagaaca 23340
gagcgaatcc ggtgtgagtg ggagtcatca tcagatattg aaagttgtcc tctcaattaa 23400
tataaatatt tctaactaaa gcaattaaac atatatttat taatttaatt ataaattaaa 23460
taaatatgct gggtgggtcc gagtcattct gactatcatc tatgatgttt aataataaaa 23520
tattgaaagc agtcaaggtt atttggaatt atgggatgat cgtgatctgt gtatcattct 23580
gcatcattgt ggatgctggc ctacgaaact acgacggcat tgcaattgcc acctggcggt 23640
gcgatcgcgt gcactcctgc aattgcgagt gtcttccgcc ggcttcaagt tgaggtgctg 23700
cgacagtgcg ggcccagagc tcctaacatt tcgtggatga ccgactgact cagacagagg 23760
tctctcaagc ttagaaagtg cgctgcaaaa aagggcgcta gctagataag atacgagtga 23820
gtgagtgagt gagtgagtga gtgagtgagg ttctagctag tgctcctccc aaatcttgga 23880
gtgccgatgc tcgagaatac atacatactt caagacacga agaacttgaa cccgaagacg 23940
aatgccgtct tcgacgtcat ctttgccgtc gtcatggccc actgcagcaa cgatccagtg 24000
cgtgcgagca gcagggccag cccacgatca cgcagctcgt cgggctggac ttggctcaat 24060
gaatgaatga atcaatcaat gaaagaatga ctcaatgaat caatgaatca gcaagttgcc 24120
accaaagccc atcgcaacga cgggtcctgc ctgcgtgcgc cattcttagg atccagagca 24180
agcaagatct tcttcaccta tcgctcagca agcgagaacg caacctccct ctgcatcatg 24240
atgcaggata agtaagataa atccatcttg gacctcgagc tcaaatcgac gcttgctgca 24300
tctatctatc tttgtatcta tctatgtatc tatctttgta tctatgtgtc tatctatctc 24360
tctgcgtgcc tcgtcgtgtt tttgaaaagg agtttcgatc gtggcccaat cggaagagaa 24420
ggctctctct ccctctctct ctctctctct ctctctgcat cgcacagacc aatgagcctt 24480
gcggcaacac agcttcaact tcattgcagg atccaatcca tccaaggcat cgcttgggct 24540
ctcagtgaat gaattcgacc aaagctcgtt ggcaggcaga caaggcctgg acaacataaa 24600
gcaagggggc acgaaggcaa gatggcaagg aggcagagca ggcaccagcg actgcgatgc 24660
tggcgagaga agatcaaggc aaagcagagg ctgcaagcaa gctctgcagt agccacctcc 24720
tcagcagatt cgtcaagatc gggcaaactt cgtctgtggc tgccacgcca gagcagagca 24780
tgcctgcttc atgatccatg ctcaagaaag aaagacagac aagacagaca agacagatag 24840
atggatgaca gcgaacttac atttgcagac ttcgaaggtg cctgacgggt attggtgcca 24900
ctaagacgag aaggagcact tgcttccaga tcgctcacgc cgctcacatc accatgctac 24960
gtcttcaata cgcctggtcc ggttcgcaag agccgcgcgc cggcgattgg gcgaaaggcg 25020
gaggagtcga ggtacgcgtt atcagcagaa tgtaggaaca ccgcgacgcg gccgacgacg 25080
ctggtgagga ggaagaaaga cctggcgcct gtacgtacgt acctacgttc tagcagtagc 25140
ttgaagtgga ctgtgggtcc cctccatctt cttcaagacc ttcaagttgc ttgctgacgg 25200
catcgctgtt tgtttgtggc tgttaggtag gtaggtagct agctagctat agctgtgtcc 25260
tagctgcaca gggagcactc agcctctttc ctagtttctt tggttctgtg cttgtttttc 25320
tagcgagtcg tgcaaataac ctgcggcggc cacgagaagt ccgcgttgag gcgatcttgc 25380
gccagtgcgg cagttgccat cactcgtgca gacagagttg agttgcttct caatcgttac 25440
caatcgctcc aagcaggcct agacatagat tttccttctc tggaccatct actaaaatga 25500
tcaagttaga taggtagata gatagataga tagatagcta gggagatact aggcaccttc 25560
tatgccggca cgtctcgaac aaagcgaaga aagagctgtg ggcaagagca ctcattttga 25620
tcgtagatga tcgtagacgc gctgtagagg agagctctta gtggcggcta ctgtgatgga 25680
ctatgagagg ggacttcgca agacctgtct cggtcgcacg tagctgtggg aagcgagaac 25740
ccgcagagga ctgattctga ttagtgcgga taacttggtc gaggaagagc ggggacccgc 25800
agggaacccg catagcagcg acgttggcac ccgacgacgc tagggcaaag acgcagcatg 25860
cgtgcgaggt gcctataagc tgcgcaattc agagaattaa gacagcagcg ctgggaagga 25920
aggaggagat ttgaaggctc ggcgggagct gtcgagatgg aggcaggcag gcaagcaagc 25980
aagcaagcga aagaggcggc cagggctcgc gtcgaagccg ctgatggacg agagaatcgc 26040
acgaagaaga atacggagtg tttgttttca aagccaaaga aagccaaagc caaagccaat 26100
tcgttcgttc gtgagttaac ttattattta atttaattga catcttcatt tactactgtt 26160
gttatctatt atttatttat ttatttattt atttatttat ttatttattt atttatttat 26220
ttatttattt atttatttat ttatttattt atttatttat tgtttatatt tttttaaatt 26280
aaaaaaattc aaaattcaaa attcaaaatt cacgaataaa ttgcacttga aggagatgaa 26340
gcaaagcttt gtttcttcta aaaagagtat aaataataca aagtgatgac ggaaagaagc 26400
atcattctga tggtaagcac ttcggcaaga tgcacgcact agcacttgtc gccttgcttg 26460
cgatccgcgg aggtaatagt ggaggcgaaa gaaggagttc attcctgtta tttcgcgctg 26520
gggttacagc agtgccaaga tttcgaatat ttgaattttt gaatttttga atttttggat 26580
cttcgttccc cttcttcctg aactgttcaa acgactcgga ggttgtcgat cggatcactc 26640
aatctctcaa tctctcactc actcactcac tcactttttc tcagctgcct gatccttcgc 26700
aatgctcgcg aagcgcgagg gatatgcgtg ggcgagcacg caccatcttc tctccacgcg 26760
taaagaagag cagagccaga ggcaggtagg tatctccacc catctcaggc tgtgacttct 26820
ttgtttcttt ctttctttgc ttgttttctg ttctctctct gtgctctgtc cacacgagaa 26880
agagaaagag agagagaaag aaccacgggt ttatagagcg cactcgtcct tcctgcttca 26940
gcagaaagca ctgcgtagga gaactacggg ggaggaggaa gcacgcacgg aggaggcgtg 27000
gaaggaagga ggagacagag agagagagac actgagggac agagggggag aggcagaggg 27060
agaggcatct gatgtttgcg agaaaccaat aagttttgaa agtgatttga tttagctgat 27120
tgactgatct atggcctgaa agaaagcttt taaagcggag ggagatagat gacgagggca 27180
gctgcgatgg cgtacggcgc atccgtctct ctctgtgtct ctctctcttt ctctctcgtc 27240
agggcgtgga gacctcggaa gctgcacgcg gcgcggtgag gaggcagggc agcagaggga 27300
gaggagagat cccagagtcg aagagcattg attgattgca gatgatcttg ggcaacgcgc 27360
gtcagcttga gcgaggaatg ctttggactt caggttcttc gcttctgtgt ttcattcttt 27420
ctcgaagaaa gaaagaatga aagaaagaga gaaagaaaga aagaaagaaa gaaagaaaga 27480
aagaaagaaa gaatgaatga atgaaagaaa gagagaaaga aagaacgaat gaaagaaaga 27540
gagaaagaat caaagagaaa gcgcattcgc agttcttctt cgtgaaagaa aaggaaaaga 27600
gaggcgatgg taggctctga tctcatcatt tctggtttct ctgttgtacc tgtactctgt 27660
gcttgtggcc ttgcgaaggc tgaagacgcc atgcagacaa ccacgcctcc gcagagactt 27720
tgcgggaaag cagagggctt ctcgccactc tcgaagaaac gagctcgcca gttttcgggg 27780
ttgttctcag aattgcgagt gttggcttta tatgggatga tggtatggca cttcgtcatc 27840
gttactctcg ctcgcttgct tacgaagatt ttcaaaaggg cgaaagaagt gctcagcttt 27900
taaaataaag tcacaccaaa gactaggccg catagcagaa agctaaagta aacccaatct 27960
gtctgaagag agtgtcgtgg ttagatactt acgcaagagt ttaaaagctg taaatagtac 28020
aggaacaaaa acaaataaat atatatatat tcttttttat tagtaaaaca tgaaaccaaa 28080
aaactccttt aaaataaaat aaaataaaat aaaataaaat aaaataaaat aaatttacta 28140
ctatatatac atatatatat acaataaata aaaacaactt tttcagacca gaaaaagact 28200
gagaaaaaag gaaactaatg actctcgagc accgagagcg atataagagt ggattatatt 28260
tgctaggccc accacgagtg agtcccctag gaggaagcgc cctctgagac aggagcagag 28320
gcgtcgctgg tgctccaaaa agcgacggcg aatggaaagc aaaacccttt cgagggaggc 28380
ttgtggccgt gactattcaa atctccagca tctcagctcc agcacagcag aagctacctc 28440
gcttctcagc tctagctatc acatcgatcg cagcatctag ctcgtagaca gctagcgccg 28500
caccttcccc caaatcaact tgggcaactt aactcttttt tcaccagaac tcctcttttc 28560
ctttaatctt cgaaaagaag acgaataaaa gagataatcc tctgccgcag cacattctaa 28620
aagaaaagcg gcatactggc gtaggcaaga ctttcaagct cttcctcgcc tccaccccgt 28680
atttccctgt tcatctttgt gaaacgagga aacaagaaat tttataggac aagatggctc 28740
aacgtgagaa ccgtctcgag gccaacatgg atacccgcat cgctgtgatc ggcatgtccg 28800
ccatcctccc ctgcggtacc accgttcgtg agtcttggga ggctatccgc gatggtatcg 28860
actgcctcag tgatctcccc gaggaccgcg tcgatgtgac cgcctacttc gacccggtca 28920
agaccaccaa ggataagatc tactgcaaac gtggtggatt catccctgag tacgacttcg 28980
acgcccgtga gttcggcctc aacatgtttc agatggagga ctccgacgca aaccaaaccg 29040
tcaccctcct caaggtcaag gaggccctcg aggacgctgg catcgaagcc ctcagcaagg 29100
aaaagaagaa cattggatgt gttctcggta tcggtggtgg ccagaagtcc agccacgagt 29160
tctactcccg cttaaactat gttgtcgttg agaaggtcct tcgcaagatg ggcatgcctg 29220
aggaggatgt tcaagctgct gttgagaagt acaaggccaa cttccctgag tggcgccttg 29280
actccttccc cggtttcctc ggcaacgtta ctgccggtcg ctgtaccaac accttcaacc 29340
tcgatggtat gaactgtgtc gtcgatgctg cctgtgctag ttctctcatc gccgttaagg 29400
ttgccattga tgagcttctc cacggagact gtgacatgat gatcactggt gctacctgca 29460
cggataactc catcggtatg tacatggcct tctccaagac cccggtgttc tctaccgacc 29520
ctagcgtccg cgcatacgat gagaagacca agggtatgct tattggcgaa ggctctgcca 29580
tgcttgtgct taaacgttac gccgacgctg ttcgtgatgg tgacgagatt cacgctgtca 29640
ttcgcggctg cgcctcttcc tctgacggta aggcctccgg tatttacacc ccgaccatct 29700
ctggtcaaga ggaggctctt cgccgtgcct acatgcgcgc taacgtcgat cccgccaccg 29760
tcactcttgt tgagggccac ggtaccggta cccccgttgg tgaccgtatt gagctcaccg 29820
ctctccgtaa cctcttcgac agtgcctacg gcaacgagaa ggagaaggtc gctgttggca 29880
gcattaagtc caacatcggt cacctcaagg ctgtcgccgg tcttgccggt atgatcaagg 29940
tcatcatggc cctcaagcat aagactcttc cggccaccat caacgttgat gagcccccta 30000
agctttacga caacactccc atcaccgact catcgctgta cattaacacg atgaaccgtc 30060
cgtggttccc tgctccgggt gtgccccgtc gcgctggtat ctccagtttc ggttttggtg 30120
gtgccaacta ccacgccgtt cttgaggaag ccgagcccga gcaccagaag gcttaccgtc 30180
tcaacaaacg cccccagccg gtgcttctga tggcatcttc aacccaggct cttgcttccc 30240
tctgtgaagc ccagcttaag gaattcgaga aggctatcga ggagaacaag accgtcaaga 30300
acactgctta catcaagtgc gtcgacttct gtgagaagtt caagttccct ggatctatcc 30360
cgagctctaa cgctcgcctc ggttttcttg tcaaggaggc cgatgatgcc accgagaccc 30420
tccgtgccat cgttgcccag ttccaaaagt cagctggcaa ggattcttgg caccttcccc 30480
gccagggtgt gagctttcgt gctcagggca tcaacaccac tggtggtgtc gctgccctct 30540
tctctggcca gggtgctcag tacacccaca tgttcagcga ggtcgccatg aactggcctc 30600
agttccgtga gagcatctct gacatggatc gtgcccaggc taaggttgct ggcgctgaca 30660
aggactacga gcgtgtctcc caagtcctct acccgcgtaa gccttataac tctgagcccg 30720
agcaggacca caagaagatc tccctgacct catactctca gccctctacc ctcgcctgcg 30780
ctcttggtgc ctacgagatc ttcaagcagg ctggtttcaa gcccgacttc gctgccggtc 30840
actctctcgg tgagtttgcg gccctctacg ctgctgactg cgtcaaccgt gacgacctct 30900
ttgagctcgt gtgccgtcgt gcccgcatca tgggtggcaa ggatgcacct gctaccccca 30960
agggatgcat ggctgctgtc attggaccca atgccgagaa gatccagatt cgcactgctg 31020
atgtctggct cggcaactgc aactcccctt cgcagactgt catcaccggc tctgttgagg 31080
gtatcaagaa ggagtccgag cttctccaga gtgagggctt ccgtgttgtc cccctcgcct 31140
gcgagagtgc cttccactca ccgcagatgc aaaacgcctc ctctgccttc aaggatgttc 31200
tctccaaggt tgccttccgt cagcctagcg cccagaccaa gctcttcagc aacgtgtctg 31260
gcgagaccta ctccaacaat gcccaggacc tccttaagga gcacatgacc agcagtgtta 31320
agttcatctc tcaggttcgc aacatgcact ctgctggtgc tcgcatcttt gtcgagtttg 31380
gccccaagca ggtgctctct aagcttgttt ccgagaccct caaggacgat ccttccatta 31440
tcactatctc tgtcaaccct tcctctggca aggatgccga tattcagctt cgcgaggctg 31500
ctgtgcagct cgttgttgct ggagtcaacc ttcagggctt cgacaagtgg gacgcacctg 31560
acgccacccg ccttcagccg attaagaaga agaagactac tcttcgtctc tcggctgcca 31620
cttacgtgtc tgacaagacc aagaaggctc gcgaggctgc catgaacgac ggccgcatgc 31680
tcagctgtgt cagcaaggtc atcgcccccc ctgacgccaa gcccattgtg gacaccaagg 31740
ctcaggagga ggttgctcgt ctccagaagc agcttcagga tgcccaggcc cagatccaga 31800
aggccaaggc cgatgctgct gaggctgaca agaagcttgc cgctgctaag gatgaggcca 31860
agcgtgccgc cgcttctgca cctgtgcaga agcaggttga caccaccatt gttgataagc 31920
accgtgctat cctcaagtct atgcttgctg agcttgactg ctactccact cctggtgctg 31980
tgtccagctc tttccaggca cctgttgctg ctacccctgc tccggtcgct gcgcctgttg 32040
cagctgctcc tgctccggct gtcaacaatg ctctccttgc caaggctgag tctgttgtca 32100
tggaggttct tgccgccaag actggttacg agactgacat gatcgagccc gacatggagc 32160
tcgagactga gctcggcatt gactctatca agcgtgtcga gattctctct gaggtccagg 32220
cccagctcaa cgtcgaggcc aaggatgttg atgctcttag ccgcacccgc accgtcggtg 32280
aggttgtcaa cgccatgaag gctgagatcg ctggcagctc tggtgctgcc gctgctgccc 32340
cggccccggt tgctgctgct cccgctgccc ctgcccctgc tgtcaacagc gctcttcttg 32400
ccaaggctga gactgttgtc atggaggttc ttgccgccaa gactggttac gagactgaca 32460
tgattgagcc cgacatggag ctcgagactg agctcggcat tgactccatc aagcgtgtcg 32520
agattctctc tgaggttcag gcccagctca acgttgaggc caaggatgtt gatgctctta 32580
gccgcacccg caccgttggt gaggttgtca acgccatgaa ggctgagatc gctggcagct 32640
ctggtgctgc cgctgctgcc ccggcccctg ttgctgctgc tccggcgccc gtcgctgccg 32700
ctgcccctgc tgtcagcagc gctctccttg agaaggctga gtctgttgtc atggaggttc 32760
ttgccgccaa gactggttac gagactgaca tgattgaggc cgacatggag ctcgagactg 32820
agctcggcat tgactccatc aagcgtgtcg agattctctc tgaggtccag gcccagctca 32880
acgtcgaggc caaggatgtc gatgctctta gccgcacccg caccgttggt gaggttgtca 32940
acgccatgaa ggctgagatc gctggcagct ctggtgctgc tgccccggcc ccggtcgctg 33000
cggcccctgc tccggtcgct gccgctgccc ctgctgtcaa cagcgctctt cttgagaagg 33060
ctgagactgt tgtcatggag gttcttgccg ccaagactgg ttacgagact gacatgatcg 33120
agcccgacat ggagctcgag actgagctcg gcattgactc tatcaagcgt gtcgagattc 33180
tctctgaggt ccaggcccag ctcaacgttg aggccaagga tgttgatgct cttagccgca 33240
cccgcaccgt tggtgaggtt gtcaacgcca tgaaggctga gatcgctggc agctctggtg 33300
ctgccgctgc tgccccggcc ccggttgctg ctgctcccgc tcccgtcgct gcccctgctg 33360
tcagcagcgc tctccttgag aaggctgagt ctgtcgtcat ggaggttctt gccgccaaga 33420
ctggttacga gactgacatg attgaggccg acatggagct cgagactgag ctcggcattg 33480
actccatcaa gcgtgtcgag attctctctg aggtccaggc ccagctcaac gttgaggcca 33540
aggatgtcga tgctcttagc cgcacccgca ccgttggtga ggttgtcaac gccatgaagg 33600
ctgagatcgc tggcagctct ggtgctgccg ctgctgcccc ggcccctgtt gctgcctctc 33660
ccgctcccgt cgctgccgct gcccctgctg tcagcagcgc tctccttgag aaggccgaat 33720
ctgttgtcat ggaggttctc gccgccaaga ctggttacga gactgacatg attgaggctg 33780
acatggagct cgagactgag ctcggcattg actctatcaa gcgtgtcgag attctctctg 33840
aggtccaggc tatgcttaac gttgaggcca aggatgttga tgctcttagc cgcacccgca 33900
ccgttggtga ggttgtcaac gccatgaagg ctgagatcgc tggcagctct ggtgccgccg 33960
ctgctgcccc ggccccggtt gctgctgctc cggcgcccgt cactgccgct gcccctgctg 34020
tcagcagcgc tctccttgag aaggccgaat ctgttgtcat ggaggttctc gccgccaaga 34080
ctggttacga gactgacatg attgaggccg acatggagct cgagactgag cttggcattg 34140
actccatcaa gcgtgtcgag attctctctg aggtccaggc tatgcttaac gtcgaggcca 34200
aggatgttga tgctcttagc cgcacccgca ccgttggtga ggttgtcaac gccatgaagg 34260
ctgagattgc tagcagctct ggtgctgctg cccctgctcc ggctgctgcc gttgcaccgg 34320
cccctgctgc tgcccctgct gtcagcagcg ctctccttga gaaggccgaa tctgttgtca 34380
tggaggttct cgccgccaag actggttacg agactgacat gattgaggcc gacatggagc 34440
tcgagactga gctcggcatt gactctatca agcgtgtcga gattctctct gaggtccagg 34500
ctatgcttaa cgttgaggcc aaggatgttg atgctcttag ccgcacccgc accgttggtg 34560
aggttgtcaa cgccatgaag gctgagattg ctagcagctc tggtgctgct gcccctgctc 34620
ctgctgctgc cgctgcaccg gcccctgctg ctgcccctgc tgtcagcagc gctcttcttg 34680
agaaggctga gtctgttgtc atggaggttc tcgccgccaa gactggttac gagactgaca 34740
tgattgaggc cgacatggag ctcgagactg agcttggcat tgactccatc aagcgtgtcg 34800
agattctctc tgaggtccag gctatgctta acgttgaggc caaggatgtt gatgctctta 34860
gccgcacccg caccgttggt gaggttgtca acgccatgaa ggctgagatt gctagcagct 34920
ctggtgctgc tgcccctgct cctgctgctg ccgctgcacc ggcccctgct gctgcccctg 34980
ctgtcagcag cgctcttctt gagaaggctg agtctgttgt catggaggtt ctcgccgcca 35040
agactggtta cgagactgac atgattgagg ccgacatgga gctcgagact gagcttggca 35100
ttgactccat caagcgtgtc gagattctct ctgaggtcca ggctatgctt aacgttgagg 35160
ccaaggatgt tgatgctctt agccgcaccc gcaccgttgg tgaggttgtc aacgccatga 35220
aggctgagat cgctggcagc tctggtgctg ctactgcctc tgcccctgct gctgcagctg 35280
ccgcccctgc tatcaagatc tccactgttc acggtgctga ctgcgatgac ctctctgtga 35340
tgtctgctga gcttgtcgac attcgtcgcg ctgatgagct ccttcttgag cgccctgaga 35400
accgcccggt ccttattgtc gatgatggta ccgagctcac ctctgctctg gttcgtgttc 35460
ttggtgctgg tgctgtagtt cttacctttg acggtcttca gttggctcag cgtgctggtg 35520
ctgctgttcg ccatgtccag gtgaaggacc tctccgctga gagtgccgag aaggctatca 35580
aggaggctga gcaacgcttc ggccagcttg gaggcttcat ctctcagcag gctgagcgct 35640
ttgcccctgc tgacattctt ggtttcaccc tcatgtgcgc taagtttgcc aaggcttccc 35700
tctgcacccc tgtgcagggt ggccgtgcct tcttcattgg tgtggcccgt cttgacggtc 35760
gccttggttt cacctcccag ggatctactg actccctcac acgtgcccag cgtggtgcta 35820
tcttcggcct ctgcaagacc attggccttg agtggtctgc taacgaagtg ttcgcccgcg 35880
gtattgatat tgctcgtgag gtccaccctg aagatgctgc cgtcgccatc actcgcgaaa 35940
tgtcctgcgc tgacaaccgt atccgcgagg tcggcattgg cctcaaccag aagcgctgca 36000
ccatccgtgc tgtggacctc aagccgggtg cccccaagat ccagatcagc caggatgacg 36060
ttctccttgt gtctggtggt gctcgtggta ttactcctct ctgcatccgt gagatcaccc 36120
gtcaggtccg cggtggtaag tacattctcc tcggtcgctc caaggtccct gctggtgagc 36180
ctgcttggtg caacggtgtt tctgatgacg atcttggcaa ggctgctatg caggagctga 36240
agcgtgcttt ctccgccggt gagggcccca agcccacccc gatgacccac aagaagctcg 36300
ttggcactat tgctggtgcc cgtgaggttc gttcctcaat tgctaacatt gaggctctcg 36360
gtggcaaggc aatctactcc tcttgtgatg tgaactctgc tgctgatgtc gccaaggctg 36420
ttcgcgaggc tgaggctcag cttggcgccc gtgtaactgg tgtcgtccac gcttctggtg 36480
tccttcgtga ccgcctcatt gagcagaagc gccccgatga gtttgatgct gtcttcggca 36540
ccaaggtgac tggtctcgag aacctctttg gtgccattga catggccaac cttaagcacc 36600
tcgtcctctt cagctctctt gctggtttcc acggcaacat tggtcagtct gactacgcca 36660
tggctaacga ggccctcaac aagatgggtc ttgagctctc tgaccgtgtg tccgtgaagt 36720
ctatttgctt cggcccctgg gatggtggca tggttacccc ccagctcaag aagcagttcc 36780
agtctatggg tgttcagatc atcccccgtg agggtggtgc cgatactgtg gctcgcattg 36840
tcctcggctc ctcccctgct gagatccttg ttggcaactg gaccactccc accaagaagg 36900
ttggcagtga gcccgttgtg atccaccgca agatcagcgc tgcatccaac ccttttctta 36960
aggaccacgt catccagggt cgctgtgtgc tccccatgac cattgctgtg ggctgccttg 37020
ctgagacctg cctgggtcag ttccctggat actccctctg ggctattgag gatgctcaac 37080
tcttcaaggg tgtcaccgtt gacggtgatg tcaactgtga gatcactctc aagccttccc 37140
agggtactgc cggccgcgtt atgattcagg ccaccctgaa gaccttcgct agcggcaagc 37200
ttgttccggc ttaccgtgcc gtgatcgttc tctccactca gggaaagccc cctgctgcta 37260
ctacttccca gaccccctct ctccaggctg atcctgctgc ccgtggcaac ccttacgacg 37320
gcaagaccct cttccacggc cctgccttcc agggtcttaa ggagatcatc tcttgcaaca 37380
agtctcagct tgtcgccgag tgcaccttca ttccgtcttc cgagagcgct ggtgagttcg 37440
cttctgacta cgagtcccac aaccctttcg tcaacgacat tgctttccag gccatgctcg 37500
tctggattcg ccgcaccctc ggccaggctg ccctccccaa ctctatccag cgcattgtgc 37560
agcaccgtgc tcttccccag gacaagccct tctacttgac cctcaagagc aacagcgcga 37620
gtggccactc tcagcacaag acctccgttc agtttcacaa cgagcagggt gacctcttcg 37680
tggacatcca ggcttccgtc acctcttctg actcccttgc cttctaaagt tgtgaggctg 37740
tcttgtcttg tcagtcgcga aagtgtaagc aagaactttg tcatacaaag aagcaaccaa 37800
cttccgaacc aacacacctt gtaggattac aaccacaact ttctataaat agtgcgcaag 37860
aataaccagt aagctatcct tcgtgtacct gttacaacaa cgacattttt acttgatctt 37920
cctacttgtg atgggtagtc ccggcttgta ctgacagtga tgccacagca gagtagatca 37980
ctgtgaataa gtaaataagc ctacttatta tattcccaaa gtactcgctg ggatattatt 38040
agtatcacga aaagtgatat gttttataac tcgcttgtct tgccaagatc taaccttttt 38100
tttttaaatg gccaaaaagt cgccagaaca catcttacaa taaacaaaaa tttagattat 38160
atcgtatgta taatgtataa tatattatat tattatatac atacgatata atctaaagcc 38220
attccagact tattcggtga tgaaaaatgc tttcccagct ttatacaaac tattcaaaaa 38280
gttgcatgac ccattttcag atatatttaa tagtataaga ttatgtccat ttgttttcaa 38340
agttattcaa gagtttacat cttgaagttt catcccttta ctactacact gtttttcgtt 38400
tgggtttttt ctctaacggc gaaagaaaca agtcaccaag cttaactagt aggcatcttt 38460
gtggtgacga aattaaagtt gaatatataa attatagtta gtcattatgg aatctcagtt 38520
tgaacgaagc taagctattt ataaaaatca ctgcatggag ataatacttg aattttgatg 38580
atagtgttta tgaagaagtt taatcttgct ttttattaat gttattctct aatatagaaa 38640
tatttcaata aaaaaatcat atgaagggat aataaataca gagaatgatc gttatcattt 38700
gatatgtcga acgctaatct atcatcttat ctaggaaaca aaggtggaaa taaaggaaag 38760
ccctacacga gttaattcct caaacgaact actttggatt atcaaatcca actgctgaca 38820
ctggatacat gcatgtattt agtgggtgtt actgtacttc cttatttcct ttaattcaat 38880
tgtcttgatt tttacttcgg agattctact tgaaaatcat ctcccttcac ttccggttat 38940
acagaaagac ccttcaattc gaatgctggc caggtacaat aactatcagc gattcccctc 39000
cactagacat gaccgactgt aagcacctca acccgatttc aagcaacaca tgatgactag 39060
ctgtttccgc aaaacaacaa ataagagagg tagtggaaaa cacccagttc gctcgagctc 39120
ccctagtaga ttcgacattc actttctatt tgattgctaa ttgtgggtcc ggctatttaa 39180
ggaaagaact gatgaaagtc cacctcacgc aatcaaatcg cggtctagtt ggaagctaca 39240
atggccgacg tatgcgcgcc tctatctttt aggattgtag aacagggcgg caatctgcta 39300
acataaattt aataccttgc tcaagctgct ttccatactt ttcaatccat ttgtgataat 39360
cttgcaatgg accaatctcc aaatctgtag aagcaataac aaggacatcg cagggtcccg 39420
gttcgtttgc atgctcgtct tctggtgcca caacaatgct gcctgttatt atctcatgag 39480
agtctttata ctgcggatcc gtggctatag cgtgaataaa cgttgtgcgc aagcctatat 39540
cctcgcgatg gagatactgg cctgctacag tttgcgttcg tctgcctacg acaacgcatg 39600
gaacattctt tggtgtgcga gtgggccgta gcgttcgacc ctgggcaagg aagccatgca 39660
gacgtgattc cgagaggcca tctcgcgtgt aagacttatc ccaattttct ggatcctcta 39720
atttccagct agccataagc tcagtcaaca gaccaagcgt tcttgatctt ctttctaggt 39780
caaatacatc ttgatggaag cctgcagtaa tttctttgta agatttggaa acgacgttct 39840
tgaaatgaac acaaactgat attgcattca tgggtgcagg tgacagttgc aaatgaactg 39900
aaatgtctgg agaaaagttg aggaagcgtg gtttataaag cggccaagct gtcctcgcat 39960
gcgcaagacc tagtatatta ctaatgactc tgcgaccaca atcctccatg cgttcaaact 40020
tgctatgcgg aattccacga atgatgttac cttgaggatt tggggctctc caaaggagct 40080
gttgcagttg ctgtacgtat tcgcggtgtt cgcggacctg atctcgaagt cgggcatttt 40140
cctctgagca aggccctaca ggtggaaatc tgcacagcat attgtatgtt ctctctagat 40200
gtactgcccg ttgccgcaaa tgagctacat ccatctccag tttatttact gtgtcttcga 40260
gcgcaaacct ttcacagcgc ctgcgtttgc gttcatttct cgaaatctct cgccgccgct 40320
gcctgattcg ttctgcgcga tcaactcggt catcccctgt gtagcttggt gatgacgtgg 40380
atccatcttg tgaggcgtca aagccagaca ctgcctttac ttctaaatct cgccattcat 40440
ctgcaaaatc cctatatcct tccccataag tgtaatcgtc actacctatc aattctgtag 40500
atgccgcatc tacagtccta attatttgag gatttccttg cattgtaaag caaagatact 40560
cggaggctgg atttgtcaca aaaggtacga cagccctatt gatcaaattg aaggaagggg 40620
attgctttta ccagtacacg atgttactgt tgttgctatt gttgttgttc ccaatttctt 40680
cagacgtagc gtgccgcttc tgacattgcc aatagctgct tgtctttggt cttctttggg 40740
gaatgggcca gtaaaagaaa ccctaggcag ttcgattatc tactaatcta aagaacctgt 40800
ggcccctttc ccctcaaccc acgcccttcg ttgctctctt cggtcggtga agcgtttaga 40860
tgcgaggttt cctccactac gtgcttcttc aatgctaaac gcccaagtca actgaggaca 40920
ctgaaagcct gcacggagca gaagacccac acagacggtc gcaggatcaa ccctacctac 40980
gcctcgttgc cacgatggtc gctgccgatc ctcgatctct cgtcgattat tggtctcctg 41040
ttgcgctctt ggccacgcgg ccactcagac tctgcttctg tggcttctca ctgacgtgat 41100
gtagaaagaa atagaaagca cagagccact ttaaaaggaa aaggggaaag cagagaggaa 41160
agggaaaaag aagacctcag attgactcag agattgactc aatcgacgag agaatggaag 41220
ggaatggacg ccacggagac agaggcgcag cgagacggag cgagacggag gtaggcagag 41280
gcagaggcag aggtggaggc gaggggccgg gttgtcggca ctggcagagg gagagagaga 41340
gaaggagagg cggaccagtt tgaaaactct cgccagcttc gatagccgta ctcggtatgt 41400
atgtatgtat gtatgtatgt atgtatgtat gtatgtatgt atgtatgtat gcactcttct 41460
acttgtttcc aatgtgctgt tctatgcttt acagtgtttt ccgcgctcgc tacttgctac 41520
tttcatcagt ctgtctgcct gaggcggcgg tgatgcagaa tgcacctagg tacctatttg 41580
tcgccaactt tggatttgcg tggcggcagg attcctcttc tcctgcactt tgtttcgact 41640
cgccttagaa gggttgttgg aagacgccta aacgggtatt gcccggagat aggtgctgct 41700
ggtagctcat gtagatagtt cgttaggtag ttacactgga acagacagac gctctgtgtt 41760
tcgtggtgtt gcaggtcatg gactcagagg ggctgcgtga gttttgtgtt cgagagcaga 41820
gtgttgatat tcttttatgg gcaggacaca ttgcaacttg aagtaccgtg gttgtaacta 41880
caggacctcc atctgaagcg cggcatcacg tgaaaaagaa atgaaatgaa gagggaaagg 41940
acacccaaag gttcataatg tttggtttgc aaaggttatt cgaaagacac cttcttcgtg 42000
gtagatggtg attctgtcga aactgccgag attttgctga gagtgaacca aagcagggtt 42060
ttgagataga agaatcaatc gtgcatggac aacctattcg taggattgtt atagctgttg 42120
tttgttatag gtcaaacttt atagcttcaa cccctcgctg gcaagtacga agggaaagtg 42180
taaatataca ttcttggttt aacgcataat ctcaagagct tccatgctga aaagttagat 42240
agtatattct tctgatttta catatttaaa ccaagtaaac aagttccacc aagggactta 42300
cttggcaact taaccatggt catcataatt tgcgcatcac ttagatcact acgttaacat 42360
tcgttcttga tctcttcgag cgcctaaata agcaaactgg cagcgaatta ggtcaccata 42420
tttttccaag gaggaaaaac tgtattgtgc tacccgttgt ggtgtaaaac ttgtaattct 42480
tcgcatctct aattcctatc gttaaacttg tcatcttact ttctggaagg aagcttggta 42540
tctcagaaaa tcgaactttg caataatacg aaagcacaag taagggttta tggcagcata 42600
acattgtctt aagaaattga atttaaaagc agaccgaatg caccgcagaa tacattgtaa 42660
attggtgcca aatattatga gtagcaatca tcaatctaac gcacgatttt ttgaagaagt 42720
acaatacaaa tttccccgtc gtagagaatc aaatggtttt acacatctat ttcaacactt 42780
ttcttggatt gtgatttcat atcaagacaa ggcttaaatg atcttggctt tctctgcaag 42840
agcggttctc caaatttcct ctcctgtttc tggattcatg tcaaaacata gtttaacaat 42900
agaaagaagg tgaccaggta ggtacgcaat aatagtttcc gcaatgaatt ggggcttgta 42960
gcgtgcagag aaatgcatga gatatagggc ctggcagttg tccaatgcac ctcgttttgc 43020
aaacctcgcg agctcttcaa tgtggatgtg gccacgctct ctagcaaagg agatatcgcc 43080
atcaaaaaat gtaagctcca tgcaaagtgt ggcagcctga agaaatagag cctcagggat 43140
atccagggcg tctataattg tgtcacctgt atatgcaaat tcaatcgttt ctctgtacac 43200
gaaatcctca ggcataggag gtgacttttg aaagcgcttt cgtttctctt ctggactgag 43260
gttggcaagc tctgacctaa gctcttttcg tttcgtcttc accgcatagc ctacagaggg 43320
aactctatgc atcgtcttgc acaccacaac acttgcatct cctcctagat cc 43372
<210>2
<211>39976
<212>DNA
<213>Ulkenia sp.
<220>
<221>misc_feature
<222>(32086)..(32086)
<223>n steht für irgendeine Base
<220>
<221>misc_feature
<222>(32086)..(32086)
<223>n steht für irgendeine Base
<220>
<221>misc_feature
<222>(32084)..(32084)
<223>n steht für irgendeine Base
<400>2
tcaagaattc gcggccgcaa ttaaccctca ctaaagggat ctgatgaact tggagcaaga 60
ataagaaatc catccattca agtcagcaca cccgatggca tcatcaatct tcgtcaactc 120
tttgtgcagg cagattggtg cttcgggcaa tcaatcggtt gacggattga ttgatcaatc 180
gctttgcttg cttgcttgct tgcttgcttg caattgatcg gcaaaagagg ccatccatcg 240
tagagcgtgc aatcttcaat gctctagcta gaggcgccat caggtagtta gttagctagc 300
tcgttagtta gttgctcttc ctgaaactaa caatgtatga catcagcatc atcgttcttt 360
cttctttatc catccaggat ccttcttttc aattcgtttg ttttgttttg tcttgttttg 420
tctttttctt tcaatgcaag catctcttaa ttcaacaaac caaacgaacc aagagatgaa 480
actcaaaaaa cgttttaaaa taaacaaaca attaaaatca aatagaaaat gaaattgaaa 540
gcacttttgt tttcgcctct ctagagagct agctatagct acctactatt cgttctcgct 600
cttcgtcgtc gggactgctg catcctgtca ttatcgggcc ctaagagtgc cctagtctta 660
gaaattgatg gcgataagat ggcggtcttt cttatccttc ttctcgttgc tgctgctgtg 720
ctctttgcct ctcggatcct tttgtttaca gctggccagt cagtcagaca gtcagttaat 780
cgattaacag gcaagcaagc aagcaagcaa gcacgcaagt cagccagctg gatagacagt 840
tagatagatc gtggcgtcgt cgttggcttc gtcgctgttt tggtgcttga ggattcgaag 900
tgcacgaggt tccttctacc tacagctctt cctttcactc ttcacctatt attatgcgct 960
gcaagttctt ttcgaaaggc tttttcttct ttcattctct ttcttttggc ctttgcgtta 1020
cagagcggag acgcctagtt ttatagatct aaataaacaa gagggaggac aacagaggcg 1080
gaaaacaagc aagttcaaga cggcaagaaa gcagcgcctt tgtttctttg tttcttttgt 1140
ttcttttcaa aagagccctt cctcggaaag ctttctttct ctcttgagcc aacttgaatt 1200
cgaatctgat cttcaaagcg agttagttcc tcaggcgcca ggcacctctc tccctccctc 1260
cctccctcta tcgcaggcag gccagcgtga cacctgtgac agcaggcagc tcaggcgtgc 1320
atgcaacgaa ggcgttgact catgcattgg cgctcactca ctcactcact cactcactca 1380
ctcgcgtacg tacgcacgca cactcacgca ctcacgcact caatcactca atcactcact 1440
cactcactca ctcactcacg ccagcattct cgaggagagg ccatgcgtag gtgaggtacg 1500
aaggaaagga gtccatagtt tggaggcgat gatggcgaat tgcagagcat aacagtgcag 1560
agggagaaac ttacatccat tcatacgtag ggaggcgcat acttacgtaa ctaagtgcaa 1620
tcggtggatc aagaaagaag gaatgaaaga atgaatgaag gaatgaatga aagaaagaaa 1680
gaaagaataa atgaataaat gaatgaatga atgaatgaat gaatgaatag ataaatgaat 1740
gaaagaaaga gccccgctta tttggtatcg atctcattgc aaatgttcct gaaagttgct 1800
tatttgcctc acaactatga gtaggtagtg atgataataa tagtaattgc tattgctatt 1860
acttgaattt gaatttgaat ttgaattcag gtagacaata aaataagatt agcaaaacat 1920
tttgagagga agcagaggat atgcagtgca aaaggaggtc ccgagtttcg atcttctttg 1980
cacctgctac gtatctagtg cacgtagagc aagaaagaat gaaagaaaga acgaaagaaa 2040
gaaagagaga gagagagaga gagagagaga gaaagcgaag atgatagcgg agagaactct 2100
tcttcgcagt cactctgttt ctcagtcagt cccgcaacca ataacaactc gaactcgcag 2160
cagtgttctt cggagtgcca gcgctcgctc gcactgcgtc ggcacagcag cagcagcagc 2220
aggccccgcg ctcgctgcac tcagcccggg caggagcaac agctgctgag cagctgaggc 2280
cagctggctg gcggctcgcc tcgcctcgcc tcgcgtcgcg tcgcgagaga aagcgatcga 2340
ccaactgtca atcgattatt cgagtccttc gagcgcttta tagggcactg attgatcact 2400
cattgattca ttgactcatt tattctttgc gtggtcagcc aaacggcgtt agcattgggc 2460
aaagcgggtc tttgctttgc tctaaaatag atttgctcgc gagagtacgt acttgcagga 2520
gtaggtaggc tctgcctagt acctgggcat ttgaatattt gaacttcgaa cttcgttgag 2580
tatctgaata tttgaatatc tgaatatttg aatttcgaaa gtttgaatat ttgaatattt 2640
gaattttgga atattggaat agctgggttt ggagataaga cttactaagc taagcgccga 2700
cgtaagagcg gcgagtaaat ccacacacaa gagagaggca gagagagagg gagggagaca 2760
actcgcgcag gcaagctgag cccactggac gcacggggcg cgtcccccct gacgggcgct 2820
ctggtggtgg cgtgtttggg agggttttgc atgcttgtga taggggctct ggcgcgggct 2880
ctgtacggtg cttggagatg cacgggcagg gcgagagagg ggacgggttc ccgggaggcg 2940
ctgcttggag gtgctgagag ggagggagaa ggcgtgcttt gcgatgcgcg gggcgaccta 3000
ggcgctgctg cgcggtgcag cagcagggac ctcggacgtg agtcgaagcc gtctgcagag 3060
gagatggtag aagggccgcg gattggtagc agagaagagg aaatagaaga agaagaagaa 3120
atagaagaag aagaaataga agaagaagaa atagaagaag aagaggagga cgggcaggcg 3180
ggaaagatgg agaaaggact cgcggcggga aaacaagaga atgtgaactt gggcttgaac 3240
tttggtttga atttgaatgt ggagaacgag gggttgaatt tgagtttgaa tttgaaagaa 3300
aacttacgga aagaaagttt agttgaaagt gagaaagaaa aaaatgagaa agaaaaagag 3360
aaagaaaaag agaaagaaaa agagaaagaa aaagagaaag aaaaagagaa agaaaaagag 3420
aaagaaaaag agaaagaaaa agagaaagaa aaagagaaag aaaaagagaa agaaaaagag 3480
aaagaaaaag aagaagaaaa agaagaagaa aaagagaaag aaaaagagaa agaaaaagag 3540
aaagaaaaag aagaaggaga tttaaaaagt tgtttagttg aaaaaggaga aggaggaaga 3600
agcagcgaca gcggcagaag aagaagtagt tgttgtaaga ggggaacgga ggcagtagca 3660
gtggagcagg cggaggcgac agcaaacctc gaactcgacc ccgtcgagcc gcagcaagaa 3720
caagagcccg accaggtgga cgaggacgag gtccgcttgt tgtcaggaac aacagaagtt 3780
gcaggactag ccgagagtgc taccactgca attcttagat ccacagacgc aagagcagaa 3840
aacttacaac tgctcgccac aacacaagaa ccaccttcag atacaaccag gttcgagaac 3900
tccacaagtc tagaagcagc aacagctcta gcagataatc aaacaggtcc agaaaaagct 3960
acgactagaa gagaaattat cgagtcgcaa cttgcaacca tggccactcg cgtgaagacc 4020
aacaagaaac catgctggga gatgaccaag gaggagctca ccagcggcaa gaacgtcgtt 4080
ttcgactatg acgagctcct tgagttcgcc gagggtgaca tcagcaaggt cttcggcccc 4140
gaattcagcc agatcgacca gtacaagcgt cgcgttcgtc tccccgcccg cgagtacctc 4200
ctcgtcaccc gcgtcaccct catggacgcc gaggtcaaca actaccgcgt cggtgcccgc 4260
atggtcactg agtacgacct ccccgtcaac ggtgagctct ctgagggtgg tgactgcccc 4320
tgggccgtgc tcgtcgagag tggtcagtgt gatctcatgc tcatctccta catgggtatt 4380
gacttccaga acaagagcga ccgcgtctac cgtctgctca acaccaccct caccttctac 4440
ggtgttgccc aggagggcga gaccctggag tacgacatcc gcgtgaccgg cttcgccaag 4500
cgtctcgacg gtgacatctc catgttcttc ttcgagtacg actgctacgt caacggccgt 4560
ctcctcatcg agatgcgcga cggctgtgcc ggtttcttca ccaacgagga gctcgccgcc 4620
ggcaagggtg tcgtctttac ccgcgctgat ctcctcgccc gcgagaagac caagaagcag 4680
gacatcaccc cgtacgccat tgccccgcgt cttaacaaga ccgttctcaa cgagactgag 4740
atgcagtccc tcgtggacaa gaactggacc aaggttttcg gccccgagaa cggcatggac 4800
cagatcaact acaaactctg cgcccgtaag atgctcatga ttgaccgcgt caccaagatt 4860
gactacaccg gtggccccta cggccttggt cttctcgttg gtgagaagat cctcgagcgc 4920
gaccactggt actttccgtg ccacttcgtc ggagaccagg tcatggctgg atccctcgtg 4980
tctgacggct gcagccagct cctcaagatg tacatgctct ggctcggcct ccaccttaag 5040
accggtccct tcgacttccg ccccgtcaac ggccacccca acaaggtccg ctgccgtggc 5100
cagatctccc cgcacaaggg taagctcgta tacgtcatgg agatcaagga gatgggctac 5160
gacgaggctg gtgacccgta cgccatcgcc gatgtcaaca ttctcgacat tgacttcgag 5220
aagggccaga ctttcgacct tgccaacctc cacgagtacg gcaagggcga cctcaacaag 5280
aagatcgtcg tcgacttcaa gggtattgcc ctcaagctcc agaagcgctc tggccctgcc 5340
gttgtcgctc ccgagaagcc cctcgctctc aacaaggacc tttgcgcccc ggctgttgag 5400
gccatccctg agcacatcct caagggcgat gctcttgccc ctaaccagat gacctggcac 5460
ccgatgtcca agatcgctgg caaccccacg ccctcgttct ctccctcggc ctaccctccc 5520
cgtcccatca ccttcacccc gttccccggc aacaagaacg acaacaacca cgtgcccggc 5580
gagatgccgc tctcgtggta caacatggct gagttcatgg ccggcaaggt cagcctctgc 5640
ctcggccctg agttcgccaa gttcgatgac tccaacacca gccgcagccc tgcatgggac 5700
cttgctcttg tgactcgtgt ggtctccgtt tctgacatgg agtgggtcca gtggaagaac 5760
gtggactgca acccgtccaa gggaaccatg gttggcgagt tcgactgccc catcgacgcc 5820
tggttcttcc agggatcttg taacgacggc cacatgccgt actccatcct catggagatc 5880
gccctccaga cctctggtgt cctcacctct gtgctcaagg ccccgctcac catggagaag 5940
aaggacattc tcttccgcaa ccttgacgcc aacgccgaga tggttcgctc tgatattgac 6000
ctccgcggca agaccatcca caacctcacc aagtgtaccg gctacagcat gctcggagac 6060
atgggtgtcc accgcttcag cttcgagctc tctgttgatg gtgtagtctt ctacaagggt 6120
accacctcct tcggctggtt cgtccctgag gtcttcatct cccagactgg tctcgacaac 6180
ggtcgccgca cccagccctg gcacattgag tccaaggtgc cttccgccca ggtcctcacc 6240
tacgacgtta cccccaacgg tgccggtcgc acccagctct acgccaacgc ccccaagggc 6300
gctcagctca ctcgccgctg gaaccagtgc cagtaccttg acaccatcga ccttgtggtc 6360
gccggtggct ccgccggtct tggctacggt catggccgca agcaggtgaa ccccaaggac 6420
tggttcttct cgtgccactt ctggttcgac tccgtcatgc ccggctcgct cggtgtggag 6480
tctatgttcc agctcgtcga gtccatcgct gtcaagcagg acctcgccgg caagtacggc 6540
atcaccaacc cgaccttcgc tcatgctccg ggcaagatct cctggaagta ccgtggtcag 6600
ctcaccccca cctccaagtt catggactcc gaggcccaca ttgtctccat cgaggcccac 6660
gacggcgtcg tcgacatcgt tgccaatggt aacctctggg ctgatggcct ccgcgtctac 6720
aacgtcagca acatccgtgt gcgcattgtt gctggcgccg cccctgctgc tgctgctgct 6780
gctgctgctg ttgctgctcc ggctgccgcc cctgctccgg ttgctgcatc tggccctgcc 6840
cagaccatca ccctcaagca gctcaaggct gagcttcttg acgttgagaa gcctctctac 6900
atctcctcca gcaacggcca ggtcaagaag cacgccgatg tggctggtgg ccaggccacc 6960
attgtgcagg cttgcagcct cagtgacctc ggtgatgaag gcttcatgaa gacctacggt 7020
gttgtggctc ctctctacac cggtgccatg gccaagggta ttgcctctgc tgaccttgtg 7080
attgccactg gtaagcgcaa gatcctcggt tccttcggtg ctggcggtct ccccatgcac 7140
attgtccgtg ccgctgttga gaagatccag gctgagctcc cgaacggccc cttcgccgtc 7200
aacctcatcc actccccctt cgatagcaac cttgagaagg gcaacgttga cctcttcctc 7260
gagaagggcg ttactgtcgt cgaggcctcc gccttcatga ccttgacccc gcaagtcgtc 7320
cgctaccgtg ctgctggtct ttcccgtaac gctgatggct ccattaacat caagaaccgc 7380
atcatcggta aggtctcccg taccgagctc gctgagatgt tcatccgccc tgccccgcag 7440
aacctcctcg acaagctcat ccagtctggt gagattacca aggagcaggc tgagcttgcc 7500
aagctcgtcc ccgtcgccga cgacatcgcc gtcgaggccg actctggtgg ccacaccgac 7560
aaccgcccca tccacgtcat cctccccctt atcatcaacc tccgcaaccg cctccacaag 7620
gagtgcggct accccgctca cctccgcgtg cgcgttggag ctggtggtgg tgttggatgc 7680
ccccaggccg ctgccgctgc tctcgctatg ggtgctgcct tccttgttac cggcactgtc 7740
aaccaggtcg ccaagcagtc cggcacctgc gacaatgtcc gcaagcagct ctgcatggcc 7800
acctactctg acgtctgcat ggctcccgct gctgacatgt tcgaggaggg cgtcaagctc 7860
caggtcctca agaagggaac catgttcccg tccagggcta acaagctcta cgagctcttc 7920
tgcaagtacg actccttcga gtccatgcct gccacagagc tcgagcgtgt tgagaagcgc 7980
atcttccagt gccctcttgc tgatgtctgg gctgagacct ccgacttcta catcaaccgc 8040
ctccacaacc cggagaagat cacccgtgcc gagcgtgacc ccaagctcaa gatgtctctc 8100
tgcttccgct ggtaccttgg tcttgcctct cgctgggcca acaccggtga ggctggacgc 8160
gtcatggact accaggtctg gtgtggccct gccattggag ccttcaacga cttcatcaag 8220
ggctcctacc ttgacccggc cgtctctggt gagtacccgg acgtcgtgca gatcaacttg 8280
cagatccttc gcggtgcctg ctacctccgc cgtctcaatg tcatccgcaa cgacccgcgt 8340
gtcagcattg aggtcgagga tgctgagttc gtctacgagc ccaccaacgc cctctaagcg 8400
agttatatct gtctagaaaa cttggcatgg ctagcaattt atgtctagct attccataca 8460
cacggtaatg ccagtagcct gttagttata gctcttttgg ttgttgtctc acaatacact 8520
gacatcagca gaacaaaatg aaaggggcct tggctaccat gaaatcaata cttcaaaagg 8580
tctcttggtt tctttactcg catgtcgcta tttacttaca ttcctcgagt acataacata 8640
tcatacatca aagaaattaa aaagaaaaca aacattcaaa tatgcattac tttccctact 8700
gtactagtaa gtacgtttct ggtattaagt tgttttttct caaaagaaca atgtgcttac 8760
ttgtaaaatc cacagctgct tacttgtaag cctcaactag ttagtgatgt gattatcata 8820
aaatgttcga cactgtacct cctttccagc tatcttccta cacctcctct gacgcaggtt 8880
gacggaggag gcgtgggggt tgattgaagt gcaacacaac gttttgttta agatattcct 8940
tgccttggcc gactccaaat ggatagcaca gaagcctaat gataatttga attaatttta 9000
tttcgagctt atttaatgct cttatcagag tccgtaggta tctcttttcc tactaattgt 9060
tgaaaaagga tgttttggac atagcaggtc atcatactat ttggttccat caaattcata 9120
tccatttctt tcgttcaagt gcttcccttc ctacttatta tatatattat atatccataa 9180
atgtaaaaga gacgattacg aatactttgc atacatgtat agcgaaacag agatggtagc 9240
aaaagttcac cttcactaat ctaagaatct ctccacgtgg gtaaaaactt cagcagtaag 9300
attgtaaatg atgtccaaga acaaaacgtc atgctagtcc aggggttact gagctaacga 9360
ttaataatgt ttcgtagtct tcctaattgc accatcaaaa cttgtctgca caagttttaa 9420
agtattggag cctttactga agaatcagag gacatagatg gggcacgttc gccttgaaaa 9480
aaatagtctt ctttacctgc atggtgttac aaacaaaaac gagttgaaaa tagctgtgca 9540
aggaggcaaa catgattgga aaagaaaaac gaggggaccc ttatacagga gggcgccaca 9600
tagtagaatg agtagattgt tagagtaggg tacgctttat gtgattgatt gaatgggcga 9660
gtgaaagttg ctgtcaaggt tctaaacaaa aggatgtttg agtttgtgag tattgtttgc 9720
ggcaaaaaga ttcagtagag agaaatgcac aaaaagataa tacgtgtgta gggcgattat 9780
ggaggcatgc atttggggga aatcatcgca tgcgcatgag tttctccatc tgccgaatct 9840
ttgcaaaggc attttcaagc tccatttgca tagcgtaggc ttgctgctca aactgagcgc 9900
gctgatgcgc cagattttct tcatgtcttt tgttcaaact acgctcaaga ccctcaagag 9960
ccgcaacctt gagcttgcgt tccttttgct gaatctccat aactcttcgt ttcacctgga 10020
gctcaatttc tgcagcatcc gtggtctttg cagcggcctg tgcgtcttgt gcggcctgtg 10080
cgttgtttgc gagctccttt cgcagctcct ccatctccgc gttctttttc tcctccatcc 10140
atttggcacc gagtttggca gcttgatcga tgcggccctt gagaacttct tcgttctcct 10200
caagttctgc gatacgcgcg tgtaagccga ggatctcctc cgagacagcc tcgccattga 10260
tcattatttc acttcccgag tcttgaatga caacatcagc cttggtgcca ggttcaccgg 10320
tatctcgctc gcaaccctgc tggcgcatag acagcataag gcgcgcatta tcctcacgca 10380
gatcatccac ctgttctgat aaaagtttga ctgcctgctc aagattacgg gggttcactt 10440
cgtgaaaaat ttcttgaagg tctcgaagct cagaaagctt ggcagagcaa gtgtgcatcg 10500
ctctgcactt tttaagacgt gcaagtgcat catcaagttt ggcattattt accttcatgg 10560
aggcttcagc tacttcggct tcttcgatta caattttctg cagctctaca acatcatggc 10620
caattaactt gcgatgcagc tcggcaatca ccccatgcat cttttcggta tggcctggac 10680
gcgcctcatc ctgcgttctt cggatctcct cctctagttc tcgatttaga cgaagggctg 10740
gtccaagggg cgggtaatta gcctgagtca agccaagctc tgttgctagt ccaaggcagt 10800
cggaaagtcg cagccggtcc ctatcagaaa cagccttttg caagtctacg ctcaaacgca 10860
cttcttgagc cttgcgcacc atcttcggtt ctgcctgtcg cagaagtttc gagtcgtagc 10920
cagcttgcca cgctagcacg atggcacgcg caagtgacct cagttgaccg ctgttcatgg 10980
cagacttgag caacattttg atttgcacaa atacctcatc tgattcatca tcttcagctt 11040
cctcaagctc tgcaggtgtc ttgcgctctc cagagacttg aagagcaggg ttcaaaccgc 11100
cctccaggac ctcgctcgca agcgcctcct ctgtctcagc tttgcgcaat agcgcagcag 11160
cattctccgc cattgtgttt gtcactcacg agattaatat cgttgccaga gtatacggta 11220
atgcgagtta aggattcaca gaatctctca aattaatctt ttcacctaat gatatccaca 11280
aaacgttgca atcgctcagc ccaacgacaa gcgtgcttct tgttttaaga ctgcaactgc 11340
tcctttttct attagtcaat atggaccgtc ctccaaacgt ccagaaaata gcacagaatt 11400
taccagcagc cgctgcagac aagaagtgca agagagcagg caagcaagtg agggtttgag 11460
caaataggcc aacctctcca cgcagaattc tagggtcgca accggaactc acagtcctta 11520
gaaaccgtgc gaagccctgg gctcaacttc aatttgtcca cgggaccttc agcaagcacc 11580
aagctcagca gcgtgaaggc aggcgctgac cacagtttga gctcagaggg cttggtgtgc 11640
ctcgcgattg atattgaagt caattgcgca ggacggcagc aacggaccag gtggtgaaga 11700
aggtaatctc cagcggagtg atgatggagc tcgaccgact actccggaat cgaccagggg 11760
aggtgcgggc gcccttcaca agcgggcgag aggcagggga gagaaggctc gactccacgt 11820
cttgaagcgt gtacgtgtgc gcgctcacgc gtgcgacacg ccggcaaggg cgccttagtg 11880
gcctgctgct gctgctggtc gccacgctgc gagcccaaga gatttgaatt gaactcgaag 11940
aaaataacta tcatttatca attccaatca atcaatgcat tatgaagcac ctctgaagtg 12000
aactattctc ctctccaata tacaacaaaa aacacacaca gtgggtttta ccctataacc 12060
tattgttccg cgagcgatca actactctat agagcgaatg accagttttt ctttctttct 12120
ttctttcttt ctttctttct ttctttcttt ctttctttct ttctttcttt ctttctgttt 12180
tcctatctaa taaccccttt aatcgaggaa acctttcgat ttaaaaggaa agctctgtct 12240
gtatatatct gttacagata ctgctatcat gccatgcaga aagaaacaca aaagaaaaac 12300
aaaagaaaga gagaaagaga gaaagaaaga gagaaagaaa gaaagaaaga aagaaagaag 12360
agcttttctc aatcggtttc ctcatcgacc gctcacatat ctacgattgt ggcaaagaaa 12420
gaaagaaaga aagaaggaaa gcctcagcag agtccgcacg aaagccttca ttgagccacc 12480
atgtcgtggt ccgctgcagt cagtgccgcc tctctgtgaa ttgagtgagt gagtgagtga 12540
gtgagttggt tggttagtta gttagtgcct cttcagctca aagcctttca cggtcgctct 12600
tcgagcgttt gctttttcat aaacaaataa acaaaccatc gaacgaacca tcgaacgaac 12660
gaacaatggt accccagaat agacggaatt aattgctaag taaaccagta acagtaagtt 12720
agtgtttctg acctgagccg ttttctttat ttattcctct cagctctgtg aagagaattt 12780
gggatgaaaa gaaacgtttt tatttattta aaagtttagt aacaagaaaa acatggtccc 12840
tcttcttcct tcatgtaaaa ataagtaagt aaaaaaaaga aaagaaaaaa aaaaaagctt 12900
ttaaagtagt aaagcgaggt agagataaaa gttctttctc agggctccta gtaggcactt 12960
aggaggtacg tctaagaccg cctcgtggga agaaaagaga aaacaagaag agaaaagaga 13020
gagagaaaca gcgctgaccc gagaggctca tgcgcagagc ccaaatctgc ccaactttgg 13080
caaaatgcag cgccgcctct gcggcggaga cggtcatgtg aatccgcaga gctgcacgca 13140
cgcgtcacag gctacagctg gatatttttt atacgagccc gcgcgagacc gcggcggaga 13200
aacggggtcc cgcgcgaagg gcctctgaaa agcaggcagc gaaccaggcc tgcaccagcg 13260
ccgacctccg cgagacttcc ttcgatctca ggaaggacct tctgaagagt ggctcaaagc 13320
agcgcaggcg gaggcagcgg cggagggcac gcccagcgag ggcatcggct cgaggctcca 13380
gggctgccag gtcgcgaggc atgcacggcc tcggttcgtg atcttggccc tgccgggtgt 13440
gccgggatcc aatatggtgc gcaccgtttt tgaagctgtc gctcttttct cgcgtcgcac 13500
attacgatgc gcagaactga gtgagtggac aaacgaagag ggcgatcgat ggcttggaat 13560
gcgaactccg tccatcgaca tcgacatcga tcaacccatc gacccatcca ctccgtgcac 13620
aagctgcact ccgtgcacaa gctggagacg agcgaccgaa gaggtgacga ttcgctctcg 13680
ctcgggatgc ttggatgatt ggatgattgg gtgcacgagc tgccacttgt tgttcttgtg 13740
ttgttcttgt tgctgttctt cttcttcttg gcggtcgttg agcgaatgcg ctgtttgtcg 13800
agaaccatga aatgagcgtc ttgaatatgg gtggcctcgg gaatccgcag aacgatggta 13860
tcgcattcgc atccctggtt gcaagaaggc ttgcgatgag gtaagcacat gccgactcgc 13920
cgatcgacca gcgcgggcct ctgtgccgaa ggagcgacag cttggacgca ggggaatggg 13980
gcctcgaagt tcttgtggtc actcaggaca gaaactcttg ttttaatttt tctagttgct 14040
tagctcaagt tagttagcca gttggctagt ttgcttttaa ttaaaaatga agaaaactaa 14100
aattgagttc tcaagtctga aagaacaagc aaacaaaagc gaaggatgtg ctgtgcatgc 14160
acgagcttcg gctcaggcag aggaagattg ccagctcgca tgaccttgga tcttccatac 14220
tgcgtaatgc tgagcgtcag agaaagatgc gggccaggtg ccggaagata taccttcatg 14280
gactttccgc agaggtgaag atcagcgatg atcatgtgga agtgacacga cgcacctcga 14340
gcatcccagg aattgcagtg tttgcccagg caggcagtga gtgcctggtc aattatggaa 14400
tagtcaatct agtaatatga gtgagtggaa ggcagaaaat aatttccatt ccttcattcc 14460
atgactagct gcatcaacat catgatgttg cttcagctcg tcagcagggt gaacaacgtg 14520
cgggctagaa gaattagaaa agaacaatga gtgtctatga atgcatgaga atcgagtgta 14580
atgcaataca gaaacgtgag aaattgcagg attgattaga aagtattagt agggcaagaa 14640
cagagagatt agagaagtga aaagggatga cggtgaaacc agtgtagtcg tagtaaagag 14700
tggcttgcaa ataggtgcac cgcatccatc aattggtcaa cgagcaaatt agtgcagcca 14760
gcgtactagc tatttactgc gacgatgtaa cgaagtcctc caaggacgcg tacacggtgg 14820
ccggcaagtc ttcattggcc ttgagcttgt ccaagataat gcggggaaac tggattgcct 14880
ggtcaatcac agccttacgg gcctgctcat cgttgacaag tgtagggtcg cgctcgaggt 14940
ggtcggtgaa gcgagagtgc aaatcaagag ctacagcaat gatgcgctca gccttgagct 15000
catcacggtc ccactcaact tcctgttgca taagcgaacg gatgcgcttt acaaggcagt 15060
ctcggacctt gggcaggtca ttaataccat cgtaataaat actcatgacc ttccacatgt 15120
tgggctcact cttacactta gaacgcattt tgtcaaagag ttcctccaat tgtgcgcgca 15180
aactagacac aagttcaggg ctcatctcct tacgaggggc ctccggcgtg tgcgggtctt 15240
cactagaggt ctgagaatcc gacttacgga cagagaccaa ggcatctaca actacaagga 15300
gactctgtaa gtcaaccatc tcagcaatgg cctcacgagt gccagcgcgc acatccacaa 15360
ggttgctcat accatcaatg gccatacccc actgatgagt gttgacagcc aacataatgt 15420
agttctccca aatacgccag ttggaacgtg actgacgaac ggcctcaata acagccttca 15480
aggcagcagg gtagtcgttg agctgaatca aaatcgaact caagttagcc caagcatcac 15540
cgctatccgg atcctggcga gtcacatgcg caaaggctgt acgggccaag gtccactgtt 15600
caaggcgcat agcgcatgag ccaaggcgga accacgactc cgggtacaac gggttgatct 15660
taagggcatc ctgaagatgg tcaatactct cctgcaaatc accgcggtca aatgccatga 15720
gggccaattc acgcttagca cgcgcatggc gcttgccaga aaactcccac gccttgctga 15780
accagtcctc atcctgaagc aaagagccca aaacgcacat aaggtgtgca gtgggctcaa 15840
ccgcaaggcg ctcacggatc aacttctcag cacgagcgcg cttgtccatg atcacaaggc 15900
agtccacagc ctcttcccaa agacgaacct cctcaaagat ctgcaacgca ctaccagcag 15960
cgcccacttc atagtacaat tgcgcgaggc cgcgcttgag ctcccaaact gcgggccagg 16020
atagcgcgtg cagaaaggcg agacgctcgg tcactggggc agcgttgtcc acgtcacgct 16080
ggcgaggctg tgttggtgtg agccggtcag tctgctggtc aacgaggact tgcatctgta 16140
aaatggcacg ctccttggtc ttgttgcgct caaactcaag ctgagactta attagaagtg 16200
cagtggagta taccatccag ttttcaggac tctggaggac acgctccacg taagcaagca 16260
tctcctctgc agtgagagcc tccatagcgt agctgttctt cacatccata cacaagccga 16320
gcacgataca ttggtccaaa aggctcaacg ttccgcggcg catgcgagcc tcctcatcac 16380
tggtctcctt tgcgtactgg atctcctcgt gaagtggagt ctcagcgtca acttcttcga 16440
ggcgcacctt ggggataccg agaacagaag tggatcctac aatctcgcca ccgttctctt 16500
cctcagttgc atcattatca tcctctgcca caatctcgtt gggtacctcg gtctcgatgg 16560
ccttgactgc aggggaatta gaattcttag gagcttcagt gtcttgctcg cgattttctg 16620
ggtctttggt ggtagctgac gatgcaagaa gaataagctg ggtcttctct tgtttctgaa 16680
acttggtacg ctttcccatt acaccagtca tctgaatatt cagctgtgct gtttcctttg 16740
cctttgcaaa agcacgctta gctccatctg cagccttgaa cttgtgccgt gctacaccgc 16800
attcaaccca cacgagggac tccagaagtt tatcatctgg gtacatgatc tgcactgctc 16860
ggaccgtgcg agcaaatcct ctctctgcct cctgctcgag gctaggagcc gctgcctcct 16920
tggcttcaag ggtctcctgg tgaacaacag cagatcgtgc tgcccaccag ctaggtgtca 16980
gcaaatgccg aagagctccg cggactgtgt tggagatctg ttcatgaggg ttagcgctca 17040
tgccgccgtt ggtctttccg ggaagatcgt cctcgccatc ttcttcatct tcatcagcgc 17100
caatgacagc cccgttctcg tccacaaggc gagcctcagc gagcatatca gttgggtcca 17160
cagcaccagg ggttactgtc ggattagcga cgacgcgaag aatgacgcga gcaacaagca 17220
agaagtgaag gtacttggca tcacggtaaa cctcctcacc gttcgcgcca agcataagag 17280
ctgttcttgc gtggagctcc ggatacgcgt ccaactgtga ctcgtggaag ctggcatcct 17340
tgccagacac agcggcagca gccttagcat cggaggcgag agcggagaga gcagtctcca 17400
catacctgag acccggctca gaggcagact tggagctagt atctacagaa ttcttggggg 17460
cattggggtt gtcgtagacg ccgcgaacaa aagggagggg gtagaacttg tcaataccat 17520
ggctagagac aggaggacct gtccagttgg cctggacgaa gatgtgaaga caagcaacac 17580
ctgcgaacat gcaggccata gctcgcagag cacgttggtt tacgtggtcc tcaggactcg 17640
agacactcgg gtccttgaag gagctgcgac ctgtgcaggg gccaatacca gtctcgatgt 17700
gctcaacaac acgctcatga agaaaacgac cttcacggtt tttaacgcga cttacagagt 17760
atcccttctt aagatcttct gcggcaaaga ggccttgcgc cgcaggagag gcgaggacct 17820
cgaagaaatc gccttgggca agagcgcacg ccatacgaag gacctcaagt tggagctctt 17880
tcacttcagg ctcgtcacgg agctcctcgc ggtcatctgc agcagccgat gcagccacaa 17940
gaacatcttc gagaccctcg gcattgctct cgagagcgag acgctcgaca agtcgcagcg 18000
agtaaagagg tttcgcaact ccagagccct ttttggagtt gaagacacca gcgccgttaa 18060
gatcaccatc gtcagcctcg tcgatctcat catcaggacc gtcagggagc tcaaagtcct 18120
gtggaaggcc taggaactcg ttcaggtctg cgtcagagct cgaatccgac gcgtagtccg 18180
ccatcctggc ctacaggacc gccgaaacag gttgcggcag ccgcccaaag tctaagctgc 18240
aagagtcaac cctcaatcgc gagcttgcgg cacaacgtcg ccgcaggatc tcgcgccaag 18300
acgtctccaa atgcaagtct ggtgctcaag tcatcctggc cacccgcgcc tttgcccctt 18360
aagctaggtc acctacctta aaccagagtt gccccgcggt gtcatattgt aaacatttta 18420
taacaatata cgtcatatta aaaacctaga tgtggggaca atgttataaa taagtaacaa 18480
atatagacta catcgagaag aaagaattct tcggcactcc gtgtgagttt gggcgaaact 18540
gcaatcacga agccatgcaa agtcttcgta tatctgagtg gagcctcgct ggagagaaga 18600
ccccatgtga atgggtgtag aacgacgaat ctacgcagcg ttgtctccgt tgagacgctc 18660
tgtccagata tgaggtccct cactattctc gtatttgatc atgccaagca tctccagttc 18720
caacaatgga gttttctatt gaaagaacat agacatgttt ggaacggttc ctttcagagg 18780
ggaaaaacta atcaaaaatc aattgaggaa tgcagggggg ttatttgctg cagttttagc 18840
aataaaataa aaatcctttg ttgatgtgat ttcattcgtt cctttgacat tcaatcattg 18900
aattgctctt caccggagct tttcaaggtg cccaactgcg atctccgctg cggctgctcg 18960
cggccgggct ctgagctcta tctccgtgtg ggaggcggga agccagcagg tgcggcgacc 19020
ctctccaaat agaggccgcg gcgaccttga ggcactcgcg tggcgggcgg attggcgatt 19080
ctgtgttcaa ccgagatatt tcatacatat tatttgctaa ttattagcaa atagaaataa 19140
atatacagac tttgcaagct cagtagagaa agtgaagatc caaaatgtcg gcctcttcct 19200
cgcaatctac ttcggagcag cgcaagtcac gcgtggcgta cttttacaaa cctgagattg 19260
gcagctacta ctatgggtaa gttagtatgg gaaaattggc gacagaaaaa tataataaaa 19320
aaagcaactg tatcgccacc gtttattcac ggtagttaga aggtatttgc ttcctgcgca 19380
cactcgatct gcaggatgta catgtcttga gtggcattgt ccaacgatcg ttctgtttgg 19440
cggaacattg cttttaaaca aaaacgagat agtgaatata ttctacccaa ctaccaccat 19500
ccggtttaag gagacaaata aatctgtctt tcgacccagg ataaggaggc ttgcatggga 19560
atcttttata atctagtctt tatgtcaaat tttcgcaggt tccagcctac catctctcat 19620
gctatttgtg attgcacaag atgatatgaa agtaaagaaa caaggcaaag gatataagat 19680
gcataaggat gtgcagaaaa ctaactagaa acattcatgt gatgaaacct tcctcttgaa 19740
aactcacctc ggtttgtttt ggatcttggt ttgtctttgc tcactttttt tcattattta 19800
cagcccgtcc catccgatga agcctcaccg cctgaaactg actcacaacc tgcttcttac 19860
atacggactc ttccgacaca tggaagttct gcgcccgcac gacgcgactg cggaagacat 19920
ggagcgtttc cactcgcacg aatatgttga ctttctaaag cgcatttctc ccgacaccga 19980
gcaagagttc gagaagcaaa tgacccgttt caacgttggt ccctattctg attgccctat 20040
ttttgacggc ttatacaatt ttatgtctag ctgctccggc gcatcgttgg atgccgcaat 20100
taagatcaac cacggacagg ccgatgtttg tgtcaactgg tctggtggtc ttcaccacgc 20160
aaagaagggt gaagcttctg gtttttgcta catcaacgat attgttctct gtattgttga 20220
gctcctcaag tatcaccctc gtgtactcta tgtggatatc gacattcacc atggtgacgg 20280
agttgaggaa gcgttttaca caaccaatcg tgtgatgacc tgctcttttc acaagtatgg 20340
tgacttcttt cccggtagtg gtgcctacac agataccggc gctcgcgctg gtaagaacta 20400
cgccgtaaac tttccgctca aggatggtct tgacgatgcc agctttgaga gcatcttcaa 20460
gcctgttctt gatggcatca tgaagcactt tcagcccggt gctgtggtga tgtgctgtgg 20520
tgctgattcc atctctggtg atcgccttgg gtgctggaac atgtcattgc gaggccatgg 20580
ctacgctgta cagtacgtga aatcctttgg cgtacctgtt gtgcttcttg gtggtggagg 20640
ttacaccccg cgtaacgtgg ctcgctgctg ggcttacgaa accggcattg cactcggcaa 20700
gcatgaggat atgcagaatg atattccatg gaacaactac cacaactact ttggccctaa 20760
ccatcttctt cacattactc ctgacccgca gatgaagaac gccaattcac gcacctacat 20820
ggacaagtac accaacatta ttctcgagaa cctttcgaag cttgaagcgg tgcccagtgt 20880
acagttccaa gatcgcccta acgactttgc aaacccagat gagcgtgctc gtattgctct 20940
tgacaacgct gaccctgatg aaaaggatta cattcaacgt cctcagcacg aggccgaata 21000
ttacgaagac gagaaacacc aagactcgga ccgtcccaat ccggctgatg gtggtgccga 21060
ctcaaaggta aagtctgaaa aatcctcagg cgatggagct gcggacgaag cggagaccgg 21120
atccagaaag ccttacaaaa agggcactga atgcggtggt ctacttgaaa ttgacgaggc 21180
tgtcatggaa gtggactcca atgaagcgcc caaggagact gctcctgctt cagattctgc 21240
tatcaagact gaggatgctc ctgctgctga gtctgctgcc tccccctcgg atgccaaggc 21300
ctaaacatga agactttgtt ttaatgcaat agacgtgctc ttttgctgct cgagtagcgg 21360
caaccctagt gccatgtcct ccttttttct tactcacttc tctctctacc tttgaaagag 21420
accaagtgga accaagcagc catttctgtg ttccacattg caatagatta tcttttaaca 21480
attctcatac atacatattt tcttcatttt tcttttctat gtatttttaa aataaaatat 21540
aacaacaaag tagtagtttg tatgaatttc ggccatgcag gtgacaaaag gtgaaagtaa 21600
tgagcgtcat tttggatcac attaccagcg aatccactca acgactcttc tcttctcgag 21660
ctttagaagc tgactgtgag ataatagaac agagcacggt ccatcaatca aaatacataa 21720
ttagctcgca atagcttcgc ctcacagtga tcgtttcacc tcatgatacc cttgttgggc 21780
gctcgctctt aggctctccc ttgttgttat atgatgcaac gatcatctaa gtgctgtccg 21840
cagtcatcaa gacatcctat tctgtagcaa gcaagcaagc aagcaagcta gctagtttag 21900
ctggctagct agtttagctg gctgagttcg cagtgaataa acaattaaca cctcaagtct 21960
tgaaggagca ggaaacttgg ctcctatgat atgccatcct ggaaggccat gttttggggg 22020
gtatgagaga caggtctttc cttttctact ctggttcggt ggatgacgag acaacaacca 22080
gacgtcccgc ctagtacctg ggtggtcgat ctgtcctccg ttcactccga gtgcagggct 22140
tgtgggacga ctcgctctgt tgaattgagg tccttcacgc gagcctatct gggcatcgat 22200
cgacctcatc catcaacaca cacacatatg ttcaatccgc gccaccctcg ctgactccca 22260
gactgcccag cgaaactttg aaaacttccc catctcgaaa cagcactccc aaaagacgca 22320
cacaagcaac gcttgagcct aggcaggctc tccgctggac gcacaaacca cctcgcagcc 22380
atccactctc tgactcccca agcatgcatg gccttctccc tcgatttggc gcttcgcgtt 22440
gctgtcttcg aagtcctcaa acacgaactt ttcactaatc atcctcgacc tcagcaggat 22500
gccccccctc ctaagctctg tttgctatgt atttattaga ggaaggacgg caagctgggg 22560
gtctgcggaa cgcattttgg gggtttgaaa attttcgaat tttcaaactc cccgaaacgg 22620
ccatggtttc ttccgagaag cggtagttag gtggggaaat gagagcacgg cggagttggc 22680
gagaagcata aatctgggcg ggcaagcaaa ccccaaacta tcctgcaatc aacaaaacac 22740
acgcactccg caatcaactt gcaccgtaag tctttggaat tgattatggt atctgcttcg 22800
ccgtcttcaa ctttaacttt gcgcctcgca acgagacttt gttttgtaat gtgcctttag 22860
atttgacgaa acatctttaa gcgagatagt acagcagcgc gttggtacca agagagatag 22920
atcctgggac cttttgaaat aaataaactg tgtgatgaac ggtcgactaa ctgggcttgt 22980
aattgatata ttgatgatac tcttggtcca catgggagtg agcacagtcc acaaacaact 23040
tgctaaccca cacaaaaacc tcccaaactt gcagacccgt tctgcattct tgtaaacaca 23100
taatcacaca gcacacataa tcacaatgac ctacggcaca gcacacaact acgtgcagga 23160
gcagattgag ttggacgaat gcttcaacaa ctttggcgaa gaagtgagca gctctgttga 23220
gcctcggtgg cagcgcaagg ccttggccgc tcgcactccc aagtctagcc gcaagcgtag 23280
ccgcaccggc aagaccccga gcaagggcaa gtctacgccc cagcacgacc gattcatccc 23340
caaccgtggc gccatggacc tcgctaacgc tcacttcaac ctcatgaagg agaacagcag 23400
ctccgcctct aaccagtgcg agtcccctac tcgtgctgaa ttcaacaagg ctttggcgtc 23460
cagcatgggt gcgggtgagt cccgtgtttt ggccttcaag aagaaggctc cggcaccgcc 23520
tgagggatat gaaaactccc tcaaggtttt gtacacgcag aacaaggaga agatggcgcg 23580
cactcagaag cccgttcgtc acattccttc ggcaccggag cgtatcctcg acgcacccga 23640
cctcttggac gactactacc tcaaccttgt cgactggggc gcctccaaca tgctcgccgt 23700
ggcccttggc cagacggtgt acttgtggaa cgccgagacc ggcggcattg aggagctctg 23760
ccagtgtgat gccgaggatg actacatcac ctcggttaag tttgttcagg agggcggtgg 23820
ctacttggct gtgggcacga acttcagcga gaccaagctc tttgatgtgg agacctgcaa 23880
gcttctccgc aacatggacg gtcacagctc tcgcgtgtcc tcgctctcgt ggaaccagca 23940
catcctttcc agtggcagcc gcgactcgac tattgtgcac cacgacgttc gcgtggccag 24000
ccacaaggtc ggtgttcttg agggtcacgt gcaggaggtc tgtgggcttt catggtcccc 24060
ggatggccag accttggcct ccggaggcaa cgacaacctg ctgtgcctct gggacgctcg 24120
ttactctggc gacggtcgct cccagcagac cgtgcagacc ccgcgtctta agatcgctga 24180
ccacctcgct gctgtgaagg ctcttgcctg gtgcccgcac cagcgcaatg tccttgccag 24240
cggaggtggt actgccgatc gcacgattaa gatctggaac gctgccaatg gcgcctgcct 24300
caacagcgtc gacactggat cccaggtgtg ctccctcctc tggaacccac acgagaagga 24360
gcttctgtct tctcacggct tcagtgagaa ccagctcagt ctctggaagt tcccttccat 24420
ggctcgtgtc aaggatcttc gcagccactc cgctcgcgtt ctccacttgg cgatgtctcc 24480
ggacggaacc actgtctgct ccgctgctgc tgacgagacc cttcgattct ggaaggtctt 24540
cgaggcagct aacccggtca agcgcaacaa gcgcgccgct ggagctgcca ctgcctctca 24600
cggtggcctc gcccgcatga gcatccggta agtttccccc cttcccttgt ccggttaatt 24660
cactttcgac tactgtctta cacagaagca aagcatggtt atgcaagcaa acttgctggc 24720
atgctctctt ttgtctcttc agtagcgaga ggccgtggtc aaggggctca tgcgggagct 24780
ccaatgtaat ctaccaccac ccggcctctc atgtatacat atatatatat ctatttatat 24840
gctgatcatg atgcaaaaaa atcccacgcc gtcatactaa agcgcgtcag tgtttacaat 24900
actgttggcg tatagttcgg tagtgaaaat taaaatcctt cagggtttgt acctatagct 24960
tttggtgatg aatgtgatct actactactg acgtgacaga agcaacaatt cttgtgaatc 25020
tgacttcttt tttgtgtatt ctatttcgca tgactgcctg attgtatgat atgggtctga 25080
tttggtcgac tgtactctat tttgcatgcc atgtaacttt ttgttcgatt atactatgaa 25140
tctgtggcaa cttttgctga gaagaaggga tggcagacag tttgattttc ttgatcaatg 25200
tgtttcgctg tcccgctgtg ttgaaagaat gcagtaaatg acccgagtat cggactggag 25260
tgcgtatgtt tcacgctgcc ttatgaatcc ccaggggttc gcagcagcac tttccctcgt 25320
ctgtctctgt gtttgctgtt tgttcgctcg taaatgtgtt ttgcctgtat catatgcatg 25380
taggatagaa agttattacg cagtgtgtat tatagattta tggaagatca ggtggactcg 25440
tatatgctga ctggtgggta tgcttcacgg gatactcgca ttaagttcaa attcgaggca 25500
atggttgctg ctgaagtcgc tgacgaagga gagctcattg ttcttgtcgc caatttgtaa 25560
gtaggtggca cctgattcct ctttcctctg ggaagagatg cagcgctctt gggatcagtt 25620
tctctctcaa tcacgcttgc cgagcagttt ttagtagcaa gcaataggtc tttaatgact 25680
tctagaacta gatgagcagg tatttgcatc atgcaaggct ggcatgtttg gtggctttgc 25740
aatttctctg tcttgaactt agctggatag atagcgagag agtgaagttg gtacaaacat 25800
aaccgacagc atgtagccgc tgccttcgct cgcagctcta gcgctcgcct gcagagacgg 25860
aagagtgtat aattgcccag tgtcaacttt tgggtggtgg gtctgactca caatcaatgg 25920
taccgttcag gtatctttcg gtagattatg acactggcca cttttctgaa gtgatttgag 25980
atttggtatc gatgatgaag agtgagagaa ttttgaaaga aatacctcat taacttccaa 26040
tagtcagtat cttgatgaaa aacgctgacc tgaaagctgc gcgtgttttg ttgacacggt 26100
ccttttattt tgttttttga tgatctattg gtacttatac ctgcgatttt tcttttgcaa 26160
gctaaggcac attcgacttt gtctagaagg aaagtgatca tcacgcttcg gcacacatct 26220
gttttcctca gttaagtttt cttcttggtt caggtatggt attacatgca ggaagaaagg 26280
ggatgcgggg acagccgtat agatgccacc aactttaaca tggtttgtgt tttggggaaa 26340
caaggaaaga gagcatacgc tatgagctac ttaaactagt gacacaagaa gcaacttatc 26400
ataccggaga tcacaatgga gtgattaggt tctatcagat agtagaagca gagtatgcga 26460
cctgcggtgg ctacgtacat gggtgaaaat aatagaacac ctcgcgtagc gtcgaaaacc 26520
gcctcgtaga ctctgtgtca ggtatgaacc acccactttt tttgtcctct ttatctccac 26580
actatttcct tcatggagac aaactcattc tcgaaagaca aacaatcaaa tcaatccatt 26640
accctcatgt tctcatgatg ggtatgttat acatatatgt ctcagacata tgtttatcct 26700
ttttaaaaca catacttaat aggcacttag cactgttact gctatagaaa actcatccat 26760
tcaagaggag ggagagaaca gagttggcaa aatcttggaa gggcaaagtt tatagcaagt 26820
aagtagtagc acagagagag tattatgtat gtgttcatct agcaaaatct aaatagaaga 26880
gccgatcgac tcagtcagtt gtaattagga ctagtcgtta atcatgacat ggctcataaa 26940
caactagtca gtttcttgat ttacttggca ctcaggaaca aagtatgttg ccatccctgg 27000
gcaatagatt tgatcccgtg cgttgagata aagcttgcca aggtcgggtc atgtaactgc 27060
agaggcactg ggcgtagatt ccagtcccag acataaggaa cagcaagatc ctcaccaacc 27120
acgcaaatgc cctcagttcc aattgtaact tcaagctgag gagtcttgtg ctcggcggaa 27180
agctcgaaag gggtaaaaac aggtacaggg tcaaggactg tgcgagctgt ggccttgtat 27240
ttgttggtgg acttccaaaa tccctcctcc atgaatggtt caatctgctt ggtcacagcc 27300
tcggagcttg aagtttcctt gtcggacatg agaccccact ggtaaagctt gcagccgtgg 27360
ccctgagaat ctttaactaa agcgacataa ctctgcgggc ctgcccaaat gtcaagcacc 27420
gggcccgcct cagggccgaa ctcgacctct cttggtgaaa cctggtcctc gtagttgctt 27480
ttggcctcgt ccaactggcg agattgcatc ttgccccaca caaagacacg accatccttg 27540
agcaaggctg cgctgctgtt catgccagca gcaaccttga tggccggtcc aggtagatct 27600
ctgacctctt gcatgacgaa gaagtcgtca ataccgcgca gaccgattcc gagttgaccg 27660
cgctgtccct tgccccacgc gaagactttg ccgctcactt tcgtagccac aactccgtgt 27720
ctgaatccca acgcaacgct ggccacggca tcatcatctt caggaagacc aattgtagtc 27780
cttgggtccc agaagtacga gtctgtggtt cccgtcgcgc actgtccata gacattctcg 27840
ccaaatacaa aaagcgtgtc cgtttccttc gtaatgaaag ctgtcacacc ggcaccacac 27900
acaacttccc gaataggttc tgtcgagtaa ccctcaaact ttgtctcaag accccttttc 27960
cgtgagtcct gctcaatatc gtcctcacca aggtccttgt acaccttgta gctcaatacc 28020
tcctttggct caatcgcatc cacagatgtc tccacaccca tcatccgcat gacatactgc 28080
atcaccatgc ttgatttcgc atagcgaccc agacgcacag taagccgggt atcgtgggtg 28140
cgaccaaaga gataaacgcg accttgggcg tcaagaacgg cgctgtggcc aaagcccgct 28200
gcaagtttta caggctgggc ctgcttggtg tcgaggtcgc cgtggatctg tgtagggctg 28260
tcagcgttat cgagactacc tgtaccgagg gcaccgttga taccaatgcc tcgagcccat 28320
acgccgcgaa gggcggttcc ggcagaagag ctaagcatcc gcttggcacc tgttagggag 28380
cccagcgccg tcatggtggt ggtctgtatg tcaatgtatc tgtagaaagg cagccagcta 28440
actaaccagc tgtactgtga accacagaag aggcttttgc aaaagatgct cgagagcaaa 28500
atggatgatc ggtggagatg cggagaagcg cacagcacga tccgagtccg aacttgattg 28560
aactcaagtt cggagtttgc aatttttcta caactaggta taccttcgta gtatcacgta 28620
gtaggtggta gtactagtag tcctttgaat tgcggcaggg aatttacgac agcaactctg 28680
gtaaattaat ttaggacgcc tcttttgtac taaagtcctt ctctttagaa cggaaagaac 28740
atatgatatt gagacatcat gaggacatgg gaaagggttg tgcatctttg gaactgtatt 28800
gcccagtatg gctggacttc accttggact tattcataga atgaccacag ctattcctgg 28860
ggtagatgga ggtctgacaa tgctcgagct aaccctgccc atccatgatc aagacgcacc 28920
caagcactat ggccgcaagt ttcagttcat ggagagcaga gctgctcaaa tttagcttct 28980
gcggtcgatt ggtcttggca caaccgctct taagagtcat ctacgacagg ctaccatcca 29040
ctcaagataa aaatggactc acagatagat agatagatag atagatagat agatagatgg 29100
caggcgacca atcgcagcgc actctcgctc tcaagatatg cccgcccatc gaaacacggc 29160
cttctcatgc ggcctgtttc gtctcaagct cgagcaggcg tcggcccatg ctccagcgca 29220
acgggcccgc aactttcagt ttcgagcttg gtcttgcttt tgagtttgct tttgcttttg 29280
agtttgagtt tgagtttgag tttgagttca aaattcaaat tcttcaaatt caaattcttc 29340
gaattcaaac tcaaattgga gaatccatct tttcaaaaac tcaattcacg ctctcgaaga 29400
agttcaaact ccgcagtcgc atccagctga ggcacgcact ccccatcgca tcgccggcgc 29460
tctctcctcg ctcctgccgc gtctaagcgt gctcgcgtct ctgtcctgct gctgcttgct 29520
tgccagtatc tccacttctc gcgagcagaa ggaggacgag cagaagaaga aggaaggatc 29580
aagaatcatc aagaaggaac actctctttg tttctgtggt tcgtcattag tttgttgtag 29640
cttgaaggag aaggagaaga cggagaagat ggagaagaag ggaatgaaca gcagtggcgt 29700
ttatctgtct ctagctagct aggtacctta cctaccaggt agagttagga ggagaggata 29760
gccgagacta aggaagcaag ccgtagtttt attttactat gtctgttgtt ctttctctcg 29820
actaccttct ctcgctaccc ccgtgggaag gaggtctctt gtgtcgagtc tgatccacgt 29880
ggacgcctcg aggatcttcc ctcgcacccc gggcccggtc gctgccggtg caaaacctcc 29940
tcagtggcct tgctcgcgct gtgtgctttc gttcctgcgt ctggaacgtc agatagcaga 30000
taaagagata taagatagtt agttgacgga agcagtcaaa gcaaacctcg aacggattga 30060
agcgaagcga ggacgctctc gcctctttgc tgactgctcc gcctattgct gctctggccc 30120
tcactctgag atattactat gtctgaacct gccgcagccg caccgccggc cgagcccaaa 30180
tcgtcgtggg cggatgaagt cgataatgac acggagggag acgctgtggc cgctctgagc 30240
gaacatgcgg ctaagttgga cctcgacgtc cacggagctc cagacctgca cagcggtgct 30300
cttgtagtac gcgaggccgg gtgccccgtg gacgagccca agacgcaggc agtgacaagt 30360
ttctcagccc ttgcgattga tgacgacctc aagaagtcta tcgcgaacgt caagggctgg 30420
agcactatgt ctaagatcca gcaaattgga cttccgcttg tgatcagcga ccctccacga 30480
aaccttatcg ggcaggctca agccggcacg ggtaagaccg gtacctttgt catctctatg 30540
cttgcaagga tctctgcaga taagaagccc agcacgcctc aggccattat cttggctgta 30600
actcaggagc tgtgcacgca gattgcacag gaggtcaacg cactgggatc cgacaagggc 30660
attaaagcac gcagagttat gtctgctagg tccaaaaatg gacccctcgc ggaagggagc 30720
gcggcggcgc cgtgggcact tagtgaaggt gaagactttg atgagcaggt cgttgtggga 30780
acacctggaa tggtcaagaa ctacctcaaa aatgccatgg gacgcaagaa gcgcaagccc 30840
atgatcgatc cgtctgagtg ccgcgttctt attcttgatg aagctgacaa gatggtgcag 30900
cagccacctc acggatttgg acaggacgtt caggagattc gcgacattat tctcaagaag 30960
cgcaaggaca agccgtgcca aattttgctc ttttcggcca ccttcaccga aaatgtacga 31020
cagattgccc gccagttcgt tggtggacat gacatggacg agtccaagta ccacgagatc 31080
acgctgcgca aggaggatgt cactctcgac aaagtcgtca acttcgttgt ctatattgga 31140
gacgagaatg agcgcaacga agaggaaatc tataagaaga agtttgaggc cattaatgag 31200
atctgggaga acctctctca gctcagcgag gggcagtccg ttatcttttg caatcgtaaa 31260
gatcgtgtac aacgcctcgc ggattatctt cgcgggctaa acttcccggt cggtcagatc 31320
catggtgaca tggataaggc cgagcgtgac attgtgctca gtgagttcaa gcgcggtgag 31380
cgcaaggctc tcgtttctac tgatgtcacc tcgcgcggta ttgacaaccc caatgtgact 31440
ttagttatta atgtcgacct tcctgttaac cgcgagcagg aagctgaccc ggagaatttt 31500
gtgcacagga taggccgctc gggacgttgg actaagaagg gtgcttctgt ttctcttgtg 31560
gctcgcagcc ctgccttccg tgaccttggc ctcatgaagg acattgagcg tgcactcttc 31620
gctaatgcag aggtaaaccg tccgcttatc cccgtcgatg atctctccaa ccttgagagc 31680
aagatcattt ctgctcttga agcatacaac taagtgccta cctaccttaa tcagccctta 31740
tcacttgcat tgcgagcccg ggtttccgca gcgcttgccc tgtgttgcta gagactgggc 31800
aagctggctc gcctgtctct ttctcgcatt caacaatgca ttcaccgttt ctcctagctg 31860
cacccgccct ctctcttgcg cccacgacaa gaaaaataca gttcatatca gcatcccccc 31920
caaaacaacc ataacaatta cgtaaatgaa ggccgtttat tctaccgtgc atcatgagca 31980
ctgcaccttt tctctcctcc atcgcgcctt ataccgataa acaaaaaata gataacacct 32040
ttttgtagag caaccaccac cattgtttcc cttccctccc tccncnctcc ctcccaaaat 32100
aacttgcttt gtttgtacgg cgttccttct atctactttt tctttaatct tcaatcatgt 32160
ctgacggttc ctttacttat tatgcgttgt tttattcggt cacaaggagg tacagccttg 32220
atggtcctgc gatagatgcc gtactttatt gtcatatgtt tataactttt aaaaaattaa 32280
ttttttagta cttatattca aaattcaaaa ttcaaaatat aaaattcaaa attcaaaaat 32340
tcaaaaattc gaaattcgaa attcgaaatt caatttagat tgtaatctga ttatctttga 32400
atccgtcacc ttctttttat tattttttaa aataatttat ttttaatgtt tttagttaag 32460
ctaattttgt aaaaacaatt atattgttat aataacctta tcacctgaat aataagatag 32520
aaaacgaaga tgcatcctta cctcagcata agaccaaaca gactaaaacg aaacatcttg 32580
gattgcattt tgtctcgact atatcccatc tcaagagagc aataaaagtt attactgagc 32640
cttttcaagt cagaaatgtg tagtcgtgtt caaatttgaa ctttagtttt cgctaaataa 32700
catataagat ctgaattttg caacgactgt gacacaacac tttggttctc aagagaacac 32760
aagttcttgg ttggccagtg cttgttattc cgtatagtat tttgggataa tggacaagga 32820
tccaaaccaa gcacaattga gaagcataat tgcaacacca aacctgaaaa gtaactattt 32880
tgaagacatt accttgtggt gcagtttgat cgatacgaga gcaacgaacg gagcattgag 32940
gttaagcgag gggagtcaaa gaaagttatg ggacaggcac tcaactccac gatgaatgcc 33000
atgcatgtat ccaaggctgg ctgctcctct gggtggatgg gtgtcggggc acatgattat 33060
gtagaggaca aagatgtccc ttctcttgag ccttctgagc atagccaggc accttttcgt 33120
tgttcttgcg tacaatctcg ggttgtaggc cccaaaagtc acgttgaaaa ggtaatgggc 33180
tcacgatgtt gtcaaagccc tcgatgtagc gcgggcaaag gcacgcttgc agaactcgac 33240
gaggtcatgg acaacaaagt ccgaatttct ctagacgttg gcgaagacgt cgatgtcggc 33300
catgaagtcg gcaaagaaga taagacgagg ggcaaggcga ctcttcatga tggaggtgtt 33360
agagacaaac tcagaatcgc tgatggtgtt attggctcta attatgttga tctcaaggcc 33420
aaagaaagtc atcaagcacc aggctcggtt atccaatcgc agcggcactc tcgagccgag 33480
aagtaccgcc gagcagacgc tgttcgcgaa gctcttcgcg atttgtcttc tttctcgcca 33540
agtacttcaa tgaatacttt tcctgactct tcgagccgaa caacacctgc atgctcccct 33600
gaatcagaaa ctagccttga tgaggagaag gagaatatag ggctggtaaa taacgttcta 33660
cttgaggaag aacacgttag tcgcccacga tcaatgacgt ttgatgcttc actttcgatg 33720
acggagctgg aaacccaaaa cgaagtggag cacgctgtgt tgacttcgtc tgtcatgtat 33780
gcagccgaga aaactctaag ttttattaag gagaattccg gagaattggg caaacatatc 33840
ggaaccgaag gcggaagtaa tatcaaagac attgttgaag aacatgcaaa tcaaaaatcg 33900
caagaaagtg ataatgaaat gtttatgagg ttgcttgaag atctgcctac tcaggcccaa 33960
caagtagttt ccgaaagttt gggaacacct actaccaaac atcattactt ttccagcgcc 34020
aacacgagca gtggagcatc gcgaagcttg cagtcaggtc gatcaagcac cccaaactgt 34080
gtcacggtat ctccatgcac agagctgggc tctcctcgtt gcgggcttga ctctgtactt 34140
ggtaaccaaa ttgatgaaaa acatggtgaa gggcttgacg atcaccatag gatcccgcag 34200
tttgatctct tacaacatga gcttttacaa gatagcaact ctattacagc acacagagat 34260
ggtgaaacga cttcgtcccc agttgcctgg gctggagatc ttcaagatga tcttacgcgc 34320
tctctgttga cagaagttga acatcctttc atctgtcgag aaacaaatat accaccggtc 34380
cattcaaaag ggaacgaggg tttgagaaca tgcaatggtt cgtcgcatag atctagtctg 34440
ggagcaattt tgcacgagat tctcgaaacc aagggagact ttcgtaaaaa cggtgaactg 34500
atcaccgacc tcgacatctt cctaggcgat aaattgccaa aaggcaaaac attttggtcg 34560
ctcttgacaa gtagcgagct aggtgagctt ggtgaaagag ttgaactcga aataatgagc 34620
cgccccctcg cgcaccagcc ttaccgagaa tcactctggt gtgttgcatt tcagacaatc 34680
cagctcactc cctatcgcca aagattggcg ctcagctgtc gcgatagact tttgcctcac 34740
gagcgggctt taagcgggtt ctccattgct caactaggtc gtgcgtgttt tgtacttcgg 34800
caaaggctcg tagactgctt ccaccacaac ggcaggataa agttcaaatg ttacaggcga 34860
acatgcaagt tgctggaagc aaggatgtgg caatgagcct caaaacatag gcttggcaca 34920
gggtgttgaa gcgcctttct gagacccatg aaactcctag tttgtttgct ttgcatcgct 34980
ctgtatcaat cgtgccgcat gcaaatgcaa taagctaaca ctcaaatcat ggtacagtct 35040
tttaatttgg accgagtcta gggcacccga ggcatttcga tgcaaacatc tttctcatca 35100
aagacttatt taggcgagtt aggcattgga gctcaccttc cctggcaggt cgcctttacg 35160
tggtaagtta tataagtcaa gaggaaaacc cgagcgacgc tggtctctat aagattgaca 35220
gatccctgga ggtgataaag gttgtatcgt acaacttgtt ctacgagaat caaatcttgt 35280
acgctccaag ccagcagctt gaaattggca gatgagttgt atctgcgtca ggagttatca 35340
gagagcttac tggactatca aatggtagac atgttgacac tgcgcacctg aaaagctctg 35400
ccaagcacct ccgctcccca gaaagcctgg tttacatgaa gtgtgatgta gtctgcagtt 35460
caagatctaa tctcatcaga gagcgcttag tacccattgg tgatctgtca cattttgagg 35520
ctacgcacag tttggatgac gctcttcgcg ctgtatgcaa cacatccgac gaacgagatg 35580
aacctacttc caaagactcg tgtgctggtt ggcgcgcggc ctagacctgg tcggggcact 35640
ggcgcatgct atgagattgc tggacgcgaa aaatgtggcg aagctgtgta cgcagtgaac 35700
tggggtgcca aatcaatgat tctaagagtg tttgccccaa agtatggctt aaaatgtttc 35760
aaactaccca agggttcccc gacatgaggc cacatgtggg aagtgtattt gccccccatt 35820
tgagaagttg ggacagagcg cttcgtcagg gatgatcatg aagcatgttc tatgaacttg 35880
caccacttgt ttagaacgga agtgtggctg gaatgaaacc tatatgtcag catatctgcg 35940
ggtaatcccc aactacataa tatttgctgg tatgcttgct ttaagcagca atcaagtttc 36000
tagcaacagg gtaataacca ggtcaccggt caatcgcaca atggcctttt tagttcggaa 36060
aatttgacaa cctgtggatg tttggggagt ccatggataa atgtggagct gtttggtgta 36120
acagaacatt gcaaagggtg acgccttaga tccttttctc atgacaggct tcgatcacaa 36180
agttgtacac tttcaaggtt gtaggtgcgt attgaacttg gcatttctgg aacaaacaga 36240
cactatatct cgaatctggg tctgcctgcc cctctagctc aggccctgat agtttgacta 36300
gagcatcgcc gtctcgtgta ttctctccga atctttctgc acattgagtt agacttctcg 36360
tcgtgtttgg agcatgtgta aatacatcag cgatattttt ttactcctaa aaatggcaaa 36420
ttcgcattta cctactgcaa ataatgaatc aaaatgagga aacaatgtgc tatatgaacc 36480
gtgctctttg gaacacaaat aaaaaataaa taaagtcaaa gatcgtgcca aatccgccca 36540
acttgagaga aaggcttggc tggtgacctg ccctgttgtg gcatcatcct atcttggctg 36600
ccgccctcca aagagaaatg tgagcctcgg aagagcgggc taggctggta accaatgaga 36660
gctatgtaaa tagcaaagga agagagaata aatctttggg aataaacctg tcagcaaggc 36720
tccaaagctt gctttctggg caaggcttac atgttgcttg atatgatttc acagaagcat 36780
ttggacacgc caaactctgc tactttgact gtgcctaggt ggtaaaccaa gcaactgcta 36840
tctttgacgc caccatgcag gtttccatca aaatagagat agaggagaag ttaccatatt 36900
tgaatccacc aattcttcaa gtgtgtggag acgctcgagt aatgagcata cttgaggaag 36960
atgctcatgg accttccgtg tgtttttctc ccgaggtatt acacgatatt ttcgtatttg 37020
caatgttgca gagtcttgat atcgtgtgac agtggaaaca aatgctacag ttgattcctt 37080
gatccccttc atcgcaaaga gcttgttatt ctctataata agagctagtt accggcaccg 37140
tagtcgcttt tgctcagcaa gtggcccttt tccagcatga gataagacct cctaattttg 37200
gctcgttttc tgattacaaa tgaaggtcct tgccaactac accatggtca cagctttctc 37260
tgccgagctc agggatgcaa ctgtcggctt agacaccaag tcagcgtcgg ttgcaagtgc 37320
tgcttctgag agctgactgc tgtagtgtgt gggtttgctc cacctatgag tgggtatgag 37380
taggtctgct ccacctatga ggaccaccaa gtttgctctc catgtgctac agcgcctgcg 37440
tctcttgtgc ggtgagacat attttttgag cttggtcttt acgaaatgaa ggcctgcgac 37500
agacaacgat cgcaacaatt ctgcctcgaa ggcgcttatc cctacgtaga cgtaggtctc 37560
tgttcccact aaagccactc ctgcgtcaat agaacaaaag caaaagctct tatggctgct 37620
gtacaaatag agtaaaactt cacctttcta ctcgtaacac tacagttata agtagcaagt 37680
caatcagagc aagacctttg cgagtaaacc tgcattgctc tatcgcagtc ttccagcatc 37740
ttcgcgaggc ggtctcgcac aacttcagtc agtctgtaat aacaggagct ttagcaccag 37800
ccaaagcagt tgcgttgcaa ccagcagaag acttggcatc atgctcattc ccgctgtgga 37860
cgtggccgtg ggcggtgctg tggcgtcctc tgagaagttt gatctcctca aacgcctgag 37920
ttggtgcggc ccccttcgca tcatcccttc acttgactct gtctccgcac caagtgtggg 37980
tgcccctgag gagaaggact tctggaaatc tgctgttcgc aagtggggca aagctttgtg 38040
ttcgtaccct tgccaagttg gtcccatcgc cgctacaagc gttgaggaag tgacgcaatg 38100
gctcaacgaa ggcgctgtcc aagtcattgt tgagggttct ttcgacgacc tcgaggacat 38160
tgcttcgcag cttcctcgtg aacgtcttgt tgccagattt tccgagaagg tccttgaaga 38220
cgacggtctc ctgagcaaac tttctggcag cgttgggggc gtttcaatta tttctgaggc 38280
caaaaattct gaagaagtcg tcaaggtcgc agagagggca tggcagcttt tgggaaaacg 38340
ccttgctatc gcattagagg tccccgagat cgaggccgga ggcgaggcgc agaagattaa 38400
caaccagctt gttggtaagc tccatggact ccactccaca gactttcctg tgaacgttgt 38460
gtctgagaac gtttccatgc caacagaagg gtctcttgcg acagatactg actcagaagc 38520
tgccttttgc gtggcaaggt cttttgtagc gtgccttcgc accgaccgta cagatggtct 38580
ctttgcgacg gtcgtcaccg atgagaatgg cgtggcactt ggcctcgtgt actccagcga 38640
acagtctgtg gttgcctcgt tggcgtgtgg ccgcggcgtg tactggtcaa gatccaggca 38700
gagtctgtgg cgcaagggcg acacaagcgg tgcctttcag gagcttgtgt ctatcgcatt 38760
tgactgtgat gccgacgcga tgaggttcaa ggtgcgccag cgtggaaacc ctcctgcatt 38820
ttgccatcaa cagacccgca catgctgggg ttatgacggt ggcatccccc acctctttcg 38880
cactcttgag tcccgcaagc ttaacgcccc agaaggatca tacacaaaac gtctttttga 38940
ggacaaggca ttgctgcgta acaagctcat tgaggaggca caagaggtaa ttgaggctat 39000
tgaggagaat gacccagagc atgttgcccg cgaggtcgca gacctcgcat acttcctctt 39060
tgccgcgtgc acgtgcggaa atgcgtcgct cgaggacgtt acacggcagc ttgacatgcg 39120
ttccctcaag gtcaagcgga ggccaggcaa tgcaaaggca gatcgcatcg ctgctggtga 39180
ggcagttctc caggctcagc agcagaaaaa gtctgcagag gagcccccag cagctcccaa 39240
ggaccaggcc taaattgcat gcttattatt acacccaaat cctgcttatt gtgacttgtc 39300
tgcacccttt tcacattgaa gaagcgtgtt ttcttacccg tcacaccacc actaagtctc 39360
atcctttctt tcttaccttt ttactagtcc gaacgatata aactttatct ttgcaaggct 39420
cttgttatac tgcaattgtt atttagtttg ttttctattg ataggcaaac cagacgtaat 39480
cgtctgagag tgtttgaaga ggataaaaca aagaatcatt aacaggtttt gtgtttctgt 39540
acacttgaat agttttatgc ctatctactt ctagagcctg ggcggagttg gcatttgtat 39600
aatctcaaca ttcgataaca aattgcttca aatgaagaac aaaaacagga aatgatttga 39660
attaaaatct aatatttgta gaaaagaaaa agcgagctga catcattcca tcaaattgac 39720
caattgactc cttagcacag tagatatttc ctaaacgact tcaactcatt cctcattatc 39780
ctcgctgttc ctgcttccgt gagtaccctt gctgattcgt acttccaaat cgccgccatc 39840
ctcccggtca tcatcatctt cgtcatcttc gtcttcatca tcagcccctg acgaggagta 39900
aatgtcaagg taaggtttgg gattctcgag ctttcgcaat tctccaatac ttattggttg 39960
gccacagacc ggatcc 39976
<210>3
<211>8994
<212>DNA
<213>Ulkenia sp.
<400>3
atggctcaac gtgagaaccg tctcgaggcc aacatggata cccgcatcgc tgtgatcggc 60
atgtccgcca tcctcccctg cggtaccacc gttcgtgagt cttgggaggc tatccgcgat 120
ggtatcgact gcctcagtga tctccccgag gaccgcgtcg atgtgaccgc ctacttcgac 180
ccggtcaaga ccaccaagga taagatctac tgcaaacgtg gtggattcat ccctgagtac 240
gacttcgacg cccgtgagtt cggcctcaac atgtttcaga tggaggactc cgacgcaaac 300
caaaccgtca ccctcctcaa ggtcaaggag gccctcgagg acgctggcat cgaagccctc 360
agcaaggaaa agaagaacat tggatgtgtt ctcggtatcg gtggtggcca gaagtccagc 420
cacgagttct actcccgctt aaactatgtt gtcgttgaga aggtccttcg caagatgggc 480
atgcctgagg aggatgttca agctgctgtt gagaagtaca aggccaactt ccctgagtgg 540
cgccttgact ccttccccgg tttcctcggc aacgttactg ccggtcgctg taccaacacc 600
ttcaacctcg atggtatgaa ctgtgtcgtc gatgctgcct gtgctagttc tctcatcgcc 660
gttaaggttg ccattgatga gcttctccac ggagactgtg acatgatgat cactggtgct 720
acctgcacgg ataactccat cggtatgtac atggccttct ccaagacccc ggtgttctct 780
accgacccta gcgtccgcgc atacgatgag aagaccaagg gtatgcttat tggcgaaggc 840
tctgccatgc ttgtgcttaa acgttacgcc gacgctgttc gtgatggtga cgagattcac 900
gctgtcattc gcggctgcgc ctcttcctct gacggtaagg cctccggtat ttacaccccg 960
accatctctg gtcaagagga ggctcttcgc cgtgcctaca tgcgcgctaa cgtcgatccc 1020
gccaccgtca ctcttgttga gggccacggt accggtaccc ccgttggtga ccgtattgag 1080
ctcaccgctc tccgtaacct cttcgacagt gcctacggca acgagaagga gaaggtcgct 1140
gttggcagca ttaagtccaa catcggtcac ctcaaggctg tcgccggtct tgccggtatg 1200
atcaaggtca tcatggccct caagcataag actcttccgg ccaccatcaa cgttgatgag 1260
ccccctaagc tttacgacaa cactcccatc accgactcat cgctgtacat taacacgatg 1320
aaccgtccgt ggttccctgc tccgggtgtg ccccgtcgcg ctggtatctc cagtttcggt 1380
tttggtggtg ccaactacca cgccgttctt gaggaagccg agcccgagca ccagaaggct 1440
taccgtctca acaaacgccc ccagccggtg cttctgatgg catcttcaac ccaggctctt 1500
gcttccctct gtgaagccca gcttaaggaa ttcgagaagg ctatcgagga gaacaagacc 1560
gtcaagaaca ctgcttacat caagtgcgtc gacttctgtg agaagttcaa gttccctgga 1620
tctatcccga gctctaacgc tcgcctcggt tttcttgtca aggaggccga tgatgccacc 1680
gagaccctcc gtgccatcgt tgcccagttc caaaagtcag ctggcaagga ttcttggcac 1740
cttccccgcc agggtgtgag ctttcgtgct cagggcatca acaccactgg tggtgtcgct 1800
gccctcttct ctggccaggg tgctcagtac acccacatgt tcagcgaggt cgccatgaac 1860
tggcctcagt tccgtgagag catctctgac atggatcgtg cccaggctaa ggttgctggc 1920
gctgacaagg actacgagcg tgtctcccaa gtcctctacc cgcgtaagcc ttataactct 1980
gagcccgagc aggaccacaa gaagatctcc ctgacctcat actctcagcc ctctaccctc 2040
gcctgcgctc ttggtgccta cgagatcttc aagcaggctg gtttcaagcc cgacttcgct 2100
gccggtcact ctctcggtga gtttgcggcc ctctacgctg ctgactgcgt caaccgtgac 2160
gacctctttg agctcgtgtg ccgtcgtgcc cgcatcatgg gtggcaagga tgcacctgct 2220
acccccaagg gatgcatggc tgctgtcatt ggacccaatg ccgagaagat ccagattcgc 2280
actgctgatg tctggctcgg caactgcaac tccccttcgc agactgtcat caccggctct 2340
gttgagggta tcaagaagga gtccgagctt ctccagagtg agggcttccg tgttgtcccc 2400
ctcgcctgcg agagtgcctt ccactcaccg cagatgcaaa acgcctcctc tgccttcaag 2460
gatgttctct ccaaggttgc cttccgtcag cctagcgccc agaccaagct cttcagcaac 2520
gtgtctggcg agacctactc caacaatgcc caggacctcc ttaaggagca catgaccagc 2580
agtgttaagt tcatctctca ggttcgcaac atgcactctg ctggtgctcg catctttgtc 2640
gagtttggcc ccaagcaggt gctctctaag cttgtttccg agaccctcaa ggacgatcct 2700
tccattatca ctatctctgt caacccttcc tctggcaagg atgccgatat tcagcttcgc 2760
gaggctgctg tgcagctcgt tgttgctgga gtcaaccttc agggcttcga caagtgggac 2820
gcacctgacg ccacccgcct tcagccgatt aagaagaaga agactactct tcgtctctcg 2880
gctgccactt acgtgtctga caagaccaag aaggctcgcg aggctgccat gaacgacggc 2940
cgcatgctca gctgtgtcag caaggtcatc gccccccctg acgccaagcc cattgtggac 3000
accaaggctc aggaggaggt tgctcgtctc cagaagcagc ttcaggatgc ccaggcccag 3060
atccagaagg ccaaggccga tgctgctgag gctgacaaga agcttgccgc tgctaaggat 3120
gaggccaagc gtgccgccgc ttctgcacct gtgcagaagc aggttgacac caccattgtt 3180
gataagcacc gtgctatcct caagtctatg cttgctgagc ttgactgcta ctccactcct 3240
ggtgctgtgt ccagctcttt ccaggcacct gttgctgcta cccctgctcc ggtcgctgcg 3300
cctgttgcag ctgctcctgc tccggctgtc aacaatgctc tccttgccaa ggctgagtct 3360
gttgtcatgg aggttcttgc cgccaagact ggttacgaga ctgacatgat cgagcccgac 3420
atggagctcg agactgagct cggcattgac tctatcaagc gtgtcgagat tctctctgag 3480
gtccaggccc agctcaacgt cgaggccaag gatgttgatg ctcttagccg cacccgcacc 3540
gtcggtgagg ttgtcaacgc catgaaggct gagatcgctg gcagctctgg tgctgccgct 3600
gctgccccgg ccccggttgc tgctgctccc gctgcccctg cccctgctgt caacagcgct 3660
cttcttgcca aggctgagac tgttgtcatg gaggttcttg ccgccaagac tggttacgag 3720
actgacatga ttgagcccga catggagctc gagactgagc tcggcattga ctccatcaag 3780
cgtgtcgaga ttctctctga ggttcaggcc cagctcaacg ttgaggccaa ggatgttgat 3840
gctcttagcc gcacccgcac cgttggtgag gttgtcaacg ccatgaaggc tgagatcgct 3900
ggcagctctg gtgctgccgc tgctgccccg gcccctgttg ctgctgctcc ggcgcccgtc 3960
gctgccgctg cccctgctgt cagcagcgct ctccttgaga aggctgagtc tgttgtcatg 4020
gaggttcttg ccgccaagac tggttacgag actgacatga ttgaggccga catggagctc 4080
gagactgagc tcggcattga ctccatcaag cgtgtcgaga ttctctctga ggtccaggcc 4140
cagctcaacg tcgaggccaa ggatgtcgat gctcttagcc gcacccgcac cgttggtgag 4200
gttgtcaacg ccatgaaggc tgagatcgct ggcagctctg gtgctgctgc cccggccccg 4260
gtcgctgcgg cccctgctcc ggtcgctgcc gctgcccctg ctgtcaacag cgctcttctt 4320
gagaaggctg agactgttgt catggaggtt cttgccgcca agactggtta cgagactgac 4380
atgatcgagc ccgacatgga gctcgagact gagctcggca ttgactctat caagcgtgtc 4440
gagattctct ctgaggtcca ggcccagctc aacgttgagg ccaaggatgt tgatgctctt 4500
agccgcaccc gcaccgttgg tgaggttgtc aacgccatga aggctgagat cgctggcagc 4560
tctggtgctg ccgctgctgc cccggccccg gttgctgctg ctcccgctcc cgtcgctgcc 4620
cctgctgtca gcagcgctct ccttgagaag gctgagtctg tcgtcatgga ggttcttgcc 4680
gccaagactg gttacgagac tgacatgatt gaggccgaca tggagctcga gactgagctc 4740
ggcattgact ccatcaagcg tgtcgagatt ctctctgagg tccaggccca gctcaacgtt 4800
gaggccaagg atgtcgatgc tcttagccgc acccgcaccg ttggtgaggt tgtcaacgcc 4860
atgaaggctg agatcgctgg cagctctggt gctgccgctg ctgccccggc ccctgttgct 4920
gcctctcccg ctcccgtcgc tgccgctgcc cctgctgtca gcagcgctct ccttgagaag 4980
gccgaatctg ttgtcatgga ggttctcgcc gccaagactg gttacgagac tgacatgatt 5040
gaggctgaca tggagctcga gactgagctc ggcattgact ctatcaagcg tgtcgagatt 5100
ctctctgagg tccaggctat gcttaacgtt gaggccaagg atgttgatgc tcttagccgc 5160
acccgcaccg ttggtgaggt tgtcaacgcc atgaaggctg agatcgctgg cagctctggt 5220
gccgccgctg ctgccccggc cccggttgct gctgctccgg cgcccgtcac tgccgctgcc 5280
cctgctgtca gcagcgctct ccttgagaag gccgaatctg ttgtcatgga ggttctcgcc 5340
gccaagactg gttacgagac tgacatgatt gaggccgaca tggagctcga gactgagctt 5400
ggcattgact ccatcaagcg tgtcgagatt ctctctgagg tccaggctat gcttaacgtc 5460
gaggccaagg atgttgatgc tcttagccgc acccgcaccg ttggtgaggt tgtcaacgcc 5520
atgaaggctg agattgctag cagctctggt gctgctgccc ctgctccggc tgctgccgtt 5580
gcaccggccc ctgctgctgc ccctgctgtc agcagcgctc tccttgagaa ggccgaatct 5640
gttgtcatgg aggttctcgc cgccaagact ggttacgaga ctgacatgat tgaggccgac 5700
atggagctcg agactgagct cggcattgac tctatcaagc gtgtcgagat tctctctgag 5760
gtccaggcta tgcttaacgt tgaggccaag gatgttgatg ctcttagccg cacccgcacc 5820
gttggtgagg ttgtcaacgc catgaaggct gagattgcta gcagctctgg tgctgctgcc 5880
cctgctcctg ctgctgccgc tgcaccggcc cctgctgctg cccctgctgt cagcagcgct 5940
cttcttgaga aggctgagtc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 6000
actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag 6060
cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat 6120
gctcttagcc gcacccgcac cgttggtgag gttgtcaacg ccatgaaggc tgagattgct 6180
agcagctctg gtgctgctgc ccctgctcct gctgctgccg ctgcaccggc ccctgctgct 6240
gcccctgctg tcagcagcgc tcttcttgag aaggctgagt ctgttgtcat ggaggttctc 6300
gccgccaaga ctggttacga gactgacatg attgaggccg acatggagct cgagactgag 6360
cttggcattg actccatcaa gcgtgtcgag attctctctg aggtccaggc tatgcttaac 6420
gttgaggcca aggatgttga tgctcttagc cgcacccgca ccgttggtga ggttgtcaac 6480
gccatgaagg ctgagatcgc tggcagctct ggtgctgcta ctgcctctgc ccctgctgct 6540
gcagctgccg cccctgctat caagatctcc actgttcacg gtgctgactg cgatgacctc 6600
tctgtgatgt ctgctgagct tgtcgacatt cgtcgcgctg atgagctcct tcttgagcgc 6660
cctgagaacc gcccggtcct tattgtcgat gatggtaccg agctcacctc tgctctggtt 6720
cgtgttcttg gtgctggtgc tgtagttctt acctttgacg gtcttcagtt ggctcagcgt 6780
gctggtgctg ctgttcgcca tgtccaggtg aaggacctct ccgctgagag tgccgagaag 6840
gctatcaagg aggctgagca acgcttcggc cagcttggag gcttcatctc tcagcaggct 6900
gagcgctttg cccctgctga cattcttggt ttcaccctca tgtgcgctaa gtttgccaag 6960
gcttccctct gcacccctgt gcagggtggc cgtgccttct tcattggtgt ggcccgtctt 7020
gacggtcgcc ttggtttcac ctcccaggga tctactgact ccctcacacg tgcccagcgt 7080
ggtgctatct tcggcctctg caagaccatt ggccttgagt ggtctgctaa cgaagtgttc 7140
gcccgcggta ttgatattgc tcgtgaggtc caccctgaag atgctgccgt cgccatcact 7200
cgcgaaatgt cctgcgctga caaccgtatc cgcgaggtcg gcattggcct caaccagaag 7260
cgctgcacca tccgtgctgt ggacctcaag ccgggtgccc ccaagatcca gatcagccag 7320
gatgacgttc tccttgtgtc tggtggtgct cgtggtatta ctcctctctg catccgtgag 7380
atcacccgtc aggtccgcgg tggtaagtac attctcctcg gtcgctccaa ggtccctgct 7440
ggtgagcctg cttggtgcaa cggtgtttct gatgacgatc ttggcaaggc tgctatgcag 7500
gagctgaagc gtgctttctc cgccggtgag ggccccaagc ccaccccgat gacccacaag 7560
aagctcgttg gcactattgc tggtgcccgt gaggttcgtt cctcaattgc taacattgag 7620
gctctcggtg gcaaggcaat ctactcctct tgtgatgtga actctgctgc tgatgtcgcc 7680
aaggctgttc gcgaggctga ggctcagctt ggcgcccgtg taactggtgt cgtccacgct 7740
tctggtgtcc ttcgtgaccg cctcattgag cagaagcgcc ccgatgagtt tgatgctgtc 7800
ttcggcacca aggtgactgg tctcgagaac ctctttggtg ccattgacat ggccaacctt 7860
aagcacctcg tcctcttcag ctctcttgct ggtttccacg gcaacattgg tcagtctgac 7920
tacgccatgg ctaacgaggc cctcaacaag atgggtcttg agctctctga ccgtgtgtcc 7980
gtgaagtcta tttgcttcgg cccctgggat ggtggcatgg ttacccccca gctcaagaag 8040
cagttccagt ctatgggtgt tcagatcatc ccccgtgagg gtggtgccga tactgtggct 8100
cgcattgtcc tcggctcctc ccctgctgag atccttgttg gcaactggac cactcccacc 8160
aagaaggttg gcagtgagcc cgttgtgatc caccgcaaga tcagcgctgc atccaaccct 8220
tttcttaagg accacgtcat ccagggtcgc tgtgtgctcc ccatgaccat tgctgtgggc 8280
tgccttgctg agacctgcct gggtcagttc cctggatact ccctctgggc tattgaggat 8340
gctcaactct tcaagggtgt caccgttgac ggtgatgtca actgtgagat cactctcaag 8400
ccttcccagg gtactgccgg ccgcgttatg attcaggcca ccctgaagac cttcgctagc 8460
ggcaagcttg ttccggctta ccgtgccgtg atcgttctct ccactcaggg aaagccccct 8520
gctgctacta cttcccagac cccctctctc caggctgatc ctgctgcccg tggcaaccct 8580
tacgacggca agaccctctt ccacggccct gccttccagg gtcttaagga gatcatctct 8640
tgcaacaagt ctcagcttgt cgccgagtgc accttcattc cgtcttccga gagcgctggt 8700
gagttcgctt ctgactacga gtcccacaac cctttcgtca acgacattgc tttccaggcc 8760
atgctcgtct ggattcgccg caccctcggc caggctgccc tccccaactc tatccagcgc 8820
attgtgcagc accgtgctct tccccaggac aagcccttct acttgaccct caagagcaac 8880
agcgcgagtg gccactctca gcacaagacc tccgttcagt ttcacaacga gcagggtgac 8940
ctcttcgtgg acatccaggc ttccgtcacc tcttctgact cccttgcctt ctaa 8994
<210>4
<211>6093
<212>DNA
<213>Ulkenia sp.
<400>4
atggcctctc gcaagaatgt gagcgctgct cacgaaatgc acgacgagaa gcgcattgcc 60
gtggtgggca tggccgtgca atacgcgggc tgcaaagaca aggaagagtt ctggaaagta 120
gtcatgggcg gtgaggctgc atggactaag attagcgata aacgcctcgg atccaacaag 180
cgagccgagc acttcaaagc agagcgtagc aaatttgcag ataccttttg caacgagaac 240
tacggctgcg tcgatgactc cgtcgataac gaacacgagc ttctccttaa gctctccaag 300
aaggctctct ccgagacatc ggtctccgac tctacaaggt gcggtattgt gagcggatgc 360
ctgtcctttc ccatggacaa cctccagggc gaactcctca atgtgtacca aaaccacgtc 420
gaaaagaaac tcggcgctcg cgtcttcaag gatgcctcca agtggtccga gcgtgagcag 480
tcgcagaacc ccgaggctgg tgaccgccgc atctttatgg acccggcatc cttcgtagca 540
gaagagctca acctcggtcc tcttcactac tctgtcgatg ctgcctgtgc caccgccctt 600
tacgtccttc gcctcgccca ggaccacctc gtttccggtg ctgctgatgt catgctcgct 660
ggtgcaactt gcttcccgga gccctttttc attctctccg gattctccac tttccaggcc 720
atgcctgtat cgggagacgg catctcgtac ccgcttcaca aggacagtca gggtctcacc 780
cctggtgaag gtggtgccat tatggttctc aagcgccttg acgacgctat tcgcgatgga 840
gaccacattt acggtactct gctcggtgct accatcagca atgctggctg tggtcttccc 900
ctcaagccgc acttgcccag cgagaagtcc tgcctcattg atacctacaa gcgcgtcaac 960
gtgcacccgc acaagatcca gtacgtcgag tgccacgcaa cgggtactcc ccagggagac 1020
cgcgttgaga ttgatgccgt caaggcttgc ttcgagggca aggtgcctcg ctttggaagc 1080
tccaagggta actttggcca cacactcgtt gcagctggtt tcgcaggcat gtgcaaggta 1140
ctccttgcca tgaagcatgg tgtgatcccg cccactcctg gtgtcgatgg atcttcccaa 1200
atggacccgc ttgtggtctc tgagcccatc ccatggcccg acactgaggg cgagcccaag 1260
cgcgctggtc tctccgcttt cggctttggt ggcaccaacg cccacgcagt ctttgaggag 1320
tttgaccgct ccaaggctgc ctgtgccacc cacgatagca tcagttccct cagctcacgt 1380
tgtggcgggg agggcaacat gcgcattgct attaccggta tggatgccac cttcggctcc 1440
ctcaagggcc tggacgcctt tgagcgtgcc atctacaatg gccaacatgg tgctgtgcca 1500
ttgcctgaga agcgctggcg tttccttggt aaagacaagg actttttgga cctgtgcggt 1560
gtcaaggagg tgccccacgg atgctacatt gaggacgtcg aggtggactt tagccgcctg 1620
cgcacgccca tgacgccaga cgacatgttg cgccccatgc agctacttgc tgtcacaacc 1680
atcgaccgtg ccattctcaa ctctggcctc aagaagggag gtaaggtcgc tgtcttcgtc 1740
ggccttggca ctgaccttga gctctaccgt caccgcgccc gcgttgccct caaggagcgt 1800
gctcgtcccg aagccgcttc agccctcaat gatatgatgt cctacatcaa cgattgcggt 1860
accgctacct cgtacacatc ctacatcggc aacctcgtgg ccacccgcgt gtcttcacaa 1920
tggggtttcg agggtccttc tttcaccatc acagagggca acaactccgt ctaccgttgc 1980
gcagagttgg gcaagtactt gctcgagact ggcgaggtcg aggccgtagt gatcgccggt 2040
gtggatcttt gcgccagcgc tgagaatctc tacgtgaagt cgcgtcgttt caaggtctcg 2100
gagcaggaga gcccgcgggc cagcttcgac tccggcgctg acggctactt tgttggtgag 2160
ggatgtggtg ccctcgtcct caagcgcgag agcgactgca ccaaggacga acgcatttac 2220
gcctgcatgg acgctatcgt gcccggcaac atgccggcag cctgcatgga ggaggctctc 2280
gcccaggctc gcgtcaaccc caaggacgtt gagatgctcg agctctccgc tgactctgcc 2340
cgccacctca agaacccctc cgttctgcct aaggaactca ctgctgagga ggaaatccgc 2400
ggcattgagg ccattctcag ccagcgctct agcaacgaag ctgtggagcc ccacaacgtc 2460
gctgtcagca gcgtcaagtc cactgtcggt gacaccggct acgcctcagg agctgccagt 2520
ctcatcaaga cggctctctg tctgtacaac cgctacttgc cctcaaacgg cgcctcctgg 2580
gaggagcctg cacctgagac acagtggggc aagtctctgt acgcgtgcca gtcctcgcgg 2640
gcctggttga agaaccctgg agctcgccgc cacgcagctg tctcaggtgt ttccgagacc 2700
cgttcatgct acacggtgct gctctctgat gtggagggcc accacgagac caagagccgc 2760
atttcgctcg atgacgatgc cgtcaaactc ctcgtaatcc gcggagactc ccatgacgct 2820
atcacgcagc gtgttgacaa gctccgcgag cgcctcgccc agcctagcgc taatgtacgt 2880
cttgctttta tggagttgct cggcgagagc attgcccagg agaccaagac cccgttgccg 2940
gccttcgctc tgtgcctggt gacctctcct agtaagctcc agaaggagct tgaactcgcc 3000
tccaagggca tcccgcggag tcttaagatg ggccgcgact ggacatcacc ctcgggcagc 3060
cactttgcac ccaagccact gtcaagcgat cgcgttgcgt ttatgtacgg cgaaggccga 3120
agcccttact atggtatcgg ccttgacatt caccgcatct ggcccgaact tcacgagttt 3180
gtaaacgcca agaccaacaa gctttgggat caaggcgaca gatggttgat cccgcgcgcc 3240
tcgacgaagg aggagcttaa ggcgcaggaa gatgagttca accgcaacca ggtggagatg 3300
ttccgactcg gtattctcat gtccatgtgc ttcacccaca tcgctcgcga cgtgcttggc 3360
atccagccca aggctgcttt cggactgagc cttggagaga tttccatggt ttttgccttt 3420
tctgagaaga acggccttgt ctctgaggag ctgacaacta aactccgcaa ctcggaggtc 3480
tggcgtaagg ccctcgctgt tgagtttgac gccctccgca aggcctggaa tattccccaa 3540
gatacccctg tcagcgagtt ctggcaagga tacgtggtac gtggaacccg cgaggccgtt 3600
gaagcggcca tcggccccaa caataagtac gtgcacttga ccattgtcaa cgatgccaac 3660
agtgctctca tcagtggcaa gcctgaagat tgcaaggctg ccattgctcg cctgagcagc 3720
aacctccctg ctttgcccgt ggaccttggt atgtgtggcc actgccccgt ggtcgagccg 3780
tacggcaagc agatcgctga gatccatagc gtcctcgaga ttcccgaggt tgccggcctt 3840
gacctgtaca cgagcgtcaa ccagaagaag cttgttaaca agtccactgg agccagcgac 3900
gagtacgcac ccagctttgg tgaatacgca gcacagctgt acactgttca ggcagacttt 3960
cctaagatcg ccaagaccgt tagcgacaag aactttgacg tctttgttga gactggtccc 4020
aacgcccacc gtagcgccgc aattcgcgcc acccttggaa atagcaagcc ttttgtcacc 4080
ggatccatgg accgccagaa cgagaatgct tggacaacca tggtcaagct ggttgcctct 4140
ctccaagccc accgcgtgcc tggcgtgaag gtctcccctc tgtaccaccc cgagactgtt 4200
gaggaggcta cgcagagtta caacgatatg gtggctggca agaagcctac taagaacaag 4260
ttcttgcgta agattgtggt caatggtcgc tatgacccca aaaagcagct cgtgccgccc 4320
caggtgctag ctaagcttcc tcctgcggac cccaagatcg aggctcttat ccaggctcgc 4380
aagatgcagc ctattgcccc caagttcatg gagcgtctcg acattcagga gcaagacgcc 4440
acacgcgacc ctattctcaa caaggataac aaaccttccg ctgctcctgc ccttgcccct 4500
gctgctccgg cccgcagcgt ctccggagct gttgtggctt cctctgaggc tctccgtgcc 4560
aaacttttgg agctcaacag cactttgatg cttggtgtca acgccaacgg tgatctcgtt 4620
gaagcaagcc caagtgaagc atctattgtt gtgcccaagt gcgatatcaa ggatcttggc 4680
agccgtgcct tcatggagac atatggtgta tccgccccca tgtacaccgg cgccatggca 4740
aagggcattg catccgctga gatggttatc gctgccggaa agcgcggcat ccttggttct 4800
ctcggtgctg gtggtcttcc tatcgccacc gtacgcaagg ctctcgaagc tatccaggct 4860
gaactgccca agggccctta cgctgtcaac ctcatccact ctcccttcga cagcaacctc 4920
gagaagggta acgtcgacct cttcctcgag aagggcgtca ctgtcgttga agcctccgcc 4980
tttatgacct tgaccccgca gctcgtgcgc taccgtgctg caggtctctc tcgcgctgct 5040
gatggctcca cggttattaa gaaccgcgtc atcggtaagg tttctcgcac agagcttgcc 5100
gcaatgttta tccgtcccgc gcccgagaat ctcctcgaga agctgctgaa gtccggcgag 5160
atcacccaag agcaggctgc tctcgcacgc acagtgcctg tggcagacga cattgccgtt 5220
gaggcggact ccggtggcca caccgataac cgccccatcc acgtcatcct ccctctcatt 5280
gtcaacctcc gtgatcgtct gcacaaggag tgcggctacc ctgcccacct tcgcgttcgc 5340
gttggtgctg gtggtggcat tggatgccct caggccgcca ttgccacctt caacatgggc 5400
gcggccttca tcgtcactgg taccgtaaac cagatgagta agcaagctgg aacctgtgac 5460
accgttcgca agcagctctc acaagccacc tactccgaca tctgcatggc cccagcagct 5520
gacatgtttg aggaaggtgt caagctccag gtgctcaaga agggaactat gttcccctcg 5580
cgtgccaaca agctctatga gctcttcgtc aagtatgact cctttgagtc catggctcct 5640
ggagagctgg aacgtgtgga gaagcgcatt ttcaagaagt ctctgtcaga agtttgggaa 5700
gagaccaagg acttctacat caacaggttg cagaacccgg agaagattga gcgcgcggag 5760
cgtgacccca agcttaagat gtccttgtgc ttccgctggt accttggttt ggcgagcttc 5820
tgggcaaacg ctggcatccc ggaccgtgcc atggactacc aggtttggtg tggcccagcg 5880
attggatctt tcaacgactt catcaagggt acctaccttg accccgccgt tgccaacgag 5940
taccccgatg ttgtgcaaat caacttgcag atcctccgtg gtgcctgctt cttgcgccgc 6000
ctcgaagctg tccgtaatgc cccgctgaag gctaacgcca agcaggttgc tgccgagatt 6060
gatgacatct acgtgcccac tgagcgcctg taa 6093
<210>5
<211>4398
<212>DNA
<213>Ulkenia sp.
<400>5
atggccactc gcgtgaagac caacaagaaa ccatgctggg agatgaccaa ggaggagctc 60
accagcggca agaacgtcgt tttcgactat gacgagctcc ttgagttcgc cgagggtgac 120
atcagcaagg tcttcggccc cgaattcagc cagatcgacc agtacaagcg tcgcgttcgt 180
ctccccgccc gcgagtacct cctcgtcacc cgcgtcaccc tcatggacgc cgaggtcaac 240
aactaccgcg tcggtgcccg catggtcact gagtacgacc tccccgtcaa cggtgagctc 300
tctgagggtg gtgactgccc ctgggccgtg ctcgtcgaga gtggtcagtg tgatctcatg 360
ctcatctcct acatgggtat tgacttccag aacaagagcg accgcgtcta ccgtctgctc 420
aacaccaccc tcaccttcta cggtgttgcc caggagggcg agaccctgga gtacgacatc 480
cgcgtgaccg gcttcgccaa gcgtctcgac ggtgacatct ccatgttctt cttcgagtac 540
gactgctacg tcaacggccg tctcctcatc gagatgcgcg acggctgtgc cggtttcttc 600
accaacgagg agctcgccgc cggcaagggt gtcgtcttta cccgcgctga tctcctcgcc 660
cgcgagaaga ccaagaagca ggacatcacc ccgtacgcca ttgccccgcg tcttaacaag 720
accgttctca acgagactga gatgcagtcc ctcgtggaca agaactggac caaggttttc 780
ggccccgaga acggcatgga ccagatcaac tacaaactct gcgcccgtaa gatgctcatg 840
attgaccgcg tcaccaagat tgactacacc ggtggcccct acggccttgg tcttctcgtt 900
ggtgagaaga tcctcgagcg cgaccactgg tactttccgt gccacttcgt cggagaccag 960
gtcatggctg gatccctcgt gtctgacggc tgcagccagc tcctcaagat gtacatgctc 1020
tggctcggcc tccaccttaa gaccggtccc ttcgacttcc gccccgtcaa cggccacccc 1080
aacaaggtcc gctgccgtgg ccagatctcc ccgcacaagg gtaagctcgt atacgtcatg 1140
gagatcaagg agatgggcta cgacgaggct ggtgacccgt acgccatcgc cgatgtcaac 1200
attctcgaca ttgacttcga gaagggccag actttcgacc ttgccaacct ccacgagtac 1260
ggcaagggcg acctcaacaa gaagatcgtc gtcgacttca agggtattgc cctcaagctc 1320
cagaagcgct ctggccctgc cgttgtcgct cccgagaagc ccctcgctct caacaaggac 1380
ctttgcgccc cggctgttga ggccatccct gagcacatcc tcaagggcga tgctcttgcc 1440
cctaaccaga tgacctggca cccgatgtcc aagatcgctg gcaaccccac gccctcgttc 1500
tctccctcgg cctaccctcc ccgtcccatc accttcaccc cgttccccgg caacaagaac 1560
gacaacaacc acgtgcccgg cgagatgccg ctctcgtggt acaacatggc tgagttcatg 1620
gccggcaagg tcagcctctg cctcggccct gagttcgcca agttcgatga ctccaacacc 1680
agccgcagcc ctgcatggga ccttgctctt gtgactcgtg tggtctccgt ttctgacatg 1740
gagtgggtcc agtggaagaa cgtggactgc aacccgtcca agggaaccat ggttggcgag 1800
ttcgactgcc ccatcgacgc ctggttcttc cagggatctt gtaacgacgg ccacatgccg 1860
tactccatcc tcatggagat cgccctccag acctctggtg tcctcacctc tgtgctcaag 1920
gccccgctca ccatggagaa gaaggacatt ctcttccgca accttgacgc caacgccgag 1980
atggttcgct ctgatattga cctccgcggc aagaccatcc acaacctcac caagtgtacc 2040
ggctacagca tgctcggaga catgggtgtc caccgcttca gcttcgagct ctctgttgat 2100
ggtgtagtct tctacaaggg taccacctcc ttcggctggt tcgtccctga ggtcttcatc 2160
tcccagactg gtctcgacaa cggtcgccgc acccagccct ggcacattga gtccaaggtg 2220
ccttccgccc aggtcctcac ctacgacgtt acccccaacg gtgccggtcg cacccagctc 2280
tacgccaacg cccccaaggg cgctcagctc actcgccgct ggaaccagtg ccagtacctt 2340
gacaccatcg accttgtggt cgccggtggc tccgccggtc ttggctacgg tcatggccgc 2400
aagcaggtga accccaagga ctggttcttc tcgtgccact tctggttcga ctccgtcatg 2460
cccggctcgc tcggtgtgga gtctatgttc cagctcgtcg agtccatcgc tgtcaagcag 2520
gacctcgccg gcaagtacgg catcaccaac ccgaccttcg ctcatgctcc gggcaagatc 2580
tcctggaagt accgtggtca gctcaccccc acctccaagt tcatggactc cgaggcccac 2640
attgtctcca tcgaggccca cgacggcgtc gtcgacatcg ttgccaatgg taacctctgg 2700
gctgatggcc tccgcgtcta caacgtcagc aacatccgtg tgcgcattgt tgctggcgcc 2760
gcccctgctg ctgctgctgc tgctgctgct gttgctgctc cggctgccgc ccctgctccg 2820
gttgctgcat ctggccctgc ccagaccatc accctcaagc agctcaaggc tgagcttctt 2880
gacgttgaga agcctctcta catctcctcc agcaacggcc aggtcaagaa gcacgccgat 2940
gtggctggtg gccaggccac cattgtgcag gcttgcagcc tcagtgacct cggtgatgaa 3000
ggcttcatga agacctacgg tgttgtggct cctctctaca ccggtgccat ggccaagggt 3060
attgcctctg ctgaccttgt gattgccact ggtaagcgca agatcctcgg ttccttcggt 3120
gctggcggtc tccccatgca cattgtccgt gccgctgttg agaagatcca ggctgagctc 3180
ccgaacggcc ccttcgccgt caacctcatc cactccccct tcgatagcaa ccttgagaag 3240
ggcaacgttg acctcttcct cgagaagggc gttactgtcg tcgaggcctc cgccttcatg 3300
accttgaccc cgcaagtcgt ccgctaccgt gctgctggtc tttcccgtaa cgctgatggc 3360
tccattaaca tcaagaaccg catcatcggt aaggtctccc gtaccgagct cgctgagatg 3420
ttcatccgcc ctgccccgca gaacctcctc gacaagctca tccagtctgg tgagattacc 3480
aaggagcagg ctgagcttgc caagctcgtc cccgtcgccg acgacatcgc cgtcgaggcc 3540
gactctggtg gccacaccga caaccgcccc atccacgtca tcctccccct tatcatcaac 3600
ctccgcaacc gcctccacaa ggagtgcggc taccccgctc acctccgcgt gcgcgttgga 3660
gctggtggtg gtgttggatg cccccaggcc gctgccgctg ctctcgctat gggtgctgcc 3720
ttccttgtta ccggcactgt caaccaggtc gccaagcagt ccggcacctg cgacaatgtc 3780
cgcaagcagc tctgcatggc cacctactct gacgtctgca tggctcccgc tgctgacatg 3840
ttcgaggagg gcgtcaagct ccaggtcctc aagaagggaa ccatgttccc gtccagggct 3900
aacaagctct acgagctctt ctgcaagtac gactccttcg agtccatgcc tgccacagag 3960
ctcgagcgtg ttgagaagcg catcttccag tgccctcttg ctgatgtctg ggctgagacc 4020
tccgacttct acatcaaccg cctccacaac ccggagaaga tcacccgtgc cgagcgtgac 4080
cccaagctca agatgtctct ctgcttccgc tggtaccttg gtcttgcctc tcgctgggcc 4140
aacaccggtg aggctggacg cgtcatggac taccaggtct ggtgtggccc tgccattgga 4200
gccttcaacg acttcatcaa gggctcctac cttgacccgg ccgtctctgg tgagtacccg 4260
gacgtcgtgc agatcaactt gcagatcctt cgcggtgcct gctacctccg ccgtctcaat 4320
gtcatccgca acgacccgcg tgtcagcatt gaggtcgagg atgctgagtt cgtctacgag 4380
cccaccaacg ccctctaa 4398
<210>6
<211>2997
<212>PRT
<213>Ulkenia sp.
<400>6
Met Ala Gln Arg Glu Asn Arg Leu Glu Ala Asn Met Asp Thr Arg Ile
1 5 10 15
Ala Val Ile Gly Met Ser Ala Ile Leu Pro Cys Gly Thr Thr Val Arg
20 25 30
Glu Ser Trp Glu Ala Ile Arg Asp Gly Ile Asp Cys Leu Ser Asp Leu
35 40 45
Pro Glu Asp Arg Val Asp Val Thr Ala Tyr Phe Asp Pro Val Lys Thr
50 55 60
Thr Lys Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile Pro Glu Tyr
65 70 75 80
Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe Gln Met Glu Asp
85 90 95
Ser Asp Ala Asn Gln Thr Val Thr Leu Leu Lys Val Lys Glu Ala Leu
100 105 110
Glu Asp Ala Gly Ile Glu Ala Leu Ser Lys Glu Lys Lys Asn Ile Gly
115 120 125
Cys Val Leu Gly Ile Gly Gly Gly Gln Lys Ser Ser His Glu Phe Tyr
130 135 140
Ser Arg Leu Asn Tyr Val Val Val Glu Lys Val Leu Arg Lys Met Gly
145 150 155 160
Met Pro Glu Glu Asp Val Gln Ala Ala Val Glu Lys Tyr Lys Ala Asn
165 170 175
Phe Pro Glu Trp Arg Leu Asp Ser Phe Pro Gly Phe Leu Gly Asn Val
180 185 190
Thr Ala Gly Arg Cys Thr Asn Thr Phe Asn Leu Asp Gly Met Asn Cys
195 200 205
Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala Val Lys Val Ala
210 215 220
Ile Asp Glu Leu Leu His Gly Asp Cys Asp Met Met Ile Thr Gly Ala
225 230 235 240
Thr Cys Thr Asp Asn Ser Ile Gly Met Tyr Met Ala Phe Ser Lys Thr
245 250 255
Pro Val Phe Ser Thr Asp Pro Ser Val Arg Ala Tyr Asp Glu Lys Thr
260 265 270
Lys Gly Met Leu Ile Gly Glu Gly Ser Ala Met Leu Val Leu Lys Arg
275 280 285
Tyr Ala Asp Ala Val Arg Asp Gly Asp Glu Ile His Ala Val Ile Arg
290 295 300
Gly Cys Ala Ser Ser Ser Asp Gly Lys Ala Ser Gly Ile Tyr Thr Pro
305 310 315 320
Thr Ile Ser Gly Gln Glu Glu Ala Leu Arg Arg Ala Tyr Met Arg Ala
325 330 335
Asn Val Asp Pro Ala Thr Val Thr Leu Val Glu Gly His Gly Thr Gly
340 345 350
Thr Pro Val Gly Asp Arg Ile Glu Leu Thr Ala Leu Arg Asn Leu Phe
355 360 365
Asp Ser Ala Tyr Gly Asn Glu Lys Glu Lys Val Ala Val Gly Ser Ile
370 375 380
Lys Ser Asn Ile Gly His Leu Lys Ala Val Ala Gly Leu Ala Gly Met
385 390 395 400
Ile Lys Val Ile Met Ala Leu Lys His Lys Thr Leu Pro Ala Thr Ile
405 410 415
Asn Val Asp Glu Pro Pro Lys Leu Tyr Asp Asn Thr Pro Ile Thr Asp
420 425 430
Ser Ser Leu Tyr Ile Asrn Thr Met Asn Arg Pro Trp Phe Pro Ala Pro
435 440 445
Gly Val Pro Arg Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Ala
450 455 460
Asn Tyr His Ala Val Leu Glu Glu Ala Glu Pro Glu His Gln Lys Ala
465 470 475 480
Tyr Arg Leu Asn Lys Arg Pro Gln Pro Val Leu Leu Met Ala Ser Ser
485 490 495
Thr Gln Ala Leu Ala Ser Leu Cys Glu Ala Gln Leu Lys Glu Phe Glu
500 505 510
Lys Ala Ile Glu Glu Asn Lys Thr Val Lys Asn Thr Ala Tyr Ile Lys
515 520 525
Cys Val Asp Phe Cys Glu Lys Phe Lys Phe Pro Gly Ser Ile Pro Ser
530 535 540
Ser Asn Ala Arg Leu Gly Phe Leu Val Lys Glu Ala Asp Asp Ala Thr
545 550 555 560
Glu Thr Leu Arg Ala Ile Val Ala Gln Phe Gln Lys Ser Ala Gly Lys
565 570 575
Asp Ser Trp His Leu Pro Arg Gln Gly Val Ser Phe Arg Ala Gln Gly
580 585 590
Ile Asn Thr Thr Gly Gly Val Ala Ala Leu Phe Ser Gly Gln Gly Ala
595 600 605
Gln Tyr Thr His Met Phe Ser Glu Val Ala Met Asn Trp Pro Gln Phe
610 615 620
Arg Glu Ser Ile Ser Asp Met Asp Arg Ala Gln Ala Lys Val Ala Gly
625 630 635 640
Ala Asp Lys Asp Tyr Glu Arg Val Ser Gln Val Leu Tyr Pro Arg Lys
645 650 655
Pro Tyr Asn Ser Glu Pro Glu Gln Asp His Lys Lys Ile Ser Leu Thr
660 665 670
Ser Tyr Ser Gln Pro Ser Thr Leu Ala Cys Ala Leu Gly Ala Tyr Glu
675 680 685
Ile Phe Lys Gln Ala Gly Phe Lys Pro Asp Phe Ala Ala Gly His Ser
690 695 700
Leu Gly Glu Phe Ala Ala Leu Tyr Ala Ala Asp Cys Val Asn Arg Asp
705 710 715 720
Asp Leu Phe Glu Leu Val Cys Arg Arg Ala Arg Ile Met Gly Gly Lys
725 730 735
Asp Ala Pro Ala Thr Pro Lys Gly Cys Met Ala Ala Val Ile Gly Pro
740 745 750
Asn Ala Glu Lys Ile Gln Ile Arg Thr Ala Asp Val Trp Leu Gly Asn
755 760 765
Cys Asn Ser Pro Ser Gln Thr Val Ile Thr Gly Ser Val Glu Gly Ile
770 775 780
Lys Lys Glu Ser Glu Leu Leu Gln Ser Glu Gly Phe Arg Val Val Pro
785 790 795 800
Leu Ala Cys Glu Ser Ala Phe His Ser Pro Gln Met Gln Asn Ala Ser
805 810 815
Ser Ala Phe Lys Asp Val Leu Ser Lys Val Ala Phe Arg Gln Pro Ser
820 825 830
Ala Gln Thr Lys Leu Phe Ser Asn Val Ser Gly Glu Thr Tyr Ser Asn
835 840 845
Asn Ala Gln Asp Leu Leu Lys Glu His Met Thr Ser Ser Val Lys Phe
850 855 860
Ile Ser Gln Val Arg Asn Met His Ser Ala Gly Ala Arg Ile Phe Val
865 870 875 880
Glu Phe Gly Pro Lys Gln Val Leu Ser Lys Leu Val Ser Glu Thr Leu
885 890 895
Lys Asp Asp Pro Ser Ile Ile Thr Ile Ser Val Asn Pro Ser Ser Gly
900 905 910
Lys Asp Ala Asp Ile Gln Leu Arg Glu Ala Ala Val Gln Leu Val Val
915 920 925
Ala Gly Val Asn Leu Gln Gly Phe Asp Lys Trp Asp Ala Pro Asp Ala
930 935 940
Thr Arg Leu Gln Pro Ile Lys Lys Lys Lys Thr Thr Leu Arg Leu Ser
945 950 955 960
Ala Ala Thr Tyr Val Ser Asp Lys Thr Lys Lys Ala Arg Glu Ala Ala
965 970 975
Met Asn Asp Gly Arg Met Leu Ser Cys Val Ser Lys Val Ile Ala Pro
980 985 990
Pro Asp Ala Lys Pro Ile Val Asp Thr Lys Ala Gln Glu Glu Val Ala
995 1000 1005
Arg Leu Gln Lys Gln Leu Gln Asp Ala Gln Ala Gln Ile Gln Lys
1010 1015 1020
Ala Lys Ala Asp Ala Ala Glu Ala Asp Lys Lys Leu Ala Ala Ala
1025 1030 1035
Lys Asp Glu Ala Lys Arg Ala Ala Ala Ser Ala Pro Val Gln Lys
1040 1045 1050
Gln Val Asp Thr Thr Ile Val Asp Lys His Arg Ala Ile Leu Lys
1055 1060 1065
Ser Met Leu Ala Glu Leu Asp Cys Tyr Ser Thr Pro Gly Ala Val
1070 1075 1080
Ser Ser Ser Phe Gln Ala Pro Val Ala Ala Thr Pro Ala Pro Val
1085 1090 1095
Ala Ala Pro Val Ala Ala Ala Pro Ala Pro Ala Val Asn Asn Ala
1100 1105 1110
Leu Leu Ala Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala
1115 1120 1125
Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu
1130 1135 1140
Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu
1145 1150 1155
Ser Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp
1160 1165 1170
Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met
1175 1180 1185
Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro
1190 1195 1200
Ala Pro Val Ala Ala Ala Pro Ala Ala Pro Ala Pro Ala Val Asn
1205 1210 1215
Ser Ala Leu Leu Ala Lys Ala Glu Thr Val Val Met Glu Val Leu
1220 1225 1230
Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met
1235 1240 1245
Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu
1250 1255 1260
Ile Leu Ser Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp
1265 1270 1275
Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn
1280 1285 1290
Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala
1295 1300 1305
Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val Ala Ala Ala
1310 1315 1320
Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val
1325 1330 1335
Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met
1340 1345 1350
Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser
1355 1360 1365
Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Gln Leu Asn
1370 1375 1380
Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val
1385 1390 1395
Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser
1400 1405 1410
Gly Ala Ala Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val
1415 1420 1425
Ala Ala Ala Ala Pro Ala Val Asn Ser Ala Leu Leu Glu Lys Ala
1430 1435 1440
Glu Thr Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu
1445 1450 1455
Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr Glu Leu Gly
1460 1465 1470
Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala
1475 1480 1485
Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr
1490 1495 1500
Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala
1505 1510 1515
Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala
1520 1525 1530
Ala Pro Ala Pro Val Ala Ala Pro Ala Val Ser Ser Ala Leu Leu
1535 1540 1545
Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys Thr
1550 1555 1560
Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
1565 1570 1575
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu
1580 1585 1590
Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu
1595 1600 1605
Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala
1610 1615 1620
Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro
1625 1630 1635
Val Ala Ala Ser Pro Ala Pro Val Ala Ala Ala Ala Pro Ala Val
1640 1645 1650
Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val
1655 1660 1665
Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp
1670 1675 1680
Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val
1685 1690 1695
Glu Ile Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys
1700 1705 1710
Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val
1715 1720 1725
Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala
1730 1735 1740
Ala Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val Thr Ala
1745 1750 1755
Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser
1760 1765 1770
Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp
1775 1780 1785
Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp
1790 1795 1800
Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Met Leu
1805 1810 1815
Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr
1820 1825 1830
Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Ser Ser
1835 1840 1845
Ser Gly Ala Ala Ala Pro Ala Pro Ala Ala Ala Val Ala Pro Ala
1850 1855 1860
Pro Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala
1865 1870 1875
Glu Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu
1880 1885 1890
Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly
1895 1900 1905
Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala
1910 1915 1920
Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr
1925 1930 1935
Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala
1940 1945 1950
Ser Ser Ser Gly Ala Ala Ala Pro Ala Pro Ala Ala Ala Ala Ala
1955 1960 1965
Pro Ala Pro Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu
1970 1975 1980
Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly
1985 1990 1995
Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu
2000 2005 2010
Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
2015 2020 2025
Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser
2030 2035 2040
Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu
2045 2050 2055
Ile Ala Ser Ser Ser Gly Ala Ala Ala Pro Ala Pro Ala Ala Ala
2060 2065 2070
Ala Ala Pro Ala Pro Ala Ala Ala Pro Ala Val Ser Ser Ala Leu
2075 2080 2085
Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
2090 2095 2100
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu
2105 2110 2115
Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser
2120 2125 2130
Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala
2135 2140 2145
Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys
2150 2155 2160
Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Thr Ala Ser Ala Pro
2165 2170 2175
Ala Ala Ala Ala Ala Ala Pro Ala Ile Lys Ile Ser Thr Val His
2180 2185 2190
Gly Ala Asp Cys Asp Asp Leu Ser Val Met Ser Ala Glu Leu Val
2195 2200 2205
Asp Ile Arg Arg Ala Asp Glu Leu Leu Leu Glu Arg Pro Glu Asn
2210 2215 2220
Arg Pro Val Leu Ile Val Asp Asp Gly Thr Glu Leu Thr Ser Ala
2225 2230 2235
Leu Val Arg Val Leu Gly Ala Gly Ala Val Val Leu Thr Phe Asp
2240 2245 2250
Gly Leu Gln Leu Ala Gln Arg Ala Gly Ala Ala Val Arg His Val
2255 2260 2265
Gln Val Lys Asp Leu Ser Ala Glu Ser Ala Glu Lys Ala Ile Lys
2270 2275 2280
Glu Ala Glu Gln Arg Phe Gly Gln Leu Gly Gly Phe Ile Ser Gln
2285 2290 2295
Gln Ala Glu Arg Phe Ala Pro Ala Asp Ile Leu Gly Phe Thr Leu
2300 2305 2310
Met Cys Ala Lys Phe Ala Lys Ala Ser Leu Cys Thr Pro Val Gln
2315 2320 2325
Gly Gly Arg Ala Phe Phe Ile Gly Val Ala Arg Leu Asp Gly Arg
2330 2335 2340
Leu Gly Phe Thr Ser Gln Gly Ser Thr Asp Ser Leu Thr Arg Ala
2345 2350 2355
Gln Arg Gly Ala Ile Phe Gly Leu Cys Lys Thr Ile Gly Leu Glu
2360 2365 2370
Trp Ser Ala Asn Glu Val Phe Ala Arg Gly Ile Asp Ile Ala Arg
2375 2380 2385
Glu Val His Pro Glu Asp Ala Ala Val Ala Ile Thr Arg Glu Met
2390 2395 2400
Ser Cys Ala Asp Asn Arg Ile Arg Glu Val Gly Ile Gly Leu Asn
2405 2410 2415
Gln Lys Arg Cys Thr Ile Arg Ala Val Asp Leu Lys Pro Gly Ala
2420 2425 2430
Pro Lys Ile Gln Ile Ser Gln Asp Asp Val Leu Leu Val Ser Gly
2435 2440 2445
Gly Ala Arg Gly Ile Thr Pro Leu Cys Ile Arg Glu Ile Thr Arg
2450 2455 2460
Gln Val Arg Gly Gly Lys Tyr Ile Leu Leu Gly Arg Ser Lys Val
2465 2470 2475
Pro Ala Gly Glu Pro Ala Trp Cys Asn Gly Val Ser Asp Asp Asp
2480 2485 2490
Leu Gly Lys Ala Ala Met Gln Glu Leu Lys Arg Ala Phe Ser Ala
2495 2500 2505
Gly Glu Gly Pro Lys Pro Thr Pro Met Thr His Lys Lys Leu Val
2510 2515 2520
Gly Thr Ile Ala Gly Ala Arg Glu Val Arg Ser Ser Ile Ala Asn
2525 2530 2535
Ile Glu Ala Leu Gly Gly Lys Ala Ile Tyr Ser Ser Cys Asp Val
2540 2545 2550
Asn Ser Ala Ala Asp Val Ala Lys Ala Val Arg Glu Ala Glu Ala
2555 2560 2565
Gln Leu Gly Ala Arg Val Thr Gly Val Val His Ala Ser Gly Val
2570 2575 2580
Leu Arg Asp Arg Leu Ile Glu Gln Lys Arg Pro Asp Glu Phe Asp
2585 2590 2595
Ala Val Phe Gly Thr Lys Val Thr Gly Leu Glu Asn Leu Phe Gly
2600 2605 2610
Ala Ile Asp Met Ala Asn Leu Lys His Leu Val Leu Phe Ser Ser
2615 2620 2625
Leu Ala Gly Phe His Gly Asn Ile Gly Gln Ser Asp Tyr Ala Met
2630 2635 2640
Ala Asn Glu Ala Leu Asn Lys Met Gly Leu Glu Leu Ser Asp Arg
2645 2650 2655
Val Ser Val Lys Ser Ile Cys Phe Gly Pro Trp Asp Gly Gly Met
2660 2665 2670
Val Thr Pro Gln Leu Lys Lys Gln Phe Gln Ser Met Gly Val Gln
2675 2680 2685
Ile Ile Pro Arg Glu Gly Gly Ala Asp Thr Val Ala Arg Ile Val
2690 2695 2700
Leu Gly Ser Ser Pro Ala Glu Ile Leu Val Gly Asn Trp Thr Thr
2705 2710 2715
Pro Thr Lys Lys Val Gly Ser Glu Pro Val Val Ile His Arg Lys
2720 2725 2730
Ile Ser Ala Ala Ser Asn Pro Phe Leu Lys Asp His Val Ile Gln
2735 2740 2745
Gly Arg Cys Val Leu Pro Met Thr Ile Ala Val Gly Cys Leu Ala
2750 2755 2760
Glu Thr Cys Leu Gly Gln Phe Pro Gly Tyr Ser Leu Trp Ala Ile
2765 2770 2775
Glu Asp Ala Gln Leu Phe Lys Gly Val Thr Val Asp Gly Asp Val
2780 2785 2790
Asn Cys Glu Ile Thr Leu Lys Pro Ser Gln Gly Thr Ala Gly Arg
2795 2800 2805
Val Met Ile Gln Ala Thr Leu Lys Thr Phe Ala Ser Gly Lys Leu
2810 2815 2820
Val Pro Ala Tyr Arg Ala Val Ile Val Leu Ser Thr Gln Gly Lys
2825 2830 2835
Pro Pro Ala Ala Thr Thr Ser Gln Thr Pro Ser Leu Gln Ala Asp
2840 2845 2850
Pro Ala Ala Arg Gly Asn Pro Tyr Asp Gly Lys Thr Leu Phe His
2855 2860 2865
Gly Pro Ala Phe Gln Gly Leu Lys Glu Ile Ile Ser Cys Asn Lys
2870 2875 2880
Ser Gln Leu Val Ala Glu Cys Thr Phe Ile Pro Ser Ser Glu Ser
2885 2890 2895
Ala Gly Glu Phe Ala Ser Asp Tyr Glu Ser His Asn Pro Phe Val
2900 2905 2910
Asn Asp Ile Ala Phe Gln Ala Met Leu Val Trp Ile Arg Arg Thr
2915 2920 2925
Leu Gly Gln Ala Ala Leu Pro Asn Ser Ile Gln Arg Ile Val Gln
2930 2935 2940
His Arg Ala Leu Pro Gln Asp Lys Pro Phe Tyr Leu Thr Leu Lys
2945 2950 2955
Ser Asn Ser Ala Ser Gly His Ser Gln His Lys Thr Ser Val Gln
2960 2965 2970
Phe His Asn Glu Gln Gly Asp Leu Phe Val Asp Ile Gln Ala Ser
2975 2980 2985
Val Thr Ser Ser Asp Ser Leu Ala Phe
2990 2995
<210>7
<211>2030
<212>PRT
<213>Ulkenia sp.
<400>7
Met Ala Ser Arg Lys Asn Val Ser Ala Ala His Glu Met His Asp Glu
1 5 10 15
Lys Arg Ile Ala Val Val Gly Met Ala Val Gln Tyr Ala Gly Cys Lys
20 25 30
Asp Lys Glu Glu Phe Trp Lys Val Val Met Gly Gly Glu Ala Ala Trp
35 40 45
Thr Lys Ile Ser Asp Lys Arg Leu Gly Ser Asn Lys Arg Ala Glu His
50 55 60
Phe Lys Ala Glu Arg Ser Lys Phe Ala Asp Thr Phe Cys Asn Glu Asn
65 70 75 80
Tyr Gly Cys Val Asp Asp Ser Val Asp Asn Glu His Glu Leu Leu Leu
85 90 95
Lys Leu Ser Lys Lys Ala Leu Ser Glu Thr Ser Val Ser Asp Ser Thr
100 105 110
Arg Cys Gly Ile Val Ser Gly Cys Leu Ser Phe Pro Met Asp Asn Leu
115 120 125
Gln Gly Glu Leu Leu Asn Val Tyr Gln Asn His Val Glu Lys Lys Leu
130 135 140
Gly Ala Arg Val Phe Lys Asp Ala Ser Lys Trp Ser Glu Arg Glu Gln
145 150 155 160
Ser Gln Asn Pro Glu Ala Gly Asp Arg Arg Ile Phe Met Asp Pro Ala
165 170 175
Ser Phe Val Ala Glu Glu Leu Asn Leu Gly Pro Leu His Tyr Ser Val
180 185 190
Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val Leu Arg Leu Ala Gln Asp
195 200 205
His Leu Val Ser Gly Ala Ala Asp Val Met Leu Ala Gly Ala Thr Cys
210 215 220
Phe Pro Glu Pro Phe Phe Ile Leu Ser Gly Phe Ser Thr Phe Gln Ala
225 230 235 240
Met Pro Val Ser Gly Asp Gly Ile Ser Tyr Pro Leu His Lys Asp Ser
245 250 255
Gln Gly Leu Thr Pro Gly Glu Gly Gly Ala Ile Met Val Leu Lys Arg
260 265 270
Leu Asp Asp Ala Ile Arg Asp Gly Asp His Ile Tyr Gly Thr Leu Leu
275 280 285
Gly Ala Thr Ile Ser Asn Ala Gly Cys Gly Leu Pro Leu Lys Pro His
290 295 300
Leu Pro Ser Glu Lys Ser Cys Leu Ile Asp Thr Tyr Lys Arg Val Asn
305 310 315 320
Val His Pro His Lys Ile Gln Tyr Val Glu Cys His Ala Thr Gly Thr
325 330 335
Pro Gln Gly Asp Arg Val Glu Ile Asp Ala Val Lys Ala Cys Phe Glu
340 345 350
Gly Lys Val Pro Arg Phe Gly Ser Ser Lys Gly Asn Phe Gly His Thr
355 360 365
Leu Val Ala Ala Gly Phe Ala Gly Met Cys Lys Val Leu Leu Ala Met
370 375 380
Lys His Gly Val Ile Pro Pro Thr Pro Gly Val Asp Gly Ser Ser Gln
385 390 395 400
Met Asp Pro Leu Val Val Ser Glu Pro Ile Pro Trp Pro Asp Thr Glu
405 410 415
Gly Glu Pro Lys Arg Ala Gly Leu Ser Ala Phe Gly Phe Gly Gly Thr
420 425 430
Asn Ala His Ala Val Phe Glu Glu Phe Asp Arg Ser Lys Ala Ala Cys
435 440 445
Ala Thr His Asp Ser Ile Ser Ser Leu Ser Ser Arg Cys Gly Gly Glu
450 455 460
Gly Asn Met Arg Ile Ala Ile Thr Gly Met Asp Ala Thr Phe Gly Ser
465 470 475 480
Leu Lys Gly Leu Asp Ala Phe Glu Arg Ala Ile Tyr Asn Gly Gln His
485 490 495
Gly Ala Val Pro Leu Pro Glu Lys Arg Trp Arg Phe Leu Gly Lys Asp
500 505 510
Lys Asp Phe Leu Asp Leu Cys Gly Val Lys Glu Val Pro His Gly Cys
515 520 525
Tyr Ile Glu Asp Val Glu Val Asp Phe Ser Arg Leu Arg Thr Pro Met
530 535 540
Thr Pro Asp Asp Met Leu Arg Pro Met Gln Leu Leu Ala Val Thr Thr
545 550 555 560
Ile Asp Arg Ala Ile Leu Asn Ser Gly Leu Lys Lys Gly Gly Lys Val
565 570 575
Ala Val Phe Val Gly Leu Gly Thr Asp Leu Glu Leu Tyr Arg His Arg
580 585 590
Ala Arg Val Ala Leu Lys Glu Arg Ala Arg Pro Glu Ala Ala Ser Ala
595 600 605
Leu Asn Asp Met Met Ser Tyr Ile Asn Asp Cys Gly Thr Ala Thr Ser
610 615 620
Tyr Thr Ser Tyr Ile Gly Asn Leu Val Ala Thr Arg Val Ser Ser Gln
625 630 635 640
Trp Gly Phe Glu Gly Pro Ser Phe Thr Ile Thr Glu Gly Asn Asn Ser
645 650 655
Val Tyr Arg Cys Ala Glu Leu Gly Lys Tyr Leu Leu Glu Thr Gly Glu
660 665 670
Val Glu Ala Val Val Ile Ala Gly Val Asp Leu Cys Ala Ser Ala Glu
675 680 685
Asn Leu Tyr Val Lys Ser Arg Arg Phe Lys Val Ser Glu Gln Glu Ser
690 695 700
Pro Arg Ala Ser Phe Asp Ser Gly Ala Asp Gly Tyr Phe Val Gly Glu
705 710 715 720
Gly Cys Gly Ala Leu Val Leu Lys Arg Glu Ser Asp Cys Thr Lys Asp
725 730 735
Glu Arg Ile Tyr Ala Cys Met Asp Ala Ile Val Pro Gly Asn Met Pro
740 745 750
Ala Ala Cys Met Glu Glu Ala Leu Ala Gln Ala Arg Val Asn Pro Lys
755 760 765
Asp Val Glu Met Leu Glu Leu Ser Ala Asp Ser Ala Arg His Leu Lys
770 775 780
Asn Pro Ser Val Leu Pro Lys Glu Leu Thr Ala Glu Glu Glu Ile Arg
785 790 795 800
Gly Ile Glu Ala Ile Leu Ser Gln Arg Ser Ser Asn Glu Ala Val Glu
805 810 815
Pro His Asn Val Ala Val Ser Ser Val Lys Ser Thr Val Gly Asp Thr
820 825 830
Gly Tyr Ala Ser Gly Ala Ala Ser Leu Ile Lys Thr Ala Leu Cys Leu
835 840 845
Tyr Asn Arg Tyr Leu Pro Ser Asn Gly Ala Ser Trp Glu Glu Pro Ala
850 855 860
Pro Glu Thr Gln Trp Gly Lys Ser Leu Tyr Ala Cys Gln Ser Ser Arg
865 870 875 880
Ala Trp Leu Lys Asn Pro Gly Ala Arg Arg His Ala Ala Val Ser Gly
885 890 895
Val Ser Glu Thr Arg Ser Cys Tyr Thr Val Leu Leu Ser Asp Val Glu
900 905 910
Gly His His Glu Thr Lys Ser Arg Ile Ser Leu Asp Asp Asp Ala Val
915 920 925
Lys Leu Leu Val Ile Arg Gly Asp Ser His Asp Ala Ile Thr Gln Arg
930 935 940
Val Asp Lys Leu Arg Glu Arg Leu Ala Gln Pro Ser Ala Asn Val Arg
945 950 955 960
Leu Ala Phe Met Glu Leu Leu Gly Glu Ser Ile Ala Gln Glu Thr Lys
965 970 975
Thr Pro Leu Pro Ala Phe Ala Leu Cys Leu Val Thr Ser Pro Ser Lys
980 985 990
Leu Gln Lys Glu Leu Glu Leu Ala Ser Lys Gly Ile Pro Arg Ser Leu
995 1000 1005
Lys Met Gly Arg Asp Trp Thr Ser Pro Ser Gly Ser His Phe Ala
1010 1015 1020
Pro Lys Pro Leu Ser Ser Asp Arg Val Ala Phe Met Tyr Gly Glu
1025 1030 1035
Gly Arg Ser Pro Tyr Tyr Gly Ile Gly Leu Asp Ile His Arg Ile
1040 1045 1050
Trp Pro Glu Leu His Glu Phe Val Asn Ala Lys Thr Asn Lys Leu
1055 1060 1065
Trp Asp Gln Gly Asp Arg Trp Leu Ile Pro Arg Ala Ser Thr Lys
1070 1075 1080
Glu Glu Leu Lys Ala Gln Glu Asp Glu Phe Asn Arg Asn Gln Val
1085 1090 1095
Glu Met Phe Arg Leu Gly Ile Leu Met Ser Met Cys Phe Thr His
1100 1105 1110
Ile Ala Arg Asp Val Leu Gly Ile Gln Pro Lys Ala Ala Phe Gly
1115 1120 1125
Leu Ser Leu Gly Glu Ile Ser Met Val Phe Ala Phe Ser Glu Lys
1130 1135 1140
Asn Gly Leu Val Ser Glu Glu Leu Thr Thr Lys Leu Arg Asn Ser
1145 1150 1155
Glu Val Trp Arg Lys Ala Leu Ala Val Glu Phe Asp Ala Leu Arg
1160 1165 1170
Lys Ala Trp Asn Ile Pro Gln Asp Thr Pro Val Ser Glu Phe Trp
1175 1180 1185
Gln Gly Tyr Val Val Arg Gly Thr Arg Glu Ala Val Glu Ala Ala
1190 1195 1200
Ile Gly Pro Asn Asn Lys Tyr Val His Leu Thr Ile Val Asn Asp
1205 1210 1215
Ala Asn Ser Ala Leu Ile Ser Gly Lys Pro Glu Asp Cys Lys Ala
1220 1225 1230
Ala Ile Ala Arg Leu Ser Ser Asn Leu Pro Ala Leu Pro Val Asp
1235 1240 1245
Leu Gly Met Cys Gly His Cys Pro Val Val Glu Pro Tyr Gly Lys
1250 1255 1260
Gln Ile Ala Glu Ile His Ser Val Leu Glu Ile Pro Glu Val Ala
1265 1270 1275
Gly Leu Asp Leu Tyr Thr Ser Val Asn Gln Lys Lys Leu Val Asn
1280 1285 1290
Lys Ser Thr Gly Ala Ser Asp Glu Tyr Ala Pro Ser Phe Gly Glu
1295 1300 1305
Tyr Ala Ala Gln Leu Tyr Thr Val Gln Ala Asp Phe Pro Lys Ile
1310 1315 1320
Ala Lys Thr Val Ser Asp Lys Asn Phe Asp Val Phe Val Glu Thr
1325 1330 1335
Gly Pro Asn Ala His Arg Ser Ala Ala Ile Arg Ala Thr Leu Gly
1340 1345 1350
Asn Ser Lys Pro Phe Val Thr Gly Ser Met Asp Arg Gln Asn Glu
1355 1360 1365
Asn Ala Trp Thr Thr Met Val Lys Leu Val Ala Ser Leu Gln Ala
1370 1375 1380
His Arg Val Pro Gly Val Lys Val Ser Pro Leu Tyr His Pro Glu
1385 1390 1395
Thr Val Glu Glu Ala Thr Gln Ser Tyr Asn Asp Met Val Ala Gly
1400 1405 1410
Lys Lys Pro Thr Lys Asn Lys Phe Leu Arg Lys Ile Val Val Asn
1415 1420 1425
Gly Arg Tyr Asp Pro Lys Lys Gln Leu Val Pro Pro Gln Val Leu
1430 1435 1440
Ala Lys Leu Pro Pro Ala Asp Pro Lys Ile Glu Ala Leu Ile Gln
1445 1450 1455
Ala Arg Lys Met Gln Pro Ile Ala Pro Lys Phe Met Glu Arg Leu
1460 1465 1470
Asp Ile Gln Glu Gln Asp Ala Thr Arg Asp Pro Ile Leu Asn Lys
1475 1480 1485
Asp Asn Lys Pro Ser Ala Ala Pro Ala Leu Ala Pro Ala Ala Pro
1490 1495 1500
Ala Arg Ser Val Ser Gly Ala Val Val Ala Ser Ser Glu Ala Leu
1505 1510 1515
Arq Ala Lys Leu Leu Glu Leu Asn Ser Thr Leu Met Leu Gly Val
1520 1525 1530
Asn Ala Asn Gly Asp Leu Val Glu Ala Ser Pro Ser Glu Ala Ser
1535 1540 1545
Ile Val Val Pro Lys Cys Asp Ile Lys Asp Leu Gly Ser Arg Ala
1550 1555 1560
Phe Met Glu Thr Tyr Gly Val Ser Ala Pro Met Tyr Thr Gly Ala
1565 1570 1575
Met Ala Lys Gly Ile Ala Ser Ala Glu Met Val Ile Ala Ala Gly
1580 1585 1590
Lys Arg Gly Ile Leu Gly Ser Leu Gly Ala Gly Gly Leu Pro Ile
1595 1600 1605
Ala Thr Val Arg Lys Ala Leu Glu Ala Ile Gln Ala Glu Leu Pro
1610 1615 1620
Lys Gly Pro Tyr Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser
1625 1630 1635
Asn Leu Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly Val
1640 1645 1650
Thr Val Val Glu Ala Ser Ala Phe Met Thr Leu Thr Pro Gln Leu
1655 1660 1665
Val Arg Tyr Arg Ala Ala Gly Leu Ser Arg Ala Ala Asp Gly Ser
1670 1675 1680
Thr Val Ile Lys Asn Arg Val Ile Gly Lys Val Ser Arg Thr Glu
1685 1690 1695
Leu Ala Ala Met Phe Ile Arg Pro Ala Pro Glu Asn Leu Leu Glu
1700 1705 1710
Lys Leu Leu Lys Ser Gly Glu Ile Thr Gln Glu Gln Ala Ala Leu
1715 1720 1725
Ala Arg Thr Val Pro Val Ala Asp Asp Ile Ala Val Glu Ala Asp
1730 1735 1740
Ser Gly Gly His Thr Asp Asn Arg Pro Ile His Val Ile Leu Pro
1745 1750 1755
Leu Ile Val Asn Leu Arg Asp Arg Leu His Lys Glu Cys Gly Tyr
1760 1765 1770
Pro Ala His Leu Arg Val Arg Val Gly Ala Gly Gly Gly Ile Gly
1775 1780 1785
Cys Pro Gln Ala Ala Ile Ala Thr Phe Asn Met Gly Ala Ala Phe
1790 1795 1800
Ile Val Thr Gly Thr Val Asn Gln Met Ser Lys Gln Ala Gly Thr
1805 1810 1815
Cys Asp Thr Val Arg Lys Gln Leu Ser Gln Ala Thr Tyr Ser Asp
1820 1825 1830
Ile Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val Lys
1835 1840 1845
Leu Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser Arg Ala Asn
1850 1855 1860
Lys Leu Tyr Glu Leu Phe Val Lys Tyr Asp Ser Phe Glu Ser Met
1865 1870 1875
Ala Pro Gly Glu Leu Glu Arg Val Glu Lys Arg Ile Phe Lys Lys
1880 1885 1890
Ser Leu Ser Glu Val Trp Glu Glu Thr Lys Asp Phe Tyr Ile Asn
1895 1900 1905
Arg Leu Gln Asn Pro Glu Lys Ile Glu Arg Ala Glu Arg Asp Pro
1910 1915 1920
Lys Leu Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu Gly Leu Ala
1925 1930 1935
Ser Phe Trp Ala Asn Ala Gly Ile Pro Asp Arg Ala Met Asp Tyr
1940 1945 1950
Gln Val Trp Cys Gly Pro Ala Ile Gly Ser Phe Asn Asp Phe Ile
1955 1960 1965
Lys Gly Thr Tyr Leu Asp Pro Ala Val Ala Asn Glu Tyr Pro Asp
1970 1975 1980
Val Val Gln Ile Asn Leu Gln Ile Leu Arg Gly Ala Cys Phe Leu
1985 1990 1995
Arg Arg Leu Glu Ala Val Arg Asn Ala Pro Leu Lys Ala Asn Ala
2000 2005 2010
Lys Gln Val Ala Ala Glu Ile Asp Asp Ile Tyr Val Pro Thr Glu
2015 2020 2025
Arg Leu
2030
<210>8
<211>1465
<212>PRT
<213>Ulkenia sp.
<400>8
Met Ala Thr Arg Val Lys Thr Asn Lys Lys Pro Cys Trp Glu Met Thr
1 5 10 15
Lys Glu Glu Leu Thr Ser Gly Lys Asn Val Val Phe Asp Tyr Asp Glu
20 25 30
Leu Leu Glu Phe Ala Glu Gly Asp Ile Ser Lys Val Phe Gly Pro Glu
35 40 45
Phe Ser Gln Ile Asp Gln Tyr Lys Arg Arg Val Arg Leu Pro Ala Arg
50 55 60
Glu Tyr Leu Leu Val Thr Arg Val Thr Leu Met Asp Ala Glu Val Asn
65 70 75 80
Asn Tyr Arg Val Gly Ala Arg Met Val Thr Glu Tyr Asp Leu Pro Val
85 90 95
Asn Gly Glu Leu Ser Glu Gly Gly Asp Cys Pro Trp Ala Val Leu Val
100 105 110
Glu Ser Gly Gln Cys Asp Leu Met Leu Ile Ser Tyr Met Gly Ile Asp
115 120 125
Phe Gln Asn Lys Ser Asp Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu
130 135 140
Thr Phe Tyr Gly Val Ala Gln Glu Gly Glu Thr Leu Glu Tyr Asp Ile
145 150 155 160
Arg Val Thr Gly Phe Ala Lys Arg Leu Asp Gly Asp Ile Ser Met Phe
165 170 175
Phe Phe Glu Tyr Asp Cys Tyr Val Asn Gly Arg Leu Leu Ile Glu Met
180 185 190
Arg Asp Gly Cys Ala Gly Phe Phe Thr Asn Glu Glu Leu Ala Ala Gly
195 200 205
Lys Gly Val Val Phe Thr Arg Ala Asp Leu Leu Ala Arg Glu Lys Thr
210 215 220
Lys Lys Gln Asp Ile Thr Pro Tyr Ala Ile Ala Pro Arg Leu Asn Lys
225 230 235 240
Thr Val Leu Asn Glu Thr Glu Met Gln Ser Leu Val Asp Lys Asn Trp
245 250 255
Thr Lys Val Phe Gly Pro Glu Asn Gly Met Asp Gln Ile Asn Tyr Lys
260 265 270
Leu Cys Ala Arg Lys Met Leu Met Ile Asp Arg Val Thr Lys Ile Asp
275 280 285
Tyr Thr Gly Gly Pro Tyr Gly Leu Gly Leu Leu Val Gly Glu Lys Ile
290 295 300
Leu Glu Arg Asp His Trp Tyr Phe Pro Cys His Phe Val Gly Asp Gln
305 310 315 320
Val Met Ala Gly Ser Leu Val Ser Asp Gly Cys Ser Gln Leu Leu Lys
325 330 335
Met Tyr Met Leu Trp Leu Gly Leu His Leu Lys Thr Gly Pro Phe Asp
340 345 350
Phe Arg Pro Val Asn Gly His Pro Asn Lys Val Arg Cys Arg Gly Gln
355 360 365
Ile Ser Pro His Lys Gly Lys Leu Val Tyr Val Met Glu Ile Lys Glu
370 375 380
Met Gly Tyr Asp Glu Ala Gly Asp Pro Tyr Ala Ile Ala Asp Val Asn
385 390 395 400
Ile Leu Asp Ile Asp Phe Glu Lys Gly Gln Thr Phe Asp Leu Ala Asn
405 410 415
Leu His Glu Tyr Gly Lys Gly Asp Leu Asn Lys Lys Ile Val Val Asp
420 425 430
Phe Lys Gly Ile Ala Leu Lys Leu Gln Lys Arg Ser Gly Pro Ala Val
435 440 445
Val Ala Pro Glu Lys Pro Leu Ala Leu Asn Lys Asp Leu Cys Ala Pro
450 455 460
Ala Val Glu Ala Ile Pro Glu His Ile Leu Lys Gly Asp Ala Leu Ala
465 470 475 480
Pro Asn Gln Met Thr Trp His Pro Met Ser Lys Ile Ala Gly Asn Pro
485 490 495
Thr Pro Ser Phe Ser Pro Ser Ala Tyr Pro Pro Arg Pro Ile Thr Phe
500 505 510
Thr Pro Phe Pro Gly Asn Lys Asn Asp Asn Asn His Val Pro Gly Glu
515 520 525
Met Pro Leu Ser Trp Tyr Asn Met Ala Glu Phe Met Ala Gly Lys Val
530 535 540
Ser Leu Cys Leu Gly Pro Glu Phe Ala Lys Phe Asp Asp Ser Asn Thr
545 550 555 560
Ser Arg Ser Pro Ala Trp Asp Leu Ala Leu Val Thr Arg Val Val Ser
565 570 575
Val Ser Asp Met Glu Trp Val Gln Trp Lys Asn Val Asp Cys Asn Pro
580 585 590
Ser Lys Gly Thr Met Val Gly Glu Phe Asp Cys Pro Ile Asp Ala Trp
595 600 605
Phe Phe Gln Gly Ser Cys Asn Asp Gly His Met Pro Tyr Ser Ile Leu
610 615 620
Met Glu Ile Ala Leu Gln Thr Ser Gly Val Leu Thr Ser Val Leu Lys
625 630 635 640
Ala Pro Leu Thr Met Glu Lys Lys Asp Ile Leu Phe Arg Asn Leu Asp
645 650 655
Ala Asn Ala Glu Met Val Arg Ser Asp Ile Asp Leu Arg Gly Lys Thr
660 665 670
Ile His Asn Leu Thr Lys Cys Thr Gly Tyr Ser Met Leu Gly Asp Met
675 680 685
Gly Val His Arg Phe Ser Phe Glu Leu Ser Val Asp Gly Val Val Phe
690 695 700
Tyr Lys Gly Thr Thr Ser Phe Gly Trp Phe Val Pro Glu Val Phe Ile
705 710 715 720
Ser Gln Thr Gly Leu Asp Asn Gly Arg Arg Thr Gln Pro Trp His Ile
725 730 735
Glu Ser Lys Val Pro Ser Ala Gln Val Leu Thr Tyr Asp Val Thr Pro
740 745 750
Asn Gly Ala Gly Arg Thr Gln Leu Tyr Ala Asn Ala Pro Lys Gly Ala
755 760 765
Gln Leu Thr Arg Arg Trp Asn Gln Cys Gln Tyr Leu Asp Thr Ile Asp
770 775 780
Leu Val Val Ala Gly Gly Ser Ala Gly Leu Gly Tyr Gly His Gly Arg
785 790 795 800
Lys Gln Val Asn Pro Lys Asp Trp Phe Phe Ser Cys His Phe Trp Phe
805 810 815
Asp Ser Val Met Pro Gly Ser Leu Gly Val Glu Ser Met Phe Gln Leu
820 825 830
Val Glu Ser Ile Ala Val Lys Gln Asp Leu Ala Gly Lys Tyr Gly Ile
835 840 845
Thr Asn Pro Thr Phe Ala His Ala Pro Gly Lys Ile Ser Trp Lys Tyr
850 855 860
Arg Gly Gln Leu Thr Pro Thr Ser Lys Phe Met Asp Ser Glu Ala His
865 870 875 880
Ile Val Ser Ile Glu Ala His Asp Gly Val Val Asp Ile Val Ala Asn
885 890 895
Gly Asn Leu Trp Ala Asp Gly Leu Arg Val Tyr Asn Val Ser Asn Ile
900 905 910
Arg Val Arg Ile Val Ala Gly Ala Ala Pro Ala Ala Ala Ala Ala Ala
915 920 925
Ala Ala Val Ala Ala Pro Ala Ala Ala Pro Ala Pro Val Ala Ala Ser
930 935 940
Gly Pro Ala Gln Thr Ile Thr Leu Lys Gln Leu Lys Ala Glu Leu Leu
945 950 955 960
Asp Val Glu Lys Pro Leu Tyr Ile Ser Ser Ser Asn Gly Gln Val Lys
965 970 975
Lys His Ala Asp Val Ala Gly Gly Gln Ala Thr Ile Val Gln Ala Cys
980 985 990
Ser Leu Ser Asp Leu Gly Asp Glu Gly Phe Met Lys Thr Tyr Gly Val
995 1000 1005
Val Ala Pro Leu Tyr Thr Gly Ala Met Ala Lys Gly Ile Ala Ser
1010 1015 1020
Ala Asp Leu Val Ile Ala Thr Gly Lys Arg Lys Ile Leu Gly Ser
1025 1030 1035
Phe Gly Ala Gly Gly Leu Pro Met His Ile Val Arg Ala Ala Val
1040 1045 1050
Glu Lys Ile Gln Ala Glu Leu Pro Asn Gly Pro Phe Ala Val Asn
1055 1060 1065
Leu Ile His Ser Pro Phe Asp Ser Asn Leu Glu Lys Gly Asn Val
1070 1075 1080
Asp Leu Phe Leu Glu Lys Gly Val Thr Val Val Glu Ala Ser Ala
1085 1090 1095
Phe Met Thr Leu Thr Pro Gln Val Val Arg Tyr Arg Ala Ala Gly
1100 1105 1110
Leu Ser Arg Asn Ala Asp Gly Ser Ile Asn Ile Lys Asn Arg Ile
1115 1120 1125
Ile Gly Lys Val Ser Arg Thr Glu Leu Ala Glu Met Phe Ile Arg
1130 1135 1140
Pro Ala Pro Gln Asn Leu Leu Asp Lys Leu Ile Gln Ser Gly Glu
1145 1150 1155
Ile Thr Lys Glu Gln Ala Glu Leu Ala Lys Leu Val Pro Val Ala
1160 1165 1170
Asp Asp Ile Ala Val Glu Ala Asp Ser Gly Gly His Thr Asp Asn
1175 1180 1185
Arg Pro Ile His Val Ile Leu Pro Leu Ile Ile Asn Leu Arg Asn
1190 1195 1200
Arg Leu His Lys Glu Cys Gly Tyr Pro Ala His Leu Arg Val Arg
1205 1210 1215
Val Gly Ala Gly Gly Gly Val Gly Cys Pro Gln Ala Ala Ala Ala
1220 1225 1230
Ala Leu Ala Met Gly Ala Ala Phe Leu Val Thr Gly Thr Val Asn
1235 1240 1245
Gln Val Ala Lys Gln Ser Gly Thr Cys Asp Asn Val Arg Lys Gln
1250 1255 1260
Leu Cys Met Ala Thr Tyr Ser Asp Val Cys Met Ala Pro Ala Ala
1265 1270 1275
Asp Met Phe Glu Glu Gly Val Lys Leu Gln Val Leu Lys Lys Gly
1280 1285 1290
Thr Met Phe Pro Ser Arg Ala Asn Lys Leu Tyr Glu Leu Phe Cys
1295 1300 1305
Lys Tyr Asp Ser Phe Glu Ser Met Pro Ala Thr Glu Leu Glu Arg
1310 1315 1320
Val Glu Lys Arg Ile Phe Gln Cys Pro Leu Ala Asp Val Trp Ala
1325 1330 1335
Glu Thr Ser Asp Phe Tyr Ile Asn Arg Leu His Asn Pro Glu Lys
1340 1345 1350
Ile Thr Arg Ala Glu Arg Asp Pro Lys Leu Lys Met Ser Leu Cys
1355 1360 1365
Phe Arg Trp Tyr Leu Gly Leu Ala Ser Arg Trp Ala Asn Thr Gly
1370 1375 1380
Glu Ala Gly Arg Val Met Asp Tyr Gln Val Trp Cys Gly Pro Ala
1385 1390 1395
Ile Gly Ala Phe Asn Asp Phe Ile Lys Gly Ser Tyr Leu Asp Pro
1400 1405 1410
Ala Val Ser Gly Glu Tyr Pro Asp Val Val Gln Ile Asn Leu Gln
1415 1420 1425
Ile Leu Arg Gly Ala Cys Tyr Leu Arg Arg Leu Asn Val Ile Arg
1430 1435 1440
Asn Asp Pro Arg Val Ser Ile Glu Val Glu Asp Ala Glu Phe Val
1445 1450 1455
Tyr Glu Pro Thr Asn Ala Leu
1460 1465
<210>9
<211>5547
<212>DNA
<213>Ulkenia sp.
<400>9
atgcttgtga taggggctct ggcgcgggct ctgtacggtg cttggagatg cacgggcagg 60
gcgagagagg ggacgggttc ccgggaggcg ctgcttggag gtgctgagag ggagggagaa 120
ggcgtgcttt gcgatgcgcg gggcgaccta ggcgctgctg cgcggtgcag cagcagggac 180
ctcggacgtg agtcgaagcc gtctgcagag gagatggtag aagggccgcg gattggtagc 240
agagaagagg aaatagaaga agaagaagaa atagaagaag aagaaataga agaagaagaa 300
atagaagaag aagaggagga cgggcaggcg ggaaagatgg agaaaggact cgcggcggga 360
aaacaagaga atgtgaactt gggcttgaac tttggtttga atttgaatgt ggagaacgag 420
gggttgaatt tgagtttgaa tttgaaagaa aacttacgga aagaaagttt agttgaaagt 480
gagaaagaaa aaaatgagaa agaaaaagag aaagaaaaag agaaagaaaa agagaaagaa 540
aaagagaaag aaaaagagaa agaaaaagag aaagaaaaag agaaagaaaa agagaaagaa 600
aaagagaaag aaaaagagaa agaaaaagag aaagaaaaag aagaagaaaa agaagaagaa 660
aaagagaaag aaaaagagaa agaaaaagag aaagaaaaag aagaaggaga tttaaaaagt 720
tgtttagttg aaaaaggaga aggaggaaga agcagcgaca gcggcagaag aagaagtagt 780
tgttgtaaga ggggaacgga ggcagtagca gtggagcagg cggaggcgac agcaaacctc 840
gaactcgacc ccgtcgagcc gcagcaagaa caagagcccg accaggtgga cgaggacgag 900
gtccgcttgt tgtcaggaac aacagaagtt gcaggactag ccgagagtgc taccactgca 960
attcttagat ccacagacgc aagagcagaa aacttacaac tgctcgccac aacacaagaa 1020
ccaccttcag atacaaccag gttcgagaac tccacaagtc tagaagcagc aacagctcta 1080
gcagataatc aaacaggtcc agaaaaagct acgactagaa gagaaattat cgagtcgcaa 1140
cttgcaacca tggccactcg cgtgaagacc aacaagaaac catgctggga gatgaccaag 1200
gaggagctca ccagcggcaa gaacgtcgtt ttcgactatg acgagctcct tgagttcgcc 1260
gagggtgaca tcagcaaggt cttcggcccc gaattcagcc agatcgacca gtacaagcgt 1320
cgcgttcgtc tccccgcccg cgagtacctc ctcgtcaccc gcgtcaccct catggacgcc 1380
gaggtcaaca actaccgcgt cggtgcccgc atggtcactg agtacgacct ccccgtcaac 1440
ggtgagctct ctgagggtgg tgactgcccc tgggccgtgc tcgtcgagag tggtcagtgt 1500
gatctcatgc tcatctccta catgggtatt gacttccaga acaagagcga ccgcgtctac 1560
cgtctgctca acaccaccct caccttctac ggtgttgccc aggagggcga gaccctggag 1620
tacgacatcc gcgtgaccgg cttcgccaag cgtctcgacg gtgacatctc catgttcttc 1680
ttcgagtacg actgctacgt caacggccgt ctcctcatcg agatgcgcga cggctgtgcc 1740
ggtttcttca ccaacgagga gctcgccgcc ggcaagggtg tcgtctttac ccgcgctgat 1800
ctcctcgccc gcgagaagac caagaagcag gacatcaccc cgtacgccat tgccccgcgt 1860
cttaacaaga ccgttctcaa cgagactgag atgcagtccc tcgtggacaa gaactggacc 1920
aaggttttcg gccccgagaa cggcatggac cagatcaact acaaactctg cgcccgtaag 1980
atgctcatga ttgaccgcgt caccaagatt gactacaccg gtggccccta cggccttggt 2040
cttctcgttg gtgagaagat cctcgagcgc gaccactggt actttccgtg ccacttcgtc 2100
ggagaccagg tcatggctgg atccctcgtg tctgacggct gcagccagct cctcaagatg 2160
tacatgctct ggctcggcct ccaccttaag accggtccct tcgacttccg ccccgtcaac 2220
ggccacccca acaaggtccg ctgccgtggc cagatctccc cgcacaaggg taagctcgta 2280
tacgtcatgg agatcaagga gatgggctac gacgaggctg gtgacccgta cgccatcgcc 2340
gatgtcaaca ttctcgacat tgacttcgag aagggccaga ctttcgacct tgccaacctc 2400
cacgagtacg gcaagggcga cctcaacaag aagatcgtcg tcgacttcaa gggtattgcc 2460
ctcaagctcc agaagcgctc tggccctgcc gttgtcgctc ccgagaagcc cctcgctctc 2520
aacaaggacc tttgcgcccc ggctgttgag gccatccctg agcacatcct caagggcgat 2580
gctcttgccc ctaaccagat gacctggcac ccgatgtcca agatcgctgg caaccccacg 2640
ccctcgttct ctccctcggc ctaccctccc cgtcccatca ccttcacccc gttccccggc 2700
aacaagaacg acaacaacca cgtgcccggc gagatgccgc tctcgtggta caacatggct 2760
gagttcatgg ccggcaaggt cagcctctgc ctcggccctg agttcgccaa gttcgatgac 2820
tccaacacca gccgcagccc tgcatgggac cttgctcttg tgactcgtgt ggtctccgtt 2880
tctgacatgg agtgggtcca gtggaagaac gtggactgca acccgtccaa gggaaccatg 2940
gttggcgagt tcgactgccc catcgacgcc tggttcttcc agggatcttg taacgacggc 3000
cacatgccgt actccatcct catggagatc gccctccaga cctctggtgt cctcacctct 3060
gtgctcaagg ccccgctcac catggagaag aaggacattc tcttccgcaa ccttgacgcc 3120
aacgccgaga tggttcgctc tgatattgac ctccgcggca agaccatcca caacctcacc 3180
aagtgtaccg gctacagcat gctcggagac atgggtgtcc accgcttcag cttcgagctc 3240
tctgttgatg gtgtagtctt ctacaagggt accacctcct tcggctggtt cgtccctgag 3300
gtcttcatct cccagactgg tctcgacaac ggtcgccgca cccagccctg gcacattgag 3360
tccaaggtgc cttccgccca ggtcctcacc tacgacgtta cccccaacgg tgccggtcgc 3420
acccagctct acgccaacgc ccccaagggc gctcagctca ctcgccgctg gaaccagtgc 3480
cagtaccttg acaccatcga ccttgtggtc gccggtggct ccgccggtct tggctacggt 3540
catggccgca agcaggtgaa ccccaaggac tggttcttct cgtgccactt ctggttcgac 3600
tccgtcatgc ccggctcgct cggtgtggag tctatgttcc agctcgtcga gtccatcgct 3660
gtcaagcagg acctcgccgg caagtacggc atcaccaacc cgaccttcgc tcatgctccg 3720
ggcaagatct cctggaagta ccgtggtcag ctcaccccca cctccaagtt catggactcc 3780
gaggcccaca ttgtctccat cgaggcccac gacggcgtcg tcgacatcgt tgccaatggt 3840
aacctctggg ctgatggcct ccgcgtctac aacgtcagca acatccgtgt gcgcattgtt 3900
gctggcgccg cccctgctgc tgctgctgct gctgctgctg ttgctgctcc ggctgccgcc 3960
cctgctccgg ttgctgcatc tggccctgcc cagaccatca ccctcaagca gctcaaggct 4020
gagcttcttg acgttgagaa gcctctctac atctcctcca gcaacggcca ggtcaagaag 4080
cacgccgatg tggctggtgg ccaggccacc attgtgcagg cttgcagcct cagtgacctc 4140
ggtgatgaag gcttcatgaa gacctacggt gttgtggctc ctctctacac cggtgccatg 4200
gccaagggta ttgcctctgc tgaccttgtg attgccactg gtaagcgcaa gatcctcggt 4260
tccttcggtg ctggcggtct ccccatgcac attgtccgtg ccgctgttga gaagatccag 4320
gctgagctcc cgaacggccc cttcgccgtc aacctcatcc actccccctt cgatagcaac 4380
cttgagaagg gcaacgttga cctcttcctc gagaagggcg ttactgtcgt cgaggcctcc 4440
gccttcatga ccttgacccc gcaagtcgtc cgctaccgtg ctgctggtct ttcccgtaac 4500
gctgatggct ccattaacat caagaaccgc atcatcggta aggtctcccg taccgagctc 4560
gctgagatgt tcatccgccc tgccccgcag aacctcctcg acaagctcat ccagtctggt 4620
gagattacca aggagcaggc tgagcttgcc aagctcgtcc ccgtcgccga cgacatcgcc 4680
gtcgaggccg actctggtgg ccacaccgac aaccgcccca tccacgtcat cctccccctt 4740
atcatcaacc tccgcaaccg cctccacaag gagtgcggct accccgctca cctccgcgtg 4800
cgcgttggag ctggtggtgg tgttggatgc ccccaggccg ctgccgctgc tctcgctatg 4860
ggtgctgcct tccttgttac cggcactgtc aaccaggtcg ccaagcagtc cggcacctgc 4920
gacaatgtcc gcaagcagct ctgcatggcc acctactctg acgtctgcat ggctcccgct 4980
gctgacatgt tcgaggaggg cgtcaagctc caggtcctca agaagggaac catgttcccg 5040
tccagggcta acaagctcta cgagctcttc tgcaagtacg actccttcga gtccatgcct 5100
gccacagagc tcgagcgtgt tgagaagcgc atcttccagt gccctcttgc tgatgtctgg 5160
gctgagacct ccgacttcta catcaaccgc ctccacaacc cggagaagat cacccgtgcc 5220
gagcgtgacc ccaagctcaa gatgtctctc tgcttccgct ggtaccttgg tcttgcctct 5280
cgctgggcca acaccggtga ggctggacgc gtcatggact accaggtctg gtgtggccct 5340
gccattggag ccttcaacga cttcatcaag ggctcctacc ttgacccggc cgtctctggt 5400
gagtacccgg acgtcgtgca gatcaacttg cagatccttc gcggtgcctg ctacctccgc 5460
cgtctcaatg tcatccgcaa cgacccgcgt gtcagcattg aggtcgagga tgctgagttc 5520
gtctacgagc ccaccaacgc cctctaa 5547
<210>10
<211>837
<212>DNA
<213>Ulkania sp.
<400>10
acccgcatcg ctgtgatcgg catgtccgcc atcctcccct gcggtaccac cgttcgtgag 60
tcttgggagg ctatccgcga tggtatcgac tgcctcagtg atctccccga ggaccgcgtc 120
gatgtgaccg cctacttcga cccggtcaag accaccaagg ataagatcta ctgcaaacgt 180
ggtggattca tccctgagta cgacttcgac gcccgtgagt tcggcctcaa catgtttcag 240
atggaggact ccgacgcaaa ccaaaccgtc accctcctca aggtcaagga ggccctcgag 300
gacgctggca tcgaagccct cagcaaggaa aagaagaaca ttggatgtgt tctcggtatc 360
ggtggtggcc agaagtccag ccacgagttc tactcccgct taaactatgt tgtcgttgag 420
aaggtccttc gcaagatggg catgcctgag gaggatgttc aagctgctgt tgagaagtac 480
aaggccaact tccctgagtg gcgccttgac tccttccccg gtttcctcgg caacgttact 540
gccggtcgct gtaccaacac cttcaacctc gatggtatga actgtgtcgt cgatgctgcc 600
tgtgctagtt ctctcatcgc cgttaaggtt gccattgatg agcttctcca cggagactgt 660
gacatgatga tcactggtgc tacctgcacg gataactcca tcggtatgta catggccttc 720
tccaagaccc cggtgttctc taccgaccct agcgtccgcg catacgatga gaagaccaag 780
ggtatgctta ttggcgaagg ctctgccatg cttgtgctta aacgttacgc cgacgct 837
<210>11
<211>51
<212>DNA
<213>Ulkenia sp.
<400>11
ggtatgaact gtgtcgtcga tgctgcctgt gctagttctc tcatcgccgtt 51
<210>12
<211>12
<212>DNA
<213>Ulkenia sp.
<400>12
gatgctgcct gt 12
<210>13
<211>522
<212>DNA
<213>Ulkenia sp.
<400>13
cacgctgtca ttcgcggctg cgcctcttcc tctgacggta aggcctccgg tatttacacc 60
ccgaccatct ctggtcaaga ggaggctctt cgccgtgcct acatgcgcgc taacgtcgat 120
cccgccaccg tcactcttgt tgagggccac ggtaccggta cccccgttgg tgaccgtatt 180
gagctcaccg ctctccgtaa cctcttcgac agtgcctacg gcaacgagaa ggagaaggtc 240
gctgttggca gcattaagtc caacatcggt cacctcaagg ctgtcgccgg tcttgccggt 300
atgatcaagg tcatcatggc cctcaagcat aagactcttc cggccaccat caacgttgat 360
gagcccccta agctttacga caacactccc atcaccgact catcgctgta cattaacacg 420
atgaaccgtc cgtggttccc tgctccgggt gtgccccgtc gcgctggtat ctccagtttc 480
ggttttggtg gtgccaacta ccacgccgtt cttgaggaag cc 522
<210>14
<211>1380
<212>DNA
<213>Ulkenia sp.
<400>14
acccgcatcg ctgtgatcgg catgtccgcc atcctcccct gcggtaccac cgttcgtgag 60
tcttgggagg ctatccgcga tggtatcgac tgcctcagtg atctccccga ggaccgcgtc 120
gatgtgaccg cctacttcga cccggtcaag accaccaagg ataagatcta ctgcaaacgt 180
ggtggattca tccctgagta cgacttcgac gcccgtgagt tcggcctcaa catgtttcag 240
atggaggact ccgacgcaaa ccaaaccgtc accctcctca aggtcaagga ggccctcgag 300
gacgctggca tcgaagccct cagcaaggaa aagaagaaca ttggatgtgt tctcggtatc 360
ggtggtggcc agaagtccag ccacgagttc tactcccgct taaactatgt tgtcgttgag 420
aaggtccttc gcaagatggg catgcctgag gaggatgttc aagctgctgt tgagaagtac 480
aaggccaact tccctgagtg gcgccttgac tccttccccg gtttcctcgg caacgttact 540
gccggtcgct gtaccaacac cttcaacctc gatggtatga actgtgtcgt cgatgctgcc 600
tgtgctagtt ctctcatcgc cgttaaggtt gccattgatg agcttctcca cggagactgt 660
gacatgatga tcactggtgc tacctgcacg gataactcca tcggtatgta catggccttc 720
tccaagaccc cggtgttctc taccgaccct agcgtccgcg catacgatga gaagaccaag 780
ggtatgctta ttggcgaagg ctctgccatg cttgtgctta aacgttacgc cgacgctgtt 840
cgtgatggtg acgagattca cgctgtcatt cgcggctgcg cctcttcctc tgacggtaag 900
gcctccggta tttacacccc gaccatctct ggtcaagagg aggctcttcg ccgtgcctac 960
atgcgcgcta acgtcgatcc cgccaccgtc actcttgttg agggccacgg taccggtacc 1020
cccgttggtg accgtattga gctcaccgct ctccgtaacc tcttcgacag tgcctacggc 1080
aacgagaagg agaaggtcgc tgttggcagc attaagtcca acatcggtca cctcaaggct 1140
gtcgccggtc ttgccggtat gatcaaggtc atcatggccc tcaagcataa gactcttccg 1200
gccaccatca acgttgatga gccccctaag ctttacgaca acactcccat caccgactca 1260
tcgctgtaca ttaacacgat gaaccgtccg tggttccctg ctccgggtgt gccccgtcgc 1320
gctggtatct ccagtttcgg ttttggtggt gccaactacc acgccgttct tgaggaagcc 1380
<210>15
<211>996
<212>DNA
<213>Ulkenia sp.
<400>15
ctcttctctg gccagggtgc tcagtacacc cacatgttca gcgaggtcgc catgaactgg 60
cctcagttcc gtgagagcat ctctgacatg gatcgtgccc aggctaaggt tgctggcgct 120
gacaaggact acgagcgtgt ctcccaagtc ctctacccgc gtaagcctta taactctgag 180
cccgagcagg accacaagaa gatctccctg acctcatact ctcagccctc taccctcgcc 240
tgcgctcttg gtgcctacga gatcttcaag caggctggtt tcaagcccga cttcgctgcc 300
ggtcactctc tcggtgagtt tgcggccctc tacgctgctg actgcgtcaa ccgtgacgac 360
ctctttgagc tcgtgtgccg tcgtgcccgc atcatgggtg gcaaggatgc acctgctacc 420
cccaagggat gcatggctgc tgtcattgga cccaatgccg agaagatcca gattcgcact 480
gctgatgtct ggctcggcaa ctgcaactcc ccttcgcaga ctgtcatcac cggctctgtt 540
gagggtatca agaaggagtc cgagcttctc cagagtgagg gcttccgtgt tgtccccctc 600
gcctgcgaga gtgccttcca ctcaccgcag atgcaaaacg cctcctctgc cttcaaggat 660
gttctctcca aggttgcctt ccgtcagcct agcgcccaga ccaagctctt cagcaacgtg 720
tctggcgaga cctactccaa caatgcccag gacctcctta aggagcacat gaccagcagt 780
gttaagttca tctctcaggt tcgcaacatg cactctgctg gtgctcgcat ctttgtcgag 840
tttggcccca agcaggtgct ctctaagctt gtttccgaga ccctcaagga cgatccttcc 900
attatcacta tctctgtcaa cccttcctct ggcaaggatg ccgatattca gcttcgcgag 960
gctgctgtgc agctcgttgt tgctggagtc aacctt 996
<210>16
<211>3510
<212>DNA
<213>Ulkenia sp.
<400>16
gcccaggccc agatccagaa ggccaaggcc gatgctgctg aggctgacaa gaagcttgcc 60
gctgctaagg atgaggccaa gcgtgccgcc gcttctgcac ctgtgcagaa gcaggttgac 120
accaccattg ttgataagca ccgtgctatc ctcaagtcta tgcttgctga gcttgactgc 180
tactccactc ctggtgctgt gtccagctct ttccaggcac ctgttgctgc tacccctgct 240
ccggtcgctg cgcctgttgc agctgctcct gctccggctg tcaacaatgc tctccttgcc 300
aaggctgagt ctgttgtcat ggaggttctt gccgccaaga ctggttacga gactgacatg 360
atcgagcccg acatggagct cgagactgag ctcggcattg actctatcaa gcgtgtcgag 420
attctctctg aggtccaggc ccagctcaac gtcgaggcca aggatgttga tgctcttagc 480
cgcacccgca ccgtcggtga ggttgtcaac gccatgaagg ctgagatcgc tggcagctct 540
ggtgctgccg ctgctgcccc ggccccggtt gctgctgctc ccgctgcccc tgcccctgct 600
gtcaacagcg ctcttcttgc caaggctgag actgttgtca tggaggttct tgccgccaag 660
actggttacg agactgacat gattgagccc gacatggagc tcgagactga gctcggcatt 720
gactccatca agcgtgtcga gattctctct gaggttcagg cccagctcaa cgttgaggcc 780
aaggatgttg atgctcttag ccgcacccgc accgttggtg aggttgtcaa cgccatgaag 840
gctgagatcg ctggcagctc tggtgctgcc gctgctgccc cggcccctgt tgctgctgct 900
ccggcgcccg tcgctgccgc tgcccctgct gtcagcagcg ctctccttga gaaggctgag 960
tctgttgtca tggaggttct tgccgccaag actggttacg agactgacat gattgaggcc 1020
gacatggagc tcgagactga gctcggcatt gactccatca agcgtgtcga gattctctct 1080
gaggtccagg cccagctcaa cgtcgaggcc aaggatgtcg atgctcttag ccgcacccgc 1140
accgttggtg aggttgtcaa cgccatgaag gctgagatcg ctggcagctc tggtgctgct 1200
gccccggccc cggtcgctgc ggcccctgct ccggtcgctg ccgctgcccc tgctgtcaac 1260
agcgctcttc ttgagaaggc tgagactgtt gtcatggagg ttcttgccgc caagactggt 1320
tacgagactg acatgatcga gcccgacatg gagctcgaga ctgagctcgg cattgactct 1380
atcaagcgtg tcgagattct ctctgaggtc caggcccagc tcaacgttga ggccaaggat 1440
gttgatgctc ttagccgcac ccgcaccgtt ggtgaggttg tcaacgccat gaaggctgag 1500
atcgctggca gctctggtgc tgccgctgct gccccggccc cggttgctgc tgctcccgct 1560
cccgtcgctg cccctgctgt cagcagcgct ctccttgaga aggctgagtc tgtcgtcatg 1620
gaggttcttg ccgccaagac tggttacgag actgacatga ttgaggccga catggagctc 1680
gagactgagc tcggcattga ctccatcaag cgtgtcgaga ttctctctga ggtccaggcc 1740
cagctcaacg ttgaggccaa ggatgtcgat gctcttagcc gcacccgcac cgttggtgag 1800
gttgtcaacg ccatgaaggc tgagatcgct ggcagctctg gtgctgccgc tgctgccccg 1860
gcccctgttg ctgcctctcc cgctcccgtc gctgccgctg cccctgctgt cagcagcgct 1920
ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 1980
actgacatga ttgaggctga catggagctc gagactgagc tcggcattga ctctatcaag 2040
cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat 2100
gctcttagcc gcacccgcac cgttggtgag gttgtcaacg ccatgaaggc tgagatcgct 2160
ggcagctctg gtgccgccgc tgctgccccg gccccggttg ctgctgctcc ggcgcccgtc 2220
actgccgctg cccctgctgt cagcagcgct ctccttgaga aggccgaatc tgttgtcatg 2280
gaggttctcg ccgccaagac tggttacgag actgacatga ttgaggccga catggagctc 2340
gagactgagc ttggcattga ctccatcaag cgtgtcgaga ttctctctga ggtccaggct 2400
atgcttaacg tcgaggccaa ggatgttgat gctcttagcc gcacccgcac cgttggtgag 2460
gttgtcaacg ccatgaaggc tgagattgct agcagctctg gtgctgctgc ccctgctccg 2520
gctgctgccg ttgcaccggc ccctgctgct gcccctgctg tcagcagcgc tctccttgag 2580
aaggccgaat ctgttgtcat ggaggttctc gccgccaaga ctggttacga gactgacatg 2640
attgaggccg acatggagct cgagactgag ctcggcattg actctatcaa gcgtgtcgag 2700
attctctctg aggtccaggc tatgcttaac gttgaggcca aggatgttga tgctcttagc 2760
cgcacccgca ccgttggtga ggttgtcaac gccatgaagg ctgagattgc tagcagctct 2820
ggtgctgctg cccctgctcc tgctgctgcc gctgcaccgg cccctgctgc tgcccctgct 2880
gtcagcagcg ctcttcttga gaaggctgag tctgttgtca tggaggttct cgccgccaag 2940
actggttacg agactgacat gattgaggcc gacatggagc tcgagactga gcttggcatt 3000
gactccatca agcgtgtcga gattctctct gaggtccagg ctatgcttaa cgttgaggcc 3060
aaggatgttg atgctcttag ccgcacccgc accgttggtg aggttgtcaa cgccatgaag 3120
gctgagattg ctagcagctc tggtgctgct gcccctgctc ctgctgctgc cgctgcaccg 3180
gcccctgctg ctgcccctgc tgtcagcagc gctcttcttg agaaggctga gtctgttgtc 3240
atggaggttc tcgccgccaa gactggttac gagactgaca tgattgaggc cgacatggag 3300
ctcgagactg agcttggcat tgactccatc aagcgtgtcg agattctctc tgaggtccag 3360
gctatgctta acgttgaggc caaggatgtt gatgctctta gccgcacccg caccgttggt 3420
gaggttgtca acgccatgaa ggctgagatc gctggcagct ctggtgctgc tactgcctct 3480
gcccctgctg ctgcagctgc cgcccctgct 3510
<210>17
<211>219
<212>DNA
<213>Ulkenia sp.
<400>17
ctccttgcca aggctgagtc tgttgtcatg gaggttcttg ccgccaagac tggttacgag 60
actgacatga tcgagcccga catggagctc gagactgagc tcggcattga ctctatcaag 120
cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg tcgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgtcggtgag gttgtcaac 219
<210>18
<211>219
<212>DNA
<213>Ulkenia sp.
<400>18
cttcttgcca aggctgagac tgttgtcatg gaggttcttg ccgccaagac tggttacgag 60
actgacatga ttgagcccga catggagctc gagactgagc tcggcattga ctccatcaag 120
cgtgtcgaga ttctctctga ggttcaggcc cagctcaacg ttgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210>19
<211>219
<212>DNA
<213>Ulkenia sp.
<400>19
ctccttgaga aggctgagtc tgttgtcatg gaggttcttg ccgccaagac tggttacgag 60
actgacatga ttgaggccga catggagctc gagactgagc tcggcattga ctccatcaag 120
cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg tcgaggccaa ggatgtcgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210>20
<211>219
<212>DNA
<213>Ulkenia sp.
<400>20
cttcttgaga aggctgagac tgttgtcatg gaggttcttg ccgccaagac tggttacgag 60
actgacatga tcgagcccga catggagctc gagactgagc tcggcattga ctctatcaag 120
cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg ttgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210>21
<211>219
<212>DNA
<213>Ulkenia sp.
<400>21
ctccttgaga aggctgagtc tgtcgtcatg gaggttcttg ccgccaagac tggttacgag 60
actgacatga ttgaggccga catggagctc gagactgagc tcggcattga ctccatcaag 120
cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg ttgaggccaa ggatgtcgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210>22
<211>219
<212>DNA
<213>Ulkenia sp.
<400>22
ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 60
actgacatga ttgaggctga catggagctc gagactgagc tcggcattga ctctatcaag 120
cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210>23
<211>219
<212>DNA
<213>Ulkenia sp.
<400>23
ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 60
actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag 120
cgtgtcgaga ttctctctga ggtccaggct atgcttaacg tcgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210>24
<211>219
<212>DNA
<213>Ulkenia sp.
<400>24
ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 60
actgacatga ttgaggccga catggagctc gagactgagc tcggcattga ctctatcaag 120
cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210>25
<211>219
<212>DNA
<213>Ulkenia sp.
<400>25
cttcttgaga aggctgagtc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 60
actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag 120
cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210>26
<211>219
<212>DNA
<213>Ulkenia sp.
<400>26
cttcttgaga aggctgagtc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 60
actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag 120
cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat 180
gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219
<210>27
<211>609
<212>DNA
<213>Ulkenia sp.
<400>27
aagaagctcg ttggcactat tgctggtgcc cgtgaggttc gttcctcaat tgctaacatt 60
gaggctctcg gtggcaaggc aatctactcc tcttgtgatg tgaactctgc tgctgatgtc 120
gccaaggctg ttcgcgaggc tgaggctcag cttggcgccc gtgtaactgg tgtcgtccac 180
gcttctggtg tccttcgtga ccgcctcatt gagcagaagc gccccgatga gtttgatgct 240
gtcttcggca ccaaggtgac tggtctcgag aacctctttg gtgccattga catggccaac 300
cttaagcacc tcgtcctctt cagctctctt gctggtttcc acggcaacat tggtcagtct 360
gactacgcca tggctaacga ggccctcaac aagatgggtc ttgagctctc tgaccgtgtg 420
tccgtgaagt ctatttgctt cggcccctgg gatggtggca tggttacccc ccagctcaag 480
aagcagttcc agtctatggg tgttcagatc atcccccgtg agggtggtgc cgatactgtg 540
gctcgcattg tcctcggctc ctcccctgct gagatccttg ttggcaactg gaccactccc 600
accaagaag 609
<210>28
<211>279
<212>PRT
<213>Ulkenia sp.
<400>28
Thr Arg Ile Ala Val Ile Gly Met Ser Ala Ile Leu Pro Cys Gly Thr
1 5 10 15
Thr Val Arg Glu Ser Trp Glu Ala Ile Arg Asp Gly Ile Asp Cys Leu
20 25 30
Ser Asp Leu Pro Glu Asp Arg Val Asp Val Thr Ala Tyr Phe Asp Pro
35 40 45
Val Lys Thr Thr Lys Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile
50 55 60
Pro Glu Tyr Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe Gln
65 70 75 80
Met Glu Asp Ser Asp Ala Asn Gln Thr Val Thr Leu Leu Lys Val Lys
85 90 95
Glu Ala Leu Glu Asp Ala Gly Ile Glu Ala Leu Ser Lys Glu Lys Lys
100 105 110
Asn Ile Gly Cys Val Leu Gly Ile Gly Gly Gly Gln Lys Ser Ser His
115 120 125
Glu Phe Tyr Ser Arg Leu Asn Tyr Val Val Val Glu Lys Val Leu Arg
130 135 140
Lys Met Gly Met Pro Glu Glu Asp Val Gln Ala Ala Val Glu Lys Tyr
145 150 155 160
Lys Ala Asn Phe pro Glu Trp Arg Leu Asp Ser Phe Pro Gly Phe Leu
165 170 175
Gly Asn Val Thr Ala Gly Arg Cys Thr Asn Thr Phe Asn Leu Asp Gly
180 185 190
Met Asn Cys Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala Val
195 200 205
Lys Val Ala Ile Asp Glu Leu Leu His Gly Asp Cys Asp Met Met Ile
210 215 220
Thr Gly Ala Thr Cys Thr Asp Asn Ser Ile Gly Met Tyr Met Ala Phe
225 230 235 240
Ser Lys Thr Pro Val Phe Ser Thr Asp Pro Ser Val Arg Ala Tyr Asp
245 250 255
Glu Lys Thr Lys Gly Met Leu Ile Gly Glu Gly Ser Ala Met Leu Val
260 265 270
Leu Lys Arg Tyr Ala Asp Ala
275
<210>29
<211>17
<212>PRT
<213>Ulkenia sp.
<400>29
Gly Met Asn Cys Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala
1 5 10 15
Val
<210>30
<211>4
<212>PRT
<213>Ulkenia sp.
<400>30
Asp Ala Ala Cys
1
<210>31
<211>174
<212>PRT
<213>Ulkenia sp.
<400>31
His Ala Val Ile Arg Gly Cys Ala Ser Ser Ser Asp Gly Lys Ala Ser
1 5 10 15
Gly Ile Tyr Thr Pro Thr Ile Ser Gly Gln Glu Glu Ala Leu Arg Arg
20 25 30
Ala Tyr Met Arg Ala Asn Val Asp Pro Ala Thr Val Thr Leu Val Glu
35 40 45
Gly His Gly Thr Gly Thr Pro Val Gly Asp Arg Ile Glu Leu Thr Ala
50 55 60
Leu Arg Asn Leu Phe Asp Ser Ala Tyr Gly Asn Glu Lys Glu Lys Val
65 70 75 80
Ala Val Gly Ser Ile Lys Ser Asrn Ile Gly His Leu Lys Ala Val Ala
85 90 95
Gly Leu Ala Gly Met Ile Lys Val Ile Met Ala Leu Lys His Lys Thr
100 105 110
Leu Pro Ala Thr Ile Asn Val Asp Glu Pro Pro Lys Leu Tyr Asp Asn
115 120 125
Thr Pro Ile Thr Asp Ser Ser Leu Tyr Ile Asn Thr Met Asn Arg Pro
130 135 140
Trp Phe Pro Ala Pro Gly Val Pro Arg Arg Ala Gly Ile Ser Ser Phe
145 150 155 160
Gly Phe Gly Gly Ala Asn Tyr His Ala Val Leu Glu Glu Ala
165 170
<210>32
<211>460
<212>PRT
<213>Ulkenia sp.
<400>32
Thr Arg Ile Ala Val Ile Gly Met Ser Ala Ile Leu Pro Cys Gly Thr
1 5 10 15
Thr Val Arg Glu Ser Trp Glu Ala Ile Arg Asp Gly Ile Asp Cys Leu
20 25 30
Ser Asp Leu Pro Glu Asp Arg Val Asp Val Thr Ala Tyr Phe Asp pro
35 40 45
Val Lys Thr Thr Lys Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile
50 55 60
Pro Glu Tyr Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe Gln
65 70 75 80
Met Glu Asp Ser Asp Ala Asn Gln Thr Val Thr Leu Leu Lys Val Lys
85 90 95
Glu Ala Leu Glu Asp Ala Gly Ile Glu Ala Leu Ser Lys Glu Lys Lys
100 105 110
Asn Ile Gly Cys Val Leu Gly Ile Gly Gly Gly Gln Lys Ser Ser His
115 120 125
Glu Phe Tyr Ser Arg Leu Asn Tyr Val Val Val Glu Lys Val Leu Arg
130 135 140
Lys Met Gly Met Pro Glu Glu Asp Val Gln Ala Ala Val Glu Lys Tyr
145 150 155 160
Lys Ala Asn Phe Pro Glu Trp Arg Leu Asp Ser Phe Pro Gly Phe Leu
165 170 175
Gly Asn Val Thr Ala Gly Arg Cys Thr Asn Thr Phe Asn Leu Asp Gly
180 185 190
Met Asn Cys Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala Val
195 200 205
Lys Val Ala Ile Asp Glu Leu Leu His Gly Asp Cys Asp Met Met Ile
210 215 220
Thr Gly Ala Thr Cys Thr Asp Asn Ser Ile Gly Met Tyr Met Ala Phe
225 230 235 240
Ser Lys Thr Pro Val Phe Ser Thr Asp Pro Ser Val Arg Ala Tyr Asp
245 250 255
Glu Lys Thr Lys Gly Met Leu Ile Gly Glu Gly Ser Ala Met Leu Val
260 265 270
Leu Lys Arg Tyr Ala Asp Ala Val Arg Asp Gly Asp Glu Ile His Ala
275 280 285
Val Ile Arg Gly Cys Ala Ser Ser Ser Asp Gly Lys Ala Ser Gly Ile
290 295 300
Tyr Thr Pro Thr Ile Ser Gly Gln Glu Glu Ala Leu Arg Arg Ala Tyr
305 310 315 320
Met Arg Ala Asn Val Asp Pro Ala Thr Val Thr Leu Val Glu Gly His
325 330 335
Gly Thr Gly Thr Pro Val Gly Asp Arg Ile Glu Leu Thr Ala Leu Arg
340 345 350
Asn Leu Phe Asp Ser Ala Tyr Gly Asn Glu Lys Glu Lys Val Ala Val
355 360 365
Gly Ser Ile Lys Ser Asn Ile Gly His Leu Lys Ala Val Ala Gly Leu
370 375 380
Ala Gly Met Ile Lys Val Ile Met Ala Leu Lys His Lys Thr Leu Pro
385 390 395 400
Ala Thr Ile Asn Val Asp Glu Pro Pro Lys Leu Tyr Asp Asn Thr Pro
405 410 415
Ile Thr Asp Ser Ser Leu Tyr Ile Asn Thr Met Asn Arg Pro Trp Phe
420 425 430
Pro Ala Pro Gly Val Pro Arg Arg Ala Gly Ile Ser Ser Phe Gly Phe
435 440 445
Gly Gly Ala Asn Tyr His Ala Val Leu Glu Glu Ala
450 455 460
<210>33
<211>332
<212>PRT
<213>Ulkenia sp.
<400>33
Leu Phe Ser Gly Gln Gly Ala Gln Tyr Thr His Met Phe Ser Glu Val
1 5 10 15
Ala Met Asn Trp Pro Gln Phe Arg Glu Ser Ile Ser Asp Met Asp Arg
20 25 30
Ala Gln Ala Lys Val Ala Gly Ala Asp Lys Asp Tyr Glu Arg Val Ser
35 40 45
Gln Val Leu Tyr Pro Arg Lys Pro Tyr Asn Ser Glu Pro Glu Gln Asp
50 55 60
His Lys Lys Ile Ser Leu Thr Ser Tyr Ser Gln Pro Ser Thr Leu Ala
65 70 75 80
Cys Ala Leu Gly Ala Tyr Glu Ile Phe Lys Gln Ala Gly Phe Lys Pro
85 90 95
Asp Phe Ala Ala Gly His Ser Leu Gly Glu Phe Ala Ala Leu Tyr Ala
100 105 110
Ala Asp Cys Val Asn Arg Asp Asp Leu Phe Glu Leu Val Cys Arg Arg
115 120 125
Ala Arg Ile Met Gly Gly Lys Asp Ala Pro Ala Thr Pro Lys Gly Cys
130 135 140
Met Ala Ala Val Ile Gly Pro Asn Ala Glu Lys Ile Gln Ile Arg Thr
145 150 155 160
Ala Asp Val Trp Leu Gly Asn Cys Asn Ser Pro Ser Gln Thr Val Ile
165 170 175
Thr Gly Ser Val Glu Gly Ile Lys Lys Glu Ser Glu Leu Leu Gln Ser
180 185 190
Glu Gly Phe Arg Val Val Pro Leu Ala Cys Glu Ser Ala Phe His Ser
195 200 205
Pro Gln Met Gln Asn Ala Ser Ser Ala Phe Lys Asp Val Leu Ser Lys
210 215 220
Val Ala Phe Arq Gln Pro Ser Ala Gln Thr Lys Leu Phe Ser Asn Val
225 230 235 240
Ser Gly Glu Thr Tyr Ser Asn Asn Ala Gln Asp Leu Leu Lys Glu His
245 250 255
Met Thr Ser Ser Val Lys Phe Ile Ser Gln Val Arg Asn Met His Ser
260 265 270
Ala Gly Ala Arg Ile Phe Val Glu Phe Gly Pro Lys Gln Val Leu Ser
275 280 285
Lys Leu Val Ser Glu Thr Leu Lys Asp Asp Pro Ser Ile Ile Thr Ile
290 295 300
Ser Val Asn Pro Ser Ser Gly Lys Asp Ala Asp Ile Gln Leu Arg Glu
305 310 315 320
Ala Ala Val Gln Leu Val Val Ala Gly Val Asn Leu
325 330
<210>34
<211>1170
<212>PRT
<213>Ulkenia sp.
<400>34
Ala Gln Ala Gln Ile Gln Lys Ala Lys Ala Asp Ala Ala Glu Ala Asp
1 5 10 15
Lys Lys Leu Ala Ala Ala Lys Asp Glu Ala Lys Arg Ala Ala Ala Ser
20 25 30
Ala Pro Val Gln Lys Gln Val Asp Thr Thr Ile Val Asp Lys His Arg
35 40 45
Ala Ile Leu Lys Ser Met Leu Ala Glu Leu Asp Cys Tyr Ser Thr Pro
50 55 60
Gly Ala Val Ser Ser Ser Phe Gln Ala Pro Val Ala Ala Thr Pro Ala
65 70 75 80
Pro Val Ala Ala Pro Val Ala Ala Ala Pro Ala Pro Ala Val Asn Asn
85 90 95
Ala Leu Leu Ala Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala
100 105 110
Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu
115 120 125
Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu
130 135 140
Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser
145 150 155 160
Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile
165 170 175
Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala
180 185 190
Ala Pro Ala Ala Pro Ala Pro Ala Val Asn Ser Ala Leu Leu Ala Lys
195 200 205
Ala Glu Thr Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu
210 215 220
Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr Glu Leu Gly Ile
225 230 235 240
Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Gln Leu
245 250 255
Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val
260 265 270
Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly
275 280 285
Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val
290 295 300
Ala Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu
305 310 315 320
Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp
325 330 335
Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser
340 345 350
Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Gln Leu Asn Val
355 360 365
Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu
370 375 380
Val Val Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala
385 390 395 400
Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val Ala Ala Ala Ala
405 410 415
Pro Ala Val Asn Ser Ala Leu Leu Glu Lys Ala Glu Thr Val Val Met
420 425 430
Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro
435 440 445
Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val
450 455 460
Glu Ile Leu Ser Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp
465 470 475 480
Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala
485 490 495
Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro
500 505 5l0
Ala Pro Val Ala Ala Ala Pro Ala Pro Val Ala Ala Pro Ala Val Ser
515 520 525
Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala
530 535 540
Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu
545 550 555 560
Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser
565 570 575
Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu
580 585 590
Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu
595 600 605
Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala
610 615 620
Ala Ser Pro Ala Pro Val Ala Ala Ala Ala Pro Ala Val Ser Ser Ala
625 630 635 640
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
645 650 655
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
660 665 670
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
675 680 685
Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
690 695 700
Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala
705 710 715 720
Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala Ala
725 730 735
Pro Ala Pro Val Thr Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu
740 745 750
Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly
755 760 765
Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu
770 775 780
Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala
785 790 795 800
Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg
805 810 815
Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Ser Ser
820 825 830
Ser Gly Ala Ala Ala Pro Ala Pro Ala Ala Ala Val Ala Pro Ala Pro
835 840 845
Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser
850 855 860
Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met
865 870 875 880
Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile
885 890 895
Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu
900 905 910
Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val
915 920 925
Val Asn Ala Met Lys Ala Glu Ile Ala Ser Ser Ser Gly Ala Ala Ala
930 935 940
Pro Ala Pro Ala Ala Ala Ala Ala Pro Ala Pro Ala Ala Ala Pro Ala
945 950 955 960
Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val
965 970 975
Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met
980 985 990
Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile
995 1000 1005
Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val
1010 1015 1020
Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala
1025 1030 1035
Met Lys Ala Glu Ile Ala Ser Ser Ser Gly Ala Ala Ala Pro Ala
1040 1045 1050
Pro Ala Ala Ala Ala Ala Pro Ala Pro Ala Ala Ala Pro Ala Val
1055 1060 1065
Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val
1070 1075 1080
Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp
1085 1090 1095
Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val
1100 1105 1110
Glu Ile Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys
1115 1120 1125
Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val
1130 1135 1140
Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Thr
1145 1150 1155
Ala Ser Ala Pro Ala Ala Ala Ala Ala Ala Pro Ala
1160 1165 1170
<210>35
<211>73
<212>PRT
<213>Ulkenia sp.
<400>35
Leu Leu Ala Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210>36
<211>73
<212>PRT
<213>Ulkenia sp.
<400>36
Leu Leu Ala Lys Ala Glu Thr Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210>37
<211>73
<212>PRT
<213>Ulkenia sp.
<400>37
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210>38
<211>73
<212>PRT
<213>Ulkenia sp.
<400>38
Leu Leu Glu Lys Ala Glu Thr Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210>39
<211>73
<212>PRT
<213>Ulkenia sp.
<400>39
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210>40
<211>73
<212>PRT
<213>Ulkenia sp.
<400>40
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210>41
<211>73
<212>PRT
<213>Ulkenia sp.
<400>41
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210>42
<211>73
<212>PRT
<213>Ulkenia sp.
<400>42
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210>43
<211>73
<212>PRT
<213>Ulkenia sp.
<400>43
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210>44
<211>73
<212>PRT
<213>Ulkenia sp.
<400>44
Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 5 10 15
Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr
20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val
35 40 45
Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
50 55 60
Thr Arg Thr Val Gly Glu Val Val Asn
65 70
<210>45
<211>203
<212>PRT
<213>Ulkenia sp.
<400>45
Lys Lys Leu Val Gly Thr Ile Ala Gly Ala Arg Glu Val Arg Ser Ser
1 5 10 15
Ile Ala Asn Ile Glu Ala Leu Gly Gly Lys Ala Ile Tyr Ser Ser Cys
20 25 30
Asp Val Asn Ser Ala Ala Asp Val Ala Lys Ala Val Arg Glu Ala Glu
35 40 45
Ala Gln Leu Gly Ala Arg Val Thr Gly Val Val His Ala Ser Gly Val
50 55 60
Leu Arg Asp Arg Leu Ile Glu Gln Lys Arg Pro Asp Glu Phe Asp Ala
65 70 75 80
Val Phe Gly Thr Lys Val Thr Gly Leu Glu Asn Leu Phe Gly Ala Ile
85 90 95
Asp Met Ala Asn Leu Lys His Leu Val Leu Phe Ser Ser Leu Ala Gly
100 105 110
Phe His Gly Asn Ile Gly Gln Ser Asp Tyr Ala Met Ala Asn Glu Ala
115 120 125
Leu Asn Lys Met Gly Leu Glu Leu Ser Asp Arg Val Ser Val Lys Ser
130 135 140
Ile Cys Phe Gly Pro Trp Asp Gly Gly Met Val Thr Pro Gln Leu Lys
145 150 155 160
Lys Gln Phe Gln Ser Met Gly Val Gln Ile Ile Pro Arg Glu Gly Gly
165 170 175
Ala Asp Thr Val Ala Arg Ile Val Leu Gly Ser Ser Pro Ala Glu Ile
180 185 190
Leu Val Gly Asn Trp Thr Thr Pro Thr Lys Lys
195 200
<210>46
<211>780
<212>DNA
<213>Ulkenia sp.
<400>46
aagcgcattg ccgtggtggg catggccgtg caatacgcgg gctgcaaaga caaggaagag 60
ttctggaaag tagtcatggg cggtgaggct gcatggacta agattagcga taaacgcctc 120
ggatccaaca agcgagccga gcacttcaaa gcagagcgta gcaaatttgc agataccttt 180
tgcaacgaga actacggctg cgtcgatgac tccgtcgata acgaacacga gcttctcctt 240
aagctctcca agaaggctct ctccgagaca tcggtctccg actctacaag gtgcggtatt 300
gtgagcggat gcctgtcctt tcccatggac aacctccagg gcgaactcct caatgtgtac 360
caaaaccacg tcgaaaagaa actcggcgct cgcgtcttca aggatgcctc caagtggtcc 420
gagcgtgagc agtcgcagaa ccccgaggct ggtgaccgcc gcatctttat ggacccggca 480
tccttcgtag cagaagagct caacctcggt cctcttcact actctgtcga tgctgcctgt 540
gccaccgccc tttacgtcct tcgcctcgcc caggaccacc tcgtttccgg tgctgctgat 600
gtcatgctcg ctggtgcaac ttgcttcccg gagccctttt tcattctctc cggattctcc 660
actttccagg ccatgcctgt atcgggagac ggcatctcgt acccgcttca caaggacagt 720
cagggtctca cccctggtga aggtggtgcc attatggttc tcaagcgcct tgacgacgct 780
<210>47
<211>51
<212>DNA
<213>Ulkenia sp.
<400>47
cctcttcact actctgtcga tgctgcctgt gccaccgccc tttacgtcct t 51
<210>48
<211>12
<212>DNA
<213>Ulkenia sp.
<400>48
gatgctgcct gt 12
<210>49
<211>477
<212>DNA
<213>Ulkenia sp.
<400>49
tacggtactc tgctcggtgc taccatcagc aatgctggct gtggtcttcc cctcaagccg 60
cacttgccca gcgagaagtc ctgcctcatt gatacctaca agcgcgtcaa cgtgcacccg 120
cacaagatcc agtacgtcga gtgccacgca acgggtactc cccagggaga ccgcgttgag 180
attgatgccg tcaaggcttg cttcgagggc aaggtgcctc gctttggaag ctccaagggt 240
aactttggcc acacactcgt tgcagctggt ttcgcaggca tgtgcaaggt actccttgcc 300
atgaagcatg gtgtgatccc gcccactcct ggtgtcgatg gatcttccca aatggacccg 360
cttgtggtct ctgagcccat cccatggccc gacactgagg gcgagcccaa gcgcgctggt 420
ctctccgctt tcggctttgg tggcaccaac gcccacgcag tctttgagga gtttgac 477
<210>50
<211>1278
<212>DNA
<213>Ulkenia sp.
<400>50
aagcgcattg ccgtggtggg catggccgtg caatacgcgg gctgcaaaga caaggaagag 60
ttctggaaag tagtcatggg cggtgaggct gcatggacta agattagcga taaacgcctc 120
ggatccaaca agcgagccga gcacttcaaa gcagagcgta gcaaatttgc agataccttt 180
tgcaacgaga actacggctg cgtcgatgac tccgtcgata acgaacacga gcttctcctt 240
aagctctcca agaaggctct ctccgagaca tcggtctccg actctacaag gtgcggtatt 300
gtgagcggat gcctgtcctt tcccatggac aacctccagg gcgaactcct caatgtgtac 360
caaaaccacg tcgaaaagaa actcggcgct cgcgtcttca aggatgcctc caagtggtcc 420
gagcgtgagc agtcgcagaa ccccgaggct ggtgaccgcc gcatctttat ggacccggca 480
tccttcgtag cagaagagct caacctcggt cctcttcact actctgtcga tgctgcctgt 540
gccaccgccc tttacgtcct tcgcctcgcc caggaccacc tcgtttccgg tgctgctgat 600
gtcatgctcg ctggtgcaac ttgcttcccg gagccctttt tcattctctc cggattctcc 660
actttccagg ccatgcctgt atcgggagac ggcatctcgt acccgcttca caaggacagt 720
cagggtctca cccctggtga aggtggtgcc attatggttc tcaagcgcct tgacgacgct 780
attcgcgatg gagaccacat ttacggtact ctgctcggtg ctaccatcag caatgctggc 840
tgtggtcttc ccctcaagcc gcacttgccc agcgagaagt cctgcctcat tgatacctac 900
aagcgcgtca acgtgcaccc gcacaagatc cagtacgtcg agtgccacgc aacgggtact 960
ccccagggag accgcgttga gattgatgcc gtcaaggctt gcttcgaggg caaggtgcct 1020
cgctttggaa gctccaaggg taactttggc cacacactcg ttgcagctgg tttcgcaggc 1080
atgtgcaagg tactccttgc catgaagcat ggtgtgatcc cgcccactcc tggtgtcgat 1140
ggatcttccc aaatggaccc gcttgtggtc tctgagccca tcccatggcc cgacactgag 1200
ggcgagccca agcgcgctgg tctctccgct ttcggctttg gtggcaccaa cgcccacgca 1260
gtctttgagg agtttgac 1278
<210>51
<211>801
<212>DNA
<213>Ulkenia sp.
<400>51
atgcgcattg ctattaccgg tatggatgcc accttcggct ccctcaaggg cctggacgcc 60
tttgagcgtg ccatctacaa tggccaacat ggtgctgtgc cattgcctga gaagcgctgg 120
cgtttccttg gtaaagacaa ggactttttg gacctgtgcg gtgtcaagga ggtgccccac 180
ggatgctaca ttgaggacgt cgaggtggac tttagccgcc tgcgcacgcc catgacgcca 240
gacgacatgt tgcgccccat gcagctactt gctgtcacaa ccatcgaccg tgccattctc 300
aactctggcc tcaagaaggg aggtaaggtc gctgtcttcg tcggccttgg cactgacctt 360
gagctctacc gtcaccgcgc ccgcgttgcc ctcaaggagc gtgctcgtcc cgaagccgct 420
tcagccctca atgatatgat gtcctacatc aacgattgcg gtaccgctac ctcgtacaca 480
tcctacatcg gcaacctcgt ggccacccgc gtgtcttcac aatggggttt cgagggtcct 540
tctttcacca tcacagaggg caacaactcc gtctaccgtt gcgcagagtt gggcaagtac 600
ttgctcgaga ctggcgaggt cgaggccgta gtgatcgccg gtgtggatct ttgcgccagc 660
gctgagaatc tctacgtgaa gtcgcgtcgt ttcaaggtct cggagcagga gagcccgcgg 720
gccagcttcg actccggcgc tgacggctac tttgttggtg agggatgtgg tgccctcgtc 780
ctcaagcgcg agagcgactg c 801
<210>52
<211>792
<212>DNA
<213>Ulkenia sp.
<400>52
gctgctttcg gactgagcct tggagagatt tccatggttt ttgccttttc tgagaagaac 60
ggccttgtct ctgaggagct gacaactaaa ctccgcaact cggaggtctg gcgtaaggcc 120
ctcgctgttg agtttgacgc cctccgcaag gcctggaata ttccccaaga tacccctgtc 180
agcgagttct ggcaaggata cgtggtacgt ggaacccgcg aggccgttga agcggccatc 240
ggccccaaca ataagtacgt gcacttgacc attgtcaacg atgccaacag tgctctcatc 300
agtggcaagc ctgaagattg caaggctgcc attgctcgcc tgagcagcaa cctccctgct 360
ttgcccgtgg accttggtat gtgtggccac tgccccgtgg tcgagccgta cggcaagcag 420
atcgctgaga tccatagcgt cctcgagatt cccgaggttg ccggccttga cctgtacacg 480
agcgtcaacc agaagaagct tgttaacaag tccactggag ccagcgacga gtacgcaccc 540
agctttggtg aatacgcagc acagctgtac actgttcagg cagactttcc taagatcgcc 600
aagaccgtta gcgacaagaa ctttgacgtc tttgttgaga ctggtcccaa cgcccaccgt 660
agcgccgcaa ttcgcgccac ccttggaaat agcaagcctt ttgtcaccgg atccatggac 720
cgccagaacg agaatgcttg gacaaccatg gtcaagctgg ttgcctctct ccaagcccac 780
cgcgtgcctg gc 792
<210>53
<211>1302
<212>DNA
<213>Ulkenia sp.
<400>53
agccgtgcct tcatggagac atatggtgta tccgccccca tgtacaccgg cgccatggca 60
aagggcattg catccgctga gatggttatc gctgccggaa agcgcggcat ccttggttct 120
ctcggtgctg gtggtcttcc tatcgccacc gtacgcaagg ctctcgaagc tatccaggct 180
gaactgccca agggccctta cgctgtcaac ctcatccact ctcccttcga cagcaacctc 240
gagaagggta acgtcgacct cttcctcgag aagggcgtca ctgtcgttga agcctccgcc 300
tttatgacct tgaccccgca gctcgtgcgc taccgtgctg caggtctctc tcgcgctgct 360
gatggctcca cggttattaa gaaccgcgtc atcggtaagg tttctcgcac agagcttgcc 420
gcaatgttta tccgtcccgc gcccgagaat ctcctcgaga agctgctgaa gtccggcgag 480
atcacccaag agcaggctgc tctcgcacgc acagtgcctg tggcagacga cattgccgtt 540
gaggcggact ccggtggcca caccgataac cgccccatcc acgtcatcct ccctctcatt 600
gtcaacctcc gtgatcgtct gcacaaggag tgcggctacc ctgcccacct tcgcgttcgc 660
gttggtgctg gtggtggcat tggatgccct caggccgcca ttgccacctt caacatgggc 720
gcggccttca tcgtcactgg taccgtaaac cagatgagta agcaagctgg aacctgtgac 780
accgttcgca agcagctctc acaagccacc tactccgaca tctgcatggc cccagcagct 840
gacatgtttg aggaaggtgt caagctccag gtgctcaaga agggaactat gttcccctcg 900
cgtgccaaca agctctatga gctcttcgtc aagtatgact cctttgagtc catggctcct 960
ggagagctgg aacgtgtgga gaagcgcatt ttcaagaagt ctctgtcaga agtttgggaa 1020
gagaccaagg acttctacat caacaggttg cagaacccgg agaagattga gcgcgcggag 1080
cgtgacccca agcttaagat gtccttgtgc ttccgctggt accttggttt ggcgagcttc 1140
tgggcaaacg ctggcatccc ggaccgtgcc atggactacc aggtttggtg tggcccagcg 1200
attggatctt tcaacgactt catcaagggt acctaccttg accccgccgt tgccaacgag 1260
taccccgatg ttgtgcaaat caacttgcag atcctccgtg gt 1302
<210>54
<211>260
<212>PRT
<213>Ulkenia sp.
<400>54
Lys Arg Ile Ala Val Val Gly Met Ala Val Gln Tyr Ala Gly Cys Lys
1 5 10 15
Asp Lys Glu Glu Phe Trp Lys Val Val Met Gly Gly Glu Ala Ala Trp
20 25 30
Thr Lys Ile Ser Asp Lys Arg Leu Gly Ser Asn Lys Arg Ala Glu His
35 40 45
Phe Lys Ala Glu Arg Ser Lys Phe Ala Asp Thr Phe Cys Asn Glu Asn
50 55 60
Tyr Gly Cys Val Asp Asp Ser Val Asp Asn Glu His Glu Leu Leu Leu
65 70 75 80
Lys Leu Ser Lys Lys Ala Leu Ser Glu Thr Ser Val Ser Asp Ser Thr
85 90 95
Arg Cys Gly Ile Val Ser Gly Cys Leu Ser Phe Pro Met Asp Asn Leu
100 105 110
Gln Gly Glu Leu Leu Asn Val Tyr Gln Asn His Val Glu Lys Lys Leu
115 120 125
Gly Ala Arg Val Phe Lys Asp Ala Ser Lys Trp Ser Glu Arg Glu Gln
130 135 140
Ser Gln Asn Pro Glu Ala Gly Asp Arg Arg Ile Phe Met Asp Pro Ala
145 150 155 160
Ser Phe Val Ala Glu Glu Leu Asn Leu Gly Pro Leu His Tyr Ser Val
165 170 175
Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val Leu Arg Leu Ala Gln Asp
180 185 190
His Leu Val Ser Gly Ala Ala Asp Val Met Leu Ala Gly Ala Thr Cys
195 200 205
Phe Pro Glu Pro Phe Phe Ile Leu Ser Gly Phe Ser Thr Phe Gln Ala
210 215 220
Met Pro Val Ser Gly Asp Gly Ile Ser Tyr Pro Leu His Lys Asp Ser
225 230 235 240
Gln Gly Leu Thr Pro Gly Glu Gly Gly Ala Ile Met Val Leu Lys Arg
245 250 255
Leu Asp Asp Ala
260
<210>55
<211>17
<212>PRT
<213>Ulkenia sp.
<400>55
Pro Leu His Tyr Ser Val Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val
1 5 10 15
Leu
<210>56
<211> 4
<212>PRT
<213>Ulkenia sp.
<400>56
Asp Ala Ala Cys
1
<210>57
<211>159
<212>PRT
<213>Ulkenia sp.
<400>57
Tyr Gly Thr Leu Leu Gly Ala Thr Ile Ser Asn Ala Gly Cys Gly Leu
1 5 10 15
Pro Leu Lys Pro His Leu Pro Ser Glu Lys Ser Cys Leu Ile Asp Thr
20 25 30
Tyr Lys Arg Val Asn Val His Pro His Lys Ile Gln Tyr Val Glu Cys
35 40 45
His Ala Thr Gly Thr Pro Gln Gly Asp Arg Val Glu Ile Asp Ala Val
50 55 60
Lys Ala Cys Phe Glu Gly Lys Val Pro Arg Phe Gly Ser Ser Lys Gly
65 70 75 80
Asn Phe Gly His Thr Leu Val Ala Ala Gly Phe Ala Gly Met Cys Lys
85 90 95
Val Leu Leu Ala Met Lys His Gly Val Ile Pro Pro Thr Pro Gly Val
100 105 110
Asp Gly Ser Ser Gln Met Asp Pro Leu Val Val Ser Glu Pro Ile Pro
115 120 125
Trp Pro Asp Thr Glu Gly Glu Pro Lys Arg Ala Gly Leu Ser Ala Phe
130 135 140
Gly Phe Gly Gly Thr Asn Ala His Ala Val Phe Glu Glu Phe Asp
145 150 155
<210>58
<211>426
<212>PRT
<213>Ulkenia sp.
<400>58
Lys Arg Ile Ala Val Val Gly Met Ala Val Gln Tyr Ala Gly Cys Lys
1 5 10 15
Asp Lys Glu Glu Phe Trp Lys Val Val Met Gly Gly Glu Ala Ala Trp
20 25 30
Thr Lys Ile Ser Asp Lys Arg Leu Gly Ser Asn Lys Arg Ala Glu His
35 40 45
Phe Lys Ala Glu Arg Ser Lys Phe Ala Asp Thr Phe Cys Asn Glu Asn
50 55 60
Tyr Gly Cys Val Asp Asp Ser Val Asp Asn Glu His Glu Leu Leu Leu
65 70 75 80
Lys Leu Ser Lys Lys Ala Leu Ser Glu Thr Ser Val Ser Asp Ser Thr
85 90 95
Arg Cys Gly Ile Val Ser Gly Cys Leu Ser Phe Pro Met Asp Asn Leu
100 105 110
Gln Gly Glu Leu Leu Asn Val Tyr Gln Asn His Val Glu Lys Lys Leu
115 120 125
Gly Ala Arg Val Phe Lys Asp Ala Ser Lys Trp Ser Glu Arg Glu Gln
130 135 140
Ser Gln Asn Pro Glu Ala Gly Asp Arg Arg Ile Phe Met Asp Pro Ala
145 150 155 160
Ser Phe Val Ala Glu Glu Leu Asn Leu Gly Pro Leu His Tyr Ser Val
165 170 175
Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val Leu Arg Leu Ala Gln Asp
180 185 190
His Leu Val Ser Gly Ala Ala Asp Val Met Leu Ala Gly Ala Thr Cys
195 200 205
Phe Pro Glu Pro Phe Phe Ile Leu Ser Gly Phe Ser Thr Phe Gln Ala
210 215 220
Met Pro Val Ser Gly Asp Gly Ile Ser Tyr Pro Leu His Lys Asp Ser
225 230 235 240
Gln Gly Leu Thr Pro Gly Glu Gly Gly Ala Ile Met Val Leu Lys Arg
245 250 255
Leu Asp Asp Ala Ile Arg Asp Gly Asp His Ile Tyr Gly Thr Leu Leu
260 265 270
Gly Ala Thr Ile Ser Asn Ala Gly Cys Gly Leu Pro Leu Lys Pro His
275 280 285
Leu Pro Ser Glu Lys Ser Cys Leu Ile Asp Thr Tyr Lys Arg Val Asn
290 295 300
Val His Pro His Lys Ile Gln Tyr Val Glu Cys His Ala Thr Gly Thr
305 310 315 320
Pro Gln Gly Asp Arg Val Glu Ile Asp Ala Val Lys Ala Cys Phe Glu
325 330 335
Gly Lys Val Pro Arg Phe Gly Ser Ser Lys Gly Asn Phe Gly His Thr
340 345 350
Leu Val Ala Ala Gly Phe Ala Gly Met Cys Lys Val Leu Leu Ala Met
355 360 365
Lys His Gly Val Ile Pro Pro Thr Pro Gly Val Asp Gly Ser Ser Gln
370 375 380
Met Asp Pro Leu Val Val Ser Glu Pro Ile Pro Trp Pro Asp Thr Glu
385 390 395 400
Gly Glu Pro Lys Arg Ala Gly Leu Ser Ala Phe Gly Phe Gly Gly Thr
405 410 415
Asn Ala His Ala Val Phe Glu Glu Phe Asp
420 425
<210>59
<211>267
<212>PRT
<213>Ulkenia sp.
<400>59
Met Arg Ile Ala Ile Thr Gly Met Asp Ala Thr Phe Gly Ser Leu Lys
1 5 10 15
Gly Leu Asp Ala Phe Glu Arg Ala Ile Tyr Asn Gly Gln His Gly Ala
20 25 30
Val Pro Leu Pro Glu Lys Arg Trp Arg Phe Leu Gly Lys Asp Lys Asp
35 40 45
Phe Leu Asp Leu Cys Gly Val Lys Glu Val Pro His Gly Cys Tyr Ile
50 55 60
Glu Asp Val Glu Val Asp Phe Ser Arg Leu Arg Thr Pro Met Thr Pro
65 70 75 80
Asp Asp Met Leu Arg Pro Met Gln Leu Leu Ala Val Thr Thr Ile Asp
85 90 95
Arg Ala Ile Leu Asn Ser Gly Leu Lys Lys Gly Gly Lys Val Ala Val
100 105 110
Phe Val Gly Leu Gly Thr Asp Leu Glu Leu Tyr Arg His Arg Ala Arg
115 120 125
Val Ala Leu Lys Glu Arg Ala Arg Pro Glu Ala Ala Ser Ala Leu Asn
130 135 140
Asp Met Met Ser Tyr Ile Asn Asp Cys Gly Thr Ala Thr Ser Tyr Thr
145 150 155 160
Ser Tyr Ile Gly Asn Leu Val Ala Thr Arg Val Ser Ser Gln Trp Gly
165 170 175
Phe Glu Gly Pro Ser Phe Thr Ile Thr Glu Gly Asn Asn Ser Val Tyr
180 185 190
Arg Cys Ala Glu Leu Gly Lys Tyr Leu Leu Glu Thr Gly Glu Val Glu
195 200 205
Ala Val Val Ile Ala Gly Val Asp Leu Cys Ala Ser Ala Glu Asn Leu
210 215 220
Tyr Val Lys Ser Arg Arg Phe Lys Val Ser Glu Gln Glu Ser Pro Arg
225 230 235 240
Ala Ser Phe Asp Ser Gly Ala Asp Gly Tyr Phe Val Gly Glu Gly Cys
245 250 255
Gly Ala Leu Val Leu Lys Arg Glu Ser Asp Cys
260 265
<210>60
<211>264
<212>PRT
<213>Ulkenia sp.
<400>60
Ala Ala Phe Gly Leu Ser Leu Gly Glu Ile Ser Met Val Phe Ala Phe
1 5 10 15
Ser Glu Lys Asn Gly Leu Val Ser Glu Glu Leu Thr Thr Lys Leu Arg
20 25 30
Asn Ser Glu Val Trp Arg Lys Ala Leu Ala Val Glu Phe Asp Ala Leu
35 40 45
Arg Lys Ala Trp Asn Ile Pro Gln Asp Thr Pro Val Ser Glu Phe Trp
50 55 60
Gln Gly Tyr Val Val Arg Gly Thr Arg Glu Ala Val Glu Ala Ala Ile
65 70 75 80
Gly Pro Asn Asn Lys Tyr Val His Leu Thr Ile Val Asn Asp Ala Asn
85 90 95
Ser Ala Leu Ile Ser Gly Lys Pro Glu Asp Cys Lys Ala Ala Ile Ala
100 105 110
Arq Leu Ser Ser Asn Leu Pro Ala Leu Pro Val Asp Leu Gly Met Cys
115 120 125
Gly His Cys Pro Val Val Glu Pro Tyr Gly Lys Gln Ile Ala Glu Ile
130 135 140
His Ser Val Leu Glu Ile Pro Glu Val Ala Gly Leu Asp Leu Tyr Thr
145 150 155 160
Ser Val Asn Gln Lys Lys Leu Val Asn Lys Ser Thr Gly Ala Ser Asp
165 170 175
Glu Tyr Ala Pro Ser Phe Gly Glu Tyr Ala Ala Gln Leu Tyr Thr Val
180 185 190
Gln Ala Asp Phe Pro Lys Ile Ala Lys Thr Val Ser Asp Lys Asn Phe
195 200 205
Asp Val Phe Val Glu Thr Gly Pro Asn Ala His Arg Ser Ala Ala Ile
210 215 220
Arg Ala Thr Leu Gly Asn Ser Lys Pro Phe Val Thr Gly Ser Met Asp
225 230 235 240
Arg Gln Asn Glu Asn Ala Trp Thr Thr Met Val Lys Leu Val Ala Ser
245 250 255
Leu Gln Ala His Arg Val Pro Gly
260
<210>61
<211>434
<212>PRT
<213>Ulkenia sp.
<400>61
Ser Arg Ala Phe Met Glu Thr Tyr Gly Val Ser Ala Pro Met Tyr Thr
1 5 10 15
Gly Ala Met Ala Lys Gly Ile Ala Ser Ala Glu Met Val Ile Ala Ala
20 25 30
Gly Lys Arg Gly Ile Leu Gly Ser Leu Gly Ala Gly Gly Leu Pro Ile
35 40 45
Ala Thr Val Arg Lys Ala Leu Glu Ala Ile Gln Ala Glu Leu Pro Lys
50 55 60
Gly Pro Tyr Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser Asn Leu
65 70 75 80
Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly Val Thr Val Val
85 90 95
Glu Ala Ser Ala Phe Met Thr Leu Thr Pro Gln Leu Val Arg Tyr Arg
100 105 110
Ala Ala Gly Leu Ser Arg Ala Ala Asp Gly Ser Thr Val Ile Lys Asn
115 120 125
Arg Val Ile Gly Lys Val Ser Arg Thr Glu Leu Ala Ala Met Phe Ile
130 135 140
Arq Pro Ala Pro Glu Asn Leu Leu Glu Lys Leu Leu Lys Ser Gly Glu
145 150 155 160
Ile Thr Gln Glu Gln Ala Ala Leu Ala Arg Thr Val Pro Val Ala Asp
165 170 175
Asp Ile Ala Val Glu Ala Asp Ser Gly Gly His Thr Asp Asn Arg Pro
180 185 190
Ile His Val Ile Leu Pro Leu Ile Val Asn Leu Arg Asp Arg Leu His
195 200 205
Lys Glu Cys Gly Tyr Pro Ala His Leu Arg Val Arg Val Gly Ala Gly
210 215 220
Gly Gly Ile Gly Cys Pro Gln Ala Ala Ile Ala Thr Phe Asn Met Gly
225 230 235 240
Ala Ala Phe Ile Val Thr Gly Thr Val Asn Gln Met Ser Lys Gln Ala
245 250 255
Gly Thr Cys Asp Thr Val Arg Lys Gln Leu Ser Gln Ala Thr Tyr Ser
260 265 270
Asp Ile Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val Lys
275 280 285
Leu Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser Arg Ala Asn Lys
290 295 300
Leu Tyr Glu Leu Phe Val Lys Tyr Asp Ser Phe Glu Ser Met Ala Pro
305 310 315 320
Gly Glu Leu Glu Arg Val Glu Lys Arg Ile Phe Lys Lys Ser Leu Ser
325 330 335
Glu Val Trp Glu Glu Thr Lys Asp Phe Tyr Ile Asn Arg Leu Gln Asn
340 345 350
Pro Glu Lys Ile Glu Arg Ala Glu Arg Asp Pro Lys Leu Lys Met Ser
355 360 365
Leu Cys Phe Arg Trp Tyr Leu Gly Leu Ala Ser Phe Trp Ala Asn Ala
370 375 380
Gly Ile Pro Asp Arg Ala Met Asp Tyr Gln Val Trp Cys Gly Pro Ala
385 390 395 400
Ile Gly Ser Phe Asn Asp Phe Ile Lys Gly Thr Tyr Leu Asp Pro Ala
405 410 415
Val Ala Asn Glu Tyr Pro Asp Val Val Gln Ile Asn Leu Gln Ile Leu
420 425 430
Arg Gly
<210>62
<211>2000
<212>DNA
<213>Ulkenia sp.
<400>62
gagcacgcac catcttctct ccacgcgtaa agaagagcag agccagaggc aggtaggtat 60
ctccacccat ctcaggctgt gacttctttg tttctttctt tctttgcttg ttttctgttc 120
tctctctgtg ctctgtccac acgagaaaga gaaagagaga gagaaagaac cacgggttta 180
tagagcgcac tcgtccttcc tgcttcagca gaaagcactg cgtaggagaa ctacggggga 240
ggaggaagca cgcacggagg aggcgtggaa ggaaggagga gacagagaga gagagacact 300
gagggacaga gggggagagg cagagggaga ggcatctgat gtttgcgaga aaccaataag 360
ttttgaaagt gatttgattt agctgattga ctgatctatg gcctgaaaga aagcttttaa 420
agcggaggga gatagatgac gagggcagct gcgatggcgt acggcgcatc cgtctctctc 480
tgtgtctctc tctctttctc tctcgtcagg gcgtggagac ctcggaagct gcacgcggcg 540
cggtgaggag gcagggcagc agagggagag gagagatccc agagtcgaag agcattgatt 600
gattgcagat gatcttgggc aacgcgcgtc agcttgagcg aggaatgctt tggacttcag 660
gttcttcgct tctgtgtttc attctttctc gaagaaagaa agaatgaaag aaagagagaa 720
agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa tgaatgaatg aaagaaagag 780
agaaagaaag aacgaatgaa agaaagagag aaagaatcaa agagaaagcg cattcgcagt 840
tcttcttcgt gaaagaaaag gaaaagagag gcgatggtag gctctgatct catcatttct 900
ggtttctctg ttgtacctgt actctgtgct tgtggccttg cgaaggctga agacgccatg 960
cagacaacca cgcctccgca gagactttgc gggaaagcag agggcttctc gccactctcg 1020
aagaaacgag ctcgccagtt ttcggggttg ttctcagaat tgcgagtgtt ggctttatat 1080
gggatgatgg tatggcactt cgtcatcgtt actctcgctc gcttgcttac gaagattttc 1140
aaaagggcga aagaagtgct cagcttttaa aataaagtca caccaaagac taggccgcat 1200
agcagaaagc taaagtaaac ccaatctgtc tgaagagagt gtcgtggtta gatacttacg 1260
caagagttta aaagctgtaa atagtacagg aacaaaaaca aataaatata tatatattct 1320
tttttattag taaaacatga aaccaaaaaa ctcctttaaa ataaaataaa ataaaataaa 1380
ataaaataaa ataaaataaa tttactacta tatatacata tatatataca ataaataaaa 1440
acaacttttt cagaccagaa aaagactgag aaaaaaggaa actaatgact ctcgagcacc 1500
gagagcgata taagagtgga ttatatttgc taggcccacc acgagtgagt cccctaggag 1560
gaagcgccct ctgagacagg agcagaggcg tcgctggtgc tccaaaaagc gacggcgaat 1620
ggaaagcaaa accctttcga gggaggcttg tggccgtgac tattcaaatc tccagcatct 1680
cagctccagc acagcagaag ctacctcgct tctcagctct agctatcaca tcgatcgcag 1740
catctagctc gtagacagct agcgccgcac cttcccccaa atcaacttgg gcaacttaac 1800
tcttttttca ccagaactcc tcttttcctt taatcttcga aaagaagacg aataaaagag 1860
ataatcctct gccgcagcac attctaaaag aaaagcggca tactggcgta ggcaagactt 1920
tcaagctctt cctcgcctcc accccgtatt tccctgttca tctttgtgaa acgaggaaac 1980
aagaaatttt ataggacaag 2000
<210>63
<211>2000
<212>DNA
<213>Ulkenia sp.
<400>63
agttgtgagg ctgtcttgtc ttgtcagtcg cgaaagtgta agcaagaact ttgtcataca 60
aagaagcaac caacttccga accaacacac cttgtaggat tacaaccaca actttctata 120
aatagtgcgc aagaataacc agtaagctat ccttcgtgta cctgttacaa caacgacatt 180
tttacttgat cttcctactt gtgatgggta gtcccggctt gtactgacag tgatgccaca 240
gcagagtaga tcactgtgaa taagtaaata agcctactta ttatattccc aaagtactcg 300
ctgggatatt attagtatca cgaaaagtga tatgttttat aactcgcttg tcttgccaag 360
atctaacctt ttttttttaa atggccaaaa agtcgccaga acacatctta caataaacaa 420
aaatttagat tatatcgtat gtataatgta taatatatta tattattata tacatacgat 480
ataatctaaa gccattccag acttattcgg tgatgaaaaa tgctttccca gctttataca 540
aactattcaa aaagttgcat gacccatttt cagatatatt taatagtata agattatgtc 600
catttgtttt caaagttatt caagagttta catcttgaag tttcatccct ttactactac 660
actgtttttc gtttgggttt tttctctaac ggcgaaagaa acaagtcacc aagcttaact 720
agtaggcatc tttgtggtga cgaaattaaa gttgaatata taaattatag ttagtcatta 780
tggaatctca gtttgaacga agctaagcta tttataaaaa tcactgcatg gagataatac 840
ttgaattttg atgatagtgt ttatgaagaa gtttaatctt gctttttatt aatgttattc 900
tctaatatag aaatatttca ataaaaaaat catatgaagg gataataaat acagagaatg 960
atcgttatca tttgatatgt cgaacgctaa tctatcatct tatctaggaa acaaaggtgg 1020
aaataaagga aagccctaca cgagttaatt cctcaaacga actactttgg attatcaaat 1080
ccaactgctg acactggata catgcatgta tttagtgggt gttactgtac ttccttattt 1140
cctttaattc aattgtcttg atttttactt cggagattct acttgaaaat catctccctt 1200
cacttccggt tatacagaaa gacccttcaa ttcgaatgct ggccaggtac aataactatc 1260
agcgattccc ctccactaga catgaccgac tgtaagcacc tcaacccgat ttcaagcaac 1320
acatgatgac tagctgtttc cgcaaaacaa caaataagag aggtagtgga aaacacccag 1380
ttcgctcgag ctcccctagt agattcgaca ttcactttct atttgattgc taattgtggg 1440
tccggctatt taaggaaaga actgatgaaa gtccacctca cgcaatcaaa tcgcggtcta 1500
gttggaagct acaatggccg acgtatgcgc gcctctatct tttaggattg tagaacaggg 1560
cggcaatctg ctaacataaa tttaatacct tgctcaagct gctttccata cttttcaatc 1620
catttgtgat aatcttgcaa tggaccaatc tccaaatctg tagaagcaat aacaaggaca 1680
tcgcagggtc ccggttcgtt tgcatgctcg tcttctggtg ccacaacaat gctgcctgtt 1740
attatctcat gagagtcttt atactgcgga tccgtggcta tagcgtgaat aaacgttgtg 1800
cgcaagccta tatcctcgcg atggagatac tggcctgcta cagtttgcgt tcgtctgcct 1860
acgacaacgc atggaacatt ctttggtgtg cgagtgggcc gtagcgttcg accctgggca 1920
aggaagccat gcagacgtga ttccgagagg ccatctcgcg tgtaagactt atcccaattt 1980
tctggatcct ctaatttcca 2000
<210>64
<211>2000
<212>DNA
<213>Ulkenia sp.
<400>64
aaattaatga atgaatcaat gaatgaatca atgaataatg ccaatgcaat gcgatgcgat 60
gctgcttcga gccatcgcac ggcggccatt gcgcgcttgc gtcagtcatg tcattccatt 120
cggagcggcg tgcgcgaggg agggagggag ggagggagaa gacgaggagc aggcggagag 180
agaggaggat gggcgggcgg gcggcgtcgt cggcgtcgtc gtcgtcgtgg gcctccgtag 240
tcgctgggaa ggagggcttt gattccaaat gaggattttg gtgcactgct ttcgagactt 300
tctcgcctga ttcggaattc ctcctcttct tcttcttttt agctgtgctt tctgcgtatt 360
cattgcgtgg gtttggcttg gttttcaaat caattagcag tctagtaact aacaaactaa 420
caaacagata aacagacaaa cagacaaaca aacaaaacaa acaaaacaaa caaaacaaag 480
caggaaagaa agaaacaaac aaatatacaa acaaagaaag aaagaagtgg tgggaactag 540
ggaaatcaat gtgtttgctt ctttcgcacc tttgcttttc ttgcttttct tggttctcaa 600
gtaagcgttt atcgcgccct cagaaaacaa aataaaatga tctaacataa catgaattta 660
tatttatttt atttgtttat taaataaata ttttttgtaa accagaattt cactctactt 720
ttgcaacact gagagagtgc catctgcata ataagtggca gtgttttttt gtttattttc 780
aaattaatta tacttgaact gctaggtcaa gaggccgcag cagcctgatg agataaggac 840
agagtaggca aggatggcag aagatcgcga aaaaagcgag aaaggcaaac gagcaggccc 900
gaaggtgagg tggagctgct tgtcaaggtc gcgaggtttg tttgacagtt ataacagcaa 960
gaactaaggc aatttcaaga atgaagagca ctcgaataaa ccgatgaagc aaagtgtgta 1020
catacaaaca tacatacgta cagatgaaaa gaacagattt tcaataaaaa tgacttttta 1080
gtttaaacaa tgtttctgtt tgttgtttcg cttttcatta atttgttgca aattattttg 1140
ttttggtttt tgtttttgtt tttgaaaatc ataaaagaga tgctgccgca gacgtctgcg 1200
cgtctcatag ttgattgggt aatcgttttg ttgagttttg aaaatgtaaa cttcacttag 1260
ttgctcattt atcctcattc gtttgcccat ttgttctctg tttgaagcag agttttgact 1320
tctcgcattc gtggaatcca ccccttgctt gctttgcttg cttgcttgct tgcttgcttg 1380
cctgcttgct ttgcttgctt gcttgaccag cgtgcgcgct ttcgccagcc tagccttcga 1440
gacctcttga agaccctttg gagcgtctag ttcgaggttc tttctatttg cttcaagaga 1500
gacaaaataa caaagaaaaa gagagaaaaa acaagcaaag aaagaaacaa ggaaacaaac 1560
cacaaagcac gcatcgtgca tccaaacttt catcccccca ctctctctct ctctctctct 1620
ctctctctcc ttcctcggaa aaggagtgag acaaaggcag acagcctcta gcttggcagc 1680
ctcgcagctc gtgcggcgcc agttcctaca gcttcgcgct gtccaaacgc cagtccatcg 1740
cagcttcggc tagctagttg gctgattgat tgattgattg attgatagcc tttattacgg 1800
cgttgattaa ctgattgatt atttgattgc tctggcatcc ctgtaatcac ttgctcaagg 1860
tagtcaatca catcatttat acatctcctc caaagcaaac catctacacg accgcttttt 1920
gatcgatcta aaagtgccgg tcaggtgaca cgcaagctct tttttttgtt tacagtaagc 1980
agcaacaaga aagcaaaaag 2000
<210>65
<211>2000
<212>DNA
<213>Ulkenia sp.
<400>65
gcccaatttg ctcctgatct gttcccatga ttatgatagg gataggtagt agttatagct 60
agactcattc cattcactta atccacatat gcaaattata attttatgtg tcgcatataa 120
actttccaaa ctttaaaatt ttcatttgca ttttatatat agatcacctg tgatcccttt 180
ctcgcccctt tcaacttcca aagtttacct actatcatat ggcatggcgc agccaatgca 240
ctctataaca tataagtaac agagatagtt tttgccgcat catttactct ttactcttgc 300
tatacaaggt aagcgccaag agagttaatt acatctgttt tatcggttcc tagtggaaat 360
aatagtgaca actataatta gtaggagtcc ttattgaccc tagtcatttg agcttgcacc 420
agatttgatg tttttgcaaa cgaccttgac gcagagtgac gagcgaaaat tggatcccct 480
tggttgaagt ctaaactagc ttaaaatata tatgctcttc atataatata aagctgtttt 540
agattctatc aaataagaaa ttgatgactt tgagcaaatt aatatttggt atgggctccg 600
gcatctctga aaacgcttaa atgaagcttt tattcaccac gattcgacaa ctaaggttat 660
tttccacata attataactt ttcctacata actgtgctgt cgactcacac cttctttata 720
tatatagcct cgtagggatt cgaaactatg aattaagact cgttgaagtt tgatttatcc 780
attattttgc tgcacaaact atcgctaaga tataaagatc gtgcccagag cctgctatag 840
ggtcctaatg gcatgcttag cccggatttc cacgataaag ctgcattgta ttgagtatat 900
gcactcagag agtaaacttt aattgcaacg aacaatcttt ggcaagtcat atctcagcca 960
tcaatacatg tattgtgttc aaacgaattg cagcatatca ctcaaattat tttggtctag 1020
ttcagcggaa tcttttggtt gttttagtaa gagttgagta gagtatgttg gatgagtgtg 1080
tccacaaggt tatttgaata gggtatttac attctacaac atagtcagta agctctcgtg 1140
tgataaactg tatcaaaatc gacacaataa caggctagtg gtgccctgtg cacgttttta 1200
ccataacatg acagctacag catcagaaac aggtgtggtg cgcattttgg ttattctgat 1260
cctgaaacct aagaacaatt ttcatcgtct tgctagattg tgttttctgt attccatttg 1320
tggagcttca acatccatgc tgctgagtat tttcacatga agatcatagt gttagaatgt 1380
ttagtaagcc tattactaag ttttgaggta taggtgcttg ttgttgtcct tacataaata 1440
catgctgtct ttagtgctta gaccaacgtt gagtgtatcg tgctcttggc agaagaatag 1500
acatttataa cattatggtg aaaggcgatg gtctcgcttg catgttctcg cttgcgtttg 1560
cgtatcccta tacacttaac cgttgtttat gtgtacctaa gctatcatgc tgcatcttta 1620
caattttata caaataaatt tattttggaa tatataattg gtcactattt caggccagtt 1680
gacagtcctt aagatttgta gttgcgctgt tctcgtagtg agaatgaaga agcggaatct 1740
acatccatct gtgattgcat aagagcttgc ataagagtga agtaggtgaa agtcacagag 1800
aatatcttcc ctactatcct aaaggcaagg aatactacta tacacgaaca tagtaatgga 1860
attttacaca acagaagtac ccttgtctcc tgcctccttt tattattcca ttatgctctg 1920
ttatataatg aatgaagacg acttttaaca tcatttgatt ctcgagcagg cacgcacaat 1980
atagaggaag gattggcgtc 2000
<210>66
<211>1212
<212>DNA
<213>Ulkenia sp.
<400>66
ggcaagaacg tcgttttcga ctatgacgag ctccttgagt tcgccgaggg tgacatcagc 60
aaggtcttcg gccccgaatt cagccagatc gaccagtaca agcgtcgcgt tcgtctcccc 120
gcccgcgagt acctcctcgt cacccgcgtc accctcatgg acgccgaggt caacaactac 180
cgcgtcggtg cccgcatggt cactgagtac gacctccccg tcaacggtga gctctctgag 240
ggtggtgact gcccctgggc cgtgctcgtc gagagtggtc agtgtgatct catgctcatc 300
tcctacatgg gtattgactt ccagaacaag agcgaccgcg tctaccgtct gctcaacacc 360
accctcacct tctacggtgt tgcccaggag ggcgagaccc tggagtacga catccgcgtg 420
accggcttcg ccaagcgtct cgacggtgac atctccatgt tcttcttcga gtacgactgc 480
tacgtcaacg gccgtctcct catcgagatg cgcgacggct gtgccggttt cttcaccaac 540
gaggagctcg ccgccggcaa gggtgtcgtc tttacccgcg ctgatctcct cgcccgcgag 600
aagaccaaga agcaggacat caccccgtac gccattgccc cgcgtcttaa caagaccgtt 660
ctcaacgaga ctgagatgca gtccctcgtg gacaagaact ggaccaaggt tttcggcccc 720
gagaacggca tggaccagat caactacaaa ctctgcgccc gtaagatgct catgattgac 780
cgcgtcacca agattgacta caccggtggc ccctacggcc ttggtcttct cgttggtgag 840
aagatcctcg agcgcgacca ctggtacttt ccgtgccact tcgtcggaga ccaggtcatg 900
gctggatccc tcgtgtctga cggctgcagc cagctcctca agatgtacat gctctggctc 960
ggcctccacc ttaagaccgg tcccttcgac ttccgccccg tcaacggcca ccccaacaag 1020
gtccgctgcc gtggccagat ctccccgcac aagggtaagc tcgtatacgt catggagatc 1080
aaggagatgg gctacgacga ggctggtgac ccgtacgcca tcgccgatgt caacattctc 1140
gacattgact tcgagaaggg ccagactttc gaccttgcca acctccacga gtacggcaag 1200
ggcgacctca ac 1212
<210>67
<211>21
<212>DNA
<213>Ulkenia sp.
<400>67
tggtactttc cgtgccacttc 21
<210>68
<211>1197
<212>DNA
<213>Ulkenia sp.
<400>68
gtgcccggcg agatgccgct ctcgtggtac aacatggctg agttcatggc cggcaaggtc 60
agcctctgcc tcggccctga gttcgccaag ttcgatgact ccaacaccag ccgcagccct 120
gcatgggacc ttgctcttgt gactcgtgtg gtctccgttt ctgacatgga gtgggtccag 180
tggaagaacg tggactgcaa cccgtccaag ggaaccatgg ttggcgagtt cgactgcccc 240
atcgacgcct ggttcttcca gggatcttgt aacgacggcc acatgccgta ctccatcctc 300
atggagatcg ccctccagac ctctggtgtc ctcacctctg tgctcaaggc cccgctcacc 360
atggagaaga aggacattct cttccgcaac cttgacgcca acgccgagat ggttcgctct 420
gatattgacc tccgcggcaa gaccatccac aacctcacca agtgtaccgg ctacagcatg 480
ctcggagaca tgggtgtcca ccgcttcagc ttcgagctct ctgttgatgg tgtagtcttc 540
tacaagggta ccacctcctt cggctggttc gtccctgagg tcttcatctc ccagactggt 600
ctcgacaacg gtcgccgcac ccagccctgg cacattgagt ccaaggtgcc ttccgcccag 660
gtcctcacct acgacgttac ccccaacggt gccggtcgca cccagctcta cgccaacgcc 720
cccaagggcg ctcagctcac tcgccgctgg aaccagtgcc agtaccttga caccatcgac 780
cttgtggtcg ccggtggctc cgccggtctt ggctacggtc atggccgcaa gcaggtgaac 840
cccaaggact ggttcttctc gtgccacttc tggttcgact ccgtcatgcc cggctcgctc 900
ggtgtggagt ctatgttcca gctcgtcgag tccatcgctg tcaagcagga cctcgccggc 960
aagtacggca tcaccaaccc gaccttcgct catgctccgg gcaagatctc ctggaagtac 1020
cgtggtcagc tcacccccac ctccaagttc atggactccg aggcccacat tgtctccatc 1080
gaggcccacg acggcgtcgt cgacatcgtt gccaatggta acctctgggc tgatggcctc 1140
cgcgtctaca acgtcagcaa catccgtgtg cgcattgttg ctggcgccgc ccctgct 1197
<210>69
<211>21
<212>DNA
<213>Ulkenia sp.
<400>69
tggttcttct cgtgccactt c 21
<210>70
<211>90
<212>DNA
<213>Ulkenia sp.
<400>70
gctggcgccg cccctgctgc tgctgctgct gctgctgctg ttgctgctcc ggctgccgcc 60
cctgctccgg ttgctgcatc tggccctgcc 90
<210>71
<211>1299
<212>DNA
<213>Ulkenia sp.
<400>71
gaaggcttca tgaagaccta cggtgttgtg gctcctctct acaccggtgc catggccaag 60
ggtattgcct ctgctgacct tgtgattgcc actggtaagc gcaagatcct cggttccttc 120
ggtgctggcg gtctccccat gcacattgtc cgtgccgctg ttgagaagat ccaggctgag 180
ctcccgaacg gccccttcgc cgtcaacctc atccactccc ccttcgatag caaccttgag 240
aagggcaacg ttgacctctt cctcgagaag ggcgttactg tcgtcgaggc ctccgccttc 300
atgaccttga ccccgcaagt cgtccgctac cgtgctgctg gtctttcccg taacgctgat 360
ggctccatta acatcaagaa ccgcatcatc ggtaaggtct cccgtaccga gctcgctgag 420
atgttcatcc gccctgcccc gcagaacctc ctcgacaagc tcatccagtc tggtgagatt 480
accaaggagc aggctgagct tgccaagctc gtccccgtcg ccgacgacat cgccgtcgag 540
gccgactctg gtggccacac cgacaaccgc cccatccacg tcatcctccc ccttatcatc 600
aacctccgca accgcctcca caaggagtgc ggctaccccg ctcacctccg cgtgcgcgtt 660
ggagctggtg gtggtgttgg atgcccccag gccgctgccg ctgctctcgc tatgggtgct 720
gccttccttg ttaccggcac tgtcaaccag gtcgccaagc agtccggcac ctgcgacaat 780
gtccgcaagc agctctgcat ggccacctac tctgacgtct gcatggctcc cgctgctgac 840
atgttcgagg agggcgtcaa gctccaggtc ctcaagaagg gaaccatgtt cccgtccagg 900
gctaacaagc tctacgagct cttctgcaag tacgactcct tcgagtccat gcctgccaca 960
gagctcgagc gtgttgagaa gcgcatcttc cagtgccctc ttgctgatgt ctgggctgag 1020
acctccgact tctacatcaa ccgcctccac aacccggaga agatcacccg tgccgagcgt 1080
gaccccaagc tcaagatgtc tctctgcttc cgctggtacc ttggtcttgc ctctcgctgg 1140
gccaacaccg gtgaggctgg acgcgtcatg gactaccagg tctggtgtgg ccctgccatt 1200
ggagccttca acgacttcat caagggctcc taccttgacc cggccgtctc tggtgagtac 1260
ccggacgtcg tgcagatcaa cttgcagatc cttcgcggt 1299
<210>72
<211>404
<212>PRT
<213>Ulkenia sp.
<400>72
Gly Lys Asn Val Val Phe Asp Tyr Asp Glu Leu Leu Glu Phe Ala Glu
1 5 10 15
Gly Asp Ile Ser Lys Val Phe Gly Pro Glu Phe Ser Gln Ile Asp Gln
20 25 30
Tyr Lys Arg Arg Val Arg Leu Pro Ala Arg Glu Tyr Leu Leu Val Thr
35 40 45
Arg Val Thr Leu Met Asp Ala Glu Val Asn Asn Tyr Arg Val Gly Ala
50 55 60
Arg Met Val Thr Glu Tyr Asp Leu Pro Val Asn Gly Glu Leu Ser Glu
65 70 75 80
Gly Gly Asp Cys Pro Trp Ala Val Leu Val Glu Ser Gly Gln Cys Asp
85 90 95
Leu Met Leu Ile Ser Tyr Met Gly Ile Asp Phe Gln Asn Lys Ser Asp
100 105 110
Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu Thr Phe Tyr Gly Val Ala
115 120 125
Gln Glu Gly Glu Thr Leu Glu Tyr Asp Ile Arg Val Thr Gly Phe Ala
130 135 140
Lys Arg Leu Asp Gly Asp Ile Ser Met Phe Phe Phe Glu Tyr Asp Cys
145 150 155 160
Tyr Val Asn Gly Arg Leu Leu Ile Glu Met Arg Asp Gly Cys Ala Gly
165 170 175
Phe Phe Thr Asn Glu Glu Leu Ala Ala Gly Lys Gly Val Val Phe Thr
180 185 190
Arg Ala Asp Leu Leu Ala Arg Glu Lys Thr Lys Lys Gln Asp Ile Thr
195 200 205
Pro Tyr Ala Ile Ala Pro Arg Leu Asn Lys Thr Val Leu Asn Glu Thr
210 215 220
Glu Met Gln Ser Leu Val Asp Lys Asn Trp Thr Lys Val Phe Gly Pro
225 230 235 240
Glu Asn Gly Met Asp Gln Ile Asn Tyr Lys Leu Cys Ala Arg Lys Met
245 250 255
Leu Met Ile Asp Arg Val Thr Lys Ile Asp Tyr Thr Gly Gly Pro Tyr
260 265 270
Gly Leu Gly Leu Leu Val Gly Glu Lys Ile Leu Glu Arg Asp His Trp
275 280 285
Tyr Phe Pro Cys His Phe Val Gly Asp Gln Val Met Ala Gly Ser Leu
290 295 300
Val Ser Asp Gly Cys Ser Gln Leu Leu Lys Met Tyr Met Leu Trp Leu
305 310 315 320
Gly Leu His Leu Lys Thr Gly Pro Phe Asp Phe Arg Pro Val Asn Gly
325 330 335
His Pro Asn Lys Val Arg Cys Arg Gly Gln Ile Ser Pro His Lys Gly
340 345 350
Lys Leu Val Tyr Val Met Glu Ile Lys Glu Met Gly Tyr Asp Glu Ala
355 360 365
Gly Asp Pro Tyr Ala Ile Ala Asp Val Asn Ile Leu Asp Ile Asp Phe
370 375 380
Glu Lys Gly Gln Thr Phe Asp Leu Ala Asn Leu His Glu Tyr Gly Lys
385 390 395 400
Gly Asp Leu Asn
<210>73
<211>7
<212>PRT
<213>Ulkenia sp.
<400>73
Trp Tyr Phe Pro Cys His Phe
1 5
<210>74
<211>399
<212>PRT
<213>Ulkenia sp.
<400>74
Val Pro Gly Glu Met Pro Leu Ser Trp Tyr Asn Met Ala Glu Phe Met
1 5 10 15
Ala Gly Lys Val Ser Leu Cys Leu Gly Pro Glu Phe Ala Lys Phe Asp
20 25 30
Asp Ser Asn Thr Ser Arg Ser Pro Ala Trp Asp Leu Ala Leu Val Thr
35 40 45
Arg Val Val Ser Val Ser Asp Met Glu Trp Val Gln Trp Lys Asn Val
50 55 60
Asp Cys Asn Pro Ser Lys Gly Thr Met Val Gly Glu Phe Asp Cys Pro
65 70 75 80
Ile Asp Ala Trp Phe Phe Gln Gly Ser Cys Asn Asp Gly His Met Pro
85 90 95
Tyr Ser Ile Leu Met Glu Ile Ala Leu Gln Thr Ser Gly Val Leu Thr
100 105 110
Ser Val Leu Lys Ala Pro Leu Thr Met Glu Lys Lys Asp Ile Leu Phe
115 120 125
Arg Asn Leu Asp Ala Asn Ala Glu Met Val Arg Ser Asp Ile Asp Leu
130 135 140
Arg Gly Lys Thr Ile His Asn Leu Thr Lys Cys Thr Gly Tyr Ser Met
145 150 155 160
Leu Gly Asp Met Gly Val His Arg Phe Ser Phe Glu Leu Ser Val Asp
165 170 175
Gly Val Val Phe Tyr Lys Gly Thr Thr Ser Phe Gly Trp Phe Val Pro
180 185 190
Glu Val Phe Ile Ser Gln Thr Gly Leu Asp Asn Gly Arg Arg Thr Gln
195 200 205
Pro Trp His Ile Glu Ser Lys Val Pro Ser Ala Gln Val Leu Thr Tyr
210 215 220
Asp Val Thr Pro Asn Gly Ala Gly Arg Thr Gln Leu Tyr Ala Asn Ala
225 230 235 240
Pro Lys Gly Ala Gln Leu Thr Arg Arg Trp Asn Gln Cys Gln Tyr Leu
245 250 255
Asp Thr Ile Asp Leu Val Val Ala Gly Gly Ser Ala Gly Leu Gly Tyr
260 265 270
Gly His Gly Arg Lys Gln Val Asn Pro Lys Asp Trp Phe Phe Ser Cys
275 280 285
His Phe Trp Phe Asp Ser Val Met Pro Gly Ser Leu Gly Val Glu Ser
290 295 300
Met Phe Gln Leu Val Glu Ser Ile Ala Val Lys Gln Asp Leu Ala Gly
305 310 315 320
Lys Tyr Gly Ile Thr Asn Pro Thr Phe Ala His Ala Pro Gly Lys Ile
325 330 335
Ser Trp Lys Tyr Arg Gly Gln Leu Thr Pro Thr Ser Lys Phe Met Asp
340 345 350
Ser Glu Ala His Ile Val Ser Ile Glu Ala His Asp Gly Val Val Asp
355 360 365
Ile Val Ala Asn Gly Asn Leu Trp Ala Asp Gly Leu Arg Val Tyr Asn
370 375 380
Val Ser Asn Ile Arg Val Arg Ile Val Ala Gly Ala Ala Pro Ala
385 390 395
<210>75
<211>7
<212>PRT
<213>Ulkenia sp.
<400>75
Trp Phe Phe Ser Cys His Phe
1 5
<210>76
<211>30
<212>PRT
<213>Ulkenia sp.
<400>76
Ala Gly Ala Ala Pro Ala Ala Ala Ala Ala Ala Ala Ala Val Ala Ala
1 5 10 15
Pro Ala Ala Ala Pro Ala Pro Val Ala Ala Ser Gly Pro Ala
20 25 30
<210>77
<211>433
<212>PRT
<213>Ulkenia sp.
<400>77
Glu Gly Phe Met Lys Thr Tyr Gly Val Val Ala Pro Leu Tyr Thr Gly
1 5 10 15
Ala Met Ala Lys Gly Ile Ala Ser Ala Asp Leu Val Ile Ala Thr Gly
20 25 30
Lys Arg Lys Ile Leu Gly Ser Phe Gly Ala Gly Gly Leu Pro Met His
35 40 45
Ile Val Arg Ala Ala Val Glu Lys Ile Gln Ala Glu Leu Pro Asn Gly
50 55 60
Pro Phe Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser Asn Leu Glu
65 70 75 80
Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly Val Thr Val Val Glu
85 90 95
Ala Ser Ala Phe Met Thr Leu Thr Pro Gln Val Val Arg Tyr Arg Ala
100 105 110
Ala Gly Leu Ser Arg Asn Ala Asp Gly Ser Ile Asn Ile Lys Asn Arg
115 120 125
Ile Ile Gly Lys Val Ser Arg Thr Glu Leu Ala Glu Met Phe Ile Arg
130 135 140
Pro Ala Pro Gln Asn Leu Leu Asp Lys Leu Ile Gln Ser Gly Glu Ile
145 150 155 160
Thr Lys Glu Gln Ala Glu Leu Ala Lys Leu Val Pro Val Ala Asp Asp
165 170 175
Ile Ala Val Glu Ala Asp Ser Gly Gly His Thr Asp Asn Arg Pro Ile
180 185 190
His Val Ile Leu Pro Leu Ile Ile Asn Leu Arg Asn Arg Leu His Lys
195 200 205
Glu Cys Gly Tyr Pro Ala His Leu Arg Val Arg Val Gly Ala Gly Gly
210 215 220
Gly Val Gly Cys Pro Gln Ala Ala Ala Ala Ala Leu Ala Met Gly Ala
225 230 235 240
Ala Phe Leu Val Thr Gly Thr Val Asn Gln Val Ala Lys Gln Ser Gly
245 250 255
Thr Cys Asp Asn Val Arg Lys Gln Leu Cys Met Ala Thr Tyr Ser Asp
260 265 270
Val Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val Lys Leu
275 280 285
Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser Arg Ala Asn Lys Leu
290 295 300
Tyr Glu Leu Phe Cys Lys Tyr Asp Ser Phe Glu Ser Met Pro Ala Thr
305 310 315 320
Glu Leu Glu Arg Val Glu Lys Arg Ile Phe Gln Cys Pro Leu Ala Asp
325 330 335
Val Trp Ala Glu Thr Ser Asp Phe Tyr Ile Asn Arg Leu His Asn Pro
340 345 350
Glu Lys Ile Thr Arg Ala Glu Arg Asp Pro Lys Leu Lys Met Ser Leu
355 360 365
Cys Phe Arg Trp Tyr Leu Gly Leu Ala Ser Arg Trp Ala Asn Thr Gly
370 375 380
Glu Ala Gly Arg Val Met Asp Tyr Gln Val Trp Cys Gly Pro Ala Ile
385 390 395 400
Gly Ala Phe Asn Asp Phe Ile Lys Gly Ser Tyr Leu Asp Pro Ala Val
405 410 415
Ser Gly Glu Tyr Pro Asp Val Val Gln Ile Asn Leu Gln Ile Leu Arg
420 425 430
Gly
<210>78
<211>2000
<212>DNA
<213>Ulkenia sp.
<400>78
gcacgtagag caagaaagaa tgaaagaaag aacgaaagaa agaaagagag agagagagag 60
agagagagag agaaagcgaa gatgatagcg gagagaactc ttcttcgcag tcactctgtt 120
tctcagtcag tcccgcaacc aataacaact cgaactcgca gcagtgttct tcggagtgcc 180
agcgctcgct cgcactgcgt cggcacagca gcagcagcag caggccccgc gctcgctgca 240
ctcagcccgg gcaggagcaa cagctgctga gcagctgagg ccagctggct ggcggctcgc 300
ctcgcctcgc ctcgcgtcgc gtcgcgagag aaagcgatcg accaactgtc aatcgattat 360
tcgagtcctt cgagcgcttt atagggcact gattgatcac tcattgattc attgactcat 420
ttattctttg cgtggtcagc caaacggcgt tagcattggg caaagcgggt ctttgctttg 480
ctctaaaata gatttgctcg cgagagtacg tacttgcagg agtaggtagg ctctgcctag 540
tacctgggca tttgaatatt tgaacttcga acttcgttga gtatctgaat atttgaatat 600
ctgaatattt gaatttcgaa agtttgaata tttgaatatt tgaattttgg aatattggaa 660
tagctgggtt tggagataag acttactaag ctaagcgccg acgtaagagc ggcgagtaaa 720
tccacacaca agagagaggc agagagagag ggagggagac aactcgcgca ggcaagctga 780
gcccactgga cgcacggggc gcgtcccccc tgacgggcgc tctggtggtg gcgtgtttgg 840
gagggttttg catgcttgtg ataggggctc tggcgcgggc tctgtacggt gcttggagat 900
gcacgggcag ggcgagagag gggacgggtt cccgggaggc gctgcttgga ggtgctgaga 960
gggagggaga aggcgtgctt tgcgatgcgc ggggcgacct aggcgctgct gcgcggtgca 1020
gcagcaggga cctcggacgt gagtcgaagc cgtctgcaga ggagatggta gaagggccgc 1080
ggattggtag cagagaagag gaaatagaag aagaagaaga aatagaagaa gaagaaatag 1140
aagaagaaga aatagaagaa gaagaggagg acgggcaggc gggaaagatg gagaaaggac 1200
tcgcggcggg aaaacaagag aatgtgaact tgggcttgaa ctttggtttg aatttgaatg 1260
tggagaacga ggggttgaat ttgagtttga atttgaaaga aaacttacgg aaagaaagtt 1320
tagttgaaag tgagaaagaa aaaaatgaga aagaaaaaga gaaagaaaaa gagaaagaaa 1380
aagagaaaga aaaagagaaa gaaaaagaga aagaaaaaga gaaagaaaaa gagaaagaaa 1440
aagagaaaga aaaagagaaa gaaaaagaga aagaaaaaga gaaagaaaaa gaagaagaaa 1500
aagaagaaga aaaagagaaa gaaaaagaga aagaaaaaga gaaagaaaaa gaagaaggag 1560
atttaaaaag ttgtttagtt gaaaaaggag aaggaggaag aagcagcgac agcggcagaa 1620
gaagaagtag ttgttgtaag aggggaacgg aggcagtagc agtggagcag gcggaggcga 1680
cagcaaacct cgaactcgac cccgtcgagc cgcagcaaga acaagagccc gaccaggtgg 1740
acgaggacga ggtccgcttg ttgtcaggaa caacagaagt tgcaggacta gccgagagtg 1800
ctaccactgc aattcttaga tccacagacg caagagcaga aaacttacaa ctgctcgcca 1860
caacacaaga accaccttca gatacaacca ggttcgagaa ctccacaagt ctagaagcag 1920
caacagctct agcagataat caaacaggtc cagaaaaagc tacgactaga agagaaatta 1980
tcgagtcgca acttgcaacc 2000
<210>79
<211>4683
<212>DNA
<213>Ulkenia sp.
<400>79
gcgagttata tctgtctaga aaacttggca tggctagcaa tttatgtcta gctattccat 60
acacacggta atgccagtag cctgttagtt atagctcttt tggttgttgt ctcacaatac 120
actgacatca gcagaacaaa atgaaagggg ccttggctac catgaaatca atacttcaaa 180
aggtctcttg gtttctttac tcgcatgtcg ctatttactt acattcctcg agtacataac 240
atatcataca tcaaagaaat taaaaagaaa acaaacattc aaatatgcat tactttccct 300
actgtactag taagtacgtt tctggtatta agttgttttt tctcaaaaga acaatgtgct 360
tacttgtaaa atccacagct gcttacttgt aagcctcaac tagttagtga tgtgattatc 420
ataaaatgtt cgacactgta cctcctttcc agctatcttc ctacacctcc tctgacgcag 480
gttgacggag gaggcgtggg ggttgattga agtgcaacac aacgttttgt ttaagatatt 540
ccttgccttg gccgactcca aatggatagc acagaagcct aatgataatt tgaattaatt 600
ttatttcgag cttatttaat gctcttatca gagtccgtag gtatctcttt tcctactaat 660
tgttgaaaaa ggatgttttg gacatagcag gtcatcatac tatttggttc catcaaattc 720
atatccattt ctttcgttca agtgcttccc ttcctactta ttatatatat tatatatcca 780
taaatgtaaa agagacgatt acgaatactt tgcatacatg tatagcgaaa cagagatggt 840
agcaaaagtt caccttcact aatctaagaa tctctccacg tgggtaaaaa cttcagcagt 900
aagattgtaa atgatgtcca agaacaaaac gtcatgctag tccaggggtt actgagctaa 960
cgattaataa tgtttcgtag tcttcctaat tgcaccatca aaacttgtct gcacaagttt 1020
taaagtattg gagcctttac tgaagaatca gaggacatag atggggcacg ttcgccttga 1080
aaaaaatagt cttctttacc tgcatggtgt tacaaacaaa aacgagttga aaatagctgt 1140
gcaaggaggc aaacatgatt ggaaaagaaa aacgagggga cccttataca ggagggcgcc 1200
acatagtaga atgagtagat tgttagagta gggtacgctt tatgtgattg attgaatggg 1260
cgagtgaaag ttgctgtcaa ggttctaaac aaaaggatgt ttgagtttgt gagtattgtt 1320
tgcggcaaaa agattcagta gagagaaatg cacaaaaaga taatacgtgt gtagggcgat 1380
tatggaggca tgcatttggg ggaaatcatc gcatgcgcat gagtttctcc atctgccgaa 1440
tctttgcaaa ggcattttca agctccattt gcatagcgta ggcttgctgc tcaaactgag 1500
cgcgctgatg cgccagattt tcttcatgtc ttttgttcaa actacgctca agaccctcaa 1560
gagccgcaac cttgagcttg cgttcctttt gctgaatctc cataactctt cgtttcacct 1620
ggagctcaat ttctgcagca tccgtggtct ttgcagcggc ctgtgcgtct tgtgcggcct 1680
gtgcgttgtt tgcgagctcc tttcgcagct cctccatctc cgcgttcttt ttctcctcca 1740
tccatttggc accgagtttg gcagcttgat cgatgcggcc cttgagaact tcttcgttct 1800
cctcaagttc tgcgatacgc gcgtgtaagc cgaggatctc ctccgagaca gcctcgccat 1860
tgatcattat ttcacttccc gagtcttgaa tgacaacatc agccttggtg ccaggttcac 1920
cggtatctcg ctcgcaaccc tgctggcgca tagacagcat aaggcgcgca ttatcctcac 1980
gcagatcatc cacctgttct gataaaagtt tgactgcctg ctcaagatta cgggggttca 2040
cttcgtgaaa aatttcttga aggtctcgaa gctcagaaag cttggcagag caagtgtgca 2100
tcgctctgca ctttttaaga cgtgcaagtg catcatcaag tttggcatta tttaccttca 2160
tggaggcttc agctacttcg gcttcttcga ttacaatttt ctgcagctct acaacatcat 2220
ggccaattaa cttgcgatgc agctcggcaa tcaccccatg catcttttcg gtatggcctg 2280
gacgcgcctc atcctgcgtt cttcggatct cctcctctag ttctcgattt agacgaaggg 2340
ctggtccaag gggcgggtaa ttagcctgag tcaagccaag ctctgttgct agtccaaggc 2400
agtcggaaag tcgcagccgg tccctatcag aaacagcctt ttgcaagtct acgctcaaac 2460
gcacttcttg agccttgcgc accatcttcg gttctgcctg tcgcagaagt ttcgagtcgt 2520
agccagcttg ccacgctagc acgatggcac gcgcaagtga cctcagttga ccgctgttca 2580
tggcagactt gagcaacatt ttgatttgca caaatacctc atctgattca tcatcttcag 2640
cttcctcaag ctctgcaggt gtcttgcgct ctccagagac ttgaagagca gggttcaaac 2700
cgccctccag gacctcgctc gcaagcgcct cctctgtctc agctttgcgc aatagcgcag 2760
cagcattctc cgccattgtg tttgtcactc acgagattaa tatcgttgcc agagtatacg 2820
gtaatgcgag ttaaggattc acagaatctc tcaaattaat cttttcacct aatgatatcc 2880
acaaaacgtt gcaatcgctc agcccaacga caagcgtgct tcttgtttta agactgcaac 2940
tgctcctttt tctattagtc aatatggacc gtcctccaaa cgtccagaaa atagcacaga 3000
atttaccagc agccgctgca gacaagaagt gcaagagagc aggcaagcaa gtgagggttt 3060
gagcaaatag gccaacctct ccacgcagaa ttctagggtc gcaaccggaa ctcacagtcc 3120
ttagaaaccg tgcgaagccc tgggctcaac ttcaatttgt ccacgggacc ttcagcaagc 3180
accaagctca gcagcgtgaa ggcaggcgct gaccacagtt tgagctcaga gggcttggtg 3240
tgcctcgcga ttgatattga agtcaattgc gcaggacggc agcaacggac caggtggtga 3300
agaaggtaat ctccagcgga gtgatgatgg agctcgaccg actactccgg aatcgaccag 3360
gggaggtgcg ggcgcccttc acaagcgggc gagaggcagg ggagagaagg ctcgactcca 3420
cgtcttgaag cgtgtacgtg tgcgcgctca cgcgtgcgac acgccggcaa gggcgcctta 3480
gtggcctgct gctgctgctg gtcgccacgc tgcgagccca agagatttga attgaactcg 3540
aagaaaataa ctatcattta tcaattccaa tcaatcaatg cattatgaag cacctctgaa 3600
gtgaactatt ctcctctcca atatacaaca aaaaacacac acagtgggtt ttaccctata 3660
acctattgtt ccgcgagcga tcaactactc tatagagcga atgaccagtt tttctttctt 3720
tctttctttc tttctttctt tctttctttc tttctttctt tctttctttc tttctttctg 3780
ttttcctatc taataacccc tttaatcgag gaaacctttc gatttaaaag gaaagctctg 3840
tctgtatata tctgttacag atactgctat catgccatgc agaaagaaac acaaaagaaa 3900
aacaaaagaa agagagaaag agagaaagaa agagagaaag aaagaaagaa agaaagaaag 3960
aagagctttt ctcaatcggt ttcctcatcg accgctcaca tatctacgat tgtggcaaag 4020
aaagaaagaa agaaagaagg aaagcctcag cagagtccgc acgaaagcct tcattgagcc 4080
accatgtcgt ggtccgctgc agtcagtgcc gcctctctgt gaattgagtg agtgagtgag 4140
tgagtgagtt ggttggttag ttagttagtg cctcttcagc tcaaagcctt tcacggtcgc 4200
tcttcgagcg tttgcttttt cataaacaaa taaacaaacc atcgaacgaa ccatcgaacg 4260
aacgaacaat ggtaccccag aatagacgga attaattgct aagtaaacca gtaacagtaa 4320
gttagtgttt ctgacctgag ccgttttctt tatttattcc tctcagctct gtgaagagaa 4380
tttgggatga aaagaaacgt ttttatttat ttaaaagttt agtaacaaga aaaacatggt 4440
ccctcttctt ccttcatgta aaaataagta agtaaaaaaa agaaaagaaa aaaaaaaaag 4500
cttttaaagt agtaaagcga ggtagagata aaagttcttt ctcagggctc ctagtaggca 4560
cttaggaggt acgtctaaga ccgcctcgtg ggaagaaaag agaaaacaag aagagaaaag 4620
agagagagaa acagcgctga cccgagaggc tcatgcgcag agcccaaatc tgcccaactt 4680
tgg 4683
<210>80
<211>1848
<212>PRT
<213>Ulkenia sp.
<400>80
Met Leu Val Ile Gly Ala Leu Ala Arg Ala Leu Tyr Gly Ala Trp Arg
1 5 10 15
Cys Thr Gly Arg Ala Arg Glu Gly Thr Gly Ser Arg Glu Ala Leu Leu
20 25 30
Gly Gly Ala Glu Arg Glu Gly Glu Gly Val Leu Cys Asp Ala Arg Gly
35 40 45
Asp Leu Gly Ala Ala Ala Arg Cys Ser Ser Arg Asp Leu Gly Arg Glu
50 55 60
Ser Lys Pro Ser Ala Glu Glu Met Val Glu Gly Pro Arg Ile Gly Ser
65 70 75 80
Arg Glu Glu Glu Ile Glu Glu Glu Glu Glu Ile Glu Glu Glu Glu Ile
85 90 95
Glu Glu Glu Glu Ile Glu Glu Glu Glu Glu Asp Gly Gln Ala Gly Lys
100 105 110
Met Glu Lys Gly Leu Ala Ala Gly Lys Gln Glu Asn Val Asn Leu Gly
115 120 125
Leu Asn Phe Gly Leu Asn Leu Asn Val Glu Asn Glu Gly Leu Asn Leu
130 135 140
Ser Leu Asn Leu Lys Glu Asn Leu Arg Lys Glu Ser Leu Val Glu Ser
145 150 155 160
Glu Lys Glu Lys Asn Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu
165 170 175
Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu
180 185 190
Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu
195 200 205
Lys Glu Lys Glu Lys Glu Glu Glu Lys Glu Glu Glu Lys Glu Lys Glu
210 215 220
Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Glu Gly Asp Leu Lys Ser
225 230 235 240
Cys Leu Val Glu Lys Gly Glu Gly Gly Arg Ser Ser Asp Ser Gly Arg
245 250 255
Arg Arg Ser Ser Cys Cys Lys Arg Gly Thr Glu Ala Val Ala Val Glu
260 265 270
Gln Ala Glu Ala Thr Ala Asn Leu Glu Leu Asp Pro Val Glu Pro Gln
275 280 285
Gln Glu Gln Glu Pro Asp Gln Val Asp Glu Asp Glu Val Arg Leu Leu
290 295 300
Ser Gly Thr Thr Glu Val Ala Gly Leu Ala Glu Ser Ala Thr Thr Ala
305 310 315 320
Ile Leu Arg Ser Thr Asp Ala Arg Ala Glu Asn Leu Gln Leu Leu Ala
325 330 335
Thr Thr Gln Glu Pro Pro Ser Asp Thr Thr Arg Phe Glu Asn Ser Thr
340 345 350
Ser Leu Glu Ala Ala Thr Ala Leu Ala Asp Asn Gln Thr Gly Pro Glu
355 360 365
Lys Ala Thr Thr Arg Arg Glu Ile Ile Glu Ser Gln Leu Ala Thr Met
370 375 380
Ala Thr Arg Val Lys Thr Asn Lys Lys Pro Cys Trp Glu Met Thr Lys
385 390 395 400
Glu Glu Leu Thr Ser Gly Lys Asn Val Val Phe Asp Tyr Asp Glu Leu
405 410 415
Leu Glu Phe Ala Glu Gly Asp Ile Ser Lys Val Phe Gly Pro Glu Phe
420 425 430
Ser Gln Ile Asp Gln Tyr Lys Arg Arg Val Arg Leu Pro Ala Arg Glu
435 440 445
Tyr Leu Leu Val Thr Arg Val Thr Leu Met Asp Ala Glu Val Asn Asn
450 455 460
Tyr Arg Val Gly Ala Arg Met Val Thr Glu Tyr Asp Leu Pro Val Asn
465 470 475 480
Gly Glu Leu Ser Glu Gly Gly Asp Cys Pro Trp Ala Val Leu Val Glu
485 490 495
Ser Gly Gln Cys Asp Leu Met Leu Ile Ser Tyr Met Gly Ile Asp Phe
500 505 510
Gln Asn Lys Ser Asp Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu Thr
515 520 525
Phe Tyr Gly Val Ala Gln Glu Gly Glu Thr Leu Glu Tyr Asp Ile Arg
530 535 540
Val Thr Gly Phe Ala Lys Arg Leu Asp Gly Asp Ile Ser Met Phe Phe
545 550 555 560
Phe Glu Tyr Asp Cys Tyr Val Asn Gly Arg Leu Leu Ile Glu Met Arg
565 570 575
Asp Gly Cys Ala Gly Phe Phe Thr Asn Glu Glu Leu Ala Ala Gly Lys
580 585 590
Gly Val Val Phe Thr Arg Ala Asp Leu Leu Ala Arg Glu Lys Thr Lys
595 600 605
Lys Gln Asp Ile Thr Pro Tyr Ala Ile Ala Pro Arg Leu Asn Lys Thr
610 615 620
Val Leu Asn Glu Thr Glu Met Gln Ser Leu Val Asp Lys Asn Trp Thr
625 630 635 640
Lys Val Phe Gly Pro Glu Asn Gly Met Asp Gln Ile Asn Tyr Lys Leu
645 650 655
Cys Ala Arg Lys Met Leu Met Ile Asp Arg Val Thr Lys Ile Asp Tyr
660 665 670
Thr Gly Gly Pro Tyr Gly Leu Gly Leu Leu Val Gly Glu Lys Ile Leu
675 680 685
Glu Arg Asp His Trp Tyr Phe Pro Cys His Phe Val Gly Asp Gln Val
690 695 700
Met Ala Gly Ser Leu Val Ser Asp Gly Cys Ser Gln Leu Leu Lys Met
705 710 715 720
Tyr Met Leu Trp Leu Gly Leu His Leu Lys Thr Gly Pro Phe Asp Phe
725 730 735
Arg Pro Val Asn Gly His Pro Asn Lys Val Arg Cys Arg Gly Gln Ile
740 745 750
Ser Pro His Lys Gly Lys Leu Val Tyr Val Met Glu Ile Lys Glu Met
755 760 765
Gly Tyr Asp Glu Ala Gly Asp Pro Tyr Ala Ile Ala Asp Val Asn Ile
770 775 780
Leu Asp Ile Asp Phe Glu Lys Gly Gln Thr Phe Asp Leu Ala Asn Leu
785 790 795 800
His Glu Tyr Gly Lys Gly Asp Leu Asn Lys Lys Ile Val Val Asp Phe
805 810 815
Lys Gly Ile Ala Leu Lys Leu Gln Lys Arg Ser Gly Pro Ala Val Val
820 825 830
Ala Pro Glu Lys Pro Leu Ala Leu Asn Lys Asp Leu Cys Ala Pro Ala
835 840 845
Val Glu Ala Ile Pro Glu His Ile Leu Lys Gly Asp Ala Leu Ala Pro
850 855 860
Asn Gln Met Thr Trp His Pro Met Ser Lys Ile Ala Gly Asn Pro Thr
865 870 875 880
Pro Ser Phe Ser Pro Ser Ala Tyr Pro Pro Arg Pro Ile Thr Phe Thr
885 890 895
Pro Phe Pro Gly Asn Lys Asn Asp Asn Asn His Val Pro Gly Glu Met
900 905 910
Pro Leu Ser Trp Tyr Asn Met Ala Glu Phe Met Ala Gly Lys Val Ser
915 920 925
Leu Cys Leu Gly Pro Glu Phe Ala Lys Phe Asp Asp Ser Asn Thr Ser
930 935 940
Arg Ser Pro Ala Trp Asp Leu Ala Leu Val Thr Arg Val Val Ser Val
945 950 955 960
Ser Asp Met Glu Trp Val Gln Trp Lys Asn Val Asp Cys Asn Pro Ser
965 970 975
Lys Gly Thr Met Val Gly Glu Phe Asp Cys Pro Ile Asp Ala Trp Phe
980 985 990
Phe Gln Gly Ser Cys Asn Asp Gly His Met Pro Tyr Ser Ile Leu Met
995 1000 1005
Glu Ile Ala Leu Gln Thr Ser Gly Val Leu Thr Ser Val Leu Lys
1010 1015 1020
Ala Pro Leu Thr Met Glu Lys Lys Asp Ile Leu Phe Arg Asn Leu
1025 1030 1035
Asp Ala Asn Ala Glu Met Val Arg Ser Asp Ile Asp Leu Arg Gly
1040 1045 1050
Lys Thr Ile His Asn Leu Thr Lys Cys Thr Gly Tyr Ser Met Leu
1055 1060 1065
Gly Asp Met Gly Val His Arg Phe Ser Phe Glu Leu Ser Val Asp
1070 1075 1080
Gly Val Val Phe Tyr Lys Gly Thr Thr Ser Phe Gly Trp Phe Val
1085 1090 1095
Pro Glu Val Phe Ile Ser Gln Thr Gly Leu Asp Asn Gly Arg Arg
1100 1105 1110
Thr Gln Pro Trp His Ile Glu Ser Lys Val Pro Ser Ala Gln Val
1115 1120 1125
Leu Thr Tyr Asp Val Thr Pro Asn Gly Ala Gly Arg Thr Gln Leu
1130 1135 1140
Tyr Ala Asn Ala Pro Lys Gly Ala Gln Leu Thr Arg Arg Trp Asn
1145 1150 1155
Gln Cys Gln Tyr Leu Asp Thr Ile Asp Leu Val Val Ala Gly Gly
1160 1165 1170
Ser Ala Gly Leu Gly Tyr Gly His Gly Arg Lys Gln Val Asn Pro
1175 1180 1185
Lys Asp Trp Phe Phe Ser Cys His Phe Trp Phe Asp Ser Val Met
1190 1195 1200
Pro Gly Ser Leu Gly Val Glu Ser Met Phe Gln Leu Val Glu Ser
1205 1210 1215
Ile Ala Val Lys Gln Asp Leu Ala Gly Lys Tyr Gly Ile Thr Asn
1220 1225 1230
Pro Thr Phe Ala His Ala Pro Gly Lys Ile Ser Trp Lys Tyr Arg
1235 1240 1245
Gly Gln Leu Thr Pro Thr Ser Lys Phe Met Asp Ser Glu Ala His
1250 1255 1260
Ile Val Ser Ile Glu Ala His Asp Gly Val Val Asp Ile Val Ala
1265 1270 1275
Asn Gly Asn Leu Trp Ala Asp Gly Leu Arg Val Tyr Asn Val Ser
1280 1285 1290
Asn Ile Arg Val Arg Ile Val Ala Gly Ala Ala Pro Ala Ala Ala
1295 1300 1305
Ala Ala Ala Ala Ala Val Ala Ala Pro Ala Ala Ala Pro Ala Pro
1310 1315 1320
Val Ala Ala Ser Gly Pro Ala Gln Thr Ile Thr Leu Lys Gln Leu
1325 1330 1335
Lys Ala Glu Leu Leu Asp Val Glu Lys Pro Leu Tyr Ile Ser Ser
1340 1345 1350
Ser Asn Gly Gln Val Lys Lys His Ala Asp Val Ala Gly Gly Gln
1355 1360 1365
Ala Thr Ile Val Gln Ala Cys Ser Leu Ser Asp Leu Gly Asp Glu
1370 1375 1380
Gly Phe Met Lys Thr Tyr Gly Val Val Ala Pro Leu Tyr Thr Gly
1385 1390 1395
Ala Met Ala Lys Gly Ile Ala Ser Ala Asp Leu Val Ile Ala Thr
1400 1405 1410
Gly Lys Arg Lys Ile Leu Gly Ser Phe Gly Ala Gly Gly Leu Pro
1415 1420 1425
Met His Ile Val Arg Ala Ala Val Glu Lys Ile Gln Ala Glu Leu
1430 1435 1440
Pro Asn Gly Pro Phe Ala Val Asn Leu Ile His Ser Pro Phe Asp
1445 1450 1455
Ser Asn Leu Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly
1460 1465 1470
Val Thr Val Val Glu Ala Ser Ala Phe Met Thr Leu Thr Pro Gln
1475 1480 1485
Val Val Arg Tyr Arg Ala Ala Gly Leu Ser Arg Asn Ala Asp Gly
149 1495 1500
Ser Ile Asn Ile Lys Asn Arg Ile Ile Gly Lys Val Ser Arg Thr
1505 1510 1515
Glu Leu Ala Glu Met Phe Ile Arg Pro Ala Pro Gln Asn Leu Leu
1520 1525 1530
Asp Lys Leu Ile Gln Ser Gly Glu Ile Thr Lys Glu Gln Ala Glu
1535 1540 1545
Leu Ala Lys Leu Val Pro Val Ala Asp Asp Ile Ala Val Glu Ala
1550 1555 1560
Asp Ser Gly Gly His Thr Asp Asn Arg Pro Ile His Val Ile Leu
1565 1570 1575
Pro Leu Ile Ile Asn Leu Arg Asn Arg Leu His Lys Glu Cys Gly
1580 1585 1590
Tyr Pro Ala His Leu Arg Val Arg Val Gly Ala Gly Gly Gly Val
1595 1600 1605
Gly Cys Pro Gln Ala Ala Ala Ala Ala Leu Ala Met Gly Ala Ala
1610 1615 1620
Phe Leu Val Thr Gly Thr Val Asn Gln Val Ala Lys Gln Ser Gly
1625 1630 1635
Thr Cys Asp Asn Val Arg Lys Gln Leu Cys Met Ala Thr Tyr Ser
1640 1645 1650
Asp Val Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val
1655 1660 1665
Lys Leu Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser Arg Ala
1670 1675 1680
Asn Lys Leu Tyr Glu Leu Phe Cys Lys Tyr Asp Ser Phe Glu Ser
1685 1690 1695
Met Pro Ala Thr Glu Leu Glu Arg Val Glu Lys Arg Ile Phe Gln
1700 1705 1710
Cys Pro Leu Ala Asp Val Trp Ala Glu Thr Ser Asp Phe Tyr Ile
1715 1720 1725
Asn Arg Leu His Asn Pro Glu Lys Ile Thr Arg Ala Glu Arg Asp
1730 1735 1740
Pro Lys Leu Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu Gly Leu
1745 1750 1755
Ala Ser Arg Trp Ala Asn Thr Gly Glu Ala Gly Arg Val Met Asp
1760 1765 1770
Tyr Gln Val Trp Cys Gly Pro Ala Ile Gly Ala Phe Asn Asp Phe
1775 1780 1785
Ile Lys Gly Ser Tyr Leu Asp Pro Ala Val Ser Gly Glu Tyr Pro
1790 1795 1800
Asp Val Val Gln Ile Asn Leu Gln Ile Leu Arg Gly Ala Cys Tyr
1805 1810 1815
Leu Arg Arg Leu Asn Val Ile Arg Asn Asp Pro Arg Val Ser Ile
1820 1825 1830
Glu Val Glu Asp Ala Glu Phe Val Tyr Glu Pro Thr Asn Ala Leu
1835 1840 1845
<210>81
<211>18
<212>DNA
<213>Künstliche Sequenz
<400>81
ctcggcattg actccatc 18
<210>82
<211>18
<212>DNA
<213>Künstliche Sequenz
<400>82
GAGAATCTCG ACACGCTT 18
<210>83
<211>21
<212>DNA
<213>Künstliche Sequenz
<400>83
ATTACTCCTC TCTGCATCCG T 21
<210>84
<211>21
<212>DNA
<213>Künstliche Sequenz
<400>84
GCCGAAGACA GCATCAAACT C 21
<210>85
<211>21
<212>DNA
<213> Künstliche Sequenz
<400>85
GTCGAGAGTG GCCAGTGCGA T 21
<210>86
<211>21
<212>DNA
<213> Künstliche Sequenz
<400>86
AAAGTGGCAG GGAAAGTACC A 21
Claims (17)
1.PUFA-PKS,其特征是它们
a.包括在SEQ ID No.6(ORF 1),7(ORF 2),8和/或80(ORF 3)中所示氨基酸序列的至少其中一种,以及具有与它们有至少70%,优选80%,特别优选至少90%和更加特别优选至少99%和最优选100%序列同源性的同源序列,所述同源序列具有PUFA-PKS的至少一个结构域的生物学活性,或
b.包括在SEQ ID No.32,34,45,58,59,60,61,72,74和/或77中所示氨基酸序列的至少其中一种,以及具有与它们有至少70%,优选80%,特别优选至少90%和更加特别优选至少99%和最优选100%序列同源性的同源序列,所述同源序列具有PUFA-PKS的至少一个结构域的生物学活性。
2.具有10个或更多ACP结构域的根据权利要求1的分离的PUFA-PKS。
3.根据任何一项在前权利要求,其特征是它包含与序列SEQ IDNo.6(ORF 1),7(ORF 2)和/或8和/或80(ORF 3)的至少500个直接连续氨基酸具有至少70%,优选至少80%,特别优选至少90%和更加特别优选至少99%序列同源性的至少一种氨基酸序列,并且具有PUFA-PKS的至少一个结构域的生物学活性。
4.一种氨基酸序列,它与SEQ ID No.6(ORF 1),7(ORF 2)和/或8和/或80(ORF 3)的至少500个直接连续氨基酸具有至少70%,优选至少80%,特别优选至少90%和更加特别优选至少99%的同一性,并且具有PUFA-PKS的至少一个结构域的生物学活性。
5.一种分离的DNA分子,其编码根据任一项在前权利要求的氨基酸序列和与它完全互补的DNA。
6.根据权利要求5的分离的DNA分子,其特征是它与来自SEQ IDNo.3,4和5和/或9的至少500个直接连续核苷酸具有至少70%,优选至少80%,特别优选至少90%和更加特别优选至少95%的同一性。
7.根据权利要求5或6的DNA分子,其特征是它编码与序列SEQID No.6(ORF 1),7(ORF 2)和/或8和/或80(ORF 3)的至少500个直接连续氨基酸具有至少70%同源性的氨基酸序列。
8.包含根据权利要求5,6和/或7其中之一DNA分子的重组DNA分子,其与至少一种控制转录的DNA序列功能性连接,所述DNA序列优选选自SEQ ID No.XX-YY(终止子/启动子),或其来自至少500个核苷酸的部分以及它们的功能性变体。
9.包含根据权利要求8的重组DNA分子的重组宿主细胞。
10.根据权利要求9的重组宿主细胞,其内源性表达具有至少另一种PUFA-PKS结构域活性的根据权利要求1的PUFA-PKS。
11.包含重组DNA构建体的重组宿主细胞,其中控制翻译的元件选自SEQ ID No.XX-YY(终止子/启动子),或其来自至少500个核苷酸的部分以及它们的功能性变体。
12.一种生产含有PUFA,优选DHA的油的方法,包括培养根据权利要求9或10的宿主细胞。
13.根据权利要求12的方法生产的油。
14.一种生产含有PUFA,优选DHA的生物质量的方法,包括培养根据权利要求9或10的宿主细胞。
15.根据权利要求14的方法生产的生物质量。
16.根据权利要求15的重组生物质量,其包含根据权利要求8的核酸和/或根据权利要求1的氨基酸序列或与它们同源的至少50个连续氨基酸的部分。
17.包含PUFA-PKS的来自SEQ ID No.6,7,8和/或80的个别酶结构域用于生产人工多酮化合物的用途。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE102004017370A DE102004017370A1 (de) | 2004-04-08 | 2004-04-08 | PUFA-PKS Gene aus Ulkenia |
DE102004017370.2 | 2004-04-08 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410175661.8A Division CN103981156A (zh) | 2004-04-08 | 2005-04-08 | 来自ulkenia的PUFA-PKS基因 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101087882A true CN101087882A (zh) | 2007-12-12 |
Family
ID=35062272
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2005800188787A Pending CN101087882A (zh) | 2004-04-08 | 2005-04-08 | 来自ulkenia的PUFA-PKS基因 |
CN201410175661.8A Pending CN103981156A (zh) | 2004-04-08 | 2005-04-08 | 来自ulkenia的PUFA-PKS基因 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410175661.8A Pending CN103981156A (zh) | 2004-04-08 | 2005-04-08 | 来自ulkenia的PUFA-PKS基因 |
Country Status (11)
Country | Link |
---|---|
US (1) | US7939305B2 (zh) |
EP (1) | EP1733029A2 (zh) |
JP (2) | JP2007532104A (zh) |
KR (2) | KR20130114225A (zh) |
CN (2) | CN101087882A (zh) |
AU (1) | AU2005231964B2 (zh) |
BR (1) | BRPI0509747A (zh) |
CA (1) | CA2563427A1 (zh) |
DE (1) | DE102004017370A1 (zh) |
IL (1) | IL178613A0 (zh) |
WO (1) | WO2005097982A2 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108753810A (zh) * | 2018-05-22 | 2018-11-06 | 昆明理工大学 | 一种转录调节蛋白基因orf2的用途 |
WO2018219171A1 (zh) * | 2017-05-31 | 2018-12-06 | 厦门汇盛生物有限公司 | 一株生产dha和epa的细菌、该细菌基因组中的6个基因片段及它们的应用 |
CN112567019A (zh) * | 2018-08-10 | 2021-03-26 | 协和发酵生化株式会社 | 生产多不饱和脂肪酸的微生物和多不饱和脂肪酸的制造方法 |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5340742A (en) * | 1988-09-07 | 1994-08-23 | Omegatech Inc. | Process for growing thraustochytrium and schizochytrium using non-chloride salts to produce a microfloral biomass having omega-3-highly unsaturated fatty acids |
US8003772B2 (en) | 1999-01-14 | 2011-08-23 | Martek Biosciences Corporation | Chimeric PUFA polyketide synthase systems and uses thereof |
KR20090064603A (ko) | 2000-01-28 | 2009-06-19 | 마텍 바이오싸이언스스 코포레이션 | 발효기 내에서 진핵 미생물의 고밀도 배양에 의한 고도불포화 지방산을 함유하는 지질의 증진된 생산 방법 |
BRPI0510132A (pt) | 2004-04-22 | 2007-10-02 | Commw Scient Ind Res Org | sìntese de ácidos graxos poliinsaturados de cadeia longa por células recombinantes |
CN102559364B (zh) | 2004-04-22 | 2016-08-17 | 联邦科学技术研究组织 | 用重组细胞合成长链多不饱和脂肪酸 |
JP2009529891A (ja) | 2006-03-15 | 2009-08-27 | マーテック バイオサイエンシーズ コーポレーション | 多価不飽和脂肪酸を含む植物種子油 |
EP2059588A4 (en) | 2006-08-29 | 2010-07-28 | Commw Scient Ind Res Org | FATTY ACID SYNTHESIS |
CN114045301A (zh) | 2008-11-18 | 2022-02-15 | 联邦科学技术研究组织 | 产生ω-3脂肪酸的酶和方法 |
EP3192871B1 (en) | 2009-03-19 | 2019-01-23 | DSM IP Assets B.V. | Polyunsaturated fatty acid synthase nucleic acid molecules and polypeptides, compositions, and methods of making and uses thereof |
CA2823678A1 (en) | 2010-10-01 | 2012-04-05 | Kyushu University, National University Corporation | Transformation of a stramenopile for production of a microbial oil |
US8816111B2 (en) | 2012-06-15 | 2014-08-26 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising polyunsaturated fatty acids |
CN104726473B (zh) | 2013-12-18 | 2020-02-14 | 联邦科学技术研究组织 | 包含二十二碳六烯酸的提取的植物脂质 |
ES2721269T3 (es) | 2014-01-28 | 2019-07-30 | Dsm Ip Assets Bv | Factores para la producción y acumulación de ácidos grasos poliinsaturados (PUFA) obtenidos con PUFA sintasas |
CN105219789B (zh) | 2014-06-27 | 2023-04-07 | 联邦科学技术研究组织 | 包含二十二碳五烯酸的提取的植物脂质 |
HUE051749T2 (hu) * | 2015-03-02 | 2021-03-29 | Conagen Inc | Labirintusgomba mikroorganizmusokból származó szabályozó elemek |
CA3017225A1 (en) | 2016-03-16 | 2017-09-21 | Synthetic Genomics, Inc. | Production of proteins in labyrinthulomycetes |
KR102442450B1 (ko) | 2016-05-12 | 2022-09-14 | 디에스엠 아이피 어셋츠 비.브이. | 미세조류에서 오메가-3 다중불포화 지방산 생산을 증가시키는 방법 |
JOP20170154B1 (ar) | 2016-08-01 | 2023-03-28 | Omeros Corp | تركيبات وطرق لتثبيط masp-3 لعلاج أمراض واضطرابات مختلفة |
US10633454B2 (en) | 2016-11-01 | 2020-04-28 | Conagen Inc. | Expression of modified glycoproteins and glycopeptides |
EP3835410A4 (en) * | 2018-08-10 | 2022-05-18 | Kyowa Hakko Bio Co., Ltd. | EICOSAPENTAIC ACID PRODUCING MICROORGANISM AND PROCESS FOR PRODUCTION OF EICOSAPENTAIC ACID |
CN110577921B (zh) * | 2019-05-28 | 2021-04-02 | 浙江工业大学 | 产两性霉素b的重组结节链霉菌及其应用 |
CN114107074B (zh) * | 2021-11-18 | 2024-04-09 | 厦门大学 | 一种过表达3-酮酰基合酶基因的裂殖壶菌基因工程菌株的构建方法及其应用 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5798259A (en) | 1992-05-15 | 1998-08-25 | Sagami Chemical Research Center | Gene coding for eicosapentaenoic acid synthesizing enzymes and process for production of eicosapentaenoic acid |
DK0894142T4 (da) * | 1996-03-28 | 2014-02-24 | Dsm Ip Assets Bv | Mikrobiel olie der omfatter en polyumættet fedtsyre og proces til produktion af olie fra pasteuriseret og granuleret biomasse. |
US6566583B1 (en) * | 1997-06-04 | 2003-05-20 | Daniel Facciotti | Schizochytrium PKS genes |
TWI337619B (en) * | 2001-04-16 | 2011-02-21 | Martek Biosciences Corp | Pufa polyketide synthase systems and uses thereof |
-
2004
- 2004-04-08 DE DE102004017370A patent/DE102004017370A1/de not_active Withdrawn
-
2005
- 2005-04-08 CN CNA2005800188787A patent/CN101087882A/zh active Pending
- 2005-04-08 JP JP2007506732A patent/JP2007532104A/ja active Pending
- 2005-04-08 EP EP05751638A patent/EP1733029A2/de not_active Ceased
- 2005-04-08 KR KR1020137020015A patent/KR20130114225A/ko not_active Application Discontinuation
- 2005-04-08 CA CA002563427A patent/CA2563427A1/en not_active Abandoned
- 2005-04-08 CN CN201410175661.8A patent/CN103981156A/zh active Pending
- 2005-04-08 AU AU2005231964A patent/AU2005231964B2/en not_active Ceased
- 2005-04-08 BR BRPI0509747-9A patent/BRPI0509747A/pt not_active IP Right Cessation
- 2005-04-08 US US11/547,921 patent/US7939305B2/en not_active Expired - Fee Related
- 2005-04-08 KR KR1020067023437A patent/KR101484097B1/ko not_active IP Right Cessation
- 2005-04-08 WO PCT/EP2005/003701 patent/WO2005097982A2/de active Application Filing
-
2006
- 2006-10-15 IL IL178613A patent/IL178613A0/en unknown
-
2012
- 2012-07-27 JP JP2012167374A patent/JP2012205595A/ja active Pending
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018219171A1 (zh) * | 2017-05-31 | 2018-12-06 | 厦门汇盛生物有限公司 | 一株生产dha和epa的细菌、该细菌基因组中的6个基因片段及它们的应用 |
US10941185B2 (en) | 2017-05-31 | 2021-03-09 | Xiamen Huison Biotech Co., Ltd. | Strain of bacteria producing DHA and EPA, six gene fragments in the bacterial genome and their applications |
CN108753810A (zh) * | 2018-05-22 | 2018-11-06 | 昆明理工大学 | 一种转录调节蛋白基因orf2的用途 |
CN108753810B (zh) * | 2018-05-22 | 2021-06-18 | 昆明理工大学 | 一种转录调节蛋白基因orf2的用途 |
CN112567019A (zh) * | 2018-08-10 | 2021-03-26 | 协和发酵生化株式会社 | 生产多不饱和脂肪酸的微生物和多不饱和脂肪酸的制造方法 |
Also Published As
Publication number | Publication date |
---|---|
DE102004017370A1 (de) | 2005-10-27 |
AU2005231964A1 (en) | 2005-10-20 |
KR20070056002A (ko) | 2007-05-31 |
CA2563427A1 (en) | 2005-10-20 |
WO2005097982A3 (de) | 2007-04-05 |
JP2012205595A (ja) | 2012-10-25 |
AU2005231964B2 (en) | 2012-03-08 |
BRPI0509747A (pt) | 2007-09-25 |
US7939305B2 (en) | 2011-05-10 |
US20090093033A1 (en) | 2009-04-09 |
KR20130114225A (ko) | 2013-10-16 |
JP2007532104A (ja) | 2007-11-15 |
KR101484097B1 (ko) | 2015-01-23 |
CN103981156A (zh) | 2014-08-13 |
IL178613A0 (en) | 2007-02-11 |
WO2005097982A2 (de) | 2005-10-20 |
EP1733029A2 (de) | 2006-12-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101484097B1 (ko) | 울케니아의 pufa―pks 유전자 | |
AU2018203835B2 (en) | Recombinant dna constructs and methods for modulating expression of a target gene | |
KR102184432B1 (ko) | 식물체에서 dha 및 다른 lc-pufa의 생산 | |
AU2020223681B2 (en) | Plant regulatory elements and uses thereof | |
KR101524398B1 (ko) | Pufa 폴리케티드 신타제 시스템을 이용한 이종 생물체내 다불포화 지방산의 제조 | |
AU2021225152A1 (en) | Isolated polypeptides and polynucleotides useful for increasing nitrogen use efficiency, abiotic stress tolerance, yield and biomass in plants | |
KR102219621B1 (ko) | 식물 생성을 위한 형광 활성화 세포 분류 (facs) 강화 | |
KR101539470B1 (ko) | 키메라 pufa 폴리케타이드 신테이즈 시스템 및 이의 용도 | |
KR20180127526A (ko) | 식물에서 dha 및 다른 lc-pufas의 생산 | |
KR20070084187A (ko) | Pufa 폴리케티드 신타제 시스템 및 그의 용도 | |
BRPI0618965A2 (pt) | construção de ácido nucléico, polipeptìdeo isolado compreendendo uma sequência de aminoácido, célula de planta compreendendo um polinucleotìdeo exógeno, método para aumentar a toleráncia de uma planta a uma condição de estresse, método para aumentar a biomassa, vigor e/ou rendimento de uma planta, método para aumentar a eficiência do uso de fertilizante e/ou absorção de uma planta e célula de planta | |
CN113366009A (zh) | 用于生物合成大麻素的双向多酶支架 | |
KR20170116034A (ko) | 성 결정 유전자들 및 육종에 이들의 이용 | |
KR20170099884A (ko) | Pufa 생산을 위한 물질 및 방법, 및 pufa-함유 조성물 | |
RU2728854C2 (ru) | Получение омега 3 длинноцепочечных полиненасыщенных жирных кислот из масличных культур при использовании синтаз pufa траустохидридов | |
CN1352680A (zh) | 分离自植物细胞的组合物以及它们在植物细胞信号修饰中的应用 | |
AU2020210193B2 (en) | Isolated polynucleotides and polypeptides, and methods of using same for increasing plant yield and/or agricultural characteristics | |
AU2017204404B2 (en) | Isolated Polynucleotides and Polypeptides, and Methods of Using Same for Increasing Plant Yield and/or Agricultural Characteristics | |
KR20230079107A (ko) | 개선된 특성을 갖는 유전자 변형된 메틸로바실러스 세균 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20071212 |