CN106661556A - 烟草蛋白酶基因 - Google Patents
烟草蛋白酶基因 Download PDFInfo
- Publication number
- CN106661556A CN106661556A CN201580038165.0A CN201580038165A CN106661556A CN 106661556 A CN106661556 A CN 106661556A CN 201580038165 A CN201580038165 A CN 201580038165A CN 106661556 A CN106661556 A CN 106661556A
- Authority
- CN
- China
- Prior art keywords
- plant
- tobacco
- sequence
- seq
- protease
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 235000002637 Nicotiana tabacum Nutrition 0.000 title claims abstract description 314
- 244000061176 Nicotiana tabacum Species 0.000 title claims abstract description 252
- 108091005804 Peptidases Proteins 0.000 title claims abstract description 116
- 239000000463 material Substances 0.000 claims abstract description 33
- 239000000796 flavoring agent Substances 0.000 claims abstract description 22
- 235000019634 flavors Nutrition 0.000 claims abstract description 22
- 241000208125 Nicotiana Species 0.000 claims abstract description 15
- 241000196324 Embryophyta Species 0.000 claims description 432
- 108090000623 proteins and genes Proteins 0.000 claims description 202
- 102000040430 polynucleotide Human genes 0.000 claims description 178
- 108091033319 polynucleotide Proteins 0.000 claims description 178
- 239000002157 polynucleotide Substances 0.000 claims description 178
- 238000000034 method Methods 0.000 claims description 151
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 131
- 229920001184 polypeptide Polymers 0.000 claims description 115
- 230000014509 gene expression Effects 0.000 claims description 114
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 113
- 230000035772 mutation Effects 0.000 claims description 109
- 239000004365 Protease Substances 0.000 claims description 93
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims description 92
- 230000000694 effects Effects 0.000 claims description 76
- 239000005418 vegetable material Substances 0.000 claims description 47
- 231100000350 mutagenesis Toxicity 0.000 claims description 43
- 108090000790 Enzymes Proteins 0.000 claims description 42
- 238000002703 mutagenesis Methods 0.000 claims description 42
- 102000004190 Enzymes Human genes 0.000 claims description 41
- 230000008859 change Effects 0.000 claims description 41
- 239000000203 mixture Substances 0.000 claims description 40
- 238000005516 engineering process Methods 0.000 claims description 28
- 239000013604 expression vector Substances 0.000 claims description 21
- 108091034117 Oligonucleotide Proteins 0.000 claims description 7
- 239000002028 Biomass Substances 0.000 claims description 5
- 108010017070 Zinc Finger Nucleases Proteins 0.000 claims description 5
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 claims description 5
- 230000001404 mediated effect Effects 0.000 claims description 5
- 238000010453 CRISPR/Cas method Methods 0.000 claims description 4
- 238000002744 homologous recombination Methods 0.000 claims description 3
- 230000006801 homologous recombination Effects 0.000 claims description 3
- 230000001105 regulatory effect Effects 0.000 abstract description 21
- 210000004027 cell Anatomy 0.000 description 134
- 239000002773 nucleotide Substances 0.000 description 70
- 235000019419 proteases Nutrition 0.000 description 70
- 125000003729 nucleotide group Chemical group 0.000 description 69
- 241001144493 Nicotiana obtusifolia Species 0.000 description 57
- 229920002477 rna polymer Polymers 0.000 description 48
- 102000004169 proteins and genes Human genes 0.000 description 46
- 239000000523 sample Substances 0.000 description 45
- 235000018102 proteins Nutrition 0.000 description 42
- 150000007523 nucleic acids Chemical class 0.000 description 41
- 102000039446 nucleic acids Human genes 0.000 description 38
- 108020004707 nucleic acids Proteins 0.000 description 38
- 238000009396 hybridization Methods 0.000 description 37
- 239000012634 fragment Substances 0.000 description 30
- 230000006870 function Effects 0.000 description 24
- 210000001519 tissue Anatomy 0.000 description 24
- 239000002585 base Substances 0.000 description 23
- 239000002253 acid Substances 0.000 description 22
- 230000002068 genetic effect Effects 0.000 description 22
- 150000001413 amino acids Chemical group 0.000 description 21
- 238000006243 chemical reaction Methods 0.000 description 20
- 229940024606 amino acid Drugs 0.000 description 19
- 235000001014 amino acid Nutrition 0.000 description 19
- 108020004999 messenger RNA Proteins 0.000 description 19
- 230000008569 process Effects 0.000 description 18
- 108091028043 Nucleic acid sequence Proteins 0.000 description 17
- 235000019504 cigarettes Nutrition 0.000 description 17
- 230000004048 modification Effects 0.000 description 17
- 238000012986 modification Methods 0.000 description 17
- 239000000047 product Substances 0.000 description 16
- 239000000443 aerosol Substances 0.000 description 15
- 238000012239 gene modification Methods 0.000 description 15
- 230000005017 genetic modification Effects 0.000 description 15
- 235000013617 genetically modified food Nutrition 0.000 description 15
- 230000009467 reduction Effects 0.000 description 15
- 241000894007 species Species 0.000 description 14
- 230000000692 anti-sense effect Effects 0.000 description 13
- 230000015572 biosynthetic process Effects 0.000 description 13
- 238000009395 breeding Methods 0.000 description 13
- 238000004519 manufacturing process Methods 0.000 description 13
- 235000019505 tobacco product Nutrition 0.000 description 13
- 230000001488 breeding effect Effects 0.000 description 12
- 229960002715 nicotine Drugs 0.000 description 12
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 11
- 230000003321 amplification Effects 0.000 description 11
- 230000033228 biological regulation Effects 0.000 description 11
- SNICXCGAKADSCV-UHFFFAOYSA-N nicotine Natural products CN1CCCC1C1=CC=CN=C1 SNICXCGAKADSCV-UHFFFAOYSA-N 0.000 description 11
- 238000003199 nucleic acid amplification method Methods 0.000 description 11
- 238000011144 upstream manufacturing Methods 0.000 description 11
- SNICXCGAKADSCV-JTQLQIEISA-N (-)-Nicotine Chemical compound CN1CCC[C@H]1C1=CC=CN=C1 SNICXCGAKADSCV-JTQLQIEISA-N 0.000 description 10
- 239000000306 component Substances 0.000 description 10
- 238000003306 harvesting Methods 0.000 description 10
- 102000035195 Peptidases Human genes 0.000 description 9
- 101710185494 Zinc finger protein Proteins 0.000 description 9
- 102100023597 Zinc finger protein 816 Human genes 0.000 description 9
- 230000027455 binding Effects 0.000 description 9
- 238000001514 detection method Methods 0.000 description 9
- 230000006698 induction Effects 0.000 description 9
- 230000002452 interceptive effect Effects 0.000 description 9
- 239000011780 sodium chloride Substances 0.000 description 9
- 239000000126 substance Substances 0.000 description 9
- 238000003786 synthesis reaction Methods 0.000 description 9
- 108700028369 Alleles Proteins 0.000 description 8
- 240000008042 Zea mays Species 0.000 description 8
- 230000004913 activation Effects 0.000 description 8
- 238000013461 design Methods 0.000 description 8
- 230000012010 growth Effects 0.000 description 8
- 238000003976 plant breeding Methods 0.000 description 8
- 239000002243 precursor Substances 0.000 description 8
- 235000019833 protease Nutrition 0.000 description 8
- 230000010153 self-pollination Effects 0.000 description 8
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 7
- 108091033409 CRISPR Proteins 0.000 description 7
- 244000025254 Cannabis sativa Species 0.000 description 7
- 108090000994 Catalytic RNA Proteins 0.000 description 7
- 102000053642 Catalytic RNA Human genes 0.000 description 7
- 241000701489 Cauliflower mosaic virus Species 0.000 description 7
- KRKNYBCHXYNGOX-UHFFFAOYSA-K Citrate Chemical compound [O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O KRKNYBCHXYNGOX-UHFFFAOYSA-K 0.000 description 7
- 241000208278 Hyoscyamus Species 0.000 description 7
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 7
- 240000003768 Solanum lycopersicum Species 0.000 description 7
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 7
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 7
- 239000003153 chemical reaction reagent Substances 0.000 description 7
- 238000005520 cutting process Methods 0.000 description 7
- 230000002255 enzymatic effect Effects 0.000 description 7
- 230000001965 increasing effect Effects 0.000 description 7
- 238000003780 insertion Methods 0.000 description 7
- 230000037431 insertion Effects 0.000 description 7
- 230000006798 recombination Effects 0.000 description 7
- 238000005215 recombination Methods 0.000 description 7
- 108091092562 ribozyme Proteins 0.000 description 7
- 238000012216 screening Methods 0.000 description 7
- 235000000346 sugar Nutrition 0.000 description 7
- 230000009261 transgenic effect Effects 0.000 description 7
- 229910052725 zinc Inorganic materials 0.000 description 7
- 239000011701 zinc Substances 0.000 description 7
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 6
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 6
- 244000099147 Ananas comosus Species 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 6
- 241000335053 Beta vulgaris Species 0.000 description 6
- 108091079001 CRISPR RNA Proteins 0.000 description 6
- 108020004414 DNA Proteins 0.000 description 6
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 6
- 244000020551 Helianthus annuus Species 0.000 description 6
- 235000003222 Helianthus annuus Nutrition 0.000 description 6
- 240000007377 Petunia x hybrida Species 0.000 description 6
- 108020004511 Recombinant DNA Proteins 0.000 description 6
- 108020004459 Small interfering RNA Proteins 0.000 description 6
- 229920002494 Zein Polymers 0.000 description 6
- 230000032683 aging Effects 0.000 description 6
- 230000000295 complement effect Effects 0.000 description 6
- 235000013399 edible fruits Nutrition 0.000 description 6
- 230000030279 gene silencing Effects 0.000 description 6
- 229910001385 heavy metal Inorganic materials 0.000 description 6
- 238000005259 measurement Methods 0.000 description 6
- 238000002844 melting Methods 0.000 description 6
- 230000008018 melting Effects 0.000 description 6
- 239000002777 nucleoside Substances 0.000 description 6
- 125000003835 nucleoside group Chemical group 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 238000013518 transcription Methods 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- 230000002103 transcriptional effect Effects 0.000 description 6
- 241000208140 Acer Species 0.000 description 5
- 241000589158 Agrobacterium Species 0.000 description 5
- 241000219194 Arabidopsis Species 0.000 description 5
- 235000007516 Chrysanthemum Nutrition 0.000 description 5
- 108020004635 Complementary DNA Proteins 0.000 description 5
- 235000009854 Cucurbita moschata Nutrition 0.000 description 5
- 240000002395 Euphorbia pulcherrima Species 0.000 description 5
- 241000209082 Lolium Species 0.000 description 5
- 240000003183 Manihot esculenta Species 0.000 description 5
- 241000219000 Populus Species 0.000 description 5
- 241000124033 Salix Species 0.000 description 5
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 5
- -1 Sodium alkyl sulfate Chemical class 0.000 description 5
- 241000218636 Thuja Species 0.000 description 5
- 241000219793 Trifolium Species 0.000 description 5
- 241000209140 Triticum Species 0.000 description 5
- 235000021307 Triticum Nutrition 0.000 description 5
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 5
- 238000010804 cDNA synthesis Methods 0.000 description 5
- 230000001055 chewing effect Effects 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 150000001875 compounds Chemical class 0.000 description 5
- 238000011161 development Methods 0.000 description 5
- 230000018109 developmental process Effects 0.000 description 5
- 238000010362 genome editing Methods 0.000 description 5
- 235000009973 maize Nutrition 0.000 description 5
- 210000001161 mammalian embryo Anatomy 0.000 description 5
- 239000012071 phase Substances 0.000 description 5
- 210000001938 protoplast Anatomy 0.000 description 5
- 230000000306 recurrent effect Effects 0.000 description 5
- 230000000391 smoking effect Effects 0.000 description 5
- 230000001629 suppression Effects 0.000 description 5
- 238000012225 targeting induced local lesions in genomes Methods 0.000 description 5
- 239000005019 zein Substances 0.000 description 5
- 229940093612 zein Drugs 0.000 description 5
- 241000743339 Agrostis Species 0.000 description 4
- 244000291564 Allium cepa Species 0.000 description 4
- 235000007119 Ananas comosus Nutrition 0.000 description 4
- 241001494508 Arundo donax Species 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 235000016068 Berberis vulgaris Nutrition 0.000 description 4
- 238000010354 CRISPR gene editing Methods 0.000 description 4
- 235000002566 Capsicum Nutrition 0.000 description 4
- 235000003255 Carthamus tinctorius Nutrition 0.000 description 4
- 244000020518 Carthamus tinctorius Species 0.000 description 4
- 241000723377 Coffea Species 0.000 description 4
- 240000004244 Cucurbita moschata Species 0.000 description 4
- 235000009355 Dianthus caryophyllus Nutrition 0.000 description 4
- 240000006497 Dianthus caryophyllus Species 0.000 description 4
- PLUBXMRUUVWRLT-UHFFFAOYSA-N Ethyl methanesulfonate Chemical compound CCOS(C)(=O)=O PLUBXMRUUVWRLT-UHFFFAOYSA-N 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- 239000005562 Glyphosate Substances 0.000 description 4
- 241000208202 Linaceae Species 0.000 description 4
- 208000007466 Male Infertility Diseases 0.000 description 4
- 241000234295 Musa Species 0.000 description 4
- 240000007594 Oryza sativa Species 0.000 description 4
- 235000007164 Oryza sativa Nutrition 0.000 description 4
- 241000209117 Panicum Species 0.000 description 4
- 235000011449 Rosa Nutrition 0.000 description 4
- 235000007238 Secale cereale Nutrition 0.000 description 4
- 244000082988 Secale cereale Species 0.000 description 4
- FKNQFGJONOIPTF-UHFFFAOYSA-N Sodium cation Chemical compound [Na+] FKNQFGJONOIPTF-UHFFFAOYSA-N 0.000 description 4
- 235000002597 Solanum melongena Nutrition 0.000 description 4
- 244000061458 Solanum melongena Species 0.000 description 4
- 244000046109 Sorghum vulgare var. nervosum Species 0.000 description 4
- 235000009337 Spinacia oleracea Nutrition 0.000 description 4
- 244000300264 Spinacia oleracea Species 0.000 description 4
- 238000010459 TALEN Methods 0.000 description 4
- 241001122767 Theaceae Species 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- 239000003513 alkali Substances 0.000 description 4
- 239000001390 capsicum minimum Substances 0.000 description 4
- 230000032823 cell division Effects 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 4
- 210000000349 chromosome Anatomy 0.000 description 4
- 235000019506 cigar Nutrition 0.000 description 4
- 230000005782 double-strand break Effects 0.000 description 4
- 230000002349 favourable effect Effects 0.000 description 4
- 239000003205 fragrance Substances 0.000 description 4
- ZZUFCTLCJUWOSV-UHFFFAOYSA-N furosemide Chemical compound C1=C(Cl)C(S(=O)(=O)N)=CC(C(O)=O)=C1NCC1=CC=CO1 ZZUFCTLCJUWOSV-UHFFFAOYSA-N 0.000 description 4
- 229940097068 glyphosate Drugs 0.000 description 4
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 239000002609 medium Substances 0.000 description 4
- 230000010152 pollination Effects 0.000 description 4
- 230000005855 radiation Effects 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 229910001415 sodium ion Inorganic materials 0.000 description 4
- 230000008685 targeting Effects 0.000 description 4
- 241000228158 x Triticosecale Species 0.000 description 4
- 235000003934 Abelmoschus esculentus Nutrition 0.000 description 3
- 240000004507 Abelmoschus esculentus Species 0.000 description 3
- 241000218642 Abies Species 0.000 description 3
- 235000002732 Allium cepa var. cepa Nutrition 0.000 description 3
- 108020005544 Antisense RNA Proteins 0.000 description 3
- 235000000832 Ayote Nutrition 0.000 description 3
- 235000011299 Brassica oleracea var botrytis Nutrition 0.000 description 3
- 240000003259 Brassica oleracea var. botrytis Species 0.000 description 3
- 241000208293 Capsicum Species 0.000 description 3
- 240000005250 Chrysanthemum indicum Species 0.000 description 3
- 244000241235 Citrullus lanatus Species 0.000 description 3
- 244000241257 Cucumis melo Species 0.000 description 3
- 240000008067 Cucumis sativus Species 0.000 description 3
- 235000009804 Cucurbita pepo subsp pepo Nutrition 0.000 description 3
- 235000001950 Elaeis guineensis Nutrition 0.000 description 3
- 244000004281 Eucalyptus maculata Species 0.000 description 3
- 241000234643 Festuca arundinacea Species 0.000 description 3
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 3
- 240000009088 Fragaria x ananassa Species 0.000 description 3
- 235000011363 Fragaria x ananassa Nutrition 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- 235000010469 Glycine max Nutrition 0.000 description 3
- 244000068988 Glycine max Species 0.000 description 3
- 244000299507 Gossypium hirsutum Species 0.000 description 3
- 108090001102 Hammerhead ribozyme Proteins 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- 101000976610 Homo sapiens Zinc finger protein 410 Proteins 0.000 description 3
- 240000005979 Hordeum vulgare Species 0.000 description 3
- 235000007340 Hordeum vulgare Nutrition 0.000 description 3
- 206010020649 Hyperkeratosis Diseases 0.000 description 3
- 206010021929 Infertility male Diseases 0.000 description 3
- 241000219745 Lupinus Species 0.000 description 3
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- 108091092878 Microsatellite Proteins 0.000 description 3
- 240000002853 Nelumbo nucifera Species 0.000 description 3
- 235000006443 Panicum miliaceum subsp. miliaceum Nutrition 0.000 description 3
- 235000009037 Panicum miliaceum subsp. ruderale Nutrition 0.000 description 3
- 241001520808 Panicum virgatum Species 0.000 description 3
- 244000130556 Pennisetum purpureum Species 0.000 description 3
- 235000007195 Pennisetum typhoides Nutrition 0.000 description 3
- 241000745991 Phalaris Species 0.000 description 3
- 241000746983 Phleum pratense Species 0.000 description 3
- 235000014676 Phragmites communis Nutrition 0.000 description 3
- 235000005205 Pinus Nutrition 0.000 description 3
- 241000218602 Pinus <genus> Species 0.000 description 3
- 235000011613 Pinus brutia Nutrition 0.000 description 3
- 240000000528 Ricinus communis Species 0.000 description 3
- 235000004443 Ricinus communis Nutrition 0.000 description 3
- 241000209051 Saccharum Species 0.000 description 3
- 240000000111 Saccharum officinarum Species 0.000 description 3
- 235000002634 Solanum Nutrition 0.000 description 3
- 241000207763 Solanum Species 0.000 description 3
- 235000002595 Solanum tuberosum Nutrition 0.000 description 3
- 244000061456 Solanum tuberosum Species 0.000 description 3
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 3
- 244000062793 Sorghum vulgare Species 0.000 description 3
- 244000299461 Theobroma cacao Species 0.000 description 3
- 235000009470 Theobroma cacao Nutrition 0.000 description 3
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 3
- 108091023040 Transcription factor Proteins 0.000 description 3
- 102000040945 Transcription factor Human genes 0.000 description 3
- 241000863480 Vinca Species 0.000 description 3
- 240000006365 Vitis vinifera Species 0.000 description 3
- 235000014787 Vitis vinifera Nutrition 0.000 description 3
- 208000005652 acute fatty liver of pregnancy Diseases 0.000 description 3
- 239000002671 adjuvant Substances 0.000 description 3
- 229910021529 ammonia Inorganic materials 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 235000018597 common camellia Nutrition 0.000 description 3
- 235000009508 confectionery Nutrition 0.000 description 3
- 238000005034 decoration Methods 0.000 description 3
- 230000007423 decrease Effects 0.000 description 3
- 230000008034 disappearance Effects 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 230000009088 enzymatic function Effects 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 238000003205 genotyping method Methods 0.000 description 3
- 229960002449 glycine Drugs 0.000 description 3
- 239000004009 herbicide Substances 0.000 description 3
- 210000004408 hybridoma Anatomy 0.000 description 3
- 238000007901 in situ hybridization Methods 0.000 description 3
- 230000002503 metabolic effect Effects 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 239000003471 mutagenic agent Substances 0.000 description 3
- 231100000707 mutagenic chemical Toxicity 0.000 description 3
- 239000005022 packaging material Substances 0.000 description 3
- 238000004806 packaging method and process Methods 0.000 description 3
- 230000001717 pathogenic effect Effects 0.000 description 3
- 239000000843 powder Substances 0.000 description 3
- 235000015136 pumpkin Nutrition 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 235000009566 rice Nutrition 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 239000000779 smoke Substances 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 241001515965 unidentified phage Species 0.000 description 3
- 230000000007 visual effect Effects 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- LWTDZKXXJRRKDG-KXBFYZLASA-N (-)-phaseollin Chemical compound C1OC2=CC(O)=CC=C2[C@H]2[C@@H]1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-KXBFYZLASA-N 0.000 description 2
- MYKUKUCHPMASKF-VIFPVBQESA-N (S)-nornicotine Chemical compound C1CCN[C@@H]1C1=CC=CN=C1 MYKUKUCHPMASKF-VIFPVBQESA-N 0.000 description 2
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 description 2
- 241000207965 Acanthaceae Species 0.000 description 2
- 241000234282 Allium Species 0.000 description 2
- 240000008025 Alternanthera ficoidea Species 0.000 description 2
- 241000219318 Amaranthus Species 0.000 description 2
- 241000234270 Amaryllidaceae Species 0.000 description 2
- 241000744007 Andropogon Species 0.000 description 2
- 241001327399 Andropogon gerardii Species 0.000 description 2
- 241000208327 Apocynaceae Species 0.000 description 2
- 241000233788 Arecaceae Species 0.000 description 2
- 235000003826 Artemisia Nutrition 0.000 description 2
- 235000003261 Artemisia vulgaris Nutrition 0.000 description 2
- 241000193388 Bacillus thuringiensis Species 0.000 description 2
- 241000133570 Berberidaceae Species 0.000 description 2
- 240000000724 Berberis vulgaris Species 0.000 description 2
- 235000021533 Beta vulgaris Nutrition 0.000 description 2
- 235000006011 Bixa Nutrition 0.000 description 2
- 241000934840 Bixa Species 0.000 description 2
- 241000934828 Bixaceae Species 0.000 description 2
- 241000339490 Brachyachne Species 0.000 description 2
- 241000219198 Brassica Species 0.000 description 2
- 244000178993 Brassica juncea Species 0.000 description 2
- 240000002791 Brassica napus Species 0.000 description 2
- 235000004221 Brassica oleracea var gemmifera Nutrition 0.000 description 2
- 235000017647 Brassica oleracea var italica Nutrition 0.000 description 2
- 244000308368 Brassica oleracea var. gemmifera Species 0.000 description 2
- 241000219193 Brassicaceae Species 0.000 description 2
- 241000209507 Camellia Species 0.000 description 2
- 241000218235 Cannabaceae Species 0.000 description 2
- 241000218236 Cannabis Species 0.000 description 2
- 240000004160 Capsicum annuum Species 0.000 description 2
- 241000219321 Caryophyllaceae Species 0.000 description 2
- 241000488900 Cephalotaxaceae Species 0.000 description 2
- 241000488899 Cephalotaxus Species 0.000 description 2
- 241000871189 Chenopodiaceae Species 0.000 description 2
- 108010022172 Chitinases Proteins 0.000 description 2
- 244000189548 Chrysanthemum x morifolium Species 0.000 description 2
- 241000157855 Cinchona Species 0.000 description 2
- 241000219109 Citrullus Species 0.000 description 2
- 235000012828 Citrullus lanatus var citroides Nutrition 0.000 description 2
- 241000131506 Colchicaceae Species 0.000 description 2
- 241000723375 Colchicum Species 0.000 description 2
- 235000021508 Coleus Nutrition 0.000 description 2
- 244000061182 Coleus blumei Species 0.000 description 2
- 101710190853 Cruciferin Proteins 0.000 description 2
- 241000219112 Cucumis Species 0.000 description 2
- 235000009847 Cucumis melo var cantalupensis Nutrition 0.000 description 2
- 235000010071 Cucumis prophetarum Nutrition 0.000 description 2
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 description 2
- 241000219122 Cucurbita Species 0.000 description 2
- 240000001980 Cucurbita pepo Species 0.000 description 2
- 244000052363 Cynodon dactylon Species 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 241000208296 Datura Species 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- 240000001879 Digitalis lutea Species 0.000 description 2
- 235000005903 Dioscorea Nutrition 0.000 description 2
- 244000281702 Dioscorea villosa Species 0.000 description 2
- 235000000504 Dioscorea villosa Nutrition 0.000 description 2
- 241000234272 Dioscoreaceae Species 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 244000127993 Elaeis melanococca Species 0.000 description 2
- 241000218671 Ephedra Species 0.000 description 2
- 241000218670 Ephedraceae Species 0.000 description 2
- 241001081474 Erythroxylaceae Species 0.000 description 2
- 241000735552 Erythroxylum Species 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 244000166124 Eucalyptus globulus Species 0.000 description 2
- 241000221017 Euphorbiaceae Species 0.000 description 2
- 241000234642 Festuca Species 0.000 description 2
- 241000220223 Fragaria Species 0.000 description 2
- 235000016623 Fragaria vesca Nutrition 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- 241000234271 Galanthus Species 0.000 description 2
- 241000219146 Gossypium Species 0.000 description 2
- 235000009438 Gossypium Nutrition 0.000 description 2
- 241000208818 Helianthus Species 0.000 description 2
- 244000043261 Hevea brasiliensis Species 0.000 description 2
- 241000209219 Hordeum Species 0.000 description 2
- 240000005385 Jasminum sambac Species 0.000 description 2
- 241000221089 Jatropha Species 0.000 description 2
- 241000208822 Lactuca Species 0.000 description 2
- 235000003228 Lactuca sativa Nutrition 0.000 description 2
- 240000008415 Lactuca sativa Species 0.000 description 2
- 241000208204 Linum Species 0.000 description 2
- 235000004431 Linum usitatissimum Nutrition 0.000 description 2
- 235000010649 Lupinus albus Nutrition 0.000 description 2
- 240000000894 Lupinus albus Species 0.000 description 2
- 241000195948 Lycopodiaceae Species 0.000 description 2
- 241000195947 Lycopodium Species 0.000 description 2
- 241000219071 Malvaceae Species 0.000 description 2
- 108020000290 Mannitol dehydrogenase Proteins 0.000 description 2
- 235000010624 Medicago sativa Nutrition 0.000 description 2
- 240000004658 Medicago sativa Species 0.000 description 2
- 235000014435 Mentha Nutrition 0.000 description 2
- 241001072983 Mentha Species 0.000 description 2
- 241000878007 Miscanthus Species 0.000 description 2
- 241000878006 Miscanthus sinensis Species 0.000 description 2
- 235000018290 Musa x paradisiaca Nutrition 0.000 description 2
- 241000234615 Musaceae Species 0.000 description 2
- 241000219926 Myrtaceae Species 0.000 description 2
- VZUNGTLZRAYYDE-UHFFFAOYSA-N N-methyl-N'-nitro-N-nitrosoguanidine Chemical class O=NN(C)C(=N)N[N+]([O-])=O VZUNGTLZRAYYDE-UHFFFAOYSA-N 0.000 description 2
- 108091061960 Naked DNA Proteins 0.000 description 2
- 241001230286 Narenga Species 0.000 description 2
- 235000006508 Nelumbo nucifera Nutrition 0.000 description 2
- 235000006510 Nelumbo pentapetala Nutrition 0.000 description 2
- 241000208126 Nicotiana acuminata Species 0.000 description 2
- 244000061322 Nicotiana alata Species 0.000 description 2
- 241000493375 Nicotiana quadrivalvis Species 0.000 description 2
- 241001144498 Nicotiana rosulata subsp. ingulba Species 0.000 description 2
- 241000228669 Nicotiana velutina Species 0.000 description 2
- MYKUKUCHPMASKF-UHFFFAOYSA-N Nornicotine Natural products C1CCNC1C1=CC=CN=C1 MYKUKUCHPMASKF-UHFFFAOYSA-N 0.000 description 2
- 101710163270 Nuclease Proteins 0.000 description 2
- 241000209018 Nyssaceae Species 0.000 description 2
- RCEAADKTGXTDOA-UHFFFAOYSA-N OS(O)(=O)=O.CCCCCCCCCCCC[Na] Chemical compound OS(O)(=O)=O.CCCCCCCCCCCC[Na] RCEAADKTGXTDOA-UHFFFAOYSA-N 0.000 description 2
- 241000209094 Oryza Species 0.000 description 2
- 240000001090 Papaver somniferum Species 0.000 description 2
- 241000218180 Papaveraceae Species 0.000 description 2
- 101710096342 Pathogenesis-related protein Proteins 0.000 description 2
- 241000209046 Pennisetum Species 0.000 description 2
- 244000115721 Pennisetum typhoides Species 0.000 description 2
- 244000081757 Phalaris arundinacea Species 0.000 description 2
- 241000218641 Pinaceae Species 0.000 description 2
- 241000013557 Plantaginaceae Species 0.000 description 2
- 241000500437 Plutella xylostella Species 0.000 description 2
- 241000209504 Poaceae Species 0.000 description 2
- LOUPRKONTZGTKE-WZBLMQSHSA-N Quinine Chemical compound C([C@H]([C@H](C1)C=C)C2)C[N@@]1[C@@H]2[C@H](O)C1=CC=NC2=CC=C(OC)C=C21 LOUPRKONTZGTKE-WZBLMQSHSA-N 0.000 description 2
- 101150111829 RBCS2 gene Proteins 0.000 description 2
- 244000061121 Rauvolfia serpentina Species 0.000 description 2
- 235000003846 Ricinus Nutrition 0.000 description 2
- 241000322381 Ricinus <louse> Species 0.000 description 2
- 241000220317 Rosa Species 0.000 description 2
- 241000220222 Rosaceae Species 0.000 description 2
- 241001107098 Rubiaceae Species 0.000 description 2
- 241000218998 Salicaceae Species 0.000 description 2
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 2
- 241001093760 Sapindaceae Species 0.000 description 2
- 241000242873 Scopolia Species 0.000 description 2
- 241000209056 Secale Species 0.000 description 2
- 108091081021 Sense strand Proteins 0.000 description 2
- 235000008515 Setaria glauca Nutrition 0.000 description 2
- 241000208292 Solanaceae Species 0.000 description 2
- 101000611441 Solanum lycopersicum Pathogenesis-related leaf protein 6 Proteins 0.000 description 2
- 238000002105 Southern blotting Methods 0.000 description 2
- 208000037065 Subacute sclerosing leukoencephalitis Diseases 0.000 description 2
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 2
- 241001116495 Taxaceae Species 0.000 description 2
- 241001116500 Taxus Species 0.000 description 2
- 241000219161 Theobroma Species 0.000 description 2
- 241000723873 Tobacco mosaic virus Species 0.000 description 2
- 108020004566 Transfer RNA Proteins 0.000 description 2
- 235000019714 Triticale Nutrition 0.000 description 2
- 241000489523 Veratrum Species 0.000 description 2
- 241000219094 Vitaceae Species 0.000 description 2
- 235000009392 Vitis Nutrition 0.000 description 2
- 241000219095 Vitis Species 0.000 description 2
- 235000009754 Vitis X bourquina Nutrition 0.000 description 2
- 235000012333 Vitis X labruscana Nutrition 0.000 description 2
- 241000209149 Zea Species 0.000 description 2
- 101001036768 Zea mays Glucose-1-phosphate adenylyltransferase large subunit 1, chloroplastic/amyloplastic Proteins 0.000 description 2
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 2
- 102100023547 Zinc finger protein 410 Human genes 0.000 description 2
- 210000005006 adaptive immune system Anatomy 0.000 description 2
- 230000000890 antigenic effect Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 244000030166 artemisia Species 0.000 description 2
- 235000009052 artemisia Nutrition 0.000 description 2
- 210000001367 artery Anatomy 0.000 description 2
- 210000004507 artificial chromosome Anatomy 0.000 description 2
- 229940097012 bacillus thuringiensis Drugs 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 229920003211 cis-1,4-polyisoprene Polymers 0.000 description 2
- 238000002485 combustion reaction Methods 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 2
- 239000003184 complementary RNA Substances 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 235000005822 corn Nutrition 0.000 description 2
- 235000004879 dioscorea Nutrition 0.000 description 2
- 238000006073 displacement reaction Methods 0.000 description 2
- 230000009977 dual effect Effects 0.000 description 2
- 238000004043 dyeing Methods 0.000 description 2
- 230000007515 enzymatic degradation Effects 0.000 description 2
- 241001233957 eudicotyledons Species 0.000 description 2
- 238000010195 expression analysis Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- MURGITYSBWUQTI-UHFFFAOYSA-N fluorescin Chemical compound OC(=O)C1=CC=CC=C1C1C2=CC=C(O)C=C2OC2=CC(O)=CC=C21 MURGITYSBWUQTI-UHFFFAOYSA-N 0.000 description 2
- 238000003209 gene knockout Methods 0.000 description 2
- 230000009368 gene silencing by RNA Effects 0.000 description 2
- 108091006104 gene-regulatory proteins Proteins 0.000 description 2
- 102000034356 gene-regulatory proteins Human genes 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 230000035784 germination Effects 0.000 description 2
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 230000002363 herbicidal effect Effects 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 230000001976 improved effect Effects 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 238000006317 isomerization reaction Methods 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- UMFJAHHVKNCGLG-UHFFFAOYSA-N n-Nitrosodimethylamine Chemical compound CN(C)N=O UMFJAHHVKNCGLG-UHFFFAOYSA-N 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 230000000050 nutritive effect Effects 0.000 description 2
- 239000003921 oil Substances 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 244000052769 pathogen Species 0.000 description 2
- 239000000575 pesticide Substances 0.000 description 2
- 150000004713 phosphodiesters Chemical group 0.000 description 2
- PTMHPRAIXMAOOB-UHFFFAOYSA-L phosphoramidate Chemical compound NP([O-])([O-])=O PTMHPRAIXMAOOB-UHFFFAOYSA-L 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 230000008929 regeneration Effects 0.000 description 2
- 238000011069 regeneration method Methods 0.000 description 2
- 230000008263 repair mechanism Effects 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 125000000548 ribosyl group Chemical group C1([C@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 2
- 235000012420 sanguinaria Nutrition 0.000 description 2
- 230000035040 seed growth Effects 0.000 description 2
- 239000011734 sodium Substances 0.000 description 2
- 229910052708 sodium Inorganic materials 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- 238000011895 specific detection Methods 0.000 description 2
- 230000009897 systematic effect Effects 0.000 description 2
- 230000003827 upregulation Effects 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 229940057613 veratrum Drugs 0.000 description 2
- 239000013603 viral vector Substances 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 230000003313 weakening effect Effects 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- 210000004885 white matter Anatomy 0.000 description 2
- 239000002023 wood Substances 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- ASWBNKHCZGQVJV-UHFFFAOYSA-N (3-hexadecanoyloxy-2-hydroxypropyl) 2-(trimethylazaniumyl)ethyl phosphate Chemical compound CCCCCCCCCCCCCCCC(=O)OCC(O)COP([O-])(=O)OCC[N+](C)(C)C ASWBNKHCZGQVJV-UHFFFAOYSA-N 0.000 description 1
- RBACIKXCRWGCBB-UHFFFAOYSA-N 1,2-Epoxybutane Chemical compound CCC1CO1 RBACIKXCRWGCBB-UHFFFAOYSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- UFBJCMHMOXMLKC-UHFFFAOYSA-N 2,4-dinitrophenol Chemical compound OC1=CC=C([N+]([O-])=O)C=C1[N+]([O-])=O UFBJCMHMOXMLKC-UHFFFAOYSA-N 0.000 description 1
- WTLNOANVTIKPEE-UHFFFAOYSA-N 2-acetyloxypropanoic acid Chemical compound OC(=O)C(C)OC(C)=O WTLNOANVTIKPEE-UHFFFAOYSA-N 0.000 description 1
- MWBWWFOAEOYUST-UHFFFAOYSA-N 2-aminopurine Chemical compound NC1=NC=C2N=CNC2=N1 MWBWWFOAEOYUST-UHFFFAOYSA-N 0.000 description 1
- MWMOPIVLTLEUJO-UHFFFAOYSA-N 2-oxopropanoic acid;phosphoric acid Chemical compound OP(O)(O)=O.CC(=O)C(O)=O MWMOPIVLTLEUJO-UHFFFAOYSA-N 0.000 description 1
- HEGWNIMGIDYRAU-UHFFFAOYSA-N 3-hexyl-2,4-dioxabicyclo[1.1.0]butane Chemical compound O1C2OC21CCCCCC HEGWNIMGIDYRAU-UHFFFAOYSA-N 0.000 description 1
- 108010020183 3-phosphoshikimate 1-carboxyvinyltransferase Proteins 0.000 description 1
- AJBZENLMTKDAEK-UHFFFAOYSA-N 3a,5a,5b,8,8,11a-hexamethyl-1-prop-1-en-2-yl-1,2,3,4,5,6,7,7a,9,10,11,11b,12,13,13a,13b-hexadecahydrocyclopenta[a]chrysene-4,9-diol Chemical compound CC12CCC(O)C(C)(C)C1CCC(C1(C)CC3O)(C)C2CCC1C1C3(C)CCC1C(=C)C AJBZENLMTKDAEK-UHFFFAOYSA-N 0.000 description 1
- ARSRBNBHOADGJU-UHFFFAOYSA-N 7,12-dimethyltetraphene Chemical class C1=CC2=CC=CC=C2C2=C1C(C)=C(C=CC=C1)C1=C2C ARSRBNBHOADGJU-UHFFFAOYSA-N 0.000 description 1
- 241001075517 Abelmoschus Species 0.000 description 1
- 206010069754 Acquired gene mutation Diseases 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 102000005869 Activating Transcription Factors Human genes 0.000 description 1
- 108010005254 Activating Transcription Factors Proteins 0.000 description 1
- 240000007241 Agrostis stolonifera Species 0.000 description 1
- 241001136782 Alca Species 0.000 description 1
- 241000123646 Allioideae Species 0.000 description 1
- 235000005255 Allium cepa Nutrition 0.000 description 1
- 241000556588 Alstroemeria Species 0.000 description 1
- 241000556591 Alstroemeriaceae Species 0.000 description 1
- 241000931365 Ampelodesmos mauritanicus Species 0.000 description 1
- 241000746375 Andrographis Species 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 241001167018 Aroa Species 0.000 description 1
- 240000006891 Artemisia vulgaris Species 0.000 description 1
- 241001494510 Arundo Species 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000208838 Asteraceae Species 0.000 description 1
- 241001106067 Atropa Species 0.000 description 1
- 108010007337 Azurin Proteins 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 101100497219 Bacillus thuringiensis subsp. kurstaki cry1Ac gene Proteins 0.000 description 1
- 235000017166 Bambusa arundinacea Nutrition 0.000 description 1
- 235000017491 Bambusa tulda Nutrition 0.000 description 1
- 235000000318 Bindesalat Nutrition 0.000 description 1
- 244000106835 Bindesalat Species 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- 235000011331 Brassica Nutrition 0.000 description 1
- 235000011303 Brassica alboglabra Nutrition 0.000 description 1
- 235000003351 Brassica cretica Nutrition 0.000 description 1
- 235000011332 Brassica juncea Nutrition 0.000 description 1
- 235000014700 Brassica juncea var napiformis Nutrition 0.000 description 1
- 235000005855 Brassica juncea var. subintegrifolia Nutrition 0.000 description 1
- 235000011293 Brassica napus Nutrition 0.000 description 1
- 235000006008 Brassica napus var napus Nutrition 0.000 description 1
- 240000007124 Brassica oleracea Species 0.000 description 1
- 235000011302 Brassica oleracea Nutrition 0.000 description 1
- 235000001169 Brassica oleracea var oleracea Nutrition 0.000 description 1
- 235000003343 Brassica rupestris Nutrition 0.000 description 1
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 1
- 241000234670 Bromeliaceae Species 0.000 description 1
- COVZYZSDYWQREU-UHFFFAOYSA-N Busulfan Chemical compound CS(=O)(=O)OCCCCOS(C)(=O)=O COVZYZSDYWQREU-UHFFFAOYSA-N 0.000 description 1
- 101100394003 Butyrivibrio fibrisolvens end1 gene Proteins 0.000 description 1
- WTEJQBARPJSNLZ-UHFFFAOYSA-N C(C(=O)N)(=O)O.P Chemical compound C(C(=O)N)(=O)O.P WTEJQBARPJSNLZ-UHFFFAOYSA-N 0.000 description 1
- 101150010856 CRT gene Proteins 0.000 description 1
- 101100342815 Caenorhabditis elegans lec-1 gene Proteins 0.000 description 1
- 235000003880 Calendula Nutrition 0.000 description 1
- 240000001432 Calendula officinalis Species 0.000 description 1
- 240000001548 Camellia japonica Species 0.000 description 1
- 241000759909 Camptotheca Species 0.000 description 1
- 235000008534 Capsicum annuum var annuum Nutrition 0.000 description 1
- 240000008574 Capsicum frutescens Species 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- WLYGSPLCNKYESI-RSUQVHIMSA-N Carthamin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1[C@@]1(O)C(O)=C(C(=O)\C=C\C=2C=CC(O)=CC=2)C(=O)C(\C=C\2C([C@](O)([C@H]3[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O3)O)C(O)=C(C(=O)\C=C\C=3C=CC(O)=CC=3)C/2=O)=O)=C1O WLYGSPLCNKYESI-RSUQVHIMSA-N 0.000 description 1
- 241000208809 Carthamus Species 0.000 description 1
- 241000208328 Catharanthus Species 0.000 description 1
- 241001674939 Caulanthus Species 0.000 description 1
- 235000012001 Cestrum nocturnum Nutrition 0.000 description 1
- 240000001918 Cestrum nocturnum Species 0.000 description 1
- 102000012286 Chitinases Human genes 0.000 description 1
- 108010077544 Chromatin Proteins 0.000 description 1
- 235000021513 Cinchona Nutrition 0.000 description 1
- 235000001258 Cinchona calisaya Nutrition 0.000 description 1
- 235000009831 Citrullus lanatus Nutrition 0.000 description 1
- 235000005979 Citrus limon Nutrition 0.000 description 1
- 244000131522 Citrus pyriformis Species 0.000 description 1
- 108091033380 Coding strand Proteins 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 235000009842 Cucumis melo Nutrition 0.000 description 1
- 235000009849 Cucumis sativus Nutrition 0.000 description 1
- 235000003949 Cucurbita mixta Nutrition 0.000 description 1
- 235000009852 Cucurbita pepo Nutrition 0.000 description 1
- 241000219104 Cucurbitaceae Species 0.000 description 1
- CMSMOCZEIVJLDB-UHFFFAOYSA-N Cyclophosphamide Chemical compound ClCCN(CCCl)P1(=O)NCCCO1 CMSMOCZEIVJLDB-UHFFFAOYSA-N 0.000 description 1
- 102000005927 Cysteine Proteases Human genes 0.000 description 1
- 108010005843 Cysteine Proteases Proteins 0.000 description 1
- 102100028717 Cytosolic 5'-nucleotidase 3A Human genes 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 1
- 238000000018 DNA microarray Methods 0.000 description 1
- 101710118613 DNA polymerase sliding clamp 2 Proteins 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 108010001682 Dextranase Proteins 0.000 description 1
- 240000003421 Dianthus chinensis Species 0.000 description 1
- 241001163054 Dichelachne Species 0.000 description 1
- 208000035240 Disease Resistance Diseases 0.000 description 1
- 208000035220 Dyserythropoietic Congenital Anemia Diseases 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241000512897 Elaeis Species 0.000 description 1
- 235000001942 Elaeis Nutrition 0.000 description 1
- 240000003133 Elaeis guineensis Species 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- PIICEJLVQHRZGT-UHFFFAOYSA-N Ethylenediamine Chemical compound NCCN PIICEJLVQHRZGT-UHFFFAOYSA-N 0.000 description 1
- 241000220485 Fabaceae Species 0.000 description 1
- 230000010558 Gene Alterations Effects 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- 101710186901 Globulin 1 Proteins 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 108010024636 Glutathione Proteins 0.000 description 1
- 108010070675 Glutathione transferase Proteins 0.000 description 1
- 102000005720 Glutathione transferase Human genes 0.000 description 1
- 235000009429 Gossypium barbadense Nutrition 0.000 description 1
- 235000009432 Gossypium hirsutum Nutrition 0.000 description 1
- 101710154606 Hemagglutinin Proteins 0.000 description 1
- 101000956263 Homo sapiens Uncharacterized protein C19orf48 Proteins 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 235000007457 Jasminum sambac Nutrition 0.000 description 1
- 241001048891 Jatropha curcas Species 0.000 description 1
- 241000588747 Klebsiella pneumoniae Species 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 241000207923 Lamiaceae Species 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 241000209510 Liliopsida Species 0.000 description 1
- 240000006240 Linum usitatissimum Species 0.000 description 1
- 241001582888 Lobus Species 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 241000227653 Lycopersicon Species 0.000 description 1
- 235000002262 Lycopersicon Nutrition 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 235000004456 Manihot esculenta Nutrition 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 241000219823 Medicago Species 0.000 description 1
- 229920000877 Melamine resin Polymers 0.000 description 1
- 241000489991 Melanthiaceae Species 0.000 description 1
- 206010027336 Menstruation delayed Diseases 0.000 description 1
- 101100409013 Mesembryanthemum crystallinum PPD gene Proteins 0.000 description 1
- 241001074116 Miscanthus x giganteus Species 0.000 description 1
- 241001562716 Muhlenbergia uniflora Species 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- 108091000020 Myo-Inositol-1-Phosphate Synthase Proteins 0.000 description 1
- 102000018463 Myo-Inositol-1-Phosphate Synthase Human genes 0.000 description 1
- FUSGACRLAFQQRL-UHFFFAOYSA-N N-Ethyl-N-nitrosourea Chemical compound CCN(N=O)C(N)=O FUSGACRLAFQQRL-UHFFFAOYSA-N 0.000 description 1
- ZRKWMRDKSOPRRS-UHFFFAOYSA-N N-Methyl-N-nitrosourea Chemical compound O=NN(C)C(N)=O ZRKWMRDKSOPRRS-UHFFFAOYSA-N 0.000 description 1
- 235000005807 Nelumbo Nutrition 0.000 description 1
- 241000250374 Nicotiana acaulis Species 0.000 description 1
- 241001144497 Nicotiana africana Species 0.000 description 1
- 241000250377 Nicotiana amplexicaulis Species 0.000 description 1
- 241001144490 Nicotiana arentsii Species 0.000 description 1
- 241000228653 Nicotiana attenuata Species 0.000 description 1
- 241000250375 Nicotiana benavidesii Species 0.000 description 1
- 241000207746 Nicotiana benthamiana Species 0.000 description 1
- 241000250376 Nicotiana bonariensis Species 0.000 description 1
- 241000250373 Nicotiana cavicola Species 0.000 description 1
- 241001609967 Nicotiana clevelandii Species 0.000 description 1
- 241001244271 Nicotiana cordifolia Species 0.000 description 1
- 241001144496 Nicotiana corymbosa Species 0.000 description 1
- 241000208113 Nicotiana debneyi Species 0.000 description 1
- 241000862464 Nicotiana excelsior Species 0.000 description 1
- 244000006449 Nicotiana forgetiana Species 0.000 description 1
- 241000208128 Nicotiana glauca Species 0.000 description 1
- 241001495644 Nicotiana glutinosa Species 0.000 description 1
- 241001144503 Nicotiana goodspeedii Species 0.000 description 1
- 241000250366 Nicotiana gossei Species 0.000 description 1
- 241000579278 Nicotiana kawakamii Species 0.000 description 1
- 241000250368 Nicotiana knightiana Species 0.000 description 1
- 241000250019 Nicotiana langsdorffii Species 0.000 description 1
- 241000250027 Nicotiana linearis Species 0.000 description 1
- 241000250024 Nicotiana longiflora Species 0.000 description 1
- 241001144499 Nicotiana maritima Species 0.000 description 1
- 241000250031 Nicotiana megalosiphon Species 0.000 description 1
- 241000250030 Nicotiana miersii Species 0.000 description 1
- 241000250041 Nicotiana noctiflora Species 0.000 description 1
- 241000228665 Nicotiana nudicaulis Species 0.000 description 1
- 241001144504 Nicotiana occidentalis subsp. hesperis Species 0.000 description 1
- 241000208132 Nicotiana otophora Species 0.000 description 1
- 241000876839 Nicotiana paniculata Species 0.000 description 1
- 241001144492 Nicotiana pauciflora Species 0.000 description 1
- 241000250042 Nicotiana petunioides Species 0.000 description 1
- 241000208133 Nicotiana plumbaginifolia Species 0.000 description 1
- 241001144487 Nicotiana raimondii Species 0.000 description 1
- 241001290303 Nicotiana repanda Species 0.000 description 1
- 241001144500 Nicotiana rosulata Species 0.000 description 1
- 241001144486 Nicotiana rotundifolia Species 0.000 description 1
- 241000208134 Nicotiana rustica Species 0.000 description 1
- 241001144491 Nicotiana setchellii Species 0.000 description 1
- 241000250044 Nicotiana simulans Species 0.000 description 1
- 241000249970 Nicotiana solanifolia Species 0.000 description 1
- 241001144495 Nicotiana spegazzinii Species 0.000 description 1
- 241000249966 Nicotiana stocktonii Species 0.000 description 1
- 241001144480 Nicotiana suaveolens Species 0.000 description 1
- 241000208136 Nicotiana sylvestris Species 0.000 description 1
- 241001144489 Nicotiana thyrsiflora Species 0.000 description 1
- 241000579280 Nicotiana tomentosa Species 0.000 description 1
- 241000208138 Nicotiana tomentosiformis Species 0.000 description 1
- 241000249968 Nicotiana umbratica Species 0.000 description 1
- 241001144494 Nicotiana wigandioides Species 0.000 description 1
- 241001597008 Nomeidae Species 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 240000002061 Nothoscordum fragrans Species 0.000 description 1
- 235000005215 Nyctanthes arbor tristis Nutrition 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 1
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 235000011096 Papaver Nutrition 0.000 description 1
- AVFIYMSJDDGDBQ-UHFFFAOYSA-N Parthenium Chemical compound C1C=C(CCC(C)=O)C(C)CC2OC(=O)C(=C)C21 AVFIYMSJDDGDBQ-UHFFFAOYSA-N 0.000 description 1
- 241001495454 Parthenium Species 0.000 description 1
- 240000004928 Paspalum scrobiculatum Species 0.000 description 1
- 235000003675 Paspalum scrobiculatum Nutrition 0.000 description 1
- 244000038248 Pennisetum spicatum Species 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- 101710163504 Phaseolin Proteins 0.000 description 1
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 1
- 244000046052 Phaseolus vulgaris Species 0.000 description 1
- 241000746981 Phleum Species 0.000 description 1
- 108010081996 Photosystem I Protein Complex Proteins 0.000 description 1
- 244000082204 Phyllostachys viridis Species 0.000 description 1
- 235000015334 Phyllostachys viridis Nutrition 0.000 description 1
- 241000132240 Pilea involucrata Species 0.000 description 1
- 235000008331 Pinus X rigitaeda Nutrition 0.000 description 1
- 241000018646 Pinus brutia Species 0.000 description 1
- 108700001094 Plant Genes Proteins 0.000 description 1
- 241000209048 Poa Species 0.000 description 1
- 241000209049 Poa pratensis Species 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 241000168036 Populus alba Species 0.000 description 1
- 241000161288 Populus candicans Species 0.000 description 1
- 240000007909 Prosopis juliflora Species 0.000 description 1
- 101710176177 Protein A56 Proteins 0.000 description 1
- 229940096437 Protein S Drugs 0.000 description 1
- 101150041925 RBCS gene Proteins 0.000 description 1
- 101150051143 RBCS1 gene Proteins 0.000 description 1
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- 108700005075 Regulator Genes Proteins 0.000 description 1
- 108091027981 Response element Proteins 0.000 description 1
- 235000004789 Rosa xanthina Nutrition 0.000 description 1
- 229940127593 SEQ-9 Drugs 0.000 description 1
- 235000007201 Saccharum officinarum Nutrition 0.000 description 1
- 241000746444 Saccharum sp. Species 0.000 description 1
- 101100352756 Schizosaccharomyces pombe (strain 972 / ATCC 24843) pnu1 gene Proteins 0.000 description 1
- 101100528946 Schizosaccharomyces pombe (strain 972 / ATCC 24843) rpa1 gene Proteins 0.000 description 1
- 241001249129 Scirpophaga incertulas Species 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 235000015503 Sorghum bicolor subsp. drummondii Nutrition 0.000 description 1
- 241000746413 Spartina Species 0.000 description 1
- 241000251131 Sphyrna Species 0.000 description 1
- 241000219315 Spinacia Species 0.000 description 1
- 241001149258 Sporobolus alterniflorus Species 0.000 description 1
- 241000923571 Sporobolus michauxianus Species 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 244000170625 Sudangrass Species 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 1
- 229940100389 Sulfonylurea Drugs 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- 239000005864 Sulphur Substances 0.000 description 1
- 241000404542 Tanacetum Species 0.000 description 1
- 208000031320 Teratogenesis Diseases 0.000 description 1
- 244000269722 Thea sinensis Species 0.000 description 1
- 235000006468 Thea sinensis Nutrition 0.000 description 1
- 102000002933 Thioredoxin Human genes 0.000 description 1
- 108010006368 Thioredoxin h Proteins 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 241000923617 Tripidium ravennae Species 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- 244000153888 Tung Species 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 102100038573 Uncharacterized protein C19orf48 Human genes 0.000 description 1
- 241000145124 Uniola Species 0.000 description 1
- 235000013419 Uniola paniculata Nutrition 0.000 description 1
- 240000007492 Uniola paniculata Species 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 235000007244 Zea mays Nutrition 0.000 description 1
- 101000662549 Zea mays Sucrose synthase 1 Proteins 0.000 description 1
- HYJODZUSLXOFNC-UHFFFAOYSA-N [S].[Cl] Chemical compound [S].[Cl] HYJODZUSLXOFNC-UHFFFAOYSA-N 0.000 description 1
- 230000036579 abiotic stress Effects 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- FRTNIYVUDIHXPG-UHFFFAOYSA-N acetic acid;ethane-1,2-diamine Chemical class CC(O)=O.CC(O)=O.CC(O)=O.CC(O)=O.NCCN FRTNIYVUDIHXPG-UHFFFAOYSA-N 0.000 description 1
- UELITFHSCLAHKR-UHFFFAOYSA-N acibenzolar-S-methyl Chemical compound CSC(=O)C1=CC=CC2=C1SN=N2 UELITFHSCLAHKR-UHFFFAOYSA-N 0.000 description 1
- KVHISFJMQMSZHQ-UHFFFAOYSA-N acridin-10-ium;chloride;hydrochloride Chemical class Cl.[Cl-].C1=CC=CC2=CC3=CC=CC=C3[NH+]=C21 KVHISFJMQMSZHQ-UHFFFAOYSA-N 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 229960003767 alanine Drugs 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- PPQRONHOSHZGFQ-LMVFSUKVSA-N aldehydo-D-ribose 5-phosphate Chemical group OP(=O)(O)OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PPQRONHOSHZGFQ-LMVFSUKVSA-N 0.000 description 1
- 150000001335 aliphatic alkanes Chemical class 0.000 description 1
- 229930013930 alkaloid Natural products 0.000 description 1
- 150000003797 alkaloid derivatives Chemical class 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 229910021502 aluminium hydroxide Inorganic materials 0.000 description 1
- 235000002783 ambrette Nutrition 0.000 description 1
- 244000096712 ambrette Species 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 235000019568 aromas Nutrition 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 1
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 101150036080 at gene Proteins 0.000 description 1
- MXWJVTOOROXGIU-UHFFFAOYSA-N atrazine Chemical compound CCNC1=NC(Cl)=NC(NC(C)C)=N1 MXWJVTOOROXGIU-UHFFFAOYSA-N 0.000 description 1
- 229960000190 bacillus calmette–guérin vaccine Drugs 0.000 description 1
- 239000011425 bamboo Substances 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 239000003139 biocide Substances 0.000 description 1
- QKSKPIVNLNLAAV-UHFFFAOYSA-N bis(2-chloroethyl) sulfide Chemical compound ClCCSCCCl QKSKPIVNLNLAAV-UHFFFAOYSA-N 0.000 description 1
- 239000007844 bleaching agent Substances 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 229960002092 busulfan Drugs 0.000 description 1
- 229910052793 cadmium Inorganic materials 0.000 description 1
- BDOSMKKIYDKNTQ-UHFFFAOYSA-N cadmium atom Chemical compound [Cd] BDOSMKKIYDKNTQ-UHFFFAOYSA-N 0.000 description 1
- 101150039352 can gene Proteins 0.000 description 1
- 230000000711 cancerogenic effect Effects 0.000 description 1
- 125000002837 carbocyclic group Chemical group 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 231100000315 carcinogenic Toxicity 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 239000002962 chemical mutagen Substances 0.000 description 1
- JCKYGMPEJWAADB-UHFFFAOYSA-N chlorambucil Chemical compound OC(=O)CCCC1=CC=C(N(CCCl)CCCl)C=C1 JCKYGMPEJWAADB-UHFFFAOYSA-N 0.000 description 1
- 229960004630 chlorambucil Drugs 0.000 description 1
- 229930002875 chlorophyll Natural products 0.000 description 1
- 235000019804 chlorophyll Nutrition 0.000 description 1
- ATNHDLDRLWWWCB-AENOIHSZSA-M chlorophyll a Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 ATNHDLDRLWWWCB-AENOIHSZSA-M 0.000 description 1
- 210000003483 chromatin Anatomy 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 230000008711 chromosomal rearrangement Effects 0.000 description 1
- LOUPRKONTZGTKE-UHFFFAOYSA-N cinchonine Natural products C1C(C(C2)C=C)CCN2C1C(O)C1=CC=NC2=CC=C(OC)C=C21 LOUPRKONTZGTKE-UHFFFAOYSA-N 0.000 description 1
- 235000015165 citric acid Nutrition 0.000 description 1
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid group Chemical class C(CC(O)(C(=O)O)CC(=O)O)(=O)O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000000701 coagulant Substances 0.000 description 1
- 235000008957 cocaer Nutrition 0.000 description 1
- ZPUCINDJVBIVPJ-LJISPDSOSA-N cocaine Chemical compound O([C@H]1C[C@@H]2CC[C@@H](N2C)[C@H]1C(=O)OC)C(=O)C1=CC=CC=C1 ZPUCINDJVBIVPJ-LJISPDSOSA-N 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000001143 conditioned effect Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- 239000008358 core component Substances 0.000 description 1
- 239000006071 cream Substances 0.000 description 1
- 230000037029 cross reaction Effects 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 230000002559 cytogenic effect Effects 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000000151 deposition Methods 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- QDOXWKRWXJOMAK-UHFFFAOYSA-N dichromium trioxide Chemical compound O=[Cr]O[Cr]=O QDOXWKRWXJOMAK-UHFFFAOYSA-N 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-K dioxido-sulfanylidene-sulfido-$l^{5}-phosphane Chemical compound [O-]P([O-])([S-])=S NAGJZTKCGNOGPW-UHFFFAOYSA-K 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-N dithiophosphoric acid Chemical compound OP(O)(S)=S NAGJZTKCGNOGPW-UHFFFAOYSA-N 0.000 description 1
- 229940000406 drug candidate Drugs 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 239000012149 elution buffer Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000004744 fabric Substances 0.000 description 1
- 230000035558 fertility Effects 0.000 description 1
- 230000004720 fertilization Effects 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 235000004426 flaxseed Nutrition 0.000 description 1
- 230000008124 floral development Effects 0.000 description 1
- 239000006260 foam Substances 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 239000004459 forage Substances 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 102000054766 genetic haplotypes Human genes 0.000 description 1
- 238000003144 genetic modification method Methods 0.000 description 1
- 210000004602 germ cell Anatomy 0.000 description 1
- 235000007919 giant pumpkin Nutrition 0.000 description 1
- 230000000762 glandular Effects 0.000 description 1
- 239000003292 glue Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 235000004554 glutamine Nutrition 0.000 description 1
- 229960002743 glutamine Drugs 0.000 description 1
- 229960003180 glutathione Drugs 0.000 description 1
- 235000002532 grape seed extract Nutrition 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 210000004209 hair Anatomy 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 239000000185 hemagglutinin Substances 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 238000004191 hydrophobic interaction chromatography Methods 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 238000012744 immunostaining Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000008595 infiltration Effects 0.000 description 1
- 238000001764 infiltration Methods 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- JEIPFZHSYJVQDO-UHFFFAOYSA-N iron(III) oxide Inorganic materials O=[Fe]O[Fe]=O JEIPFZHSYJVQDO-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 239000010977 jade Substances 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical class O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 235000021374 legumes Nutrition 0.000 description 1
- 229960003136 leucine Drugs 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 210000002540 macrophage Anatomy 0.000 description 1
- 235000005739 manihot Nutrition 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- HAWPXGHAZFHHAD-UHFFFAOYSA-N mechlorethamine Chemical compound ClCCN(C)CCCl HAWPXGHAZFHHAD-UHFFFAOYSA-N 0.000 description 1
- JDSHMPZPIAZGSV-UHFFFAOYSA-N melamine Chemical compound NC1=NC(N)=NC(N)=N1 JDSHMPZPIAZGSV-UHFFFAOYSA-N 0.000 description 1
- SGDBTWWWUNNDEQ-LBPRGKRZSA-N melphalan Chemical compound OC(=O)[C@@H](N)CC1=CC=C(N(CCCl)CCCl)C=C1 SGDBTWWWUNNDEQ-LBPRGKRZSA-N 0.000 description 1
- 229960001924 melphalan Drugs 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 210000000473 mesophyll cell Anatomy 0.000 description 1
- 230000007102 metabolic function Effects 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- RXMBKOPBFXCPDD-UHFFFAOYSA-N methoxyphosphonamidous acid Chemical compound COP(N)O RXMBKOPBFXCPDD-UHFFFAOYSA-N 0.000 description 1
- MBABOKRGFJTBAE-UHFFFAOYSA-N methyl methanesulfonate Chemical compound COS(C)(=O)=O MBABOKRGFJTBAE-UHFFFAOYSA-N 0.000 description 1
- 238000002493 microarray Methods 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 239000003147 molecular marker Substances 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 235000010460 mustard Nutrition 0.000 description 1
- 229940087004 mustargen Drugs 0.000 description 1
- 239000000618 nitrogen fertilizer Substances 0.000 description 1
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 1
- 150000004005 nitrosamines Chemical class 0.000 description 1
- 108091027963 non-coding RNA Proteins 0.000 description 1
- 102000042567 non-coding RNA Human genes 0.000 description 1
- 230000006780 non-homologous end joining Effects 0.000 description 1
- 230000002352 nonmutagenic effect Effects 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 230000031787 nutrient reservoir activity Effects 0.000 description 1
- 238000010397 one-hybrid screening Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 150000002894 organic compounds Chemical class 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 235000006502 papoula Nutrition 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- LWTDZKXXJRRKDG-UHFFFAOYSA-N phaseollin Natural products C1OC2=CC(O)=CC=C2C2C1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-UHFFFAOYSA-N 0.000 description 1
- 238000012247 phenotypical assay Methods 0.000 description 1
- 229960005190 phenylalanine Drugs 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 235000008729 phenylalanine Nutrition 0.000 description 1
- 230000000243 photosynthetic effect Effects 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 239000001739 pinus spp. Substances 0.000 description 1
- 230000008635 plant growth Effects 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920000447 polyanionic polymer Polymers 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 230000023603 positive regulation of transcription initiation, DNA-dependent Effects 0.000 description 1
- 101150063097 ppdK gene Proteins 0.000 description 1
- CPTBDICYNRMXFX-UHFFFAOYSA-N procarbazine Chemical compound CNNCC1=CC=C(C(=O)NC(C)C)C=C1 CPTBDICYNRMXFX-UHFFFAOYSA-N 0.000 description 1
- 229960000624 procarbazine Drugs 0.000 description 1
- ZBAFFZBKCMWUHM-UHFFFAOYSA-N propiram Chemical compound C=1C=CC=NC=1N(C(=O)CC)C(C)CN1CCCCC1 ZBAFFZBKCMWUHM-UHFFFAOYSA-N 0.000 description 1
- 229950003779 propiram Drugs 0.000 description 1
- 230000004224 protection Effects 0.000 description 1
- 235000021251 pulses Nutrition 0.000 description 1
- 238000004080 punching Methods 0.000 description 1
- 238000010926 purge Methods 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000012797 qualification Methods 0.000 description 1
- 229960000948 quinine Drugs 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 238000007634 remodeling Methods 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 230000005070 ripening Effects 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 230000008117 seed development Effects 0.000 description 1
- 230000009758 senescence Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 229960001153 serine Drugs 0.000 description 1
- 235000004400 serine Nutrition 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 230000001568 sexual effect Effects 0.000 description 1
- 239000002002 slurry Substances 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 235000011008 sodium phosphates Nutrition 0.000 description 1
- NTHWMYGWWRZVTN-UHFFFAOYSA-N sodium silicate Chemical compound [Na+].[Na+].[O-][Si]([O-])=O NTHWMYGWWRZVTN-UHFFFAOYSA-N 0.000 description 1
- 230000037439 somatic mutation Effects 0.000 description 1
- 108010048090 soybean lectin Proteins 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000005728 strengthening Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 150000005846 sugar alcohols Polymers 0.000 description 1
- YROXIXLRRCOBKF-UHFFFAOYSA-N sulfonylurea Chemical class OC(=N)N=S(=O)=O YROXIXLRRCOBKF-UHFFFAOYSA-N 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 238000000672 surface-enhanced laser desorption--ionisation Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 108060008226 thioredoxin Proteins 0.000 description 1
- 229940094937 thioredoxin Drugs 0.000 description 1
- 229960002898 threonine Drugs 0.000 description 1
- 235000008521 threonine Nutrition 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 229910052723 transition metal Inorganic materials 0.000 description 1
- 150000003624 transition metals Chemical class 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical class [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- 238000010396 two-hybrid screening Methods 0.000 description 1
- 229960004441 tyrosine Drugs 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 235000002374 tyrosine Nutrition 0.000 description 1
- 235000018322 upland cotton Nutrition 0.000 description 1
- 229960004295 valine Drugs 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- OGWKCGZFUXNPDA-XQKSVPLYSA-N vincristine Chemical compound C([N@]1C[C@@H](C[C@]2(C(=O)OC)C=3C(=CC4=C([C@]56[C@H]([C@@]([C@H](OC(C)=O)[C@]7(CC)C=CCN([C@H]67)CC5)(O)C(=O)OC)N4C=O)C=3)OC)C[C@@](C1)(O)CC)CC1=C2NC2=CC=CC=C12 OGWKCGZFUXNPDA-XQKSVPLYSA-N 0.000 description 1
- 229960004528 vincristine Drugs 0.000 description 1
- OGWKCGZFUXNPDA-UHFFFAOYSA-N vincristine Natural products C1C(CC)(O)CC(CC2(C(=O)OC)C=3C(=CC4=C(C56C(C(C(OC(C)=O)C7(CC)C=CCN(C67)CC5)(O)C(=O)OC)N4C=O)C=3)OC)CN1CCC1=C2NC2=CC=CC=C12 OGWKCGZFUXNPDA-UHFFFAOYSA-N 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 230000010148 water-pollination Effects 0.000 description 1
- 210000002268 wool Anatomy 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A24—TOBACCO; CIGARS; CIGARETTES; SIMULATED SMOKING DEVICES; SMOKERS' REQUISITES
- A24B—MANUFACTURE OR PREPARATION OF TOBACCO FOR SMOKING OR CHEWING; TOBACCO; SNUFF
- A24B3/00—Preparing tobacco in the factory
- A24B3/12—Steaming, curing, or flavouring tobacco
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
- C12N15/8213—Targeted insertion of genes into the plant genome by homologous recombination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1025—Acyltransferases (2.3)
- C12N9/1029—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1025—Acyltransferases (2.3)
- C12N9/104—Aminoacyltransferases (2.3.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/63—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y203/00—Acyltransferases (2.3)
- C12Y203/01—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
- C12Y203/01091—Sinapoylglucose--choline O-sinapoyltransferase (2.3.1.91)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y203/00—Acyltransferases (2.3)
- C12Y203/02—Aminoacyltransferases (2.3.2)
- C12Y203/02002—Gamma-glutamyltransferase (2.3.2.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y304/00—Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
- C12Y304/21—Serine endopeptidases (3.4.21)
- C12Y304/21107—Peptidase Do (3.4.21.107)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y304/00—Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
- C12Y304/21—Serine endopeptidases (3.4.21)
- C12Y304/21112—Site-1 protease (3.4.21.112), i.e. subtilisin kexin isozyme-1
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y304/00—Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
- C12Y304/22—Cysteine endopeptidases (3.4.22)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y304/00—Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
- C12Y304/22—Cysteine endopeptidases (3.4.22)
- C12Y304/22002—Papain (3.4.22.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y304/00—Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
- C12Y304/23—Aspartic endopeptidases (3.4.23)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y304/00—Hydrolases acting on peptide bonds, i.e. peptidases (3.4)
- C12Y304/24—Metalloendopeptidases (3.4.24)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Botany (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Manufacture Of Tobacco Products (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Physiology (AREA)
- Developmental Biology & Embryology (AREA)
- Environmental Sciences (AREA)
Abstract
本发明提供蛋白酶基因,其在烟草植物材料的烘烤期间以特定方式经调节且其影响烘烤的烟草的风味。
Description
技术领域
本发明涉及使用在烟草中表达的蛋白酶来改变所烘烤烟草产品的特征。具体来说,本发明提供通过调节一种或多种烟草蛋白酶基因的表达来改变烟草叶的烘烤以及调节烟草叶组合物的方法。
背景技术
烟草烘烤是带出每个烟草品种的香气和风味的物理和生物化学变化过程。在烟草已收获之后,有必要在消费之前将其烘烤且接着使其老化以改进其风味。有四种常见烘烤方法,且所使用方法取决于烟草类型和其既定用途。
空气烘烤烟草在充分通风室中遮起来免受风和太阳,在充分通风室中其空气干燥六到八周。空气烘烤烟草的糖较低(此给予烟草烟草烟雾淡、甜风味)且烟碱较高。雪茄和白肋烟草是空气烘烤的。
在火烘烤中,来自低燃火的烟雾渗透叶子。此给予叶子独特的烟熏的香气和风味。火烘烤需要三到十周且产生糖较低且烟碱较高的烟草。烟斗烟草、嚼烟和鼻烟是火烘烤的。
烟道烘烤的烟草保持在密封加热区中,但其并非直接暴露于烟雾。此方法产生糖较高且烟碱水平中等到较高的香烟烟草。其为最快和烘烤方法,需要约一周。已经烟道烘烤的弗吉尼亚烟草也称为浅色烟,因为烟道烘烤使其叶子变成金色、橙色或黄色。
阳光烘烤的烟草在阳光中无覆盖的干燥。此方法在土尔其、希腊和其它地中海国家用于生产东方烟草。阳光烘烤的烟草的糖和烟碱较低且用于香烟。
烘烤在烟草叶中产生各种化合物,这些化合物给予烟草特殊的风味和口味,如甜干草、茶、玫瑰油或水果芳香风味。
在烘烤的第一阶段(对应于所谓的黄化阶段且也称为颜色烘烤)期间,叶绿素含量降低。此阶段需要2到8天,取决于烟草类型。在此阶段期间,叶子代谢活性大幅改变。不仅叶绿素降解,而且例如淀粉和蛋白也降解。
迄今为止,已提出的改变烘烤工艺的唯一方法是基于改变烟草在所选烘烤程序中暴露的实际条件。关于烟草在烘烤期间的基因表达已知极少,且此外很少有关于蛋白酶在烟草叶和其所得产品中的活性的数据报导。
我们已在三种主要烟草类型(白肋、弗吉尼亚和东方)中鉴定出80种在叶子烘烤期间激活的蛋白酶基因。我们已发现特殊蛋白酶表达与烟草中的特定风味概况相关。
发明内容
在空气烘烤时的白肋烟草、烟道烘烤时的弗吉尼亚烟草和阳光烘烤时的东方烟草中鉴定出上调的80种蛋白酶基因(SEQ ID NO:1-80)。图2和表1和表2中概述关于在一种或多种不同烟草类型中的此类上调的细节。
此类基因序列和其调节序列可用于调节或改变在烘烤期间的蛋白酶活性。多核苷酸序列SEQ ID NO:1-80包括外显子和内含子序列。涉及多核苷酸序列SEQ ID NO:1-80的编码序列部分的蛋白质序列描绘在SEQ ID NO:81-160中。
因此,提供一种突变型、非天然存在的或转基因烟草植物细胞,其包含:
(i)多核苷酸,其包括编码功能性蛋白酶的序列、由编码功能性蛋白酶的序列组成或基本上由编码功能性蛋白酶的序列组成,且与SEQ ID NO:1到SEQ ID No:80中的任一个具有至少95%的序列同一性;
(ii)由(i)中所示的多核苷酸编码的多肽;
(iii)多肽,其包括编码蛋白酶的序列、由编码蛋白酶的序列组成或基本上由编码蛋白酶的序列组成,且与SEQ ID NO:81到SEQ ID No:160具有至少95%的序列同一性;或
(iv)包含(i)中所示的经分离的多核苷酸的构建体、载体或表达载体,
且其中与所述蛋白酶的表达或活性未改变的对照烟草植物细胞相比,所述蛋白酶的表达或活性得以调节。
在烘烤工艺期间蛋白酶表达在烟草细胞中的改变赋予所烘烤烟草和由其制造的产品以不同的风味。下文进一步论述不同基因对不同烟草风味概况的影响。
在实施方式中,所述蛋白酶的表达或活性与对照烟草植物细胞相比上调。然而,在某些实施方式中,所述蛋白酶的表达或活性与对照烟草植物细胞相比下调。在再其它实施方式中,至少一种蛋白酶可与至少一种蛋白酶在同一细胞中下调同时地上调。
在一个示例性实施方式中,因此,提供一种根据前述权利要求中任一项所述的突变型、非天然存在的或转基因烟草植物细胞,其中选自以下各项的蛋白酶的表达或活性经调节:
SEQ ID NO:1到16中的至少一者;或
SEQ ID NO:30到41中的至少一者;或
SEQ ID NO:17到22中的至少一者;或
SEQ ID NO:42到44中的至少一者;或
SEQ ID NO:45到61中的至少一者;或
SEQ ID NO:62到80中的至少一者;或
SEQ ID NO:23到29中的至少一者。
在一个具体实施方式中,提供一种根据权利要求4所述的突变型、非天然存在的或转基因烟草植物细胞,其中选自SEQ ID NO:30到41中的至少一者的蛋白酶的表达或活性在东方型烟草中经调节。
在一个具体实施方式中,提供根据权利要求4所述的突变型、非天然存在的或转基因烟草植物细胞,其中选自SEQ ID NO:17到22的蛋白酶的表达或活性在弗吉尼亚型烟草中经调节。
在一个具体实施方式中,提供一种根据权利要求4所述的突变型、非天然存在的或转基因烟草植物细胞,其中选自SEQ ID NO:42到44中的至少一者的蛋白酶的表达或活性在白肋型烟草中经调节。
在一个具体实施方式中,提供一种根据权利要求4所述的突变型、非天然存在的或转基因烟草植物细胞,其中选自SEQ ID NO:45到61中的至少一者的蛋白酶的表达或活性在弗吉尼亚或东方型烟草中经调节。
在一个具体实施方式中,提供一种根据权利要求4所述的非天然存在的或转基因烟草植物细胞,其中选自SEQ ID NO:62到80中的至少一者的蛋白酶的表达或活性在白肋或东方型烟草中经调节。
在一个具体实施方式中,提供一种根据权利要求4所述的突变型、非天然存在的或转基因烟草植物细胞,其中选自SEQ ID NO:23到29中的至少一者的蛋白酶的表达或活性在白肋或弗吉尼亚型烟草中经调节。
突变型、非天然存在的或转基因烟草植物细胞可为烟草植物细胞,其中所述突变是杂合或纯合突变。
在本发明的实施方式中,一种或多种蛋白酶的表达增加约10%到约1000%,例如增加至少10%、至少20%、至少25%、至少50%、至少100%、至少200%、至少500%、至少750%或高达1000%。
在第二方面中,提供一种突变型、非天然存在的或转基因植物或其组分或部分,其包含根据本发明的前述方面的植物细胞。
在第三方面中,提供植物材料,其包含来自本发明的第二方面的植物的生物质、种子、茎、花或叶。
在第四方面中,提供制备具有经调节水平的蛋白酶的烟草植物的方法,所述方法包括以下步骤:
(a)提供植物,所述植物包括(i)多核苷酸,其包括编码功能性蛋白酶的序列、由编码功能性蛋白酶的序列组成或基本上由编码功能性蛋白酶的序列组成,且与SEQ ID NO:1到SEQ ID No:80中的至少一个具有至少95%的序列同一性;
(b)将一个或多个突变插入至所述烟草植物的所述多核苷酸中以产生突变型烟草植物;以及
(c)烘烤所述烟草植物材料。
在一些实施方式中,步骤(b)中的烟草植物为突变型烟草植物,优选地,其中所述突变型烟草植物在一个或多个其它序列中包括一个或多个突变,所述一个或多个其它序列编码功能性蛋白酶且与SEQ ID NO:1到SEQ ID No:80中的至少一者具有至少95%序列同一性。因此,可构建其中一个或多个细胞包括多个突变型蛋白酶的植物。
包括经调节蛋白酶表达或活性的突变型细胞在烘烤工艺期间赋予烟草叶以不同风味概况。通过在另一种烟草类型中复制一种烟草类型的叶子化学性质,可将风味特征转移到通常不具有那些特征的烟草类型中。
在实施方式中,烟草植物的细胞的基因组利用基因组编辑技术或基因组改造技术修饰,所述技术选自CRISPR/Cas技术、锌指核酸酶介导的诱变、化学或辐射诱变、同源重组、寡核苷酸引导的诱变和大范围核酸酶介导的诱变。
因此,在另一方面中,提供一种制造与对照植物材料相比风味概况改变的烘烤的植物材料、优选地烘烤的叶或花的方法,所述方法包括以下步骤:
(a)提供根据本发明的前述方面的植物或植物材料;
(b)任选地自其收获植物材料;及
(c)烘烤所述植物材料一段时间,使得至少一种蛋白酶的水平与对照经烘烤植物材料相比经调节。
在再其它方面中,提供一种
(i)多核苷酸,其包括编码功能性蛋白酶的序列、由编码功能性蛋白酶的序列组成或基本上由编码功能性蛋白酶的序列组成,且与SEQ ID NO:1到SEQ ID No:80中的任一个具有至少95%的序列同一性;
(ii)由(i)中所示的多核苷酸编码的多肽;
(iii)多肽,其包括编码蛋白酶的序列、由编码蛋白酶的序列组成或基本上由编码蛋白酶的序列组成,且与SEQ ID NO:81到SEQ ID No:160具有至少95%的序列同一性;或
(iv)包含(i)中所示的经分离的多核苷酸的构建体、载体或表达载体,
的用途,其用于在烟草烘烤程序期间调节烟草中的一种或多种蛋白酶的表达或活性。
根据本发明的这一方面的烘烤程序可选自由以下组成的群组:空气烘烤、火烘烤、烟雾烘烤和烟道烘烤。
蛋白酶活性在烘烤期间的改变或调节可经由(进一步)上调或下调进行。改变或调节可使用例如(至少)在此类烘烤期间活跃的特定启动子序列经由遗传改造进行。调节也可经由例如此类序列和/或其调节区的如以上所要求的诱变进行,引起由此编码的蛋白酶活性在相应的烘烤条件下的上调或下调或完全基因敲除。
在另一实施方式中,提供16种基因序列SEQ ID NO:1到16(参见表2)中的至少一者以及包含以下、由以下组成或基本上由以下组成的序列的用途:与所述16种序列中的一者或多者具有至少95%序列同一性的序列,其在烘烤期间在所有三种类型的烟草中上调,用于改变烘烤的烟草的风味。
在另一实施方式中,提供12种基因序列SEQ ID NO:30到41中的至少一者以及包含以下、由以下组成或基本上由以下组成的序列在东方型烟草中的用途:与所述12种序列中的一者或多者具有至少95%序列同一性的序列,其在空气烘烤的白肋和烟道烘烤的弗吉尼亚中均上调,以在烘烤期间改变所述烟草的风味。
在另一实施方式中,提供6种基因序列SEQ ID NO:17到22中的至少一者以及包含以下、由以下组成或基本上由以下组成的序列在弗吉尼亚型烟草中的用途:与所述6种序列中的一者或多者具有至少95%序列同一性的序列,其在空气烘烤的白肋和阳光烘烤的东方中均上调,以在烘烤期间改变所述烟草的风味。
在另一实施方式中,提供3种基因序列SEQ ID NO:42到44中的至少一者以及包含以下、由以下组成或基本上由以下组成的序列的用途:与所述3种序列中的一者或多者具有至少95%序列同一性的序列,其在烟道烘烤的弗吉尼亚和阳光烘烤的东方烟草中均上调,以在烘烤期间改变白肋型烟草的风味。
在另一实施方式中,提供17种基因序列SEQ ID NO:45到61中的至少一者以及包含以下、由以下组成或基本上由以下组成的序列的用途:与所述17种序列中的一者或多者具有至少95%序列同一性的序列,其独特地在空气烘烤的白肋中上调,以在烘烤期间改变弗吉尼亚或东方型烟草的风味。
在另一实施方式中,提供19种基因序列SEQ ID NO:62到80中的至少一者以及包含以下、由以下组成或基本上由以下组成的序列的用途:与所述19种序列中的一者或多者具有至少95%序列同一性的序列,其独特地在烟道烘烤的弗吉尼亚中上调,以在烘烤期间改变白肋或东方型烟草的风味。
在另一实施方式中,提供7种基因序列SEQ ID NO:23到29中的至少一者以及包含以下、由以下组成或基本上由以下组成的序列的用途:与所述7种序列中的一者或多者具有至少95%序列同一性的序列,其独特地在阳光烘烤的东方中上调,以在烘烤期间改变白肋或弗吉尼亚型烟草的风味。
由于某些基因序列仅在三种烟草类型(如根据烟草类型和烘烤方法定义)中的一种或两种中上调,故某些基因序列潜在地可用于在烘烤期间改变或调节蛋白酶活性,使得相对于所获得的烟草叶细胞的叶子化学性质(例如细胞的代谢物含量)和特性的结果得以改变,使得例如空气烘烤的白肋烟草在烘烤时获得烟道烘烤的弗吉尼亚型烟草或阳光烘烤的东方烟草的某些特征。此例如可通过调节在一种或两种烟草类型中上调且不在其它烟草中上调的基因序列中的一者或多者的表达来进行。举例来说,在烘烤期间,17种基因序列在空气烘烤的白肋烟中独特地上调,19种在烟道烘烤的弗吉尼亚中独特地上调,且12种在两种类型烟草中均上调。通过选择性地调节仅在空气烘烤的白肋中现在烟道烘烤的弗吉尼亚中上调的19种基因序列中的一者或多者,阳光烘烤的弗吉尼亚烟草的叶细胞组成在烘烤时可朝着更多的白肋型改变。使用遗传改造方法,此可使用例如在目标烟草类型的烘烤条件下活跃的启动子序列实现。因此,所用启动子序列例如为驱动此处列出的基因序列的表达的调节序列。使用诱变、基因组编辑或改造方法,突变型基因序列可在目标烟草类型的烘烤条件下活跃。
在一个例子中,调节序列为突变型,使得下游基因序列在所需烘烤条件下活跃。举例来说,通过选择性地改变或调节在烟道烘烤的弗吉尼亚中在空气烘烤的白肋型烟草中独特地上调的19种序列中的一者或多者的表达,白肋型烟草的叶细胞组成在烘烤时可朝着更多的弗吉尼亚型改变。此外,通过在阳光烘烤的东方烟草中选择性地调节在空气烘烤的白肋和烟道烘烤的弗吉尼亚中均上调的12种序列中的一者或多者的表达,阳光烘烤的东方烟草的叶细胞组成在烘烤时可改变,使得其获得白肋和弗吉尼亚特征。
在一个实施方式中,在SEQ ID No:1-80中列出的基因序列中的一者或包含以下、由以下组成或基本上由以下组成的序列上调:与所列出的序列中的一者或多者具有至少95%序列同一性的序列。在另一实施方式中,多于一个在SEQ ID No:1-80中列出的基因序列或包含以下、由以下组成或基本上由以下组成的序列上调:与此类序列具有至少95%序列同一性的序列。在另一实施方式中,在SEQ ID No:1-80中列出的基因序列中的一者或多者或包含以下、由以下组成或基本上由以下组成的序列下调:与此类列出的序列具有至少95%序列同一性的序列。在另一实施方式中,在SEQ ID No:1-80中列出的序列中的一者或多者或包含以下、由以下组成或基本上由以下组成的序列上调:与此类列出的序列具有至少95%序列同一性的序列,且在SEQ ID No:1-80中列出的一种或多种序列或包含以下、由以下组成或基本上由以下组成的序列下调:与此类列出的序列具有至少95%序列同一性的序列。
由于烘烤条件决定最终叶细胞化学性质,故此类改变或调节影响消费者体验由此类叶材料制备的产品的方式。
因此,本发明还提供根据以上所要求的方法获得的烟草叶和包括此类叶子的产品。此类产品包括(但不限于)包括此类叶材料或由其衍生的材料的嚼烟、烟草梗、由其获得的提取物和其它吸烟物品。
附图说明
图1.CYP82E4(AGD93125.1/GI:444237502)的表达在48小时烘烤之后在三种主要烟草类型(SC,阳光烘烤的;FC,烟道烘烤的;AC,空气烘烤的)中增加。
图2.80种衰老激活的蛋白酶基因的表达在三种主要烟草类型中增加
图3.一种APA1烟草基因(SEQ 68)仅在弗吉尼亚烘烤期间表达。
定义
在本申请的范围内使用的技术术语和表达通常被赋予在植物和分子生物学的相关领域中经常应用于它们的含义。所有以下术语定义适用于本申请的整个内容。词语“包括”不排除其它元件或步骤,且不定冠词“一(a)”或“一(an)”不排除多个/多种。单个步骤可以完成在权利要求中列出的几个特征的功能。在给定数值或范围背景下,术语“约”、“基本上”和“大约”指在给定值或范围20%内,10%内或者5%、4%、3%、2%或1%内的值或范围。
术语“经分离的”指取自其天然环境的任何实体,但该术语不意味着任何纯化程度。
“表达载体”是包括使得核酸能够表达的核酸组分的组合的核酸媒介物。合适的表达载体包括能够染色体外复制的附加体,例如环状双链核酸质粒;线性化的双链核酸质粒;以及其它具有任何起源的功能等价表达载体。如下文定义的,表达载体包含置于核酸、核酸构建体或核酸缀合物上游,且与核酸、核酸构建体或核酸缀合物可操作地连接的至少一个启动子。
术语“构建体”指包含一种或多种多核苷酸的双链重组核酸片段。构建体包括与互补“有义链或编码链”碱基配对的“模板链”。给定构建体可在两个可能定向上插入载体内,所述两个可能定向是就置于载体(例如表达载体)内的启动子定向而言的相同(或有义)定向或相反(或反义)定向。
“载体”指核酸媒介物,其包含允许核酸、核酸构建体和核酸缀合物等等转运的核酸组分组合。合适的载体包括能够染色体外复制的附加体,例如环状双链核酸质粒;线性化的双链核酸质粒;以及其它具有任何起源的载体。
“启动子”指通常置于双链DNA片段上游,且与双链DNA片段可操作地连接的核酸元件/序列。启动子可完全源自接近天然目的基因的区域,或可由源自不同天然启动子的不同元件或合成DNA区段组成。
术语“同源性、同一性或相似性”指通过序列比对比较的两个多肽或两个核酸分子之间的序列相似性程度。被比较的两个不连续核酸序列之间的同源性程度是在可比较位置处的相同或匹配核苷酸数目的函数。同一性百分比可通过目视检查和数学计算进行确定。作为另外一种选择,可通过使用计算机程序(例如ClustalW、BLAST、FASTA或Smith-Waterman)比较序列信息,确定两个核酸序列的同一性百分比。
“变体”是指大体上相似的序列。变体可与野生型序列具有相似的功能或大体上相似的功能。对于蛋白酶,相似功能为在相同条件下野生型酶功能的至少约50%、60%、70%、80%或90%。对于蛋白酶,大体上相似的功能为在相同条件下野生型酶功能的至少约90%、95%、96%、97%、98%或99%。举例来说,野生型蛋白酶序列阐述在SEQ ID No:81-160中。所述变体可具有一个或多个突变,所述突变使得所述酶与野生型蛋白酶相比具有降低水平的蛋白酶活性。所述变体可具有一个或多个突变,所述突变使得其蛋白酶活性被基因敲除(亦即,100%抑制且因此非功能性多肽)。变体也可具有增加的活性,产生更活跃的蛋白酶酶类功能。
术语“植物”指处于其生命周期或发育的任何阶段的任何植物或植物的部分以及其后代。在一个实施方式中,植物是“烟草植物”,这指属于烟草属(Nicotiana)的植物。优选烟草植物种类在本文中描述。
“植物部分”包含植物细胞、植物原生质体、从其可再生完整植物的植物细胞组织培养物、植物愈伤组织、植物团块以及植物或植物的部分中完整的植物细胞,所述植物的部分例如胚、花粉、花药、胚珠、种子、叶、花、茎、枝、果实、根、根尖及其类似物。经再生植物的后代、变体和突变体也包含在本发明的范畴内,限制条件为其包括本文中所描述的经引入多核苷酸。
“植物细胞”指植物的结构和生理单位。植物细胞可采取不含细胞壁的原生质体、经分离的单细胞或培养细胞的形式,或作为更高等组构单位的一部分,例如但不限于植物组织、植物器官或全植物。
术语“植物材料”指可得自植物的任何固体、液体或气体组合物或其组合,包括生物质、叶、茎、根、花或花的部分、果实、花粉、卵细胞、合子、种子、插条、分泌物、提取物、细胞或组织培养物、或任何其它植物部分或产物。在一个实施方式中、植物材料包含生物质、茎、种子或叶,或者由生物质、茎、种子或叶组成。在另一个实施方式中,植物材料包含叶或由叶组成。
术语“品种”指共享恒定特征的植物群体,所述恒定特征使其与相同物种的其它植物分开。虽然具有一种或多种独特性状,但品种的特征还在于该品种内的个体之间的极少总体变化。品种通常在商业上出售。
烟草的“类型”由来源和烘烤方法界定。烟道烘烤的烟草(其占全球产量的40%)也称为“浅色”和“弗吉尼亚”烟草。其几乎完全用于香烟掺合物。部分较重叶子可用于烟斗吸烟的混合物中。一些英国香烟为100%烟道烘烤的。烟道烘烤的叶子的特征在于较高糖:氮比率。此比率通过挑选处于成熟晚期的叶子以及通过允许叶子中出现某些化学变化的独特烘烤工艺增强。烘烤的叶子的颜色在柠檬色到橙色到赤褐色范围内变化。
白肋为来源于白肋(White Burley)的淡空气烘烤类型,其在1864年的俄亥俄州农场中作为突变型出现。白肋主要用于香烟掺合物。部分较重叶子用于烟斗掺合物且也用于咀嚼。
烘烤的白肋叶的特征在于较低糖含量和极低糖与氮比率(较高烟碱)。此通过高氮肥料、在衰老早期收获以及允许可出现任何糖的氧化的空气烘烤工艺增强。
马里兰(Maryland)为另一淡空气烘烤类型。其在一定程度上用于美国掺合香烟且在较高程度上用于某些瑞士香烟掺合物。
马里兰烟草极其蓬松,具有良好燃烧特性、较低烟碱和中性香味。
深型空气烘烤的烟草涵盖多种主要用于咀嚼、鼻烟、雪茄和烟斗掺合物的类型。大部分世界产量限于热带。
东方烟草提供具有极特征性香味的温和烟雾。由腺毛(毛状体)渗出的树脂、蜡和胶提供香味。烟碱较低,平均约1.0%。
深型烧制烟草用于制造鼻烟、嚼烟和烟斗掺合物。深型烧制叶在烘烤早期期间经历来自闷燃木的烟雾。所用木材的类型在决定口味和生长中极为重要。烘烤的叶子的颜色极深,且体型长且重。
术语“调节”可指降低、抑制、增加或以其它方式影响多肽的表达或活性。该术语还可指降低、抑制、增加或以其它方式影响编码多肽的基因的活性,所述影响活性可包括但不限于调节转录活性。
如本文使用的,术语“降低”或“降低的”指数量或活性约10%至约99%的降低,或者至少10%、至少20%、至少25%、至少30%、至少40%、至少50%、至少60%、至少70%、至少75%、至少80%、至少90%、至少95%、至少98%、至少99%或至少100%或更多的降低,所述活性诸如(但不限于)多肽活性、转录活性和蛋白质表达。
如本文使用的,术语“抑制”或“抑制的”指数量或活性约98%至约100%的降低,或者至少98%、至少99%但特别是100%的降低,所述活性例如但不限于多肽活性、转录活性和蛋白质表达。
如本文使用的,术语“增加”或“增加的”指数量或活性约5%至约99%的增加,或者至少5%、至少10%、至少20%、至少25%、至少30%、至少40%、至少50%、至少60%、至少70%、至少75%、至少80%、至少90%、至少95%、至少98%、至少99%、至少100%、至少500%或至少1000%或更多的增加,所述活性例如但不限于多肽活性、转录活性和蛋白质表达。
在对照植物背景下的术语“对照”意指这样的植物或植物细胞,其中酶的表达或活性未进行修饰(例如,增加或降低的),并且因此它可提供与其中酶的表达或活性已进行修饰的植物的比较。对照植物可包含空载体。对照植物或植物细胞可对应于野生型植物或野生型植物细胞。举例来说,对照植物或植物细胞可与用于产生标题植物的基因改变的起始材料属于同一基因型。在所有此等案例中,出于比较目的,使用相同的方案来培养和收获标题植物和对照植物。本文中所描述的基因或多肽的水平、比例、活性或分布的改变,或烟草植物表型的改变,确切地说蛋白酶产生的降低,可通过比较标题植物与对照植物来测量,其中使用相同的方案来培养、收获及烘烤标题植物和对照植物。对照植物可提供用于测量标题植物的表型改变的参照点。对表型的改变的测量可在任何时候在植物中测量,包含在植物发育、衰老期间或优选地在烘烤后。对表型的改变的测量可在在任何条件下生长的植物中测量,包含在生长室、温室或在野外生长的植物。表型的变化可通过测定本文在SEQ IDNo 81-160中鉴定的蛋白酶的表达或活性来测量。
具体实施方式
在一个实施方式中,提供一种分离的多核苷酸,其包含以下成分、由以下成分组成或基本上由以下成分组成:与本文描述的序列中的任一者,包含序列表中所展示的多核苷酸中的任一者,具有至少95%序列同一性的多核苷酸序列。适当地,所述分离的多核苷酸包括以下成分、由以下成分组成或基本上由以下成分组成:与之具有至少95%、96%、97%、98%、99%或100%序列同一性的序列。
适当地,本文所描述的多核苷酸编码具有蛋白酶活性的蛋白质,所述活性为阐述于SEQ ID NO:81-160中的蛋白质活性的至少约50%、60%、70%、80%、90%、95%、96%、97%、98%、99%、100%或更多。
如本文描述的多核苷酸可包括核苷酸聚合物,其可以是未经修饰或经修饰的脱氧核糖核酸(DNA)或核糖核酸(RNA)。相应地,多核苷酸可以是(无限制地)基因组DNA、互补DNA(cDNA)、mRNA、或反义RNA或其一个或多个片段。此外,多核苷酸可以是单链或双链DNA,其为单链和双链区的混合物的DNA,包含DNA和RNA的杂交物分子,或具有单链和双链区的混合物的杂交物分子,或其一个或多个片段。另外,多核苷酸可由包含DNA、RNA或两者的三链区或者其一个或多个片段组成。多核苷酸可含有一个或多个经修饰的碱基,例如硫代磷酸酯,并且可以是肽核酸。一般地,多核苷酸可由经分离或克隆的cDNA片段、基因组DNA、寡核苷酸或个别核苷酸或前述的组合装配。尽管本文描述的多核苷酸序列显示为DNA序列,但序列包括其相应的RNA序列,及其互补(例如完全互补的)DNA或RNA序列,包括其反向互补体。
如本文描述的多核苷酸一般含有磷酸二酯键,尽管在一些情况下,包括多核苷酸类似物,其可具有替代主链,包含例如氨基磷酸酯、硫代磷酸酯、二硫代磷酸酯、或O-甲基亚磷酰胺(O-methylphophoroamidite)键;以及肽多核苷酸主链和键。其它类似多核苷酸包括具有阳性主链;非离子主链和非核糖主链的那些。磷酸核糖主链的修饰可出于多种原因而完成,例如以增加此类分子在生理环境中的稳定性和半衰期,或作为生物芯片上的探针。可制备天然存在的多核苷酸和类似物的混合物;作为另外一种选择,可制备不同多核苷酸类似物的混合物,以及天然存在的多核苷酸和类似物的混合物。
多种多核苷酸类似物是已知的,包括例如氨基磷酸酯、硫代磷酸酯、二硫代磷酸酯、O-甲基亚磷酰胺键以及肽多核苷酸主链和键。其它类似多核苷酸包括具有阳性主链、非离子主链和非核糖主链的那些。还包括含一种或多种碳环糖的多核苷酸。
其它类似物包括其为肽多核苷酸类似物的肽多核苷酸。这些主链在中性条件下是基本上非离子的,与天然存在的多核苷酸的高度荷电的磷酸二酯主链形成对比。这可导致优点。首先,肽多核苷酸主链可显示出改善的杂交动力学。对于错配碱基对相对于完全匹配的碱基对,肽多核苷酸在解链温度中具有更大变化。对于内部错配,DNA和RNA通常显示出在解链温度中的2-4℃下降。在非离子肽多核苷酸主链的情况下,下降接近于7-9℃。类似地,由于其非离子性质,附着至这些主链的碱基的杂交对盐浓度相对不敏感。另外,肽多核苷酸可不被细胞酶降解或被细胞酶降解至更少程度,并且因此可以是更稳定的。
在所公开的多核苷酸及其片段的用途中有片段作为核酸杂交测定中的探针或用于核酸扩增测定的引物的用途。此类片段一般包含DNA序列的至少约10、11、12、13、14、15、16、17、18、19或20个或更多个邻接核苷酸。在其它实施方式中,DNA片段包含DNA序列的至少约10、15、20、30、40、50或60个或更多个邻接核苷酸。因此,在一个方面中,还提供一种用于检测编码具有烟碱N-去甲基酶活性成员的蛋白质或编码烟碱N-去甲基酶酶类的多核苷酸的方法,所述方法包括使用探针或引物或两者。
影响杂交条件选择的基本参数和设计合适条件的指导由Sambrook,J.,E.F.Fritsch和T.Maniatis(1989,Molecular Cloning:A Laboratory Manual,ColdSpring Harbor Laboratory Press,Cold Spring Harbor,N.Y.)描述。使用遗传密码的知识与本文描述的氨基酸序列组合,可制备简并寡核苷酸组。此类寡核苷酸可用作例如聚合酶链反应(PCR)中的引物,由此分离且扩增DNA片段。在某些实施方式中,简并引物可用作遗传文库的探针。此类文库包括但不限于cDNA文库、基因组文库、以及甚至电子表达序列标签或DNA文库。通过这种方法鉴定的同源序列随后用作探针,以鉴定本文鉴定的序列的同源物。
另外的潜在用途是多核苷酸和寡核苷酸(例如引物或探针),其在降低的严格条件下(通常为中等严格条件,且通常为高度严格条件下)与如本文描述的一种或多种多核苷酸杂交。影响杂交条件选择的基本参数和设计合适条件的指导由Sambrook,J.,E.F.Fritsch和T.Maniatis(1989,Molecular Cloning:A Laboratory Manual,Cold Spring HarborLaboratory Press,Cold Spring Harbor,N.Y.阐述,并且可基于例如多核苷酸的长度或碱基组成,由本领域普通技术人员容易地确定。一种实现中等严格条件的方法涉及使用预洗涤溶液(所述预洗涤溶液含有5×标准柠檬酸钠、0.5%十二烷基硫酸钠、1.0mM乙二胺四乙酸(pH 8.0)),具有约50%甲酰胺、6×标准柠檬酸钠的杂交缓冲液,和约55℃的杂交温度(或其它相似的杂交溶液,例如含有约50%甲酰胺的杂交溶液,伴随约42℃的杂交温度),以及在0.5×标准柠檬酸钠、0.1%十二烷基硫酸钠中,约60℃的洗涤条件。一般地,高度严格条件定义为如上的杂交条件,但使用在大约68℃下的洗涤、0.2×标准柠檬酸钠、0.1%十二烷基硫酸钠。SSPE(1×SSPE是0.15M氯化钠、10mM磷酸钠和1.25mM乙二胺四乙酸,pH 7.4)可替代杂交和洗涤缓冲液中的标准柠檬酸钠(1×标准柠檬酸钠是0.15M氯化钠和15mM柠檬酸钠);洗涤在杂交完成后执行15分钟。应当理解通过应用控制杂交反应和双链体稳定性的基本原则,洗涤温度和洗涤盐浓度可根据需要进行调整,以实现所需严格性程度,如本领域技术人员已知的和下文进一步描述的(参见例如,Sambrook,J.,E.F.Fritsch和T.Maniatis(1989,Molecular Cloning:A Laboratory Manual,Cold Spring Harbor LaboratoryPress,Cold Spring Harbor,N.Y)。当使多核苷酸与未知序列的靶多核苷酸杂交时,杂交物长度假定为杂交多核苷酸的那种。当杂交已知序列的多核苷酸时,可通过比对多核苷酸的序列且鉴定一个或多个最佳序列互补性区域,来确定杂交物长度。长度预期为小于50个碱基对的杂交物的杂交温度应比杂交物的解链温度低5至10℃,其中解链温度根据下述等式进行确定。对于长度小于18个碱基对的杂交物,解链温度(℃)=2(A+T碱基数目)+4(G+C碱基数目)。对于长度在18个碱基对以上的杂交物,解链温度(℃)=81.5+16.6(log10[Na+])+0.41(%G+C)-(600/N),其中N是杂交物中的碱基数目,并且[Na+]是杂交缓冲液中的钠离子浓度(1×标准柠檬酸钠的[Na+]=0.165M)。通常,每种此类杂交多核苷酸具有它与之杂交的多核苷酸的长度的至少25%(通常至少50%、60%或70%,且最通常至少80%)长度,并且和它与之杂交的多核苷酸具有至少60%序列同一性(例如至少70%、75%、80%、85%、90%、95%、96%、97%、98%、99%或100%)。
如本领域技术人员应当理解的,线性DNA具有两个可能定向:5'-至-3'方向和3'-至-5'方向。例如,如果参考序列以5'-至-3'方向放置,并且如果第二序列以5'-至-3'方向置于相同多核苷酸分子/链中,则参考序列和第二序列以相同方向定向,或具有相同定向。通常,启动子序列和处于给定启动子调节下的目的基因以相同定向放置。然而,就以5'-至-3'方向放置的参考序列而言,如果第二序列以3'-至-5'方向置于相同多核苷酸分子/链中,则参考序列和第二序列以反义方向定向,或具有反义定向。如果参考序列(5'-至-3'方向)和参考序列的反向互补序列(以5'-至-3'放置的参考序列)置于相同多核苷酸分子/链中,则就彼此而言具有反义定向的两个序列可替代地可描述为具有相同定向。本文所示的序列以5'-至-3'方向显示。
本文提供的重组构建体可用于转化植物或植物细胞,以便调节蛋白质表达及/或活性水平。重组多核苷酸构建体可包括编码如本文描述的一种或多种多核苷酸的多核苷酸,所述一种或多种多核苷酸可操作地连接至适合表达多肽的调节区。因此,多核苷酸可包含编码如本文描述的多肽的编码序列。蛋白质表达及/或活性水平经调节的植物或植物细胞可包括突变型、非天然存在的、转基因的、人造的或经遗传改造的植物或植物细胞。适当地,转基因植物或植物细胞包括已通过重组DNA的稳定整合而改变的基因组。重组DNA包括已在细胞外部经遗传改造且构建的DNA,并且包括含有天然存在的DNA或cDNA或合成DNA的DNA。转基因植物可包括由最初转化的植物细胞再生的植物,以及来自经转化的植物的以后世代或杂交的后代转基因植物。适当地,与对照植物相比较,转基因修饰改变本文描述的多核苷酸或多肽的表达或活性。
由重组多核苷酸编码的多肽可以是天然多肽,或对于细胞可以是异源的。在一些情况下,重组构建体含有可操作地连接至调节区,调节表达的多核苷酸。合适调节区的例子在本文中描述。
本发明还提供了含有重组多核苷酸构建体的载体,例如本文描述的那些。合适的载体主链包括例如本领域常规使用的那些,例如质粒、病毒、人工染色体、细菌人工染色体、酵母人工染色体、或噬菌体人工染色体。合适的表达载体包括但不限于源自例如细菌噬菌体、杆状病毒和逆转录病毒的质粒和病毒载体。众多载体和表达系统是商购可得的。载体可包含(例如)复制起点、支架附着区或标记。标记基因可对植物细胞赋予可选择表型。例如,标记可赋予杀生物剂抗性,例如对抗生素(例如卡那霉素、G418、博来霉素或潮霉素)、或者除草剂(例如草甘膦、氯磺隆或草胺膦)的抗性。另外,表达载体可包括设计为促进所表达多肽的操纵或检测(例如纯化或定位)的标签序列。标签序列例如萤光素酶、β-葡糖醛酸酶、绿色荧光蛋白、谷胱甘肽S-转移酶、聚组氨酸、c-myc或血凝素序列,通常作为具有所编码多肽的融合物表达。此类标签可插入多肽内的任何地方,包括在羧基末端或氨基末端处。
植物或植物细胞可通过具有重组多核苷酸整合到其基因组进行转化,以变得稳定转化。本文描述的植物或植物细胞可以是稳定转化的。稳定转化的细胞通常伴随每次细胞分裂保留所引入的多核苷酸。植物或植物细胞可进行瞬时转化,使得重组多核苷酸不整合到其基因组内。瞬时转化的细胞通常伴随每次细胞分裂丧失所引入的重组多核苷酸的全部或一部分,使得在足够数目的细胞分裂后,所引入的重组多核苷酸无法在子细胞中检测到。
用于转化植物细胞的许多方法是本领域可获得的,所述方法全部在本文中涵盖,包括生物射弹、基因枪技术、土壤杆菌属(Agrobacterium)介导的转化、病毒载体介导的转化和电穿孔。用于将外源DNA整合到植物染色体内的土壤杆菌属系统已就植物遗传改造进行广泛研究、修饰且探究。包含对应于主题纯化的烟草蛋白质的DNA序列的裸露重组DNA分子,通过常规方法连接至适当的T-DNA序列,所述DNA序列以有义或反义定向可操作地连接至调节序列。通过聚乙二醇技术或电穿孔技术,将这些引入烟草原生质体内,所述两种技术均为标准的。作为另外一种选择,将此类包含重组DNA分子的载体引入活土壤杆菌属细胞内,所述重组DNA分子编码主题纯化的烟草蛋白质,所述活土壤杆菌属细胞随后将DNA转移到烟草植物细胞内。通过裸露DNA而无伴随T-DNA载体序列的转化可经由烟草原生质体与含DNA脂质体的融合或经由电穿孔来完成。不伴随T-DNA载体序列的裸露DNA也可用于经由惰性、高速度微弹转化烟草细胞。
如果细胞或培养的组织用作转化的受体组织,则需要时,通过本领域技术人员已知的技术,可由经转化的培养物再生植物。
待包括在重组构建体中的调节区的选择取决于几个因素,包括但不限于效率、可选择性、可诱导性、所需表达水平、和细胞或组织优先表达。通过适当选择调节区且相对于编码序列放置调节区,调节编码序列的表达对于本领域技术人员是常规工作。多核苷酸的转录可以相似方式进行调节。一些合适的调节区仅或占优势地在某些细胞类型中起始转录。用于鉴定且表征植物基因组DNA中的调节区的方法是本领域已知的。
合适的启动子包括由组织特异性因子识别的组织特异性启动子,所述组织特异性启动子存在于不同组织或细胞类型中(例如根特异性启动子、枝条特异性启动子、木质部特异性启动子),或存在于不同发育阶段期间,或响应不同环境条件存在。合适的启动子包括组成型启动子,其可在大多数细胞类型中活化,而无需特异性诱导剂。用于控制RNAi多肽生产的合适启动子的例子包括花椰菜花叶病毒35S(CaMV/35S)、SSU、OCS、lib4、usp、STLS1、B33、nos或遍在蛋白或菜豆球蛋白启动子。本领域技术人员能够生成多种变化的重组启动子。
组织特异性启动子是仅在植物发育期间的特异性时间,在特定细胞或组织中(例如在营养组织或生殖组织中)活跃的转录控制元件。例如,当多核苷酸在某些组织中的表达是优选的时,组织特异性表达可以是有利的。在发育控制下的组织特异性启动子的例子包括可仅(或主要仅)在某些组织中起始转录的启动子,所述某些组织例如营养组织,例如根或叶,或者生殖组织,例如果实、胚珠、种子、花粉、雌蕊(pistol)、花或任何胚组织。生殖组织特异性启动子可以是例如花药特异性、胚珠特异性、胚特异性、胚乳特异性、珠被特异性、种子和种皮特异性、花粉特异性、花瓣特异性、萼片特异性或其组合。
合适的叶特异性启动子包括来自C4植物(玉蜀黍)的丙酮酸正磷酸双激酶(PPDK)启动子、来自玉蜀黍的cab-m1Ca+2启动子、拟南芥(Arabidopsis thaliana)myb相关基因启动子(Atmyb5)、二磷酸核酮糖羧化酶(RBCS)启动子(例如,在叶和光生长幼苗中表达的番茄RBCS 1、RBCS2和RBCS3A基因,在发育中的番茄果实中表达的RBCS1和RBCS2,或几乎专一地以高水平在叶片和叶鞘的叶肉细胞中表达的二磷酸核酮糖羧化酶启动子)。
合适的衰老特异性启动子包含在果实催熟、叶枯萎和脱落期间活跃的番茄启动子、编码半胱氨酸蛋白酶的基因的玉蜀黍启动子、82E4的启动子和SAG基因的启动子。可使用合适的花药特异性启动子。可选择本领域技术人员已知的合适的根优先启动子。合适的种子优先的启动子包括种子特异性启动子(在种子发育期间活跃的那些启动子,例如种子贮藏蛋白质的启动子)和种子发芽启动子(在种子发芽期间活跃的那些启动子)。此类种子优先的启动子包括但不限于Cim1(细胞分裂素诱导的信息);cZ19B1(玉蜀黍19kDa玉米醇溶蛋白);milps(肌-肌醇-1-磷酸合成酶);mZE40-2,也称为Zm-40;nuclc;和celA(纤维素合成酶)。γ-玉米醇溶蛋白是胚乳特异性启动子。Glob-1是胚特异性启动子。对于双子叶植物,种子特异性启动子包括但不限于豆β菜豆球蛋白、油菜籽蛋白、ββ-伴大豆球蛋白、大豆凝集素、十字花科蛋白(cruciferin)等等。对于单子叶植物,种子特异性启动子包括但不限于玉蜀黍15kDa玉米醇溶蛋白启动子、22kDa玉米醇溶蛋白启动子、27kDa玉米醇溶蛋白启动子、g-玉米醇溶蛋白启动子、27kDaγ-玉米醇溶蛋白启动子(例如gzw64A启动子,参见Genbank登记号S78780)、蜡质启动子、shrunken 1启动子、shrunken 2启动子、球蛋白1启动子(参见Genbank登记号L22344)、Itp2启动子、cim1启动子、玉蜀黍end1和end2启动子、nuc1启动子、Zm40启动子、eep1和eep2;lec1、硫氧还蛋白H启动子;mlip15启动子、PCNA2启动子;和shrunken-2启动子。
诱导型启动子的例子包括响应病原体攻击、厌氧条件、温度升高、光、干旱、寒冷温度或高盐浓度的启动子。病原体诱导型启动子包括来自发病机制相关蛋白质(PR蛋白质)的那些,所述发病机制相关蛋白质在通过病原体感染后诱导(例如PR蛋白质、SAR蛋白质、β-1,3-葡聚糖酶、壳多糖酶)。
除植物启动子之外,其它合适的启动子可来源于细菌源(例如,章鱼碱合成酶启动子、胭脂碱合成酶启动子及来源于Ti质粒的其它启动子),或可来源于病毒启动子(例如,花椰菜花叶病毒(CaMV)的35S和19S RNA启动子、烟草花叶病毒的组成型启动子、花椰菜花叶病毒(CaMV)19S和35S启动子、或玄参花叶病毒35S启动子)。
优选的启动子包括本文提供的控制元件(作为SEQ ID No.1-80的部分),其在烘烤程序期间在烟草叶中展示出所需表达。
在另一方面中,提供一种分离的多肽,其包括以下成分、由以下成分组成或基本上由以下成分组成:与本文描述的多肽序列中的任一者,包含序列表中所展示的多肽中的任一者,具有至少95%序列同一性的多肽序列。适当地,所述分离的多肽包括以下成分、由以下成分组成或基本上由以下成分组成:与其具有至少95%、96%、97%、98%、99%、99.1%、99.2%、99.3%、99.4%、99.5%、99.6%、99.7%、99.8%、99.9%或100%序列同一性的序列。
多肽可包含与SEQ ID NO:81-160具有足够程度或很大程度的同一性或相似性的序列以充当蛋白酶。多肽片段通常保持全长序列的一些或全部活性。
如本文所讨论,多肽还包括通过引入任何类型的改变(例如,氨基酸的插入、缺失或置换;糖基化状态的变化;影响重折叠或异构化、三维结构或自缔合状态的变化)而产生的突变体,所述突变体可以是有意改造或天然分离的,其限制条件是它们仍具有其作为蛋白酶的一些或全部功能或活性。适当地,调节、提高或降低作为蛋白酶的功能或活性。
多肽包括通过引入任何类型改变(例如,氨基酸的插入、缺失或置换;糖基化状态的变化;影响重折叠或异构化、三维结构或自缔合状态的变化)而产生的变体,所述变体可以是有意改造或天然分离的。变体可具有产生沉默变化且导致功能等价蛋白质的改变。有意的氨基酸置换可在残基的极性、电荷、溶解性、疏水性、亲水性和两亲性质中的相似性基础上作出,只要物质的二级结合活性被保留。例如,带负电的氨基酸包括天冬氨酸和谷氨酸;带正电的氨基酸包括赖氨酸和精氨酸;并且具有相似亲水性值含不带电极性首基的氨基酸包括亮氨酸、异亮氨酸、缬氨酸、甘氨酸、丙氨酸、天冬酰胺、谷氨酰胺、丝氨酸、苏氨酸、苯丙氨酸和酪氨酸。保守置换可例如根据下表进行。第二列中的相同块和优选第三列中的相同行中的氨基酸可彼此置换:
多肽可以是成熟蛋白质或不成熟蛋白质或源自不成熟蛋白质的蛋白质。多肽可采取线性形式或使用已知方法环化。多肽通常包含至少10、至少20、至少30或至少40个邻接氨基酸。
公开了包括如本文所描述编码蛋白酶的基因中的突变的烟草植物或植物细胞,其中所述突变使得所述蛋白酶的表达经调节或功能经调节。可增强蛋白酶的表达或功能。除了所述蛋白酶中的一个或多个突变外,所述突变型植物或植物细胞可在一个或多个其它基因或多肽中具有一个或多个其它突变。在某些实施方式中,除了蛋白酶基因中的一个或多个突变外,所述突变体可在一个或多个其它基因或多肽(例如,如序列表中所述的一个或多个其它蛋白酶基因或多肽)中具有一个或多个其它突变。适当地,蛋白酶在烘烤程序期间在突变型植物的叶子中表达。
还提供一种调节(烘烤的)烟草植物或(烘烤的)烟草植物材料中的蛋白酶水平的方法,所述方法包括将一个或多个调节至少一种蛋白酶基因的表达的突变引入所述植物的基因组中,其中所述至少一种蛋白酶基因选自SEQ ID No:1-80。
还提供了一种用于鉴定具有增加水平的蛋白酶的烟草植物的方法,所述方法包括针对SEQ ID NO:1-80中一个或多个突变的存在筛选来自目的烟草植物的核酸样品,且任选地使所鉴定突变与已知用以调节蛋白酶水平的突变相互关联。
还公开一种烟草植物或植物细胞,其对于编码蛋白酶的基因中的突变为杂合或纯合的,其中所述突变引起所述蛋白酶的表达或功能的调节(增强或减弱)。
大量方法可用于组合一种植物中的突变,包含有性杂交。在增强或减弱蛋白酶表达或活性的蛋白酶基因中具有一个或多个有利的杂合或纯合突变的植物可与在一个或多个增强或减弱蛋白酶活性的其它蛋白酶基因中具有一个或多个有利的杂合或纯合突变的植物杂交。在一个实施方式中,进行杂交以便引入相同植物内的蛋白酶基因内的一个或多个有利杂合或纯合突变。
如果蛋白酶活性在统计学上低于或高于烟草植物(其尚未经修饰以抑制所述蛋白酶多肽的活性且其使用同一方案培养、收获和烘烤)中的同一蛋白酶的蛋白酶活性时,那么烟草植物中的一种或多种蛋白酶多肽的活性根据本发明减弱或增强。
在一些实施方式中,使用诱变法将突变引入烟草植物或植物细胞中,且使用本领域技术人员已知的方法(诸如Southern印迹分析法、DNA排序、PCR分析法或表型分析法)鉴定或选择所引入的突变。可使用本领域熟知的方法来确定影响基因表达或干扰所编码的蛋白质的功能的突变。基因外显子中的插入突变通常导致空突变。经转化残基中的突变可在抑制所编码的蛋白质的代谢功能中特别有效。
还公开了用于获得突变型多核苷酸和多肽的方法。任何目的植物包括植物细胞或植物材料,可通过多种已知诱导诱变的方法进行遗传修饰,所述方法包括定点诱变、寡核苷酸指导的诱变、化学诱导的诱变、辐射诱导的诱变、利用经修饰的碱基的诱变、利用缺口双链体DNA的诱变、双链断裂诱变、利用修复缺陷型宿主株的诱变、通过全基因合成的诱变、DNA改组及其它等价方法。
还公开了由此编码的蛋白酶多核苷酸或多肽的片段。多核苷酸的片段可编码保留天然蛋白的生物活性的蛋白质片段且因此涉及烟碱向降烟碱的代谢转化。或者,适用作杂交探针或PCR引物的多核苷酸的片段通常不编码保留生物活性的片段蛋白质。此外,所公开的核苷酸序列的片段包含可在本文所描述的重组构建体内组装的那些。多核苷酸序列的片段的范围可介于至少约25个核苷酸、约50个核苷酸、约75个核苷酸、约100个核苷酸、约150个核苷酸、约200个核苷酸、约250个核苷酸、约300个核苷酸、约400个核苷酸、约500个核苷酸、约600个核苷酸、约700个核苷酸、约800个核苷酸、约900个核苷酸、约1000个核苷酸、约1100个核苷酸、约1200个核苷酸、约1300个核苷酸或约1400个核苷酸且至多达到本文所描述的编码多肽的全长多核苷酸。多肽序列的片段的范围可介于至少约25个氨基酸、约50个氨基酸、约75个氨基酸、约100个氨基酸、约150个氨基酸、约200个氨基酸、约250个氨基酸、约300个氨基酸、约400个氨基酸、约500个氨基酸,且至多达到本文所描述的全长多肽。
突变型多肽变体可用于制备包括一种或多种突变型多肽变体的突变型、非天然存在的或转基因植物(例如,突变型、非天然存在的、转基因、人造或遗传改造的植物)。适当地,突变型多肽变体保留未突变多肽的活性。与未突变的多肽相比较,突变型多肽变体的活性可更高、更低或大约相同。
本文描述的核苷酸序列和多肽中的突变可包括人造突变或合成突变或遗传改造的突变。本文描述的核苷酸序列和多肽中的突变可以是经由过程获得或可获得的突变,所述过程包括体外或体内操纵步骤。本文描述的核苷酸序列和多肽中的突变可以是经由包括人为干预的过程获得或可获得的突变。作为实例,所述过程可包含使用外源添加的化学品的诱变,诸如致突变、致畸形或致癌有机化合物,例如甲磺酸乙酯(EMS),所述化学品在遗传物质中产生随机突变。作为进一步例子,该过程可包括一个或多个遗传改造步骤-例如本文描述的遗传改造步骤中的一个或多个或其组合。作为进一步例子,该过程可包括一个或多个植物杂交步骤。
多肽可通过在适合表达多肽的培养条件下培养经转化的或重组宿主细胞进行制备。所得到的表达多肽随后可使用已知的纯化过程由此类培养物进行纯化。多肽的纯化可包括含有与多肽结合的试剂的亲和柱;一个或多个在此类亲和树脂上的柱步骤;一个或多个涉及疏水作用层析的步骤;或免疫亲和层析。作为另外一种选择,多肽还可以促进纯化的形式表达。举例来说,它可经表达为融合多肽,例如麦芽糖结合多肽、谷胱甘肽-5-转移酶、组氨酸标签或硫氧还蛋白的那些。用于融合多肽的表达和纯化的试剂盒是商购可得的。多肽可用表位加上标签,并且随后通过使用针对此类表位的特异性抗体纯化。一个或多个液相层析步骤-例如反相高效液相层析可用于进一步纯化多肽。以多个组合的前述纯化步骤中的一些或全部可用于提供基本上同质的重组多肽。因此纯化的多肽可基本上不含其它多肽,并且在本文中定义为“基本上纯化的多肽”;此类纯化的多肽包括多肽、片段、变体等等。多肽和片段的表达、分离和纯化可通过任何合适技术实现,所述合适技术包括但不限于本文描述的方法。
还能够利用亲和柱例如针对多肽生成的单克隆抗体,以亲和纯化所表达的多肽。这些多肽可使用常规技术,例如在高盐洗脱缓冲液中,从亲和柱中取出,且随后透析到更低盐的缓冲液内用于使用,或取决于利用的亲和基质,通过改变pH或其它组分,或使用天然存在的亲和部分的底物竞争性取出。
公开了分离的或基本上纯化的多核苷酸或蛋白质组合物。“分离的”或“纯化的”多核苷酸或蛋白质或其生物学上的活性部分大体上或基本上不含通常伴随多核苷酸或蛋白质或者与其相互作用(如在其自然存在的环境中发现的那样)的组分。因此,分离的或纯化的多核苷酸或蛋白质在通过重组技术产生时,其大体上不含其它细胞物质或培养介质,或当以化学方式合成时,其大体上不含化学前体或其它化学品。最佳地,“分离的”多核苷酸不含天然地侧接于所述多核苷酸的来源生物体的基因组DNA中的多核苷酸的序列(最佳地,蛋白质编码序列)(例如,位于多核苷酸的5'和3'末端的序列)。举例来说,在各种实施方式中,分离的多核苷酸可含有少于约5kb、4kb、3kb、2kb、1kb、0.5kb或0.1kb的核苷酸序列,所述核苷酸序列天然地侧接于所述多核苷酸的来源细胞的基因组DNA中的多核苷酸。大体上不含细胞物质的蛋白质包含具有小于约30%、20%、10%、5%或1%(以干重计)受污染蛋白质的蛋白质制备物。
多肽还可通过已知的常规化学合成来产生。通过合成方法用于构建多肽或其片段的方法是本领域技术人员已知的。由于与天然多肽共享一级、二级或三级结构或构象特征,合成构建的多肽序列可具有与之共同的生物特性,包括生物活性。
如本文使用的,术语‘非天然存在的’描述并非天然形成或在自然界中不存在的实体(例如多核苷酸、基因突变、多肽、植物、植物细胞和植物材料)。可通过本文描述或本领域已知的方法,制备、合成、起始、修饰、干预或操纵此类非天然存在的实体或人工实体。可由人制备、合成、起始、修饰、干预或操纵此类非天然存在的实体或人工实体。因此,例如,非天然存在的植物、非天然存在的植物细胞或非天然存在的植物材料,可使用传统植物育种技术(例如回交)或通过遗传操纵技术(例如反义RNA、干扰RNA、大范围核酸酶等等)进行制备。作为进一步例子,可通过第一植物或植物细胞基因渗入第二植物或植物细胞(其自身可为天然存在的)内,或通过将一个或多个遗传突变(例如一种或多种多态性)从第一植物或植物细胞转移到第二植物或植物细胞内,来制备非天然存在的植物、非天然存在的植物细胞或非天然存在的植物材料,使得所得到的植物、植物细胞或植物材料或其后代包含并非天然形成或在自然界中不存在的遗传组成(例如基因组、染色体或其区段)。所得到的植物、植物细胞或植物材料因此是人工的或非天然存在的。相应地,可通过修饰第一天然存在的植物或植物细胞中的遗传序列,来制备人工的或非天然存在的植物或植物细胞,即使所得到的遗传序列在第二植物或植物细胞中天然存在,所述第二植物或植物细胞包含与第一植物或植物细胞不同的遗传背景。在某些实施方式中,突变是非天然存在的突变,其天然存在于核苷酸序列或多肽(例如基因或蛋白质)中。
遗传背景中的差异可通过表型差异或本领域已知的分子生物学技术进行检测,所述分子生物学技术例如核酸测序、遗传标记(例如微卫星RNA标记)的存在或不存在。
还提供了与本文描述的多肽免疫反应的抗体。如本文描述的,多肽、片段、变体、融合多肽等等可用作生产与之免疫反应的抗体中的“免疫原”。此类抗体可经由抗体的抗原结合位点与多肽特异性结合。特异性结合抗体是特异性鉴定且结合多肽、同源物和变体,而不是其它分子的那些。在一个实施方式中,抗体对于具有与如本文所示的氨基酸序列的多肽是特异性的,并且不与其它多肽交叉反应。
更具体而言,多肽、片段、变体、融合多肽等等含有引发抗体形成的抗原决定簇或表位。这些抗原决定簇或表位可以是线性的或构象的(不连续的)。线性表位由多肽的单个氨基酸区段组成,而构象或不连续表位由来自多肽链的不同区域的氨基酸区段组成,所述氨基酸区段在多肽折叠后达到紧密接近。表位可通过本领域已知的方法中的任一种进行鉴定。另外,来自多肽的表位可用作测定中的研究试剂,且纯化来自物质例如多克隆血清或来自培养杂交瘤的上清液的特异性结合抗体。此类表位或其变体可使用本领域已知的技术或使用重组DNA技术产生,所述本领域已知的技术例如固相合成、多肽的化学或酶促切割。
针对多肽的多克隆抗体和单克隆抗体两者均可通过常规技术进行制备。本文还考虑了产生对于多肽特异性的单克隆抗体的杂交瘤细胞系。此类杂交瘤可通过常规技术进行生产且鉴定。对于抗体生产,多种宿主动物可通过用多肽、其片段、变体或突变体注射进行免疫接种。仅举几个例子,此类宿主动物可包括但不限于兔、小鼠和大鼠。多种佐剂可用于增加免疫应答。取决于宿主物种,此类佐剂包括但不限于弗氏(完全和不完全)、矿物凝胶如氢氧化铝,表面活性物质如溶血卵磷脂、普鲁兰尼克多元醇、聚阴离子、肽、油乳剂、钥孔血蓝蛋白、二硝基苯酚,以及潜在有用的人佐剂如BCG(卡介苗)和短小棒状杆菌(Corynebacterium parvum)。单克隆抗体可通过常规技术进行回收。此类单克隆抗体可具有任何免疫球蛋白类别,包括IgG、IgM、IgE、IgA、IgD,及其任何亚类。
抗体还可用于测定中,以在体外或体内检测多肽或片段的存在。抗体还可用于通过免疫亲和层析纯化多肽或片段。
除诱变以外,可调节本文描述的一种或多种蛋白酶的表达或活性的组合物包括但不限于:可干扰一种或多种内源基因的转录的序列特异性多核苷酸;可干扰RNA转录物翻译的序列特异性多核苷酸(例如双链RNA、siRNA、核酶);可干扰一种或多种蛋白质的稳定性的序列特异性多肽;可干扰一种或多种蛋白质的酶促活性或者一种或多种蛋白质就底物或调节蛋白质而言的结合活性的序列特异性多核苷酸;显示出对于一种或多种蛋白质的特异性的抗体;可干扰一种或多种蛋白质的稳定性、或者一种或多种蛋白质的酶促活性、或者一种或多种蛋白质的结合活性的小分子化合物;结合一种或多种多核苷酸的锌指蛋白;以及具有针对一种或多种多核苷酸的活性的大范围核酸酶。基因编辑技术、遗传编辑技术和基因组编辑技术是本领域众所周知的。
一种基因编辑方法涉及转录激活因子样效应物核酸酶(transcriptionactivator-like effector nuclease)(TALEN)的使用,所述TALEN诱导细胞可以修复机制响应的双链断裂。非同源末端连接使来自双链断裂任一侧的DNA再连接,其中存在很少的序列重叠或无序列重叠用于退火。该修复机制经由插入或缺失、或染色体重排诱导基因组中的误差。任何此类误差可致使在该位置处编码的基因产物无功能。另一种基因编辑方法涉及细菌CRISPR/Cas系统的使用。细菌和古细菌显示出称为规律成簇间隔短回文重复(clustered regularly interspaced short palindromic repeat)(CRISPR)的染色体元件,所述CRISPR是适应性免疫系统的一部分,所述适应性免疫系统保护不受侵袭性病毒和质粒DNA。在II型CRISPR系统中,CRISPR RNA(crRNA)与反式激活crRNA(tracrRNA)和CRISPR相关(Cas)蛋白质起作用,以在靶DNA中引入双链断裂。通过Cas9的靶切割要求在crRNA和tracrRNA之间的碱基配对,以及在crRNA和靶DNA之间的碱基配对。靶鉴定通过称为原型间隔序列毗邻基序(protospacer-adjacent motif)(PAM)的短基序的存在得到促进,所述PAM符合序列NGG。该系统可用于基因组编辑。Cas9通常通过双重RNA按程序工作,所述双重RNA由crRNA和tracrRNA组成。然而,这些RNA的核心组分可组合成单一杂交物‘引导RNA’用于Cas9靶向。对靶DNA使用非编码RNA引导用于位点特异性切割有希望比现有技术(例如TALEN)明显更直截了当。使用CRISPR/Cas策略,再靶向核酸酶复合物仅要求引入新RNA序列,并且不需要再改造蛋白质转录因子的特异性。
反义技术是可用于调节多肽表达的另一熟知方法。将待阻遏的基因的多核苷酸克隆且可操作地连接至调节区和转录终止序列,使得RNA的反义链被转录。重组构建体随后转化到植物细胞内,并且产生RNA的反义链。多核苷酸无需是待阻遏基因的整个序列,但通常与待阻遏基因的有义链的至少一部分基本上互补。
多核苷酸可转录成核酶,或催化RNA,其影响mRNA的表达。核酶可设计为与基本上任何靶RNA特异性配对,且切割在特定位置处的磷酸二酯主链,由此使靶RNA功能失活。异源多核苷酸可编码设计为切割特定mRNA转录物的核酶,从而阻止多肽的表达。锤头状核酶可用于破坏特定mRNA,尽管可使用切割在位点特异性鉴定序列处的mRNA的多种核酶。锤头状核酶在由侧翼区指示的位置处切割mRNA,所述侧翼区与靶mRNA形成互补碱基对。唯一要求是靶RNA含有5'-UG-3'核苷酸序列。锤头状核酶的构建和产生是本领域已知的。锤头状核酶序列可嵌入稳定RNA例如转移RNA(tRNA)内,以增加体内切割效率。
在一个实施方式中,可干扰一种或多种RNA转录物翻译的序列特异性多核苷酸是干扰RNA。RNA干扰或RNA沉默是进化上保守的过程,特异性mRNA通过其可被靶向用于酶促降解。一种双链RNA(双链RNA)通过细胞(例如双链RNA病毒、或干扰RNA多核苷酸)引入或产生,以起始干扰RNA途径。双链RNA可通过核糖核酸酶III而转换成长度为21bp至24bp的多重小干扰RNA双链体,所述核糖核酸酶III是双链RNA特异性核酸内切酶。小干扰RNA随后可由RNA诱导沉默复合物鉴定,所述RNA诱导沉默复合物促进通过ATP依赖性过程的小干扰RNA的解旋。小干扰RNA的解旋的反义链将活化RNA诱导沉默复合物导向靶向mRNA,所述靶向mRNA包含与小干扰RNA反义链互补的序列。靶向mRNA和反义链可形成A形螺旋,并且A形螺旋的大沟可由活化RNA诱导沉默复合物鉴定。可在由小干扰RNA链的5'末端的结合位点限定的单个位点处,通过活化RNA诱导沉默复合物切割靶mRNA。活化RNA诱导沉默复合物可再循环,以催化另一切割事件。
干扰RNA表达载体可包含编码干扰RNA多核苷酸的干扰RNA构建体,其通过降低mRNA、mRNA前体或相关RNA变体的表达水平,显示出RNA干扰活性。表达载体可包含置于干扰RNA构建体上游,且可操作地连接至干扰RNA构建体的启动子,如本文进一步描述的。干扰RNA表达载体可包含合适的最小核心启动子、目的干扰RNA构建体、上游(5')调节区、下游(3')调节区,包括转录终止和多腺苷酸化信号,以及其它本领域技术人员已知的序列,例如多种选择标记。
多个实施方式涉及通过将一种或多种多核苷酸的多个拷贝整合到(烟草)植物基因组内,用于调节本文描述的多核苷酸中的一种或多种(或如本文描述的其任何组合)的表达水平的方法,其包括:用表达载体转化植物细胞宿主,所述表达载体包含可操作地连接至多核苷酸的启动子。
提供了通过调节mRNA的翻译用于调节内源基因表达水平的多种组合物和方法。宿主(烟草)植物细胞可用表达载体转化,所述表达载体包括:可操作地连接至多核苷酸的启动子,所述多核苷酸以就启动子而言的反义定向放置,以允许与mRNA的一部分具有序列互补性的RNA多核苷酸的表达。
用于调节mRNA翻译的多种表达载体可包含:可操作地连接至多核苷酸的启动子,其中序列以就启动子而言的反义定向放置。反义RNA多核苷酸的长度可改变,并且可为约15-20个核苷酸、约20-30个核苷酸、约30-50个核苷酸、约50-75个核苷酸、约75-100个核苷酸、约100-150个核苷酸、约150-200个核苷酸、以及约200-300个核苷酸。
如本文所讨论,可通过非转基因方法来调节一种或多种多肽的表达,例如如本所讨论的在一个或多个基因中产生一个或多个突变。在基因序列中随机引入突变的方法可包括化学诱变、EMS诱变和辐射诱变。将一个或多个靶向突变引入细胞内的方法包含(但不限于)基因组编辑技术(确切地说锌指核酸酶介导的诱变和靶向诱导的基因组局部损伤(TILLING))、同源重组、寡核苷酸引导的诱变和大范围核酸酶介导的诱变。在一个实施方式中,使用TILLING。此为可用于生成及/或鉴定编码具有经修改表达及/或活性的多肽的多核苷酸的诱变技术。TILLING还允许选择携带此类突变体的植物。TILLING组合高密度诱变与高流通量筛选方法。用于TILLING的方法是本领域众所周知的(参见McCallum等人,(2000)Nat Biotechnol 18:455-457and Stemple(2004)Nat Rev Genet 5(2):145-50)。
还可制备多核苷酸中的特异性突变,其可导致调节的基因表达、mRNA的稳定性调节、或蛋白质的稳定性调节。此类植物在本文中被称为“非天然存在的”或“突变型”植物。通常,突变型或非天然存在的植物包括外源或合成或人造核酸(例如DNA或RNA)的至少一部分,其在被操纵前不存在于植物中。外源核酸可以是单个核苷酸、两个或更多个核苷酸、两个或更多个邻接核苷酸、或者两个或更多个不邻接核苷酸-例如至少10、20、30、40、50、100、200、300、400、500、600、700、800、900、1000、1100、1200、1300、1400或1500个或更多个邻接或不邻接核苷酸。
突变型或非天然存在的植物或植物细胞可具有一种或多种基因中的一个或多个突变的任何组合,所述一个或多个突变使得蛋白质水平得以调节。举例来说,突变型或非天然存在的植物或植物细胞可具有单个基因中的单个突变;单个基因中的多个突变;两个或多于两个或者三个或多于三个或者四个或多于四个基因中的单个突变;或者两个或多于两个或者三个或多于三个或者四个或多于四个基因中的多个突变。这些突变的实例描述于本文中。作为进一步例子,突变型或非天然存在的植物或植物细胞可具有在基因的特定部分中的一个或多个突变,所述特定部分例如基因中编码蛋白质的活性位点或其一部分的区域中。作为进一步例子,突变型或非天然存在的植物或植物细胞可具有在一个或多个基因外部区域中的一个或多个突变,所述区域例如其调节的基因的上游或下游区域,只要其调节基因的活性或表达。上游元件可包括启动子、增强子或转录因子。一些元件例如增强子可置于它调节的基因的上游或下游。一种或多种元件无需定位接近于它调节的基因,因为一些元件已发现位于它调节的基因上游或下游几百上千个碱基对处。突变型或非天然存在的植物或植物细胞可具有位于基因的前100个核苷酸内、基因的前200个核苷酸内、基因的前300个核苷酸内、基因的前400个核苷酸内、基因的前500个核苷酸内、基因的前600个核苷酸内、基因的前700个核苷酸内、基因的前800个核苷酸内、基因的前900个核苷酸内、基因的前1000个核苷酸内、基因的前1100个核苷酸内、基因的前1200个核苷酸内、基因的前1300个核苷酸内、基因的前1400个核苷酸内、或基因的前1500个核苷酸内的一个或多个突变。突变型或非天然存在的植物或植物细胞可具有位于基因的100个核苷酸的第一、第二、第三、第四、第五、第六、第七、第八、第九、第十、第十一、第十二、第十三、第十四或第十五集合或其组合内的一个或多个突变。公开了包括突变型多肽变体的突变型或非天然存在的植物或植物细胞(例如,如本文描述的突变型、非天然存在的或转基因植物或植物细胞等)。
在一个实施方式中,使来自植物的种子诱变且随后生长成第一代突变型植物。随后允许第一代植物自花授粉,并且使来自第一代植物的种子生长成第二代植物,所述第二代植物随后就其基因座中的突变进行筛选。尽管诱变的植物材料可就突变进行筛选,但筛选第二代植物的优点在于所有体细胞突变均对应于种系突变。本领域技术人员将了解到多种植物材料包括但不限于种子、花粉、植物组织或植物细胞可进行诱变,以便制备突变型植物。然而,当植物核酸就突变进行筛选时,可影响诱变的植物材料类型。例如,当在非诱变植物授粉前对花粉实施诱变时,使起源于授粉的种子生长成第一代植物。第一代植物的每一个细胞将含有在花粉中制备的突变;因此这些第一代植物随后可就突变进行筛选,代替等待直到第二代时。
主要制备点突变和短缺失、插入、颠换和或转换的诱变剂包括化学诱变剂或辐射,可用于制备突变。诱变剂包括但不限于甲磺酸乙酯、甲磺酸甲酯、N-乙基-N-亚硝基脲、三乙基蜜胺、N-甲基-N-亚硝基脲、丙卡巴肼、苯丁酸氮芥、环磷酰胺、硫酸二乙酯、丙烯酰胺单体、美法仑、氮芥、长春新碱、二甲基亚硝胺、N-甲基-N'-硝基-亚硝基胍、亚硝基胍、2-氨基嘌呤、7,12-二甲基苯并蒽、环氧乙烷、六甲基磷酸胺、白消安、二环氧烷烃(二环氧辛烷、二环氧丁烷等等)、2-甲氧基-6-氯-9[3-(乙基-2-氯乙基)氨丙基氨基]吖啶二盐酸盐和甲醛。
还考虑了无法由诱变剂直接引起的基因座中的自发突变,条件是它们导致所需表型。合适的诱变试剂还可包括例如电离辐射,例如X射线、γ射线、快中子照射和UV辐射。本领域技术人员已知的植物核酸制备的任何方法均可用于制备用于突变筛选的植物核酸。
可任选合并由个别植物、植物细胞或植物材料制备的核酸,以便加速源于诱变的植物组织、细胞或材料的植物群体中的突变筛选。可筛选植物、植物细胞或植物材料的一个或多个后续世代。任选合并的组的大小取决于使用的筛选方法的灵敏度。
在核酸样品任选合并后,可对它们实施多核苷酸特异性扩增技术,例如聚合酶链反应。对于基因或与基因紧相邻的序列特异性的任何一个或多个引物或探针可用于扩增任选合并的核酸样品内的序列。适当地,一个或多个引物或探针设计为扩增基因座的区域,在所述区域中有用的突变最可能出现。最优选地,引物设计为检测多核苷酸区域内的突变。另外,优选一个或多个引物和一个或多个探针避免已知的多态性位点,以便容易筛选点突变。为了促进扩增产物的检测,一个或多个引物或探针可使用任何常规标记方法进行标记。使用本领域充分理解的方法,可基于本文描述的序列来设计一个或多个引物或一个或多个探针。
为了促进扩增产物的检测,可使用任何常规标记方法来标记一个或多个引物或一个或多个探针。使用本领域充分理解的方法,可基于本文描述的序列来设计这些。多态性可通过本领域已知的方法进行鉴定,并且一些已在文献中得到描述。
在进一步方面,本发明提供了制备突变型植物的方法。该方法涉及提供包含编码本文所述功能多核苷酸(或如本文描述的其任何组合)的基因的植物的至少一个细胞。接下来,植物的至少一个细胞在有效调节本文所述一种或多种多核苷酸的活性的条件下进行处理。至少一个突变型植物细胞随后繁殖成突变型植物,其中与对照植物的那种相比较,所述突变型植物具有调节水平的一种或多种所述多肽(或如本文描述的其任何组合)。在制备突变型植物的该方法的一个实施方式中,处理步骤涉及使至少一个细胞经受如上所述的化学诱变试剂,及在有效获得至少一个突变型植物细胞的条件下。在该方法的另一个实施方式中,处理步骤涉及在有效获得至少一个突变型植物细胞的条件下,使至少一个细胞经受辐射源。术语“突变型植物”包括这样的突变型植物,与对照植物相比较,所述突变型植物中的基因型适当地通过除了遗传改造或遗传修饰之外的方式进行修饰。
在某些实施方式中,突变型植物、突变型植物细胞或突变型植物材料可包含一个或多个突变,所述一个或多个突变在另一种植物、植物细胞或植物材料中天然存在,且赋予所需性状。该突变可引入(例如基因渗入)另一种植物、植物细胞或植物材料(例如具有与突变源自于其的植物不同的遗传背景的植物、植物细胞或植物材料)内,以对其赋予该性状。因此,例如,在第一植物中天然存在的突变可引入第二植物内,例如与第一植物具有不同遗传背景的第二植物。技术人员因此能够搜索且鉴定在其基因组中天然携带本文所述基因的一种或多种突变等位基因的植物,所述基因赋予所需性状。可通过多种方法包括育种、回交和基因渗入,将天然存在的一种或多种突变等位基因转移至第二植物,以产生在本文所述基因中具有一个或多个突变的品系、品种或杂交物。可在突变型植物的库中筛选显示所需性状的植物。适当地,利用如本文描述的核苷酸序列的知识进行选择。因此,能够与对照相比较筛选遗传性状。此类筛选方法可涉及如本文讨论的常规核酸扩增和/或杂交技术的应用。因此,本发明的进一步方面涉及用于鉴定突变型植物的方法,其包括下述步骤:(a)提供来自植物的包含核酸的样品;和(b)确定多核苷酸的核酸序列,其中与对照植物的多核苷酸序列相比较,在多核苷酸序列中的差异指示所述植物是突变型植物。在另一个方面,提供了用于鉴定突变型植物的方法,与对照植物相比较,所述突变型植物累积增高或降低水平的蛋白酶,所述方法包括下述步骤:(a)提供来自待筛选的植物的样品;(b)确定所述样品是否包括在本文所述的多核苷酸中的一种或多种中的一个或多个突变;和(c)在烘烤程序期间确定所述植物的至少蛋白酶含量。
在另一个方面,提供了用于制备突变型植物的方法,与对照植物相比,所述突变型植物具有提高或降低水平的蛋白酶,所述方法包括下述步骤:(a)提供来自第一植物的样品;(b)确定所述样品是否包括在本文所述的多核苷酸中的一种或多种中的一个或多个突变,所述一个或多个突变导致经调节水平的蛋白酶;和(c)将一个或多个突变转移到第二植物内。适当地,在烘烤的叶材料中测定至少蛋白酶含量。可使用本领域已知的多种方法,例如通过遗传改造、遗传操纵、基因渗入、植物育种、回交等等,将一个或多个突变转移到第二植物内。在一个实施方式中,第一植物是天然存在的植物。在一个实施方式中,第二植物具有与第一植物不同的遗传背景。
在另一个方面,提供了用于制备突变型植物的方法,与对照植物相比,所述突变型植物具有提高或降低水平的蛋白酶,所述方法包括下述步骤:(a)提供来自第一植物的样品;(b)确定所述样品是否包括在本文所述的多核苷酸中的一种或多种中的一个或多个突变,所述一个或多个突变导致经调节水平的蛋白酶;和(c)将来自第一植物的一个或多个突变基因渗入到第二植物内。适当地,在烘烤的叶材料中测定至少蛋白酶含量。在一个实施方式中,基因渗入步骤包括植物育种,任选包括回交等等。在一个实施方式中,第一植物是天然存在的植物。在一个实施方式中,第二植物具有与第一植物不同的遗传背景。在一个实施方式中,第一植物不是栽培变种或优良栽培变种。在一个实施方式中,第二植物是栽培变种或优良栽培变种。进一步方面涉及通过本文所述方法获得或可获得的突变型植物(包括栽培变种或优良栽培变种突变型植物)。在某些实施方式中,“突变型植物”可具有仅定位于植物的特定区域,例如在本文所述的一种或多种多核苷酸的序列内的一个或多个突变。根据该实施方式,突变型植物的剩余基因组序列将与诱变前的植物相同或基本上相同。
在某些实施方式中,突变型植物可具有定位于植物的超过一个区域,例如在本文所述的多核苷酸中的一种或多种的序列内以及基因组的一个或多个进一步区域中的一个或多个突变。根据该实施方式,突变型植物的剩余基因组序列将与诱变前的植物不同或基本上不同。在某些实施方式中,突变型植物可不具有在本文描述的一种或多种多核苷酸的一个或多个、两个或更多个、三个或更多个、四个或更多个、或者五个或更多个外显子中的一个或多个突变;或可不具有在本文描述的一种或多种多核苷酸的一个或多个、两个或更多个、三个或更多个、四个或更多个、或者五个或更多个内含子中的一个或多个突变;或可不具有在本文描述的一种或多种多核苷酸的启动子中的一个或多个突变;或可不具有在本文描述的一种或多种多核苷酸的3’非翻译区中的一个或多个突变;或可不具有在本文描述的一种或多种多核苷酸的5’非翻译区中的一个或多个突变;或可不具有在本文描述的一种或多种多核苷酸的编码区中的一个或多个突变;或可不具有在本文描述的一种或多种多核苷酸的非编码区中的一个或多个突变;或其部分中的其两个或更多个、三个或更多个、四个或更多个、五个或更多个;或者六个或更多个的任何组合。
在进一步方面,提供了鉴定在基因中包含突变的植物、植物细胞或植物材料的方法,所述基因编码本文描述的多核苷酸,所述方法包括:(a)对植物、植物细胞或植物材料实施诱变;(b)由所述植物、植物细胞或植物材料或其后代获得核酸样品;和(c)测定编码本文描述的多核苷酸的基因或其变体或片段的核酸序列,其中所述序列中的差异指示其中的一个或多个突变。
锌指蛋白可用于调节本文描述的多核苷酸中的一种或多种的表达或活性。在多个实施方式中,通过锌指核酸酶介导的诱变,修饰包含多核苷酸编码序列的一部分或全部的基因组DNA序列。在基因组DNA序列中搜索锌指蛋白结合的独特位点。作为另外一种选择,在基因组DNA序列中搜索锌指蛋白结合的两个独特位点,其中两个位点在相反链上且紧靠在一起,例如相隔1、2、3、4、5、6个或更多个碱基对。相应地,提供了与多核苷酸结合的锌指蛋白。
锌指蛋白可进行改造,以鉴定基因中的所选靶位点。通过截短或扩增或与选择方法偶联的定点诱变过程(所述选择方法例如但不限于噬菌体展示选择、细菌双杂交选择或细菌单杂交选择),锌指蛋白可包含源自天然锌指DNA结合结构域和非天然锌指DNA结合结构域的任何基序组合。术语“非天然锌指DNA结合结构域”指锌指DNA结合结构域,其结合靶核酸内的三碱基对序列,并且在包含待修饰的核酸的细胞或生物中不存在。用于设计结合特异性核苷酸序列的锌指蛋白的方法是本领域已知的,所述特异性核苷酸序列对于靶基因是独特的。
在其它实施方式中,锌指蛋白可选择为结合多核苷酸的调节序列。更具体而言,调节序列可包含转录起始位点、起始密码子、外显子区、外显子-内含子边界、终止子或终止密码子。相应地,本发明提供了在本文描述的一种或多种多核苷酸附近或其内通过锌指核酸酶介导的诱变产生的突变型、非天然存在的或转基因植物或植物细胞,以及通过锌指核酸酶介导的诱变用于制备此类植物或植物细胞的方法。用于将锌指蛋白和锌指核酸酶递送至烟草植物的方法类似于下文对于大范围核酸酶递送描述的那些。
适用于遗传修饰的植物包括但不限于单子叶和双子叶植物和植物细胞系统,包括来自下述科之一的物种:爵床科(Acanthaceae)、葱科(Alliaceae)、六出花科(Alstroemeriaceae)、石蒜科(Amaryllidaceae)、夹竹桃科(Apocynaceae)、棕榈科(Arecaceae)、菊科(Asteraceae)、小檗科(Berberidaceae)、红木科(Bixaceae)、十字花科(Brassicaceae)、凤梨科(Bromeliaceae)、大麻科(Cannabaceae)、石竹科(Caryophyllaceae)、三尖杉科(Cephalotaxaceae)、藜科(Chenopodiaceae)、秋水仙科(Colchicaceae)、葫芦科(Cucurbitaceae)、薯蓣科(Dioscoreaceae)、麻黄科(Ephedraceae)、古柯科(Erythroxylaceae)、大戟科(Euphorbiaceae)、豆科(Fabaceae)、唇形科(Lamiaceae)、亚麻科(Linaceae)、石松科(Lycopodiaceae)、锦葵科(Malvaceae)、黑药花科(Melanthiaceae)、芭蕉科(Musaceae)、桃金娘科(Myrtaceae)、蓝果树科(Nyssaceae)、罂粟科(Papaveraceae)、松科(Pinaceae)、车前草科(Plantaginaceae)、禾本科(Poaceae)、蔷薇科(Rosaceae)、茜草科(Rubiaceae)、杨柳科(Salicaceae)、无患子科(Sapindaceae)、茄科(Solanaceae)、红豆杉科(Taxaceae)、山茶科(Theaceae)或葡萄科(Vitaceae)。
合适物种可包括下述属的成员:黄葵属(Abelmoschus)、冷杉属(Abies)、槭属(Acer)、剪股颖属(Agrostis)、葱属(Allium)、六出花属(Alstroemeria)、凤梨属(Ananas)、穿心莲属(Andrographis)、须芒草属(Andropogon)、蒿属(Artemisia)、芦竹属(Arundo)、颠茄属(Atropa)、小檗属(Berberis)、甜菜属(Beta)、红木属(Bixa)、芸苔属(Brassica)、金盏菊属(Calendula)、山茶属(Camellia)、喜树属(Camptotheca)、大麻属(Cannabis)、辣椒属(Capsicum)、红花属(Carthamus)、长春花属(Catharanthus)、三尖杉属(Cephalotaxus)、菊属(Chrysanthemum)、金鸡纳属(Cinchona)、西瓜属(Citrullus)、咖啡属(Coffea)、秋水仙属(Colchicum)、鞘蕊花属(Coleus)、甜瓜属(Cucumis)、南瓜属(Cucurbita)、狗牙根属(Cynodon)、曼陀罗属(Datura)、石竹属(Dianthus)、洋地黄属(Digitalis)、薯蓣属(Dioscorea)、油棕属(Elaeis)、麻黄属(Ephedra)、蔗茅属(Erianthus)、古柯属(Erythroxylum)、桉树属(Eucalyptus)、羊茅属(Festuca)、草莓属(Fragaria)、雪花莲属(Galanthus)、大豆属(Glycine)、棉属(Gossypium)、向日葵属(Helianthus)、橡胶树属(Hevea)、大麦属(Hordeum)、天仙子属(Hyoscyamus)、麻风树属(Jatropha)、莴苣属(Lactuca)、亚麻属(Linum)、黑麦草属(Lolium)、羽扇豆属(Lupinus)、番茄属(Lycopersicon)、石松属(Lycopodium)、木薯属(Manihot)、苜蓿属(Medicago)、薄荷属(Mentha)、芒属(Miscanthus)、芭蕉属(Musa)、烟草属、稻属(Oryza)、黍属(Panicum)、罂粟属(Papaver)、银胶菊属(Parthenium)、狼尾草属(Pennisetum)、矮牵牛属(Petunia)、虉草属(Phalaris)、梯牧草属(Phleum)、松属(Pinus)、早熟禾属(Poa)、一品红属(Poinsettia)、杨属(Populus)、萝芙木属(Rauwolfia)、蓖麻属(Ricinus)、蔷薇属(Rosa)、甘蔗属(Saccharum)、柳属(Salix)、血根草属(Sanguinaria)、赛莨菪属(Scopolia)、黑麦属(Secale)、茄属(Solanum)、高粱属(Sorghum)、米草属(Spartina)、菠菜属(Spinacea)、菊蒿属(Tanacetum)、红豆杉属(Taxus)、可可属(Theobroma)、小黑麦属(Triticosecale)、小麦属(Triticum)、北美穗草属(Uniola)、藜芦属(Veratrum)、长春花属(Vinca)、葡萄属(Vitis)和玉蜀黍属(Zea)。
合适物种可包括黍属物种(Panicum spp.)、高粱属物种(Sorghum spp.)、芒属物种(Miscanthus spp.)、甘蔗属物种(Saccharum spp.)、蔗茅属物种(Erianthus spp.)、杨属物种(Populus spp.)、大须芒草(Andropogon gerardii)(大蓝秆草)、象草(Pennisetumpurpureum)(象草)、虉草(Phalaris arundinacea)(草芦)、狗牙根(Cynodon dactylon)(狗牙根)、高羊茅(Festuca arundinacea)(高羊茅)、互花米草(Spartina pectinata)(草原索草)、苜蓿(Medicago sativa)(苜蓿)、芦竹(Arundo donax)(芦竹)、黑麦(Secale cereale)(黑麦)、柳属物种(Salix spp.)(柳树)、桉树属物种(Eucalyptus spp.)(桉树)、小黑麦(Triticosecale)(小麦杂交黑麦)、竹、向日葵(Helianthus annuus)(向日葵)、红花(Carthamus tinctorius)(红花)、桐油树(Jatropha curcas)(麻风树)、蓖麻(Ricinuscommunis)(蓖麻)、油棕(Elaeis guineensis)(棕榈)、亚麻(Linum usitatissimum)(亚麻)、芥菜(Brassica juncea)、甜菜(Beta vulgaris)(甜菜)、木薯(Manihot esculenta)(木薯)、番茄(Lycopersicon esculentum)(番茄)、莴苣(Lactuca sativa)(生菜)、香蕉(Musyclise alca)(香蕉)、马铃薯(Solanum tuberosum)(土豆)、甘蓝(Brassicaoleracea)(绿花椰菜、花椰菜、抱子甘蓝(Brussels sprouts))、山茶(Camellia sinensis)(茶)、草莓(Fragaria ananassa)(草莓)、可可(Theobroma cacao)(可可)、咖啡(Coffeycliseca)(咖啡)、葡萄(Vitis vinifera)(葡萄)、菠萝(Ananas comosus)(菠萝)、辣椒(Capsicum annum)(辣椒和甜椒)、洋葱(Allium cepa)(洋葱)、甜瓜(Cucumis melo)(甜瓜)、黄瓜(Cucumis sativus)(黄瓜)、笋瓜(Cucurbita maxima)(南瓜)、南瓜(Cucurbita moschata)(南瓜)、菠菜(Spinacea oleracea)(菠菜)、西瓜(Citrulluslanatus)(西瓜)、秋葵(Abelmoschus esculentus)(秋葵)、茄子(Solanum melongena)(茄子)、蔷薇属物种(Rosa spp.)(玫瑰)、香石竹(Dianthus caryophyllus)(康乃馨)、矮牵牛属物种(Petunia spp.)(矮牵牛)、一品红(Poinsettia pulcherrima)(一品红)、白羽扇豆(Lupinus albus)(羽扇豆)、燕麦(Uniola paniculata)(燕麦)、翦股颖(翦股颖属物种(Agrostis spp.)、松属(Pinus spp.)(松树)、冷杉属物种(Abies spp.)(冷杉)、槭属物种(Acer spp.)(枫木)、大麦(Hordeum vulgare)(大麦)、草地早熟禾(Poa pratensis)(早熟禾)、黑麦草属物种(Lolium spp.)(黑麦草)和猫尾草(Phleum pratense)(梯牧草)、柳枝稷(Panicum virgatum)(柳枝稷)、高粱(Sorghuyclise)或(高粱、苏丹草)、芒草(Miscanthusgiganteus)(芒草)、甘蔗属物种(Saccharum sp.)(能源蔗)、香脂杨(Populusbalsamifera)(白杨)、玉蜀黍(Zea mays)(玉米)、大豆(Glycine max)(黄豆)、油菜(Brassica napus)(芸苔)、小麦(Triticum aestivum)(小麦)、陆地棉(Gossypiumhirsutum)(棉花)、水稻(Oryza sativa)(稻)、向日葵(Helianthus annuus)(向日葵)、苜蓿(Medicago sativa)(苜蓿)、甜菜(Beta vulgaris)(甜菜)或珍珠粟(Pennisetum glaucum)(珍珠粟)。
多个实施方式涉及经修饰的突变型烟草、非天然存在的烟草或转基因烟草植物或植物细胞,以调节基因表达水平,由此产生与对照相比较,其中多肽的表达水平在目的组织中经调节的植物或植物细胞(例如烟草植物或植物细胞)。所公开的组合物和方法可应用于烟草属的任何物种,包括黄花烟草(N.rustica)和普通烟草(N.tabacum)(例如LA B21、LNKY171、TI 1406、巴斯玛(Basma)、Galpao、Perique、Beinhart 1000-1和Petico)。其它物种包含无茎烟草(N.acaulis)、尖叶烟草(N.acuminata)、非洲烟草(N.africana)、花叶烟草(N.alata)、阿米基诺氏烟草(N.ameghinoi)、抱茎烟草(N.amplexicaulis)、阿伦兹氏烟草(N.arentsii)、渐狭叶烟草(N.attenuata)、阿姆布吉烟草(N.azambujae)、贝纳莫特氏烟草(N.benavidesii)、本赛姆氏烟草(N.benthamiana)、印度烟草(N.bigelovii)、博内里烟草(N.bonariensis)、洞生烟草(N.cavicola)、克利夫兰氏烟草(N.clevelandii)、心叶烟草(N.cordifolia)、伞床烟草(N.corymbosa)、迪伯纳氏烟草(N.debneyi)、木丝烟草(N.excelsior)、福尔吉特氏烟草(N.forgetiana)、香烟草(N.fragrans)、粉蓝烟草(N.glauca)、粘烟草(N.glutinosa)、古特斯比氏烟草(N.goodspeedii)、哥西氏烟草(N.gossei)、杂交烟草(N.hybrid)、因古儿巴烟草(N.ingulba)、卡瓦卡米氏烟草(N.kawakamii)、奈特氏烟草(N.knightiana)、郎氏烟草(N.langsdorffii)、渐尖叶烟草(N.linearis)、长花烟草(N.longiflora)、海滨烟草(N.maritima)、特大管烟草(N.megalosiphon)、摩西氏烟草(N.miersii)、夜花烟草(N.noctiflora)、裸茎烟草(N.nudicaulis)、欧布斯特烟草(N.obtusifolia)、西方烟草(N.occidentalis)、西方亚种香芥烟草(N.occidentalis subsp.hesperis)、耳状烟草(N.otophora)、圆锥烟草(N.paniculata)、少花烟草(N.pauciflora)、矮牵牛状烟草(N.petunioides)、蓝茉莉叶烟草(N.plumbaginifolia)、夸德瑞伍氏烟草(N.quadrivalvis)、雷蒙德氏烟草(N.raimondii)、波缘烟草(N.repanda)、莲座烟草(N.rosulata)、莲座亚种因古儿巴烟草(N.rosulata subsp.ingulba)、圆叶烟草(N.rotundifolia)、赛特氏烟草(N.setchellii)、拟似烟草(N.simulans)、茄叶烟草(N.solanifolia)、斯佩格茨氏烟草(N.spegazzinii)、斯托可通氏烟草(N.stocktonii)、香甜烟草(N.suaveolens)、美花烟草(N.sylvestris)、拟穗状烟草(N.thyrsiflora)、绒毛烟草(N.tomentosa)、绒毛状烟草(N.tomentosiformis)、三角叶烟草(N.trigonophylla)、荫生烟草(N.umbratica)、波叶烟草(N.undulata)、颤毛烟草(N.velutina)、序叶烟草(N.wigandioides)和花烟草(N.x sanderae)。
本文还考虑使用烟草栽培变种和优良烟草栽培变种。因此,转基因、非天然存在的或突变型植物可以是烟草品种或优良烟草栽培变种,其包含一种或多种转基因、或者一个或多个遗传突变或其组合。一个或多个遗传突变(例如,一种或多种多态性)可以是非天然存在于个别烟草品种或烟草栽培变种(例如,优良烟草栽培变种)的突变,或可以是的确天然存在的一个或多个遗传突变,条件是所述突变并非天然存在于个别烟草品种或烟草栽培变种(例如,优良烟草栽培变种)中。
特别有用的普通烟草品种包括白肋烟型、深型、烟道烘烤型和东方型烟草。品种或栽培变种的非限制性例子是:BD 64、CC 101、CC 200、CC 27、CC 301、CC 400、CC 500、CC600、CC 700、CC 800、CC 900、Coker 176、Coker 319、Coker 371Gold、Coker 48、CD 263、DF911、DT 538LC Galpao烟草、GL 26H、GL 350、GL 600、GL 737、GL 939、GL 973、HB 04P、HB04P LC、HB3307PLC、杂交403LC、杂交404LC、杂交501LC、K 149、K 326、K 346、K 358、K394、K399、K 730、KDH 959、KT 200、KT204LC、KY10、KY14、KY 160、KY 17、KY 171、KY 907、KY907LC、KTY14×L8LC、Little Crittenden、McNair 373、McNair944、msKY 14×L8、窄叶Madole、窄叶Madole LC、NBH 98、N-126、N-777LC、N-7371LC、NC 100、NC 102、NC 2000、NC291、NC 297、NC 299、NC 3、NC 4、NC 5、NC 6、NC7、NC 606、NC 71、NC 72、NC 810、NC BH129、NC 2002、Neal Smith Madole、OXFORD 207、PD 7302LC、PD 7309LC、PD 7312LC、'Perique'烟草、PVH03、PVH09、PVH19、PVH50、PVH51、R 610、R 630、R 7-11、R 7-12、RG 17、RG81、RG H51、RGH 4、RGH 51、RS 1410、Speight 168、Speight 172、Speight 179、Speight210、Speight 220、Speight 225、Speight 227、Speight 234、Speight G-28、Speight G-70、Speight H-6、Speight H20、Speight NF3、TI 1406、TI 1269、TN 86、TN86LC、TN 90、TN97、TN97LC、TN D94、TN D950、TR(Tom Rosson)Madole、VA 309、VA359、AA 37-1、B 13P、Xanthi(Mitchell-Mor)、Bel-W3、79-615、Samsun Holmes NN、KTRDC 2号杂交49、白肋21、KY8959、KY 9、MD 609、PG 01、PG 04、PO1、PO2、PO3、RG 11、RG 8、VA 509、AS44、Banket A1、巴斯玛Drama B84/31、巴斯玛I Zichna ZP4/B、巴斯玛Xanthi BX 2A、Batek、Besuki Jember、C104、Coker 347、Criollo Misionero、Delcrest、Djebel 81、DVH 405、 Comum、HB04P、希克斯阔叶、Kabakulak Elassona、Kutsage E1、LA BU 21、NC 2326、NC 297、PVH2110、Red Russian、Samsun、Saplak、Simmaba、Talgar 28、Wislica、Yayaldag、Prilep HC-72、Prilep P23、Prilep PB 156/1、Prilep P12-2/1、Yaka JK-48、Yaka JB 125/3、TI-1068、KDH-960、TI-1070、TW136、巴斯玛、TKF 4028、L8、TKF 2002、GR141、Basma xanthi、GR149、GR153、Petit Havana。即使本文未特别指明,也考虑上述的低转化亚变种。
实施方式还涉及用于产生突变型植物、非天然存在的植物、杂交植物或转基因植物的组合物和方法,所述植物已进行修饰,以调节本文所述的一种或多种多核苷酸(或如本文描述的其任何组合)的表达或活性。有利地,所获得的突变型植物、非天然存在的植物、杂交植物或转基因植物可在整体外观上与对照植物相似或基本上相同。多种表型特征,例如成熟程度、每植物叶数、杆高、叶插入角度、叶大小(宽度和长度)、节间距离、以及叶片-中脉比可通过田间观察进行评价。
一个方面涉及本文所述的突变型植物、非天然存在的植物、杂交植物或转基因植物的种子。优选地,所述种子是烟草种子。进一步方面涉及本文所述的突变型植物、非天然存在的植物、杂交植物或转基因植物的花粉或胚珠。另外,提供了如本文所述的突变型植物、非天然存在的植物、杂交植物或转基因植物,其还含有赋予雄性不育的核酸。
还提供了如本文所述的突变型植物、非天然存在的植物、杂交植物或转基因植物或其一部分的可再生细胞的组织培养物,其中培养物再生能够表达亲本的所有形态和生理特征的植物。可再生细胞包括但不限于来自以下的细胞:叶、花粉、胚、子叶、下胚轴、根、根尖、花药、花及其一部分、胚珠、枝条、茎、杆、髓和荚膜或源自其的愈伤组织或原生质体。
再进一步方面涉及源自或可源自突变型、非天然存在的或转基因植物或细胞的烘烤的植物材料,诸如烘烤的叶或烘烤的烟草,其中本文所述多核苷酸中的一种或多种的表达或者由其编码的蛋白质的活性经调节。
适当地,所述植物(例如,叶)的视觉外观与对照植物大体上相同。适当地,所述植物是烟草植物。
实施方式还涉及用于产生突变型、非天然存在的或转基因植物或植物细胞的组合物和方法,所述植物或植物细胞已进行修饰,以调节本文所述多核苷酸或多肽中的一种或多种的表达或活性,所述修饰可产生具有调节水平的蛋白酶的植物或植物组分(例如叶,例如绿叶或烘烤的叶,或者烟草)或植物细胞。
在另一个方面,提供了用于调节(例如,提高)植物的至少一部分(例如叶,例如烘烤的叶,或者烟草)中的蛋白酶的量的方法,所述方法包括下述步骤:(i)调节(例如,提高)本文所述多肽中的一种或多种(或如本文描述的其任何组合)的表达或活性,适当地,其中多肽由本文所述的相应多核苷酸序列编码;(ii)测量在步骤(i)中获得的突变型、非天然存在的或转基因植物的至少一部分(例如叶,例如烘烤的叶,或者烟草或烟雾)中的蛋白酶含量;和(iii)鉴定与对照植物相比较,其中的蛋白酶含量已进行调节(例如,提高)的突变型、非天然存在的或转基因植物。适当地,所述突变型、非天然存在的或转基因植物的视觉外观与对照植物基本上相同。适当地,所述植物是烟草植物。
在另一个方面,提供了用于调节(例如,提高)烘烤的植物材料的至少一部分(例如烘烤的叶)中的蛋白酶的量的方法,所述方法包括下述步骤:(i)调节(例如,提高)多肽(或如本文描述的其任何组合)的一种或多种的表达或活性,适当地,其中多肽由本文所述的相应多核苷酸序列编码;(ii)收获植物材料(诸如叶中的一种或多种),且烘烤一段时间;(iii)测量在步骤(ii)中或在步骤(ii)期间获得的经烘烤植物材料的至少一部分中的蛋白酶含量;和(iv)鉴定与对照植物相比较,其中的蛋白酶含量已进行调节(例如,提高)的经烘烤植物材料。
与对照相比较,表达中的增加可为约5%至约100%,或者至少10%、至少20%、至少25%、至少30%、至少40%、至少50%、至少60%、至少70%、至少75%、至少80%、至少90%、至少95%、至少98%或100%或更多(诸如200%、300%、500%、1000%或更多)的增加,这包含转录活性或多核苷酸表达或多肽表达或其组合中的增加。
与对照相比较,活性中的增加可为约5%至约100%,或者至少10%、至少20%、至少25%、至少30%、至少40%、至少50%、至少60%、至少70%、至少75%、至少80%、至少90%、至少95%、至少98%或100%或更多(诸如200%、300%、500%、1000%或更多)的增加。
与对照相比较,表达中的降低可为约5%至约100%,或者至少10%、至少20%、至少25%、至少30%、至少40%、至少50%、至少60%、至少70%、至少75%、至少80%、至少90%、至少95%、至少98%或100%的降低,这包含转录活性或多核苷酸表达或多肽表达或其组合中的降低。
与对照相比较,活性中的降低可为约5%至约100%,或者至少10%、至少20%、至少25%、至少30%、至少40%、至少50%、至少60%、至少70%、至少75%、至少80%、至少90%、至少95%、至少98%或100%的降低。
本文描述的多核苷酸和重组构建体可用于调节本文描述的蛋白酶在目的植物物种(适当地,烟草)中的表达。
许多基于多核苷酸的方法可用于增加基因在植物和植物细胞中的表达。作为实例,可制备与待转化的植物兼容的构建体、载体或表达载体,其包括目的基因连同能够在植物或植物细胞中过表达所述基因的上游启动子。示例性启动子在本文中描述。在转化后且当生长在合适条件下时,所述启动子可驱动表达,以便调节(例如,降低)这种酶在植物或在其特定组织中的水平。在一个示例性实施方式中,生成携带本文所述一种或多种多核苷酸(或如本文描述的其任何组合)的载体,以在植物或植物细胞中过表达所述基因。所述载体携带位于转基因上游的合适启动子(例如花椰菜花叶病毒CaMV 35S启动子),从而驱动所述转基因在植物的所有组织中的组成型表达。所述载体还携带抗生素抗性基因,以便对经转化的愈伤组织和细胞系赋予选择。
在一个优选实施方式中,启动子和调节序列来源于SEQ ID No:1-80中的一者或多者。这些调节序列可与同源或非同源表达序列结合使用以增加所述序列在烘烤程序期间在烟草植物中的表达。
来自启动子的序列的表达可通过包括表达控制序列(包括增强子、染色质激活元件、转录因子反应元件等)增强。此类控制序列可为组成型的,且以通用方式上调转录;或其可为兼性的,且响应于特定信号上调转录。专门指示与衰老相关的信号和在烘烤程序期间活跃的信号。
因此,多个实施方式涉及通过将多核苷酸的多个拷贝整合到植物基因组内,用于调节(例如,提高)本文所述的一种或多种多核苷酸(或如本文描述的其任何组合)的表达水平的方法,其包括:用表达载体转化植物细胞宿主,所述表达载体包括可操作地连接于本文所述的一种或多种多核苷酸的启动子。由重组多核苷酸编码的多肽可以是天然多肽,或对于细胞可以是异源的。
携带本文所述的一种或多种多核苷酸(或如本文描述的其任何组合)的突变等位基因的烟草植物可用于植物育种计划,以制备有用的品系、品种和杂种。特别地,可使所述突变等位基因基因渗入上述商业上重要的品种内。因此,提供了用于植物育种的方法,其包括将如本文所述的突变型植物、非天然存在的植物或转基因植物与含有不同遗传同一性的植物进行杂交。所述方法可进一步包括将后代植物与另一植物杂交,且任选重复杂交直到获得具有期望的遗传性状或遗传背景的后代。此类育种方法发挥的一个目的是将期望的遗传性状引入其它品种、育种品系、杂种或栽培变种,尤其是具有商业利益的那些。另一个目的是便于在单一植物品种、品系、杂种或栽培变种中叠加不同基因的遗传修饰。考虑种内和种间交配。起于此类杂交的后代植物也称为育种品系,是本发明的非天然存在的植物的例子。
在一个实施方式中,提供了用于生产非天然存在的烟草植物的方法,其包括:(a)将突变型或转基因烟草植物与第二烟草植物杂交,以获得后代烟草种子;(b)在植物生长条件下,使后代烟草种子生长,以获得非天然存在的烟草植物。该方法还可包括:(c)将非天然存在的烟草植物的上一代与自身或另一烟草植物杂交,以获得后代烟草种子;(d)在植物生长条件下,使步骤(c)的后代烟草种子生长,以获得另外的非天然存在的烟草植物;和(e)将(c)和(d)的杂交和生长步骤重复多次,以获得非天然存在的烟草植物的更多世代。该方法可任选包括在步骤(a)之前提供亲本植物的步骤,所述亲本植物包含得到表征且不同于突变型或转基因植物的遗传同一性。在一些实施方式中,取决于育种计划,杂交和生长步骤重复0至2次、0至3次、0至4次、0至5次、0至6次、0至7次、0至8次,0至9次或0至10次,以便生成非天然存在的烟草植物的世代。回交是此类方法的例子,其中后代与其亲本之一或与其亲本遗传相似的另一植物进行杂交,以便获得在下一代中具有更接近于亲本之一的遗传同一性的后代植物。用于植物育种、特别是烟草植物育种的技术,是众所周知的且可用于本发明的方法中。本发明还提供了通过这些方法产生的非天然存在的烟草植物。某些实施方式排除选择植物的步骤。
在本文所述方法的一些实施方式中,使用标准田间操作,在田间评估起因于育种和筛选变体基因的品系。包括原始未诱变亲本的对照基因型包括在内,并且按随机化的完全区组设计或其它适当的田间设计,将入选者(entry)排列于田间。对于烟草,使用标准的农学实践,例如将烟草收获、称量且取样,用于在烘烤之前和烘烤期间的化学及其它常见测试。执行数据的统计分析,以确认所选择品系与亲本品系之间的相似性。任选执行所选植物的细胞遗传学分析,以确认染色体组和染色体配对关系。
DNA指纹鉴定、单核苷酸多态性、微卫星标记或类似技术可用在标记辅助选择(MAS)的育种计划中,以如本文所述的,将基因的突变等位基因转移或培育到其它烟草内。例如,育种者可由含有突变等位基因的基因型与农学期望的基因型的杂交,而制备分离的群体。可使用本文所列出的技术之一,使用从基因组序列或其片段所开发的标记来筛选F2中的植物或回交世代。鉴定为具有突变等位基因的植物可以回交或自花授粉,以制备待筛选的第二群体。取决于预期遗传模式或所用MAS技术,有必要在每轮回交之前对所选择的植物进行自花授粉,以帮助鉴定期望的个体植物。可重复进行回交或其它育种操作,直到恢复轮回亲本的所需表型。
在育种计划中,成功的杂交获得能育的F1植物。所选择的F1植物可与亲本之一杂交,并且第一回交世代植物进行自花授粉,以产生再次筛选变体基因表达(例如,基因的无效版本)的群体。将回交、自花授粉和筛选的过程重复例如至少4次,直到最终筛选产生能育且与轮回亲本相当相似的植物。如果需要的话,这种植物进行自花授粉,并且随后再次筛选后代,以确认植物显示出变体基因表达。在一些实施方式中,在F2代的植物群体中筛选变体基因表达,例如,根据标准方法如通过使用具有引物的PCR方法,鉴定由于基因缺失而未能表达多肽的植物,所述引物基于本文所述的一种或多种多核苷酸(或如本文描述的其任何组合)的核苷酸序列信息。
杂交烟草品种可通过以下方式产生:阻止第一品种的雌性亲本植物(即,种子亲本)的自花授粉,允许来自第二品种的雄性亲本植物的花粉使雌性亲本植物受精,且允许F1杂种种子在雌性植物上形成。可通过在花发育早期阶段将花朵去雄,来阻止雌性植物的自花授粉。作为另外一种选择,可使用雄性不育的形式阻止在雌性亲本植物上的花粉形成。例如,通过细胞质雄性不育(CMS)或转基因雄性不育可产生雄性不育,其中转基因抑制小孢子和/或花粉形成、或自交不兼容。含有CMS的雌性亲本植物是特别有用的。在其中雌性亲本植物是CMS的实施方式中,从雄性能育植物收获花粉且人工应用于CMS雌性亲本植物的柱头,并且收获所得到的F1种子。
本文所述品种和品系可用于形成单杂交烟草F1杂种。在此类实施方式中,亲本品种的植物可生长为基本上同质的相邻群体,以便于雄性亲本植物与雌性亲本植物的天然异花授粉。通过常规方式选择性收获在雌性亲本植物上形成的F1种子。还可大批种植两个亲本植物品种,且收获由于自花授粉在雌性亲本上形成的F1杂种种子和在雄性亲本上形成的种子的掺和物。作为另外一种选择,可进行三系杂交,其中单杂交F1杂种用作雌性亲本,并且与不同的雄性亲本杂交。作为另一选择,可制备双杂交杂种,其中两个不同单杂交的F1后代进行自身杂交。
可在突变型、非天然存在的或转基因植物群体中,筛选或选择具有所需性状或表型的那些群体成员。例如,可在单一转化事件的后代群体中,筛选具有由其编码的一种或多种多肽的所需表达或活性水平的那些植物。物理和生物化学方法可用于鉴定表达或活性水平。这些包括用于检测多核苷酸的DNA分析或PCR扩增;用于检测RNA转录物的RNA印迹、S1RNA酶保护、引物延伸或RT-PCR扩增;用于检测多肽和多核苷酸的酶或核酶活性的酶促测定;以及检测多肽的蛋白质凝胶电泳、蛋白质印迹、免疫沉淀和酶联免疫测定。其它技术如原位杂交、酶染色和免疫染色以及酶测定也可用于检测多肽或多核苷酸的存在或表达或活性。
如本文所述的突变型、非天然存在的或转基因植物细胞和植物包含一种或多种重组多核苷酸、一种或多种多核苷酸构建体、一种或多种双链RNA、一种或多种缀合物或者一种或多种载体/表达载体。
无限制地,在表达或活性已根据本发明进行调节之前或之后,本文所述的植物可出于其它目的进行修饰。下列遗传修饰中的一种或多种可存在于突变型、非天然存在的或转基因植物中。在一个实施方式中,修饰涉及含氮代谢中间产物转化的一种或多种基因,导致当烘烤时,产生比对照植物更低水平的至少一种烟草特异性亚硝胺的植物(例如叶)。可进行修饰的基因的非限制性例子包含编码烟碱去甲基酶的基因,诸如CYP82E4、CYP82E5和CYP82E10,其参与烟碱向降烟碱的转化,并且在WO2006091194、WO2008070274、WO2009064771和PCT/US2011/021088中描述以及如本文详细描述。在另一个实施方式中,修饰涉及重金属摄取或重金属转运的一种或多种基因,导致比不含一种或多种修饰的对照植物或其部分具有更低重金属含量的植物或植物部分(例如叶)。多药抗性相关蛋白质家族、阳离子扩散协助蛋白(CDF)家族、Zrt-,Irt样蛋白(ZIP)家族、阳离子交换剂(CAX)家族、铜转运蛋白(COPT)家族、重金属P型ATP酶家族(例如HMA,如WO2009074325中所述)、天然抗性相关巨噬细胞蛋白(NRAMP)同系物家族、以及参与重金属如镉转运的ATP结合盒(ABC)转运蛋白家族(例如MRP,如WO2012/028309中所述)。如本文使用的,术语重金属包括过渡金属。其它修饰的例子包括除草剂耐受性,例如,草甘膦是许多广谱除草剂的活性成分。通过转移aroA基因(来自鼠伤寒沙门氏菌(Salmonella typhimurium)和大肠杆菌(E.coli)的草甘膦EPSP合成酶),已开发草甘膦抗性转基因植物。通过转化来自拟南芥属(Arabidopsis)的突变型ALS(乙酰乳酸合成酶)基因,已产生磺脲抗性植物。来自突变型绿穗苋(Amaranthushybridus)的光系统II的OB蛋白已转移到植物内,以产生莠去津抗性转基因植物;并且通过掺入来自细菌肺炎克雷伯氏菌(Klebsiella pneumoniae)的bxn基因,已产生溴苯腈抗性转基因植物。另一示例性修饰导致对昆虫抗性的植物。苏云金芽孢杆菌(Bacillusthuringiensis)(Bt)毒素可提供延迟Bt抗性害虫出现的有效途径,如最近在绿花椰菜中所示,其中金字塔式的(pyramided)cry1Ac和cry1C Bt基因控制小菜蛾(diamondback moth)对任一单一蛋白质的抗性,且显著延迟抗性昆虫的进化。另一示例性修饰导致对由病原体(例如病毒、细菌、真菌)引起的疾病抗性的植物。已改造了表达Xa21基因(对白叶枯病的抗性)的植物,以及表达Bt融合基因和壳多糖酶基因(对三化螟的抗性和对鞘(sheath)的耐受性)两者的植物。另一示例性修饰导致改变的生殖能力,例如雄性不育。另一示例性修饰导致耐受非生物胁迫(例如,干旱、温度、盐度)的植物,并且通过转移来自拟南芥属的酰基甘油磷酸酶,已产生耐受的转基因植物;编码甘露醇脱氢酶和山梨糖醇脱氢酶的基因改善抗旱性,所述甘露醇脱氢酶和山梨糖醇脱氢酶涉及甘露醇和山梨糖醇合成。其它示例性修饰可导致具有改善的贮藏蛋白和油的植物、具有增强的光合效率的植物、具有延长的贮存期限的植物、具有增强的碳水化合物含量的植物以及对真菌抗性的植物;编码涉及生物碱生物合成的酶的植物。也可考虑这样的转基因植物,其中S-腺苷-L-甲硫氨酸(SAM)和/或胱硫醚γ-合成酶(CGS)的表达已被调节。
一种或多种此类性状可基因渗入来自另一烟草栽培品种的突变型、非天然存在的或转基因烟草植物,或可直接转化到其内。性状基因渗入到本发明的突变型、非天然产生的或转基因烟草植物中可通过所属领域中已知的任何植物育种方法实现,例如谱系育种、回交、双单倍体育种等(参见Wernsman,E.A,and Rufty,R.C.1987.第十七章.Tobacco.第669-698页于:Cultivar Development.Crop Species.W.H.Fehr(编),MacMillan PublishingCo,Inc.,New York,N.Y 761pp.)。如上所述的基于分子生物学的技术特别是RFLP和微卫星标记,可用于此类回交中,以鉴定与轮回亲本具有最高遗传同一性程度的后代。这允许加速生产与轮回亲本具有至少90%、优选至少95%、更优选至少99%遗传同一性的烟草品种,再更优选与轮回亲本遗传上相同的烟草品种,且还包括由供体亲本基因渗入的一种或多种性状。此类遗传同一性的确定可基于本领域已知的分子标记。
最后的回交世代可进行自交,以给出对于被转移的一种或多种核酸纯的育种后代。除转移的一种或多种性状(例如,一种或多种单基因性状)之外,所得到的植物一般具有本发明的突变型、非天然存在的或转基因烟草植物的基本上所有形态和生理特征。确切的回交方案将取决于被改变的性状,以确定合适的测试操作方案。虽然当被转移的性状是显性等位基因时,回交方法是简化的,但也可转移隐性等位基因。在这种情况下,可能有必要引入对后代的测试,以确定是否已成功转移所需性状。
多个实施方式提供了突变型植物、非天然存在的植物或转基因植物以及生物质,其中多核苷酸(或如本文描述的其任何组合)的表达水平被调节,以调节其中的蛋白酶活性。
此类植物特别是烟草植物的一部分,以及更特别地烟草植物的叶片和中脉,可掺入或用于制备多种消费品,包括但不限于气雾形成材料、气雾形成装置、吸烟物品、可抽吸物品、无烟产品和烟草产品。气雾形成材料的例子包括但不限于烟草组合物、烟草、烟草提取物、烟丝、切丝填料、烘烤的烟草、膨胀烟草、均质烟草、再造烟草和烟斗烟草。吸烟物品和可抽吸物品是气雾形成装置的类型。吸烟物品或可抽吸物品的例子包括但不限于香烟、小雪茄和雪茄。无烟产品的例子包括嚼烟和鼻烟。在某些气雾形成装置而不是燃烧中,烟草组合物或另一气雾形成材料被一个或多个电加热元件进行加热,以产生气雾。在加热气雾形成装置的另一类型中,通过将热从可燃性燃料元件或热源转移到物理上分开的气雾形成材料来产生气雾,所述气雾形成材料可位于热源之内、热源周围或热源下游。无烟烟草产品和多种含烟草的气雾形成材料可包含任何形式的烟草,包括沉积在其它成分上、混合于其它成分中、由其它成分包围或以其它方式与其它成分组合的干燥颗粒、碎片、小颗粒、粉末或浆料,所述其它成分采取任何形式,例如絮片、膜、卡(tab)、泡沫或珠。如本文使用的,术语“烟”用于描述一类气雾,其由吸烟物品如香烟、或通过燃烧气雾形成材料而产生。
在一个实施方式中,还提供了来自本文所述的突变型、转基因和非天然存在的烟草植物的烘烤的植物材料。烘烤绿色烟叶的工艺是本领域技术人员已知的,并且包括但不限于如本文所述的空气烘烤、火烘烤、烟道烘烤和阳光烘烤。
在另一个实施方式中,本发明描述了包括含有烟草的气雾形成材料的烟草产品,所述气雾形成材料包含来自本文所述的突变型烟草植物、转基因烟草植物或非天然存在的烟草植物的植物材料,例如叶,优选烘烤的叶。本文所述的烟草产品可以是混合的烟草产品,其还可包含未修饰的烟草。
突变型、非天然存在的或转基因植物可具有在例如农业中的其它用途。例如,本文描述的突变型、非天然存在的或转基因植物可用于制备动物饲料和人食物产品。
本发明还提供了用于产生种子的方法,其包括培养本文所述的突变型植物、非天然存在的植物或转基因植物,并且从培养的植物中收集种子。来自本文所述植物的种子可通过本领域中已知的方式进行条件处理,且包装在包装材料中,以形成制造物品。包装材料例如纸和布是本领域众所周知的。种子的包装可带有描述其中种子的性质的标记,例如固定至包装材料的标签或标记、印刷在包装上的标记。
用于植物基因分型以鉴定、选择或育种的组合物、方法和试剂盒可包括检测多核苷酸样品中的多核苷酸(或如本文描述的其任何组合)存在的手段。相应地,描述了组合物,其包含用于特异性扩增多核苷酸中的一种或多种的至少一部分的一个或多个引物,以及用于进行扩增或检测的任选的一个或多个探针和任选的一种或多种试剂。
相应地,公开了基因特异性的寡核苷酸引物或探针,其包含对应于本文所述的一种或多种多核苷酸的约10个或更多个邻接多核苷酸。所述引物或探针可包含以下或由以下组成:约15、20、25、30、40、45或50个或更多个邻接多核苷酸,所述引物或探针与本文所述的一种或多种多核苷酸杂交(例如,特异性地杂交)。在一些实施方式中,所述引物或探针可包含以下或由以下组成:约10至50个邻接核苷酸、约10至40个邻接核苷酸、约10至30个邻接核苷酸、或约15至30个邻接核苷酸,其可用于基因鉴定(例如,DNA杂交)、或分离(例如,细菌菌落或细菌噬菌体噬菌斑的原位杂交)、或基因检测(例如,作为核酸扩增或检测中的一个或多个扩增引物)的序列依赖性方法。可设计一个或多个特异性引物或探针,且用于扩增或检测一种或多种多核苷酸的部分或全部。作为具体例子,两个引物可用于聚合酶链反应方案中,以扩增编码核酸例如DNA或RNA的核酸片段。还可使用源自核酸序列的一个引物和第二引物(其与核酸序列上游或下游的序列杂交,所述序列诸如启动子序列、mRNA前体的3'末端或源自载体的序列)执行聚合酶链反应。可用于多核苷酸体外扩增的热和等温技术的例子是本领域众所周知的。样品可以是或可源自植物、植物细胞或植物材料,或者从如本文所述的植物、植物细胞或植物材料制备或衍生的烟草产品。
在另一个方面,还提供了检测样品中本文所述的一种或多种多核苷酸(或如本文描述的其任何组合)的方法,其包括下述步骤:(a)提供包含或疑似包含多核苷酸的样品;(b)使所述样品与多个引物之一或一个或多个探针接触,用于特异性检测所述一种或多种多核苷酸的至少一部分;和(c)检测扩增产物的存在,其中所述扩增产物的存在指示样品中存在一种或多种多核苷酸。在进一步方面,还提供了使用一个或多个引物或探针,用于特异性地检测一种或多种多核苷酸的至少一部分。还提供了用于检测一种或多种多核苷酸的至少一部分的试剂盒,其包含用于特异性检测一种或多种多核苷酸的至少一部分的多个引物或探针之一。试剂盒可包含用于多核苷酸扩增(例如PCR)的试剂,或用于探针杂交检测技术(例如DNA印迹、RNA印迹、原位杂交或微阵列)的试剂。试剂盒可包含用于抗体结合检测技术(例如蛋白质印迹、ELISA、SELDI质谱法或测试条)的试剂。试剂盒可包含用于DNA测序的试剂。试剂盒可包含用于测定至少蛋白酶含量的试剂和说明书。适当地,试剂盒包括试剂和说明书,用于确定植物材料、经烘烤植物材料或经烘烤叶中的至少蛋白酶含量。
在一些实施方式中,试剂盒可包含用于所述方法中的一种或多种的说明书。所述试剂盒可用于遗传同一性确定、系统发育研究、基因分型、单倍体分型、谱系分析或植物育种,特别是共显性评分。
本发明还提供了对包含如本文所述的多核苷酸的植物、植物细胞或植物材料进行基因分型的方法。基因分型提供了区分染色体对的同源物的手段,并且可用于区分在植物群体中的分离子。分子标记方法可用于系统发育研究、表征作物品种之间的遗传关系、鉴定杂交或体细胞杂种、定位影响单基因性状的染色体节段、图位克隆和定量遗传研究。基因分型的具体方法可采用任意数目的分子标记分析技术,包括扩增片段长度多态性(AFLP)。AFLP是由核苷酸序列变异性引起的扩增片段之间的等位基因差异的产物。因此,本发明还提供了使用诸如AFLP分析这样的技术,来跟踪一种或多种基因或核酸、以及遗传连锁至这些基因或核酸的染色体序列的分离的手段。
在一个实施方式中,本发明还提供了来自本文所述的突变型、转基因和非天然存在的植物的烘烤的植物材料。例如,烘烤烟叶的工艺是本领域技术人员已知的,并且包括但不限于空气烘烤、火烘烤、烟道烘烤和阳光烘烤。
在另一个实施方式中,描述了烟草产品,其包括包含来自本文所述的突变型、转基因和非天然存在的植物的植物材料(例如叶,适当地烘烤的植物材料,例如烘烤的叶)的烟草产品,或通过本文所述的方法生产的烟草产品。本文所述的烟草产品还可包含未修饰的烟草。
在另一个实施方式中,本发明描述了烟草产品,其包含来自本文所述的突变型、转基因和非天然存在的植物的植物材料,优选叶,例如烘烤的叶。例如,所述植物材料可加入烟草产品内部或外部,并且因此在燃烧后释放期望的香气。根据此实施方式的烟草产品甚至可以是未修饰的烟草或经修饰的烟草。根据此实施方式的烟草产品甚至可源自突变型、转基因或非天然存在的植物,其在除了本文公开的基因之外的一种或多种基因中具有修饰。
本发明还在下文实施例中描述,提供所述实施例以更详细地描述本发明。这些实施例阐述目前考虑用于进行本发明的优选方式,意欲举例说明而不是限制本发明。
实施例
提供以下实施例作为例证,而不是作为限制。除非另有说明,本发明采用分子生物学、植物生物学、生物信息学和植物育种的常规技术和方法。
实施例1
选择烘烤开始后的48小时时间点以基于Affymetrix数据筛选烘烤激活的基因,基本上如Martin et al.(2012)BMC Genomics,13:674)所述。
简单来说,将来自基因组DNA和来自EST重叠群的外显子候选物连接,且针对冗余清除基因组候选物(98%阈值)。此产生一组312,053个外显子候选物,其中12,925个由EST表示,但未包括在基因组集合中。如由制造商(Affymetrix)所述验证数据集。另外,质量检查包括探针水平模型,标准化的不成比例的标准误差(NUSE)和相对对数表达(RLE)曲线和DABG结果的分析,如由制造商所述。
由于外显子阵列设计不经受错配探针,故使用鲁棒多阵列平均(Robust Multi-array Average,RMA)方法进行概述。产生总共272,342个探针集表达值,且计算DABG P值以评估针对每一探针集获得的信号的显著性。此涉及散布在芯片上的背景探针。这些随机探针具有不同GC含量。质量检查涉及Affymetrix Power Tools(APT)和Bioconductor包装的组合,对其产生烟草外显子阵列(TobArray520623F)cdf环境。当表达值可供使用时,在线性模型LIMMA中使用调节t-统计进行差异基因表达分析。
实施例2
差异表达.组织样品使用RNA-seq测序;读段使用Tophat2映射到3个种类的基因组。将先前公开的基因模型用作差异基因表达分析的基础。在烘烤期间的表达变化使用Cuffdiff2软件基于映射的读段计算。如果基因表达水平在烘烤的前48小时期间显著增加,那么视为基因上调,且如果变化不显著或降低,那么不视为基因上调。烟草蛋白质通过BLAST搜索对照3个种类的转录物的数据库鉴定,且3个种类中的等效基因通过3个种类白肋、弗吉尼亚和东方的转录物的相互最佳BLAST命中搜索鉴定(e值截止值1e-80)。
数据(图2)展示3个烘烤种类中的衰老激活的基因的数量。
实施例3
针对已知蛋白酶家族的成员资格分析实施例2中鉴定的蛋白酶基因。结果阐述于表1中。
发现80种烘烤激活的蛋白酶基因属于21个不同蛋白酶家族。
在表中,AC,空气烘烤的;FC,烟道烘烤的;SC,阳光烘烤的。AC+FC+SC,在所有三种类型的烟草中上调;AC+FC,在空气烘烤的和烟道烘烤的烟草中上调;AC+SC,在空气烘烤的和阳光烘烤的烟草中上调;FC+SC,在烟道烘烤的和阳光烘烤的烟草中上调;AC、FC和SC,仅在相应的烟草类型中上调。
表1
实施例4
APA1由拟南芥中的单一基因编码且4在番茄中。烟道烘烤的弗吉尼亚烟草中激活的APA1基因(参见表1)接近APA1-Tomato-1。来自祖先美花烟草(S)和绒毛状烟草(T)两者的两种基因拷贝存在于普通烟草中。Affymetrix数据证实在弗吉尼亚烟道烘烤期间S形式(上图)和明显并非T形式的激活(下图)。
实施例5
表2说明三种烟草类型空气烘烤的白肋(AC)、烟道烘烤的弗吉尼亚(FC)和阳光烘烤的东方(SC)中的SEQ ID NO:1到80的差异上调。
表2
序列表
SEQ ID NO:1
ATGGCTCTTCGTTTCTCTTTAATTTTCCTATTTTCTCTTTTCTTAACGACGTCGTTATTGTTGTCCGTTAACGGCAACATTAACGGCGGTGAAGATGACGATATTTTGATCCGTCAAGTCGTAGGCGACGACGACGATCACTTGTTAAACGCCGATCATCACTTCACGATTTTTAAGAGGAGGTTCGGCAAAACCTACGCGTCCGATGAGGAGCATCATTACAGATTCTCGGTGTTCAAGGCTAACTTGCGCCGTGCAATGCGCCACCAGAAGCTTGATCCCTCCGCCGTTCACGGTGTGACTCAGTTTTCCGATTTGACTCCGGCCGAGTTCCGCCGGAATTTTCTAGGAGTTAACCGTCGGCTCCGGCTTCCTTCTGATGCCAATAAAGCTCCTATTCTTCCTACTGAGGATCTCCCTTCAGGTTTCGATTGGAGAGATCACGGTGCCGTCACGTCAGTAAAGAATCAGGTACTAGTATATATCAATGTTTGTGTAAAGTTTATCTTTTTTTGGATAGGCGAAGTGTTCGTCATTAATGAATAATTACATAATTTCTATTTGTATCGATTGAAAAACTAGGGTTCATGTGGCTCGTGCTGGTCATTTAGTACCACTGGTGCGTTAGAAGGTGCCACCTATCTTTCTACAGGGAAGCTTGTAAGCCTCAGCGAGCAACAACTTGTGGACTGTGATCACGAGGTTTGACGTTCTTCCTCTTTATCTTAGCTTAAAATCATGAATATATTGTCAATAGAGTTACTGTTTTTCTTTTTTCTTTTTTTCTGGGACGTTTGAATGTGTAAAATAATTTTCGCTGTGGTGTGTCACAGGATTTGGTCCATAGCTGTCATCTTTTTCTAGTTAAAGAAAATTGATAGCGTGAAGGACACTAACCGCATAAATTAAAGTGCTTTCTCTGATTCCGTCTCACTTTAAAGTTTAAGAACCCGTTTGGCCATGAAATTTCTTTTTTTTTTCCGTAAAATTTAACTTTTCTTCTAAATCAATGTTTGGCCATCAAATTTTTTATTTTCACTTGAAGATAATTTTACAATTTTTCAAAAATTTGAAAAACTTCAAAAACTGTTTTTCAAAATTTTGAATATTGTTGTTGATGTAAAAAACAGACACTAATTTATAAGAGTAATCTCCTCTTCTTTGTTTGGTGGATGGCCAGGGGTGGGGACTGGGGACCCATCTTAAGGGAGCGGAGGAAAAGTTGTTTTATTATTAGTTTATGGCTGGTTATGAAATTCAACTAATTGATACTCTGAGGATAACCACGGACAAAATTGTTTGGATGATGAGGAAATCGCATCCAAAAATTGTCTGCATCTGAATATACTTTTAACATTACTTGAAGTTTCAAGTTTAAGCTCGTGTATGCAACGTGGTGGGAGATGTACAAGGATAAATAGAAAGGCGTTGAGTTATTGAGATAGGTTTGTAAAACTCTTCTTAAATTTTCCATTGTTTGATTGCCATTATATAATCATTTGTATAATTTCCAACTTGGAAAAAGCTGTTCAAACTCAAAATAAGGTTTAGGCTTGAACTTATTGCTATTTACGGTGTCTGCCATTTTATAATCAGAAATGGGATTGAATACAGAGTTAATAAGACCACTGACTCGCCTTATTTACCTCACTCGTCTCAGATGAATTTTATACTTCCAAATTTCAGTGTTCCCCATCTCCCTGAAAAATGTATAATTTGGCCTTGCATTTATCTGCAGTGTGATCCAGAAGAAAAAGATTCATGTGACGCAGGGTGCAATGGTGGCCTAATGAATAGTGCCTTTGAATACACTCTGAAAGCTGGTGGACTTATGCGAGAAGAAGATTATCCATACACTGGCACCGATCGTGGAACCTGCAAATTTGACAACACCAAGGTTGCTGCTAAAGTTGCTAACTTTAGCGTTGTCTCCCTTGACGAAGAACAAATCGCTGCTAATCTTGTCAAGAATGGTCCTCTCGCTGGTAAATAGTCTCTCAAAACACTTTTCAATTTGCCTATCATTATGCTTCTTCTTTGTCCTTACTTGATATTGTCAAAGTATATACTTGGATTGTCATATTTATGCACTGGAATGTAAAAGGTATTTACACAATTAAGTCACTTATTAGGTAATTACAAGTAACTATTTTGATAAGTTTTAATTAGTAATGTGTTAAAATGATAATTAACTTGCTATTTAAATTCACTGATAGCCGTAACAAAATCTTTTAACTATTAATATATATAATATAAATATTTGTTTTTTAATAAACAACAAATATTATTTGTGAAAGATCCAGTTATGTAGCTTGAAACTACATTTTGGGATTTTGAATTATGTACTACTCTTCTTATGCTAATGGTTTTCAATTTTTCACTGATGTAAACTTCTGAAAGCATTTTTGTTGCTTGGCTTGCAGTGGCGATCAATGCAGTGTTCATGCAGACATACGTTGGCGGAGTTTCCTGCCCATATATATGCTCTAAGAAGTTGGATCATGGTGTCTTATTAGTTGGTTATGGTACTGGCTTTTCTCCCATTAGAATGAAAGAGAAACCATACTGGATCATCAAGAACTCATGGGGAGAGAAATGGGGTGAAAACGGATACTACAAAATCTGTAGAGGCCGCAATGTTTGCGGAGTGGATTCAATGGTTTCAACAGTTTCAGCTGTTAGTACCAGCTCACAC
SEQ 2
TTAAGCTGCTTCAGCAAATCCAACTCTGAGTTTGCCATAATCGAAGACTGTGTGATATCGACCCATGAAAACATCACCCAAGATCCTGTAACCAAAGGAATACCATAGAGAACTCAGTGAACAAAAGAACTGCAGGCTCAGGTTTAATTGTGCTGTAGCTCTATAGTTCGGATTAAACTTATATTTGGATTAACTGCATTGCTGATATTTATCTCTAAAACATAATTATAAACTAAAATAGAGAGAACATATAAAGATAACTTTACCAGAGTGGTCCGCGGGGAGGAGGAATGTCCAAGCCAGTGAAACCACTAATACACTGTGCCTTAGCACCCTCGCCCACCTTGAGTATGTACTGATCACGGGAAAAGAATCCAAAATTAGAACATAGATCAATCTAGGTCAGCAATCTAAACAGACAACTGAAACAAGTAAAAGGGAACCACACATCGGATTACAACTTCACTCTTTCAATCTTGAAAAAATTTGTTGAAAGGGTGGGAAAAGACTAGAGTGATAGTCTAGTAGAGAAAAGTTTTGCATATGGTCAAGGGGTTTGGTGTATCACTTGGATTTTTTCCTTTGTAAGATGTGGTCTATCCTGAATTATTCAAAGCTCAACCTCTTTATGTTACTAAACCACAAAACAAACAAATTCAGAAAAAATGCAAATGATCAAATTGATTTGGTGTACACTTGATGAATTTCTTCTTTGTAAGATTTGTTCTATACTGAATCCTTCAGACATAAAAAAAAATATTTTTTTTTTGGGGGGNCGGCCTGAATTAGCAAGGTCAGCAAGTAATACACTTCCATAAAAATAGCAAAGGGTAACTTTTTCACGGCACAAAGATCTTATGCAGGTTTTCTTAGATTACTTAGCTGGAAAATGAGACATCTAAATTTAAGTAAAGTCGAAATACTCACAACCTCAAAATATAGAAGTACTTCTTGATGACAACAAACATCTACTTCTCTGTAGAAACTGAAAACCTTAAACACTAGAATCGGTTTTGTAATATGACAATTAGTTGTAATGCCACAAAAGGACTCTATGATGAGCCACTTAATTTTTTCTCTCTTTGACAATGTTGAATTAGAAGAGGAATAGCAATGTTTATTACTGTCAAAGACCATTATAAAGCATACCTCCTTCGGGACGAGGTCAAAAACTTTGCCACCAATTGTGAAAGAGACTGTAGGCATTGAAGAAAGCTTTCCACAGTCAACAGCTGATTCCCCCAATGGGCTTGGGAGACGCTCGCAAAGCTGTCAGTCAGTCAGCAATCTTTTCAAGAAAAGAAGAAAATTGCAGTGACAGATGTTTTACCTCATTCACATAGTTTAATATGCGATCTTGAGTCTGGTTTTGTCTCAGTTGATTCTCCATCCATATGACCGCCATTTCACAAGCAGAGCACATACCATCCTGCAGTCCTGTGGATCTGCCAGCTTTCTCGTCTACAACACTCTCAATTCCCATACTGCAGAAAAAAGGCCACAGAATTATTCAGTATTTATATCAACATTATGGATTAACCAAATGCACTATATGTTCACATGAAGGGGCAAAGAGAGCCTGAAGACTAACCTAACTCCGCGGTTTCCATCGAAAGTGCATACTCCAACCTGTGAGCAAATCTTCTTTGGATGTGCCTGTTTAAATACTGACCAATTAGATAACCGGGAAAGGCAACTAGATTGCCAAGTGTCATTTTGCTGTAACTGCACAGGAAACTGCATATCAAACAAATGAAAATGCAGTTACACAGTTGAATGCTCACCTCTGCTAACAGCAAATCCATGATTGTCTGCCCGTACTGCTCCACTACAGATTTGCATTGTTGGCTAGCAACTCCAGAGGCTCCAATGGCTTGATTAATCATAGTGATTATGGTCTGTACGGAAAGGGGTGGGTTTAAGATTGCTCAACCTTGGAAGTGTTTTAATCGTACAATTGTAGAGACAAAAGGCAGCAGATTTTTACTTAATTTATATTGTCAACATTTCCAAGCCAACAGGATAAAACTTGGCTACAGTTTTCGGGTTGGATAATTTTCTTTTCAAATAGAAGAGGGGTAAATAAATAAGTCGACAGAAGACCAGGACTACAGCAGAAGTAAAAGCATCATCCTCATTGAAACGTAATAAAAGCAAGTAACACAAAACAACAAGTACCTACTGAGGCAGCTTTAAACATATTAAACTGAAAGACAGGGAAGAAAAAGCAGATTTACAGACTTCGGCCCAGTGATAAGCTAAGATGGTATATCCAAAGGTAACCTCAGAAATGAAAAACCAATTTCACTACCATCCTCTCTGTGATGAAAAATTAAAACACAACACAGATCAGATGATGGATTCGTGCTATTTAACTCATGAATCTTAGGAAAATGTTACTTTTCTTGCTGAGCTGTTGAAGGTTCAAAGGAACAAGGAAATCAATAATCGAATTGCGGTTGACTTTGATGATGCAAACAAATAACAAAAACATAACAAACAAGCGATATGTCCCAAATCAAGGCTATAGATAATACCGTTGGACCAGCCAAGAGAGAAGTCCCTGAATCCGCTATTGCAGAGCACCCACTTTCACAGTAACCTACACCCGATATGTTCATGTTAAAAACTCAAAGGAAGGGAAAATTCTATATCCAGGCACATAGCCTTCATCTATATTCCCGAAATTCGGCAATCCAATTCAAAAGGTACATAGCAAAACATACCAGTAGCTTTACCCTCGATAAGAACATCACCCATATCAAACTGCCAATCATATTACTAAGATCAAGATAGCATTTGTACAAAAAATGAACATACATAGTATCGAATTGACCGAATGACAAACCTGCCAATAACCTTTGTGTGTGACTGGGACATAAGTGATTTCTCCCTTATAGTGATTAGGATCAACCCCACCAAACACGATTTCTCCGCCTTGTTCTTCCTCTGTATTTCGGTTGAGCCAAAATGAGAAGACAGGATCCTTGATAAGACCCTGTTGGACCATGTTGTACCTGGAAAAGACAGGAGATGCTGCCCAGATGAATGTCAAATCAAATTTAAACAGAAAGAGACATCCAGCCTATCCTGCATTTATGGAAATCTAATCCTTCAATGTGTTAAACCTCTTCTGGAAAGGAAATTGTCTAGAGCTTTAATTTGGTTTGTGGGAAAGAAATAGAGCAAACTAAATACCGCCCACGTACCAAACTGGAACAGCATTGCCAACTGAAATCTCCTGGAATCCAAGACCCAATATACCGTCAAACTTGGCTACCAAAAATGTCACGCTGGGTTCTCTGGTTGCCTCAATAAATTCCTAGTACATGAACACCTTGAGATATAAGATTTCCACTTTCAAGAGATTTAAAACAAGTGAGGAGCCTCACTAACCTGATCTGTTACAACAAGGTCACCAACTTTGACGTTGTCTTGACTGAAGAATCCAGAAATAGCTCCACTACCATACTGAATTGCAGCAGACTTCCCTGTATGGCAAATCAAAAATTTATCACGAACTAAATCACATTAAATTACAATGCCAAATACGATCTCAGTCTTGTGGAAACATTCGATAAGATCTTAATGTTGTTCATTAAGGTAGGAGTGACCTAGTGTTCTTAAAAGCAAAACGTGCAAAAAAATAAAACAAGGTCCACGGACTTGCATTGAATTGCGAGAAGTGAAGCGCAAAATTAACAGGAAACAAGAAAATATCGAACATTTATGAATTTACTCTACCATAAAAATTAAACTGCAGGTTAAATAACTAATTTCGGCATCCAATATACAATAATCCCAGTATTAATTCAACTCCTCAAAATTGAGATTCAAAGAAGCAACCAATTCTAGTTGGAATCACTTTGTGCACCATTATTTGAAGCGCAACTTCTCTAAAGCGCATGGCTTCAGCAATGAAGTGATAGCCCTTGCTGCATCGCTTCATAACTTTAAACGACCAAGCAATGGCTTTCAATAACACTGGAGTGAACTCACCTGGCAAACCCATTCAACTGCTATAGACTGTTCAATCCATTTTCTTTGAGCAACATATATAATTGTAATAGAACAAAAAATAAAGAATAACTAGTTCTCTGCGGAAAATTTCTTATGTCACAGACCTACATGGATACAAACCGAGTTAATAGGGAAGAAGAAAGACCATCTAAAAAGGCATTGCATAGGTTAAGACTTAAGACTATACAAGGTGCAACGAAAAGGCACTAATCGCAGAGAGATATAAGGATATTGATGTTTCTTTTCCAAAACCTCACTAGTTACAGTAATATACTAAGAAACACAACATAAACATTAAACAGCCTCGTTTTATGTCTTAACAGTCAACTACATGTACTCGTCAATTAACCTTTCCAAGGGAATCCCTTGATGCACCGTGAGAAACACATAAGGACAATACAAAAGATGTTCCATAATGAACAAGATGGCACGTATTCTAAACAATAACAGGCATTAGAAGGAAGCATATGTTTCATGCAGCAATAAACAAGCAAATGGTAGAGAGAAACAATTTGCATCAACATACAGAAATGGAAACATAACATACCATTCTTCTTATAAGTACTTGATTCGCTTGATTTGAACTTGGAATGAAAGAAACAGGGAACCTACACCAAGATAGGCAGTCATCAAATTTTACATCACTCAAGATGGATGTACAATGCTATGCTTTGTATCATTTGCATGTATAGAAGCTTACAGAGAAATAGCACTTCGACGACGGCACCCACAAATTCGAGCTACCAGTGTCAAAGATTACAGTGAACTTCTGAGGTGGAGTGCCTACACCAATCTCCCCAAAATATTGAGCATCCATATAGTTCTTCAGTGCTACAATGTCTGTATCCTCAGAGTCCCCGAGTTTACCACGGAAGTTATACTTCCTAATAGACGCCCTCAAAACGTCCCCTTCCTTTGACTCAATGCGTGCAGCAAGCCGGTTATTTTGATCAAATTTCATTTTTTTCAAGCCAATTCTCATCAAGCCATCATTGGATGAGGAGGCCAAAGGAAAGAGCAGTGCTGAGAGAAACAGGGCAACAAGAAATACTTTTGCTCCCAT
SEQ 3
ATGGGTTCTTTCCTCTGTTTCTCCGTCATTGTTGTTCTCCTTGTTCTTCAGCCATGTTTAGCCAAGAAAGTTTACATTGTTCACATGAAAAATCACCAAATACCTTCTTCTTTTGCTACCCATCACGATTGGTACAATGCTCAGCTCCAATCTTTGTCCTCTTCTTCTACCTCTGATGAATCATCCCTTCTTTACTCTTACGACACTGCTTATTCTGGCTTTGCTGCTTCTCTTGACCCACATGAAGCTGAACTACTCCGTCAATCTGATGATGTTGTTGGAGTTTACGAGGATACTGTTTATACACTCCATACAACAAGGACTCCTGAGTTTCTGGGGTTGAATAATGAGCTCGGCCTTTGGGCTGGTCACAGTCCACAGGAACTCAACAACGCTGCTCAGGATGTTGTTATCGGAGTTCTTGACACCGGCGTTTGGCCGGAGTCGAAGAGCTATAACGATTTCGGTATGCCCGATGTGCCGTCGAGGTGGAAGGGTGAATGTGAATCGGGTTCCGATTTCGATCCGAAAGTACATTGCAACAAAAAGCTGATAGGTGCTCGTTTTTTCTCCAAAGGTTATCAAATGTCGGCCTCTGGCTCGTTCACGAACCAACCTAGACAGCCGGAGTCACCTCGTGACCAAGACGGTCATGGCACCCACACATCCAGCACCGCCGCTGGTGCACCTGTGGCGAACGCTAGCCTTCTCGGGTACGCTAGTGGGGTCGCGCGTGGTATGGCACCTCGAGCGCGTGTAGCTACGTACAAGGTATGCTGGCCTACTGGTTGTTTTGGTTCTGATATTCTAGCTGGTATGGAACGTGCTATTTTAGATGGAGTTGATGTACTTTCATTATCTTTGGGTGGTGGATCGGGTCCTTATTATCGTGATACAATTGCTATTGGTGCTTTCTCTGCTATGGAAAAAGGAATTGTTGTTTCCTGTTCAGCTGGAAATAGCGGTCCAGCTAAAGGCTCACTTGCAAATACAGCTCCTTGGATCATGACCGTTGGTGCTGGTACCATAGATCGTGATTTCCCTGCATTTGCTACTTTAGGTAACGGGAAAAAAATTACCGGAGTTTCGTTATACAGTGGAAAAGGAATGGGTAAAAAGGTAGTTCCATTAGTTTACAGCACAGACAGTAGTGCAAGTCTTTGTTTGCCGGGTTCACTTGACCCGAAAATGGTCCGAGGGAAAATAGTGTTATGTGATAGAGGGACAAATGCGAGAGTAGAAAAGGGTTTAGTAGTGAAGGAAGCTGGTGGAGTTGGGATGATATTGGCTAATACGGCGGAGAGCGGCGAGGAATTGGTGGCGGATAGTCATTTGTTGCCGGCGGTAGCTGTAGGTAGGAAATTGGGAGATTTTATAAGGCAGTATGTAAAGAGTGAAAAGAATCCGGCCGCCGTGCTCAGCTTTGGTGGGACGGTGGTGAATGTGAAACCGTCGCCGGTGGTGGCTGCGTTTAGTTCAAGAGGGCCCAATACTGTAACTCCACAGATTTTGAAGCCCGATGTTATTGGGCCTGGAGTTAATATTTTGGCTGCTTGGTCTGAGGCTATTGGGCCCACTGGGCTTGAAAAGGATACCAGAAGGACCAAGTTCAACATCATGTCTGGTAAGTATTACCAACAACGGCTAGTTTCTTAATTTAATCTTTTTCATGCTTAGCTTAATTATGGCCTTAATTATATTTTTATTAGATCTCGCAATTATTAATACTAACCGTACACACTTAAAAAGGAAAAGAGGAACGCGTAGAATAAAGACACCTGTGGGTGATCTGGAATTATGTACTATGCACATTCCTAAACTTTAGAGGGGTTCACATGTGTAGCATTGATAAGTTAATCCTAAATTACATTAGTTATAATTAAATATTAATGCAGTTTCCAAGAAAATAGATGGACTAAAATTTAGACTTATTTGTATGATGTGACGTGTGGAATTAAATTTAAAAACTGCCCAAGCCTATATCAAATTTATGGCTAAAATAGCAAGAAACGTCCCTTTAATAGGCACAGAAGAAATCCAAGAGGGGCTCGCTGTAGGAGTGTTAAGAGTTTCGATATGAACAAGGTCTAGAGAAGAATTTATTAATTAATTTCAATAATATACGCTAATGGTATTTGAAAACAATATATTGTAATTTATCGTAACAAGTTACTAATTTCGCTTATTATAGACCATTATTGTGAAGTTATTTCTATAGATAAGCCAATAGCATAAAATTCATCCGTCGGAATGTGCAAGGTGTAGTGGTAGGAGTGCTACTCATGATGTGACAAGTGCATGTCACGGGGTTTGAATCATAATGCAAACAAAAGCCTGATATGTTAGTGAAAAATGATAGAGGGACGGGTTCATTATTCACACAAAAGCTTGATATTTAAGTGAAAAATGATAGGGGAACGAGTTCATTATCCAAAGAGTTTCGAACCTAACCTTACACCATGGCCCTTCTTGGTATAATTTACACTAGTTTTATGAGGCTCCTTTTTGTCTCACAAATTTGTGGATCCCAATCTTACACTTCTGGGTCCACAAAATTGTGAGACAAAAAAGTTACCTCATACAACTAGTCTCGGTTGTGACTAGTTGTATGAGACACAAAATAAAATTTTCCGAAAAAGTAGTATGGTCTGTCTTCCGCTATGAGACTAGTTGTTTGAAACACAAAATAAAATTTTCCGAAAAAGTAGTATCGTCTGTCTTCCACTAGTGGGTCCTGGTCCCCTTGGAATCCCAGATTATTGGTCCCTACATAACTATAAAGGTCATAACCTTATCATGGATTTAACATCAACCCTTTGCCCCATCTGAGCACTCTGGACCTACCTTAATCACTTTATTGGCTGGAAATAAGTTGATGAACTTTTTGAATTTTTCTTGAAAAAACAACAACAAAAAACCACTTGTGATCCCACAAGTGGGTCCGGGGATTAGTGTGTTATAAAGAGGATGTTTCGGATAGACTTTCGGCTTAGGAAAGATCAATAAAGTAGTAGAAACAAGCAATAACAATAGCAAAATACTGAATTTTTCTTGAAAATCCTACACAAATCTCATACTTTGAAAATTGTATTTTGTTACATAATTTGATCATTTTTCACTTCGAACTCTTGTAGGCACATCCATGTCCTGTCCTCATATCAGTGGCCTAGCTGCACTGCTGAAAGCAGCACATCCTGAATGGAGTCCAAGCGCGATCAAATCTGCACTTATGACGACTGCCTATGTTCGCGACACCACCAACTCTCCTCTCCGCGACGCTGAAGGTGGCCAACTCTCCACTCCTTGGGCTCATGGATCAGGTCATGTTGATCCCCATAAGGCACTTTCCCCCGGTCTAATCTATGATATTACCCCAGAGGACTACATCAAATTCTTATGCTCCTTGGACTATGAGTTGAACCACATACAAGCCATTGTCAAGCGCCCGAATGTCACTTGTACTAAGAAATTTGCAGATCCTGGGCAGATTAACTACCCTTCATTCTCAGTTTTGTTCGGGAAATCAAGGGTTGTTCGTTACACCCGTGCAGTCATCAATGTAGGAGCTGCAGGATCCGTCTATGAGGTGACCGTTGATGCTCCCCCGTCTGTTACTGTAACCGTGAAGCCATCAAAACTTGTATTCAAAAGGGTAGGAGAGAGGCTGCGTTACACCGTTACATTCGTGTCAAAGAAGGGTGTTAACATGATGAGAAAGAGTGCATTTGGCTCCATTTCTTGGAATAATGCTCAAAACCAAGTTAGGAGTCCAGTTTCATATTCCTGGTCACAACTATTAGAC
SEQ 4
TCAAGCATCAGCACATCTTGTTGGTGCATATCCCAGTCTGGACCTTTTGGTGTCATATAAGATATGAAAATTCTGCTGCTGATAGTTTCCAATTATCGACAAAGCAGATCGAGGAGTCCCTAAAACTGCCAAACAAACGATATCCTCTGGTTCGAGTTTGATAAAGTAGTTCTCTACTGGAAAATTCCATACAGCTCCATCACCAAACACGATCCCAAACGAGGGAAATTCCAAGTTCTTCACACCAGACACATTGTAACACGGATTCAAAATAGGAAAGTCTTGTACAATGGGATATCCCTTAACCTTATTGACAAATGCCTCTTTTATAATCTCATAAGCAGGATCCGCGAAATAACTCAATGTGGTACCTGAATCAATGATTGCACCACCAAGACCTTCTAGCGATAAATTCCACGTCTCCTCGGGTATATTCAGTACCTCTCCTCCAACTATGACAGACTTTATCTGCACATAGTAGAATGTTTCCACTTCTTTGCCTCCAACCAATGAAGTAAAATTCAACTGTGGATGTTTCAAAAGTTCCTTATCTTCACCAAAAATCAACTTACTACTAACACTAGAATTGCTATTCCTATCAACAAGACAATACGAAAACGAATGACCATATAAAGATTGAAGCTGAGAAGCAAACGAAAGCGGCCCTCTCCCTAATCCTAACAAACCAGCAGCACCATGAAATAATCCTCTATTCCAATGACCACAACCAAACATCACATTTTCCACCTTCCTAAATTCACTCCCACTCGTCGTCGTGAGGTTAACAGTAAATGTCTCTAGCGCGAAATCGCCAGTAGTATTAGAACTATCACCATACCAATAGTAATAAGGACAAGTTTGATTCTCGGATTTACAAAGCTGAGGAGGATCAGGGGATGTAACAAATTTACACCTAGGATCATGACAACTTATATTTCTAAATGAAGTAGAGTCTTGAGGATTATAATGAGGTCCATTTTGTTCAAAACAATCAAAACAAGGAACACATTGAATCCAATTAAGATCACTACCAGTATCAAGAATTAAAGAAAAATGCTTAGGTGGTGTACCAACAAACACATCCATAAAATACTCACCAGAGCCAAGGCTTACACCTGACTCCAAAGTCGCCATTAGTTTGCCGGAAAGTTCATAAGATTCCAGCGAAACTGCTGCCGGAGCAATCACAGGCTTATGTTTGTCCACATGTTTTTCATTACTTTTTGCAAGTCTTGAATTGTAATTCTGATTTTTCTTCTCAACAATTCTTGTATGGAGTGTCTGAATTCTGCTTAAATCCCTTGCTCTTGACTCAAAGACTGAATCCTTAGCCTCAATTTTTTTACCAGCTGATCTGTGCCTTAACTGAAACTTTACAGCTTCTTTTTTCTGGTTTCCAAAAATGGAAACTTCTTCATTTTCTCCATTTTTTACATCAACACCATCTACTTCTTGAGCTATTGAATGGGTTTTTGATTTTTGAGAAACTCCATAGTTGCAATCTGAGTCAGCTGAAGAAGAAACAGCATTAAAGCTTGGATGGTTAGGGAATTCAATACCCGAAACACTAGAATTTAGATTTCTGAAGCTGTAAAATCCTCCACAGGCAACAAAACCAGAGGAAAACAAGAATATAAACAACAAAATGAAAAGAATGAACTTTGTCCCCAT
SEQ 5
TTACATTGAGGCATCAAGGAAAGCATTAGAGACATCATCTAATTCCACATTCAGATTTTTGGCTGAAGGCAATCCAGCCACCACATTATGTTCAATGCCACATTCATTTGTTCCTCTTCTGATCTTGAAGTAACCATCCTGTTATCATAGAGATAGACATACATTTAGACAAGAAGCTTATACAAATAGAGTTTAACTTTTGTGTATTGATAGTTTTCAGTTTGTTTAATCATTCAGGCTAAGATGACTTATGCAACTGTCTAAAGTAATTCTAATTTAGTAACTTTAAGAGTGCAGTAAATAACTTGCTTTAACTTTTAAGATACACAGATAGTGTAAAAATTATTTACGCTGTCAGTATAGTTAGCGCATCTACAACAAGGAGTTAGGATGTCGATGGTTTAAGGAAATTTGTGCTCATATATGAAAGCATAAGGAGAAAATAGAGTACATACATCACCCCAGCCTCTGTTCCAAGAATTAGCAATAAGCTGCAGAAGATAATTAAAAAAAAAGGTGATGATTAGATTTTGATAAAATGATATGATTTTAGATAGAAAATAAGACACCACAACTTGAACACATCAACATACCCAATAGTCCTCTCCCTGCTCACTGGTTCCCCATCCGATAAGCTTAACAGCATGGCCTCCCATACTTTGCCCTGTTACATGCTTGTAAACTCCAGACTTGTAGTGAGCAAAATCCTGTGATTGACAAAAAAGTTTTAAGTCATTGGGTTAGCGGAAAGCATTGCTAAGAAAAAGAGAAAATAACAATTATAATCAACGAGAACAATCAAATTCAAACAAGCATTAGTTTATACTAGAGAAACTGAGAAACTAACAATTATAACCAGATAACTTCTCATATTGTGCTAAGCATTAAATTATCTCATTTACAAGTATTAATTTGCAAGTAAATCTCGGACAACATAAAATGAGAAGGTACATCGGTAGTTAAAGTTTCTCAATTATCATAATAGTTACTATCAGGTCACTTAAAAGATCATTACATGTATCACATTATGGGATTGGCAGGTAATTGTTTCTATGTACACATACCACAGGACCAGCATATAAGTTTACCAGCTCAGACAATGTCATGTGTGAAAACTATATCAAATAATTTTAGTATCAGAACTTAACCCTCCCCCTTATCCTCATTACCTCGTAGACGGTAAAAGAGACCTCGACTGGTCCATTTTTGTAAATTTCTGTCATGATACTGTTGGGATCATGGTGGATCCTGTATGCATTGACACCATAATGCTTTGATTTCCCCCATAGTAGAATCTCCTTCACACACTTCCTCTGACACTTTGGGGTGGGATATCCTGGTTCACAACCAGGGTGGGAACATCCCTCATTATCAAAGTAAGGGTCACACTGCCAGAAAAAACAACTTTATTAGTGATTGATCATAAAGATCCACGGTAGCTAATGGTTTTAGAGGAAGCTGTAATCTCTTTTGGTGAAAATAAGTCACCATTTACCTCTTCTGTGACCACACCCCTACGGATAAAGTATCGCCATGCTGTTATTGGATATCCACCATCACAACCACTCCCACATAAAAAGCCACAGCATGCTAACAGATCATTTACAGACAGAGAGATATTCTGCATTACACATTAGAAGTTTAACATCAGTGACCATAACTACAGAAATAGATTCACATACGTTTTGTGCTAGGTGAGATGGTTTCCACATGCTTAGACCATGAAAAAGAATCATGAGGCTGGCACGTGAGAGCACTTGCTGAAGATATATAATTAAGTACATAAAAATGTGTCATCTAAAGCTTTTAAATGAGGCGGTCACACAGTTCAACAAAATTTACCAAGTTATGATGGATACAGAAACGATCAGACAGCGATTCAACAGCACCAAAAGCCCAACAAGAACCGCAATGTCCCTGATCTGTGACGCAAATTTGTCCCGGTGATTGATGTGCAAAGACGGAAAGCATTAGGATCACTATCATAGAATTATAATTCAATAGTAGTAAGCAAGAACAAGAAGAGACTGACCCAGAATTCTTCCGATAGTACTACATTGAGGCCAAGCTTTTCGTGCATCAAACTCTTTTGGTAGCTCCAAAAGTTTTGGATGAGTTAGAATAGGAATTCCCTCCAAATCACCTTCTCTTGCGGGCTTAACTCCAAGAAGGCGCTTAAATTGCGAAACCTGTGAAAACCAAAGAAAAACATAATTATAAATCGACGTTTTCGGTGTATGGCTGCAAAATAAGCAACTGTTATTGATGGTTAGAGAAAGCGCTAAGCCATATGATGATCTGACCGTGAAATTCGAGAATCGAGGGTTGAATGCAGCTTTCCACCCAGCTTTGGCATTTTCATTAACCTCTTTAATGATTGATTCCTACATGATTATCCAAAAAGCTCTCTTAGTTTTAAATTTGAAGCAACAAGGGCAATAACATCTTCTCTAAACATGAAAAGAAGAATATACCTGAAGGATTGCAGATTCAACTTTAGCTTCAGATATTGGCTGCTCTGCAACAACCTGTTTTCCATAAGAACAAAGAGATTCTACTCATCAAAAGACATCCTTAAAGCTTTTAGGAAACAGAGCTGCAACTCCAGGAACAAAAGCATGACACAATGAGTGACAAACGAAGAACTTCGGGCTCGTTTGGTACGAGGGATAAGGGATAATTAATCTCGGGATTAAATTTGAGATGAGTTTATCCCATGTTTGATTGTAGTGTTATTTTAATAATTATGGGAGGGTGGGATAAACAATCGCGGGATAACTAATTTCGGGATAATTAATCCTGCGAACCAAACAATCCCTAAAGGTTTCACTTTAATCAAGATGAAACTCTTCCACAACTTTTATTTTCAACATTATAATACTATTAGCCTGGAAAATTAATCAAAAGTTTGTAGGAAATTCATCATATGTCTAAAGCACTATAACGTAGAGGAAAAAGAATCATAGAACAAGCAGAAATTGTAATTAGTCCATTATTTCTCCTCCTTCTTCTCCCCTTTCTGTATTTCTTTGTGAAGCAATACTTCCTCTCATGTTATTATATTTCGACAAGTAAGTTAGCTAACACATAACATAAGTAATTTGCATCAAACCATATAATTAACTTCAGAAACATGTGTATACTTCTCTTTTCTCATTCTCACTAGGTAATGAGAAAATCATTAAAATTTGCTTCTACTCATGATTTCTAGTCAACGCTTAACTAAAGCATAAGAAGTCCAAAATACCCAACAATATTTGATCTTTCTGAAGAAACACAAAAAGGCTAATCCTTGTGTTCATCAAAAGCTATACAAATCAAATCAATACGCTAAATCCACCTAAAACAAAATCATCAATTCAATAGGCAAGAACTACCCATAAGACATACTCCTACTGTGAAAGGTTCAAAGAATGAAGAAACAAACCTGCAATATAAGGATAAACAAAGCACCAAAAAGCAAAGGAGTTGCTAAAGACTTCAGGGTCAAGGCCAT
SEQ 6
ACATTAGTCCTCCATACTTCTTTCTATCTTCTTCTGTCAGTCGCATCTCCCGGCGACTGTCTCCTCCTCTCCATTTTTCCTTTCTCTTTTTCCTCACCGAGATATTTTCCCTATAAACAAAACACCGTAAAAATCATCTCCTCTAATTTCCTATTTTCCCCATTTTTCCAAATGGGTTCTTTCCTCTGTTTCTCTGTCATTGTTCTTTTCCTTGTTTTTCAGCCATGTTTTTCCAAGAAAGTTTACATTGTTCACATGAAAAACCACCAAATACCTTCTTCTTTTGCTACACACCATGATTGGTACAATGCTCAGCTCCAATCCTTGTCCTCTTCTTCAACCTCTGACGAATCATCACTTCTTTACTCTTACGACACTGCTTATTCTGGCTTTGCTGCTTCTCTTGACCCACATGAAGCTGAACTACTCCGTCAATCTGATGATGTTGTTGGAGTTTACGAGGATACTGTTTATACACTTCATACAACAAGGACTCCTGAGTTTCTGGGGCTGAATAATGAGCTCGGTCTTTGGGCTGGTCACAGTCCGCAGGAACTCAACAACGCTGCTCAGGATGTTGTTATCGGAGTTCTCGACACCGGTGTTTGGCCGGAGTCGAAGAGCTTTAACGATTTCGGTATGCCCAATGTGCCGTCGAGGTGGAAAGGTGAATGTGAATCGGGTCCTGATTTCGATCCGAAAGTACATTGCAACAAAAAGTTAATCGGTGCTCGATTTTTCTCCAAAGGTTACCAAATGTCGGCTTCTGGTTCATTTACGAACCAACCTAGACAGCCGGAGTCACCTCGGGACCAGGACGGTCATGGGACTCACACATCCAGTACCGCCGCTGGTGCACCGGTGGCGAACGCTAGCCTTCTCGGTTACGCTAGCGGGGTCGCGCGTGGTATGGCACCGCGAGCGCGTGTAGCTACGTACAAGGTGTGCTGGCCTACTGGTTGTTTTGGTTCTGATATTCTAGCTGGTATGGAACGTGCTATTTTAGATGGCGTTGATGTACTTTCTTTATCTTTGGGTGGTGGATCGGGTCCTTATTATCATGATACAATTGCTATTGGTGCTTTCTCTGCTATGGAAAAAGGAATTGTTGTTTCCTGTTCAGCTGGAAATAGCGGTCCAGCCAAAGCTTCACTTGCAAATACAGCTCCTTGGATTATGACCGTTGGTGCTGGTACCATAGATCGTGATTTCCCTGCTTTTGCTACTTTAGGTAACGGGAAAAAGATTACCGGAGTTTCGTTGTACAGTGGAAAAGGAATGGGTAAAAAGGTAGTTCCCTTAGTTTACAGCACAGATAGTAGTGCAAGTCTTTGTTTGCCGGGTTCACTTGACCCGAAAATAGTCCGTGGGAAAATAGTGTTATGTGATAGAGGGACAAATGCGAGAGTAGAAAAGGGTTTAGTAGTGAAAGAAGCTGGTGGGGTTGGGATGATATTGGCGAACACGGCGGAGAGCGGCGAGGAATTGGTGGCGGATAGTCATTTGTTACCGGCGGTAGCTGTAGGGAGGAAATTGGGTGATTTTATAAGGCAGTATGTGAAGAGTGAGAAGAATCCGGCCGCCGTGCTCAGCTTTGGTGGGACGGTGGTGAATGTGAAACCGTCGCCGGTGGTGGCTGCGTTTAGTTCAAGAGGGCCCAATACTGTAACTCCACAGATTTTGAAGCCCGATGTTATTGGGCCTGGAGTTAATATTTTGGCTGCTTGGTCTGAAGCTATTGGGCCCACTGGGCTTGAAAAGGATACTAGAAGAACTAAGTTCAACATCATGTCTGGTAAGTATTACCAACAACGGCTAGTTTTTGTCATAATCTTTTTATTTATGCTTAGATTAATTATGGCCTTAATTATATTTTTATTAGATCTTGCAATTATTAATACTAATCGTACACACTTGAAAGGAAAAAGAGGAACATGTTTAATTAGTGCGTAGTGATCTGGAGCACATGCCTAAAGTTTAGAGGGGTTCACATGTGTTGCATTGATAAGTTAATCCTAAATTACATTAGTTATAATTAAATATTAATGCGCTTCCAAGAAAAAGTTGACTAAATTTATCATATATTTCCAAATTTGTTTTGAAAAATATGATTTTGGTGAAGTTTGGCTTGAAGATGAAAATGTGTTTGGACATCAATTTTCAAAACATATTTCCCAAATTTATTTTGGAAAAACATGAAACATTTCTTATACCCACAAGTTTAAAAAACTATCACAAATATCCAACGGTACCATTATCAATAACATTCATTATATTATCTCAAACCATAATCCTGAATATAAATAAATTTGGCACAATATTATCATTTTTATAATTAACTATATGATACACTATTAGATGATCGAGAATACGAAGCAACATCGTTTCAAAATAATAAATGAAAAATGGTGGACTCTTTTATATAATACAAAAGTTTGGAATAATTTTTAAAAAATATAATAATGATATTTTGACCCAAAACCAACATGTAGTCAAAATCTATGACCAAACATGTGTTTGCCAAATAAAACCCAAATTTATTTTGACAAAATATATGGCCAAACGGGGCTTAGTTGTATGATGTGTCGTGTGATATTAATAAAAGAACTGCCGAAGCCTATACCAAATTTATGGCTAAAATAGCAAGAAACGTCCCTTTAACAGGGACAGAAGAAATCCAAGAGGGGGCTCGCTGGTCTAGAGAAAAATTTATTAATTAATTTCAATAATACGCTGATGGTGTAAAAAATATTGACACCATCAATATATTGTAATATATCGTAAAAGTTTATTAATTTCACTTATTATAGACAATTATTTGAAGTTATCTTTATAGCTAACCCAATAGTGTAAAATTCATCCGTTGGAATGTGCAATATGATGTTTGTTTTCAGCTTTTGTGCAGTAGTGATTTTAAATAGGTATTACTTGGAGCTTTTGTGCGATGTGACAAAGTGCATGTCACAAGGTTAGAGTCATAATGAAGGCAAAAACATGATATTTAAGTGAAAAGTGATAGAGGGACGAGTTCATTGTCCGCACAAAAGCCTGATATTTAAGTTAAAAAAAATAGGTGACCAGCTTGATATTTAAGTGAAAAAGGATAGAAAGACGGGTTCATTATCCACCGAAAGTCGAACCTAACCTTTTGCCATGGCCTTTCTTGGTCATCAAAATAATTTAGGAGACTACCTAGGAAAAGTAGTATGGTCTGTCTTCCACTAGTGGGTCATAACCTTAATATCATCCCCCTTGCCCCGTTGAGTACTCTGGACCTATCTTAATCACTTCATTAGCTGGAAATAAGTTGATGAACTTTTTGAATCATTCTTGAAAATTCACAAATTCGAACCGTGGAAACAATCTATTACAGGAATGCAGTCTAAGTCTTCGTACAATAGACCCTGTGGTCCGGCCCTTATAGCAGGAGCCTACTGCACTGGGCTGACCTTTTTCTTTAAAATCTTACAGAGCTCAAAATTTGGACTTTGTACTGTTTCGTTACATTATTTGATCCTTTTTGTACGTCAAACTCTTTCAGGCACATCCATGTCCTGTCCTCATATCAGTGGCCTAGCTGCACTTCTGAAAGCAGCGCATCCCGAGTGGAGTCCAAGCGCGATCAAATCTGCACTTATGACGACTGCCTATGTTCACGACACCACCAACTCTCCTCTCCGTGACGCTGAAGGTGGCCAACTCTCCACTCCTTTCGCTCATGGATCAGGTCATGTTGATCCCCACAAGGCACTTTCCCCGGGTCTCATCTATGATATTACTCCAGAGGACTACATCAAATTCTTATGCTCCTTGGACTATGAGTTGAACCACATACAAGCCATTGTCAAGCGCCCGAATGTCACTTGTGCTAAGAAATTTGCAGATCCCGGGCAGATTAACTACCCTTCGTTCTCAGTTTTGTTTGGGAAATCAAGGGTTGTTCGTTACACCCGTGCAGTGACCAATGTAGCAGCTGCAGGATCCGTTTATGAGGTAGTCGTTGATGCTCCCCCATCCGTTCTGGTGACCGTGAAGCCATCAAAGCTTGTGTTCAAAAGGGTAGGAGAGAGGCTGCGCTACACCGTTACATTTGTGTCCAACAAGGGTGTTAACATGATGAGAAAGAGTGCATTTGGTTCCATTTCTTGGAATAATGCTCAAAACCAAGTTAGGAGTCCAGTCTCATATTCCTGGTCACAACTATTAGAC
SEQ 8
ATGAATCCTGAAAAATTCACCCACAAGACTAACGAGGCCCTTGCTGGGGCACACGAGCTAGCACTATCCGCAGGGCATGCTCAATTTACGCCTCTGCATATGGCTGTGGCCTTAATATCTGATCACAATGGTATTTTTCGACAAGCGATTGTCAATGCTGGTGGGAATGAAGAAGTAGCTAATTCAGTGGAGCGGGTATTGAATCAAGCGATGAAGAAGCTACCTTCTCAGACACCGGCTCCAGACGAAATCCCACCTAGCACTTCCCTTATCAAGGTGTTACGACGAGCACAATCGTCGCAGAAGTCTTGTGGTGACAGCCATTTAGCAGTGGATCAGTTGATTTTAGGACTGCTAGAAGACTCTCAAATCGGAGATCTTTTGAAAGAAGCTGGGGTGAGTGCATCAAGAGTGAAATCAGAGGTAGAGAAACTTAGAGGAAAGGAAGGAAGAAAAGTGGAAAGTGCTTCAGGGGATACCACATTCCAAGCACTCAAGACTTATGGCCGTGATCTTGTGGAACAAGCAGGAAAGCTTGATCCCGTGATTGGTAGGGATGAAGAAATTAGAAGAGTCGTTCGGATTTTATCAAGGAGGACGAAGAACAACCCGGTTCTTATTGGAGAGCCTGGTGTGGGTAAAACAGCAGTTGTTGAAGGGCTAGCACAGAGGATTGTACGTGGTGATGTTCCAAGTAATTTAGCTGATGTTAGGCTTATAGCATTGGATATGGGAGCGCTAGTTGCTGGAGCTAAGTACAGAGGTGAATTTGAAGAGAGGCTGAAGGCTGTGCTGAAAGAAGTTGAAGAAGCAGAAGGGAAAGTGATACTTTTCATTGACGAGATACATTTAGTCCTTGGTGCTGGTCGGACAGAAGGGTCTATGGATGCTGCTAATCTGTTTAAGCCAATGCTAGCCAGAGGTCAATTACGGTGTATTGGTGCAACTACACTTGAAGAGTACAGGAAGTATGTTGAGAAGGATGCTGCATTCGAGAGGCGTTTCCAGCAGGTGTATGTGGCTGAGCCTAGTGTTACTGACACTATTAGTATTCTCCGCGGGTTGAAGGAGAGGTATGAAGGGCATCATGGTGTTAAAATTCAGGACAGAGCTCTTGTGGTGGCTGCCCAGCTCTCATCTCGGTACATTACAGGTATCTATACTTTTGCTATTTTTACATAGCACCTTGTTTTGATGTCTTTTCTCCGTCAATAACTAAGCATGTATATGCACTACTTTTTCCTCGTGCATTTCATTAACTCTATAAATCAGAATGGGACTTAGATTCGGTTAAGCGAATGAAGGTGAATTTTAACCTAAAATGTTATGGTGTCGGAGCTATAGATGTATATTTGTCTGGTACTAAAATGACTTCTTGAAGCAGTAGCCAGAATTTTGATTCATTTAAGCAGGTAGGGCATGAGACTTAATTAGCATATCATTGTCTGCACTTCCTTCTGGACCTTTACCAGTGTATGAGTTGTTTTTGTGTTACAAGCTGCTCCCCATCTGGATAATGGTGGATTAAGACTTATATGATTGTCAGAAGTGTACTAAAACTTCTTGAGGATAATTAAAAATTGCTCAAATCAAATCCGTAGCTCGTTTTCCACTGTCAGTTTTTGCAAAATGCTTTTTATGTCTGTGTCGTGACAAATTAAGCAGTCAGCCAGTTAAATTTTGGCAGTTTGGCATGCAAATTGTCTTTGCTGCACATTTCAGGTGCAAAAATCACTAACCTCTTTGTATTTTCAGGTCGACATCTGCCAGATAAGGCTATTGACCTAGTTGATGAAGCTTGTGCAAATGTTAGAGTTCAGCTTGATAGTCAACCTGAGGAAATTGACAATCTCGAAAGGAAGAGAATTCAGCTAGAAGTTGAACTTCACGCTCTCGAGAAGGAAAAAGACAAAGCTAGCAAAGCACGTCTAGTAGAAGTAAGTATTATATACTACCAATGCTTTTACTGGTAATTGCTCTATTTTCTAAAAGATATGTTAAGAATTATACTGACTCGAATTATACTGACACTGGTCCAGGTGAGGAAAGAACTTGATGATTTGAGAGACAAACTCCAGCCCTTGATGATGAGGTACAAAAAAGAGAAGGAAAGGATAGATGAGCTTCGCAGGCTCAAGCAAAAGCGCGATGAGCTCATTTATGCTTTACAAGAAGCTGAAAGGAGATATGATCTGGCTAGGGCAGCAGATCTGAGATATGGGGCAATTCAAGAAGTGGAAACTGCAATAGCAAATCTTGAGAGTACCTCAGCTGAGAGTACAATGCTAACAGAGACTGTGGGTCCTGATCAGATTGCCGAAGTTGTGAGTCGCTGGACTGGTATTCCGGTCTCAAGGCTTGGGCAGAATGAGAAAGAGAAACTGATTGGTCTTGGCGATAGGTTGCACCAAAGAGTGGTCGGGCAAGATCATGCAGTTAGAGCTGTTGCTGAAGCCGTGTTAAGATCTAGAGCTGGTTTAGGAAGGCCACAGCAACCAACTGGTTCATTCCTTTTCTTGGGGCCAACTGGTGTTGGAAAGACGGAGCTCGCTAAAGCTCTTGCAGAGCAGCTCTTTGATGATGATAAACTGATGATCAGAATAGACATGTCCGAGTACATGGAACAACACTCTGTTGCCCGGCTGATTGGTGCTCCACCAGGGTAAGTTTGAATCTAATTCTTTTCTTTTAATGTCATGTCATATTATTACAGTATTCAATCACAGATTCTCATGTGTTCCACATCTGCAGTTATGTTGGGCACGATGAGGGAGGACAACTTACTGAAGCTGTTAGGAGGCGGCCTTACAGTGTTGTGCTCTTTGATGAAGTTGAGAAAGCCCATCCTACGGTGTTTAATACATTGCTTCAAGTTTTGGATGATGGAAGGTTAACAGATGGTCAAGGCCGCACAGTTGATTTCACCAACACCGTGATTATTATGACTTCAAACTTGGGAGCAGAGTATCTGTTGTCTGGATTAATGGGAAAATGTACGATGGAGACGGCTCGTGAAATGGTAATGCAGGAGGTAAATAGTCTCAAACTAGTAACTTCCCCTTTGCTGATAAAACTGGAAGAATACAGTGAAATAGTTTACCTTATTAGCTAGAATGACAACTGTTTACATGTGTGTATGCTTTGTGATAGGTGCGAAAGCAGTTTAAGCCCGAGCTCCTGAATCGGCTGGATGAGATTGTTGTGTTTGATCCCCTGTCCCACGAGCAGTTGAGGCAAGTATGCCGCTACCAGATGAAAGACGTTGCACTACGGCTGGCTGAGAGGGGTATTGCATTGGGTGTTACTGAGGCAGCTCTAGATGTCATACTCTCAGAGAGTTATGACCCGGTAAGTGTTATATCTTGTAATCTAGTCCAATATTTTAGGATTATTTTGCGAACTTGTACTTATTGTGGTGATCATGGCATTCAGGTTTATGGTGCAAGACCTATTAGGAGATGGTTGGAGAGGAAGGTGGTGACCGAGCTATCCAAGATGCTTGTGAAGGAGGAGATTGATGAGAACTCAACGGTTTACATAGATGCTGGTGTCGGCAGGAAAGATCTAACCTACAGGGTGGAGAAGAATGGAGGTCTTGTGAATGCTGCCACCGGGCAAAAATCTGATATATTGATTCAGCTTCCTAATGGTCCCAGGAGTGATGCTGTCCAAGCAGTCAAGAAAATGAGGATTGAAGAAATTGAAGAAGACGAAATGGAAGAT
SEQ 9
TTATGTAAATGCTTCACGTTGCTGTGTAGGTAGCTCCAGTTCAGGCTCAAATGCTGTTAGCCGAGAAAAATACTGATCTACCATGTCATCTGTTACTTTATCCAAGCTTGGAGGATCCCACTGCACGTTTTAACATAAATCAAGAAACTCTCTCTAAGGTTAAAGTTGACTCTTAGGAAAATTCCTCCAGGAAGGGGCTCATAATTCATAAAAATAGCATATTAGTTCGCAATAATATTGATTTACCTTAGGTGCAAAATCTCTATCCACAAGCCGTGCACGAACCCCCTGATAGGAAGATATTAGTACACTGTGATGATGATCTGTGGAATTTAGTACTGAGACTTTTTTTAGCAGCATATATTACCTCACAGAAGTCATTAGTTATTTGTCCAGAGAAAGCTTGTACTGACATTCGATACTCACGAATTAGACACTGGTCCAGAGTCTGATGTCTGCCTTCGCGTATCTGATCAGAAGCATTATTGGAAGGTCATCTCACAATGCCCTGGTGGCAGAAGGGAAAAGGAGAACATTCTATCTAGAGAAACTTACAGATCTCAGTGAAACCTTCAAGCTCAGTGGGGCTGTTTCTTGTAGTTTTCGCAATGTTGAAACACACCATGCATCTTGCTTCTTGGCTGCCTCACTTTCCTGGCAACATTTAAATTTCTTTTTAAAAATGTAATCACACCTGTCCTTAAAAACTTCCACACATGTAGCAGACTTTATATGTTTACTTAAATACTTTACTGTCAGTATTTTAAAAACAAGAGAAAATCCATCTCGCTCGAAAGAGCACCCAGAGGTCGTAGATGCAAACGCAATGACTCAGCTTGTACTATGGTGCAACAAAGTGCAGCACTTAAAGGCTTCTCCGTGCAATTTATACGTGATGTTAGAATCTCCACAATATGCTTTGCCCAAAGAATGATAGATATAAGTTAGTGTGACAACAGCCCCAGGACCTATAGGTACTTTTTGAACCTCAAGGAGCAAACAAAGAGTATGCTGAGCAATGATCATTGCATTGTTTCAGATTTTTCAAATAGTGGCAACTTGTGATGGCCGAGGTATTAAAAGCAAAGCAATCCGATGCTGAACATTATTGCATTGATCAACTAAAATATGTTTCCACTGGCACCTAAGGGGGTAGACTGTAGCCAAATAGTCAATGAAGTCGGTGAAAAACACGGGAGACTAGGTTTCAAATCTAGCAAAGAGTCGGAGAGAGTCAGAGACAGAAAGGCTAGGTGATTTCTTCCTATCTGCCTAAAATTTGGTAGATAGCAAGTACCTGTTGGAATAGTCGAGGTGCGAGCAAATTGGCCCACACCACCGTTATAGAAAAGAATAATGTTTCTACTGGCCACACTTTGATTGTATGCCCATCCGATCGAGTTTCTTACCAGGTAATAGCCTTCACACCCCAGGCTAGTGAATTAAGATGTGCTATCCTATAAAGACTTACTTCGTCCTCCACTAATATGATCGTATCTTGTTACTGGAGTCGAAGATGGTATGTTGGCGAGGCTTCCACCTTCTGAATACCACATGTTTAATCCCTGTTCTTTCTTTTCTCACTAGAGATATTAAGCAAATTAAAATCAATTAACATCAGAATTTTGTATCTTTCAGAGACCTTTTCAGTATATTTGGGAAACAAGTTACACACTCAGGAACTATATGCTCACCAAAGCATCAATAATTTCTTCGACTGTGTCATGGCTGAAACATTTATTGAGAGTTTCAATCCTAAAGAAAGAGAAAAGATGGATTAGATTAAGCTATCAAATACTCATTGATCTGCAAGCAAGATCACACAAACTTGATAAGAATTTATTTTTATAAAAACAAAAGGTTAAAGAACAAGACTAAATACAAAAACCAAAAGATCCATGTGAAACTATGACAAAAATTATCATGAACCACAAAGTTTTAAGAATGTAGTGTGAAGTTTTAGAGAGCTGAAGTTTGTTCCACTAGTTGATGTATTTTATATAACTCATGGAAAAGATGGTGCCACTACCAGTTTCTCTATAATGGAAAAAACAATTTCGCCACTTACAAATCTAAGTGAAGCTAAATATGAAGAGCTAATTGCAGGATTTTGTGTTTACTGAATTAAACAATTTACCAAAAATTACCAATTTGATTCCTCGAAGAAGAAAGTAAGAGATTGATTTGCAATTCTTTCTTTTTGAGTAGTATGAAAAATGTTATGAAATGAGGGCAAGTGTCTCATCCAATTAAACCTGTAACTTGCTTTCCAACAGCAGGTAAAATATTGAAGCAAGGCTGCAGCGTGTCCACTTATGATTTCAATTAGCCAAGGGATTCTTTCGATGTTCTATAAGAAACGTGAACGGGATTCCCCGAAGATGTAGGTAGGGAATCCGCTGACTGGTGTGGTATATATGTTTGGTTAACAACTAGAAAAGGTGTTTCAAATCCAAAAGCAGCCCTTAACATTAAACGGTAATATGTATCAGTCCACCCCCTTTCAAACTGTAGCAGGAACTAATATATTTATGGACATTCCAATTTCCATATTTAGCAAGGATGACAGGTACCTGTGAAGTACACTTGTTGGATCTGGATGGACAATCTCTCCACAATTTTCAAGAGATCTTTCAATCACTGAAGGATCATCAGTCATCAATTTACCAAGTTGTTCCTCAATTAAGGGAAGCTTCTGAGAGGAGTGGAGAAGGCAGAGAAGAAGTTTTCAGAGCAATAGTCAAATAACCTAAGTAGGCTATTGGTACCTAGGTGCAAAGCAAATTAAGCAAAGGTAGTAGACTTACTGCACTGTGTAAGTAGTGAGTAGCAAGCCCACAGGATATCATTTCTGCTCCATTGATCTTGTCTCCAGTTAGGGCCAGGTACTCTCCTGCAAATTGAAAACATTTAAATAGCTTTCTTCCATGCTGATTTTTTCTTTCATTTACGCAGTAGCAAATGTCCAAACAGATGAGAGACACAGAGAGAGGCAGCATACTAAAAACATATTCCACTTGAACCACAAACTAGTATAGCATAAGATAGATAGGGTTTACATGAGCTGACATGCGTGCATCAGGTTGCTTAACATTTATATGCCATAGAATATGAAGTCATCAAAGAGTCAGCCATGTAAATGCAGTGCTACATGATCTAGCATGAGTTATCAGTTATCTACAAAAAAAAGGCTTGTAACCCACTAATTTTAGTCCGCCACCATTTAAAGCTAAATATTAAATAGATAGAACCACATATTAACATCACCGTATGAAGTTAAATTTTTAAAAAAAAAGGCAGTCCAGACAAATTCCGAACATAGGTAGAATTCAAAGTCATAATCTCCAAAATACATGAAACAGGAGGAGGAACACATGCTTACTTCCCAATGAGTTAGGGAAAAATAAGATGAAGTAAAAAGAACATCACGTTAGTACAAATATTTTGATAACAAGCATATAAGAAGGGAAACGTCCAGATATGACATGCTTATTTTTTCCCCATAGAAGATGCGCATCTCCCCAGAGAAGAGGAGATCGGGGAAGGGGGTGTGTGTGTGTGTGTGTAGAGAGAGAGAGAGAGAGAGGGGGCGCGGGGATCAGAAAAGTAGTTTCCAACTAGATAATTCATCATCAAAGACGATCAGCTACATGACATGATAGAAGTTACTGATGGATCCAAACCACAGCATGAGATCAGCAAAAGAATATACCCAAATAACCAGGGAGGTGTGAAAGGTAAAATGACGCCCCAGCATCGGGATGGTAACCAATCAATGTTTCTGGTGTGGCAAAAACCTACATTGATACAAGACCCAATACCCTTGTAAGAACGGTGAAAAATCGATCTAACAGCACTTAAAACACCTAAACCATCACCTCGTGCACCGGCCAAATTTGATAGACCTCTTTGACAATTAAGTTCGATAAATTTTTTATTTCCTGCACATTCCTTTAGTTCTTGAGTAATTTAAAATCAGGTTATAGCATTTCAGTAAGACCAAGATATCTTAAATTTTTCAGCACTTTAAACGCTAAGCCTGTGGTTATAAGTCATATATAACCGAAATAAACAAGCTTCATCACGAAAAGTCAAAAGAATTGATAATATACAATTGAAATTACTATTTAACTTATTGGTACAAAGAAACTTCATCAATATAGAGCATTCAATTTAAGAAGATTAAAAGATGAATTTTATCAAATTAACTCGGTACGAACTCATAAATAGATAATAATAACTGATAAGGCCAAAACTATCAATAAAGGAAAAGCAAGAGAAGGGGCAGAGGAAAAATTTGCTGAATAAGTTTGTAGCACTAGGATTTGAACTTCTCTCTGAAGAGAAATAATTAAGATTTAAAGAACAGAAAATTCATGATATGTGAAGTCACTAAGCTGTATATAAGAATGAGCACAAATGGAACTTCACATTACTTAGCAATGTAGTCTAGCAGTTCTTGAAGTAGGAGAATTTATTCTGAACCAACAAATGAAAAGCTTAAACAAATAAAGCATGACTAATCTTTTCCATACAGTTTTCTCAGTTGCAACACGGAAAGTTCCAGGAATTGAGATGCCAGCCCCACCACCCATGGTAATTCCATTCAAAAGAGCAACCTGCATATGGCATCTGAGGATTAATATCTATACTTGGGAACCGCATCCAGGGACCAGCTAATGTCAACAAAAGTATGACAAGTACTGAATATGCAATGACTGAATTCCTTCAACATGGAATTGAGGAGGTGAGAGCGAGAAGTAACTCAAATCCGAAAGTGCGCAAAAAGCTCTAATCGATAAAAAGATACAAAGAAATATGGATGATAACTACTGATAAATGCACACCGGGTTTCTGAACTAAAAAAAGACGTATATTATGATTAATAAAAGAATGAAAAGACCATTATGGCTGCATCTTGTCAAAGCAAAAATGATCACATGATTTTAACAAAATAAACAACTTCCAAAATTGAGGAGATATTATTTCATGGGCAAAAAGGAACAGACAAACTATAGGTCTAAACAGGTGACCACTTTGTTGAGCATCAACAACAGCTTATCCCAAATCTTTCATTTCAGAAGTCAAAGAACACAGCAGCAAGAGAACTATGGATGAACTAAGAAAATGGAGTTGAATATCTATGAATAATTGGATCAAATTTGCATTTGTCCTAAGGAAGTTCTTTAAGCTTATGAATCATAATGTTAATGTGACTGATTATGTTTTCCTCAACAGGCATCGTTATGAAGATTGTATAGCAGCCAGCATGATAAAGTAGTGTTGCTACCATTTCAAGTTTATGAATAAGATCCAATCAAACTTTGGCAAGACACATCTAACCTCTCTGAATACCATTTTAATTCAGAAAGAAGTGATTGGTGTAATTAACTGTCGAGATGCTGTCTTTGCAGCAGCTTAGTCACAGTAAGATGAGAGAGAATCCAATTAACAGAACAAGCGATCTTCTAATAAATCCAGATATTTCTATAAGAGTATCTTTAAAACTGCCAGCATAAAGTACAAAGGTGTTGAAATTTCAATAAGCAATGGCGAATATAGGATGAAAATGTTGATTCAACATCTAATAGAACTTACATATATTTTTTAGAAATTCTTAGGTTCCAACTACAGAAGCATATTTTATTATGCGGAAGTCTAGACGCACAGTTAGATCACAACATAAAAGACATGTAACTACAAAATTTATAAGACGCTGGCCTCTATCAAGGTTTAGTAAATACAACAAAAGTTCCTGATACATATATAAGAGGCAGAAAACAGAAGAAAATTCAAGTCAACAGTTCTTTTAGTATCTGATGTTTGAACAATAATGATACTTACATGTGGCTTCAAGAGTGTGCCGACAACGTATACTAAGTTATTTATTGTCCAACAAAAATCTTTACAGTCTTGAAGATTCCCTGCATCTATCCAATACAAAAAATATTACACAAAGAATGGAGAAGATCAAATTAAAATTGAGGTAATAGTAAATTGGTAGCATGAAGATGCCGGCTGCAGTTTTTAAAACGATGTAGAAACGTTTTAATTGTTCCCAAAAATAAATAGATATCAATACCCTACTCAATAAAAACAACAGAACATCCAGCTATCAGCACAAATAGCTTAATTAAGATAATTCACATCTAAACAACTTCTTCACACCCCAATAATACAATTATCGACAATGTTTAAATATTTAAAATTCCTTTAACCATGTATCACTTGCAACATAGCAGAATATGAACAAGTTCGCCGTGAAGTACAACAGTCAAAATGACACAACATACCTTGTTTTAACAAATTATAAATAGTGACAATGTCACCCCCAGCAGAAAATGCCTTGCCACTTCCCTTCAAAGTTAAAGAATAACTGAGTTGCTACTCAGTAGATTAACATATGTAACTTCACCTTAAGATAATCCAGAAGAACAACAATAGGAAAAGCCATAAAATAATTCAAACAATAATTTACTTAACAAATTACCTTCAATACCACGAATCCAATATCAGGATCATCTTCCCAATTTTTGTACAGCTTTAGCAACCTATCCACCTAAACAACAGTATCATGCAGAGTTTTTATTTATTATACAAGGTGGAAATTAAGATTGCCAACCAACTGGAATATCAAGATTCCCCAGTGTCATGTAGGGTTCATTCCCAAAGCCAAAACAAAATCAATTCAGCATAAAAACACGATTTTGATGGTTGGATATTAATTAAAACACGATTTTACTTGGTGGGTATCAATAAAAAAAATCCAAAAAGAGAGCAATAGATTTCAAAAATGATCTTCTTGTGCGTACACACTGTAGATTTCAAAAATGGCAAAAACAGAAGCAAAAAGTAAAGGTCTTTAAAAGGCAGGAAACTAACAACTGAAAAATTGAGAGCATTTAACGCATGGGGTCTGTTAAGGATTGCTGTTCTCGAAGAAGCTTTTCCCTCCACTAACACCTGAAAAATGCAAACTCATATTTTATAACAAAATGGAAATTTTGATAATGAATCATAAATTGGATTGACAAATTTTCTTTTAAAAAAAATTCAGAACTCACAGTGCTTTGGGATTCATCAACAAGGGCATTGGTAGAGACACTGCAAAAGCTTCTGGAGTGAGAAACCAAGCGCGAATTCTGCAGTAAGCGCCTCAAAATACTTGCTGATTTGAAGCTCTGCAT
SEQ 10
ATGGCCTTGACTCTGAAGTCTTTAGCAACTCCTTTGCTTTTGGGTGCTTTCTTTATCCTTGTATTGCAGGTTTGCTTCTTCATTCTTTGAACCTTTCTACAGTAGGAGTATTTCTAATTATGGGTAGTCCTTGCCTATTGAATTGATGATTTTGTTTAGGTGGATTTAGCGTATTGGTTTAATAGCTTTTGATGATTTTACAATTTATATGGATTACCCTTTTCGTATTTCTTCAGAAAGATCAAAGATTATTGGGTATTTAGGACATCCTATGCTTTAGTTAAGCGTTGACTTGAAATCATGAGTAGAAGTAAATTTTAATAATTTTCTCATTACCTAGTGAGAATGAGAAAAGAGAAGTATAGACATGTTCCTGCAGTTAAATATATGGTTTGATGCAAATTACTTATGTGTTAGTTAACTTACATGTTTCTATATATAACATGAGAGGAAGTATTGCTTCACAAAGAAATACAGAAAGGGCAGAAAATGGACGAAAAACAATGGACTAGTTACAGTTTCTGCTTTGCTCTATGATTCTTCCTCTACGTTATAGTGCTTTAGACATATGATGCATTTGCTACAAATTTTTAATTAATTTTCCAGGTTAATAGTATTATAATGTTGAAAATAAAAATTGTGGAAGTGTTTCATCTTGATTAAAGTGAAACCTTTAGTTCTGCGTTTGTGACCCACTGTGTCATACTTTTGTTCCTGGAGTTTCAACTCTGTTTCCTAAAAGCTTTTAGCTTGTCTATTGATGAATAGAATTGATGTGTTCTTATGGAAAGCAGGTTGTTGCAGAGAAGCCAATATCTGAAGCTAAAGTTGAGTCTGCAATCCTTAAGGTATATTCTTCTGTTCATGATTAAAGAAGATGTTAGTGCCCTTGTTGCTTCAAATTTAAAACTTAAGAGCGCTTTTTGGATGATCGTGTAGGAATCTATCATCAAAGAGGTTAATGAAAATGCCAAAGCTGGATGGAAAGCTGCATTCAACCCTCAATTCTCGAATTTCACGGTCAGATCATCATATATCTTAGCGCTTTCTCTAACCATCAACAACAGTTGCTTATTTTGTTGCTATACACTGAAAACATGCATTTATAATTATGTCCATCTTTGGTATTCACAGGTTTCACAATTTAAGCGCCTTCTTGGAGTTAAGCCCGCACGAGAAGGTGATTTGGAGGGAATTCCACTTCTAACTCATCCTAAACTTTCGGAGCTACCAAAAGAGTTTGATGCACGAAAAGCTTGGCCTCAATGTAGCACTATCGGAAGAATTCTGGGTCAGTTTCTTCTTGTTCTTGCTTACTACTATTGAATTATAATTCTATGATAGTGATCCTAATGCTTTCCGTCTTTGCACATCAATCACTGGGACAAATTTGCATCACAGATCAGGGACATTGCGGTTCTTGTTGGGCTTTTGGTGCTGTTGAATCGTTGTCTGATCGTTTCTGTATCCATCACAACTTGGTAAATTCTGTTGAACTGTGTGACCACCTCATTTAAAAGCTTTAGATGACGCATTTTTATTTACTTATTTATATATCTTCAGCATACTCTCTCATGTGCGAGCCCTGATTCTTTCTCATGGGCCAAGCACGTGGAAACTATCTTATATTAGCACAAAATGCTTGTGAAGTTTTCACTATAGTTAATGTCACTAATGTTAACTTTTAATGTGTAATGCAGAATATCTCTCTGTCTGTAAATGATCTGCTAGCATGCTGTGGCTTTTTATGTGGATCCGGTTGTGATGGTGGATATCCTATATCAGCATGGCGATACTTTATCCGTAGGGGTGTGGTCACAGAAGAGGTAAATGTTGTCTTATTTTCACCTCAAAAGAGATTACAGCTTTCAGTAAAACCATTAGTTACCGTGGATCTTTATGATCAATCACTAATAAAGTTGTTTTTATTCTTGCAGTGTGACCCTTACTTTGATAATGAGGGATGTTCGCACCCGGGTTGTGAACCAGGATATCCCACCCCAAAGTGCCAGAGGAAGTGTGTGAAGGAGAACCTACTATGGGGGAAATCAAAGCATTATGGTGTCAATGCATACAGAATCCACCGTGATCCCTACAGTATCATGACAGAAATTTACAAAAATGGACCAGTTGAGGTCTCGTTTACAGTGTACGAGGTAATGACGATAAGGAAGAATGTTAAGTTCTGATCCTAAAACTATTTGATACAGCTTTCCGTACATGACATTATCTGAGCTGGTAACCTTATATGTGGTTGCCTACCTATCCCAAAATGAGATACATGTAATTATTTTTAGGTGACCTATAGTGTAACTGTTATGATAATTGAGAAACTTTAACTACCGATGTACCTTCCCAATTTATGTTTGCCCGAGATTTACTTGCAAACTAATATCTGTAAATGAGATATTTAATGCTAACCACAAGACAATATCAGAAGTTACCTGTTGTCGTAAAACTGCATCATCTCTTTCTCGGTGCAAGTAGATTTGTTTAGATTTTGTTTGTTGTCTTTGATCATAACTGTTATCATCTCTTTTTCTCAGCAATGCTTTCCTCTAACCAATGAGTCAATTTTTTTTATTTTTTTTTTGTCAATCACAGGATTTTGCTCACTACAAGTCAGGAGTTTACAAGCACGTAACAGGTCAAAGTATGGGAGGCCATGCTGTTAAGCTTATCGGATGGGGAACTAGTGAACAGGGAGAGGACTATTGGGTATGTAGATGTGTTCAAGTTCTGGTGTCCTGTTTTCTATTTAAAAGCATATCTTTTTGTCAAAATCTAATCACCTTATATATCATCTGCAGCTTATCGCAAATTCTTGGAACAGAGGCTGGGGTGATGTATGTCCTTAAATTCATCCCTATGTTTTCATATATGAGCAAAAAGTCCTTAGACATAGGCATGCTAGCTTCTTGTTGTTGATGCACTAACTGGCACATCAATAAATGGATTTCAACTTATATAAACTAACAACGTAAACAATTTTTGCACTATATTTCAACTGGTAAAGTTATCTCTGTGTGACCTATTGGTCACGGGTTCGAGCCGTGGAAGCAGCCACTAATGCTTGCATTTGGGTAGGCTGTCTACATCATACCCCTTGGGGCTACGGCCCTTCCCAGGACCCTGCGTGAACGCGGGATGCCTTGTGCACCAGACTGCCCCTTTTATATTTTAACCAGTTAAGGCAAGTTATTTACTGCATTTTTTGAAGTTACTCATTTAGGATTACTATAGAGAGTTACATGCCGTCGTATGTCATTTAACCTAATGATGCAAATAAATTGTATACTATTTTAATGCACAGAAGTTAAAGTAGCTTCTTCTCTAAATGAATGTATATCTCCAATATGACAGGATGGTTACTTCAAGATCAGAAGAGGAACAAATGAGTGTGGCATTGAACATAATGTGGTGGCTGGATTGCCTTCTGCAAAAAATCTGAATGTGGAACTTGATGATGTATCTGATGCTTTCCTTGATGCCTCAATG
SEQ 11
CTATACCATCATACCCATGTTGGAATGTGCCACTCTGACAACAAGTGGAGATGTTACCGATGTTCTTTTGTTCCTCCATGTCAAAGAACCAAATACATATCCCTGTGTTGGTGCAGCCACCTTGAAGTTCACTGTAAAATTCATCTTCTGGTAATATCTAGTGAAGGCTAATCTTCGGGGAACCACAGTGACATTGACACCCGTAGGTGCATAGACAACTGCCTTGTAAATGCTTCTTGCTTTTCCCACGTTAGTAACAGTTCGAGTTACTGAATATGTGCTTCTGAGGTTTGGTATTGTGATGGAGGGATAATTTAGTCCATTTGGTGATGCAAAGGTTTGATCACAGGTGCTATTGTCCCTTGTAATCAGATGCAGAGATTTCTCATCATAACCAATTGAACAAAGAAATGCTCTGTAATCTGCTGGCTGTGCATCGTATATAAGACCAGGATCCAGGACATTCGTAGGGTTAACAAAGCCAGAACCAAAATCAAATGGAGTAGCTCTCTTCCCTTCAGGATCTACTATTATGGGTTTGTGATGCTTATCTGACAGTTTAGCTGCATTAATATTGATGAGATTAGTGCATTGAAGGCTTGAATGAAAGAGTTAGATTATGTAAAAGCTTTTATTCTACCTGTCGTCATGATCGCGGATTTAATTGCAGAGGGAGACCAAGATGGATGCACAGCTTTTAACAAGGCAACAACTCCTGTTATGTGGGGGCAAGCCATAGAAGTTCCGGATAGTACATTGAAGTTCAACTTAGTAGAAGCTGCTGGAGACCATGCTGCCAGGATATTTAATCCAGGAGCTGCAATATCAGGCTACAAAGGCAAATCAGCCAGGAAATTACTTGCAAGAAAAAGCCAAATCCTCAAATAAGGTAAGAACCAAAGAAAATGCAAAGTAACAAAGAACTAATCCACAACCACATTGACGATCCAGGAAGTAAATATCAGGATGCAACATAAAACTTTGTTGGTCTACATGACAAAGGCAGAGAGAGATCATTGTTGAAAACAAGTGGCAGTTGAAATTAAGTCCCTATATTACTATTTTTAGCGCACAAATTACCTTCAAAATTTCTGGTGTTACAGAATTAGGACCTCTTGAAGAAAATGCTGCTACTCGAGGAGCAGGTTGAGCTCCCAAAACGGTTCTAGCAGAGAGAATCCTTGCCATGGGGAGGCTGTTAATATAATAAGATGTGAGATTGGGAAGAGATGATAGTTAACAACATCAAAAAGTCAAGAAGAAGGAAGTGAACCGTGTATTGTTAATGTAAGCTAGGATCTTGTTTCCAATCTTTTTCCCAACAGTTGCTGCAGGAATGACAAAAGGGATGGCCACACCCTTGTCTGCGTCATCTATAAGGATCATCCCAACTCCACCGGCTTCTTTAACTATAATGCTTTTCTCCATCTTTGACTCACTTGAGCTTCCAGCATGTAGGCACACAAGCACCTTCCCTTTGGCCTTAGTTCTATTCAAAGAACTATCTAAGCAATAACTAGTGGAAAAGAGAGGGAAGGAAAAAAGTAAATTAAGATAATTGTCAATGCATACATATCTACATTTAACAAACAGCGAAAGTACCTTATGATTCACCTGGATTGATAGGGAGTGAAGTATCCAGCATAAGCTTCAGAAGCAGGTATGATTCTTGTAGATGTATTCATTTGAGATAAGCTAAGACTTTCACCCTTCCAAATATCCACAACTTAATTAGAAAATAGAAATTGAATAATAATAACATGTAAACTTGTCGAAAACTGGGATTACCTTGAGCCGAACTCCATTTCCTAGTAAAATATCAGAAGTAAAATCTCTATCAGTTGAACTGGCTGCAACTGTGATCATCCAAGGAGCTAAATTTGTGGCTGAACCAGTGCTGCCTTCATTTCCAACTGAAGCCACCACAAGTATTCCGCGGCTAACAGCATGATATGACCCCACAGAAATGGCATCATTGAAATAATCTCCTTGGGGAGCATCAGGGCCCAAAGATAGAGAAATGACATGAACCCCATCTCTAATTGCATCATCAAATGCAGCCAATAAATCAACATCATAGCAACCAGAACTCCAGCAGGTTTTATACACTGCTATCCTGGCCATTGGGGCACCACCTCTGGCTCCTCCATTTGCCAAACCTTTGTAATTCATATTAGCTACGTAACGCCCCGCTGCTGTTGAAGCTGTGTGACTCCCATGACCAGAACTGTCCCTAGCAGACTTGTAAAACATGGTCTTCCCATTTTCTTCTTCAGCTTCATAGCCACTCATATAATATCTTGCCCCAATTATTTTCCTACATAAGACAAATCATATTGCACTTATCATCTAAATAACAAAAGAAGAGATGGTTGCCAATCAAAAGAAAACCTGTTGCATATAGAGGCATTGAATGCTTCTCCTGATTGGCATTGTCCTTTCCATCCAGCTGGCACTGGAGGCATGTTGGTATCACTAAAACTTGGAGACTCAGGCCAAATTCCTGTTTCATTAAATATCTTAAGAGCTTAACCTCAAGTTCTAATTAGCTCGAAAAACAAGGGAAACAGTGGAGCTGGGGACCAGGTTGAGGATAAACTGATAAGGTTGTGAACAGAGATAATATTTGCATTGAAACATGTCACTCTAATGTATAGTGGCTCTTCCCATAAAGTAACATTTACAAGTAGTTCAAGCACACTGTTAGGCATAAATGCAAATGGCAAAATATGGGAAGAGGTGAGAAAGATGAACTGGGAAGCTAAGAAATTGCATAAACTTGAGTTTTAAAAAAATCTAAGCAAACATATTTCATCATTCGAAAATGATTGAAACAAGAATTGATTGATGAAAGGAACTACTTTCCTCAGGTTCAGCCATATGTACCCAAATGACAAACTTAGCACTTTTGCAAAGTCATGTTATTGTACTCTTCTTAAGAAAACTAACAGAGACAAGAGCCCTTTTAAGTGACAATACATTAAATGAAGGGACCAACACTAACTTGGTTGGTGCATTCCATCATTAAACGATCATATCTTTCACCTAACTCGGAAAAGATTGCTAGAATTTAAGATAATTAAAGCAAAAGGAACAGAGAAACCACCTGTATCAATGAAACCAATGATTACATTAATTTGGTTCTTGGTAGAAAAACCTGGAATTTCCATTGTTTCATCATCACTGAGCCCCATAAAATCCCATGAATGAGTTGTGTGTAGGCTCCTCTTAGTATTTGGAAACACGGATACCACTCCAGGCATTTCTGTTTTATTTTAACATTAAAGACAAATTTCTCAGTACTTATTCATATCATTACCTTAAAGAAAAACATTTGGCATGGGCTTACTGGATATTTCAGAAGCCTGTGCCTCAGTCAACTTGGCTGCAAAGCCTTTAAAACCATGCCTATAACTATATACATGTGAAGTCTTGGCTTGTTCAATGCTGTAAAGTTACAACTTCAGTTTTTTTGCTAAAAGCAACCAGTGTAAAACCCAATGAACCAGCTCAAAAAAGGGAAAAACCCTCATAGCTAAAGGTAACAAGAACTGACCTTCCTTTATGAATAGCAGTCAGCATTTGATGGTTTTGCCTCAAAATCTCATCTGGGTGTTCATCACTATCTTTGCTTCCCATGTACACCACATATAACTGGAAAAAAAAAAAAACCAAGAACACAACTTTACTAACTATTCATCATTTCAAAACATCATTCAAATATACACCCTCAATAACAGCACATAAAAACCCATATCAACATACAGACTACAGAGCCAAAGTTTATTTACCTTGGAAGAAAAGCAGAGGCTAATATCTCCAAGAAAAACACAAAGAAAGAGTAAAAGAAGAGTCTTTTTTAGAACACCCAT
SEQ 12
ATGGGAGCAAAAGCATTTCTTGTTGCTATGTTTCTCTCAGCACTGTTATTTCCTTTTGCCTCCTCATCCAATGATGGCTTGATGAGAATTGGCTTGAAAAAAATGAAATTTGATCAAAATAATCGGCTTGCTGCACGCATTGAGTCAAAGGAAGGGGATGTTTTGAGGGGGTCGATTAGGAAGTATAACTTCCGTGGTAAACTGGGGGACTTTGAGGATACAGACATTGTAGCATTGAAGAACTATATGGATGCTCAATACTTTGGGGAGATTGGTGTAGGCACTCCACCTCAGAAGTTCACTGTAATCTTTGACACAGGTAGCTCGAATTTGTGGGTGCCGTCGTCGAAGTGCTATTTCTCTGTAAGCTTCTATACATGCAAATGATACAAAGGATAGCATTGAACATCCATCTTGAGTGATGTAAAATTTGATGACTGCCTATCTTGGTGTAGGTTCCCTGTTTCTTTCATTCCAAGTACAAATCAAGTGAATCAAGTACTTATAAGAAGAATGGTATGTTATGTTTCCATTTTTGTATATTGCTTCTCTCTACCATCTGGTTGTTTATTGCTGCATGAAACATATATATGCTTCCTTCTAGTGCCGGTTATTGTTTAGAATATGTGCCATCTTGTTCATTTTAGAACACTTTTTGTATTGTCCTTATGTGTTTCTCACGGTGCATCAAGGGATTACATTGGAAAAGTTAAATGATGAGTACATGTAGTTGACTGTTGAGACATAAAAAGAGGCTGTTTATGTTATGTTTCTTAGTATATTACTGTAACTAGTGAGGTTCCAGAAAAGAAACACCAATATCCTTATCTCTCTCTGCGATTAGTGCTTTTTGGTTGCGAGTTGTATAGTTTTAACCTCTGCAATGCCTCTTTAGGTGGCCTTTCTTCTTCCCTATTAACTAGGTTTTGTATCCATGTCTTGGCATGTCTGTGACATAAGAAATTTTCCGCAGAAAACTAGTTATTCTTGATTTTTTTGTTCTATTACAATTATATATGTTGCTCAAAGAAAATGAATTGAACAGTCTGTAGCAATTGAATGGGTTTGCCAGATGAGTTCACTCCAGTGTTATTGAAGGCCATTGCTTGGTCGTTTAAAGTTATGAAGCGATACAGCAAGGGCTATCGCTTCATCGCTGAAGCCATGCACTTTAGAGAAGTTTGCACTTCAAATAATGGTGCACAAAGTGATTCCAACGAGAATTTATTGTTTCATTGAACATCAACTTTTAGAAGTTGGATTAATGTTGGCATTTTCACAAAGTGATTCCTGTTGGGTATAAAAATAATATTCACGGTATTAGTGATAAACGCGGAACACTAAGTTATGCTTAAATCAGTAAGAATAAAAATGCAGCAAAAATGACACCAAGATTTTACCTAGAAACCCTTCTGAATAAGGGAAAAACCACGGCCAAGAAGAGCAACTGATATCACTATAGCGAGGATTTTACACTGTGTAGTAACGAGTACGAATACTCCTAAGACCACTACACCCTCAAAAGAAATAAACACTCTTTTGCTTTTTCACCTCACTACAATATCTCTCACACTCTATTTTCTTTACAAACTATTTTCTTATAGTTTATGGAATACCTTGCTCTCTCTTTTTTCTCTCTTTGTTGGTGTGTAGAAATGAGAGTTAAAGCTCTCCTTTTATAGCCAAAACCTCACTCTCTAAAGCCTACAATATTTGACAATTTGCACTCCCTTTTCACAATTTCAACAAGGTTGGCTACCAAACCAAACCAAATCAATAAAATTGTCTACCAAATCATACCAATTCAACAAGGTTGGCTACCAAACCAAACCAAGTCAACAAAATTGGCTACCAAATCATTTGAATGAGATGGAAACCATATCAATCTCCCCCTCCAGTCTCATTCATCTAGAGGAGGTAGCACTGTCTTCTAGTCTGAGTGCATGCCGACAAGTTCTTTGCATAGCTCGAACTTGTCTCTTGGTACCACCTTGGTCAACATATCAGCAGGATTTTCACTTGTGTGAATCTTTTTGACCTGAAGTGATTCGTTCTCCACTTGCTCACGAATCCAATGATATCTGACGTCGATGTGTTTTGTTCTCGCATGGTACATGGAGTTTTTGCTTAGGTCTATTGCACTCTGACTGTCGCAATAGACAACATACTCCATCTGTTGCAATCCAAGTTCTTGAAGAAATCTCTTGAGCCATATCATCTCTTTGCCAGCTTCAGTAGCCGCAATATACTCTGCTTCAGTTGTAGATAGTGCGCACTTCTGCAACTTCGACTGCTATGATATAGCTCCCCTTGAAAAAGTAAACAAATATCCGGTAGTGGATTTTCTGTTATCAAGGTCACCTGCCATATCAAAATCTGTATAACCTTTCAAAATTGGATTTGATCCTCCAAAACATAAGCATTCACGAGAGCTTCCTCTTAGATACCTGAGTATCCACTTTACAACTTCCCAATGCTCTTTTCCTGGATTTTCGAGAAATTTGCTAATAACACCGACTACATGAGCAATATCTGGTCGATTGCATACCATTGCATACATCAAACTTCCGACAGCAGAAGAATAAGGAATCTTGGCCATTCTCTCTTTTTCCTCCGTTGTTGTAGGACACATCTTCTTACTCAACTTCAAATGACCAGGAAGGGGTGTGCGAACCAACTTAGCACTTTTCATATTGAAGCGCTCCACTACACGTTCTATGTACTTCTCCGGTGACAAGTAAAGCTTTCTTTCGTTTCTCGAACGAGTAATTCTCATGCCCAAAATCTGCTTAGCATGACCCAAGTCTTTTATTGCAAAAGACTTATTCAACTGTTTCTTCAACTCGTCAATCTTGGATGCATTCCTGCCCACAATCAACATACCATCCACATATAGCAAGAGGATGATAAAATCATCATCAGAAAATCTTTGTACAAATACACAGTAATCTGAAGAAGTCTTCTTGTAGCCTTGCTCCCCCATAACAGACTCAAACTTCTTGTACCACTGTCTGGGAGCTTGCTTCAATCCATATAGACTCTTCTTAAGTTTGCATACAAGATTTTCTTTACCTTTTGCATTGAAGCCTTCAGGTTGTTCCATATAAATCTCCTCTTCTAAGTCACCGTGAAGAAAAACAGTCTTCACATCCATCTGCTCAATCTCCAAATCAAGATTGGCAGTTAAACCAAGAACTGTCGGAATGGAGGACATTTTCACGATAGGAGAAAATATTTCGTCAAAGTCAATACCTTTCCTTTGACCAAATCCCTTGACAACCAATCTAGCTTTGTATCTGGGCTTCAAACTATGTTCTTCAGCTTTAACTTTGAACACCCACTTGTTCTTCAAAGCTCTCATGCCCTTAGGCAATTTCACCAACTCATAAGTATGGTTCTCATGCAGAGATTTCATCTCATCTTGCATGGCTTCAATCCATTGATCCTTGTGCTCATCTTCTATGGCCTCCGCATAACATTCAGGTTCTCCCCCATCAGTGAGTAATACATATTCATTGGGTGAATAACGGGAGGAAGGAGTACGAGGTCTAGAAGACCTCCTGAGTGGAATATCTAACTCGTCCACAGCTTCGTGAGTAGGAGCATCTACCTCATCAACATTAGCATTGTTATCACCATCACCATCAATATGCTGATCCAGAATATGGTTCTGGGCATCACCATCATCATTGAGCCCACCAACGTCATCCACATTTGTATGAGGAACTTGATCAAGATTAACTAAACCTTCAGAACTTGAAGATTTTAGTTTCTCCGCTTTGTTAATATCTTCAATGGTTTGATCCTCCACGAAGATAACATCACGGCTTCTCACGACCTTCTTCTCAATTGGATCATATAACTTGTAACCAAACTCATCAAGGTCATAACCAATGAAGATGCATTGCCTTGTCTTGGCAGTTAATTTTGACCTCTCATCTTTAGGCACATGTACAAAAGCTTTGCAACCAAACACTTTCAAGTGGTCATAGGAAATATCCTTGCCATACCAAACTTTGTTTGGAACATCACTTTGCAAAGCAACCACAGGGGAAAGATTAATAACATGTGCGGCGGTCAACAAAGCCTCACCCTAAGAGGAATTCGGCAACTTTGCTTCAGAAAACAAACATCTGACTCTTTCCATCAAGGTCCTATTCATCTTTTCTGCTAAACTATTAAGCTGAGGAGTCTTAGGAGGAGTCTTCTGGTGTCTGATACCCTGTTGTTTGCAGTATTCGTCAAACAGTCCACAATATTCACCACCGTTATCAGTACGAATACACTTCAGCTTCTTTCCAGTTTCTCTTTTAGCTGAAGCCTAGAACTGCTTAAAGACACCCAACACTTGGTCTTTAGTCTTCAAGATGTAGACCCAAAGTTTCCTTGAGCAATCATCAATAAAGGTAGCAAAATAAAGTGCACCACCCAAAGTCCTTGTCTTCATTGGACCACATACGTCTGAATGCACCAACTCAAGCAACTCGTCTTTCTTGAAAGAAGATGAGACTGGAAAGAAACTCTTTTTTGTTTTCCAGCCAAGAAGTGCTCACATTTTTCTAATTTTGCACTTTCAAAATTTGACAACAATTTCTTCTTGGCTAGAACATTTAGTCCTTTCTCGCTAATGTGGCTAAACCTCTTATGCCATAACGTTGAAGAGTTATGGCTCTCAACGGCATTCACCATATCAACACAGGTAGAGGTCGTAGTCCAATATAGACCACGACGCTTTTCCCCACGAGCCATAATCATGGAGCCCTTAGTGAGCTTCCACTTTCTAGCACCATTGGTACTGACATATCCCTCATCATCCAAAACACCAACAGAGATCAAGTGCAAACGAACATCAGGTGCGTGCTTTACATTGTTTAAAACTAGTTTAGTTCCAATACTAGTTTCCAAACAAATCATTCCAACACCAGTCACCCTAGATAAGTTCCAAAGTCACCCTGAGTATAGGATGAGAAAATCCTTCCTTGATGTCACATGAGATGCGGCACCACTATCCACAACCCAGCTTGACTCATCACAAGCAATATTTATCAAATCCGCATCAAGGACAATAACAAGATCTTCTGTAGTGACGGTGGCCATACGATTGCCATCTTCTTTCTGTTCTTCCTTGTCTCTATTCTCCTTTTTCAAAATCCGCAGAACTTCTTTGTGTGCCCTTTCTTCCCGCAATGATAACACTTAATATCTTTAAGTCTGCTTCTGGATTTGCTTCTATTATGTTCTCTATTTTGAGAACCCCGATTCTTGCTTCTCCCCCTAGAGTCAGTCACCAAGACATCTGATGGGGAGGAACCTTGAGATTTTCTTCTCATCTCTTCATTTAAAAGACTGCTTTTGGCAAGATCCATAGAGATCACACCATCCGGAGCAGAATTTGATAATGAAGTTCTAAGAATTTCCCAAGAACTTGGTAGGGAACCAAGTAGAAACAGGCCTTGAATTTCTTCATCAAATTTAATGCTCATAGCAGATAACTGGTTCATGATCCCCTGAAAATTATTCAGATGATCTGTCATCGCAGAACCATCATGGTATTTTAAACCCAACATCTGCTTTATCAGAAACATCTTGTTGTTTCCAGTTTTCCGAGCATACAAACTTTCAAGGTGCTCCCATAGGGTCCGAGCATGTGTCTCCCCAGAAATATGGTTCAAAACATTATCGTCAACTCACTGTCTAATAAAGCCGCAAACCTGCCTGTGTAACAGATTCCACTCTTCATCTGATTTATTATCAGGCTTTACAGTGGCGAAGACAGGTTGATGAAAATTCTTGACATAGAGCAAATCTTCCATTTTGCCCTTCCAAATGGCATAATTTGTGCCATTCAAAGTAACCATTCTACTAGTGTTGGCTTTCATCGTTTATCACAAATACAAATACTATTTATTATGAGACCAAAGTAATTCTTTTCTGATGTGGAAGTTCAGACTGTGCTGCAACCACAGAGCATACTCANNNNNTATTTATTATGAGACCAAAGTAATTCTTTTCTGATGTGGAAGTTCAGACTGTGCTGCAACCACAAAGCATACTCAAACAGAACCTTGGCTCTGATACCACTTGTTGGGAATAAACCCCGTAAAAATAATATTCACGATATTAGTGATAAACGCGGAACACTAAGTTATGGTTAAATCAGTAAGAATAAAAATGCAGCAAAAATGACACCAAGATTTTACGTGGAAACCCTTCTGAATAAGGGAAAAACTACGGCCAAGAAGAGCAACTGATATCACTATAGCAAGGATTTTACACTGTGTAGTAACGAGTACGAATACTCCTAAGACCACTACACTCTCAAAAGAAATAAACACTCTTTTGCTTTTTCACCTCACTACAATATCTCTCACACTCTATTTTTCTTCACAAACTATTTTCTTATAGTTTATGGAATATCTTGCTCTTTTTTCTCTCTTTGTTGGTGTGTAGAAATGAGAGTTAAAGCTCTCCTTTTATAGCCAAAACCTCACTCTCTAAAGCCTACAATATTTGACAATTTGCACATCCTTTTCACAATTTCAACAAGGTTGGCTACCAAACCAAACCAAGTCAATAAAATTGGCTACCAAATCATACCAATTAGTAATGGTGCACAAAGTGATTCCAACGAGAATTTATTGTTTCATTGAACCTCAACTTTTAGAAGTTGGATTAATGTTGGCATTTTCACAAAGTGATTCCAACTAGAATTGGTTGCTTCTTTGAACCTCAATTTTGAGGAGTTGCATTAATACGGGGATTATTGTATATTGGATGCTGAAATTAGTTATTTCAACTGCAATTTGATTTTTATTGTAGAGTAAATTAATAAATGTTTGATATTTTCTTGTTTTCTGTTAATTGTGCGCCTCACTTCTCGCTATTCACTGCAAGTCTGTGGACCTTGTTTTATTTTGTTGCACGTTTTGGTTTTAAGAACACTAGGTCACTCCTACCTAGGGGTGTCAATGGATATTAGAAAACCGACTTAACCGACCGAACCGTACCGTACCGAACCGATTTTTAGGTTTCTTTTAAAGAAACCGTAGGTTTTTATATAAATCTATAATCGTACCGATAATTAGGGTAGGTTTTTTATTTTATAAAAATAAACCGAAAAAATACCGAACCGTACCGAATAAGTTTTACATATGAAAAATATATTCATATAGTAAGTTTAAAACTAGTAAAGTATTAAATTTTTCATTGGGTCTTGGAATTATGAAAACTGTTACAAGCCAATAAGTAATTAAACTCAAAATACTAATTCCTAAAACNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTTTTATATAATTTAGATTTATCTTTTTAAATATTTAATATAGACTTTATTCTTGAGTCCCAACTTGGTTAATATCTTTCCACTCGTGTGATTTATATCTTCTTTGCTTTTACTTGGTTTCTTTTCGTTGGTGTCGAATAGTTGTGTATTTATACTCTAGCCATCTTTCATGTTTTTTAATTCATTATCCTTTAAACAGTAAAAATGTCTAGAGAGTTTCGCTAAGTCCTATAAAAGAACGTACGTTATTGCATTCTATTTTTACTGGTGAATTTTATATGACATTTAAAAAATACCGAAAATTAACCGAACCGTACCGATACCGAAGTGAAACCGACATGATTGGGACGGTTTCGAAAAGTCTAGTTTTGGTTATACATAATAAAATAACCGAAAAATTGGTATGGTATAAATTTTATAAAATAAGCCGAACCGAACCATTGACACCCCTACTCCTACCTTAATGAACAACATTTATAACTTATCGAATGTTTCCACAAGACAGAGATCGTATTTGGCATAGTAATTTTAATGTGATTTCATTCGTGATAAATTTTTGATTTTCCATATAGGGAAGTCTGCTGCAATTCAGTATGGTAGTGGAGCTATTTCTGGATTCTTCAGTCAAGATAACGTCAAAGTTGGTGACCTTGTTGTAACAGATCAGGTGAGTGAGGCTCCTCACTTGTTTTAAGTGTCTTGAAGGTAGAAATCTTACATCTCAAGGTGTTCATGTACTAGGAATTTATTGAGGCAACCAGAGAACCCAGCGTGACATTTTTGGTAGCCAAGTTTGACGGTATATTGGGTCTTGGATTCCAGGAGATTTCAGTTGGAAATGCTGTTCCAGTGTGGTACGTGGACAGCATTTAGTTTGCTCTCTTTCTTTCCCACAAACCAAATTAAAGATCTAGACAATTCTTTTTTCCAGATGAGGTTTAACACATTGAAGGATTAGATTTCCATAAATGCAGGATAGGCTGGATGTCTCTTTCTGTTTAAATTTGATTTGGCATTCATCTGGGCAGCATCTCCTGTCTTTTCCAGGTACAACATGGTCAAACAGGGTCTTATCAAGGATCCTGTCTTCTCGTTTTGGCTCAACCGAAATACAGAGGAAGAACAAGGCGGAGAAATCGTATTTGGTGGGGTTGATCCTAATCACTATAAGGGAGAAATAACTTATGTTCCAGTCACACAGAAAGGTTATTGGCAGGTTTGTCATTCCGTCAATTCGTTACTATGTATGTTCATGTTTTGTACAAATGCTATCTTAATCTTAGTAATATGATTGGCAGTTTGATATGGGTGATGTTCTTATCGATGGTAAAGCTACTGGTATGTTTTGCTCTGTACCTTTTGAATTGGATTGCTGAATTTTGCGAATATAGATGAGGGCTATGTGCCTGGATATAGTCTTCCTTTGAGTTTTTAACATGAACATATCGGGTGTAGGTTACTGTGAAAGTGGGTGCTCTGCAATAGCGGATTCAGGGACTTCTCTCTTGGCTGGTCCAACGGTATTATCTATAACCTTGATTTTGGACATATCGCTTTTTTGTTATGTTTTTGTTATTTGTTTCCACCATCAAAGTCAACCGCAATTCGATTATTGATTTCCTTGTTCCTTTGAACCTTCAACAGGTTGCTCAGCAAGAAAAGTAACATTTTCCTAAGATTCATGAGTTAAATAGAACGAATCCAGTGTCTGATCTGTGTTGTGTTTTAATTTTTCATCACAAAGAGGATGTTAGTGAAATTGGTTTTTCATTTCTGAGGTTACCTTTGGATATACCATCTTAGCTTATCACTGGGTCGATGTCTGTAAATCTGCTTTCTCTTTCCTGTCTTTCAGCTTAATATGTGTAAACCTGCCTCAGTAGGTACTTGTTGTTTTGTGTTACTTGCTTTTATTACGCTTCAATGAGGATGATGCTTTTACTTCTGCTTTAGTCCTAGTGTTCTGTCGACTTATTTATTTACCCCTCTTCTATTTGAAAGAAAATTATCCAACCCGAAAACTGTAGCCAGGTTTTATCCTGTTGACTTGGAAATGCTGACAATTAAATGAAATAAAAATCTGCTGCCTTTTGTCTCTACAAATTCAGGGGCGAACGTTTACCCAAACGGTGTCATCCGACACCGCTTGGTCGAAATTTTTTACTGTATAGACATATATATGTGGGAAAAAACAGTACGCAATATAAATTATAAATGACCATTTATGCGTGTAGCTGTCTCTGGTATAATGGTCAAGTGCTGTTTTTTCCCTACTTGACTTGATGTCGTGGGATTGACCTCAACGGATGGCATTTTTTTATTTCAAATTTTTTAGCATGGTCCTTTTAAAAACTAGAGTTTATATAGAGTTGAACCTTCAATATCATCTTAAAAAAACAACTAAATCTGGGGGACATGACGGCTTCCACGTTATAAGTGATAAAAAATTTATTTGAATGCATAAAGGAAGCTGTTTAAGCATACATAATATAATTTAGAATAATAATTTTTAAAAAATTATTCGACACCGCTTACTAAAAGTTCTGCGTACGCCCTTGTCTACAATTGTACGATTAAAACACTTCCAAGGTTGAGCAATCTTAAGCCCACCCTTTTCCGTACAGGCCATAATCACTATGATTAATCAAGCCATTGGAGCCTCTGGAGTTGCTAGCCAACAATGCAAATCTGTAGTGGAGCAGTACGGGCAGACGATCATGGATTTGCTGTTAGCAGAGGTGAGCATTCAACTGTGTAACAGCATTTTCATTTGTTTGATATGCAGTTTCCTGTGCAGTTACAGCAAAATGACACTTGGCAATCTAGTTGCCTTTCCCGGTTATCTAATTGGTCCGCATTTAAACAGGCACATCCAAAGAAGATTTGTTCACAGGTTGGAGTATGCACCTTTGATGGAAACCGCGGAGTTAGGTTAGTCTTCAGGCTCTCTCTGCCCCTTCACGTGAACATATTGTGCATTTTGTTAATCCATAATGTTGATATAAATACTGAATAATTCTGTGGCCTTTTTTCTGCAGTATGGGAATTGACAGTGTTGTAGACGAGAAAGCTGGCAGATCCACAGGACTGCAGGATGGTATGTGCTCTGCTTGTGAAATGGCGGTCATATGGATGGCGAATCAACTGAGACAAAACCAGACTCAAGATCGCATATTAAACTATGTGAATGAGGTAAAACATCTGTCACTGCAATTTTCTCCTTTTCTTTGAAAAGAATGCTGACTGACTGACAGCTTTGCGAGCGTCTCCCAAGCCCATTGGGGGAATCAGCTGTTGACTGTGGAAAGCTTTCTTCAATGCCTAAAGTCTCTTTCACAATTGGTGGCAAAGTGTTTGATCTCTCCCCAAATGAGGTATGCTTTATAATGGTGTTTGCGAGTAATAAACTTTGCTATGCCTCTTGTAATTCAACATTGTCAGAGAGAAAAATTAAGTGGCTCATCATAGAGTCTTTTTGTGGCATTACAACTAATTGTCATATTACAAAACCGATTCTAGTGTTTAAGGTTTTCAGTTTCTACAGAGAAGTAGATGTTTGTTGTCATCAAGAAGTACTTCTATATTTTGAGGTTGTGAGTATTTCGATTTTACATCAATTTAGATGTCTCATTTTCCAACTAAGTAATCTAAGAGAACCTCCGTAAGATCTTTGTGCCGTGAAAAAGTTACTCTTTGCTATTTGTATGGAAGTGTATTACATGCTGACCTTGCTAATGCAAGCCGCTTGAGAAGGCCCAATGAATCGAGATTTAGATGTCGTCCCCCCCACCCCCCGGGCCCCAAAAATAATAATAATTTATATCTGGATGATTCAGCATGGAACAATTCTCTACCTTCATAGACAGGGGTAAGGTCTGCGTACACACTACCCTCCCACACCCCACTTGTGGATTCCACTGGGTTGTTGTTGTTGTTGTTGATTCAGCATAGAACAAATCTTACAAAGAAGAAAATTCATCAAGTGTACACCAAATCAATTTGATCATTCACGTTTTTTCTAAATTTGTTTGTTTTGTGGTTTAGTAAACATAAAGAGATTGAGCTTTGAATAATTCAGGATAGATCACATCTTATAAAGGAAAAAATTCAAGCGAGACACCAAATCACTTGACCATACGCAAAACTTTTCTTTACCAGACCATCACTCTAGTCTTTTCCTGCCCTTTCAACAAATTTTATTCAAGAATGAGAGTAAAGTTGTGATCCGATATGTGGTTCTGTTTTACTTGTTTCAGTTGTTTGTTTAACTTGCTGACCTAGATTGATCTATATTCTAATTTTGGATTCTTTTCCCGTGATCAGTACATACTCAAGGTGGGCGAGGGTGCTAAGGCACAATGTATTAGTGGTTTCACTGGCTTGGACATTCCTCCTCCCCGCGGACCTCTCTGGTAAAGTTATCTTTATATGTTCTCTCTATTTTAGTTTATAATTATGTTTTTTTGAGATAACTATCAGCAATGTAGTTAATCCAACAATAAGCTTAATCCGAATTATAGAGCTACGGCACAATTAAACCCGAGCCAGCAATTCTTTTGTTCACTGAGTTCTCTATGGTATTCCTTTGGTTACAGGATCTTGGGTGATATTTTCATGGGTCGATATCACACAGTTTTCGATTATGGCAAACTCAGAGTTGGATTTGCTGAAGCAGCT
SEQ 13
ATGACTTTTTTCAGGTCGTTCTTATTCTTTCTTCTCACCTTATTTGTTATTTCATCTGCACTCGACATGTCCATCATTAGTTACGACGAACAGCACGGCCAGATGGGGACAACACATCATCGTACTGACGATGAAGTCAGAGAATTGTACGAATCGTGGCTTGTTAAGCACGGAAAGAATTACAATGCCATCGGAGAGAAAGAGAGAAGATTTGAGATTTTTAACGATAATTTAAGATTCATCGACGAGCACAACGCTGAGAACCGCTCATATAAACTTGGGTTGAATCGATTCTCTGATCTTACCAACGAGGAATACCGTGCCATGTTCGTAGGTGGACGGTTGGATAGAAAGACGAGGTTGATGAAGAGCCCTAAAAGTAACCGTTACGCTTTTCAGGCCGGCGAAAAGTTGCCGGAATCCGTTGATTGGAGAGAGAAAGGCGCCGTTGCCCCTGTTAAAGATCAAGGCCAATGCGGTGAGTTTTTTTCTTCTTCAAAACTTTCCTACTATAAAGGAAAGCTCTGCTCTTTATCGTAAACATGTACTTTTGTTTTGTCTGCTTACGGAGTGAGACCAAGAGGAAGAGTTTGGATAGATTGTTGAAAGGAGTCATATGTAGGTCAAAAGTTTTTGATTTTTAGGTTGTTTTTTGACCTATGTTGTCGTCTTATACGGTCAATGATCTGTTATTGGGTAACTAATGATTCTGTTTTCATGTTTATTTCAGTCAACAAATTGGAGAATAAATTAATTGCTGCTCTGTCTGGTAGTTAATCTTCATGATATACACCTAAAGCTTACATCCTGATTTAGTATTTGGTGTCTCCAATTGGAATGTTTATTTGCTTTGCTAGTGTTTCCTCTCTCTCTCTCTAGGGTAAATATAAAAAGATCTAAAATTTAGAGGTACCTGGTGTATATCTTAATATATTCCATGTACAAACTTTAAAAAATTATTTAAGCTTCCCCTAATTTGTTTAATACGCTGATAAGGGGTAATCAAAAAGCATAAAGATTAGATTGAACGGACACAGTATATATTTTGCTTTTGCAAGTTGATCAGTTTCTTTCTCCATTCTAAATCGGAATCGACCAGAATTTAAAGCGGTATAACTTAAGATTAAGCCATGAAGACATATTTGGCTATTCTAGGTGTTATAAATTTTAACCCAAGTGTCCTAGGGAATTGATGTTTAATCTTGCTTTGATTATGACGAAACCCATATCTCGATTGGTTAGATATCAGTATATCTATGTTATGTATAGAATCCTCGTTTGAAATTTGAGATTTTCTTATGAAGGGAGTTGTTGGGCATTCTCAACGGTTGGCGCTGTTGAAGGAATAAATAAAATTGTAACGGGTGAATTAATTAGTCTGTCAGAGCAAGAGCTTGTTGATTGTGATAGGAGTTATAACCAGGGATGTAATGGCGGTCTCATGGATTACGCCTTTGATTTCATCAAAAATAACGGTGGCATTGACACTGAAGATGACTACCCTTACCATGCTCAAGATGGCACTTGTGATCCATACAGGGTAAGTAATTAACCATACTATCAAGAAAACATCCAAATATTAATTATGTACTATTTCAGAATGTAAGTCTATATAGCAAGTAATTAATAGTATTTGCTGACAAAATTTGGTCATTCAGAAAAATGCCCGTGTTGTCTCCATTGAAGGGTATGAAGATGTTCCAGAAAACGATGAGAAGTCGTTGATGAAGGCAGTGGCAAATCAACCAGTTAGTGTTGCTATTGAAGGTGGTGGCAGAGCTTTCCAGCACTACTCTTCGGTATGGTGGGCGGATCTTGACTAATATATCCTTCTGAATATATATGTTATTTGTGTCTGAACTCACTGGCCCTAAATTCTGGATTCGTTATTGCATTTTAGTATGCCTGTGTCCCTAATCTGCAAACACGGCTGCATTGTGCCTTGTTTTACTACTTAAAGCTAGTATACTCATTTACCCTTCCAATTTTTATCAAATCATGCAGGGTGTTTTCACTGGATATTGTGGAACGCAACTAGACCATGGTGTAGTTGTAGTTGGCTATGGAACAGAAAATGGCGAAGATTACTGGATTGTGAGGAATTCATGGGGTGCTAACTGGGGAGAAAGTGGTTACATCAAGCTTCAGCGCAATTTCGCTAATTCTACAACTGGAAAGTGTGGAATTGCAATGCAGGCATCTTATCCTCTTAAGTCTGGCGCAAATCCTCCTAATCCTGGTCCATCTCCTCCTACTCCTGTAACACCATCAACTGTTTGCGATGAGTACTATAGCTGCCCACAGGGCACTACTTGCTGCTGCATTTATCAATATGGCGAATACTGTTTTGGCTGGGGATGCTGTCCTTATGAGTCTGCTACCTGTTGTGATGATAACTACAGCTGCTGTCCCCATGATTATCCTGTATGTGATGTTGATGCTGGCACTTGCCTTATGGTAAATATTTTTTCCCTCCCATTCTGCTTTTTTCTCCTTTATAATAATGATCGTCAATTTCACTTATTACGTGTAATATTCTACCAGCACAGGATTAATTAGATAACTCTGTCTACCAAAACTTTGGCAGATATTTAAACCTTCGTCTTCACTCGTTTATTGACCGCTAGACCCACGTACAGATTCAACCTTTTATAGGTTTAATCATCAATGCAAGACTACTTATCACAATCTTTTTTCTTTTTATGTGACAGAGCAAGGACAATCCATTAAAAGTAAAAGCATTGAAGAGAGGTCCAGCTAGAGTAAACTGGTCAGGGATGAAATCTAACAGGAAAGTGAGTTACGTT
SEQ 14
TCATGAAGAAACAATGATCAAATAATAGCTAAAAAGGGAAAACAGAGCCATCATAAGTTGGCAAATGTAGGAATTGAAATGTGCTGGTGCATGGTTTATTGCAGGTCTAGATGACGGAACAGATGGAAACGAAGTAGCGGGTTCATTTCCACTTCCATTTCCCTTGGTGGCCTCTGGCACCACACTGGAGGGCGACGGCGCTTCAGTAGAATTACGTTTGTTCACTGGCAGAGTTGTCGATTTGTCGTTGGATTCTCTAGAATCATAACCTGACAAGAATGGTTTCTAGTTAAAATATGGACAGGTGTGCACACTAAAAAGGTCATACTCATGAATGCAAACTCACAATCGGATGGTTTCCAACCCAAAACCATCTTCTCTCGATCAAAAACCACGCGATAGCCTGTCATAAAATTTTCTGCAAAAAGATCCATGTATTAGTTTTCTGTCATAAATTCCAAATACAGAGACAAAATCGAAGTAAATCAACAAATAGCTCAGATTTTTGACTATGTAACCAGTTTTACTAGTTGGTTATGGACTTGATTCTTACTAATTTACTTTGTCACATTCACATTTCCAATAATATATAAGCTAGAAGTATGTAAAAAACTTGATAGGAACCAAAACTTCTAAAAGTTGGTAATTGTGAGATCATTAGCTGGCATGGTGCAAGTTATATTGCAAAATTCCATGGATAGAAACAAAATCTACGAGCAAGCAACTGATAGATACTTACGTCCAATGATGTTGACATCCCCACTTTTCACAACAGCTAAGCAAAATGCGCGAGAACCATCCTGATGATCATGACTGGGTATTAGAATATTCTAAAAGAGAACTTCTGTAATAAAGAAGGAGCAGAAACCATCTTACCTGGAGCGAGAGCATAATTATCGGATCGAAAAGAAAAAACTGGTTGCCGCCTTTCATTGTCAAATTTAAATCAGGAACTTCGAATGTAGTTTGATTTGCACTGCGTATGTAGCATGTTAATCCTGAGAGTTCAAAGGACATCAAGAAAGTAAAAGTGATGAAGATTATAAAAGATGGTTCACCTTAGCCCGTAGCAGTATTCAAAAGGAATTTCGCCATCAGGTTGAATACGTAGCTGTTTTGCTTGAGAATCAAACTGAAAAAAGAAACAAAATATCTTCAGTTTCACAATACAAAGAACAAACTCCCAACTAACAAATCATACAGTCAGCCTGTCACTCACGTTCTCTGTAATGACTTTGTAAGCTGGGTCGTTCAAGTATGTGAATGAGGTGCCAGAGTCAAAAATGGCTGTGAAATCAACATCAGTGATCTTGTTTCCCACTGTTATTCCTGTCAAGCTGATGTTATAGGTTGGGCTGCAATTCAAGGAAGGATAAGAGTGAATACATATTTGGTCGAATTCTCCTGAATCAAGCCAGGAACAGAGGCAAACGCTAAACCTAGAATATCAAATTGACTGCTTACTGTAGTTGATCAAGATTGAGTGGTGTTTCTCCTTGGTCTGGACTCCCTTTATCTCCAAACACTATTCTTCCAATACCATCAGGGCCAAAGCACATGGAGAAAGAATTTGCAGCAAGACCTTTACTTGCTAACATGCTCGGAACAGATATACTTTCCAAGCCAAGTCCAAATAGACCATTAGGAGCAGCGCCACTTAAAAATGCACCGGTTTGTCTTATCCCACACCTGAAATTGAACAAGTAGAAAATTAACATCATGGTGGATAAAGATTGCATCCAAATTACACTGCATTTCTCTCAAACCATACCCACCCTAGAGCAATTGGAGCCTCAACACTTTTTTGTTGAGCATTATCTGTCTCTAAGTGCAAGATGTCTTCCACCAGTACCCCTGATGATGAGGTATTATTGGAGAGATATGCAACTCCATAAGCACATGCGTTTTGTGAAGATAAGCATCGCCTCCTTTGTCCACACAGAGTGCCGTTGCAAGGAACAATCTGACCCGTTGACGACGTATTAGGGCTGTAAATATTGAGATTTATTCGCTGTTTCAAAGCAAACAAAGGATTATAAGTGTCAAATAATACATAAAGAAACATTAAACGGAAAAGGCAAAAGAGCAAAGAAAATTTTGTTCCTGAAGGTGAAATCCTGTGGAGTATGAAGTAATTGAGCTAGTAAGAATCTCAAAATTCTTCAATGACAAACAAGCCACGGAACAATATGGATCAGATATTTCATTTCAGAGTCAGTATATTACATGCATCAAACTCACTTTGCAAATATTGATTATGTTGAATTTTGCGCTAGTTATGTATCTTCCTTAACAGAAAAACAATTTTCTCAATACATTCTCCACCCCATATCTTGTTAACTAAGAGAATATATTATTGTCATAATGACGGAAAAAGACAGTTGAAGAGAAATCCACTGTGTTACAGATTCAGCCATTGTGTTATCAGCAGGTTCCTTTCTGAAAAAGGTTATCAGACGAGGATGATCATCTTCCTTGTAGCAGAAGGCAACATACAGACGAGGATGATCATCGAAAAAGAAAAACTCTTTAAACATTTGAAAGTAGAAAGAAAAAGGTACTAGAATAAAGCAAACATACTCGTCCAGAGCGTGTCTCGAGGGCGCGCACACAATTGCTGCAATCACAGGGTAGCCAAAACAAGTCACTGCCAGTGTCAAGTGCCACCAGAAATGATAGCCCAGGAGTGCCCACTGTCACATTTGCATAATGCAAACTGCAAATTGGCAAAGAGTATTAGTCACACACCTTAAGAAGAAAAAATCACAACTACAGATACTACATATTTTGCATTCAACTCTATCTTTATAACATATAAATTACAACATGCTACTGCAAGGAATTTTCAGAACAATTCCTTGTCTAGAGAGGAGATAAGTGGCAGCAGAGGAACGGAAAATCAGAAAAAAAAAAATGGAATTTATTTCCGGAGCATGGAATTCGAACTAGAAGAAGACTATAATTAAATTTAGAGTCAGTACTTTTAATATAGGAGTGAAATCGCCAAAATTCCAGTCCGATGAAACACACAAACAGAATTAAAGAACAGAAACAGGCCTAAATCTTTCTTTTTTTTGTTTTATCATATTTTCCTCCACATGAATCTCGTAAGAACTATTAATGGTACATGGAATTTATTTAAGTTAGGTAACCTATTTTTCCTGAACTGACACATCCAACTAAACAGACAAAAACAAACGCAAAGCTCAGTCAACTCTAACATCACACTAAACGGACAAAAGTAAAAGACAGTAAACAAGAATTTCCAAAAACGTACTAGTAGTTCACAATCAGGAATAACAACAAAAATATTTTAAAAAAAAAATAGAGCAGCAAATAAACAACTGCAAAAGCAATCAGAAAAGAAAAATAGAGTGAGCTTACAATCCCAAAGAACTGAGGCGGAAAGTTTCATTTCCTCCGGAAAAAGAGAGAGGAGTGGGATTAGTTGTATCAGCAAGGCGGCGACCTTTGATAAAGCGATCACGCTGAGTCCAAGCTGAATAATACTCAACACTTCCCTTCTCAGGCAATCCATGAAGGTCCAAAATACCCTTCACCGGATCCGAATACCGGTGATGGATATCAAACCCGAACGTCCCAAACCCATCGCTGCTCTGCAATTGCAATCCCAGAATCGCCAAGAAAATAATAGGGGCAAGGAAAAAATTAAAACTTGTATAAGAATTAGCCAT
SEQ 15
ATGGTGACAAAGTTTAGTATTTTTATTTTGGTGGTGTTGTTGAGGTTATTTTCATTTGGTTCTGTAGCCTCAAGGGAAATTCACAATTCTGGTCTTAATCTGAATTCTAGTGCTTCTGGTATTGAATTCCCTCAACATCCAAGTTTCAACTCAGTTACTGCTTCTGGAAATTCAGATTGCAGTTATGGAACATCCAAGAAATCAACAACCACCCATGTAATAACTCAAGAAGAAAATAGATCTGATGAAAAAGAAGATGAAGATTTAATGGTATCTAAAAACCAGCCAAGAGAAGCAGTCAAGTTTCACCTAAGGCACAGATCAGCTGGTCAAAATATAGAGGCCAAAGACTCAATATTTGAGTCCACAACAAGGGACTTAGGTAGAATTCAGACATTGCATACAAGGATTGTAGAGAAAAAGAATCAGAACTCTATTTCAAGGCAAACAAAAAATAGTGAAAAACCTACACAATCTTCTTCATTTGAATTCTCAGGCAAGCTCATGGCAACATTAGAGTCAGGTGTAAGTCATGGTTCAGGGGAGTATTTCATGGATGTTTTTGTCGGTACACCTCCTAAGCACTTCTCTTTGATTCTTGATACTGGTAGTGATCTTAATTGGATTCAGTCTGTTCCTTGTTATGATTGTTTTGAACAAAATGGTCCTCATTATGATCCTAAGGATTCTATCTCTTTCAAAAATATAAGCTGCCATGATCCTAGGTGTCACCTTGTTTCATCTCCTGACCCTCCACAGCCTTGCAAGTCTGAAAACCAGACTTGCCCTTATTACTATTGGTACGGAGACAGCTCGAACACGACTGGTGATTTCGCGCTTGAGACGTTTACGGTTAATCTCACAACCCCTAGTGGGGATTCAGAGATCAAGAAGGTGGAAAATGTGATGTTTGGTTGTGGACATTGGAATAGAGGCTTGTTTCATGGTGCTGCTGGTTTGTTAGGACTTGGTAGAGGACCGCTTTCGTTTTCGTCTCAGCTTCAATCTTTATATGGCCATTCTTTTTCGTATTGTTTGGTTAATAGGAACAGCAATTCTAGCGTAAGCAGCAAATTGATTTTTGGTGAAGATAAGGAACTCTTGAAACACGCGAATTTGAACTTCACTTCACTGGTTGGTGGGAAAGAAAATCATTTGGAAACATTCTACTATGTGCAGATAAAATCAGTTATAGCTGGAGGTGAAGTGCTGAATATACCTGAGGAGACATGGAATTTGTCTACAGAAGGTGTTGGTGGAACAATCATTGATTCAGGAACTACTTTGAGCTATTTTGCAGAACCAGCATATGAGATTATAAAACAGGCATTTGTTAACAAGGTGAAGCACTATCCTGTTTTAGAAGATTTTCCAATTTTGAAACCATGTTACAATGTTTCTGGAGTGGAGAAACTTGAATTGCCTTCATTTGGGATAGTTTTTGGTGATGGAGCTATATGGAATTTTCCAGTAGAGAACTACTTCATCAAACTTGAACCAGAGGATATTGTTTGTTTGGCAATGTTAGGAACTCCTCATTCGGCCATGTCGATAATTGGCAACTACCAACAGCAGAATTTTCATATCTTATATGACACCAAAAGGTCAAGGCTGGGATTTGCACCAACAAGATGTGCTGATGCC
SEQ 16
TCACATAGGAGCAAGATGACCTTCTTTAGACAATTTATCTTGCATCCACCTCTGAAGCATTTCCATTGCTGCCTTAGGCTGATCCATTGGAACCATGTGACCTGCATCATGGACCTTAAGGAAAGTTAAAGGTCCATAGTTTTTTTGAACTCCTTTCTCTACACCATCTACTGCAAAAGAAACTTGTGTGGCTTTTCCAAAGGCTTTTTGCCCTGTCCATTTCATTGCATGCACCCATCTCGAATTCCCTGTCCATATAACAAGAAATAGAGTTTTATAATATTATTGTTAGTTGGTAGCTTTGAATTGACTTATAAACAATATGATAGAGTTCAACTTCTATATTTTGACAGACGTAAAAGATAACTTGAAGTAACTTTTGTTAAAAGTAGAATTACTAGCATGAAAAAATAAGGTAGGTTAAAATACACTAATATAGTATGAAAATCCTCTTTTGTGTATATAAGTTAAATCCATATGATAATATAAAGCTTACCAAGCCAATTGCAGATAAGGTCATATTCCCCAGCATACACTAGTAGCTTGATACCATCCTCAAGGAGTGAAGGAATTCCCAATTCAAGATTCCTCATCCAGTCCAACTGCATTGCCTGGTAAACTTCAGAGCTACATGAAACAAACTCAATATCCCCAACACCAAGAGCCTTTTTAACTTGTTGATCATTGAGGAAAGTTTCCATTTTGGAGAAATCATAGCATAGATCGCCCTCACATCTCTTCCGCACATCATAGTACTGCAACTCGGAAATTACAAAATTCATATGACTTTAACTTTTGTATACTGACAGTGTAAAAAAAACTTTATTCTATCACGTCACTTAAAGGATTGTAACTATAGATACCCGTTCATTATAAGTGAGATCAGTAATTTGGAAAATAAGACAAGTAATATGTTGTACATCATTAGCTCAGAGAAAATGAGATTGGTCTTTCTTACGTTTTTGTCACCAGCAATGTCCATAATCTTGTTGAAGATGCTTGTACAAACAAGATATGCAGCCATGCAAGCAGTTCCGCCATCTTTTCCTAATATTATTTGCATAGAAGAAAGCTAATGTAAAGACTAGCTGCTGCTATATGAAGAAGGAAAAGGACATTGAGGATAAACAAAATGAATTACCACAAAGCTTAATTGCTAGTTGACATTTTGGATATGACTTCTCTATGGCATTGTAATCAGATTTTTTGATCAATTTCATATCCAGAGCATAGTCAGTGTAGGCTTTGTATTGAATTTCTGGATCAGTGAGTCCATTACCAATAGCAAATCCCTAAAAAAAATTGTACTTTGTTAAGTCATTGGCATGACGACAAATTCAAATTAAACCTAACTAAAGGTAATTACTGGATAAAGAAAAAGGGATGATATATGGTAAGAGTTAGAAATACCTTGAGATTTACGTAGATTCCTTCTTTATTTTTGTTTCCTTGGTGAACCCGAGAAGCAAATGCAGGAATGTAATGCCCAGCATATGATTCTCCAGTAATATAGAAATCATTTTTTGCATACTGTGGATGTGCCTTGAAGAAGGCCTATCATCAAAAGAATTTGAAAAAGTTTGAATTAAATTTTATTAATTATATCAGTTAAACTTTAGAGATTTATCACGAGCTAAAAAAAAGGAATGAAAGAATAGGATCAACCTGCAAGAAGTCATAGAGATCATTGCTTACGCCCCTTTCATCGTGACGAATATCATCATCGTTTGAACTATAACTGAAACCAGTTCCAGTTGGCTGATCGACGTATATAAGATTTGAGACCTGTAAAATTGCAATTTATCATATGTTATCATTCTTCAACTAACAAAGGAAAGTTGCATGTTTGATTATAGGATTTAACCGGTGTAAACGATTTTTACACTATTGTTATATTTTAACATGTTGTAACATGTTGTATTCGTCCCACTTAAATAAAGTGAAGAGAAGCGTAGTAGTCATTGATGTCAATAAACGTTGAACTACTTTCGAATTTTTGAAATTCTACAAGTCACAGCTAATGAACAACAAGTGTTAAAGAAAAAAATGCTAGTAGGTAAAAAGGTATTTTGCATGATGGAGAAAGGTTGAATAACAAATAAAAACATGGAGGGAATTCTTTTAGATTTTTACCATATTCAAAAGATCTAACTGACGTTTCTTGAGAAATTAATTGGGTAAAATAAAAAGAATAAACTGAAAAAAAGAGAGGAAAAAACAAAAGAAAAAGCAAAAGGAAGAAAACAAGAACCTTGTCCCAGCCGAAATCATTCCAGACAAGAGACATGTTATCTGCAATTTTGAATGGTCCATTTTCATAAAACACAGCCAATTCACTGCTACATCCTGGCCCTCCAGTTAGCCATATAACTACTGGATCATTCTTCCTGCTCCTCGATTCAAAGAAAAAGTAAAACATCCTGCCAAAAACAGATAATTTAGCATTAATTAATAATACCCATAAATTCATTTTTTTACCAAAATGAAGCAAGAAGAACATTTAATCCAATTCAAACCTTGCATCTTTAGTATGTGGAAGACGATAATAACCAGCGTGATGACCCAAGTCTTGAACTGTAGACCCAGAATTACCAACATAAGATAAATTCAATTTCTTTTCAAAAAGTCTCTGTTCAGTAACTGCTGCAGAATCCCCTGTTGCTGCAGCCTTGTTGATATCATGCTTAGGGAATAAATTAAGCTGTCTGATTAGCTTTTCTGCCATTGTTAATGGGAATTTTGGAGTAGAAGATAGGAAAAAATCATCATCATTAGAATTTAAAGTTGATGAGAAAGATAAGGAAATAGAAGCAAGAAGCAGAGTAAGAAAGAGAAGAGAGAAAGATGAAGGCAT
SEQ 17
TCACACGACACTTTGGGGTGGGATATCCTGGTTCACAACCCGGGTGTGAAATCCCTCATTATCAAAGTAAGGGTCACACTGCCAGAAAAAACAACTTTATTAGTGGTTGATCAAAAAGATCCACAGTAGCTAATGGTTTTAGTGGAAGCTGTAACCTCTGTGAGGTGAAAATAAGTCAGCATTTACCTCTTCTGTGACCACACCCCTACGGATAAAGTATCGCCATGCTGATATAGGATATCCACCATCACAACCACTCCCACGTAAAAAGCCACAGCATGCTAACAGATCATTTACGGACAGAGAGATATTCTGCAATACACATTAAAAGTTTAGCATCAGTGACCATAACTACAGAAATACTTCACAAACATTTTGTGCTAATTAAGATAAGATGGTTTCCATGTGCTTGGCCCATGAAAAAGAATCAGGGCTCGCACGTGAGAGAGCATGCTGAAGATATATAAATAACTAAATAAAAATGTGTCATCTAAAGCTTTTAAATGAGGCGGTCACACAGTGCAACAGAATTTACCAAGTTATGATGGATACAGAAACGATCAGACAGAGATTCAACAGCACCAAAAGCCCAACAAGAACCGCAATGTCCCTGATCTGTGATGCAAATTTGTCCCGGTGATTGATGTGCAAAGATGGAAAGCATTAGGATCACTAAAATAGAATTATAATTCAGTAGTAGTAAGCAAGAACAAGAAGAAACTGACCTAGAATTCTTCCGATAGTGCTACATTGAGGCCAAGCTTTTCGTGCATCAAACTCTTTTGGTAGCTCCAAAAGCTTTGGATGAGTTAGAATCGGAATTCCCTCCAAATCACCTTCTCTTGCGGGCTTAACTCCAAGAAGGCGCTTAAATTGTGAAACCTGTGAATACCAAAGATGGAGATAATTATAAAAGCATATTTTCATTGTATAGCTGCAAAATAAGCAACTGTTATTGATGGTTCGAGAAAGCGCTAAACTATATGATGATATGACGACAAGAGGGGGTTGCTCCGATGGTCAGCATCCTCCACCTACGACCCCAGGATTGTGGGTTCGAGTCACCAAAGGAGCAATAGCTCCAACAAAGAGGATCACAGGGGAATCAAAAGGGGAGGGGAATTTTAAAAAAATGATGATCTGACCGTGAAATTCGAGAATCGAGGGTTGAATGCAGCTTTCCATCCAGCTTTGGCATTTTCATTAACCTCTTTGATGATTGATTCCTGCATGATCATCCAAAAAAGCTCTCTCAGTTTTCGAATTGAAGGACAAGGGCTATAACATCTTGAATCATGAAAAGAAGAATAATGTACCTGAAGGATTGCAGATTCAACTTTAGCTTCAGATATTGGCTTCTCTGCAACAACCTGCTTTCCATAAGAACACATCAATTCTATTCATCAATAGACAAGCTAAAAGCTTTTAGGAAACAGAGTTGCAATTCCAGGAACAAAAGTATGACAGTACTGTGACAAACAAAGAACTAAAGGTTTCACTTTAATCAAGATGAAACGCTTCCAACATTTCTTATTTTCGACATTATAATCCTCTTAACTTAGGAAAATTAATAAAAAATTTGTAGCAAATGCATCATATGTCTAAAGCACTATAACATAGAGGAAGAACCATAGAACAAGCATAAATTGTAACTATTCCATTATTTGTCCTCCTTTTTCTCCCCTTTCTGTATTTCTTTGTGAAGCAATACTTCCTCTCATGTTATATATAGAAACATGTAAGTTAGCTAACACATAAGTAATTTGCATCAAACCATATATTTAACTTCAGAAACATGTCTATACTTCTGTTTTCTCATTCTCACTAGGTAATAAGAAAATCATTAAAATTTATTTCTACTCATGATTTCAAGTCAACGCTTAACTAAAGCATAAAAAGTCCAAAATACCCAACAATATTTGATCTTTCTGAAGAAATACAAAAAGGGTAATCCATGTAATCATCAAAACCTATATAAATTAAACCAATAATCTAAATCCATCTAAACAAAGAAATACTCTTACTGTAGAAAGGTTCAACGAATGAAGAAACAAACCTGCAATATAAGGATACAAAAAGCACCCAAAAACAAAGGAGCTGCTAAAGACTTCAGAGTCAAGGTCAT
SEQ18
ATGTTCCGACTAGTAATGGTGACAAAGTTTAGTATTTTTATTTTGGTGGTGTTGTTGAGGTTATTTTCATTTGGTTTTGTAGCCTCAAGAGAAATTCACAATTTTGGTATTAATCTGAATTTTAGTGCTTCTGGTATTGAATTCCCTCAACATCCAAGCTTCAACTCTGTTACTGCTTCTGGAAATTCAGATTGCAGTTATGGAACATCCAAGAAATCAACAACCACCCATGTAATAACTCAAGAAGAAAATAATTCTGATGAAAAAGAAGATGAAGATTTAATGGTATCTGAAAACCAGCCAAGAGAAGCAGTCAAGTTTCACTTAAGGCACAGATCAGCTGGTCAAAATATAGAGGCCAAAGACTCAATATTTGAGTCCACAACAAGGGACTTGGGTAGAATTCAGACATTGCATACAAGGATTGTAGAGAAAAAGAATCAGAACTTTATTTCAAGGCAAACAAAAAATAGTGAAAAAACTACACAATCTTCTTCATTTGAATTCTCAGGTAAGCTCATGGCAACATTAGAGTCAGGTGTGAGTCATGGTTCAGGGGAGTATTTCATGGATGTTTTTGTTGGTACACCTCCTAAACACTTCTCTTTGATTCTTGATACTGGTAGTGATCTTAATTGGATTCAATCTGTTCCTTGTTATGATTGTTTTGAACAAAATGGTCCTCATTATGATCCTAAGGATTCTATCTCTTTCAAGAATATAAGTTGCGATGATCCGAGGTGTCACCTTGTTTCATCTCCTGACCCTCCACAGCCTTGCAAGTCTGAAAACCAGACTTGCCCTTATTACTATTGGTATGGAGACAGCTCGAACACGACTGGTGATTTCGCGCTTGAGACGTTCACGGTTAATCTCACAACCCCTAATGGGGATTCAGAGATCAAGAAAGTGGAAAATGTGATGTTTGGTTGTGGACATTGGAATAGAGGCTTATTTCATGGTGCTGCTGGTTTGTTAGGACTTGGTAGAGGACCTCTTTCGTTTTCGTCTCAGCTTCAATCTTTATATGGCCATTCCTTTTCGTATTGTTTGGTTAATAGGAACAGCAATTCTAGTGTAAGCAGCAAGTTGATTTTTGGTGAAGATAAGGAACTCTTGAAACACCTGAATTTGAATTTCACTTCATTGGTTGGTGGGAAAGAAAATCATTTGGAAACATTCTATTATGTGCAGATAAAATCAGTTATAGTTGGAGGTGAAGTGCTGAATATACCTGAGGAGACATGGAATTTGTCTACAGAAGGTGTTGGTGGAACGATCATCGATTCAGGAACCACTTTGAGCTATTTTGCAGAACCAGCATATGAGATTATAAAACAGGCATTTGTTAACAAGGTGAAGCGCTATCCTATTTTAGATGATTTTCCAATTTTGAAACCATGTTACAATGTTTCTGGAGTGGAGAAACTTGAATTGCCTTCATTTGGGATAGTTTTTGGTGATGGAGCTATATGGACTTTTCCAGTAGAGAACTACTTCATCAAACTTGAACCAGAGGACATTGTTTGTTTGGCAATTTTAGGAACTCCTCATTCGGCCATGTCGATAATTGGCAACTACCAACAGCAGAATTTTCATATCTTATATGACACCAAAAGGTCAAGGCTGGGATTTGCACCAAGAAGATGTGCTGATGCC
SEQ 19
TTACAAAGGTTGCTGAGCTATCCATCTTTTAAACATATGAAAGCTCTCTTCACGTTTGTACTCAGGAGCTGTGTGCCCTCCTCCCTAAAGAGAAAAAAAGACAAGAAGGAAGAACAAAAACATCTGAGAACTGTGAAAATGTGAGCACAAGACAATTTTACTATCGAGTGTGAGTACAAAATTCTAACCTTTACTGTTGCATATGTCATATGATTAGAGAAAGATCTTGTGTAACTGCACATCAAAGCTATAAATTTAGAATTTCATAACAATTATATCGTCTATCCAACTGGAATAGAGATCCAGAGAAATTAAAATGGAGAATCATACCCTGCAACTTGACCATCAATTGTCCAAGGGCGCCAATCATCAATGATAGAATAATTTAGATACTTTATCCATGCTTGCGTCGATTGGAAAGGAACAACCATGTCATGATCACCACTGGGCAAAAAAAAAAAAAAAAAAAAAAAACTTTGTTATGTATATAATAACTTCAGCTAAATCTTTTTTGCATAAAGCACAAATTAGAGAACTACAAACATTGGTTTAGGTTTAAAGTATCAAACCTGTATATGAGTGATCGATAACCTTTGGAACTAAGGTTAACATGGTAAGGTATACTATTCATGAAAGTAACTCTATAAGTTGTACCCATAATACTTTGCCTACATCTCGCCCACGCTCTTCTTATAGTTCCCTAATCAAAGAAGAACCAAAAAGAAAAGAAAAAAAGCCTTTTAATATCTTGCACCTTAATGCCAAGCAACCTTCACTATGTACAAGTACAAACAATTGAAATACCTTTCTAACATGGAGAGCCTCTTGAACACTAGGGTCATTTGCCCAATGATTGGAGAGCTTGCGTGTCGCAACCTAAAATCAGAAAGCATATGTATATCTAAGTATCCTACATTTATCTGTTTATCCCGAGAAATCATGAAAAAGAACCGGGCATATATGCCTCTTTTAACATAAATAGGTATATATCTTAAAATTATTAAACAATATTCAAACTTTATTCGGTGTAAAGGCATTTAGTTGCTGCAATATTCAAACTTTATATAGTAATATTTGAGGGACTTACACGACTTTCTCGACAAATGAAGTCATCATGTTTTAGAAAGATAAAATCCTCTTCAAGAGATCTCCTTTCACCAGACAATTGGCGTGGGTTTGGCGATTCTGAGTCCGTTCCACAAAAAGGCTCTAGTATTTGCTGATCATTTATACTGCTTACAAGCTACAAAGTTGCATTACATGAACATTGTGGTATTACCTAATTACAAACTTAGTACCAATTCAAAACATTAAAACAAGAATATGTAAGAAGGAATATAACCTTACCTTCTTGAACATTTTAAAGTTTTCTAAACATAGTTTATTGGTAGGATCAATGTTTCGGCAATCGCCTTTGCAAGTCTCCTTCAGTGACTGATACAACAAGATCCAAATTATCATTTTTGTGCCAAACTGTGTTGGTATGATACAAAGTTAATATATGAATTGAAGTAAGAGATGAACCAACCTCATAAAGTTCATTAGATATTAGTCCCATACCATGACAGAAAGGAATTTGGTAATTGCTTTCTTCAGGAAATGTTAGCGGATTGCCAAGTGAATAACCCTGAAAGGATATTAATTACCATTAAAAATAATAATTTTTCATGTATGAGATATATTTTAAATAAAAGGAAAGGAATAGAAAACCTTAAGGTTGATTAGTGGCTTTTTGCCTGCTTCAATTCCTGCATTTTCATATCACTCTTGAAGTGTCAACAGAGAACACTTTGGCTAGAAATGTTTTGAGAATCTTCTCTTCAACCAAGTGAGGCTACACATTCAGTTTTGAAGAGACAACAAAAGAATGTAGTGTTCTATATATCTTATCTAATATCTGTGGAAGTAATGATCAATAATAACAAAGCATAAACTTGGCTTAGAGCCCTAGGAATGGAGAACCTTTGATCAATAGAGTCACTTTACTCCCCCTAGTGGACCAATTCAGCATTAATTGCCAGCGGGCTTCAAATAACGAATGGCTAAACCGGAAAAATAATAATAACAAAGCATACACTAAAGAGTGATGGGGTGATGAGAACGATCACAACCGCACATTATATGAAAATACTAAATGACACTTACCATCTGATATTAGTTGAACAATAACTGGAACTGTAATGCCTGAATATGAGTCCCCAGAAACATAGAAAGGGTTGGAAATGAATTCTGGATGATTATTGAACCACTGCAAAATAAATCACAACTTATTTCGCGATACATTTTTGTTATATTTGGTTGCTATTTAATTATTACTGTCTTCATTTCTTCACGGTTCATTTGATCAATCCACCAAGATACACATATATTAGGATTTAGGTGGGTGGAGGGAATTCAAATACGTATATATTGAGCCTAAAAAATTTATATCCTAGATCCTCTACCAATGAACAATGCAAAATGAGAAGTATATTGAGGAATACATGAAAATTCAGAGTTAACTCTGCTTACTAATTCAATCTAAAAAATACTCAGTTTACTAATTCATATGTATCTAACTAAATGAACTTGAAATGATTTTCAAATACCTTTAGTAGAAATTCATAGACCTGGTCGCACGCTTGTAGATCAGTACACTTGGATGCCGCTGAAGTTGTTGCATATGAGAACCCAGTATTTACAGGCTGTTCCAAGAAAAGTATGCTCGCAAACTACCAAGTTTAAGGAACACATTAATTTGATGATCTAATGTTAACCATAAGAGGATAAAAGAGAACATGAAGTGTGGATTAGAATATATATGCAGTTTGTTGACAGCAAAATGAAGTGTGCAATAGAAAAATAACATGCTCTTCAGTGTTACCTTTGTCCAGGAATATGGAGTTGAAACAAGAATTGGTAGGCTCCCATTGTATGCCTTCTGACCAAAAGCCAATGGCCCTACAAAAGAGAATTCAGAAGTTAATTTTCTCCAACTATGAGTTACACGTAATACCAAGCTTTTCCACACCTATATATTTACTACTAATTGACTAATTAGAAACATCGACTAAAAGTAAATTGCTTTTGTGATACTAACAAAAGCATTTCTCAAATGAGTTAAGTTAATCACAAAAGGTTAAACTCTATTTGACAAAAATTACATTTGAACACAAACAATAATGGTAACTGTTGGGATAAAGATAACCTCCAGACTATGATTACATATTTAAGGTGAGTTACTGATAAACATACTTGATAATACAATAGTAGTATAACTAACATACTATCATAGGTTAAATAATATTTTATAAAAAATATTTACATTGTCAGTATATACAATATAAATATTTACCTACTTCATACGCCACACCCGTGAAGGATGAGCAACCAGGCCCTCCCGTTAGCCATAGCAAGAGTGGATCTTTTTTAGGGTTGGATTCTGATTTGACAAAGTAATAGAATAGTTGCACTTCCTCGGATTTGCCAACTCCAATATATCTAACAATTGTATTCAAAATACATCACTTCAACAAACTTGTTTTACTACTCCACTATATATGTAGCCAGTATGTTCTGAATGAAGTAAATTACCTAAGAAAAAGTTTAGATTCTTTTTATACTAATTGATCTTTTGATCAAATACAAGTTAAAATTCAAAGGGTGGTAAAATTAACGTACCCAGTCTCAAGATAAAAAGGAAGAGGGCCATCAAAACCAGGAAGAAACTCAACAGTTGAGCTATTCTGAGGAAGACTTTGTACATATTGTAGAAAGAGAGTAAGAGGAAGAAGAAGATGAAACAATAGTGGCAGGCGAAAACCAGACAT
SEQ 20
TTAACCAGCTAGAGGATTCATCACACTGCCAATGAATAACACCGCACCAGTAGATTCGTCTCTTATAAGGAATAGGAATGGATGGTCCGCAACAAAATCCATTTCCTTCTCAATAATCAAGGACATGGTCATTATTACAGTAGCGGTAACAGCTGCAGCTTCGGTTCCTTCCTCATTTACCTCAATGAAAGACTTGTGAAAAACCTGTGAAACAGACAGGTTCTGAGGCATAGGAGAATCAACCATCTCAGTGAGGCTACCACCACAAAAAGGCAACGTGAGGCCGAGTCCCTTTAGAATGTTGGAAGCTTCAAATCCAAAAGTTATTTTAAATTTAGGGATAAGAAACTTGCGCGCTCTAACTTTTCCATATGGAACATGGTTATTTAAAAATCCTGGTTCTAAGCTGATTTTTTCCAGTAAAGCAGGTAATCCATCATGGGCATCTGGGAGAATGAAATACATACAGAAGCGACGCGTATCCGTGCCTTGTTTATAAGGAAGCCTCAATATTTTAAAGCAATCAAACGCTGCTATGTACTGCTTCTTCTTGCTAGTCATAAATGGTGCTTGAATAGACCCTCCATTGAGGAGATGGAAGTCATGATCTTTCGTTTCTGACACATCGAACTTCTCATTCCATTCTCCTTTGAAATATAGTGCATTGGACAAGATCAGCCTTGTCATGTTGTTCACTGCATCGCGAGGAAGAATCTCTTTGATAAGACCATTTGTCTCCAT
SEQ 21
TTAGAAAAAAAGCCAATGCTTCTTTCTGCGCACTCTAGCTGGAACCTCTGTGGCACATTATGTTCAAAGTAAAGCAAAAATTAGTATTCAAGAATAGCTATGACAAAAAATTCTGAACTCAGAAATAGTTAAAGCAAGAAGACACTTACCTCCAATTTCGTATAACGAATTGTAGTGAACTTCGCTCCAGAAACTCAGCCACAATTCTGAAAATCAAGAATAACAAGTCAAAAGTTTTGTCTTCAAGAGATTAAACATATCGAGAAGAAATGTTCGTCAGCGCCACACCAACACATTTCCGAACAAGTGATTTGGAATTTATTTACTTGTTAAGATAAGAGTTAATCTGATCAGGTTTACTCAGTCTTTCCTAAGATCACCATGGATGTGTTTGTTTATGAAAATATAATCAGCAGAACATCATCAAACATGTTAAAGGGGAGCTTTGGAGCAACGGTAAAGTTGTCTCCATGTGACCTATAGGTCACGGGTTCGAGCCGTGAAAGCGGCCACTAATGTTCGCATTAGGATAGACTGGCTACATCACACTCCTTGGGATACATCCCTTCCTCGGACCCTGCATGAACACGGGATGCCTTATGCACCGGGCTGCCTTTTTTTAATCATTAAACATGTTCAGCATATTCAATTTCTTGAGAAAACAATTTTAGCATAACAAAAGAGAACTATTAAAACAGGAGCGACGATCTGATGTTACTTTATTGGCAGCGGATCAAGTTTATCGTCTGTAACAAATAACGCTTAATAAGTGCTTGCATCAGTAAATTGTGTGTTTCACGTGAAGCAAGACTCTGGAGATAAATTCTCCTGATAGCAACTAATACTGCTCTTAGTAAGAGCATCAGGAAACTCTGTACAAAGCTTACATTCACTAATTGTAAACATACAAATGATCCACAGTTCACAAATGACAGCGAAGGATTCTTGTCATTAGAATCAGCCTCGTTCAATGATGCAACAATGTCATTACAAATTAAAGGTGTTTTAGTTTTACTATTTAAAAATTAGTTAGTACCAATTCACTTCGACATACTTCCACTCACAATCTATTGTGTGTGGTACCAAGCAATACGATGATACTTTCTCCTAAATGAGAAAAGCACATCCATTTGTCAAAATAAACAAAATGATTCATTTTTGTTTTCCTTTTTTTTCATTTTTGTTTTCCTTTTTTTTTCATTTTTGTTTTATATATGTATTCTTATAATAAAGTGGGAGCATGCTAGAGAGTTCAAAGTAGGACCATGCTACAAAGTTCAGAAGAATACTTTTGGTTTAGTGTCTCAAACAAAACCAGCAAGTACTATTATTTAAATTCTGGAATTTATATCATAATATCATTATTTTAGAGTTATTTGTAAATTTCAAGTATTTATTTTATTTTTTGAGTTTAAAAACTCAAGCAGGAAGTTAAAGAACTAATAATCAGAAGTATTGATTGTGCCATGATCATTAGATACAAATAACATAAAATGTATGTACCCCTAGAAGGTTGTATGTCCTTGGGAAGAATATCAATGTAGCCGTTGTCTCGGAAAGACGTGACCAAACATATTTTTACCCCAAACTGCAAACAAAATACAAGCAAACACAAGTACCATCAGTCACCCAAAACGGGAAAGCAAAAAAATAAAAATGCAGAGGGCATAAGGGCGAAAGGCTGATTAACAACACAAATATCTAAAGCAGTTCCACAAAGGTAAAAGAGCAAGAGAATGGTCATATTGCGGGAAATTCTTTAGGTAGTAGATCTCCTGCTTTTAAGTTGCTTTCGCGTTTGATGCTATGTTGAACTTCAAATATATTCAAGTTTGAACCCATAATTTCTAAAGTGTAGCAAATTTAGTGGTAAGAACCTAAAAGCTGAACCCACCAAACTTAAATCCTGAATCCGCCTTCGTATTTGATGTCTTAGACATTTGTATGCCTTCTAAGCGTAAATGTAGGTTGACAAGATGAAACAGGGCAAATTTAACTTTTTTTCTCATTTCTGCTCTAATATCTACAGTTAAATTTCAGTAACCTACATGCAACACAGGCATACTGCACAAAAAGTCAGGCTAAAAAATAAGAAGATGCCGGCGTTTGTAAGAAAGTAAGAGGATTATTAGAAATGTTCTTGGTTATTTTCTCCGACAGCCTTGGGCAAGAATAAAGCTCTAAATTTCACCGTCAAAGAGTTCTAATCAGGAACCATTTTAATACATTGAGGGGTTTTCTCTTAAGGAAAAATAATTTTCATAAAAGCCAAATGGTTAAAGGTTAACAGCTGAGCCGAATTGCCTTAATACTCATAACTCTTAATGCTACCATATTATATACCAACACCAACTCAAAAATTCACTTCCTAAAATTTTTCAACTTTTTCACAGGAAACTATTCCACTATGTGTATTATGCAATGTTAGATACTATTAACAAAGTTGCTTCATTGTTTTTTTTCTCATTTAAAAAGCTTAAATTTGCAATGCAACAGTTCCACTATCCAGGCAATACACCTTTATTGACAGTATAATTTGTTGGCTTTTCTTGTTCGTAGTGTTCAAATGCTGTCTTACATACTAACAAGTGTACCGTATATATGCCTCAAAAGCTAGTTGAGGACAAAAGAGAAACTTTCAACTTACTCTATCGGCAGCGGCTTGTAGAGTGACATGATCTCCCCACTCTCCCAACCTGGTTAGAGCAAGAATATGTTGGTAATATATTGAGATTTCCCGCAATAAGAAGCACTGAAAATTGACCAGACAGCTTACCTCTTCATTTTCCTCAAGTAGCTTTTGTATCTCATGGGCACATAACCTTCATATAACTTTCTAAAGCGCTTTAGCTGAAATGGATTGCCAACTAGGATAAGAAACGTGTAGGACATAAACTAGACATAGGCGTCAAACTCAAGCAAGTATAGGACTCTGTTGATGTTTAATAGATCCAGAAATTAGGTTAAGCAGTTGAGCTGATGAGTCTCAAGCAAACACCGGGATATTTTATTTTTAGTACTCTATTATTATGCACAAGTTACTTTAGTTCTACTAAAACTCACGAAAACGAATATAAACTGCAGGATTTGTTAATCTAGGATGAACACAATCCTCAAACATAATCATTATGTAATCTCAACTTTTCACCCATCCTGTTTCACAACCAAAGGCAAGAGATGAGAAGCAAATATTGAAAAAATGCAAACAGACCTGTTTAACGACCTCCTTCCTTACATGCTTATGATACTCTGGATTATGATACAACTGATCCGAAAGGGCCCGAAACTACAATAAGCCAAGTAGACAGTCATATAGCAGGATTAATCACCTGTGAATTAATAAGTAGCAGGTGGGTCTCAAGTTCAATGAGAGTTTGTCGACTAGTCAAAAGAGACCGACCTGGCAATTTCCATCTCCTTCAATTTGCATTTCAGCAAGACCATATGTCGCTAACCTGAAACAAAGAAGTTACAAATAGGATCCGAGTTGCCAAAAGGCGAAAAACATAGAAAAGCACCAAGAAATTTCACCCTTTAAGTGCTAAGTGCAATTAAAGCATGAATTAAGCTATATAGAAAGACACTAATGAAGAAAAAGATAACCTATGAGCAGAAACTATAATAATTTGGACAAAGAATTGAAAAACTAAAATTAATCAATAAATATGAAGTAAAAATCATGTCAATTAAGTTCCATAATACTGGATTCACAAAATAAGTCAACCAATGACCTCAAGCTGAACAACTATAGCATGCTCAGTAGTCAGTACAATGATGGATAACCTGCTAGAGAGCCTCCCATGGTCTAGTGTGGCATCATTTGGGTCAGGTATCTCCCCAATTACCCGTGGAGTGTGCTGCAAGGATAATAAGACTATAAGTAGCATGTAGAGATAGTGCAACATGGGTTTTCATGGGTTAAAGCTCAAAAAGGAATATATAGAGAAGCAACCTTTGGATACATACATGTGGTCACATTATACTCATTTAATTTGAAAGAGTTGCCCCCAAAAATGTAAATTTAGATAATTGTTCATAAAAAGGTCAAAAGTTCCCCGGATTTCTAAACTTCTTGAACATTCTCCTTCTTCCTCTATGGATAGGGTGACGTGACGCGCAATGGATCAGTTAGAAAGGAGAAGAAACTGATCAAGTGTCCCCTTCGCAACAACCTTTTTCCCTTTCTCGCATCCTTGGTCAAGCTTGAGATTTTCTTCTATCTGGATTTAGTTACCCGAACCTATGCAAAGCAAGCCTACACTCTAGCTGGGGCCAAGCCATCTTTGCCTAAGTTCCATTACCTCATTAACTCGATAAAGCTGGCATTAAGAGAAGCTTGGTAGCATAAAGTGTCAAGGAGGTGGTTTTCGCCTATCTATAAGGACAGTTGGAGTTGATGGTCCTGGTTTTGTTACAGTAAGGGATGTTGCAGGAATCACTTGCATTGATGGAACAGAAGCTCTTCCTGCTCTTGTATTCGACGAAGAATAGGCCCAAGGGACATCAATGGACAATCGCCAAGCACATGGTTTAGCTAATTCAATAGCCTTCGGAGAAAAGTGATAAGTACCATCGAACAAGCAAATCACTTCTGGATTATTAACCTTTAAGGATCCACTTATGGTGAAATTTTGGAATGATGCGGGGATAACCCAAGGGAGTCAGCTAAACAGAGACGGAAAATATCAGTACCATTCTGGTCGGCGTATGCTTGTTGAAGAGAACAAGCTCACTTCTAGCTAGCTAAAGGGAAAGAGAGATACTATAGAATACCTAAGTCAATTTATCATTGAGCTGCAAAGATCATAGCAGACACTACGAATTAATAGCAAATCTCGGCACAAACCTCTACATGTCAAGATAGTGAGTCCATCAGATCACAAAAGACTTAATACCAGATACCTTTCTAGATAAAGGGGTAAAATCACTTTCTTTCTTCCACAAGATCTCCATTTAAAGAGACTATCAACTGTACTTAAATGTAATCATATCAACTGTACTTAAATGTAATCATACCGGGATTGAGTCCAAATGAGAAAGTCTCCTCCCAAGTTTATTGCCACCATATTTGAGAGCATTTTCTTCTTCTTCAGCTAAAATTCTTGCAATGGTATGATCATCCTCAGTACCGTGTGAACTACTATTCAAACTAGATGTAGTAGAACTCGAGCTTGCTCTTGAATTTCCATAGGATTCATTCAT
SEQ 22
CTAGAAAGGGTAAATGAGACCTCCGAACTTCCCAGAAAATGCTTCCTTCTGAGGCTGTTGCACTATAGTTGCATTGCTCATTCGCTGCATTTAAAACAAATTAACTGTGAAAACTACAGTAGCAAAAGGTTAAAGAAAACGAACATGAATAGCACGTCAAGAGAAATTGGCTTTGCTTTAACGGTTATTTCATCTCTGTCAACAATGAAATGGCAATGAGTGAACCTTTTCAGAATAAGTTGGCTTATCTCATTAATGAGAGACAGATAACAAGAGATGTCTCCTCTAATCTCTAAATTGATATTTCATGTTGTATGGATCCTAATAGGATGAGAAATGCATCAAATACAGAAAGGAATGGCAGAAGTGGAGATATACCTTGAGGCAGAGGTTCCGACTTGTGTCACAAATAGGATAATCATGTGGGCAACAGTGGCGTCCATCTTTGCAACACACTGCAGAATCCAACCCACAACATTTCCAAGAAACGCAAACTCCAAGAAGCCTCCACCCACAGCAGCAGGTTTCACCTTGACCACATGAGGTAAACATACTGCATTTGCTTGGACCAGGAGATGGAGGAGATGGTGGATTTGGGCTACTCTTAGTTGGATATGAAGCTAGCTTATTGATCCCACATATCCCTTCTTGATTCCCACTATTACGCTGCATGTGCATATAACCATTTATTCCCCAGCTTGTTCCCCATGAATTTTTTATAATCCAGTAATCAACTCCATTTTCAGAACCATAGCCCACAATCAGTACCGCATGATCAAGTACTGTAGAACACGGTCCAGTGAATATCCCCTGCAACACAAAGATAGCACCTTTATATTTCTCTCCGACAAACAATTTAACTGATTAGGAGATTGGTAATTTGGAGATGGAAGATACCTTTGAATATGATTGAAATGCTCTCTCACTGCCGCATATCCCAACACTCACGGGTTGATTTGCCACCGCCTTTAGAAGCTTGTCCTCATCATATTGGGGAACATCAGTATATCCATCAATGGTTACAACACGTCTTTGTAGCTGCAAAGTCGACAAGTTAAGCCAAGCAATCATATGTTAACGTTGTTCATTATGTTTTATCTGGAATAAACTTGTCCTAGGTTCTCTATATTAATTATGAAATCCAGAAGCAGGAGGGCATATATAGGACAAAAAAAGATTTAGTACAAGAATAGGAGCAAGAAGATGAGAGAACAATTATGTACCTTGTTTTTGTTGCATGTTCCTTCTCTTTCATTAAAGGGGTAATCCTCTTCAGTGTCAATACCACCATTCTTTTTGACAAATTCAAAAGCATAGTCCATCAATCCACCTCCACAGCCGTCATTGTAACTTTTGTCGCAATCAATTAACTCCTGCTCAGAGAGACTTACAAGAGATCCAGTGACAATCTTATTGATACCTTCGATTGCTCCAGTGGCTGAGAATGACCAGCAAGCACCTGGGAAAATGAAACAGAAGTAACTGGTTTTAGTTACAGAAGCTAGTTGCTGAGATTAAGTATATGGAATGACAATAGAATGACAGTGTTGTGGACAAGGGCAAATTTGATTCATATTATCTCAGAACAAATTCACAAAAAGGCTAGATCTTCACTTCCGTCCTATATTCAGGCTGACATTACCAGACATATCTACAGAAAATAATTACTTGAGAAACATAAAGGCAGTGATAAATTTTAAGAATAAACTATACTTAGTGAGAATTGTGTGCAGTCATAAAAGTAACAAGTCTAAGTCCCTGAAGCAAATTCTGCATTGGGAGGAAAGTATATTTCCGCGTATATGACAGCCAAATTAGTTGCTATAAAACATCACACTAGTATGTGACTCAATATTGACAGTAAAATTATAAAACATGTTCCTTCGATTGCACAAGCAAGTAGAGAATCATAGGGTACAATTGTGTACACAGTTCCAAAAACAAGAAAGAAGAGCTAAAACAATGAATTGTGAGTCAATGATTCAATGGTCGAAAACAGGACCAAGAAATGGATCAGTGGATATTTATATTTATTTCTATTTTTAAAACTTAAAGGGCATGATGTGAAGACTGAAGCTGGTAATCCAGTTTTGATTGTGATGCACATGAATGGATGTGGAAAGTAATATTCTCTGGAAGACAGAAGACCACTAACCACCTCAGTTGCTCAGACCAAGATAGTGAGTAGATCTCCCCTAATCTATTCAACAAAGTTCATTGGAAAGAAACAAACAATGAAATGGCGGATCTCCGAGCAGTCTGGTGAAGTTTATGGTCCAGTGGTTAAAAAGAATGACAACTCAACCATATTTTACTCCTCCGATGTGCTCCGTTCATGATCTTTTTAATATTTCTCGCTAATCCGCTAATCAATAAAGAATGAGATACTGTATCAGTATGTCCTATTATTGTTGTCTTCCAGTCACTCTGAAGAAATGATTTTCACATACATAGAGACAAAAATTGAAAGTAAGAAACAACAACAACAAACCAGTGGAATCACATAAGTGGGGTCCGGGAGTATAATGTGTACGCAGACCTTAGAGGTTGTTTCTGATAGACCCTCGGCTCAAGAACAGTGAGAAAATTGAAAGTAAGAAACAAACAGTATATTCATTCCTAATCAACTCATGAAAGGACGAGCTCATGAGACTAAGTTTCAACAACAACAACCATATGATTGTTTATTCCACTTCATCTTGATTCCAATACCTAATAATTTGTCTTTTGGGGCAACTCAAGGGTTCCTAAAGCTAAGAATTCTCTAAATCTCACACTTCTCCTTATACAAACATTCAAATCCTAACCAAACTGAAAGTGCTCCTGTCTAATACTGATGAACTAAAGTAAGTGCTGAGGCTAGGTTTCAATGAAGTAAATTAGTCCTGAACTTCAACCTGTCAAATAATACAAGGAAAAGCAAAAAAGGGTAGCTCCCAGACAAGAGAAAAAGGCAAAACTAACATCACAAGTTTCCATTGTCATTTGAGAAAAAAATCATCAAAATCCAAACTTTGTAAAAATTTCTAATGTTGGCTCTACTATGCACAAGTTATATATCCTCCACATAAATGAAATCACTATAAAGATACAACTAAAAGATAACGCAATAAACTGAGCATACCACAACTGCCTTGATTCTTGACTTTAGTAACAGCTCCTTTCTCTCTCCAATCCAAAGAAGAAGGAATATCAACAACACCAACATCATTAAAAACTCCAGCAGAAGACGACCCAGTTTTCAATCTAATAAAATCATTAGCAGAAGAGGACAAACCCAAAAAAGAGTTCTTGAATTCATGGTGAGTGAGATCAGAAAAGGCATTGAGATTAAGGGTATAAGTGGAATTCCCCTTACTATTATGCTCTATAATATAAGCATAATTTTCTTCAAACACCTCGAGTCTGTACACCCTTTCTTGTTCAGAAGAATATGTCTTTCCATTTTGCTGACACCAACTTTCAAAAAGATCAGAAATTGATGAACAAGTGCAAATTGGTCCTTGAAAAATTAGAAGTACAAGAACCAAAGATGGACATAACCAACTCAT
SEQ 23
TTAATGCTTATTCCAGAAACTCCACTTCTTCTTCTTCTTCTTAAAGTCAAATGGCAGGAAGTCTGGAATGGAGCATCAAACTACAGTATTAGAATAATATGATAAGGGTAGTGTGTACGCCGGTGGCGGACCCAGGATTTTGTGCAAGCGGGTTCAATCTTAGAAGTATATAACTTTAGTTGTAAAATAGTAGTTGTCAAGTGGGTTCAAATAAAATATTTAAACAAAATTTACGCAGCTTTAATCCTAATTTATACATATATACAGTATTAGTTTTTGATGCTTGCCACCACGTGCGTCCACCACTGTGTACACATACCCCTTACCCCCTACCTTGTGAGGATAGAAATCTAAATGAAGCAAGAGCAGCAAACTTCCGGTCAACTCTCAATGTTCATCGCTTTGTCCATAAGCATGTGATAAACAAAAAGTGTTATTCCGTAATGCCCATGGTAACCCCCCCCCCCCGGGGGGGGNGGTTTAAACATGTAATTAATCAGATATAGGCCAATTAATAATAGTTGAGCGACCATGCTAAAACCACGGAACTCCGGAGTACCTAACCCCCCCCCCCCGGGGGGGGGATGTCCATGCTAAGACAAACTAAACAGAAACGGGACATAAAAGTACAAGCAACTACCTCCTTGAGGATAGATTGAGTTGTAGTGCACCTCTGCCCAGAAACTCAAGTATATGACTGCAAGAAGAGAAAAATAAGAGAAACAATTAAATTGGTGTGAGCGAATGAATATCAAGACTTCAAAACACCGCTCTTACAAAGTGCACATGCACAAAGAAATTGCATTCATACTTTAATTTCTCTTCCAAGCAATCTCAAGATTTGCTTGCCCACACTTGGCTTCATAGTATAGGTATGATACAATGGCATAGAATAAATGACATGCATACATAACCAATATAAAGCTTGCCCCCATTCATTAAACTTACAACCGTCTCATAATCATTCAATATGTTAAAACAGACAAATTCCGGTCTCTAAAAGGAGAATGTGAATGTCAAAGCATCCATGTTATGAGATGGAATTTAGATTTCAAAAGAGCTAAAACGGACGACTCTTCAAAAATCAAAATCTCCTTCTCATGAAACGCAAAATCGAATTTGCTTAAGATTGTCCTTAAGGGTTCATAGTCATCCATTCATCCCTCCTTCCCCTTGCGCAATTTTTTGGTCAAGGCAGGCCGAGGTACTAACTTTACAGTCCAAAGATCAAAAGTACTATTTGCATTCTTCACGACTCATGTAATAAACTTATGTTGTCTCTTTAACTCCAGTGGTCCTACTTTATCAGAGTCGTTATCTGATTTTGGACTTCTGAAAAAGTTTGATACAAAGATCGTACTAACTTTTCCATAGTTGGCACAAATTTCAAGAATCAAATTCATCCAGTAAATCAGGTTGCTCTGGTACCAGCTACTTCTATAATTTAATTTACTACTATTACTACAATATGCATAATCAAATTATCTGCTTCATCTCCATGTGTAGCCTGTGTCGTCTGAAACGCCAATGGGGGAGTACTAATTTGGTGGTTACGAATTATCATACTATCTCCCCTTTTTGAATTGTTGAATTTGGTCCTGAAAAATGTTGCGGTTTTGGCTAAAAGTCTAAAACTGCATTGCAGAAAGTATCAATAGAACGACATAAGAACTCGATAATGGTTTCTCAGTTAGTGGAATTACAGCTGAGGAAAGCATCTTTAACCGCAAACTGGAAATACGTACAGCATTAAGCGACTCGTGACTTTTTATTTGAGACACATGGAAATTGAGAAATAGGATCTATGTCACTCCCACTTCCAAATATTTTTGTAATAAAAACTTGTTCAATCGCATTTTGTGAAGTAGAGGATATTACAGAAAGGTAAAAGCAATTCAAAGTTTGAGAACTAGCCTACCTCTGTTTGACTTCTGATTCTTCGGAAGAATCTCGATGTAACATGTATCCTTGAATGACGTTATAACAAGAATTTTCACACCATACTGCCCACAAGACAAAAAGACAAAATCAATGTGCAACACAAGTAGGTTCATTCACAAACCAATGCATCCTAGTCTAGCATCATCACAAATAAAATCTTCATAAAAGGAGCTGCGCATATACAATAAATAAAAAATGCATCATAACCACTCAAAATGGAGAGTGAAAGAAAGAAATAGCAAAATAGAGGCACATGAATTAACAAAAGCTAGTAAAGCACCCAATGGAGGCACTATACCAGGACATCCAAATTATGGTCTGGCCAACAATAGCTTAAGCTTCTTATATTCCAAGGTTAAAAAGTAAACCAAAGTAATCAAATGGAGAGAAAAAACCAAGGAAGCAAATAAGGGGAATAATCAATACCGAGTCAGCAGCAGCCTGCAACGTAACATGATCGCCCCATTCCCCACTCCTGGTTCAACCAATATATAAGCAGATGTTGATTTAAAGGAGAAAAGATAATGCAGAATTTCAAAATGGAGAAACAAAGGTCTGAGAATTACTTGGACATCCTCGTCAAGTACTCTCCATACTCCATTGGGACATATCCCTCATACATCTCCGGATGATGTTGAAACTGACAAGACAAAATGATTTTAAAAAACATACTCAGCTAAAATGTATGTGAAATAATTCAAAAATGAAATCGAAATATCATAAAGAAGATTATCTTATTCACTTAGACAAGCACCTGGCTGACTACTTGCTGTCTGACAAATTTGTGGTGCTCTGGTGTACGATAGAATTGATCTGATAAAGCACGGAACTGGAACAAAAGAGGACATGTGAATAGATGTGCATTAGAAAGAAATAGGATGGAACCTTATTTCAACAAACTTAGAATCGGAGGCGAAGCCTAATCCACTTGGGAAAAATAGGCAAAGTTCCCATACCCCATAAAACTAAAGAGTTGAGAAAAAAATTGAATTATCCTTTTGTCAAACTACTAAAAATCCACAAATATTTTTAATAAAGTTGGAATGGCAAGCAGCCTGACAATACGTCACTTGAGTTTCATATTCCAATTTTTTTAAAAATCCTATTATGAACCACTTCATTCACCATTTCACTGTCACAAACACCAACAACAAGTTTCTATCAGAGCCAAAGAGTTAATTCCAAAGTAAGGAATACTGATAAACCGTCATCAAACATTATGTTTTTTGTTTTTCATTCCCTTTCTTCTTTGAACCAGAGAGTAAGACCTCCATACCACCTAGCACCTTGATATACTGAGATTTTTCATGAGACAAACTTAAAGAATTGGGGATCCTTCTTTTGTTTATGGTTAAAAATTATGCTACAAGTAGTTTAAGGAAAGGGAAAACAATTTTTTTCTTTCACTAGAAATCAAATCATGAGCGTCTCTAGACAGTTTGATATCATTACTGCAAGACAAATCAACCAAGTAATACAGTAACCTGGCAGTTGCCATCTCCTTGCACTTTGTGCTCCACCAAGTCAAATAATTGCAATCTGAAGCAAGAAAAACCATAGCATGGGATCCTTAGCAAAGAATAACTTGCAGCAATAATATTACCATTCAATTGCATTGGCAAAAATATCAACTTCACAGGAAACTTAGGTGGCACAAAACCTAATAAAAAACACAACAATATCAACTTCATAGGAAGCTTAGGTGGCACAAAATCTAATAAAGAAACACAACAAAAGAAAATAAAATGAAAAACCACATGCATTGTCCTGTGACTTTAAGAAAACAAAGGGTTAATAGTATCTATTGGAATGTGCTTCATAAGTGTTTATTTACAAAGCACAAAATACTAGTTGGCTCATTCCACACCAATTATTTTCTGTGCTTGTCTCGGCTCCCATCTCTCCCATTTGATTCTCGTTTTCTCAAAATGCTTGAGGGGTCAGATGTTACTTTTCGAATAGCAGGCAGTAAGAACCACCAGTAAAAGAATGAGTTAATAAAGAAAGAGAAAACCTAACAAAATAAAAATAAAGAAATGTTTCTGGAGACACAGGACCCCTATCACAATAGAGGTATGCATTTCTCACAGTGAAGCTATATTTCATTACCTTGAAAAGTGCTCAGAATTTGGAGAGAATTAAGCAAAGCACTATCATAAGATCATATGTCATTAACTTGTTCATACACAATTTTTATCTACTTGAAAACCTAAGTCGGAGGGATCAGGAGATACCTATTTAGCAGCCTTTGATGATCAGAAGTTGCTTCATCGACTGAAGGTATGTCCCCATTTATTCTAGGAACATGCTGCATGGTTAATAACACAAATTGTTATAAGAAACTAAAGCAGCCTCAAGAAAATGGCCATAGGTGCAAAGCACCACACATGTCCTCGTACACAAAGTGAATTAGTGTTCTCAATATACTAACAGACACTACACTTACAGGAACAGCACTCAGCTGGTTTATTCTCTTCCCTACTTCTCCATCAAGCTCAAATTCATCTTGTATTTCCAATGTGTAGGTATACTCTTCTCCATCGTATGATCTGTCTCCAGGACTCGAACAAGAACTTGAAGGCCCTACATCATCAGCTTCTAGGCTGGTGTCATGCCCTGTGAATCATATATAATTAGCAAATGTTTAAACTCAAGGAACATCACAGAATTGAAAACAAGAAATGTACCAGCATAATACTCTCTTGGAGGAGTATGCCAATGTTGTACACCAGTGGAGGCTTGCAAATACTGCTCGTCTGCATGTGAAGATTCAGCATCTTCTGCGATGGACAACTCTGACAAATCTTCTTGTAGAACATGAGCAATAGCCTCATCATTGTCAACATTGCAATATGATGTGTGATAGTGGTTTTCTCTGGCATATTGTTCATGACATATCTCAACGTCATGCTTTCTACCATCACCGTAATAGTTGGAACTAAAAAGTTGGTCCACATCGAGAAAACTAAGAACGCCACGAGCAGCTTCAGATTCCGGCTCACACAT
SEQ 24
ATGCCTTCACTTCTTCAAATTTTCCTTCCTTTGTTTCCATTCTTTTTCTTGGTTTCTTTCTCAGTTTCTCACGGACCCTTTTTGCCAAAGGCCATTATTCTTCCTGTAAACAAAGATCTGTCAACTTTTCAGTATGTTACTCAAGTTTACATGGGTGCTCATCTTGTTCCTACCAATTTAGTTGTAGATCTTGGAGGTTCATTTCTCTGGACTAATTGTGGCTTAACTTCTGTATCTTCAAGTCAGAAACTTGTCCCCTGTAATTCACTCAAATGCTCAATGGCTAAACCTAATGGTTGCACTAACAAGATTTGTGGTGTACAATCAGAAAATCCTTTTACAAAAGTGGCTGCAACAGGGGAATTAGCAGAGGACATGTTTGCTGTGGAATTCATAGATGAGTTAAAAACAGGTTCAATTGCTTCAATACATGAATTCTTGTTTTCTTGTGCATCAACTACTTTGTTGCAAGGTCTTGCTAGAGGTGCCAAAGGAATGTTAGGACTTGGAAATTCAAGAATTGCATTGCCATCTCAGTTGTCTGATACATTTGGTTTCCAGAGGAAATTTGCTCTCTGTTTGTCTTCTTCAAATGGTGCTATAATATCTGGTGAAAGTCCTTACTTGTCACTTTTGGGTCATGATGTTTCAAGATCTATGCTTTATACACCTTTGATTTCATCTAAAGATGGTGTTTCAGAAGAGTATTATATCAACGTTAAATCCATCAAAATTAATGGCAAGAAACTGTCGTTAAACACATCTTTGTTTGCAATGGATGAAGGTGTTGGAGGGACAAAGATTAGTACAATTCCCCCTTTTACCACCATGAAAAGCTCAATTTATAAGTCATTTATTGAAGCTTATGAGAAATTTGCTATTTCCATGGAATTGAATAAAGTGGAAGCTATAGCACCATTTGAGCTTTGCTTTAGCACAAAGGGGATAGATGTCACAAAAGTGGGGCCAAATGTGCCAACTACGGATCTTGTGTTGCAAAGTGAAATGGTTAAGTGGAGGATTTATGGGAGAAATTCAATGGTGAAAGTAAGTGATGAAGTGATGTGTTTGGGATTCTTGAATGGAGGGGTGAATCAAAAGGCTTCAATTGTTATAGGGGGTTACCAGTTGGAGGATAATCTTTTGGAGTTTAACTTGGGAACTTCTATGCTTGGATTTACTTCTTCACTTTCAATGGCAGAAACAAGCTGTTCTGACTTTATGTTCCATTCTGTATCAAAAGATTCAGCTTTTGATTCT
SEQ 25
TTAAGAAGAATGAGAAGTAAACTTATTTGTTGAATTTAAGAGGTAAGAATATGCAAAAGTAGCATGAATTGCAGCACCAATTGGAAGGACATCCTCATCAATGATGAAATGTGGATTGTGTGGAGGGTAAATAGCACCAATCTTTTCATTTTTTGTTCCCAAAAGGAAGAAGGAACCAGGAACTTTCTCTAAAAACACTGCAAAATCTTCACTTCCCATGAAGCTAGGTGCTATTTTGAAACTCTCTTCCCCAACAATCATTTTTGAAACTTTTCGGGCATGTTCGTATATTCTCTCATCGTTTATTGTTGGAGGAAGTGTTGGATTTTCTCGACCATCAAAGTCAATCTCGACCGTACATCGATGTACTGCTGCTTGTGCTCGTATCACCTGAAAATTTTACCAATAAAAAGTTTAATTACCAAATATTGAATATAATAATATGTTCTAAAAATAACATGGATGTCTATTCCTAATTATTAGCAAGTTATTTCATTCTCCCTAGTTGATTAGTGAATTACTGAAAGGTTATGGCGTTCTGATATTGTTAAATGTACCACTTATTTTATTGAAAAGTTATATTGCATCTCTAAAGAACTGAAAAGTCATTTGACCTCTTGGCTCGTGACCCTTCTCAAAAAACAGTTCTTTGGCTATAAATAAGATTATTTTGTGTTGAATGAATATATCAAGCAACTTGAAAATATTAAAACCTCTTTCCCGAAACAAATCCTACAATTTCCTCAAGTACTCGTCTTTTCAAACAAAGTATTAATGAAAAAGAAACGTAACTTGTTTGAATAAATAAAATTTGCACATATAAACTTTGTAAAGAGACGTGTGGGACTTTGAATATTGGTCAAAGTATCCAAGATTTTTGTTCTAAATTACAAGTAGTTTACCTCTTCAATTCTTTTCCTCAAACCGTAGAAACTCTTCTTACTGAATGCTCTATAGGTCCCGGAAATTGTAGCTAATTCTGGTATGATATTAAATGCATGCCCCCCTTCAATCATGGCAACAGAAACTACCTGAAATTTCAAAAACTAAATATATAAGAATTGATAAAATAAAAATTTAAAATTTGTTTGAATAAGTTTGGAAAAGAAAATTGTTAATCTCTATGTTTCAAAAAGATTGTTCTAGTTTGACTTGACACAAATTTTAATAAGGAAGAAAAGACTTTGAGATATGTGGTCCTAAATAAACCATATCATTTGTGTGACTGTAAAACTTTTGAAACTTGTGATCTTAAACTTACTATAACATTTGTGTAACTATAAATGCTTCTAATAAAAAAAATATTAAAATTTGTCAATTTTTTTGAAACAGACCAATAAATAAATAGTGTCAATGCTTTTGAAACGGAGGTAGTACCTGGGATTCAAGAGGATCAGTCTCTCTAGAGACAATACTTTGCAAACTAATAACAGAAGTAGAAGCAGCCAAAATTGGATCAACAGAATCGTGTGGAACAGCAGCATGACCTCCTTTTCCTCTAATTGTAGCTTTAAAGCTTCCACATCCAGCCAAGAATTCACCAGGCCTAGATGCAACTACTCCACTTTCATACTTATGAACTAAGTGCATTCCAAAAATGGCTTCCACATTTTCAAGAACTCCTTCTTCTATCATATCTTTAGCCCCATGCCCTCGTTCTTCAGCTGGTTGAAAAATTAACACCACTGTTCCCTGCAATATTATTATACACGGTAATTAAATTCATTACTTCAACTAATCCATTAGCTTAGAAGTATGTATTTAGAGCTTAATTAAGGGTTTTATTTAACCTGTAAATTGTGTCGGAGTTGTTGTAATATCTTGGCAGCACCAAGAAGCATGGCAGTATGGGCATCATGAGCACAAGCATGCATTTTTCCATCAACTTTGCTCTTGTGCTCCCATTTCGCCAATTCCTACATTAGAAGAATTCAACTTTGACTCACGACTCTTTTATTGATCAAATTATTCACTTTATAGATTTTTGAAGAATTGATTAATCGAGAATAAATATAGAGTCCTACTGTAGAGGCATATTATATGATATTGACCTCTACAACTTATAAAACCCGACTTATGATCTTTATTTTCTTTTTCTGTTGTTTAGATCACAATTGATATTTGATGTTCAAATTAAATGTTTTAGCGGTGTAATATTATTACTTATGGTACTTTCGGCCATCCTATCCAATTTTACTACTAGGAAAATAAAAAACGTGTTGACCCTTTATTCCACAACATATGAAACTAAAAGTAAAAAGAGATGGTCACCATAGAAGAAAACTAGCTAAAGTATATACCTACGAATTGAAGTGTTTTCTCTTTCCCAATGAAGTTCCAAAATTCAAGAATCTCTTTGTTTTAGGTATAATTAAGCTGTTTCGAACTCTATACTTAATTCAATATTAAGAGAGATCTGATTTATTACTTTCCTTTCATGGCTTAAATATTACCGCCGCCGCCGCCATTTCTGACAAAAACGGAAAGTAAACTGCCGCAAGTAATTTCTTCTTCTGCCATAGTTAATTTAGTCGCCCACAAAATTAATAAAATGACTCAAATTTACTGCCTACACCCTAGTTCCGACCGAATACAACATATAAATGATCCCCGTGCTGTTGTCATCTCGAACATCCTTAATAACAATCTCCAAAACCATTAATGAAATACAGACAAAGGTAAGAAGTAAATTTGAAGATATATAGTACTATATTAGGCCTATAGATCTACCTCCTAAACTCCACAAACTGTTTAAAGTGAATAAAACATTTAAAGAGTTCATATCAATTTTTTTTGATATGAAGAGTTATCCGTGGTTATAAATGAACTAAACGTGATACTAGTATAAATATTCTTACCGTTTTTTGTTTTGAATATAATTGCAGGGTTGAGAAAATTTCCAAGCAGAGACTACTAACCTGAATAGGCAAAGCATCCATGTCTGCTCTGAGAGCCACAAATGGCGGCTTACCGGAGCCGATGGTGGCAACAACTCCGGTCTTAGCCACCGGCCACCGGTACTTTACTCCCATCCGATCAAGCTCCTCTCTGATCAAACCACTCGTCTTAAATTCTTCATAAGCAAGTTCTGGGTTCTCGTGAATTTGTCTCCTTATTTTCATCATCCACTTCACTGTCTCCGTAGCATTTGCTAATTTTGTAATATAATCTTTCACGTAACAGTTTTGATCTACCAAAAACGGATTCAAGCACTCATCATCGCCGTGACACGAAGGAAAAACAATGAACATACATACAAGCACCAAAATTAGAACTTCCTTAGCACCCAT
SEQ 26
ATGAAACTGAATCCTTACTCATGGACAAAGGTAAGTACTTGATTGTGAATTATAACTGTATTATGTACATAAGGTCGCTGCACAACACAAAATGTTGAAAATAAGATGGAATTATTAGGTGGCAAGCATTATTTTCTTAGACTTACCAGTAGGCACTGGATTTTCCTATGCAAGAACTCCAACAGCTTTACAGTCATCTGATTTACAAGCAAGTGATCAAGCATATGAGTTCCTTTACAAGGTAATTAGATTCTTCACGAAATTATTAGTTAAATGTATTTTCTCCTTTGCCCCTCAATGTTGTTCAATATGTAGTAGAACAGTCAATAATTTTATGTTGTTTGCAGTGGTTCCTTGATCACCCAGAATTCTTAAAGAATCCATTGTATGTTGGCGGCGACTCATATTCAGGGATGGTTGTTCCCATCATTACTCAAATTATAGCAACTAGTAAGACTATATTTTCCCTCAAATAGTTGTGAAACAAGTAATGGCAGCCTAAGGTAGTAAGGTGTTCTGTTCTTGTACTATAACATTTTGTGGCCTTGTGATAATGCAGAAAATGAGATGGGAATAAAACCTTTTGTGGATCTTCAGGTTTGTCATTTTTCTTGTATATATTCTCTTTTCCCTACGGATAAGCAGACGGATTACATACCAACTCAGAATTTGTAACGAAATTGTTATGAGAATGTCACGACCCAAGCCCATAGCATGTATTGTCTGCTTTGGGCCTAGGCTCGCACGGATTTGTCTTTCGGGCTACGCCACCTCGAGCCCCAAAAGCGCGTGCACCATGTGAACTTGTGTCATACCTTATAAAGTTCATCACTTTCCTCTATTATTCCGATATGGGGATTCGTCTAAGGTGACATGTGCACCGCTTATTCAGAAGTTTGGCAGCCTAGAAGCTAGTCAGTCCTACTTAACTTGCCCTCATCAGCCCCCTCCTTCATGGGCATCACACAGAATCAAAAGTCACTGTAGAATGTGAGTTGATTTGCAAAATGTATGACCTGATATCTCTCGTCAAGTGGTTTCAGGGATATTTACTCGGAAATCCATCGACTTTTAAAGGTGAAAAGAATTATGAGATTCCATTTGCTTATGGAATGGGACTTATTTCTGATGAACTCTATGAGGTTGGTTTTCCTTTGGTGTTATATAGTACAGTCAAACCTTTCTATAATAGCTACATTTGTTCCGATATTTTTTGGATGCTATAATGAAGTGTTGTTATAGAGGATATATATTAGTATAACATAACATACAAAATCGGCTCCGAGAAAAACTTGGCTTTATAGTAAATGACTATTATATATGGATGCTGTTATACAGAGGTTTGACCGTAAGATCTTAAATATCCTCCAGTTATGCGCTTTAATTTAGTTTGCTTACATTGTCCTTAGAACTAATTGATTTCCCTTTCTCAAATAGTCCTTGACGAGAAATTGTAAAGGAGAGTATCAAAACACTGATCCAAGCAATACACAATGTTTGCAAGATGTTCATACTTTTCAAGAGGTTGGATCCTATTTTGAGGAAAATCAAATATCATCTGTTTGTTTTATGATAGGTTCATTAACATACTGACCTTATGCAGCTTCTGAAAAGAATTAATAATCCCCATATTCTGGAGCCCAAATGTCAGTTTGCTTCACCAAAGCCACACCTATTGTTTGGCCAAAGAAGATCTCTTAATGTGAAGTTTCATCAACTTAACAATCCTCAACAACTCCCTGCGCTAAAGTGTCGCGTGGGTACTCATCAACAAACTCTAGCATTCTTTATGCTATTGATTTTTTGTTTCACTGAGATACTTACGAGAATTTACAACTTGCAATTGATTTAGAATGATTGGTACAAACTTTCTTCTCATTGGGCTGATGATGGCCAAGTTAGAGAGGCCCTCCATATCCGAAAGGTACGTTAGTTCTTGTTGGAAGGGGAACCTTGGAGCAACGGTAAAAATATCTCTGTGTGATCTATAGAGCACGGATTTGAGCCATGAAAGCAGTAATGCTTGCATTATGATAGGCTGTCTATATCACACCCTTGAGATGCGGCCACCTTGCATGAATGCGTGATACTTTGTGCATCATGCTGCCTTTTTTTTTGAAGAACAACAAAATTTAACAAAGTGTGCTACACAAAACTAAAAATATGATCAATTTGATTACAGGGAACTATTGGAAAATGGGTGAGATGTGCAAGTTTGCAATACCAAAAGACAATCATGAGTAGCATACCATATCATGCAAACCTCAGTGCTAAAGGTTACAGATCTCTTATATACAGGTTGAGTAAGATTGTTGTGTTTGCAAGATTGGAATAACTACATAAATAGTTGAAGATTATTATCTCTGTGAAACTATTTACTTAGTTTTCTATGTTTTTTGAATTAAGCAGTGGAGATCATGACAAGGTTGTTACCTTCCTATCAACTCAAGCATGGATAAAATCTCTTAACTACTCCATTGTTGATGATTGGCGACCGTGGATCGTTGACAATCAAGTTGCCGGGTTAGTTTATGATGAAAACATTGTACGCTAGTCATAAGCTCTGTCAAGGTATAGAAGTTAAACTCATTTTTTGTCTTTTGCATGATTGTAGTTACACGAGAAGTTACTCAAATCGGATGACATTTGCCACAGTAAAGGCAAGATATCTCTTTCACTTGCTTTTCTCAGTTAAGTTTGAAGATAAAAAATTTTGTTAAATAGTTGGTGTTTAAATTGCACTATTTTGTTACAGGGAGCAGGGCATACTGCACCAGAGTATAAGCCTCGTGAATGTCTGGCCATGCTCAAAAGGTTGATGTCTTACAAGCCTTTG
SEQ 27
ATGTGTGAACCGGAGTCTGAAGCAACTCGTGGGGTTCTTAGTTTTCTCGATGTGGACCAACTTTTCAGTTCCAACTATTACGGCGATGGTAGAAAGCATGACGTTGAGATATGTCATGAACAATATGCCAGAGAAAACCAGTATCACACATCATATTGCAATGTTGACAGTGATGAGGCTATTGCTCATCTTTTACAAGAAGAATTGTCAGAGTTGTCCATCGCAGAAGATGCTGAATCTTCACATGCAGATGAGCAGTATTTTCAAGCCTCCACTGGTGTACAACATTGGCATACTCCTCCAAGGGAGTACTATGCCGGTACATTTCTTGTTTTCAGTTTTGTGATTTTTCCTCGAGTTTAAACATTTGCTAATTTATATATGATTCACAGGGCATGACACTGGTCTAGAAGCTGATGATGTGGGGCCTTCAAGTTCTTGTTCTAGTCCTGGCGACAGATCATACGATGGAGAAGAGTATACCTACACATTGGAAATACAAGATGAATTTGAGCTTGATGGAGAAGTAGGGAAGAGAATAAACCAGCTGAGTGCTGTTCCTGTAAGTGTAGTGTCTGTTAGTATATCAAGAACACTAATTCACTTTGTGTACGAGGACATGTGCGGCGCTCTGCAACTTTGGCCATTTTCTTGTCACTGCTTTAGTTTCTTATAACAATTTGTGTTATTAACCGTGCAGCATGTTCCTAGAATAAATGGAGACATACCTTCAGTCGATGAAGCAACTTCTGATCATCAAAGGCTGCTAGATAGGTATCTCCTGATCCCTCCGACTTAGGTTTTCAAGTTGACAGAAATTTTGTGTATGAACAAGTTAATGACATATGATCTTATGGTAGTGCTTTGCTTAATTCTCTCTCAGATTAGCACTTTCCAAGGTAATGAAATATAAGTTCACTGCGAGAAATGCATACCTCTATTGTGATTAGGTGTCCTGTGTCTCCAGAATCATTTCTGTATTTTTTTTAGGTTTTCTCTTTCTTTATTAATTCATTCTTTTCCCGGTGGTTCTTACTGCCTGCTATTTGAAAAGTAACATCAAACCCCTCATGCATTTTGAGAAAAGAGAATCAAATGGGAGAGATGGGACCCGGGACAAGCACAGAAAATAATTGTTGTGGAATGAGCCAACTAGTATGTTGTGCTTTCTAAATAAACACTTACGAAGCACATTCCAGTAGATACTGTTAACCCTTTGTTTGCTTAAAGTCACAGGACAATGCATGCGGTTTTTCATTTTGTTCTGTTTTTTTATTAGGTTTTGTGCCACCTAAGTTTCCTAGGAAGTTGATATTGTTGTGTTTTTTCATTAGGTTTTGTGCCACCTAGTTTCCTATGAAGTTGATATTTTTGCTAATTCATTTGAATGGTAATACTATTGCTATAATAACTTATTTTCTGCTAAGCATCCCATGCTGTGATTTTTCTTGCTTCAGATTGCAATTATTTGACTTGGTGGAGCACAAAGTGCAAGGAGATGGCAACTGTCAGGTTATCATATTACCTGGTTGATTTATCTTGCAGTAATGATATCAAACTGTCTAGATGCGCTCATGATTTGATTTCTAGTGGAAGAAAAAAACTGTATTCCCTTTCCTTAAACTACTTACAGCATAAGTATTAATCTTAAACATAATGTTTATCAGTATTCCTTCCTTTTGGAATTGTTCTGGTAGAAACTTGTTCTTGGTGTTTGTGACAATGTCTTAGCTTTCTTTATTACTTTTTAGTTATGCTTGAAAACAGTGGAAACAGTAAAGTTATCTCCATATAAAGTTGTCTCTGTGTGACATATAGGTCATGAGTTTGAGCCGTGGAAGCAGCCATTAATGCTTGCATTAGGTTAGGCTATCTATATCACACCCCTTGGGTGAGGCTCTTCTCGGGACCCTGCGTGAATGTGGTCGGGACCCTGCGTGAATGTGGGATGCTTTGTGCACTGGGCTGCCATTTTAGTTATGCTTGAAATTCTCAACTTTTTAATTTTCATATTTGGTTTTTACTTGTCTATTCTTTCCATTAGCCTTAAGCAGTTGCTCACTGTTCATCATATTTCATTTAAGTTTGTGAAGTGTGTGAGACCATATACAATATTGCTGAATTATGATATACATTGGGGATTGGCAATTTCATTTAAATTGAATTCTTTAGTGATTAGTTCAATAAAGTCACAAAAAGAAAATCGGACTTGAATTATTGATTTGGGAGTTATTTAATTATGAAATGAATACTAGTAAGAAGCGAGTCAAGAAATTTGAGACTGAATGTGAAAATTGGATGGAAGATGTTCACGGAGAAAAGCTGATTAATAGTAATGTTGGTAAAATAGGAAGGGATTAGAACTCGGATAATGAATGTAGAGCGAACTACAAAATATAAGAAGTTGAGAGTTCGGATGGAGTTGGGGGGATGGGTGGTGAATGGAAGTGGTTCATAATAGGATTTTGGAGAAAAACTGGAATATGAAAACTCAAGTGATATATCGGCAGGTTGCTTGCCATGCCAAGTGCCAACTTTATGAAAATTATTTGTGGATTTTCAGTTAGTTTGACAAAAGGACAATTCAAATTTTTTCTTAACTCTTTAGTATTATGGGGGTATGGGAACTTAGCCCGTTTTTCCTTTCTGTGAATTAGGTTTCACCTCCGATTCTAAGTTTGTTGAGATAAGGTTCCATCCTATCTCTTTCTAATGCACATCTATTCACCTGTCTTCTTTTGTTCCAGTTCCGTGCTTTATCAGATCAATTCTATCGTACACCGGAGCACCACAAATTTGTCAGACAGCAAGTAGTCAGTCAGGTGCTTGTCTAAGTGAATAAGATTATCTTCTCTATGATATTTCGGTTTTCATTTTTGAATTATTTCACATACATTTTAGCTGAGTAGGTTTTTTAAAATCATTTTGTTTTGTCAGCTTAAACATCATCCAGAGATGTATGAGGGATATGTCCCAATGGAATATGGAGAGTACTTGAAGAGGATGTCCAAGTAATTCTCAGACCTTTGTTTTTCCATTTTGAAATCGTGCATTACCTTCTCTCCTTTAATTTACATCTGACTTTTATATTGGTTGAACCAGGAGTGGGGAATGGGGCGATCATGTTACGTTGCAGGCTGCTGCTGACTCGGTACTGATTATTGCCCTTACTTTGGTTCCTTGGTTTTTCTCTCCATTTGATTACTGCCTTTTGGTTTGTTTCTTAACCTTGGAAATAAGAACCGTAAGCTATTGTTGGCCAAACCATAACTTGGATGTCCTCATATGGTGCCTTCTTTGGATATTGTAGTAGCTTGTTAATTGCAAGTTGATGGTATGTAGGAAGTAACATGCTTCTATAGAATTTGTGATCCTGTAGTTTTTCATGAGTATGTGTTAATCCTTTATTTTGTAGTGTGGAAGAAAATGTGTGTTTATGTGCCTCTCTTGCTATTTTTTTCTTTCAGTCTCCATTGTGTGGTCGTGTTGCATTTTTTATTGTATATGCTTAGCTCCTTTACGAAGATTTTGCTTGTGATAATATTAGATGAGGATGGATTGGTTTGTGAATGACCTATTTGTGCTGCACATTGAATTGTCTTTTTGTCTTGTGAGCAGTATGGTGTGAAAATTCTCGTTATAACGTCATTCAAGGATACATGTTACATCGAGATTCTTCCGAAGAATCAAAAGTCAAACAGAGGTAACTAGTTCTCAAATTTTGAATTGCTTTTACCTTTCTGTAATATCCTCTACTTCATAGAATGTGATTGAACAAGTTTTCATTAACAAAATATTTGGAAGTGGTAGTGGCATGAATTCTCAATTTCCATGTATCTCATATACAAATACATGTGTCACCGAATTGTGTACGTACTTCCAGTTCGCAGTTAAGGATTTTCCCCAACTTTAATTCCACTATGTGAGCAACCCTTACAAAGTTCTTCAAACATTCTTTATTGGTTATTTCTGAAATGTGGTTTTAGACTTATAGATAATACCAGAATATTATCCAGGGCCAAATTTCAACAATTCAAACAAGGAAAGATAGTATGATCATTCTTAACACCACTGTAGTTACCCCCCATTGAATTTTCCGATAACGAGGGCTACACATGGAGATGAAGGAGATGATCTGATTATGCATATTGTAGTAATAGTAGTAGATAAATTTATAAAAGTAGCTAATACTAGATAACCCGATTTATTATCTGAGTTTGATTCTTGAAACTTGTTCGAGTTATGGAGAAACTAGTATGGCCTTTGTATAGGACCTTTCCGAAAGTCCCAGATCCGATAACGATCTGATAAAGTAGGACCACTGGAGTTATGAGACAACATTAGTCTATAATATAAATGATCAGAATATTGCAACAGAAAACATCAGTTGTCTCTTTCCTCTCTTCGATAGAGGCAAAGGAGATTGAATCTAATTGATTCCGAAATGCTTCATTGGATATTCAATAGTTAAATCGAACTATTCATCTTGCAACTCTGAAAACAGACGTGCTATACATGCAGTATAAGAGCAACATAATTAACATACACTAGGTTGGAGGTTTACTTATCTATGTTTAGGTGGTCGGTTTATGTGCAAGTTTTCCATTTTTCAACAATTTAGAGTTATCAGAGCTACTTATAAGACATGATACTTTTGCTGTATTTAACTTTTTTTGTAAAGTTCAGCAAGAGTTTTTGCTAGTCCGAGTGGAAATAATTATTTGTTGACTACCCATTTTCCCTTTTTACTTGAGAAAAAGATTGAGAGGGGGGAGGCAGAGCAGCATCATGAGTCATCTGGAGAATGCAAAGAGTACTTTTGATCTTTGGACTGAAGAGTTAGAACGTTGGCCTTCCTTGATAACAAAATTTTCAAGGGGAGGGGGGATTAATGGATGACTATGAACCGTTATGGACAATCTTAAGCAAATCCGATCTTGGGTTTCATGAAAAGGAGATTCCCAAGGGTTGACCAGTTTTGGTTGTTTTGAAATCTAAAAGATGGACTGATGAGCATCCTTATTGCCTTTTTAGAGACCTAAAGTTGTCTACTTTAACATATTGAATGATTATGAGACAGTTCTAAGTTTAATGAATGGGGACAGCTTTATATCGGTCATGTATGCATGACATTTATTCCATGCCATTATATGATACCTCTACTGTGAAGCCAAGGGTGGGCACAACTTAGCAATGGATACTTAAATCTGGAGATTGCTTGGAACATCTTTGTGCATATGCACTTTGTAAGAGCGATGTTTGAAGTCTTGATATTGATTCGCTTACACCAATTTAATTGTTTTGTTTCTCTCGTTTTTCTCTTGGTGCAGTCATATACTTAAGTTTCTGGGCAGAGGTGCACTACAACTCAATCTATCCTCAAGGAGGTAGTTGCTTGTACATTTATCTCCTGTTCCTGTTTATTTTGTCTTAGCATTGAGAGTTGAGTGGGAGTTTGCTGATCTTGCTTAGTTTAGAATTCCATTCTATTATCATATTATTCTAATACTGTAGTTTGATGCTCCATTCTAGACTTCCTGCCATTTGATCTTAAGAAGAAGAAGAAGAAGTGGAGTTTCTGGAACAAGCAT
SEQ 28
TCAAGAATCAAAAGCTGAATCTTTCGATACAGAATGGATCATGAAGTCAGAACAGCTTGTTTCTGCTGTAGACAGTGAAGAAGTAAATCCAAGCATAGAAGTTCCCAAGTTAAACTCCAAAAGATTATTCTCCAACTGGTAACCCCCTATAACAATTGAAGCCTTTTGATTCACCCCTCCATCCAAAAATCCCCAACACATCACTTCATCACTTACTTTCACCATTGAATTTCTCCCATAAATCCTCCACTTAACCATTTCACTTTGCAACACAAGATCCATAGTTGGAACATTTGGCCCCACTTTTGTGACATCTATCCCCTCTGTGCTAAAGCAAAGCTCAAATGGTGCTATGGATTCCACTTTAGTCAAATTCACGGAAATAGCAATTTTTTCATAAGCTTCCATAAATGTCCTATAAATTGAGCTTTTCATGCTAGTAAAAGGGGAAATTGTACTAATCTTTGTCCCGCCAACACCTTCTTCATCCATTGTAAACAAAGATATGTTTAAAGACAGTTTATTGCCATTAATTTTTATGGATTTGACATTGATGTAATACTCTTCTGAAACACCATTTTTAGATGAAATCAAAGGTGTGTAAAGCATAGATCTTGAAACATCATGACCCAAAAGTGACAAGTAAGGACTTTCACCAGATATTATAGCACCATTTGAAGAAGACAAACAGAGAGCAAATTTCCTCTGGAAACCAAATGTATCAGACAACTGAGATGGCAATGCAATTCTTGAATTTCCAAGTCCTAACATTCCTTTGGCACCTCTAGCAAGACCTTGTAACAAAGTAGTTGATGCACAAGAAAACAAGAATTCATGTATTGAAGCAATTGAACCTGTTTTTAACTCATCTATGAATTCCACAGCAAACATGTCCTCTGCTAATTCCCCTGTTGCAGCCACTTTTGTGAAAGGATTTTCTGATTGTACACCACAAATCTTGTTAGTGCAACCATTAGGTTTAGCCATTGAGCACTTGAGTGAATTACAGGGGACAAGTTTCTGACTTGAAGATACAGAAGTTAAGCCACAATTAGTCCAGAGAAACGAACCTCCAAGATCTACAACTAAATTGGTAGGAACAAGATGAGCACCCATGTAAACTTGAGTAACATACTGAAAAGTGGACAGATCTTTGTTTACAGGAAGAATAATGGCCTTAGGCAAAAAGGGTCCATGAGAAACTGAGAAAGAAACAAAGAAAAAGAACGGAAACAAAGGAAGGAATATTTGAAGAAGTGAAGGCAT
SEQ 29
TTAGGCCTCAATCAGTTCTCTAATTGGTTTGCTGTTTATGTTAGCTGATGGGATATTAGTGAATTCAGAGAGTATAGCTCTGAATTCATCTCCTGTAAGAGTCTCCTTTTCTAGCAACACATCCACTAATTTGTCGATTGCCTCCCTGTTGTTCCTTATGTGGTTCTTTGCAATTTCATATGCTCTCTCAATTATGTGCCTTACCGATGCATCAATGTCTTCTGCTAGTTTCTCTGACATTTGATTCCTCGCCAGCATTCTCAGCACCACATCACCACTCTGTGTTGCTGGATCTGTTAACGCCCATGGTCCTATCTCAGACATCCCGAACATTGTCACCATCTGCTCATGATATAAACATTGGCAAGTTAATACTTGTGTGTATTCGAATATGTTGTTCTCTTTTAATGTGGTGCAACAAGATGATGTGTTAAGTAAATACCTGTCTTGCTATTTGAGTTATTTGTTGCAAGTCTCCGGCTGCACCAGTAGTGATTTCTGCTTCACCAAAAATTATTTCCTCTGCTGCTCTACCTCCTAAGCTTCCAACTATTCTAGCAAAAAGTTGCTGCTTAGATATCAAGGTTGGATCTTCACCAGGAATAAACCATGTAAGACCGCGAGCTTGCCCTCTTGGGATCAATGTAACTTTCTGTACTGCATCATGGCCAGGGGTCAATGTCCTATAAGCACAAGGACACATTCTTTAGTACTGTGTCTTTTGATTACAAATAACAAACTGAAAAGATTGAATACTTAGACAGCTTCTTAAATTTGTCCGGTTTTTTCATCTAAACACCTTGTCTAAGGGCCTGATATATTGAACACTTGATGTTAGTTGAAAATTCAATAAGGAGCAAATTACTCCCTTTTTTTGCTTCATGTATAATCTAGTATAAATGAAAATAATGAGAGGAAAGAAATGATTGTTAACTTACGCGCAGACACCATGTCCAACTTCATGATATGCTACCAAAATCTTGTTTTTGCCATCTGTCATCTTGGTTCCTTCCATTCCAGCAACAATTCTATCGATGGAATCATCAATCTCTTTCGAGGTAATCTTATCTTTTCCTCTTCTTCCAGCTAGAATAGCAGCTTCATTCATGAGGTTTGCAAGATCTGCACCACTGAATCCTGGAGTTCTCATTGCAATAACACTTAGAGACACATCTTTATCAAGCTTCTTGTTGTTACTATGAACCTTCAATATTTCTTCCCTTCCTCTTATATCAGGCAGTCCAACACTTACCTAATAAAATGAAATATCAATATAAGTGAAGTGTATTCTGGAAACTGTATAATACACCTCATTTTATTGGAATTTTACAATCAAAATCTCATTTTATACCTGTCTATCAAATCTTCCAGGTCGAAGCAAAGCTTGATCAAGAATTTCAGGCCTATTAGTGGCAGCAATGACAATGACTCCAGTGTTTCCAGTGAAACCATCCATTTCAGTGAGAAGTTGGTTAAGTGTCTGCTCTCTTTCATCATTTCCACCGCCAATACCAGTTCCTCTTTGCCTCCCAACAGCATCAATCTCATCAATAAAGACTAAACAAGGTGAATTTTCCTTTGCCTTGTTGAATAAGTCCCTAACTCTAGAAGCTCCCACACCAACAAACATCTCAACAAACTCTGAACCAGAGAGAGATAAGAATGGAACCTCTGCTTCTCCGGCAATCGCCTTAGCTAGCAATGTCTTCCCTGTCCCTGGTGGCCCTACTAAGAGAACTCCCTTTGGTATCTTTGCCCCAACTGCTGCAAACTTTTCTGGGGTTTTCAAGAACTCAACAATCTCTTGAAAATCTTGCTTTGCATCATCTACCCCAGCCACATCATCAAATGTTACTCCTGTATTTGGTTCCATCTGGAATTTTGCTTTGCTCCTGCATTATTCACAAACAAATACTAGTTATTAGTAGTTGTTGAAGATTACATCACTAGACATAATGTTCAATCTTGATCATGTTTATGGAATTTCTATTATAGCATACTGTTGGGTTTCTTAAAGAGATGGAAATGATTGAAATTGTCTCTCCTAAGTTTTATTAACTATAGAGCGATTTAAATAGCCAACTTGAAAATAAAATACACAAATTTATAAAATATTGAAAAACCTAAAATATCTCAACAACCTAAAATATCTAACCGAAATTTAAATTCAAACAAAGTAGACTACTTTTACCACTAAAAATTACTCCTTCTATTTCAATTTAGATGATACAATTTCCTATTAGTACGTTCCAAAAAGAATTATACATTTCTATAATTGAAAATAATTCAACTTTAAACTCTTTATTTTATCTATTTTAACCTTAATAAAAAACTTTTATAACTACACAAATATCATGCCCCCCACAAAGCTTTTACCTCTTAAACTTTTTCAAAAGTCTTCTGTTTTTTTTTTTTAAACTACGTGCCGAGTCAAACTAACTAATTTAAATTTAAACCGAGGAAGTATTATTCTAGTAAATTAACAGTAACAGAAGCTATATACAAGACATACCTTCCTAATCCAAAAGGCAGGTTTGGCCCTCCAGGAGTATTTGAAGAAGAGGTTCTCAACAGCAAAGAGCCAAGCAATATCAATGGAAAAGCTAAATTCCCAAGTAAATCAAGAAGTGGCCCTATGACATTCATTTCAGGGAGATGAGCAGCAAAATCTACATCCTTCTCTCTAAGTTTTCTCACCAATTCTGGTGGCAATCCTGGCAACTGAACTTTAACTCTCTGGACTTTGTTAAGAGCAGGATTGAATATCTCAGCAACAGCACTACTCTCAAAAAAATCAACTTTTTTCACAGCACCTTCATTCAAGTATTCCAAGAATCTTGAATATGACATTCTACTTGAAGTTGCTTCAATTGGTGCTTCAGTTTCTGCTCTTGCTGGTTTAGCCAAAGTCCCTGCTACAAGGCTCAAACCACTACCACTCAACAGCTTCCTCCTATTTATTCTGGTGTCTGAATATGATTTTTGACATGGGGTTTCTTTACTAAAGATTTTAGGATTGTTAGTATCCTTAGAAAGATCTTGGGATTTGCATAGGGGAAATTGAATGACAGACAAAGAAAGGGCAGGGGACATTTTCAT
SEQ 30
TTAGGCAGTGGGATAAGAAGCGTCCATAGCAAGTCCACAAAGGCCTTCTTTCTCATGAACATCCCTTTTGATGCGCATATATCCACTGTCACCCCATTTACTGCCCCATGAATTCTTTATAATCCAATATTTTGTACCGTCAGTTGTTGCACCATATCCCACTGCTGTAACAGCGTGGTTAAGCCAAGTGCTGCATGATCCACTGAATACACCACTTGAATAGAACTGGAAATCGAAGCTACTCCCGTCTATTGCCACCGAAACAGGTTGATTAGCCACTGCCTGCAATAGAGCCTTCTCACTGTTCGCTGGCACATCTTCATATCCTGTAATAAGAGGCGTAAGTCATAATTTCAAGCTTATGGATTCGGAATATTTATCGTTTGAAGTTGCTGGGCTAGATCATAATTAAACCAACTCACCCAATTGAAGATTTGTTCTACCCCTTATATTTTTATGGGCTTACCTGTAATTTTGGCTGCTGAAAGAGCTGACTTTTTCTTGTTGCAGACACCATCTTCTCCTTTGTATGGATAGTTTACTTCTGTTGTGAGGCCCTTGTTTTTCAGGATGAAATCAAAGGCAGTGTCCAAGAGTCCACCGCTGCAACCTTCGTCCTCGCCTTCGACATCACAGTCTACAAGCTCTTGCTCTGATAAAGGGATCAACTCTCCTGTTTTCAGTTGGTGTAGCCCTTCCAT
SEQ 31
ATGGGATGCCGCATGAAATTCTTGAATGTGGTTTTGGTGGTGGCGGCGGTGATGGCTGCTGCCGCCGCCGTGGCCTTCGGAGCTGAGAAATTGCCGGCGGGAGTGCTTAGTTTGGAAAGGATTTTTCCTTTGAATGGGAAGATGGAGCTGGAGGAGGTTAGAGCAAGGGACAGAGCTAGGCATGCTCGAATGTTGCAGAGTTTTGCTGGTGGTATTGTTAATTTTCCTGTTGTCGGTTCATCTGACCCTTATCTTGTCGGGTAATTACTTTGTTACGACCAATTTGATAAGATTATATTTGTGATGTTTTTAGTGTTTTCTTCCTTTTTCTAATGTGGAGTTATATTGCTATATTTGCTATATTTTATTTGGTATGATGACGATGATATGGCTTGAGCTTAAATGGAGAAGTGATGATTGGTATAGCGGACTCCAACTTGTTTGGGACCGAGGCGTTGTTGTTGTTGAGTTTTGTTTGGTTAATTTAGTCATTTTTTGGAAAGTTTGATTCTTTATGATGTTAAAACTTGGAACTTTTGGTGAATGTATGGAAGCTATGGACTATTTGATGTGTTATTAACGTCTTATGAATTTGATCTCATGAATTCTGATGTAAATTTTGTTTTAGTTTGAGGGTAATTGATTTTAAGTGTATTAAAGTACTTGTAACACAATGAATTTTGGTGTGCTGTTTTTCTTTTCTAGGTGCTTTCTTGTTAATTATCGGTTGGATGGTGTTGTTGATGGTAGTGATAGTTCTGTTTGCTTTATCTTGTATCGTTTCTTGTTCAGGTGCAAAATATTTAGGTACAATTGAATGATGAGTTTCGTGTTGCTCTGAATATGAACATTAGCTATATCAGTTTGGCATTTGCTTTCTGTTATTTGTGGATGAGGGATATTCTTATTGATTTGACTATCAATTTTGTTGACCATCGTCTCTTTCTCTCTCTTCTATGCTTTTGGTGTATTTGTAGCCTTTATTTTACAAAAGTAAGACTGGGAACTCCACCAAGAGAATACAATGTGCAGATCGACACTGGCAGTGATATCCTATGGGTCACATGTAGTTCCTGCGATGATTGTCCTCGGACAAGTGGACTTGGGGTAACTCATCTTCCCTTCATCTTGTTATTACTTTTTTAGTTTCTTGTTTAAAGTGTGGTGAAGGAATAAACTGTTACGTGGGTGCAGGTTGAGCTCAACTTCTATGATGCTACCATCTCGTCAACTGCTTCTCCCATTTCTTGTGCAGACCAAGTGTGCGCCTCTATAGTTCAAACTGCCTCCGCTGAGTGCTCTACGGAAACCAATCAGTGTGGTTACTCCTTTCAATATGGAGATGGGAGTGGCACAACTGGCCATTACGTAGCCGATTTACTATATTTTGACACAGTCCTGGGAACTTCTTTGATTGCCAACTCTTCAGCACCGATTATTTTTGGGTGAGTTCTTATTTTTTAAATACCCCTATATCTATACTTAAAATTTCATTAGAAATAGTTGTGGGTCATTTGAACCAGAAATATCTTTGGCCCAATTTACAAAAAAACCATGTTTGTTTACTCAAAGCTTATACTTGGATATGATTTAAAACAGGTGCAGCACCTCTCAGTCTGGGGACTTGACCAAGACGGACAGAGCAATTGATGGGATATTTGGGTTTGGTCAACAGGGTCTTTCAGTAATATCTCAACTGTCTTCTCATCGGATTACTCCTAAAGTATTTTCACATTGCTTGAAAGGAGAGGGAAATGGTGGAGGTATACTAGTCCTTGGTGAGATTTTGGATCCGAGAATCGTATATAGTCCCCTTGTTCCGTCACAGTACGTATTGTTACAGTACAATGAAGTTTCTTTTCTTGCTTATGACGAATATAGAGATTTAATTGTTTTCATCTTTAGTGTGCCTTGTGCTACATGATATAAAACAGTTGTGTTCTTTATAGTTTGTGATCCAGCTTGAGCATGTGAAATATACCTCTCATGCGCTACATCCTGATTTTATTGAAATTTCGTCACTATATTATTGGTTTTGCATCTACAGATATATAGTAGTTGGGTCTTGGGAAGATGACATCAATGAAACTTTACTTTGTACATATAAAAAAGGGCAGCCCGGTGCACAATTTTGAGTGTTATATATATATATATATATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGAACATGTCGATATATGTGTTTATTTTTTATGTTTTTCTACATTATTTGTTTTCTAACTATAAGCACGGGGATTGGCTGGGCACTAGGAAAGAAAATGGTTTGTAGCAAGCTTGATTGTATCCGCTTTCCACTTTTGCAGGGCGCATTACAATGTATATCTGCAGAGCATTGCTGTTAATGGACAGTTGGTGCCTGTTGATCCATCAGTGTTTGCGACATCTGGCAATCGAGGAACTATTGTGGATTCTGGTACAACTTTGGCTTATATTGCCACAGAAGCTTATGATCCCTTTGTCAATGCTGTAAGTTCCTACATTTTGCCAATTTATTTACTCCCTCCGTCTCAACTTGAATGTCCAATTGCTCTCTTTGTCTATACCAAAATTATATCCACATCTCCTAAATATAAAAACTGACCACATAACAACAAGAACAACTACGCCTGTTGGGGTTGGCGAAAAAGGGCAGATAACTGTTGATCAAAACCCCTGAAAAATGTTTTATCATTCATCTGCAGATAACTGCTGCTGTTTCACCATCAGTTAGGCCAATCATCTCACGAGGAAAACCGTGCTTTCTAGTGTCCTCGAGGTTCATCCTTGTATAATTCAATAGATATTTACTTTGAGCTTTTAATGACAAAAATGTCTTTACCCAACTGCTTTGGTGACTCGAATATGACACCCTGTTTTGTTCTGAAAACAGCATAGCAGAGATATTTCCCCCAGTTTCTCTAAACTTTGATGGTGGTGCATCGATGGCTTTAAGACCATCAGACTACCTTGTGCATATGGGCTTTGTCGTGAGTACCAGAATCTGTATTGTGTTTGAATGTCTTCTTGAAGTCTCATCTGAGACTATATCAACAATGCTATATGCAGCACTATTGTCTTTTTGATGAACTACAAAGCTAAGTGACTCAATTGAATGTTTCTAACAGGAAGGTGCTGCTATGTGGTGCATCGGCTTTGAAAAACAGGATCAAGGTGTAACAATTTTAGGAGGTTGGTTCCTTGTTTACTACAATATTTATGCCTCAACCCCAAGTGGGTTCGCTGGTGTATTTATCAACATATCTTCTAGGACTATGGTTACATCTAATATCTGCTGCATCATTACGTGTGCTAGCGTCCTTAATGGGTGCTTAAGCACTCCATCACTCAAACCATCTAAAAAGGACCTAACTATACCTTGTCCTGTATTAATTTAAAAAAAAATATAACTTGGCGAGTGTCTGCCTAGATGGAATGTACCTTAGTGATTGCATGCTTAGATAAAGTCATCATTCCTGCAGATATCGATGTTGATGAAACACGGCCATCCATTTTCTATTAGAATATTAGGTCATTGATTTGCAATTTGAAAATTGTAGCTTGAAGATTGCAGTCCCTTTGTCTTTTCTAATTTTGCATCCGTTCTTTTATTGTGATTGACCAGAGGATAGGTATACTTGTAGTACCAGTAATTCGGTTAGATATGTTCCGTAGCGTGCCAAGGAATTTATATTTCCTGTTTTGCCTAGTTTTTCGAAGTTTATGTCAAAGTTACTTGCTTTGGATTCCGTGCGTGGGACAACTATATATCTTGTAGAGGCAGTGATTGTTACTTTGATGTTGAAAGACAGCAGAGGGTGGAATTCATAGAAAGTTAATACGTGGAAAGGGGTGTTAGACTGGATACATGACTGCAGTCCTAAATCAAGGTTGGAGTTGCATTTAATATGGCTATTCCTAAAGGCTAAAAGACACAGTCTCATCAAGATTGTATGTTGAGAACAAGTCTACGAGAGTGCGCTCTGACTGTTCTCAGAGATAGGTGCCTCAGACTCTAGGGATTAAGCTAACTACGTGTTTTGAGTGGTGTTTTGTTCTTTCTTTTCCTTTTTCACTGCTGGCTAACACAACTTGAAGACTTGAATTCCAAACTGCTTCAAGTTTTAGGTCTATTATTCATGCCTTCTAACTTCTTAGAGTGTTGGTTCCCTAGTCCTCTTTTTTTTCCAACTCGTGCATGCACGCTCATGCCCGTATACGTACATGCAAGACATCTACAGTTGTAAACACAATTTATGACCAAATATTAACGGAAAGGTATTAATCCCTTTCTATTTCTTGTTTGTGTGTGTATGTGTAAACCAAGCACAATGTTATGACCAACATTAAGTGAAGAGAATTAATCTTCTTTTCTTTTGGCCCATGTCCGTGTATTGTATGCGCATTGAATGTGTGAGCCTTTCCTTGTTTATCCTGGTTATGAGTTAATCGGTAGCATACATGGTGAGGTTTCAGGATGCATGATAGTTGCAGATGATAGTGTGAATTAACAAAATTAGTCAAAAGCAGCTCCATGGTGAAACACTTAGCAAGGTTTTGGAATAAGTGGAAACAAGATACTATCCAGGCAATGCAAGTTTACTCCGCCTAAAAAGTATTAAGGTGAAATGATATTAGATTATTATGGTTCCTAAATGCTGGCAGTCAAGTTATTTAAGCTCAAATCTTCCAGGAGAATAGAGAAATGTCGTGGTAGATGAGGAAGGAACACATAGAACTAAAATATGCTGGTTGACATGAAAGAGTGTTGCATGAGTGTTATTAGACAGAAAGATTCCTAGAAAATTGAAATATAAGTTAGTATAACGGTTGTGAGATCAACAATAATATGTGAGAGTGAACATTACATGTCCGCTAATTGAATATCATAAAAATGCGGATGTAAAGATGGTTGTGCAATCATATAAGATTGAACATGATTAGAAACAATCACTTTTGTTGAAGGGTGCAAGTAGGGCACATATAGAGCATGAAAAGGACGTCACATGAGATGGTTTGGCCATATCCTATTAGTCCGCCATATGAACCGGTTATTAAGTGTGGCAATATTGTGTTTAAAGTGCTGAAAGGAACGAGGTAGACTGATGACTACATTCAAAAATTTGTCTCAAATGACTTAGAATCTCATGGAGTCAATACGGCTTGAACTATAAACAAAACCTTATGGAAGAAATGGATCCATATAGGCAATACTAACTAGTTGAAATAAGGCTTACTTGATTTTACTCTACTGGGTGCAGTATTTGTCAGGAGTCTTTATAATTTGGTTAGAGACATGTTTGTAAGTGTTGGTATAAGTTAGAGATTTAGAGAACCTCTAGATTTAAGAGAACCCCTTGTCTTAAAAAGATTATGTACTATCGGAGTGAATTATTTCAAAAAAAGAAGATTATTTACTATCAGAGTGAATTGGGTTCAAATAGAGCAGAATGGCCCCAAATGATATTAGACCTCAACTAGTTTGGGACAGAAGTGTAGTTGATTTATTGATATGCATCTTATGTGCAAATTTTATTTTAGTCATGCGTGTGCCCTTGAGCTCCTTCCTCTTCCCTCTTCCCTCCTAGTCATTCTACTAAATTTGGCTATTCAATTAGTTCTGGCTATGCTTTGGCATTTGAACCATGTATTCCTTGTCAGTTGACTTATTGCATTGCTCTGCATCTTCATTTGTCGAGTATTCTGGATCCATAGATAAACAAGAATCTAGGTTGGTCTAACTGTATTTTTCTTGATTTTCAGATCTTGTTCTGAAAGATAAGATCTTTGTGTATGACTTAGCTCGGCAAAGGATTGGATGGGCAGATTATGATTGTAAGTACCTCTTTTCAAAAATGAAGTCAACATTTGCTTTTGTCTCTATTTTGCCCCCTTTTCTTTTGGGGTGGGGAGGGGGTTGTTTGTTGTGACATGACTTACTTCTTCCCTCTGATTTTCTCTCTATTTTTAATGGTTTTCATGATTAATGTTTTTATTATAGGTTCATCATCTGTGAATGTGTCTATAACCTCTGGCAAGGATGAATTTATCAATGCTGGACAGTTAAGCGTGAACCGTGCATCAGGCAGTTTGCTGTTCAATCCGCGGCACACTAGAACTATATTTCATCTGCTATCGTTGGTTCTGATGATTGGTTCCCCATTTTTAACT
SEQ 32
TCAAGCAACAATAGGGTATGATGCGCAAGTTGCAATACCTACAAACATATCATTAGGATCAGTTATTTACCTCATCAAAATCATGACTACATTGACTAGAAAGTTTTTATTATTATTTGTGTCGATCTAAGATGAACTCAACTTGAACTCGTATATAGAATAGCATTATAGTTCAGATTTTGTGGCCTCTTTTTTTATGGTCCAAGTTGAAAGTTATTTTTCATTAATTTATCCTAATAGAATAAGCTTTCACAATCCGAAAGCAGTGTGCATCCAGTATGCAGAAAGGGGGAAAGGAGACTTGATGATGTGGGAGAAACTTACCACACATGTTCTTTCCCATCTCCATTTTGAAGTATCCATTGTCACCCCATTCAGCTCCCCACGAGTTCTTGATCAGCCAATATGGAGTACCATTATCAACACCATATCCCACAGCAAGAACAGCATGGTTCACATCCTGTCAATGCAATAGAAAATGTTAATGTAGTATTATGTCATTGTCTCGCATATTTAGTGAAGGTCGGACCATATACTTTGTTGATGTCATTGATATTTAACAAAAGAAGAAAAAGAATATGTGCCTCTTCATTGCAGCAAAATCAATCATGTCTCAGTGCCAGATTAGTTGGGTTGGGTGGGGGTATGAATCCTCTGTGTACACTGCTCATCAGACCTTTTTATTCCAATACTACATAATCTGTATATTAAGGCAGATTAGGAGTTTCTAAAAGTAGAGACTATTTCAATCTTTCCTATCATCCAGCATCTTTTTTTGGTTGACAAAGCATGGTATCTTTCCTATCAGTATTATGACACTCGAATTGTTAAAAATCAATAAACACAGAGAGAGAGAGACACACACACACATTACGCATAACTTAAAGATAATTGCAGACACCCTTCTCGACGAGTAATCAAGCACTGGGTTATAACTAACTACTGGGTACAATTCTTTCCCCTACGGTGAGTTACATGGAAAATGTGCTTCTTCATCGACAAAGAATAGCAGATTAAAAGGGCAATATCTCAAATTCATCGGAACTAAAAACTCACCTGGGGAGTGTTGCCACATACGGTGCTGCTGTAAATTCCACCCTTGTACTGTTTGAAACCTTTTACCACCTGATAAGCTACACTAACCGGTCTAATAAATGCAATCGCGTATTTTAGTTCATCTTCAGCACCCTGTAATTATAAATGTTTAATTGTAAGGAATCCAACCAATGCCCATTGACCTTTTCAGGTCAAATAACGCATATTAAAAGATCCACTCAACTGAAAATCATCAACAATGAAACACTAGAACTATGAAAGGTTTTTCCCGTCGGAAAACTACAATAATCTTCCTTTAGTGTGTCAAATACAAGAAATGATAAGCACTGATGTAAGCAGTTAAAAGCTGATCCAAGACATTGCTGTTCCAGAGTAAAGAGGTATCCTACCTTGGTAATATTAACAGAATCAACGACTTTAACAGCAACATTTTCTGACGAGAATTTGCATACACCAGCCTTTCCAGCATAAGGATATTCTTCTTCAGTGTCAAGACCACCACTGTATTTAATGTACTCAAAAGCCTGTGATGGAAGCCCGCCATGGCATCCAAAGTTATTAAAAGCTCCAGCACAGTCCAAGAGCTGCTGTTCAGACAGAGAGATGTTCTTCCCAAATGCCTGGGCATATGCTGCCTCCAGAGCACCAGTAGTGCTGCCAAAACATCATGCGCCGACTCCGTCATTTTTCAACTTCAGAAACCCGAATAGAATGGAAAGGATATAGAATTTACATGTTGTTAAGAATAAGTAGTTTAGGCTTAACGCATAGTTGATTAACGATTGATAGGTAGAATGCCATTCCAAACTGAAAATTTAGGAACCTAGTGACATGATCCTAACTACTCCTTACCTGAATGTCCAGCAAGACCCGCACTTGCCCTGCTTCTTCACTGGGCTTACTATCCCTGCTTCCCTCCAGTCTTTCTGAAATGACACATTCGGAGATCAAAATTAGGAACAAGAGCTACTTTTGGTAACCAACAATTCGTTCACTTAGCACATGCTCTTCTAGGCAGCGGCTGAATGGAAGGCAACTGTTCCTGAACTTACTACTACTTTTAGCTTGGAAAAGAAGAAAAAGTAAATATATATAGATAGACAATACACACACACGTTTTTCTAAACACACATATATACCAAATTTTTATTTCATTCAGATACAGTGGATGCAGCAGTACTCTGCACCTCTTGACTTAAAATCCTACGTCCAGCGTTGCTCCTAGATGCAACAGCTATCTATGTTGTCATCTCAAGAAATTCCATCATCTTGAAAGTAATAAAAGTTAAAACATAACAAAGTGATAAAAATCCAATGAAAGAAGAATGACAGCAATATTAGGCTGCAAAATGGACAAGACTAAGCATTTATGAATTTTCCTTTTAGGAGAACATACAAGATAGAGGCACAAAGAAAGCACTGAAAGCTAGATTTCCAATAACAGGATTTCCGGTTGGAGTGAATGGATGCAAATGGTAATGATTTTTGAGAAAGTATCATAAACAACTGCAGAAAAAGATATAATGGAGTTCTGATTGCAAATACCGTCTCTGGTAGGTTGACATTAGTGAGCTGAAGATCGCTCTTTGTGGTAGCGGAACAGTTTTGAGGAGCTCCTAGCCTTTCTCTCCTAAACTCATCCCATGTTAGGTCAGTAAACTCTGCCAAGAAGCATAGTCATTGTCAAGAGCAAAATACAGCAAGCAAAATATGCAGTGCCTTCTTGTTTCTTTATTTTGTTTGTTCTTCTCCTTTACCTTGACCTTGGTTTTCATATTTTTATTTTCCCTTTCTTCACTCACGTACAAGAAATCTGCTAGGAGTTCTAATTAGAAGTAACAAAAACATTAATCTACTACATTTTGAACCACAAATCAAGTTTTTGGCGTCGTTGAAGTCTTAAGAAGGCATCCACCTTAGCATCCGCACCTGGGCACAACAAGGTTGAGGGAACAGAGGGTGGACGCGCAGCATCCACCCCAGCATCCGATGCTGAGAAGATCAAGATGAGGCGGACGCGAAGCATCCAACTCAACATTAATCCCTGAAGCTGATTCGGGAAAGCAAAAGGAAAACTTTGGCCCATAACTTTTGGACGCAATATATAAGCCAAAAACGGCTCTTTTAGGTCATCGAACACACTTTTTGAAGGGGATTCGACCTAGGGAGAGCAAGGAGCCGCCGTGGAGGCCGAATTTCATCTTCTTCCGCCAAACTTAGTAATTTTTATGTTTCTTTGTATGATTTGTTGTTTGGCTACCATGTCTATGTGGAGCTAAACTTCACGTTCTAATGTTCTGGTTCTTTCATGACTATTGTTATTCGAGTTGATTTTCGTTTCTTGATTTATCATATTAGTTTATTTATTCAATCCTGCGCTTAATTATTTGATTGCTTGATCACCAATTAAAACTATCTACGAATCTAGAATTGAACTCGAAAGTGTGAATTCTAGATTGCATATAGGATTAAATAGAGCAAGTTCTTGAACCTGGGTATCGGGGAACGGATTTGCGGTTAGGATAAACATATATACCCGATTGCCTTGCTTGGTTGATTTACACGAATTTCAAATGCGTTCTTGTTAGTTCTAATTCCATAGACATATTGGCGTTAGGTTAGCTTGAATAGACGAGTAAGAACTCGAGAGATTCTTATGAGCAATATTAACACTGTCAACCAATAAACTAGATAAATTAGTTAGTCAATTCAATTGAAGAATACAATAGGAATGTTAGATAACTCATAACCCTAGATCGTTTTCATTACACTGATAATATAAAAATCAGCTCTTCCTTTGTTCAGAGTTCATTATTTATTTTCTTTTTAGTTTAGTTACTTTTGCATCACTACTTTTGGGTTTAATCCTTGTTTAGATAATTAACAAGTCCTCATGGGTTCGACACTCTATCTTATCACTTTATTACTTGACGACCGCATATACTATACAAGTCAACTTTATGTATCCACTCTATTCAGATCATTTCATGAACTATAAGTAAGAAGAAAAACCAAAACGAAAAAGGGCAAATTGTCCATAAAGGCATATATTGTCCAGCCTATAGCTAATAGGAAACCATATAGTATACACAAAGGCAATTACCATTGACACCAAGTTTGTAGGAAAGTCGTTGCTTGTTGTGAGACCTAATCATCTTCAGATTGTCCAAGTATATCTCAAACCTTTGCTTGATCTCCTCAACTGAGTCGTATCTCTTCCCATACCTTTTAAAGTTAACAGATATCCCAAAAATAAATAATTTTTAAGTAAAAAGGAAAACAAAGCTTATTCATTCAAATAGGAGGAAATTAGATGAATCGAACTGACCTGCGAACAAAGCGAACGAAGGAGAGAGCACGGCGCGTTTGGCCGATGAGTTGGAGAATTCCATTCTCCAGCTCCTGCAAACCGTCGGATACTACTACTTGCCTGATCGGATTATCATCGTCAAACGTCAACGCTCCGCCTTGTGCGGCGGCGATTGAGGTCGCGATGAGTAATAGTAATAGTATAATCGAGGCGCGAGTCAT
SEQ 33
ATGAATCCTGAAAAGTTTACTCACAAGACCAATGAGGCACTTGCTGAGGCACATGAACTAGCTATATCAGCAGGGCATGCTCAATTTACCCCCTTACATATGGCACTGGCCTTAATATCCGATCACAACGGTATTTTCCGGCAAGCTATTGTGAATGCTGCTGGTAGTGAAGAAACAGCTAATTCAGTTGAAAGGGTATTCAAACAAGCCATGAAGAAAATCCCTTCTCAAACACCAGCACCTGATCAAATCCCACCTAGCACATCACTGATTAAGGTGCTCCGACGAGCTCAGTCGTTGCAAAAGTCTCGCAGAGACACCCATTTGGCAGTTGATCAGTTGATTTTAGGCCTTCTAGAAGATTCCCAAATTGGTGATCTTTTAAAAGAAGCTGGGATTGGTGCAGCAAGAGTGAAATCAGAAGTAGAGAAACTTAGGGGAAAAGATGGCAAAAAGGTTGAAAGTGCTTCAGGGGACACTAATTTCCAAGCACTTAAGACTTATGGTCGTGATCTTGTTGAACAAGCAGGAAAACTTGATCCTGTGATCGGTAGGGATGAAGAAATTCGAAGAGTAATTCGGATTTTGTCGAGGAGGACGAAGAATAATCCGGTGCTTATTGGTGAGCCTGGTGTTGGTAAAACAGCAGTAGTTGAAGGGCTAGCACAAAGGATTGTTCGAGGCGATGTCCCGAGTAATTTGTCTGATGTTAGACTTATAGCATTGGATATGGGGGCATTAATTGCTGGAGCAAAATATAGAGGTGAATTTGAAGAGAGGTTGAAGGCAGTGTTAAAGGAAGTGGAAGAAGCAGAAGGGAAAGTGATCCTTTTTATTGATGAGATTCACTTGGTTTTAGGTGCTGGTAGGACTGAAGGGTCTATGGATGCTGCCAATTTGTTTAAGCCAATGCTTGCTAGGGGCCAATTAAGGTGCATTGGTGCAACAACTCTCGAGGAGTATAGGAAGTATGTCGAAAAGGATGCTGCGTTCGAAAGGCGTTTCCAGCAGGTATACGTGGCTGAGCCTAGTGTTCCTGACACTATTAGTATCCTTCGTGGGTTGAAGGAGAAGTATGAAGGGCATCATGGTGTCAAAATTCAAGATAGAGCTCTTGTGGTGGCAGCCCAGCTTTCGGCTCGATACATTACAGGTATGTCCTTTTTTGGATTGTCATTGTATTTTATGAATTTTACCTTTGATCTTTAATCGAGTAAAGATGCCACTACAGGAATATAGCAATGTATGTAATGTTGAAATGTGATGTGTCACACGTTTGTATTGTGGTTGTCAAAACATTTCCTAAAATTTTGAGGAGATAGTCCCTTTCCTTTATGTCTATGCAGGATGGATGTGAATCTAGTTTTATACTTAATTTAGCTGAATCACGTCCCATTTGAATGATAAAGTTATTTTCTGCTTCATTGTGCTTTTCAAGGTGATAACCTCTAACCTTTGGTTTGTAGTTTCAGACTTATAAAAGTATGATTGGTGCGTGCTCACCTTAATTGATTGGATGGGATTATGTGTTTGCTCTCTATTAATACGAATTTTCTTTAAAGCTTTTTCTCTCCCTTGCTATGGAGAATTGCTACTGTTGTTTTGCGTATCATTTGCCAGTTTGCCATAATTTTGTGCATATAGGGATTACTAATCTGTGAATTTACGTTCAGGTCGTCATTTGCCAGATAAGGCTATTGACCTAGTTGATGAGGCTTGTGCAAATGTAAGAGTTCAACTTGATAGTCAACCTGAAGAAATTGATAATCTTGAAAGAAAGAGGATTCAGCTAGAAGTTGAACTTCATGCACTTGAGAAGGAAAAGGACAAGGCTAGCAAAGCTCGACTCGTTGAAGTAAGTATACATCCCGGAAATGCTTTGACCTATAATTCTAGAACCTGTGTAGGAAATGTGGACAAATAACGTAATTACTATTTCAGGTGAGAAAAGAACTTGATGATTTGAGGGACAAACTCCAGCCTTTGACGATGAGGTATAAGAAAGAGAAGGAAAGAATTGACGAGCTTCGCAGGCTCAAACAAAAGCGTGATGAACTCACGTATGCTTTACAAGAAGCTGAAAGGAGATATGATCTTGCTAGAGCAGCAGATCTTAGGTATGGGGCTATCCAAGAAGTGGAAGCTGCTATAGCAAATCTCGAGAGTAGCACAGATGAGAGTACAATGTTAACTGAGACTGTTGGACCTGATCAAATCGCGGAAGTAGTCAGTCGGTGGACTGGTATTCCTGTGTCAAGGCTTGGTCAGAATGAGAAAGACAAATTGATTGGTCTTGCTAATAGATTGCACCAAAGAGTGGTTGGGCAGGATGATGCAGTTAGAGCTGTTGCTGAGGCTGTATTAAGGTCTAGAGCTGGGTTGGGAAGGCCACAACAACCAACTGGTTCATTCCTTTTCTTGGGACCAACTGGTGTTGGAAAAACTGAACTTGCTAAGGCTCTCGCTGAGCAGCTCTTTGATGACGACAAGTTGATGGTCAGAATTGACATGTCCGAATACATGGAACAGCATTCTGTTGCCAGGTTGATTGGTGCTCCACCAGGGTAAGGACCCTTTAACTATTGATAGGATAAAAGAACAAATCATACTTTTACGAGTAAACTGTATCTGCCATAATGAGATTGTGGATTGCACCTTTTGTAGAACTCTGTAGCCTCATATTTGTCTAGGTACTTAATAGTTTTACGTCTGAAGTGATGAATGCTGAACATGTTATGTGTGTGCAGTTATGTTGGACATGAGGAAGGAGGACAACTCACTGAAGCTGTGAGGAGGCGCCCTTACAGTGTAGTGCTTTTTGACGAAGTGGAAAAAGCTCATCCCACTGTATTCAATACCTTGCTCCAAGTGCTGGACGATGGACGATTAACAGATGGCCAGGGTCGTACCGTTGATTTTACTAATACAGTCATCATTATGACCTCAAATCTAGGAGCAGAGTATCTCTTGTCAGGATTAATGGGCAAGTGCACCATGGAGAAGGCCCGCGATATGGTCATGCAGGAGGTAAGCTAGAACAGCCTATTTTCTGCTAATTTTCTGAGCATTGTTTCCTAGTTTACATCTTTATTTGAGGAAGGATTGTTCACATATATCTTTTTGTGACAGGTGAGGAAGCAGTTTAAGCCTGAGTTATTGAACCGGCTAGATGAGATTGTAGTGTTTGATCCTTTATCACACGAGCAGTTGAGGCAAGTATGCCGTCACCAACTGAAAGATGTAGCAAGCCGTTTAGCTGAGAGGGGTATCGCCTTGGGCGTTACCGAGGCCGCGTTAGATGTCATACTTGCTCAGAGTTATGACCCTGTAAGTATCACCATCTGGTATTTCAACCTGACATTTCATGGTGATTAGACTAGGGTCTGAGTTGAGATACCAACTATGCAGATTTTTGCATTTATCTTGCTGTGGCGGGTTACACTTGTTTTTTCAGTTGCTAATTTCACTTATTATGGAAAATTATTTGTAGTTACATTTTAGGTGATCTAACATTCTAAAAATTATCTTAGAACCGTTGGCGTATAGAAGCGAAATACTTTTGACAATTGATTGTGCTAACTTTTGTTACAATTACATCACAGGTTTATGGTGCAAGACCTATTAGAAGGTGGTTGGAGAAAAAAGTGGTAACTGAGTTATCCAAGATGCTCGTGAAAGAGGAGATTGATGAGAATTCTACCGTCTACGTCGATGCTGCATCCAGTGGGAAAGATCTAAGCTACCGAGTGGAGAAAAATGGAGGGCTTGTCAATGCTGCCACTGGGAAAAAATCTGATATATTGATTCAGCTCCCTAATGGAGTGAGGAGTGATGCTGCTCAAGCAGTGAAAAAGATGAAGATTGAAGAAATAGTAGACGAA
SEQ 34
TCACTTGCTTTCAGGTATGATACTAACAAGGAGACATACTATGCCAGTAACAACAGGGCTCGCCACACTTGTGCCAGATAATCGTTTACAACGTGTGCTGATTTTGGATCCCATAATCTCACGCCCATATGCAACAATGTCTGGCTTTACACGACCATAACTGTCAAGGCACAGTACAGTTAAGAATCTATGGTTTGATAGACGAGTGTAATTATTTAACTGTTATTAGAAAAGCAAGATCCTAAAATATTCAAAGAAAAAGAAAGAAAAAAAAATGAAGCAAAGAACCATAATTTTGAACTTAATTCTTTTTCAAGAAAAAAAAGAAGCAACAATAACAACACCAATAACAAGCCCAGTATTTTTCCACAAGTGGGGTCTGGAGAGGGTGGGAAGTACGTACCCTTACCCCTACCCTAGAAGGACAGAGAGCTTGTTTCCGATAGACCCTCGGCTGGAGAATGGATGACAAAAATAATGGCAACAATAAGGAATAACAACAAGATAAAAATACTGAAGCCAAGAAAGCAGCTAAACTCTAGGTAATAATAGCAATCTATGAATAAAAGGATATCATACTAACACTGATGCTAGCGAACTGGGAAAGACAAAGAGATACGTTCGACTACCTACTAGCCTTCTACCCTAATTCTCGACCTCCACACCCTCCTATCTAGGGTCATGTCCTCAGTCAACTCCAGTTGCGCCATGTGTTGTAACCTCGCCCCAAGACTTCTTAGGCCTGCCTCTACCCCTCCCGATACCCATTGTGGCTAACCTCTCGCACCTTCTAACTGGGGTTTCTATACTTCTCCTCTTAACATGCCCGAACCATCTCAACCTCGTCTCCCGCATCTTTTCCTCCACCGAAGCCACTCCCACCTTATCCCGAATGATTTCATTCTTAATACCTAGTATGCCCACACATCCATCTTAACATCCTCATCTCAACTACTTTTATCTTCTGGACATGAGTGTTCTTGACCGGCCGACACTCTGTGTCATACAACATAGTTGGTCTAACTACAGCCTTGTAGAACTTACCTTTAAGTCTCATCGGCACATTTTTATTACACAAGACACCGGTAGCGAGCCTCCACTTCATCCATCCAGCTCTGATACTGTGTGTGACATCCTCATCAATCTCCACATTACCTTGTATTATAGATCTAAGGTACTTAAAACTACCTCTCTTAGGGATAACTTGTGTATCAAGCTTCACCTCCATGTCCGCTTCCCGGGTAACGTTGTTGAACTTGCACTCCAAGTATTCCGTCTTGATCTTGCTCAACTTGAAACCCTTAGACTCTAGAGTCTGCCTCCAAACCTCCAGCCTCTCATTAACACCACCCCGACTCTCGTCAATCAGAACTATGTCATCAGCAAATAACATACACCATGGCACCTCTCCTTGAATGTGTTGCGTCAATGCGTCCAAAAAAAAAAAGAAGCAAAGAGCTTAATTGTGACTTTTTTCTATTTCATGTTTACGGTTCATCTTTCTTCCTTTCCTTTTTTCCTTAGAAGCTGAGTGGATTGTACAAGAGGCATTCAACAGATGTCATGCTCCTATTCATCCATAAAGTTTTTGCCATTTTCACCCATCATTTTCCACTCAGCAGAATTTTACTCGAAGCATCACAACCATGGATAGAATAAAGCTCATAGAATGCTCGTTTGTTTCACAAGAGCTGATTTTTAAAGGCATCTTTTTTAAATGAAGTTGGTACATCCAACACACTCATTGACTTCCTATGTGGTCATATAGTAGAACACAACTTTAATACAGAGAAGGAGAGGGCAGAAAAATAAAGAATACAGACTATACTCCATTGCAAAGAGTAATACATAGCAAAGAAAGGAGAAAGAGATACCCATGAGGAATCTCCCAGGTACTCATTCCACGTGAAGAAAATGAGGCTAAATGATTACTTTGATCAATGGCACCAACACCAATAACATCACTTTGATCAGCAGGATTGTTAAGAGTACCATAAAGTGGTCCATCATTTCCAATAGCAGAAACCATGATAATATTGTTGGCAGTAAGCTCCCAAACCTAGCGGAAAATATTGATTACGTCCATCAATATAAAGCAACTATGAGGAGAGACTCCAGAGAGTAAAGGTGTTAAGACAAGTAGAAATAAGACTGAATATATGCATGCTAAGTTTAGCAAGCATGAGAAGAGTGAAGTTGAGGTAAGATTAGATAAGATTCCCATACCTAAAATGCACACAATTCAGATATCTAGACTCATTTTCCATGAAATTAGTATGATCCGTGAAGATATCACATCAAATTTGAATAGAATTGGTTGTAATGGAGGTGCGCTACAGAGGGTTATGTGATAAGAGGATACCTATCAAAGTTAAAGACAAGTTTATATTGGGGTAAATGTTTGGCTTTGTATTAAAGATATGAGCATCACCAAAATGCATATGCAAGATGGGTATTATAAAAGTTTCACAAGATAAGAAATGATCAAATTTGACAGAACATAAAGGATAAAATAAGAATGTCGTTTGAGATGGTCTCATCATGTCCTAAATAAATCTCCAAAGGCACTGGTCCTTAGATGGAAACCATGATGATTGGAGGTGCTAAAAGAGATGTATACCTAAAATCACATGGAAGGAAGTTGTCCCAAAAGACTTATAATCTCGTTGAATTCATACAGACTCAAAACAGAACACAACAGAAGCAAAAGTCTGTTATACGCGATATCGACTATTAAGAATCAAGGTGTAGTCATGCTAGTGCACTTACTTTAGGTCCAATGTCTACTAGGAATCTTTTTAGTCTGTCTGCACTTCTGAAGTTTGTATGTCAGTAGAAAAAGAACCTCCAGATTTATAGATATCCAAACTACTGAATCTTTGATGACTCCAAATGGAGCAGAATGGATGGTGAGGATTCATTAGCCAACCCAACTAGCTTGGAATTAAGGAGTAATTATTCGTGTTGTTGTACATCTCATCAATATAAAGGTGAAAAGTTCTGCTAATGTTGTTTCAGGTCCTGCTGAAAGTAATGTTAATTTCAGTGAAATGACAGGCTTAAACAACTCCGAATCTCTTTTACAAAATTGAGTACTAGATATAATATACAACCCTTGTGTTTAAGACATCCATGACATAGTTCAGCTTGCAAAATTAATAGATCTCATGAAACAAACGCCCCATTAAGCTCAAGAAAGCCAAGTAAAATCCATGCACTGCATAAGAAATATAAGATTACATGCTGCCAATTATAATCAAACTTCTAATACTTCCGAGACCACATATTATACAGAAACTTAGACAATAAGGGGTTATGGAACAACAGCAAGATCATTTCAATGCCTATGCTGTAGACAAAATGCAATCCAGTATCATACCACATAAAAACAATAAAGATATAAACCAATAGATAAGTGACCTCACCTTTTCCACAAAAGGGAGATCCAAATAATCAGGTCCACCTATGCTCAAATTCAGAACATCCATGTTGGTTGCAATTGCGTAATTAAATGCATCGAGAAACCACGATGTGTAAGAGACCTGCAATAAACTGAAGAGCCACTTCTTATAATGCTAAATTGGTCATTACAAGATTGATCTTTTTATTTCTAACTTTTTTATAGGTCGCCTAGCGTTGTCCTTGTCTGTAACAGTAGCTTTAGTACATGAGTTAGTGTTATTTATGTATTTTCGTATTCCTTGACTTATGTGATTACTTGTCGTTGCTTTCGTTCCGGCCTTCTAATTGCAATACTCAGTTTTAGTTTTGTTCCTTTGTATTTTTTGCTTCGGTTTTCTAATTGGTGTGCTTGTTGCTGCTCTTCCTTTTATCTTTCCTAAACCAAGGGTCTTCCGGAAATAACCTCTGCCTTCTTGAAGGTAGGGGTAAGGTCTGTGTATGTACTACCCTCCCTAGACCCCACTTGTGGGATTACACTGGGTCTGTTGTTGTTGTTGTTGATAATGATGGTGTCAAAGCAAAACTTGTCTCGACTATTCCAAGGATACCTGCAACCTCCCACTAGCACAGGTACCGGGTATCTCAACCCACCAAGGCTTAGGCAGATGGGTAGATATCACCTAGCATTTTTTATCTAGGCAAGGATTTGAACCATAGTCTCCAAAATTTTAACCCACTTCATTGAACGCTACCCAACACCCTTGGGTGCTACAAGATTGTTCCTTTTTGTGTGAATAGACTCTCTTTCAAACCCCAACATCAAGGATTCAAACCCATCGAACCCATGATGTGCGTCTAACTCACACATCACTTGTTGCGCTCTTACCACTACACCAAAGCCCTGGGGGTGAATACTCCATATCATCTTGATTGTCCTTACTTGCGGATATGGTTTGTAGCTACTGAAATCAATAGATTGCACAAAGCCAATGATAGGTAGCTTAATTGGAAAAACTGAGGTCCAAGAACGATAAGGTCTTGGCCTCAAAATCTCCATGAAAGGAAAATAGTATTTAATGTGTGCCTTATGTAAGTAATTTTGTTTTCCAATTTACTATTAGCAACACTGTTACTTGTTATAGTATCTCAAACAGCGCATACAGTTTTTATAATATTTCAAACTGCTTATACCTCCCCAATAGTGGGCTAATACTAAGTATGGGCTTCCTTGAAAACAAAATAGGAATTATAGATTGCACATAATTCGCAGACAAGTTCCTGGCTTTTCTAAAACATAAAAGTAAACAATGTCCTCCCTCCCCAATCCCCTGAAAAATAGTTGCAATCTTATCACTAAAGTCATAATAAGATGGCAGAAGAAATATTATATGTTCAATAACATAGCATGTAACATGGACTCCACCACTAAATCCAATCAGTGGGTTTGGCCGCTGGAAAGGGTATGAGAAAACTGTGTATATTTAGGTGCATATTCTTCCAAGATATTGTAGCTTAATGATGAGAAGTTAAAGCTAGCACAAAATAAGGTGCAGAAGCAGAACTTGTCATTTACAGAGACTAGGCAGTCTAAAGTATTTTTTCTTCCATTCCAGAGGACTTTTCACTAAAAACTATGACTGCAAGAATTTGCTATATTAGGTTCACCACTCATGAGGTGGATGTGGCACACTCTACTAGCAGAAAACTGGAAGGGAACGGGGGAAGGATCTTAACACATCAAGTATTTGCTTTGCTGCAATTAACAACGAAAGGACCGTTTGATCATAGGAATCATCATTAGCACTCAGCAAGAAGCAGACTTGTATAAAACATCAGTACAATAAATTAGAGGCAATAATCCAAGACATCAGATTGTTGAAGATCTTCAAGTCTCAGCTTACTTAAACAGTTTAAGAAAATAAAGCCCCGTCACCCCCCAAAGAAAAGGAATTGGAATACTCGTTCAAAACAATCCATTACCTGTGCATCTGTAAATACATGGAAAGCATAGATTTCCGCATCTGGAGCAAAACCGAGGCATTCTTCATCCTGACCAGCAATAACACCAGCTACAAATGTCCCGTGTCCAACATTGTCATTCAATGTATCTTCGTTGGTCCAATTTGTGCGTTCCTGAAATTTAAGGTACCAAGCCCACTGTCTGTTATTAATGACGTAAAGATGGTACGAAGATATATACATTGGCATATATGTAAATGTCAAGCCTCAAACCACATTGGATTCAACTAAAGCATTTCTTTGTTACAGAAAATATATTTTTAAACGACAAAAGACAAGATACCTTGATATTACGAAAATGTGGGTGATCTGCACGGATGCCTGTATCAAAAATTGCCATTTTGACCTTAGCACCAGTATGCCCTTTTGACCAAAGCTCATGTGCCCCAAAGAGGGATGTGACTCGAGATTTCTGCAAAATAATCATACAGGACAGACTGTATATCAAGAAAAATACAGCGAACAACAACAATATGTTCAATCTAAAAAAGAAATAGAAAATAGAGCAGCAACATGAGACCCAGACACTCAAAGAATGCAGACCATGTTTTTCAAAAAGGAAGTATGCCCTGACAACCTTGAGGAAATAAGAATGACAAATTATAAACCTATTAACACTCTCCGATCTGAATGTTACGCTCCAACGGCCAACCAAGTTAAAGTGAGCCACTTAATGATCTAAGGATATATAATCCATTCTCGTTAGAGGTACTTTAGTGTCTTGAAAGACTAAAAAAACAAAACATGGTCATATATCCAGTGCAAAAGAGAATATTGGGGCATAGCAGAGACAACTTGTGAAATTATATGGATCACAGGGTCGCACAAAGATTAAACTTTATAGTGATCAGTAAGGTGCAGCTTCTGTGTAATAATCAAGTTTCCCTTCATATTGCGTTGAATTCAGTGTGTCATGAAAGATATAAAAGAATATTATCAATTGTAACTTCGTCACAGAGAAGATAATCTCAGAAGTCATTTTCACAAGTTCGTGAAGTCGAATGCTTAGATTGTAGATATATCCACGAGTTCTTCACTAGTCCCTGAATTAGTCACATATGTAACAAGCACTAGAAAGGGACTGTTAGATAGTTACGGGAAAATAGCTAAATGTAAATACTTATATTTATTATAAGTGTCCCACTTCGGGAAAACACAGGTATAGATATTATTACATTGTCAAGTGTGCCCATAAAAGGAATCAGTTGTAAGATATTAGTCTTCAAGCATTCTCTAATCTTTCTCTTATTTTTCTCACTATGGAGTCTCAGCATCATGACATATTAATGGAATAACAGACACTTGATTAGACCAAAATAGGAAAAGGACAAAGACAAAAGGGAACTGAAAGAGATTAATTTCCTTTGAACATATACCATGCAATAAAGTTGCAACTATCATATGTCATGAATGCAAAGAGAAGAGTTGCATACATTCCAATGATGTAAATTATCAGTAAACGCATACAAAATAAAACACAAATAATCAGTGGTCTTGCCTGCATCAATAGATGTCTGCTCCAGCTAATTCTCATTATACTAGTGTTGGCCACCGCATAGTTTTGACCTTCACTGAAGGACATAGCAGTAAAAATCTTTCCTGGCCTCTTCTTCCCATTGGCAAAAGCCCCATTCTTCTCACTCTTCTCTTCAAGAACTATCCTTTGATAGCTCAAATCCAACGAAACGTCTTTTACAAGATTCATTTTTCTGAACTTTTCTAGCAAGAGTTCTTTCATTGACTCGTCGATTTCCACCAATCCAAAGTCAGTAGGAAATCTCGCAGCCGGATTTTTCCGCTCAATCCATTGCCAACCCTTAAATTTCAAGTTGTTTTGAAGATAATTCCAGTGATCCTCAGGTTCCTTATAATGATAGAATCGAACAATATAATTTCTGCTATCAGATTGTTGCTTCTGGTCATGTTGGCACTCATCGGAGCTACTAGAAATTAATGGCTCAGACTCTATTGGTGGGTTGAAGCGGATGAGTGTATATACCGGAAGGAAGGGGACAAGTGAGAGGGTGAAGAATGATTTCTTAGGAGCTTCAGGCAT
SEQ 35
TCATATTGAAGCGACCAAGTCTTCAGTCTCAGTCGTCTGCTGAGAAAGGGTGCCTCCAATCCACCTCTTGAGCATCTCTAAGGCAACTTTAGGTTGATCCATTGGAACCATGTGTCCAGCATCGTGAACCTGTTATAAGACACCAAAACAGTTAGCTCAAACATCCATCAGTAGAATTTGAACAATAACATCGACAAAGAAAGGCACCTTCAGGAAACTCAGAGGCCCATGGCTTTTCAACAATCCAGCTTCAGAACTGTCAACTTCAAAAGGAACATCGGGAGATGCTACAAACTCTTTCTGACCACTCCATTCCATAGCCTGAACCCATCTTGAGTTACCTGGTAAAAGAGAGGCGTTATAATATCCGAAATATTTATGTGAAAAGTTTCCATCATTAGGCTTAGAGTTGAGTCAAAGCTTACCAAGCCAGTTGCAAATAAGATCATATTCTCCAGCATAAACAAGCAACTTTATTCCATCCTCGAGCAAGGTTGGAATGCCAGCCTCAAGATTCCTCATCCAATCAACAAGCATGGCCTGGTACACAGTAGTGCTGCATGAGACAAACTCTATATCCTCAACTCCAAGAGCCTGCTTAACAGAGTGCATATTCAGCAATTTCTCCATGTTTGAGAAGTCATAGCAGAGTGCTCCAACGCATTTCTTTCTGATGTCGTAATGCTGCAGGTGAAAGCTCAATGGATCAGAAATATGGTTAATCAGTCATTTGTTCCAAACTTTGGAAGGCATGCCAATGACAATGTGACCTCTTGAGTAATTTAACATATTCAACATGAAATGATATGGAGTAGCGATTTAGAAGAAATAGATTTCTGGGATCATTTCTACTCTTTCTGAGGCTAGTAACACCTATTTCTCTACGAAGTACAGAATAAGTGTAATAGGCACTAGAATTAGTAGAAAACGGGAAACAGACAGAAAGGGCTGATAACTTACATTGATGTCAGCCCCAGCACGTGCACGAACAGCAGAGAATATAGAATTGCAAACAAAATAGGCAGCCAAGCAAGAGATTTTCCCATCAGTACCTAAAAGAATAAAAAAGACAGAGAAACTGAGAAGCAAAACAAAGTACAGAGAATTGATTTGCTGTGGTCAAAGAACATCCATTTACTTCATCAGCTCCTTCCCCTTTTCTTTTTCACCCAGGGAAAGCCCGAATGAGTCAAATGATATGGAGGAAAGAAAGATAGTAAACAGTAAATTAAATAATGTACCACAAAGGTTTATTGCAACTTCACAAACTGGAAGTATTTTGTTGATACGATCATGATCAGACTTTGAAATTAATCCCATGTCCAATGCATAGTCAGTATACGCAGCGTATTGTATTTTGGGATCTGTAAGCCCATTCCCAATGGCAAATCCCTGTTCAAATACTTTCAATGAAGTAAAGACACATGATTAAGGAAATAAGAATTCAATAACTGGGAAAATGAGGTACCTTTAAGTTTATATGTATTCCTTCTTTAGCCTTGTTTCCCTTGTGTACTCTAGCAGCAAAAGCAGGAATATAGTGCCCAGCATATGATTCTCCAGTTATGTAGAAGTCATTCTTTACAAGCTCAGGATGCTCTTCAAAGAAAGCCTACACCATTATTAATGATCATACCAACACAAACAAGTCAGGATCATATTATCTCTGTTACGCATCAATTTAGAAAAATGCTAACTGTGTCACATAAAAGAACGAATCTGAAATAGCCAATGTGTCACAAATGCTCAAGAGAATTCATACCACATTCGGCCTTCAGAATTTGCAAAGCAGAAAAATACAACAATAAAAGCAACATATAACAATATTTCCAACTAGAAATCTGTTGAAAAATTCACGTTCCGAATAGGTAATGTATAGTCTTAAGGCGGCTAAGCCAAGTTCTGCTAAATATTCTGGTTTAAAAGCTGTTATACATGCTAACAAAATGCATCATGAGGGAAACAACTGACCAACAAGTTACCCAGCCAAAATTCAGGGATCAGCTGCAGTTTGTAGTAAAAAACAGGAAAACCAGCCCATGAAGAAGGGTATTGAATACTGCAAAAAGGTTGAGGGACAGGGTTTTCTAGCAATGTGATCACATCTTTTGCCCCTAATGCATTTGGCAGATAAATGGAGTCAAAATATTTTAACGCCTCCATTTTTGTTGGGATCAGAACCCAGGCATCAACCAACTATCATTTCATAAGCACAATATAAGACTCAAGTCCTAGTATATGACATCTCTCCAATTATCTATATGGTAAAAGTATTAAGTGACCATGTTTCTTTTGACAAGAGTGGGTTGCTCTAGTGGTGAGCACCCTCCACTTCCAACCAAGAGGTTGTGAGTTCGAGTCACCCCAAGAGCAAGGTGGGGAGTTCTTGGAGGGAGGGAGCCTAGGGTCTATCGGAAACAGCCTCTCTACCCCAGGGTAGGGGTAAGGTCTGCGTACACACTACCCTCTCCTGACCCCACTAGTGGGATTATACTGGGTTGTTGTTGTTGTTCTTGTTGTTTCTGTAAGGCTCGAGTTCTAGCATTAGAAATCAGCCTTTTGAGCTCCTGTAGACCTATTGTACTGTACCCCGTCTTTGTATCACATGTACACAGGTGATCAACACACAGACAAGAGAACTGACAAGCAAACCGCACTCTGAGAGTCCGAGCAATTACTTGAAAGGGATCCCAGACAACCCTTACTGGCACCAGTACTTTAATTTGTCACCACTCACCAGGCGGCCAATTCTAGACAGGTCAGCAAGGCGATAAACAGAGGTGTCTTAATCATCACAATATCATCCCGCACACAGGAGTGGGGATCAGCAAACACTTCTTTTATAACTGAGAAACTATTGTTTCCAGCATTGAAACTGTGGATAATGAGCCTGCCCCTGTACTTTCTTCAATTTTTTTCAGAACAGGATTCAAACTACTGACATGCGCCTACCACACATCCCATGTTCGAAATTGAAACCAAAGCTCTGGGGCACTGGAGATTAGCAAATAGGCTAGGTGATTATAAAGATATCATTCAAGAGTTCTCCTACTAATTCACGGTACTTTCTACAAACCCCTCCCTCCTTCCACAGTTGATCACAATAAGCTTGACTACTGACGTATATGTCAATACCACAGCCTCTGTGAGATAGAAAAGCTTCCATAATGACTACTTGAAAGGAGACCAAGGGGGTTTAGAAAGTATTTATCATTCTGTAAGCTACTGCAACAATAATGATTTTACTTAACGGAAAGGAATGCCATAAATGAATTTGTATTCTTGAGGATGTTCACAACCAGGACTGAAGTTGCTTCCACCCCTAGCTACATTCTTTATCCGTATTAAGGAAAAGTTTACCATCCTTTTTTTCCAGGTGAAATGTTTTATTGGCCTTTAGAAGCAGGACAAATTGTCCAGTGCAAGCCTCATGAATAACATACATGAAACTAGGAATTGATAGGTGAAGAAAATATAGGAAACCATCCATATCATTAAGTTAAACAATTGACTCTCTGTCATCATACGACAATGATTACAACGGTTGATTCCAGAGGAATAAATATGGTTCCAAGTTGCTTTAGGGGTTTAATTTCACACAGAACAAACCTGTAGGAAGTCATACAAGTCGTCGCTAACACCTGCTTCACTGTGACGGATGTCATGTCTGTCAGAACTGTAACTAAAGCCAGTACCTGTAGGTTGGTCCACATAGATAAGGTTTGATACCTGCAGAAATAACACGTCAACATCATTTCTTCAGGNCGTCAACATCATTTCTCTAGGTTTGGAAAATCTATTAAGGTATTCTAGTGTCCTTGTAGGAGAAAAAGGAAAATCGATAATAAAAATGAAACATCTACTTTACAAGGAACAAATGTGGAACACAAGGCAAACTTGACACTCTAGGAGTCAGCAATAAAAGACCCAAACCACAAAAACCAAAACTCAAGATCTTATGGAACATAAAGCACTTTCCTCTCTGCATTCTTGAATTGCCGTGCAGGTAATTTTTTTCAGTAATGAGAAAAAAGAACATTAAAACAGGCAGAAGCATGACATGGAATTAGGGAAGTGCAGTATCAGAGGTCTAATGAAAAAAATATGGCTGACATGTTTCCTATGCAAAGCATTAAGATTTCAGTAAAACACAAAGCTCCTTCCGGGAAAAAAAGTTTTCTTGCAGTCTGGTTGATGATAACTACATAAATGCTGAAAGTGTAACTATCTAAAGGCTAAAAAGGACATTTCTCCTGACAGATGTATCATGTGCAAGAAAGAAATGGTCCCCACAGATTTTCTTCACATTGATGAAGTATCAATTTGTGCCTGTGAATTGGCATTGCATTCATTTCAGGTTGGTTACTGGTACATCTCGAAGAAGGATAATGGTGAAGCTTGGATTATCAAAGCTTAATCACAAACCAGTATCCTTATACTTATGATTCTTCGATTTTACACAACATTGAACTAGATATAGACATGTTATTAACTGTTTTATCTTGATTTCCTTTTCAAGTAATCTTTTCAAAACTAAAAATGATAATAACAAAGAGGAACATCCACAACGAACAAGCTTTTTCGCTAATTGGCACTTTACAAATAAATTAGCAGGCTGCAAAACTACTTATCATGAAAACGACTGGATAAAGGACATGCAAAAGATGTTGCCAATGCAAAATCTAATAATCTATACGCCCATGCACCATTATACTCTTAACTTTCATGTACCATTGAGTAAAAGCAAGAGAAAGTCTTATACTATCGAGCTGAATCATCTACTAAAGAGCAAAGAAGGTAAATTGCTACTATTGCTTAAGCAAAGCTGTGTTTACATACCACAATAAACTGCTGTTAAATAGCATAAACCATTAAGACTATAAGGTGGTCTTAAAGATAGTCAATTACCTTGTCCCACCCATATTCATTCCGCACAAGTGACAAATTATTTGAAATAGAGAAAGGTCCATTTTCATAGAAAAGGGCCAACTCACTGCTGCAACCAGGCCCTCCACTCAACCAGATGACAACAGGATCGTCCTTACTACCGCGTGATTCAAAGAAGAAATAGAACAACCTACAAAATGAAAGATAACACAGTTTCAAGCTAAATAAGAGCCCAACCAACAATAAGAAGAAGCTAGTTGATACGTTAGACAGGAAGACGAAGATATAAAGCCTTATTACTCCGGAGATGTATATTTTGGTAAATGCTCACGAAAGACAAATATTTAAATGAAACAATGTTGCAGCATTCCTTTCAATTCTTACAACTGATTTGGAAACTTCTTAAGCGCACTGATTGAGTTAAACAGATAAGTGGTCAAACTGTAACAATCTGATGGTTCCCCACAAAAGCTACATGTAATGACATTTGAAGGTAAATAAATCAAGCAGACAGAATATTTCTCCATTAATCAACCAATCAAATAATTCCAAATTAATTGGGTTGACTAAATGAATACTCTTTAACAATTCTGCTATATTTAAATCCAATTTGTTGCAATATTAAATAAATATCCTTTGAATTCATTTCCACACCAGTTACAACCGTTTTATTCGAATTTCCGAAGTGGAAAGTGAATTTGTAGGAGGATAGAGGGGAATGCAGGGGTGCACTGCACGATAGAAGACTACTGAGAACTTGGCTACATTTTGTAGACAACAAAAACCTATTACCCCAGTACACCACAATGCAATGCTGAATCCGACAGTATTTTAACATAAAAGAGAAAAGAAAAAATTGAAACTATAATTTGGTTGGATGCCTATGATAATGCAAATCGGAACAAGAATCACCAAAAACCAGGAAGTGTTTGAAGCAACAAATATGGTACTAGTCTGTTCCAATTTGATAAACTTTCTATTTTTGGGACGTCCGGGAACATTTGACACCAGTTTGCTAGTATCTTAACAATTTTCAAGATACTCTAATTCATTTCTTTGAAAAGAATAGACATTAGGCCTAACTCAACATCAAAAGCTAGCTCATGAGGTGATTGATTGTTCATTCAATATATAAGGAGACAACAGTCCACTCACTCCACCAATTTGGGACACTTTAATATTTCCACACGACGAGGCCGTGGACAACTGGAGTGTGGACAGCATAACTTGCGACCCCAACATGGAGAAACACAATGTCGACCTTACCCTACCACAACCTCAAAAACTAGCTCACGAGGTGAGGAACGGAGGATTGCTTGATACCATAGGAGACAACACTCCATTCCCTCAACCAATGTGACACTTCAACATTATGTGTTATTTTTATCGAGTTACATTTTCTACAGTTATCGAAATATGTTTCTTACAATACTACATCGTACATACTCTAATAATTAAATCAAATCAACATCTTAAATTCCGTGTTCAACTAAACACTACCATAAAAATCGAGACAGAGATTGAAAAAAAAACTTAACCTAGGGTTTGAAAAGGTACCTAGCAGCATGAGAATGCTTAATCTTATAATAACCGGCATGATGCCCCAAATCTTCAAAGGATATAACACTCGAATTCGTCAAATTAGCGAAATTAAATCGCTTCTCGACGATTCTAGAAGCAGCGGTGGGAAATGGATCCCGATCGACAATGTTATCGGATTCTTTCGGGAACAAATTTAGCTCGTGTATCAACTTCTCAGCTTGCTTCGATGCTAACTTCGAAGATATTGAAACCTTCGCGAATGAAGAAGGAGAAAAAGCAAGGAGAAGAACAAGAGAGAGAAAAAGGGAAAGTGAAAGCTTCATTTGCGCCAT
SEQ 36
TCAAGAAGAAGGGGTCTTCTTTCCCTCATTTCCGAAGGTTCCAATAGGTTTGGCAAGAATGTGCTGCATTGCTTTGACTCCTAGTGGATTGTTCTTGCTCTGTACCAACAACATTTAAAACACAATCACGTAAGTAAATGAAACAACCATATCCTTCTGAACAAAACTGTTAAAAGATAAACCTTGGCCTAACTAAAGGGCAGCACGGGGCACTAAGCTCCCGCTATGCGCGGAGTCCGGAGAAGGGCCGGACTACAAGGACCTATTGTACGCAAACTTACCCTGCATTTTTGCAAGAGAATGTTTCTAAAGCTCGAACATATGATAGCTACTTTACCAGTTAAACCTTGGCCTAACTCAAACAAAAATCTAGCTCATGAGATGAGAATTGCCAAAGAACCCATTCCCTTAATCAATGTGCGACAATCTAACAATAACAAGCAGAGATCACTTACAATTGAGCAGGTGCCTGCTTTAACATTGCAGACAGGATAATCATGTGGGCAACAACTGTTATGGTCTTTACAGCAAGTAGCTCCTTCCATGGGACAACAACCCCAAGCAAAACAGTAGTTATAATACTTGTAGACACAGCAGCACGTTGTTCCGGCTGGGCATTCGTTATAATCATCACATTGAGTGGGTGGCTTGACTGGTGATGGAGGAGATGGAGCTGGTTTTGGGGGGTTTTGGCCTGTCTTTACAGGGTAAGAAGCAATTGTAGCAATACCACACAAACCTTTGGGGTTGCCAATGTTTCGCTGCATCCTGAGGTAACCATTTTCTCCCCACGAAGCACCCCATGAGTTCCTCACGATCCAATAATCCATGCCATTTTCACTACCATATCCTACTGCAACCACACCATGGTCCACTGCTGCACCACATTTTCCGGTAAAGATACCCTGGAGCATAGGCGGATCCAAGATTTAAACTTAATAGGATCAACATTTAAATTTTTTAATACTGAACTCATTGTGACTTTGAAAATACAGAAATATTTGTTGAATCCGTGTAAATACTGGCTAATTCGATCAAAAAGATAACAACTTTATTGACAATAGGCAAATCTGAAGTAATACCGATTTATAGTGCTGGAAGTCTTTGCCGCCAGCTTCGATAGCAACGCTGACGGGTTGACCCGCGACGGCCTTTTTCAGTGCCTTTTCATCATTAGCAGGAACATCTTCATACCCGTCGATGGTGACAACCTTGGCATTTTTCTATAATTGTTTCAAACAAAATATTAATAGTCATAACTTGTATTTAGTTCATTTCCAATAATCCCTTTTAAGACAGAGAACACCATACTGACCCTTGCTTGATCGCATTTTCCATCTTTGGCTTTGTAGGGGTAGTCTTCTTCAGTGTCTATTCCTCCATTTTGAATGACGAATTTAAAGGCATCGTCCATTAGACCCCCTTGGCAGCCTTGGTTATCGGCAGTATCACAATCTACCAGCTCTTGTTCAGATAACGAGATCAGATTACCTGTCATTATCTTGTTTACTGCTTCAATTGAAGCAACTGCTGAGAAAGCCCAGCAACTCCCTATTTCATTTCCCAATTGTAACACAATTAAGACGAAAATCATTAGTTGCTTTACGTAATATAAAAATATCATTATTTCTTCAGCATTTATTTATGCATAAAATGGAAGATAATCCAATCAATCAAGGCTGCATAATTTGCCAAAACAATTGCACAAAAATACATACTGATGAACATAGTGTTTTGTTCAAGGGAAAAAACATTACTTATATTGCCATAATTAGATGCGGAAATTGGAGCAAAAGTCCTGCTAACCTAATTTATCATACTTAACAACAACAATAACAAATCCACTGAAATCCCATCGTGTGGAATCTCCTTACCCCATCTAGATAAAGTATAGAGACACTTTCTGACAGACCCCAAAAGATACAAAGGTAAAGAGGAAAAAAGGCCCACGAAAGGTAAAAAGCAGGGAATTAAATAACAATAAACAGCAATACCAACAAAGCAATCAGAAAATTTAAAAATTAAACATGCTTAACCTTAGCTAAAATTAACAATAATGAAGGGTCAACTGAACTTGCCATTGATCTTAGGTTAAAACTTATGACACTGTGATCATTTGAATCCCTTACCCTAGCAAACTACTAAATTTTCCGAATTTCCATCAAAGCCATCAGTAAAGAAAGCATTGACAAATGTTAAACAGAAAATAAGTATTAACAAAAAGGAAAACAAAAACAAGTAAGAGGTGCAGAGATGAAGAAGGCAGAGTACGAAAGGAAAATACCACATTGTCCTTGATTTTTGACGTCAACAAGAACACCTTTCTTCCTCCAGTCAACGGAATCCGGCAAACTATCTCCGACCTTGGGGGCATAACGGTCACTTTGGGTATACGACAACCTACTACGACCATCGGGCTTAGTACCCAAGTAGATGGACTTGTACTCCTCGTTGGTCAAATCTGCAAACTGAGTCAAACCCAGCTTGTAACTTTTTTCAGGCGCAGAGTTCTGTTCATCGATGTATCTAAGGTTGTCCTTAAAGATCTGGAACCGCTTGTCCTTTTCTCCTAAGGCGTTATACACTTTTTTATGTTCAACTAGCCAAGATTCATACAAAGACACGATTTCATCGTCTGTTCGCCAGACCGTTGACTCGCCGTTCGTGTGATGTTTTTCGTTGTAGCTTATAATGGACATGTCCTCCGCCGATGATGTTACGGCGGAGAACATGAGCATTACGAGTATGGAGATGGAGAGGGTGGAAGTATGAATCGCCAT
SEQ 37
TCAGGAAGGCGAAACAGCAAGAGGATTCAACAAAGTGCCCACGAACAAAACAACACCAGCAGTCTCATCTTTCACAAGGAACATGAAGGGATGATCAGCAACAAAGTCTATCTCCTCTTCAACCTTCATCATCGAGCACCCGAACATCATTGTAGCAACAGTAACAGCTGGAGCTACTGGAGCCTCCTCATTTACTTCGATAAAGGCCTTGTGAAAAGCTTTTGCAGCTGGAGCTTCTGCACCTTCCTCATTTACTTCAATAAAGGCCTTGTGAAAAACGTTTGCAACTGCCAGAGGATAATTCTCGCCCACCATCTCAGTGAGACCACCTTTAAAAGGTAATGTGAGCTCGAGTCCTTTTAGAACTTCTAAAGCTTCAATCCCCAAAGATATTTTGAACTTAGGGATAAGAAACTCGTGCACTTTAACTTTTTCATATGGAACATGGCGATCCAAAAATCCAGGTTCCGAACTAATTTTCTCCAGTAAAGTTGGTAATCCATCACGGGCATTTGGGAGATACACATACATGTTGAGAAATCGCTTGTCCTCGCCCTGTTTATAACGAAGCCTTAACACTTGGAAACCATCAAAGGCCTTCACGTATTGCCTTTTTTTGCTGGTCATTAAGGGTGCTTGAACAGATCCTCCATTAAGGAGATGGAACTCATGGTCTTTTGTATCTGAAGCATTCAACTTTTCAGTCCATGCTCCTTTGAAATATAGTGCATTCGCTAAGATCAGACTTGTACCGCTATTGACTGCAACAGGAGGAAGAATTTGTTTGATAAGACCATTCGTTTTCTCTTCAGCCCACTTATTGACTTCACCAGTAACCTCATCACCCTACATAAGAAAAATTAAAACAAACAGAGAACATAGAATCAGCAGGCTAGTAACAGGATAGAGTTGAAAGTAAAAGAAACAAAAAAGATGTCCTATATGACATTAGATTCGTTTGTTTGTAATCTTATTCGAAAAGCATAATAAACGTTTCTAATGTGCTTACTAATTTGAAAATATTCTGTTTACCAACATGCCCATAATATTTTGTTATACATTATAATACTCCCTTTGTTTCGTACTTCGAGGGTCAAACTTTTCAATTTTAACCGTGAATTCGAACATGAAATTTTAATTTTTGACATAAAAGTCACATATTTAGAGACTGTAAAAGTAATATAAGTCATTTATAAGGAATATAAGAAAAATCGCAGTCAAGAAAAACTCGACTCTCGAAATCCAAAAGGTGTCACATAAATTGGGATGGAGGGAGTATCATGTAATTTCAAACATCAATTTCTTTTCTAGTGAGTGAAGGATTAAGACTAACCTTGTTCCGAAAATCAACAGAAGCCGAAGCAGCCTTATAAACATTGTCCATAACCTGTTTGAAAGAATGCTTAAAAGACAAAGATTGGTCAACCCAGGCCCAATTAGTGACAGACAAACGAGGACCTCCCATGGGGCTGCCATCGGCTAAGACGTCGGTGATGACCCGAGAATAAACAGAGTTAAGTTCTTCAACAGAGTTGAATTTGAGAAAAGCCAACAGTTGATCCAATGTGGAACCACTGGAGCCTGCTGCAATAAGAGCAAAAATTATTTGAATGGAGACCGGGGAAAACACCATGTTTGCGTTGTTGGACTCGTCCTCGTCGGCTTTAAACTTGCTGAAGAATACATGCTTTGAAAGAATCAATGGAACATCCATCATTTCTGATGAGGGAAACTAAAATTGAAACAGAGAACGTAAATCCATAAGATGAGGGAGACTGAAACTGAAACGGAGAACAGAAATCCATAATTTCTGATGAGGGGAACTGAAACTGAAACTGAAACGGAGAACGGAAATCCATAATTTCTGACGAGGGAAACTAAGGTACCTTCACAGTTGAACTTGACAGCATGGAGAGAATGCTGATGCATATAGAACTAACAGTCATGGTAGGCGACCACGAGTCATACAGAATATCTGCAAGACAGAAAATAGTTGTCAACTGCTGCTTTTATTAGATGCATTCTAACTTTTTTCATGAGTCCGGTAGAAAATCTTAATTATTAAAAAAAAAAACTTATTCTGGAAGATGAGAGGAATAACTGTGAATGAAAGGCGTTTTGCTTTCACTTTATTAGGCAGGTAACTATAGAAAGTTTAAGAGATTAATAAGCAAGCCATAGGCATAGAGCAGAAACCAAAATTATGCAGGAAAAGAAACTAGTATGATCTCTGCAATTCTTAGATGAAATGACAAATTTGACGAGACCCACAGACAAACAAAAAACAGATCCAAAGTTTATTCTACTAGGGCTAAGAGTTCTGAATCAGTAAACAACTGTGTTTTTTAATCATCCGTCAACCATCATTGACTATTACTCCATAATTAGATACGAATCAACAAATATTATCGGGACATGAAAGGGGAAAAGAATCAATAACAAAAACACCAAGACCAAGTAAAAAGCTCACGGATATAGCGAGACAATCTACCATCATACCCATCATCAACTAAGTTGGCATAGAGCATTTCCAAAACTATAATACCTGTCATCTAATCTTTATTAAGAGTTCATATCACAAATCATCCATAATTGCCAAGGATTTTCGTTAAATGCTTCATCTAATGCATTCTAAATCAAGCCATTAAGCTCGAAGAGTAACCAAATAAATAAATTTCTGTTGGTGATAAGAAATAGAGGAGACTTGAGCTGTTGAATAAATACCTAAACAAATGTGCCCGTCCCTATAGATATCAGGGTGCAAAGGAGCTGGAGGCAAAAATATCACCTGCATCAATGTAGTCAGCCCTAATTAGTATGATAGGAATTTCAATCAAGAACTTTAACAAGAAACCACATCTTATATCCACCTGGGGAGCTTTAATAGGATAATGTTCAGGGAATTCGGCTTGAAGCTGATAAGTTTCATTAGCATACAGCGTCCCAGGAGCACCATTCACTTCAATTAACCACCTTTTCAAAAATCAAAAGTTCAGTAATAATAGCATGGGACTTCGAATTCAATATTAGAAATTTATTGGAAAACGACAAAAAACTTAACAACAAACAGGAAAAATGAAGACCTTTGAAGATAATCGGAGGGTTCAAGATTGAAGCCAGACGGGGGATTGACCTGCCAGTTCCCTAGCTCTGTATGGAGTCGATTGCATGCCAT
SEQ 38
CTGAAAGTTGGTTCCTTTTTTTCTTCTCTTATTTATTCATGCAATAAAGCATCTCCAAACTTCTATTCTTATTCATTCTCTCTGCTTTCTTGCTTCATCGAACTGGTGAGTAGTTGTTTTGTTTCCTTTTTCTTCTATTTAAGAAAAAATTAACTCTCTTTTTGTCGATATATTTTATCCTTTTTTTTTCTTTTTTCTTTTTTTGTTTCTGTGGGTATTAGAGGTTTTGTGCTTCATTTCATATATTTGTCTCATGATTTTACTACTTTCAAGGTTGGGCTTTTTCCTTAGCAAGAAAGAACTATTTCTGTTTATGTTTCATTTTTCTTTTGGACCTTGGTTTTCTGTTCTCGAGGATTGTATCTGTTAAAAATTGAAGTACTTTTTTTTCCCTTCATCTTTTTAATTGATGTTCTGTTTAGTGTTATTTTCACCTTTTATGGCATTTAGCAATGTTTGTGCTTTGACGGGTTGCTGTTATAAACATAAATTTTGGGAAAATAATTACCAGGTAAACTTGTTATTATGCAAGTGCAATTTGTGTGCGTGTGGTGGTTTTGTTGCTAGGGAGCAAGGCATGTGATTAGTGATAAGAGGGTTAAAAGGGGAGTAGATAAACAAAGCTCCACTTTTTAGGCTATTGTTTTTACTTGGGTTCTTCCATTTTTTATTATAGCTTGATGAAGTAATATGTAGCTTATAAATTTCCCAGAATAAGAATCATCTCTTGCCTTAGAAAAAATAATTTACCAGTAAGAGCAGAATATATGGTAGGATTCATCCACTCAACTCCAATTAGTTTGTGACTGAGGCAAAGTTGATTGAGTGATCGATTGAGTTTAGTCTCATTAGATTGTCATTTATCCATTAAAAACATGCAGCAGGCATAACATGAGTGATTTGATCTTCTGAGCATTTTCTCTTGTTTGTTGAATTTAATATATCTTCACTAATTGCTTGGCCTAAATTTTATTAACTCAAAGTGATGATTTGCCTAGGTCAATATGGGAGCAAAAGCTTTTCTTGTCACCATTTTACTCTCATCGCTGTTATTTCCTTTGGCCTTGTCTACGTCAAATGATGGCTTGGTTAGAATTGGACTGAAAAAGATAAAATTTGATCAAAACAATCGACTTGCTGCACGCGTCGAGTCCAAGGAGGGCGAGGCTGTGAGAGCCTCTATTAGGAAGTATAATAACTTCCATGGTAATCTTGGGGCCTCTGAGGATACAGACATTGTAGCACTGAAGAACTATATGGATGCTCAGTACTTTGGGGAGATTGGTATAGGCTCTCCCCCTCAGAAGTTCACAGTCATCTTTGATACTGGTAGCTCTAATTTGTGGGTGCCTTCATCAAAGTGCTACTTCTCAGTAAGTTATTTTTTTCCTTAAAAGAATGCATAATAGAGAAAGCTAGTATTGGCTACATAATTTGATGATCATCAATATTTATGTTTCTCTATGTTTGTGCAGGTTCCCTGTTTTTTCCATTCCAAGTATAAGTCAAGCCAATCAAGCACTTATAAGAAAAATGGTCTGTTTCTGACCTTTGTCTATATTTGATAATTGCAACACGACACGTGCTTTTCTCTTATACTTGTTATTTATGCTCAATGCTTGCTTGTAAGAGAAAGCGTTCCATTATTGGCATTATACATGACATGTCTTAGGTTTTGAGATCAAAACTATTAACTCTGCTACCAACTTAGGATTTTTTTAAAAAGAAAATAAAGGAAACCCTCACCATTTTTATTGTTGTCATCCAATTATGTGCCTTGTATCAAAGTTTTTTGTTGAAAAATATAATTTGGCAAGTTTATGTTGTTGGCTTTCCCTGCCAAAAATGTGCTAATGTTATCTCTCTGATTTTTTTTACTCATGATTTGCAATAAAAGCTTGTGCCTTTTAAACTGTTTTGTCTATCAAGGAATCTGTTATGCTGGAGTTCCTTTATTGAGTTTTGATATCTATCATAATTTACTTTCCTGGAAAATTGATGTCTGCTGTGTGTTTGATATGACCTTTGAATATTCTTCTCTGTCGTTGAGTTGGTCAACGTGTTCAATTGGTTGTTGACCTAAGAACCTGTTCATCCAAACCTTTTTCTGTTTAATATGCCATACAGGGAAGTCTGCTGCAATTCGTTATGGTACTGGAGCAATATCTGGATTTTTCAGTCAAGATAGCGTTAAAGTCGGTGACCTTATTGTGCAAAATCAGGTGAATGTGGCTTCTCACTTCCTTTTTTTTAATTTTTTTTTATGTTTCTTGAATATATGGTCTCTCATCTGTCGAGATTGTTAATGACATCAGGAGTTCATTGAGGCAACAAGAGAACCCAGTGTGACTTTTTTGGTAGCCAAGTTTGATGGTATATTGGGTCTTGGTTTCCAGGAGATTTCTGTTGGAAATGCTGTTCCAGTATGGTATGTGGGTTTATTTTGTTTGCGTTCTCTTCTTTCCAAATGTTTCTTCAATTTCCTATTAACCAAGTGCGTGCCTTGTGAATTTCATTATTATTGAAATGATTTTATCTTCTGGATTGCAGAATTTCATGAACATTTTCTTCTATATAAAGTTTTAAGTGATACCGGTCTTGACGGTTTCTTCTGTGTTTTATAGGTACAACATGGTCAAACAGGGTCTTGTCAAGGAGCCTGTCTTCTCATTTTGGCTCAACCGAAATACAAAGGAAGACGAAGGGGGCGAAATTGTGTTTGGTGGGGTTGATCCTAACCACTATAAGGGAAAGCACACCTATGTCCCAGTCACACGGAAAGGTTATTGGCAGGTAAATATCCCTATATCTTCGGAAGATTGATGTTTTGCTTTCTGCAACTGTTTTCTTACTCTTCAGAATATAATATGCAGTTTGACATGGGTGATGTTCTGATTGATGGTCAAGCTACTGGTATGTTACGTTACTTCCTTTTCTATTTTTTTGTGTGTGGAGATTTCGAGGATATTGATGAGAGCACTTTCCCATGATTTCCCTGCTTTTTCGTTGTATTGACATACTGAATAATGTAGGTTACTGTGACAATGGATGTTCTGCAATAGCGGATTCTGGGACTTCTCTCTTGGCTGGTCCAACGGTATTCTCCAAAGCATATTCCACTTTTTGTCCCTATTATTCAGCTATTTTCAATAGTGAACTAGCTCAGAATATTTTTTGTACCTTCTTGTTCATGTGTAGCTTCAACAATCTTCGAGCGATGAATAGGTTTAGTTTTTGGTTGGAATATCAGTTAAATAATAATCAGCCATTCCTTTGAACTTTTCTCGTTTTTTCCTTTTCCTATTCAAAAAAAGGACGACGGGAAGTGCAGTGGAATTGATGTTCATCCCAGTATCAGGACAAACTACCTTGTTGATTGTCATACCTAAGAAATGTTTTTTTTTAACTTTTGCCTGTTGTTTCTGTCTTATTAAATTAATGCAACTTGAGAACTGCTTCTTTCTTCTCATCTTTAAGGCATGGTTGACAAATATGATACAAGGAAAAAGCTGCAGCTTTATTTGTCTAGACAATTGCAGTAGTGAAATGCTTTACTACTACATTTTCTAGTTCTCATCACTGTATCCTTCCTCCTCTATCTTGCAGACTGTTATCACTATGATTAATCATGCCATTGGCGCCTCGGGGGTTGTAAGCCAACAATGCAAAGCTGTTGTTGAACAGTATGGACAAACAATAATGGATATGCTTTTAGCGGAGGTGAGCAATTAATTATTTTAGTTGATAGTTTGTTTTTGTTTTTACCAATAGTTTTCCGTGGTATCTGCAAAGAGGGTGGTTTCGTGCTACTAGTTGCCTTCCCAATATTCTGATGGATTGGCGTCTTAACAGGCACATCCAAAGAAGATCTGCTCGCAGGTTGGGTTATGCACCTTTGATGGAACTCGTGGCATTAGGTTAGGCTAATCATTTCTTTCCTAACCTTGGCCAATCATTTGATATGTTAAATCCTATTATAAAATGTGTGCTGAGTGGATTTATGTCCTCCACGTGTAGTATGGGCATTGAGAGTGTTGTAGATGAGAATGCTGGCAAATCTTCAGGACTGCATGATGCTATGTGCTCCGCTTGTGAAATGGCGGTTGTCTGGATGCAGAACCAACTTAGACAGAACCAGACCCAAGAACGCATCTTGAACTATGTGAATGAGGTAAATAGCATCAGTCACATGCTTTCTCTTCTCATCTTAGGTTAGATTACTGACCATCTTTAACAGCTTTGCGAGCGACTACCAAGCCCAATGGGACAATCTGCTGTTGATTGTGGAAAACTTTCTGGCATGCCTAGTGTTTCCTTCACAATTGGTGGCAGAACATTTGACCTCTCTCCTGAGGAGGTATGTCTGATATCAATCTTCTGTAGTATACATGGTGTCTTCTCAACTTGTAAATGGCTTTTGATTCTTCTGAACGACGTGGTTGGTTGTAGAATCCTTTTGTCATGTTTCAGTTTGGCAGTTCAATTCTTTTTGGTTTTCACTAGATTAGCTAGCAAGGTGTTACGCTGCTTTCAAGAGAAGTACACTTGTCTTGTAGAAAATTTCAACCATGACAGCTAAGTGTAGTTTGGATAATTAATGATATTGAATGTGTCGAGCTTCAATATCAGTTTCTTTGCTTGATAAGTTAACTTATGATTGGATAATTAATGTCATTGAAGTGTGTCGAGCTTTGATATCAGTTTCTTTGCTTGGTAAGTTCATATGATTGTACTAAGCTTGCATGCTTGTCTTGTCACCAGTACATACTCAAGGTGGGCGAGGGTCCTGCTGCACAATGTATTAGTGGCTTCATTGCCTTGGATGTTCCTCCACCCCGTGGACCTCTCTGGTATGTTTTCTTTTCGTCTTAACACACGTGCAGATTCTGTTATTCTAGAAAAGTTATACCAGCTCCCTTTTGATAATGCTGTTTGCTTATGGCTTTGGTGGTGCAGGATCTTGGGGGATGTTTTCATGGGTCGATATCACACCGTCTTTGATTTTGGCAAACTTAGAGTTGGATTTGCAGAAGCAGCT
SEQ 39
TCATATGGCTGCAGGTCTTCCATCTTTCCTAGGATCACTTACCGCCACAAGCATCCCATGAAAAACCCCGTTTTTATATTCTTTCCCGCTCCTTCTTCCCAACTTTAAATGAGAATTTGGAAGGTTTTGAACAATTAGCTGACAGATGGCTCCTCCGTTATGTGCTTCGAGTTGATGACCCCTCTCTTCCAAGAAATGCTTTTTCTCATCCGATAGCTCAATGTGATCGCCATCGATGCATGTCCAGTTCTCATACAGAACTACATTCGGAATTAGCTGCAAAAAGAGGAACACGAATCAATACTCGGGGTTTCAGAATTCGTCCGTATCAAGAAATACGTGAATTGCTCAGTAGTGCTTCGCCACTACGCTAGCTGGCGGAAATAGCTTAATGGTAGAGCATAGCCTTTCCAAGGCTGAGGTTGAGGGTTCAAGTCCCTCCGCTCCTGGCTTCGTCGTTTAGTGGTAACAAGTTCCGTGCATAAGCCACTTTAGAGATAGGTGATCCTTAAAAATACTCCCTCTGTTCCACTTTATGTGAGCCTTTTCGGAGCACGAGGTTCAAATTGACCAATTTTCCTTGTGGATTGAGACATAGAATTTTCAAAAATTACTACATAAAAAGTACTATAAGTCATACTAATAGTTAACAATTCAAAATAAGAAAACTTTGTCTGACTCCCTAAATAGTAATAGATTCACATAAAGTGGAACAGAAGGAGTAATATACATTGCTGATCAGGCAAAGGGACCGACCTCGTGGTAGACTCTTGGACTCTGCACTGCTGCTAAAGGATCCATTCCCAAGATGAAATGGTTGATGAAAACCTGGACCACCGCGGGGATTATTTTCATGCCACCACTACCACCAATTACACCAGCCAACTGATTATCCTGTTAACAAGAAGAAGAAAGAATCAATAAAAGATATACTAAACAACAACTACTGAGGAAGTTCGGATTGTCCATATACTAAGCAGTTAAGAAAGATGACGCATTGCTCAAAGGATCGTTTCCTGGATTGGATACCTTGAGAACAATGATTGGAGCCATGGACGACAACGGTCTCTTTTTTGGTTGAATAAAATTAGCCGGGGCAGGAGGGAGTTCATCAGGGGATATCTCACTAGGTGTTGAGAAATCTCCCATTTCGTCGTTGAGTACAATACCAGTTGATGGAGAGAGCACACCGGCTCCAAATGGATAGTTTACTGTGGTAGTTACTGATACAGCATTTCGATCAGAATCTACAATACAAAAGTGACTTGTTCCGTGATCTCTTAGCTGACTCCACCTACAAATCAAGGACCAAAACCGCTTTAGAGCCATTGGTGTGCAATCAATAGGCTTGTTAGTTCTGGGCTCAGTCTGAAATACTCCTTCTGTCCCAATTTACGTGGCGGTGTTGGATTTCGAGAATCAAAAAAGTTTTTCTTTGACTGCGATTTTTTCATAAGCCTTTTAAATATTTTGATTTAATTATTATTGTGACTTATAGTACTTTTTGCGTAGTTTCCAAAGATTTAAATTTTATTTCAAGACTAAAAGATTCTATGTCCAAATTCATGGTCAAAGTTAACTTATTTGACTCTCGAAATTCACAAACCGCCACATAATTGGGACGGAGGAAGTAGAATTTATCTAACCTGGGCATATAGTATTCAGGAGGAAAGGTGGTATTGTCGAAAATCTTCTGTCGAATTGCTTTGGCAAAAGATGGGGAAAGCATGTCTGATACAGTTTTGCTGATATTTACAAAGTCGGGATCACCGAGGTCCATCCGAAATGCAAACATGTGTTTCATCGCCTCAATTAGTCGATGCAGACCTAAAGAACCTTCTGCAGCATTATAGCTTTCAAGGATTTTAAGAATCTGCATTTCCCCAATCGCAATGGTTTAGTCAGACAATTGCGTTGTATTAAGTTTAGACATAATTGAGGAAGATTGGAACCAATACGTCTACGTCTGTGTTATTAGAAGGAAAATAGCAGTACAAGCTAAGTTGGTTTATATACATACCAGAGAAATCCCCAGTGTTCCACTGGACGGAGGTGGCATTCCAACGATGGTGTAGCCCATAGCATTAACGGTAACTGCTTCTGGAGTTTCCACTTTGTAATTCCTCAAATCGTCCATTGTCAAAATTCCACCCGCTTTTTTCACATCTTCGACAAGCTTTTCACCAACCTCTCCATTATAGAATGCTTCAGGCCCTTGTTCAGCAATAAGCTCTAAGCTGTGGCTAAGTTTTACATTATGGCAAATATCACCTGCCCGTAACAATTTCCCCTCTGGTGCAATTACTTGTCGTAAACCAGGATCTTTAAGTATCAACTTTGCTTTTGACGCAATATGATGTGCAAGATATGGAGCAACCACGAATCCATCTCTAGCAAGTTTAATCGCTGGTTGAAATAGGGTCTTCCACGGCAACCTGCCATGTTTTGACCAAGCGGCGTGAAGACCAGCTAACTCACCGGGAACTCCCATGGACAATGCTCCCTCTAACTTGGATTTTCCATTATTATCATACATGTTCTGCTTGAGGACAAACAAAACAAATTCAGAACCTGCAACTTCGTGTTGTCTATGTTTATATACTAAATACTAATTCCAATTCCCAATAAGAGCAGAACTGCAGTTTTCTCATGTAGACAGCTACAACAGCTATTACTACTATGCCTCAATTCCAAGCAAGTTGGATCAGCTACATGAATCCTCACTGTCCATTTCGCTTCATTAAGCCACAGTTTACTTATGCCGATACAAATTAAAGAATTTAACTTCTATACACTAATAACCTAATTGTATTTTACAATATCAGTACTTCAATCCAAAGCAAGTTGGGATCGATTATACGAATCCTTAGTGTTCATGTCTCTCCATTTGAGCCAAAGTTTACTTATGTCGATACAAATTAAAGAATATAAACGTAATACACTAACAACCTAATTGTCTTTTTACAATATCAGTACTTCAATCCAGAGCAAGTTGAGATCCGCAATATGAATCCTCAATGTTCATGTCGCTCCATTTAAGCCACAATTTACTCATGTAGGTACAAGTTAAAGAATTTTAACTTATATACACTGATAATATAATTCTTTTATACAATATCGGTGTATTTAACTTGGTGTAATAGGCAGCGTGTCTTATTTTCCACATTAGTATTCCCACTATTTATGGACGATTGCATGTAATTTTCGGTTAGGTGACCTGATACTGTAAAAATTCATTTCTCCAAATTAAATAGTTATTTGATGATGTTTGGATGAGTTGAAACTTGAATGAGAAAAAGCAATGCTAAACGAGTGAATTACGAACCTGTGAAGCAGCTAAAGGAGCAGTTTCCCTCATATCAATAGCTTGAACTTCTGATGTTGATGAAGATCTAACAACCATAAAACCTCCACCGCCAAGTCCGCTGGCCATTGGATTGACAACTCCAAGGCAAAGTGCTGTGGCAACTGCAGCATCAACAGCATGTCCACCAATTTTAAGCATGGATATACCAATTTCCGAGCATCGACCATCATCAGCAGCAACAACTGCTTGCTCCGATTCAACAACGTCAGCATTTTGCTGCAGTTTTCCATTATATCTCTCAACATCTCCGATTAGCCAAATACCAACGTGTCCATGGTGTCTAAGGCCTATAACTAAATTAAGATGGAGGAAAAAAAGGAACTTAAAAATCAAATGACAAAGAAGTGCAAATTGCAGGTGCTAAAATAATTGTGTAAGCTCGAGCAACTTCTTGATTCTTATTTCAGAAGATAATTAGACAATCGATGTTAAAGGTAGGAATATACATGCCAATTCTTTATATTTTTTATAATGGAAATATGCCCGAGAGCTAATGGCGCATTGTTCGAAACTCAATGGATAGTGGGCCCGCTCCTCTAATTCTCACTTAAAGTAGGATTTTTGTCTATGACAATGTTCCACTGCTAATTTTGGTTTAAAAAAGTAGGAATGTACATGCCAAATTTTATTTTTTGATAACCGACAAGCGATTATCAGAAAAGTGCAACCCGGTGCACTAAGCTTCCGCTATGCGCGGGGTCCGCAAAAAGGCCCTTGTGGTCTGGCCCTTTCCTGGACCCGTTGCATAGCGGGAGCTTAGTGCACTGAGTTGCCTTTTTTGGTAACCGACAAATCCCAGGGTCATTAGCGTATTGTTCGAAACTCAACGGATAATGGGTCCGCTCCTCTAAATTCTCACTTAAATACTAGGATTTTTGTGTATGACAAGGTTTGAACCTGTGAAATGCGCACTCACACATCACAAGTTGTGCTTTTACCACTAGACCAAAACCCCACAAGCTTGTACATGCCAATTTGCCAGGTCAAATATTATCCAATTCAAGAAAGCCTGTAAATTTGACAATTTTTAGGCAAAAACCCAGAATTTGATTTTTTTTTAAAATATGTCCTTACGATAAAGGGTCAGCAAAATTTTACGCTCTACGGTAATTACTCAACAAATTCTTTCACAATTTCAAGATTTATAACATAACACTTCACTTCTCTTAAGTAAAGCTAAAGAGAGAAATTCCTGTTAGGATCAAAAAGTCACGTGTCATGCGGAAGCTAGTAACACAAATCTTGAACGACGATAAATCAAACAACAAAAGAGAAATATACCAAAAGAGACACAAACATTTAACGTGGTTCGGTCAACTGACATACGTCCACGGCGGAGATGAGCAATCCACTATATATAAAAGAGAGTTCAAAATATCGAGATAACAACCTCACGAAGAGGCAAACACAAGTGATACACTAACATTTGTCCCGTAAAATTCTCCCCCTAAACACGACTCTCAAACCTCATATGGCTACATCGTGGATGTTAGAGATAAAGTTCAATCTCTATAAGTTAGGATAGAAATCTCTATTAGTTAGGATAGCTATGTTCTGTTAGCTATATTTTAGGATATATGATTGTTCTATTAGTTACCTTATCTCCCTAGTCTTCTATAGTGTGTTGTAGACTGTTGTATATATATTCAACTATGTACTCAATAGAAAATCATCGAATTCTCTCAACATCATCTCTCATAATGCTACTGAATGGGAAAGAAAGATCTCAATTTATAGAAGTTCAAACATTTTTCTACCAGAAAAGGGACTAGCCAACTATGGAAGCATTATATTTTCCTTCTAGGAAAAGAAAAACTGAATTATGGTAAATATGTTGTTCTTTCCTCCGTGAAATAGGAAAATCAATTATAGTAAAAAAATCTAGACAAACACGTAACAATTCCATAATCATGGTGTTAATTAACTTCATTTTTCATAGCTTTTTAAAGCCCAATTAACGAAATTCCTACAGAATTCAACTGAATATTCTGTTAACAGAATTGCAAATACTAAGAAAACAAAGAAGAAGACAAAAAGTCAAAGGTGAAAAACTCACATGAAATGGCAGTGAGTGCAAATAAAAAGCAAAGAGCAAAACTCCATTTCTTCCTTCTATTGAAAGTAGCAGGAGAAGGGTCCAACAATGGAGCTTCTAAATTCTGTTTACTCAT
SEQ 40
TCAGTTGTGTCCTGTCAAAGGATCTACTTTTATGCTTGTGGCAACAATTGGACTTCTAACCACATATTTACCTTCAGTTTCTACCCAGCTCAAAGAACCATAAACCACAATATCATCCATAACTAATGGACCTTCTATTCTTAGCTTGTAGCTCAACTTTTCATACTTCTCTTTGAAAACCAACTTTTCAGGTACAAGATTAACTTTAAATTTACCCATTGTGGTCAATTTTGCTGTGTATACTGACATACCATCTCCAATATTAGTCACGGTCCTCTGGAATTCTTGTATCCTTCTAGGATCCGACTCGCTGCTGTTCCCATTGAAAAATCCAATGAAAGATGGATAGTTTAAGTCCAATGATGGGTTGGAGCAAGTATAAGATGAGGATCTTGTGATGGTTTTTATTTGTTTGGATGTGAAGTTCAGAGCACAGAGAAGATTGACATAATCTTGTGGTGTCGCATCATAGATAAGTCCAGGATCTAGTGCCTTGTTTGGATCGATATGGCCAGCTCCCATGGCTAGAGGAGTAGCAGCAGCATTCTTACTACCTGTTGAGATATAATATAATTAAATGATTAAATATATGCTCTCTCTAACATATAAGCTTACTTGATTCAACATAGTATCAGAGCATGCAAGAGGTCCTAGGTTCAAATCTCACCGCCACCAAAAAAGTCATAAAATAATTCCAAGTGTTTGGTCCATGAAAAAAAATCAAACTTTTAGATGAGATGGTCACACAATTCAATATTACCTATGTCTCGGATGGGACTTTGTGTGTTGTCCATCGCATTGGAAGTGGTCATCATGGCAGATCGGATGGCTGCAGGGCTCCATTCAGGGTGTGCGGCTTTTAGAAGTGCTGCTACACCAGAAGCATGTGGACATGACATTGATGTACCAGATATAATATTGAAGTTACTAAAAAGTTTTCCTGAGGTAACATCAGTCACTGGTGATTGTTGTGGCCATGAAGCTAGTATTAAGGCACCAGGAGCCATGAGATCAGGCTTGAGGATACTTGGACAGCTCGGTGACGGTCCTCTTGAGCTATAGGTAGCAACTTTTGGTGCTGGTTTAGCACCAATATGTGTCACTCGGAATTCAAGTTTTCCTTTAGGTGCAGAGTTGCTCTTAATGTACTCTAGAACTTTATCACCCTCTTGTAAGTTCAAGAACACAGCCGGGAATTCGCTTTGGAGGTAGAATTCCAAATCAGTTATATTAGTTATGAAGACAGCCCCAGCAACTTTTGAATTTCTCACATTGTACACATGCTCACTGACCGAATCATTCTTGTCAAGGCAGACAACAATATTGTGTGCACTTTTTTGCAGTTCCTTGTCATCTTGGCATTCAACATAGACAATGGAGCTTTCACTTGAACTAGAATTCCCAGGGTAGAGCGATAAGCCAGTGACTGAAACTCCATTTCCAAGAGTTAATGCGCCAATAAATTCGCGGTCAACTGTGCCAGCTGCAACAGTTAGCACCCAAGGTGTTCCATTGTGCAAAGTCTCATAATAAGGCCCTTCATTTCCTGCAGAGGTGGAAACAAATATACCTTTCTCCAATGCAGCAAATGCGGCAATTGCCACAGGATCTTCGTGTAGTGGAATCGCGTCTATGCCTAATGACAAGGATAAAACATCTACACCATCTGTAATTGCTTGATCAATTGCAGCAAGAACATCAGACAAGTATACACCTTCTTCCCATAGAGCCTTGTACATAGCCACATGAGCCTTTGGTGCTATGCCAATAGCAGTGCCGGTGGCATAGCCAAAATAAGATGCACCCTCGACATAACTTCCCGCAGCTGTGGAAGAAGTGTGAGTTCCATGTCCATCTGTATCTCTAGCAGAATTCATTGAAATGTTAAGATTTGGATTGTTGGCAAGTAGGCCTTTATTGAAGTAACGAGCGCCAATGATTTTCTTGTTACACAAAGAGGAATTGAACTCAATGCCACTTTCACATTCTCCTTTCCATCTTGATGGTACTTCACTAATCCCATAATCACTATAGCTTTTACTCTCTGGCCATATTCCAGTATCAACTAAGCCAATTATGATATCTTTACCATAGTCGGACGTTGGCCATACACCAGACTCAGAGTTTAGGCCAAGGAATTGGGATGTGTGAGTTGTGTCAATTTTAACTGACATATCCTTAATTGAAGAAACATAACCTGGAGAATTTTTTATGGCTTCAAATTCAGAAGGAGAAAGACTTGCACTAAAACCATTGATGGCATTAGTATAAGCATAGACTAGTTTTGAGGACAAGAATTCTTTGTGATTTGTACTACTGTCTGATAAAGAAGCAAGTGTTGTCAAGTACCAATTATGATGGCTAGCAAAAGCTTTTGGCATGGCTGACAAATCCATATGAATGATATATGTTTCTGGCTTTGCTAGTGAAATTATAGAAATAAAGAAGAAAAGCAACCAAATACACAAGGTAATATGACTTGCCATGTTGAGTAATATATTGAAGGAGGATATTTTTTTTAACAT
SEQ 41
ATGGAATTTTACCAAAAACTGGCAACATGTTCTCATTTGTCGCTTTTGTGCTTCATCCTCTTACATTCCATTCAAGTTCAAGGTAGCTACTTTGATCAAGAATATGGTAAGCAGGTACTGAGCTCAGCAATACAAGATAAAGATTGGTTAGTATCCATAAGAAGGATAATTCATGAATACCCAGAACTCAGATTCCAAGAATATAACACCAGTGCTCTCATTCGTACTGAACTTGATAAACTTGGCATTTATTATGAATACCCTTTTGCCAAAACTGGTCTTGTTGCTCTAATTGGCAGCAGTTCTCCTCCTGTTGTTGCTTTACGAGCTGATATGGATGCCCTTCCTCTCCAGGTTCATACACAATTTTTTTACTATCAATCAATTATACCTCAATCGTCAATTAGTTGGGCAGTTATATGCAGTTCGGAGCTAGGTTGTTCCCTAAGGGGAATCAACATATAAAGAAGTAAAGACGAAAAAGCCACGGAGATTCAATATATAGTGTATATACAAAAAAAAAATAAAAAAATTGACCTATTTACCCTGTGTAATTTTCGACCCAAAGGGTATCAGTTAACTCCCCTTGGATAAGGTTGCTCTGCCCCTAGTTATATGAATCTTCTTGTATCTAATTGAGAGGATTCAATATAGTTAAATTATTTATGCACCGGTCGTCAACCTAGCACAATCCTCCAACTTTATTTGAATCTGCAACTGGCTATGCTTTGTGAAGCTTAAATAGGTGTAGTTAGAAAGAAATATTCTTAATAGTGTGCATATTTAGTTATGGAATGTCTCTAACATTATTCTCGAGTGAATATAACCATAGGAGCTTGTTGAATGGGAGCATAAGAGCAAAGTTACTGGCAAAATGCATGGATGTGGACATGATGCCCACACGGCGATGCTTCTTGGCGCTGCTAAGCTGCTGAATGAGCGAAAGGACAAACTTAATGTAAGTTTGTTAACCTTACCCACTTCACTAATGCTGATTCATTTGGAATGTAATTTGTGCTTGTGTGATTCTTTAACAAAAGATTTTTTGCACAATGTTGACCAATGACCAGATTGTCTTGTTCTCAGAAGTAATAATATTAGGTTTGCGCTATAGTGATTATGCTGATCATTTTATCCGTTGTGCTTTGACTTCTTATCTAGGTTTGCATGTACACTAGGCCTTTGGAGCTTATTCTAAAAGGGGGTATTTCTTAAACATAGAGGACTGTAAGAAGATAGATGAAAACATTCTTTAATAGAGGGGGGTATTAAGTGTACTTTGTCGAGATAATGAAAGAACAGACTCAAAAGGAATAGACCAAAAAGGATATCTTTTTGCTTTGTTATCAGATTTAGTTCACTTATTCACATGTCTCCCTCGGAACAGTCCAAATTTCATAGCAGTGTCGCAAAAAAGGAATAGTTGTGCTGTTTGTTATCGATGATGCTTCTTAACTTGGATATGACCATGTTATTCTTTGATTCTTTAAATCTGAAACTTGGATCGTCCTTCTGTGGGTGACTAGCAGTGCCTGTGGGTAATCATTTTTGCCTTTTCCTTAGATGAACATATAAAGTGATTTTGCCCATTGAACATAGTTGTGACCATTCATGATTCATCAATTGTCTCGATGTGGAGAACCTAGCCCTCTGATCCTCCATGGCTTGCGAGTTCACATCCAGATGAAGCAACCAGAGAAACTAATTCAGGCATGACGAGAAATTTTCCGGTCAAGAGAGAGGATCGATCAGAACCTGTTGAAGGAAATGGTAGATGACGGAGCATTGGCCCAAATCAATTTCTCTCTGGAACCACGAAAAAGAAGCTGAGAAGACCGATAACTTCTATCTACATTACAATAACAATACATGGCTGCATGTATAGGAAACGAGGAAACCATGAATGTTTTTTTGAATTCTTTTTTGCTTGACCAATAAAAAGGAATTCAAGACTGAACCACACTTTCTAATTACTTGTTAGTCTGTAATTGTCTGACTGATACTATTAGATATTTCTTTTCAACTTTATAAGAATACATTTGTCACATGACACTCGTAAAGCACTGTTCGAATTGACTTAATCTGTTTTTGCCCTTTGTGTGGCATCATTCATTATCTATCCATTCTTGGGGTAGTCTACAATAGAAAGTTGATTTGTTGCTTGTCTCTATTTTTATTTTTTGAACCCGAAAAGGGAACGGTAAGACTTGTTTTCCAACCTGCGGAGGAGGGAGGAGCTGGTGCATATCATATGATCAACGAAGGGGCTCTAGGTGATGCAGAAGCTATATTTGGAATGCATGTTGATTTTAAAAGACCTACAGGGAGCATCGGTACTAGTCCTGGGCCGATTTTAGCTGCTGTTTCCTTCTTTGAGGCAAAAATAGAAGGAAAAGGTGGGCATGCTGCAGAACCCCATGCTACTGTGGATCCAATACTTGCTGCATCATTTGCAGTTGTGGCATTGCAGCAGCTCATCTCAAGAGAAGTAGATCCCCTTCATAGTCAAGTATGTAGCCTAATCTCAATTAGAAGTATAAATCTTTGGTTTACACACACACAGAGACACACAGACACATAATTATGTAGGTACATATATTCCCTTCAGGAACATTTCTTGTTTTAGAAAGCAGTATAGCATTTGAGACCTGAAGCCTCATTGACAGTTAAGCTGACTGAGATTGAAATTCTCATTTCTGCCTGAAGGTTCTTTCTGTTACTTATGTCAGAGGTGGATCAGCATCAAACGTAATTCCGCCTTATGTTGAATTTGGGGGAACTCTGAGGAGTCTTACAACTGAAGGCTTGCTTCAACTTCAAAAGAGGGTGAAAGAGGTAGGTTGCTTACATGAACCTTTGACTGTTGTTGACTATCAACATCTGCACACTAGATTGTCTGCCAGATGTCTTCAACATGTAGTTTTCTGTTAAAAAATTTAGTGATTTTTTTGAGTGATGTTTAATAGCCTTAAACTGAGCCTTCTTAGGTACTGAGAGCTACGTAATCAAATTAATAAGATTAAGGGTGAATAATTCTCGAACACGTGTTCACATGAATATAGAAGTCTCAGCTGAATGAATGATATAACTTGTGGTCTGCTTGCAATTTTCCCATGAAAATGCCATGTAACTCTAGCATTCATAACTGATCATCTTTCCCTGCTTTGCTTCTCTTTCTTTGTCAAAATCAATTTTATGCCTGTCCTCAACATAGAAGCTTATCATTTTTATTATTGAATCCTCTATTTCTATTTCGCATTGTTGAATTAGATGCTAATCGTCTTCAATGTCAAGTATTGCGGCAAGATCTTACTAATTAATGTGAACAGAACCTAGATTTCTTGTGGCAATTTTGTGCATTTGTAACACATATTTACATGGAGCCTGCAGGTAATTGAAGGACAGGCTGCTGTGCATAGGTGTAAGGCGTACATTGACATGAAAGAGGAGGATTTCCCAGCATATCCAGCTTGCATAAATGATGAGCGCTTACATCAACATGTAGGGAGGGTTGGCAAACTCCTGCTTGGTTCCGAGAACATCAAGGAAACTGAAAAGGTTATGGCAGGTGAGGACTTTGCCTTCTATCAAGAATTGATCCCTGGAGTTATGTTTCAAATTGGAATCAGAAATGAAAAACTGGGCTCTACCCACGCTCCACACTCCCCTCACTTCTTTCTCGATGAGGATGTCCTGCCAATTGGAGCAGCGTTGCACACAGCCATAGCAGAGATGTATCTGAATGATTACCAACATCCCATTGCGGTT
SEQ 42
TTAGATTTCCTCAACTCGTCTATAAAATAGGACATAGGCGGCCGAGGTTTTGAGCTTGTCCTGGCTGATGGGATACACATGGCTGTCATCGAAGTCATACCACCGATCAGCACCTTGCTAGATTATTAGAAGAAAAAAACACAAAGTTAGAATATCTGGATTAAACTGGGAAGACTGTAAAAGTCTGAATATTTGACCTTCTTGCATATTCTTCATGATGAAGAAATAAAAACAATGAAGATGCATGACCAAAGTTAAATATATAATAGATGCACATATGTGCAATATGTGATTAATTTAGATGCAGATGATGCATTGGAACTAAAAAATACATCAAAGGAACACTCACTACGAAATAGGAAGTTTCTATATTTGCCCTTGGAGGTAGGTTTCAGTGACCGCCCACCACCCACCCTTCTCCCCAAGTACTGATCTTTAGATGCCAATTTTATCAGGTATAAAAGAATCCTATTTACCATAGAATAGAAACAAAACCAAGAAAAAGAAGCAAGGTAATCCAAGTCGCAGCACCACTTACATGAACAAACGCAGTGTAGTGACCCCCTCCCATGCTTCCATAATGGTTGCTAATTGCATAAAGCATATACCGGTAGGAAGATTTGCCATCTTTGTAGGCCAAATATGAGGATAAATCAAGATCATGAGTTGGGAAGTCAACATACGTCTCCAACTTGTTCTTCAGAAACCGGTTGTACGAGAACCTCTTCAGGTGGATGACCAGAATCTCCGGCAGTCTCCAAAGATCCAACTTTTTAGTAGCTTGGCGATGCTGCTTGCATGCAGGGCAGTACCTAATGTTGGATATAAGACAAAGAAAGTTAGGCAGATCATATTCATACTTCCCAAGCGAATGACACAAAATTAGATGATAAGATAGAATACTAGAAAGATTTCATGAAAGAAGTCTTTTCCCCCATGAGTGGCCCCAGGTAAATAGCTGAATTTTATTATCTCAATTACTGTCTGTAGTTGCTATGAATATACAAGAAACAAAGAGCAACACAAAAACTATTTCAGAAGCACAATGTGCAGAAAATCAATAGGTGTTACATAAGATCATCAGATGCTGTCATTGTTTCTTTAACATAACTGAACTAGAAGTTGGAAGGGTACTACTCATGCCATGTTGCAAAGTCAAGGTCCAATTTGGTTGAGGGGAGGCTAATAAGAATGGTTTATCCACAATAAGCAATCACGACGTGGTGGAAAATATTGACAGAAATGAAATAAAATGGTATAATTGGAAAAATACAATTATGTAACTACAAAAGTTGAGGGTTTATAATATTAGCATAAACCAAAGAAGAAGCAAAAAGGGAAGCATTACGAGAAATCTCTGAACATTCATATGTTATGAATAGTGAATACATAGCTTGCATATGAATGGTGTACAATCACAGAAGTGAGGGAGTTGCATGCTTACCACATATCTTCTGGCCCTAGAGGCTCTTCCTTCAGAAATGCCTCAAGACATTTATACAGAGAGACAGATTCTTGTGGTCTTTTGGCAAAAAACCCAGATTTAAAAACTTCTGGCAGTGAGCTGAAAAGGCCTGTATTGTACTGTTCAAGCATTTTAGGTGACCAACTTACAAGTACATTTAACCGTCCAGAGATATCTGTGGACTGTAATGGCTCATTCATTACAATCTCGGAGCCTTTAAAGGTTGCCTTATCATCTGATAGGTAAAATTCAAAGTCCATGTCTAAAGGTTCGGCAGTATCTTCTTCAGCAATGCTTTCTGGAACCCCGTTAACTATTGAGTTGCCAGGTTCCATGTCTGTGCTGACTTCTGAATCTGTACATACTTCAGTAGCACTTCTATCACAGTTAAGAGTAGCACTTCTATCACAGTTAAGATTATCTGCTTGGGCTGTAGTGTGGACTAAGAATGGTGTAAGTATCTGTAGATAAAGACTACGGATATAAGATCCTGTAAGAACTCTACTATGCGCGGCAAGCGGAATTCCAAATGTCTTCATATTTGAGGTCAGCTTTCCGTATATGTAATGCCT
SEQ 43
CTAGCTAACCTGGTTTTCACTAGCCTTGTAATACAGCATTTTCTGCACATAAAACATCATGTATCCTTGCGCAGCTCTCACAATACTCTCGCTCACCTGAGTTATCCAGGCATCGTCACATTTGTACCATTGATTGCTTAACCTCAGATATGTTACGTAATGACCAGCATCAAGTTTACCAGTATGGGTGATGACAGCAAACAACTCAAATTCCGAGGACGATTCACAGGACGCATCTTGCTCGTCCCCGTCAAAGGAGAAGATTCTATTTCCAAATCGACTCCTCAAGATAGATGAAGAGAGATAAGGCGACATGTCCAAGGAAAAAGGAAACTGTAGGTAGTGATCAACCTTCCTTGACATTTTCTTAATCACAGAATGCTCAAACCTTTTGATATGGAAGCAAGAAACTAAAGGCAGTTTTCTTATGGACATCTGTTTAAGAGATTCCTGTCTCACTTGACAATGTTGGCAGAAGAACTTCTGATCAGAACCCAATTTCTCAGGTCTTGTGAAATGATCTAAGCATCCCATCAACGTAGAAATTCGACCGTTTTGGCTAAACTTTCCAGATTCTGCCTCCTTTTTGTGAGTATTATGAGACTTCTTTGATGTCATCTTGGCGGAACTCCCCTGGCTCAGTTCCAAGTCCAAGGAGATGTCTATACATGGATCATATGTAGTAGATGTGAAGCCACAAGCTGTACACATGACATCAGACCGCAAGATCCCAGAAAATACTCTATGAGCAATGCAACAGTCTCCGCTGCCTGCACAAAAGAAAAAAATTAGTCAAGATTAGATTACAGATGACAAAGTGCAATGTGTACTGCTCCTAATAAGTTACATACAGTAACATAGTATGCACAGCTTAAAGTGCTCCTCAACACATTTCATAAAAAGCAAAAGTCCAGTTATGCCATAAAGACAAAACAACAATGGTCCTATGAGGAATGGCAACCAACAGAATGAGTAGCTACAAGCAATGGAGTACAATTGACTTTTGTAAAAAAAACCTCAGTAAGAAAAGTATCCGCAAGACACTTTATTGCCTCAAAAGGTCTATTATTCCATTTACACAAAACCATTCAAAATCATAGATCCTTGTATTCATTATTTAACTGCAACAACCAGGTGTTAACTAAATTTTGCAAACTCAGGGTAGCCCTGTACTAATCCTCTAATTAAGAGGAATAAGAATAGGTGTTAACTCTCTCAATTAAAGTACGTGTGACTTAAGCGGGAATCAAATCAGCATATTTACAGTGGCAGAAATGAATTGTTTGCTAAGGAGTTACCCTACTAATGATTGCTCTGCTAAAGAAATTTGAGGTACCGAGGAGGTAGACTCACAAGAATAATAAGACACACAGGGGCTTATAGAAAAAAGTAAGCAAAAAGTTGTCAACCTAGTAAAGACTTCAAGCTTTCTTGAAGGTTGCGATCAGCTCACTCGAGAGTAATACCCTAAAACATGGTAAAAGTGCGAGTTAATATGAGAACTATTGTACCAGAAGAAAACTCCTATGCTGAAATCAATAAGACTAACCAAACTGAGACTTACTGGGAAGAGTAGGAGGAAATCTTTTTCCTTTTGAGAAACATCTCTAAGCCTGACAGCGAATATCAACTGGTAAATCGCCTATGGCAAAGAAAAGACCTAAACCATAACCTGCATTCAAATATTTCTATCTTTTCTCAGTGACAACGGAAGTTGGGATTGCCATGAGAGATGAGATGCTAGAACAACAAAGATAAGCCTATCAAGTAGGCCCTGAGTGCATTTTAGTTAGACGTTACTTGACATCACAAAGATGCTTGAACACACTATTCTACTTCCTACAGAAATTACTTTTTCCACCCCCTCTCCACCAACAAAAAAGTTCAAAAAATTCACCTACTGAAGTACTTGACCTTTGCAAGTAACTATACAAATTTCAGTAATCCACTTATTTGAGGTTTTTACTGCACTAGCCTCCCTTGACTATGTTAGATTTATGTTGCTTTATTAAAATATGCATAAACAATGTTGCCAATAATTTTCACAGCACAATAAAAAATATTTAATCTCAATGATTGGTCTAATTCGGAAGAAAAGGAAAAAAGAAAGAACAAGAAACTAACTATGCAAGGTGATGGGGGGAGAAAAGATGGGTAACTAATAGATACTCTAACGCAACAAACAAATATACCTGGACTCAACGCCTTTCCTTTATCGTTCTGCATCCTTTCATGAATCCCATCAAGCACGGAAATGAAAAACTCATGAGCATCCTGCTGTTCATAACTTGCAAGATTTGATGCATGCTTCCACCAACTGTTCCAAAGAAAGGTTTACATCCATCAGAATCAGATACAGAACAGACTGCAATTAGATACTCAAAAAAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCAATATAACAAATTACAAATGTATTATATCACAACCAAGTTACACACCACTAGCTAATTTTACTTCAATGAAGGGATAAGGGTGGTACATATAACCCCAACAATAAAAAATTAATCTTGTCCCTACATCATTTGCAAATAAGGGCACAACCAAGCTAAATAAGGAATCTTTTACCAATATATCATTATCCTTAAAAAAGAATTATAAGTATACAACAACTATGCCTCAATTCCAAGCAAATCTGGATCACCTATACGAACTTCTCACCCAAAAAAAAAAAAGAACGAGTGCACATTATAAGAAAGCCAAACATGATCTCAACAATCCAAGAAAACATGATCAAATGCAGACCTGTAGAGGAACTTTGCAGGACTAATAGGGGTCCAATCGCCGGAGAAAACAGCAGAAAACATTGCATCCAAATCACAAGCTAAACACAGCATTGTTGAGTTCTTATTCCCATTATCACTACTACTCCTTGTTATAACACTGTTATTCTTTCGCTGGCAAAAATATCTGTTATGCTTGTCACTCAGAAAGTAATTCCTCAATGGTGGTGTATGAAGCAATGCTTGAAGCACTGAATTCATAAAACACGTGTTTCCAAGATTGTTAAGGCCCCTCAAACCCCATTGTACTTCTGGGGTTGTTGAGTCATTGCCGAGCTGACTCGGCAACGGACTCGAATTCCCAACGATCAAGACCTGCTCTTTCACATCAGGCGTCCACGGCTTATACTCCACGCGCCTCCTCTTGCGCGTGTTCTCCGGCTGCGGCGGAGGGTCTTGTATTGATCCGATCACGGTAGCCTCCGTCTGCGCTAGTACTACGGCGGCGTCGAAGTCTCGATCGTATACCTGGTCCCTACACCCGCAGCAGAACAGCTCGGCCCTGTCGATGTCCACGGCGATGCTGTGCAGCGAGGGATCGGAAGCGTTCCCCACCGGATGCGACGGCGCGTGCACACGGCAGAATACCGCGGCGCACGTGACGCAGGCGTACAACCGCGGCGGCGCCTGTCCGCACGCACCACATCTCACCAGCTCATTCGGCGGCTCACGGCGGATGGACGCACGCCCCAATGGCCGGACCTTCACGCAGCCCCGGAAGTTGAAAAATGGGTTCGACCCAACTCTGGATCGGAGCTCCGACAAATGCTCGCATGTACCACTATCGGTATAAACAGGCGACCCGTTTTTATTTTCGGATCCGGGTTGGATCTTACCGATTGACAAATGGATCTTAAGACTGTTCCTAGACAT
SEQ 44
TCAAGAACTGCTCTTCCTTCCTCCATTTATGAAGGCCTCAATTGGTTCGGCATGGATGTGCTTCATTGCCTTCACTCCCAGCGGGTTGTCCTTGCTCTGTATCAACAAATGCAATCACGAGCATTGTTTAGAAAGAATGGCGAGCAACATGCTACAACCAGTTAAATTACACTGATAATATCAAAATTCTTTGGACGATCAATGTATAGAAATCAAACTTCTCTTTAGAAATAATAAGTGCGAGTCACTTACTATTGAGCAGGTGCCTGCACGAATATTGCAGACAGGATAGTCGTGTGGGCAACAACTGTAGTGATCTTCACAGCAAGTGGCTCCTTCAAGAGGGCAACATCCCCAAGAAAAGCAGGAGTTATAGTATTCAAAGACACAACAGCAAGTGGTACCCTCGGGGCATTGAGCATAATCATCACACTGAGTAGGTGGCTTGATTGGAGATGGAGGAGAAGGTCCAGGTTTAGGGGGATTTACGCCTGTCTTGACTGGGTAAGAAGGCTCTATGGCTAATCCACACAAACCTTTAGAACTGGCAACGTTACGTTGAACCCGAAGGTAACCTTTCTCTCCCCAGGAAGCACCCCAAGAGTTCCTGATAATCCAATAATCCATACCATTCTCACTACCATATCCAACAGCAACCACGCCATGGTCCACTGCAGTTCCACATTTTCCAGTAAAGATACCCTGCAAAATATATGTATTAGAACCATATAGCTACTAGCCAATAAGAACTTCATTATAATAAGATTTAAATATAGAATTGTAAGTAGTGATAATTGGTAAATCATAGTTTTTGAATTATAACAGGAATCTATTAACTCATGCAGCCAGCGTTATCTTTTGCTACATTCTTTGGTCGAAACAAAACTATTATATCTGATCGAAAGGTAACGTATTACTGGAACGACGATGTCTTTCTTTGAATTCTGTGACTAACTAAAAGACTTCAGATTTTGCATGTTCACTTGTATTTATTACTTTCAGAAGAAAAAGATAGGTCAATGCGAATAAAAACATTACCGATACATAGTGTTGAAAGTCGTGGCCACCAGCTTCAATGGCAATGCTCACAGGTTGATTTGCAACAGCCTTTTGCAGTGCCTTTTCATTATTTGCAGGAACATCTTCATACCCATCTATGGTAACAACCTTGGCATTTTTCTGCATGACACAAATGTTTCAATAAATAAATATTCAACTCACTACATAAATCAATATATAAAGTGAATTTAGTCATTTCAAGAATGCAAAATTTACCCTTGACTGGTCACATCTGCCATCACGGCCTGTGTAAGGGTAGTCTTCCTCAGTGTCAATTCCTCCATTTTTGATGATGAAATCAAAGGCATAGTCCATAAGACCGCCATTGCAGCCGTCATTGTAGGAAGTATCACAATCCACCAGTTCTTGCTCCGATAGTGAAATCACATCTCCGGTGACTATCGAGTTCACTGCTTCAACGGAAGCAATTGCAGAGAATGCCCAACAGCTCCCTGGTTCATTAACCGATAGATTAATACACAAATATTGAGCCATACAACATAAATAATGAATTGAGAAATTTGAACTCAGTGATTTATAAAAGAAACTGATATGCAGGCATTAATAAAGAATACACGTCAGTGTCACATGGTATTTAGAAGGCCCGTCGGGTCACTTCGTGTAACATTTGAGCACATTAAGAGGTCGTTTGGTAGAATGTGTTAGAGAAAATAATGCATGCATTAGCTTTGTGTATTAGTAATGCTTTGTTTGATACACTTTTTCAACCTATGTATAACGGATACAAGCATTAGTTATACAGTCTATTTGGTATTATCCTATGTATAGCTAATGCATAGAAAACCATGACATTAGCTATATCGAGGCTATTAATACTTGCATTAGCATGGTTAAAGACAAAATTATCCTTAAAGTCCCTTAAGTAAAGAATATGGAGGGCATTTTTGTAAACAATTAAATATCTAAAAAATTATGCAATGCATTTTAATTTTTAATACACCACACCAAACAATGCATAAAAAATAATCTCTGTATAACTAATGCTTGCATTACAAACCCCTGCATTACTAATGCACCTTATTTAGCATTATTCTTATACGCCCTACCAAACGACCCCTAAATGTTGTAGCAACTAGCAAATATTCTAGTGCCAAAACTCAGCATCAATTGGTCAGCATTTTCTATCATTATCAAGTTGTGCTGTTAAGCTTTTCACCCTATAATAATTGCTCATGCGTATAGAAAGATGGGGTCGGGGAATAATGACTAGGAAAAAAGAAAAGATGGGGTCCTGGAATAATATGATTAAAAGGGAAATTCGGTGCCCTAAGCTCAGGGTATGTGCGAGGTGCGAAGAAGAAGTGGATCATAAGGATTTATTGTACGCGACCTTACCCTGCATCTTTCTATACACCAAAAAATGTAAAGAATTGTTACATATTACTAGGTAAAATTTGATCTATAACAAATAACTCTTCATATATATAATACCGGCATAGGACTATTTAAACAATAAAATAACAAACTGTTACACCCAATTAAATTACACTAATACTATAAAAAATTATACTATCAAAGTATAGAATTTAAACTCGAAGGAAATTGTAAAAATAGCACGGTATAGCCAGTTTTCGGATTGGTCATTCAAAAATAGTCAGCGTTTATCAAGTCAATGAAAAATAGCCACTATTTTGCTGCAACAAAGACCGATCCAACATAATATACTGGAGTTCGGTGCAACTGTGTATGAACTACAACATATTATGCTGGACCGATATACTTTGTTAGCTCCAGTATATTACTGTAGCACCGGTGCTCCAAACTCCAGTATATTATGTTGGACCGGTATACTTGATGGAACTCCAGTATATTATGCTGGAGTTCTAGTGCGCTTATGCAGATAGAGTTCCAGCATACTTATCCTGGAACTCCAGTATAATATGTTGGAGTTCAAGTATACTTATGCTGGAACTCCAGCATAATATACTGGCGTATTTTCTGAGTTTTAAACAGTGTTTTCGCTCAAATTTATCTTTACATAAAAAGTGGCTAAATTTCGATTACTTTTAAAATTTGGCTATTTTTGAACGACCAGCTATTTTTTATTTTCACACAAGGAAATCATAAAGGTATCCTACGTTTATGCGTAGACACACAAATTGGCTTTCTACTCATAGAGGACTAGAACTGAACGCCACCATACTATTGACATGAGAGTTTATTTAAGTGCAATGAAAGTGTAAAGAACAGAAAAAATATTTAAATCTGATATAATAACATAGAAAATAATTCTTGATATCGATTCTAATCTAATAAAATAGGAAATAATAAATAACTAGTTATATTTAGAAACAATTTAACTTTAACGTTTTCATTTTATCTATTTTATCATTAGAGAGAAACTTTTATAATCACACGAATGTTACCCACAAATCTTTTGCCCTTGACCTTTTAGGACCACATGATCAAAAGTTTTCTTTTCTTCTTTTTTTTTAAAAACTTTATATCAAGTCAAATTATATCATTTAAATTGAAACGGGTAGAGTATTTATATATTTTAGCACAATAAGGCACGTATGATTTCCTTTGTTTGTCAAGTTCGTAGAAATACTTTCTACATATAAATTAATTATGGTAGTGGAATACCAATAGCCTATCTCTATATGCTTTAATAACAAATTAAATCAGGAAAATATCATCTAAAACGCCGTCTAATTAATTATAGATCCATAACCCAAAAACCGGAGATAAACGAAAAAGACAAAGCCATTGCTAATTAACTTACCACAGCTTCCTTGATCCTTAACTCCAACAAGAACACCTTTCTCTCTCCAATCAACTGAGTCCGGCAAGCTATCCCCAACTTTAGGAAGATACCGATCGCTTTTGTTTTTCAACAACCTGCGACGATCACTGGTCTTAGTACCTAAGTACATGGACCTGTACTCCTCGTTGGTCAGATCAGCAAATTTGGTTAAACCAAGCTTGTAACTCTTGTTTGGAACGGAGTTTTGTTCATCGATGTATCTTAAGTTATCTTTAAAGATCTGAAACCGCTTGTCTTTTTCGTCTAAGGCGTTGTACGATTTTCCATGTTCGAGTAGCCATGACTCGTACAAGGACATGACTTCATCGTCCGTTCGAAAGTGTTGGTTTTCGTCGTAGGTTAAGATGGACATGTCGGAAGCGGAAGATAAGGTGGAGAAGAAGAAGAAGAAGAGGAGGAGAAGTAGGGATATGGATATGGTGAGAGTGGAGCTATGAGTTGCCAT
SEQ 45
TTAGAGTTCATCCTTAGGTGCTGTTGCAGGAGACCCTGTGGGGTTGCTTGATGAAGTCTTGATGGGGTAGGATGGTTGCATTGCTATACCACACAATCCCTCTTCAGCATCAATCTCGCGTTGCATCCTAATGTATCCTTTTTCTCCCCATTCAGGTCCCCACGAGTTCCTCACAATCCAGTATTTGGTTCCATCAAGGGTTGTGCCATAGCCCACAATTGCCACACCATGGTCCAACTCAGTACCACAGTCTCCGGTGAATACACCCTGCCATATTTACAGTTCGTAAATGTTTATACCTAGTAAAAAACTTTTTAACTTGAGATAAATGGTCTATCATCATTTACCTCAGAGTAGAACTGGAAGTCAGAACCTGAAGCTTGTATAGCTACAGAAACAGGCTGGTTGGCTACTGCTTTAAGTAGGGAATCCTCATCATTAGGAGGAACATCCTCATATCCGTCAATTGATACCACAGGAGAATTCCTCTGCCAATTCCATAAAATTCATGCACGTGGATTAGAAACAAGACTGGTTCGATCTGACAGACTGACACCCTACAGATGTAACAGAATCTTACCTTTTGAATATCACACTCGCCACCTTCAGCCATGTATGGATAGTTCTCTTCAGTATTGATGCCTCCCTTCTTCTTGATGAATTCAAATGCCATGTCCATCAACCCTCCATTGCATCCTTGGTTTTGACTAGTGTCACAGTCAACAAGTTCTTGTTCTGATAAAGATACTAACTCATTTGTTTTGATTTGGTTTATCCCCTCTACTGCAACGACAGTTGAAAATGCCCAGCAACTTCCTATAACAGGCAAAAGGTCAGTTTCCATCAGCTATAATATTTTGAAAGAACATATCATATGGTTTACCCTTATGTTATTATGCTAGAGGTGTAAAGCGGCATAGAATTAATGACATGCTACTCTTTTCTTACCACATTTGCCTTGGTCTTTGACAGGAGTAACAGCACCCTTCTTCCTCCAGTCAACAGAGGGAGGGACATCTTCCACATTGGCGTACATGAAAGTTCCATTTGCTCGTGAAGCTCCAAGAAAAGAACGATGATGCTTAATCTTGGAACCAGCATAATGGTGTCTGAATTCATGGTTAGTCATGTCTGCAAACTTGTTCAATTTCAACTTATAAGGCTTATCCTTCTTGTTGAAGTTGTGAACATAGTGTACATTAGCCTTGAACACATTGAACCTCTTGTCTTTCTCATCAAGGCTCCTCGATACAGTGTGATGGCTTCTCCATCTCTCATACAACTCCCACAATTTTTCCTCAGTTTCCAACTCCTTCTCGTGGAAATCGAAACTCTCCCCAAGCCTAAGTACCAAAGCCAAAGAGAAAAGAACCAGAAATAACTTCTTCAT
SEQ 46
AAAACCAACCTGTGAGACATTAACATCCAACTCTTGGGCAATGAAATGGGCAAGTTCTGGAATGCGAGGCTTCAACTCTGAGCAGTTGACACTTAGTGACAAGTAAAATGTAATGAGCCCAACTTTAATTTCCACTGCAAGTCGACATAAAAGATGAATGTGATTACAACCATAAGTCTTTGTAATGGAATTATCTAATTTCAATAGCCATCATATCTGCACCGAAGCCTAGCTCAAGTTTGTGAGAATAGATGTAGTGAAGTAGAAAAAGGGGACTAATACTTGCAAAACTAAATTGAAATCTTGAAAAGTTTTACAGCAGATAAAAGTCAAAGCATTTGAGATTATGCAAACCATTGAAGAGGTACATCAAATTGAAATAATACAAAACAGGGCTATGTTTCAACAATGCAAACAGGAAATATTAGGCAGGAAAAATTTTGCGATTCTGTCATTACTTTAAGGTCTTGCCACAAATTTCTCATGCTTGTCGTTGTCCAGTCACAAAATTCACTAGAAATTTGACAATTGATTACTATAACTTAGTGGATGGATTTTCAGATAGTCGGTATATGGTCAATGCATGTTCACTTGGTATCAGTTGTCGTAGTCCTTAGAAATAACTTTTTGGTCCCTTGATTATACCATATTTGTACTTTAGATCCCTCAACTATTCAGCTTTACACATTAAGCCTACAATTTAACGAACTTTACAGATGTAGTCCAATAATAAACAAAACTAACTAACCCACGACATATTCATATTTCAAGTCTCCTTTTTTAATAATACAAATTTCAAAAGGAGCATTAATGTTGTAAAAGTACCGTTGCCTACTAAAATATCCCAAAAAGATGAACACGCTGCTTTGGAAATGGAACGCACCAATCACACGAGTTGCGGATTAACAAAATCTAAAAATTCTAGTTCTAATTAAGATCTGACTAAATCTGCAAACTCGACAAATTAAAAGGCAGATGTCAAAGTCGGAGAGTTGGACTAAAAGTGGAAACAGGGGGATAACAGGGGACCAAAGATTTTCTCAGTATTCCTTGAAGTTATTATAATAAATTTCCAGTTTAAGTAAATCTTTCTCAAACTACAAGAAGGCTGAGAATGCTGTGACATCAGTACAATTTTATGCAGGCAGTTGCTTCTTAGAAATTTTAAAATACCAGGGAAACAAGATGATTACAAGACTAATTTCAGAGAAAGGTCAGATGTCAACTTGAATAGGATTATAAACAGGGATCTTTACACAAATAGCCGGCTATATTCATGTTTACTTTTTCTAGCCATATACACAGATTATACATTGATGATACACAATTATGCACATATAATACATAAATTATGCATTCACACAAATACCAGCATTCTGGACATAAGAGACAGAATGTTGATTGCCCAAAAATGATCTAATCGAAGGCAACACATCAAAATCAGCATGATGCAATTCTAATTTTGATTCTCATTTTCATAAAAGAAACCACAAACCATTATAGTTCAAAATTGAAGGGAAAATTGAAAAGGGAATTGTATTAATCTATTAGAAAACAGAGCTAGTAGAGATGCGAAAAATGAAACCAGTTCAGGTTATGACAGCATTCTAATGGGCAGAGTAACCTTACAGTAAAAATACTATGTGAAGAAAAGCTGTCCCTTCATACCAGGTGTGTTGTACCCAGGGGGTCCACTAGGAGCTGATGAAGGAGACAGGTGCGCACTGGAGTTTGTGTTATCCAAGCTTGAGACCGATGGTGAAGGTGGAGATGGAGGGGATAAATTGAGTCTGTCCCATAACTCAGAACAATTAGTTTTCCAAAAACCAATTCTTTTATTTTCACGATCGTAAGTAACAAGAGTGTTGCGAACAACGATTCCTGTTGCAATTAAAAGCACGCATGTTACAGCACCGAAGACCTAGGACGATGAACTTTGTCAAGTAGTTAGAAATACCTCCAAGAAGACTAGCTGGATTCTTTCCATTCGGGAAAATTCCTAGGCAATAAGCACCACGTACTTTGAAGTGCTTTATAGCATACAAGTTACAATGAAGAAGGGGGAAATTAATCAATACTAAATCTTATAAAGAAAAAGAAGCCAGTAGATGTGACGCCAAAATAGTGTCGGCAGATAGTTAAGAAAGAACTAAAACAGAATAAAATGCCTACCTGAAACAAGTAATTTTCAGGAGAGAGAGTTAGTTTCTTTCCATCGCTGAATACCATATCGACACGCGGAAAGTTCTTTGAGAGTTCTGATATGTTGCTGGAATATATAAGACATTAAATAATAAGGGTAACACCAACAATAGGAAAAAAAAACAAATTTAGACAAGAAAATCATAGACGTCTAATTAATTTTGGAACTCTCCTTATAAATACCTTCCAGCACCAGAAAAGCAGATATCTTTAAAACTAGGATCTGGCCCTTCAATCTGTTTTAAAGAATGAAGCTCTTTCACTACCTGAAAAGAAAAGTTGAAGAGGTTAATCCCAAAAAGGAAAATACTTAAGTTTATCTAAAGATAATCGCCACAAGTTACATACTGTATAAATTCACAACCAGTGACTCCACTTTTCCTTCGCTAAATGTAACATAGCAAAAGCAATGAAAGACAACATCAAAGATATGCAGAAACTTGCAAGGAACTCTGTATATGTAGATAATTACTACCTGCTAACATCATTGGAAGAAAATGTTGTGTGGTGTATTACCAAATTCAGAAGTTTAAAACAGGAAAAATCAACTATTATAAGTGGTTAAGCAATCTAATTAGGTCCTTTAATAGCAGAGAAACTAACAACAAAAGAAGGGATAGAGGCCATAATATAGCCTTCCTTTCTTTTGATAAAGTACATAGCCTCCAACTCATCACAGGAATAAGTTAGGTGGCAGAGTTATAGGATCACAAGATACCTGGTTCTTACTTTTCTTCAAAAACATGTTCTAGATATCAAAGTGTTAGCTCTCAAACTATTATTGACCAAGATGTTTTAGTTTCAGTCCCACTCATGTACAGGAATCTCCAACTATATATTTGATCCACGTCACAATAGGACATGATGGTTTGTTTCACCAATCAGTGGTGTAAATAGAACTTTGAATGGCAAGAAAAATTTAACAGATTGAGTTACATATAGAAAACACATTATTCAAATGAATATCTTTTCACATCTAATTTCTTCTAGGACATTCTGCTCATCAATAGCGTGCTATTCGCCACTAGTCACTTTCTCAGACAGAACGAATCAAACAAACAGACAGAAACATTGGCAGATATGTAGATAAACTATATCTTACAGCATTCTTGAAAGCTGCAAATGCTGCTTCTGGAAGGTACGCATAGGTGGTACCACTATCAAGTATAGTCCCATGTTTTCCACCAAAAACCCGTGGATTTAGGTTTAGCGGCTTCCCAGCGACATGTATCTCCTTCAGGTCAATATTGTAGTACGGGCTGTTGCCATCCGAGACTAAAGTAAATGCGTAAGATTTATCAAGTGTATTTGCTAGGAGGAGACCTAACATCACTGTGTCTCTAAACAATTGTTTCCAATACATACCTGTGACCAAAATCTGATTTGGTAAAGGCCATGTCAGCAGGGGGTTTTACTCCACCAAGAACCATTGCCCCGCCACCAAAATCCATCCCTCCATAGCACAAGGAGAAAGAATCACTAATTACATGTTTTTCAACAAGTTGATCAACTATACTAAGATCACCTCGGCCCAAACCCATTATACCATCAGCACGTTGGCTGTAAAGATCACCAGTTTCCGCAATTTCACATCCAAAAACAGCTCGTTGTGGTGCAAGCTCACTTAGATTTCCAAAAGATATGATGTCCTCTCCAAGCAACCCATAACTTGCACTCATCTCAGCGTACCGTCTCTCATAAATACATTGCTGCCTCTTATGGTCGCAGGGACAAGCCTTATTGCATTTCACAGATTGATAAGTGCTTGACATTTCCGGCTGAAACTTAGGATCCTAAATAAGACAACATACAGAGTACCACCGATCAAAAGACAAAAATCAAATGCCAAGCAATTCTAAGTGCTATATTAAGCATTTCTCACAAAATTAATAGGCTGACTCAAAGTAAAATTACTAGTCATGAAGTTTCTTAAGTGCTGATATTTTCTCCATTAGGAGATCTTTTATGATTACAAGGCAAATTGAGCAAGAATCACATTTAAAACATCATGAAACTATACAATGGATTTGTACAGCTTATCAACAAAGAGAGGCTTAGAACTATTTGTATCCTAGTTAGATTGGTTTGTTCTTGTTTACCCTCTTTCCCTTAATACTTACAAATAACTGCATCCTACTAAGCGATTTCCTCACAAACAAAAAGTATATTAAGTAATGTTTATGAAATAGCACCTATACATAAACAGTTTCAAATTTTAATTTCCATATCAAGCTATCAAAACACACTAACTGCAAAATTAAGATAAATATGTAACCTTTAATGTTTATCCAGAAAAAGAAAAGAGGGAAAAAACCTGATGGTTGCCACACTTTTTACACTCAGAGCAAGGGACATAGGTAACTGTACTCCCTGTATCAACAATAAGAGCGAACTTCTGCGGTGGTGTTCCAATCCAAATATGAGTTGTATAGTATCTGCATCACAAGTTACTTGGAATCCATTCAACAAAATAAAAATTACTAATAAAGAAATTAGTCTGAACTAGAGGTATATAGAAATATATTCCACAGTGAAAACGCAAAATACGCATGTGAATCAGCAGCCAAAAGAGTTAGTAACAGTGAATTTAAATTTTCTGAGCAAAAGCTACGAATTTGAACCCGTTGAGGAGGAGATCATCATGGAGAGACATGCGAGCGCTGGCAGGACTTTTCTGGAGGTGGCGACGGGAGATTTCCGCACGGCGTGAAGTGTCTTTCGGAGGAAAGAGCGGCAGCAGCATGGTTGTGTGACGGCTGCCGTCGGCCGGCGAAGGGAGGAAAACGGAGCTGCCGTTAGTAACATCAGATAATCGGAAGCCGGAAACGACACCGTAATGGATCAACAGAGAGATGATCGCGAGAATAACGGTGAACTGTGGCCGTGCCAT
SEQ 46
TCAATATTGTAGTACGGGCTGTTGCCATCCGAGACTAAAGTAAATGCGTAAGATTTATCAAGTGTATTTGCTAGGAGGAGACCTAACATCACTGTGTCTCTAAACAATTGTTTCCAATACATACCTGTGACCAAAATCTGATTTGGTAAAGGCCATGTCAGCAGGGGGTTTTACTCCACCAAGAACCATTGCCCCGCCACCAAAATCCATCCCTCCATAGCACAAGGAGAAAGAATCACTAATTACATGTTTTTCAACAAGTTGATCAACTATACTAAGATCACCTCGGCCCAAACCCATTATACCATCAGCACGTTGGCTGTAAAGATCACCAGTTTCCGCAATTTCACATCCAAAAACAGCTCGTTGTGGTGCAAGCTCACTTAGATTTCCAAAAGATATGATGTCCTCTCCAAGCAACCCATAACTTGCACTCATCTCAGCGTACCGTCTCTCATAAATACATTGCTGCCTCTTATGGTCGCAGGGACAAGCCTTATTGCATTTCACAGATTGATAAGTGCTTGACATTTCCGGCTGAAACTTAGGATCCTAAATAAGACAACATACAGAGTACCACCGATCAAAAGACAAAAATCAAATGCCAAGCAATTCTAAGTGCTATATTAAGCATTTCTCACAAAATTAATAGGCTGACTCAAAGTAAAATTACTAGTCATGAAGTTTCTTAAGTGCTGATATTTTCTCCATTAGGAGATCTTTTATGATTACAAGGCAAATTGAGCAAGAATCACATTTAAAACATCATGAAACTATACAATGGATTTGTACAGCTTATCAACAAAGAGAGGCTTAGAACTATTTGTATCCTAGTTAGATTGGTTTGTTCTTGTTTACCCTCTTTCCCTTAATACTTACAAATAACTGCATCCTACTAAGCGATTTCCTCACAAACAAAAAGTATATTAAGTAATGTTTATGAAATAGCACCTATACATAAACAGTTTCAAATTTTAATTTCCATATCAAGCTATCAAAACACACTAACTGCAAAATTAAGATAAATATGTAACCTTTAATGTTTATCCAGAAAAAGAAAAGAGGGAAAAAACCTGATGGTTGCCACACTTTTTACACTCAGAGCAAGGGACATAGGTAACTGTACTCCCTGTATCAACAATAAGAGCGAACTTCTGCGGTGGTGTTCCAATCCAAATATGAGTTGTATAGTATCTGCATCACAAGTTACTTGGAATCCATTCAACAAAATAAAAATTACTAATAAAGAAATTAGTCTGAACTAGAGGTATATAGAAATATATTCCACAGTGAAAACGCAAAATACGCATGTGAATCAGCAGCCAAAAGAGTTAGTAACAGTGAATTTAAATTTTCTGAGCAAAAGCTACGAATTTGAACCCGTTGAGGAGGAGATCATCATGGAGAGACATGCGAGCGCTGGCAGGACTTTTCTGGAGGTGGCGACGGGAGATTTCCGCACGGCGTGAAGTGTCTTTCGGAGGAAAGAGCGGCAGCAGCATGGTTGTGTGACGGCTGCCGTCGGCCGGCGAAGGGAGGAAAACGGAGCTGCCGTTAGTAACATCAGATAATCGGAAGCCGGAAACGACACCGTAATGGATCAACAGAGAGATGATCGCGAGAATAACGGTGAACTGTGGCCGTGCCAT
SEQ 47
ATGGGAGCAAAATCTTTTCTTGTCGCCTTTTTCCTTTCATTGCTGTTATTTCCTTTGGCCTTCTGTACATCAAATGATGGCTTGGTTAGAATTGGTTTAAAAAAGATAAAATTCGATCAAAACAACCGACTTGCTGCACGCGTCGAGTCCAAGGAGGGGGAGGCTTTGAGGGCCTCTTTTAGGAAGTATAATAATCTCCGTGGTAATCTTGGGGCCTCTGAGGATACAGACATTGTAGCACTGAAGAATTATATGGATGCTCAGTACTTTGGGGAGATTGGTATAGGCAGTCCCCCTCAGAAGTTCACTGTCATCTTTGATACTGGTAGCTCTAATTTGTGGGTGCCTTCATCAAAGTGCTACTTCTCAGTAAGCTTTCTATTACATTTTTACTGTCATAAAACATAACAGAGAAAGCTAATGTTGGCGTATGCATAATTGACGAGCATCCATATTTATGCGTCTCTGTATTTATGCAGGTTCCATGCCTTTTCCATTCTAAGTACAAGTCAAGCCAATCAAGCACTTATAAGAAAAATGGTTTGTGTCTTGACCTTTGTCTATAGCTGAAATTGCTGCATGAAAACATGCTTTTCTCTTAAACTTGTTATTACGCTCAATGCTTGCTTGTAAGAGAAAGTGTTCAATTATTGCGTTTTGAGATCAAAACTGTTAACCCTGCTCCCAACTTAGGAGATTTAAAAAAAAAAAAGAAAATAAAGAAGACCCTTACCATTCTTATTGTTGTCATCCAATTATGTGCCTTGCACCAAAGATTTCTGTTGAAAAATATAACATGCGAGATTATGTTGTTGGCTTTCCCTCCCAAAAGATGTGCTAATGTTATATCTCTGATTTTTTTCTTTCAATTATTGGCAATAAAAGCTTGTGCCTTTTGAACCGTTTTGTCTATCGAGGAACCTGTTATGGTGGAGTTCCTTTATTGAGTTTTGGTATCCATCATAATTTACTTTCCGGGAAAATTGGAGTCTGCTGTGTGATTGACATGACATGATTTTTGATTATTCTTCTCTGTCTGCTTTCTAAGTTTCTACATTCTCGGTAGAGGTAAGATATGCGTACTATCTACCCTCCCCGGACCCCACTTATGGGACTAGGTTTTTTTTGTTGTTGTTGTCGTCATCTACTTTCTAAGTTGGTCAACGTGTTCACTTGGTTGTTGACATAAGAACCTGTTCATTCAAACTTTTTTCCTGTTTAATATGCCATACAGGGAAGTCTGCTGCCATACGTTATGGTACTGGAGCAATATCTGGATTTTTCAGTCAAGATAGCGTTAAAGTTGGTGATCTGGTTGTGAAAAATCAGGTGAATGTGGCTTCCCACTTTGTGTGTGTGTGTGTGTGTGTGTTTTAAAATGTTTCTCGAGCATATAGTCTCTCATCTTGTTAATGACATCAGGAGTTCATCGAGGCAACCAGAGAACCCAGTGTAACTTTTTTGGTAGCCAAGTTTGATGGTATATTGGGTCTTGGTTTCCAGGAGATTTCTGTTGGAAATGCTGTACCAGTATGGTATGTGGGTTTATTTTGTTTGTGTTCTCTTCTTTCCAAATGTTTCTTCAATTTCCTATTATCCAAGTGCGTGCCTTGTGAATTTCATTATTACATTGAAATGATTTTATCTTCTGGACAGAATTTCATTAACATCTCCTTCTGTATAAAGGTTTAAGTGATACTGGTCTTGACAGTTTCTTCTGTGTTTTATAGGTACAACATGGTCAAACAGGGTCTTGTCAAGGAGCCTGTCTTCTCATTTTGGCTCAACCGAAATACAGAGGAAGATGAAGGGGGCGAAATTGTGTTTGGTGGGGTTGATCCTAACCACTATAAGGGAAAGCACACTTATGTCCCAGTCACACGGAAAGGTTATTGGCAGGTAGATATCCCTATATCTTTGGGAGATTGATGTTTGGCTTTTGCAACCGTTTTCTTACTCTCAGAATATAATTTGCAGTTTGACATGGGTGATGTTCTGATTGATGGTCAAGCTACTGGTATGTTATGTTACTTCCTTTTCTATTTTTTTGTGTGGAGATTTCGAGGATAAGATGAGAGCACTTTCACATGATTTCCATGCTTTTTCGTTGTATTGACATACTGAATACTGTAGGTTACTGTGACAATGGATGTTCTGCAATAGCGGATTCTGGGACTTCTCTCTTGGCTGGTCCAACGGTATTCTCAAAAACATGTTCCATTTTTTGTTCCTCTTATTCAGCTATTATCAATAATGAACTGTCTCATAATTTTTTTTGTACCGTCCTGTTCATGTGTAGGTTTAATTTTTTCGCTGGAATATGAGTTGAATAATAATCAGCCATTCATTTGAAGTATTCTCATTTTTTCCGTTTCTATTCAAAAAAAAAGGAGGATGGCAAGTGCAGTGATATTGATATTCATTCCAGTATCTGGACATACTTCCTTGTTGATTTTCATACCTAAGAAATGTTTCTTTTTACTTTTGATCTGTTGTTTCTGTCTTCTTTGTGTGCTCTTCTTCTTTATTAGGAAAAAAATTGTGCATCTTGAGAACTGCTTCTTAATTGTTTTCTTTTATGGCATGGTTGACAATATGATACAAGGAAAAACTGCAGCTTCTTTTGTCTAGACAATTGTAGTAGTGAAATGCTTTACTACTACATTTCTAGTTCTCATCATTCTTCCCTGTATCCTTCCTCCTCTATCTTGCAGACTGTAGTCACTATGATTAATCATGCCATTGGCGCCTCGGGGGTTGTAAGCCAACAATGTAAAGCTGTTGTTGAACAGTATGGACAAACAATAATGGATATGCTTTTAGCAGAGGTGAGCAATTATTTGTTTTAGTTGATAGTTTTTTGTTGTTTTTACCAATAGTTTTCCGTGGTATCTGCAAAGAGGATGGTTTCATGCTACTAGTTGCCTTCCCAATATTCTGATGCATTGGCGTCTTAACAGGCACATCCAAAGAAGATCTGCTCACAGGTTGGGTTATGCACCTTTGATGGAACTCGTGGCGTTAGGTTAGGCTTCAGACCCTTTCTTTCCTCGCCTTGGCCAATCATTTGATATGGTAAATCCTATTATAAAATGTGTGCTGAGTGGATTTATGTCCTCCACGTGTAGTATGGGCATTGAGAGTGTTGTGGATGAGAATGCTGGCAAATCTTCAGGACTGCATGATGCTATGTGCTCCGCTTGTGAAATGGCGGTTGTCTGGATGCAGAACCAACTTAGACAGAACCAGACCCAAGAACGCATCTTGAACTATGTGAATGAGGTAAATAGCATCAGTCACATGCTTTCTCTTCTCATCTTAAGTTAGATTACTGACCATCTTTAACAGCTTTGCGAGCGACTACCAAGCCCAATGGGACAATCAGCTGTTGATTGTGGAAAGCTTTCTGGCATGCCTAGTGTTTCCTTCACAATTGGTGGCAGAACATTTGACCTCTCTCCTGAGGAGGTATGTCTGATATCAATCTTGCGTAGTGTACATGGCGTCTTCTCATTTTGTAAATGGCTTTGATTTTTCTGAACAAAGTGATTGGTTGTAGAATCCTTTTGTCATGTTTCAGTTAGGCAGTTCATTTCTTTGTGGTTTTCACTAGATTAGCTAGCAAGGTGTTACTCTGCTTTCAAGAGAAGTACACTTGTCTTTTAGAAAATTTCAACCATGACAGCTAAGTGTCGTTTGGATAATTAATGATATTGAAGCGTGTCGAGCTTTAATATCAGTTTCTTTGCTTGATAAGTTAACTTGTGATCGGATAATTAATGTTATTGAAGTGTGTCGAGCTTTGATATCAGTTTCTTTGCTTGATAAGTTCATATGATTGTACTAAGCTTGCATGCTTTTCTTGTCACCAGTACATACTCAAGGTGGGCGAGGGTCCTGCTGCACAATGTATTAGTGGCTTCATTGCCTTGGATGTTCCTCCACCCCGTGGACCTCTCTGGTATGTTTTCTTTTCGTCTTAACGCACAAATGCGTGGATTCTGTTATTACCAGCTCCCTTTTGATAATGTTGTTTGCTTATGGCTTTGGTGGTGCAGGATCTTGGGGGATGTTTTCATGGGTCGATATCACACCGTCTTTGATTCTGGAAAACTTAGAGTTGGATTTGCAGAAGCAGCT
SEQ 48
ATGGTTGTTGCATTTGTGGGCATAGCCAAGTCTATCGGGCAACAATGCTTGAGGCGATCAAAACCCTACTCTTACTCTTACTTCTCCAGCTATGTTCGTTCCTCAAATTCTAAGTATGGACTCCAAAATTGGCAATTTCAGAGTCATAGAACTCTAATTTTACAATCGGCTTCTGAATCCGTCAAATTAGAAAGACTCTCCGATTCCGATTCCGGTAATTCCACTCACTTTGTCCTATTTTACGGCGTACTGTTACTATTTGGGGAATCAAACTTCTTTTAATTTTGGGTACAATTGCTTTCTGGGGTTAATTAACAGGGATTTTGGAGGTTAAATTGGATAGGCCCGAAGCGAGAAATGCAATAGGGAAGGATATGTTAAGAGGATTACAGCAAGCGTTTGAAGCCGTGAGTAATGAACGTTCAGCAAATGTTTTGATGATCTGCAGTTCGGTTCCCAAAGTGTTTTGTGCTGGAGCTGATTTGAAGGTATAACAGCTTCTCTTCATGTTGTGTTTTTAGGAAAAAATGAGGCAAAAAAAAAACTTTTGAAATCTTGTGCAGAGTATAGGACACACTATTTGGTTAACAAAAAATGTGTAATCAAAGGGTTCGAAGTCAGATTAAATAACAAAAATTATGGGTAAATATTAAAAATTTTAAAACTTTTAAAAAATCATACTCGTTCAAATTATATTAAATTTTTTCATTTCTCCTCTTTGCTTCTTCTTCTCCTTCTTTCTTCGTTCTCCTCCTTAATTTCTTATTTCTCTTCAACTTTCGTTGTTGTTGCTGCTGGTTCGTCCATGTCATCTTCTTCTTCTTCATTTTATTTTTCCATCTTCGTCTCATTTTCATCTTTCTTCTTTTTTTCATTTTTTCTTTTCTTTCTAGTGGTTTAAAATATACGAGAAAAAGAGAAAAATATGAAATTTTACAAAGTAAGTATTTTGCAAAATACCCTCGAGATATTTTGAGACATACCCATAAATGAGTATTCTACTGAAACATCCCCGGGTTGTTGTTTGAAACATCCTACGGGATATTTCTTCTACTTCTTTCTTTTTTTCTAGTGTTGTTCTCATTGATCTTTTTTTCACAGATGTTTCAAACGTTCCCACAGATGTGTGCAGATGTTTGAAACATTTCAATAAATGTTTGAAACATTTCTTTAGATATTTGAAACATCTTTCCTAAATGTTTGAGGATTAGGAGTGGCGGGGATAGAGAGAGGAGCGTTGCGAGATGGAGTAACAAGGGGAGGAGGAGGAGTGGCGGGAGGAGGAGGAGGGTTTTTTTTTTTGTAAATAAGAAAACTTTGGGGGTTTTAAAAATAGTGTCATTAACCCTAATCACAGAAGTGTCCCTTTTACCCCATTCTTAACACTTTTGTCTTAAAAAGTGATATAAGTTTTGAAGTGTCTTAAAAATTTAAATGCCCCGTGTTTTTAGTTGTAATATGTTTTTTCTTGCAAGCAGTATCACAACTTGTAGATATTACATCTTCCTTTTCGTTATGTATCCTCTTCTCTTCATTGTGCCAACCATTGTATTTGTGTTTTCAAGCAAGAGACAGATCCAAGATATGAACTTTATGTGTTCGAGTTCTAAATTCTGCCTAGTCCATTTGATTTACGGGGTTTGAAATCTATATTTGTACAAGTTTTGGTGAGTTTTTTAACACATATATGTTTCGTTGATTCTCTCTGTAATGTCTTGTTGTCTTCTAGTATATTTTTCTTTTGTATCGCGTGACATGTTGACAAGAAAAACAATAAAATTTCACAAGCTAGGAGCATCTGCCAAACATACAACTTTCAATGTTGAAAAGTTTCTCTGTGGTTTGATCTCTTCGCTTGTGCAGCCTCTGTGGGACTCAAACATCTTCACAGATTCAATTTTCTTTTGTGGGGACTGCATTTATGATTGGCATGGCATATGATTGAATAGCTGATGTTTTTGGTGTCAGATGTTGATTTACGCACTTTAGTAGTCTTTCTACTGTTTGTTATCTGTGCTGTATTTTGTACCTACTGATAGCTAATTCAATTGATTGGCATTTGCTACATAGGAACGAAAAACTATGATTCTTTCTGAAGTCCAGGATTTTGTAAGCACTTTGAGATCAACTTTTTCCTTTTTGGAGGTACGTGATTTTTATTGATGTTTTGTTTAATATATTAAAGATCATAGTGTCTTAAAGCTCAAGAAGAAGTTTTTTTTATCAATTTATGAAAAGCAGAAGTTATCAGTTTATAATCTTCTGAATTCTTCCTTCAAAATGATTGGTTCATGAGATCATACTATGTCTCGTTTCTTCTTCCTCACTTATTACTTTATCAACCATAAAGTGGTCCAGTTGTACATATCCTCTTCTTTTCTTTCACTCTATTTGAGTAAATATTTTCTTGCTCAGGGTCTTCATATTCCTACAATTGCTGCCATTGAAGGTATAGCATTGGGTGGGGGGCTTGAAATGGCGATGTCTTGTGATATCCGTATATGTGGTACGTGCTTTCTTGCACTTCTGGGTGTACCATATTTTCTCCTTCTTGCTCTCTAGTTTGATAATGTGTTAACAGGTGAAGATGCAGTGCTGGGCTTGCCAGAAACAGGACTTGCTGTAATACCAGGGTAGGTATGCCTTAATTACGCATTATATGTTTGCTTATGCAAAATCCCAAAATTCTTTGAAGGATGTGTTAGCTATGTGTGGTTTATTTCTAAATTTATCAGATCAGTGGGACGCATTTCCACTATCTTTTTGTCACTTTGTAATCTTTTACTATTCAAAGGTTTCCAACTTCAGAAGTTGCTATAAATACTCTGTATGCATAGATGATCTTCTTAATGGTATCCTCTTATTTCATACTGGCATTGTGCAGAGCAGGAGGAACACAACGGCTTCCTAGATTGGTTGGAAAATCAATTGCAAAAGATATAATATTTACTGGCCGAAAGATAAGTGGGAAAGATGCTGTATCAATAGGTACGTGTATGACTTGTCAGAGCTCATTTGTCAAGAGACAGGACTCCTTTGTCTTTCCAAGTTCTCTCTTGTTAATATAAAGATAGCAGTGATGTCAGCACTTCATTACAAATTATGGGTTAACAGTGTCCTCCAAGGTTTAGGCAGATAGAAAGAAATCATCTAATTTTTGCTTCTGCTGTAATTTTGGACCTTGATCTCCTATGGTTTTCTTTTCCAATTTCTTAGTGAATAATACATTGTATGCAGGGCTTGTCAATTACTGTGTTCCTGCTGGTGAGGCTCGCCTCAAGACACTTGAACTTGCTAGGGATATTAATCAGAAGGTTAGACTTTAGTTATTGAGATAAAGAGGATGTGATGTATTTATCCAGTGTGCCACCCATATGACTTCCAATTGCAATTTAGTCACGAACAAAGAAGAAACATAAAAGAAGTCCAACTCTTCCTATAAACAAAATGATTTCAAACTGTACTGTACATAGATAATTGTAAAGATTCGTTAGCAGTAACGTGTACTCTTTTGTACCCTTTTCACCTTTTATGAGTTATGCACCCTCTTTTGTACCCTTTTCACCTTTTATGAGTTATGCACCCTCAAGGCCATGAAATATGCTTGTCACTGGATTTTCTTTTCTTTTGTGTGTGTTGAATGAAGTTGAGGCTCTTGTTTGAACTTTTTATTGTCATCCATGGACCTTAATTTAATGGCATTTACTAATCCTATGCTTGTTTGTTTTCCACTTTGTGCACTGCACCTTCATTTTTTGTGACAAGCTTTGTTTCTTGCTTTTGGTCTTTTTCTGTCTTGTTTTTTCTTAGGTGGAGGATCATTGACTATTGCATAGTTCCTTGCTTTGGTTTCTTTGTTTTCCTTTTCCCCTTCTTTTTCGATTTTTAGCTATTTTATGGCAGGTTCACATAGAAAACAAGTGTTACTCTATTGTTTTTCTTTCCCTTTTTTCTCCATGCATCTTATAGAAACCGAAGCTTAAGGTTTCTCCACGCAAGCTGCAACGTTCTGTTTTATAAGCTTCATATATTTCTGGTTTTCATGTAGTATGAAATGATATTGAGTGGGATTATTAGGAAGCTGAGACAGATTGAAAAGAACAAGTAAAAGCCACATTGGTGATTCCCTCATGCTTTCAACCTTAAAAGGTCATTTCAATGTCCAGGGTCCAGAAAAGGGACTCACTCTATTCATGTTCTATAAGAATGGAGTAATCCACTTGACAAGTTCTGGTGGTACTACTTCTTCATAAGGTTTTATTATACTAGTCAGGGCTGTGTTGAAAGGATATGGTAGCATCAACTAAGTCCAATTGTTTCGTATTGTATAGAGCCTTCCTTTTTCCTTTTCCTTTTCCCTTGTTGACTCTGCTTTTATACCTCCAAATGGAATGGTCGTAAAGCTTTTGCTCTTTATTCAATAAGCTAAAACTTCTGATGAATAAATTTGGTTTAAGGATGCACGAGGAGTGAAAGTAAAATAAATATTGATGAAGGTTTTGCTAAAGATGCTCTTTTTTATGCTCGGGTTTTGCATGTCAACTGACATATACCTTATCAATGTCGACTGACATATATTCTCTGACAGGGTCCGGTGGCGTTAAGGATGGCAAAATGTGCTATTGACAAAGGAGTGGAGCTAAATATGGAGTCAGCCTTAGCTTTAGAGTGGGATTGCTATGAACAACTGTTAGACACAAAAGATCGGTTAGAAGGCCTTGCTGCATTTGCCGAGAGGAGAAAACCTAGGTATAAGGGTGAA
SEQ 49
CTAGCAGCAACCAGCTATAGGAACAAATGTGTCAGCTCGAAGTGACATTGGTTGGCAGCTAACATCCTCACAACTCTTGTGGTAAAGCATCTTTTGTACATAGTACATCAGATAGCATTGTGAGGCCCTGACAACTTCTTCATCGACTTCAGTAATCCATGCATCGTCGCATTTGTACCATTGGTTTCTCAAACGCAAATAAGTCACATAGTGACCTGATTCTAACATCCCTGAATGTGTGACCACAGCGAAAATTTCAAATTCCGTAGAAATATCTGATTCATCACCGTCGAATGAAAAGATTCTGTTCCCGTATCTCTTTCGTACAATTGAAGATGATAAATATGGTTTCATGTCTAAAGAAAAAGGAAATTGCAGGTGGCGGTCAATCTTTCTGGACATTTTTCGGGTGGGAGAATGTTCAAAGCGTTTTATATGAAAAGATAGCACCAGCGGAAGCTTCTTGATGGACATTTGTTTCAATGCATCTTGCTTTTCCTGACAATTTTCACAATACAGTTTCTGATCAGATCCCAACTTTTCTGGTCGTGTGAAGAGGTCCAAGCAACCTACAAGAGATTCATTTGGCTTACTCGACTTATTAGCAAAATCCTTTGGGCTGGAGTTGCAGCTATTCAAGTCAAGAGAAATGTCCATACAAGGATCATGAGTTGTTGAAGTGAATCCGCACGATGTACATGTGACATCAGATCTCAAGAGCCCATAGAAAGTCCTATGAGCAATACACTGGCAATCTCCATTATCTGTTTAAGAAATGTTATTTTTATCAGAGAAAAAGAACTAGAGGAAGCCACAAATCTTCAGTTCTCAAGGTTATATATGAATCATCCCTCTTCTGATCAATGCATTGACAGCAAATGGAAGTACAGGCCAAAATTGATATCCCAGAAAGGAAATTGCGCATGTACATAGACAAGATCTGTACTATATTTACTCACTGCGTAGCATAATGTCCTACATAAACAAGAAGAATACCAGCCATTCTCAGCAAAAGATACCTTTGGTTGCCAAACTAGCTTTCCCCTCTTTATCATGGATCCTGTCCATAACTGAAATGAAGAACTCATGAGCATCCTGCTGCTCGTAGGTAGCAAGATTTTCTGAATGCTGCCACCAACTGCAAAAATAAACAACATTTCAATTAAAAGAATCACTACATAAGTTTTCAACATGTCAACGTCCAATAAAAGTTAACATTTTCCACGTTTAAGCATCCAATCTAAATTAAACAAATGATGATATCTTAGGTGGACATAGCAGCATAGAGAAAATTTCAACAGATTTATTTTATCACGATAAGAGAATAGCAATCTTGCTTATTGTCTGATTTTAGCGGTAATGCACCAAATCTTGTTTATCCAAAAACTTCAAAGTGAAAACCTATGCAGTTAGCGCTAAAAATTGTAAACATTATTTACAATTTTGCAGGCCATGTTATAATCAAATATCCAGTATCAATACATTAAATGGTGAGCACAGATAAAAAGCAATTAAGAGATAATGACAGGAGATAAATCCTTTTAACACTTACAGACCTGTAAAGAAACCGAGCTGGACTATAAGGGGTCCGATCACCAGAAAAGACAGCTGAGAAGATAAGGTCAATATCACAAGGCAGGCACAACCGATCCGACGACATCTTTCTGCAAATATCTCGGTTATGCCTATCGCTAAGGAAGTAATTTCTCAAAGGGGGTGCATGAAGTAACACTTGCAACACAGAGTTCATGAAACAAGTATTCCCCAAATTGTTCAAACCCCTTAATACTAAAGGAAAACATGATTTCGACTTCTGATCCCTCCTTAAAAACAACGTCTTCATATTCTTTGAATCTAAATCCATCCCAAAACTCAACCTTCTCCTCTTGCTCAACCTCAACTCACTCTCCACAACCCCAATTTCTGTTCTAGGAAACCCCATTATGTGTTTACACATCACAACCTTATCAAAATCAGGATCATACACCTGATCACAACACACTGAGCAATAAAGCTCAGCCCTTTCCATGTCTACTGAAATCTCATGCCCAGCTTTACACTGACTATGCAAAAGGGCATGGTTTGATTCAGGTGACAAACAACATAACACTGATGAACAGATCAAACACATGTAAAATCTACCCTCATGTCCACTACAAATACTACATCTAGGTAGCTCTGATTTGGATATTTCTAATGTGGTCCTACCATATGGGGTTGTCTTAAAACATTCTTGGATCAAACTATACCCACTCATACCATTTTTCACCTTGTAATCTGCAAGATGCTTACAGGGCTTTGGATTTATATATAAAGAGTTACTTGAGCACAT
SEQ 50
ATGAAAGAACTTCATTCTCTAAGAGAGATCGAAGGGCCTGACCCGAATTATAAAGATATATGCTTTTCTGGTGCTGGAAGGTAATAAATTAACTATAGTAATGTTAGATCATTAACTTTTTCTTTCCTTTATTTTTGGTGTTGTTTCTTGTATTGAATGTCTTGTATATTGCAGTGACATCTCAGAGCTCTCAAAATCATTTCCTCCTATCGACATGGTATTTAGCAATGGAAAGAAACTATCTCTCACTCCTGAAAACTACTTATTCAGGGTAGGCATCTTAATCCACATGGTTTTTATCTTACTACCTTGCCTTTAGAATCTGTATCTCCTTTTGGCTTCATCTCTCCTAGTGGCTACATTTTTTGTCTTTGTTTTGATAAGTGGCTCCATTTTCTCTCTGTGTTTCTATATTGACTAATTCTGCCCTTTTGCTTACTGTAACTGATTGCTATAAAGCACTCAAAGGTGCGTGGGGCTTACTGCTTGGGAATTTTTCAGAATGGGAAGGATCCAACTACTCTTCTTGGAGGTATTTGTCATATATATCTTTTAGAATCTTGGGAAAGTTCATCTGCCAAATTCTTCAGTTGTATAAGCTGTAACATGCGTGCCTTTGCTTTTAATTGCAACAGGTATTGTTGTCCGCAACACTCTTGTAACCTACGATCGTGAAAATGAAAGGATTGGTTTTTGGAAAACCAATTGTTCTGAGTTATGGGACCGACTAAATTTATCTCCTTCACCTCCACCTCCACCATTGCCCTCAGGCTTGGACAACACAAACTCCAGTGCAAATTTGACTCCAGCACTGGCACCTAGTTTACCTCTGGAGCATGCACCTGGTACGAAGAAACTGTTCTCCTATCTTTTTGTCACCATTAGTATGCCTTTCAGTCATGCTTTTATCCAGTTTTGTAGTGGAACTGGTTTTATTTCAATTATTCTACCGGAAGGGGGGAGCCTTAGAGCAACGGTAATGTTGTCTCCGTCTGGCCTATATGTCATGGGTTCGAGAAGTGGAAGCAGCCACTATTGCTTGCATTAGGGTAGGCTGTCTACATCATACTCACACCCCTTAGGGTACGGCCCTTCCCGGAACCACATCAATCCGAGATGCTTTGTGCATCGGGCTGTCCATTATAATTCCGCCAGGCTGTTTTGCATCATTTCCCCCTAATATTTTTAATCCATTTTGGTTTCTGATTTGCTATGCTGGTTTTTTGCTATATCGCCTAAGATTAGGTTAGCTTTGATGATTTCACATCCTTTCTTTGATTAAGGTCCATGAATGTTCCTGTGTCTCCAAATGTCAGCTTTCAAAATGACATTTGAGCTTGCGTTTTGTATTGTTTCATCAAGTTTTGTATTCATCTATCTCCTTAGCATTCCAGAGTTCCTGAGAAGCACTCGCTAGTAAAGACGTATTCTGATGTCATGACATTTTTAACCTTGTTGGAGTTTGGACCCAAACAAACTTTTTTTACAGAAGGAAACTATAATTTTAAGGAGTACAACAGTTGCTGTATATAGAACATGGTGAGTAACTCCACTCTTGAGATGCCTTCTCTTCACTGAAGTCAATTTCTAAAAACCTCCGTGCTTGACACAGATTTGTTGGTATAGATTTGTGCTCCAGAGATGCAGATGGGTCCAGCGAATTTTTCACTAGAATTTTTCTTTTTTTTCTCACCACCTGTCCAGCATAGTGCTGTCAAAAGTGACAAGCTTAGAAAAAAACCATGTGCTTGGTGGGGCTTTAACTGCAACATGCTATAAAAACGTTCACTATAAATGTAATGTAAATAAATAACCATAAACACAAAATAGCAATTGTTGGAAAAATTGCAATTTAGTGAAATACACGAGGTGTCAATCAAGTTCAGATCATATTGTAAGTCTTGATTCAAGGTGCATTTTTAAATTAGTAATTTGGATCAATGATTAGTTTTCTTAGACTATAACTTTCAATTTTCATACTCGTAAGAACTGATATACTCATATATAATATAACGTTTTTCTTAATAACTAATAAATGCTTTCCTAACTATATTTATTTTTGTGCTATCCTATTAACAACAGAGCCTGTGGATGTTAGGCACCCACTTTAAGGCCTTTTTCCTCGCACTGAAGCCCTACTTTTAAGGTTTACTGTCACGACCCAAAATTCCACCTTAAGGATCGTGATTTCACCTAGTCTCTAAAACTAGGTAAGTCGATCACTTACAACAGTTAAACCATTAAAACATGATATTATGAAGCGGAGTTTAATATAAATGCGAAAATAAAGGTGATACAAGCCAACACGGCGTTAATCACAACAAATCCCCAAGACTAGGTAATACAGAGTCACGAACTCTAACTGAATACATAGAAATATTTCAAAACAAAGATACAATACTGTTCTGGCAGATAATTGACAGTATAAAGATAAGGAAAGACTACAAGGGACTTCGACGATCAAGCAGCTCTACCTTGAATCCTCGTGATCAAAAAGCTAACTCTGCCTAGGTCCTATGCCTCCAACACCTTGATTTGCACAAAATGTGCAGAAGTGTAGTTTGAATACACCATGGTTGGTACCCAGTAAGTATCAAGGCTAACCTCGATGGAATATTGGCGAGGTTCAAGTAAAGACACTCACTAGTCAAATAACCTGTGAAAAATATCAAAAATGGGCAAATGGAATAATAACATAAAGTCATAACTGTAATCTCTTCAAATTAAACGATACCTATTTAGAATAATTAAAGGTCCCGTTCTGACAATAAGCCATCAAATAGAATCACGCACACCCGGCACCTCGTACCCACATTAACAATCACCCTCGCACGGCAAAGGCCTCGTGCCACAACATAAGATATACCTCGCACGACGAAGAGCTTGTGCCACAATATAAGTCACAACCGCATGGATAACTCATATGCCAATATCACAATCCGCCTGGCGTGGTCACATGCTCAATATCACAATTCGCCCGGCGTGGTCACATGCTCAATATCACAATCCACCCGGTGTGGTCACATGCTCAATATCCCAATCCGGCTGGCATGGTCACCGGCTACCTGTCCAAATGTACATGATCAATGGACATCAAGTTTCATACTCCTGGACTGATATTAATGACATGTTATGGTATATGCATGTGCAAGTGTATTATCACAGCTTAAATCATCTAAGTAATATCAGAGACACCAAGTGGCACATTAGGAACAACACAACAAATCACGTAATATGTATGACACACACAAGGAAGTCAAAAGCAACAACCAGAATACTCCTCTTTCATCAACAACATGCCCCCAGGCCATCACATAACATCCCCTTATTGCCACCCTTATGTCACCACGTTGACAATATCATAATAGCCACCCGTATCGCTCCGCCTAGGCAGTATATCAATAGCCACCCGTGTCACTGCGCGCAGACAATATATCAATAGCCATCCGTGTCACTCCGCACAATCAACAACAGTGAATTGTCATCCTTGTGCTCCGGATAACAACAATCGATCCACACATGTCCACATATGCCACAATATCACAGGATAGTAGTATTAGAGATTTATCACGATACAAGCTCACCACTCATCAACAAAGTGCACAAGGACATATCATTAATATAGAATTGCTGAGGGGTATTCAACATTTAAGCATGAAAGCTACTCAAATTAACAAGAGTCTCACAAGCGCCCAACTTGGCCAAATAAGGAATTAAGATCCTAAAACATGATTTGTACATGGAATATAAATAACTTAATGTCAAAAATAACTTGATGTCATAAATAAAAGCCATAGGAAACGATTCTGAATAATAAAGCTTCTATCTTGAACAAGAATAAAAAGTAATCCCAAAAAGTCAACCCCGGGCCCACACCGTGGAATCCGACAAAACTCACAAATTCCGAACACCCGTTCAAATACGAGTCCAACCATACCAAAATCATCCAATTCCGGCCTCAAATCGGCCTTCAAATCATCAATTTATGTTTTAAAAAAGTTTTTACTATGATCTCCAATTTCTCCCATTCAAATCATCAATCAAACACTAAAATTGAGATTGGAATCATGAGAATAAACAAATCCGAGTAAAAAATACTTACCCCAATCCAAATCGTGGAAATTCCCCCAAAATCGCCCAAATCCGAGCTCTATAACTCAAAATGTGATAAAATAACCAAAACCTTTGAAATAGAGTACTTATAGATCTGCTCCAGGTAAACCCTTCTCAATTGCAGGACCAGCTTCGCAATCGCAAAGCACAAACTTAACTGACCACAGAAATACCCTTCGCGTTCGCGGTACATACCTCGCGAACGCGATGCATGGCTGAGCCAGACCTACGCGAACGCGGCGTAGACCACGTGACCGCGAAGACAATACCACCAGCTCCCAGTTCTTCATCGCGAACGCGTCATTGCCATCGCGAACGCATTGACCAAGCCCCACAAAGCTACGGGAACGCGACCCTCCAGTTGTGAATGCGAAGAGGAAAAACACTCAGCTCCAATCATACACTGCGCGATCGCGGTTAGCCCCTTGCGATCGCTAAGAACGTCAGCAACAACAGAAAACCAGCAACACAACATGAAGGAAAATGGTCCGAAATCACCCCGAAACTCACCCGAGCCCCTCGGGGCCCCGTTCGAACATACCAACAAGTCCCAAAACATAGACAAACCTACTCGAGGTCCCAAATGACACCAAACAACATCAAAACTACGAATCACACCGCAAATTCAAGCCTAATGAACTAATGAACTTTCAATTTCCAAAACTCATGCCGAACCATACCAAATCAACTCAGAATGATCTCAAATTTTGCATGCAAGTCCCAGATGACATAAACGGACCTATACCAACTCTCAGAACCGCAATCCGAACCCGATATCAACAAAGTCAACTCTCGGTCAAATCTATCAACCTTCCAAACCTTCAACTATCCAACTTTTGCCGGTTCAAGCCAAAACAACCTAGGAGACTCCAAATCCACATCCGGACACACGCTAAATCCAAAATCACCATCCAGACCTAACAGAACCATCCAAACTCTGATCCGAGATCAAATACGCAAAAGTCAAACTTGGTCAACTCTTCCAATTTAAAGCTTCTAAAATGAGAATTATTCTTCCAAATCAATCCCGAAATGCTCGAAAACCGAAACCGACCATACACGCAAGTTGTAATACATCATATGAAGCTACTCACGACCTCGAACCACCGAACAGAAATGCAAATGATCAAAACGACCGATCGGGTCGTTACATTTATGTATGCTTCAAATGAGCATTCAGTGACACTGTTCAGCAAAAGGAGAAACTCTACTAGCCACTTGTAGCCACCTCCAGGGACCCTCTCTGTCTCGGCCATGGATTACTTTGAGGAGTAATAGGGCTTCTCCAAGCGGAAACATTCCACGCATGCTGTGATCCTCCATGTTTTCTCTGCTAATCTTTGCTACTTTTTCTGGCGGTCCAACTGGTTGATCAATTCTACTAACACCATCCGAATGAGCCCTCAGGAATTCATCTCCCTACTTCTTTGCCAGATACAAGGAAGGCGTTGTCCTCATAGGGTTGGCCTTGGCATAACTCTCTGATAATTTGACACGTTGCTTTAATTTGGTAACAATTTACAATTTTGGGGGTGCTGCACTTGATTCCATTGAGTTACACCATTTCTCATATTTAGGAATGGTCCTGTGCAAGATTGAAATGTACTGAACTCAGTTTTTCCCTGTGCAGATATTTATGATTGTTATTATTATTTCATCTTTGACCTGAATTGGCAGGGAAAATCAAAATTGGACTCGTATCATTTGATATGTCACTGAGTGTTGATTACTCAGCATTGAAGCCTCGTGTTCCAGAGCTTGCCCATTTTATTGCGCAAGAGTTGGAGGTTAACGTCTCACAGGTAGTTTTTGCATGACCCAAAGTTGTGTCAGTCTGATGTAATCTAAAACTGTATATCCCATTTTCTTTAAGTTACTTAACTGTATTTTAATTTTGTTCAATATGATATGTCACTTATTGGAAGATACCTTGCAGGTTCACTTAATGAACTTTTCGACAGAAGGAAATGATTCCCTCATTAGATGGGCCATCTTTCCTGCAGGATCTGCAAACTACATGCCAAATGCCACTGCAACAGTAACTCTAAACATCTAGAATATGTGAGGACTATTTCTTGATTGAAGAACCCTTTATTCATCATTTACCTATTTGCAGGAAATAATAAACCGGTTGGCTGAGAATCGTTTTCATCTTCCTGATACATTTGGAAGTTATAAATTAGTCAAATGGGACATTGAACCCCCACCAAAGAGGTATAAAAGCTATCTCCATTCTTTGCATGTTCATAAAATATTGAGTTCTGCTGTACAAACTTTTAGCATCATAGCATTACTTATAAAATTATTCTGAATTGTCAAAACAAATGTGCCTTTTCTTTTCAAAATGCAAAATAAATCTCCGCATTGCATTTCAGATGGGAAAAACATGACACGCATCTTTTCATCTTGCCTTAAACACATGTTTGTAAGTTACATTCTAAATTAGGAAACGTGAATGAGTCTACATTGCATCGCACCAGTTCGACTGCATATTCCAAGGATAATGATGAATAGGTGATGACTTTCGTCTCCATTTTTCATTGTTTCAATTTTTCTCAAAGTTTCTTACTTGGATTGGTGGATAAAGTGGCAAAGCCAGAATTTTTTTATAAGGGATTCGAAAATACTAGAATGTCATAATTGAGATCTGAACTTGTGACTTGAAAGCAACTTTTGAATCCTCTTTGCTACTAAACTAAAAAATTTCCCCTATGGCAAGGAGATTCAATAGCTTATATATAACCAAAAAACTTCATTTTTACCCTATTCGCATACTATAATTTGAAATGTTTTTGGTCAAAGTTTAATTTGCTGCATCTCAAAATCTTAATAGCAAAATATTACCTTAATTAACTCTAATGTAAAGAGATTGGATAACACACCACAACAATATTTTTGGTAGGTGAATATTACTTTAATTTTTTTGACAATTAATTGAGATAGGAGTTCTTGATATTTTTTTTTTGGTTTTGGTAACTATCAAGTTGTTGGTTTGATATGGTTTCATTGCAACGATTTAGGATACGATGGCAGCAAAATTACCTTGTTGTAGTGTTTGCGCTACTAGTTGTCCTGATAATTGGATTATCAGCTTCTCTGGGATGGTTAATTTGGAGACGAAGGCAAGAAATCCCATATAATCCTGTTGGAAGCGCTGAAACACATGAAAAAGAACTCCAGCCGCTAAAT
SEQ 51
ATGGTCACAGGTCTGAACTTCCGCCATAACTTTCTCGTGTCAGCTTTACTATACTCTAAATTTGAACTATAATACCACATTGATGTGAAAATTCACACTTAGGTATCGATTTTTTAACACAGAGATTTATTTTGTGTTCATGCTTTGGTTTCAAGTATTGGAGAACCTCGTAATCGTTCTCTATAAGCTTCTGGTTTAACAGATCCTAATTTTTCTTAGAAGCTCGAATTATTTTGTATTGGAATGAAATGAACCTGAATATTGTGGACGATACAGAGGAATTATTGTGGTATAGTTGATTGATTGATTGATAGTCTTAAGTAAGAAAAAGACCTATTGGAGATTATGGTGAAGCTTATACAGGAGCAGCTGGACTTGGTTTTCACAATATCTTTTTTGTTAAGGTTAGAATAAACCTGCTAAAATTTTTTACTTATCAAAATAAATAAATAAACTTGCTAGAATTTTTTTCAAGTTGGTGATTGTTTAAGTTTTTTCGATTGTTTTTTCCTTTGGTAAAAACGTTTTTGGCAAGAACTATATTTTGAAGTTGTGGTTTGAGAGTGTTTGTCAAATAATCTTTTCAAACAAACTCTCTTTTCAAATATCCGAACATCTTCAACTTCCACGAAATAGGTGACAACTGGATTAAATTTGGGGGGGGGGGGGAGTGGTTGATGGTGTATTAAGTTCAAACATCATTTATCTTTTTCTCTAGGAGCAGATTTTTAAATCATTATAGATACATTGCTTGATGTATGTTTGAGAAATACCATTGGTGTTTCATTTAGGCATCATCACTGTAATAGTTTGGTTAATGTTTTGTTAATTCATCATGGTGGTTCATTCAGACAGCATCATTCGGCTATGATGTTGATGTTGATTTGGTTACACAAGCAACTCATTGGTGGAAACAGTTTTCTAGGATACCCCTTTTATCATTTTCATTTGATTGTACCTCCTGTTTATTTTTGCACTTGGACAATTACGGGCTACAATTCTCTCCTTGCAAATCTGGTGATTGGTTGCAGTAAGTGTAAAGTGGCAAAAAGAAGTCTATCCTGCTGTGGAAATTGACACTAGCCAGCCTCCATATGTTTTCAAAGCCCAGCTGTATGATCTAACAGGGGTACCACCTGAAAGGCAAAAGATAATGGTCAAAGGTGGTTTGCTTAAGGTATAAAATTTCGTTTCACTTAGCTTGTTATGCCATTTTTCACTTTGCAAATAAAGCACAAAACTCATTTGTGTTTTGAGGAACGCTGAAATTCTCAGCATCTGATGCTTTGCTTGTTAATTTTGTTGTTAACTCTTCGGTTATTTCTATGGTTGTTTAGGACGATGCCGACTGGTCGAAAGTAGGAGTAAAAGAGGTACACGGCTACTCATTGAATTACTCTCTATTTTTATGCAATGAAGTGCCAATTATCTAGAAGCATCTGTTATTTATTATTTCCAGGGTCAAAGGCTGATGATGATGGGAACTGCAGATGAGATTGTGAAGGCCCCCGAGAAGGGTCCTGTTTTTGCTGAAGATTTACCTGAAGAAGAGCAAGTGGTTAATGTAGTAAGTTTTTTGACACTGATGTTGTTGCATCAAATCGAATGATCCGGAGATGTGTGATTCCTTATGTTTAACTGCTTACATAGTTAGTCTTGTCTCATATGCTGTACTTATACCAGCACTGGATCCCTAGTAGATTTATTGGTATAACTTTACCGCAATTGCTTTGTTCATTTTTTTTCAAAAGCAGTTGCCTTTTCCAACTTCTACATGCAAATAAGCTTTAATATATAATTCTCTATTCTTTTTCCGCTGGCACAAGTGATTTTGTGGATGCCAAGCGCTTGTCGAATGCGTTTCTTGTTCCGCTGGCACAAGTGATTTTGTGGATTCCAAGCGCTTGTTGAATGCGTAATTTCTTCATTTACACATTATGAATCGGCCCTTCCCAAGACCCCGCACATAGCAGGAGCTTAGTGCACTGGGCTGCCTTTTTACACATTGTGAATCAGATTACTATGTTGTTTTAGAGTCCTGTCTAAAAGAACTGCTAACTTTTATAATGGCAAGGCTTAGTTTTGTACTTTTAATCAGTAAATGGGTGATGAGAATTTTTATAATTTTGTTTCCTCCAGGGTCATTCTGCTGGATTATTTAATCTCGGAAATACATGCTACATGAACTCCACAGTACAGTGCTTGCATTCAGTTCCAGAACTGAAGTCTGCTCTAACAGAGTGAGCATTTGCTTCTTTCATCCTTTCCTTCATTTTTGGGAGTCTTTTGGTTTAGGCTTTTTTTTGGTCCTTTTGCTTTAGCCTTGATTTCCCAAAACTTGATCAAATTCAATATGGTTGCTTTTAAGTCTAGTTCTGAAAATATTTAGGTCTATTTTGATTGCGTTGCACTTTTTTGGTTAGGCAATTCGATCTATTTGCACCTAATCCGTAATTCCTGTTTTTGCTAAAATTGATAATTCTGATTTTACTGTTTATATTTGTCAAATCTATTAATCATAATTTAACTTATATAATGTGTCGCGTTGTATACACCTAGATAGTATGTATTTACGGAGACAAAGCGGAGAAAACAGTAATTAGTAGAGGAGACATAAATTATCCTGTTTTAATTCCTATATATCCTCCCTTATATAAATATGGACTCGTTTCTCGGCATGTTCTCCTTTGGATGAAATCAATCCAAAATGTAATCCACTTTGAATCAATTTGGACTCCGAAACTGTGGATCTTTCCCGAACATTATCAGAAAAAAGATCAAAATGGCTCCTGTTAAATACCAAGTGTAGGAGTTCCAAAAACAACTCCGTTAGGTACATTTCTTTTTGTGTTCCTGAGATTCTGAGTTTATTTATTCTTCCTGTTAGGTATAACCAGCTTGGTAGAAGCAATGATTTGGATCACTCATCTCATCTCTTGACAGTTGCAACAAGAGATCTGTTTAATGACCTGGATAAAAATGTCAAACCAGTGGCACCAATGCAATTCTGGACGGTACTTTATTTGTCTTTATTCTACTCCTAATATTTTTGGTTACGACTTAGTATTCCTGACTTTGTATTCTTAGAAAATGTGTTTGGATTTCGAACAAAGTTACCATACCTTTGAAGGAGAATACGTATGCTGAGTAGGAGATAGTGTTTGCCAATAATTTCCTATTGGCAGACTTCAAAATATACTCGTTTAGCGTTGAACACTGAACTCGATATATTTTGTCATGACTTTTGTGTGCAAATGGATTTGCTCTTAGAGAGCAAGGATGAAGTACTTTGTATAGGATGCGAATAGAATGACTTAAGCTTGGGCCTGTTGTCTACATGGTCAAAACTTTGTGCATTATCATTCTTGCACAAGGTTACTTGGATTTATATGAGAACTATAAATGTAGCCTATGTTGATATGTTTGTCTTTTTAGTGTTTTCGTCACATGAGCTACTCGGGCATCAACATTTGATTAGGTTTATGTTCACAGGTTTTGCGGAAGAAATATCCTCAATTTGGCCAGCAGAGCAATGGAGCTTTCATGCAACAGGTTCCAAGCTTACCTAAGCTACCACAATGCTTCCTTGTTATTAAAAAAAAAAAAGTGTACCACTATTGCAATTGCTATATAGAGGTCCTACTGACATGTCCTGGATAATAACAGGATGCTGAAGAATGTTGGACGCAACTACTTTACACCCTTTCTCAGTCTCTTAAATCACCGAACTCTAGGTACTACATCTCCTCTCGGGATATTTCTTGCAGATGAAAGTCCCTTTTCTAAATAATTTCCATGTTTTGTTTCGCTAGTTGTTTTCTTTGTTTCAGTTTGGACATATGGTCCCTATTTTGTAAAAATGTGAAGGAAAACTCTCCTATATATACATTGTGCTTCTTTATGTCATGATTGTGACTGCTCTTTATCTCGGTACTTGCAGTGGAAGTCCGGATATTGTGAAGGCTCTCTTCGGTATTGAGTTTGACAACAGGTATTTCTGCAGTCAAATGTTGTTTACCTTCCAGTTATTCTGTTACCTTATCCCCTTTGCATAGAGTTGTTCTGCACCTAAATATTATAAGAGGCATGTGAACTTACTGCTGTATATGTATTGAGATAGGAAGGAATGCAGCTAGTGGTCCTAGGATGTAGGATGTTCCCTGTTCTGACTTTGAGTATCTTCTGGGCAACCTGATGAGAATCAACATCCTCAACTTTTACTCTGTCATATTGTGAATCATGTAGTTGACAATAAGAGATGAATTACTGAAGTTGTTTTGAAAGTTGAAGCTAAAAATCATGTTTATGTTGACTTCTTTTAGTTTCTCCTACTGTTAGTTAAGTGTACTATAGTGCTACTAGTGTGTATTTGTATTACTGCTAATGAAAGGTCTGGCTGGTTTATGCCGTTCTAGGGTATTTTCAACAGGCTGTGCTTTTCTGCTGTAGCCTAGACCTCTGGGCTAATATTTCTTGTCTAGGACCTGACCTACTGCAATGAGGTTGGGAAGATCCCAATGCCCATCCCAGAGTTCTCATGTGCTCAAATTCATCTACTGATATACAAAATTTAATTTTCGATAGTGTGGGAAGCTGTTTATCATTCATGTCTGACTTGATCTATACTGTTCTGACTGGATTATGGTGGTTGTGCTAGAGTCCACTCTGTGACTATGTTCCATATCTGATATTAACTGCTAGATACTGAAGAAAAATGACTTAACTCTGCCCCTTATTCTCATGGTACTGATATGGACCAGGATTCATTGTGCTGAAAGTGGTGAAGAAAGCACAGAAACAGAAACTGTATATTCCCTTAAATGCCACATTTCACAGGAAGTGAACCATTTGCATGAGGGTTTGAAACGTGTAAGTTCGGTTCTTTTCCTCCTTGTATGTCCCAACTTCTAACTTTAGTCTTGTTTCCTCCCAAATGTTTCATATTACTGCTAAGTTCTGTCTCAATTTTTTCTGTTGTGCCAATCCAGTAATCATCCAATTTGATTAAGAGGACAGTCCCAAAGTGAAAAATGACGTATCTAATTCATAGAAATTCCTTTGGTAGCTGTAACCTTTAAGGATAACTAACAGTTAGTCCTGAAATGGTTGGTTGGATGGAGAAACTATTATATAAGATGCCTCACGGGCGGCACATTGGGGGGGGGGGGGGCTTTTNGGGGGGGGTCTTTTGCTTTAGAATTTTTTCATGCCATGATTGGACCAAATGTGGGGCCTGCATGTTGAGGTTCATGCAATAGTTTCACATCAGAGGTAGGCTGAGCCATTGGAGCCGACTCATAATGCTTTTGGTGGGAAAAGATGCTTTGGAGATTTTGTTTAACTTGGTGCAAAATCGTTGATGATGTTTATACTGGAAATCTGGTATTTATCCCCTCAAAATAATTTAAATGCACTCATATGGCCATTGCTTTTTCTGACAGGGTCTGAAATCAGAACTGGAGAAGGCGTCTCCGTCACTTGGACGGAGTGCAGTTTATGTGAAAGACTCCCGAATCAATGGCTTGCCAAGGTATTAACTGGCTCGATTAAATTCCATGGCGATGTAGCGACATATGTATGATCCGTAGCTTCTGCATATAGACTATCTTAATCCACGCCTTTCACATAACAAAAATGCCTTTTGATGTGTTGAAGTAGTTCACCTCATTTTTGGCATCACTTCTTTCTCATTCTCCTTTCTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGCTTTTAATTCTCACACTTTTCTGGATAGTTTCTTTACTATTTGGTGCTGATTATTGACTGGTCTATGTCAAAATCTTCTTCTATTTTTACTATTTCTTTTTCATTTGATGAATAGATACTTGACCATTCAGTTCGTCCGGTTTTTCTGGAAGAGGGAATCAAATCAAAAGGCAAAGATTTTGCGGGTATGTTGACTTACTCATCTTCCTTTTCACATTGATCAAATAGTTGGTCCTCCTTGAAAATGTGTCGGAAGAGGGAAAAAGGATCAGGTTACCATGTCCTGAACCGAGAAAGGGGGATGGAGAAAGGGAGTTCACTGTTTTACTTTGTTTTGTGAGGATGAGCGGTTCTGTCATGTTTAGGTGCTCTAAAGTCTCGCCTATTCTCTCGATTGTCAAATTCTGAACTGTATATTTTAACTTTAGGCTGGTTTGTGATGGTGTCGGTGTAGGAGTCTTGGTCTATAATCCCTTCCCAATAAAAAATTAACTGAAACTTCTCTTAAGTTTCTAATTGATCTTGGAGTAGGTGTAACTTGGGCACAATTACAAAGAGGAGTTCTTATGCACCAATGACTAGATTTCAGCTACTAGAAGTAAGAAGTAAGAGAAGGATGTGATATTAATACTTTGCTATTCTAGTGAGATAGTATGTACATAATATTTTTTTGAAAGAGAGTTCTGGTGAGAAGCTGATATGTTTTTTTTTCTTTTTTCCTTTGTGTTGATACAAGATCTTCTTAACAATCAAAAATATCTGAAAAGCTTTTTCCTGGATTCGGCCATTCGGGATAATAACCCACCCATTGCCATAAGTTCTGGGATCTATAAAGTAGTTGTACTGGGTGTTTCAGATATCTTTGTGTTTGTGAGAATGCACAGGAGAAATCTATTCTATTGATTAAAGTGTTGTACAACCCTATTTATATACAGTAATTACATAATAATAGGTATCTACTTCCCGATGTGGGACACTATAATACGAGAACCAGTAAGAGACTTAGTGAAAATATCTGCTATGCTAGCTAATCATTCTACTTTACAAACTTTGTAACAATATCTCCTGAGAGTATCTTTTCTCTGCCAAAGTGACAGTTGATCTCAATGTGTTTAGTCCTCTCATGGAACACCAGATATGACGCAATATGAAGAGCAACTTGATTATCACACGCCAGTTCCATCGTGCTGATTTCTCCGAACTTCAACTCCTTAAGCAACCGCTTGATCCAAACTAGCTCACACGTTGCCACAGCCATGGCCCGATATTCGGCTTCGGCGCTAGATCAAGCAACTACATTTTGTTTCTTGCTCTTCCAAGACACCAAATTACTTCCTACTAGAACGCAATATCCAGACGTAGAACGTCTATCAGAAGGTGATCCTGCCCAATCAGCATCTGTGTACCCAACAATCTGCTCGTGGCCTTGATCCTCGAATAGTAACCCTTTGCCTAGAGCTGACTTTATATACCGAAGAATGCGAACAGCTGCAATCCATAACTGACTTACAACACTCACCGGAAAAGAAATGTCAGGTCTACTCACAGTGAGGTAATTCAATTTGCCAACCAACCTCCTATATCTTGTAGGATCTCTAAGAGGCTCCCCCTGTCCAGGCAGAAGCTTAGCATTCGGATCCATAGGAGTGTCAACCGGTCTACAGCCCATCATTCCCGTCTCCTCAAGAATGTCTTAAGGCATACTTTCGTTGTGAAATAACAATACCTGAGCTAGACTGAGCGACAGTCTCGAAGTGCTGAAAGAGTTGCTGCTTCAGATTAGTAATACCATCCTGATCATTGCCAGTAATAACAATATCATCAACATAAACCACCAGATAATACACAGATTAGGAGCAGAATGCCGATAAAACACAAAGTGATCAGGTTCACTACGAGTCATGCCGAACTCCTGAATAATTGTGCTGAACTTACCAAACCAAGTTCGAGGTGATTGTTTCAAACCATATAGTGACCTGCGCAATCGGCATACAAGATCACTAGACTCCCCCTAGCAATAAAACCAGGTGGTTGCTCCATATAAACTTCTTCCTCAAGATCACCATGGAGGAAAGCATTCTTAATGTCTAACTGATAAAGAGGCCAATGACGTACAACAACCATGGACAAGAAGAGACGAGCAGACTTTAGCCGCGGGAGAGAAAGTGTCACTATAATCAAGCCCAAAAATCTGAGTGTATCCTTTTGCAACAAGACGAGCCTTAAGACGATCAACTTGGCCATCCGGGCCGACTTTGACTGCATAAACTCAACGACAACCAACAATAGACTTACCTGAAGGAAGAGGAACAAGCTCCCAAGTGCCACTCGCATGTAAAGCAGACATCTCCTCAATCATAGCATGTCGCCATCCTGGATGAGATAGTGCCTCACCTGTAGACTTGGGATAGAGACAGTTGACAAAGATGATATAAAAGCATAATGAGGTGATGACAGACGATGATAACTTAAACCGACATAATGAGGATTAGGATTAAGAGTGGATCGCGCACCTTTCCAAAGTGCAATCGGTTTACTAGGAAGAGGCAAGTCCGCAGTAGGAGCAGGATTAGGTGCAAAACGTGAATCAGCTGGGCCTGATGCTGGCTGCGGACGACGATGATATGTCAAGAGTGGTGTTCCTGTGGCGGGGAATCTAGGAGGAGTCTGTGGCGAAAGGTGAAGGAGGAGCTATAGTAAGCTCCTTAAAGGTCGATACAGGTAAGACCTCAGATATATCAAGGTGGTCAGAAGAGGTAAAGAAAGGTTTAGACTCAAAAAATATGACGTCAGATGATATAAAGTACTTATGAAGATCAGGTGAGTAACAACGATATTCCTTCTGAACACGAGAATAACCAAGGAAGACACACTTGAGAGCACGAGGAGCTAACTTATCTTTCCCAGGGGCTAAGTTATGAACGAAGCAAATGCTCCCTAAAACACGAGGAGGAACAGAGTATAAGGGTGATTGGGGAACAATACTGCATACGGAATCTGATTCTGGATGGGAGATGAAGGCATCCGTTTAACCAAATAACAAGCTGTGAGAACTGCATCAGGTGGTCAGAAGAGGTAAAGAAAGGTTTAGACTAAAAAAATGTGATGTCAGATGGCATAAAGTACTTATGAAGATCAGGTGAGTAACAACGATATCCCTTCTGAACACGAGAATAACCAAGGAAGACACACTTGAGAGCACGAGGAGCTAATTTATCTTTTCCAAGGGCTAAGTTATGAACGAAGCAAGTGCTACCAAAAACACGAGAGGAACAGAGTAGAAGGGTGACTGGGAAATAGTACTGCATACGGAATGTGATTCTGGATGGGAGATGAAGGCATCCGATTAACCAAATAACAATCTGTGAGAACTGCATCGCCCCAAAAACGCAACAGAACATGAGATTCAATGAGAAGTGTGCGAGCAATCTCAATGATGTGCCTATTCTTTCTCTCTGCAACCCTATTTTACTGAGGGGTATAAGAACAAGAGGTCTGATGAATAATTCCTTGAGAAGTCATAAACTGCTGAAATTGAGAGGATAAATATTCTAAGGCATTATCACTGCGAAAAGTGCGAATGGAAACACCAAATTGATTTTTAATTTCAGCACAAAAATTCTGGAATATAGAAAACAACTCAGAACGATCTTTCATTAAGAAAATCCAAGTAATCTTGAATGATCATCAATGAGACTAACAAAATAACGAAATCCCAAGGTTGAACTGGCTCTACTAGGACCCCATATATCAGAATGAACTAAAGAAAAAACAGACTCTGCATGACTCTCAATACTACGAGAAAAGGTTTGGGAATGTTTTCCGAGCTGACATGACTCACACTATAATCTAGATAAACTAGACAAACTAGGCACCATCCTCTGAAGCTTGGATAAGCTTGGATGTCCTAAACTTATGTGAATTAGGTCCGGAGGATCTGTAGCTAGACATGTCTTGGAGGAATTGAGTGAGTTAAGGTAGTAAAGGCCTTCTGATTCAAGTCATGTTCCAATCGTCTGTCCCGTACTACGGTCCTGCATAATAAAAGAATCATCAATAAAATATATACCACAATGGAGGGCACGAGTCAAATGACTAGCAGATGCAAAGGACAGCCAGGGACATAAAGAACAGAATCTAGAGTGACAGAGGGTGGGGGATTTGCTTGTCCAACTCCTTTTGCTTTAGTTTGAGACCCATTGGCTAAAATAATAGTGGGAAGAGACTGTGAATATGCAATATTTGACAAAAGTGATTTATTACCAGAGATATGATCAGAAGCTGCTGAGTCCACAACCCATTATCCAAGAGTACTAGACTGGGAAACACAAGCAAAAGAATTATCAACAACAGAAGTATCAGTCTGAGCAATAGAGGCTACTTGTGGAGATGTCTGCTTGCTCGATACGGAGGGAACTCATTATATTCCCCTTCAAATAAAGAAAATCCCTGGTTACCTGTAGTCTCGGTCTGAGCAACATAAGCATTTTTGAGTGGACGACCTTGTAAAGAATAGCACACGTCACGAGTGTGTCCAAGTTTATGACAATAAGAGCAATTGGGCCTAGATCTTCCAAAACGACCACCTCCTCGTCTATTCTCCATAGTTTGAGATACCTGATTGTCCACTGACTGGGATACGAGAACAGATGAGTCAAGTGTCTGTGATGAGCTTACTAGGTGACTTGGTATTGCAGCAAGGCGAAGTAATTGAGAGAATAATTCATCAACATTGGGGACAGTCGGACTAGCCAAAATCTGGTCACATACTGAATCAAGGTCATTAGGGAGTTCAGCGAGTGTAAGAACTAGAAACATCTTCTGTCGTTGCTCTTGTTGCTTTTCAATACTAGCAGAAACTGACATCAATTGCTCAAATTCTTCCATGACTGCCTGTACTTGTCCTAAGTGTGTAGACATATCCAATTCCTGTTTCTTCAAGCTTGTCATTCGCGATATTACATCATAGAAACGAGATAGTCATTAGTGTATAAAGTACGAGCCTTCTCCCAAACTAAATAACATGTCTGGAATGGATGGAACAAAGGCATCAACTTGGAATCAATAGATCGCCACAAGATACTACATAACTGAGCATCGACCTTCTCCCAAAGTGTTTTGGCCTTTTCATCACCGTCGCTAGCCCTTTTTGTTAAATGATCTTGAACTCCTTGACCTTTACACCACAACTCGACAGACGAAGCCCAAGCTAAGTAGTTTGAACTTCCCATTAAAGGTTCTGAGGCAATCATAATACCATAACTTCCAGAACCCGTGTTTTTAGACCCAAATACATCCACTCCCAAAGACATTATTGGATTGAAAAGAGATCTAGCAAATTAGCACCAAATAAAACAAAGAATCAACTGTGGTTGCCGAAAAACTGCCGGAAAAAATACTGTAGTTGCAGGAAAATTTTCAAAGTGCTCGGAATCAAAAAATAAAAATATGGGAAGGCTCGGAATTGCAGGGCGATCAGACTGTTCTAAAGAAGTTTTCTGAAAAAATGGACGGAACGGGCTCCACGCGCCGGCGCGTGGAGTAGATCTTGCCGGCGAAAATTGTCTTCGGGCGGCGCGTGAGGCGGAGTCTGACGGAGTTGTTTGCTGGGGTTTGGTCGCCGGAGGTTGGGGACCTTATGGTGGTGTTGGTTTTTGCACAACACCGATGGAATTGGTTTTGACGAAAAAATAGCCCTAAAAGGTCACCGGGATGAAGCACGTCGACGACTGGGTTTTCATTCCCGGATGTTTTCTCACTGCCGCTCTGATACCATGTGAGAATGCACGGGAGAAAAATCTATTCTATTGATTAAAGTGTTGTACAACCCTATTTATATACAGTAATTACATAATAATAGGTATCTACTTCCCGATGTGGGACACTAAACATGACTAACTACTTAACAGTGTTGAACATGGGTAACCCAGGGGGGTCTTCACCTAATTCCAAAACTGAAGTGAAAAGAGGAAAAAGAAAGCAGGCTTGTGAGGTTCCGCGTTCCTCTCCATCTGATATGACCTTTCTATATATATATATTCTTTTAAATGATGCTTCCCGGCTAGCTTATGCGCACCTCGATTATTCTATTTAGTACATGCTACCTCCCATCAGAACATGCACAAGGTAACTCTGTCCATCAAGGCTTAGGAAAATAGAAGAAATCACCTACTCTCTCCGTTCCAATTTATGTGATCCTGTTTGACTGGGCACAGAGTTTAAGAAAAAATGAAGACTTTTAGAATTTGTGGTCCTAAACAAGTCAAAAAGGGGCCTAGATTATTTGTGTGGTTATAAAAGCTTCTCATTAATAGTAGAATTGTAAGTTTAAGCTAAATTGTTACCAAATTTAGAAATGGGTCATTCTTTTTGAAACGGACCAAAAAGGAAATAGGTTCACATAAACTGGAACAGAGGGAGTAGTATTTTTTGTTTCCATTGGGATTTGTGATTGAGATCTCATGGTTATGTATGCATATGTTGTGATGTAGTTCCATATCTTTACTGTTTAGATGGTTGACAGGGAAATAGTAAGTTCTTTTTTAACTTAATTACGAAAATAAATTGTCTTCTTTTATTTAAGCTTATGTGACATTATTTCCTTTTTAGTTTGCTTATAAAAGAATTAACCCTTTCTAAGTTTGGAGAACTAAATGTTCTCATTTTACGCTTAATGGCAAGCATTTATAGCCACACATTCGTTCTGCATATTTAAAACCTGAAGTTTCAAAAGTCTTATTTAACCATAGCATGTTTAAGACTACAAGTTTCAAAAGTCATTCTTTTTTCCTTATTAAACTTCCTGTGTAGTTAAACAAGGTCGCAATAAAATGAAATAGAGGGAGTACTATATTTAAGAATGGTATCATACTTGGTAGTTTTTCTCTTCTCGTCTCTCCTTTTTTGGGTAGGGGATGCATCAAGCTGTAGGTTCAATTGTTTATAACTTTTAAATAGCTAAAGAACTTGCTCTAGTAGTGATTTCGGGGGTAAGATAATCTTTGTGGTTAAGAATCCATTGAATATGGGAATAATAAAAAAAAAAAAAGAAAAAGAATGCATTGGATTAGAGATCACAACCATTTTTAGCCGATATTGGCTGCGATGTGTTCTGGCTAATTTTTTCTTGAAAGGTAACAGAGGATCGGTTCTGGCGAATTAACTAAGGTTCATTTTAAATATCACCCATATCCAACCTAGCCACCCATCTGTCACGTATAAAACTTATTGTGGACAAAAATAGAATGGTCACTTCCTTTTCCACCCACCGCTAGTTGCGACGTGACAACAAGGAGTTGTAAATGTGATGTGATGGATTTTAGTTATAGCAGGAACTTAAATGATAATCTAGGAAGCTACAATTTTGCTATGCTTCAAACAACAGTCATATCTCGTATTAAAACTCAGCAAGTCTTGCTTTTTTAGTAATATAGGTTTGCTAATATTTCAAAATCCTATTTTATATTTTCTGCATGTATTGGATCTCTCATTGCTTTAAGATATAAGAAATGGTAATCTTAAACTATGTCTATGCCATCAGAAAGTGGATTACCCGCTGTCGTTGGATGTATATGATTTTTGTTCGGAAGACCTTCGCAAGAAACTGGAAGGTCCTCGCCAGGTACTGTCTTTTTCCCATTGATCAATGTCTTTTAAGAAATGAGGAAAGACCAGACCCTCTTTGGCCCCTCTTTCTCTTTCTTGTTCTGTTATTACATGACTCTAAATTTGCTGCTAGGTTTTGAGGGATGCTGAAGGTAAGAAGGCCGGTTTAAAAACCAGTGAGAAAACTTCAAGTTCAACTGACGGCGACGTTAAAATGACTGAGGCTGAGGTATGAATTAATCTTTGTAATGTAGGAGTGACTTAAGGGGATAAAGAGGGACCTTTCGGGCCCACCTATGGCGGATTGTACAAGGTGGTTTAAAGTGGGAAATTAAGAATGTCTATAAGTAACTTCTGCCTTTTCTCCTTTTTTTTCCTATTGTTTATGCAGGAATCATCTAGTGGAAGTGGAGAAGCGTCTAAAACAACCCAAGAAGGTAGAGAAACACCTCCTTTTCTTGATAACTTGATGACTTGATAAACATATGCTGCTGCTGTATTTTAATTGGTAACAATGTCTGGCATTAAAATTGTAATATTTGGGAGAGAAGTTATTGTCATGAAATTACCTTCCAACTCACATATCCTTTTCAATGCTTTAAATGAAACTCTGTTAGTTAATTGTCAGATATAATTCTGCAGTAATATTGCGGTTCCATGAGGTTTTACAGTTCTTATGACAACAATGTTCTGCCCTGGGTATTCAAACTTCTTTTCACAAAGTCACTGTTAGTATCTTTGATTACAAGCGATTGACCTTCTATTAACAATTTTGGGATCCCATAAAGATTATTAACTTGGATCAGATTTATTCCTTTTTAAATTACTATATGTCCCTAGACACCGGCGGACGTTTGCCAAGTTTCTTTTGAAGGGGGCGCCTTTTAATTTTTAGAACTATGGAAGATCCTTAAGGTTTAGCTCTGCTGATACAAGTACATTTTAATTTGTTTGGACATTTGTTTGTATGTAGAAAATGTAGAGAACCTCTAGCATAGATAACCCCGTACTTGCCTTTGAATTTATATAACTACGAAAGATTCTAAAGGTTTAGCTCTGCTGATACACTTACATTAAGTTGGTGTTTTCTCGGAGTCCTTTAATTTGTTTAGACATTTATTTGTATGTGGAAAATGTAGAGAACCTCTAGCATTGGATAACCCCATAAATTGTCTTAAAAGAAAATTTTCTGAGTATTGGAATGAACTAGGGCCCAGATGGAGCAGAATGAATGTCGGGGATTAAATAAGGGACTCTAACTGGTTCTGGATTGGGAGCAGTAGTTGTGAATGATCAATTTTATCAACAGTTTAGTGTTTCTGATATATAAGGGGAAGTTAGTCTAAGAGCTCAATTTTGGAATTCTGTTATGAATGAGGAGTCGCAATGATTAGTGCTTTTTTTTTTTTAATGATGCCCGATTAGTAAACTCCATTTCAGGTGTTCTGCCTGAGAAGGAACACCACTTGACTGGAATATATGATTTGGTGGCCGTGCTGACTCACAAGGGAAGAAGTGCTGACTCTGGGCATTATGTTGCCTGGGTCAAGCAAGAAAACGGTCAGTTTAACTGGGAAGAGATTTTGTTCTAGTAATCGTTGCTCTTGGACTACCATCTGATACAATATATTGAAAATCTCTTTGTAAACCACAGGAAAGTGGGTTCAATTTGATGATGACAATCCAATTCCGCAGAGAGAAGAGGACATCCCTAAACTTTCAGGAGGTGGTAAGTGAATCACTTGTGTATTACGTCTTCGGCAAATTTTCAAAGTCTGGCAAGCATATCCTTTCTTATAACAACAAGATGTAAAGCAGATGGAATATTTTGTTGCTTGTGTGCCTGAATGTGTTTTTCGTTCTGTCAGTTTATAGAAGTGCTTTATTTTTGGTTTCAGGTGATTGGCATATGGCTTATATTTGCATGTACAAGGCCCGTGTTGTTCCCATG
SEQ 52
CTATTTCACTTGATGCAAGGAAAATTGATTACTCCTGGCACTACGAAGAAATGTTTGGTTAAAAGGACAAAGGTCAGTGCAAAAACCAGAATAGTTCATTGCTACACTAGGTCCAACTAGCCAGATAGATGCACACAATTTGCGGCAGAGATAACTATAAGACACCACAGTGCTTAACTGCCTTTTAACCAAAGACCAAAAACACTCATGAAGAAGAAAAATGACAACCTTTTTATAGCTATGGACTGTCCATTTCATACGATTCCAGTTGTTAATACTTTATGCATGAGTCAAATAGAATTTGCATACAAAAAAATAGGTCCATTTGCAACAAATCCGAGTATAACTGAATGAACAGATGAGAGCCATTAAAACCTTAATGTCAAATCCTACAAAACAATTGGATCATCTCCACAATGCATGACGTAATTCATCTCATGCTGTAAATATATCAGTCTGTGGTTAATATGAAAGTTATAGATTAAAGATTCACAAGACACAAATGTGCCTCCTGAACTTCCTAGAGCAGCATATCAGCTAAAGCAAAGAATAGAAATACACTAATACAGAACAGCAAAGAGAGACTTAATAGACCTGAGTTCCATATTTTCAATGCGGATACGTTCAACATATAAGTCATGATCTTCTGCTTCACTGTTGATCTCATCGTCTTTTTTGGGATCCTTTTTGCGCCGTGAACTTGATCTCCTCGAAAATAGTCTATCAATCCTGGCAGCATAAGAACCTAAGAGTGCTAAGACACAATGCATTCACTAAAACATTGAAAAAAGGCCCAAAACAGAGGCAAAGTGGAGATGAACTTTAGAATTCTATCTTCACATTTTCACCAGAAGGGTTCAGAAAGTTATAGGAACTTCATTTCAAGTACAATGCAAACAGTAAACAGGTTTTTACTATACTAAAGTATCTAAAAGTCACTTACCTTGTTTTTCTCAAAATCCATGCCCAAGTGCGTCCAATAATCATGTCCACTTTTTCCATGAAAAAGTAATCAGTAGTCCTATCAATTCCAATGCCGCGTCGAAAGATTACATACTGGAAAGAAAAGGTACGTATATAGAGATTCAGTGGCATGATCAAACTCTTGACTCTCATCCTTTTCCCTCGAGAAATATATGTTCATATTACACATTGATATACTTAATAAATTTCGAGGTAAAACATTGATATACCATATAAGTTGAACAGCTCAAATACAATATAATACATCCATGATACTATTTGCAGGTTCAAAAAGATATAAGTGACAACACGTGCACAAGTCACTCTTTACCTTTGTTGAAGCTTGTGTGCACTATTACAGGTTTGAGGAGCAGACCTTGAACAATGATTAATCTTTTGAAAACTAGCAAGAAGATGAGAGGATAAAGATTCCAGTTTGTGCTCAGTCATAACACACTTTTTTGAATCCGTGGTGAAAATAAAAATCTTTCATCAAAGAAAAAGAAGAAAGGGCCAGTAAGCACAATATAACTCTAAAGTTCTTCGCAATAAATGATCGCATACTTTTGCTTGTGTTTTATTGGAGTGCAAATTCCTTCTTGCCAAAATCATGGGCAGTAATGGACATCCTGACACCTAAGAGCTAATGTTGTATAGGGACAGTGGTTACTTCAAGATTCACTTTAAACAGCCAATCATACTTGCTGCAAAGCAACAATGAGATAGGGAAATAAATGGACACCATATGTGGATATCAAGGATTCAAGTCGCATTCTTGAACCCCAAACAATGCAAGGCACCAACAAAAAGCAGATTGGGAAATATTCAATTTGGTGAAAATTTCATAAATACAACCAGAAGCTACACTTGTCGACCTAGCATCTGGGAGCCATTAACTTCATACCAAACGTTCCAGCAAAAGTAAAGGCCTCATTTCAAATTGTAACTTGAAGCTACACTAAGAGATATACCTTGTCAGCAAACTCTGGAAGGTCCTCATGAGGATGCTCTGCAAAATACTTCTCTAAAAGTTTTTTGTCAAGCTGCATCCAGGAAAAGGTAGAGTGAATCGCATAGCGAACATGACAAAATGGAACTATATCAAGAGAGCCAATAATTGTCGGAACCAAATTATATGCGGAAGCAGCTGGTTAAGAACCTTAGATTCATCAACAGTGATTGGAAGATTTAGAAGATACTGTCCTGAATGTGCAACATCAATCTCTTCATCACTAGCTATTTTAAAATTGCTTTTGTGCATAATCTGCATCAGCATAAGAGAAATGATGATAGAAAGTGGAGAATGAAATAAGGGTGCCAGAACACAAATGTCATACCCATTTCCCTTAATCCTCCAGCTAGATGTGGAAAAGCATTCAGAATTCACTGCACTTAGAGTTTGATGCAAGTTTAAAATCCATTACAGGCTTGGCCTCAATAGCATGATAATGCAGTGCAAAATAACGGAAAACGACTTTAGCGATAAATGAATAATATATAGCCGTTCCCCATGCTTGAAAATAAAGAAATATCCCCTTCCATTTCGATTTTGCAACAAAACAGCATAAATATGCGAAACTTTCTTATTGTAAGGCTACTCCTTATGATTAAAAATGGGAAGCATGTGTTAGAAAAAACAAAGTAAAAGAGAAAAAAAACAGGAAAATGAAGGACGAAGCAACTCCCACTCCCTGAAGACAAAAACCTCGACCTCTTACCTTTCACCTCACTGGTTGTCTCCCTCCCCTGAGAAAAAACAATTATTCGCTATGTCCCAATTTATGTGATGCACTTTCCTTTTTAGTATGTCCCAAAAAGAATTATACCTTACTATATTTAAAAAAAAATTAAAACTTTCCATTTTACCCTTAATGAGATAATCTATAGCCACAAAAATATCTATGACTTGTTTAGACCACAAGTTCCAAAAGCTTTCTTTCTTTCTTAAATTTTGTGCCCAGCTAAACAACATCACATAAAATGGGACGGAAGGAGTAGTTTTTCCTCTCAATTTAATCCAATAGAATTTTCTTCCATTTCAGTGGAGATCTTACACTTAATAACTGATGCAGGATTATATCTTTTGTTATATTTTTATTTCCTAGGCTCGGGAAGGAGAGGATAGCATGTTTTCACCTCCGCTGGGGATTTTCTTTTGAAGGTACATAAATGTCAGGGCCCTCATACAAGGGTTGTAGATCGAAAGTATTCTAACTATCTTCTTTTTTTCGGGTAAATTGTTGAGCCATGGGTACACACATACACAAATATATAGAGAGAGATATCTTCATATAAGCAACTGCAGACTATAAATATGCACACACAGAAGACAGATATGAGATTGGATTTTTTATCAAATTTAAACTGGAACACCAATATCCCGACTAAAGATACGCGGATACAATAGAATATAGCAGCTGATGGTGTTACTGAAAGGTCCAACACGGGTGCTCAAAGTCCACTTGGGCAAATGCGGCTAGGGAGTCAGAAAGAATGAGATGGGCGACGGACTTGCCCTACTGTCAGAGCTAGTGGTGTGTTCTTGAGATGAAGGCTTAGCATTCACCACCTTCCACGTTTCCGACTTTCTCTTATTTGAACTTTTTAGATCAGTCAACTTGAGTAAGCAACTTGGGCCAAGGGATGAAACTCAAGTCTCTAGCTCTCAAGTATACTGAGTAGGCAACTTGGGCTTGCATGTTATACATATTTTTTTAACTAGTATACTGAGTAGGCAACTGGGGCTTGCATGTTATATAATGAATCCAAACAGCCCATATAGCTGGCTCAGGTACCAGAGGAAAACAAACTTTAGATGAAGCTGCAGATTTTTTACCTGAAACAAATATGTCAAGAAATTCTGTTCAAGAATGTCAATCTCTTCTGGAGATAACTTCTGTTGTTCCAATTTCTTAGCCCCGTTCACAGGATCAAACAAAGAGTATAATTGCTGCAATCAGACAAATATTATTGATCAAAACACAATATCAACATCAGCATATTATATTCATTTGCAGCAGACTGATGAGAAAAGATGTCTGATCAAATAAAAAGTAGCATAAGATTATATTACCATGAGATCCTCAAATTGTAGAAGATACCAAGCATGAATTGTGTACTCAACCCTCTTGCAAAGCTTCAGAAATTCAGCCCGGTCAGAACTATGTTCTGGGCAAGGAGGAAGGTCGGATCATTAGAACCTCAAAAAAGACTTTTACTTAAATAACCAATCATAAAAAAGCACTAAAGGGACTAGAAGCACAGATATTCATGCAGCATTTTACAGTTTCAAGAAAAGTAGATTTTCTGAAGAAGAAAAAAAGAGTGGAAATGAGCTGCCAAGACGCAGACATGATTTACGTATATCCAGAAGGAAAACTTGTTTCTGTTAAAAATTCTAAACATATAACACGTCAAAGCTTATCAGTGAAGGAAGGGTTTTGAAACCTTTTGGGGATTTTGATCCATGACCATCAAGTGTCTCCGTAAAATTCACATGGAGGGTGTACAGGGTCCCCCCTTCTTTTGAACTGGGGTCTCCTCCTTTTGAAATGTCCAGCACTAATTTACTGCGTTGTTTTTACTGTGAGTAATACTGTTACTTATCTATCCAAAAAATTACATAGAGGGAAGTCGTTTCAAAAGCTTACAATTTCTTGAAATCCATGAGGACTTGATGAAAAGATGGAAGCCTTTCTATTTCGGTTAGAGTTTATATGCTCGAGTGACCGTGTAAGGAGAAGTGTAAGATTTAGACAATTCACGGTTATAGAGAAACCCTTATTTGTCCTAAAAGACAATTTATTGAGTAGTGGAATGAAACAAGCCAATGGAGCAGAATGATAATGAGAAAATCACATAACCAACTCCAACTAGTTATGAATCAGGGTGGAGTTGCAAAAGTAAAATCTCTATTAAGCATCTAATTTCTTAAATTCAAGAGATTGTGAACTTCTGAGTTGAGTGAAAGTGACTTACTCTAAGAAAGTATTTAATGATAAAACTCGCAATTGAGATAAGTCAAATGGTAAACCTTGGAAAATAATCTAATTTAATCTGGGAAAATATCTAATTCACACAACAGTAAATCTAATTAATCTGGGAAAATAATCAAGGTCAAATTGTCTGCTTTGTCACAATACAATTAGAGTGCCTCAGAAAACAACATTAAATTTTGCAAAATTAAACCATTAAGGGGGTTGCTGTTTACACGAAAAAAGCTAAAACTCAGACGTCACCAAAACTATGAGATCGTGTAAAGTTAGTAACATTTTAGGTTTAAATTCATGGCACGCAATAATTGCTTGACATTCAGGAGAAAGACAAAAAAGTGACTGCCGTTCCCTATAGAGTCTAGCGTAGAAAGTTGTATTATTTGCTTCAAAGATCTCCTCTTGGAGAAAGAAGCATTGTTTCCTTACACTGCCATTTTCTATGGTCTAGTATAGGAAGTTCTATTGTTTGCTTTAAAAGTCTCCTCGGATTATAGCAGAGTTGTACTTGTGCTAGTCAAATTTGGTAGTTTTTTCCACAGAAGTAACAGAAAGTGAAAATATCAGAGATATTATCCAATTAAATTAGAGGAGAAGAATTGTTTCAAAATTCAAACAACTGATCGACCACAAAATAGAGAGGGAAGAAAGAGTAGCCGGTTGTTTGGGAGCATCCAATTTTTACCTATAAGGTCGGCGAGGGCCATGATGAGCCTGGGTTTGAGGACTGGAATAACAGACTCGCGCTCCAAACGGATCACCTCTTTCTTCTTCTCCAT
SEQ 53
TCACCGGTTTGTGACAACTGGATTTCCGTTCGCATCAGACAAATGGATACTCTGGGTTGACTTGTTAGCGACAACAGGATTTCCAGACTCATCTAAATTAATTATAGCAACATCACCAGGCTTGAGATCCCCAGAAAGGAAAGATTCACTCAGGAGATCTTCAACCATTTGAGTAACAGCCCTCCTAAGAGGGCGTGCACCGTAGTTTCTGTCAAATCCTTGTTGGCATATAAGCTCCATTACTGCTTCTGACACCTCCAAGCTTATTTCCAATGAAACAAGCCTAGCCCTCACCTCCTGCAGCATCAGGTCTAGTATCTGGAGCATCTGAAAGGGAAAAAACAAAGCAGTTACTCGCAGTAGCCGACTGCTTTCTAAAGAGATGTGCCAGAATAAAAGATCACCCCAATGTACATTCAGTTATCAGATCAATGCAACTTTCCAAATCAAACAAGAGGTATATTATACGAACCTGGGGCTTCTCTAGAGGACGGAATACTACTACTTCGTCTAGCCTATTCATCAACTCAGGGCGGAAATATGTCTTGAGCTCTTCCATCACTATTGCTTTCATACCAGCATAGGAGGCTGCTGATTCATCATCAGCAAGCAAGAAGCCAATAGTATTCTGTCTACCCTTTACTATGGCTGTAGAACCCACATTAGAAGTCATCACTATCAGGGCATTCTTAAATGACACTCTTCTTCCCTGTTGAATCAAGTTTCATAATTATCAACGCCAATCTTCCAAAACAAGGTTTACACCCCATATTGTGGAACATTCTAGAACAATAAGCCAGATGTAAAAACGGATCTAACCTGAGAGTCTGTTAGGTGACCATCTTCAAACAACTGAAGGAGAATATTGAATATGTCAGGATGAGCCTTTTCAATCTCATCTAGCAGCACTACAGTGAAGGGCTTTCTTCTGATAGCTTCAGTAAGTGTTCCTCCTTCTCCATAGCCTACATAACCAGGAGGCGATCCAATTAACTTGCTCACAGTATGCCGCTCCATGTATTCACTCATATCCAATCTTAGCATGGCAGATTCCTGTGCAACACAAAAAGATACTTCACTGAGTACATAGAAATTCACAAAACCAAGTGATTGTTAAACTGAAACAGAACCGAAGATAGCAAATATAACTCTAGCAATTTGATGATGAACTTCTAAGAAAAGTAGATGCATAACTCTTGATGATGAAATTCTTTTTGATAAAATCCTAGCCACAATTTTCATCTAGAAGGTTGAACAGAGAATGTCGCTCAGATGAATTTGATTTGATTCAATCTCTGCACTGAAGTCACAAACCGTCTACGACCAAGAGATCCAAATTCAGTGTGATGACAAGATTAAAAGCAGCGCTAACTATATGACTATGATCCATTATCAAAGCTCACTTAGTACTTTTCATTTATTTTGCCAAATTACGTGCCTCCACCATAGGTGATTAGAGACTCCTTCTGTTCTTTCTTTTGGGGCGGGCTGAGAAGGGGTGGAAGGTGCAAGGATCTAGTAAATGTAACAGACTTATCAGATATATATACAATGGTTGACTTCATTTCCCATTGAAATGGATGAAGGAATAATCTGATCCTGGCAACAGGGAAAGAGATTTGAAATAAGCCAGTAATAGAGCACTACCTAGTACCTAGTATGTCTAAACTTGAAGTACTTATTAGTGCCCACCAATAATCAGAAGTCGTGCTCCAACAAGATTAATTGGTGGTATTTCCAACGCTACATTTGACCATGCCGCAATAGCCATAGAAACCTAGAATGGACACACAGCACAGCCTCCGGTACCCTTTCCTTCTCTCCTTTTGTTTTTATTATGTGTGTCCGACCACCTAAAACACAGTCACCACATCTTACTATCATAGTATAATACTTTCTTTACCCGAATCACCACTACCAACAACATAGAAAATTCCCAAGAACACCAGAACCAAAAAAAGTCCTGAGAACACCAGTACTATCACCAAAACCTATCCATTGTCACTGAAGTGGCTGGATTGCATAACTTCAGTGGCAAAACATGGTGACTGTCTGGTGAAACAATTAAGGCATCTAGAATGAAAAGATGAAGCACATTTCTTATCTTACATAATAATTCTTCTCAAATTTAGACAAACTAAAAAGAGCAAGATTGTGTTTGTGCAAATTATGCTGCCAGAACTCTTGGTCAACACGATTCAATTTCAGAGTTCCACAATTTCTACTCAATTGCTTAATCTGGAGACGCATCTTTGGAGGAATAATGCAAAACAGCTCATTTTATTAATACTTACAGAACCAAAATAAGACGCTGCCAAAGCTTTAGCTAGTTCAGATTTTCCAACTCCAGTAGGACCACAGAAGAGCATTGCCGAAATTGGTCTATTTGGGTCCTTAAGACCAGTTCTAGATCTCTTAACAGCCCGACAAATGGCTGCAACAGCCTCATCCTGACCAACAACCCTTTTTTTAAGCTGCTCATCAAGACCAACCAGAAGCATTCTTTCATCAACAGTAAGCTGCTTAAGGGGAATGCCTGTCCAGAGTGAAGCAACTGCTGCTATTTCCTCAGGTCCAACTACCGGAGGTCTAGAGTAAAAATCAAAGGATTTAAGTTAATTTACTTCTGATTTAAATTTATGACTTCAGACAATCTTTATTCTACTATAATGCCCTTGTAGGAACATGGAGGCAGATACGTACTCATCTTCATCAGATGTAGAAGGTGATGCTGGCTGTAAATGAAGTTCACTGCCATCATTCAAACGAGATGCATCATCATTTTCTGTCAGCTTGCTTGCCAAGATCTGTTTATGGAGGGGGCACAGGTAATTGTATTTAGACCACATAAGGAAGTTCAAATTTTTCAAGACGTGCAAAATCTAGAGAGTTTGTATAACATTGAGTACTATTACCACTTCATGCATGGCTTGAACAGCTCTAATCTCCTGCCAATAATCACTTGGTGATTGTGAGAGTACAGATATCTGCTGTTCCTTTCTTCTTTTGTGAGCTTGCATACGAGATTTACTACCAGCCTCATCAATAAGATCAATAGCTTTGTCAGGAAGATACCTATCCGGTATATATCTTGCTGACAGTTGCACAGCAGCATTTATGGCTTCCAAACTGTATATACACTTATGATGTGACTCATATTTCTCACGCAATCCCAACAGTATCTGGACAGCATCCGCCTATATAATTTAGGAAACAAGAATATCTGTTAGGGCCTTAGACACCCACTTGAGCATACACATGCATAGAGTATTATTCACCTGACTTGGTTCATTAATCAAGACAGGCTGGAATCTTCGGGCAAAGGCCTTGTCCTTCTCAATATGCAATCTGAACTCATCCATGGTGGTAGATGCAATACACTGTTACAAGAACAAAATTTAAGCGCAGAGTCCATAAAAGCCAACATTATGAATGCAACTACATGGAGTAATGCATGGAACCAGAATAGGAAATTACCTGCAGTTCGCCCCGCCCAAGTGCTGGCTTTAGCAAATTAGCAATGTCAAGACCAGAACCCTTATTTCCCCTTCCAACTGTACCAGCACCAACAAGGATGTGGACCTCATCTATGAATAGAATGATATTGCCTGCAAATTATGTCACAAGCTAAGCCAATGATCTAAGAATTTGACCAAATTTTACTTCCATTCTATGCTCACCTGACTTTTTGACCTCCTTAATTAATGTAGTCACACGCCCCTCTAGTTCGCCCCTCTCCTTTGCACCTGAAATGAGTAGGCCAATGTCTAAAGACATTACCCGCTTTTTCTGTGTATATAGCAAATAACCACTAGTTAAGTAGGTGCAGCAGCTAGTGGAACAAGAGTATAGGAGATGCATATTAAATTACAAAATAGTTCAAAGAATCCCTGGAAAAAAAATGAATTCATTGAAAAGCCCACCATTAAAAATGCAGGAATATTTCCCTCAGCAATGTTTATCGCCAGCCCTTCGGCTATCGCTGTTTTCCCAACCCCAGCTTGACCAAGCAGAATAGGATTGTTTTTGGTTCGACGGCAGAGAATCTCGATAATTCGCTGAACTTCAATCTCTCTGCCAATTACTGGGTCTATAAGGCCCTCACTCACACGGGCAGTAAGATCTACACAGAATTGCTCCAGCGCATTTTTCTCTGCTGGAAATGAAATGCATTTGAGACAGCGTGCCAAACCAAACACTAACAAGAGAGAGGAACTGGCCATACCTTTTGCTTTCTCAGCGGATCTGTCGATAGTTATTTTTCCAGGAAAGGATTTCTCACGCGACCTTTTGAATGAAATTGGCTCTCTACCATCTTTAGCAAGCTCTCCTTGAAGCCTGGAAACTGCCTCAGCTGCCAAACGATTTACATTTACTCCTAACCTGAAATTAGGTCAGTCGACCATGCAAATCTTAATTTCTTATATAAATAGCGTGTAATAGATGACAGAAAGATAATCATATTGCAAGAGAGAAGGCATATTAAACTCAATCAGGAAGGTGAAGAAGGATCCTATTCCCAAGCCAGTAGCATTGTAAGATAAATTATGCAGCAAAAAGGAAATCCGTAAGCATTTGTTTATTTATTAACAAGTACATCAGTATGCTTGGTTTAGAGCACAACATGTGCTCATGCATAGCGTTCCATCAGAAACAGAAATTGCTTATCAACAGGAGGTCAAAATATTAAGCTAGGGTTGTCAGGCGTACAACTTGAAAAAGAATATGTACACATACATGTACACAACACAAGAAGACACCAAAAAAAAAAGAGAGGCAGATACCATTCATAAGTTAAGAACAGAATACTAAGGTAACAACAACAATCCAGTGTATTCCCACAGAGGATAAGATGTACGTAGCCTTACCCCTACCCCGGAAAGGCTGAGAGACTGTTTCCGGTAGACCCTTGGCAAACCAAAATATCTCTTGGTATTTTGTATACGCTTGGGAGGGAAAACAACACCAAATGGAATGTAAAATATGGCAAGCAAGGATAAATGAGTGAAGGATAAATTCTACCAATAAGTTCATTGCTTCCTAGCAACAGTGGGAGACAACATCTCATTCCTCCTCAATTGTAGAAGCTGTTTTGGACTCAAAATAAGCAACAATAGCTAATCTATTCATGATCCCAAGATTGTTTCAGCTTACCAAAAGCCTATTACTTTTGTTCTTAATAACCTCATCATCCACTACTAGAGGCAGATGTAGTGCTTGAGTTAAAGGTTCATCTAAACCCATTAACTTTGGTTCAAACTTGTATGTATGTTAAAATATGCACCAAATAAGTACAAATAATATATTTCAAACCAAGAATGGGCTGCGGAACGCATATTCAAATCGTTGATCTGCCTCTAACTACCTACTTGCTCTACCAAAAAAACTAGCAGTTCCTATACTGAAAACAAAACTACTTGCCAAAAAAAGTTAAGAAAAAAAAAAGCATTCTTGACTGACTATCTGTAATACCTCTTGAGCACACGAGTGGCGTTACCATCATCAACAGTAAACAAACCAAAGGCCATATGCTCGGGAGCAATAAAATTATGCCCCATGGTCCTTGAATACTCAACCGCAGCCTCAAAAACGCGCTTCGTACTTGAAGAAAACGCCACATCAGTAGCCGACGTAGCAGAACCGGAGTCCTGAGAAGCCAATTTTTCTTTATCATCCTCCACGTCATCATGCCATATGCTCCGAACAGCTTCGCGGGCTTTATCAATTGTTATTCGAGAACCAAGGAATCCACCAGGGCTACGATCCTCTGCGATCAGACCCAGCAAAAGATGCTGTGTATACACCATATCTTTGCCCAAAGCCTTTGCTTCTTTTTGAGAAAACATCACAGCTTTGATTGATCTCTCAGTAAATCTCTCGAACACTCCAGAGACAATATACAAAGAGCGCTTGATTTTACGAGGAATTGAGCTGCAGGGCCTATGAGAAAGGGAAATTCCAAAAAGGGACGAAGTAGAACTGCTAGTACTACAAGCAGCGGTAGTGGCGGTAGTAATAGTAATATGAGAGGAGGAAGAAGGGCAATATGGGAAAAGCGAAAACACGGTTTGACATCTCTTGTGAGGGTACACAGAGCCATAGCGACGAAGCTGAGGATTGAAGCTGATTGTTGAGTTCACAGAAAGTGGAGAAGAACACGTTAATTCCAT
SEQ 54
ATGAAGAATATCGAGCGTCTCGCAAATGTTGCTTTATTAGGTATGGTTTCTTTTGATTTTGATTCATACATTATATCTTTTGATGATAGCTGAATTGCATAGTATACTTGTTTGATTTGTGCTTGTGAAGTTCAGAAAGTAAAGTAAATCCTGTTTGATTTATAGCTTCGTTTTTTGCCCCTTAGTTTGTGTTGGTTACTCATTCAGATCATTTTTCCGCTAGATAGATTGCTAAAGCTTTTGATGCTAATTCTTTGTTGTTAATTGGAAGGAGGCTACCCTTCCAGGGTAGGGGTAAGACTACGTACATCTTATCCTCCCCAGACCCCACTCGTGGGAATTCACTGGGTTTATTGTTGTTGTTGTTGTTAATTGGAAGGACTCAATGTAGGGAAAGGTGCTAATTATTGTGTAGTTGGAATTTGAGGTGTGGTTGATGGTTACCCTAAATATATCTATCCAGCATGTTGGAGTAGATTCTATAGCGGTTGGAATGAATATTCAAATCCCCTTGGGCAGTCACATTACTACTGTTACCCGCTTTCCTTTATGTCACAGTAGGTTCCACTTCCACAGTTCCAGTTCAATCGGTAGACAAAGATGGTCATGTGGGTTCTTTATATCAGTTTTAGCATTTTCTTATATGTTGGATGTTTGTTTCATCATATTGCCTTTTTGAGGACATTTCACTACGTAATAGCAGCTATGCCGTTCTTGGAAATTTACAATGTACGATTATTTGGTCATGGCAATTTCACATCACTTTCCAAATTTTATGTTGACGCAATTACCTTGAAACTCTTGCTTTTTTGGTGGATTTCAGGTTTGAGTCTGGCACCACTGGTGGTGAATGTGGATCCAAATGTAAATGTCATAGTAACAGCTTGCCTTACTGTCTTTGTGGGATGCTACCGTTCTGTCAAGCCTACTCCACCTTCAGTATATCTTCTGTACTCCAAGTTGCAGCTTCCCTTTTTCTTAGATCTGTTTTGATGTCACTTAAACATATTCTACTGCTGTTTTCCAGGAAACAATGTCTAATGAACACGCAATGAGGTTCCCCTTGGTTGGAAGTGCAATGCTCTTGTCATTGTTCTTGCTTTTTAAGTTCCTGTCAAAAGACCTGGTTAATGCCGTATTGACATGCTACTTCTTCGTTCTTGGCATTGCTGCACTTTCGTATGTTCTCTCCGTATGGATCATTCTGTGATGCTTAATATTTTCTATAACAAGTTCTTGAATAGTAGTTTTTCTGTGGGTGTATTGGATGTCATCTCTTTCTTTGTGTCTTTGCAGGGCGACATTGTTACCTGCTATCAGACGATTCTTGCCCAAAAAGTGGAATGATGATCTCATAATATGGCACTTCCCATATTTCCGCTGTAGGCACCACCTTTCTTGTCTCTTTTGAAATGCCAATTGATCCTTTAGAATCCTTGGGCATACAGATCTCATCTTAGTTATTTTGTTTCGTCTTTTTTCAGCTTTGGAGATTGAGTTCACAAGATCTCAGATTGTTGCCGCAATTCCTGGAACCATCTTCTGTGTTTGGTATGCTAAACAGAAGCATTGGCTAGCTAACAACGTTTTGGGCCTTGCCTTTTGCATTCAGGTTTGTCGGCATATCCATCCAAGTTACATTCTCATTCTTCAGGATATCTCAAAATGAAAAGTTGTGTAAAATAGTATTATTAGTACAATGGTAATATACAATTTTGGATATTTCAAAGTGAAAAGAGTATCATATAAATTGGGATAGAGGAAGTACTAAGACACTTGAATGAAGAGATCATATTTCATCACTAAAAAAGTTGCACTTATCTGTCCATACATGTTCTCGTAACCAAGCATGGTTGCTCTTTAATACCAGATGCAAAGGCTACCCGCCTTTATATCTAGCATTTAAATCCACGATAGCACTTGATGGCTTCCTCTTTAATTTGTTTGATAACTAGAATTCTCCCAGGAGTTGGCCTACATTTATTAAACTATGGGAGTATAATAGGCCTCCTCTATCATGCTCCCACTAATATAGCGGCTCCTTGTTAGTGATAGGGTTCTAACTCATGACGTGGACCCATATTCTGACATTGCGTCATTACATTGAACCGGAGCCCCAGGGGCTTACTATTTGTGATTTTCTATTTATATATACATTGAGTTAATGGAGATTTTTGCAAGGGAGAAAAGGTTTGATCCTCTCTTATGTCATGTCTACATCAATGATTGATATTGATTTTCCCATTGCGATTTTGATTTTCAGGGTATTGAAATGCTTTCACTTGGATCATTTAAGACTGGCGCCATACTATTGGTAAGAAAGAAAATTTGTTTTCTAATTTCTATCTGTAATTATACATGGCTGACAGCTGTATTCTGTTTATGTGTTTCGCCTTAAACTACATATTGCTTGTCTTTTTGAATTTGATGCTAACCACATATCTCTTTATTCAAGCGGGAAGAGAATTTCATGAAATGAGCTATTAATGATTGTTATTGTTGAGCTATTAATGATTTTACATACAAAAATACATAACATTTGCATGGATTATCCCTAATTGCAGAGTTTTTAGACATTTTTGAGGTATTCTTTTATGTTGGCATTTTTGCTTGTTTATGCAACTATTTATATCCATTAACTTGTAGCTGATGTTGAATGCACATGGTTTTCGAGAATGCAGGCAGGACTTTTTGTGTATGACATCTTCTGGGTCTTTTTTACCCCAGTGATGGTCAGTGTTGCCAAATCTTTTGATGCTCCTATCAAGGTGTGCATACTGATTTTCTCATATAGCTATTTCTTTTGAATTTTCATTTCATGCCTTTATTAGTTACAGAGTCCTGATTATAACTTCGCTTTCTCTGCAGCTTTTGTTCCCCACAGCAGATGCTAAACGCCCCTTCTCAATGTTGGGTCTTGGAGACATAGTTATCCCCGGTATAACCTCCATTTGCGTGAAAACTCCATTCACTTTATGTGGTTAGAACAGAGAGGTTTAGCATTTTGCCTAGCGGAGGGATCCTCCACCTCAAACCATGTGGTTTGGGGTTTGAGGCAGTAGGGAAACGGTGGGAAAAGCCACTGTTGGTCCCTGGAGGGGAAAAAATGGGGGTGGGTGGGGGGATGAGGTTTACCATATTAAAAATGAAAAGCTTACTTTTTGTGAGTAGCTACTTGAATGTATTTTTCTGTTTCTTACACATGCTTATTATTAGCTTTTGCCATGATGCTGTATTTGTTTTCATTTTTCAACTTGTTTTTTGCTTGAATAGATACACTTGGTAACATTTGATCACTTCAATCATGCAGGTATTTTTGTTGCATTGGCCCTCCGCTTTGACGTTTCCAGAGGGAAGGGGCCCCAATACTTTAAGAGTGCATTTTTAGGATACACATTTGGTTTGGCTCTTACCATATTTGTTATGAACTGGTTTCAAGCTGCACAGGTTGGTGAATCAAAATAAAGCTTTTACACTTTATTTCTCTTGCTAGAATTGCAGCGCCCTTATGTTTGACTTGGCCTTTGTTTTTTCCAGCCTGCTCTGCTATATATTGTTCCAGCAGTGATTGGATTCTTAGCCGTACACTGCATATGGAACGGGGACGTGAAGCCTGTACGTTTTTTCTTTTGACAATCTGTTTCAACTTCATCCACTTGCTAACTTTACTGTTTATGTTCTTTATATGCTGTTTACTACCATTTTAGCTACACACTATTTGTAGATTATATTTTCTAGGAGTTAATAGATATGAGAAAATGCATCTTATGGTTTACCTTTAATTCATCCAAAGAAAACATGCATGTCATGATTTGTTTGGAAACTATGGACAATAAGTTAAAGGTAGGGAAGGGAAGTTTTCTCGTTTCTTCGAGTTGGAAGCAAAAAAGGACGATCAAGAACTTCTTTTTCGCCTAACCTTTGATAGAGAGAGTACAAACCCAAACCTTCATTGCCTTTTCTAGTTTATACGGATACAGAGTTAACGAAATTTTCGTTTATGGAAGTAAGATTGGGGTCTATTCTCAAGTCGTAGAAAATTACATTTTCGTTTTGATGCTGAGTTTGTATTCTTGATTTTCTGTTGAGCAGTTGTTGGAGTTCGACGAGGGAAAGACGAAAGGCGCTGAAGAAGCCGATGCCAAAGAAAGCAAGAAGGTAGAA
SEQ 55
CTAATTTTGTTTGAGATCTCTATAGTACTCTTCTAACCACTTTTGAATAATAGCCACTTCTTGCTTCCTTTGCATGATCAACCAGCCAGGGTCATTCTTTGTCTCAGAACGAAAGTCAACATGGTGTGCACCTAATTAAATAAGAAAAATTAAAACATGAAACTAGTAATAAAATAAATCAATGTCTCTAATCAACAGCAATCGTTTTTCATTATGCTTTAATTCAATGAATATCATGATATACAAAAGCATATATTACACAAATAGCCGGTCATATAGCCGATGTACATAGATTATACAGTAATTATATATAGTTATACACATTTTATATATGAATTATACATAAATTGTACATGCGCTAGTTATTTTTAATTTAAGAAATCAGATCAGTGGCTATTTGGGTTAACTCTTCGTACCTTTTTGAGTTACGAGTGCCACAATGCTAGCTGATATATTTTTCAGCACACTGTCACAAAAAGAGAAAGAAAAAAATCAATTTAATAACAATAAACAAAGACTTATTCAAGATTTAGAGTCTATATGAGTTTATAATATAAGTCGAAAATAATAGATTCAATTAAAACAACAGGACTTTACCATCACAGATAGATATAGGACCACATTTAGTGGTTTTAATTGTATCTATCTATTGTTTTCCACTCCAATTATAAACTCATATGGACTCTAAATCTTGAATTTACGAGTCCTCACATTATTATATTTTTATTTTTAAATTTTGAACTCACCCTCCTCTGCTCCATGGATCTTGCATTCCGTTAGAGAATATCATATTACTGCCAAATCTCTTAAGAACTTGCTCAATTCTCTGAAATATAAACATATTTTTTTTAATTTCTTATTTTCGTGAATAAAAAAATATGTAAAGAGAATTTTCATTTGTTAACAAAAAAAGAATGTAAATTAGAAAACTTACATAGCCACCAAATTCAGTAGTGATCCAATGTGGTCGAGGCTCTACTCCATATTTCTTTTTGCAATCTTCTTTGAATTCCTTGTAACTATAGGAAGATGGAGGAAACATGCTTTCATTTGAACAAGTCATTGGCATAACCATCTCTGTACATGCCTTACTCAAAAACCATGGAAAGAAACACACTAAAATTAGGCATACATTAGACCAAGATTTCAAGAATCATTTAATTTTAATCACACCATAACCAAACTAAAGTTTAATATAAGTACCAGTGGCGAATCCAACAATGCATTTACGGATTCGATCGAACTTAGTATTTACAGTATAGAAAAATTTGTATATACGACAACAACATATCAAGCCCACCAAGCAGGGGCGTACGCATGAATTTTTGTAAGTGATGTCAAAATTTATAGAAGAACGGGAATTATAACTTTAATTATCATACTTCTAGACAAGAAATAACTTTAATCTATTGGTTTTGTTGAGTCTTAAGTTAGAAAAAGTCCAGAATTAAATTCTACTTCATGACAGTTTTAACCACATAGATTCTCTAAAATATTTGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGAAAGCAAAAGAAGCCTGTCAGTTCTCAAATTCACAACTTACAATTGAACAGAAACAACCTTCTTTTTTCGTNAGCAAAATCAAGCACATAACCAATGCATCAAGAAATAATTTATGTCAAGTGGTGTCATTTTTTTCTACTTATCTGCTTCTTCACAGTATTAACACATATATTTAGTAAAAAAATTCGACGAAGCGGTGTCGCGTGACACCGCTTCGATACATCTGCATCCGCCCCTACCACCAAGTGGGATTTGGGGGGTGGGTAGGACATATGCAAACATTATCCCTACTTTTGTGAAGGAAAAAATATTATTTCCGGTAGACCCCCAACTCAAAGAAAGATAAAAAGAGATAGAAACAACAAATAGTAACAATCACAAGATAAGGAGATAAGGTGATAGAATAATGAGAGATAGAATAACATTTAATAGAAAAATCTGCGAATAAAAAGCTACAAAAATACCATACTAAAAAAATACGTATAGATAGAGTTAGATATATTTTTCAAAATCTACACGTTTAAATTCTGAATCTGCCACTAATAAGTACAAAGAATAATAAATGGAATAGAATAAAAGGTAATTAAGGTAAAGAATTCTAAACCTGCCAATCCCAACCACGAAGACCATGAGCATCATCACCACCTTCTAAATTGAAACATTTTTCTCTTTTTGTATAATTGTAATATAAACTTGCCGCAGCAAATGCCCGGCTGATTTTGGAAGCTCCTTTTGGTAATCCATCAATTATCTTGCACATCTTAATTTAAAAAAAAAAAAAAGTTTAAGAATTTTTTCACTCGACAATATAATTTTTTTACACAATTAGTCATTTATAAGATAATTACAAATAAATTTTTATGATAAGTATTAATTGATGAACTGATAAAAATGATAAATAACCTGTTATAATATGTTAAACTACACTAATCATGTACAAAATCTCCACATTGTCATTGCATTAAGTCTTATGTACAAATGGTATAAAAAATATTTATGCCAATCAGATAAATTATAAAGTACCTACAAGTAACCTTTGTGATAAACATTAATTGGTAAATGGCTAAAAATTGTATCTAGCATGTCATATCAGATTAAACTATACCCACTTTTGTAAAAATATTTATCCTGTCAATTAGGGGCGGACCTACGTGGGGAGGGGTCACTAGACCCCGTCAGTCTCGACAAAAAAACTGTATATAAATTTCATATATATCTATATATACAGTAAAGACGCCTTAAATACTTTATGCGCCCCCCTAAAAGCACAAAAACTGGACAGAGGCACTGGTTTGTAGGAGTGCTTAATCCAAGTTCGAATCTATGCTCCTACAACTTATATTTTTTATTTTATTTTTAAAGTGGTGTCGCCGTAATACTCAAATCCCAGGTCCGCATCTGTTATCAATGTATATCAAGTTCAACAAAAGTTGATCAGGGGCGAAGCTATATATCCCAAAGGGTAGTCAACTGACCACCCTTCCTCGAAAAATTTACTTTGCGTATATAGGTAACATATTAGGTTTTAGAGGTATATAACATATATGAATACTCTTTATTAGAGAATTTTTTCCACTTCTTTAAGTTTGAACACCCTTGAGCCTTGAGAAAATTACTGATTTCGCCACTGATGCTGATAGAAAAAAAGGAAATGTAATTAAATGCAATGATTTTGAAACCAATTTAGTGCTAATTAAGCAGAATTGTCTACCTCTTGTACTGGATATGCTGGCAATGGCATCATAAAGTTGGCTTTAGTAGGATAATTCACCATTGCTGTATATACAAAAGCTTCCCATAGCCAATCTCTAGCTGAATAAACTGAATGTAAACCCCTGAAATATAATATAAAACAAATCATTTTATTGAATCAAACTTGACCTTGTAATTTGCTACAATAAAAAATTATCATGTTTCTATTTTTATATTCAAATATACTTACTTGCAAGTTCTGAAAAGTTTACTAACTTCAGTCAAGCCTTCTTCATGTTTTGATAAAGCATCCAACTCTGTCCAACTTCCCTTTATCACTCTATAACAATTCAAGCTTACCTCCTATTCGTGAGGGCAAATAAATAAATAATAATAAGGTCTCTTATCCAAGAAAGATTTTTATGAATATATCCTATTCGAGATAAAAGATAATTACAGTAATCGCTCATAATAAAGTGAGATTAGTAATCTGCAAATAAGTCAAGTTACTTGTTACAACATGTTGAAAATATTGAGTGTAAAAATTTATTTACATTGTCAATAACGGTGGATCCTGATGGGTTCATCCTCTAGAGTTCAAATAATTTAAGGGTTTGTTTGGCCATGATTTTTTTTTTTACTTTTTTTTTGGAATCAGTGTTTGGCGATGAAAAATTCTAACATTTGAATTTCTAAATTTTTTCGAATTTGAAAAACTTCAAAAAACTATTTTTCAAGATTTTCACTTCAAAACACTTAAAAAAATTTAAAAACAACCCCAAATTATATTCATGTCCAAACACAATTCTAATTTTAAAATACCATTTTCAACTTGAAAACAAAAATTACTTGTTTAAGGAATTTCACAATTCTTATGTCCAAACACCCACAAATTTGTTGCGCTTTTAGAAGTAGAGTTTCAAAATTTAATATTTATTAAAATTTACAATCTTTTTGCATTTCTTTATAAACCTAGTAAATAAGGCGATATTCGTCCCAGTGTATATAAATTAAATCCTATTATAAAAGGAGCATCTAGGTGCATGAATGTGACCATGTAGGATCAGAACATTTTAAACAAAGGGGCTTAATAGATTGTAAAGGCTTTACTATTACTAACGACAAGAGCGGGTTGCTCCAGTGGTGAGCACCCTCCACATTCAATCAAGAGGTTGTGAATTCGAGTCACCCCAAAAGCAAGGTTAGGAGTTCTTCGAGGAAGGGAGCCGAGGGTCTATCGGAAACAACCTCTCTATCCCAAGATAAAAGTAAAGTTTACGTACACACTACCCTCCTCAGACCCTACTAGTAAAATTTTACCTGGTTATTATTGTTGTTGCTGCTTTACTATTACTAACCTTGAAATCTTGGGAAACAGCATCATAAAAGCTTGACCATGGGGTGATTTTGTCAAACTGCAAGATTGGTGCTGAAGATGCCACTGCACCTATTGCTATATGTGGGTACTTCAATCTAAACCAAGAAGCCAACACTGGTTGACCAGAAAAAACAAAGTTAAAAGAGGAAAGGAAAACAAGTATTTAAGAGTGAGAGCAATTTATTAATTATTGGATACTACTCGAGAAGAATGAACCTAAATAGCCGTTCACTCAACTGCTTAAACTAAAATAGCTGACGGATGTATAAAATAGGAAAATTTTATTTTGTGTCTCAGACAACTTACACTAGTTGTATGAAGCATTTTTTTTATCTCACAGTTTTGTGGACCCGAATTTTTGTGGACCTAGAAGTGTAAGATTGGGGTCCACAAATTTATGAGACAACAAAGAGCCTCGCACAACTAATGTCAGTTGTGTGAGGCACACAATAAAACTTCTCTATAAAATATATAATTCATACTCTTATATAATACACCATCCGGTCCACTTTCATTGATTTTTTGACTCTTTTCACATATATTAGAAAATCACATTTTAGCATTAATTCACAATGAAATTGACCATATTAACCTTATTTTGTTCCTTGAAAATATAACAAATGCTCCTATGCTCTTTACTTCAAATGCAACTTTAAAAAAAAAATTAACTTATTCTTAATATCTGGAAAAAATCAAATATTGTGGACCACAAAAAAAATTAAAAATTCAATTAAAATTGACCGGAGAGAGTATATGCATAACTATGTATAATCTATATATATCGGCTAGAAACAAACAGTAAATTGAACTGGCTATTTGTGTAAAGATTCCTACTTAACAAATGCAAAAGTGGAAGAAAAGTTCTGTTTATTTGAATAATTGAATGCATCTAATGCTAATGCTAAATTCATACAAAAGAGAACTTTCCATGAACATTTAGCAACCATAGAATGTAATTATCATTGATTCACATGGATTGGACACTCAATAAGTCAATATGTCCACACATGTAATGTCATGTCATTTCCATCTATCATTATGTCAAGGCAAAAAATTAGCTAAAAGTTAAAACTTTTTCACTTATATTATTACTTTTCTTTCATTACTTTTTTTTTGTTTGTTTGTGTGGTGTTCTACTATATTAGTGGCAGTTTGGACATAAGAATTGTAAAATTTCAAAAAAAAAAAAATTAACAAAATTTAAGTAAAAATAATATTTGAAAATTAGAGTTGTATTAGAATATGAACATAATTTAAAGCTGCTTTTGATTTTTTTTATGAATGATTTGAAATGAAAATTTTGAAAAACAGCTTTTTGAAGTTTTTAAAATTTTCGAAAAATTCCAAAATTCAACTTCAAGTGAAATTTAAAATTTGCATGGCCAAACACTGATTTCGGGAAAAGTGAATGTTTTTTATGGCCAAACAGTTCCTTACTTACTTCCTCCATAAGAGCCACCAAAAACCACAACCGGTGATGATTCAGAAGAAAGATTCTGCTTTAAACTCCTTATTAGAACAGCATAATCAGCCAATGCTTGCTGTGAATTCAAGTATCCCAAAGTCTTTGGTGACTTGTAAGATTTCTTTCCAAATGGCATTGAATCCCCATAAAACCTATGCTAAATTATTACAATACAAAAAACCATTATCAATTTCATTCCCAACAAAGATAAATAATAATAATAATAATAAAATATAAAAAAGGTTCAATTTTACCTTTCTATATGAAGTATCTTAATATTACAATTCATTATACTTTGGGCCACTAATATCTTATTTTTGGAAAAAATTCTTGTATTTGTCTTGATTCTAACGAAGTTCCAACTTGAAGTATAATAGATGGTAATTTTAAATCATAGTGAATAGCTGGATAAATTTGGATTTTTTCTAGTAGTATTTTGATACGTAGAATCTACCAAATCAATATTGGAGTTTCATTAAACGTAGTATAAATACGATTCGATTTAATAACGGCAAGAATATAAATAATCCCTTAAATAAAACGAAGTGTAAAACTAAAATACTTCGTATAGTACAACAACAATAAATTCAGTGTAATTTCACATGTGAAGTTTGGGGAGGATAGTGTGTACGTAGATCTTACACATATCTTGGGAAGATAAAGAAGTTGTTTTCGATAGACCCTCGGCTCAACGAATAGTGAAAACAAAGTAACAAACAGTAGCAACAACAACATAATATGAACAAAAGGCAAAATACTTCGTATAGTATAGGAGTAAATTTAAATATTTTTCTCAAAAATAAATACTTCAAATAAAAAAACATTTCAAGATTATATACATACTTCAATGAAGACTAGAAGAGCATGAAACTTAGGAGCAATATCAAGCATAAATCCAGTATTTGCAGCAAACCAATCAATATTTCCTTCATTTCCAGTGTAGACAAAGATAGGGCCTCCTTGTTTCCAATAATTATCATTTATGAGATATTTCTGTTTAAAAACTTTAGAACTCTTTGGTAGAAAAGTGAAATGGTCAAGAATTTGAGGAAAGTAATGGACTTTAAATGGTATTTTTGACTTGACATGTTGTTTTTCTAATGAAGATTGATAAGTTCCAGGTAGATAAATTGGCTTAATTTCTCCAACTACAAAAGAGATAATAAACAGTAAAATCAAGAAAATGAAAGAAAAATAAGAAGAAGAAAAAGCCAT
SEQ 56
ATGTCTCGTTTCTCACTCCTATTGGCTCTCGTCGTCGCCGGTGGCCTTTTCGCCTCCGCACTCGCCGGACCGGCGACCTTTGCCGATGAGAATCCGATCAGACAAGTCGTTTCTGACGGTTTACATGAGCTGGAGAACGCAATTCTCCAAGTCGTCGGCAAGACCCGCCATGCTCTCTCCTTCGCTCGCTTTGCTCACAGGTACGATGATCTCTACATGGAAATGAGATTTTTTTTTGTTATTTGCTTATTAATAGTAATTGTTTTATTTTGAGTTTAAGTTCTATATATGCAGTGAGCATATTTTTTTTTTTACATAAATAAGATAATAACAAATAAATCACTTAATATGTATTAGTTGGTAATGATAGTGTAAAAAAATATTATACTGTAATGTGTATATAACTTAAATCTTTTTATTTTTGGGACGATATTTAAGGTATGGGAAGAGGTACGAGTCAGTTGAGGAGATAAAGCAAAGGTTCGAGGTATTTTTGGACAATTTGAAGATGATTCGATCGCACAACAAGAAAGGACTATCATACAAACTCGGTGTCAATGGTATAATTAATATTATGGCATAACGCTAAGGCCCTGCTCTTTTCCTTTTTTCTCTTTTGCTTAAGTGGAGTCTTAATTTGTTGATTTGGAGGTAACAAGTTATAGTTTTGTGGTTCCTTTACCGGAATACTCTTTGTTTTTATCTTCAGCTAAGGTAACAGATTAAGGCGTAATTATAGTTATTATTGTAAAATAAGGTAATTTTTATTTAGAAGCTTCAAAATTAAGTACAAGCAATTGAATACTACTCTTTGTAAGTAGACTTTGTATATATGTTTTTATTTCATTCTCTTTTTTCATTTGGAGAGATGTGGACAAATTAAAATTATAATATAATGCGATAAAACATATGTTCCACTACAGTATCAGTATGGTATTTATAGTTTGCATATTTTATTAGTAATTAATTGGTCTAGTGCCTTATCATGTGTAGATATTTCATTCATATTGTGTGGCTAGTGGGTACCCTTTCTCTCTCCAATCAAAAAACTTTTTTTAAAAGCTCAATTCAAAAGCTTTTCTTCATTACAACTGATCCTGCTTAAAGACTAAAAACAATCTAAATTGAATTCTTAATTCTTCTCTATTCATTCATATATGGACATAAAAACAAAATCACAGTACATGGAAAGAATATAAGCACCTAAGCATTGGACTGCCCAAATGAAAAGTTTTTGCAACTTAATCTAGTTGTGCATAGATTCAACAACAAAAAGTAAAGAAATAAGTATGCATTTTATGCTTCTAAGTTCTAGTATATATGGCCCTTATTGTTTATCGATTATTATGTTTCATGACAGAGTTTACCGACCTAACATGGGACGAGTTCCGGAGAGACAGGTTGGGGGCAGCTCAAAACTGTTCAGCCACCACAAAGGGCAATCTCAAAGTCACTAACGTTGTTCTGCCGGAGACGGTATATGCACTCAGAACTCCTCTGTATCTATTTCTGGAGTTAGTGATCATTAGAGTTAAACTACTTTCTGATGATTTATTATTTCCAGAATTGTGGAGTGCTCTGAGTTTAATTATGCTGTAACTATAGAAACACTAACTAAAAAGATCTTGAATAGGTATCCTACAACAATAAATAGAATCCTCATAAGAAATACCACTAGATCGAGCACCAGTCATGATTTCATATCTGGTAAAAATCTTGGCTAATTGATCGAAGTGGAGTAGACTAGCGAGCATGTACTGAGCTAATGCACAATTGGTTGCAAAAAGAAGTTTTTTCTTTCCTAACCGAAATTTCCAATTTCGTAATTATAGAAAGACTGGCGGGAAGCTGGGATTGTCAGCCCAGTCAAGAACCAGGGCAAGTGCGGATCTTGCTGGACATTCAGGTAAGAATTAGTTAGAATCTCACATCATTGGACTCTTAAATTGTAAGTCTTGAAATTGCACTCTTAAGCTGAAATATAACGGAGAAGGCACTTGGCAGCACTACTGGTGCACTAGAAGCAGCATATAGCCAAGCATTTGGGAAGGGAATCTCTCTATCTGAGCAGCAGCTTGTGGACTGTGCTGGAGCTTTTAATAACTTTGGCTGCAATGGTGGGCTCCCATCACAAGCCTTTGAGTATATTAAATCCAATGGTGGTCTTGACACTGAAGAAGCATATCCATACACTGGCAAGAATGGCTTATGTAAATTCTCATCAGAAAATGTTGGTGTCAAAGTCATCGATTCCGTCAATATTACCCTGGTATGATATCTCTTTCCTCCAGTATGCAACCAATCTTTGCCAGTGTTAATATCCAACCTTAATGGTCAATAAGGATTGGTTAAGTTCCTTACATACGTGTCATTACAGGGTGCTGAAGATGAACTAAAATACGCGGTTGCATTGGTTAGGCCCGTTAGTATAGCTTTTGAGGTGATAAAAGGTTTCAAACAATACAAGAGTGGTGTTTACACCAGCACCGAATGCGGCAACACTCCCATGGTAAGTCATCTGTCCCTAGGAACGTGATATGCAAATATATTGACATAGTTACCTAAATACAGGGGAAAGCTACAGCCGACCAAGGGTCGTCAGTTGAACACCCTTCACTTCACTGTCGTGCATATATTAAATCTTGAACACCCTAAGTGAAATTTATAACTTCGCTAAATAGGCATATACACAATATTACAAACATTGTGTGTTGCATTGGCAGGATGTAAACCATGCTGTTCTTGCTGTGGGTTACGGTGTTGAAAATGGTGTTCCCTATTGGCTCATCAAGAATTCATGGGGAGCAGATTGGGGTGACAATGGATACTTCAAAATGGAGATGGGAAAGAACATGTGTGGTATTGCCACTTGCGCATCCTACCCTGTCGTTGCC
SEQ 57
ATGGAGAAGGAACACAAATACTCTTTGTTTCTCACAAAGTTGAAGTTGTTTTTTCTTGTTACATTAAGTACTTTCCATGGCCTTAGCCATGGCTTCCAAATGGATCAGGCACGTACATTAATGTCTTGGCGTCGTTCTAAAATGCATGCTCAGACAACTACTTATGCTACTAATGAGGATGAGACAGAAAACTTAGTATTTTCCGATGAAAAACATGTCGGAAATATGGAGGATGATCTTATTAAAGATGGTCTTCCAGCGCAGCCTTCAAATGTGATGTTTAAGCAATATGCAGGATATGTTAATGTTGATGTAAAGAATGGAAGAAGCCTTTTCTATTACTTTGCTGAAGCTTCTTCTGGAAATGCTTCTTCAAAACCTCTTGTTCTTTGGCTAAATGGAGGTAAATTATATGTGTTGATGATTCTTTCTCAACTTAATTTTGTCTTACTAATTACTCATCTTCTCTTAATTCTTTTGTCATGCACCTAATTTGATTAAGTACTCATTTGTTTTGTTTCGATTTAATCTAACTTACCCTTTATGCACATATATTCTGCATCAAATTAAGTTGAATATTACTCCACGTGTTCCACATTATATACTTAACATTTTTTTTTTCCAATCTATTTTACACATTTTATATATTTGAATACTTTTTTAACTTTAGACATTTCAATTTACCCTTAATAGTATATTCTTGTAGCCGATCAAATATCTATGAGATATTTTTGAAGTCTTTTTTCTTCCTAAATCAAGTCAAATGTTAATGTATAAAATAAAGCAGAGGGAGCAATAACTTTCGTTTTATGTTTGTAATTTTTCTTAATAGTGATACTCATTACTCTCCCCGGTCCACAATAAGTGACTATTTTACTTTTTTATTTTGGTCAAAAATAAGTATCCATTTACCTAATCAATAAGGAATTAATTTTATTTTTCTAAAATTTACCCTTATTTACATATTCCAACGTGTCAAGGAAATAATTAATTAAGGTTAATTTAGTGAATATATTTTTTTTCTCTAAGAGTTCGTATTTCTTTAATGGATGTGCCAACTATAAAATGGTCACTTATTAGGGACCAGAAGAGTAACTCTTTATTATTTGAAATTTTGATTTCCCAAAAGGTATAAATGGTCCGAGGAACATCTAATTTTCGCTCATTTGCAAGAAAGTGGTTCATAGACAAATGGAGTTATTAAGTGGGGACGCGCAACAGAGAATTATTGGTCACTTTATTCGTTTTGTTCGCTCTTTTTCTTTCTTTACTTCTTTACTAAAGTAGAAGAGAAAAAAAGGAAGTTTAAAAAATTGTTAGTGTATATGAAGATAAGAGCTGTCATTTTCTTCGGCTATTGAATGACGAATAAAAGCACAATTGGGTACAGGTCCAGGATGTTCATCATTAGGATTCGGGGCCATGCTAGAGCTTGGGCCTTTTGGTGTAAACCCTGATGGTAAAACCCTTTATTCCAGAAGATTTGCATGGAACAAAGGTACATTTCATTTGCTAAACTAATATAGACCTACTTATAATTAATGAAACTAATTTCTCAAGAAATAAGACAACTATTTTTTGTAATAACGTATACTCTATCTTTTTCAATTTATGTGACAACATTATGTTAAAGGTGTCACGTGAGTAGGAAGCCAGCTGACACTTAGTAGGCAAAGAGTCTGTTAGATTAGTTGTTAATTATACAATTAGAAATTAGTTAGAATCAGTTGGATTACATTGTATATGTATGTGTATAGACGGTTATTCAATACAACAGTAATTTTCTCATCTTCTCTTTTCCTCTCTAAGCTGCGATCTCTCTTAGCTCAATCTAGAAGCATCCACGACAGATGTTTGGCATGGTATCAGAGCTTTGTGCGATCATTGCTCTCGTCTAATTCTCCTCTGAGTTCATGTGAACGAAACTTCAACTCGTTCGTCTTCATCTCCTTCCCTCGAGAACATCGAATCGACGATGACGACGGAAAAAATTGACCACATTCATCCTCTGTTTGTGCATCCCTCAGATACTCCAAGTTTCATGTTGATTCCAGTCCAACTCACTGGATCTGAGAATTACGAATTATGGCGGAGATCGATGAAAATTGCACTTAGGCAAAACGAAAGTTAGGGTTCGTCAATGGCACACCCACTAAGGATCAGTTTAGGTCAGAGCTACATGAAGACTGGGAGACATGTAATGCGATTGTGCTCTCGTGGATTATGAACACAGTATCTCCAAATTTACGTACTTAGTGGAATTGTGTATGCTTTTAATGCTCACCTAGTATGAGAAGATCTAAAGGAGATGTTTGATAAGGTGAATACGATGAGGATCTTTCAATTTCATAGAGAAATTGCTACAATTTTCCAACGAACAGATTCAGTGTCCATGTATTTTATAAAATTGAAGGAGCTCTGGCTGAGTATGATGCAATGGTACCCTCAACAAATTCGAAGTAGTATGCTGATCATCTTCAGCGGCAGAGGCTATTACAATTTCTAAGTGGACTGAATGATTCCTATGCTCAAGCTAGAAGACAGATTCTAATGAAATCAGTAGAACCTACTTTGAATCGGCCTTATGCTCCAATTGTTGAAGACGGAAGTCAAATGAGTACATCGGGAACTTTATCACACATTGGGCTGAACTCAATAGCCGAGGAAAATGACATTACAACATTGTGGAGCTCAGCAGTAAAATGAGGTTCAATCAAGAAGAACAAAAGGAATTACAGTATATTTTGTGAACATTGCAAGATGAAAGGACATAGTAAAGAAAATTAGTACCAGCTCATTGGTTATCCGACAGACTTTAAAGATAGAAGAAAACAAGGAGCACCTACTGGTTACCAATGAGCACCTATTGGTCACCAAGGAACAATTGAGGAAGATGCAATGCAAGGGAAACAGGTATGACTGTAGATTTTGGGAATCTTTATGCAGGCATATCATATGGGGACAACAACATATGCAGGTGCAAAGGCAAGGGACTCATAATCCTGTACACATGGAAGATGCTCAATCTCAGGGACAATCTTAGGGATATACAGGTGGTGTCACTGTTATATTTACTCCGGAACAGTATAGTCAAATCTTACAAATGCTCAACAAAGATTATGTTCCAGAAACATCAGCTAATATGGCAGGTACTATTTGTTCTTTTCTGGCTAGTAAAACCGGGCACAATTGGATAATGGACATAGGAGCAACAGATCATATGGTATCTACTCCTTAAATGTTATTTGATTTGAATGACTATGCTAAGCAAGGCTCACTGTTGCATTTACCTGATGGAAAAAGTTGCCTATTAGTTATGTTGGTAAATGTAGATTGGCACAAGGGGACATCAGGGATGTGTTGTGTGTACCAGACTTCAAGTTTAACTTGTTGTCAGTGGCTAAACTAACTAGAGAATGCAGTGTTTCATGTCTTTCTATCTTGATTTTTTTCTGATGCAGGACCTTCACATTGGGAAGGTGAAAGGGACTGATAGAATGCACAATGACTTGTACTATTGGAGAAATAATATAGAGAATAAGATACCACAATCATTGGCTACTACTTTGACTCAATCTGCAGCATTGTGGCATAAGAGGTTGGGGCATGTTCATCATAGAATACTACAACAAATGAACTTTTTTAAAGATATCAAGACAAATACTGGCAGAACTTGTTCTATATGTCCTTTAGCTAAGCAAACTAGGCTTTCTTTTCCTCAAAGTACTAGTGGAACTACTACACTGTTTGAGCTAGTTCATGGTGATGTATGGGGTCCATACAATGTACCTACATATGGTGGTCATAGATTCTTTCTTACACTTGTAGACGATTGTAGCAGGATGGTCTGGGTTTTCTTGTTAAGGTTGAAGAGTGATGTCTCATTTGTATTAAAAGATTTTATGTCATTAATAAAGACACAGTTTTATAGTTCAATCAAGGTTTTCAGAAGTGATAATGGTACAGAGTTATTTAACTCACATTGTATAGATTTGTTCAGTGGTGCATGAATTGTACATCAAAACTCATGTGTTCATACTCCACAGCAGAATGAAGTTGTTGAACGAAAGCACATATAATTTTTTGAGGTAGGAAGAGGTTTCAGGGTTGCATTCCTCTAACTTTCTGGGGATTATGTGTTCAGAATGCTGCGTATCTGATTAACAGGATTCCATCCACTACTGTGGCAAGAAAGTCACCATTTGAGGCATTCTATAGGAGGAGTCCTAACCTACAACACCTAAGGGGTGCTTATGTTATGCCATAAGTGTGGGTGCCAAAAGTGACAAATTTGGAGCAAAAGCAATCCCAACAGTGCATATGGGATACTCTACCACTCAGAAAGGCTATAGGTTGTATAACACAGCCAATAAACTGATCTTTGTCAGCAGGGATGTTTCATTTAGAGAAGATATATTTCCCTTCAAGTCCTCCTACTATCAACTTAGACCACCTAATCTTGTGGAGTATTGGAATGGTCGCCATGATCCCTTTGTTCTTGAAACTACTATTGATGCAGCTCCATTGGAGACTTCATCTATAGTTGAGCCAGTCTTTGTCCCCTCTAGTCCTTCTATTCCTACTTCTTTGAATTTAGGAGACTCTACAGCTGGTGTCTCTGAGAATGCTACTACTGTATCAGTCCCTGCTGCTAGTACTGATTCTCTCATTCTTAGTAAGGCTCCTTATGATAATGTAGCAGATATTACTGTTGCTCCAGATTCTTGAGAGCTTACAGTCACAAGAAAGTCATGCAGAACCTCCAAGACTCCTAGTTGGCTTAGTGACTATGTTCATAAGGGGTCCAAGCCTCTATCACATGCTGTAATGGGCACAAGTTATCCTTTATCAGTATATATGTCATATCCTTCACTTTCAGACCCCTATTACAAGGTCATTTATAGCATCTCATCTGTGAGGGAGCCTGATACTCATGAAGAAGCTCTTTATGATCCACAGTGGGTAGTAGCTATGCAACAAGAACTGCAAGCCTTTCAAGACAATCACACTTGACAGCTGGTTAATATACCTCCTGAAAAGAGAGTCATTGGTTGTAAATGAGTATTCAAAGTCAAATACAATGCTAAAGGTGAGGTGGATAGATACAAAGTTCGTTTGGTAGCCAAGGGATATACTCAGCAGGAGGGGTTGGATTACTAAGAGACTTTTTCTCCTGTGGATAAGATGGTCACTGTGAGGACTATCTTATCCTTGGCTGCAATGCATGGTTGAAGGTTGCATCAAATGGATATATTCAATGCATTCCTCCAGGGTGATCTTGTAGAGGATGTTTACATGGTTCTACCTCCTGGTCTTCTAGGACATGGGGGGGAGGGNNNNNAGGGGGGGGGGATGTAGGAGAGTATGCAAGCTACATAAGTCTATGTGTGGCTTGAAACAAGCCTCTCGACAGTGAAATCTTAAGCTTTGTGAGGCACTTCTCTCCTCAGGCTTTATTCTAAGTCATCATGACTAGTCCCTCTTCACTCAAAGATCAGGGAATGAGCTGTTCCTCATCCTAGTTTATGTGGATAACCTCCTCATCACATGTTCTTCTCCTTCTCTCATTCATGCAGCTAACTCATGCTCCATCAGCATTTCAAGATCAAGGATCTGGGGGAGATGAGATACTTTCTTGGTCTTGAAATTGCAAAGAGCACAATGGAATACTAGTATGTTAAAGAAAGTTTGCACTAGCTCCTTATTGCAGACTTATGAGTGGCTGCTTCTAAGCCTACTAGCATACCTACGGAGGTCAATCAAAGGTTCACTAGTGAATAATTTGATCACAACTATAAGACTGAGGGCAATACTGATGAGTTGTTGTCCGATCCTACTGGCTATCAGAAACTAGTAGGGAAGCTGCTATACCTAACAATGACTCGACCAGATATAAGATACACAGTGCAGAACCTGAGTCAATTTATGCATAAACCAAAGAGATCACACGTGGAAGGGGCTCTAAGGGTGGTGAAGTACTTAAAGAATGCACCTGGTTTGGGCATCTTGTTACCTTCTAAGCCATCCTCACAACTTACAGTCTACCGTGATGCAGACTAGGCCAATTGTCCCATGACAAGAAGGTTAGTTAGTGGCTTCATAGTCAAGCTGGGAGACTCCTTGATTTCTTGAAAATCAAAGAAGCAAAGTACAGTGTCAAGAAGTTCAGCAGAGGCATAATACAGAAGTATGGCCAATGCAATTGCAGAAATAGTTTGGCTCATTAGACTGTGTGAGGAACTGAAGGTGAAGCTGGAGTTGCCTGTTAAACTATATTGTGATAGCAAGGGAGCACTTCAAATTGCTGCTAATCCTATCTATCATGAACGAACGAAGCACATAGAAATCGACCGTCACTTCATTAGGGAAAAGATACATGAGGGCATTATACACACAGAACATGTGTCCACAAGTTTGCAGCTGGCAGATATTCTAACTAAAGGTTTAGGAAAGGCGCAATATGACTTCCTATTATCCAAGCTAGGAATGTTCAATTTGTTCATATTACATAGCTTGAGGGGGAGTGTTAAAGGTGTCACATGAGTAGGAAGCCAGATGACACTTAGCTGGTAAAGAGTGTTAGATTAGTTGCTAATTATACAATTAGAAGTTAGTTAGAGTCAGTTGGATTACACTTTATATGTATGTGTATAGACGGTTATTCAATACAATAGTGAAAATAATTTTCTCATCTTCTCTTTTCCTCTCTAAGTTGCGATCTCTCTTAGCTTAATCTAGAAGCATCCATGACAGATGTTTGACACATTACTATTTGGGGAGCCAAAGAGGTTCTTCTTTATCATGTGTTTTCTTAAATGTTTTATAAATATTTTGAATTATAATTTTTTTTATGAATTATAGTACTTTTTATGTAAAAAAAATGAATTTGTATCTAAATTTACGGTGTAAAGTAAGCTAGCGTTTGGCCATAGATTCCCAAATTTGTTCTGAAAAATCTGATTTGGGTGAAGTTTGGTTTGGAGATGAAAATGCGTTTGGACATCAGTTTTCAAAACATATTTCCCAAATTTATTTTGGAAAAACATGAAATATGATTTATACCCACAAGTTCTAAAAACTATCACAAATACCCAACAGTACCATTATCAATAACATTCATTAAAAAACTTTGATTCTCGTAAAAACTTTGATTATCAATCACAAATATCCAAATTTATTTTGGCAAAATCTATGGTCAAACGGGTATTAAGAACTTTGATTCTCGTGCTATGTACCTTGCCCGGAATGGAATTGTACGGGTATTAGAAAAATACATTGTGCTGCCAAATGCATTGTACAATAACTATAACTACTTATAGTTATTGTATGTTTTTTCTTTCTTTTATTTATTTACATATGTAATGATGGTATACAGTTGCGAATGTGATGTTTCTGGAGTCGCCGGCAGGGGTTGGGTTCTCTTATTCCAACACTACCTCGGACTATTCAAAGTCAGGCGATAAGAGGACTGGTACACACCGAAAAATCTCGTTAATACAATAGTAATAATTGTCAGTTTCATTATTATTTTTTTAAAACAATTTTAACAGTCAAAATGATGAAATTTTACTCTTTCATTTAACTCCTCAACTTCAATTTCAACTTCACATGCTCTATTCGTCAACACTCAACTCCAATCAAACATTGTGCAAACAGTTATATTATTATCGTTTGTAGTCTGTAACTATTTTTTAATTTTTTTTAAAGACTACACTTGAGTATCGTTAAAAACATGGTCAAATCTTTTGGTCACTTAAAGTGAGGCAGAGGTGGTGTCTTTTCTTATCAAACGAGATTTTTCATTTTTTTATTTATCATTAATTCAGTTATATATTTATTTCTTTTCCTTAAACTATATTCTTTTTTATTGGGTGACAGCTGAAGATGCATATAGGTTTCTAGTGAATTGGTTCAAGAGGTTTCCACATTACAAAGGCAGGGATTTCTACATCATGGGAGAAAGCTATGCAGGTATCTAGTACAGTATCAGTAATTAACAAACGAAAAAATACAAAACAAAAACTTTTGATATTCTTGACTTATCCTTCTAGTAGTGAGAGCCCTCATTATTGAGTTCTGTCCAATAAAATTTGTAAGAATTAAGGACCACTGTATCAGAGACAGCTTGTGCATATTTCAGACCATTCACAGAAATGTCCTCCTGTACAGTCTCAGCTCAAAGCCGAAATGAGCATTCTGATTGAAAATTTTGGCTATATTTGTAATACAATCTTATATAAGTAGTCTATATCTAAAGTCTAATATTGACATGACAGCTAATATTGCTGTGACTGCTCATCACAGGATTCTACGTACCAGAGCTAGCAGATATCATTGTCAAGAGGAACATGTTGCCTACCACAAACTTCTACATCCAATTCAAAGGAATCATGGTATTATATCATTTAATTTGTTGACCTTTTAATTTGTTTGATCTCTCTGTTATCAAATCTTACTTGTATACCTAGTGATGAGGGGCGGATTTAGGGGTGCAAGGGTGTTCACCCGAATCCCTTCGCCGAAAAATTACACGGTATATATAAGAAAAAGTCTGATATTTACCTTTATATATTATGTTTTGAATTTCCTTTACACAGCCCAAAAGTCTACTCTATGTCATGACATAAATTATTTCTTTATATTGCAGATAGGGAATGGTATAATGAATGATGAAACAGACGAGAAAGGGACATTGGATTATTTATGGAGTCATGCACTAATCTCAGACGAGACTCATCGAGGTCTCCTACAACACTGCAAAACGGAGACCGAAACATGCCAACATTTTCAGAACATAGCAGAGGCTGAGTTGGGAAACGTCGATCCTTACAACATCTATGGTCCCCAATGTTCCATTAATTCAAAGAGCAGATCTTCTTCTCCGAAACTGAAGAATGGATATGATCCTTGCGAACAACAATACGTTCAGAATTATCTCAATCTTCCTCATGTGCAGAAGGCCTTGCATGCTAACCTCACTAACCTTCCTTATCTTTGGAACCCATGCAGGTAATCCAACTAAGTAAATATTATGTATAGCATATCGATTTAACTTATATATACCGATAGTATAAACAATTTTTACACTGTCGTTGTATATGTATTGATTTATATATATACACTGTTAGTGTAAAATGTGTTGTAACAAATAATCTATGTTATTTTTCTATTTATAAATTAAATTCTACTTTTTATAAATAATAACTTGTACTTATCTTTTTGGTCACCTGATAGAAACTCTTTTATATCATCCAATGTGTATTTAAATCTGTTGCGGCAATGATATTTCTTATTTTCAAGATTACAAAATCTCACTCTTTATGTTTAGTTTATGTCACTTTTAATATGTAGAAGGTAATTGAACTCATATAAAAAATAGTGTATATGATATGATATGATGATTTTTTTTCTTTTTTTTTTTTCATTTGGTATGGTAGCAATTTGGATTGGAAGGATACTCCAGCAACCATGTTTCCGATATACAAGAGACTTATTGCATCTGGTCTACGTATACTTCTTTACAGGTAACTTTATTATGGGCTTATCTTAGACTTTGGTTTATGTTCATGATACAATATTTTTAATTGTTCGAATAAAGAACAAGTGGATTTGTATTGTTTGGAAACAGTGGAGATGTTGATGCAGTAGTTTCAGTTACTTCAACTCGCTATAGCCTTAGTGCTATGAACCTTAAGGTGATCAAACCTTGGCGTCCTTGGCTTGATGACACACAAGAAGTACGTTCTTCGAATATATTTTTTAATGATAATTTTATATATTTGTGGTGAGAAATAAATCTTATTGTTTCGTTCTTTGTTTTTTTTTTATAATTTAAAGGTAGTTTGTATAATTTCTGCAGGTAGCTGGATATATGGTGGTTTATGATGGATTAGCTTTCGCAACAGTTAGGGGAGCAGGGCACCAAGTTCCACAATTTCAACCACGTCGAGCTTTTGCTTTGTTGAATATGTTCTTTGCCAATCATTCT
SEQ 58
ATGGCTAATTCTTATACAAGTATTAATTTTTTCCTTGCCCCTATTATTTTCTTGGCGATTCTGGGATTGCAGTTGCAGAGCAGCGATGGTTTTGGGACATTCGGGTTTGATATCCATCACCGGTATTCGGATCCGGTGAAGGGTATTTTGGACCTTCATGGATTGCCTGAGAAGGGCAGTGTTGAGTATTATTCAGCTTGGACTCAGCGTGATCGCTTTATCAAGGGTCGCCGCCTTGCTGAAGCTGATACAGCTAATTCCACTCCCCTCTCTTTTTCAGGAGGGAATGAAACTTTCCGCCTCAGTTCTTTGGGATTGTAAGCTTCCCTCTATGCATTTTTCTGATTGCTTTTTGCACTTGTCTATATCTTTATTGTTTACTTTTTCTAGTCATATACATAGATTATATACTAATTATACATAATTATACATATATAATACAAAAATTATACCTTTTAAGTGGTTGGGTGGGCGGCTATTTGGGTTAATTCTTCTTCTTTTTTTGTATGTGTGTTTTGTATCTGTGTTATTATTCCTGATTGTGAACTAGTACGTCTTTGGAAATTCTTGTTTACTGTCTTTTCCTTTTGTCTGTTTAGTGTGATGTTAGAGTTGACTGAGCTTTACGTTTGTTTTTGTCTGTTTAGTTGGATGGCAGTTCAGGAAAAATAGGTTACCTAACTTAAATAAGTTCCATGTGCCATTTTTAACGAGATTCAAGTGGAGAAAATATGAAGAAGAAAAAGAATGATTTAGGCCTGTTCTGTTCTATAATTCTGTTTGTGTGTTTGATTGGACTGGAATTTTGTCGATTTAACTACTACATAAAATACTGACTCTTAATTTGATTTTACTTTTCTTCTATTTCGAATTCCAAGCTCCGGAAATAAATTCCGTTCTTTTTTCTGATTTTCCTCTCCTCTGCCGCCACTTAACTCCTCTCCAGACAAGGAATTGTTCTGAAGTTTCTGGCAGTAGCATGTTGTAATTTATGTGTTATAAAGATAGAGTTGCAAAATCTGTAGTATCTGTAGTTGTGATTTTTTCTTCTTAAGGTGTGTGACTAAATATTCTTTGGCAATTTGCAGTTTGCATTATGCAAATGTGACAGTGGGCACTCCTGGACTATCATTTCTAGTGGCACTTGACACTGGCAGTGACTTGTTTTGGCTACCCTGTGATTGCAGCAATTGTGTGCGTGCCCTCGAGACACGCTCTGGACGAGTATGTTTGCTTCATTCTAGTACCTTTTTCTTTCTACTTTCAAATGTTTAAAGAGTTTTTCTTTTTTTTGATCGTCATCCTCGTCTGTATATTGCCTTCTGCTACAAGGAAGTTGTGCATACTTCTCTTCCTTTTGTAATTATGAGACTTTCTGATAACCTTTTTCAGAAAGGAACCTGCTGATAACACAATGGCTGAATCTGAAACACAGTGGATTTCTCTTCAACTGTCTTTTTCGGTCATTATGACAATAATATATTCTCTTAGTTAACAAGATATGGGGTAGAGAATGTATTGAGGAAATTGTTTTTCTGTTAAGGAAGATACATAACTAGCGCAAAAAAGAAGATTTAAACATAATCAATATTTGCAAAGTGAGTCTGATGCATGTAATATACTGACTCTGAAATGAAATTTCTGATCCATATTGTTCCGTGGCTTGTTTGTCCTTGAAGAATTTTGAGATTCTTACTAGCTCAAGTACTTCAACTTGTCACGACCCAAAAATCCCACCACAGGCGTCGTGATGGCACCTAGTCTCTAAAACTAGGTAAGCCGATTTCAATTACATTTTTGGAGCCATTTTTTTTTTAATTAAATAAGTAACCAAAACTAACAGCGGAACAAATATGAATGTACAATCTCCCAAGACTGGTAGTACTAAGTCACGAACTCTAACTGAATACATGGAATGATCACGAGGACCGAATATACAATACTGTTTGATTAAAAACTCCACAGGAGTTCACCTTGAAGAACAAAATTTTCTTTGCTCTTTTGCCTTTTCCTTTTAATGTTTCTGCATGTATTATTTGACACTTGTAATCTTTTGTTTGCTTTTGAAACAGCGAATAAATCTCAATATTTACAGCCCTAATACGTCGTCAACGGGTCAGATTGTTCCTTGCAACAGCACTCTGTGTGGACAAAGGAGACGATGCTTATCTTCACAAAATGCATGTGCTTATGGAGTTGCATATCTCTCCAATAACACCTCATCATCAGGGGTACTGGTGGAAGACATCTTGCACTTAGAGACAGATAATGCTCAACAAAAAAGTGTTGAGGCTCCAATTGCTCTGGGGTGGGTATGCTTTAGTTTTTTCTCTTTATCTTTGGAAGAGATTATCTTTGGATCTTCTGATGCATTTCTTTATCCGCCATGATTTTTTATATTCTACTTGTTCAATTTCAGGTGTGGGATAAGACAAACTGGTGCATTTTTAAGTGGCGCAGCTCCTAATGGTCTATTCGGACTTGGCTTGGAAAATATATCTGTTCCGAGCATGTTAGCAAGTAAAGGTCTTGCTGCAAATTCTTTCTCCATGTGCTTTGGGCCTGATGGTATTGGAAGAATAGTCTTTGGAGATAAAGGGAGTCCAGCCCAAGGAGAAACACCACTCAATCTTGATCAACTACAGTAAGCAAGTCACTTTGATATTCTGGGTTTATCGGTTGCTTCTGTTTCTGGCTTGATTTAGGAGAATGCGACTGAATATTTATTAACTCTTACCCTTTCCTGAATTGCAGCCCAACTTATAACATCAGCTTGACAGGAATAACAGTGGGAAACAAGATCACTGATGTTGATTTCACAGCCATTTTTGACTCTGGCACTTCATTCACATACTTGAATGACCCAGCTTACAAAGTCATTACAGAGAACGTGAGCGACAAGCTGACTGTATGATTTTAAGTTGGAGTTTGTAACTTTGTATTGTAAAACTGAAGATATTTTTTTTCTTTTTTCAGTTTGATTCTCAAGCAAAACAGCCACGTATTCAACCTGATGGCGAAATTCCTTTTGAATACTGCTACGGGCTAAGGTGAACCATCTTTTATAATCTTCATCATTTATTACTTTCTTGACGTCCTTTGAACTCTCAGGATTAACATGCTACATACGCAGTGCAAATCAAACTACCTTCGAAGTTCCTGATGTAAATTTGACAATGAAAGGCGGCAACCAATTATTTCTTTTTGATCCGATAATAATGCTCTCGCTCCAGGTAAGATGGTTTCTGCTCCTTTTATATTACAAAAGTTCTCTTTTAGAATATCCTAATATCCAGTGATGATCATCAGGATCGTTCTGGCGCATATTGCTTAGCTGTTGTGAAAAGTGGGGATGTCAACATCATTGGACGTAAGTATCTATCAGTTGCTTGCTCGTAAGATTTTGTTTCTATCCATGGAATTCTGCAATATAACTTGCACCATGCCAGCTAATGATCTCACAATTACCAACTTTTAGAAGTTTTGGTTCCTATCGAGTTTTTTACATACTTCTAGCTTATGTATAATTGGAAATGTGAATGTGACAAAGTAAATTAGTAAAAACCAACTAGTAAAACTGGTTCCATTGTCAAAAGTCTGAGCTATTTGTTGATTTACTTGGATTTTGTCTCTCTATTTGGAATTCATGACAGAAAACTAATACACGGATGTTTTTGCAGAAAATTTTATGACAGGCTATCGCGTGGTTTTCGATCGGGAGAAGATGGTTTTGGGTTGGAAACCATCGGATTGTGAGTTCGCATTCCTGAGTATGACCTCTTTAGTGTGCACACCTGCTCATATAATTTAACTATAAACCTTTCTTGGCAGGTTATGATTCTAGAGGATCCAACGACAAATCGACAACTCTGCCAGTGAACAAGCGTAATTCTACTGAAGCGCCTTCGCCCTCCAGTGTGGTGCCAGAGGCCACCAAGGGAAATGGAAGTGGAAATGAACCCGCTACTTCGTTTCCATCTGTTCAATCATCTAAACCTGCAGCAAACCAAGCACCAGCACATTTCATTTGCCAACTTATGATGGCTCTGTTTTCCCTTTTTAGCTATTATTTGATCATTATTTCTTCA
SEQ 59
ATGGCGATTCATACTTCCACTCTCTCCATCTCCATACTTGTAATGCTCATGTTCTCCGTCGTATCATCATCGGCGGCGGAGGACATGTCCATTATAAGCTACAACGAAAAACATCACACGAACGGCGAGTCAACGGTCTGGCGAACAGACGATGAAGTCATGTCTTTATATGAATCTTGGCTAGTTGAACATAAGAAAGTGTACAACGCCTTAGGAGAAAAGGACAAACGGTTTCAGATCTTTAAAGATAACCTTAGATACATCGATGAACATAACTCTGTGCCCGATAAAAGTTACAAGCTGGGTTTGACCCAGTTTGCAGATTTGACCAACGAGGAGTACAAGTCCATCTACTTGGGTACTAAGCCCGATGGTCGTAGCAGGTTGTTAAATACCCAAAGTGACCGTTATGCCCCTAAGGTCGGAGATAGTTTGCCGGATTCCGTTGACTGGAGGAAGAAAGGTGTTCTTGTTGACGTCAAAAATCAAGGGCAATGTGGTATTTTCCTTTTACCCTCTGCCTTGACTCTGCACCTGTTGTTTTTGTTTTCCTTTTTGTTCGTACTTATTTTCTGTTTAAAGTTTGTCCATGCTTTCTTTACTGATGGCTTTGATGGAAATTTGGAAACTTTAGTAGTTTGATAAGGTAAGATATTAAAATAATCACAGAGTCATGAGTTTTAATCTAAGATCAATTTTAATGGCAAGTTCAGTTGACCCTGCATTATTGTAAATTTTAGCTTAACATTAAGTATGATTAATTAGGTCAGCACGATGAAGTTGACAACTTTTGCTCCAATTTCCGCATCTAATTGTGGCAATATAAGTAATGCTTTTTTCCCTTGGACAAAACACTAGTTTCCGGAATTGAGCTATTTTATTCAATTTAAAATGAAAATTTTCTGTTTTAATGTATTAGAACTATAAAGAAACCGAAACATTAAGTAAACTTCGGATTGATCTGTGTTTTTCGGGAATTTAGTTGTTAGTGGTCTAATTTTCGGTTTAAATGCAGTTCTTAATATTGGATAGGCATTTTGGCACTTTTCTTGGCTGTCGCTTCTCTTACCTTAAAATTAAAATTATGGAGTACCTACCAAGTTCAAGATCTTATGGTTGTAAATTGAATTTGTAAAAGGGGTTCTTCTTCGTTTGCTCTGAGATCCTTCTTTTAGCTCGCTCCTTAAATATTTACTAATCAGTGGTTTGTAGCTCCAACCGAGTGTCTATCGGAAACAAACTCTTTACCCTTCTAGGGTAGGGGTAAGGCTGCGTCACTTGTGTGAACTCACTGGGTTTGTTGTTGGTCTGTAGTCCGATATACCCCCATCAAACACCCTTGGAGTTGTTTCACTATGTCTAGTTGTGTCAATTGTTTTGGCAAATTATGCAGCCTTGATTGATTGGATTATCTTCCATTTTATGCATAAGTAAATGCTGAGGAAAAAATGATATGTTTATATCACATAAAGCAACTAATAATTTTCTTCGTAATTGGTGTTGCAATTGGGAAATGAAACAGGGAGTTGTTGGGCTTTCTCAGCAGTTGCTTCAATTGAAGCAGTAAACAAGATAGTGACAGGTAATCTGATCTCGTTATCTGAACAAGAGCTGGTAGATTGTGATACGTCCGATAACCAAGGCTGTCAAGGGGGTCTAATGGACGATGCCTTTAAATTCGTCATTCAAAATGGAGGAATAGACACTGAGGAAGATTATCCTTACAAAGCCAAAGATGGAAAATGCGACCAAGCAAGGGTCAGTATGGTGTTCTCTGTCTTAAAGGGATTATAGGAAATGAACTAAATACAAGTTGTGACTATTAATATTTTGTTTGCAGAAAAATGCCAGGGTTGTCACCATCGACGGGTATGAAGATGTTCCTGATAATGATGAAAAGGCACTGAAAAAGGCCGTTGCTGGTCAACCCGTCAGCGTTGCTATCGAAGCTGGTGGCAAAGACTTCCAGCACTATAAATCGGTATTACTTCAGATTTGCCTATTGTCAGTAAAGTTGTTTTCTTTTAATCGAATTAGCTAGTGTTTACACAGGCTCAACAAATATTTCTGTATTTTCAAAGTTACAGTGAGTTCAGTATTAAAATTTTTAAATGTTGATCCTATTAAGTTTAAATGTTGGATCCGCCTATGCCCCAGGGTATCTTTACCGGAAAATGTGGTGCAGCAGTGGACCATGGTGTGGTTGCAGTAGGGTATGGTAGTGAAAATGGCATGGATTATTGGATTGTGAGGAACTCGTGGGGTGCTTCGTGGGGTGAAAAGGGCTACCTCAGGATGCAGCGAAACATTGGCAACCCCAAGGGTTTGTGTGGTATTGCTACGATTGCTTCTTACCCTGTAAAGACAGGCCAAAACCCTCCAAAACCAGCTCCATCTCCTCCACCAGTCAAGCCGCCCACTCAATGTGATGATTATAACGAATGCCCAGCTGGAACGACGTGCTGCTGTGTCTACGAGTACTATAAATACTGCTTTGCTTGGGGTTGTTGTCCCATGGAAGGAGCTACTTGCTGTAAAGACCATAACAGTTGCTGCCCACATGATTATCCTGTCTGCAATGTTAAAGCAGGCACCTGCTCAATTGTAAGTGATCTCTGCTTGTTATTGTTAGATTGTCCCGCATTGGTTGAGGGGAAGTGTTGTTGTCTCCTTATATAGTCTTCGGCAAGTCTTTTTAACAGTTAAGGTTGTTTCCTTTACTTATGGAATCATGTTTTTGTTGATACAGAGCAAGAACAACCCACTAGGAGTCAAAGCAATGCAGCACATTCTGGCCAAACCTATTGGTACCTTCGGAAATGAGGGAAAGAAGAGCCCTTCTTCT
SEQ 60
CTAAGCACTTTCTGCAAATCCAATTTGTGAACTGCCAAAGTCGAAAACTGTGTGATATGCTCTCAAGAATGCATCTCCAAGAACCCTGCAAAAGTAGAGTTGAATATATCATACAACTGGATCTTCATGAATATATAATATATTATAACTTATGGCAGTGAAAATAGTCTTACCAGAGGGGACGTCGCGGATGCGCGTTTAAAGTTGTAAATCCACTAATACAGTGGACACCTTGGCTGTCATCAACTCTGATAACATACTATTACACAGAAAGGAAGATAGTAAGTGGAAGAAGAGAAAGCACTGCTATTTAAAACTATAATATGTTACTAATATACGGCCTAAAAACAAGACGCTAACGGCTTTTCCAATGTACCTTACTGGAAAACCAGTTCAAGTTGTCGAAGCAATTTTAATCTACATTGCTCCGATCCTCCAAAAATGCTAACCGCACTCATGTTGGATCCTCCAAAAAGTGCAATGAGATTTTTGCGGGATCCCAGCAATCAGTGGCGAATCCAGGATTTGAATTTTATGGGTTCAATCTTTAAGATTTTTAGTATTGAACTCATTGTATTTTGAAGTTATTGCTTCAGTACTACTATTTATTAGATTTGACTGAACCCGGTACTAATATGATGCATCTGCCTCTGCCAGCAACATAAATTTTAATGGGAAGATAAGAACTTTTCTCCGATATTATGTCATCTCCGATAAGAACAGATGTAATTCTAGCACCAGTTGCTACCTTTAACACATATTGATAATGGAGGAATAGTATCAGATAACAACAAACCTGATCTGGAGAAAGGGGAAAAGATTTGTCTCCAATGGTAAATGTTATATGTGGCAGGGCAAAGACATCACAGTTGATAAATGATTTTCCCCCGGGATTCGGAAGCTTCTCACACAGCTTCAGCACAAATTACATGTATTAGTAGTAAGTAACTAATGAGAACTCGAAAAACAAAAAGACACAACTGTGACATGAATACCTGATTGGCATATTGAAACGCTTTTTCTTTTGATCTCTCTTTTCTGATCTCTACTTGTATCCAGAACACTATCATCTCACAAGAAGAACATAACGACCCATTATTTGTACAGAGTCCAATTCTGTTGCATACGTTCTCCGGTTGTAACTGCATGCAGCCAAACAAAAGGATGACATCTAAATAAGAGAACACCTAAAATACCCGCACACAACAATTAGAAGTTAAAAAGCAGCTACCCCTGCTATCAAGCGTTCCCAGATCGAATCCCCATAACTCGAGACAACTTTTTTGCATTCCAAACTAATAATTCCTTCCGCTCCAATGGCATGATTTATTTGAGTTAAAATAGTCTGAAATGGAGATGGCTGAGAAGAAATTGATCTTGCAAAAAAGAATTAAAATCAACAATGGTTTAAATACAGACAGTTGGACCAGCGATAAATGATGTCCCTGTATCCACAATAGCTGGACATCCATCCTTACAAAGGCCTGAAAGGGAAAGAGCAGAATTACACTAGAAATGCATTGTTTTTATATAGTAAATCACTTATGTATATGAAGATATTACCTGTTGAATTGCTTCCTATAAAAAGATCCCCTATCTCAATCTGGAATTTTTGAAAAAAAACTCAGTTTTTTCAGGTTACCAAGTAAACAATACAAATAAAGTTCCATTAAACGGAGGCGCAGGTTACCTCCCAATAACCATTTTGAGCGACTGGTACGTATGTATGCTGACCCCTGAAGTGAGTCCAATCCATGCCTCCAAAGATAATTTCACCCGCTATCTTAGACGTAGGATCTCGATTTAGCCAGAATGAGAAGATTGACTTGGTAACCATATGCTGAAGCAACATGTTATACCTGAAATTAACTCACACAAAAGAATGTGGATTTCAATTTCAATGACACAGGTAAGAAATGAATGAGAGAACAAGCTCATTAAAACTATGTCAGAAAGCATAAATTACTATGTCAAAAGTTATTATAATATGAAGAGATAATAATTTACCATACTGGTGTGACATTCCTTGATGTCGTGCTCTGATCAAATCCAAGTCCTAGTACTCCATCAAATCGTGCACGCAACAATGTCAAGTATCCCTCCCGTGTTACCTCAGTGAAAACCTGTTTAATAAATTTTATTCAACATGTAACTTGAAAACATATATATCTACAATTTCAGCTGCAAAGACCGGCACCTGCTGCTTTAAGACAGCACCTCCAACTTTCACATTGTCTTGGCTGAAGAATCCATGAACTGAACCAGTGCCAAAAGGGATTTTGCTAGACTTTCCTATCATAAAAGCCCATGTATAGAAGTGATAAGTATTTCGATTAAGCTATTCTACGCTTAAATATCAACAATGCTCTAACCAAAATAAAGGTTTGGAGGTCCATTGGGTAACTACCAATTTTTGTATACGTATTTGATAGTCTTGATTTGTACCTGGAACGAAGATAACATGCAATCTGCATTGGATAAATCCATTCAAAAGTTAGTTAGTTTCATGTAGTGAAAATTGTTAATCCACAACTTGACAAAGACTAACCGAGAAGAAACATCTGGAAGAAGGGACCCAAAGATTGGAACTTCCAGTATCAAACACAACAATGAAGCGTTGGGGCGGTGAACCAATACCAATCTCCGCGAAGTACTGAACATCATGATAATTTTTGAGGTAAACTATCTGGTCATTCGGAGCAGCCAAATTTCTATTGCGACCCCTGAGATCTTTAGCGTAGATTCTTGCATCGCTTATGCTAGAAAGGTCCAACGATTGCCTTTTTAGCTCAATCCTAACCATATCATCAGCATATACGTTGATGCAGGTTATATACCATATTACAAGTGATGCAAGAAGGATTTTGATCTCCAT
SEQ 61
ATGGCGTCAATTTTCGCTCTTTCATTATTTTTCATTATTATCTCTTTCTGCATCACTTCGATCACCATTCCCGTTCAATCCGACGGTCACGAAACTTTCATCATTCACGTTTCTAAATCCGATAAGCCCCGTGTGTTCGCCACCCACCACCATTGGTACTCCTCCATCATCCGATCCGTTTCTCAACACCCTTCTAAAATCCTCTACACCTACTCACGCGCTGCCGTGGGCTTCTCCGCCCGCCTCACCGCCGCGCAGGCCGATCAGCTCCGCCGTATTCCCGGCGTAATCTCCGTCCTTCCCGACGAAGTACGCCACCTCCACACCACCCATACCCCTACCTTCTTAGGCCTTGCTGACTCCTTCGGCCTTTGGCCCAACTCCGATTACGCCGATGACGTCATCATCGGAGTTCTGGACACAGGTATATGGCCGGAAAGACCGAGTTTTTCCGATGAGGGTCTCTCTCCTGTTCCTTCAAGTTGGAAAGGGAAGTGCGCTACTGGACCGGATTTTCCTGAAACCTCATGTAATAAAAAAATCATAGGTGCCCAAATGTTTTACAAAGGCTATGAAGCTTCACATGGCCCAATGGATGAATCAAAAGAATCGAAATCGCCAAGAGATACTGAAGGACATGGAACACACACAGCATCAACTGCAGCTGGTTCTGTAGTGGCAAATGCTAGCTTTTATCAATATGCCAAAGGTGAAGCTAGAGGTATGGCTATAAAAGCAAGAATAGCTGCTTACAAGATTTGCTGGAAAAATGGTTGTTTTAATTCTGATATATTGGCTGCCATGGATCAAGCTGTTAACGATGGTGTGCATGTGATTTCACTTTCCGTTGGGGCTAACGGTTATGCTCCACATTATCTCCTTGATTCTATTGCAATTGGAGCTTTTGGTGCATCTGAACATGGCGTCCTCGTCTCATGTTCAGCTGGAAATTCTGGTCCCGGCGCTTATACGGCAGTGAACATTGCCCCCTGGATTCTCACCGTTGGTGCTTCAACTATAGATCGTGAGTTCCCTGCAGATGTTATTCTAGGAGATAATAGAATATTTGGTGGCGTATCATTGTACTCCGGCGATCCATTGACCGATGCCAAATTGCCGGTGGTTTATTCCGGCGACTGTGGTAGCAAATACTGTTATCCAGGAAAGCTAGACCATAAAAAAGTCGCTGGAAAAATTGTTTTGTGCGATAGGGGAGGCAACGCTAGGGTTGAAAAAGGGAGTGCAGTGAAGCAGGCAGGCGGAGTAGGGATGATACTCCTTAATTTGGCCGACTCCGGTGAAGAGCTCGTCGCCGATTCACATCTTCTCCCCGCGACGATGGTAGGTCAAAAAGCAGGAGACAAAATAAGACACTACGTAAAGTCTGATCCTTCACCGACGGCGACGATCGTGTTCAGAGGAACCGTGATCGGAAAATCACCGGCGGCGCCACGTGTAGCGGCGTTCTCGAGCAGGGGACCGAATCATTTGACGCCGGAGATTCTCAAACCGGATGTTATTGCACCTGGAGTTAACATTTTGGCCGGTTGGACCGGATCTGTTGGACCGACCGATTTGGATATTGACACGAGAAGAGTGGAATTTAATATTATTTCTGGAACTTCCATGTCGTGCCCTCACGCTAGTGGATTGGCTGCGTTACTTAAAAGGGCCCACCCTAAATGGACCCCAGCAGCGGTAAAGTCAGCACTCATGACAACAGCTTACAATTTGGACAATTCTGGTAAAGTATTTACAGATCTTGCCACTGGCCAAGAATCTACTCCTTTCGTTCATGGATCAGGTCATGTAGACCCGAACCGAGCATTGGATCCGGGTTTGGTTTACGATATCGAAACTAGCGATTACGTGAATTTCCTATGCTCCATTGGCTATGACGGCGACGATGTCGCCGTGTTCGTGAGAGATTCTTCTCGAGTGAATTGCAGTGAACAGAATTTGGCTACTCCAGGAGACCTGAATTACCCGTCGTTCTCTGTTGTTTTTACCGGTGAGAGTAACGGTGTGGTTAAATACAAGCGGGTGATGAAAAATGTAGGGAAAAATACAGATGCTGTTTATGAAGTGAAGGTGAACGCGCCGTCGTCTGTGGAGGTGAGTGTGTCGCCGGCGAAGCTTGTATTCAGTGAGGAAAAGAAAAGCTTGTCGTATGAGATTAGCTTTAAGAGTAAAAGCAGTGGTGATTTGGAGATGGTGAAGGGGATTGAATCTGCATTTGGGTCGATTGAGTGGAGTGATGGAATTCACAATGTGAGAAGCCCAATTGCAGTGCGTTGGCGTCACTATTCTGCGGCATCCATT
SEQ 62
TCACATAGGAGCAAGATGACCTTGTTTGGACAATTTATCTTGCATCCACCTGTGAAGCATTTCCAGTGCTGCCTTAGGTTGATCCATTGGAACCATGTGACCAGCATCATGGACCTTTAAGAAAGTTAAAGGCCCATAGTTCTTTTGAACACCTTTCTCTACACCATCTACTGCAAAAGAAACTTGTGTGGCTTTTCCAAAGGCTTTTTGCCCTGACCATTTCATTGCATGCACCCATCTCGAGTTTCCTGCCCATTATAGCAAGAAATTGAGTTTAGTTGTCAATTACTAGGTTGTTTTCATCTTTCAGTTTATTGTAACAAAATTATGTTTATATATCACATATAAAATAAAAATAGTTACCGGAAATATATACTAAATCCGGTCAAAGAGAGATCATATCCATGTAACACAATGTATAGGAAGGCCCATTTTTTTTTTGGTTGGTCGGGTAATGTTTTGTTTTGACAATAAGTAGTGTCACATGGATATTCTGAATGTAAGGTCAGTTGCCGATATGACTATATTATTACATGTAAATGTTATACATTTGGCAGCCTACTTGCACTTTCTTAGAACCGGCTTTAACATTTTTTGACTTTGTATTTTAATTTTCTGTGATACATAGGTATAATTAAGTTATTTTGCTTTACAGAAACATGCTTAATTTTGTCCATATATGAAATTAGATGTTGGGTCAGAGATGAATGAAGCTTACCAAGCCAATTGCAGATAAGGTCATATTCCCCAGCATACACAAGTAGCTTGATACCATCCTCAAGGAGTGAAGGAATTCCCTCTTCAAGATTCCTCATCCAGTCCAACTGCATTGCCTGGTAAACCTCAGAGCTACATGAAACAAACTCAATATCCCCAACACCAAGTGCCTTTTTAACTTGTTGGTCATTGAGGAAAGTTTCCATTTTTGAGAAGTCATAGCAGAGATCACCCTCACATCTCTTCCGCACGTCATAGTACTGCAACTCGGAAATTACATGACTTTAACTTCTCTACACTAACTAACAATGTAAAGAAATTTTTATTGTTACTTAAAAGATAACTACAGATAATAAGTGAGATCAGTAATTTGGAAAATGAGACTGCAAGTAGCGTGTTGTACATCATTAGCTCAAAGAAAATGAGATTGCGCTTTCTTACATTTTTGTCACCAGCAATGTCCATAATCTTGTTGAAGATGCTTGTACAAACAAGATATGCAGCCATGCAAGCAGTTCCGCCATCTTTTCCTAATATTATTTGCATAGAAATAAAGCTAATTAATGTCAAGATTATATTAGCTGCTGCTATATGAAGGAGAAAGAACTTACAGGATAAACAAAAAATTAAGAATTACCACAAAGCTTAATTGCCAACTGACATTTTGGATATGATTTCTCTATGGCATTGTAATCAGATTTTTTTATCAATTTCATATCCAGAGCATAGTCAGTGTAGGCTTTGTATTGAATTTCTGGATCAGTGAGTCCATTACCAATAGCAAATCCCTATACAAATTAAATACACTTGGTTAAGTTATCGGCATGATGACAAATTTAAATTAAACCTACCTAAAAGTTTAACTGAAAAAAAAAAAGAATGGTGGAGGAGCTAATGAGTTAGAAATACCTTGAGATTTACGTAGATGCCTTCTTTATTTTTGTTTCCTTGGTGGACCCGAGAAGCAAATGCAGGAATGTAATGCCCAGCATATGATTCTCCAGTAATATAGAAATCATTCTTTGCATACTGAGGGTGTGCCTTGAAGAAGGCCTATCATCAAAAGAATTTGAATTAAATTTTATTAATTATATCAGTTAAACTTTAGAGACTTATCACGAGCTAAAAAAAGAAGAATGAAAGAATAAGATCAACCTGCAAGAAGTCATAGAGATCATTGCTTACGCCCCTTTCATCGTGACGAATATCATCATCGTTTGAACTATAACTGAAACCAGTTCCAGTTGGCTGATCGACGTATATAAGGTTTGAGACCTGTCAAATTGCAATTTATCTTATGTTATCATCATTCTTCAACTAACAAATGAAAGTTGCATGTTTGATTATAGGATTTAACCAATGTAAACGACTTTTACAGTATTGTTATATATATATATATATATATATATATATTAACATGTTGTATTAGTCCGGTTACTAATCCCACTTAAATAAAGAGAAGCGTAGTAGTCATTGCTGTCAATAAGCGATGAACTACTTTTAAACTTTTGAATTCTACAAGTCACAACTAATGAACAAGTGATAAAGAAAGGAAATGCTAGTAGGTAAAAAGGTACTTTGCATGATGGAGCAAGGTTGAGTAACAAATAAAAACATGGAGGGAATTCTTTTAGACTTTTACCATATTCAAAAGATCTAACCGACGTTTCTTGAAAAATTAATTGGGTAAAATAAAAAAAATAAAAAATAAAAAGCAAAAGGAAGAAGACAAGAACCTTGTCCCAGCCGAAATCATTCCAGACAAGAGACATATTATCTGCAATTTTGAATGGTCCGTTTTCGTAAAACACAGCCAATTCACTGCTACATCCTGGCCCTCCTGTTAGCCATATAACTACTGGATCATTCTTCCTGCTTCTCGATTCAAAGAAAAAGTAAAACATCCTGCGAAAAACATATTAAAAAAACACACAGATAATTTAGCATTAACTAATAATACCCATAAATGAAGCAAAAAGAGCATTTAATCCAATTCAAACCTTGCATCTTTAGTATGTGGAAGACGATAATAACCAGCGTGATGACCCAAGTCTTGAACTGTAGACCCAGAATTACCAACATAAGATAAATTCAATTTCCTCTCAAAAAGCCTCTGTTCAGAATCCCCTGTTGCTGCAGCCTTGTTAATATCATGTTTAGGGAATAAATTAAGCTGTCTGATTAGCTTTTCTGCCATTGTTAATGGGAATTTAGGAGTAGAAGATAGGAAAAACTCATCGTCATTAGAATTTAAAGTTGATGAGAAAGATAAGGAAATAGAAGCAAGAAGCAGAGTAAGAAAGAGGGATGAAGGCAT
SEQ 63
ATGTTAGTTATCAGTGATTGTTATATAAATTCTTGCAAAGCTTTCAACTTTGTGATCAATTTGCCCGTCATGGGACACTCTCACTCTCATTCTTCTCATTCTCACTCTCACTTTCACTCATCTAAATCTTCCGATGATCAAAATATGGATATGGGGGAATCGATCACCACCCAAACAGACGTTTCTTTCATGCTCGCTAAGCATGTTTTCTCCAAAGAAGTTAAGGGCGATTCCAACCTGGTGTTTTCTCCTCTCTCAATTCAAATAGTACTTGGCCTGATTGCGGCCGGTTCTAAGGGGCCAACTAAGGATCAGCTGCTCTGCTTCCTCAAGTCCAAATCCATTGATGAACTCAACTCTCTTTATTCTCATTTTGTCAGCGTCGTCTTTGTTGATGGCAGCCCCAATGGAGGTCCTCGTTTGTCTGTTGTTAATGGTGTTTGGATCGACCAAACACTGCCTTTTAAGCCTTCTTACAAAAAGGTTGTGGATAAAGTTTACAAAGCAGCTTCCAATTCTGTTGATTTTCAGTGCAAGGTTAGGCCTTTATTCGTTTGTTTCATTCAAATCTTGTTTCTTTTGTGCTGGGGTTTAATATTCTTTGTTCATGCTGACTGCTGAAAATTGGTTCTTTAACTAGTATAATTGACCCTGCATATTACTCTCATCATAAGCCCTCCAAATATATCATATAAAATGGATATACATATAGTAAACTGCAACTAATTAACTTGGGATTGAGGTATAAATGATTGATTGACCAATCTGACTTTAAAATAATGAAAAGTGTTAAACAATTAGGACAGAAGCTATATTGCTTAGCCTCAAGTAGTAACAAAACTAAATAATGTCAACGGTTGATACTCGTTTCACAGAATTGAGGCAGTTTAAAGAGTAAAAAGTATTGGTTGTTAATTTGAAAAGTAAGATGAAAGAGTCACAATTCATCTTCATCAATGCTTATTGTTTAGCAGGTTAGTTGACTAGTTCGACATTTTACTGAGTGGTAATAAGCTTCTTTTTTGTAGGTAGCTAAAGAAGCCTAAGTAGTTCTAAGCTCAACTGGATATGTGGCCGTGCTTAATTTTGTAAAACTTCAGTTTTTGGCCTAAATCTACACCCAATCAGTGCTTAAAATATACCATGTAAAGCATCCAAATTCTCACTTACCCCTTGCAAGTACTGTAATCAATCTTCTTACTGCAAACTCCCTTTGTTGGCTAAAGCATATACGTGTTAATTCTGTCGTATACTCTGTTTGTCTTGCTAATTGAATAAGGCTGCTGAGGTTGCCAATCAAGTCAATCAGTGGGCTAAAATGAAGACAAATAATCTCATTAAAGAGATTCTTCCTCATGGAACAGTAAACAATATGACAAGGCTCATCTTTGCAAATGCATTATATTTTAAAGGAGTATGGAATGACAAGTTCAATGCTTCAGAAACAAAAGACCATAAATTCCATCTCCTCAGTGGAGGGTCTATTAAAGCGCCGTTCATGACTAGCAAGAACAAGCAATATGCAGTAGCCTTTGATGGCTTCAAAGTGTTGGGACTTCATTACAAGCAAGGCAAAGATATGCGTCGTTTCTGCATGTATTTAATTTTGCCAGATGCTCGTGATGAATTACCAGCTCTATTGGACAAGATTAGTTCAGAACCTGGTTTTATAGATCATCACATTCCGTTTGAAAAAGCTAAAATGCGCAAGTTTCTTATCCCTAAATTCAAAACAACTTTTGGTTTTGAAGCTTCCAAGGTTCTAAAGGGACTTGGCCTCACATTGCCTTTCTCCAGTGGTGGCCTCACTGAGATGGTGGATTCCCCGTTAGCTGGGAGGTTGTTTGTTTCGCAGATTTTTCACAAGTCCTTCATTGAGGTAAATGAGGAAGGAACAGAAGCTGCAGCTGTTACAGCTAGTGTAATAATGACCAAGTCCTTGATAATTGAGAAGGAAATGGAGTTTGTTGCTGACCATCCATTTCTATTCCTTATAAGAGACGAATCTACCGGTGCTGTGTTTTTCATAGGGAGCGTGCTGAATCCTCTAGCTGGT
SEQ 64
TTAGCTTGAGCAAGCTGACTGAAGTTCAACTGCATTTTCATCATCAGTAAGGTCACTAGACATGGCATGAGGTATTCTATGGCGTTTCAATATACGCGATGTGGCTATTCTTGCCGATTCATAGTTCAAAACAATCACCTTCTCATCATCCAAATCAAATCTTACATTCTTTTGGTTACCATCTTCCACAAGCTGACGTAAATGTTTCAAATTCAGAACTTCTACGCCATTAACCTTCTTCACCTGTACAAAACGGTTCAAAATTTCCCCGAAAACAAAGAGGAAAAATAATTATAAGGCGCCCTTATGTTGTCTTTTAAGGGAGAATGAAGTAATAGAAGCAACAAGTGTTTTACCTGCAACTCGGCAAGGCGCTCATAACCAGCATTAATATCATCCATCAACACCTAATTAATTGCAAACGGTCATATTAGCAGCATATAGCTCATTGATCTGCGAGTACAATCCTAAAGGAAAAACAGACCTCCGTCTCTATGTATAGGTTAGGAATCATGTTGCGAAGCAAAAGTTAAGGCACAAAAAATTGAAGTAAACAAATACAAAGTTTATTATTTATGAAGTTAGCCAAATGACATAGATTGTCAAGTAAAATAAGACATGCCTTCCTCCCATGTCAGAAATCTTACTGGAAACATTTACAAAAGGTTATCTAGCAGCAAATATCTTAATACGTTAACAATGTCTGCCCATTTGGATAGACCCGTCCACCATCAGTTTTGTGTGTCTCAGTATGTATTTAGCATGTGAAATTATGGTGGAACTTCATGGCGTCTTTATGCTTTTTTGTTTTTCTATTTTTATAAGAGATCAAATTATTTGAGTTTTCTATTTGACAAAAGGATAAGAAACTCTAGATACCTGAGAAAGGATGATGAATTGTTCACCAGGTTTCTTAGGTAGTTCCCGAAGGGCTCGTTCACACAACCGACGGGGTGAGGCATTATACCAGTCTTCTCCATACTCGTGAAGGAATGGTTGAGTTAATGGAATAAAGACGAGACCAGCAAATATGAAATAACTCGGAAGCTTGTCAAATTGATGAACTGGAACAAGTGGCTGCAACTGCATAATGTTAACTAGTTGTTAGATACAGTAGTCAGTTACACAGCATCAAAGAAACCTTTAGATTACCAATCATACAAGTTGATAAAGTGTTAAATCTAAGGCTTAGCAAAAAGTGCAGTCACCTTCCCAACACCTTACTCCTATGCATCAGATTCCCAAGAGCGACAAAGCAAAAAATAACCTTGGTCCACTCGCACACCAAAATACTTCTAACAAAACAAAAATTTACCTTGCTCCAATACCGCACCAAAATACTTCTAACTACATAAATTATGTGACACCCCCAAATACTTCAAGGCTTCTTTCATCTTTTCTATTTTTCTATTTTTGTAAAAGATCAGATTGTTTAAGTTTTCCATTTAACAAAAGGATACTTGATGAGAGCAACTATAGATACCTGAAAGAGGATGATGCATTGTTCAACCTTCATATTATAACTTGTGCAAATGATGAATTTACTTCTACCGGAGCGATAATGTCAATAAGTTATTTTACCAGAGTAATTCTAATTGAACAATGTAGATGCAGTGATTTCGATCTATTCGTCTGATTACATGCACCCAGGATATGAGCAGAAGTTTGAAAAATCACTGGGAGATACATTCGAAACAACAATTAATTATGAAGGGTTAATAGAAGGGTAATTATGAAGGGTAAATAAAAGTGACTTACAGGATGAAGCGTGATTTTGAAGTCATGCACTTTGCCATTTCTCAAGACTTTAAGTTCAGCAGTTTCATTAGGTTTCTTCATAGATACCAGATGGTCAAATGTGATCCTCTCTCTGTTTCGGAAAGGAACTGCAGATCATAAGCAATCTCAAAACTTTAGTCCAGGGAAATAAGGTTGCTAATTTTGATATGCATTTTAGTCAAAATGCAGAAGGGCAAACTTTTCTAACAGAAAATAAGTTATTCTAGCCTTACAATTTTCATATCCTGCACACCCACTTATCTGGTTTCTGGTATAATTATCAGCTATTCACATGGCAAGAGAAGAAACTCAGAATTAAAAACGACTAGACTCTAGGCTTTTCCATCTCTCAAAAGGAGGTCATTAATTTGTTACTCATAGTTGCTCGTTGAACATGAATACTTTTTAGTTGGTGCTGGCCTGCCTAAAAGAGCCTCTACAGCAAGAACCACATCCATGTTTCATCCAAATCCGTCATTTTACGTCACGAAACATTTAGAGAAAGAAAAAGGCATGTGCCATAAAATGTAAGAAAAAGCTGTGAAGAAGAATGTTGTACTAGTTTAACTGTCGTACATAAGAATTACAAGAAATAGAGAGAAGCAAGGCAATAAGAGAAAGCCTTACCTGTTCCATCATTTGCTATGGGTACGCCATCAAATGAGAGGATTATGTCGTCTTTCTTTAATACTCTAGAAGCATCAGAAAGTGGGTTGATTCGGCTAACAAGCACACCTGTCAATTTGGACTGCATTTGGAAGTACTCTCGAATTTGTGCATTTTCAGTAGGTTGGCATGACAAGCCCAGAGAGCAAAACCCAATGTATTCACCCCGTTCTTCTACTCCAGCTATAAAATGCTTTATCACAGGAACAGGAATAATGTAGCTGCAAAATCCAAATAGAACTGAAACTTTTAAGGCCAACTATGCACACTGTAAATTTCCTTTCCAATTACAACAGTTTTTCTCCAAGGTAGATTGTTTAATCAAGGGTATCAGTTTAGATTTGATTGTTTGCAGCACTGAGGAAGAGTCGAAATAGAATGAACTTGAATTCGAGACCGCTATTTGAAATCATAGTGAAATACTGGAATTTTTATCTCATGTCTAAGAGCTACTAAATGTTCCCACAAGCTAAGCAAATGTTGATTAAAACTAGTAAATGTCATCAACCAAATCTCCTTATCAACTGCATGTCATCACAACTAAAAGCTTTCAGGCATTCCCACATATGCCATCCTTTGTAACCCCTCTGAGATGAAAAGAATAATATTATGAAGCTAGAGCCAAAGGGCTACAACTCAAGCTTCAAATTTGTGAATGCATGAACAAGGACTGCGTGAAGGGAAAAATCTGAATATTATGACGAAGAAAAATGGAAGAGAAGATTTGCAGGAGAATACATGTGTGAAGGGAAGAATAACATGCAGTCAGATTCAGGTAAAGGAGAGAAAAATCTGAATTTTGTGATGATGCAAGTGGATATTAGAGTATATACCCCATCAATATTTAACTAAATAATATATGGTAGACCCAGCAATAAATGATGCAATGGAACTAAATCTGTATAGTAGTTCTATCCCTCCGGGGTAGGGGTAAGGTCTGCGTACACTCTACCCTCCCCAGACCCCACTTGTGGGATCCTACTGGGTTGTTGTTGTTGTTGTAGTTACAGAGACACAAAGTACCAAAATAAAATTCTAATTTACCCAATATTCTCTGCACCAGAGAGGTTTTGGAAAGCAACTCCAGCAACTTTGTCACCCATAATTGCTGGTCCTCCACTATTCCCTGGATTTATAGCCGCATCAATTTGTATTGCCAATAGTTGACTAGCGCCGTGTACATATTGCGTAGGTTCTACCCTTGAGACAACACCTTTTGTCACGGATATATTATCTCCCCCTAGAAAAAAGCAAAAATTATTAGAGAACTCCTCAGGTGAAGGTATTTGTGGCTTACAATTAAGTAAAGAAAAAAAAAAAGAGAACATAGTATAGTGAAGAAAACAAAAATATAACTAGACGTCAACAAAAGATTAAGAAGGATCTGCACTATTGAAGACAAGAATCTAGTATATGCAAGCTACAAATATCCAGCCTTGCACCTAGTTGACACCAGAGAGAAACAAAATACATCATGAAAGTTTCCTTTTCACTCTTCTGGATCTTATTTGTTCGCTGCTCGTATGAGCCCTCGAAAAGGGTACACCGAATCCGAGAAAATAGTGCACTCTCTCGCGGAGTATAGCTCACATCCAAACATCTGATTAGGGAATGGGGCAATGCCCATGAAGCTCTGGCGGAAAGGGAAGGCATGCCAGGCCGTATGCCTATGGGTGCACAATTCTTCGAAAAAGCGCATGCTACCTCGGAGACCTGGGACCTTGGCTTAGTAATGAATGAAGGGAAGCTCTTCGAGCTTTCTCCGCCAGCGGCTTATGTAGTGGTCGGCCTTATAAAGCTCGCTAAGCCTCGCTTCCCTCTCCCCTTCACTTATTAATTAAGTGGAAAATAGTCGTCGGCATTCTATAAGCGACTTGACCGAGTCTACGAAGCTTTGCTTTTCTTGTAGTCGGCCCGTAATGCCTCCCTTCATTTGCTTGCCTCCTTTCAGCTCAGAATGCCTAATGCCTATTATTTAGTGCTAGAAGCTAACCGCCATAAGCTCACCCTTCGGGTCTCGCTTCCAGCGCAGGAGGCCAAGCATTCTGCCAGACGTCCCGGCCTGGGAGCCCCGTTCGCCTTTGGTNGCTTATTAATTAAGTGAAAAATAGTCGTCGGCATTCTATAAGCGACTTGAGCAGAGGAGACAAGAAAAGGCTCAGTGGTAACTGTTATGAATGTGGAGGAGTGGGGAATTTTTGTCGTTACTGTTACAATGTTCAGACGGACGAATGCTGCTAGGGGAGTGCAGAATAGACCAGCTGAACGTAGTCGGAGAAGCTCTAAAGGTAAGTTCAAGTCGGAAACATTGTGTTTCCACAAGAAAAGAAGCACATATTTTTTATAGGATTTGATTTGATAAATGGAAGGAAATTGTCAATTGAATTGGCAGAAAACTATGGTAAAGGGGAGCCCAACGTCCTCAGCAGAGAGGAATGGCAGATGCATTCCCCCCNAATGACATTAGCCTAATTTATTTGAACTATAGACCAAAAAGAGCCCCATTGGAGTGAAATATATATAGTACTCATACTGCTGTTCCCAATTGATTTGGATTGAGGTGTAGCTGATTGGTTGATTTCGTGAGAAAACATGAATAAAAGGAGCTCACTTGTGAATTGTGATTCATCTGCCGTTGTATTTTAAGACACGAAAAGAAGAATGTGAATTTCATATTAACTCGACACCGTATCAACAACTCAGTTATTTTTATTTTTCTGAAAATTATATCAATAACTCAGTTATTTCGAATAGGTATAGCATCCCATTGCTAAACATGAATGTTGTATATCACACAAACAATAGGGGGGAAAGGGAGGTTCAGAGTTCGATTCAAGTGTATGGGGGGTTGGGTGTTTACATTACATCTTGAATCAGTTACAAGAAAAATAATTATCTTGAGGATGACCGCTGATTAAAAAAAAAATTACCTTGAGGATAACCAACAACAGCCACAGCTTCTTGGAGAAATGGAACATCACCAAGCTCCAAAGAGTTCATGCCCTCCCAGAATTCTTCACTTTCTACCACCAGAATAGCCAAGTCACATTCATGACCAACAGCTTGCACTGTTGCTCTATACTTGGTAGGAGAACCATGCTTTCTTACAAGTACAAACGTATGATCAGCCACAACATGAGCATTTGTTAGGATCCTCTTTCCCCGAATAACAAAACCTATAACATTAAGAATGCAAAATCCGAAAGTAAGCTTTATGTTCTCTTAAATTATTAGTAGGTCAGATACTTGATAAGTTGATATGAGCTCTGTATTCTCTTCTTTTAAGAAAAAGAAATTATTACACATAGAGGGTGGGGAAGGGGAAATGGGGGAGGGGATTACAAGTGGGGAATCGAAACCCAAACTTGTAACAAAAGCAGATAAGAGAATTAACCTTCCAAAATCGAGTCCCCATAACAAATCAATCAACAATCCTCAATCCTAAACTAATTGTAGTTACAACATCAATAATTTCATGCTTTAATCCTAAACTAGTTGGCGTTGGAGTTGGGACAACAAATCTCTAATACTTGCAGCCCTTGGGACAACAACAAACTATTTATCTCAATCAGTGGCGGAGTCACCTTATACCAAGGGGTGTCAATTTGACACCTGACACCCCTTCACGGGAAAAAAATATACTACATAGGTAGGTAAAAAAAATTATATATATGTTGACTCCCCTTAATTTTTTCGTCTATTTACTTATATATATTTTGACACCCCAATGAAAAGCATGCCTCCGCCACTGATCTCAATGAACAACAACTTCATAAAAAGTAGCCTTTTGACAAGGCTTCCTTAGTAAATGAAGTGCCAATGTAAGATTTTCACAATAACCAAATGGCTAGTAAAAAAACGGAAGCATTACACTGATAGAGAATAACATATTTAGAAAGTAAATGAAAGAGAATAAAAATACCAGAGCCCGTAGTTTCACGCTGGGACTTGTTCTGCCATGGAAGGAAGTAATTAGGACTACTGGAAACAGTGAATATTTTAACTACAGAATCCAATGCTAGCTCTATTGCTAAATAAGCATCCACCATTCCACTACTTAATCGCTGCTCCACCGCCGCCACCGGTTCTACTTCCGATGCATTGGATTGAAGACTATCATTTTCCTCAGCTCTCTCGACATGAGGTGTCGTAGAAGTTGTTGAGCTATTGCTGTTATTCAAGGTGGAAAAAAATGAGGCGGAAGCAGAAGTATTACCAACAGTGCTGCTATAATTGCAGCGCCGAACAAATCGATGAAGCTCTTGACGTCGGTGATGTACCGGAGCTACATCTCCGGCGATAATAGGGGATTGAAAGTGGAGATTTCTGTTTAAGAGCTTTCGTGCTGTACGGAGACTTGGACCTATTCGTAACAT
SEQ 65
CTAGCTAACTTGCTTTTCACTAGCCTTGTAATACAGCATTTTCTGCACATAGAACATCATGTATCCTTGCGCAGCTCTCACAATGTTTTCGCTCACTTGAGTGATCCAAGCATCATCACATTTGTACCATTGATTACTTAACCTAAGATATGTTACGTAATGACCAGCATCAAGTTTACCGGTATGGGTGATGACAGCAAACAGCTCAAACTCTGAGGACGATTCACAGGACGCATCTTGCTCGTCCCCATCAAAGGAGAAGATTCTATTTCCAAATCGACTCCTCAAGATAGATGAAGAAAGGTAAGGCGACATGTCCAAGGAAAAAGGAAACTGTAGGTAGTGATCAACCTTCCTTGACATTTTCTTAATCACAGAATGCTCAAACCTTTTGATATGGAAGCAAGAAACCAAAGGCAATTTTCTTATGGACATCTGTTTAAGAGATTCCTGTCTCACTTGACAATGTTGGCAGAAGAACTTCTGATCAGAACCCAATTTCTCAGGTCTTGTGAAATGATCTAAACATCCCATCAACGAAGAAATTCGACCGTTTTGGCTAAACTTTCCAGATTCTGCTTCCTTCTTGTGAGTATTATGAGACTTCTTTGATGTCATCTTTGAGGAACTCCCCTGGCTCAGTTCCAAGTCCAAGGAGATGTCTATACATGGATCATATGTAGTAGATGTGAAGCCACAAGCTGTACACATGACATCAGACCGCAAGATTCCAGAAAATACTCTATGAGCAATGCAACAGTCTCCACTACCTGTGCCAAAGAAAAAGATTTGATAGAATGAGATCACAGCTGAAAAAACACAATGTGTACTCCTAATAAGTTACACAGAGTAACATACTTAATATGCACAGCTTAACGTGCTCCTCGACACACTTCATAACAAGTAAAAGTCCAGTTATGCCATAAAGTTCTATCAGCTATTGCATTACTAGACAAAACAAAAATCGTCCTATCAGGAATGGCAACCAACAGAATGAGTAGCTAAAAGCAACGGAGTACTATTGAAATAAAAGTAAAAAACTCAGTAAGAAATGTATCCGCAAGACACATTTATTGCCTCAAAAGCTCTATCTATTCCATTTACACAAAACCATTCAGAATCATAGATAATGTGTATTCATTATTTAAATGCAATACCCAGATGTTAACTAAATTTTGCAAACTCAGGGCAGATAGCCCTGCACCAATCCTCTAATTAAGAGGAATGAGAATAGGTGTGAACTCTCACAATCAAAGTACGTGTCACTTAAGTGAGAACCAAATCAGCATATTTACAATGGTAGAAATGAATTGTTGCTAAGGAGTTACCCCTCTAATGATTGCTCTGCTAAAGAAATGCGAGGTACCGAGGAGCTAGACTCACAAGAATAATAAGGCACATAAGGGCTTATAGAAAGGAGGAAGCAAAAGGCTGTCAACCTAGTAAAGATTTCAAGCTTTCTTGAAGGTTGCGATCAGCTCACTCGAGAGTAATACCCTAAAACATGGTAAAATTGCGAGTTAATACGAGAACTATTGTACCAGAAGAAAACTCCTATGCTGAAATCAATAAGACTAACCAAACTGAGACTTACTGGGGAGAGAGTAGGAGGAAATCTTTTTCCTTTTGAGAAACATCTCTAAGCCTGACAGCGAATATCAACTTGTAAATCGCCTATGGCAAAGAAAAGACCTAAACCATAACCTGCATTCAAATATTACTATTTTTTCTCAGTGACAATGGAAGTTGGGATTGCCATGAGAGATGAGATGCTAGAACAAAGATAAGCCCATCAAGCAGGCCCTGAGTGCCTTTTAGTCAGACGTTACTAGACATCACAAAGATGCTTGAACACACTATTCTGCTTCTGACAGAAATTGCTTCTTCCACCCCCTCCCCACCAACAAAAGAAATTCAAAAAATTCACCTACTGAAGGACTTGACCTTTGCAAGTACCTGTACAAGTTTCAGTAATCCACTTGTTTGAGGTTTTTACAATACTAGCCTCCCTTGGCTATGTTACATTTATGTTACTTTAAAGTTGCTGCCATGTGACCTGGAGGTCACGGGTTCGAGCCGTGGAAACAACCTCTGCAGAAATGCATGGTAAGGCTGCGTCCGATAGACGCCTGTGGTCCAGCCCTTCCCCGGACCCCGCGCATAGCGGGAGCTTAGTGCACCGGGCTTCCTTTTTTTTTTATTAAAATATATATAAACAATGTTGTCAATAATTTTCCCAGTACAACAAAAAAAAGAAATCTCAATGATTGGTCTAATTCGGAAGAAAAGGGAAAAAGGAAGTATAAGAAACTAATATAGGCAAGGTGATGGGCGGAGAAACGATGGGCAACTAATAGGTACTCTAATGCAACAAACAAATTTACCTGGACTCAACGCCTTTCCCTTATCGTTCTGCATCCTTTCATGAATCCCGTCAAGCACGGAAATGAAAAACTCATGAGCATCCTGCTGTTCATAACTTGCAAGATTTGATGCATGCTTCCACCAGCTGTTCCAAAGAAAGGTTTACATCCATCTGAATGAGACACAGCACAGACTGCAACTAGATACTCAAAAAGTCGAAATCCACATCTAATAAAACAAATTACAAATGTATGTATATCACAACCAAGTTACACACCACCAGTGGCGGAGCCAGGATCTCCGCGAAGGGGGTTCAAGAAAAAAAAAATCGTAGCTAGTGGGAATTGAACCTATGACCTTTCAAAGATTTTGAACCCCCTTGACCACTAAGCTACACTTATGGTTGTGTCAAGGGGGTTCAAAACTTAATATATAGAGGTAAAAAACAGATTTTGCCTTATATATACAGTGTAATTTTTCGGCGAAGGGGGTTCGGGCGAACCCCCTTTCGCCCCCCTAAATCCGCCCCAGCACACCACTGTCTAATTTCACCTCTATGAAGGGAAAAGCGTGGTACATACAATCCCAATAATAAAAAACTAATCTTGTCCCTACATCATTTTCAAAGAAGTGCACAACCCAAGCTAAATTAAGGAATCTTTTACCAATGTATTGTTATCCTTATAAAAAAGAATTATATACAACTATACCTCAATCCCAAGCAAATCGGGATCAGCTATATGAACTTCACAACACACACACACACACACACAAAAAAAAAAAACGTGCACATTATAACAAAGCCAAACATTATCTCAACAAACCAAGAAAACATGATCAAATGCAGACCTGTAAAGGAACTTTGCAGGACTAATAGGGGTCCGATCGCCAGAGAAAACAGCAGAAAACATTGCATCCAAATCACAAGCCAAACACAGCATTGTTGAGTTCTTATTCCCATTATCACTACTACTCCTTGTTATAACACTGCTATTCTTTCGCTGGCAAAAATATCTGTTATGCTTGTCACTCAGAAAGTAATTCCTCAATGGTGGTGTATGAAGCAATGCTTGAAGCACTGAATTCATAAAACACGTGTTTCCAAGATTGTTAAGGCCCCTCAAACCCCATTGTACTTCTGGGGTTGTTGAGTCATTGCCGAGCTGACTCGGCAACGGACTCGAGTTCCCAACGATCAAGACTTGCTCTTTCACATCAGGCGTCCACGGTTTGTACTCCACGCGCCTCCTCTTGCGCGTGCTCTCCGGATGCGGCGGAGGGTCTTGTATTGATCCGATCACGGTGGCCTCCGTCTGCGCTAGTGCCACGGCGGCGTCGAAGTCGCTATTGTACACCTGGTCCCTACACCCGCAGCAGAACAGCTCGGCCCGGTCGATGTCCACCGCGATGCAGTGCAGCGAGGGGTCCGCAGCGTTTCCCGCCGGATGTGACGGCGCGTGCACGCGGCAGAATACCTCGGCGCACGTGACGCAGGCGTACAACCGCGGCGGCGCGTGTCCGCACGCACCACATCTCACCAGCTCATTCGGCGGCTCGCGGCAGAT
SEQ 66
TCATAACTTACTGTGCACGAGCTTATTTGAAAGACGTTCAACCTTGCCAAGTTGAGTTGCGCTCTTCAGCATTGCCTCGCTGTCAGCTATAAACTTTGCAAACTTAGAACCCTTTGTATCACCATCACTATCTCCTCGAGGAAGCATTGGCAGATGAAGCATAGAGGGACTTTGCTTGGAAGATAAAGGCGGAGTTAATGCCCACACAGAAGAAAGCTGTTCATCTGGTTTGTCAAGATATTCCAATGATAGATTCTGCATGTCTGTNTAACGGTGATAGATCCTGCATGTCTGTCAGGAAGAACAGAACAGAAATATCAGACGTTGGTATCGTGCCTACCAAGTGTTATTCGGAACTGATTTCTTGAGAGTTGCAAAACTTATTAAGAGCTTGCAGATCAAATGTTACTGCTTTTATTCTTCATACGGGAAAAGACCACCCATTATCTGAAAATGGAAGTATCAGGAACAGTCGAATAAAGTACCTTCAACAAACTTGAAGATGGGCTCTAAAGCTGCACATGGAATGCTGAAGTTCAAATGTGGAATGACCGTCCCTCCACCATGTCTAGCATTACTGCATCAAAAATAAAACTACAAGACATGAAGTTTCAACAAGAAAGATATTAATAGATGGAAAAACTATAGAAAGACCATGATATGTGAAGATTAACATAAATGAGATGACTAAAAGATCTTCTGTATGAGATCATTCAAATTGGACACCATTATTTTTCCTGTATGAAAAGCGTATAACCAATATACATTTTTGGTAAGGACAATTAATATACATTCAAACAGGAATATCTTTCTTCAAGGACTTCTCAAAGTACTCCAGGACGCAGTGTACCACAATATCATTAGATTGAACTTCAAGAACAAACACATGTAAATATACATAAGCTGAAAAAGAAATATCCTCAATTATAAGCATCCCCAGTTGTCATCCAAAAGTTAATTACCTTGTTACAAGGGCAATCATGTGTCCTTCTGAATTAACAACAGCTCCACCACTACCACCAGGGTGTACAGCAGCCGTTGTTTCAAGCATTGCCGGAAAATGTTCTCCTAAACTTGATTGGTTGAGCAGAGACCGCTTGGCTTCAACTACCTTAGCTATTGCACCCACACAAGCAGATGGAAGGAAGTCTATATTATAAGAAAAAGTAAGACAAATTACAAATACAACTAAAGAAGTCTAATATAGGTATGAAACATATGAAATAAATATACGTAGTATATCATGTTCAAAATGAAAGAACTTAACAAAATTATTCACATGAAAAGCAATTTAACCTCCAGAACAACATGATTTAGTACTATTGGGCGCACAAAGATAGTCAGTTCCAGAAAATTATGTTCAGCAAAGGTTATGGAACAGACAAGTTATCTGTATCAACGAAAAAAGATGGAACAGACAAGTTAAGATTGCATCAATAAACAATAGTAGCACTTGCAACAACCTAGCTACTATTAAAATATCCTTGAGATACAGCCCGACTCGAATAAGTGAGTTACCAGGAATTTCCTATTCAAAAATCCCATTCTTTAAAGCTGATCATTTGTACTTGCTTTCACAATAGAAAACATCAATTTAATGCTCCAGAAATTTACCTTTTTTCGTGATAACTGATAAGTGACTTCAAAACTCTAGATTTGATTCCCCAATTCCACTTTGTTAGCATAGGTATTAGGTATATATCATTCTTATGGATGAAGATCTGAATTAGTGCCTATGGCTTTTATTAGCCCACGAAAGGAAAACGCTTCTTTTTAATTTCGTCTACCTTTCTCCTTGTTCTGCTAGCCTTGTTTGAGCCCTACAACAACCTCGCTATTCTTAATCTGACGTGCAATTTTTTTTAACCAGAAGATCAAAACACTGACTTGGACTACAAATCAAATTCAGTATCAGTAAACAATGTCTTCACCTAAAAGATTACCCAGTTTTGAGCCCTCCCGACCAATCTGGTTATTATTTCTCCATTGGAAGACACCTCAAGTTCCCTTGCGGAGGCGTCGACATCACCTCATAACTACTCAATCAGTCATCAAATGATCATCATTTGGTGAAGGAAGAAACATCAAGTATTCCAGCAGTAACAAGGAGATGAAAATGATATAATACGACCCAATCCTGCAATTGATTATAATGACACTTCAACAAATTCTTAACACAAGAGAAGCAAGGTGGAAGAGAGAGAAATTCAAGATAAACAAAGTTTTTGTAGAATATTCTAAAATTTCAGATTTACTGTGATGCGTGTGTCAAAATAAAAGTAAAGGCAAAAATATTTTATTTAGACAACAAATCTAAAGCAAGATTTACCACATCGTGGTCCAAATAGCCCATGTCCGAGAATGTATGCTTTTGATCCAGGGGACGGGCACATGAAGTCAGCAGTAATGGGACAGAGCTGATCAGGAACTAGCTCAAGTTGTAGTAATGCAACATCCAGAGGTCCTCTGGAGACATGAACTACCTTTGCATTTGTCCATACCCAGGGATCCATAAAATCCAAGCGAACACGAATGATCCTACTGCCTGTCTTTGCCAGGTTAACTCTAAAGCTACCTTGCTCATTGTCAACCAAGAAATGCGGAGTTTTTAGTTCCTTTTGAATCAAATGTTTATTCCTACGCTGAATGTCAAATTTCTCAACCCCTGGATGCTCAGATTGATCAGAAGGGATGAGAACTACATCAGATTTGGTGTTATATCCTGAACCGTTTACAGATGTTTTTCCAAATCTCCATGGCTCTAGAAGATGAGCATTTGTAAGAAGAAGACCCTGCTTGTTGAGCAAAACTCCAGAAGCCCATGCTCCATCATCAACAGTGATAAGGCAGATAGATGTCATTGCCTTCTCAATCAAGGATGGGGGAACAGGATCTATTAGGAGATGCTCTCGAGTATCATTGGAAGGTCCATTCCGAATATTATTGGAGGGTGATTCATTTTTAACACTGATTAGGTTTCCATTATCAAAATGGATCTTTCTCCTAGTTTGTAGCTCTTCTTTAAGCAGGCTACCACAAGCAGATGTAATAGCTTCCCATGGAATCACCATCTGCCACCCAATGCTTCAATTTCAAATTAAAGCAATTTATGTGAAGATAGATCTCAGCAAGAACATAGAGCACATTATGAAATGTTTACCTGAATTTCAGCAGCAGTAGCCCTTTGTCTGAGTGGCCGAGACAGAACACCAATAAGCTCTGCATGTTCCCCTAACACTGGGCTACCTTCCAT
SEQ 67
CTGGATATAAGTTTGAACATGATCCTTCCAGAGAGAACTTGTCTTCAAAACATCCCTTAGGGTTGATAGAACTTCAAAAGATGTCGCCTTTATAACATCGTCATCCTTGTTGTACGGCTGTTCCTGCAACATAATTAAGAAAGCAAATTTTAGAAACAATTTCAAAACTGAACATCAAGCTTTATGAGTTTATCATTGCTGGGTAGAAAGAATGCACAAAAAACTGAGGCATGCCTTGAGATGATCAACTTTCACTGTAAGGGGTTCTTCACTGACCTGCAAATGAATGCCACTGTTTCAGTCCTCAAAGTGTTACTCTTGTACATATCACAAAATGGGCTATCGAGACGGGAAAAAACAACATGGTGAATAAAGTAAGGAAATCTGACAGTGAACATTCAAGTGTGACAAATCTACTGCATAGTCATCAAATTGTTACACTAGTATATTGAAGAAAATTTTTATTGAAACAAAAGAAATAACTACATAAGGGAATCTGACTGCAAACATCCAACAATAAATAAACCCAGTGTAATCCCACATGTGGGGGTCCAAGGAGGGTAGTGTGTGCGCAAACCTTACCCTACCTTGTGAAGGTAGAGAGGTAAACATCCAACAGTATGTATAATCTAATAGGAGAATTATTAATAAGGAGCTCAGGGGTTGAGAGATTTAAGTCCAATAGAAGAGGATAAGAGGTCCTCAATCACAAAATAAAACGAAAAAATAAGAAGACATAAGACCCTCCAATGGCTAAGTAAGCCCATAAATGGCTGACCATGCTTGAGTACTCCAGCTTTACAGCTAAAGCCAAGAATACGACTCTATACCAAATTAGAGCTCAAACAAATAGGATTCTATTCAATCTTTTATGTGTTTTATATATGTTTAAGCCCCAGGGCTTTGGTCTAGTGGTAAGAGTACAGCCCGTGATATGTAGGTTGGGTGCACATCACAGTTCGTGCTCTGGCGCAAACAAAAGCCTAGTATTTAGGTAGACAATGGTAGAATGGCGAGCCCATTATCCACCGAGTTTGAAACCATGAGCCACTTACCCTCAGATTTCTCGGTTATCAAAAGTCTAGGATTTCATTCAAACCAAATGCATCGGCAATCAAACAGATGCTAAACTAGCACATCATAGAAGAAACTCAATAGCTTTTCTCTTCTACAACCCCGAAGTGCAATAAGAAAGACATTCCTATGACTTTGCAGGAAACCAGTTGTCCAAAATTATTACAAAGTGCTACTTTCTGTTGCAAACAAGCAATCAAGAATAACTAAAAGAGTGTTTCATTCTATTGGTAACACATCATACACAACTGGGAAAAACAGCTCATTTAGCCTCAAATTGCAACAGACCTCTATCGCTTGACAACGCAGCTGCTCAGTTATATGAAGCCAAAAGGAGAAAGACTGGGTGTTGGGACACAAAAGGTAGGAAAGAGAGAAAAGTATACAAGTGCAATGAACCCAAGATGTCAAATCGGCAGTCTCAAAATCAGGAGATGATGGATATGTAGATGAAAAATACAACTAAATAGGACACAAAGAAAACATATGGAGCTACTTATAACCAAAAAATAACGCAGACATTTACTTACCACCTCTGTCATACGTATCCGCCTGTGACCAATAAGAATAACCTGGTCGTCTTTAATACTTGTTATCTGGCTCATCATAAGTTGCATCAACCTTGATCAGAAACTTCTTCAATAACTTAAAGCAACAGAATAACACAAGGAACAACAAGATGTACCTGAGCAAGTGTACCAACTTCATGAAGACGGTTCAACATGTCTTTTCCTTTAAGCTCATAGATGTTTTTTTCTGTATCTGAGGCAGACACAACATTAGGATCAGTCCCTTGCTCGTCTTTCATAAGGAAAGCGCCAGCATAAGGTGCTTGCCTTTTTCGACTTTCCAGCAAGGCTGCTAATACCTTGGGATCCTGCAGTTAAATTTACGGATTATGTGGAGCCTCAGATCAATTTGTTGCATGCCATTACAGTATAATTTATCTTTTTCTGGAAACTAACAAATGAGGCTAAAGAATATTACAATTGAGCAAAGACGCTCTCCGAATTGAAGGTTTTAGCTTTTCAAGCTTACAACTTCAGCCAAGAACTTTCCATCAATTAAAGTAGAGCATTCTTAAAGCTGAATGCTGGTTACAGACTTAAGAGGAAATAGGGTCCAAAACACTAGCTGAAACAGTACAACTCTCAGAAAGTTGACAGTAATTATACACTTAATGTTTAAAGCTTATATCTATTTTATTTTGAAAATTACGAAGACTAAGACATGGTTTAACCCATCCTAGACAAGGTACCATACCTTCACATAAATATGCATATAAAACCCTGGAAATAACGGTCTGTGTGGAAGTGGCAGTGCTAGAACCTGCACCAAGAAAACAACATGCATCATTATAAATTATAAAGAATTAAAAAAGTACAGCCCACACAGACTTTATTTATTTTTAGTATAATAAATGACATAAAGCCCTTAATTGCAAGTCTTACAGGTAAATCTTATCCAAAGATAAAACCATAACCAACCAAGAAGGCAAGAAATGGGTCGCATCCATTAAATTTACAAACAGATGAATTATGAGCAAACAAGAAGCCTCCAAGAAACAGTGTTTTCTTTGCATTCGTTCAGTATCTCTCTTTTTTAGGGAAGGTTGATAAATACCATTTTTACACTGATCTTGCAACAACTAGCAAAAACTGAAATACACATATGAATCTTCCTTTCAAAGAAAAACGAGATAGTAGGGCAACCTATTACTTTATCCTCACATGTACTTTCCAATGCTTTACTTCTTAAATAACCTAGAAGCTGTTGATTAAATTGAAACTAAAGCATGAAGTAACCCTCTCTAGGCCTCCAAATATAGGATGAAATGTTAAAAATTAACATCCATCAAGGTGCCTTCTGTACGGCCAGAAGTCAGAGGCATGAGGCATAACTGGTCGAATGTTGACTCTGAAAAGAGCATATGGGAAGGAGAGTATTCTAATTACAATGGACAAGCAAACAAAAAAAAATTCTTTTGCTGAACAATGTGAATCACCAGCTTCTTGCAGGTGAGAAATCATTCAAAATCATTAGGAACTTTCTGAAGCTTCAGTCTTGACATAGAGCTGATCAATTTGCCTCTACAAGGTGGGAGTTACACATGGTCCAGAGAGAGACAAGGCAGGAAGTACTCAAGGATCGACGGGTTTCTAATATCAGGAGATTTCAAAGAAACAATCTCAACCAATACGGATGGTTCAACTGACAATGTGCTCAAAACAAATCAATCTTCTTGATTATGGGATTCTCTGTAAAAAAAATAAGCCATTTAAATCTGAAAATTGTGAGACAATAGAAGAAGCAGATGAAAAGATAAGTTGATGAATACTTATTACATAATACATGACACTTAATTTTCAGGTGTGAAAAGAACAATAATCTTTGATGTTGGACTAACTTAACATTAACATTCTACTAACTAACATTGCCATTGTATTATGTGTCCTCGAACTCAAGGTAAGAAGAAGTGAAGAACACCTTGAATTTGCTTTTCTGGAAAACACGGAATAGTGAACAATAGTCGTAATAGACTAATAGTAAAGCATCACCTAGAAAGTGAACAGTTCAAATCGTGATCAGTCACCAGAACACCATAATGGATTTTTTCGCTGAAAGTCACGAAAATAGCCCAGCTAAAGAAATCATTAATATTCAGCATAGACTAAGGGAACCGACTCTCAATGACCAGAGAGAACGCGATACCAGGAATCTCAAAACCAACCTTGGTGTGGCGTAAGGTGGCAGCTAATGTGGCTCGAATTGGCGGCTAAACAGTGACTAGAATAGCAATGAAATAGCGTCCAAAATAGATATCTGGAACAATAATCATAATACTGGCCTGAAAGATGGTAGAAAACTCATTGGAAGAATGGCCAAAATAGTTCTTTTACACTGTAAATGGGACATGGTTGGGATGGTGGCAGAAACTCAATGGAAGAATGGTCAAAAAGGAGGAATTTTACAGCTAAAAAGATGACTGAGAAAGAGGCATGATACAGTCAAATGGTCTACAGAAATAGGTATGAGGTTAACGATCACCAAAAGTGGAAAGAATAGTAAAAAAGTTGACCAATCGGGTAGCATAGGCTGCCCCTTATTAAGATCACCAATGGTATGTGATACGATCTGAGAAAATAAGAAAGAGGATGAATTGGTAACTTGATGAAAATTGGAATACAAGACACATCTATTTGTACCTAACAATCAAATAGAAAAGACACATCTATGTGTAGGTGAGAAATACATTAACTTTATGATATCTTTCCAATGTGAGACTAATTTAACACTACTAAGATAACACACATTAACATTTGAACAAGAACAAGTGGGTGCTACAAGATGGTCAAGGGATGACAAGAATAATATCACATTGCCGAAAATCCAAGCTGTGTGATTGGAAAAAAACTTGAAATTACTTACAGATGATCTTAGAAGATGAAAATAGAGAGGTATATGCTTGAGCGGAGATGGAGATGAAAGAATTGTTGGAAAAATTGAAACCTTTAGAAATCATTAGGTGAAGCAAAAGAGATGATTGCAAATGAATTACCCCTAATGGAAGACTTGATTAAGGAGTTGGCAGACTGTGCCTTAGTGGAGGAGGTATCAAGGAAGCAAAAATCAATAACATTGTGGTTGAAGGAGGGGTACAGAAATACCACATTCTTTCTTCCTCCTCTTCTTTTTTTTTTTTGGAAGAAAGTGGAATACCACATTCTTTCATTGAAAGGGTAATGCTAATTCAAGCTACAATAGCATTAAAAAAGTTGCTCACTGATGAGAGGCTCTCTGACGATCCTAAGAAGATTCAGGACAAGAAATCTTTAAAAAGACGACTAGATTTTAAAAATTTGTCAATTGTCATTTCATTAAAATACTAATGCTAATTGAAGCTACAATAACATTAAAAGTGTCGGTCATCGATGTGAGGTTGTCTGAGGATCTTAAGAAGATTGATATCAAGATTCTTTACCACTACAGCCTGATTTAAAAAAGTCTACACTACTATTCTTTCGTCTCAAACGGAATGGCTGGGTTCAAAATATGAGAATCAAAGCATCTGATAATAAAACCGTTCTTTTTGAGACAAAGAGGGTATCAAAAGCTGTTCTTTGGAAAATAAAATGTTGAAGACCTTCACAGCATTATATCACGTCAAGGTTGGGAAGGCCTTTTACTAAAGAATAATTGAGTGTACAAGTCTACGAGAAATATAATCTTGGCGATCCACCTTTTATATCCTAGAAAGTTAACTTCTTAGCCAACCATCCCTTTTGATTGCCGGCAGAGTGTTGCAACCAAAGTTAAAAATATTTTCTATATCAAATTGATCCCAATTCTGCGTCAATGCCAACCACAAAGATTAGAAATTTCATCATCAGCGTCCATGTTTGCATCCATATGTACAAATTTGTATAATAATGAATAAAATCATATCACAACTACCCAAAGAAGACATAAAACCAGCACTAACAACAGAAGGAACTCACTTAATATCTTCTCTTTTGGTGCACAATCTGACATGGCCACGCCGTTATAACAAAAAGTTGACATCTTTTCTTAAGGAGTCATTTTACCAGCCATCAAATATTGGAATAAAATGTACTTATATAATGATGGAGCAGGAAAATACTTAATTATGCGCCAAATTATATTAGTTAATAACAATGAAGTGATAAATTGAACTGAGAATGTACATTAGTTAACAAAAGTCATAAAATTTATTTTAAAGAGACGTGTAATTCTATTGGTGCGGGTGCAAGAAATTATTATGAACTAACCGTAAGGCAATCTTCAGGCTTAAAAACAGTAGGAACAATAGCAGCGGAAGCCTTAGAATCAGCATCTCCGCCTTTCTCGGCCGGCTTGGCTTCGGATGCAGCAGCTTCGGAATTCGATTCGGACCCATCAGTAGAATCCGAACAAAAGAATCGTCGAGACAAATAAGGGCCCCTGCGATTCGAACTTCTTAGCGAACCTAAAACCCGAAGCAAGGGCGTATTCGAGTCAGTGCCACGGCGAACTTGAGGGGTAAATGCCGTTGTGACGGCGTGGAAACGATTCTGCAGACATGAGGATGTGAGAGCCTTCAACAT
SEQ 68
TCACGCAGCTTCAGCAAAACCCAGTTGAAGATTACCATAGTCAAATACAGTATGATACACCCCCATGAATACATTACCAAGAATCCTGCATGTACATGAATCTTTTCGTATGAAACAGATCGAGAAGCTTTTGAATAAATTGAGGTATTCTTGTCTATGCATGCAATTTCTATGACCTTATTAAAGCGAAATGGTTGTAAGAATAATAAAAACGTACCAAAGAGGACCACGAGGTGGTGGCACATCCAAAGCAACAAACCCACTGAGGCAAATGGTAGCAATCCCCTCTCCAGTTTTCAAAATATACTGGAAACAGAAACAAAGCCACAGTTGAAACAAGGCTGAGTATGCAACATAACAATGTAACATATATAAAATTAGGAAGAGAGGCATAACCTGATCTGGAGTCAGGACAAAATCCTTATCACCAATATTGAATGTAACATTTGGCATGGATGATATGCTATTGCAGTCGATTACGGATTGCCCCATAGGACTTGGTAATTTCTCACAAAGCTGTGAAATACCAAACATTAGGCATTAAGACAATGCTCTGCACTACTTAAAGAGAAAATTCACTCACCTGATTCACATATTCTAGCACACTCTCCTTTGTTGTCTTCTGTTTCAGCTGGTTCTGCATCCAAATTACAGCCATCTCACAGGCAGTGCACAATGGGGCCTCTCCTATGGAACTTCCTTCATTTTCCTTCTCAACCACACTTCTGATATTCGAGCTGCAAAATTGGTTGTAGATTGTCACATGGTTACACAATTCAACATGTATTGTAACAGACATAGATCCTTGGTTCAAATCTCATTACCAACCTTCATCATAATGAATTTCCAAGTACTAGGCCCATGAAAAGATCATACTTACACGTAAGAGAGCATGTCCAAGATATAATAATAATTGAATAAGATTGTGACATCTTTGACAGCTTAAGTTTTTAAACGAGATGGTTACACAATTCAACGTAAACAACATAGCTCTAGAGCCTTGACATATGTTCCAAAGACAGATTGTCCAGTTCAATTTAGTAACTTCACCAAAACCACCTTAATTCACATATGCTCCAAAGAACAGGAGAAAAGCATCCATACCTCAAATGCTGAGCTCCATTAAGATAACATAAACCTACTTGTAAACAAATTTGATCTGGTGTGACCTGCAGACAATTACAAGTAGCATTATATCTACAATGTCACATGCAGTAGTGGCCGGCTCAAATAAATAAAAGAAGAGAAATAGAACAAACCCCTGATACTAGTAAATCCCAAATCATTTCCCCATATTGAGAAATGGTTTCTTTACATTCCATGCTCAATACTCCTTCTGCTCCAATGGCATGGTTGACTTGTGTCACAACAGCCTGTTATGTACTTGCTATAATTACTTTCAAGAATTTGTTCAAGATAACTAGATAGTCGAAAGGTCTACAACTTCTGCAACAACCCAACTAAAGTAAATCATACAGTTGGACCAGCAAGCAATGATGTTCCAGAATCAACTATAGCAGCACAACCGCCTTCACAGAAGCCTGTAAGGGAAAAGTATTAAGACAAAAAGAAATTATTCTTATCTTAGGCTCGTAATCAACAGCAACGCTCACCTGTTGATTGGTTCCCAATAGAGAAATCTCCCATTTTAAACTGAAACAGAAAAAGGAAAAGAACAGATGGGGGGACCGTGAAAAAAGGCAAAAAGGAATGGAAACCATAGTGGAAAAAACAATTCAAGTACCTGCCAGTAACCTTTCTGAGTCAAAGGAACATAAGTATGTTTATCCTTGAAGTGTTTTGGATCAACACCACCAAAAACAAGTTCACCTCCCTCTTTTGCATTTATATCGCGATTAAGCCAGAAAGAGAACACAGGCTCCTTTACGAGATCTTGCTTCACCATATTGTACCTGCACATATATCATGAATCAAACAGACAAGATTCCAAAAACTGAGAAGAAAGGAAAACAAGATAGAATTGGTTAAAAACTGAACTGAACAAATCAATTGCAGGCCTTATTACCAGACAGGTGTAGTGTTTCCAACAGCAATTTCCTTGAAACCAAGCCCAAGTATTCCATCAAACTTTGCAACTATAAATGTAACACTTGATTCCCGTGTCGCCTCAATAAAGACCTGAAGAAATTGATGTAAAAAATTCTCATCCATTGTGTTTTCAGAAGAGCAGAAAGGACCATAATATGAGGCAGTGATGACTTATTGCCAAGCAAGATTTCACCTGATCCGTGACTACAAGATCGCCAACTTGAACATTATCTTGACTGAGAAATCCTGAAATTGATCCAGATCCATAGTGGATTGAACAAGATTCTCCTGGATTAGAATAAACATCAAATATAATCAGAAGCCATCAATAAATAACTTCTTAGTCTTTCAATTAATGTGAAAGAAATATAACATTAAACTATGATATGAACATCACCTTTTTTTGTGTATGTACTAGACTTCCTTGCCTTGTATTTGGAATGGATCCAGCATGCAATCTGGAAATAATTCAAGTTTAAGGAAAAATTCTGTATAAACCGGTAATTCAACAAAGGAACAAATAACTAAAAGAATTCAAGTTATATACACTAAAAGTGTAAGGATTTTTGTTACTATCAGTATAGTTTAACTTCTGATAGAAATAATTAAGTACCAATTATTGTTACCAATTAATGTTTTCTTATAGAGATTTACATGTAATTACCTTATAAGTGACCTGATTATATAACTAACCTTTGCACCGTTAGCACATATAGAACGTAAACATTAACTAAAAAGAAGCACAACTTACAGAGAAATAACATCTTGATGATGGAACCCAGAGATTAGAACTTCCTGTATCAAATATGACAGTGAAATTTTGAGGGGGTGAACCAATACTAATATCTCCATAATATTGAGCATCCAAGTAGTTCTTTAAGGACACTATATCTGAATTTGTGTCAGATTTCTTCTTCTTCTTCTTCTTCTCTATGTCCTTCATCACATGCTTTCCATATCTGTCTTCAAGTCTTGCTACATTGGCTACATTTAAGCTACTGATATCTAATTGTCGCTTCTTCAGACTAATTCTTAGCAAACTATCAGAGGAAGCAGGAAATACAAAGCAGGCAATGGCCAATAAAAGGAGAGCAGCCCAAAGATGCTTCCTTTCCAT
SEQ 69
TCAAGATCTTATTACAACATACTTCTATTACAATATCTTTTTCTTTTTGTAATGGCTTTGATCCTTGGATGGAAAATACTATTTATCCTTCTTTTTGTGATAATTGGGATGTGTACATCTCAAGTCACTTCTCGTAATATTCAAGCTTTATCCATGTTAGAAAAGCACGAGTTATGGATGTCAAGTCATGGACGTACTTACAAAAATGAAGCAGAGAAGGAAAAGAGATTGAATATATTTAAAGAGAATGTGAAATTTATTGAGTCTTTCAACAATAATGGGACTAAAAAGCCATACAAATTAGGCATCAATGCATTTGCTGATCTTACTGCAGAGGAATTCTTGAGTTATTATACTACTGGACTTAAGTTGTCTAATTCCTACTCTCAAATTCAATCATCATTTAAGTATGAAAACTTGAGTGATGTTCCATCTGTTATGGACTGGAGAAAGAGTGGTGCTGTCACTAGAATCAAACATCAAGGTCAATGTGGTAAGGCACAGTTTCCTATTCAAGAAAAGTTTCATATTCTCTTCTTATTAAGTGCTGACGTAACTAGTAAAGTTGATGATATGTGACCAGCAGGTCACGGGTTCAAGTCATAGAAACATTCTCTTGCAGAAATGTAGGGTAAGGCTGCGTACAATAGACCCTTGTGGTCCGGCCCTTCCCCAGACTCCGTACATAGCGAGAGTTTAGTGCACTGAGCTGCCCTTTTTATTAAGTATTGAGAAAGGATTTAAGTAAAATACTACATACTCCTTTCAAATTTGTGATCTTAAACATGTTTTATCATTGTATTATAACGGAGTATCACTAAGGTTAAAATGAGAATATTAGAAGCAAGCATACTAAATATAAAAATACATTCTTTCTGTAATAGACTAAAATGGAAAATAAGATATGCATAGAGTACTCTCTTCTTGTCCAATAATGTTGACAAGGCACTTAAATTATGAGTGTGTGAAGTCTCACATTGGTAACTGAAAAAATTAGGAGTCTACATATAAGCCTACATATAAGGTTTAGAGTTTTTTTATGGTGTGAGGTCTTTTGAAAAAAATCGTGCGGACTTAATCCAAAGTGGATAATATCACACTATTCTAAGAGTATCTTTGAGCTGTTTTAGCTCAACAACTCGTATCAGATCCCAGGTTCTGCGGACGAGCATAGCGATGGCGACCTGTGGATCGTGGTAATAGCCACATGAAACTGGTTCGACGGGGAGACCCGTGGATCATGATCATGGTAGTGAGCCACATAAAACTTAGTTCGAGAGAAGGATTATTGGGTATGCAAACAAAGTCTCACATTAATAGCTAAAAAGTTTGGGAGCCTGCATATAAGGCGTAGAGAACTTTTAATATTGTGAGTCCTTTTGGGGAAACCGTACAGTTTGGCCAAAGCGGACTATATCATACTAAGTTAAGAGTATCTTTGAGCCATTTTAGCCCAACAAATCATATATGATAATTTAAATTTGTTTTACACTACCAATAATGTATTTGACCTACTTTGCAGTATAGTTACTATTTTTGTATGTTTATCATAAAAGTTAACCTTTAAAACAATACAAGTGATATGATTTGTATAAATATGTGCATAGAACTTCCAACTCATTAATAAATTGCATGAAATATAGGATGTTGCTGGGCATTTTCAGCAGTTGCAGCCTTAGAAGGAGCAAACAAACTCTCAACGAACAACTTGATTTCACTCTCCGAACAACAACTGTTAGATTGCACCACCGAAAATAACGGTTGCAACGGCGGTTTAATGACCACAGCCTACGATTTCATCATTCAAAATGGCGGCATTGCCACAGAATCCAACTACCCTTACGAGGAATATCAAGATTCATGCAAAAGCCAAGAGATGAACTCTGCAGTGAAAATCAATCGTTACGAAACTCTGCCCTCGACTGAATCAGCATTGTTAAAAGCCGTAGCTAAACAACCGGTCTCTATCGGTATTGCAGTGAATGAAGATTTTCATCTGTACCAAAATGGTGTTTACAATGGAAATTGCGAGGGTCAAGAACTAAATCATGCAGTTACTGTAATTGGTTATGGGACAGAAAATGATGGTACAAAATATTGGTTGATCAAGAATTCTTGGGGGACAAGTTGGGGTGAAAATGGTTACATGAAAATTGCTAGAGATACTGGAATTGAAGGAGGTCTTTGTGGGATCACCACTTTAGCTTCCTATCCTGTTCTT
SEQ 70
TCATAACTTACTGTGCACGAGTTTATTTGAAAGACGTTCAACCTTGCCAAGTTGAGTTGCACTCTTCAGCATCGCCTCGCTGTCAGCTATAAACTTTGCAAACTTAGAACCCTTTGTATCACCATCACTATCTCCTCGAGGAAGCATTGGCAGATGAAGCATAGAGGGACTTTGCTTGGAAGATAAAGGCGGAGTTAATGCCCACACAGAAGAAAGCTGTTCATCTGGTTTGTCAAGATATTCCAATGATAGATTCTGCATGTCTGTCAGGAAAAACAAAACAGAAACATCAGACGTTGGTATCGTGCCTACCAAGTGTTATTCGGAACAGATTTCGTGAGAGTTGCGAAACTTATTAAGAGCTTACGGATCAAAGATTACTGCTTTTATTCTTCATACGGGAAAAAACCACCCATTATCTGAAAATGGAAGTATCAGGAATAGTCGAATAAAGTACCTTCAGCAAACTTGAAGATGGGCTCTAAAGCTGCACATGGAATGCTGAAGTTCAAATGTGGAATGACCGTCCCTCCACCATGTCTAGCATTACTGCATCAAAAATAAAGCTACAAGACATGAAGTTTCAACAAGAAAGACATTAATCGATGGAAAAACTATAGAAAGACCATGATATGTGAAGATTAGCATAAATGAGATGACTGAAAGATCTTCCATATGAGATCATTCAAATTGGACACCATTATTTTTTTCCTGTATGCAAAGCGTATAATTAATATACATTTTTGGTAAGGACAATTAATATACATTCAAACAGGAATATCTTTCTTCAAGGACTTCTCAAAGTACTCCAGGACGCAGTGTACCACAATATCATTAGATTGAACTTCAAGAACAAACACACGTAAACATACATAAGCTGAAAAAGAAATATCCTCAATTATAAGCATCCCCAGTTGTCATCCAAAAGTTAATTACCTTGTTACAAGGGCAATCATGTGTCCTTCTGAATTAACAACAGCTCCACCACTACCACCAGGGTGTACAGCAGCCGTTGTTTCAAGCATTGCCGGAAAATGTCCTCCTAAACTTGATTGGTTGAGCAGAGGCCGCTTTGCTTCAACTACCTTAGCTATTGCACCCACACAAGCAGATGGAAGGAAGTCTATATTATAAGAAAAAGTAAGACAAATTACAAATATAACTAAAGATGTCTAAATAGGTATGAAACATATGAAATAAATATACGGTATTATATCATGTTCAAAATGAAAGAACTTAACAAAATTATTTACATGAAAAGCTATTTAACCTCCAGAGCAACATGATTTAGTACTATTGGGCGCACAAAGATAGTCAGTTCCAGAAAATTATGTTCAGCAAAGGTTATGGAACAGACAAGTTAACTTTATCAACGAAAAAAGATGGAACAGACAAGTTAAGATTGCATCAATAAACAATAGTAGCACTTCCAACAACCAAGCTACTATTAAAATATCCTTGAGATACAGCCGACTCGATTAAGTGAGTTACCAGGAATTTCCTATTTTAAAACCCCATTCTTTAAAGCTGATCATTTGTACTTGCTTTCACCATAGAAAATATCAATTTAATGCTCCAGAAATTTACCTCTTTTCGTGATAAGTGACTTCAAAACTCTAGATTTGATTCCCCAATTCCGCTTTGTTAGCATAGGTATTAGGTATATGATCATTCTTATGGATGAAGATCTGAATTAGTGCCTATGGCTTTTATTAGCCCACGAAAAGAAAACGCTTTTTTGTTTTTTAATTTGGTCTACCTTTCTCCTTGTTCTACTAGCCTTGTTTGAGCCCAACAACAACCTCGCTATTCTTAATCTGACAAGTGCAATTTTTTTTAACCGGAAGATCAAAACGTTAACCTGGACTACAAATCAAATTCAGTATCAATAAACAATGTCTTCACCTAAAAGATTACCCAGTTTTGAGCCCTCCCGACCAATCTGGTTACTATTTCTCCACTGGAAGACACCTCAAGTTCCCTTGCGGAGGCATCGACATCACCTCATAACTACTCAATCAGTCATCAAATGGTCATCATTTGGTGAAGGAAGAAACATCAAGTATTCCAGCAGTAACAAGGACATGAAAATGATATAATACGACCCAATCCTGCAATTGATTATAATGACACTTCAACAAATTCTTAACACGAGAGAAGCAAGGTGGAAGAGAGAGAAATTCAAGATAAACAAAGATTTTGTAGAATATTCTAAAATTTCAGATTTACTGTGATGCGTGTGTCCAAATAAAAGTAAAGGCACAAATTTTTTATTTAGACAAGAACATATCTAAAGCAAGATTTACCACATCGTGGTCCAAATAGCCCATGTCCGAGAATGTATGCTTTTGATCCGGGGGATGGGCACATGAAGTCAACAATAATGGGACAGAGCTGATCTGGAACTAGCTCAAGTTGTAGTAATGCAACATCCAGAGGTCCTCTGGAGACATGAACTACCTTTGCATTTGTCCATACCCAGGGATCCATAAAATCCAAGCGAACACGAATGGTCCTACTGCCTGTGTTTGCCAGGTTAACTCTAAAGCTACATTGCTCATTGTCAACCAAGAAATGCGGAGTTTTTAGTTCCTTTTGAATCAAATGTTTATTCCTACGCTGAATGTCAAATTTCTCAACCCCTGGATGCTCAGATTGATCAGAAGGGATGAGAACTACATCAGATTTGGTGTTATATCCTGAACCGTTTACAGATGTTTTTCCAAATCTCCATGGCTCTAGAAGATGAGCATTTGTAAGAAGAAGACCCTGCTTGTTGAGCAAAACTCCAGAAGCCCATGCTCCATCATCAACAGCGATAAGACAGATAGATGTCATTGCTTTCTCAATCAAGGATGGGGGAACAGGATCTATTTGGAGATGCTCTTGAGTATCATTGGCATGTCCATCCTGAATATTATTGGAGAATGATTCTTTTTTAACGCTGATTAGGTTTCCATTACCAAAATGGATCTTTCTCCTAGTTTGTAGCTCTTCTTTAAGCAGGCTACCACAAGCAGATGTAATAGCTTCCCATGGAATCACCATCTGCCACCCAATGCTTCAATTTCAAATTAAAGCAATACATGTGAATGTAGATCTCAGCAAGAACATAGAGCACATTATGAAATGTTTACCTGAATTTCAGCAGCAGTAGCCCTTTGTCTGAGTGGCCGAGACAGAACACCAATAAGCTCTGCATGTTCCCCTAACACTGGGCTACCTTCCATTCCTGAAACATAAGGTACCATACGTTATCAGGTACCCAAAGATGGAAAGGATTAATTAACTAAAGTTGCAAGGCAAAAGCTCAAACCAGGGAGACAACGGATGTCAGCAATCAACAGTGCTTTATTCTGTGGACTAGGTGGATAGCTGTTTGCAATGGACCCAACTGATATGCTGTATAGCAACAAAAGATCAAATTGAGCTTTCACAAGGAGATGTAACAAGAATCAAAGTGAATATTAATCCTATGTATGATCACAACTTGGCAGACAAATAATCATCGAACAACATCTGCATACACCAGATACAGCTTATATAGCTTACTCAAGTAAAGATGAGCTAAAACATCTTTAGCACTAGCAACAAGAATTACACCAGTGCTTTCATATTCAGATAGTCTATTAACACAGCACAAATGTTCAGTCATCATACATTTTGATGCTGACAAGCAAGGATGGAAAGATTACAAGAAAGTTGTAATTCCAACATATGATAGAAACAATACCAAAGCAACTACCTGTTGAAAAAGTGACTGGGAGACAAGATACCGAAAGGAGAACCCATACCCAGAAGAAGATCACCTCTCCTACTCCAGGTAGCCACTTTTAATGCAGGCAGGTCCTGAAAGGAAATACCTATGCAGTCATTATCATCAAAAACCTACTAAATGGAGCTTTCAAAAATTACCAGCAAAAAAGCAAGCCCATTCAGCAAAATGCAATGCCATGCTTCATTTCACAAGTTTGATGCCACTTAGTGCAGTAAAAGCCTGAAACCTCATATGGATTTGAAGAAACTCTGAGAAGAGCAATTCTTGTAGTTGATGTTCCTATCACACTTGGCAGGCTGGACTGCGCCTCCATCATTGGAGTTTGACTTGGGAAAGAAATTTTCTCAACCTGAAATTATCAGTTGACAGTTCCTGTTAAAAAATAGTCAATCTACAAAATCAATCTCCGTGGTCCTTTGAGACTAGCATCTCATAGACCTTTGGCTATGAAAACTAGAATGCTTTGTGCAAAAATTGTTTGGGCCATCTTGAACATAGTACTAGGGACAATTGAGATGCCACCGTTGAAGAACACCATGGTAAGAGTTCCTTATGTCATGCCATTTTGCACTTGACAGCGTGAGACTCAAATGTGCTCCTTTCCAATAACACTGCCGAATGCTTATACCATTACAAATATGTAACCAGAGTAGCTTACTGAGTTGGAGTTCTTAATATGAATTATTTTAAGAAATGCCTCCAAGTTTTACGGGTGGTAGACTACCTTGTATTCAGACACGGGAAGGTGATATTCAATCATTAATCTTTGAATTGGAATTAGGTGTTGAGACATTCTATGATGATAAAGAGGCAACTATTTTGGCATGAGTAGAGACATGAAACAATGTGATGTCACTCTATTGTAAATATAGGGAGCATGAAATAAGGAACTCATTACATGAATTTCCATTATTCTCCTATAAAAAGAGGTTACAGAACATATAAAAGGTTCTGCTTGGAAACAACTCAACTGATAATGCAATTAATGCTAAATATATGGAGGAAACTTGCATGTTCCACAACTCGAAGAAGTGGTGTGCAACATATATTAGTTCAAATCCATACTGCTCTTATGCCAGAAAAGAGAAGAAAAAACAGAACCCTGAAAGTGCAAAGTGGTCATGATTAGACGTAAGACAGGAATAAAATGGCATTTCTGCTCAAAAGAAATAGGTGCTTGATTTATTTAGTTATTTAAGAATGATAAATGATATGCCTTTCAGTTGATAGAACTTTAAAGTGTTAGCCTGATAAAATTATAGTTATTTGATGAAGTCTTTTGAAATTTGAACCACAAAGTGGGATCATAGCAGAAGTTAGCTCATGAAAAATGACCAAGGATCTACAGCATCCAATCAAAATATGCATGCAAGAGAATTTGGCTTATCTTGGTCTCGGCGACATTTAATTATCTTTGAATGGAAGTATTGTCATGTTAATCCTATTTATGACTATGTTATGCATTAATGAAACAATCACCTAATTCAATAATAACACAGCTAGATAGTGTCAGGAGTATTTAGAAAGCGAGTGAGAGGCTCGGACACATCTATCAATAGGTTGTACCAATCATATATAGATGAAGTACAGAGACTATGGTTACATATTTACTCATTAATACATAAAGGTACAGAGATTGTTATTGGGTACACATTTACTGCATGTAAGTGCCAAACAGAGGAAACATCATACCTGTGTGCGCTTAGTGTTAGTAAATGATTGACGGGAATTTCCATAGGCAGCTAGGGACCAACCAACCTCCCACCCATGTTCAATTGAACTAGACGAACCTTCAACTAGGGACTGGACAGCAGCAGATGACACAGGGATGTCAACCTTCAGCAAGAACACCAAATAGACGGTGTCAAGTGCAACTTTTACGTTAGATCAGTAACAAAATAGCGACAAAGAAGAACACCAGCCATAAAAGTGATCAGTTAACCCATGCTAAAAACTAGGATAAAGTTCAAAACTAGGTATTCCTTCTCCATTTTATACGGCACACTTTCCTTTTTAGTATGTTCCAAAAAGAATAGAACCCTTCTCTATTTGGAATATCTTAAAACTTTAAACTTCCCACTTTACCTTAATGACATGCTCTTATAGCCATAGAAGTGTTATGAAATGTTTAAAACCACAACTTCTAAAGGTAATTTGGTATGTGTCAAAATCTTTTGTGCACGGGCACAAAACACCAGATGACATCAAACTAGAATTTATATGCATGCATCAAAATGAGAGCACTTATCAATTCAATATTGCAAATAAAAAACATATATAAGATAAAGTAACAGGTTTTATGATAATCAGCATTCAGATTACAAAATCCTTTCAGTCTCCTACTAAATCCTACCACTCTTAGGAGTTCTGCAGGTAGCCAGTTCAAACCCTCTTTGTTGGTCACTTTGATATCATTTTGCAATGTATTTCCTCCCTGTGAACCACGAGCAGGACATCCAATGTCAATTTAATATATCAATGAACCTAAAAAACGTATTTTCTTCCGTAGGCTCAAATAAATACCTCCCACAGTATATCAATTTGAGCACCAGGAATCAGCTCCGGCTTATCCTGCAAGATGGTACATTGTAAAAGAATTATGAAAAAGCATAACGACAAAGAAATAATCATGTTGTTCCATTTTATACTACATCCATTCAAAAACCTACCTTTCTATACATAGAATTTTTTTAATCTTGTAATTCTCATTTTGCACTTAATGGCATGCTCTTATAGGAATTGACATGGCATGCTAAAGACAACTAGATAACTTCTTACACATGTAATTAAATATGTGACAAAAGTGGTTCTTTCTTTATTAAACTCCTTCTCCAATCAAACACCATCATATAAAGTGAAACCAAACAGAGGGAGTAATTATCAACTGAAAGAAGACTAAAGATCCAAACCTTTGATATGTCCCCTCTATCCTGTTGTACAACAAAAGGCTCAATAACAGAAGCAACTGTTAAAACCAAGAAGTGACCTCCAAAAGAGTGCAACTTGCTTTCACCTTGAATCTGCTTAGACACTGAAGCATTAACAAAGGAACTGGGCAAAAGCATCCCAGATGCTGATAGTGTCGTCTTCCCAGAACTACACCACAAAATTACAACCCAAATTTAAAACTTTCAGTTCAAAACACATAACATAAACAACTATAATATAGAGACAGAGAGAGCTATGTAAAATCACTTACTTGTACAGGTGGAAAGCATGTTTTCGCATTTTTAGGCCTTTAGGGTCCTTCAAGAAGAAGAATTTATTTGTATTTTTTCAAAAATTAATTAGTAGAATAAGCAAAGTGATTGAAAATTACAGTATCTTGGCTTAAGAAAAGGGACTCACTGGGCCTTGAATTCTGACCATGACGGCATAATTGCGGGCAACATCAACCACTTCAGGAAGACCCAT
SEQ 71
ATGGATAACCCATCGGAGGATTCCTCGGATTCTCCTCAACAGCAGCCCGAATCTCCTGTAAACGATGACCAACGTGTTTATTTAGTTCCTTACAGGTAAAATCTCCCTTCCCCGTTTTGACCCATTCCTCATGCAACTGTTTGTTTATGTATATCAACATAAAAGTAAAAATAAATAAAAATAAAGAATTGAATTCTCGGATTTTGCTTTCCCAATTGATTTTATGATTTGGTTTGATCCAATTCAGCTAAACCCGAATCTGAACCCATGAGATAACGAGAAAGTCGAAACAAGTTCTAGTTTTTTTTTTCTTTTTTCTTTTTGTTTAAATTACTTATATTTTTATTTGTATTACTTGTCATTTAGATTGGTAATTGTATTAGCTTCCCTACATTGGAATGTTGTAGTTTTTTTAATCAAGTCTTATTATCTGGATCAAATCGTGTTGTGAGTTTTTTTATTTTTTTTATTAGTTGCCATTTGGATTGGTAATTGTATTAGCTTTTGTACATTGAACTAGTGTTGGTTTTTAATCAATGTTGTTGGTTTTTGTTATCTGTTAACCGGTGGATCAAATCATGTTGTGGGTTGTATATTTTTGTTTTGTGAGCTTAAGCATAAGAAAGTATCGGCCTTGGATTTTCAGTTGTGTTTTTTTGATGAAGTAAATAGTTTCACCAATGTCATCAAGAAGATGCAAGTATTACGAAAGATTAGGCCAGAGAGTATCAGCTTCAATTACATTGGTCTAGATTGCTAAGGAGCTGATAAAGTCCAGAAAGTTAACAGGGTAAGTTACAATAGATAGTTTTGCCAACTAAATAAAAGTAAAAGACACCTAGCTATCAGTTGTTAACAATGGAGAAGTAGTATAGCAAAGTGCCGGCAAGATCTGAAAGTGGTGGTTATAGGGACCTGTTTAATAACTTAGTAAACCTTAGAAGAAGCTGACAAATTGTTCCATCTACAATTTGTCAACCTTAATAGAGGTGCACACAAGCTGGTCGGACACCACGGTTATCAAATTTTTTGTTTAAAAAATGTTCCATCTCGATAAAATATCAATTGATTATGCATTATGTTGTCAGTTCAAATATTGTTTCTCGCAATTATTATAAAAAGTGCATATCTGTGGAGAAGTGCTCCGCGGGCTAGTGCGGTGGTAGGGGAGAGTGGTAAAAATGACACAAATGATGCTTTCCACTTGCTAGTGGTTGTTAAGAAGAGAGAGAATGTTTGAGCGGGAAGGACGGGGTAAATAGCATGGAAATGTTAATTGAAAGAAGTTAAAAGTTACCCTTTGCAGCATCTTCTCTAGGTAAGAATTTTTTGTCTGTGTTTTCCCGAGTAGAGGGTTAAAGTGTTGCACACACATATATTACAGGTGCCACAGACACGTATATGTTTAGAGTACTATATAAGAAAGCGTGTTTGTGTTCTAGGTGGTGGAAAGAAGCACAGGAGTCATCACCATCAGATGGGAAGTCAGTGACTTTGTACGCAGCGGCACCAGCTCCATCTTATGGAGGGCCAATGAAAATCATTAACAACATATTTAGCCCAGACGTCGCATTTAACTTGAGGAGAGAGGAGGAATCTTTATCACAGAGTCAGGAGAATGGTGAAGTTGGGGTATCTGGTCGGGACTATGCTTTGGTCCCTGGCGACATTTGGCTGCAGGCACTCAAATGGTCAGTATTTTAGAGCAGTTTCCAATTTGTATTCCTTGAAGTGTGTTAGATAAAGCCTCTTCTGACGGAGATTTACGCCATAGTTGTTGAGCATTCTGAGGATACCATTTGCATATGTGTTTTTCTCGACTTCAAATAAAACATTGATTTTTCACTTCTGGTTACAACAACCACTTGCAATTTGTTGTTTGGTTTCTTCTGCTTTTCAGACCATTCACATTTTCATTTCACATGAAAGAGGCCTCAAGCCTTTCGAGGCTTCATTGTTGTTGCTAGTCCGATGGCAATTCCCAGTTATAAATATATATTGTTAAATGCCTTGTGAATGCATATGGAAGCTCGTTTTTTAAAGCATTTTGAGATTTCATTCTAAAAAGACCACTGTTTATTCTTTCAGCTTTAAAGTGCTAAGCTCAATCTATTAATTCGCTTCCTTATTTTCTTTGTCTCTTTCATATATTTTTTTTGTGTGTGTGGGGGGTGGGGATTGGTGTTAACTTATAACTGATTATTTCACTTTCCTTTTTGGTGTTTTTGCACATCTAAGAAAGGGAATTTGTCTTTTGATCCTAGTAACATGTTATTTAGCACGTTAATTTCATACATCTGGCACTATGTAAAAGTTGATCTTTTGATTATAGAGTTCTGATTAGTTTGATTGGAATTGCTCCTTTCCATCCAGGCACAGTAACTCTAAAGCTGCGGCTAAGAATGGAAAAAGCTTTTNCAAAATTGCTCCTTTCCATCCAGGCACAGTAACTCTAAAGCTGCGGCTAAGAATGGAAAAAGCTTTTCAGCTACAGATGAGGATATTGCAGATGTCTATCCTTTACAGCTGAGGCTTTCTGTTTTGCGGGAAACCAGTTCCTTGGGAGTCAGGATAAGCAAAAAGGTTAATAATAACTTTGGAATTTCTAGTTTCATCTACAATTCCCATGAGATTTGTACTGTCATAATATCCATAGAGTGCATAACCACATGTGATCTTTTTGGTACAGACCTTGCCACAATTTGTGAAGCTGACTTTCTTTTTCATCAGCTGCTTTGCCATCTTTCATCCTTCACTTTTGTTGTTGCTGTTTATTGTTGAAACTGAAGGATCTAAGATGGACAAGTCCAAAATATCAACTTTAAGAACAAGGGTTATGCATGAAGCCCTCTATTCTCATCCTCATATTAATTACTCGGAATGGGCATAGCTGTTAAGTGCTTCCATTTTTGTTTGATATTTAAATATTAACAGTAAGAGCTTTTTATGGTTCTGGGTTGTGCAAAAAGAGGACAAGTATGTTAGCTGGATACGTATCTTTTTGCTGACAAAGTGGGATAAATTTTGCATGGTAATTTTTGGCTTTACATGAATTTGTTGACAAGGATATCAATCCTATTGATATTTATGTAATTCACTTGAAGCATTTAATACTTCTATTTCCCAATTACAGAATTGCTCATCAGCAATTTTTGACCACAGTGAGTTCTAGAGAAGCGAGTCTTTTGAAAATGATAGTAGAAAGAGCGCTTCCTTTCTCTAGATTGTCTTTCTAGCAAAGTAAATTAATCTAGAGTTAAATTGTATTGACAGGAATAAACAATATAAGCGGTATTTTCATCCAGACACCCCTCCCCTGTTTGTGGAGCAGAAAGAAACTATGTAATTGGGAATCACTTAGTTTTGAGATAACATAGTAGACTATGTGACAGTTTTATTCTTTTAATTTAAAACTAACAAGTTGTCTTATATCTAAAGTTTTAGCAGAATCATTTATTCTGCCTCTAATAGCTTGGAAGAACTATATATCATTTGAGGTCTTTACTTGCCGAAAGACATCGGAGATGAAGTTAGTTTTTTATTAGATCAGATATGAAATTAGTTCTAGCTTTTTTTTATATACTCAAGGATGCCCTGACTTCCTCCATCTTTATCTATTTTTGAGAAATTCTCTTTCTTGACTGCCAAATGCTAAGAGGAAATGGTACCAAGCGGTTACATGCAAACCCTGGTCTGAAGAAGTAAAATAGCTAGAGTTCTAATTTTCATAAAGCTAATAGGAAATAATTTGATCACTTGTGAAAATAAGCCAAAAGAATGATGCTCATTCGAACAAAGTTCTCTTAGAGTTACTACATATTTCGTTGTGTATACATGATCCTTCAATGCTACTTCATATTTATTATTTACCCGAAAGTTGATGTTAATTTGAGCTCTTTTTTTCTTAACAATGTTATTGCTGACTTGTCCGTTTACTGCCTCAGCTTGTGTACATAAAAATGTAGTTTCCAAAATGTTGTTTGTATTTTGTAACTGTTGCATGTTAAACATTTGCAATATTGCGGTGGACAACGTTTCTTTTTTTTTTTCTTGCAATAATATCCTCACCGATGTTCTTTTTTTTTTTTTAACTATCCTCACCGATGCTTTGGAGTCAGTTAAGCTGCTTTATTTCCTTTAGAGTCTAGTTTTACATTTGTCTTCTCACATTTGATTCAGGACAATACAGTTGAATGCTTTAAAAGAGCCTGCAGAATTTTTAGTGTCGATACAGAACCCGTAAGTTTCAATACTGTTGTTAATCAATTGCAATGGTATCTCTTTCAGGAGATTAATGGTATTTTGGTTCTCTGCAGTTACGGATTTGGGATTTATCTGGGCAGACGGCATTGTTTTTTTCAGATGAAAACAATAAGATCCTCAAAGACTCTCAGAAACAGTCAGAGCAAGATGTATGTACTTTCAACTGTGTCATACTTCATGACTAACCAATAAACAAGTCGACCAATGCTTCTGCGGCATTCACTATTTTTCCTGTCTTTACTAAGGAAATAATTTAGTTATGCTTTTTTCTAATTGTTTTCTAATTAAGTGTTTTTAGCAGATTTTTCCATTTCTTTATCTAGTGTTTGTGTCGTAAAAAGATATATAATGATTGAGGTGATGAATATGCTTACTTAACACTTCATCTAGGAATGAAGTGAGACAATGATTTTTCTCCATTTTCTATATAAGTGTTGTTTTTTCTTGAGCATGGACAATGCTAAGCCCACCAAAATTCAGTTTTATGCGACTCTCTTTCATTTTAGGGTTCGTTCTGGAGTTATTCATTTATAAGCAGTAGATGTGCTCTTTCCTTGTACTTTCAATGATGTACACTCTAAGAAACTTTAGCTCTTTTATTACCCTGGGACAAAAGAAACACATAATAAACGGGACTGTCATGTCTAACGACCAGCTTATACACCCATTCGTCTTGGAGAACAGGCGGGTAAATTCAGCTTAGTTATGCTGTTTTTCAGCTGGATCTAAGATTACAAAAAGAGCAACTGTTTTGTTTTTTGTTTTTTTCCATTTGTGGCAGTTATTACCGGTGTGGTTCATCATTGATTGTTTTTGTATTTTCTTAGGCTGTTTATCTGTGAAACAAATTGAAAGAGATGCCAAAGTTTTATCTGTTTTATGTTTCTTTTTTCCTTGTGGGTTACCATTTAACTGAGAGCAAGGTAAACCTTTACTGTTGAAGGCATTTTGCTGGTTATGGGTTGCCTTATAGTCTTATTACTGACTCTTGAATTAACTCTAGAATTTAGTGTTTAATGGTTCGCACCGCTTGTAAGCAAGAAATGATTTGGACAAACTTCTTATTTTGTCCTCTTATGTTTTTGCTTGCAGATGCTCTTGGAGTTGCAGGTCTATGGGTTATCAGATTCTGTTAAAAATAAAGTGAAAAAAGATGAGATGTCAATGCAATACCCTAATGGTTCTTCTTTTCTGATGAATGGTACTGGCAGTGGTATAACCTCTAATCTCACTAGGAGCAGTTCTTCATCATTTTCTGGAGGTCCATGTGAAGCTGGTACCTTGGGCTTGACTGGATTGCAAAACCTAGGGAACACCTGTTTCATGAACAGTGCTCTTCAGTGCCTTGCACATACGCCAAAGCTTGTTGATTACTTTCTCGGGGACTACAAGAGAGAAATAAATCATGATAACCCTTTGGGAATGAATGTAAGCAATCTTGAATATTTCAAGATCATTACGTGCTGCTTTAGATGTTTTCTTCAGTTCTCTCTGAATAAGTCAATGTTGACATCCCTTAACCTATTCTACATTATATGTGGTTGGAAAAGTAAAAGAAAAAGAGAAATTCATTTGATTACTCTCCAGGTGAGGAATTCTTTATTTACCTCCAATTGTTTTGTTAGCCCGGACAAAAGAAAACGATATGCTTATCCGTTCCATTCAATTTAGTAGGGGTTGAGAAAATTGACTCGGAGGGTATTCAATATCTCCACTTTTTGTTTCGTACCAAACAAGGGGAATAAACTTTACCTCTTTTACTTTTCCTCCTCCTTCCACCTCATCTCATCCCAATCAAACATTGTGTTCTAATCTGTCTCCTTACATATTTTATTGTCTAAGTTCCTCTCTTTAAATTCTTTCAGGGTGAAATTGCATCTGCTTTTGGTGACCTTTTGAAGAAATTATGGGCTCCTGGAGCGACTCCTGTGGCACCTAGAACATTCAAATTAAAGCTTGCTCATTTTGCTCCTCAATTCAGCGGCTTTAATCAGCATGATTCTCAGGTCCTTTCAGTCCTTCCTGTTGGATTTAGTTTCCCAGTTTTAGGTCACTTATTAACGCTCTCTTTTCTGTCCTCTCATTTTGTGGGCATCTTTTGACATCTAATTCTCCTATTTATATCTGCAGGAGCTCCTAGCTTTTCTATTGGATGGACTCCACGAAGATTTGAACCGTGTCAAGAATAAACCTTATGTTGAAGCTAAGGATGGAGATGATCGTCCAGATGAAGAAATTGCTGATGAATACTGGAATAATCATCTGGCTCGTAATGATTCCATCATAGTGGACGTTTGCCAGGTAAGTAACATCCGATGGTCTCTTGTATCTCACTAGAAGTAGGAAACATTTGATATCACCGGCACTCAGTGGTCTCTCGTCTCTCACAGGCAAATGTGAATTATTGATCTCATTTCAACATTGACTTGAAAAAGCAAGAAGAATGAAGTGGCATATTTTTTAAAAATATCTGAACTCTACTGTATTTGTGCTGCGAGAGTTGTTCTAGGATGAGAGAGTAATTATACCCCAACTGTTTGGGGAAGTTTAACCAGTGTTCCTAAAGCTTGCTTCAAATTTCTCAGATATTTTTGTCTAGATTCTCTGCCTTTTTCATCAATAAGATTTCTTACCTTACTCAAAAGAAATTGTAAGTAATGGAAATTGAATTCAACTCTTACCATAAGTAATGGAAATTGAATTAGTCGTCCCTTTCAAATCAATCCGATAACCAACTTGGTTCAATAATTCGGAATAGTGGGAGTACTATTTGTTAACAACTGACATACTATTTTCCTAGAATGCAGTCCTGAACTAAGAGCTGAATTTTGGATTCAGGTTTATTTTAAAACAAATTAGTTTTATTTGTAGTGCTCCCTGTTTTCTCTATATGGTCTCCGCATTTTATCTTTGATGTCTTTTTGAGTTTTACTGAAATTTCCTAAAAGAAGAAGAATATTCCAGCCTTTAATCTCCATAAGAAAGTTAAATTTTTGTTCTTCCATAGTTTACAAGTTTAATTATATAAAAACTTGAACCTACCTTATCAAAAAAGAAAAAGAATTATGTGAAAACCTTGAACATTGTGAATGTTTCTGACATTTGTGCACTCTATAGGTGTGTTTGGAAGTAATTGGTCTTTTTATGTGTTATGTAGTCTATGGCGGTATGTAATTTCTAAATTCCTCTTTGGCCAATCATCTGTGAGATAAAGCTCTGCAATTCTAAGATAATTTCGAATACCACCAATTGATGATAGCTTGTCATTTTTTTTAGTTTTTTTATTCTGTAATTTTTTGCATTAATAAAGTTGACTCTTAGCCATGTCTATCATTAGATGTCGTAAATTTTGAAATTTCACTTTAGAAACCATGTACATAGTACATGTTTCCTAGCAGGTCGTGATTTGTCTATGACTTTTAGGAGGATTTAAAAGTTTACTTCATCTGACTCCCTTTTTCTGCACATTTATACAAGTTTCTTTTCTAGTCTTGCTTCTCAGAGCTTATCTCGATTGTTGCAGGGTCAATATCGTTCCACATTGGTCTGTCCTGTTTGCAAAAAGGTCTCCATCATGTTTGATCCTTTCATGTATTTGTCACTGCCTCTTCCATCTACATCTATGAGGTCAATGACTGTCACAGTTATAAAAAATGGCAGTGATATTCAGATATCTGCCTTTACAATCACTGTTTCCAAGGATGGAAGACTTGAAGATCTTATTCGTGCTTTAAGCACTGCATGCTCTTTGGACGCTGATGAGACCCTTTTGGTGGCTGAGGTAAAGTGCAGAATTTCCAGTGATGAGAAATGGTTATGGATTTCAAGTTGTTGCTTTATTGTTTCCTAAATAGAACTTATTACATACTGTGTATTGGATAGTCAAGTAGAGTCCTTTTTCCTATTTCCAAAATTTTATTTCCAGCTCTTGCTGGGTTGTTGTTGTTGTATTTCCAGCTCTTACTCCATTTAATGTTACAGATATACAACAACCGCATTATACGTTATCTTGAGGAGCCAGCCGATTCATTATCCTTAATAAGAGATGGTGACCGACTTGTTGCTTATCGGTTGCACAAGGGTACTGAAGAAGCCCCCTTGGTTGTGTTTACGCATCAACAGATTGATGAGTATGTCTTGACTTCATAATTTGGGCATTATCTTTTTTTGCTTTAAAGTTCATCAAACATTACTAGCCATTACTCAGATGTGTCTTGCATGCACAGCTATGTTTCATAAGTAATAAGTTGGGGGAAAAAGTACTCCAAGGGTGGTGCTTCCACATCATCACTCTTAATCATGGCAGGGTTTGGATGTGGGCGTACTTGATGACTACATTGCTTAAAAGAATTGACAAAATATTTTCGCAGATGACATATGTAGTAATATCTCAGTCTATTAGTTTGCTTTATGGAGATCGGGTGATTAATTCATGATCGACACAACTCCAGTTAGTTAATAGAGTAGGCTGTTAGTTGTCATATACTTCTATCTTGTATAAAGTAAAAATGTGAGGTGGTTTATTTAGTACTGTTGAGCATCCTCAGTCTCAATTCGCTTCACTTGAATACATTACAAATCATTGTTATGCATGGTTCGTCGAGCAACATGTAGTTCAGATGATGTGTGTGATCCTTCTAGATTATTTGGACAATCATGAAACTATTGCTTCTTCCATGCATCTTACTGCTGAAGCTGTATATGATATGGAATTTCATGCTGTTTTGTTTGCTGATTATGTTTAGTTTAACTTTTGATCCATTGAAAGATTCTCATAGTGGTCCTTGACTCATTAAATGAGATGGCTGATATTTATTTTGGCAAAATATCATTTCCTTCTTGATTTCCTCTTCCATTCTAGCAATCTTATGAAGACTGCATGTGCAGGCATTATATATACGGAAAGCTGACCTCAAACATGAAGACATTTGGCATTCCGCTTGCCGCGCATAGTAGAGTTCTTACAGGATCTGATATCCGTAGTCTTTATCTACAGATACTTACACCATTCTTAGTCCACAATACAGCCCAAGCAGATAATCTTAACTGTGATAGAAGTGCTACTGAAGCATGTACAGATTCAGAAGTCATCACAGACATGGAACCTGGCAACTCAATAGTAAACGGGGTTCCAGAAAGCATTGCTGAAGAAGATACTGCCGAACCTTTAGACATGGAATTTCAATTTTACCTATCAGATGATAAGGCAACCTTTAAAGGCTCCGAGATTGTAATGAATGAGCCATTACAGTCCACAGATATCTCTGGACGGTTAAATGTACTTGTAAGTTGGTCACCTAAAATTCTTGAACAGTACAATACAGGCCTTTTCAGCTCACTGCCAGAAGTTTTTAAATCTGGTTTTTTTGCCAAAAGACCACAAGAATCTGTCTCTCTGTATAAATGTCTTGAGGCATTTCTGAAGGAAGAGCCTCTAGGGCCAGAAGATATGTGGTAAGTATGCAACTCCCTCACTTCTGTGATTGTACACCATTCATATGCAAGCTATGTATTCATAACATATGAAATTTCTCGTAATGCTTCCCTTTTTGCTTCTTCTTTGGTTTGTGCTAATATTATAAACCCTCAACTTTTGTAATTACATAATTGTATTTTTCCAATTATACCATTTTATTTCATTTCTGTCAATATTTTCCACCGCGTCGTGATTGCTTATTGTGGATAAACCATTCTTATTAGCCTCCCCTCAACCAAATTGGACCTTGACTTTGCAACATGGCATGAGTAGTATCCTTCCAACTTCTAGTTCAGTTATGTTAAAGAAACAATGACAGCATCTGATGATCTTATGTAGCACCTATTGATTTTCTGCACATTGTGCTTCTGAAATGCTTTATGTGTTGCTCTTTGTTTCTTGTATATTCATAGCAACTACAGACAGTAATTGAGATAATAATATTCAGCTATTTACCTGGGGCCACTCATGGGAGGAAAGACTTCTTTCATGAAATGTTTCTAGTATTCTATCTTATCATCTAATCATTTATTTGTGTCATTCGCTTGGGAAGTATGAATATAATCTGCCTAACTTTCTTTGTCTTATATCCAACATTAGGTACTGCCCTGCATGCAAGCAGCATCGCCAAGCTACTAAAAAGTTGGATCTTTGGAGACTGCCGGAGATTCTGGTCATCCACCTGAAGAGGTTCTCGTACAACCGGTTTCTGAAGAACAAGTTGGAGACGTATGTTGACTTCCCAACTCATGATCTTGATTTATCCTCATATTTGGCCTACAAGGATGGCAAATCTTCCTATCGGTATATGCTTTATGCAATTAGCAACCATTATGGAAGCATGGGAGGGGGTCACTACACTGCGTTTGTTCATGTAAGTGGTGCTGCGACTTGGATTACCTTGCTTCTTTTTCTTGGTTTTGTTTCTATTCTATGGTAAATAGGATTCTTTTATACCTGATAAAAATGGCATCTTAAGATCAGTACTTGGGGAGAAGGGTGGGTGGTGGGCGGTCACTGAAACCTACCTCCAAGGGCAAATATAGAAATTTCCTCTATTGGTCTTATTCTTATTGTTCGTAGTGAGTGTTCCTTTGATGTATTTTTTAGTTCCAATGCATCATCTGCATCTAAATTAATCACATATTGCACACATGTGCATCTATTATATATTTAACTTTGGTCATGCGTCTTCATTTTTTTTATTTCTTCATCATGAAGAATATGCAAGAAGGTCAAATATTCAGACTTTTACAGTCTTCCTAGTTTAATCCAGATATTCTAACTTTGTGTTTTTCTTCTTCTAATAATCTAGCAAGGTGCTGATCGGTGGTATGACTTCGATGACAGCCATGTGTATTCCATCAGCCAGGACAAGCTCAAAACCTCGGCCGCCTATGTTCTATTTTATAGACGAGTTGAAGAAATC
SEQ 72
TTACACTTGCCTACTACACTCTCCTTTGCCAAAACCTACTCGTCGATTTCTCATATCAAATTCTACCCATAAATTTTGCTGGTGAAAATTACCAATAATATTGCTTGCTATTCCAAGTGATTCTGACCGTCCGATTCCAACACAATGGATCCCACCTTCTACTTCATCCAACATCCTTTCCTTATTGATCAAAATATCAACCCCGTTTTCAAATTGCAATGTCATATCACCTATCAACCGTCCGATTTCGATCGGACGGTTATCGAAGCACATGTCGAGTGCACCACCATAAACGTAACCTTTTTTCAATCTTGGACCTACTAACCTAACAATTTCTTCTCTGACCTTATTGTACGCTTCTTCCACTAAGAAAGTGTACTCCGTGCCGGAATCAATGATCGTCTGGCCGGAACCACCAGCGTTTGGCCGGAAAACCCTCCCGGAGATGTTTAATTTTTTGCCGCCAATTTTTATCCCCACCATGCCAACAGTAAAAGCTAGTGGATCCAAATTTGGCATGCGTTGACTTTGAGGAAAAGTCAAAAGATTTATGTATTGAAATGTATGGGAATTAGGGTTTTGGCCTAGGTAAAATGTTCCACTAGGTTTAACTGCATGGCTACCTTGTCTAATTGGCACGCAATATGAGAATTTTTGTACCTTAGCTTGGGAGGCAAAAGAAAACCGTCCAAGATTCATTCCCAAAATACCCTCAGCATCTTCGGACTCGGTCGCACAACCAAGAATCAAAGGAGGGGTACTTTGGGAACGTGAAAATGTAATTTTTTCACGGACAAGATTACCCTCAGCTAAAGTACCATCAGCATAAAAGTAGGAATAGTGGCACAAACGATTTTGGTCACAAGTAGTTGGAAGGGTAAAATCGGGAATTCTTGGCTTACATAAAGGATGAGTACAAGGAAGAACAGAGAAAGTAGAAGACAAAGAAGGATCAAACGACGTCGTTGGTGGGGGTCTTTTGGGAATTTTCTTATGACATTGAATCCAAGAAAGTTGGCTACCAGTGTCCAAAACCATTTGTTGATTTTGTGGTGGTGTTCCTATTGGTAGTGTAACAATTAAAGCCATTGAATATTTAAAAGTTGATTTATAGTTCAAAGATGGAATTCTAGACATAGTTTTTGTATTTTGAGTTTGTCTTCTATTATTAGAAGCCATAAAAGAAGAAAGAAAAAGAGCTTTAGAAGAAGAGTTATGTGATAAAGATGTTGAAATAAGAGGAAATGACATAGAAAAAGGCTTATGTTTAATGGTTTTTTGTGCTGAGATGTAGAGAAAATTGAAGATTATGAGAAGAAGAAGAACAAAAACTCTAGAAGAAGAAGCCAT
SEQ 73
TACACTATAATTATATTTTCGTTAAATATGAAGATTTTTTCCATATTCTCTTTGCTTCTTCTCCTTCTCCTTCCCATCTTGGCTTCATGTCATGAAAAACAGGTACAAGCATATACAATTCTAGTTTCTCATTGATTCTTTAATCGCAGTTCTACTTCTGTTTATTCTTTGTTTTAATTATGGGGTTTTGTTTTGCAGGTTTATATAGTGTATTTTGGAGGACATAAAGGGGAGAAAGCATTGCATGAGATTGAAGAAAACCATCACTCATATCTCATGTCAGTGAAGGAAAGTGAAGAAGAAGCCAGATATTCTCTTATTTACAGTTACAAACATAGCATCAATGGCTTTGCTGCACTTCTCACCCCACATGAAGCCTCCAAGTTATCTGGTATAATAACCACGAAAAAAGTTCACTCTTTCAAAGAAAGAGTTTAAGTTACATATAGTAAAATTTAATTGGTTATAGCAGGTTATTGCTCTATTTTCTAGGTCAGAGTAACTTGTTTTCATATGTCAAATTAATCTGATAGTGTAAAAAATCCTGTATAAGAAACACAAGGTTCTTGTATGTAGAAGAACTTACCTTATGTATTATTTGAACACAGAATTGGAAGAAGTGGTATCGGTGTATAAAAGTGAGCCAAGGAAATACAGATTGCAAACAACAAGGTCATGGGAATTTTCTGGAGTGGAAGAGTCAGTGCAACCAAATTCCTTGAACAAGGATAACTTGCTACTGAAAGCCAGATATGGCAAAGATGTCATTATTGGCGTTCTTGACAGCGGTACATACATATATATTTGCTTACCATTATTTCCAATATGGCATTATTTTCCCTTTGTTTTAAATTTTAAATGTATTTCCACAAAGGGCTACATAATCTAGCATGTGATTATCGTTTCTCCAATAGTGATACAGACAATCTTATTAGTAAGACTAATGCCTTGTATGTATAATAGTAGAAAGGGATAACACGTGAGGAATCAACCTATATATATATATATATATATATATATAAATGTATTTCAAAAAATACTACTTATAGACATATAAGGAAAATTGTGAGAAGCCTTGTACCAAAGGGAGTCTAAAGTTAAAATAAAAATTCAACATGTTTAAGGATTATGGTTATATAGGATGGACGTGTAACTGTGTCTATCCTCCGGCTTATCACTGGCAACTGAACACGAGGGTTGCGCTCGTTGCGGGACTCATTAATTATGAGATTATCAACTGTAACTAGTGTTAATTGACTAGTCTGATACTTAAAAAAAAATTGGAGTATGATATTATGTGATGAATGTTGTTGGATGATTTACCAGGGCTATGGCCAGAATCTAAGAGCTTTAGTGATGAAGGGTTGGGACCGATTCCAAAGTCATGGAAAGGAATCTGCCAATCTGGAGATGCTTTCAACTCTTCAAACTGTAATAAGTGAGTGTAATTCCTCTTCCATATGTTTTATATCTTTCCTTTAACTTTTTCTTTCTTTCTTTATCTTATCCCTTTTTATTATCTCGATGATCTGATGTCTACCTGTTTTACAATGATTTAATGTGGATTTTAGCCATTCTTGGGTTAGAAAATGTTCAGCTGCTCTACAACCCTAGACCACATTCTTTTTGTTTTGGGAATTCCTGCTAAAATAAGCTGATTTACTACCTTAGACGTTTGGTTTATCAAATATACCAACCTATACGTATTTCTTTATTTTTCTTTTTTTAATAAACTTTATTAAATTTTATAAGGCTGAGATGACTTTGAACGAAAAATATGATTCATTTAGTTTAAATCCAACTTATTTGGAACTGGCATAATAGTTGTTGTTGCTATTAAATTTCATAAGTAGGCTTAATAAACATGTCATCAAGTTTTGTGCGCACCTATCATATGATGCCTTGTTTATCCAATTATGGATTTCAGGATTTGCTTGGAAATGAAGTGTTTGGCACTTATCTCTTGTCTCTTATACATTGATCTAACTTCGTAAGATTAATTGTATTTAAATGGCTTGTAATAGAAAAGGCCAAAGGTCAATTTCAAGGCCGATTTTTGGAAGTTTTCCTTTGTCTTCTTTATCAGTTGACCCTAAACCATTCTCATAATTTAGCTTAATTAAAATCAATTAAAAGAAAGCAGATACATGTTTAGTTTTTTAATCTTGTACCTCTCTAAAGAGTGAAAGAGAGTTTTTTTGAGAGGACAGGACCCATTGGGTGTCCATGCCTGTCCTTTGGTGGCCTTAGGATATCAGTGTAATAATTTCAATATTGTCCATTTCAATCAAACCAAGAGAGGTTATGCTGACAAGTTGCTAATTGTTTTTTGGATTCTTGCTTTGCCATCTTGTGAACTTTGTATCCTTCCAATGCTTTGTTGTGCAGTAATTTGTTTTTTGCATGTGTGTTGTCATTATGGTTATTGTGAAGTCTATAGTGAAATTTTGTGAGGCCCTTACTTCCAGTTTTGCACGGATATTCTCAGTAGTAGCCAGTAATATTATCCATTTTGACTATCTCATGACTTCCATGCAGCAGCTTTTTGACCTTTAGAAAGTTGATGATGAAATTCTACCATTTTAGAATGATAAGTCATTTTCTAGCTGTTAAGTCACAAAAAGAGCACTAGAGCAGTAAAACTTTTGAAGTTTCATTGTGAGGTTGGGAGGAGTGGGCCTGATTATCACATTCTTGTCCTAATTTGTTACTGCTACTATCCTTTTTTTTTTTTCTTATTAAGAAGAAGAAAGCCTTTTCTTCCCTTCTTTTCAAAGGGTAGGGGGTGGGTGGAATATATTAGCCTAATTTGTCATATTTTCCTTCTCGTATATAACCATGCTACTATATATGTTGTACTCAAAATATAAGATTTTGTATACCTTTTCCTCTATATACTAGATAGTGTGATCCCCTCATGCATCATCTTCTTTTCTCTAGAAGAAAATGTTTTATTCATGGTGACAGGGGAGGGAGAGGGTGGGAATGTTGGGATCATATCTTGATATCTTGTCTAATTGATCATCTCAGGCAAATTTAGGGTGGTCATGTGAGTTAAACTAAATAATTTTATTTCATAGGATCAGCCCGCCCTGATCAAGAATTACTTACTAGCTAGCCAGACTAGTGGAGCCCTAGCCGGAGACATTCTCTAAATCATGCCTTAACGCGCCCATCTTCCAAATAAAAAAGGGCTAGTTAGTAAGAAAGATGGAAAGACCTTTATCCATAATTCTTTCCCAGTCTACCTCCTTCCTTAATTGTGACATGTCCCGTTGATCCCACCTACGAGCTATCTGTCTTTGCCTAGCAAGATAATTTTTGGTCTCCTATTCTTGCCTATTTTTATAGCCTGTCTTTATCAAGCGAGATAATTCTAGTTCTTTTATTTTTGCCTATCATGGTAGGAAATTGGTTCGGCTTGATTGAAATTTTTTAAAATGTTTACATATAAAAAGAGTACACGCATTCTGAACCCACCAACTCTAAATCCTGAACTTGCTTCTCCTAATTATGTAAGATAACTTTAATATTTATTCTCCTATGCTACTTTGGGACTTCTATTGCAGGAAAATAATTGGAGCTAGGTACTACATCAAAGGTTACGAGCAATATTATGGCCCTCTAAACCGAACTCTAGATTATCTATCTCCACGAGACAAGGATGGACATGGAACTCATACATCATCAACAGCAGGAGGCAGAAAGGTTCCAAATGTCTCTGCCATTGGTGGCTTTGCATCTGGCACCGCCTCGGGTGGCGCGCCACTCGCACGGCTAGCAATGTACAAAGTCTGCTGGGCTATTCCGAAGGAGGGCAAAGAAGATGGAAACACTTGCTTTGACGAAGATATGTTAGCAGCAATGGATGATGCTATTGCAGATGGTGTTGATGTTATTAGCATTTCTATTGGAACAAAAGAACCTCAGCCTTTTGATCAAGATAGCATTGCTATTGGAGCACTTTATGCTGTGAAGAAAAACATTGTTGTGTCTTGTAGTGCAGGGAATTCAGGACCTGCACCTTCTACATTGTCTAACACAGCTCCCTGGATTATCACTGTTGGTGCTAGCAGTGTTGACAGAGCATTCTTGTCACCTGTTATCCTAGGAAATGGCAAGAAATTTACGGTAACACGATAATCTATTCATTTTCTGTACACTATTTCATCTAAAATGTTGTAACACTAGGATCATAACGTTTTCCTTTATCTATTTAATTACATTCATATTGGAATGAAATTGAATCCATTTTTCGTTTGCTTAATATCAGGGACAAACAGTTACACCTTACAAGCTCGAGAAGGAGATGTACCCTCTAGTTTATGCAGGACAAGTAATCAACTCTAACGTAACCAAAGATGTAGCAGGGTACTCTCCTTGCCTCAAAGTTTCAATATTTTTAATTAATAATCATAATTTTCTTTTGGTTGATTATGTTAAACACTATCTGAAACTTTTTCAAAAAAAAAATTCAGGCAATGTTTACCAGGTTCCCTTTCGCCGAAAAAGGCCAAGGGGAAGATAGTAATATGCTTGAGAGGGAACGGGACAAGAGTAGGAAAAGGTGGAGAGGTGAAAAGGGCAGGAGGAATTGGTTACATACTAGGAAATAATAAAGCAAATGGAGCTGAATTAGTAGCTGATCCTCACTTTCTTCCAGCCACTGCAGTGGACTATAAAAGTGCAATGCAGATTCTCAACTACATCAATTCTACAAAGTCCCCAGTGGCATATATTGTCCCAGCTAAAACAGTTTTGCATTCTAAACCAGCACCTTACATGGCTTCCTTCACTAGTAGAGGTCCAAGTGCAGTTGCACCTGATATCCTCAAGGTCAGAATTTACATAACAAACTTAAGATATTTACCTGACTTATGATTTATGCTTCCTCATCTAAATTAAATTCTGATTTTCGCTACTTCCACAGCCTGATATCACCGCACCAGGGCTGAATATATTGGCAGCATGGAGTGGCGGATCTTCCCCAACGAAACTAGATATCGATGATCGTGTGGTTGAGTATAACATAATCTCAGGTACTTCCATGTCTTGCCCACATGTCGGTGGCGCCGCTGCACTTTTGAAGGCTATACATCCCACTTGGAGCAGTGCTGCAATAAGATCTGCTCTTATAACCTCAGGTACCTCTCAACTACTTTTGAACTTAACTTATATACACTAACTACAGTATTTTAACCTGTTATAACATATATAGTTATTTTGCTGCAGGTGACCTGATAGTGTGTAAACATTATTTTACATTGTCGGTGTATAGAATTTAAACTCCTTTTTCGTCCAAAATTTTGTATTTTGAACTGATCAATCGTTATATTTTCAGCTGGATTACGAAATAATGTTGGTGAGCAAATAACGGATGCATCAGGGAAGCCAGCAGATCCATTCCAATTCGGAGGAGGGCATTTCAGGCCATCAAAGGCAGCAGATCCTGGACTTGTCTACGATGCTTCCTACCAAGACTATCTTCTCTTCCTTTGCGCTTCTGGTATTAAGGATCTTGACAAATCCTTCAAGTGTCCCAAGAAATCACATTTACCTAACAACCTAAATTATCCATCTCTGGCTATTCCCAATCTCAATGGTACTGTTACTGTTAGCAGAAGGTTGACAAATGTTGGTGCACCAAAGAGTGTTTACTTTGCCAGTGCTAAACCTCCATTGGGATTCTCTGTTGAGATTTCTCCTCCCGTCTTGTCTTTTAAGCACGTTGGTTCGAAGAGGACGTTCACTATTACAGTGAAAGTTCGAAGTGATATGATTGACAGTATTCCGAAAGATCAGTATGTGTTTGGATGGTATTCCTGGAATGATGGAATCCATAATGTTAGGAGTCCAATTGCAGTCAAATTGGCA
SEQ 74
ATGGCAACACGTAGAAGCTCTAGCTCTGCTCTCACGGCCCTTGCGGCGTCTCGTTCCCGCCTACTCTCGCGGTTTCGTCCTGCAGTTTCTCGTCTCTCTCAGAATACTTTACTCGGCACCGGCAGGTGTCCACCTCCCAATAGTGGATTTTTTGTTGCAGAAACAACTGCTGCACTTTGGCCGAATTATAACGTGTTGTCCAAAAGTTTCGTGCACTCTTACTCTACTACTGCTGCTAGCTCCGGACAGGCACGACTTTCTTCTTCCTAATTGCATTCTTCTCTGTTCAACGACTTTTCTTCTTCCTAATTGCATTATTCTGTCCGTTCAATTGGAAGTGCTAATAGAATTAACTCTAATTGACGTTTAGATTAAACTTGAATGAATGCTGTTGGTTCTTTTATTTAGCTTTTGATGCGAAGTGAAGTAATCTCTATTTAGATATTGTCAGTTAGAGAACTATTTTCTCAACGTTAAGGAACATCATTTCCAGCCTTTTTTTTTTTTGCAGAGTGGAAGCCTTAAATTGTGTATTTTTGGACGAGAAATAACAAAAATGGTCCCTTATATGTGGGGTAGAATAAAATAGTCCCTTAATATACTCCTGAGCAGTTTTGGTTCTTCAAGTTTGCAAAAAAGTGAGCAGTTTTAGTAGTCGTCAATTATTTTAACAAACTCTGGTTGTTTAATTTGACGAATACGAAGTCGCATTTGGAGGTGCATTTTTTGCCGTTTATGATATATTTGGTCTATTTCTGGTGTCTGATGGAGTTTCTGGGATTTAATGGGTCTTTCCTAGTGGTTCTAGTTAAATCCTTTTCTTTTTCATAGTTTCGCTAAATCTAGCAGCCAAATTTTGATAAACAGTTGACAAGATAAAAAATGCTCATGCGTGGCAAACATAGATCCTTCTGATAAGCGTCAAGCAGTGGAAAACACTTTTAGGTGCTGAAGTGGATTTTTATAAATTGGCAGTTACGTGTTAAGTGAGAAGTGGAACTGATAATCAATTAGTATGGTTGGTAAAAAAACTGTTGATAAACACTTTTTTTGCTAAAATAACTGTAATGACCTTAAAGTTATTTACAAATTCTATAATTTTAAAGTATTTATTACATAAAAAGACGAAAAATAGAGGTAATTAAAAGTTATGTTAGAAGAATATATTGGAGATTACAAAAGATCATAGGGATAAAATCGTAAAAGGCTTGGTCAAACAAAAAATGTTTATAAGGTATAACTTTTGACTGATTTTGGCTTACAAGTTCTTCTCGTACGAGCACTTTTGATGTTTATCAAACGTGTAGATAAGCCAAAATGTGCTTACAAGCTAGTAGGACCCTCTTATAGCTTAGACAAATACATGTATTTAAGAGTCTATTTTATACCTACCTGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAATAAAAGTTACTACTTTTGTCATTTTCTAATTGTTGGACAGGCGGTTTCATTCTACTATTTTTTGACAATTTCGTGGGTCATCCTTGGGGTAGGGTAGGCTGCCTTCGTCACACACCCTTGGGGTGCGACCCTTTCTCGGGCCCTACATGAATGCACGATGTTTCGTGCACTGGACTTCCCTTTGTGGAGTACACTTTATCAAAGTTACGTATAAACACATGCACTAAACTAGACTTTTGTGATAGTTGTTTCACTTGAGGCTGTCAGTTTACTTCTTTTTTTGTTTGGCTGCTATTGGTGTTGCCATAAAAACTACTCTAGAGTTTACTATTGATGCTTAGAACTTATGGGGTGAATGAAAGGACGGGTGACTGAAAGGACATCTCCTTCCTCAAACGTGTTATATCTTCTTAAACAAGAAATTTTGATGTACTCAAACAATACTAAGGGGGGTTGTGCAGATAAAAACAGCTATTTATAAAGGTCCATTGGTCTCCCTGGCCTTCTCTAGAATGAGGAGTGCCATGATTGTAGATATTTACCTATACCATATCGCCATTTGTCAATTCTCTTGGATTTGTCCTGATGATTACTTATCTTTAATTTTGGGTAATTTTAAATTGTTTTGGCATATTATTTGTTGTTATGTTAATTAGTATGAAGATTTAACAGATTAATAATATGGACTACACCGAGATGGCTCTTGAGGGTATTGTTGGTGCTGTAGAGGCGGCACGGACTAGCAAGCAACAAGTAGTTGAGACTGAGCACTTAATGAAAGCTCTTTTGGAGCAGAAGGATGGGTTGGCTCGAAGAATATTCACTAAGGCTGGGTTGGACAACTCATCAGTTCTGCAAGAAACAGATCAATTTATATCTCAGCAGCCAAAGGTATGAAAAATGGAGACTGATTGTGGATTCTGATGAGTTCTTGGACTAGAGAATCAGATATTTTTTCTTGCAGGTAGTAGGTGATACTAGTGGCCCCATATTGGGGTCACATCTTAGTTCTCTCCTAGAGAATGCGAAGAAGCACAAGAAAGAAATGGGAGATTCCTTTGTGTCTGTGGAGCATATGTTGTTATCTTTTTTGTCAGACACAAGATTTGGTCAAAAGTTATTCAGGAATCTCCAGCTTACGGAGAAGGCTTTGAAGGATGCTGTCAATGCTGTTCGTGGAAGTCAGAGAGTAACTGATCCAAGTATGTATATATTTATATAGCTTCATGTTTCGTGGCCATGTCTCTTATGATTTCATTCGTTCTGGTTGAATGGTAAATCCCAGTTGGCGAAAGGATCTTACTTTATCACAGCATGAGATGATCCTTACTTTGGTTGATTGGGTGGATCGTGCAATATTGTCTCTCTGGTTTGTCATGGACATTAAATTTTTCCTATTGATGGGGTGGGGGGGGGAGTCTTGTAAGGTGGTTAGAGGCTTAGAGCTGATAATATTTCAAAAAGTACTTGGCAAAAAAGGCTAATTCCAAGTAGATGGTATAATACTCCATTCACATGAACTTAGATTTAAATCTGAATTTGAAGAATGACTTTTGATATCAAAAGGTAGAATGAGGAATTAGCAAAATTGCATTGAGGGAGTATGGATGTCCTTGATATATATGACAATGCATATTACTGTCTCTTAGTGACTACATCATGAAGTTTCAAAATGATTGACCATGCTCAGTGCAAATAAGATATCTAATTGTGTCAAAATAGATTTTAGTATGGCTCCGATATTTATCAAGATCTTCCAGCGGCTCTTCCTGTCTAGGAATGAGTTTGAAATCTCGATCCTAAGGGTGTGAACATGTTTGCAATCTGACACGTTTGTTTACTATAGTCGTCAAAAGCATGCTTTGTGAGATAGCAATGCCTATCTTTGATTGTGCTACCTCAATCTCCTGTAATTTTCTCAAATCCTGAGCCTTACAGCAATTGAACAAGTGTTGCTAGAGTTAAGAAACTATTTGGTCATTGCCTTCTATAATAAAATGTCATCAACATAGTCCACTAGTAAATACATTAACCACATTGTTGTGAAGAGCGTGAAGTGAGAAAAAGCGACAACCCCCATTTCGCTTAAAGCGAGAAGCGTAGCACTCACTTTTTTGAAGTGAAGCCGAATTTTCAAAAAAAAATTAAAATAAATACTGCATAGACAACACATGTAATTGTAAGCAAATGTTCAATACTTCAATGTAAAAACTAAATAGTAGCATCAATTAAAGCACAAAATGAGCATCCTATTCTTCTACAAGATTGTGAAATTCTTGTATTCCACTATCATTATTATATTGCTCGTCATCTTCTTCAACTTCTTCTTCTTCATCAACTAGGGACAAAGTAGCCGCCTCTTTTCCCTTCCTCTGTGAACTTGAAGTTGAGGTACTCCCCTTCAAACCATAAATCCTCTCCCCAATTTCACACGCCTCCGCAACATCACCCCAAGTGAAATCAGAAGTTTCCTCAAATACTTCTTCATCTGCATGATCTTCCGGGACTCCAATTAGCCATTCATTAGCATCATTGATGTTGTCCAAACTAATTGGATCAATTACATTGCGAGCATTGTAACGACGCCTCAATGTTCTATTGTACTTAATGAAGACTAGATCATTGAGACGCTTCAAGGTTACTTTGTTCCTCTTTTTGGTGTGGATCTGCAAATAATAAGTAATTAGTAAGATTAGGACAATTGAGATATAATTTAATTATCTCAATATACTAAATCTTTCTTCTTAGTTCTTACATGTTCAAACACGCTCCAATTCCTTTCACACCCGGATGAACTACATATTAGACTTAGAACCTTAATGGCAAACTTTTGTAAATCTGGGGTGAAATGGCCATATTGCTTTCACCATTCAACTGTAGAAGTAGAACAAATTTTCAAAACATAATATTGATAAATTAATAACTTATTAACTATAAAACAACATATAAGCACTCGTTCACCTGGTGACTTCGTCTTTTTTTGTCTAATCGCCATGTTTTTTCCAAAAGTTGCTCAGCATTCCTATAAATACTAAATTGCTCTGTTATTTTATCTTGCACGGATTCTTTGGGTATCAACTTCTCAGTACATTCATAGTATCCATTCCACAAATGTTCATCTCCTAGAATCCTCTCTTCATTGTCATAAAACAGTTCCGGGTTCAAAATAAGTCCAGCTACATGCAAAGAGCTATGAAGCTTACTATCCCACCTTTTATTTATGAAATACACTTTTGTATTTTCTTTGATCACTAAAAGAGACTTGAATAGCCTTCTTTGCCCTATCCATTGCTTCGTACATGTAGCCCATTGGTGGCCTTTGCTCCCCATCCACCAAACAAAGCACTTTAGCTAAAGGACCACCAATCTTCAATGCATGAACCACATTGTTCAAGAATGAAGGAGAAAGTATAATATCTGCAAATTCTCTCCCTCGAGCTTCCCTTCCATAGGCACCGTTAGTGTACTCATCTGAAACAAACAACTTCTTCAAATTGCTTTTTTGCTCATACATCCTATGCAAAGTCAAGAAAGCGGTAGTAAATTTTGTCTTTGCAGTTTCACCAAGCTTCTTTGTTTAGTGAATCTCTTCATCATATTCAATAACAAAGGCCTTTGAACAATAGAGGAATGCACTCTAATTGCCTGATTAAAGACTGAATTGATGGGTCTTTCCTTGAAAATGTCACCGAAAATCAAATTAATGCAATGTGCTGCACACAGAGTCCAATAAATATGCGCGTACCCAACAGACATCAAATCACCAGCTTTAACATTTTCACTGGCGTTGTCCATGACAACATGAACAACATTTTCTGCTCCAATAGAGTCTATTGTACTCTTGAACAAGGAGTATATTTTGGTTGAATCAGTCAAAGAGTCGCTTGCATTAACGGACTCAAGAAACATACTTCTTTTAGGAGAATTCACCAAGATATTGATGATCATTTTTCCATTTCTCGCCGTCCACTTATCCATCATAATGGAACAACCAAACTTGTTCCATTCTACTTTGTGATCCTCCCCGATTTTGTTCAACTCTGCCGCCTCTTTTTTTTAGATATAGACCTCTAACTTCATGATAAGTTGGAGGCTTCATTCCTGGACCATATTGGCCTACGACATCAATAAAAGCAGAAAAAGTGTCAGTATAATTAACACAATTAAAAGGAAGACCTGCATCATACATCCACCACGCAAACATTGTGACTGCACGAACCCTCAAAATCGCTTTGGCATCAATTTGAGGATTACCACTTTTTCCTCCCTTATCCCCAGATTTTTGCGGGAAGTAACAATCCATAGGACCTTTGGTCTTGCCAGTAGATCCACAGCTAGAAGACATTGGTTGCATCCTTCCCTGCTTTTGTGATTTTGGTGGAAGCGACGAAGCATCGTCACCTTCTTCTGTTTCATCATCGTCATCATCATGATTATACAGTTCTTGTTCATGAATCATTTGAGTCTTTAACTCTTTTTTTTTTTGAAGGAATGCTTTCAATTCTGCCTTCACATGCGATGGAACTTTAGGACAATATGCGACATTTGGATCACCACCGATTAGATGCGCTATTTGACCGATAGATTCCCCCATTTGAAATCTTGTCACAAAAAAGACATCTAATTGCCATCTTGTTTTGTTGGTCTCGCTAACTCTTTCAGAGTAAGTCCAAGCCGGATCTTTCCTATCTTCTTTTGGTGCCATTAAAAGAACAACTACATAACACATAAGAAAGACAATAAGAACTAAGAATAAAGGATATAAAAATTAGAGGAAGGGCAGTAATTCTCTTTTAAAAATTGGGCAGAGTTAAAAAAAAAAAAACTAGGCAGTAATTCTCTGTTAAAAATTGGCATCAATTCTGAGAAAGAACCGAAGGTTAAAAAAAAAAATTTGCCAGCATTTCGCTGCTTTTTGGACGTTTAAACAGTGAAAGAAAAAGAAAAGAAGAAGAAGAAGGAGGAAAATCAGAACATACCTGTTGCTGTTGAAGACTTGAACTTGAAGTTGAAGACTTGAACTTGAAGTTGAAGTTGAACTCTTGAAGAAGAAGAAGAACCCCAGTCGATGCTTGTCGATGCTTTCAGAAGTTGATGAAGAAGAAGAAGTCGACTAGTGTCTCTGCTTTAAAACCCTAGTCGCATTTGTTCGTTTAATGAAAGACCAGACATCTTGTTTTTAATAAAATAGGGTTGAGTCTGATTTAAAACACAGAAGCGATCGCTTCTCTCGCATCGCATCGCTTTCCTGCTTCTCGTTTTTTAGTGGGAAGCGGTCGCTTTTCTACACCTAAGTCGCTTCACCCTGTTGAAGCGTGCACTTTCTTGCTTCGCTTCGCTTCTCGCTTAAAGCGAGGAAGCGGACGCTTTTTTAAACACTGATTAACCATGACCATTATAATTGTATACGGGTAAAACCGAGCCCATGATGCACCTCGATTTCCGACAAGAGAAGCCAGGCTCGAGATGTGATGGCAAGGGACAAATATCAAGCCGAAAGTCCCATTGAGCCAGAGCCCTGGGACACGATGCCTGCCCTCGGGAATATCGAGGTCATAATTACAGAATCGGTCCTAACCTCGAACAACTTCGAGGAACATTATCGGACGATCAAGCGTAGCCAACAGAAAGCCGAAATATCCATGACCGGCCGAGTATCACGACGGGGATCTCGGCACGTATCGATAAGGAACCTTCAACCAGTTAATCAGAAGACCTTTTACCTTTTACAGAGTTGTACCTAAAGTAGGACTCCCCTACTATATAAAGGGGGTTTGATAATTCATGTAACACATTGAAAACACGCGTTCCAAGGAAATATATTATCATTTTCTCTTTTATCTAGCTTTTTTCACTTGTTCATCAGTGTTGACTATAGCAAGCCCGGGATCGAGGGTGAACAATTTTACTAAGGTTGAATCTGTCTTATTCGCATGGTTTGAATTCATTTTATCTTTACTAGTTCAATCTAATCCAATTTATAGCTTTGTGTCAAATTAATCCGCGTATCCTTAAAACCACTTATAAATTCAATTGTTATCCGATTTTGAGGGTAAACAATAATGAAAGGAAATTGAACGATCAGCTTCACTTTGGACACTTCCCATGATATTGGACCACGAACTGAATTAGAGAAATTCCAAGCCACACTCCCTATAAGTGTTTTAGACCATATAAGTGACTGTCAACAGCATACTAAGCCTGAATCCTTTGAGCAACAAGCTAGAGTGGTTGCTTCATGTACACCTCCATGGCTAATTCTCCATGCAAGAAGGCATTTCAGATATCCTGTTTATAAAGTGTACATTAAAACATTGACGCTAAAAAGTTAAACAGTTGTGTGGGTGGATACCATATAAGTAGCGACGTCCAAAAAACACTTACCAGAATTAGCATGAGTTTCAGATTCAAATTCCTGCGGAGGCAAAAGACACTAGGTGATTTCTTTTTGTATGTCCAAGTCTTGGTGAACATATGTGAGAAAGAAAGAGTTTGAGAGAGAACTCAGCTTGATTATTACCTTAAGCAGTGGGCTTTAAATAGCTAAGTGCACATATTCTAAGTATTAGATCAAGTTTGAGACCTAGTAAACTTCCACAGGAAAAGGGATAGAGTCAGATACCTAAACCTAGTTATACTCTAAAGTTTTTATGTAATTAGACCTCCTTAGTCTCTATCCTAATGGTATAGTCTCATCTGTGCTGTCCCAAATTAATATATTGAAGAGAAATTCAATCCCAAAGTTGTGTGTGAATTTATATGGTATCAGAAGCCATGTCGATATCTTCCTCCTCCTCCCCTCCATCACCTACACTCGTTAACCCTCTTTCTTCGTCATCTTCCTCGCATGCACCCCTTGACCATGCTCATCACTTCATTTCAGTTAAATTAACTTCTACGAAATTTTTCTTTTTTGGAAGACGCAACTATTACATTTTCTTCGAGGACAAAATCTCCACAAAAACTCCACGGCTATATTGATGGAACTAATCCTTGCCCACCATCACACACTACGGTTGAAGGCAAAGAAATACCAAATACAACCTATGTATAATGGATCTAACAGGACCAATTGATTCTTAGCCTGTTGATTTCATCACTTTCCAAAGAAATGTTGCCCATGAAAATTGGTTTAAATACCTCCAAAGCAGTTTGTGATGCACTTGAGGCAGCCCTATCCGAACCTTCAAATGCACGAATCCTCAATCTTCATATGCAACTTTAAAACTTGAAGCAAGAAGATCTTTCGGTTACTCAATACTTGCACAAGGCCAAACTCATCTCCGACGAGTTGGCAGTTGCTGCAAGGCCCCTTCGTCTTGCCGATCAAAATGTGTACATCTTTAAGGGACTGAGATCTGATTTCAAGGACATTGTTACAACTCTCTCAGAACGACATGAACCAATCACATTCTCAGAACTTCACAGCCTCTTGCTTAACCATCAATTTAGACATGGTTCCTCTATCTCCTCACTTTCCTTAACCACCCCAAAACCACCTGCTCTTCAACAATACCCACAGCTAACTTCAATCAACGAACTACAAATCTGATCGTAATAATGGTTTCAATTCAAATAGGGGACGAGGCAGATCTTCGTGTGAAAGAGGGGGTAGAGGTGGTTGTTCATCCTCAAGGAATTTCTCTAACAATGGACAATCTTGGTCTCAATATGATCAGCGAACCCGGTGTCAAATATGCAATGGTACCAACCATCTTGCATCAACTTGCTTCCAGAGGTACAATCACTTGATTAACCCTATGGCTTATTTGTCTAACCAAGCTCCTTTACCCTCAACTTTGCAATGGTTTGCGGACATTGGAGCCACTCGCTACATCACTTTGGATCTCACAAATATTCATCAAGTTGAAGATTATAGGGGTTCAGATCAGGTCCAAATTGGCTATAGACATGGCCTTTCTATCCATCGCACTGGTAACTCCTCTCTCTGATCACCCTCTTGGTCTCTCTATCTTAAGAATATCCTTCATGTTCCTTCAATTACCAAACGTTTACTCTTTGTTCAACTTTTGCTCGTCACAATAATGTCTTCGAACTTCATCCCTTTCATTTTGTTGTCAAGGATCTACAATCCAGGACACCTCTTTTTACAGGGCAGAGTGATGGCGATTTATACACACTTCCATCCAAGTCTTCTTCTTCTTCCATCTCCCAGCCAGTTCCAGCCTCTCCAACAGCTTCTCTATCCATCAACACATCACCTTCATGCTGACATCTTCATCTTGGTCACCCCCATCAACTAGTACTTACGCAGATTCTTAGGACCTACTACAATCTGAAATGAATGCTTTGCTACGAAATAATACCTGGTCTTTGGTTCCTCATAATCTTTCAATGAATGTTTTAGGATGCAAATGGGTGTTTCTCATTTAAAAAAATTTCTATTGGGGCAATTAAGAGATGAAAAGCCCATCTTGTGGCTAAAGGTTTTCATCAACTTGAAGGCCAGGACTACTCTAAGACTTTCAGTCCAGTTGTAAGGCTGCAACCATTTGCATTGTTCTATTTTTAGCAGCTTCACATGGGCGGTCTCTCCAACAATTTGATGTGCAAAATGCAGTTTTACATGGTGAGCTTCAAGACCATGTGTTCATGAGCCAGCCTTCAGGTTTCATCCATCCTCTTTTTCCTCATCATGTTTGTCAACGTAAGAAGTCACTATACGAGCTCAAATGGCTCCCAAGGCATGGTATATGCGTCTCCATAAGTTCTTGCTCAGCGTAGGCTTCATCACCTCTAGATCGGACACTTCCCTGTTTGTCTGCAACTCAAATGGTGTTGTCGCCTACCTCTTAGTATACGTTGATGATACATAGTCACTGGCAGTGGTACCTCCTTTTTAGAATCCATTTTCCTCAAACTTGGAGATGTCTTTTCCATATGTAATCTTGGTCCTCTCAGTTTCTTTCTTGGTCTTCAGGTTTCACGTGATCACCATGGCATCTCTATGTCCCAAGCTGAACACATTAAGACTATTCTTGCAAGAGCACGTATGTAGCACTGCAAACCTTTAATTACTCCCATGGAAGTGAATGTCAAACTTCACAATGGAGAAAGTCTTAGCTTTCATGATCCTACCTTGTACTGTCATATTGTGGGCCTTACAGTATGTTACTCTCACTTGGCCGGACTTAGCTTTTGTGGTGAATAAAGCTTGTCAATTCATGCACAATCCTACTATGAGTCAGTGGGCAGCAGTCAAGCGCATACTCTGCTATTTGATGCATACCCAACGTATGTGTTTTCACATTCCTAGGTCTCTTACACTCACTTTTCAAGCCTTCACACACTCAGATTGGGCAGGTTCACTCGATGATCGTAAGTCCACTACGGTTATGCCATTATCTTGGGTGAAGCTATTCTCATGGTCGTTCAAAAAGCGGCGCATTGTAGTAGATCTTCCACAGGTTCAGAGTATAAAGCTTTAGTAGATGCAGCTGCCGAGCTGACTTGGATTCTGTCTCTCTTGTTTGAGCTTGGTGTTTAACTTCCCAAAGCTCCAATTCTATGGTTTGACTACCTATCTTTCGGTAATCCTGTGTTTCATGCACGAACCAAGCATGCGGAAATTAATTTTCACCTTGTTAGACAAAGTAGCTCGAAAGGATCTCACAGTTCAATTTTTATCCTCCAAAGATCAGCTTGGTGATGTCTTCACAAAGCCACTAGCTTCCTCTAGATTTGAGTTCCTTTGGTCGAAGCTCAATGTGGTTTATCCACCTCAGCTTGCAAGGGAGTATTGTATCAACTTTGAGTCCTGGTAAACTTAGGATATAGTCGGGTACCTAAACCTAGTTATACTTTGAAGTTCTTATGTAGTTAGATCTCCTTAGTCCCTATCCTATAGTATATACTCATCTCTATAAATGTACGACCGCTGTACCAAATTAATACATTGAAGAGAAATTCAATCACAAAGTTGTGTGTGAATTTATACTAAAAGGAAAGAGAATTAATCACAATGAAAATACAATCAAGCTATACTCTATTTACAAGATCCTAGAATATTCTAGAATATTGACAAGATTCTAATAAGAGTTAGCCCGTATCTGTGCTGGTGGGAGGTAGCAGGTATCCTTTAGAATTAGTGGAGGTGTGCGCAAGCGCCAGAACACCGTGGTTATTAAAAAAAATCCCTACGAATTATTGGTATGAATAACAACAATCTCCTCTATCATGGAATGACTCCATATTAGATGTATCAATAATTTTCCAACGTCTTTTAAATGTGGAAACACTAGAAAAAGTTGACACATTGATTAAGAAAGTAACAAAATGATACGGGAAGTACGAATATCTTTGCGTGAGGTAATGAATGGAGTTGTTATGTCTGTTGAAGGCATGATTAGGATTAGTGATGGAGGAGGAGGAGAAAGTCATTGAATGCTGGCGATGTACTTGTATCATGAGAAAAATCTCATCTGTATTAGTAGGAGGATGTTTATGAACTGATTAGACTCGGACAGGACAAACTTGTGTTTTAAGAAGTTAGGGACTAGTAGATGTCAGGAGGGAAATTTTATTTCATAGAATTTGGTGAAAAGTATAGAGATGAGTGGGAGGTTACATTTGCTGGGACGAACTTTACTTTTTGGGGGATTATAAACATTTGCAACTTTTCTTGAAAATGGGTGAGTAAAAAAAGACACAATTCTGATTAGGATTGAGGTCTGTCTTTTCCAGTAGTGTGATCATGAAGGAAGCAAGTACATCCAAGCACTTCCAATTGTAGATGGTGGAGATTTTGATCTGGAAATTTTAGTGTGGACAGTAAAGTTGCAACATAGAAAAGATTGATTTGTACATTTGCCATCGATAACAATGCTTCTAGCAACATGATTCTAGGGTGAATGGGTTTGTTGGGGCTCTATGAGTCTTCCTCGAGTGGCGGTGAGGTCTAGAAGTCGATGCAAAGAGAATTGAGTGGTACAGTCAGTGACTATGACTCATTCTGTCAGCCTCTGTAATTTTTTGCCGCTATAGCTTTTCAGTAAGTAGTTTCTTCCCTGGTTTTAATGTCATGAAAACTAGAAATAAAAGAATGAAAAATGATGGAAAAACTTGATTATTTCCTCAAGTTTGATTAAGAACTTAAACTAGTTACAATGTTGGTAAAATACAGAATATGGTGAAGTAATTCTCCTAATGTGACTCATTCATAGCAACTGAGTTAACCCAATAAATGCAAATGACTGGACGATCTCGACAAATCCAAGTCTATCAATTTCAACAAGTTTCGCTGCCTTAATCATGACAATATATTTAAGTGATGGTCATTAAATTGGAAAGAGTTGCTGTTGCTCTTGTTTTTGCCATCATTCAGCTGTTCACTGTGGTAGATTATGGTTTCCTACCAAGTCCAATGAAACTGAGCAGTCTTGACAATGCCTGATGTTCAGTTTCTAAAGTCTGTTCTCTCTCCAAAAGTAGAGGAACATAATGTTATCTGATTGGCTCGGGAAAAAGTTGTATGTAGGGTGAAACCTTAATGACATAATGAAACATGTAATGGTCTTTGTGCCTTTGGTTCATTATGTCTGCTACTAGATACTGAAATTGCTGCTGAAAGTGCTTTTTGAGGTGTCACTCATTTTTTCTTGCTGCTATTAATACATAGCGTTCTGATTCTTTTCAGACCCAGAGGGAAAGTATGAGGCACTTGAGAAATATGGAAATGACTTAACTGAACTTGCCAGACGTGGAAAACTTGACCCGGTGATAGGAAGAGATGATGAAATACGGCGCTGCATCCAAATATTAAGTCGGAGGACAAAGAATAATCCTGTTATTATTGGTGAGCCTGGAGTGGGGAAAACTGCAATTGCCGAAGGGTATGATCTCTAGCCTTTTTTGGTCTCACGGGGTGATGTATGAACATGTTTTTCCTTATATTTATTTGTCTGGATCCTGGTTCAGTTGAAAACAATCCAAAAACAGTAATGGAATAGCAGATCTGTGGAGGACATTTTTATTATTACTTGTCTACAATGATATTCTTTTGGTTGGATATGCTGTTTTAATTAGTTTTGTTGAATAATGCTGCCTGGCAAACCAAACATTGAACTTTAAAAGAGTTATGTTTCTAGAAAAGATATGCTCGGAATAGCATGATTATCCACTCAGAGAGGTTTGTGATATTTAGCAGACTGTATGTGGGGTTTACAGGAAAAAATCATTGCTAATACATATTGTTTCTGGACAATACGTTATTCCTGTTAACTATTTGAATTGTGAGATGGGTCAGGGTGTGTTATGTGGAGTTGTTGAAAATTAGTAATTGTAGGATGGAATGAAACAATTATAATCTTTTTTGTTTACCATGGTTCCTTGTGTTTATTTTTAATGAGCAGTGGCGTGGTAATGAAATGTAGTTATAGAACTTCTTACTAAGGGGTCTGCCTATATGTTGCAGAATCAGGATCGCTGACAAGGTTCAGTGAAGAGTTCTGATGGATGCATTCAACATTTCTTGAAAATATATATCAAAATAATGCTGTTGTTAGTGAATCACTGAATTTGTGTTAACTTCCCTTTGATTTGAAACTACCTTACCCGTAAGCCTGTAAAATTGAGGTGGCTTGCAGAGCCTTTGAATGATCAAATTTCTATTTAATAATTAATATAATGCCAAAATTGTGGCAGCTTGTAGAGCCTCTGTGTGACCTCGTTTTAACTTTTTACATAGGTCCAAGGTAGTGTTAGCTGTAATATTTGCGTGTACTTAAGAGTCCTGTTCAAGTGGTGGCATTTTTCACGTCATCTACTTTATGCAATATGTTATGTTTCCAGCCTTCGTGAAATGGGGATGTGTTTTTGACAGATTAACTGATAAAAGTCAATCAGTTCTGTCTCTTGTAATGATCTTTTTCCAGCAAAGGAGGCTTTTTATTTATTACTTGTTACCAGTACGTTATGAGATTTAGAGCCTTTGTGGTTTGGATATTTAAAAAGTTAATCAATACTTACTTTTTATTAGATCAGTTGTGCTAGTTGTAACTTACTTATCTTACATATCGGAATAATTAGTTTGGTTTAACTGCCAGACGAATGTGTTTCTGCACCGGAAGTGAATAATTCTAAATTGATATTGGATAGTGACTTATTTGAGTGTATCTGCAATATAGTTTTTTTTTGCTTAGACGTAGAGAAAGCATATGATGAGTTGGGAAAAAATGTCGTTTAGTGGGTGCTTGAGAACAAAGAAATTTCTACTCAAAATATATAAGTACCATAAAAGATATATATGAAGGAGTGGTCACAAGTGAGGGACCAGTGGGCGGAGATATTGAGGAGTTCCTTGTAACCGTGAGTCCATAATAGGGAGCTGCCCCATAGTTGCTTACCCTGGCTATGGAATAAGTTATTCAGTAGCAAATAGGACGGCATTATGGGATATGCTATTTGTCGATGATATTTTGTTATTTGACGAGGGGCAGGGCAAGAAAGTAACAATCATACCCGAGAGAGTCAATTGTTAATTGACGAAATTAGTTAGGAGCCGATCAAAAGTTGGAATTATGGAGAAGCACGCTAGAGAATAAGGATTTTACAATAGGTAGAAGTTAAACAGAAGATATGCAATGCAAGTTTAGCTTTTGAAGTGAGGTTAGATAGGATACTAGTGTCGAGACAAAAAATTCAGATATCTAGACTCGATCTTTCAATAGAATGACATGATAGATGAATATGTAACACAATAATATGATGGCCGAATTGAAGGAAGCCCATGGAAATGCTTTGCGATAATGATATAGCTACTAAAGTAGAACATAAGGTCTATGGAAACTGGTGATATAAACAGGGTTATATAGGAGTAAAGGCTGGAATTTTAAGACCCACAATATCGGCAAGATTAGCATCGTAAATATGTGGTGTCAGATGGATGTACTATACGACCCAAAAATGGTTGAGTCCAAGTCACTACATTGACAAGATGAGCATCGCAAATATGTTGCGTCAGATGGATGTATACACACCAAAAATGGTTGTGTCCATGTGAAGGTGCATGTAGCACACAGTGATAGTAAATTGAGAGATGGCCACATTTCCATCATGTCTTTCCTAGGCCTTCCAAGTGCATTGGTTTATTGGTGAGACTATGATGACTAAAGCTGTTGAAAAGGTATGAACTAGATCTAAAATTACATGAAGAGAAGTCATCTAGAATCACCTACAATCTCACAGAATCTGCGTGGATTCATTATGAACATAGCACAATGAAAGCAAATTATCAAAAGCATGCAATAGTAATTAGCTAGAGTTGAAGCCTAGTCGCTATTGCTCTTACATTAGGTTGTGTGTTTTCCAGGAGCTTTTAATTTGGTTAGAGATTTGTATATCAAATGTGAGATATAGAGACCCTCTAGTTTTAACAAACACCCTTAAGTTATCTCAAAAGAGAGCAAAATAGATAGATGATTCATATAACGATCCCAACTAGGTTGGATTTGAGGTATTGATTAGAATGATTGATATAGTTATCCAAATTTTAAAAATCAACCTTATTAGTAAGGCAAAGATGCTCTTAACATGTTAAAAAGAAGTCGAACAAGAAATTGTTCTTCCTTTCCTTTGATATAAGATATTTCTCTCCCACCATCCTGGAAGGAGAAGGTAAGGATTGTGAGAATGCATGGGAGAAAAATCTATCTTTTTAAATATATGATACAACGAGCCCTATATATAATATATATTCTACTCCTACTACATATAGGACTAGGACATATTCTACTCCCACTGACTGGAATGATGCACACAACGGCTAGAATAGCCTCCAGAGAGGGTGGAGCAGCAACCGAAGAAAATGTTGGCGGGTTGGGCTGCCGGTAGACAGAAGTTATTGAGTCTCTACTAAGAGAATAGAGGTTTTGGTATCATGAGAAAAGAAGAGAGTACTTATTGACTTGATTATTGACACAATGAGAGTGTTTTTTATAAAGGATTCTTATTCTAGTGATATAAGCTTAAGTATTTATATTATGCTAATGATATGAATAGTGATTTTTCTCTTGTAGTTAAGTAAAGAATACTCCAAAATATCATATAAAATATTTTACATATTCTCCTATTCAGATACAATGAGACTAGGCAATATTAACCTATAATTACTTTGATTCTTTTGGTATTTTAACACGGTTTTTAGTCTTTAAAAGAAATCAGGAAAAAGAGAGGTGAAAGATGGTATCAGCCAAATCTATATGACAATTAACCTCGGAATGATTATGCTTTCTGCTCCTCATCCATGCAGGTTAGCTCAAAGGATAGTCCGCGGTGATGTTCCTGAACCTTTGATGAATCGGAAGGTTATAACTTCTCTTCCTTGGTCTTAATTTGATTGACTTTATTTTATTGTGAAAAGGTCAAGTGATTGTGCATATGCAAGATCTTGGAAGATGGCCTTTGCGTAGTGTGCATTGGATTGCTTTCTCTTCAATTTGAAAAAGATAGTATCTGGGAAAGGCATGTCTTGAAGAAATAAATCAATGGTTGAAAACCTGCTGCCTTTCTCTTGAGAACCATGAGCCGTAACTTGTTGCTTACTGCAAATAGTTCTGTTTCTCGTGTTGCAGTGATTCATTTGTATGACGCCAGTATTATTGGATCATAATTTTCTTTTCAAGATATGAGTCTGCTAAAGACAGTTGATGCAGTCTTTACTCAAATGAACGACCGTGGTTAAGTAACATATTGTTTTAATCATTTCTGGGAGCAATTTCAGTCGAGTTTGATACTAAGTGAGATATGCATAACATTGCACTCAAATGTCAAACTAAATTTTGATCATCTAAGCTAGTCCGATGAGTTAGTCTCTTTTCTGAAACCCCCAACCCCTCACTCAGCAAAGGGGTAGGAGAAAAGGAAACGAACCACCTCATTCTCTAAAAAGAGTGGAACTAAATTATGATGCCTCCTAAAGTAAAGTAGTTGGACATAAGTTTCTGAAGTTTCAAATTTGAGAGTGGGATTCCGAGGAAAGCCTTCTGTAGCTTGAACCTTAATTGTCTACTAGGATTTGTTAATTTCGACTTTTAAGTGCTAATACTGTAATTTGATTTCCTTTTTCCTTTTCATTTTATTTAGTTGATGTCTCTCGATATGGGTGCCTTGCTTGCTGGTGCAAAGTACCGTGGAGATTTTGAGGAAAGGCTGAAAGCTGTTTTAAAGGAAGTCTCTTCATCCAATGGGCAGATAATATTGTTTATTGATGAGATACACACTGTAGTTGGTGCAGGTCTGGTACTTTTTTTTTAATATCCATTTCTCCATGAAGGAAGAAGTTTATTTCTACCGACTGGTTAGAAAATTTGCCAAATGTATTCTTTCTCTCTAAGATCAAATCTCTATTATTTATAGAGTTCGATATTAAAGAAAAGTGCTGACTCAAACTGCTTGTCTGCTTTCAAATCTCATGGCGTGTGGACAGCAAGTAGTGACTGCTCTTTTTGTTGGATGATTTTTATCCTTGCTTAATTCAATCTTAAACAATCCAAAGTTCATGATTTAATTCATTTGATGTCACTGGGAAACACTTGTTCTTCATTCTAGTGAGGTTAAAAAGCACTATTGCTTCGTGATTGTGTTTTAGGTGATCCATTTTTAAATTTGAGCCTGGAGCTGGGAAATGCTTGAACTAAAGTTTACTTTTCTGTTAATTTGAGCCTGGACCTTGGAAATGATTGGAATAGGTTTATTTTTCTGTTCCATAAAGCTGATAAAACATTACTGATGATGTCCTTTATGTATGCAACCGAGTAAAGGAGAAGGTACCTTTACACCCAAGATATAGTTTCTGTTTGTAGCTGGCTTATATTAGTAACTAATTCAGCGAATGTTACTTGGTCAAATTATGTTGATACGTTATATTCTAAAACTTTTGTTTGTTTATACCTTCCTAATCATTGTGTTTGCTTATGCTTTGTGTCTAGGAGCTACTAGTGGGGCCATGGATGCAGGGAATTTGTTGAAACCCATGCTTGGTCGGGGTGAACTTAGATGTATCGGAGCAACCACTTTGAATGAATATAGGAAGTACATTGAGAAGGACCCTGCTCTGGAGCGCAGATTTCAACAAGTATATTGTGGCCAACCATCTGTGGAAGATGCAATTTCCATCCTCCGTGGATTGCGTGAACGATATGAGCTGCATCATGGTGTTAAAATATCAGACAGCGCTCTTGTATCAGCTGCAGTTCTTGCAGATCGATATATCACTGAGCGATTTTTGCCGGACAAGGGTAGGCTAATGTATCCTTAGAACTGCAAGTTGTCTGAAATACTTGCTTTTCATTCCTATAAAATTCTTGTGAACGTTTTTCATGATATCTTCAAATAATACAGCAGCCTAATGTTACTTTTACATAATAAGAAAGTTACAGGGTTACAAGTAGCTTATTTTTATGGCTTCTTTACATGTTTTATTGCATTGAGTGGATCAATGGGTCCAGATTTTCAAGCTTCTTCTAAATGTTTTTAGCTGTGCGTGATCTGACATACGTTACTTGGGGCTTTTCACTTATGCTCAGTTCTTTCTTTTCAGCCATTGATCTTGTTGATGAAGCTGCTGCAAAACTAAAAATGGAAATTACTTCAAAGCCAACTGAATTGGATGAGATAGATAGGGCAGTGCTAAAGTTGGAAATGGAGAAACTCTCCCTGAAAAATGACACGGATAAAGCATCTAAAGAAAGACTTAACAAGCTAGAAAGTGATTTGAAGTCCCTTAAGGCAAAGCAGAAAGAGTTAAACGAACAGTGGGAACGCGAGAAAGATCTGATGACACGTATACGTTCTATAAAGGAGGAGGTAAATTGCATCTTTCATTGATGAGGTCAAATCAAAGTTGCAGTTTTTCTTTGTTTTCTCATGATTACTGTTCAATTTTTTCCGTTGCGTAGATTGACAGGGTGAACTTAGAGATGGAAGCTGCTGAACGTGAGTATGACTTGAATCGTGCTGCTGAACTCAAGTATGGCACCCTAATCTCCCTTCAACGGCAGCTAGGAGAAGCAGAGAAAAACCTGGCAGACTACCGGAAGTCTGGGAGTTCGTTGCTTCGTGAAGAAGTAACAGATCTTGATATTACTGAAATTGTTAGCAAGTGGACGGGTATACCACTATCAAACCTTCAGCAGTCTGAGAGGGACAAGCTTGTCTTTCTAGAGAATGAACTTCACAAAAGAGTTGTTGGTCAGGATATGGCAGTAAAATCTGTGGCTGATGCAATCAGGCGATCTCGGGCAGGCCTGTCCGATCCAAATCGGCCCATTGCAAGCTTCATGTTCATGGGTCCCACTGGAGTTGGCAAAACTGAACTTGGAAAAGCTCTTGCTGCGTACCTTTTCAATACTGAAAATGCTCTGGTGCGTATTGACATGAGTGAATACATGGAAAAACATGCTGTTTCACGGTTGGTTGGTGCACCACCAGGTTATGTTGGATATGAAGAGGGTGGGCAACTCACTGAAGTGGTCCGTCGGAGGCCTTACTCTGTGGTCCTTTTTGATGAAATTGAGAAAGCGCATCATGATGTTTTTAACATTCTCTTACAGTTGTTGGATGATGGAAGAATAACTGATTCTCAAGGGAGGACTGTTAGTTTCACAAACACTGTTGTAATAATGACATCAAACATCGGGTCACATTACATTCTTGAGACGCTGCAAAACACTCGAGATAGCCAGGAGGCAGTTTATGATGCGATGAAAAAGCAGGTTATTGAATTGGCAAGACGGACTTTCCGGCCTGAGTTCATGAATCGGATTGATGAATACATTGTTTTCCAACCTCTGGACCTTAAGCAAGTTAGCAGAATTGTTGAGCTCCAGGTAATACAGATCTGTAATCTGTTGAATTCTGATTCTCCTGACTTCATACGTTTTTCTTCTGTGTTGTTTTCTGTTTGCTGCGGTGTCATCTGCTTTCTGATTACTTTGACTTTAAGAGTTTTATAAGCACTACAGCAGATTACTGTTTGTGCGTTATCTCTGTAAATTTCAGTTTTTCTGTGTGAGAACAAAAAAATGTTTTAGTGTGCATTAGATCTCAAAATTACACATAAGTACATCTCATTTGCTTGGTGGTCGTCGTCCTAGTTTGTCCTCCTTGCTGCTTTCTGATGAGTGCATGGTTGAGTATGTCAAGCTTGAGAACTGCAGCGCACTGCGCATCCTGTCTAATGTCTGCTCTTGCAGTAGTTTTCTAACAGAGTATAATGTAAAATATATCATTTCATCTGGTGGTTAAGCTTTCTCCAAGATGAAACATAATTTGATATCTGTTCTTTGTGGTTCTTAATTTGGGGAAAGTGTTTGGCTATTTCTTATTTTAACCTTATCATCGCATCTGCAGCCATAGCATAACCTTGGTGAGGTTCATGGGAAAGAAAGTTACCGAGGCTACCTGACAATATCGTTAACTGATGAAAATATTTGTAGAACAAAACTTTGTGTTTTCATTATCATTACTATATTAGTTGCTCTCTTTTATCTTTTTTTCGTCCATCTTTTCTGTTTGAAGAGATTTTTCTTCTGTTTCATGAGATAATTCGAGGTGGAACTGCTGAGTGCTATGTATTACATGCGGCTGATTATCTATTTCATTTTCTATAAAACCTTCTCTCTATCGTGAGAAAGCGAAGGTCTCCCTCTGAAGTTATACTGCTGATATTAGAGTTTCTTAGAACCTGACGACTAGTTCTCTTTTTCTCTTCGCAGATGAGAAGGGTGAAAGACAGACTCAAACAGAAGAAAATTGATCTTCATTACACGCAGGAAGCTATCAGTCTACTGGCAAATATGGGCTTCGACCCTAACTATGGAGCTCGACCCGTTAAACGAGTGATTCAGCAGATGGTTGAGAACGAAGTAGCAATGGGTGTTTTAAGAGGAGATTTTTCGGAGGAAGACATGATTATCGTTGATGCTGATGCTTCTCCTCAGGGGAAGGACCTTCTTCCCGAGAAGAGACTGTTGATACGAAGAATTGAAAATGGTTCCAACATGGATGCCATGGTTGCCAACGAT
SEQ 75
GTGAATGTGAAATGTTTCTTTGTTTCTTTCTTTTTTTCTTTTTCTTGTATGTCACTTTTTTTTTTGCAAGGCTGGAACTTTGAAACTTTTTGTTTGAAAACACAATCATTCGCAGTAACAAACAAGAACCACCGTCCCCATCTTCACTCCCATCACTCTTCTTTTCTTTGTTTTCACACTTCATATTTACTCTTCTTTCTCATCCTTTATATTTACATAGCAAAAACAACGTCAAGATTTGCAAAAACACAGCAACCCCCCCAAAAAATGTCAAGATTTACAATGCTAGTAGTTCTTGTTCTTCTTCTTCTATGTCTATGCCATTTATCAGTAGCAACAATAGGAAGTAGTAGTAATAAGAAGAGTACTTACATAGTACACGTGGCAAAATCCCAAATGCCGGAGAGTTTTGAAAACCATAAACACTGGTATGATTCATCACTAAAATCAGTTTCTGATTCAGCAGAAATGTTGTATGTTTACAACAACGTTGTACATGGTTTCTCAGCAAGACTGACTGTTCAAGAAGCAGAATCACTTGAGAGACAAAGTGGGATTCTGTCTGTTTTGCCGGAGATGAAATATGAACTTCACACGACAAGAACACCATCTTTTCTGGGTCTTGATCGAAGTGCTGATTTTTTCCCAGAATCAAATGCTATGAGTGATGTGATTGTTGGGGTTCTTGATACTGGAGTTTGGCCAGAAAGTAAGAGTTTTGATGATACTGGACTTGGACCTGTTCCTGATTCTTGGAAAGGAGAGTGTGAATCTGGTACCAATTTCAGTTCTTCAAATTGCAATAGGAAATTAATTGGTGCAAGGTAAAACTTTTCTAAAAGTTTATGCGGTTAGAGACAAGACATTTTTAAGTTAGTTAATTATATTATATCTCAAATTGTGGTCGCGAGGATTCATATTGCTTACTTCAACTTTTTTGGGACTGGGACGTAGCAGTTGTTATTATATGTTAAACTCGTCCTCTCGCATGTTGGTCTGATTAATTTTATGATTTCTCTAGTTGGCAGTGAAATTTGAATCTGGGATTTTTTGCTTGGTTTGATACCATGTTGAGTTGTCTGATTAGTTGCATAACTAAGTGGTAACTGGTAAAGCTGCTCCCATATGATCGGAAGGTCACGGGTTCGAATCGTGAAACCAGCCTCTTGCTGAAATGCACGGTAAGGCTGTATACAATAAACCTTTTGTGGTCCGGTCCTTACCCGGATACTGCTATGGTATAGCGGGAGCTTAGTGCACGGGGCTCCCTTTTGGCATAGCTAAGTGTGTTGAAGAGGGTTTAGTTAAATCCCATTCATCAGAGTTGTACTGTACAAATAAGCTAAAAATGAATTATTTTTGTGTATGTAAATTGGTGTATCCCATTGATAATACAGGTTTGTACTTTTTTGAATTTCCTTGTTAGAATTATTTTAAAAAAAAATAAAAAATATCATGGCTCTGCCACTGTTGTGCTCAACTTATCTAAAAGCTAAAACTATTAGAGATAAGATATACTTTTAATTACTTAATCATATTATGTCTGTTGATAGGTACTTCTCGAAAGGTTATGAGACCACTTTGGGTCCAGTTGATGTATCCAAAGAGTCGAAATCTGCGAGGGACGATGACGGACATGGAACACACACTGCTACTACTGCAGCTGGTTCAATTGTTCAGGGCGCTAGTCTCTTTGGTTATGCTTCTGGAACTGCTCGTGGAATGGCAACACGCGCTAGAGTTGCTGTGTACAAAGTTTGCTGGATTGGTGGTTGTTTTAGTTCTGATATATTAGCAGCTATGGACAAAGCAATTGATGATAATGTGAATGTGCTTTCTTTGTCACTTGGTGGTGGCAATTCAGATTATTATAGAGATAGCGTCGCAATTGGAGCATTTGCTGCTATGGAGAAAGGGATTCTAGTCTCTTGCTCTGCAGGTAATTATGCTAGTCGGAAAATATGAAGAACTTCTAGTACTTCTTAATTATTACATTTTATTTTATACTAGACCAGACTAGTTTAAAACTGAGCGACATTAACAATGAAGATTCATTCATATTGCCGATTCTAACTTGCTTGGGATTGAGACGTAATTGTTGTTGTTGCTCTGCAGGTAACGCTGGTCCTGGTCCCTATAGTTTGTCCAATGTAGCGCCGTGGATAACTACTGTGGGTGCAGGAACATTGGACCGTGATTTTCCTGCATATGTAAGCCTTGGCAATGGTAAGAATTTCTCTGGTGTTTCACTTTACAAAGGGGATTTGTCGCTGAGTAAAATGCTTCCGTTTGTGTACGCTGGTAATGCTAGTAATACTACAAATGGAAATCTTTGCATGACGGGTACCTTGATTCCTGAGAAGGTTAAAGGGAAAATTGTTCTATGTGACCGCGGGATAAATCCCAGGGTCCAAAAAGGTTCTGTGGTAAAAGAAGCTGGTGGGGTCGGTATGGTTTTGGCTAACACTGCCGCCAACGGGGATGAGCTGGTGGCTGATGCCCATTTGCTTCCAGCAACGACAGTTGGTCAGACGACAGGGGAAGCAATCAAGAAATACTTAACCTCGGATCCTAATCCAACCGCTACAATTCTTTTCGAGGGAACTAAGGTGGGGATCAAACCATCACCAGTGGTTGCTGCATTTAGCTCCAGAGGACCAAACTCAATCACGCAGGAAATTCTCAAACCGGACATCATAGCACCAGGTGTTAACATTCTCGCAGGGTGGACAGGTGGTGTTGGACCAACAGGGTTGGCCGAGGACACGAGACGTGTCGGGTTCAACATTATCTCGGGCACGTCTATGTCTTGCCCGCACGTGAGTGGTTTGGCTGCTTTGCTTAAAGGAGCGCACCCCGATTGGAGTCCAGCGGCTATTCGCTCGGCTCTTATGACCACGGCTTATACAGTGTACAAGAACGGCGGTGCACTCCAAGATGTCTCGACGGGAAAGCCATCCACACCATTTGATCATGGTGCAGGACATGTAGACCCTGTTGCAGCACTAAACCCCGGACTTGTTTACGACTTGAGGGCTGATGATTATCTGAATTTCCTCTGTGCCTTGAACTACACATCAATCCAGATTAATAGCATTGCTAGAAGAAACTACAACTGTGAAACAAGTAAGAAATACAGTGTCACTGATTTGAATTACCCTTCATTTGCTGTTGTTTTTCTAGAACAAATGACTGCAGGCAGTGGAAGCAGTTCTAGCTCCGTTAAATATACACGAACGCTTACTAATGTTGGACCAGCAGGAACATACAAAGTTAGTACTGTTTTTTCATCAAGCAACTCAGTAAAAGTCTCGGTTGAGCCTGAAACATTGGTTTTTACTCGTGTGAACGAGCAGAAGTCATATACTGTGACTTTCACTGCTCCTTCAACTCCATCAACTACGAATGTGTTTGGTAGAATCGAGTGGTCAGATGGCAAGCATGTAGTTGGTAGTCCAGTGGCCATTAGTTGGATA
SEQ 76
ATGTTGAAGGCTCTTACATCCTCATGTCTGCAGAATCGTTTCCACGCCGTCACAACGGCATTTACCCCTCAAGTTCGCCGTGGCACTGACTCGAATACGCCCTTGCTTCGGGTTTTAGGTTCGCTAAGAAGTTCGAATCGCAGGGTCCCTTATTTGTCTCGACGATTCTTTTGTTCGGATTCTACTGATGGGTCCGAATCGAATTCCGAGGCTGCTGCATCCGAAGCCAAGCCGGCCGAGGAAGGTGGAGATGCTGATTCTAAGGCTTCGGCTGCTATGGTTCCCACTGTTTTTAAGCCTGAAGATTGCCTTACGGTTAGTTCAAAATAATTCTTTGCACCCGCACCGATAGATTTAGACGTGTCTTTAAAATAAATTGTATGACTTTTGTTAACTAATGTACATTCTCAGTTCAATTTATCACTTCATCATTATTAACTAACATAATTTGGTGCATAATTATGTATTTTCCTGCTCCATCATTATATAAGTACATTTTATGCTAATATTTGATAACTGCTAAATGACTCCTTAAGAAAAGATGTTAACTTTTTGTTATAACGGTGTGGCCATGTCTGCTTGTGCACCAAAAAAGAAGATATTAAGTGAGTTCCTTCTGTTGTTAGTGCTGGTTTTATGTCTTCTTTGGGTAGTTTTGATATGATTTTATTCATTATTATTCAAATTTGTACATATGGATGCAAACATGACGCAGAATTGGGAACAATTTGATATAGAAATTATTTTTAACTTTGGTTGCAACACTCTGTTAAGATTTCGGCAATCGAAAGGGATGGTTGTTTAAGAAGTTAACTTTCTAGGATATAAAAAAGGTGGATCACCAAGATTATATTTCTCGTAGATTCGTACACTCAATCATTCTTTACTAAAAGACCTTCCCGACCTTGACTTGATATAATGCTGTGAAGGTCAACATTTTATTTTTCCAAAGAGAGGCTTTTGATACCCTCTTTGTCTCTAAAAGAATGGTTTTATTATTAGATGCTTCGATCCTCATATTTTGAACCCAGCCATTCCGTTTGAGATGAAAGAATAGTATTGTAGACTTTTTTAAAATCAGGCCGTAGTATTAAAGAATCTCGATATCAATCTTCTTAGGATCCTCAGACACCTCACATCGATGACCAACACTTGTAATGTTATTGTAGCTTCAATTAGCATTAGTATTTTAATGAAATGAGAATTGACAAATTTTTATAATCTAGTCGTCTTTTTAAAGATTTCTTGTCCTGAATCTTCTTAGGATCATCAGAGAGCCTCACATCAGTGAGCAACTCTTTTAATGCTATTGTAGCAATGTAGCTTCAATTAGCATTACCCTTTCAATGAAAAAATGTGGTATTTCAGTACCCCTCCTTCAACCACAATGTTATTGATTTTTGCTTCCTTGATATCTCCTCCACTATTTATTCTGCTTTTATATGGCTTTTTGGTACTATCCCTTCTTGTCTATATTTTCATTAATGTGGTGCTTATGCTTTCCTGAGCCGAGGGTCTATTGGAAACAACCTCTCTTTCATCACAAGGTAGGGGTAAGGTCTGCGTACACACTACCCTCCCCAGACTCCACGGGGTGGGATAAGACTGGGTATGTTGTTGTTGTTGATACTTCCTCCACTAAGGCACAATCCGCCAACTCCTTAATCAAGTCTTCCATTAGGGGTAATTCATTTGCAATCATCTCTTTTGCTTCACCTAATGATTTCTAAAGGTTTCGATTTTTCCAACAATTCTTTCATCTCCATCTCCACTCAAGCATATACCTCTCCATTTTCATCATCTATGATCAGCTCTAAGTAATTTCAAGTTTTTTCCCAATCACACAGCTTGGATTTTCGGCAATGCACTTGTTCTTGTTCAAATGTTAATGTGTGTTATCTTAGTAGTGTTAAATTAGTCTCATTGGAAAGATATCATAAAATTTATGTATTTCTCACCTACACATAGATGTGTCTTTTCTATTTGATTGTTAGATTTTCTCAGATTGTATCACATACCCTTGGTGATCTTAATAAGGGGCAGCCTATGCTACCCGATTGGTCAACTTTTTCACTATTCTTTCCGCTTTTGGTGATCTTTAACCTCATCCCTATTTCCGTAGACCGTTTGACAGTATCATGCCTCTTTCTCAGTCATCTTTTTAGCTATAAAATTCCTCCTTTTTGACCATTCTTCCAGTGAGTTTCTGCCACCATTCCAACCATGTCCCATTTACAGTGTAAAATTACTATTTTGGCCATTCTTCCAATGAGTTTTCTACCATCTTTCAGGCCAGTATTATGATTATTGTTCCAGATACTATTTTGGACGCTATTCCATTGCTACTCCAGTCACTGTTTAGCCGCCAATTTGAGCCACATTAGCTGCCACCTTACGCCACACCAATGATGGTCTTGATATTCGTGGTATCGCGTTCTCTCTGGTCATTGAGAATCGGTTCCCTTTTTTTTTTGATAAGGTAAATTGTATTAATCAAAAGGGAGAAAAAAAAAACTCCCGCATACAAGAAGTATACAAAAAGTAGAGAATTTACATCAGAACATGATTCTCTACAAATGACGCCCAATCTTCTACGCAAGTAGGGGCTACATGTGTGCACCATTAAGCGATCAAAGATAAAAGACTATTCTTAAGATGTGAAAACGGAGACTCAATCCCCTCAAAAACTCTCTTATTTCTCTCTCCCCAGACTAACCACATGAAAGCTAACGGGACGAACTTCCACGCCTTTTGCTTGCTTCTTCTACGGAAATTGGCCCAGCTATATAGCATTTCCTTTACAGTGTTTGGCATCACCCATTGAACACCAAAAAGATTCAGGATTGCCCTCCATAAACCCCAGGACACAAGGCAACGCATCATAAGATGATCAACTTCTTCCCCTGAGCTTTTACACATGTAGCACCAACTAACATGTGTAATTCCTCTCTTTCACAGATTTTCAGATGTCAAGATCACTTCCCTCGCTGCTAGCCATGCAAAGAAGCACACCTTCGTGGGCGCCTTAGGAATCCAGATCGATGAGTATGGGAAAGTAGCTTCCTGTCTCNTGCCGCATCCCAGAGAAAATCCCTTTGAAGCCGCTCCAGTTTTTCTGTGATGCTCACAGGTGCTTGCAACAGGGATAAGTAATAGGTAGGGATACTTGACAAAGTGCTTTTAATAAGCACTTCCTTACCGCCTTTTGACAAATACCGTTTCTGCCAGCCTGCTAATCGTTTTTCAACCCTTTCAATGACTGGATTCCAAACNGAATTTGTTGGACCATCAATTCCTTCTGGTTTGAAACTCTGAAGACGTGGGGGAAAGAGACCCTCAGAGTAGCCTCTCCACACCACCTGTGATTCCAAAAGCTGATTCTCCTTCCATCACCCACCTTGTAAGTGATGTTGCCATAGAAAGCTTCCCAGTTCTTCATGATGTTCCTCCACATGCCACACCCGAACGGTGTTGTGATTGCCTTGGTCCTCCAACCCCCTCCCGTGGAATCATACTTTTCTGCTATGACCTCCCTCCATAGAGCATGCTCTTCTACCCCGAATCTCCACAGCCACTTCCCCAGCAAAGCTCTGTTGAATACCCTGAGATCTTTTACTCCAAGTCCACCCCACTTCTTTGGGGAAGTGACTGTCTGCCAATTCACTAGATGAAACTTTCTAGTTCCATCTGCCGCATCCCAGAGAAAATCCCTTTGAAGTCGCTCCAGTTTTTCTGTGATGCTCACAGGTGCTTGCAACAGGGATAAGTAATAGGTAGGGATACTTGACAAAGTGCTTTTAATAAGCACTTCCTTACCGTCTTTTGACAAATACCGTTTCTGCCAGCCTGCTAATCGTTTTTCAACCCTTTCAATGACTGGATTCCAAACAGTAGTATCCTTTTGCAAAGCACCCAATGGTAGACCCAGGTAGGTAGTGGGGAGAGAGCCCATCTTGCATCCGAGAACATGAGACAAAGCATCAATGTTAGCAACCTCATCCACCGGGAAAATCTCACACTTGCTGAGGTTGATTTTGAGTCCTGATACTATCTGAAATCACTGCAGTAGCTGCTTCAGGCAGGTCAACTGATCCATATCGGCATCACAGAAAACTAGGGTGTCATCCGCAAAAAGCAAATGAGAGACCCTTCGGGCACTGAGCACCTCGATCGGAGCTGAGAAACCTCTCAAGAAGCCTCCACTCGCTGCACGATCCATCATTTTACTCAGAGCATCCATCACTAAAATGAATAGCATGGGGATAAGGGGTCACCTTTCCTAAGCCCCCTGGAGCTGCCAAAGAAACCACACGAGCTACCATTAACCAGGACAGAGAATCTGACTGATGAAATGCAAAACTTGATCCATCCCCTCCATCTTTCCCCAAACCCCATCCGTTTCATAATGAAGTCCAGGAACTCCCAATTGACATGATCAAAGCCTTCTCAAGGTCCAACTTGCACAGTAATCCGGATTCTCTATTTTTCCTTTTGGAGTCTACAAGTTCATTTGCCACCAGAGCAGCATCCAGGATCTGCCTACCTTCCACAAACGCATTCTGGGAGGACGAAACAGACACGTCAAGAACCTTCTTTAGTCTGTTAGAGAGCACTTTAGAAATAATCTTGTAAATGCTCCCCACTAGACTGATAGGCCTATAGTCTCTGATACAAGATGCACCTTCTTTCTTAGGCACAATGGTAATAAAAGAAGCATTGATGCTTCTCTCGAAAGCACCATTCACGTGGAAGTATTCGATGGCTTCCATCAACTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAGCATCGTGCACCTCCTCCTCCTCGAAAGCTCTTTCTAGCCACTCATTGTCTCCCTCCCCTATGTGGTTGAACTCTATTCCCCCCAAAGTAGGCCTCCAACTAACTTCCTCTTTATATAAGTTCTCATAAAACCCCACTATCGTCCCGTTTACCTCCTCCTCACCCTCGATCCTCACCCCATCTACAACTAAGGACTCTATGAAGTTTCTCCTTCGATTTGCAATCGCAACCCTGTGAAAAAACTTAGTATTCGAATCCCCCTCCTTTAACCAGAGCGCCCTTGATTTTTGTCGCCAACTAGTCTCCTGAGCTATTGCTAGCTCGACTATTTCACTCTTAACCTCCCTCAATCTCACTTTCTCAGATTCATCTAGCTCTCTCGCCCCTTCCCCGCTCTCTAATTCCCCCAATTCATGCATAAGCTCCCTCATTTTGACCTTCACCCTCCCGAAGACCTCCTTGTTCCACCTAATAATATCTCCCTTGAGCAATTTGAGTTTCTTCAACAGACGGAATGAGGGTGACCCTGATACCCCGTAGTTGGTCCACCATTCTTTCACTTTGTCACAAAATCCTGGCACTTTCAACCACATGTTCTCGAACCGGAAGGGAGCACGGACCCCCCTCCCTCTGCATCCATCTAGCAAGATAGGCAAGTGATCCGAAGTCAGCCTAGGTAAAGGGATCTGCAACACATNTTGCAACACATTCGGTACCAGCTCATCCCAGGAGGACGATATCAAAAATCTATCGAGTCTGGACCTTGAGTTTGAATCCTCCGCCCTGGTCCAAGTAAACCTTTCCCCTGACAGAGGAAGATCAATGAAAAAGTGATCATTGATGAAGTCAGAAAACTCCTCCATGGCCGTCGAAATCAAATTGCACCCTAATCTCTCCTCCGGGAATCTAATTGTGTTAAAGTCTCCCCCTAGAACCCATGGAATGTCCCATTCCCCCATAATATTAGCAAGTTCCTCCCAGAAAGACACTTTGGACTCTTCCCCTACCGGCCCATACACTCCCCCAAATCTCCACTCCACCCCACTCACTCTATCTTTGACGAGAGCTGCCAAAGAGAAAGCTCCCTTCCTAATCTCCTTTACCTCCAAACTTCTATCATCCCACATAAGTAGAATACCCCTGGCACTTCCTGCTGCAGGGACCCAGTCATATTTCACCCAACAACCCCCCAGACACTTCTCACGATCGCATCCGACAAAACTTCCATTTTGGTCTCTTGAAAACAAACTAGGTTGGCACCCCACTCCTTAACTCCCACTTTGATGATGGCCCTTTTGTTTGGGTCATTCATCCCCCTGACATTCCATGAAAGGATCTTAACTTCCATAAGCAACAATAGCCCCCTTTTCCCCACCCCCCTAGACGAGATCCCTTTCTCCATGTCACACACTCGACCCCTTCTCTGATACACCACTACATCCCCTATTTTATTTTGATTAACACCTTTACCCAACTCCCTCCCCGCGCCTTCGTAACAAGGCTGTTGCTCAATCTCCCTCAACAGTTCTAGCACCCTATCTTCCTTCCCTTCAAAAGAAACACCTAAGAACTTCCCAAAGTTTAATAGTTTATCCTCCATCCATAAGGACATGTCCACTTCTCCAATCCGTGACGAAATTGGACATGCGTCTTCTGTAATTACCTTTTCCTTGCCCCTACCGGAGGAATGCAAGACCATAGCATTGCTAGTAGTAGGAACTCTGGGTGTCGGGTGGATCACGTCGTTTCGTTGAGAGGCACCAACCTCAGGAATTAATGCCTCTCTTGGAGGAGATTGTATTGTAACCTCAGCACATCTAAGCACAGTGGGAGATGAGGGGGCGATGGAGCTGGGTCCATCAGAGGCGGGGGGCCGTGGAATAACACCGCTAGCTTTTGCCGGTGCTGCCCCTGGAACAGTGGCTGAAGAGGAGCACAAGTCTCTAGTGTTTGGTATAGCACTGTTTGAGGCATCATTGGCAGAAGGAGGAGGAGTGACTTGCTCCCATATGTTTAGGCGCCTTATCGGACGTCGCTTGATATAATTTGGGCCCTTTTGGATCGATTCTAATAGAATTGGGAAAAGTTGCTGGGCCCATGTGTAGAGGCCCAGACCTATGCTTGCTGAAATCTTGCCCGACCCTATGATAGTTAAGAGAGGACGCATCACTATCCCATTCACGTCTTTTCCTCCTGTTAAAATTTTGTCTTCTTCTCCCAAGTTCAAACCTCCCGTCACTTCTCGATCTACCGTCTGATTCCAGCTTAGTCCACTGGTCCGGTGAGCCCTCCCCTCCTCTCAGAAACTGGTTTCCCCTTCCCTCCCTTCTATTTGACCTTTCCTTTTCTTCCGCCAAAATCGCAGGCATAAAGACAGGGCGAAATTCAGGGCTCAACCAAACCCTGTAGCTCAAGTAGCCATCTTCCACAACAGTGGAGACAGGGATTCGGCCCCCTTTCCTCACCACAATCCTTACTCGCGAGAGATCGTGCAAGCTGCAAGTGACATCAATGAAACCCCCACAGATTTCCCCAACCTTCTTGAATAGGTCAAGACACCAGAGATGCACCGGGAGACCCACCATGTTGACCACCAATGTTTCGGCAGGGAAACGAGGGTCGAGACACCCATCGTGCTCCACCCATCTGTCTAACTTTAAGAAATTCCCATCAAACCACCTGTTTCCTCTTACGAGAATTTCTGAAGCTTGAGACTCGGTGACAAATCGAAAGAGATATTGCGTGTCACCCAGTTTCGATATTTTCAGCCCGTCCTTGACGTTCCATGCCTCTGCTGTCCAGTTCCTAACAGTTTCTGGAGAACCAACCCATTTGGAGAAAGACCCAATCAAACAAGAGTTCAAGAAACCTTTACGCCTGAGAGTGTGCTCCTCGGACAGCGAGAAGTGTGGCTCCTGCCCTACGTCAAGCTTTCGCTGGCATCTTCCCCCAAGTTCGCGGGCTGGCCAAGATGCAGCTTTGCTAAAAGACATGCTTTCTAATTTTCTAGATTCTAGGGAGAAGTAGGAGCTTCTCTCTGTAGAGTCCGTCATAGCTAAACCAAGAGCCTCGCAAGGAGACTGACTGAAGAAGGGAAAGTGAGAGGGAGTGAAATCTGGCCGGAAAAAGACACCTCCGTCGCCGAAAAATAGCAAAAGTGGTCGGAAGTTGGAGGTTGATGGATGGGTGAGGTCGGAATAGCCTCGCGAAGAGGCTGTCTGAAGGAAAAAAGTTGGGATCTGGCCTGAAAAGGCGTCACGTGCCGGCGCGTGGGGTGTCAGATCCCGGCGATTCTTTCTGGGGTAGTGTCGCGACAGGGTTGGCTCTCGGATGGTGGTGGTGGAATCCTAGGATTCATGGTTCATGGTGGTGGTGAAGGGGTGGGATGGGGTGTCTGTGTGCTACAGTGGTTTTTTTCCGCTTCATGCACATTGAGTTCTGGTGACTGATCTCGTTTTGAACTGTTCACTTTCTAGGTGACGCTTTACTATTACGACTATTGTTCACTATTCCGTGTTTTCCAGAAAAAGCAAACTCAAGGTGTTCTTCACTTCTTCTTACCTTGAGTTCAAGGACACATAATACAATGGTAATGTTAGTTAGTAGAATGTTAGTGTTAAGTTAGTCCAACATCAAAGATTATTGTGCTTTTCACACCTGAAAATTAAGTGTCATGTATTATGTAATAAGTATTCATCAACTTATCTTTTCATCTGCTTCTTCTATTGTCTCACAATTTTCAGATTAAAATGGAGTATTTTTTTTACAAAGAATCCCATTAGCAAGAAGATTGTGATTGTGTCTTGGGCACATTGTCAGTTGAACCATCCGTATTGGTTGAGATTGTTTCTTTGAAATCTCCTGATATTAGAAACCCGTCGATCCTTGAGTACTTCCTGCCTTGTTTCTCTCTGGACCATATGTAACTCCTACCTTGCAGAGGCAAATTGATCAACTCTATGTCAAGACTGAAGCTTCAGAAAGTTCCTAATGATTTTGAATGATTTCTCACCTGCAAGAAGCTGGTGATTCACATTGTTCAGCAAAAGATTTTTTTTTTGTTTGCTTGTCAAATTAACATTGTAATTAGAATACTCTCCTTCCCATATGCTCTTCTCAAAGTCAGCATTCAACCAGTTATGCCTCTTACTTCTGGCCGTACAGAAGGCACCTTGATGGATGTTAACTTTTAACACTTCATCCTATATTTGGAGGCCTAGAGAGGGTTACTTCATGCTTTAGTTTCAATTTAATCAACAGCTTCTAGGTTATTTAAGAAGTAAAGCATTGGAAAATACATGCGCGGTTTAAGTAATAGGATTGCCCTAGTATCTCATTTTTCTTTGAAAAGAAGTTTGATTCATATGTGTATTTCAGTTTTTGCTAGTTGTTGCAGGATCAGTGTAAAAATGGTATTTATCAGCCTTCCCTAAAAAAAGAGAGATACTGAACGAATGCAAAGAAAACACTGTTTCTTGGAGGCTTCTTATTTGCTCATAATTCATCTGTTTGTAAGATTTATGGGTGCGACCCATTTCTTGCCTTCTTGGTTGGTTATAGTTTTATCTTTGGATAAGATTTACCTGTAAGACTTGCAATTAAGGGCTTTATGTCATTTATCATGCTAAAAATAACATAAAGTCTGTGTGGGCTGTACTTTTTTAATTCTATATAATTTATAATGATGCATGTTGTTTTCTTGGTGCAGGTTCTAGCACTGCCACTTCCACACAGACCGTTATTTCCAGGGTTTTATATGCATATCTATGTGAAGGTATGGTACCTTGTCTAGAATGGGTTAAACTATGTCTTAGTTAGTCTTCGTGATTTTTGAAATAGAATAGATATAAGCTTTAAACATTAAGAGTAAAATTACTGTCAACTTTCTGAGAGTTGATACTGTTTCAGCTAGTGTTTTGGAGCCTATTTCCTCTTAAGTCTATAAAACCAGCATTCAGCTTTAACAATGCTACTTTAATTGATGGAAAGTTCTTGGCTGAAGTTGTAAGCTTGAAAAGCTAAAACCTTCAATTAGGAGAGCGTCTGTTGCTCAATTGTAATATTCTTTAGCCTCATTTGGTAGTTTCCAGAAAAATATAAATTCTGCTGTAATGGCATGCAACAAATTGATCTGAGGCTCCACATAATCCATAAATTTCACTGCAGGATCCCAAGGTATTAGCAGCCTTGCTGGAAAGTCGAAAAAGGCAAGCACCTTATGCTGGCGCTTTCCTTATGAAAGATGAGCAAGGGACTGATCCTAATGTTGTGTCTGCCTCAGATACAGAAAAAAACATCTATGAGCTTAAAGGAAAAGACATGTTGAACCGTCTTCATGAAGTTGGTACACTTGCTCAGGTACATCTTGTTGTTCCTTGTGTTATTCTGTTGCTTTAACTTTATTGAAGAAGTTTCTGATCAAGGTTGATGCAACTTATGATGAGCCAGATAACAAGTATTAAAGACGACCAGGTTATTCTTATTGGTCACAGGCGGATACGTATGGCAGAGGTGGTAAGTAGTTGTCTGCTTTATTTTTTGGTTATAAGCAGCTCCATATGTTTTCTTTTTGTCCTATTTAGTTCTATTTTTCATCTACATATCCATCATCTCCTGATTTTGAGACTGCCCAATTGACATCTTGGGTTCATTGCACTTGTATATTTTTCTCTCTTTCCTACCTTTTGTGTCCCAACACCGAGTCCTTCCCCTTTGGCTTCATATAACTGAGCAGCTGCATTGTCAAGCGATAGAGGTCTATTGCAATTTGAGGCTAAATGAGCTGTTTTCCCCAGTTGGTTATGACGTGTTATCAATAGAATGAAACCCTCTTTTAGTTATTCTTGATTGCTTGTTTGCAAGAGAAAGTAGCACTGATTTTGTAATAATTTTTGGACAACTGGTTTCCTGCAAAGTCATAGGAATGTCTTTCTTATTGCACTTCGGGGTTGTAGAAGAGAAAAGCAATTGAGTTTCTTCTATGATGTGCTAGTTTTGCATCTGTTTGATTGCTGATGCATTTGGTTTGAATGAAATCCTAGACTTTTGATAACCTAGAAATCTGAGGGTAAGTGGCTCACGGTTTGAAACTCAGTGGATAATGGGCTCACCATTCTACCATTGTCCACCTAAATACTAGGCTTTTGTTTGCGCCACGGCACGAACTGTGATGTGCACCCAACCTACATATCACGGGCTGTACTCTTACCACTAGACAAAAGCCCCGGAGGCTTAAACATATATTAAAACACATAAAAGTTTGAATAGAATCCTATATATTTGGGCTGTAATTTGGTATAGAGTCGTATTCTTGGCTTTAGCTGTAAAGCTGGAGTACTCATGCATGGTCAGCCATTAATGGGCTTACTTAGCCACTGGAGGGTCTTATGTCTTCTTATTTTTTCGTTTTATTTTGTGATTGAGGACCTCTTATCCTCTTCTATTGGACTTAAATCTCTCATCCCCTGAGCTCCTCAGTAATAATTCTCCTATTATACTATACTTACTGTTGGATGTTTACCTCTCTACCTTCACATGGTAGGGTAAGGTCTGCGTACACAGTACCTTTCTTGGACCCCACCTGCACGATTACACTGGGTTTGTTTGTTGTTGGATGTTTGCAGTCAGATTTCCTTATGTAGTCATTTCTTTTGTTTCAATAAACAATTTCTTCAATATACTAGTGTAACAATTTGATGACTATGCAGTAGATTTGTCACACTTGAATGTTCACTATCAGATTTCCTTACTTTATTCACCATGTTGTTCTTTTTCCGTCTCAATAACCCATTTTGTGAAATGTACAAGAGTAACACCTTGAGGACTGAAACAGTGACATTCATTTGCAGGTCAGTGAGGAACCCCTTACAGTGAAAGTTGATCATCTCAAGGCATGCCTCAGTTTCTTGTGCATTCTTTCTACCCAGCAATGATAAACTCATAAAGCTTGATGTTCAGTTTTGAAATTGTTTCTAAAATTTGCTTTCTTAATTATGTTGCAGGAACAGCCGTACAACAAGGATGACGATGTTATAAAGGCGACATCTTTTGAAGTTCTATCAACCCTAAGGGATGTTTTGAAGACAAGTTCTCTCTGGAAGGATCACGTTCAAACTTATATCCAGGTGTTAGTCATTTCTTTCTAAATGTTAAGTCCTAATTGTTTGATTTGGTGATAACTCCAAAAAAAAAAAATTTACTCTCTCAAAATGCATTGCCTTTTGATATTCTAGCCCACACTATTGTGTGAAGCTCCACAATGCCGGTGGATTGTGTTAAACTTAGATCATGCCCTAAGTTAGGCATTTCATGCCATTCTTTTTAATGGAAAATTATGTTCGTGTTGGTGGTAATCAAATCCATGACTAGATTCTCTGCCTTATAGTTGATGCCCTGCTGAAAGAGGAGACTAATTAAAGATGTCAAGACTGCTAGGAAATGGAAGGTCATAAATGTTCTCTGAACCAACTGTATACATATTGTTATACATATTTCTAAAGAGACGCTAGGAAGTTTATACAACTGAACAACTGTATAAGATGTATTTGGAGCTAGGAAGTTTATACAATACCTAATTCTCTCTCTGCTGAATTTCTAAAAAGATGCTACTGATTGTTATGAATAGTGTCAAAATCAAAAGATAATATTGTGGCATGTTTTGCTGAAGAAGTGGGAGTGGAGGGATAAAGGGATAGGGACTTACTGAATACTTCTAAGGGAAGCATTTGTGTCCCTCCAAGTTATTGGTACACTAGTAATTGGAGTAAGAGACATTACTCCTTTCGACCAATTTCAGAGGGGTTATCAAGTCTTCAAAGCGTGGCGATATTTGTGGGGGTTGGCTGGAAATTGAAAAAAAGACAACTCTAAGAAATCATCCTAGATGGGCTTCGATCAAGGAGAGGAGGCAGATGGAGAAGATTTCGAAAACCCTAGAAGTGGAGAAGAAGGGTTTTCTTTATAAATTACATGTTTGGAGTGTAGCGTTGGTAACAATTATCTCCAATCGACGAAATGGAGAAGAAGATGGACCTGCTAGGTACTATGATGAAGTTTGAGATAGGCCTGCTACCGTAGAAAAGATCGGTCATGTGGGGGTTGGGAACAATATTTCAAATTCTAAAAAAGTGTGGACAGATGTGGGAGTGGCACGTGGAAAGGTCTTAAGGTAGAATGGGTCAGTAGTAAAGGAAAGGGCTCCAGAATTGGGCCACTTATGTCATGATGTTTGGAAGCTTAACATAACTTTTGGGTCATAAAAAATAATAGATATAGGCTCAGATGAGTCAACCTTTATTTTCAATAAGGGGCTTGGTCTTTTAGCCATGAACCTTTAGGGTCTCAGTTTTGATACTCTTTTAGTGGAGATTAATGTCATTAATTTGTCTGAAACATGATACATTTTAGACTCCTTTCACAACACCATGATCGGCAAACCGATGAAAGAGCAGGAAAGCCCTAGTCTGCCACAAATTTCACAGAACTCACATGCAGCTGATTACACAAAAATTGTCATTCAAGATTAAGAAAACTCTTTCTTCGTACAATCGAAAGAGGCTGACCAAAATCAGATAGTTTTTTGAATTTGAAAGAACTAAACCATGGCTTATCGAAGAGGCTACTCAAGCTTTCTTATGGCTGCATAATAATGTTACAAAGATGAGCAAACAATATGGATTGAATTTTTGTGAATGTGAGATTGAGGGTCTAGCTCTGTTCACGAAGCTTGACAGGAGAAGGCATAAGAGAAATGAAGCTACCATGTCCAGATTCACAATTCCAAAAGTGATAGGTATAAAGGAGCTCCAAAAACTGTTTTTTAATGTGAATTATGGGGAGCTCGGATCAATGATGGGAAGGGGGATCACAAACACTAGGTACCCATGAAGCTGAATATTCTCACTTGGAATATTAGGGGGTTGAATGACAGGGAAAAGATAAAGGTGATAAAAAGTTAATCCATAAATTGAAGGCAAATATTTATTGCTTTCAAAGACGAACTTAGAAGGGGGTGTGGAAATACTAGTTAAACAAATGTGGTCAGACCCATCTCAAGTGTTGTTTGGAGTCCAACGACAGGAAAGGGAAAATATTGGTGATGTGGGAAAAGAATGTTTGGACAAGAGAAACTATCAACAAGGGATGTATACTATCACTCGCAAAATTTCCTCATTATCACAGAATTTCTCCTGGCACCTCACAGGGGTGTATGAATCACACTGCAAGTTGGAGAAACAAGAACGCTGGTGGGAGATAGTAGCATCTAAGGCATTTGTGCAGGGCCTTGGGTGGTGTATAGAGATTTTAACACTAATAGATTCATAGCAAAAAGAAAGAACAACAATAAACTCACTAGGGCTATGATGGACTTCTCTAATTTTATAGATCATTAGAAGCTTGTAGATTCTAATCTTAATGGGGCTCCTTTTACTTGGACAAAGGGTAATAATCAGGAAAACTCTTCAAGATTGGATAGATTTTTTTCCCGGCTAAATGGGCTGAGGAATTAAAGAACAAAAGGCAAGCAGTACTCCCCAGTGTATTTTCTGATTGTACTCCCGTTTCTTTTCAATGCGGAGATGGGGAAGTTTAAAATCTTACTTCAAGTTTGAAAGCTGGTGGTTGGGTGTTGAGAGATTCAATGAAATGGTGAAAAGCTGTGGAACTCTTTTGAAGTACAGGGTAGACTAGACTCCATTCTTTCAAGCTAACTGAAGTTGTTGAATACAAAGCAGTGGGATTGGGAATCTTCAGACTATGACAGTGTTGTGAAGAGCCTGAAGCGAGGAAAAGCGATAAGCCCCTTTTCGCTTAAAGCGAGAAGCGAGAAGCGAAGCGCTCGCTTTTTTGAAGTGAAGCGGTTTAAAAAGATATTAAAATAAATAATGCATAGACAACACATGTAACTGTAAGCAAATGTTCAATACTTCAATGTAAAAACTAAAGAGTAGCATCAATTAAAGCACAAAATGAGCATCATATTCTTCTTCAAGATTGTCAAATTCTTGTATTCCACTATCATTATTATATTGCTCGTCATCTTCTTCAACTTCTTCTTCTACATCAACTAGAAATGAAGTAACTGCTTCTTTTCCCTTCCTCTGTGAGCTTGAAATTGAGGTACTCCCCCTCAAACCATAAATTTTCTCCCCAATTCCACGCGCCTCCGCAACATCACCCCAAGTGAAATCAGAAGTTTCCTCAAATACTTCTTCATTTGCATGATCTTCCGGGACTCCAATTAGCCATTCATTAGCATCATCGATGTTGTCCAAACTAATTGGATCAATTACATTGCGAGCATTGTAACGACGCCTCATTGTTCTATTGTACTTAATGAAGACTAGATCATTGAGACGCTTCAAGGTTAGTTTGTTCCTCTTTTTGGTATGAATTTGCAAATAATAAGTAATTAGTAAGATTGCATGCGCATAACTGTCTGTCATCATTCAACATTCTAACTTCTTGAAGTTGCAGCATATTGGTGATTTCAATTATGCAAGGTTAGCAGATTTTGGAGCAGCAATATCTGGAGCCAACAAGCTACAATGCCAGCAAGTGCTTGAAGAGCTAGATGTAAGTCCGTGGTTCAAGAAGTTAGATATTCCCTCTGTTTCAATTTAGATGACACACTTTCCTTCTTAATCCGTTCCAAAAAGAATGACACATTTCTACAATTGAAAATAATTCAACTTTAAACTTTTCATTTTACCCATTTACCCTTAGTGAGAAGTTTTTATAACCACACAAATGTTATGCCCCCACAAAGCTTTTACCCCTTAAGCTTTTAAGTCCACAAGTTTCAGAAGTCTTTTTTCCTCTTAAACTTCCTGCCAAGTCAAACTACCTCATCTAAATTTAAACGGAGGGAGTACATAATATTTTCTGTATGTGTGCCTTTTTCAATCCAAACCTCAAAGAGAGAGGTTTACCAGTATTGTGAGTTCTAACCACTCTGGAAGAGAGCGAGGTTTTATTACTAGGAGGTAGATCTAAGAATTTTTCCTTGCCATTTGCTTCCTATTGTAACAAAAAAAAAATATATGTTTGGTCAGTGCTTCCCTGCTCTGAAAAGGAAAATGGTTGAAAGTTAAAAAAAAGGAGGATCAAAGCGTATTAGCGGTAAAGCTTCATTTCTTGCTGATGGGTAAAATGGCTTCTCATTGGTACTCTTGCTCGAAAAATTAGGAATCTAAAAAAGTTATCAGTTTGTTCACCACTTCATAAAAAGAAATTGTCAGTTTGTTCAAGTGGAGCTACTTATTTTGGGAAAATTAGAAAGGGAAATATTCAGTTTAGAACAGCTAGCAATATTTTGTATTTCCTCGTAAGAATCATAAACTTGCATTACAATGGTTATTTCATGATTTTCATCGAACTGAGCCTTTTGTCTGCTCAGGGATCAGAATATAGTGTGTGCATGAGATAAAATGAATGTTTCCCGCTTGTTTCATAGCTAACACCATAATACCTGCATCAGGTGCATAAGCGGCTACAGCTTACCCTGGAGCTAGTGAAGAAAGAAATGGAGATTAGTAAGATTCAGGTAAATGCACATCAAGACGCATACCTGAACTTTAAATAGGTGCTATGCATGCATTTAGCATTTTACGTCTTTTCTGTTGTCTGCAGGAATCAATAGCAAGAGCAATTGAAGAAAAAATAAGTGGAGAGCAACGCCGTTATTTGTTGAATGAACAATTAAAGGCCATAAAGAAGGCATGTGGATTATGTGCAGCTTTTTTTGTGTTATCATCCTTAAACTTAGTACTTACATATGTTTATTCCTAGATAATATATATGTTAAGCTTCTGATTCTTATGTTTGATTCACAGTTTAAACAAAGAGTTGATTTGAAAAAAAAGTGCTTCCTTATGACGTGATTTTGATTGGCTTAATGCAGGAACTAGGTTTGGAGACTGATGACAAGACAGCTCTTTCTGGTTCGTTGTCTCTTAATTACTACTGAAATGAATAATGTTCTTTTTGGATTTATTACGGCCAGACGTGTTTTCCCATCTGGCCAATGAAACATCTTATGTTGGCCCTGGAAAAATTCCATGTAGCAGTAGCATAGAAAAGGCCATTTGCAATACGTTTTGCCCTTTCTGTTGTAACAGTTTTATGGTGGCTGATTGATCTCCTGTTTACCATTGAAAATCTTCACTAGAAGCATGAATAGCATTCAGGCGATAATTGGCTTTACAGATAGGAAATTGAAGGTTGAATTTTCTTTGTACACGGTCCAACCAACTTTATTGACATGCTTTCTAAAGCTATTTGAGACATACCAAACTTTTGACAGATAAGAATACATTGTACCTATAGGGGGCTCTTTGTGTTTTCACTAAAGCACTTAATAAGACAATACTGGAATCCTTCAACTTCAATTTCCGAAAGGTTACTGTCTACCATATTTGGTAATGTGGCACTAAATCAACATGGATTTACTGGAAGGCCACATTGTTAGCACTGCATTGTTGTTTATTCAAGTTATGGACTGCTCATGATTCAGATTTATCTTCCTCCTCCAATGTTTCTTTTTCTCTTGAACTCTGTCCTGTTTCCCTTTTCAATGCACTTCCTTATTAGAAGCCTACTTGCTTTTTCTGTCATGTACTCTTTCTTCTCCCCTTTGCTCCTGTTGAGGAGTGTAATGCTTATGAAGGTGCTTCTTCCAGGTTTTGGTGATAATAGACATTACTGCAAACCCTAAAACGCGGTTCTCATTGTTTTTTCTCTATAATTTTCACAAAAATAGAGTTAAGCTGCGTGTTGTAACATTGTTTTGGTAGACCTGGTTTTGGTGTTGAAATTGAGAAGAAACTGCCTAATTATTAGGTTTGCTTTAAGGGCTTCGATTTTTCTGGTCAATACTAACGAAAACCTGCAGTTTTTTGTTCCCTTTTTTATTACATCTAATGGTGTGACACTATATTTGTTTATTACTGCAGCAAAGTTCAGGGAAAGATTGGAGCCTAATAAAGAAAAAATACCAGTACATGTTATGCAAGTTATTGAAGAAGAACTGACAAAACTGCAACTGTTGGAAGCTAGTTCCAGTGAATTTAACGTAACACGTAATTATCTTGATTGGTTGACTGCCTTGCCATGGGGTAATTACAGGTTTGTTGTCTATCGATTCTGCCTTACATTGTCTTGGGTTCAACCCAACTGATGTTATCCTTATCCTTGGCTAGCTGTACTAGAGGAATCTGTTTGAGAAGCTGGCTAAACAGTCCAGCGAGAAATAAAAATGTTATTCTCTGAAATTTGCTGCTTCCAAGTTAACCTTACTGCCTAGTGATGTGACTTGCCTAAATATCTATCGAGTAATATCCATTTGTCTTTAACTTTTCTTTCTCCTCCAGTTCCTTATTTTGGGTTCTTACATGTCATGCTTCTGGCTTTGAGGATGCTTACTTGACATCCCAATGTATGAGTTTAGACCAGGATCTCATGAGAGCAGCAAAACTAGGATTGTACTTATGATGAGCTCCTTAAGATGGGGGCTTGATTTGCCGTAGTTCGTGTTGTTTGCTGCTGATGGTGGTGGTGTTGGCTTTATAGTTTTGTTCTCTGCCATGGGTGTATTGCATTGGTTCCTGAAGTTTTCTTTTTATGATCCAAATGCAGTGATGAAAACTTTGATGTACTACGGGCAGAACAAATTCTTGATGAAGACCACTATGGGTTAACCGATGTTAAGGAAAGGATCTTGGAATTTATAGCTGTGGGAAAACTCAGAGGAACCTCGCAAGGTTGGTAAATGCCTTTTTTTAAAAATAATAACCCTCATTTTTATTAAAAAAAATCCTATTTTATAAGGTTCAGCCATAATCATATTAAAAGAACGGAAAATGATCCAGCCATCTCCTTACTGTCCATTGTCATAACATTATAATGGACCAATGGAAAATATATCCATAGAACATGAGATTTATGGTTCCCAAATACTTTATTGACATCAAATTGAAACGAGTAAACGGAAAGAAGTGAACATTTTAGGGAATTTGAGAAATATTTATTGGTCAAACTTAGGTAATACTTTTTGTGTCAGTCTAGAGTTCCTCCAATGTTTTCTTGTGATTATCTGTGGAGTAAAGAAATATATCTTGAGCTTAATTTCTTCCCTTGAAAAGCAACTAATGTGAATTAAACTGCTGCACCTTGGGCCATAGTTTGTTGGTGTTCTTCTTACATTCTGATTTTGTGCTGTCCATGATTGGGCACTCGCTGTGTGGTATTCGATTGATAACTTACTTTCACCACCAGTTGTACTTGTATATCTTTGGGACATTGAACTTGAGATGTAGTTGTTTGTTGAGGATAATCTTTGGAAACTATGAAGTGTTGAGAAAAAAACAGGTTGAATGAAAGTTAACAATATAATCCAAAGACAAAGGTTAACTCTAGAAAAATGTGAATTGCATCATAGCTAGGACAATATTAGGTTCAAATGATAATATTAATCCCAATTATACACTGGCACTGTCTTTCATATGTGCGGCTTCACCTGATTTATCTCAGTTTAATTTTGAATCTGAGTCAGGAACTAGAAACAGACTCATTGCTTATTTTTGTTTGAACAGGGAAAATCATATGCCTCTCTGGCCCTCCTGGGGTGGGCAAAACCAGTATAGGTCGTTCAATTGCACGTGCATTGAACCGCAAATTTTACCGATTTTCTGTTGGAGGGCTGTCTGATGTTGCTGAAATAAAGGTAATGGGAATATCTGGCCAGCTAAAACAGAGTTGTTTTGTGGCGCACAGAATCTTGAACTTTCATGACTAACTTTGGGATACACTTCAAGGGACATCGACGAACTTATATCGGTGCCATGCCGGGGAAGATGGTGCAATGTTTAAAAAGTGTGGGAACCGCTAATCCTCTTGTTTTGATAGACGAAATTGACAAGGTATTTTATGGTTTGTGAGTTCATGCTTCAATTGTATGGCTTTGACTATGAGAGGAAGTCTAACTTCTTTTTCACCATTTAATCTCGTTTTTCTGTATATGACCACTGGAAGAATCTTGAGCCTGAACATATTATGTTTTTGCTTGGATTTCCTCTGCAATCTAAATGTTTGAAGAAAGTGTTTATCGATCAGTTTAATAATAGCCTTGTATTTTTTCTTCTATGGCAGTTGGGAAGAGGACATGCTGGTGATCCAGCAAGTGCTATGTTGGAGCTTCTTGATCCAGAACAGAATGCAAATTTCTTGGATCATTATCTTGATGTTCCTATTGACTTATCAAAGGTAGTTGTTTTCTGGAGCACTTATCAAATTATTGTGGCTGTTGATTGGCCCTTATGAAATGCCCTCTAACATCATTTGATGAATGGGGACTAATGTGATATTAAAAATCTTGCAAATATCTACTATCATTTTGCTTTTTCACCTTTTGATTTCCCCCCCTTTTTCTTTACTGATTGATGTTCTCTTTCGTCTCTTACCTTAGTTAAGTTTGGAAAAGGTTTCGTGACTGAGCCTGTTTCTTTTATTGTCCATAATGGAAGGTTATCTGGAAGTATTTTATTTCACGTCTGTGTTACCTTTGTCTCTGTCATTATGACTGTAATATGATTAGTGTAAGTAGAATGTTGTTCTCTTGATACTTGAGAAAAAACTAGGTTTCATTGGCATTGGTTTTGGTGATCATTCAATAGGAAAAAGGTTGATGCATTGAGTTTTCCTACTGTCTGTTCATATTATCTTTGATGGCTTTACTGATGAGGGATATGGTTTTATACCTCTTGGAGTTACCGATAATTGGAGCGATAGACTTTGAGTGTTGAGTGTTAGTTTACCTATTGATTAAAGAATTTGGCCCATAATTCAAATATGACAGTTATCGATACTATTTCATTTTATTATATCAAACTTCAGCAAATCAACATGGCTACAGAGAGAAGGTTGATGGGTATATTCGCATACAATTTCATTTATTTTGTCCTTTAATGTGATAATTTACTGTGTGTTCCTATGTCTTTTCATAATGGTAAATATTTGTGTTATATTTCCAAAGGGCTTTGTCAAAAGAAGATAGCATCTTTCGATTATTTTGGTAGTATTTTGGGTCTGACTTGGTTATGGGGAGGGAGGCTATATCAAAGCCCTGTTATGGATGTTGAGTTATATATGAACCTGAAGAAATTATAAAATCCAACCTCTAGGTTTTAATGTACTTCTAAAATTTGAAAAAATCGAGCTTTGGCACCTTCAGTGTTTTCTATCCTCTTTGCTTAATACAGTTACCTATTCCACTGTAGTTTATACGAGAATGACATCTGCACATTTATCAATTTTTTGAGTGCATAATTTGAGGTCGATCATTTTCCTTTTTGCGGCTGCAGGTTTTGTTTGTCTGCACAGCCAATGTTGTAGAAATGATACCTAATCCTCTTTTGGATAGAATGGAAGTAATTTCAATTGCTGGTTACATTACGGATGAGAAAATGCACATAGCCAGGGATTATTTGGAGAAAGCTACTCGTGAAACATGTGGGATCAAGCCTGAGCAGGTATGTCTTATAGAAACATCTCTAGTTGCATCTCTTTCATTATCTCCTGTGCATATTCATTATCGGGTGAATAGTTTTGTATTTTTTCCGCTCCCAATTTTTGACATTCAGAAGCAGACATTGGTCTGTCGGGAAAATGTTCGAAACATGATTGGGCAAGAATGTAACTTCTTAAAGTAAAATGTGAAAAGCTTTATTCAGTTAGAAGTTTATGGAATAAAACATATGATTGGAACATTCAGGGGATCAAGCTCTTTACACCCTTCCATATAGAACATGGCTAAAGGAAAAAATCGCAGTAAATCTCATTATAGGTTTTTTGCCATATGCTTTAATATTTCCTAGAAAATATACTGTTTGATGTATTGATATCATTTTCATCTTAAGCCTCTACATTTAGAATATACTTGCCTCCAAAGCTAATGGATTATAAAGTTTAATACAAACTCTTCCATCACTATCCTTTTCAAGAAGTATTTAAGTGGTTCACATAAACAACTGAACGAGTCCACATTGCTGTATATATCCTCTCAAGTGCTGCTTTCTCTTTGAGGAATGTTTGCTTGACCGAGCCAAACTGTGACTGCCTATGCTAGCCAATGTGCTCGTGCATCATTAACACAACCACTGTGCCACACTTTGCTGGGAGTATAAACGCAATTGAAGGTGGTTCTGTCGTGGTAGTCGCCAATCTCTATTAGTAAATGGAGGTATAATGAGTTTCTTAGTGGTAGGGATACATGTGCCTGGTGGGTGGGGTTGCTGGATGGTATTGTCTAACAGATTTGAGCGGGGCGTGAACTAAAGCAGGATGGATGTTTAAAATATGCCAAGTGTTTTGGTGTTTTTGCTGTTGACTTCAATTTGACCATCGTGTAAAAGAGAAACTAGACTAGTTTGTTACACAGTAGTGAATTTTTTATTACTGTACAAGGAATCATCATACGATGCCAAGATTGTGGTGATGACTCCAAAAAACACTTGACAATGTCTATTGAGTGAGCTTAAGTTATAATATGTCTGCAGTAAACATGTTCTCCCACAGCCTTAAAGATGAGATCCGTGGAATTCGGAGAACCTTATATTTAGCTCTCTAATGTGATCTTCAGTTGAGTTACATGCCATATGGATCTCTGGTGAAGAAAAGGGAGCTGAAGACTTACTGATTAAGGGCATCAAAATGACATTTGACCTCATCTAACTCTGTGGTGTCATGTGGTGTCATTTACTGTGCATATATACTCTAATAATTAACGTTTTCCACTTTTCCGGGAAAAAACACCCTAAATATGGGTGTCAGAAGTTAATAAGTTGATATGTGGGAGGGCTGATGGCTACTTTGGTTTGTTTTGGCCCTCCATAGCGGATAGGAGCAGTTCTAATTTTTGAAGTATCTTGTAGGCTGTTGGTACTTTTTCATGACATAGTAGTGGCGTTATGCTTATCTAATGGCAAATGTATGACAGGACGAGGGTAGAGCTTTGGGTACCATTTCTCGGGAAATGTTAACTCCCTAATGCTCCTGTAAAATTACTCGAGTTGATATGTAATCAAGTTGAAGAGATGCAACGTTTGAGCCCATGTATATGTCATCTCTCTGTCTCTCAGGCTTTTAACATGCTGAGGGCCGTTTCTCAGATGCAGAGATGTGATATCAAGCCGGAAAATTAAGTGTGCATGATTTATGTTAGCAATTGATTGCAATTCAAAGCCTTGAATGGTTGATGTTTGGTTTAAATATTTCAACACGTATAACCATCCTGGCAGTTAGTATATGTGGTTATAGGAAGACTTTCAATTGATGCCTTGATGTTGTTTTTCTAAATATTTCACAAGTTACACTGAAGCGTTACTTGCAATAAGCTAATTTTATTGGCTTCATGAATTCTTTATTTCAGGTTGAAGTGACCAATTCAGCTCTTCTTGCTTTAATAGAAAATTACTGCAGAGAAGCTGGTGTACGCAATCTGCAGAAGCAGATTGAAAAGATTTATCGCAAGGTTCTGTGACCTGTCCTCTTTTGTATAATACATCTATGATGAAATTGTCTCGAGTCTCGTACCTAGTCAACTTTTGATGAAAACTGATGGTTATTTCATGAAATCCAATACGAACACTTTGAATCGTTTTCATGCTCTGATACTTTGCACATCCTGTTCTTGCTTTTCAGATAGCTCTAAAGCTTGTCAGGGAAGATGGAGAGATTGAGCCTCAGAATGCAGAGGTAGGTGAGGTAGAAGCAGAATCTATCCATCTATCAGACGAAATCAAGTCTAAGGAAGAAATTCAAGCTGGAGCTGAGTCCGCAAACGGTAGCAATGATGACAAGGCCTCTGAAAATAATGCTGAAGCTGAAGCACAGGGAGCACCAGTGAATCAAACACAGAAATCTGCTAATGAAGATGCTTGTTTACAGGTAAATGAAAAACATTAAAAAGCAAAATTATAATGTTTAGTACTTCAGGTGATTCTTGCCAGTTGTAACTATGTATGCTACAATGTATTTTAATGCTTCTAAGTTTTATCTATGCTCAAAATAAAAATACAAGATGCAGTGTAATAGGTAGTTTTGTAGGCACAGAAGTGTCTTTTTACAACTTGTCTTACTGTCACATGTTGGATTAAATATGGTTAACAAATGAATGTAAGAAATCATTTATCTGGCAATATAACACCAAACAAGCAGGGGAAGCACTTTCTTTTGAAATGTTAACTAAGAACAAGCAAGGGAAGCGCTTGGTATTCATTTTCTACGCTAAGTCCCCCCTGTATCTTAAAAAGGTATCTATGCATAATTTGCATTTATATCAGACTGTAAGACAAGAGTGGGTTGCTCTAGTGGTGAGCACCCTCCACTTCCAACCAAGAGGTTGTGAGTTCGAGTCACCCCAAGAGCAAGGTGGTGAGTTCTTGGAGGGAGGGAGCCGAGGGTCTATCGGATACAACCTCTCTACCTCAGGGTAGGGGTAAGGTCTGCGTACACACTACCCTCCCCAGACCCCACTAGTGGGATTATATTGGGTTGTTGTTGTTATCATACACTGTAATCAGTTAAAATCAAATTCTTTGGGATGGAGGGGTTTCTTTAGTCTGGCTAATTAGACTTGATAGTCAAATTTGTTTTAAGAAGTTGTGCCAAATGTTAACTCACTTGTTATAATCTAGCGCGCGCACACACATTTTGATATGTTTGATGTTAAGCTGCTAATGAATAACGTCCTTATATTTTCCTTGACATTGCATGATTTTATGTTTACATATTTATCTGCGTTTAATTTAATGCTATTGTGTCCTTGCAACTGAATTTTGGTTCTGAGTGGTTGTGATCTTAACAGGATACTCAAGAAACTGAGAAAGCAACAGAAAGTGAAGCGAGTAAAACAGTAAATAAAGTGGTTGTTGACTCGCCAAACCTAGCTGATTATGTTGGCAAACCTGTTTTCCATGCGGAGCGCATATACGATCAGACACCAGTTGGAGTTGTGATGGGTCTTGCTTGGACTTCAATGGGTGGCTCAACACTCTATATAGAAACATCTCTGGTGGAGCAAGGAGAAGGGAAAGGGGCTCTCAATGTAACAGGACAACTAGGCGACGTTATGAAAGAAAGTGCCCAAATTGCCCATACGGTTGCCAGGACCATTTTGCAGGAAAAGGAGCCTGATAACCAATTCTTTGCAAATAGTAAGCTTCATCTTCATGTTCCTGCAGGTGCTACCCCTAAGGATGGCCCTAGTGCTGGTTGTACTATGATAACGTCCTTGTTGTCTCTTGCCATGAAAAAGCCTGTTAAAAAGGACCTGGCAATGACAGGGGAAGTCACGCTAACTGGCAAAATTCTTCCTATCGGCGGGGTATGTTAACAATTCTTACACCTCTCCTTATAATTTCATGCAGCTTTTGTGTCTGATCATCTATCATGTTTTCTTTTTATTTTTCGTTGATTTTGTCTTTAATGTTCTTTATGCTTTAATTATTTCCTGTGTCTGTATGTGTTACATGCATGCGCATGTAAGCATAATAAGAGTGGTCTTTTCTTTTTGACCCAACCAGTGTTGGGTTCCTTTCTTGATTTTACAAAAGCTTTACCTTTTGGTTCAATAATAGGTCAAGGAGAAAGCCATAGCAGCGCGAAGAAGTGATGTGAAAACTATAATATTCCCTTCAGCCAATCGCAGAGATTTTGACGAGCTTGCTCCTAATGTCAAGGAAGGCCTTGATGTACACTTTGTGGATGACTACAAGCAAATATTTGATTTGGCATTT
SEQ 77
ATGCAGTTTTTCCGAAGAAACCCATCACTTCACAGAATCTCCTCCAGATTCCTTAATCAAGTTCGTTTTCTTTTCTTTTCCTTTTCCGAAGTATAACTAGCTTTTCAATTTTTGTTTGGCTTTTCGATTATCTTACTAGTGTAACATATATTTCATATGTCTTGAGGTTCATTGCAAAAACTCGTTATATTTCTAAATGGGGTTGAGCACGTGGTCCAATACAATTCAGTAGGACTTACAGCTGTAGCCTGTAGTTAGAGGACATTGGATTAATTAATTATATGGCTGCAATTCAGATATTCAAAACGTTTCTTTTCCCTGTTTGCAATTTTTTCCTCCAAGTAGTGAAACAGTGGAATTTTCTCCCCATTCTTAGGTCAAGCTATCATCTTTTTGCTTAAGAGTTGGTTTGGATGTTTACATTTATTTTCTAACACAATTTGTTTTGGTTTGCTGCTGATATCTCATGAATTGGATACGATAGGTAGTCAAAACCAGTGCATATTCAACCAAGAAAGTTTACAATGCTGGGCAGCCGACTGCTGCTACTCACCCTCAGGTACTTCTACACTGCATATGTAGTATTGACATTTGGTAGCATAGAATTAAGACCGTTGACATTAACAAATGATAAATGTGCAGTAAATTTAAAATCTTGTTTTCTTGGTGGTTGTTTTCCTTGTATGGGACATACTTCGCTGTCCTTTGGAGCTCCTTTGTGAATTTCTGTTAAATGTTGTTACATCACTGCATGCAGTTAATGAAGGAAGGGGAGATTACTCCTGGCATTACCAGTGAAGAATATATGCAGAGAAGGAAGAAATTATTGGAGTTTCTTCCGGAGAATAGCTTAGCAATTGTTGCAGCCGCTCCCATAAAAATGATGACTGATGTTGTACCATACAATTTTAGGCAGGATGCTGACTATTTGTACATCACCGGATGCCAACAACCTGGTGGTGTTGCAGTTCTAGGGCATGACTGTGGTTTATGCATGTTCATGCCAGAACAAAGCCCCCAGGTATTTCAGGAACCATTCACTTGCTTCCTTCTTGTTGACAAGAAGCTGTTAATAAGAGAAAAGCTTCGTCCTATAATTTAGTGACATTTTTCTTTAGATTCAGTTACTACCATGATTTTTTGGTAGTTAGTATACATTGTAGCAAGTTAAAGATTGTTTCCATACTAAAAGTGAAAAAGTATTTTTAGGACGCTCTTTGGCAAGGAGAAACTGCTGGAGTTGATGCAGCTCTACAGATATTCAAGGCTGACCTTGCTTACCCTATTAACAGATTGCCTCAGGTAAATCTTTTTTAAAATCATATCTCCAACTGCAAATAAGTTTGAGATTCTTTTTAGAAGCGAATACCTCTCACACTGATAAGTAAAAGGGCATATGATAACATCCCTTCTTTTATTCCTTTCAATAGGACAATGAAGTACTTTATCTAAAAAGGGAGTGGAGACCTATTTGTCTCCTTTCCACTTGATTATAGAAATTTATGTCAGAGAATTGAATCTGTTAGAGTTGGCTTGTAGATACCTTTTGACTGTTGATGCAATTCTTAATATGCGTAAAAGAATTGTTTTTCTCCTTTTTCTCTTTTCTTGCCGGGGAAAAGAATTGTTTTCCTCCTTTAATATGCGTAAAAGGTATAGAGGGAACAAAGTGGATGGAAGTTAGAGTTTTCACCTAAGTTGCTCCGACACGGCAATTTAGGTGCCGCACCCATATCGACACGACACTAGTATGGGTGTGGGTATGGGATCCGTACCGGATCTGGTCAAACAATTTTGGGTACTTTGACCACGACGGATGGAAAAATTCGAGACGAGATACAATTTGATTCCCAAAATCAGAATCTAAGGTAAATTTAAATAAAATAATATACCTTATCTAGAAAATCAATCCTTTACTTATCTATAACTTGAAAATAAAAAGGAAATCCACACTTTACAAGCTATACGTAAGTAATCCACAAAATTTCTCATAATTTAAAAATATTTTTATTTTTTTTGAATTATTTTTAGTCGGATCCCCGCACCCATATCTGTACTAGGATCTGTATCCCCGAATCTTAGAATTTACATCTCGAAGGATCCAACCTCTAGATTCGCACCCATGTCGGACACCCGCACCCGTGTCCGAGCAACTTAGGTTTTCACATATATAGGAGTCGGGCCTGGCTTATTACTATAAATTCATGTTTGATAGGACCTATTACTGGATGTAGCCTTTCCTCATAATTTTGAAAATCAAGCAGGCATCACGCTAGGATCGGTTGAGAATATAATATTATGGGTAAGAAATTGAAGAATGAAGGTTAACAGAAAGTGGACACTGTGTTCCAAATGGAAACTAGGTAAATGATTTAGGCAGACGGAATATTTTTTGGTTGGCTGTATTTGGCTCTCAGGTTAACCGTTTGACCACTTAATGGTAATTTACTATTTAAATCGGCAACACAGAGAACAGGAAGTGAAGATGTATATATGACTGTGTTATTTGTAGAGAACCAGTTTATGGTGTAGGTTTTCTAGTTATTGTAGAGCACTTGCGTACAAGAGTTTAAATTCCGCAACATCGATAAATTCTTACCTTATAAAAAAGAGCAGGAAATGAAGACGAGATGCCGATATCATGATTAGATCTATGTCAGCAAAGAAAAAATGTCAATTATTTCCTTCTAAGCTGTCTTTCCTGTACATGGCTTCAATGTAGTGACTTTGTTTCACTTTCTCCATGTTCCAATTTCTCTTCTCTAATTTTTGCTGGACTTGTCAGATTCTCTCCAGGATGATAGAAAGTTCTTCCACTGTGTTCCATAATGTGAAGACAAGGACTTCATCCTACCTGGAGCTTGAGGCCTATAAAAAAGCAGTTAGCAATTACAAAGTGAAAGATTTCTCTGTGTACACTCATGAAGCCCGATTTGTGAAGTCTCCAGCAGAGCTGAAATTGATGAGAGATTCTGCATCTATAGCTTGTCAGGTAATGGTAGTTCTTTCATTTTTGTCAGGTTCATGGGTTAGAGTGGTAGTTCTTACTCATAGAGGTTCTTGTTTTTATTGGACATGGAAAAGCAGTCTCTGGGTTATAGAGATGGAAAGATAGAGTGTACACTTGACACTACTTTTGTATGTTTATTTGTTTTCCATTGAAGTTGATACTCTTCACACAGTTAACATGTGACTAAGGTATTGATATGCCAGCGAGGTGTTTTCAGAATTTTAAAAAGCTTATTTGCAGGTCAGCTGTTATACAATCTTAAGTAACATGTTTGTCATTTTGCTATACGACAAAATTTTTTAGAAAGGTAAAATAGGTATTTGCATTTCCCTTTTTTTCCTCTTCTTCTTTCATGTCTAGGGTGGTGTCTTCAGGTTGAAGATACTACACTTCTGAGGATCTAAAAAATATTTCACAAAAGGAAAAAGGGTACAGTCAGATAAAAGGATCACCAGTCTAAAAGAAGACGGTTCTTAATATTCCAAAAGTTGGAGTCCCAGCTTTCTTACTTGGGTCAACATATTCTTGGTCTAATTGTGAAGGAACAGTTCTTGCATGTACAATCCTTTCTTTGATAATGTGCTTCTGTGTTAAGTAGTTCAGAAGCTCTAGGCATGCTTAACCAAAAGATGTGTATATACTACTCATTCATTCTATTTCACAATCATGATTTGCATGTTTTCTTATGAGAGACTGGTCTAGAAAATGCTTCTTCCTATTCCTGGATTTGTATGCAGTTGCCTAGCAATAAAGTTGCCAGTTATATGGGAGTTGAGATATTTTCCTTTCACTAATTCAGTCCTTTTTTATACTGTATAAAGGATATTTTTTATTTCTTGATCTTTTAATGTCTGTCTTGTCTTTCGGAAACAGCCTCTCTACCCCTCGGGGTAGGGGTAAGGTCGGCGTACACACTACCTTCCCCAGACCCCACTAGTGGGATTTCACAGGGTCGTTGTTATTATTGTTGTTCTTTTAATGTATATATTTTTGGTAGGCACTTGTCCAGACCATGTTGTACTCGAAGTTGTTTCCTGATGAAGGAATGCTGTCAGCCAAATTTGAATATGAATGCAGAGTTAGAGGTGCCCAAAGAATGGCGTAAGCTTTTTCTTGTAATAATTTTTGGAAGTTTGTATATAGAGAGGAGCACGTTGCAATTTCTAAGTATTTTAGTCTAACATGAGTTGCAGGAGAGTAAATCAAAATGCCACTAAGACCTCATGTGTAAACATGCAATTGATTTTCTTTTTCTTCTATTCTCTGCGATTCTGATAATTTGTTGTTTTTCCTGACTGTTAGTTTTGGTCATACTTCTGGTTGAGATAGTTTCAAGGATTATACATTTTTCTTTTCCTGTTCCACAGGTTTAATCCTGTTGTTGGTGGCGGACCTAATGGCAGTGTCGTGCATTATTTTCGTAATGACCAGAAAGTATGTTTACTGTCTTTAAGCACAGTTGAATTTGAATATCAAGCATATTGAGTAGTAGTATCTAATTTGTTGTTTTAACAGATTGAAGATGGGGACTTTGTTGTTTTAACAGATTGAAGATGGTAACCTTGTCCTCATGGATGTTGGATGCGAGCTCCACGGTTATGTCAGTGATCTTACTCGTGTTTGGCCGCCCTTTGGAAAATTTTCTTCTGTTCAAGTAAGTATAGAATCCATGATTTTCTTCTCCGTTTTCCCCTTAAAACTCAAGTCAAACCCCACTCCTCTGGGTAAAAACCCTGTTAGCTGATCAAAGTCATAGACAACCTTCCATTTCAGAAAGAATGCACTGACCTAAATTGAAACCCCAGACTACCAAAAGCAATCTAGAAAGACAAACGGTAAAATGAAAATCATCAGGTAATGTAGCCTAGCAGCTAGCTTCACCCTCCAGTGGTATGAGTTATGAATCTTAATTCCAGATCCTCAATGGCCTTGCTGATTTGATGTGGATGTGAAGAATGAGAATGATATAATCATATAAAGTTCCTCCTTATTAGAAAAACAAAATTTCAATTTTACTTCTAAGCTACAGATTATGCTTGAGAAATAAATTCTTCCCTTGGTATGGTTTTAAATTGCTACTTTTCTGTGATATAGTCTCTATACATTATTGTCTGGAACTTAGTGATACGCTCCAAGATACATTTCAGGAGGAACTTTATAATCTTATTTTGGAGACAAACAAGGAATGCGTGGAGCTGTGCAGACCTGGCACAACCATCCGAGAAATACACCACTACTCGGTACTATTTTAGTTAATCCATCTCGTAATTTCTTTTGGTTTATATTCAAGGGGTAGCTGAGAGTAGGAATTTAATTTTTTTTCTCTTGCCTTTCATAGACTTAGACCCAGTTATATTGCCAAGTTACATTGGTAGTCTCGGTGATAGAAATTTGGGTACAGTTGTGAAGGCCCTACTCTTGCTTTATGTTTTGCCTAATTCTCAAGTTACACTGACTTCCACCTCCTATTGTGAATAGCAATGTTGCTTCAAAGTTTTCGTTCCTATGCATGAGCCGAGGAAAATGAGGCTTGATGGCATTTTCGAAGATAAGGAAAAAAGTTTAATTCCTTTTAAGCCTTTGAGTATAGAGGTGTTGGGAAGAGATAGATGCCTTTTGATTGCCCTCCCTTGATTTGAAAATAGTATTTGTTCCCCCATTTCTTCATATATGATGAATAATGCTTTGTAAAATAAGCCAGTAAGGTAATGATTAGAGGTGTCTAATTAGTGTAGTGTGTGGTAATTGATTCTCAGTGAAGGGTAGTATTACCTGGTTGATGCAAATGCTTTATGAATTAAGGGTCATTCTCTGCATTTGTTGTTAGGGCAGCTGTTAAAGGTCTTATTACTGTTCAAGATATGGGCGTATATGGTATATATTCTGTAGTGAGTAATTGCCCTATATCAGCATGCTCTTTTCTTTAGATACTTGAGGACTGCCAAGGTCTCATTCTTTTTTTTTTATTTGATGTGTATAGGTAGAAACGCTGCGAAGAGGATTCAAAGAAATTGGGATACTAAAAAATGATCGGCGTGGAAGATATGAAATGTTAAATCCTACAAATATAGGTCTTTCCTTTTAACCCTTACTCTTCCGCTGCAGATATAAGTACAAATGCATGTGCAATAGCAGCAGAAACCTGCTCCCTCTCAATTGTCTTCACAGTTGCTAATGCTATTCCTATTATGCTTTTTGCTGAAAAGAGAAATGATTTCTTGTACCGCTGCAACCATCTCTGAATGAATTTGGTTTGTTCAATTATATTTCCAGGTCACTATCTAGGAATGGACGTTCATGATTGTTCTACAATTGGAAATGATCGACCTCTGAAACCTGGTGTAGTAAGTTTCCTTCCTTACTGATGATTGCTTTGATATTTGAAAATAATCGGGAAACTGCTAGGTTTGCAAAAGAAATTGGTTGTCATTATTTTGAAATCCTCCTAACGAATGAGGACCAGTGACTTGCTCATTTGGAAACAAATGAGCTTTGCCATAATGCATCACTCTCTTTTAGCAATTTACAATGAGTTTTCCTCCATAGGATAGTTCAGTCATTCTCTCTTTGCTTCTTGACTGGATTACAATATGAACCAACTAAGAATGCTTATGTTTTTAGAGCCATGTGGTCAAATTTCCCTTTTCCTTCACTTTTTTCATTTTTATAGAGATGCAAAGGGTTAAAAGAGAGGATTGAATGATGATATATATGAATATTTTCTAAACTGCTTTTGTCATATGCACCTCTTTGGTCCTATTGCAGCTTCTAATTTCATCACTTCTACTAGTAGTTACATGGAGAAAGTTAAATTCAGAAATGAAGTTGGTTAGGTTACATGGAGAAAATTAAATTCAGAAATGAAGTTGGTTAGGTTTATTTCTATTGAAGGATGTGTACATCAGGGTAGTGCGTGTATTTGCATAAAAATTATGTTTGAAATATCTGACGGTCCAATTAGCAGAAAGATCAAAGTATCTTTTGCTTCTCTCTAGATTCTATATAGACCTTTGTTTTGATTGATTAATTTGAAATATTTGAAATGATTAATTCCTCGACCGCTTAGTTCATTAACCCCGGCTTCATACACGATGCAATTTTTGTTATTGATAAAGGTTTCGTTCCTGGTATGTCTACTCTTTATCAAAAAGTAAAGCTCTTGTATATCTTTTCATTCTGAGTCACAAGGAATGTAATATTCCTGCAGAATGTGTAATGTTATATGCAATTGTAGTAAAATTTCTCAGTAGCTCGCGCATCTTGTTTTCATGTTGGTACTGCAAACTGTTAACTTATACTTTATGTATAATCCATTTCATGACAAGTACCTTGCTCTTGTGAAAGGTACTTAACGTCCAACATGCTTGCTTCCATGTATGAAAAATTATTAGTTATCCCATTTTGCTCCTTTTTCCTTCATTCTTCTAATCATAAAAAATTGGAATATGCTCCCGACCTGTCTGATATGACAATAAAAACATACACAATATATATCAAGTCAGCTGTTATATGCAAAATTACTGAAGGTAATTCCAGTAATACAGCTTATTGGTGTCAGCGGTAGATTTATGTTAAATTATGCTTTAACTGAGGTCTATTTTGCCAGTGATATCTGTATCCATGCATTTGTTTTTAGTCCTTACAAAAAACAATTTCAGAACTACTATGTTTTTGAATAAGAAGCACCATGAACATGCTACTTAGAGGTCTTAGTTTGTATAATTTATGTTGATCATATCTGTGGAGAAATATGCTAATTTTGCGTGGGCCGTGGCATTTGTATTTGAAAGAAGTACATGACTTTTGATTGTTCTGAGATTATGTGCAGAGTCTAGTTCTTTGACTTCGGGACTATGGAAAACTTTGTGTTTACTCTACATTCATACATCTAGACAAGGTCATGGCCAATTGCGAAACATGCTTTACGTTTTTTTAAAAGTGACGGAGACGCATATAGCTCAGAATGATGAGATGAAATTGGACAAAGCTAATAGTCTAGGATAAAATTGCCTGGTCATTGTTTAGACATTTGTAAACTCCTGTTCCCTCTGTTTGTTTCTACATTAACCTTGATGAGGCGGTCATTGATAAGAAGAATCTCTGACCCCATAAATAGATGAGTCCTTTTCATCTTAGCTTCCAATTACTTGTGATTTCTGCAAGAACTTGTGATCAATCTTCATGGACTGTTATATGTAGGTCATCACAATTGAACCAGGAGTATACATCCCTTCATGCTTTGATTGTCCAGAAAGGTAATACTTGTTACCTCATCAAATTAATGTTCCTTTTGGCATGCATTCAGAAGTTACTGTATCTTAGATCATCCTCCAGATTCTTGGTTTATTGAACTGGTTCTATCTGCCAAAATGTATTATGGTGGATGGACGAAGAGTTACTCTTTCATGCAGAAAAGATTGAGAATCATATAAAGATGTCTGTTCTAGGTGGGTCAGGTAGATCTTGTAGGTCTAGTCAACATGTTAATCTAAGGACATTGTAGCAGAAGTAGTCGTATCGTGCAAAGGAGTCCCAGTTTGGTTGGGCTTTCAGTGAATGACTGATGGCTACTTGTATGTTCAACCGAGAATGGGATCAGGAACCTTTACTTCGAAATTTCTTTTGCAAGGTTCGAGAAAACTTCCCAGAAACAAATAGCTCGAAAGAAATATGAAAGAAATATTCCACGTATCAATGGCATATTCTGGCTGTCCCATTCTAGGGAAAATTGTTGCTTTTGTCATAAATCTTAGGGAGAAAGAATTATTGCTCACTTGAACTAAGAAGCTTCATCTGCTGATTCATCTATTGTTAAGGAGCTAGATTATTCTCTTACCCCAATAAGGAACAACGTGTCTGTTTCCTTAGAATTCTTGATTTTTACTCACTTATATATGTCTATATTCACTTATGCTCATTCAGGTTCCAAGGCATTGGATTTAGGATTGAAGATGAAGTCCTTATTACAGAATCAGGTTATGAGGTATAGTTACAGAAATCGTTCAATTGTTTGAACAACCGAGTTATACAAGTACCAGTTCATATGATCTCTGATACTTTGATCACTTCCGACACTTGTTAGCATTCAAGACCTGATTTTCTGCCCTACTGGAAACAGGTACTTACTGCATCCATACCGAAGGAAATTAAACACCTCGAGTCCTTGTTGAACAACTTTGGCAGTGGGAGAGGAACAGAAATTAGAGCTGCTCTCAGT
SEQ 78
CTACTCACCTCTCACAAAAACCATATAATTCTCCTTCCCTTTCTTCTCTACAAAATCTTCATTTCTCTCCAAAAACAAACTCTCATGGCTTCTTCTACTAGAGTTTTTGTTCTTCTCCTTCTCATAATCTTCAACTTTCTCTACATCTCAGCACAAAAAACCATTAAACATAAGCCTTTTTCAATGTCATTTCCTCTTACTTCAACATCTTTATCACATAACTCTTCTTCTAAAGCTCTTTTTCTTTCTTCCCTTTTGGCTTCTAATCAAAGAAAACAAGCTCCAAACACAAAAACTGTGTCTAGAATTCCATCTTTGAACTATAAATCAACTTTCAAATATTCAATGGCTTTAATTGTTACACTTCCAATAGGGACACCACCACAAAATCAACAAATGGTTTTGGACACAGGCAGCCAACTTTCTTGGATTCAATGTCACAAGAAAATTCCAAAAAGACCCCCACCAACGACGTCGTTTGATCCTTCTTTGTCCTCCACTTTTTCTGTTCTTCCTTGTACTCATCCTTTATGTAAGCCAAGAATTCCCGATTTTACCCTTCCAACTACTTGTGACCAAAATCGCTTGTGCCACTATTCTTACTTTTATGCTGATGGTACTTTAGCTGAGGGTAATCTTGTCCGTGAAAAAATTACATTTTCACGTTCCCAAAGTACCCCTCCTTTGATTCTTGGTTGTGCTACGGAGTCCGAAGATGCCGAGGGTATTTTGGGAATGAATCTTGGACGGTTTTCTTTTGCCTCCCAAGCTAAGGTACAAAAATTCTCATATTGCGTGCCAATTAGACAAGGTAGCCATGCAGTTAAACCTAGTGGAACATTTTACCTAGGCCAAAACCCTAATTCCCATACATTTCAATATATAAATCTTTTGACTTTTCCTCAAAGTCAACGCATGCCAAATTTGGATCCACTAGCTTTCACTGTTGGCATGGTAGGGATAAAAATTGGCGGCAAAAAATTAAACATCTCCGGTAGGGTTTTCCGGCCAAATGCTGGTGGTTCTGGCCAGACGATCATTGATTCCGGCACGGAATACACTTTCTTAGTGGAAGAAGCGTACAATAAGGTCAGAGAAGAAATTGTTAGGTTAGTTGGTCCAAGATTGAAAAAAGGTTACGTTTATGGTGGTGCACTTGACATGTGCTTCGATAACCGTCCGATGGAAATCGGACGGTTGATAGGTGATATGACATTGCAATTTGAGAACGGGGTTGAGATTTTGATCAATAAGGAAAGGATGTTGGATGAAGTAGAAGGTGGGATCCATTGTGTTGGAATCGGACGGTCAGAATCACTCGGAATAGCAAGCAATATTATTGGTAATTTCCATCAGCAAAATTTATGGGTAGAATTTGATATGAGAAATCGAAGAGTAGGTTTTGGCAAAGGAGAGTGTAGTAGGCAAATG
SEQ 79
ATGGCTGCACTCAATTTCTTCATAATCTTCACATCACTAGTCTTACCAATTGCATCTGATCCTCTGTTGTCAACTTATGTTGTCCATGTTGACACCAAAGCCAAGCCATCTCATTACTTAACTCAAGATGAATGGTATAATTCAGTGGTTGAGTCAGTTCTTGCAAACAAAATGGACTCAGATTCTACTTCTCCAAGATTGTTCTACTCATATGATGTAGTGTTACAAGGTTTTGCAGCAAGATTGACTGATCAAGAATCTGAAAAACTAAATAAATTTCCAGAAGTCATTCACATTTTCAAAGATCAGTCTAGAATCAAGCTTGACACAACACGTTCGCCGAATTTTCTTGGCCTAAACACAGGTTATGGTCTGTGGCCACAATCTAACTTTGGAGATGATGTTATAATTGGCCTTGTTGATACAGGGATTTGGCCTGAGAGTGAGAGTTTCAAGGACAATGGTATTGGTCCTATTCCAACAAGGTGGAAAGGTAAATGTGTTGATGGAATTGAATTCAACGCGACGAGTAGTTGTAACAGAAAACTTATTGGTGCTAGGAATTTCGTTAAGGGTGTTGAGAATGACTATCATCATCAATCGGCACGAGATCAAAATGGACATGGAACACATACTGCTTCAACTGCAGCAGGTACAGAGGTAAATGGTGCCAATGTATTTGGTTTTGCTAAAGGGAAAGCACGAGGGATTGCGAGTAAAGCTAGGATTGCAATGTACAAAGCTTGTGGGAGTAGTTCTTGTGCAGAATCTGATATTTTAGCAGCTATTGAAAGTGCTATAAAAGATGGCGTAGACATACTTTCGCTCTCTTTAGGATACGATGATGCTCCGTTTTATGAAAATCCAGTGGCAATTGCAACATTTGCTGCTGTTAAAAGGAACATATTTGTTGCTTCTTCAGCTGGAAATCTTGGACCTTATCCATTTTCAGTTCACAATACAGCACCTTGGGTTACAACAGTTGGAGCTGGATCACTTGATCGCGATTTCCCCGTTGAAATCAACTTATCAAACAACAAGACTTTTGTTGGTTCTTCTCTTTATCCAGGGAGAATCAGTGGTAAAAGTTACTCTCTTGTTTATATTGAAAATTGTTCTATAATGACAATCGATCGTTCTAAAGTTGAACGAAAGATTGTAGTTTGCAACACTAGTAAAATCGAAGCTCTTAGAAATGGGATTTTAATTCAGAAAGCAGGTGGTTTTGGACTGATTCAATTAAATCTTCCAACTGAAGGAGAAGGGATTAGAGCAATGGCTTACACATTGCCTTCTGCAACATTGGGTTATAAAGAAGGTATAGAGCTTCTTTCTTATATCAAATCCAATGCTAATCCAAGAGCAGGGTTCGTACGTCGAAAGGATACAGTAATTGGGAAAAAAGTTAGAGCTCCAATTGTTGCTAGCTTTTCTTCAAGAGGGCCTAATGTTGTTGTTCCTGAAGTCCTCAAACCTGACCTCATTGCTCCGGGTTTGAACATTCTTGCTGCATGGCCAGGTAACCAGAGACGGATCCAGGATTTATACCTTATGCATTCAACCTTTATTCTTTACCATTGACCCCACGACACTTTTAAACTTATGAGGTAGGAATTTTATACTTTTTGAAATTGTTGTGATTTTTCATATTGCGTGGAAGCAACCACTAATGCTTATGGTAGGATAGGCTGTCTACATCACACTCCTTAAGTGCGGCCCTTCGCCCGACCCTGCGTGAGCAAGGGATACTTTATGCACTAGTCTACCGCTTTTCTTTATTTAGTGATTCTTCACATTGTGTGTGTCTATGCAGGTGACATTTCCCCAACACGTCTCAAGATGGATCCAAGGAGAGTGAAGTTCAATATAAACTCGGGAACATCAATGGCGTGCCCTCACATAGCCGGAGTAGCTGCATTAGTCCGCGCTGTTCATCCAGATTGGTCCCCGGCTGCTATAAAATCCGCACTCATGACTACATCCACAGCATTCGACAATGCACAACTCCCTATCATAAAACACGAAGACATGGAGCTAGCAACTCCGATCAGCATTGGAGCCGGGCACGTGAACCCTGAATCGGCTATTGATCCGGGCCTAATATACGACACTGATACATCAGACTACATCAACCTACTATGCAGCTTGAATTACACAGAGAAACAAATGAAACTTTTCACGAACGAGTCAAATCCTTGCTCGGGTTTCACTGGATCTCCACTTGATCTTAACTATCCATCACTTTCTGTTATGTTCAGGCCTGATTCCTATGTTCATGTAGTTAAGAAGACACTGACACATGTCGCGGTATCTAAGCCCGAGGTGTACAAAGTAAAGATAGTGAATCTGAATTCTGAAAAGGTGAGTTTAAGTATAGAGCCAAGGAAGCTGATTTTCAATGAATCTTTACAGAAACAAAGCTATGTGGTCAAATTTGAGAGCCATTATGCATTCAACAGCAGCAGGAAAATAGCTGAGCAAATGGCGTTTGGTTCGATATTGTGGGAGAGTGAAAAGCACAATGTTAGGAGCCCCTTCGCTGTTATGTGGGTTCAGCAAAATTTCAATAACAGTAGATTATACAAA
SEQ 80
TCAAAATGCCAAATCAAATATTTGCTTGTAGTCATCCACAAAGTGTACATCAAGGCCTTCCTTGACATTAGGAGCAAGCTCGTCAAAATCTCTGCGATTGGCTGAAGGGAATATTATAGTTTTCACATCACTTCTTCTCGCTGCTATGGCTTTCTCCTTGACCTATTATTGAACCAAAAGGTAAAGCTTTTGTAAAATCAAGGCACTTCAGAAAAGGAACCCAACACTGGTGGGGTCAAAAAGAAAAGACCACTCTTATTATGCTTACATGCACATGCATGTAAACACATACACACAGAGGAAATAATTAAAGCATAAAGAACATTAAAGGCAAAATCAACAAAAAATAAAAAGAAAACATGATAGACGATCAGACATAAAAGCTGCATGAGATTATAAGGAGGTGTAAGAATTGTTAACATACCCCACCAATAGGAAGAATTTTTCCAGTTAGTGTGACTTCCCCTGTCATTGCCAGGTCCTTTTTAACAGGCTTTTTCATGGCAAGAGACAACAAGGACGTTATCATAGTACAACCAGCACTAGGGCCATCCTTGGGGGTAGCACCTGCAGGAACATGAAGATGAAGCTTACTATTTGCAAAGAATTGGTTATCAGGCTCCTTTTCCAGCAAAATGGTCCTGGCAACCGTATGGGCAATTTGGGCACTTTCTTTCATAACGTCGCCTAGTTGTCCTGTTACATTGAGAGCCCCTTTCCCTTCTCCTTGCTCCACCAGAGATGTTTCTATATAGAGTGTTGAGCCACCCATTGAAGTCCAAGCAAGACCCATCACAACTCCAACTGGTGTCTGATCGTATATGCGCTCCGCATGGAAAACAGGTTTGCCAACATAATCAGCTAGGTTTGGCGAGTCAACAACCACTTTATTTACTGTTTTACTCGCTTCACTTTCTGTTGCTTTCTCAGTTTCCTGAGTATCCTGTTAAGACCACAACCACTCAGAACCAAAATTCAGTTGCAAAGACACAATAGTCATTATATTAAATGCAGATAAATATGTAAACATAAAATCATGCAACGCCAAGGAAATACAAGGACATTATTCATTAGCAATTTAAATTGGTAGAATTCTATATATTTTTCTTTCAGTACAGACCAATATGAGGGTAGAAATAACATCAAACATATCAAAATGTATGTGCGCGCGCTAGATTATAACAAGTGAGTTAACATTTGGCACAACTTCTTAAAACGAATTTGACTATCAAGTCTAATTAGCCAGACTAAAGAAACCCCTCCATCCCAAAGAATTTGATTAACTGATTACAGTGTATGATATAAATGCAAATCATGCATAGATACTTTTTAAGATACAAGGGGGACATAGCATAGAAAATGAATACCAGGCGCTTCCCTTGCTTGTTCTTAGTTAACATTTCAAAAGAAAGTGCTTCCCCTGCTTGTTTGGTGTGATATTGCCAGATAAGTGATTTCTTACGTTTTGTTAACCATATTTAATCCAACATGAGACAGTAAGACAAGTTGTAAAAAGACACTTCTGTGCCTACAAAACTACCTATTACACTGCATCTTGTATTTTTATTTTACGCGTAGATAAAACTTAGAAGCACTAAAATACATTGTAGCATACATAATTACAAATAGCAAGAATTACCTGAAGTACTAAACACTATATTTTTGCTTTTTAATGTTTTTCATTTACCTGTAAACAAGTATCTTCATTAGCAGACTTCTGCGTTTGATTCTCTGCTCCCTGTGCTTCAGCTTCAGCATTATTTTCAGAGGCCTCATCATCATTGCTACCGTTTGCTGACTCAGCTCCAGCTTGAATTTCTTCCTTAGACTTGATTTCGTCTGATAGATGGATAGATTCTGCTTTTACCTCATCTACCTCCGCATTCTGAGGCTCAATCTCTCCATCTTCTCTGACAAGCTTTAGAGCTATCTGAAAAGCAAGAACAGGATGTGCAAAGTATCAGAGCATGAAAACGATTCAAAGTGTTCGTGTTGGATTTCATGAAATAACCATCAGTTTTCATCAAAAGTTGACTAGGCACGAGACTCGGACAATTTCATCATTAGATGTGTTATACAAAAGAGGACAGGTCACAGAACCTTGCGATAAATTTTTTCAATCTGCTTCTGCAGATTGCGTACACCGGCTTCTCTGCAGTAATTTTCTATTAAAGCAAGAAGAGCTGAATCGGTCACTTCAACCTGAAATAAAGAATTCATGAAGCCAAATAAAATTAGCTTATTGCAAGAAACACTTCAAGTGTAACTTGTGAAATATTTTGAAAAACAACATCAAGGCATCAATTGAAAATCTTTCTACAACCACATATACTAACTGCTAGGATAGTTACACGTGTTAAAATATTTAAACCAAACATCAACCATTCAAGGCTTTGAATTGCAATCAATTGCTAATATAAATCATGCACACTTAATTTTCCAGCTTGATATCACATCTTTTTTTTGATAAGGTGAAGATTTTATTAAAAACAGTATCAAGCTGATACTGTAAAAATACAAGGACACTGCTGGCTTAAAAACATTAAAATCCTAAGCGGTCTAGCATGTCCAGCTTGATATCACATCTCTGCATCTGAGAAACAGCCTCAGCATGTTAAAAGCCTGAGAGACAAAGAGATGACATATACATGGACTCAAATGTTGCATCTTTTCAACTAGATTACATATCAACTCGAGTAATTTTACAGGAGCATTAGGGAGCTAGCATTTCCCGAGAAATGGTACCCAAAGCTCTACCCTAGTCCTGTCATATGTTTGCCATTAGATAAGCATAAAGCCACTACTATGTCATCAAAAGTACCAACAGCCTCAAGATACTGAAAAAATTAGAACTGCTCCTATCCGCTCTGGAGGGCCAAAACAAACCAAAGTAGCCATCAGCCCTCCCACATATCAACTTATTAACTTCTGACGCCCATATTTAGGGTGGATTTTTTTTTTTTTTTTTTTTTTTTTGGGGGGGGGGGGGGGGTAGGCACAGTAAATGACACCGCAGATATAGATGAGGTCAAATGTGTCATTTTGATGCCCTTAATCAGTAAGTATGCAGCTCCCTTTTCTTCACCAGACATCCATATGGCATGTAACTCAACTGAAGATCACATTAGAGAGCAAAATATAAGATCCACCGAATTCCATTGATCTCATCTTTAAGGCTGTGGGAGAACATATTTACTGCAGTCATCTTATAACTTAAGCTCCCTCAATAGACATTATCAAGTGTTTTTTGGAGTCATCACCACCAATCTTGGCATTGTATGATGATTCCTTGTACAGTAATAAAACATTCACTACTGTGTACTAGTCTAGTTTTTCTTTTACACGATGGTCAAATTGAAGTCAACAACAAAAAAAACAAAACACTTGGCATATTTTAAACATCCATCCTGCTTTAGTCCATGCCACACTCAAATCTGTTAGACAATACCATCCAGCAACCCCAGCCCACCAGGCACATGTATCCCTACCAGTAAGAAACTCATTATACCTCCATCTACCAATAGAGATTGGCGACTACCACGACAGATCCACCTTCAATTGCGTTTATATTCCCAGTAAAGTGTGGCACAGTGGTTGTGTTAATGATGCACGAGCACATTGGCTAGCAAAGGCAGTCACAGTTTGGCTCGGTCAAGCAAACATTCCTCAAGAGAAAGCAGCACTTGAGAGGATGTATACAGCAATGTGGACTCATTCAGTTGTTTATGTGAACCACTTAAATACTTCTTGAAAAGTATAGTGATGGAAGAGTTTGTATTAAACTTTATAATCCATTAGCTTTGGAGGTAAGTATATTCCAAATGTAGAGGCTTAAGATGAAAATGATATCAATACATCAAACAGTATATTTCCTAGGAAATATTAAAGCATATGGCCAAAAACCTATAATGGGATTTACTATATCCGTGCGATTTTTTTCCTTTAGCCATGTTCCATATGGAAGGGTGTAAAGAGCTTGATCCCCTGAACTTTCCAATCATATGTTTTATTCCATAAACTTCTAACTGAATAAAGCTTTTCACATTTTACTTTAAGAAGTTACATTCTTGCCCAATCATGTTTCAAACATTTTCCCAACAGACCAATGTCTGCTTCTGAAATGTCAAAAATTGGGAGCGGAAAAAATACAAAACTATTCACCCGATAATGAATATGCACAGGAGATGATGAAAGAGATGCAACTAGAGACGTTTCTATAAGACATACCTGCTCGGGCTTGATCCCACATGTTTCACGAGTAGCTTTCTCCAAATAATCCCTGGCTATGTGCACTTTCTCATCCGTAATGTAACCAGCAATTGAAATTACTTCCATTCTATCCAAAAGAGGATTAGGTATCATTTCTACAACATTGGCTGTGCAGACAAACAAAACCTACAGCCGCAAAAAGGAAAATGATCTACCTCAAATTATGCACTCAAAAAATGGATAATGTGCAGATGTCATTCTCATATAAAGTACAGTGGAATAGGTAATTGTATTAAGCAAAGAGGATAGAAAACACTGAAGGTGTCAAAGCTCCTTTTTTTCAAATTTTGGAAGTACATTAAAACCTAGAGGTTGGATTTTATAATTTCTTCAGGTTCATATATAACTCAACATCCATAACAAGGTTTTGATATAGCCTCCCTCCCCATAACCAAGCCAGACCCAAAATGCTACTAAAATAATCGAAAGATGCCATCTTCTTTCGACAGAGCCCTTTGGAAATATAACACAAATATTTACCATTATGAAAAAGACATAGTAACACACAGTAAATTATCACATTAAAGGACAAAACAAATGAAATTGTATGCGAATATACCCATCATCCTTCTCTCTCTAGCCATGTTGTTGATTTGCTGATGTTTGATATAATAAAATGAAATAGTACCGGTAACTGCAAAGTCGTCATATTTGAATTATGGGCCAAATTCTTTAATCAATAGGTAAACTAACGCTCAACACTCAAAGTCTATCGCATAATTGTAAAGTCGTCATATTTGATTTATGGGCCAAATTCTTTAATCAATAGGCAAACTAACGCTCAACACTCAAAGTCTATCGCTCCAATTATCGGCAACTTCAAGAGGAATAAAACCATATCCCTCATCAGTAAAGCCATCAAAGATAATATGAACAGACAGTAGGAAATCTCAATGCATCAACCTTTTTCCTATTGAATGATCACCAAAACCAATGCCAATGAAACCTAGTTTTTCTCAAGTGTTAGATATAGAAATGTAGTTGTCCCACATTGGAATAGGTGTAGTATGCCTTTGTATAGAGTAGCTATAAATAAGCCCATCTTGTATTGCATTAGACACACAATATCAATATATCATATTTTCTCCCGTGTCTTCTCACATGGTATCAAAGCAATCGTGAGAGATTTATCGTTGTGCATAAATTCCAGCGACTCCGGGAAGGAAAATCAGTTGACCGGAAGCCTTTTCCGGCAGGTCTGCCGCAAGTAAAAAAAAAGCCACTTCGTCAGTGTTGTGCAAAAAAACCAACACCACCACGAAGTAGATCGGGCTCTGGCAACCAACCCATAAAAAAATCTCCGTCAGAATACCCTCCACGCGCCGTCACTTGCTACCGGAAGAAAATTTTCCGGCGAAGTTCCGACGTCGCGTGGGCCACCTTCCAGCCATTTTTTGGCGACGACTCTTCAGGACAAATTATTCCCCTTGCAATTCCGAGCCTACCCATCCAGGTTACACCAAATTCCAGACAACTTATATATTTTTTCCAGCATGCATAGTGATTTCAAAAGTGGACTTCCGGCAATTTTTTGAAAACGTTTCTTCAGAACAGTTGGGTCATCTGGTAATTCCGATCCTACCCCTACTGTTTTTATTTCATTCCGACCACTTTGAATTTTCCCGGCAGCTACAGTACTATTCCGACTGCTACAGTAATATTCCGATAGCTACAGTATTTCCTTATTCTGTTTCACTGTTCCTTACTCTGTTTTAGTGGATTAAATTTGATTATTTCTATAATTTGGTAATAATTTGCAACGATGTCTATGGGAATTGATGCTTTTGGGTCTAAAAACATGAGTTCTGGAAGCTCTAGTGTTATGATTACTTCAAAACCTTTAATGTGAGGTTCAAACTACTTAGCTTGGGCTTCATCTGTCGAGTTGTGGTGTAAAGGTGAAGGTGTTCAAGATCATCTAATTAAACAGTCTAGCGAAGGAGATGAAAAGGCGATAGCGCTTTGGGCAAAGATTGATGCTCAATTATGTAGCATCTTGTGCCGTTCTATTGATTCCAAGTTGATGCCTTTGTTTCGTCCATTCCAGACATGTTATTTGGTTTGGGCAAAGGCTCGTACCTTATACACTAATGACATATCTCGCTTCTATAATGTGATATCACGGATGACAAACTTAAAGAAGCAAGAATTAGATATGTCTACTTAATTGGGTCAAGTACAAGCAATCATGGAGGAATTTGAGACATTAATGCCAGTTTCTGCTAGTGTGGCAAAACAACAAGAGCAGCGACAAAAGATGTTCTAGTTCTTACACTCGCTAGACTTCCTAATGATCTTGATTCAGTGCGAGACCAGATTTTGGCTAGTCCGACTGTTCCCACAGTTGATGAATTATTCTCTCGATTACTCCGCCTTGCCGCACCACCAAGTCACCCAGTGATCTCATCACAAATACTTGATTCCTCTCTCACATCGCAGACGGTGGATGTTCGGGCGTCTCAAACTATGAAGAACAGAGGAGGACGAGGTCGTTTTGGGAGATCTAGACCCAAGTGTTCTTATTGTCACAAACTTGGATACACTCGTGAAATGTGCTATTCCTTACATGGTCGTCCACCCAAAAATCTTACGTTGCTCAGACTGAGACTACATGTAACCAAGGTTTTTCTGTATCTAAAGAAGAATATAATGAGCTCCTTCAGTATCGAGCAAGTAAGCAGACATCTCCACAAGTAGCCTCAATTGCCCAGACTGATACTCCAGTTGTTGGTAATTCTTTTGCTTGTGTTTCCCAGTCTAGTACTCTTGGACCATGGGTCATGGACTCAGGCGCTTCTGATCACATCTCTGGTAATAAATCACTTTTGTCGAATATTGTATATTCACAGTCTCTTCCCACTGTTACTTTAGCCAAGGGATGTCAAACTAAGGCACAAGGAGTTGGACAAGCTAACCCATTGTCTTCTATCACCCTAGATTCCGTTCTTTATGTCCTTGGTTGTCCTTTTAGTCGTGCATCTGTTAGTCGTTTGACTTGTGCCCTCCATTGTGGTATATATTTATTAATGATTCTTTTATTATGCAGGACCGCAGTACGGGACAGACAATTGGTACAGGACGTGAATCAGAAGGCCTTTACTACCTTAATTCACTCAGTCCTTCCACAACATGTCTAGTTACTGATCCTCCGGACCTAATCCACTGTCGTTTAGGACACCCAAGTTTATCCAAACTTCAGAAGATGGTGCCTCTTTTAGGACACCCAAGTTTATCCAAACTTCACAGTCTACATTAGATTGTAAGTCGTGTCAGCTTGGGAAACATACCTGAGCTTCCTTTCCGCGTAGTGTTGAGAGTCATGTAGAGTCTGTTTTCTCCTTGGTTCATTCTGATATATGGGGTCCTAGTAGAGTCAGTTCAACCTTGGGATTTCGTTATTTTGTTAGTTTCATTGATGATTACTCAAGATGTACTTGGCTTTTCTTAATGAAAGATCGTTCTGAGTTATTCTCTATATTCTAGAATTTTTGTGCTGAAATAAAAAATAAATTTAGTGTCTCTATTTGCATTTTTCGTAGTGATAATGCCTTAGAATATGTATCTTCTCAGTTTCAGCAATTTATGACTTCTCATGGAATTATTCATCAGACATCTTGCCTTATACCCCTCAGCAAAATGGGGTTGCAGAGAGAAAGAATAGGCACCTTATTGAGACTGCTCGTACACTTCTAATTGAATCTCGTGTTCCGTTGTGTTTTTGCGGCGATGTAGTTCTCACAGCTTGTTATTTGATTAATAGGATGCCTTCATCTCCCATCAAGGATCAGATTCCGCTTTCAGTATTGTTTCCCCAGTCAGCCTTATACCCTCTTCCACCTCGTGTTTTTGGGAGCACATATTTTGTTCATAACTTAGCCCCTAGGAAAGATAAGTTAGCTCCTCGTACTCTCAAGTGTATCTTCCTTGGCTATTCTCGTGTTCAGAAGGGATATCGTTGTTATTCACTTGATCTCCGTAGGTATCTTATGTCAGCTGACGTCACATTTTTTGAGTCTAAACCTTTCTTTGCTTCTGCTGACCACCATGATATATCTGAGGTCTTACCTATACCGACCTTTGAGGAGTTTCCTATAGCTCCTCCTCCACCTTCGAACACAGAGGTTTCACCCATACTAACCATTGAGGAGTCTAGTGTTGTTCCTCCTAGTTCCCCAGTCACAGGAACATCACTCTTGACTTATCATCGTCGTCTGCGCCCTACATCAGGCCCAACTGGTTCTCGTCCTGCACCTAACCCTGCTCCTACTGCGGACCCTGCTCCTAGGACACTGATTGCACTTCGAAAAGGTATACGGACCACACTTAACCCTAATCCTCATTATGTTGGTTTGAGTTATCATCGTCTGTCATCTCCCCATTATGTTTTTATATCTTCTTTGTCCTCGGTTTCCATCTCTAAGTCTACAGGTGAAGCGTTGTCTCATCCAGGATGGCGACAGGCTAGGAGTGATGAGATGTCTGTTTTACATACAAGTGGTACTTGGGAGCTTGTTCCTCTTCCTTCGGGTAAATCTACTGTTGGCTGTCGTTGGGTTTATGCGGTCAAAGTTGGTCCCGATGGCCAGATTGATCGACTTAAGGCCCATCTTGTTGCCAAAGGATATACTTAGATATTTGGGCTCGATTACAGTGATACCTTCTCTCTTGTGGCTAAAGTGGCATCAGTCCGCCTTTTTCTATCCATGGCTGCGGTTCGTCATTGGCCCCTCTATCAGCTGGACATTAAGAATGTCTTTTTTCACGGTGATCTTGAGGATAAGGTTTATATGGAGCAACCACCTGGTTTTGTTGCTCAGGGGGAGTCTCGTGGCCTTGTATGTCGCTTGCGTCGGTCACTTTATGGTCTTAAGCAATCTCCTCGAGCCTGGTTTGGTAAGTTCAGCACGGTTATCCAGGAGTTTGGCATGACTCGTAGTGAAGCTGATCACTCTGTATTTTATCGGCACCCTGCTTCAAGTCTATGTATTTATCAGGTAGTCTATGTTGATGATATTGTTATTACTCGCAATGATCAGGATGGTATTACTAATCTGAAGAAGCATCTCTTCCAGCATTTTCAAACTAAGGATCTAGGCAGATTGAAGTACTTTCTAGGTATTGAGGTTGCTCAATCTAGCTCAGGTATTGTTATTTCTCAAAGGAAATATGCTTTAGACATTCTTGAGAAGATAGGGATGATAGGTTGCAGACCTGTTGATACTCCAATGGATCCGAATTCTAAACTTCTGCCAGGACAGGGGGAGCCGCTTAGCGATCCTGCAAGCTATAGGCGGTTGGTTGGTAAATAAAATTATTTCACAGTGACTAGACCCGACATTTCTTATCCTGTGAATGTTGTAAGTCAGTTTATAAATTCTCCCTATGATAGTCATTGGGATGCAGTCGTCCGCATTATCCGGTATATAGAATCGGCTCCAGGCAAAGGATTACTGTTTGAGGATCGAGGTCATGAGCAGATCGTTGGGTACTCAAATGCTGATTGGGCAGGATCACCTTCTGATAGACGTTCTACGTCTGGATGTTGTGTTTTAGTAGGAGGAAATTTGGTGTCCTGGAAAAGCAAGAAACAGAATGTAGTTGCTCGGTCTAGTGCAGAAGCAGAATATCGAGCAATGGCTATGGTAACATGTGAACTAGTCTGGACCAAACAATTGCTCAAGGAGTTGAAATTTGGTGAAATCGGTTAGATGGAACTTGTGGAACTTGTGTGCGATAATCAAGATGCCCTTCATATTGCATCAAATCTGGTGTTTCATGAGAGAACTAAACACATTGAGATTGATTGTCACTTCGTAAGAGAGAAGATACTTTCAGGAGATATTACTACGAAGTTTGTGAGGTCGAATGATCAACTTGCAGATATTTTCACCAAGTCCTTCACCGATCCTTGCATTGGTTATATATGTAACAAGCTCGGTACATATGATTTGTATGCTCCGGCTTGAGGGGGAGTGTTAGATATAGATATGTAGTTGCCCCACATTGGAATAGGTGTAGTATGCCCTTTGTATAGAGTAGCTATAAATAAGCTCATCTTGTATTGCATTAGACACACAATATCAATATATCATATTTTCTCCCGTGCCTTCTCACATCAAGTATCAAGAGAACAACATTCTACTTACGCTAATCTTATTACAGTCATAAAGACAGAGACAAAGGTAACACAGACGTGAAATAAAATACTTCCAGATAACCTTTCATTATGGACAATAAAAGAAACATGTTCAGTCACGAAACTTTTTCCAAACTTAATTAAGGTAAGAGACGAAAGAAAACATCAATCAGTAAAGAAAAAGAAGGGGAAATTAAAAGGTGAAAAAGCAAAATGATATTAGATATTTGCAAGATTTTTAATATCACATTAGTCCCCACTCATCAAATGATGGTAGAGGGCATTTCATAAGGGCCAATCAACAGCCACAACAATTCGATAAGTGCTCCAGGAAACAACTAACCTTTGATAAATCAATAGGAACATCAAGATAATGATCTAAGAAATTTGCATTCTGTTCTGGATCAAGAAGCTCCAACATAGCACTTGCTGGATCACCAGCATGTCCTCTTCCCAACTGCCATAGAAGAAAAAATACAAAGCTATTATTAAATTTGATCGATAATTATACACATTCTTTAAACATTTAGATTGCAGAAGAAATCCAAGCAAAAACATAATATGTTAATGCCCAAGATTCTTCCGGTGGTCATACACAGAAAAGCAAGATTTAAATAGTGGAAAAGAAGTTACACTTCACTCATAGTCAAAGCATATAATTGAAGTATGAACTCACAAACCATAAAATACCTTGTCAATTTCATCGATCAAAACAAGAGGATTAGCGGTTCCCACACTTTTTAAACATTGCACCATCTTCCCCGGCATGGCACCAATATAAGTTCGTCGATGTCCCTTGAAGTGTATCCCAAAGTTAGTCATGAAAGTTCAAGATTATGTGCACAACAAAACAACTCTGTTTTAGCTGGTTAGATATTTCTGTTACCTTTATTTCAGCAACATCAGACAGCCCTCCAACAGAAAATCGGTAAAATTTGCGGTTCAATGCACGTGCAATTGAACGACCTATACTGGTTTTGCCCACCCCAGGAGGGCCAGAGAGGCATATGATTTTCCCTGTTCAAACAAAAATAAGCAATGAGTTTGTTTAGTTCCTGACTCAGATTCACAAATTAAACTGAGATAAATCAGATAAAGCCGCACATATGAAAGACAGTGCCAGTGTATAATTGCGATTAATATCATCACTTGAATCTAACATTGTCCTAGCTATGGTGCAATTCACATTTTTCTAGAGTTGCCCTTTGTCTGTTTTTTCCTCAACACTTCATAGTTTTCAAAGATTTTCCTCAACAAACAACTACATCTCAAGTTCAAATGTCCCAAAGATATACAAGTACAACTGGCAGTGAAAGTAAGTTATCAATAAAATACCATAGAGCGAGTGCCCAATCATGGACAACACAATATCAGAATTTAAGAAGAACACCAACAAAGTATGGCCCAAGGTGCAGCTATTTAACTCACATTAATTGCTTTTCAAGGGAAGAAATTAAGCTCAAGATATATTTCTTTACTCCACAGATAATCACAAGAAAACATTGGAGGAACTCTCGACTGTCACAAAAAGTATTACCTAAGTTTGACCAATAAATATTTCTCAAATGCCCTAAAATGTTCACTTCTTCCAATTTACTCGTTCCAATTTGATGTCAATATAGTATGTGGGAACCATAAATCCCATGTTCTATGGATATATTTTCCATTGGTATATTATAATGTTATGAAAATGGACGGTAAGGAGATGGCTGGATCATTTTCCGTTCTTTTAATATGATTATAGCTGAACCTTATAAAACTGAGGATTTTATTAAAAATGAGGGTCATTATTTTTAAAAAATAAGGCATTTACCAACCTTGTGAGGTTCCTCTGAGTTTTCCCACAGCTATAAATTCCAAGATCCTTTCCTTAACATCGGTTAACCCATAGTGGTCTTCATCAAGAATTTGTTCTGCCCGTAGTACATCAAAGTTTTCATCACTGCATTTTGGATCATAAAAGAAGAAAAGTTCAGGAACCAATGCAATACACCCAGGCAGAGATCAAAACTATAAAGCCAACACCACCACCATCAGCAGCAAACAACACGAACTATGACAAATCAAGCCCCCATCTTAACCTACATAAGGAGCTCATCATAAGTACAATCCTAGTTTTGCTGCTCTCATGAGATCCTGGTCTAAACTGATACATTGGGATGTCAAGGAAGCATCCTCAAAGCCAGAAGCATGACATGTAAGAACCCAAAATAAGGAACAGGAGGAGAAAGAAAAGTTAAAGACAAATGGATATTACTCGATAGAAATTTAGGCAAGTCACATCACTAGGCAGTAAGGTTAACTTGGAAGCCTAGCAAATTTCAGACAATAACAATTTTATTTCTCATAACTGTTTAGCCAGCTTCTAACAAACAGATTCCTCTAGTACAGCTAACCAAGGATAAGGTTAACATCAGTTGGATTGAACCCAAGACAATGTAAGGTAGAATAGATAGACAACAAACCTGTAACTACCCCATGGCAAGGCAGTCAACCAATCAAGATAATTACGTGTCACGTTAAATTCACTGGAACTAGCTTCCAACAGTTGCAGTTTTGTCAGTTCTTCTTCAATAACTTGCATAACATGTACTGGTATTTTTTCTTTATTAGGCTCCAATCTTTCCCTGAACTTTGCTGCAGTAATAAACAAATATAGTGTCACACCATTAGATGTAATTAAAAAGGGAACAAAAAACTGCAGGTTTCCCTTAGTATTGACCAGAAAAATCCAAGCCCTTAAAGCGAACCTAATCATTAGGCAGTTTCTTCTCTCAATTTCAACACCAAAACCAGGTCTACCAAAACAATGCTACAATGCGGAGCTCAACTTTATCTTTGTGAAAATTATAGAGAAAAAATAATGAGAACCGCATTTTAGGGTTTGCAGTAACGTCTATTATCACCAAAACCTGGAAGAAGCACCTTCTTAAGCATCACACTCCTCAACAGGAGCAAAGGAGAGAAGAAAGAGCACATAACAGAAAAAGCAAGTAGGCTTCTAATTAGGAAGTGCACTGAAGGGAAACAGGACAGAATTCATGAGAAAAAGAAGACATTGGAGGAGGAAGATAATCTGAATCATGAGCAGTCCATAACATGAATAAACAACAATGCAGTGCTAAAACGGAAAATGTGGCCTTCCAGTAAATCCATGTTGAATTTGTGCCACATTACCAAATATGGTAGACAGTAACCTTTCGGAATTGAAGTTGAAGGATTCCAGTATTGTCTTATTAAGTGCTTTCGTGAAAACACAAAACAGCCCCCTATAGGTACAATGTATTCTAATCTGTCAAAAGTTTGGAATGTCTCAAATAGTTTTAGAAAGCATGTCAATAAAGTTGGTTGGACTGTGTACAAAGAAAATTCAACCTTCAATTTCCTATATGTAAAGCCAATTATCGCTTGAATGCTATTCATGCTTCTAGTGAAGATTTTCAATGGTAAACAGGAGATCAATCAGCCACCATAAAACTTTTACAACAGAAAGGGCAAAACATATTGCAAATGGCCTTTTCTATGCTACTGCTAAATGGAACTTTACCAGGGACAACATAAGATGTTTCACTGGCCAGATGGGAAAACACGTCTGGCCATAATAAATCCAAAAAGAACATTATCCATTTCAATAATAATTAAGAGACAACTAACCAGAAAGAGCTGTCTTGTCATCAGTCTCCAAACCTAGTTCCTGCATTAAGCCAATCAAAACCACGTCATAAGTACTTTTTTTTCAAATCAACTCTTTGTTTAAACTGTGAATCAAACATAAGAACCAGAAGCTTAACATATATATTATCTAGGAATAAACATATGTAAGTACTAAGTTTAAGGATGATAACACAAAAAAAGCTGCACATAATCCACATGCCTTCTTTATGGCCTTTAATTGTTCATTCAACAAATAACGGCGTTGCTCTCCGCTTATTTTTTCTTCAATTGCTCTTGCTATTGATTCCTGCAGAAAACAAAGACGACGTAAAATGCTAAATGCATGCATAACAACTATTCAAAGTTCTGGTATGCGTCTTGACGTGCATTTACCTGAATCTTACTAATCTCCAT
SEQ 81到160为与SEQ 1-80相关的推定蛋白质SEQ
SEQ 81
MALRFSLIFLFSLFLTTSLLLSVNGNINGGEDDDILIRQVVGDDDDHLLNADHHFTIFKRRFGKTYASDEEHHYRFSVFKANLRRAMRHQKLDPSAVHGVTQFSDLTPAEFRRNFLGVNRRLRLPSDANKAPILPTEDLPSGFDWRDHGAVTSVKNQGSCGSCWSFSTTGALEGATYLSTGKLVSLSEQQLVDCDHECDPEEKDSCDAGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGTCKFDNTKVAAKVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICSKKLDHGVLLVGYGTGFSPIRMKEKPYWIIKNSWGEKWGENGYYKICRGRNVCGVDSMVSTVSAVSTSSH
SEQ 82
MGAKVFLVALFLSALLFPLASSSNDGLMRIGLKKMKFDQNNRLAARIESKEGDVLRASIRKYNFRGKLGDSEDTDIVALKNYMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYFSVPCFFHSKFKSSESSTYKKNGKSAAIQYGSGAISGFFSQDNVKVGDLVVTDQEFIEATREPSVTFLVAKFDGILGLGFQEISVGNAVPVWYNMVQQGLIKDPVFSFWLNRNTEEEQGGEIVFGGVDPNHYKGEITYVPVTHKGYWQFDMGDVLIEGKATGYCESGCSAIADSGTSLLAGPTTIITMINQAIGASGVASQQCKSVVEQYGQTIMDLLLAEAHPKKICSQVGVCTFDGNRGVSMGIESVVDEKAGRSTGLQDGMCSACEMAVIWMENQLRQNQTQDRILNYVNELCERLPSPLGESAVDCGKLSSMPTVSFTIGGKVFDLVPKEYILKVGEGAKAQCISGFTGLDIPPPRGPLWILGDVFMGRYHTVFDYGKLRVGFAEAA
SEQ 83
MGSFLCFSVIVVLLVLQPCLAKKVYIVHMKNHQIPSSFATHHDWYNAQLQSLSSSSTSDESSLLYSYDTAYSGFAASLDPHEAELLRQSDDVVGVYEDTVYTLHTTRTPEFLGLNNELGLWAGHSPQELNNAAQDVVIGVLDTGVWPESKSYNDFGMPDVPSRWKGECESGSDFDPKVHCNKKLIGARFFSKGYQMSASGSFTNQPRQPESPRDQDGHGTHTSSTAAGAPVANASLLGYASGVARGMAPRARVATYKVCWPTGCFGSDILAGMERAILDGVDVLSLSLGGGSGPYYRDTIAIGAFSAMEKGIVVSCSAGNSGPAKGSLANTAPWIMTVGAGTIDRDFPAFATLGNGKKITGVSLYSGKGMGKKVVPLVYSTDSSASLCLPGSLDPKMVRGKIVLCDRGTNARVEKGLVVKEAGGVGMILANTAESGEELVADSHLLPAVAVGRKLGDFIRQYVKSEKNPAAVLSFGGTVVNVKPSPVVAAFSSRGPNTVTPQILKPDVIGPGVNILAAWSEAIGPTGLEKDTRRTKFNIMSGTSMSCPHISGLAALLKAAHPEWSPSAIKSALMTTAYVRDTTNSPLRDAEGGQLSTPWAHGSGHVDPHKALSPGLIYDITPEDYIKFLCSLDYELNHIQAIVKRPNVTCTKKFADPGQINYPSFSVLFGKSRVVRYTRAVINVGAAGSVYEVTVDAPPSVTVTVKPSKLVFKRVGERLRYTVTFVSKKGVNMMRKSAFGSISWNNAQNQVRSPVSYSWSQLLD
SEQ 84
MGTKFILFILLFIFLFSSGFVACGGFYSFRNLNSSVSGIEFPNHPSFNAVSSSADSDCNYGVSQKSKTHSIAQEVDGVDVKNGENEEVSIFGNQKKEAVKFQLRHRSAGKKIEAKDSVFESRARDLSRIQTLHTRIVEKKNQNYNSRLAKSNEKHVDKHKPVIAPAAVSLESYELSGKLMATLESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPHYNPQDSTSFRNISCHDPRCKFVTSPDPPQLCKSENQTCPYYYWYGDSSNTTGDFALETFTVNLTTTSGSEFRKVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQSLYGHSFSYCLVDRNSNSSVSSKLIFGEDKELLKHPQLNFTSLVGGKEVETFYYVQIKSVIVGGEVLNIPEETWNLSLEGLGGAIIDSGTTLSYFADPAYEIIKEAFVNKVKGYPIVQDFPILNPCYNVSGVKNLEFPSFGIVFGDGAVWNFPVENYFIKLEPEDIVCLAVLGTPRSALSIIGNYQQQNFHILYDTKRSRLGYAPTRCADA
SEQ 85
MALTLKSLATPLLFGALFILILQVVAEQPISEAKVESAILQESIIKEVNENAKAGWKAAFNPRFSNFTVSQFKRLLGVKPAREGDLEGIPILTHPKLLELPKEFDARKAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHHNLNISLSVNDLLACCGFLCGSGCDGGYPITAWRYFIRRGVVTEECDPYFDNEGCSHPGCEPGYPTPKCQRKCVKEILLWGKSKHYGVNAYRIHHDPNSIMTEIYKNGPVEVSFTVYEDFAHYKSGVYKHVTGQSMGGHAVKLIGWGTSEQGEDYWLIANSWNRGWGDDGYFKIRRGTNECGIEHNVVAGLPSAKNLNVELDDVSNAFLDASM
SEQ 86
TLVLHTSFYLLLSVASPGDCLLLSIFPFSFSSPRYFPYKQNTVKIISSNFLFSPFFQMGSFLCFSVIVLFLVFQPCFSKKVYIVHMKNHQIPSSFATHHDWYNAQLQSLSSSSTSDESSLLYSYDTAYSGFAASLDPHEAELLRQSDDVVGVYEDTVYTLHTTRTPEFLGLNNELGLWAGHSPQELNNAAQDVVIGVLDTGVWPESKSFNDFGMPNVPSRWKGECESGPDFDPKVHCNKKLIGARFFSKGYQMSASGSFTNQPRQPESPRDQDGHGTHTSSTAAGAPVANASLLGYASGVARGMAPRARVATYKVCWPTGCFGSDILAGMERAILDGVDVLSLSLGGGSGPYYHDTIAIGAFSAMEKGIVVSCSAGNSGPAKASLANTAPWIMTVGAGTIDRDFPAFATLGNGKKITGVSLYSGKGMGKKVVPLVYSTDSSASLCLPGSLDPKIVRGKIVLCDRGTNARVEKGLVVKEAGGVGMILANTAESGEELVADSHLLPAVAVGRKLGDFIRQYVKSEKNPAAVLSFGGTVVNVKPSPVVAAFSSRGPNTVTPQILKPDVIGPGVNILAAWSEAIGPTGLEKDTRRTKFNIMSGTSMSCPHISGLAALLKAAHPEWSPSAIKSALMTTAYVHDTTNSPLRDAEGGQLSTPFAHGSGHVDPHKALSPGLIYDITPEDYIKFLCSLDYELNHIQAIVKRPNVTCAKKFADPGQINYPSFSVLFGKSRVVRYTRAVTNVAAAGSVYEVVVDAPPSVLVTVKPSKLVFKRVGERLRYTVTFVSNKGVNMMRKSAFGSISWNNAQNQVRSPVSYSWSQLLD
SEQ 87
MASSCLHAILLCFLLFITSTTAQNQTSFRPKGLILPITKDASTLQYLTQIHQRTPLVPVSLTLDLGGQFLWLDCDQGYVSSSYKPARCRSAQCSLAGAGSGCGQCFSPPKPGCNNNTCSLLPDNTITRTATSGELASDTVQVQSSNGKNPGRNVTDKDFLFVCGATFLLEGLASGVKGMAGLGRTIISLPSQFSAEFSFPRKFAVCLSSSTNSKGVVLFGDGPYSFLPNREFSNNDFSYTPLFINPVSTASAFSSGEPSSEYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTILETSIYNAVTNFFVKELVNITRVASVAPFGACFDSRNIVSTRVGPAVPSIDLVLQNENVFWRIFGANSMVQVSENVLCLGFVDGGVNPRTSIVIGGYTIENNLLQFDLAGSRLGFTSSILSRLTTCANFNFTSIT
SEQ 88
MNPEKFTHKTNEALAGAHELALSAGHAQFTPLHMAVALISDHNGIFRQAIVNAGGNEEVANSVERVLNQAMKKLPSQTPAPDEIPPSTSLIKVLRRAQSSQKSCGDSHLAVDQLILGLLEDSQIGDLLKEAGVSASRVKSEVEKLRGKEGRKVESASGDTTFQALKTYGRDLVEQAGKLDPVIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPSNLADVRLIALDMGALVAGAKYRGEFEERLKAVLKEVEEAEGKVILFIDEIHLVLGAGRTEGSMDAANLFKPMLARGQLRCIGATTLEEYRKYVEKDAAFERRFQQVYVAEPSVTDTISILRGLKERYEGHHGVKIQDRALVVAAQLSSRYITGRHLPDKAIDLVDEACANVRVQLDSQPEEIDNLERKRIQLEVELHALEKEKDKASKARLVEVRKELDDLRDKLQPLMMRYKKEKERIDELRRLKQKRDELIYALQEAERRYDLARAADLRYGAIQEVETAIANLESTSAESTMLTETVGPDQIAEVVSRWTGIPVSRLGQNEKEKLIGLGDRLHQRVVGQDHAVRAVAEAVLRSRAGLGRPQQPTGSFLFLGPTGVGKTELAKALAEQLFDDDKLMIRIDMSEYMEQHSVARLIGAPPGYVGHDEGGQLTEAVRRRPYSVVLFDEVEKAHPTVFNTLLQVLDDGRLTDGQGRTVDFTNTVIIMTSNLGAEYLLSGLMGKCTMETAREMVMQEVRKQFKPELLNRLDEIVVFDPLSHEQLRQVCRYQMKDVALRLAERGIALGVTEAALDVILSESYDPVYGARPIRRWLERKVVTELSKMLVKEEIDENSTVYIDAGVGRKDLTYRVEKNGGLVNAATGQKSDILIQLPNGPRSDAVQAVKKMRIEEIEEDEMED
SEQ 89
MQSFKSASILRRLLQNSRLVSHSRSFCSVSTNALVDESQSTVLVEGKASSRTAILNRPHALNALNFSVVDRLLKLYKNWEDDPDIGFVVLKGSGKAFSAGGDIVTIYNLLKQDAGNLQDCKDFCWTINNLVYVVGTLLKPHVALLNGITMGGGAGISIPGTFRVATEKTVFATPETLIGYHPDAGASFYLSHLPGYLGEYLALTGDKINGAEMISCGLATHYLHSAKLPLIEEQLGKLMTDDPSVIERSLENCGEIVHPDPTSVLHRIETLNKCFSHDTVEEIIDALESEAAKKQDAWCVSTLRKLQETAPLSLKVSLRSIREGRHQTLDQCLIREYRMSVQAFSGQITNDFCEGVRARLVDRDFAPKWDPPSLDKVTDDMVDQYFSRLTAFEPELELPTQQREAFT
SEQ 90
MALTLKSLATPLLLGAFFILVLQVVAEKPISEAKVESAILKESIIKEVNENAKAGWKAAFNPQFSNFTVSQFKRLLGVKPAREGDLEGIPLLTHPKLSELPKEFDARKAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHHNLNISLSVNDLLACCGFLCGSGCDGGYPISAWRYFIRRGVVTEECDPYFDNEGCSHPGCEPGYPTPKCQRKCVKENLLWGKSKHYGVNAYRIHRDPYSIMTEIYKNGPVEVSFTVYEDFAHYKSGVYKHVTGQSMGGHAVKLIGWGTSEQGEDYWLIANSWNRGWGDDGYFKIRRGTNECGIEHNVVAGLPSAKNLNVELDDVSDAFLDASM
SEQ 91
MGVLKKTLLLLFLCVFLGDISLCFSSKLYVVYMGSKDSDEHPDEILRQNHQMLTAIHKGSIEQAKTSHVYSYRHGFKGFAAKLTEAQASEISKMPGVVSVFPNTKRSLHTTHSWDFMGLSDDETMEIPGFSTKNQINVIIGFIDTGIWPESPSFSDTNMPPVPAGWKGQCQSGEAFNASICNRKIIGARYYMSGYEAEEENGKTMFYKSARDSSGHGSHTASTAAGRYVANMNYKGLANGGARGGAPMARIAVYKTCWSSGCYDVDLLAAFDDAIRDGVHVISLSLGPDAPQGDYFNDAISVGSYHAVSRGILVVASVGNEGSTGSATNLAPWMITVAASSTDRDFTSDILLGNGVRLKGESLSLSQMNTSTRIIPASEAYAGYFTPYQSSYCLDSSLNRTKAKGKVLVCLHAGSSSESKMEKSIIVKEAGGVGMILIDDADKGVAIPFVIPAATVGKKIGNKILAYINNTRLPMARILSARTVLGAQPAPRVAAFSSRGPNSVTPEILKPDIAAPGLNILAAWSPAASTKLNFNVLSGTSMACPHITGVVALLKAVHPSWSPSAIKSAIMTTAKLSDKHHKPIIVDPEGKRATPFDFGSGFVNPTNVLDPGLIYDAQPADYRAFLCSIGYDEKSLHLITRDNSTCDQTFASPNGLNYPSITIPNLRSTYSVTRTVTNVGKARSIYKAVVYAPTGVNVTVVPRRLAFTRYYQKMNFTVNFKVAAPTQGYVFGSLTWRNKRTSVTSPLVVRVAHSNMGMMV
SEQ 92
MGAKAFLVAMFLSALLFPFASSSNDGLMRIGLKKMKFDQNNRLAARIESKEGDVLRGSIRKYNFRGKLGDFEDTDIVALKNYMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYFSVPCFFHSKYKSSESSTYKKNGKSAAIQYGSGAISGFFSQDNVKVGDLVVTDQEFIEATREPSVTFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLIKDPVFSFWLNRNTEEEQGGEIVFGGVDPNHYKGEITYVPVTQKGYWQFDMGDVLIDGKATGYCESGCSAIADSGTSLLAGPTAIITMINQAIGASGVASQQCKSVVEQYGQTIMDLLLAEAHPKKICSQVGVCTFDGNRGVSMGIDSVVDEKAGRSTGLQDGMCSACEMAVIWMANQLRQNQTQDRILNYVNELCERLPSPLGESAVDCGKLSSMPKVSFTIGGKVFDLSPNEYILKVGEGAKAQCISGFTGLDIPPPRGPLWILGDIFMGRYHTVFDYGKLRVGFAEAA
SEQ 93
MTFFRSFLFFLLTLFVISSALDMSIISYDEQHGQMGTTHHRTDDEVRELYESWLVKHGKNYNAIGEKERRFEIFNDNLRFIDEHNAENRSYKLGLNRFSDLTNEEYRAMFVGGRLDRKTRLMKSPKSNRYAFQAGEKLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINKIVTGELISLSEQELVDCDRSYNQGCNGGLMDYAFDFIKNNGGIDTEDDYPYHAQDGTCDPYRKNARVVSIEGYEDVPENDEKSLMKAVANQPVSVAIEGGGRAFQHYSSGVFTGYCGTQLDHGVVVVGYGTENGEDYWIVRNSWGANWGESGYIKLQRNFANSTTGKCGIAMQASYPLKSGANPPNPGPSPPTPVTPSTVCDEYYSCPQGTTCCCIYQYGEYCFGWGCCPYESATCCDDNYSCCPHDYPVCDVDAGTCLMSKDNPLKVKALKRGPARVNWSGMKSNRKVSYV
SEQ 94
MANSYTSFNFFLAPIIFLAILGLQLQSSDGFGTFGFDIHHRYSDPVKGILDLHGLPEKGSVEYYSAWTQRDRFIKGRRLADTTNPTPLSFSGGNETFRLSSLGFLHYANVTVGTPGLSFLVALDTGSDLFWLPCDCSNCVRALETRSGRRINLNIYSPNTSSTGQIVPCNGTLCGQRRRCLSSQNACAYGVAYLSNNTSSSGVLVEDILHLETDNAQQKSVEAPIALGCGIRQTGAFLSGAAPNGLFGLGLESISVPSMLASKGLAANSFSMCFGPDGIGRIVFGDKGSPDQGETPLNLDQLHPTYNISLTGITVGNKITDVDFTAIFDSGTSFTYLNDPAYKVITENFDSQAKQLRIQPDGEIPFEYCYGLSANQTTFEVPDLNLTMKGGNQFFLFDPIIMLSLQDGSRAFCLAVVKSGDVNIIGQNFMTGYRVVFDREKMVLGWKPSDCYDSRESNDKSTTLPVNKRNSTEAPSPSSVVPEATKGNGSGNEPATSFPSVPSSRPAINHAPAHFNSYICQLMMALFSLFSYYLIIVSS
SEQ 95
MVTKFSIFILVVLLRLFSFGSVASREIHNSGLNLNSSASGIEFPQHPSFNSVTASGNSDCSYGTSKKSTTTHVITQEENRSDEKEDEDLMVSKNQPREAVKFHLRHRSAGQNIEAKDSIFESTTRDLGRIQTLHTRIVEKKNQNSISRQTKNSEKPTQSSSFEFSGKLMATLESGVSHGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQSVPCYDCFEQNGPHYDPKDSISFKNISCHDPRCHLVSSPDPPQPCKSENQTCPYYYWYGDSSNTTGDFALETFTVNLTTPSGDSEIKKVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVNRNSNSSVSSKLIFGEDKELLKHANLNFTSLVGGKENHLETFYYVQIKSVIAGGEVLNIPEETWNLSTEGVGGTIIDSGTTLSYFAEPAYEIIKQAFVNKVKHYPVLEDFPILKPCYNVSGVEKLELPSFGIVFGDGAIWNFPVENYFIKLEPEDIVCLAMLGTPHSAMSIIGNYQQQNFHILYDTKRSRLGFAPTRCADA
SEQ 96
MPSSFSLLFLTLLLASISLSFSSTLNSNDDDFFLSSTPKFPLTMAEKLIRQLNLFPKHDINKAAATGDSAAVTEQRLFEKKLNLSYVGNSGSTVQDLGHHAGYYRLPHTKDARMFYFFFESRSRKNDPVVIWLTGGPGCSSELAVFYENGPFKIADNMSLVWNDFGWDKVSNLIYVDQPTGTGFSYSSNDDDIRHDERGVSNDLYDFLQAFFKAHPQYAKNDFYITGESYAGHYIPAFASRVHQGNKNKEGIYVNLKGFAIGNGLTDPEIQYKAYTDYALDMKLIKKSDYNAIEKSYPKCQLAIKLCGKDGGTACMAAYLVCTSIFNKIMDIAGDKNYYDVRKRCEGDLCYDFSKMETFLNDQQVKKALGVGDIEFVSCSSEVYQAMQLDWMRNLELGIPSLLEDGIKLLVYAGEYDLICNWLGNSRWVHAMKWTGQKAFGKATQVSFAVDGVEKGVQKNYGPLTFLKVHDAGHMVPMDQPKAAMEMLQRWMQDKLSKEGHLAPM
SEQ 97
MTLTLKSLAAPLFLGAFCILILQVVAEKPISEAKVESAILQESIIKEVNENAKAGWKAAFNPRFSNFTVSQFKRLLGVKPAREGDLEGIPILTHPKLLELPKEFDARKAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHHNLNISLSVNDLLACCGFLRGSGCDGGYPISAWRYFIRRGVVTEECDPYFDNEGFHTRVVNQDIPPQSVV
SEQ 98
MFRLVMVTKFSIFILVVLLRLFSFGFVASREIHNFGINLNFSASGIEFPQHPSFNSVTASGNSDCSYGTSKKSTTTHVITQEENNSDEKEDEDLMVSENQPREAVKFHLRHRSAGQNIEAKDSIFESTTRDLGRIQTLHTRIVEKKNQNFISRQTKNSEKTTQSSSFEFSGKLMATLESGVSHGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQSVPCYDCFEQNGPHYDPKDSISFKNISCDDPRCHLVSSPDPPQPCKSENQTCPYYYWYGDSSNTTGDFALETFTVNLTTPNGDSEIKKVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVNRNSNSSVSSKLIFGEDKELLKHLNLNFTSLVGGKENHLETFYYVQIKSVIVGGEVLNIPEETWNLSTEGVGGTIIDSGTTLSYFAEPAYEIIKQAFVNKVKRYPILDDFPILKPCYNVSGVEKLELPSFGIVFGDGAIWTFPVENYFIKLEPEDIVCLAILGTPHSAMSIIGNYQQQNFHILYDTKRSRLGFAPRRCADASEQ 99
MSGFRLPLLFHLLLPLTLFLQYVQSLPQNSSTVEFLPGFDGPLPFYLETGYIGVGKSEEVQLFYYFVKSESNPKKDPLLLWLTGGPGCSSFTGVAYEVGPLAFGQKAYNGSLPILVSTPYSWTKFASILFLEQPVNTGFSYATTSAASKCTDLQACDQVYEFLLKWFNNHPEFISNPFYVSGDSYSGITVPVIVQLISDGIEAGKKPLINLKGYSLGNPLTFPEESNYQIPFCHGMGLISNELYESLKETCKGDCRNIDPTNKLCLENFKMFKKLVSSINDQQILEPFCGTDSESPNPRQLSGERRSLEEDFIFLKHDDFICRESRVATRKLSNHWANDPSVQEALHVRKGTIRRAWARCRQSIMGTTYRVTFMNSIPYHVNLSSKGYRSLIYSGDHDMVVPFQSTQAWIKYLNYSIIDDWRPWTIDGQVAGYTRSFSNHMTYATVKGGGHTAPEYKREESFHMFKRWIAQQPL
SEQ 100
METNGLIKEILPRDAVNNMTRLILSNALYFKGEWNEKFDVSETKDHDFHLLNGGSIQAPFMTSKKKQYIAAFDCFKILRLPYKQGTDTRRFCMYFILPDAHDGLPALLEKISLEPGFLNNHVPYGKVRARKFLIPKFKITFGFEASNILKGLGLTLPFCGGSLTEMVDSPMPQNLSVSQVFHKSFIEVNEEGTEAAAVTATVIMTMSLIIEKEMDFVADHPFLFLIRDESTGAVLFIGSVMNPLAG
SEQ 101
MNESYGNSRASSSSTTSSLNSSSHGTEDDHTIARILAEEEENALKYGGNKLGRRLSHLDSIPHTPRVIGEIPDPNDATLDHGRLSSRLATYGLAEMQIEGDGNCQFRALSDQLYHNPEYHKHVRKEVVKQLKRFRKLYEGYVPMRYKSYLRKMKRLGEWGDHVTLQAAADRFGVKICLVTSFRDNGYIDILPKDIQPSRELWLSFWSEVHYNSLYEIGEVPARVRRKKHWLFF
SEQ 102
MSWLCPSLVLVLLIFQGPICTCSSISDLFESWCQQNGKTYSSEQERVYRLEVFEENYAYIIEHNSKGNSTYTLNLNAFSDLTHHEFKNSFLGLSSSANDFIRLKTGSSSAGVFNDVGVVDIPSSLDWREKGAVTKVKNQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAFEFVKKNGGIDTEEDYPFNEREGTCNKNKLQRRVVTIDGYTDVPQYDEDKLLKAVANQPVSVGICGSERAFQSYSKGIFTGPCSTVLDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYMHMQRNSGNQEGICGINKLASYPTKSSPNPPSPPSPGPSKCSMFTSCGQGETCCCGWRLLGVCVSWKCCGLDSAVCCKDGRHCCPHDYPICDTSRNLCLKRMSNATIVQQPQKEAFSGKFGGLIYPF
SEQ 103
MCEPESEAARGVLSFLDVDQLFSSNYYGDGRKHDVEICHEQYARENHYHTSYCNVDNDEAIAHVLQEDLSELSIAEDAESSHADEQYLQASTGVQHWHTPPREYYAGHDTSLEADDVGPSSSCSSPGDRSYDGEEYTYTLEIQDEFELDGEVGKRINQLSAVPHVPRINGDIPSVDEATSDHQRLLNRLQLFDLVEHKVQGDGNCQFRALSDQFYRTPEHHKFVRQQVVSQFQHHPEMYEGYVPMEYGEYLTRMSKSGEWGDHVTLQAAADSYGVKILVITSFKDTCYIEILPKNQKSNRVIYLSFWAEVHYNSIYPQGDFLPFDFKKKKKKWSFWNKH
SEQ 104
MPSLLQIFLPLFPFFFLVSFSVSHGPFLPKAIILPVNKDLSTFQYVTQVYMGAHLVPTNLVVDLGGSFLWTNCGLTSVSSSQKLVPCNSLKCSMAKPNGCTNKICGVQSENPFTKVAATGELAEDMFAVEFIDELKTGSIASIHEFLFSCASTTLLQGLARGAKGMLGLGNSRIALPSQLSDTFGFQRKFALCLSSSNGAIISGESPYLSLLGHDVSRSMLYTPLISSKDGVSEEYYINVKSIKINGKKLSLNTSLFAMDEGVGGTKISTIPPFTTMKSSIYKSFIEAYEKFAISMELNKVEAIAPFELCFSTKGIDVTKVGPNVPTTDLVLQSEMVKWRIYGRNSMVKVSDEVMCLGFLNGGVNQKASIVIGGYQLEDNLLEFNLGTSMLGFTSSLSMAETSCSDFMFHSVSKDSAFDS
SEQ 105
MGAKEVLILVLVCMFIVFPSCHGDDECLNPFLVDQNCYVKDYITKLANATETVKWMMKIRRQIHENPELAYEEFKTSGLIREELDRMGVKYRWPVAKTGVVATIGSGKPPFVALRADMDALPIQELAKWEHKSKVDGKMHACAHDAHTAMLLGAAKILQQLRHNLQGTVVLIFQPAEERGHGAKDMIEEGVLENVEAIFGMHLVHKYESGVVASRPGEFLAGCGSFKATIRGKGGHAAVPHDSVDPILAASTSVISLQSIVSRETDPLESQVVSVAMIEGGHAFNIIPELATISGTYRAFSKKSFYGLRKRIEEVIRAQAAVHRCTVEIDFDGRENPTLPPTINDERIYEHARKVSKMIVGEESFKIAPSFMGSEDFAVFLEKVPGSFFLLGTKNEKIGAIYPPHNPHFIIDEDVLPIGAAIHATFAYSYLLNSTNKFTSHSS
SEQ 106
MKLNPYSWTKVASIIFLDLPVGTGFSYARTPTALQSSDLQASDQAYEFLYKWFLDHPEFLKNPLYVGGDSYSGMVVPIITQIIATKNEMGIKPFVDLQGYLLGNPSTFKGEKNYEIPFAYGMGLISDELYESLTRNCKGEYQNTDPSNTQCLQDVHTFQELLKRINNPHILEPKCQFASPKPHLLFGQRRSLNVKFHQLNNPQQLPALKCRNDWYKLSSHWADDGQVREALHIRKGTIGKWVRCASLQYQKTIMSSIPYHANLSAKGYRSLIYSGDHDKVVTFLSTQAWIKSLNYSIVDDWRPWIVDNQVAGYTRSYSNRMTFATVKGAGHTAPEYKPRECLAMLKRLMSYKPL
SEQ 107
MCEPESEATRGVLSFLDVDQLFSSNYYGDGRKHDVEICHEQYARENQYHTSYCNVDSDEAIAHLLQEELSELSIAEDAESSHADEQYFQASTGVQHWHTPPREYYAGHDTGLEADDVGPSSSCSSPGDRSYDGEEYTYTLEIQDEFELDGEVGKRINQLSAVPHVPRINGDIPSVDEATSDHQRLLDRLQLFDLVEHKVQGDGNCQFRALSDQFYRTPEHHKFVRQQVVSQLKHHPEMYEGYVPMEYGEYLKRMSKSGEWGDHVTLQAAADSYGVKILVITSFKDTCYIEILPKNQKSNRVIYLSFWAEVHYNSIYPQGDFLPFDLKKKKKKWSFWNKH
SEQ 108
MPSLLQIFLPLFPFFFFVSFSVSHGPFLPKAIILPVNKDLSTFQYVTQVYMGAHLVPTNLVVDLGGSFLWTNCGLTSVSSSQKLVPCNSLKCSMAKPNGCTNKICGVQSENPFTKVAATGELAEDMFAVEFIDELKTGSIASIHEFLFSCASTTLLQGLARGAKGMLGLGNSRIALPSQLSDTFGFQRKFALCLSSSNGAIISGESPYLSLLGHDVSRSMLYTPLISSKNGVSEEYYINVKSIKINGNKLSLNISLFTMDEEGVGGTKISTISPFTSMKSSIYRTFMEAYEKIAISVNLTKVESIAPFELCFSTEGIDVTKVGPNVPTMDLVLQSEMVKWRIYGRNSMVKVSDEVMCWGFLDGGVNQKASIVIGGYQLENNLLEFNLGTSMLGFTSSLSTAETSCSDFMIHSVSKDSAFDS
SEQ 109
MKMSPALSLSVIQFPLCKSQDLSKDTNNPKIFSKETPCQKSYSDTRINRRKLLSGSGLSLVAGTLAKPARAETEAPIEATSSRMSYSRFLEYLNEGAVKKVDFFESSAVAEIFNPALNKVQRVKVQLPGLPPELVRKLREKDVDFAAHLPEMNVIGPLLDLLGNLAFPLILLGSLLLRTSSSNTPGGPNLPFGLGRSKAKFQMEPNTGVTFDDVAGVDDAKQDFQEIVEFLKTPEKFAAVGAKIPKGVLLVGPPGTGKTLLAKAIAGEAEVPFLSLSGSEFVEMFVGVGASRVRDLFNKAKENSPCLVFIDEIDAVGRQRGTGIGGGNDEREQTLNQLLTEMDGFTGNTGVIVIAATNRPEILDQALLRPGRFDRQVSVGLPDIRGREEILKVHSNNKKLDKDVSLSVIAMRTPGFSGADLANLMNEAAILAGRRGKDKITSKEIDDSIDRIVAGMEGTKMTDGKNKILVAYHEVGHGVCATLTPGHDAVQKVTLIPRGQARGLTWFIPGEDPTLISKQQLFARIVGSLGGRAAEEIIFGEAEITTGAAGDLQQITQIARQMVTMFGMSEIGPWALTDPATQSGDVVLRMLARNQMSEKLAEDIDASVRHIIERAYEIAKNHIRNNREAIDKLVDVLLEKETLTGDEFRAILSEFTNIPSANINSKPIRELIEA
SEQ 110
MEGLHQLKTGELIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKGEDGVCNKKKSALSAAKITGYEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA
SEQ 111
MGCRMKFLNVVLVVAAVMAAAAAVAFGAEKLPAGVLSLERIFPLNGKMELEEVRARDRARHARMLQSFAGGIVNFPVVGSSDPYLVGLYFTKVRLGTPPREYNVQIDTGSDILWVTCSSCDDCPRTSGLGVELNFYDATISSTASPISCADQVCASIVQTASAECSTETNQCGYSFQYGDGSGTTGHYVADLLYFDTVLGTSLIANSSAPIIFGCSTSQSGDLTKTDRAIDGIFGFGQQGLSVISQLSSHRITPKVFSHCLKGEGNGGGILVLGEILDPRIVYSPLVPSQAHYNVYLQSIAVNGQLVPVDPSVFATSGNRGTIVDSGTTLAYIATEAYDPFVNAITAAVSPSVRPIISRGKPCFLVSSSIAEIFPPVSLNFDGGASMALRPSDYLVHMGFVEGAAMWCIGFEKQDQGVTILGDLVLKDKIFVYDLARQRIGWADYDCSSSVNVSITSGKDEFINAGQLSVNRASGSLLFNPRHTRTIFHLLSLVLMIGSPFLT
SEQ 112
MTRASIILLLLLIATSIAAAQGGALTFDDDNPIRQVVVSDGLQELENGILQLIGQTRRALSFVRFVRRYGKRYDSVEEIKQRFEIYLDNLKMIRSHNKQRLSYKLGVNEFTDLTWDEFRRERLGAPQNCSATTKSDLQLTNVNLPETKDWREAGIVSPVKKQGKCGSCWTFSTTGALEAAYAQAFGKNISLSEQQLLDCAGAFNNFGCHGGLPSQAFEYIKYSGGLDTEEEYPYAGKAGVCKFSSENVAVKVVDSVNITKGAEDELKYAIAFIRPVSVAYQVVKGFKQYKGGIYSSTVCGNTPQDVNHAVLAVGYGVDNGTPYWLIKNSWGAEWGDNGYFKMEMGKNMCGIATCASYPIVA
SEQ 113
MNPEKFTHKTNEALAEAHELAISAGHAQFTPLHMALALISDHNGIFRQAIVNAAGSEETANSVERVFKQAMKKIPSQTPAPDQIPPSTSLIKVLRRAQSLQKSRRDTHLAVDQLILGLLEDSQIGDLLKEAGIGAARVKSEVEKLRGKDGKKVESASGDTNFQALKTYGRDLVEQAGKLDPVIGRDEEIRRVIRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPSNLSDVRLIALDMGALIAGAKYRGEFEERLKAVLKEVEEAEGKVILFIDEIHLVLGAGRTEGSMDAANLFKPMLARGQLRCIGATTLEEYRKYVEKDAAFERRFQQVYVAEPSVPDTISILRGLKEKYEGHHGVKIQDRALVVAAQLSARYITGRHLPDKAIDLVDEACANVRVQLDSQPEEIDNLERKRIQLEVELHALEKEKDKASKARLVEVRKELDDLRDKLQPLTMRYKKEKERIDELRRLKQKRDELTYALQEAERRYDLARAADLRYGAIQEVEAAIANLESSTDESTMLTETVGPDQIAEVVSRWTGIPVSRLGQNEKDKLIGLANRLHQRVVGQDDAVRAVAEAVLRSRAGLGRPQQPTGSFLFLGPTGVGKTELAKALAEQLFDDDKLMVRIDMSEYMEQHSVARLIGAPPGYVGHEEGGQLTEAVRRRPYSVVLFDEVEKAHPTVFNTLLQVLDDGRLTDGQGRTVDFTNTVIIMTSNLGAEYLLSGLMGKCTMEKARDMVMQEVRKQFKPELLNRLDEIVVFDPLSHEQLRQVCRHQLKDVASRLAERGIALGVTEAALDVILAQSYDPVYGARPIRRWLEKKVVTELSKMLVKEEIDENSTVYVDAASSGKDLSYRVEKNGGLVNAATGKKSDILIQLPNGVRSDAAQAVKKMKIEEIVDE
SEQ 114
MPEAPKKSFFTLSLVPFLPVYTLIRFNPPIESEPLISSSSDECQHDQKQQSDSRNYIVRFYHYKEPEDHWNYLQNNLKFKGWQWIERKNPAARFPTDFGLVEIDESMKELLLEKFRKMNLVKDVSLDLSYQRIVLEEKSEKNGAFANGKKRPGKIFTAMSFSEGQNYAVANTSIMRISWSRHLLMQKSRVTSLFGAHELWSKGHTGAKVKMAIFDTGIRADHPHFRNIKERTNWTNEDTLNDNVGHGTFVAGVIAGQDEECLGFAPDAEIYAFHVFTDAQVSYTSWFLDAFNYAIATNMDVLNLSIGGPDYLDLPFVEKVWELTANNIIMVSAIGNDGPLYGTLNNPADQSDVIGVGAIDQSNHLASFSSRGMSTWEIPHGYGRVKPDIVAYGREIMGSKISTRCKRLSGTSVASPVVTGIVCLLVSIIPESK
SEQ 115
MAQMKLSLSLFLSLVLLLAFSPSSFAKVSISSKLASKQAEKLIHELNLFPKESDNIVDRDPFPTAASRIVEKRFNFANLTNSSVISFEDLGHHAGYYKIKHSHAARLFYFFFESRGSKDDPVVIWLSGGPGCSSELALFYENGPFSISNNLSLVRNEYGWDKVSNLIYVDQPTGTGFSYSSDRHDIRHSEAGVSDDLYDFLQAFFEEHPELVKNDFYITGESYAGHYIPAFAARVHKGNKAKEGIHINLKGFAIGNGLTDPKIQYAAYTDYALDMGLISKSDHDRINKILPVCEVAINLCGTDGKISCLAAYFVCNSIFSAVRARAGADINHYDIRKKCVGALCYDFSNMEKLLNMHSVKQALGVEDIEFVSCSTTVYQAMLVDWMRNLEAGIPTLLEDGIKLLVYAGEYDLICNWLGNSRWVQAMEWSGQKEFVASPDVPFEVDSSEAGLLKSHGPLSFLKVHDAGHMVPMDQPKVALEMLKRWIGGTLSQQTTETEDLVASI
SEQ 116
MAIHTSTLSISILVMLMFSAVTSSAEDMSIISYNEKHHTNGESTVWRTDDEIVSLYESWLVEHKKVYNALGEKDKRFQIFKDNLRYIDEQNSAPEKSYKLGLTQFADLTNEEYKSIYLGTKPDGRSRLSYTQSDRYAPKVGDSLPDSVDWRKKGVLVDVKNQGQCGSCWAFSAVASIEAVNKIMTGNLISLSEQELVDCDTADNQGCQGGLMDDAFKFVIQNGGIDTEEDYPYKAKDGKCDQARKNAKVVTIDGYEDVPANDEKALKKAVAGQPVSVAIEAGGKDFQHYKSGIFTGKCGAAVDHGVVAVGYGSENGMDYWIVRNSWGASWGENGYLRMQRNIGNPKGLCGIATIASYPVKTGQNPPKPAPSPPSPVKPPTQCDDYNECPAGTTCCCVYKYYNYCFAWGCCPMEGATCCKDHNSCCPHDYPVCNVKAGTCSISKNNPLGVKAMQHILAKPIGTFGNEGKKTPSS
SEQ 117
MACNRLHTELGNWQVNPPSGFNLEPSDYLQRWLIEVNGAPGTLYANETYQLQAEFPEHYPIKAPQVIFLPPAPLHPDIYRDGHICLDILYDSWSPTMTVSSICISILSMLSSSTVKFPSSEMMDVPLILSKHVFFSKFKADEDESNNANMVFSPVSIQIIFALIAAGSSGSTLDQLLAFLKFNSVEELNSVYSRVITDVLADGSPMGGPRLSVTNWAWVDQSLSFKHSFKQVMDNVYKAASASVDFRNKGDEVTGEVNKWAEEKTNGLIKQILPPVAVNSGTSLILANALYFKGAWTEKLNASDTKDHEFHLLNGGSVQAPLMTSKKRQYVKAFDGFQVLRLRYKQGEDKRFLNMYVYLPNARDGLPTLLEKISSEPGFLDRHVPYEKVKVHEFLIPKFKISLGIEALEVLKGLELTLPFKGGLTEMVGENYPLAVANVFHKAFIEVNEEGAEAPAAKAFHKAFIEVNEEAPVAPAVTVATMMFGCSMMKVEEEIDFVADHPFMFLVKDETAGVVLFVGTLLNPLAVSPS
SEQ 118
LKVGSFFSSLIYSCNKASPNFYSYSFSLLSCFIELVNMGAKAFLVTILLSSLLFPLALSTSNDGLVRIGLKKIKFDQNNRLAARVESKEGEAVRASIRKYNNFHGNLGASEDTDIVALKNYMDAQYFGEIGIGSPPQKFTVIFDTGSSNLWVPSSKCYFSVPCFFHSKYKSSQSSTYKKNGKSAAIRYGTGAISGFFSQDSVKVGDLIVQNQEFIEATREPSVTFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLVKEPVFSFWLNRNTKEDEGGEIVFGGVDPNHYKGKHTYVPVTRKGYWQFDMGDVLIDGQATGYCDNGCSAIADSGTSLLAGPTTVITMINHAIGASGVVSQQCKAVVEQYGQTIMDMLLAEAHPKKICSQVGLCTFDGTRGISMGIESVVDENAGKSSGLHDAMCSACEMAVVWMQNQLRQNQTQERILNYVNELCERLPSPMGQSAVDCGKLSGMPSVSFTIGGRTFDLSPEEYILKVGEGPAAQCISGFIALDVPPPRGPLWILGDVFMGRYHTVFDFGKLRVGFAEAA
SEQ 119
MSKQNLEAPLLDPSPATFNRRKKWSFALCFLFALTAISFIGLRHHGHVGIWLIGDVERYNGKLQQNADVVESEQAVVAADDGRCSEIGISMLKIGGHAVDAAVATALCLGVVNPMASGLGGGGFMVVRSSSTSEVQAIDMRETAPLAASQNMYDNNGKSKLEGALSMGVPGELAGLHAAWSKHGRLPWKTLFQPAIKLARDGFVVAPYLAHHIASKAKLILKDPGLRQVIAPEGKLLRAGDICHNVKLSHSLELIAEQGPEAFYNGEVGEKLVEDVKKAGGILTMDDLRNYKVETPEAVTVNAMGYTIVGMPPPSSGTLGISLILKILESYNAAEGSLGLHRLIEAMKHMFAFRMDLGDPDFVNISKTVSDMLSPSFAKAIRQKIFDNTTFPPEYYMPRWSQLRDHGTSHFCIVDSDRNAVSVTTTVNYPFGAGVLSPSTGIVLNDEMGDFSTPSEISPDELPPAPANFIQPKKRPLSSMAPIIVLKDNQLAGVIGGSGGMKIIPAVVQVFINHFILGMDPLAAVQSPRVYHELIPNVVLYENWTCIDGDHIELSDEKKHFLEERGHQLEAHNGGAICQLIVQNLPNSHLKLGRRSGKEYKNGVFHGMLVAVSDPRKDGRPAAI
SEQ 120
MLKKISSFNILLNMASHITLCIWLLFFFISIISLAKPETYIIHMDLSAMPKAFASHHNWYLTTLASLSDSSTNHKEFLSSKLVYAYTNAINGFSASLSPSEFEAIKNSPGYVSSIKDMSVKIDTTHTSQFLGLNSESGVWPTSDYGKDIIIGLVDTGIWPESKSYSDYGISEVPSRWKGECESGIEFNSSLCNKKIIGARYFNKGLLANNPNLNISMNSARDTDGHGTHTSSTAAGSYVEGASYFGYATGTAIGIAPKAHVAMYKALWEEGVYLSDVLAAIDQAITDGVDVLSLSLGIDAIPLHEDPVAIAAFAALEKGIFVSTSAGNEGPYYETLHNGTPWVLTVAAGTVDREFIGALTLGNGVSVTGLSLYPGNSSSSESSIVYVECQDDKELQKSAHNIVVCLDKNDSVSEHVYNVRNSKVAGAVFITNITDLEFYLQSEFPAVFLNLQEGDKVLEYIKSNSAPKGKLEFRVTHIGAKPAPKVATYSSRGPSPSCPSILKPDLMAPGALILASWPQQSPVTDVTSGKLFSNFNIISGTSMSCPHASGVAALLKAAHPEWSPAAIRSAMMTTSNAMDNTQSPIRDIGSKNAAATPLAMGAGHIDPNKALDPGLIYDATPQDYVNLLCALNFTSKQIKTITRSSSYTCSNPSLDLNYPSFIGFFNGNSSESDPRRIQEFQRTVTNIGDGMSVYTAKLTTMGKFKVNLVPEKLVFKEKYEKLSYKLRIEGPLVMDDIVVYGSLSWVETEGKYVVRSPIVATSIKVDPLTGHN
SEQ 121
MEFYQKLATCSHLSLLCFILLHSIQVQGSYFDQEYGKQVLSSAIQDKDWLVSIRRIIHEYPELRFQEYNTSALIRTELDKLGIYYEYPFAKTGLVALIGSSSPPVVALRADMDALPLQELVEWEHKSKVTGKMHGCGHDAHTAMLLGAAKLLNERKDKLNGTVRLVFQPAEEGGAGAYHMINEGALGDAEAIFGMHVDFKRPTGSIGTSPGPILAAVSFFEAKIEGKGGHAAEPHATVDPILAASFAVVALQQLISREVDPLHSQVLSVTYVRGGSASNVIPPYVEFGGTLRSLTTEGLLQLQKRVKEVIEGQAAVHRCKAYIDMKEEDFPAYPACINDERLHQHVGRVGKLLLGSENIKETEKVMAGEDFAFYQELIPGVMFQIGIRNEKLGSTHAPHSPHFFLDEDVLPIGAALHTAIAEMYLNDYQHPIAVSEQ 122
RHYIYGKLTSNMKTFGIPLAAHSRVLTGSYIRSLYLQILTPFLVHTTAQADNLNCDRSATLNCDRSATEVCTDSEVSTDMEPGNSIVNGVPESIAEEDTAEPLDMDFEFYLSDDKATFKGSEIVMNEPLQSTDISGRLNVLVSWSPKMLEQYNTGLFSSLPEVFKSGFFAKRPQESVSLYKCLEAFLKEEPLGPEDMWYCPACKQHRQATKKLDLWRLPEILVIHLKRFSYNRFLKNKLETYVDFPTHDLDLSSYLAYKDGKSSYRYMLYAISNHYGSMGGGHYTAFVHQGADRWYDFDDSHVYPISQDKLKTSAAYVLFYRRVEEI
SEQ 123
MSRNSLKIHLSIGKIQPGSENKNGSPVYTDSGTCEHLSELRSRVGSNPFFNFRGCVKVRPLGRASIRREPPNELVRCGACGQAPPRLYACVTCAAVFCRVHAPSHPVGNASDPSLHSIAVDIDRAELFCCGCRDQVYDRDFDAAVVLAQTEATVIGSIQDPPPQPENTRKRRRVEYKPWTPDVKEQVLIVGNSSPLPSQLGNDSTTPEVQWGLRGLNNLGNTCFMNSVLQALLHTPPLRNYFLSDKHNRYFCQRKNNSVITRSSSDNGNKNSTMLCLACDLDAMFSAVFSGDWTPISPAKFLYSWWKHASNLASYEQQDAHEFFISVLDGIHERMQNDKGKALSPGSGDCCIAHRVFSGILRSDVMCTACGFTSTTYDPCIDISLDLELSQGSSAKMTSKKSHNTHKKEAESGKFSQNGRISTLMGCLDHFTRPEKLGSDQKFFCQHCQVRQESLKQMSIRKLPLVSCFHIKRFEHSVIKKMSRKVDHYLQFPFSLDMSPYLSSSILRSRFGNRIFSFDGDEQDASCESSSEFELFAVITHTGKLDAGHYVTYLRLSNQWYKCDDAWITQVSESIVRAAQGYMMFYVQKMLYYKASENQVS
SEQ 124
MATHSSTLTISISLLLLLFFFFFSTLSSASDMSILTYDENQHFRTDDEVMSLYESWLLEHGKSYNALDEKDKRFQIFKDNLRYIDEQNSVPNKSYKLGLTKFADLTNEEYRSMYLGTKTSDRRRLLKNKSDRYLPKVGDSLPDSVDWREKGVLVGVKDQGSCGSCWAFSAIASVEAVNSIVTGDVISLSEQELVDCDTSYNDGCNGGLMDYAFDFIIKNGGIDTEEDYPYTGRDGRCDQSRKNAKVVTIDGYEDVPANNEKALQKAVANQPVSIAIEAGGHDFQHYVSGIFTGKCGTAVDHGVVAVGYGSENGMDYWIIRNSWGASWGEKGYLRVQRNVASSKGLCGLAIEPSYPVKTGVNPPKPGPSPPSPIKPPTQCDDYAQCPEGTTCCCVFEYYNSCFSWGCCPLEGATCCEDHYSCCPHDYPVCNIRAGTCSISKDNPLGVKAMKHIHAEPIEAFINGGRKSSS
SEQ 125
MKKLFLVLFSLALVLRLGESFDFHEKELETEEKLWELYERWRSHHTVSRSLDEKDKRFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFRHHYAGSKIKHHRSFLGASRANGTFMYANVEDVPPSVDWRKKGAVTPVKDQGKCGSCWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINTEENYPYMAEGGECDIQKRNSPVVSIDGYEDVPPNDEDSLLKAVANQPVSVAIQASGSDFQFYSEGVFTGDCGTELDHGVAIVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQREIDAEEGLCGIAMQPSYPIKTSSSNPTGSPATAPKDEL
SEQ 126
MARPQFTVILAIISLLIHYGVVSGFRLSDVTNGSSVFLPSPADGSRHTTMLLPLFPPKDTSRRAEISRRHLQKSPASARMSLHDDLLLNGYYTTHIWIGTPPQKFALIVDTGSTVTYVPCSECKKCGNHQDPKFQPEMSSTYQSVKCNKACPCDHKRQQCIYERRYAEMSASYGLLGEDIISFGNLSELAPQRAVFGCEIAETGDLYSQRADGIMGLGRGDLSIVDQLVEKHVISDSFSLCYGGMDFGGGAMVLGGVKPPADMAFTKSDFGHSPYYNIDLKEIHVAGKPLNLNPRVFGGKHGTILDSGTTYAYLPEAAFAAFKNAVVKELHSLKQIEGPDPSFKDICFSGAGSNISELSKNFPRVDMVFSDGKKLTLSPENYLFQHFKVRGAYCLGIFPNGKNPASLLGGIVVRNTLVTYDRENKRIGFWKTNCSELWDRLNLSPPSPPSPSVSSLDNTNSSAHLSPSSAPSGPPGYNTPVEIKVGLITFYLSLSVNCSELKPRIPELAHFIAQELDVNVSQVGF
SEQ 127
MGAKSFLVAFFLSLLLFPLAFCTSNDGLVRIGLKKIKFDQNNRLAARVESKEGEALRASFRKYNNLRGNLGASEDTDIVALKNYMDAQYFGEIGIGSPPQKFTVIFDTGSSNLWVPSSKCYFSVPCLFHSKYKSSQSSTYKKNGKSAAIRYGTGAISGFFSQDSVKVGDLVVKNQEFIEATREPSVTFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLVKEPVFSFWLNRNTEEDEGGEIVFGGVDPNHYKGKHTYVPVTRKGYWQFDMGDVLIDGQATGYCDNGCSAIADSGTSLLAGPTTVVTMINHAIGASGVVSQQCKAVVEQYGQTIMDMLLAEAHPKKICSQVGLCTFDGTRGVSMGIESVVDENAGKSSGLHDAMCSACEMAVVWMQNQLRQNQTQERILNYVNELCERLPSPMGQSAVDCGKLSGMPSVSFTIGGRTFDLSPEEYILKVGEGPAAQCISGFIALDVPPPRGPLWILGDVFMGRYHTVFDSGKLRVGFAEAA
SEQ 128
MVVAFVGIAKSIGQQCLRRSKPYSYSYFSSYVRSSNSKYGLQNWQFQSHRTLILQSASESVKLERLSDSDSGILEVKLDRPEARNAIGKDMLRGLQQAFEAVSNERSANVLMICSSVPKVFCAGADLKERKTMILSEVQDFVSTLRSTFSFLEGLHIPTIAAIEGIALGGGLEMAMSCDIRICGEDAVLGLPETGLAVIPGAGGTQRLPRLVGKSIAKDIIFTGRKISGKDAVSIGLVNYCVPAGEARLKTLELARDINQKGPVALRMAKCAIDKGVELNMESALALEWDCYEQLLDTKDRLEGLAAFAERRKPRYKGE
SEQ 129
MCSSNSLYINPKPCKHLADYKVKNGMSGYSLIQECFKTTPYGRTTLEISKSELPRCSICSGHEGRFYMCLICSSVLCCLSPESNHALLHSQCKAGHEISVDMERAELYCSVCCDQVYDPDFDKVVMCKHIMGFPRTEIGVVESELRLSKRRRLSFGMDLDSKNMKTLFLRRDQKSKSCFPLVLRGLNNLGNTCFMNSVLQVLLHAPPLRNYFLSDRHNRDICRKMSSDRLCLPCDIDLIFSAVFSGDRTPYSPARFLYSWWQHSENLATYEQQDAHEFFISVMDRIHDKEGKASLATKDNGDCQCIAHRTFYGLLRSDVTCTSCGFTSTTHDPCMDISLDLNSCNSSPKDFANKSSKPNESLVGCLDLFTRPEKLGSDQKLYCENCQEKQDALKQMSIKKLPLVLSFHIKRFEHSPTRKMSRKIDRHLQFPFSLDMKPYLSSSIVRKRYGNRIFSFDGDESDISTEFEIFAVVTHSGMLESGHYVTYLRLRNQWYKCDDAWITEVDEEVVRASQCYLMYYVQKMLYHKSCEDVSCQPMSLRADTFVPIAGCC
SEQ 130
MKELHSLREIEGPDPNYKDICFSGAGSDISELSKSFPPIDMVFSNGKKLSLTPENYLFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYDRENERIGFWKTNCSELWDRLNLSPSPPPPPLPSGLDNTNSSANLTPALAPSLPLEHAPGKIKIGLVSFDMSLSVDYSALKPRVPELAHFIAQELEVNVSQVHLMNFSTEGNDSLIRWAIFPAGSANYMPNATATEIINRLAENRFHLPDTFGSYKLVKWDIEPPPKRIRWQQNYLVVVFALLVVLIIGLSASLGWLIWRRRQEIPYNPVGSAETHEKELQPLN
SEQ 131
MVTVSVKWQKEVYPAVEIDTSQPPYVFKAQLYDLTGVPPERQKIMVKGGLLKDDADWSKVGVKEGQRLMMMGTADEIVKAPEKGPVFAEDLPEEEQVVNVGHSAGLFNLGNTCYMNSTVQCLHSVPELKSALTEYNQLGRSNDLDHSSHLLTVATRDLFNDLDKNVKPVAPMQFWTVLRKKYPQFGQQSNGAFMQQDAEECWTQLLYTLSQSLKSPNSSGSPDIVKALFGIEFDNRIHCAESGEESTETETVYSLKCHISQEVNHLHEGLKRGLKSELEKASPSLGRSAVYVKDSRINGLPRYLTIQFVRFFWKRESNQKAKILRKVDYPLSLDVYDFCSEDLRKKLEGPRQVLRDAEGKKAGLKTSEKTSSSTDGDVKMTEAEESSSGSGEASKTTQEGVLPEKEHHLTGIYDLVAVLTHKGRSADSGHYVAWVKQENGKWVQFDDDNPIPQREEDIPKLSGGGDWHMAYICMYKARVVPM
SEQ 132
MEKKKEVIRLERESVIPVLKPRLIMALADLIEHSSDRAEFLKLCKRVEYTIHAWYLLQFEDLMQLYSLFDPVNGAKKLEQQKLSPEEIDILEQNFLTYLFQIMHKSNFKIASDEEIDVAHSGQYLLNLPITVDESKLDKKLLEKYFAEHPHEDLPEFADKYVIFRRGIGIDRTTDYFFMEKVDMIIGRTWAWILRKTRIDRLFSRRSSSRRKKDPKKDDEINSEAEDHDLYVERIRIENMELSARSNQFSLHQVK
SEQ 133
MELTCSSPLSVNSTISFNPQLRRYGSVYPHKRCQTVFSLFPYCPSSSSHITITTATTAACSTSSSTSSLFGISLSHRPCSSIPRKIKRSLYIVSGVFERFTERSIKAVMFSQKEAKALGKDMVYTQHLLLGLIAEDRSPGGFLGSRITIDKAREAVRSIWHDDVEDDKEKLASQDSGSATSATDVAFSSSTKRVFEAAVEYSRTMGHNFIAPEHMAFGLFTVDDGNATRVLKRLGVNVNRLAAEAVSRLQGELAKDGREPISFKRSREKSFPGKITIDRSAEKAKAEKNALEQFCVDLTARVSEGLIDPVIGREIEVQRIIEILCRRTKNNPILLGQAGVGKTAIAEGLAINIAEGNIPAFLMKKRVMSLDIGLLISGAKERGELEGRVTTLIKEVKKSGNIILFIDEVHILVGAGTVGRGNKGSGLDIANLLKPALGRGELQCIASTTMDEFRLHIEKDKAFARRFQPVLINEPSQADAVQILLGLREKYESHHKCIYSLEAINAAVQLSARYIPDRYLPDKAIDLIDEAGSKSRMQAHKRRKEQQISVLSQSPSDYWQEIRAVQAMHEVILASKLTENDDASRLNDGSELHLQPASPSTSDEDEPPVVGPEEIAAVASLWTGIPLKQLTVDERMLLVGLDEQLKKRVVGQDEAVAAICRAVKRSRTGLKDPNRPISAMLFCGPTGVGKSELAKALAASYFGSESAMLRLDMSEYMERHTVSKLIGSPPGYVGYGEGGTLTEAIRRKPFTVVLLDEIEKAHPDIFNILLQLFEDGHLTDSQGRRVSFKNALIVMTSNVGSTAIVKGRQNTIGFLLADDESAASYAGMKAIVMEELKTYFRPELMNRLDEVVVFRPLEKPQMLQILDLMLQEVRARLVSLEISLEVSEAVMELICQQGFDRNYGARPLRRAVTQMVEDLLSESFLSGDLKPGDVAIINLDESGNPVVANKSTQSIHLSDANGNPVVTNR
SEQ 134
MKNIERLANVALLGLSLAPLVVNVDPNVNVIVTACLTVFVGCYRSVKPTPPSETMSNEHAMRFPLVGSAMLLSLFLLFKFLSKDLVNAVLTCYFFVLGIAALSATLLPAIRRFLPKKWNDDLIIWHFPYFRSLEIEFTRSQIVAAIPGTIFCVWYAKQKHWLANNVLGLAFCIQGIEMLSLGSFKTGAILLAGLFVYDIFWVFFTPVMVSVAKSFDAPIKLLFPTADAKRPFSMLGLGDIVIPGIFVALALRFDVSRGKGPQYFKSAFLGYTFGLALTIFVMNWFQAAQPALLYIVPAVIGFLAVHCIWNGDVKPLLEFDEGKTKGAEEADAKESKKVE
SEQ 135
MAFSSSYFSFIFLILLFIISFVVGEIKPIYLPGTYQSSLEKQHVKSKIPFKVHYFPQILDHFTFLPKSSKVFKQKYLINDNYWKQGGPIFVYTGNEGNIDWFAANTGFMLDIAPKFHALLVFIEHRFYGDSMPFGKKSYKSPKTLGYLNSQQALADYAVLIRSLKQNLSSESSPVVVFGGSYGGMLASWFRLKYPHIAIGAVASSAPILQFDKITPWSSFYDAVSQDFKEVSLNCYRVIKGSWTELDALSKHEEGLTEVSKLFRTCKGLHSVYSARDWLWEAFVYTAMVNYPTKANFMMPLPAYPVQEMCKIIDGLPKGASKISRAFAAASLYYNYTKREKCFNLEGGDDAHGLRGWDWQACTEMVMPMTCSNESMFPPSSYSYKEFKEDCKKKYGVEPRPHWITTEFGGYRIEQVLKRFGSNMIFSNGMQDPWSRGGVLKNISASIVALVTQKGAHHVDFRSETKNDPGWLIMQRKQEVAIIQKWLEEYYRDLKQN
SEQ 136
MSRFSLLLALVVAGGLFASALAGPATFADENPIRQVVSDGLHELENAILQVVGKTRHALSFARFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAAQNCSATTKGNLKVTNVVLPETKDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKNGLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGVYTSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCASYPVVA
SEQ 137
MEKEHKYSLFLTKLKLFFLVTLSTFHGLSHGFQMDQARTLMSWRRSKMHAQTTTYATNEDETENLVFSDEKHVGNMEDDLIKDGLPAQPSNVMFKQYAGYVNVDVKNGRSLFYYFAEASSGNASSKPLVLWLNGGPGCSSLGFGAMLELGPFGVNPDGKTLYSRRFAWNKVANVMFLESPAGVGFSYSNTTSDYSKSGDKRTAEDAYRFLVNWFKRFPHYKGRDFYIMGESYAGFYVPELADIIVKRNMLPTTNFYIQFKGIMIGNGIMNDETDEKGTLDYLWSHALISDETHRGLLQHCKTETETCQHFQNIAEAELGNVDPYNIYGPQCSINSKSRSSSPKLKNGYDPCEQQYVQNYLNLPHVQKALHANLTNLPYLWNPCSNLDWKDTPATMFPIYKRLIASGLRILLYSGDVDAVVSVTSTRYSLSAMNLKVIKPWRPWLDDTQEVAGYMVVYDGLAFATVRGAGHQVPQFQPRRAFALLNMFFANHS
SEQ 138
MANSYTSINFFLAPIIFLAILGLQLQSSDGFGTFGFDIHHRYSDPVKGILDLHGLPEKGSVEYYSAWTQRDRFIKGRRLAEADTANSTPLSFSGGNETFRLSSLGFLHYANVTVGTPGLSFLVALDTGSDLFWLPCDCSNCVRALETRSGRRINLNIYSPNTSSTGQIVPCNSTLCGQRRRCLSSQNACAYGVAYLSNNTSSSGVLVEDILHLETDNAQQKSVEAPIALGCGIRQTGAFLSGAAPNGLFGLGLENISVPSMLASKGLAANSFSMCFGPDGIGRIVFGDKGSPAQGETPLNLDQLHPTYNISLTGITVGNKITDVDFTAIFDSGTSFTYLNDPAYKVITENFDSQAKQPRIQPDGEIPFEYCYGLSANQTTFEVPDVNLTMKGGNQLFLFDPIIMLSLQDRSGAYCLAVVKSGDVNIIGQNFMTGYRVVFDREKMVLGWKPSDCYDSRGSNDKSTTLPVNKRNSTEAPSPSSVVPEATKGNGSGNEPATSFPSVQSSKPAANQAPAHFICQLMMALFSLFSYYLIIISS
SEQ 139
MAIHTSTLSISILVMLMFSVVSSSAAEDMSIISYNEKHHTNGESTVWRTDDEVMSLYESWLVEHKKVYNALGEKDKRFQIFKDNLRYIDEHNSVPDKSYKLGLTQFADLTNEEYKSIYLGTKPDGRSRLLNTQSDRYAPKVGDSLPDSVDWRKKGVLVDVKNQGQCGSCWAFSAVASIEAVNKIVTGNLISLSEQELVDCDTSDNQGCQGGLMDDAFKFVIQNGGIDTEEDYPYKAKDGKCDQARKNARVVTIDGYEDVPDNDEKALKKAVAGQPVSVAIEAGGKDFQHYKSGIFTGKCGAAVDHGVVAVGYGSENGMDYWIVRNSWGASWGEKGYLRMQRNIGNPKGLCGIATIASYPVKTGQNPPKPAPSPPPVKPPTQCDDYNECPAGTTCCCVYEYYKYCFAWGCCPMEGATCCKDHNSCCPHDYPVCNVKAGTCSISKNNPLGVKAMQHILAKPIGTFGNEGKKSPSS
SEQ 140
MEIKILLASLVIWYITCINVYADDMVRIELKRQSLDLSSISDARIYAKDLRGRNRNLAAPNDQIVYLKNYHDVQYFAEIGIGSPPQRFIVVFDTGSSNLWVPSSRCFFSIACYLRSRYKSRLSNTYTKIGKSSKIPFGTGSVHGFFSQDNVKVGGAVLKQQVFTEVTREGYLTLLRARFDGVLGLGFDQSTTSRNVTPVWYNMLLQHMVTKSIFSFWLNRDPTSKIAGEIIFGGMDWTHFRGQHTYVPVAQNGYWEIEIGDLFIGSNSTGLCKDGCPAIVDTGTSFIAGPTTILTQINHAIGAEGIISLECKKVVSSYGDSIWERLIAGLQPENVCNRIGLCTNNGSLCSSCEMIVFWIQVEIRKERSKEKAFQYANQLCEKLPNPGGKSFINCDVFALPHITFTIGDKSFPLSPDQYVIRVDDSQGVHCISGFTTLNAHPRRPLWVLGDAFLRAYHTVFDFGSSQIGFAESA
SEQ 141
MASIFALSLFFIIISFCITSITIPVQSDGHETFIIHVSKSDKPRVFATHHHWYSSIIRSVSQHPSKILYTYSRAAVGFSARLTAAQADQLRRIPGVISVLPDEVRHLHTTHTPTFLGLADSFGLWPNSDYADDVIIGVLDTGIWPERPSFSDEGLSPVPSSWKGKCATGPDFPETSCNKKIIGAQMFYKGYEASHGPMDESKESKSPRDTEGHGTHTASTAAGSVVANASFYQYAKGEARGMAIKARIAAYKICWKNGCFNSDILAAMDQAVNDGVHVISLSVGANGYAPHYLLDSIAIGAFGASEHGVLVSCSAGNSGPGAYTAVNIAPWILTVGASTIDREFPADVILGDNRIFGGVSLYSGDPLTDAKLPVVYSGDCGSKYCYPGKLDHKKVAGKIVLCDRGGNARVEKGSAVKQAGGVGMILLNLADSGEELVADSHLLPATMVGQKAGDKIRHYVKSDPSPTATIVFRGTVIGKSPAAPRVAAFSSRGPNHLTPEILKPDVIAPGVNILAGWTGSVGPTDLDIDTRRVEFNIISGTSMSCPHASGLAALLKRAHPKWTPAAVKSALMTTAYNLDNSGKVFTDLATGQESTPFVHGSGHVDPNRALDPGLVYDIETSDYVNFLCSIGYDGDDVAVFVRDSSRVNCSEQNLATPGDLNYPSFSVVFTGESNGVVKYKRVMKNVGKNTDAVYEVKVNAPSSVEVSVSPAKLVFSEEKKSLSYEISFKSKSSGDLEMVKGIESAFGSIEWSDGIHNVRSPIAVRWRHYSAASI
SEQ 142
MPSSLFLTLLLASISLSFSSTLNSNDDEFFLSSTPKFPLTMAEKLIRQLNLFPKHDINKAAATGDSEQRLFERKLNLSYVGNSGSTVQDLGHHAGYYRLPHTKDARMFYFFFESRSRKNDPVVIWLTGGPGCSSELAVFYENGPFKIADNMSLVWNDFGWDKVSNLIYVDQPTGTGFSYSSNDDDIRHDERGVSNDLYDFLQAFFKAHPQYAKNDFYITGESYAGHYIPAFASRVHQGNKNKEGIYVNLKGFAIGNGLTDPEIQYKAYTDYALDMKLIKKSDYNAIEKSYPKCQLAIKLCGKDGGTACMAAYLVCTSIFNKIMDIAGDKNYYDVRKRCEGDLCYDFSKMETFLNDQQVKKALGVGDIEFVSCSSEVYQAMQLDWMRNLEEGIPSLLEDGIKLLVYAGEYDLICNWLGNSRWVHAMKWSGQKAFGKATQVSFAVDGVEKGVQKNYGPLTFLKVHDAGHMVPMDQPKAALEMLHRWMQDKLSKQGHLAPM
SEQ 143
MLVISDCYINSCKAFNFVINLPVMGHSHSHSSHSHSHFHSSKSSDDQNMDMGESITTQTDVSFMLAKHVFSKEVKGDSNLVFSPLSIQIVLGLIAAGSKGPTKDQLLCFLKSKSIDELNSLYSHFVSVVFVDGSPNGGPRLSVVNGVWIDQTLPFKPSYKKVVDKVYKAASNSVDFQCKAAEVANQVNQWAKMKTNNLIKEILPHGTVNNMTRLIFANALYFKGVWNDKFNASETKDHKFHLLSGGSIKAPFMTSKNKQYAVAFDGFKVLGLHYKQGKDMRRFCMYLILPDARDELPALLDKISSEPGFIDHHIPFEKAKMRKFLIPKFKTTFGFEASKVLKGLGLTLPFSSGGLTEMVDSPLAGRLFVSQIFHKSFIEVNEEGTEAAAVTASVIMTKSLIIEKEMEFVADHPFLFLIRDESTGAVFFIGSVLNPLAG
SEQ 144
MLRIGPSLRTARKLLNRNLHFQSPIIAGDVAPVHHRRQELHRFVRRCNYSSTVGNTSASASFFSTLNNSNSSTTSTTPHVERAEENDSLQSNASEVEPVAAVEQRLSSGMVDAYLAIELALDSVVKIFTVSSSPNYFLPWQNKSQRETTGSGFVIRGKRILTNAHVVADHTFVLVRKHGSPTKYRATVQAVGHECDLAILVVESEEFWEGMNSLELGDVPFLQEAVAVVGYPQGGDNISVTKGVVSRVEPTQYVHGASQLLAIQIDAAINPGNSGGPAIMGDKVAGVAFQNLSGAENIGYIIPVPVIKHFIAGVEERGEYIGFCSLGLSCQPTENAQIREYFQMQSKLTGVLVSRINPLSDASRVLKKDDIILSFDGVPIANDGTVPFRNRERITFDHLVSMKKPNETAELKVLRNGKVHDFKITLHPLQPLVPVHQFDKLPSYFIFAGLVFIPLTQPFLHEYGEDWYNASPRRLCERALRELPKKPGEQFIILSQVLMDDINAGYERLAELQVKKVNGVEVLNLKHLRQLVEDGNQKNVRFDLDDEKVIVLNYESARIATSRILKRHRIPHAMSSDLTDDENAVELQSACSS
SEQ 145
ICREPPNELVRCGACGHAPPRLYACVTCAEVFCRVHAPSHPAGNAADPSLHCIAVDIDRAELFCCGCRDQVYNSDFDAAVALAQTEATVIGSIQDPPPHPESTRKRRRVEYKPWTPDVKEQVLIVGNSSPLPSQLGNDSTTPEVQWGLRGLNNLGNTCFMNSVLQALLHTPPLRNYFLSDKHNRYFCQRKNSSVITRSSSDNGNKNSTMLCLACDLDAMFSAVFSGDRTPISPAKFLYSWWKHASNLASYEQQDAHEFFISVLDGIHERMQNDKGKALSPGSGDCCIAHRVFSGILRSDVMCTACGFTSTTYDPCIDISLDLELSQGSSSKMTSKKSHNTHKKEAESGKFSQNGRISSLMGCLDHFTRPEKLGSDQKFFCQHCQVRQESLKQMSIRKLPLVSCFHIKRFEHSVIKKMSRKVDHYLQFPFSLDMSPYLSSSILRSRFGNRIFSFDGDEQDASCESSSEFELFAVITHTGKLDAGHYVTYLRLSNQWYKCDDAWITQVSENIVRAAQGYMMFYVQKMLYYKASEKQVS
SEQ 146
MEGSPVLGEHAELIGVLSRPLRQRATAAEIQMVIPWEAITSACGSLLKEELQTRRKIHFDNGNLISVKNESPSNNIRNGPSNDTREHLLIDPVPPSLIEKAMTSICLITVDDGAWASGVLLNKQGLLLTNAHLLEPWRFGKTSVNGSGYNTKSDVVLIPSDQSEHPGVEKFDIQRRNKHLIQKELKTPHFLVDNEQGSFRVNLAKTGSRIIRVRLDFMDPWVWTNAKVVHVSRGPLDVALLQLELVPDQLCPITADFMCPSPGSKAYILGHGLFGPRCDFLPSACVGAIAKVVEAKRSLLNQSSLGEHFPAMLETTAAVHPGGSGGAVVNSEGHMIALVTSNARHGGGTVIPHLNFSIPCAALEPIFKFVEDMQNLSLEYLDKPDEQLSSVWALTPPLSSKQSPSMLHLPMLPRGDSDGDTKGSKFAKFIADSEAMLKSATQLGKVERLSNKLVHSKL
SEQ 147
MLKALTSSCLQNRFHAVTTAFTPQVRRGTDSNTPLLRVLGSLRSSNRRGPYLSRRFFCSDSTDGSESNSEAAASEAKPAEKGGDADSKASAAIVPTVFKPEDCLTVLALPLPHRPLFPGFYMHIYVKDPKVLAALLESRKRQAPYAGAFLMKDEQGTDPNVVSASDTEKNIYELKGKDMLNRLHEVGTLAQITSIKDDQVILIGHRRIRMTEVVSEEPLTVKVDHLKEQPYNKDDDVIKATSFEVLSTLRDVLKTSSLWKDHVQTYIQ
SEQ 148
MERKHLWAALLLLAIACFVFPASSDSLLRISLKKRQLDISSLNVANVARLEDRYGKHVMKDIEKKKKKKKSDTNSDIVSLKNYLDAQYYGDISIGSPPQNFTVIFDTGSSNLWVPSSRCYFSIACWIHSKYKARKSSTYTKKGESCSIHYGSGSISGFLSQDNVQVGDLVVTDQVFIEATRESSVTFIVAKFDGILGLGFKEIAVGNTTPVWYNMVKQDLVKEPVFSFWLNRDINAKEGGELVFGGVDPKHFKDKHTYVPLTQKGYWQFKMGDFSIGNQSTGFCEGGCAAIVDSGTSLLAGPTAVVTQVNHAIGAEGVLSMECKETISQYGEMIWDLLVSGVTPDQICLQVGLCYLNGAQHLSSNIRSVVEKENEGSSIGEAPLCTACEMAVIWMQNQLKQKTTKESVLEYVNQLCEKLPSPMGQSVIDCNSISSMPNVTFNIGDKDFVLTPDQYILKTGEGIATICLSGFVALDVPPPRGPLWILGNVFMGVYHTVFDYGNLQLGFAEAA
SEQ 149
SRSYYNILLLQYLFLFVMALILGWKILFILLFVIIGMCTSQVTSRNIQALSMLEKHELWMSSHGRTYKNEAEKEKRLNIFKENVKFIESFNNNGTKKPYKLGINAFADLTAEEFLSYYTTGLKLSNSYSQIQSSFKYENLSDVPSVMDWRKSGAVTRIKHQGQCGCCWAFSAVAALEGANKLSTNNLISLSEQQLLDCTTENNGCNGGLMTTAYDFIIQNGGIATESNYPYEEYQDSCKSQEMNSAVKINRYETLPSTESALLKAVAKQPVSIGIAVNEDFHLYQNGVYNGNCEGQELNHAVTVIGYGTENDGTKYWLIKNSWGTSWGENGYMKIARDTGIEGGLCGITTLASYPVL
SEQ 150
MGLPEVVDVARNYAVMVRIQGPDPKGLKMRKHAFHLYNSGKTTLSASGMLLPSSFVNASVSKQIQGESKLHSFGGHFLVLTVASVIEPFVVQQDRGDISKDKPELIPGAQIDILWEGGNTLQNDIKVTNKEGLNWLPAELLRVVDIPVSSAAVQSLVEGSSSSIEHGWEVGWSLAAYGNSRQSFTNTKRTQVEKISFPSQTPMMEAQSSLPSVIGTSTTRIALLRVSSNPYEDLPALKVATWSRRGDLLLGMGSPFGILSPSHFFNSISVGSIANSYPPSPQNKALLIADIRCLPGMEGSPVLGEHAELIGVLSRPLRQRATAAEIQMVIPWEAITSACGSLLKEELQTRRKIHFGNGNLISVKKESFSNNIQDGHANDTQEHLQIDPVPPSLIEKAMTSICLIAVDDGAWASGVLLNKQGLLLTNAHLLEPWRFGKTSVNGSGYNTKSDVVLIPSDQSEHPGVEKFDIQRRNKHLIQKELKTPHFLVDNEQCSFRVNLANTGSRTIRVRLDFMDPWVWTNAKVVHVSRGPLDVALLQLELVPDQLCPIIVDFMCPSPGSKAYILGHGLFGPRCDFLPSACVGAIAKVVEAKRPLLNQSSLGGHFPAMLETTAAVHPGGSGGAVVNSEGHMIALVTSNARHGGGTVIPHLNFSIPCAALEPIFKFAEDMQNLSLEYLDKPDEQLSSVWALTPPLSSKQSPSMLHLPMLPRGDSDGDTKGSKFAKFIADSEAMLKSATQLGKVERLSNKLVHSKL
SEQ 151
MDNPSEDSSDSPQQQPESPVNDDQRVYLVPYRWWKEAQESSPSDGKSVTLYAAAPAPSYGGPMKIINNIFSPDVAFNLRREEESLSQSQENGEVGVSGRDYALVPGDIWLQALKWHSNSKAAAKNGKSFSATDEDIADVYPLQLRLSVLRETSSLGVRISKKDNTVECFKRACRIFSVDTEPLRIWDLSGQTALFFSDENNKILKDSQKQSEQDMLLELQVYGLSDSVKNKVKKDEMSMQYPNGSSFLMNGTGSGITSNLTRSSSSSFSGGPCEAGTLGLTGLQNLGNTCFMNSALQCLAHTPKLVDYFLGDYKREINHDNPLGMNGEIASAFGDLLKKLWAPGATPVAPRTFKLKLAHFAPQFSGFNQHDSQELLAFLLDGLHEDLNRVKNKPYVEAKDGDDRPDEEIADEYWNNHLARNDSIIVDVCQGQYRSTLVCPVCKKVSIMFDPFMYLSLPLPSTSMRSMTVTVIKNGSDIQISAFTITVSKDGRLEDLIRALSTACSLDADETLLVAEIYNNRIIRYLEEPADSLSLIRDGDRLVAYRLHKGTEEAPLVVFTHQQIDEHYIYGKLTSNMKTFGIPLAAHSRVLTGSDIRSLYLQILTPFLVHNTAQADNLNCDRSATEACTDSEVITDMEPGNSIVNGVPESIAEEDTAEPLDMEFQFYLSDDKATFKGSEIVMNEPLQSTDISGRLNVLVSWSPKILEQYNTGLFSSLPEVFKSGFFAKRPQESVSLYKCLEAFLKEEPLGPEDMWYCPACKQHRQATKKLDLWRLPEILVIHLKRFSYNRFLKNKLETYVDFPTHDLDLSSYLAYKDGKSSYRYMLYAISNHYGSMGGGHYTAFVHQGADRWYDFDDSHVYSISQDKLKTSAAYVLFYRRVEEI
SEQ 152
MASSSRVFVLLLLIIFNFLYISAQKTIKHKPFSMSFPLISTSLSHNSSSKALFLSSFMASNNRRQTQNTKTMSRIPSLNYKSTFKYSMALIVTLPIGTPPQNQQMVLDTGSQLSWIQCHKKIPKRPPPTTSFDPSLSSTFSVLPCTHPLCKPRIPDFTLPTTCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTPPLILGCATESEDAEGILGMNLGRFSFASQAKVQKFSYCVPIRQGSHAVKPSGTFYLGQNPNSHTFQYINLLTFPQSQRMPNLDPLAFTVGMVGIKIGGKKLNISGRVFRPNAGGSGQTIIDSGTEYTFLVEEAYNKVREEIVRLVGPRLKKGYVYGGALDMCFDNRPIEIGRLIGDMTLQFENGVDILINKERMLDEVEGGIHCVGIGRSESLGIASNIIGNFHQQNLWVEFDMRNRRVGFGKGECSRQV
SEQ 153
YTIIIFSLNMKIFSIFSLLLLLLLPILASCHEKQVYIVYFGGHKGEKALHEIEENHHSYLMSVKESEEEARYSLIYSYKHSINGFAALLTPHEASKLSELEEVVSVYKSEPRKYRLQTTRSWEFSGVEESVQPNSLNKDNLLLKARYGKDVIIGVLDSGLWPESKSFSDEGLGPIPKSWKGICQSGDAFNSSNCNKKIIGARYYIKGYEQYYGPLNRTLDYLSPRDKDGHGTHTSSTAGGRKVPNVSAIGGFASGTASGGAPLARLAMYKVCWAIPKEGKEDGNTCFDEDMLAAMDDAIADGVDVISISIGTKEPQPFDQDSIAIGALYAVKKNIVVSCSAGNSGPAPSTLSNTAPWIITVGASSVDRAFLSPVILGNGKKFTGQTVTPYKLEKEMYPLVYAGQVINSNVTKDVAGQCLPGSLSPKKAKGKIVICLRGNGTRVGKGGEVKRAGGIGYILGNNKANGAELVADPHFLPATAVDYKSAMQILNYINSTKSPVAYIVPAKTVLHSKPAPYMASFTSRGPSAVAPDILKPDITAPGLNILAAWSGGSSPTKLDIDDRVVEYNIISGTSMSCPHVGGAAALLKAIHPTWSSAAIRSALITSAGLRNNVGEQITDASGKPADPFQFGGGHFRPSKAADPGLVYDASYQDYLLFLCASGIKDLDKSFKCPKKSHLPNNLNYPSLAIPNLNGTVTVSRRLTNVGAPKSVYFASAKPPLGFSVEISPPVLSFKHVGSKRTFTITVKVRSDMIDSIPKDQYVFGWYSWNDGIHNVRSPIAVKLA
SEQ 154
MATRRSSSSALTALAASRSRLLSRFRPAVSRLSQNTLLGTGRCPPPNSGFFVAETTAALWPNYNVLSKSFVHSYSTTAASSGQINNMDYTEMALEGIVGAVEAARTSKQQVVETEHLMKALLEQKDGLARRIFTKAGLDNSSVLQETDQFISQQPKVVGDTSGPILGSHLSSLLENAKKHKKEMGDSFVSVEHMLLSFLSDTRFGQKLFRNLQLTEKALKDAVNAVRGSQRVTDPNPEGKYEALEKYGNDLTELARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVIIGEPGVGKTAIAEGLAQRIVRGDVPEPLMNRKLMSLDMGALLAGAKYRGDFEERLKAVLKEVSSSNGQIILFIDEIHTVVGAGATSGAMDAGNLLKPMLGRGELRCIGATTLNEYRKYIEKDPALERRFQQVYCGQPSVEDAISILRGLRERYELHHGVKISDSALVSAAVLADRYITERFLPDKAIDLVDEAAAKLKMEITSKPTELDEIDRAVLKLEMEKLSLKNDTDKASKERLNKLESDLKSLKAKQKELNEQWEREKDLMTRIRSIKEEIDRVNLEMEAAEREYDLNRAAELKYGTLISLQRQLGEAEKNLADYRKSGSSLLREEVTDLDITEIVSKWTGIPLSNLQQSERDKLVFLENELHKRVVGQDMAVKSVADAIRRSRAGLSDPNRPIASFMFMGPTGVGKTELGKALAAYLFNTENALVRIDMSEYMEKHAVSRLVGAPPGYVGYEEGGQLTEVVRRRPYSVVLFDEIEKAHHDVFNILLQLLDDGRITDSQGRTVSFTNTVVIMTSNIGSHYILETLQNTRDSQEAVYDAMKKQVIELARRTFRPEFMNRIDEYIVFQPLDLKQVSRIVELQMRRVKDRLKQKKIDLHYTQEAISLLANMGFDPNYGARPVKRVIQQMVENEVAMGVLRGDFSEEDMIIVDADASPQGKDLLPEKRLLIRRIENGSNMDAMVAND
SEQ 155
VNVKCFFVSFFFSFSCMSLFFLQGWNFETFCLKTQSFAVTNKNHRPHLHSHHSSFLCFHTSYLLFFLILYIYIAKTTSRFAKTQQPPQKMSRFTMLVVLVLLLLCLCHLSVATIGSSSNKKSTYIVHVAKSQMPESFENHKHWYDSSLKSVSDSAEMLYVYNNVVHGFSARLTVQEAESLERQSGILSVLPEMKYELHTTRTPSFLGLDRSADFFPESNAMSDVIVGVLDTGVWPESKSFDDTGLGPVPDSWKGECESGTNFSSSNCNRKLIGARYFSKGYETTLGPVDVSKESKSARDDDGHGTHTATTAAGSIVQGASLFGYASGTARGMATRARVAVYKVCWIGGCFSSDILAAMDKAIDDNVNVLSLSLGGGNSDYYRDSVAIGAFAAMEKGILVSCSAGNAGPGPYSLSNVAPWITTVGAGTLDRDFPAYVSLGNGKNFSGVSLYKGDLSLSKMLPFVYAGNASNTTNGNLCMTGTLIPEKVKGKIVLCDRGINPRVQKGSVVKEAGGVGMVLANTAANGDELVADAHLLPATTVGQTTGEAIKKYLTSDPNPTATILFEGTKVGIKPSPVVAAFSSRGPNSITQEILKPDIIAPGVNILAGWTGGVGPTGLAEDTRRVGFNIISGTSMSCPHVSGLAALLKGAHPDWSPAAIRSALMTTAYTVYKNGGALQDVSTGKPSTPFDHGAGHVDPVAALNPGLVYDLRADDYLNFLCALNYTSIQINSIARRNYNCETSKKYSVTDLNYPSFAVVFLEQMTAGSGSSSSSVKYTRTLTNVGPAGTYKVSTVFSSSNSVKVSVEPETLVFTRVNEQKSYTVTFTAPSTPSTTNVFGRIEWSDGKHVVGSPVAISWI
SEQ 156
MLKALTSSCLQNRFHAVTTAFTPQVRRGTDSNTPLLRVLGSLRSSNRRVPYLSRRFFCSDSTDGSESNSEAAASEAKPAEEGGDADSKASAAMVPTVFKPEDCLTVLALPLPHRPLFPGFYMHIYVKDPKVLAALLESRKRQAPYAGAFLMKDEQGTDPNVVSASDTEKNIYELKGKDMLNRLHEVGTLAQITSIKDDQVILIGHRRIRMAEVVSEEPLTVKVDHLKEQPYNKDDDVIKATSFEVLSTLRDVLKTSSLWKDHVQTYIQHIGDFNYARLADFGAAISGANKLQCQQVLEELDVHKRLQLTLELVKKEMEISKIQESIARAIEEKISGEQRRYLLNEQLKAIKKELGLETDDKTALSAKFRERLEPNKEKIPVHVMQVIEEELTKLQLLEASSSEFNVTRNYLDWLTALPWGNYSDENFDVLRAEQILDEDHYGLTDVKERILEFIAVGKLRGTSQGKIICLSGPPGVGKTSIGRSIARALNRKFYRFSVGGLSDVAEIKGHRRTYIGAMPGKMVQCLKSVGTANPLVLIDEIDKLGRGHAGDPASAMLELLDPEQNANFLDHYLDVPIDLSKVLFVCTANVVEMIPNPLLDRMEVISIAGYITDEKMHIARDYLEKATRETCGIKPEQVEVTNSALLALIENYCREAGVRNLQKQIEKIYRKIALKLVREDGEIEPQNAEVGEVEAESIHLSDEIKSKEEIQAGAESANGSNDDKASENNAEAEAQGAPVNQTQKSANEDACLQDTQETEKATESEASKTVNKVVVDSPNLADYVGKPVFHAERIYDQTPVGVVMGLAWTSMGGSTLYIETSLVEQGEGKGALNVTGQLGDVMKESAQIAHTVARTILQEKEPDNQFFANSKLHLHVPAGATPKDGPSAGCTMITSLLSLAMKKPVKKDLAMTGEVTLTGKILPIGGVKEKAIAARRSDVKTIIFPSANRRDFDELAPNVKEGLDVHFVDDYKQIFDLAF
SEQ 157
MQFFRRNPSLHRISSRFLNQVVKTSAYSTKKVYNAGQPTAATHPQLMKEGEITPGITSEEYMQRRKKLLEFLPENSLAIVAAAPIKMMTDVVPYNFRQDADYLYITGCQQPGGVAVLGHDCGLCMFMPEQSPQDALWQGETAGVDAALQIFKADLAYPINRLPQILSRMIESSSTVFHNVKTRTSSYLELEAYKKAVSNYKVKDFSVYTHEARFVKSPAELKLMRDSASIACQALVQTMLYSKLFPDEGMLSAKFEYECRVRGAQRMAFNPVVGGGPNGSVVHYFRNDQKIEDGNLVLMDVGCELHGYVSDLTRVWPPFGKFSSVQEELYNLILETNKECVELCRPGTTIREIHHYSVETLRRGFKEIGILKNDRRGRYEMLNPTNIGHYLGMDVHDCSTIGNDRPLKPGVVITIEPGVYIPSCFDCPERFQGIGFRIEDEVLITESGYEVLTASIPKEIKHLESLLNNFGSGRGTEIRAALS
SEQ 158
LLTSHKNHIILLPFLLYKIFISLQKQTLMASSTRVFVLLLLIIFNFLYISAQKTIKHKPFSMSFPLTSTSLSHNSSSKALFLSSLLASNQRKQAPNTKTVSRIPSLNYKSTFKYSMALIVTLPIGTPPQNQQMVLDTGSQLSWIQCHKKIPKRPPPTTSFDPSLSSTFSVLPCTHPLCKPRIPDFTLPTTCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTPPLILGCATESEDAEGILGMNLGRFSFASQAKVQKFSYCVPIRQGSHAVKPSGTFYLGQNPNSHTFQYINLLTFPQSQRMPNLDPLAFTVGMVGIKIGGKKLNISGRVFRPNAGGSGQTIIDSGTEYTFLVEEAYNKVREEIVRLVGPRLKKGYVYGGALDMCFDNRPMEIGRLIGDMTLQFENGVEILINKERMLDEVEGGIHCVGIGRSESLGIASNIIGNFHQQNLWVEFDMRNRRVGFGKGECSRQM
SEQ 159
MAALNFFIIFTSLVLPIASDPLLSTYVVHVDTKAKPSHYLTQDEWYNSVVESVLANKMDSDSTSPRLFYSYDVVLQGFAARLTDQESEKLNKFPEVIHIFKDQSRIKLDTTRSPNFLGLNTGYGLWPQSNFGDDVIIGLVDTGIWPESESFKDNGIGPIPTRWKGKCVDGIEFNATSSCNRKLIGARNFVKGVENDYHHQSARDQNGHGTHTASTAAGTEVNGANVFGFAKGKARGIASKARIAMYKACGSSSCAESDILAAIESAIKDGVDILSLSLGYDDAPFYENPVAIATFAAVKRNIFVASSAGNLGPYPFSVHNTAPWVTTVGAGSLDRDFPVEINLSNNKTFVGSSLYPGRISGKSYSLVYIENCSIMTIDRSKVERKIVVCNTSKIEALRNGILIQKAGGFGLIQLNLPTEGEGIRAMAYTLPSATLGYKEGIELLSYIKSNANPRAGFVRRKDTVIGKKVRAPIVASFSSRGPNVVVPEVLKPDLIAPGLNILAAWPGDISPTRLKMDPRRVKFNINSGTSMACPHIAGVAALVRAVHPDWSPAAIKSALMTTSTAFDNAQLPIIKHEDMELATPISIGAGHVNPESAIDPGLIYDTDTSDYINLLCSLNYTEKQMKLFTNESNPCSGFTGSPLDLNYPSLSVMFRPDSYVHVVKKTLTHVAVSKPEVYKVKIVNLNSEKVSLSIEPRKLIFNESLQKQSYVVKFESHYAFNSSRKIAEQMAFGSILWESEKHNVRSPFAVMWVQQNFNNSRLYK
SEQ 160
MEISKIQESIARAIEEKISGEQRRYLLNEQLKAIKKELGLETDDKTALSAKFRERLEPNKEKIPVHVMQVIEEELTKLQLLEASSSEFNVTRNYLDWLTALPWGSYSDENFDVLRAEQILDEDHYGLTDVKERILEFIAVGKLRGTSQGKIICLSGPPGVGKTSIGRSIARALNRKFYRFSVGGLSDVAEIKGHRRTYIGAMPGKMVQCLKSVGTANPLVLIDEIDKLGRGHAGDPASAMLELLDPEQNANFLDHYLDVPIDLSKVLFVCTANVVEMIPNPLLDRMEVISIAGYITDEKVHIARDYLEKATRETCGIKPEQVEVTDSALLALIENYCREAGVRNLQKQIEKIYRKIALKLVREDGEIEPQNAEVDEVKAESIHLSDEIKSKEEIQAGAESANGSNDDEASENNAEAEAQGAENQTQKSANEDTCLQDTQETEKATESEASKTVNKVVVDSPNLADYVGKPVFHAERIYDQTPVGVVMGLAWTSMGGSTLYIETSLVEQGEGKGALNVTGQLGDVMKESAQIAHTVARTILLEKEPDNQFFANSKLHLHVPAGATPKDGPSAGCTMITSLLSLAMKKPVKKDLAMTGEVTLTGKILPIGGVKEKAIAARRSDVKTIIFPSANRRDFDELAPNVKEGLDVHFVDDYKQIFDLAF
Claims (22)
1.一种突变型、非天然存在的或转基因烟草植物细胞,其包含:
(i)多核苷酸,其包括编码功能性蛋白酶的序列、由编码功能性蛋白酶的序列组成或基本上由编码功能性蛋白酶的序列组成,且与SEQ ID NO:1到SEQ ID No:80中的任一个具有至少95%的序列同一性;
(ii)由(i)中所示的所述多核苷酸编码的多肽;
(iii)多肽,其包括编码蛋白酶的序列、由编码蛋白酶的序列组成或基本上由编码蛋白酶的序列组成,且与SEQ ID NO:81到SEQ ID No:160具有至少95%的序列同一性;或
(iv)包含(i)中所示的所述经分离的多核苷酸的构建体、载体或表达载体,
且其中与所述蛋白酶的表达或活性未改变的对照烟草植物细胞相比,所述蛋白酶的表达或活性得以调节。
2.根据权利要求1所述的突变型、非天然存在的或转基因烟草植物细胞,其中所述蛋白酶的表达或活性与所述对照烟草植物细胞相比上调。
3.根据权利要求1所述的突变型、非天然存在的或转基因烟草植物细胞,其中所述蛋白酶的表达或活性与所述对照烟草植物细胞相比下调。
4.根据前述权利要求中任一项所述的突变型、非天然存在的或转基因烟草植物细胞,其中选自以下各项的蛋白酶的表达或活性经调节:
SEQ ID NO:1到16中的至少一者;或
SEQ ID NO:30到41中的至少一者;或
SEQ ID NO:17到22中的至少一者;或
SEQ ID NO:42到44中的至少一者;或
SEQ ID NO:45到61中的至少一者;或
SEQ ID NO:62到80中的至少一者;或
SEQ ID NO:23到29中的至少一者。
5.根据权利要求4所述的突变型、非天然存在的或转基因烟草植物细胞,其中选自SEQID NO:30到41中的至少一者的蛋白酶的表达或活性在东方型烟草中经调节。
6.根据权利要求4所述的突变型、非天然存在的或转基因烟草植物细胞,其中选自SEQID NO:17到22的蛋白酶的表达或活性在弗吉尼亚型烟草中经调节。
7.根据权利要求4所述的突变型、非天然存在的或转基因烟草植物细胞,其中选自SEQID NO:42到44中的至少一者的蛋白酶的表达或活性在白肋型烟草中经调节。
8.根据权利要求4所述的突变型、非天然存在的或转基因烟草植物细胞,其中选自SEQID NO:45到61中的至少一者的蛋白酶的表达或活性在弗吉尼亚或东方型烟草中经调节。
9.根据权利要求4所述的突变型、非天然存在的或转基因烟草植物细胞,其中选自SEQID NO:62到80中的至少一者的蛋白酶的表达或活性在白肋或东方型烟草中经调节。
10.根据权利要求4所述的突变型、非天然存在的或转基因烟草植物细胞,其中选自SEQID NO:23到29中的至少一者的蛋白酶的表达或活性在白肋或弗吉尼亚型烟草中经调节。
11.根据前述权利要求中任一项所述的突变型、非天然存在的或转基因烟草植物细胞,其中所述突变是杂合或纯合突变。
12.根据前述权利要求中任一项所述的突变型、非天然存在的或转基因烟草植物细胞,其中所述一种或多种蛋白酶的表达增加约10%到约1000%。
13.根据权利要求12所述的突变型、非天然存在的或转基因烟草植物细胞,其中所述一种或多种蛋白酶的表达增加至少10%、至少20%、至少25%、至少50%、至少100%、至少200%、至少500%、至少750%或高达1000%。
14.一种突变型、非天然存在的或转基因植物或其组分或部分,其包括根据前述权利要求中任一项所述的植物细胞。
15.一种植物材料,其包含来自根据权利要求14所述的植物的生物质、种子、茎、花或叶。
16.一种烟草植物,其包括根据权利要求1至13中任一项所述的植物细胞、根据权利要求14所述的植物的至少一部分或根据权利要求15所述的植物材料。
17.一种用于制备具有经调节水平的蛋白酶的烟草植物的方法,所述方法包括以下步骤:
(a)提供植物,所述植物包括(i)多核苷酸,其包括编码功能性蛋白酶的序列、由编码功能性蛋白酶的序列组成或基本上由编码功能性蛋白酶的序列组成,且与SEQ ID NO:1到SEQID No:80中的至少一个具有至少95%的序列同一性;
(b)将一个或多个突变插入至所述烟草植物的所述多核苷酸中以产生突变型烟草植物;以及
(c)烘烤所述烟草植物材料。
18.根据权利要求17所述的方法,其中步骤(b)中的所述烟草植物为突变型烟草植物,优选地,其中所述突变型烟草植物在一个或多个其它序列中包括一个或多个突变,所述一个或多个其它序列编码功能性蛋白酶且与SEQ ID NO:1到SEQ ID No:80中的至少一者具有至少95%序列同一性。
19.根据权利要求17或18所述的方法,其中烟草植物的细胞的基因组利用基因组编辑技术或基因组改造技术修饰,所述技术选自CRISPR/Cas技术、锌指核酸酶介导的诱变、化学或辐射诱变、同源重组、寡核苷酸引导的诱变和大范围核酸酶介导的诱变。
20.一种制造与对照植物材料相比风味概况改变的烘烤的植物材料、优选地烘烤的叶或花的方法,所述方法包括以下步骤:
(a)提供根据权利要求14所述的植物或根据权利要求15所述的植物材料;
(b)任选地自其收获所述植物材料;及
(c)烘烤所述植物材料一段时间,使得至少一种蛋白酶的水平与对照经烘烤植物材料相比经调节。
21.一种
(i)多核苷酸,其包括编码功能性蛋白酶的序列、由编码功能性蛋白酶的序列组成或基本上由编码功能性蛋白酶的序列组成,且与SEQ ID NO:1到SEQ ID No:80中的任一个具有至少95%的序列同一性;
(ii)由(i)中所示的所述多核苷酸编码的多肽;
(iii)多肽,其包括编码蛋白酶的序列、由编码蛋白酶的序列组成或基本上由编码蛋白酶的序列组成,且与SEQ ID NO:81到SEQ ID No:160具有至少95%的序列同一性;或
(iv)包含(i)中所示的所述经分离的多核苷酸的构建体、载体或表达载体,
的用途,其用于在烟草烘烤程序期间调节烟草中的一种或多种蛋白酶的表达或活性。
22.根据权利要求21所述的用途,其中所述烘烤程序可选自由以下组成的群组:空气烘烤、火烘烤、烟雾烘烤和烟道烘烤。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP14177715 | 2014-07-18 | ||
EP14177715.1 | 2014-07-18 | ||
PCT/EP2015/066341 WO2016009006A1 (en) | 2014-07-18 | 2015-07-16 | Tobacco protease genes |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106661556A true CN106661556A (zh) | 2017-05-10 |
Family
ID=51210347
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201580038165.0A Pending CN106661556A (zh) | 2014-07-18 | 2015-07-16 | 烟草蛋白酶基因 |
Country Status (12)
Country | Link |
---|---|
US (1) | US20170265516A1 (zh) |
EP (1) | EP3169149B1 (zh) |
JP (2) | JP2017529063A (zh) |
KR (1) | KR20170032317A (zh) |
CN (1) | CN106661556A (zh) |
AP (1) | AP2017009676A0 (zh) |
BR (1) | BR112017000932A2 (zh) |
CA (1) | CA2954828A1 (zh) |
MX (1) | MX2017000834A (zh) |
PH (1) | PH12016502546A1 (zh) |
RU (1) | RU2756102C2 (zh) |
WO (1) | WO2016009006A1 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111763687A (zh) * | 2019-03-12 | 2020-10-13 | 中国农业大学 | 一种基于基因编辑技术快速培育玉米单倍体诱导系的方法 |
CN114032324A (zh) * | 2021-11-23 | 2022-02-11 | 云南省烟草农业科学研究院 | 与烟草黑胫病1号生理小种抗性基因qBS1连锁的SSR标记 |
CN116590308A (zh) * | 2023-05-09 | 2023-08-15 | 西南大学 | 马铃薯耐旱性相关热激蛋白基因hsp101及其应用 |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106811453B (zh) * | 2017-01-05 | 2020-07-14 | 上海交通大学 | 百子莲组织蛋白酶b及其编码基因和探针及应用 |
JP2021519064A (ja) | 2018-03-28 | 2021-08-10 | フィリップ・モーリス・プロダクツ・ソシエテ・アノニム | 植物体における還元糖含有量の調節 |
GB201805949D0 (en) * | 2018-04-10 | 2018-05-23 | British American Tobacco Investments Ltd | Smoking article |
EP3632925A1 (en) | 2018-10-02 | 2020-04-08 | Universität für Bodenkultur Wien | Plant serine proteases |
WO2021063863A1 (en) | 2019-10-01 | 2021-04-08 | Philip Morris Products S.A. | Modulating sugar and amino acid content in a plant (sultr3) |
JP2022550383A (ja) | 2019-10-01 | 2022-12-01 | フィリップ・モーリス・プロダクツ・ソシエテ・アノニム | 植物における還元糖含有量の調節(inv) |
CN115747249A (zh) * | 2022-11-28 | 2023-03-07 | 湖南大学 | 烟草NtabCrRLK12基因在解除烟草连作障碍中的应用 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040106198A1 (en) * | 2002-07-16 | 2004-06-03 | Large Scale Biology Corporation | Inhibition of peptide cleavage in plants |
US20100012137A1 (en) * | 2006-12-15 | 2010-01-21 | U.S. Smokeless Tobacco Company | Tobacco plants having reduced nicotine demethylase activity |
CN103012571A (zh) * | 2012-12-05 | 2013-04-03 | 北京师范大学 | 降低烟草尼古丁含量的基因及应用 |
CN103403170A (zh) * | 2011-01-17 | 2013-11-20 | 菲利普莫里斯生产公司 | 植物中的蛋白质表达 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0865499B1 (en) * | 1995-09-14 | 2009-03-18 | Virginia Tech Intellectual Properties, Inc. | Production of lysosomal enzymes in plant-based expression systems |
AU2007299219A1 (en) * | 2006-04-05 | 2008-03-27 | Metanomics Gmbh | Process for the production of a fine chemical |
RU2324737C1 (ru) * | 2006-10-18 | 2008-05-20 | Институт цитологии и генетики Сибирского отделения Российской академии наук (СО РАН) | Способ получения трансгенных растений табака с повышенным содержанием пролина |
BRPI0911501A2 (pt) * | 2008-04-29 | 2015-07-28 | Monsanto Technology Llc | Genes e usos para melhoramento de plantas. |
-
2015
- 2015-07-16 CN CN201580038165.0A patent/CN106661556A/zh active Pending
- 2015-07-16 KR KR1020177001642A patent/KR20170032317A/ko not_active IP Right Cessation
- 2015-07-16 MX MX2017000834A patent/MX2017000834A/es unknown
- 2015-07-16 RU RU2017105148A patent/RU2756102C2/ru active
- 2015-07-16 BR BR112017000932A patent/BR112017000932A2/pt not_active Application Discontinuation
- 2015-07-16 JP JP2017502853A patent/JP2017529063A/ja active Pending
- 2015-07-16 US US15/325,997 patent/US20170265516A1/en not_active Abandoned
- 2015-07-16 AP AP2017009676A patent/AP2017009676A0/en unknown
- 2015-07-16 CA CA2954828A patent/CA2954828A1/en not_active Abandoned
- 2015-07-16 EP EP15738907.3A patent/EP3169149B1/en active Active
- 2015-07-16 WO PCT/EP2015/066341 patent/WO2016009006A1/en active Application Filing
-
2016
- 2016-12-20 PH PH12016502546A patent/PH12016502546A1/en unknown
-
2020
- 2020-11-02 JP JP2020183791A patent/JP2021045130A/ja active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040106198A1 (en) * | 2002-07-16 | 2004-06-03 | Large Scale Biology Corporation | Inhibition of peptide cleavage in plants |
US20100012137A1 (en) * | 2006-12-15 | 2010-01-21 | U.S. Smokeless Tobacco Company | Tobacco plants having reduced nicotine demethylase activity |
CN103403170A (zh) * | 2011-01-17 | 2013-11-20 | 菲利普莫里斯生产公司 | 植物中的蛋白质表达 |
CN103012571A (zh) * | 2012-12-05 | 2013-04-03 | 北京师范大学 | 降低烟草尼古丁含量的基因及应用 |
Non-Patent Citations (3)
Title |
---|
CATHERINE NAVARRE等: "Identification, gene cloning and expression of serine proteases in the extracellular medium of Nicotiana tabacum cells", 《PLANT CELL REP》 * |
杨昌达等: "《烟草生产与加工》", 30 April 1993, 贵州科技出版社 * |
赵娟等: "不同光质对烟草叶片生长发育过程中类半胱氨酸蛋白酶活性及其基因表达调控的影响", 《安徽农业科学》 * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111763687A (zh) * | 2019-03-12 | 2020-10-13 | 中国农业大学 | 一种基于基因编辑技术快速培育玉米单倍体诱导系的方法 |
CN111763687B (zh) * | 2019-03-12 | 2021-12-07 | 中国农业大学 | 一种基于基因编辑技术快速培育玉米单倍体诱导系的方法 |
CN114032324A (zh) * | 2021-11-23 | 2022-02-11 | 云南省烟草农业科学研究院 | 与烟草黑胫病1号生理小种抗性基因qBS1连锁的SSR标记 |
CN114032324B (zh) * | 2021-11-23 | 2023-09-01 | 云南省烟草农业科学研究院 | 与烟草黑胫病1号生理小种抗性基因qBS1连锁的SSR标记 |
CN116590308A (zh) * | 2023-05-09 | 2023-08-15 | 西南大学 | 马铃薯耐旱性相关热激蛋白基因hsp101及其应用 |
CN116590308B (zh) * | 2023-05-09 | 2024-03-29 | 西南大学 | 马铃薯耐旱性相关热激蛋白基因hsp101及其应用 |
Also Published As
Publication number | Publication date |
---|---|
JP2017529063A (ja) | 2017-10-05 |
WO2016009006A1 (en) | 2016-01-21 |
RU2756102C2 (ru) | 2021-09-28 |
BR112017000932A2 (pt) | 2017-11-14 |
MX2017000834A (es) | 2017-05-01 |
RU2017105148A (ru) | 2018-08-20 |
CA2954828A1 (en) | 2016-01-21 |
EP3169149C0 (en) | 2024-01-17 |
RU2017105148A3 (zh) | 2018-12-06 |
EP3169149B1 (en) | 2024-01-17 |
AP2017009676A0 (en) | 2017-01-31 |
US20170265516A1 (en) | 2017-09-21 |
PH12016502546A1 (en) | 2017-04-10 |
JP2021045130A (ja) | 2021-03-25 |
KR20170032317A (ko) | 2017-03-22 |
EP3169149A1 (en) | 2017-05-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10415050B2 (en) | Reduction of nicotine to nornicotine conversion in plants | |
CN103228671B (zh) | 植物中的重金属减少 | |
US10563215B2 (en) | Tobacco specific nitrosamine reduction in plants | |
US11213004B2 (en) | Reducing cadmium accumulation in field grown tobacco plants | |
CN106661556A (zh) | 烟草蛋白酶基因 | |
CN104080802B (zh) | 在植物中调节β大马酮 | |
JP6225108B2 (ja) | ニコチアナ・タバカムからのトレオニン合成酵素ならびにその方法および使用 | |
CN107074919A (zh) | 通过氯化物通道的异位表达调节植物中的生物质 | |
US11685929B2 (en) | Plants with shortened time to flowering | |
CN107920489A (zh) | 天冬酰胺含量减少的植物 | |
CN103958673A (zh) | 来自红花烟草的异丙基苹果酸合酶及其方法和用途 | |
EA041833B1 (ru) | Уменьшение накопления кадмия в выращенных в поле растениях табака | |
EP2586792A1 (en) | Modulating beta-damascenone in plants |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170510 |
|
RJ01 | Rejection of invention patent application after publication |